A framework to enable multimodal models to operate a computer.
conditional import for voice
J
Josh Bickett committed
b011fa84edbc5567b31e8fb6d5e86abdb55d6764
Parent: ef5921f
A framework to enable multimodal models to operate a computer.