Multimodal-voice-assistant
PublicThis project is a multi-modal AI voice assistant that uses OpenAI's GPT-4, audio processing with WhisperModel, speech recognition, clipboard extraction, and image processing to respond to user prompts.
Creat:2024-06-22T10:02:42
Update:2025-03-21T00:19:57
6
Stars
0
Stars Increase