AIbase

Multimodal-voice-assistant

Public

This project is a multi-modal AI voice assistant that uses OpenAI's GPT-4, audio processing with WhisperModel, speech recognition, clipboard extraction, and image processing to respond to user prompts.

Creat2024-06-22T10:02:42
Update2025-03-21T00:19:57
6
Stars
0
Stars Increase

Related projects