CLIP + FFT/DWT/RGB = text to image/video
Stable Diffusion web UI
ComfyUI docker images for use in GPU cloud and local environments. Includes AI-Dock base for authentication and improved user experience.
Open Source Computer Vision Library
real time face swap and one-click video deepfake with only a single image
A lightning-fast search engine API bringing AI-powered hybrid search to your sites and applications.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Port of OpenAI's Whisper model in C/C++
?? - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
? Upscayl - #1 Free and Open Source AI Image Upscaler for Linux, MacOS and Windows.
A generative speech model for daily dialogue.