[CVPR2022 oral] A Simple and Effective Baseline for Text-to-Image Synthesis
An Open Source Machine Learning Framework for Everyone
Stable Diffusion web UI
21 Lessons, Get Started Building with Generative AI ? https://microsoft.github.io/generative-ai-for-beginners/
Tensors and Dynamic neural networks in Python with strong GPU acceleration
ComfyUI docker images for use in GPU cloud and local environments. Includes AI-Dock base for authentication and improved user experience.
Open Source Computer Vision Library
A lightning-fast search engine API bringing AI-powered hybrid search to your sites and applications.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Port of OpenAI's Whisper model in C/C++
?? - a deep learning toolkit for Text-to-Speech, battle-tested in research and production