??? ?? One-shot video tuning with Stable Diffusion
Generating Robotic Simulation Tasks via Large Language Models
? Scalable embedding, reasoning, ranking for images and sentences with CLIP
BoxMOT: pluggable SOTA tracking modules for segmentation, object detection and pose estimation models
Effortless data labeling with AI support from Segment Anything and other awesome models.
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
? Create Disco Diffusion artworks in one line
Unofficial implementation of Image Super-Resolution via Iterative Refinement by Pytorch
OpenMMLab Pre-training Toolbox and Benchmark