Google open-sourced Stitch's core format, DESIGN.md, a machine-readable protocol addressing brand consistency in AI Agent UI generation. It integrates YAML design tokens (e.g., colors, fonts) with plain-text annotations, providing AI with intuitive design guidelines and logical support to generate brand-aligned UIs while adhering to WCAG accessibility rules.....
Microsoft launches the upgraded image generation model MAI-Image-2-Efficient, which focuses on low cost and high efficiency. The model provides production-grade quality at nearly half the price, making it suitable for enterprises to quickly generate product photos and UI prototypes. The generation speed has improved by 22%, and overall efficiency has increased fourfold, with a significant reduction in costs.
Apple introduces two ML studies: SQUIRE enhances AI-generated UI control and fine-tuning with GPT-4o and slot query representation, while another improves image safety review to address current tech challenges.....
Anthropic is developing a persistent agent called Conway for Claude, creating an independent, always-on intelligent environment. It will have an independent UI, going beyond traditional chat interfaces to run as an agent workspace. Conway can directly operate the browser, connect external tools, and integrate code capabilities to enable deep task processing and code execution.
Quick text-to-image conversion and editing, supporting poster design, UI simulation, and product visual design.
A versatile AI image generator launched by OpenAI, featuring extremely high text accuracy and pixel-level UI design capabilities.
AI-powered user testing platform
An AI-assisted visual UI development tool that helps AI accurately understand your UI modification intentions through precise DOM information and natural language descriptions.
Bytedance
$3.5
Input tokens/M
$12
Output tokens/M
128
Context Length
Baidu
-
unsloth
JanusCoder-8B is an open-source code intelligence foundation model built on Qwen3-8B, aiming to establish a unified visual programming interface. This model is trained on JANUSCODE-800K (the largest multimodal code corpus to date) and can handle various visual programming tasks, including data visualization, interactive web UI, and code-driven animation.
Disty0
This project is the result of 4-bit (UINT4, SVD rank of 32) quantization of Tencent's Hunyuan Image 3.0 model. It uses the SDNQ quantization method and is specifically designed for text-to-image generation tasks.
Tesslate
UIGENT-30B-Lora-200 is a large language model fine-tuned based on Qwen3-Coder-30B-A3B-Instruct. It is specifically optimized for text generation tasks and is particularly good at handling HTML-related content. It also has various text generation capabilities.
DevQuasar
This is the quantized version of the Tesslate/UIGEN-FX-4B-Preview model, aiming to make large language models easier to deploy and use through quantization technology, realizing the concept of 'Making knowledge accessible to everyone'.
QuantFactory
UIGEN-FX-4B-Preview is a UI generation model with 4 billion parameters, specifically optimized for front-end development. It can perform like a front-end engineer in 22 frameworks. This model focuses on generating semantic HTML code, with excellent layout rhythm, spacing control, and component combination capabilities, suitable for quickly generating single-file web pages.
enacimie
This is a GGUF format model converted from Tesslate/WEBGEN-4B-Preview, specifically designed for web page generation and UI design tasks. The model uses the Q4_K_M quantization format, which reduces the model size while maintaining good performance.
macpaw-research
This is a computer vision model fine-tuned based on Ultralytics/YOLO11, specifically designed to detect UI elements in macOS application screenshots. It is part of the Screen2AX project, dedicated to generating accessibility metadata using computer vision technology.
The quantized version of UIGEN-X-8B is committed to making knowledge accessible to the public. This model is optimized based on Tesslate/UIGEN-X-8B to provide more efficient inference performance.
electroglyph
This is a quantized model based on ONNX, which is the uint8 quantized version of Qwen/Qwen3-Embedding-0.6B. It reduces the model size while maintaining retrieval performance.
yujiepan
A quantized version based on the ByteDance-Seed/UI-TARS-1.5-7B model, using GPTQ technology for 4-bit weight quantization and 16-bit activation quantization to achieve the function of converting image-text to text, with efficient inference performance.
Mungert
UI-TARS-1.5-7B is a multimodal model based on advanced technology, which performs excellently in tasks such as image-text conversion. It adopts an innovative quantization method and can maintain high accuracy at extremely low bit rates.
LZXzju
UI-R1-E-3B is an efficient GUI positioning model fine-tuned based on Qwen2.5-VL-3B-Instruct, specializing in visual question-answering tasks, particularly excelling at locating and identifying operational elements in user interface screenshots.
Lucy-in-the-Sky
This is the GGUF quantized version of the ByteDance-Seed/UI-TARS-1.5-7B model, specifically optimized for GUI interface understanding and multimodal tasks. The model uses 4-bit quantization technology to significantly reduce the model size and memory requirements while maintaining performance.
UIGEN-T2 is a UI generation model fine-tuned from Qwen2.5-Coder-7B-Instruct, specializing in generating HTML and Tailwind CSS code
mlx-community
UI-TARS-1.5-7B-6bit is a vision-language model converted based on the MLX format, supporting image understanding and text generation tasks.
UI-TARS-1.5-7B-4bit is a multimodal model focused on image-text-to-text conversion tasks, supporting the English language.
parasail-ai
UI-TARS is the next-generation native GUI intelligent agent model, which has human-like perception, reasoning, and action capabilities, and can seamlessly interact with the graphical user interface (GUI).
UI-TARS is the next-generation native GUI agent model with human-like perception, reasoning, and action capabilities, enabling seamless interaction with the graphical user interface (GUI).
prithivMLmods
Raptor-X5-UIGEN is a large language model based on the Qwen 2.5 14B multimodal architecture, specializing in UI design, minimalist coding, and content-intensive development, with enhanced reasoning capabilities and structured response generation.
UIGEN-T1.5 is an advanced Transformer UI generation model fine-tuned from Qwen2.5-Coder-32B-Instruct, specifically designed for generating modern and unique front-end user interfaces.
An MCP server that provides shadcn/ui component integration for AI workflows, supporting React, Svelte, and Vue frameworks. It includes functions for accessing component source code, examples, and metadata.
Changesets is an automated tool to help manage the version control and release process of multi-package or single-package codebases.
The Magic Component Platform (MCP) is an AI-driven UI component generation tool that helps developers quickly create modern UI components through natural language descriptions and supports integration with multiple IDEs.
assistant - ui is an open - source TypeScript/React library for quickly building production - grade AI chat interfaces, providing composable UI components, streaming responses, accessibility, etc., and supporting multiple AI backends and models.
An ADB - based Android device control server that provides device management functions through the MCP protocol, supporting ADB command execution, screenshot capture, UI layout analysis, and application package management.
The AI Development Assistant MCP Server is an AI - based code development toolkit that provides functions such as code architecture generation, UI screenshot analysis, and code review, specifically designed for Cursor.
napi is a tool developed by NanoAPI, aiming to automatically analyze software architecture complexity, support extracting functional modules from the codebase, provide both CLI and UI operation methods, and be compatible with mainstream CI/CD platforms to help developers optimize architecture design.
The iOS Simulator MCP Server is a tool for interacting with the iOS simulator via the Model Context Protocol (MCP), supporting functions such as obtaining simulator information, controlling UI interactions, and checking UI elements.
The official ModelContextProtocol (MCP) server for Magic UI, providing details on implementing a rich set of UI components and effects, and supporting integration with multiple clients.
Claude Mobile is an MCP server that supports controlling Android, iOS, Desktop, and Aurora OS devices through natural language, providing unified automated operations, intelligent screenshots, permission management, and UI interaction functions.
A Go process inspection tool that provides analysis of goroutine status, memory statistics, and binary information, supporting terminal UI and HTTP API
The MCP service for Flux UI component reference, providing query functions for component documentation and examples
Zeigo web dashboard UI and API testing framework based on Playwright
An MCP server that provides automation capabilities for mobile devices, supporting interaction with mobile devices through structured UI data.
A TypeScript MCP server that provides reference information for shadcn/ui components
An MCP server implementation designed for Starwind UI, providing component development assistance tools
Data Agents is a GenAI - based data engineering agent platform that automates data workflows through multi - agent collaboration, supports multiple AI backends and n8n integration, and provides a modern UI interface and Docker deployment solution.
swagger-mcp is a tool for dynamically generating API tools. By parsing the JSON file of Swagger UI, it provides runtime-generated tool definitions for MCP clients. It supports multiple authentication methods and can be integrated with local or cloud-based LLM models.
UIThub MCP Server is a GitHub repository content interaction service designed for Claude, which can intelligently retrieve and analyze code library content through the API.
An MCP server for analyzing website UI interfaces and providing precise repair instructions for AI programming assistants to solve the problem of messy UI updates when using tools such as Copilot and Cursor.