Best मशीनलर्निंग(ML)मॉडल AI Tools & Models - Premium मशीनलर्निंग(ML)मॉडल News

AI News

6G Has Not Been Commercialized Yet, Beijing Jiaotong University × Imperial College Have Already Embedded Semantic Dedicated Lines into Edge Cloud - A Sentence Makes the Building Across the Street Enjoy VIP Bandwidth

AR glasses use MLLM-SC framework for semantic processing, generating attention heatmaps in 10ms to prioritize key targets and reduce background data. The system filters task-relevant multimodal data, optimizes transmission, frees 30% of 6G bandwidth, and enhances device-edge server collaboration for improved efficiency.....

8.9k 4 days ago

Warner Bros. Discovery Leverages AWS Graviton to Achieve Cost Savings and Improve Machine Learning Inference Speed

Warner Bros. Discovery uses AWS Graviton processors and Amazon SageMaker AI instances to optimize AI/ML infrastructure, achieving cost savings and performance improvements for personalized content experiences.....

5.4k 2 days ago

Warner Bros. Discovery Leverages AWS Graviton to Achieve Cost Savings and Improve Machine Learning Inference Speed

Moonshot Introduces a New Hybrid Linear Attention Architecture Kimi Linear

Kimi Linear, a hybrid linear attention architecture by Moon AI, outperforms traditional methods in long/short-range processing and reinforcement learning. It uses Kimi Delta Attention with gating to enhance RNN memory efficiency, combining three KDA and one MLA.....

13.5k yesterday

OpenAI Collaborates with the Martin Luther King Jr. Estate, Temporarily Halting Sora from Generating Portraits of Dr. King

OpenAI partners with MLK estate to regulate Sora's use of Dr. King's image, suspending disrespectful content generation per request to enhance historical figure protection.....

9.4k yesterday

AI Products

Arthur Engine

A tool designed for AI/ML model monitoring and management.

Model training and deployment

6.6k

Thunder Compute

Provides the world's cheapest GPU cloud services, empowering self-hosted AI/ML development.

Development platform

6.4k

MLGym

MLGym is a novel framework and benchmark for advancing AI research agents.

Model training and deployment

10k

FlashMLA

FlashMLA is a high-efficiency MLA decoding kernel optimized for Hopper GPUs, suitable for variable-length sequence services.

Model training and deployment

10.7k

Models

GigaChat3 10B A1.8B GGUF

ai-sage

GigaChat3-10B-A1.8B is an efficient dialogue model in the GigaChat series. Based on the Mixture-of-Experts (MoE) architecture, it has a total of 10 billion parameters and 1.8 billion active parameters. It adopts the innovative Multi-head Latent Attention (MLA) and Multi-token Prediction (MTP) technologies, aiming to optimize inference throughput and generation speed. The model is trained on 20T tokens of diverse data and supports 10 languages including Chinese, suitable for dialogue scenarios requiring quick responses.

Natural Language Processing Gguf

GgufMultiple Languages

ai-sage

366

Ministral 3 3B Instruct 2512

mlx-community

This model is an MLX format conversion version of the Ministral-3-3B-Instruct-2512 instruction fine-tuning model released by Mistral AI. It is a large language model with a parameter scale of 3B, specifically optimized for following instructions and dialogue tasks, and supports multiple languages. The MLX format enables it to run efficiently on Apple Silicon devices.

Natural Language Processing Mlx

MlxMultiple Languages

mlx-community

167

Kimi Linear 48B A3B Instruct Mlx Bf16

mlx-community

This model is an MLX format conversion version of Kimi-Linear-48B-A3B-Instruct, optimized for Apple Silicon devices such as the Apple Mac Studio. It is a large language model with 48 billion parameters, supporting instruction following and suitable for local inference and conversation tasks.

Natural Language Processing Mlx

Mlx

mlx-community

224

Gemma 3 12b It Qat Mlx Mxfp4

ExaltedSlayer

Gemma 3 is a lightweight open-source multimodal model launched by Google. This version is an instruction-tuned quantization-aware training model with 12B parameters, which has been converted to the MXFP4 format of the MLX framework. It supports text and image input and generates text output, with a 128K context window and support for over 140 languages.

Multimodal Mlx

Mlx

ExaltedSlayer

104

Crisperwhisper Unsloth Mlx 8b

kyr0

This is an automatic speech recognition model optimized for Apple silicon chip devices. By converting to the MLX framework and quantizing to the FP8 format, it enables fast on-device speech transcription on Apple devices. The model is fine-tuned for verbatim accuracy and is particularly suitable for scenarios requiring high-precision transcription.

Audio Processing Mlx

MlxOther

kyr0

159

GigaChat3 10B A1.8B GGUF

ubergarm

This is the GGUF quantized version of the ai-sage/GigaChat3-10B-A1.8B-bf16 model, offering a variety of quantization options, from high-precision Q8_0 to extremely compressed smol-IQ1_KT, to meet the deployment requirements under different hardware conditions. This model supports a 32K context length, adopts the MLA architecture, and is optimized for dialogue scenarios.

Natural Language Processing Gguf

GgufOther

ubergarm

2.7k

Olmo 3 7B Instruct 8bit

mlx-community

This model is an 8-bit quantized version converted from allenai/Olmo-3-7B-Instruct, specifically optimized for the Apple MLX framework. It is a large language model with 7 billion parameters, supporting instruction following and dialogue tasks.

Natural Language Processing Mlx

MlxEnglish

mlx-community

400

VibeThinker 1.5B Mlx 4bit

mlx-community

The 4-bit quantized version of VibeThinker-1.5B, optimized for Apple chips based on the MLX framework, is a dense language model with 1.5 billion parameters, specifically designed for mathematical reasoning and algorithm coding problems.

Natural Language Processing Mlx

MlxMultiple Languages

mlx-community

113

GigaChat3 10B A1.8B Base

ai-sage

GigaChat3-10B-A1.8B-base is the basic pre-trained model of the GigaChat series, adopting the Mixture of Experts (MoE) architecture with a total of 10 billion parameters and 1.8 billion active parameters. The model integrates Multi-Head Latent Attention (MLA) and Multi-Token Prediction (MTP) technologies, and has the advantage of high throughput during inference.

Natural Language Processing

SafetensorsMultiple Languages

ai-sage

210

Gemma3 27b Abliterated Dpo Mlx 8Bit

McG-221

This model is an MLX format model converted from summykai/gemma3-27b-abliterated-dpo using version 0.28.3 of mlx-lm. It is a 27B parameter Gemma 3 large language model fine-tuned with DPO (Direct Preference Optimization), optimized for efficient operation on Apple Silicon (MLX framework).

Natural Language Processing

TransformersEnglish

McG-221

557

Kimi K2 Thinking MLX 3.825bit

inferencerlabs

Kimi - K2 - Thinking 3.825bit MLX is a quantized model for text generation. It achieves different perplexity performances in tests through different quantization methods. Among them, q3.825bit quantization can reach a perplexity of 1.256.

Natural Language Processing Mlx

MlxEnglish

inferencerlabs

604

Falcon H1 34B Instruct Mlx 8Bit

McG-221

This model is an MLX format conversion version of the instruction fine-tuned version of Falcon-H1-34B-Instruct, optimized specifically for Apple Silicon (M series chips). It is based on the original Falcon-H1-34B-Instruct model and converted to an 8-bit quantization format compatible with the MLX framework through the mlx-lm tool, aiming to achieve efficient local inference on macOS devices.

Natural Language Processing

TransformersMultiple Languages

McG-221

101

Llama 3.3 Krix V2

Ali-Yaser

This model is a fine-tuned version based on meta-llama/Llama-3.3-70B-Instruct. It is trained using the mlabonne/FineTome-100k dataset, which contains 100k token data. The model is fine-tuned using the Unsloth and Huggingface TRL libraries and supports English language processing.

Natural Language Processing

TransformersEnglish

Ali-Yaser

147

Molecular Llm 3b

Leohan

A text generation model developed based on the MLX library, focusing on natural language processing tasks and providing developers with an efficient text generation solution.

Natural Language Processing Mlx

MlxEnglish

Leohan

276

Kimi K2 Thinking MLX 4.25bit

inferencerlabs

A text generation model implemented based on the MLX library, supporting inference in multiple quantization methods, with distributed computing capabilities, and can run efficiently in the Apple hardware environment.

Natural Language Processing Mlx

MlxEnglish

inferencerlabs

1.5k

Kimi K2 Thinking

mlx-community

Kimi-K2-Thinking is a large language model in MLX format converted by mlx-community from the original model of moonshotai. The conversion is carried out using version 0.28.4 of mlx-lm, retaining the chain of thought reasoning ability of the original model.

Natural Language Processing Mlx

Mlx

mlx-community

182

Marvis Tts 100m V0.2 MLX 6bit

Marvis-AI

This is a text-to-speech model optimized based on the MLX framework, converted from the original model Marvis-AI/marvis-tts-100m-v0.2. It uses 6-bit quantization technology and is specifically optimized for Apple Silicon hardware, providing efficient speech synthesis capabilities.

Audio Processing

TransformersMultiple Languages

Marvis-AI

111

Qwen3 Coder 480B A35B Instruct MLX 8.5bit

inferencerlabs

Qwen3-Coder-480B-A35B-Instruct is a large code generation model with 480 billion parameters, supporting 8.5-bit quantization and optimized based on the MLX framework. This model is specifically designed for code generation tasks and can run efficiently on devices with sufficient memory.

Natural Language Processing Mlx

MlxEnglish

inferencerlabs

793

MiniMax M2 4bit DWQ

catalystsec

This project performs 4-bit quantization on the MiniMax-M2 model using the DWQ (Dynamic Weight Quantization) method with the help of the mlx-lm library. This model is a lightweight version of MiniMax-M2, significantly reducing the model size while maintaining good performance.

Natural Language Processing Mlx

Mlx

catalystsec

159

Kimi Linear 48B A3B Instruct 6bit

mlx-community

This is a 6-bit quantized version converted from the Kimi-Linear-48B-A3B-Instruct model, optimized for the Apple MLX framework. The model retains the powerful instruction-following ability of the original model, while significantly reducing storage and computational requirements through quantization technology, making it suitable for efficient operation on Apple hardware.

Natural Language Processing Mlx

Mlx

mlx-community

161

MCP

Mlx Whisper Mcp

An audio transcription MCP service based on MLX Whisper, supporting transcription of local files, Base64 audio, and YouTube videos, optimized for Apple M-series chips.

python

9.1k

2.5points

MLflow Prompt Registry

The MCP server of MLflow Prompt Registry enables access to and management of prompt templates from MLflow.

typescript

6.9k

2.5points

Balldontlie Mcp

An MCP service implementation based on the Balldontlie API, providing query functions for player, team, and game information in the NBA, NFL, and MLB.

typescript

7.5k

2.5points

Mlb Mcp

A server project based on the Model Context Protocol (MCP) that provides access to baseball statistical data through the MLB Stats API and the pybaseball library, including data sources such as Statcast, Fangraphs, and Baseball Reference, and supports data visualization.

typescript

7.5k

2.5points

PromptLab

PromptLab is an intelligent system that optimizes basic user queries into AI system prompts through MLflow integration, providing dynamic template matching and parameter extraction functions.

python

7.3k

2.5points

Mcp Atomictoolkit

A server project compatible with the MCP protocol, providing atomic-scale simulation functions through ASE, pymatgen, and machine learning interatomic potentials (MLIPs). It is currently under active development.

python

7.2k

2.5points

MLCBakery

A Python - based ML model provenance management service built with FastAPI and SQLAlchemy, providing functions such as dataset management, entity tracking, activity logging, proxy management, and provenance relationship tracking.

python

14.1k

2.5points

MlflowMCPServer

This project provides a natural language interaction interface for MLflow through the Model Context Protocol (MCP), allowing users to query and manage machine learning experiments and models in English. It includes server - side and client - side components.

python

9.1k

2.5points

Mcp Server Mlflow

This project provides MCP protocol support services for the MLflow Prompt Registry, enabling the functions of retrieving and managing prompt templates from MLflow, and is mainly used for conveniently invoking preset prompts in Claude Desktop.

typescript

7.9k

2.5points

MLflow

This project provides a Model Context Protocol (MCP) service for MLflow through a natural language interface, simplifying the management and query of machine learning experiments and models.

python

6.7k

2.5points

Mlb Api Mcp

An MLB data service based on the MCP protocol, providing comprehensive access to baseball statistical data, including team standings, schedules, player information, etc., and supporting AI application integration.

python

6.2k

2.5points

B Step62_mcp Server Mlflow

Implementation of the MCP service for the MLflow Prompt Registry, supporting the retrieval and management of prompt templates from MLflow, facilitating users to quickly invoke preset workflows in Claude Desktop.

typescript

5.9k

2.0points

Mcp Servers Elj

MCP Servers is a collection of servers and services for a Model Composition Platform (MCP), aiming to promote the integration and deployment of various AI/ML models and services. The project adopts a modular architecture, supporting standardized communication and scalable design, and includes various server types such as weather services.

Developer tools

6.4k

2.0points

Cloudera AI MCP

The Cloudera ML Model Control Protocol (MCP) is a Python toolkit that provides functions for integrating with the Cloudera Machine Learning platform, including services such as file management, job scheduling, model management, and experiment tracking.

python

6.1k

2.0points

Python Documentation Search

A document search assistant integrated with Claude AI. It enhances Claude's document retrieval capabilities through the MCP server and supports intelligent search and explanation of documents for multiple AI/ML libraries.

python

8.7k

2.0points

MlflowAgent

This project provides interaction functions for MLflow through a natural language interface. It includes server - side and client - side components, supports querying experiments, model registration, and system information, and simplifies MLflow management operations.

python

9.3k

2.0points

Mlxml

Project summary

python

6.7k

2.0points

Mcp_mlb_statsapi

An MLB data API encapsulation service based on the MCP framework, providing functions such as schedule query, game result query, team information query, and player query.

python

8.1k

2.0points

Mlb_mcp

This is an MCP server based on FastAPI, providing baseball data query functions from MLB and Fangraphs through the pybaseball library, including player data, team statistics, and league leaderboards.

python

9.2k

2.0points

MladenSU_cli Mcp Server

A secure MCP server implementation for executing controlled command-line operations, providing comprehensive security features, including command whitelisting, path validation, and execution control.

python

6.2k

2.0points

Empowering the future, your artificial intelligence solution think tank

English 简体中文繁體中文にほんご

FirendLinks:

AI Newsletters AI Tools MCP Servers AI News AIBase LLM Leaderboard AI Ranking

Business Cooperation Site Map