Four Mac Studios Overcome Cloud Clusters! Apple Teams Up with LM Studio to Run Trillion-Parameter Large Models Locally

AIbase基地

Published inAI News · 4 min read · Jun 22, 2026

At the recently concluded WWDC2026, LM Studio and Apple delivered a remarkable technical demonstration—successfully running Moonshot's 10-trillion parameter large model, Kimi K2.6, using only four Mac Studios in a cluster. This achievement shattered the conventional belief that "trillion-parameter models must rely on cloud GPU clusters," making it a reality for consumer-grade hardware to support cutting-edge AI computing power.

Kimi K2.6 has a total parameter scale of 1 trillion, using a MoE architecture with 32 billion activated parameters. It supports long context, multimodal input, and agent task processing. During this demonstration, four Mac Studios were connected through Apple's memory sharing and interconnection technologies to form a cluster, with a total unified memory of approximately 1.5TB, sufficient to meet the inference requirements of this massive model. Previous developer tests showed that under similar configurations, Kimi K2.6 could achieve a generation speed of about 28 tokens/s, while consuming far less power than traditional GPU solutions.

Connecting directly from iPhone to local cluster, data never leaves the premises

More notably, the demonstration also showcased LM Studio's LM Link remote access feature. Users can securely remotely connect to the Mac Studio cluster from their MacBook Neo laptop or iPhone, interact in real-time with the running model, and all data and communication are processed locally without going through the cloud.

LM Link has been updated into LM Studio's Mac application and Locally AI's iOS application, supporting end-to-end encrypted connections. This design allows users to access cluster-level AI computing power at any time, even with lightweight devices, without worrying about privacy leaks. Combined with Apple's Thunderbolt 5 RDMA and other multi-device memory sharing technologies, the entire ecosystem is rapidly forming a closed-loop in AI local deployment.

This collaboration sends a clear signal: deploying trillion-parameter large models locally is no longer an unreachable lab concept but is becoming an engineering reality on developers' desks. As Apple's hardware connectivity continues to evolve, the boundaries of consumer devices carrying large-scale AI inference are expected to be further expanded.

LMStudio KimiK2.6 MacStudio AIComputingPower

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Apple and LM Studio Achieve a Breakthrough Collaboration: Four Mac Studios Successfully Run Trillion-Parameter Large Model

At WWDC, LM Studio and Apple ran Moonshot AI's Kimi K2.6 model on a cluster of four Mac Studios. The Mixture-of-Experts model has one trillion parameters, demonstrating Apple Silicon's potential for massive AI workloads.....

Jun 22, 2026

240

Amazon AI Computing Power New Strategy: Self-developed Trainium Chip May Start Spot Sales

Amazon is in talks to sell its AI chip Trainium externally, shifting from cloud-only to hardware sales for data centers, intensifying AI compute competition from cloud leasing to direct chip sales.....

Jun 22, 2026

140

Locally Run Trillion-Parameter Models: Apple and LM Studio Team Up to Unlock the Full Potential of Mac Studio

At WWDC 2026, LM Studio and Apple demoed Moonshot AI's trillion-parameter MoE model Kimi K2.6 running smoothly on a cluster of four Mac Studios. This challenges the assumption that large models require the cloud, proving consumer hardware can handle cutting-edge AI and marking a milestone for local deployment.....

Jun 22, 2026

240

Bezos Invests 400 Million Dollars to Lead Investment, British AI Unicorn CuspAI's Valuation Soars to 2.6 Billion Dollars

Bezos personally invested 400 million dollars, and the UK AI startup CuspAI saw its valuation surge to 2.6 billion dollars. The company uses generative AI in materials science, rapidly simulating new materials on demand with its 'reverse design' technology, disrupting traditional R&D models.

Jun 17, 2026

400

NVIDIA Joins the AI Debt Boom, Massive Financing May Intensify the Global Computing Power Arms Race

Nvidia plans to issue at least $20 billion in bonds across seven tranches with maturities from 2 to 30 years, offering yields up to 0.9 percentage points above U.S. Treasuries for the longest term. Underwritten by JPMorgan and other banks, proceeds will fund general corporate purposes, including AI computing expansion, joining a debt issuance trend.....

Jun 16, 2026

390

Ultra-Fast Programming Experience: Kimi K2.7 Code High-Speed Version Now Officially Launched

On June 15, Moonshot AI launched the Kimi K2.7 Code model high-speed version, available to Beta program members, API developers, and commercial users. This version maintains the original model logic but boosts output speed by 5-6 times through technical optimization, notably enhancing response efficiency for short-context tasks, marking a new acceleration in AI coding tools.....

Jun 16, 2026

320

Kimi K2 Series Model API Discontinued, Users Are Advised to Migrate to the New Version

Moonshot AI officially announced that the API for the Kimi K2 series model will cease maintenance on May 25th. Users are required to migrate to the latest model, kimi-k2.6, to obtain ongoing support and enhanced multimodal capabilities. The K2 series includes multiple versions, which were renowned for their trillions of parameters since their release in July of last year and have now reached the end of their lifecycle.

May 26, 2026

490

Cursor Releases Composer 2.5 Encoding Model: Competing with GPT-5.5 and Opus 4.7 at Extremely Low Cost

Cursor released a major upgrade to its AI coding model, Composer 2.5, built on Moonshot AI's open-source Kimi K2.5. With 25 times the training scale of its predecessor and 85% of computing resources allocated to additional training and reinforcement learning, it achieves a breakthrough in core performance, significantly enhancing AI programming efficiency and cost-effectiveness.....

May 19, 2026

430

Saying Goodbye to AI That Only Chats: Bai Ling Large Model Opens Source Ring-2.6-1T Focuses on Real Complex Task Loops

Bailing Large Model open-sources its trillion-parameter flagship model, Ring-2.6-1T, addressing execution deficiencies in real production environments. It shifts to end-to-end Agent workflows, software engineering, and scientific analysis tasks. Three breakthroughs: enhanced Agent execution, achieving state-of-the-art open-source performance on PinchBench and ClawEval benchmarks.....

May 15, 2026

750

Ant Bailing Launches the Ring-2.6-1T Trillion-Level Thinking Model with Customizable Inference Intensity

Ant BaiLing releases the trillion-scale flagship reasoning model Ring-2.6-1T, designed for complex scenarios like Agent workflows, engineering development, and scientific analysis. It features an adjustable Reasoning Effort mechanism, breaking the fixed ratio between reasoning power and resource consumption to balance cost and efficiency. Offers high and xhigh reasoning modes, with high mode optimized for high-frequency Agent collaboration, ensur....

May 9, 2026

2.3k

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Ranking Monitor

AI Conversation Insight

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

LLM API Proxy Checker

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

Four Mac Studios Overcome Cloud Clusters! Apple Teams Up with LM Studio to Run Trillion-Parameter Large Models Locally

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Apple and LM Studio Achieve a Breakthrough Collaboration: Four Mac Studios Successfully Run Trillion-Parameter Large Model

Amazon AI Computing Power New Strategy: Self-developed Trainium Chip May Start Spot Sales

Locally Run Trillion-Parameter Models: Apple and LM Studio Team Up to Unlock the Full Potential of Mac Studio

Bezos Invests 400 Million Dollars to Lead Investment, British AI Unicorn CuspAI's Valuation Soars to 2.6 Billion Dollars

NVIDIA Joins the AI Debt Boom, Massive Financing May Intensify the Global Computing Power Arms Race

Ultra-Fast Programming Experience: Kimi K2.7 Code High-Speed Version Now Officially Launched

Kimi K2 Series Model API Discontinued, Users Are Advised to Migrate to the New Version

Cursor Releases Composer 2.5 Encoding Model: Competing with GPT-5.5 and Opus 4.7 at Extremely Low Cost

Saying Goodbye to AI That Only Chats: Bai Ling Large Model Opens Source Ring-2.6-1T Focuses on Real Complex Task Loops

Ant Bailing Launches the Ring-2.6-1T Trillion-Level Thinking Model with Customizable Inference Intensity

AI News Recommendations

Apple and LM Studio Achieve a Breakthrough Collaboration: Four Mac Studios Successfully Run Trillion-Parameter Large Model

Amazon AI Computing Power New Strategy: Self-developed Trainium Chip May Start Spot Sales

Locally Run Trillion-Parameter Models: Apple and LM Studio Team Up to Unlock the Full Potential of Mac Studio

Bezos Invests 400 Million Dollars to Lead Investment, British AI Unicorn CuspAI's Valuation Soars to 2.6 Billion Dollars

NVIDIA Joins the AI Debt Boom, Massive Financing May Intensify the Global Computing Power Arms Race

Ultra-Fast Programming Experience: Kimi K2.7 Code High-Speed Version Now Officially Launched

Kimi K2 Series Model API Discontinued, Users Are Advised to Migrate to the New Version

Cursor Releases Composer 2.5 Encoding Model: Competing with GPT-5.5 and Opus 4.7 at Extremely Low Cost

Saying Goodbye to AI That Only Chats: Bai Ling Large Model Opens Source Ring-2.6-1T Focuses on Real Complex Task Loops

Ant Bailing Launches the Ring-2.6-1T Trillion-Level Thinking Model with Customizable Inference Intensity