Meituan LongCat-Flash-Lite Launches: 4.5 Billion Activated Parameters with Performance Comparable to Large Models
Meituan's LongCat team introduces LongCat-Flash-Lite, a new model using an 'embedding expansion' paradigm to overcome MoE architecture bottlenecks. Research shows expanding embedding layers outperforms adding experts, improving Pareto frontiers and addressing diminishing returns and high communication costs.....