Meituan Launches Native Multimodal LongCat-Next: Visual and Speech Achieve Bottom-Level Unification
Meituan launches LongCat-Next, a native multimodal AI model that uses DiNA technology to unify images, audio, and text into discrete tokens, enabling deep integration of multimodal modeling for enhanced perception of the physical world.....