OmniLMM-12B is a multimodal large model built upon EVA02-5B and Zephyr-7B-β, connected via perceptual resampling layers, trained with a progressive curriculum learning strategy, featuring outstanding performance, trustworthy behavior, and real-time multimodal interaction capabilities.
Multimodal
Transformers