MAVIS is a mathematical visual instruction tuning model designed for multimodal large language models (MLLMs). It enhances MLLMs' capabilities in visual mathematical problem-solving by improving visual encoding of mathematical graphs, graph-language alignment, and mathematical reasoning skills. The model includes two newly curated datasets, a mathematical visual encoder, and a mathematical MLLM, achieving leading performance in the MathVerse benchmark test through a three-phase training paradigm.