Translated data: UIUC and Tsinghua University have jointly released a new large language model called Magicoder, which, with only 7 billion parameters, rivals top models in the field of code generation. The model has been fully open-sourced, including its code, weights, and data. Magicoder employs the OSS-INSTRUCT method to generate diverse, authentic, and controllable coding instruction data, emphasizing the importance of authenticity in instruction tuning. It has demonstrated outstanding performance in evaluations across Python, other programming languages, and data science libraries. Notably, on the DS-1000 dataset, Magicoder improved by 8.3 percentage points. The release of Magicoder marks a significant step forward in the field of code generation.