ARC-AGI is a dataset designed to test whether an artificial intelligence system possesses the ability of abstract and reasoning like a human. It consists of 400 training tasks and 400 evaluation tasks, each stored in JSON format and including input-output pairs. This dataset can be used as a benchmark for artificial intelligence, program synthesis, or psychological intelligence testing.