Three document layout analysis models (nano, small, medium) based on the YOLOv11 architecture, fine-tuned on the DocLayNet dataset, can accurately detect 11 types of layout elements such as text, tables, and charts in documents, suitable for document understanding and information extraction tasks.
Computer VisionEnglish