A fashion object detection model fine-tuned based on YOLOS-tiny, specifically designed to detect 7 categories of fashion items such as bags, bottoms, dresses, etc. Trained on the ModaNet and Fashionpedia datasets, achieving an mAP of 0.697.
Computer Vision
TransformersEnglish