vision-transformer-classifier
PublicImage classification pipeline using the CIFAR-100 dataset by leveraging a Vision Transformer (ViT) model, as described in the paper "An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale" (Dosovitskiy et al., 2021).
Hora de creación:2025-05-30T13:39:05
Hora de actualización:2025-05-30T13:44:18
0
Stars
0
Stars Increase