YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Model Card

  • Model Name: Food Type Image Detection Vision Transformer
  • Original Model: Vision Transformer (ViT) model pre-trained on ImageNet-21k (14 million images, 21,843 classes) at resolution 224x224.
  • Model Type: Image Classification
  • Model Architecture: Vision Transformer (ViT)
  • Fine-tuning:
    • Fine-tuned on Food Image Classification Dataset by using 12 varieties of these 35 varieties
    • Optimizer: AdamW
    • Epochs: 20
  • Model Performance: Achieved an accuracy of 96.23% on all of the kinds of Food Image Classification Dataset
Downloads last month
95
Safetensors
Model size
85.8M params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Spaces using ewanlong/food_type_image_detection 2