richardr1126's picture
Upload ONNX optimized RoBERTa model with quantization
d7c0836 verified
metadata
license: mit
tags:
  - text-classification
  - onnx
  - quantized
  - roberta
  - tweet-topic
base_model: cardiffnlp/tweet-topic-21-multi

This is an ONNX-optimized version of cardiffnlp/tweet-topic-21-multi for efficient inference.

Model Description

This repository contains:

  • model.onnx: Regular ONNX exported model
  • model_quantized.onnx: INT8 dynamically quantized model for faster inference with minimal accuracy loss

The model is optimized for zero-shot classification tasks across multiple languages.

Acknowledgments

This ONNX optimization was created for efficient deployment in production environments. Special thanks to the original model authors and the Hugging Face Optimum team.