--- license_name: qwen-research license_link: https://huggingface.co/Qwen/Qwen2.5-VL-3B-Instruct/blob/main/LICENSE language: - en benchmarks: - ChatDoc/OCRFlux-bench-single - ChatDoc/OCRFlux-bench-cross - ChatDoc/OCRFlux-pubtabnet-single - ChatDoc/OCRFlux-pubtabnet-cross base_model: - Qwen/Qwen2.5-VL-3B-Instruct library_name: transformers --- # OCRFlux-3B This is a preview release of the OCRFlux-3B model that's fine tuned from Qwen2.5-VL-3B-Instruct using the our private document datasets and some data from [olmOCR-mix-0225](https://huggingface.co/datasets/allenai/olmOCR-mix-0225) dataset. Quick links: - 🛠️ [Code](https://github.com/chatdoc-com/OCRFlux) ## Usage The best way to use this model is via the [OCRFlux toolkit](https://github.com/chatdoc-com/OCRFlux). The toolkit comes with an efficient inference setup via vllm that can handle millions of documents at scale.