Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
OpenVINO 's Collections
Visual Language Models
Image Generation
Speculative Decoding Draft Models
Speech-to-Text
LLM
LLMs optimized for NPU

Visual Language Models

updated about 7 hours ago

Collection of OpenVINO optimized models for visual-language assistance

Upvote
6

  • OpenVINO/Phi-3.5-vision-instruct-int4-ov

    Image-Text-to-Text • Updated Jul 22, 2025 • 1.07k

  • OpenVINO/Phi-3.5-vision-instruct-int8-ov

    Image-Text-to-Text • Updated Mar 18, 2025 • 17 • 2

  • OpenVINO/Phi-3.5-vision-instruct-fp16-ov

    Image-Text-to-Text • Updated Aug 21, 2025 • 6 • 1

  • OpenVINO/InternVL2-1B-int4-ov

    Image-Text-to-Text • Updated Jan 23, 2025 • 52

  • OpenVINO/InternVL2-1B-int8-ov

    Image-Text-to-Text • Updated Jan 23, 2025 • 6

  • OpenVINO/InternVL2-1B-fp16-ov

    Image-Text-to-Text • Updated Jan 23, 2025 • 3

  • OpenVINO/InternVL2-2B-int4-ov

    Image-Text-to-Text • Updated Jan 23, 2025 • 11

  • OpenVINO/InternVL2-2B-int8-ov

    Image-Text-to-Text • Updated Jan 23, 2025 • 4

  • OpenVINO/InternVL2-2B-fp16-ov

    Image-Text-to-Text • Updated Jan 23, 2025 • 7

  • OpenVINO/gemma-3-4b-it-int4-ov

    Image-Text-to-Text • Updated Dec 16, 2025 • 110

  • OpenVINO/gemma-3-4b-it-fp16-ov

    Image-Text-to-Text • Updated Dec 16, 2025 • 7

  • OpenVINO/gemma-3-4b-it-int8-ov

    Image-Text-to-Text • Updated Dec 15, 2025 • 7
Upvote
6
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs