OpenGVLab
/

InternVL-Chat-V1-2

Image-Text-to-Text

feature-extraction

Model card Files Files and versions

Metrics Training metrics Community

czczup commited on Feb 13, 2024

Commit

f54d62f

·

verified ·

1 Parent(s): 6f600ec

Update README.md

Files changed (1) hide show

README.md +0 -4

README.md CHANGED Viewed

@@ -17,10 +17,6 @@ datasets:
 InternVL scales up the ViT to _**6B parameters**_ and aligns it with LLM.
-It is _**the largest open-source vision/vision-language foundation model (14B)**_ to date, achieving _**32 state-of-the-art**_ performances on a wide range of tasks such as visual perception, cross-modal retrieval, multimodal dialogue, etc.
-![image/png](https://cdn-uploads.huggingface.co/production/uploads/64119264f0f81eb569e0d569/4SynvLt2qH8JXFQVI_fmv.png)
 ## Model Details
 - **Model Type:** vision large language model, multimodal chatbot
 - **Model Stats:**

 InternVL scales up the ViT to _**6B parameters**_ and aligns it with LLM.
 ## Model Details
 - **Model Type:** vision large language model, multimodal chatbot
 - **Model Stats:**