osunlp
/

UGround-V1-7B

Image-Text-to-Text

text-generation-inference

Model card Files Files and versions

BoyuNLP commited on Jan 3

Commit

8788a59

·

verified ·

1 Parent(s): 3fd987c

Update README.md

Files changed (1) hide show

README.md +20 -8

README.md CHANGED Viewed

@@ -22,21 +22,34 @@ UGround is a storng GUI visual grounding model trained with a simple recipe. Che
 - [x] Model Weights
 - [ ] Code
   - [ ] Inference Code of UGround
   - [x] Offline Experiments
     - [x] Screenspot (along with referring expressions generated by GPT-4/4o)
     - [x] Multimodal-Mind2Web
     - [x] OmniAct
   - [ ] Online Experiments
-    - [ ] Mind2Web-Live
-    - [ ] AndroidWorld
-- [ ] Data
   - [ ] Data Examples
   - [ ] Data Construction Scripts
-  - [ ] Guidance of Open-source Data
 - [x] Online Demo (HF Spaces)
@@ -111,13 +124,12 @@ messages = format_openai_template(description, base64_image)
 completion = await client.chat.completions.create(
     model=args.model_path,
     messages=messages,
-    temperature=0
 )
-```
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6500870f1e14749e84f8f887/u5bXFxxAWCXthyXWyZkM4.png)

 - [x] Model Weights
+- [x] Qwen2-VL-based V1
 - [ ] Code
   - [ ] Inference Code of UGround
   - [x] Offline Experiments
     - [x] Screenspot (along with referring expressions generated by GPT-4/4o)
     - [x] Multimodal-Mind2Web
     - [x] OmniAct
+    - [ ] Android Control
   - [ ] Online Experiments
+    - [ ] Mind2Web-Live-SeeAct-V
+    - [ ] AndroidWorld-SeeAct-V
+- [ ] Data-V1
   - [ ] Data Examples
   - [ ] Data Construction Scripts
+  - [ ] Guidance of Open-source Data
+- [ ] Data-V1.1
 - [x] Online Demo (HF Spaces)
+## Models
+Initial UGround-V1:
+UGround-V1-2B (Qwen2-VL): https://huggingface.co/osunlp/UGround-V1-2B
+UGround-V1-7B (Qwen2-VL): https://huggingface.co/osunlp/UGround-V1-7B
+UGround-V1-72B (Qwen2-VL): Coming Soon
+UGround-V1.1-2B (Qwen2-VL): Coming Soon
+UGround-V1.1-7B (Qwen2-VL): Coming Soon
+UGround-V1.1-72B (Qwen2-VL): Coming Soon
 completion = await client.chat.completions.create(
     model=args.model_path,
     messages=messages,
+    temperature=0  # Remember to set temperature to ZERO!
 )
+# The output will be in the range of [0,999), which is compatible with the original Qwen2-VL
+```
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6500870f1e14749e84f8f887/u5bXFxxAWCXthyXWyZkM4.png)