Update README.md
Browse files
README.md
CHANGED
|
@@ -22,21 +22,34 @@ UGround is a storng GUI visual grounding model trained with a simple recipe. Che
|
|
| 22 |
|
| 23 |
|
| 24 |
- [x] Model Weights
|
|
|
|
| 25 |
- [ ] Code
|
| 26 |
- [ ] Inference Code of UGround
|
| 27 |
- [x] Offline Experiments
|
| 28 |
- [x] Screenspot (along with referring expressions generated by GPT-4/4o)
|
| 29 |
- [x] Multimodal-Mind2Web
|
| 30 |
- [x] OmniAct
|
|
|
|
| 31 |
- [ ] Online Experiments
|
| 32 |
-
- [ ] Mind2Web-Live
|
| 33 |
-
- [ ] AndroidWorld
|
| 34 |
-
- [ ] Data
|
| 35 |
- [ ] Data Examples
|
| 36 |
- [ ] Data Construction Scripts
|
| 37 |
-
- [ ] Guidance of Open-source Data
|
|
|
|
| 38 |
- [x] Online Demo (HF Spaces)
|
| 39 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 40 |
|
| 41 |
|
| 42 |
|
|
@@ -111,13 +124,12 @@ messages = format_openai_template(description, base64_image)
|
|
| 111 |
completion = await client.chat.completions.create(
|
| 112 |
model=args.model_path,
|
| 113 |
messages=messages,
|
| 114 |
-
temperature=0
|
| 115 |
)
|
| 116 |
|
| 117 |
-
|
| 118 |
-
|
| 119 |
-
|
| 120 |
|
|
|
|
| 121 |
|
| 122 |
|
| 123 |

|
|
|
|
| 22 |
|
| 23 |
|
| 24 |
- [x] Model Weights
|
| 25 |
+
- [x] Qwen2-VL-based V1
|
| 26 |
- [ ] Code
|
| 27 |
- [ ] Inference Code of UGround
|
| 28 |
- [x] Offline Experiments
|
| 29 |
- [x] Screenspot (along with referring expressions generated by GPT-4/4o)
|
| 30 |
- [x] Multimodal-Mind2Web
|
| 31 |
- [x] OmniAct
|
| 32 |
+
- [ ] Android Control
|
| 33 |
- [ ] Online Experiments
|
| 34 |
+
- [ ] Mind2Web-Live-SeeAct-V
|
| 35 |
+
- [ ] AndroidWorld-SeeAct-V
|
| 36 |
+
- [ ] Data-V1
|
| 37 |
- [ ] Data Examples
|
| 38 |
- [ ] Data Construction Scripts
|
| 39 |
+
- [ ] Guidance of Open-source Data
|
| 40 |
+
- [ ] Data-V1.1
|
| 41 |
- [x] Online Demo (HF Spaces)
|
| 42 |
|
| 43 |
+
## Models
|
| 44 |
+
|
| 45 |
+
Initial UGround-V1:
|
| 46 |
+
UGround-V1-2B (Qwen2-VL): https://huggingface.co/osunlp/UGround-V1-2B
|
| 47 |
+
UGround-V1-7B (Qwen2-VL): https://huggingface.co/osunlp/UGround-V1-7B
|
| 48 |
+
UGround-V1-72B (Qwen2-VL): Coming Soon
|
| 49 |
+
UGround-V1.1-2B (Qwen2-VL): Coming Soon
|
| 50 |
+
UGround-V1.1-7B (Qwen2-VL): Coming Soon
|
| 51 |
+
UGround-V1.1-72B (Qwen2-VL): Coming Soon
|
| 52 |
+
|
| 53 |
|
| 54 |
|
| 55 |
|
|
|
|
| 124 |
completion = await client.chat.completions.create(
|
| 125 |
model=args.model_path,
|
| 126 |
messages=messages,
|
| 127 |
+
temperature=0 # Remember to set temperature to ZERO!
|
| 128 |
)
|
| 129 |
|
| 130 |
+
# The output will be in the range of [0,999), which is compatible with the original Qwen2-VL
|
|
|
|
|
|
|
| 131 |
|
| 132 |
+
```
|
| 133 |
|
| 134 |
|
| 135 |

|