Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
johannhartmann
's Collections
Music
Computer Use Models
Document & UI Intelligence
Multimodal Models
Medical MultiModal
Computer Use Models
updated
13 days ago
Upvote
1
ByteDance-Seed/UI-TARS-72B-DPO
Image-Text-to-Text
•
73B
•
Updated
Jan 25
•
2.27k
•
147
ByteDance-Seed/UI-TARS-7B-DPO
Image-Text-to-Text
•
8B
•
Updated
Jan 25
•
1.32k
•
221
microsoft/OmniParser
Image-Text-to-Text
•
Updated
Dec 2, 2024
•
480
•
1.7k
jadechoghari/Ferret-UI-Llama8b
Image-Text-to-Text
•
8B
•
Updated
Jan 8
•
273
•
68
microsoft/GUI-Actor-7B-Qwen2.5-VL
Image-Text-to-Text
•
8B
•
Updated
Aug 9
•
957
•
24
showlab/ShowUI-2B
Updated
Mar 11
•
2.25k
•
269
Zery/CUA_World_State_Model
Image-Text-to-Text
•
Updated
Aug 7
•
9
•
4
microsoft/Fara-7B
Image-Text-to-Text
•
8B
•
Updated
7 days ago
•
31.5k
•
425
Qwen/Qwen2.5-Omni-7B
Any-to-Any
•
11B
•
Updated
Apr 30
•
138k
•
1.83k
Hcompany/Holo2-30B-A3B
Image-Text-to-Text
•
31B
•
Updated
17 days ago
•
1.61k
•
36
Hcompany/Holo2-4B
Image-Text-to-Text
•
4B
•
Updated
24 days ago
•
2.71k
•
16
Hcompany/Holo2-8B
Image-Text-to-Text
•
9B
•
Updated
24 days ago
•
606
•
15
AskUI/PTA-1
Image-Text-to-Text
•
0.3B
•
Updated
Nov 28, 2024
•
799
•
97
OS-Copilot/OS-Atlas-Base-7B
Image-Text-to-Text
•
8B
•
Updated
Nov 19, 2024
•
964
•
42
Qwen/Qwen2-VL-7B-Instruct
Image-Text-to-Text
•
8B
•
Updated
Feb 6
•
1.55M
•
•
1.24k
xlangai/OpenCUA-72B
Image-Text-to-Text
•
73B
•
Updated
26 days ago
•
213
•
3
xlangai/OpenCUA-32B
Image-Text-to-Text
•
33B
•
Updated
Aug 18
•
569
•
25
xlangai/OpenCUA-7B
Image-Text-to-Text
•
8B
•
Updated
25 days ago
•
22.5k
•
21
xlangai/Jedi-7B-1080p
Image-Text-to-Text
•
8B
•
Updated
Jun 18
•
111
•
29
xlangai/Jedi-3B-1080p
Image-Text-to-Text
•
4B
•
Updated
Jun 18
•
117
•
17
Qwen/Qwen3-VL-8B-Instruct
Image-Text-to-Text
•
9B
•
Updated
Oct 15
•
1.89M
•
•
522
Qwen/Qwen3-VL-8B-Thinking
Image-Text-to-Text
•
9B
•
Updated
12 days ago
•
225k
•
151
Upvote
1
Share collection
View history
Collection guide
Browse collections