Ashley
TRASHLEY
ยท
AI & ML interests
None yet
Recent Activity
replied to
prithivMLmods's
post
about 14 hours ago
Try CUA GUI Operator ๐ฅ๏ธ Space, the demo of some interesting multimodal ultra-compact Computer Use Agent (CUA) models in a single app, including Fara-7B, UI-TARS-1.5-7B, and Holo models, to perform GUI localization tasks.
โ CUA-GUI-Operator [Demo]: https://huggingface.co/spaces/prithivMLmods/CUA-GUI-Operator
โ Collection: https://huggingface.co/collections/prithivMLmods/multimodal-implementations
Other related multimodal spaces
โ Qwen3-VL: https://huggingface.co/spaces/prithivMLmods/Qwen3-VL-HF-Demo
โ Multimodal-VLM-v1.0: https://huggingface.co/spaces/prithivMLmods/Multimodal-VLM-v1.0
โ Vision-to-VibeVoice-en: https://huggingface.co/spaces/prithivMLmods/Vision-to-VibeVoice-en
I have planned to add Chrome sandboxes to streamline it and turn it into a browser based CUA multimodal tool, which will be added to the same space soon.
To know more about it, visit the app page or the respective model page!
upvoted
a
paper
25 days ago
Intelligence per Watt: Measuring Intelligence Efficiency of Local AI
liked
a model
4 months ago
ByteDance-Seed/Tar-1.5B