Ngodi
mrngodi
ยท
AI & ML interests
RNN, CNN, LLM, LVM,
Recent Activity
liked
a model
about 5 hours ago
Qwen/Qwen3-Coder-480B-A35B-Instruct
published
a model
10 months ago
mrngodi/CMT_tinyllama_1.1B-Chat-v1
replied to
JustinLin610's
post
about 1 year ago
Finally, Qwen1.5-110B is out! With weights and demo!
Blog: https://qwenlm.github.io/blog/qwen1.5-110b/
Demo: https://huggingface.co/spaces/Qwen/Qwen1.5-110B-Chat-demo
Base: https://huggingface.co/Qwen/Qwen1.5-110B
Chat: https://huggingface.co/Qwen/Qwen1.5-110B-Chat
This model has some specific features:
* GQA
* 32K token context length
* Multilingual support
We feel good about its performance on benchmarks, including those for base models and chat models, but we still need more of your testing and feedback to help us know its capabilities and limitations!
Additionally, the base model has not learned chatml tokens. Yeah if you use chatml format, you need to be careful about it!
Enjoy and stay tuned for Qwen2!