Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
34
73
108
Li Dong
unilm
Follow
shuyuej's profile picture
yun645's profile picture
alicntny's profile picture
50 followers
·
18 following
AI & ML interests
Language Model Pre-Training
Recent Activity
authored
a paper
about 14 hours ago
MoE-CAP: Benchmarking Cost, Accuracy and Performance of Sparse Mixture-of-Experts Systems
authored
a paper
about 14 hours ago
Towards Stable and Effective Reinforcement Learning for Mixture-of-Experts
authored
a paper
about 14 hours ago
Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge
View all activity
Organizations
unilm
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
published
an
article
about 21 hours ago
view article
Article
Differential Transformer V2
about 21 hours ago
•
6