Tower+: Bridging Generality and Translation Specialization in Multilingual LLMs Paper β’ 2506.17080 β’ Published Jun 20, 2025 β’ 6
deepseek-ai/DeepSeek-V3.2-Speciale Text Generation β’ 685B β’ Updated Dec 1, 2025 β’ 26.7k β’ 640
view article Article Topic 33: Slim Attention, KArAt, XAttention and Multi-Token Attention Explained β Whatβs Really Changing in Transformers? Apr 4, 2025 β’ 16
view article Article Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO) Jan 19, 2025 β’ 40
Running on CPU Upgrade 186 LLM Hallucination Leaderboard π 186 View and filter LLM hallucination leaderboard
intfloat/multilingual-e5-large-instruct Feature Extraction β’ 0.6B β’ Updated Jul 10, 2025 β’ 1.34M β’ β’ 598