fieryTransition 's Collections
Gemini 1.5: Unlocking multimodal understanding across millions of tokens
of context
Paper
• 2403.05530
• Published
• 65
Aurora-M: The First Open Source Multilingual Language Model Red-teamed
according to the U.S. Executive Order
Paper
• 2404.00399
• Published
• 42
Rho-1: Not All Tokens Are What You Need
Paper
• 2404.07965
• Published
• 94
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs
with Nothing
Paper
• 2406.08464
• Published
• 71
Building Math Agents with Multi-Turn Iterative Preference Learning
Paper
• 2409.02392
• Published
• 16
RADLADS: Rapid Attention Distillation to Linear Attention Decoders at
Scale
Paper
• 2505.03005
• Published
• 36
MeshCoder: LLM-Powered Structured Mesh Code Generation from Point Clouds
Paper
• 2508.14879
• Published
• 69
Stronger Normalization-Free Transformers
Paper
• 2512.10938
• Published
• 21
DyLoRA: Parameter Efficient Tuning of Pre-trained Models using Dynamic
Search-Free Low-Rank Adaptation
Paper
• 2210.07558
• Published
• 1
Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning
Paper
• 2512.20605
• Published
• 62