ServiceNow-AI/apriel_textrls130_dynsample_gspo_datamixfixed_pro7kresp12k_bs128mbs64_length Updated 2 days ago
ServiceNow-AI/apriel_textrls130_dynsample_gspo_datamixfixed_pro7kresp12k_bs128mbs64_length Updated 2 days ago
Just-in-time Episodic Feedback Hinter: Leveraging Offline Knowledge to Improve LLM Agents Adaptation Paper • 2510.04373 • Published Oct 5
ColMate: Contrastive Late Interaction and Masked Text for Multimodal Document Retrieval Paper • 2511.00903 • Published Nov 2
Challenging Common Assumptions about Catastrophic Forgetting Paper • 2207.04543 • Published Jul 10, 2022
PICARD: Parsing Incrementally for Constrained Auto-Regressive Decoding from Language Models Paper • 2109.05093 • Published Sep 10, 2021 • 1
UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models Paper • 2201.05966 • Published Jan 16, 2022 • 1
Unifying Autoregressive and Diffusion-Based Sequence Generation Paper • 2504.06416 • Published Apr 8 • 3
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution Paper • 2510.08697 • Published Oct 9 • 35
Optimizing What Matters: AUC-Driven Learning for Robust Neural Retrieval Paper • 2510.00137 • Published Sep 30 • 2
DeepCodeSeek: Real-Time API Retrieval for Context-Aware Code Generation Paper • 2509.25716 • Published Sep 30 • 3
GRAFT: GRaPH and Table Reasoning for Textual Alignment -- A Benchmark for Structured Instruction Following and Visual Reasoning Paper • 2508.15690 • Published Aug 21 • 8
Modular Techniques for Synthetic Long-Context Data Generation in Language Model Training and Evaluation Paper • 2509.01185 • Published Sep 1