view article Article BigCodeArena: Judging code generations end to end with code executions Oct 7 • 18
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution Paper • 2510.08697 • Published Oct 9 • 35
Privacy-Preserving Tabular Synthetic Data Generation Using TabularARGN Paper • 2508.06647 • Published Aug 8 • 16
TabularARGN: A Flexible and Efficient Auto-Regressive Framework for Generating High-Fidelity Synthetic Data Paper • 2501.12012 • Published Jan 21 • 9
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 429
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM +2 Mar 12 • 473
The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism Paper • 2407.10457 • Published Jul 15, 2024 • 24
Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer Paper • 2403.13570 • Published Mar 20, 2024 • 3
Instruction Pre-Training: Language Models are Supervised Multitask Learners Paper • 2406.14491 • Published Jun 20, 2024 • 95