DICE: Diffusion Large Language Models Excel at Generating CUDA Kernels Paper • 2602.11715 • Published 10 days ago • 5 • 3
Pretraining A Large Language Model using Distributed GPUs: A Memory-Efficient Decentralized Paradigm Paper • 2602.11543 • Published 10 days ago • 4 • 4
NanoQuant: Efficient Sub-1-Bit Quantization of Large Language Models Paper • 2602.06694 • Published 15 days ago • 15 • 5
SimpleGPT: Improving GPT via A Simple Normalization Strategy Paper • 2602.01212 • Published 20 days ago • 3 • 4
OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer Paper • 2601.14250 • Published Jan 20 • 47 • 5