LLM Training Datasets Collection A collection of datasets for training LLMs. • 125 items • Updated Dec 2, 2025 • 28
view article Article The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix Nov 3, 2025 • 54
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 Mar 26, 2025 • 177
LucidDreamer: Domain-free Generation of 3D Gaussian Splatting Scenes Paper • 2311.13384 • Published Nov 22, 2023 • 53