TiTan Collection Smaller models, fine tuned on generating titles and tags. • 9 items • Updated Sep 16 • 2
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19 • 175
Taiwan-pretrain-llm-zh_tw-corpus Collection 本清冊收集用於訓練 繁體中文資料的資料集。特別適合需要自行訓練語言模型者使用 • 8 items • Updated Mar 6, 2024 • 6
Extending Context Window of Large Language Models via Positional Interpolation Paper • 2306.15595 • Published Jun 27, 2023 • 53