reinforcement-learning THU-KEG/LongWriter-Zero-32B Text Generation • 33B • Updated Jul 3 • 57 • • 110
Datasetets for general finetuning argilla/distilabel-capybara-dpo-7k-binarized Viewer • Updated Jul 16, 2024 • 7.56k • 2.43k • 182 Locutusque/function-calling-chatml Viewer • Updated Jul 16, 2024 • 113k • 216 • 174 pints-ai/Expository-Prose-V1 Viewer • Updated Aug 12, 2024 • 6.67M • 194 • 19
reinforcement-learning THU-KEG/LongWriter-Zero-32B Text Generation • 33B • Updated Jul 3 • 57 • • 110
Datasetets for general finetuning argilla/distilabel-capybara-dpo-7k-binarized Viewer • Updated Jul 16, 2024 • 7.56k • 2.43k • 182 Locutusque/function-calling-chatml Viewer • Updated Jul 16, 2024 • 113k • 216 • 174 pints-ai/Expository-Prose-V1 Viewer • Updated Aug 12, 2024 • 6.67M • 194 • 19