distributed/optimized-gpt2-250m-convergence-test-v1 Text Generation • 0.3B • Updated Sep 24, 2024 • 3
distributed/optimized-gpt2-250m-convergence-test-v2 Text Generation • 0.3B • Updated Sep 24, 2024 • 1