tomg-group-umd/Gemstone-1024x28_cooldown
Text Generation
•
0.5B
•
Updated
•
10
AI security & privacy, algorithmic bias, foundations of ML
Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence
Gemstones: A Model Suite for Multi-Faceted Scaling Laws