Memorization Dynamics in Knowledge Distillation for Language Models Paper • 2601.15394 • Published Jan 21 • 3
Memorization Dynamics in Knowledge Distillation for Language Models Paper • 2601.15394 • Published Jan 21 • 3
Llama 2: Open Foundation and Fine-Tuned Chat Models Paper • 2307.09288 • Published Jul 18, 2023 • 250