September 24, 2024
Optimizing Knowledge Distillation in Large Language Models via Recursive Multi-Modal...
Henry McKinleigh, Jacob Mcallister, Oliver Johansson, et al.