August 27, 2024
Novel Token-Level Recurrent Routing for Enhanced Mixture-of-Experts Performance
Ethan Pedicir, Lucas Miller, Liam Robinson, et al.