October 14, 2024
Optimizing Human-Controlled Preference Alignment in Large Language Models via Dense T...
Leopold Farmer, Vincenzo Rosales, Olivier Anderton, et al.