Physics of Language Models: Part 3.3,
Knowledge Capacity Scaling Laws
arXiv paper: https://arxiv.org/abs/2404.05405
Authors: Zeyuan Allen-Zhu and Yuanzhi Li
YouTube video: TBD
Twitter link for discussions: click here
Slide show (best viewed on a computer)
Slide show (best viewed on a computer)