ICML 2024 tutorial for project overview
SSRN paper: https://ssrn.com/abstract=5250617 (last updated April 2024)
(arxiv link is deprecated)
Authors: Zeyuan Allen-Zhu and Yuanzhi Li
a short version live-streamed in ICML 2024 tutorial; longer video under plan.
Twitter link for discussions: click here
@inproceedings{AL2024-knowledge3,
author = {{Allen-Zhu}, Zeyuan and Li, Yuanzhi},
title = {{Physics of Language Models: Part 3.3, Knowledge Capacity Scaling Laws}},
booktitle = {Proceedings of the 13th International Conference on Learning Representations},
series = {ICLR~'25},
month = apr,
year = 2025,
note = {Full version available at \url{https://ssrn.com/abstract=5250617}}
}