Physics of Language Models: Part 3.1,
Knowledge Storage and Extraction
arXiv paper: https://arxiv.org/abs/2309.14316 (last updated July 2024)
Authors: Zeyuan Allen-Zhu and Yuanzhi Li
Slide show (best viewed on a computer)
Slide show (best viewed on a computer)
@inproceedings{AL2023-knowledge1,
author = {{Allen-Zhu}, Zeyuan and Li, Yuanzhi},
title = {{Physics of Language Models: Part 3.1, Knowledge Storage and Extraction}},
booktitle = {Proceedings of the 41st International Conference on Machine Learning},
series = {ICML~'24},
month = jul,
year = {2024},
note = {Full version available at \url{http://arxiv.org/abs/2309.14316}}
}