Physics of Language Models: Part 1
Learning Hierarchical Language Structures

arXiv paper: https://arxiv.org/abs/2305.13673 (last updated June 2024)
Authors: Zeyuan Allen-Zhu and Yuanzhi Li

Slide show (best viewed on a computer)

@article{AL2023-cfg,
  author = {{Allen-Zhu}, Zeyuan and Li, Yuanzhi},
  title = {{Physics of Language Models: Part 1, Learning Hierarchical Language Structures}},
  journal = {ArXiv e-prints},
  year = 2023,
  month = may,
  volume = {abs/2305.13673},
  note = {Full version available at \url{http://arxiv.org/abs/2305.13673}}

}