Physics of Language Models: Part 2.2,
How to Learn From Mistakes on Grade-School Math Problems
arXiv paper: https://arxiv.org/abs/2408.16293
Authors: Tian Ye, Zicheng Xu, Yuanzhi Li and Zeyuan Allen-Zhu
Code Release for iGSM data generator (and the box-over-box example): https://github.com/facebookresearch/iGSM
Slide show (best viewed on a computer)
Slide show (best viewed on a computer)
@inproceedings{YXLA2024-gsm2,
author = {Ye, Tian and Xu, Zicheng and Li, Yuanzhi and {Allen-Zhu}, Zeyuan},
title = {{Physics of Language Models: Part 2.2, How to Learn From Mistakes on Grade-School Math Problems}},
booktitle = {Proceedings of the 13th International Conference on Learning Representations},
series = {ICLR~'25},
month = apr,
year = 2025,
note = {Full version available at \url{http://arxiv.org/abs/2408.16293}}
}