Tutorial II has begun (Part 4.1a, 4.1b, 4.2)
Part 4.2 is currently released as a fully reproducible code package, with all technical details in the accompanying YouTube talk.
A paper PDF may follow if time permits; please cite using the BibTeX below.
Authors: Zeyuan Allen-Zhu
Code release for Part 4.2: https://github.com/facebookresearch/PhysicsLM4
Mode release for Part 4.2: huggingface (only for 16 Transformer models)
@misc{Allen2025-resonate,
title = {{Physics of Language Models: Part 4.2, Canon Layers at Scale where Synthetic Pretraining Resonates in Reality}},
author = {{Allen-Zhu}, Zeyuan},
year = {2025},
url = {https://physics.allen-zhu.com/part-4-architecture-design/part-4-2},
note = {Code released at \url{https://github.com/facebookresearch/PhysicsLM4}},
}