Skip to main content

<span style='font-size: 1.3em;'>μ</span>pscaling small models: Principled warm starts and hyperparameter transfer

* indicates equal contribution or alphabetic order. The arXiv version is preferred for the most up-to-date content.

There's no articles to list here yet.