Cool Link
[2604.05091] MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU arxiv.org

arxiv.org/abs/2604.05091