Reader /
Discussion

[2604.05091] MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

arxiv.org
/ pin · @ user · Ctrl+Enter
0 threads

No discussions yet

More from arxiv.org

Discover