Zhangzhe's Blog
The projection of my life.
Home
Tags
Categories
Search
0%
Tensor Parallelism
Category
2024
08-21
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism