pytorch-transformer-distributed
PublicDistributed training (multi-node) of a Transformer model
collective-communicationdata-parallelismdeep-learningdistributed-data-paralleldistributed-traininggradient-accumulationmachine-learningmodel-parallelismpytorchtutorial
Creat:2023-12-08T08:52:38
Update:2025-03-25T04:53:42
https://www.youtube.com/watch?v=toUSzwR0EV8
76
Stars
1
Stars Increase