PyTorch profiling data reveals optimization strategies for MoE layers in training and inference
https://github.com/deepseek-ai/profile-data
#performanceprofiling #deeplearning #moearchitecture #pytorch #parallelcomputing
0
0
0
0