原创 Facebook推出數據並行訓練算法FSDP:採用更少的GPU,更高效地訓練更大數量級的模型

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ