Add Autoconfig and Coordinated_Optimizer implementations for Tensor Parallel Autosharding #7895
Job | Run time |
---|---|
2m 35s | |
22m 0s | |
29m 56s | |
22m 54s | |
6m 49s | |
5m 58s | |
3m 40s | |
1h 33m 52s |
Job | Run time |
---|---|
2m 35s | |
22m 0s | |
29m 56s | |
22m 54s | |
6m 49s | |
5m 58s | |
3m 40s | |
1h 33m 52s |