Commit 09ea5dad authored by Hi Press's avatar Hi Press
Browse files


parent 452e0719
......@@ -276,7 +276,8 @@ MXNet training script is as follows.
>>> python3 --numprocess 2 --servers host1:1,host2:1 --model vgg19 --comp-threshold 262144 --comp-alg terngrad --horovodrun
# Reproduce the baselines' results
One can use commands at this [repo]( to reproduce the end-to-end training throughput in our SOSP'21 paper.
# References
[1] Frank Seide, Hao Fu, Jasha Droppo, Gang Li, and Dong Yu. 1-bit stochastic gradient descent and its application to data-parallel distributed training of speech dnns. InFifteenth Annual Conference of the International Speech Communication Association, 2014.
Supports Markdown
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment