Software:DeepSpeed

DeepSpeed
Original author(s)	Microsoft Research
Developer(s)	Microsoft
Initial release	May 18, 2020; 4 years ago
Stable release	v0.5.10 / January 14, 2022; 2 years ago
Repository	github.com/microsoft/DeepSpeed
Written in	Python, CUDA, C++
Type	Software library
License	MIT License
Website	deepspeed.ai

Short description: Microsoft open source library

DeepSpeed is an open source deep learning optimization library for PyTorch.^[1] The library is designed to reduce computing power and memory use and to train large distributed models with better parallelism on existing computer hardware.^[2]^[3] DeepSpeed is optimized for low latency, high throughput training. It includes the Zero Redundancy Optimizer (ZeRO) for training models with 1 trillion or more parameters.^[4] Features include mixed precision training, single-GPU, multi-GPU, and multi-node training as well as custom model parallelism. The DeepSpeed source code is licensed under MIT License and available on GitHub.^[5]

The team claimed to achieve up to a 6.2x throughput improvement, 2.8x faster convergence, and 4.6x less communication.^[6]

References

External links

0.00

(0 votes)

Original source: https://en.wikipedia.org/wiki/DeepSpeed. Read more

[1] "Microsoft Updates Windows, Azure Tools with an Eye on The Future". May 22, 2020. https://uk.pcmag.com/news-analysis/127085/microsoft-updates-windows-azure-tools-with-an-eye-on-the-future.

[2] Yegulalp, Serdar (February 10, 2020). "Microsoft speeds up PyTorch with DeepSpeed". https://www.infoworld.com/article/3526449/microsoft-speeds-up-pytorch-with-deepspeed.html.

[3] "Microsoft unveils "fifth most powerful" supercomputer in the world". https://www.neowin.net/news/microsoft-unveils-fifth-most-powerful-supercomputer-in-the-world/.

[4] "Microsoft trains world's largest Transformer language model". February 10, 2020. https://venturebeat.com/2020/02/10/microsoft-trains-worlds-largest-transformer-language-model/.

[5] "microsoft/DeepSpeed". July 10, 2020. https://github.com/microsoft/DeepSpeed.

[:0-6] "DeepSpeed: Accelerating large-scale model inference and training via system optimizations and compression" (in en-US). 2021-05-24. https://www.microsoft.com/en-us/research/blog/deepspeed-accelerating-large-scale-model-inference-and-training-via-system-optimizations-and-compression/.

[1]

[2]

[3]

[4]

[5]

[6]

v t e Deep learning software
Open-source	Apache SINGA Caffe Deeplearning4j Dlib Keras Microsoft Cognitive Toolkit MXNet OpenNN PyTorch TensorFlow Theano Torch ONNX
Proprietary	Maple Neural Designer Wolfram Mathematica Apple Core ML
Category Comparison

Anonymous

Search

Software:DeepSpeed

Namespaces

More

Page actions

Contents

See also

References

Further reading

External links

Navigation

Navigation

Help

Translate

Wiki tools

Wiki tools

Anonymous

Search

Software:DeepSpeed

See also

References

Further reading

External links

Navigation

Wiki tools

Page tools

Other projects

Categories