Software:SqueezeNet

From HandWiki
Revision as of 14:24, 9 February 2024 by Raymond Straus (talk | contribs) (simplify)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Short description: Deep neural network for image classification, released 2016
SqueezeNet
Original author(s)Forrest Iandola, Song Han, Matthew W. Moskewicz, Khalid Ashraf, Bill Dally, Kurt Keutzer
Initial release22 February 2016; 8 years ago (2016-02-22)
Stable release
v1.1 (June 6, 2016; 8 years ago (2016-06-06))
Repositorygithub.com/DeepScale/SqueezeNet
TypeDeep neural network
LicenseBSD license

In computer vision, SqueezeNet is the name of a deep neural network for image classification that was released in 2016. SqueezeNet was developed by researchers at DeepScale, University of California, Berkeley, and Stanford University. In designing SqueezeNet, the authors' goal was to create a smaller neural network with fewer parameters while achieving competitive accuracy.[1]

Framework support for SqueezeNet

SqueezeNet was originally released on February 22, 2016.[2] This original version of SqueezeNet was implemented on top of the Caffe deep learning software framework. Shortly thereafter, the open-source research community ported SqueezeNet to a number of other deep learning frameworks. On February 26, 2016, Eddie Bell released a port of SqueezeNet for the Chainer deep learning framework.[3] On March 2, 2016, Guo Haria released a port of SqueezeNet for the Apache MXNet framework.[4] On June 3, 2016, Tammy Yang released a port of SqueezeNet for the Keras framework.[5] In 2017, companies including Baidu, Xilinx, Imagination Technologies, and Synopsys demonstrated SqueezeNet running on low-power processing platforms such as smartphones, FPGAs, and custom processors.[6][7][8][9]

As of 2018, SqueezeNet ships "natively" as part of the source code of a number of deep learning frameworks such as PyTorch, Apache MXNet, and Apple CoreML.[10][11][12] In addition, 3rd party developers have created implementations of SqueezeNet that are compatible with frameworks such as TensorFlow.[13] Below is a summary of frameworks that support SqueezeNet.

Framework SqueezeNet Support References
Apache MXNet Native [11]
Apple CoreML Native [12]
Caffe2 Native [14]
Keras 3rd party [5]
MATLAB Deep Learning Toolbox Native [15]
ONNX Native [16]
PyTorch Native [10]
TensorFlow 3rd party [13]
Wolfram Mathematica Native [17]

Relationship to AlexNet

SqueezeNet was originally described in a paper entitled "SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size."[18] AlexNet is a deep neural network that has 240 MB of parameters, and SqueezeNet has just 5 MB of parameters. This small model size can more easily fit into computer memory and can more easily be transmitted over a computer network. However, it's important to note that SqueezeNet is not a "squeezed version of AlexNet." Rather, SqueezeNet is an entirely different DNN architecture than AlexNet.[19] What SqueezeNet and AlexNet have in common is that both of them achieve approximately the same level of accuracy when evaluated on the ImageNet image classification validation dataset.

Relationship to Deep Compression

Model compression (e.g. quantization and pruning of model parameters) can be applied to a deep neural network after it has been trained.[20] In the SqueezeNet paper, the authors demonstrated that a model compression technique called Deep Compression can be applied to SqueezeNet to further reduce the size of the parameter file from 5 MB to 500 KB.[18] Deep Compression has also been applied to other DNNs, such as AlexNet and VGG.[21]

Offshoots of SqueezeNet

Some of the members of the original SqueezeNet team have continued to develop resource-efficient deep neural networks for a variety of applications. A few of these works are noted in the following table. As with the original SqueezeNet model, the open-source research community has ported and adapted these newer "squeeze"-family models for compatibility with multiple deep learning frameworks.

DNN Model Application Original

Implementation

Other

Implementations

SqueezeDet[22][23] Object Detection

on Images

TensorFlow[24] Caffe,[25] Keras[26][27][28]
SqueezeSeg[29] Semantic

Segmentation

of LIDAR

TensorFlow[30]
SqueezeNext[31] Image

Classification

Caffe[32] TensorFlow,[33] Keras,[34]

PyTorch[35]

SqueezeNAS[36][37] Neural Architecture Search

for Semantic Segmentation

PyTorch[38]

In addition, the open-source research community has extended SqueezeNet to other applications, including semantic segmentation of images and style transfer.[39][40][41]

References

  1. Ganesh, Abhinav. "Deep Learning Reading Group: SqueezeNet". KDnuggets. https://www.kdnuggets.com/2016/09/deep-learning-reading-group-squeezenet.html. 
  2. "SqueezeNet". 2016-02-22. https://github.com/DeepScale/SqueezeNet. 
  3. Bell, Eddie (2016-02-26). "An implementation of SqueezeNet in Chainer". https://github.com/ejlb/squeezenet-chainer. 
  4. Haria, Guo (2016-03-02). "SqueezeNet for MXNet". https://github.com/hariag/SqueezeNet. 
  5. 5.0 5.1 Yang, Tammy (2016-06-03). "SqueezeNet Keras Implemenation". https://github.com/DT42/squeezenet_demo. 
  6. Chirgwin, Richard (2017-09-26). "Baidu puts open source deep learning into smartphones". The Register. https://www.theregister.co.uk/2017/09/26/baidu_mobile_deep_learning/. 
  7. Bush, Steve (2018-01-25). "Neural network SDK for PowerVR GPUs". Electronics Weekly. https://www.electronicsweekly.com/news/products/software-products/neural-network-sdk-powervr-gpus-2018-01/. 
  8. Yoshida, Junko (2017-03-13). "Xilinx AI Engine Steers New Course". EE Times. https://www.eetimes.com/document.asp?doc_id=1331467&print=yes. 
  9. Boughton, Paul (2017-08-28). "Deep learning computer vision algorithms ported to processor IP". Engineer Live. https://www.engineerlive.com/content/deep-learning-computer-vision-algorithms-ported-processor-ip. 
  10. 10.0 10.1 "squeezenet.py". https://github.com/pytorch/vision/blob/master/torchvision/models/squeezenet.py. 
  11. 11.0 11.1 "squeezenet.py". https://github.com/apache/incubator-mxnet/blob/6dcb0ebb39558d83df4d9fd0338a46f775b94bd6/python/mxnet/gluon/model_zoo/vision/squeezenet.py. 
  12. 12.0 12.1 "CoreML". https://developer.apple.com/machine-learning/. 
  13. 13.0 13.1 Poster, Domenick. "Tensorflow implementation of SqueezeNet". https://github.com/vonclites/squeezenet. 
  14. Inkawhich, Nathan. "SqueezeNet Model Quickload Tutorial". https://github.com/caffe2/caffe2/blob/c63037fd27493d0db72debd8202776124fa46cb8/caffe2/python/tutorials/Model_Quickload.ipynb. 
  15. "SqueezeNet for MATLAB Deep Learning Toolbox". Mathworks. https://www.mathworks.com/help/deeplearning/ref/squeezenet.html. Retrieved 2018-10-03. 
  16. Fang, Lu. "SqueezeNet for ONNX". https://github.com/onnx/models/tree/master/squeezenet. 
  17. "SqueezeNet V1.1 Trained on ImageNet Competition Data". https://resources.wolframcloud.com/NeuralNetRepository/resources/SqueezeNet-V1.1-Trained-on-ImageNet-Competition-Data. 
  18. 18.0 18.1 Iandola, Forrest N; Han, Song; Moskewicz, Matthew W; Ashraf, Khalid; Dally, William J; Keutzer, Kurt (2016). "SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size". arXiv:1602.07360 [cs.CV].
  19. "SqueezeNet". http://www.shortscience.org/paper?bibtexKey=journals/corr/IandolaMAHDK16#unmesh. 
  20. Gude, Alex (2016-08-09). "Lab41 Reading Group: Deep Compression". https://gab41.lab41.org/lab41-reading-group-deep-compression-9c36064fb209. 
  21. Han, Song (2016-11-06). "Compressing and regularizing deep neural networks". https://www.oreilly.com/ideas/compressing-and-regularizing-deep-neural-networks. 
  22. Wu, Bichen; Wan, Alvin; Iandola, Forrest; Jin, Peter H.; Keutzer, Kurt (2016). "SqueezeDet: Unified, Small, Low Power Fully Convolutional Neural Networks for Real-Time Object Detection for Autonomous Driving". arXiv:1612.01051 [cs.CV].
  23. Nunes Fernandes, Edgar (2017-03-02). "Introducing SqueezeDet: low power fully convolutional neural network framework for autonomous driving". https://theintelligenceofinformation.wordpress.com/2017/03/02/introducing-squeezedet-low-power-fully-convolutional-neural-network-framework-for-autonomous-driving/. 
  24. Wu, Bichen (2016-12-08). "SqueezeDet: Unified, Small, Low Power Fully Convolutional Neural Networks for Real-Time Object Detection for Autonomous Driving". https://github.com/BichenWuUCB/squeezeDet. 
  25. Kuan, Xu (2017-12-20). "Caffe SqueezeDet". https://github.com/XuSen123/caffe-squeezeDet. 
  26. Padmanabha, Nischal (2017-03-20). "SqueezeDet on Keras". https://github.com/omni-us/squeezedet-keras. 
  27. Ehmann, Christopher (2018-05-29). "Fast object detection with SqueezeDet on Keras". https://medium.com/omnius/fast-object-detection-with-squeezedet-on-keras-5cdd124b46ce. 
  28. Ehmann, Christopher (2018-05-02). "A deeper look into SqueezeDet on Keras". https://medium.com/omnius/a-deeper-look-into-squeezedet-on-keras-13f107d0dd32. 
  29. Wu, Bichen; Wan, Alvin; Yue, Xiangyu; Keutzer, Kurt (2017). "SqueezeSeg: Convolutional Neural Nets with Recurrent CRF for Real-Time Road-Object Segmentation from 3D LiDAR Point Cloud". arXiv:1710.07368 [cs.CV].
  30. Wu, Bichen (2017-12-06). "SqueezeSeg: Convolutional Neural Nets with Recurrent CRF for Real-Time Road-Object Segmentation from 3D LiDAR Point Cloud". https://github.com/BichenWuUCB/SqueezeSeg. 
  31. Gholami, Amir; Kwon, Kiseok; Wu, Bichen; Tai, Zizheng; Yue, Xiangyu; Jin, Peter; Zhao, Sicheng; Keutzer, Kurt (2018). "SqueezeNext: Hardware-Aware Neural Network Design". arXiv:1803.10615 [cs.CV].
  32. Gholami, Amir (2018-04-18). "SqueezeNext". https://github.com/amirgholami/SqueezeNext. 
  33. Verhulsdonck, Tijmen (2018-07-09). "SqueezeNext Tensorflow: A tensorflow Implementation of SqueezeNext". https://github.com/Timen/squeezenext-tensorflow. 
  34. Sémery, Oleg (2018-09-24). "SqueezeNext, implemented in Keras". https://github.com/osmr/imgclsmob/tree/master/keras_/kerascv/models. 
  35. Lu, Yi (2018-06-21). "SqueezeNext.PyTorch". https://github.com/luuuyi/SqueezeNext.PyTorch. 
  36. Shaw, Albert; Hunter, Daniel; Iandola, Forrest; Sidhu, Sammy (2019). "SqueezeNAS: Fast neural architecture search for faster semantic segmentation". arXiv:1908.01748 [cs.LG].
  37. Yoshida, Junko (2019-08-25). "Does Your AI Chip Have Its Own DNN?". EE Times. https://www.eetimes.com/document.asp?doc_id=1335063. 
  38. Shaw, Albert (2019-08-27). "SqueezeNAS". https://github.com/ashaw596/squeezenas. 
  39. Treml, Michael (2016). "Speeding up Semantic Segmentation for Autonomous Driving". NIPS MLITS Workshop. https://openreview.net/pdf?id=S1uHiFyyg. 
  40. Zeng, Li (2017-03-22). "SqueezeNet Neural Style on PyTorch". https://github.com/lizeng614/SqueezeNet-Neural-Style-Pytorch. 
  41. Wu, Bichen; Keutzer, Kurt (2017). "The Impact of SqueezeNet". https://aspire.eecs.berkeley.edu/wiki/_media/eop/2017/bichen_talk_slides.pdf.