A FPGA-Oriented Quantization Scheme for MobileNet-SSD

Xie, Yuxuan; Liu, Bing; Feng, Lei; Li, Xipeng; Zou, Danyin

doi:10.1007/978-981-13-9710-3_10

Yuxuan Xie ORCID: orcid.org/0000-0002-9877-4594⁷,
Bing Liu ORCID: orcid.org/0000-0001-6831-5779⁷,
Lei Feng⁷,
Xipeng Li⁷ &
…
Danyin Zou⁷

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 157))

771 Accesses
3 Citations

Abstract

The rising popularity of mobile devices, which have high performance in object detection calls for a method to implement our algorithms efficiently on mobile devices. As we know, Deep Learning is a good approach to achieve state-of-the-art results. But it needs lots of computation and resources, mobile devices are often resource-limited because of their small size. Recently, FPGA is a device famous for parallelism and many people try to implement the Deep Learning Networks on FPGA. After our investigation, we choose MobileNet-SSD to implement on FPGA because that this network is designed for mobile devices and its size and cost are relatively smaller. There are also some challenges about implementing the network on FPGA, such as the large demand of resources and low latency, which are pretty important for mobile devices. In this paper, we show a quantization scheme for object detection networks based on FPGA and a process to simulate the FPGA on PC to help us predict the performance of networks on FPGA. Besides, we propose an integer-only inference based on FPGA, which truly reduce the cost of resources greatly. The method of Dynamic Fixed Point is adopted and we make some improvement based on object detection networks to quantize the MobileNet-SSD, which is a suitable object detection network for embedded system. Our improvements make its performance better than Ristretto.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Lee, A.: Comparing Deep Neural Networks and Traditional Vision Algorithms in Mobile Robotics. Swarthmore University (2015)
Google Scholar
Chen, X., Peng, X., Li, J.-B., Peng, Yu.: Overview of deep kernel learning based techniques and applications. J. Netw. Intell. 1(3), 83–98 (2016)
Google Scholar
Howard, A.G., Zhu, M., Chen, B., et al.: Mobilenets: efficient convolutional neural networks for mobile vision applications (2014). arXiv:1704.04861
Iandola, F.N., Han, S., Moskewicz, M.W., et al.: Squeezenet: Alexnet-level accuracy with 50x fewer parameters and <0.5 mb model size (2016). arXiv:1602.07360
Yin, P., Zhang, S., Xin, J., et al.: Training ternary neural networks with exact proximal operator (2016). arXiv:1612.06052
Rastegari, M., Ordonez, V., Redmon, J., et al.: Xnor-net: Imagenet classification using binary convolutional neural networks. In: European Conference on Computer Vision, pp. 525–542. Springer, Cham (2016)
Chapter Google Scholar
Chen, Y., Du, Z., Sun, N., Wang, J., et.al.: Diannao: a small-footprint high-throughput accelerator for ubiquitous machine-learning. In: ASPLOS, vol. 49, no. 4. ACM, pp. 269–284 (2014)
Google Scholar
Kuang, F.-J., Zhang, S.-Y.: A novel network intrusion detection based on support vector machine and tent chaos artificial bee colony algorithm. J. Netw. Intell. 2(2), 195–204 (2017)
Google Scholar
Fan,C., Ding, Q.: ARM-embedded implementation of H.264 selective encryption based on chaotic stream cipher. J. Netw. Intell. 3(1), 9–15 (2018)
Google Scholar
Gysel, P.: Ristretto: hardware-oriented approximation of convolutional neural networks (2016). arXiv:1605.06402
Ioffe, S., Szegedy, C.: Batch normalization: accelerating deep network training by reducing internal covariate shift (2015). arXiv:1502.03167
Liu, B., Zou, D., Feng, L., Feng, S., Fu, P., Li, J.: An FPGA-based CNN accelerator integrating depthwise separable convolution. Electronics 8, 281 (2019)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Harbin Institute of Technology, Harbin, China
Yuxuan Xie, Bing Liu, Lei Feng, Xipeng Li & Danyin Zou

Authors

Yuxuan Xie
View author publications
You can also search for this author in PubMed Google Scholar
Bing Liu
View author publications
You can also search for this author in PubMed Google Scholar
Lei Feng
View author publications
You can also search for this author in PubMed Google Scholar
Xipeng Li
View author publications
You can also search for this author in PubMed Google Scholar
Danyin Zou
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bing Liu .

Editor information

Editors and Affiliations

College of Computer Science and Engineering, Shandong University of Science and Technology, Qingdao Shi, Shandong, China
Jeng-Shyang Pan
Northeast Electric Power University, Chuanying Qu, Jilin, China
Jianpo Li
Swinburne University of Technology, Hawthorn, Melbourne, Australia
Pei-Wei Tsai
Centre for Artificial Intelligence, University of Technology Sydney, Sydney, NSW, Australia
Lakhmi C. Jain

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xie, Y., Liu, B., Feng, L., Li, X., Zou, D. (2020). A FPGA-Oriented Quantization Scheme for MobileNet-SSD. In: Pan, JS., Li, J., Tsai, PW., Jain, L. (eds) Advances in Intelligent Information Hiding and Multimedia Signal Processing. Smart Innovation, Systems and Technologies, vol 157. Springer, Singapore. https://doi.org/10.1007/978-981-13-9710-3_10

Download citation

DOI: https://doi.org/10.1007/978-981-13-9710-3_10
Published: 11 July 2019
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-9709-7
Online ISBN: 978-981-13-9710-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics