Human Tracking for Facility Surveillance

Wen, Shin-Yi; Yen, Yu; Chen, Albert Y.

doi:10.1007/978-3-030-17798-0_27

Shin-Yi Wen¹⁶,
Yu Yen¹⁶ &
Albert Y. Chen¹⁶

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 944))

Included in the following conference series:

Science and Information Conference

2244 Accesses
2 Citations

Abstract

This research provides two main changes based on Detect-And-Track. To improve the Multi-Object Tracking Accuracy (MOTA) while keeping the lightweight of the original approach, this paper proposes a gradient approach to obtain higher MOTA. We use the location of two previous frames of the same identified person to calculate the gradient for the location prediction of the current frame. Then, the predicted and the detected locations are compared. We also compare the current and previous detections. With a weighted combination for matching, we increase the MOTA score and improve the results of Detect-And-Track. Moreover, this research replaces cosine distance, the original feature extractor, with Euclidean distance. By doing so, feature extraction can match Intersection over Union (IoU) better. The weighted combination, which consists of IoU and Euclidean distance, provides a better MOTA than Detect-And-Track. In addition, a greedy approach facilitates a higher MOTA when implement with IoU and Euclidean distance. This weighted combination utility is superior than the combination of IoU and cosine distance, achieving 56.1% MOTA in total on the validation data of PoseTrack ICCV’17 dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Bishop, G., Welch, G.: An Introduction to the Kalman Filter, p. 80 (2001)
Google Scholar
Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20, 273–297 (1995)
MATH Google Scholar
Girdhar, R., Gkioxari, G., Torresani, L., Paluri, M., Tran, D.: Detect-and-track: efficient pose estimation in videos. In: CVPR (2018)
Google Scholar
Girshick, R.: Fast R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision (2015)
Google Scholar
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (2014)
Google Scholar
He, K., Gkioxari, G., Dollar, P., Girshick, R.: Mask R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision (2017)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016)
Google Scholar
Iandola, F.N., Han, S., Moskewicz, M.W., Ashraf, K., Dally, W.J., Keutzer, K.: SqueezeNet: alexnet-level accuracy with \(50\times \) fewer parameters and \(<\) 0.5 Mb model size. In: ICLR (2017)
Google Scholar
Iqbal, U., Milan, A., Gall, J.: PoseTrack: joint multi-person pose estimation and tracking. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2017). https://arxiv.org/abs/1611.07727
Kuhn, H.W.: The Hungarian method for the assignment problem. Nav. Res. Logist. Q. 2(1–2), 83–97 (1955). https://doi.org/10.1002/nav.3800020109
Article MathSciNet MATH Google Scholar
Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: 2016 YOLO You only look once: unified, real-time object detection. In: CVPR (2016)
Google Scholar
Redmon, J., Farhadi, A.: YOLO9000: Better, faster, stronger. In: Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017 (2017)
Google Scholar
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. (2017)
Google Scholar
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: International Conference on Learning Representations (ICRL) (2015)
Google Scholar
Uijlings, J.R., Van De Sande, K.E., Gevers, T., Smeulders, A.W.: Selective search for object recognition. Int. J. Comput. Vis. 104, 154–171 (2013)
Article Google Scholar

Download references

Author information

Authors and Affiliations

National Taiwan University, No. 1, Sec. 4, Roosevelt Road, Taipei, 10617, Taiwan
Shin-Yi Wen, Yu Yen & Albert Y. Chen

Authors

Shin-Yi Wen
View author publications
You can also search for this author in PubMed Google Scholar
Yu Yen
View author publications
You can also search for this author in PubMed Google Scholar
Albert Y. Chen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Albert Y. Chen .

Editor information

Editors and Affiliations

Saga University, Saga, Saga, Japan
Kohei Arai
The Science and Information (SAI) Organization, Bradford, West Yorkshire, UK
Supriya Kapoor

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wen, SY., Yen, Y., Chen, A.Y. (2020). Human Tracking for Facility Surveillance. In: Arai, K., Kapoor, S. (eds) Advances in Computer Vision. CVC 2019. Advances in Intelligent Systems and Computing, vol 944. Springer, Cham. https://doi.org/10.1007/978-3-030-17798-0_27

Download citation

DOI: https://doi.org/10.1007/978-3-030-17798-0_27
Published: 24 April 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-17797-3
Online ISBN: 978-3-030-17798-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics