Visual Video Analytics for Interactive Video Content Analysis

Schöning, Julius; Heidemann, Gunther

doi:10.1007/978-3-030-03402-3_23

Julius Schöning¹⁷ &
Gunther Heidemann¹⁷

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 886))

Included in the following conference series:

Future of Information and Communication Conference

1127 Accesses
2 Citations

Abstract

Reasoning as an essential processing step for any data analysis task, yet it requires semantic, contextual understanding on a high level, e.g., for the identification of entities. Developing an architecture for visual video analytics (VVA), we integrate human knowledge for highly accurate video content analysis to extract information by a tight coupling of automatic video analysis algorithms on the one hand and visualization as well as user interaction on the other hand. For accurate video content analysis, our semi-automatic VVA-architecture effectively understands and identifies regular and irregular behavior in real-world datasets. The VVA-architecture is described with both (i) its interactive information extraction and representation and (ii) its content-based reasoning process. We give an overview of existing techniques for information extraction and representation, and propose two interactive applications for reasoning. One of the applications uses 3D object representations to provide adaptive playback based on selected object parts in the 3D viewer. Another application allows the formulation of a proposition about the video by using all extracted objects and information. In case the proposition is correct, the corresponding frames of the video are highlighted. Based on a user study, relevant open topics for increasing the performance of video content analysis and VVA is discussed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Pillai, G.: Caught on camera: you are filmed on CCTV 300 times a day in London. International Business Times, September 2017. http://www.ibtimes.co.uk/britain-cctv-camera-surveillance-watch-london-big-312382
Wu, S., Zheng, S., Yang, H., Fan, Y., Liang, L., Su, H.: SAGTA: semi-automatic ground truth annotation in crowd scenes. In: International Conference on Multimedia and Expo Workshops (ICMEW). IEEE - Institute of Electrical and Electronics Engineers (2014)
Google Scholar
Schöning, J., Faion, P., Heidemann, G.: Pixel-wise ground truth annotation in videos: an semi-automatic approach for pixel-wise and semantic object annotation. In: International Conference on Pattern Recognition Applications and Methods (ICPRAM), SCITEPRESS - Science and and Technology Publications, pp. 690–697 (2016)
Google Scholar
Schroeter, R., Hunter, J., Kosovic, D.: Vannotea—a collaborative video indexing, annotation and discussion system for broadband networks. In: Workshop on Knowledge Markup & Semantic Annotation (2003)
Google Scholar
Tanisaro, P., Schöning, J., Kurzhals, K., Heidemann, G., Weiskopf, D.: Visual analytics for video applications. IT Inf. Technol. 57, 30–36 (2015)
Google Scholar
Keim, D.A., Mansmann, F., Schneidewind, J., Thomas, J., Ziegler, H.: Visual analytics: scope and challenges. Lecture Notes in Computer Science, pp. 76–90. Springer, Heidelberg (2008)
Google Scholar
Höferlin, M., Höferlin, B., Weiskopf, D., Heidemann, G.: Uncertainty-aware video visual analytics of tracked moving objects. J. Spat. Inf. Sci. 2, 87–117 (2011)
Google Scholar
Pintore, G., Gobbetti, E.: Effective mobile mapping of multi-room indoor structures. Vis. Comput. 30(6–8), 707–716 (2014)
Article Google Scholar
Sensopia Inc.: Capture the floor plan of your house with magicplan, September 2017. https://www.magic-plan.com/
Kowdle, A., Chang, Y.-J., Gallagher, A., Batra, D., Chen, T.: Putting the user in the loop for image-based modeling. Int. J. Comput. Vis. 108(1–2), 30–48 (2014)
Article MathSciNet Google Scholar
Pan, Q., Reitmayr, G., Drummond, T.: ProFORMA: probabilistic feature-based on-line rapid model acquisition. In: British Machine Vision Conference (BMVC).British Machine Vision Association (2009)
Google Scholar
Wu, C.: VisualSFM: a visual structure from motion system, January 2011. http://ccwu.me/vsfm/
Marconi, D.: Enemy of the State. Touchstone Pictures (1998)
Google Scholar
Höferlin, B., Höferlin, M., Weiskopf, D., Heidemann, G.: Scalable video visual analytics. Inf. Vis. 14(1), 10–26 (2013)
Article Google Scholar
Russell, D.M., Stefik, M.J., Pirolli, P., Card, S.K.: The cost structure of sensemaking. In: SIGCHI Conference on Human Factors in Computing Systems (CHI), pp. 269–276. ACM Press (1993)
Google Scholar
Thomas, J.J., Cook, K.A. (eds.): Illuminating the Path: The Research and Development Agenda for Visual Analytics. IEEE Computer Society Press (2005)
Google Scholar
Höferlin, B., Netzel, R., Höferlin, M., Weiskopf, D., Heidemann, G.: Inter-active learning of ad-hoc classifiers for video visual analytics. In: Conference on Visual Analytics Science and Technology (VAST), pp. 23–32. IEEE - Institute of Electrical and Electronics Engineers (2012)
Google Scholar
Pirolli, P., Card, S.: The sensemaking process and leverage points for analyst technology as identified through cognitive task analysis. In: International Conference on Intelligence Analysis (2005)
Google Scholar
Thomas, J.J., Cook, K.A.: A visual analytics agenda. IEEE Comput. Graph. Appl. 26(1), 10–13 (2006)
Article Google Scholar
Dasiopoulou, S., Giannakidou, E., Litos, G., Malasioti, P., Kompatsiaris, Y.: A survey of semantic image and video annotation tools. In: Knowledge-Driven Multimedia Information Extraction and Ontology Evolution, pp. 196–239. Springer, Heidelberg (2011)
Chapter Google Scholar
Multimedia Knowledge and Social Media Analytics Laboratory. Video Image Annotation Tool|Multimedia Knowledge and Social Media Analytics Laboratory, January 2012. http://mklab.iti.gr/project/via
Doermann, D., Mihalcik, D.: Tools and techniques for video performance evaluation. In: International Conference on Pattern Recognition (ICPR). IEEE Computer Society Press, pp. 167–170 (2000)
Google Scholar
Schöning, J., Heidemann, G.: Interactive 3D modeling: a survey-based perspective on interactive 3D reconstruction. In: International Conference on Pattern Recognition Applications and Methods (ICPRAM), pp. 289–294. SCITEPRESS - Science and and Technology Publications (2015)
Google Scholar
Schöning, J., Heidemann, G.: Bio-inspired architecture for deriving 3D models from video sequences. In: Computer Vision – ACCV Workshops, pp. 62–76. Springer, Heidelberg (2016)
Chapter Google Scholar
Trick, L.M., Enns, J.T.: Lifespan changes in attention: the visual search task. Cogn. Dev. 13(3), 369–386 (1998)
Article Google Scholar
Eriksen, C.W., Schultz, D.W.: Information processing in visual search: a continuous flow conception and experimental results. Percept. Psychophys. 25(4), 249–263 (1979)
Article Google Scholar
Schöning, J., Faion, P., Heidemann, G.: Interactive feature growing for accurate object detection in megapixel images. In: Computer Vision – ECCV, Workshops, vol. 9913, pp. 546–556. Springer, Heidelberg (2016)
Google Scholar
Schöning, J., Faion, P., Heidemann, G., Krumnack, U.: Providing video annotations in multimedia containers for visualization and research. In: Winter Conference on Applications of Computer Vision (WACV). IEEE - Institute of Electrical and Electronics Engineers (2017)
Google Scholar
Schöning, J., Faion, P., Heidemann, G., Krumnack, U.: Eye tracking data in multimedia containers for instantaneous visualizations. In: IEEE VIS Workshop on Eye Tracking and Visualization (ETVIS), pp. 74–78. IEEE - Institute of Electrical and Electronics Engineers (2016)
Google Scholar
Schöning, J., Gert, A.L., Açik, A., Kietzmann, T.C., Heidemann, G., König, P.: Exploratory multimodal data analysis with standard multimedia player: multimedia containers: – a feasible solution to make multimodal research data accessible to the broad audience. In: Computer Vision, Imagingand Computer Graphics Theory and Applications (VISAPP), pp. 272–279. SCITEPRESS - Science and Technology Publications (2017)
Google Scholar
Xiph.org: Ogg, September 2017. https://xiph.org/ogg/
Matroska: Matroska media container, September 2017. https://www.matroska.org/

Download references

Author information

Authors and Affiliations

Institute of Cognitive Science, Osnabrück University, Osnabrück, Germany
Julius Schöning & Gunther Heidemann

Authors

Julius Schöning
View author publications
You can also search for this author in PubMed Google Scholar
Gunther Heidemann
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Julius Schöning .

Editor information

Editors and Affiliations

Faculty of Science and Engineering, Saga University, Saga, Japan
Kohei Arai
The Science and Information (SAI) Organization, London, UK
Supriya Kapoor
The Science and Information (SAI) Organization, Bradford, UK
Rahul Bhatia

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Schöning, J., Heidemann, G. (2019). Visual Video Analytics for Interactive Video Content Analysis. In: Arai, K., Kapoor, S., Bhatia, R. (eds) Advances in Information and Communication Networks. FICC 2018. Advances in Intelligent Systems and Computing, vol 886. Springer, Cham. https://doi.org/10.1007/978-3-030-03402-3_23

Download citation

DOI: https://doi.org/10.1007/978-3-030-03402-3_23
Published: 06 December 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-03401-6
Online ISBN: 978-3-030-03402-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics