Abstract
In this paper, we describe MongoIE, an Open Information Extraction (Open IE) system for the Mongolian language. We present the characteristic of the language and, after analyzing the available preprocessing tools, we describe the features used for building the system. We have implemented two different approaches: (1) Rule-based and (2) Classification. Here, we describe them, analyze their errors and present their results. In the best of our knowledge, this is the first attempt in building Open IE systems for Mongolian. We conclude by suggesting possible future improvements and directions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
One of the autonomous regions of China.
- 2.
- 3.
- 4.
Available at: https://bit.ly/2nClF3q.
References
Michele Banko, O.E.: The tradeoffs between open and traditional relation extraction. In: Proceedings of the ACL-08: HLT (2008)
Horn, C., Zhila, A., Gelbukh, A., Kern, R., Lex, E.: Using factual density to measure informativeness of web documents. In: Proceedings of the 19th Nordic Conference on Computational Linguistics (2013)
Mausam, Schmitz, M., Soderland, S., Bart, R., Etzioni, O.: Open language learning for information extraction. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (2012)
Lin, T., Mausam, Etzioni, O.: Identifying functional relations in web text. In: Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing (2010)
Bird, S., Loper, E., Klein, E.: In: Natural Language Processing with Python. O’Reilly Media Inc (2009)
Helmut, S.: In: Improvements in Part-of-Speech Tagging with an Application to German, pp. 13–25. Springer, Netherlands, Dordrecht (1999)
Sangha, N., Younggyun, N., Sejin, N., Key-Sun, C.: SRDF: Korean open information extraction using singleton property. In: Proceedings of the 14th International Semantic Web Conference (2015)
Sidorov, G., Velasquez, F., Stamatatos, E., Gelbukh, A., Chanona-Hern\(\acute{\text{a}}\)ndez, L.: Syntactic dependency-based n-grams as classification features. In: Gonzalez-Mendoza, M., Batyrshin, I. (eds.) Advances in Computational Intelligence. Proceedings of MICAI 2012 (2012)
Sidorov, G., Velasquez, F., Stamatatos, E., Gelbukh, A., Chanona-Hern\(\acute{\text{ a }}\)ndez, L.: Syntactic dependency-based n-grams: more evidence of usefulness in classification. In: Gelbukh, A. (ed.) Computational Linguistics and Intelligent Text Processing. Proceedings of International Conference on Intelligent Text Processing and Computational Linguistics, CICLing 2013 (2013)
Bayartsatsral, C., Altangerel, C.: Annotating noun phrases for Mongolian language and using it in machine learning. In: Proceedings of the Mongolian Information Technology—2018, Ulaanbaatar, Udam Soyol, pp. 12–15 (2018)
Davidov, D., Rappoport, A.: Unsupervised discovery of generic relationships using pattern clusters and its evaluation by automatically generated sat analogy questions. In: Proceedings of the ACL-08 (2008)
Fader, A., Soderland, S., Etzioni, O.: Identifying relations for open information extraction. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP’11 (2011)
Alisa, Z., Alexander, G.: Open information extraction for Spanish language based on syntactic constraints. In: Proceedings of the ACL2014 Student Research Workshop, Baltimore, Maryland, USA, pp. 78–85 (2014)
Gamallo, P., Garcia, M., Fern\(\acute{\text{ a }}\)ndez-Lanza, S.: Dependency-based open information extraction. In: Proceedings of the Joint Workshop on Unsupervised and SemiSupervised Learning in NLP, ROBUS-UNSUP ’12 (2012)
Van Durme, B., Schubert, L.: Open knowledge extraction using compositional language processing. In: Proceedings of the STEP ’08 Proceedings of the 2008 Conference on Semantics in Text Processing (2008)
Michele, B., Michael, J.C., Stephan, S., Matt, B., Oren, E.: Open information extraction from the web. In: Proceedings of the Twentieth International Joint Conference on Artificial Intelligence (2007)
Wu, F., Weld, D.S.: Open information extraction using wikipedia. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, ACL ’10 (2010)
Acknowledgements
This work was supported by Ernst Mach-Stipendien (Eurasia-Pacific Uninet) grant funded by The Austrian Agency for International Cooperation in Education and Research (OeAD-GmbH), and Centre for International Cooperation and Mobility (ICM).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Lkhagvasuren, G., Rentsendorj, J. (2020). Open Information Extraction for Mongolian Language. In: Pan, JS., Li, J., Tsai, PW., Jain, L. (eds) Advances in Intelligent Information Hiding and Multimedia Signal Processing. Smart Innovation, Systems and Technologies, vol 157. Springer, Singapore. https://doi.org/10.1007/978-981-13-9710-3_31
Download citation
DOI: https://doi.org/10.1007/978-981-13-9710-3_31
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-9709-7
Online ISBN: 978-981-13-9710-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)