Using Word Embeddings for Ontology Enrichment

Abstract

Word embeddings, distributed word representations in a reduced linear space, show a lot of promise for accomplishing Natural Language Processing (NLP) tasks in an unsupervised manner. In this study, we investigate if the success of word2vec, a Neural Networks based word embeddings algorithm, can be replicated in an aggluginative language like Turkish. Turkish is more challenging than languages like English for complex NLP tasks because of her rich morphology. We picked ontology enrichment, again a relatively harder NLP task, as our test application. Firstly, we show how ontological relations can be extracted automaticaly from Turkish Wikipedia to construct a gold standard. Then by running experiments we show that the word vector representations produced by word2vec are useful to detect ontological relations encoded in Wikipedia. We propose a simple but yet effective weakly supervised ontology enrichment algorithm where for a given word a few know ontologically related concepts coupled with similarity scores computed via word2vec models can result in discovery of other related concepts. We argue how our algorithm can be improved and augmented to make it a viable component of an ontoloy learning and population framework.

Authors and Affiliations

İzzet Pembeci*| Muğla Sıtkı Koçman University. Department of Computer Engineering

Keywords

Related Articles

BAT algorithm for Cryptanalysis of Feistel cryptosystems

Recent cryptosystems constitute an effective task for cryptanalysis algorithms due to their internal structure based on nonlinearity. This problem can be formulated as NP-Hard. It has long been subject to various attacks...

AIR: An Agent for Robust Image Matching and Retrieval

This paper presents a novel scheme coined AIR (Agent for Image Recognition), acting as an agent, to oversee the image matching and retrieval processes. Firstly, neighboring keypoints within close spatial proximity are ex...

Cloud Computing Environments Which Can Be Used in Health Education

At the present time, it is known that cloud computing technologies began to be used widely in information technology. The purpose of this study is to provide information about cloud technologies that can be used in healt...

An Artificial Neural Network Model for Wastewater Treatment Plant of Konya

In this study, modelling of Konya wastewater treatment plant was studied by using artificial neural network with different architectures in Matlab software. All data were obtained from wastewater treatment plant of Konya...

Solution for the Travelling Salesman Problem with a Microcontrollerbased Instantaneous System

The travelling salesman problem (TSP) is one of the most frequently researched combinational optimization problems. Despite its trivial definition, the problem is very difficult to solve. Therefore, it is categorized as...

Download PDF file
  • EP ID EP799
  • DOI 10.18201/ijisae.58806
  • Views 478
  • Downloads 25

How To Cite

İzzet Pembeci* (2016). Using Word Embeddings for Ontology Enrichment. International Journal of Intelligent Systems and Applications in Engineering, 4(3), 49-56. https://www.europub.co.uk/articles/-A-799