Using Word Embeddings for Ontology Enrichment
Journal Title: International Journal of Intelligent Systems and Applications in Engineering - Year 2016, Vol 4, Issue 3
Abstract
Word embeddings, distributed word representations in a reduced linear space, show a lot of promise for accomplishing Natural Language Processing (NLP) tasks in an unsupervised manner. In this study, we investigate if the success of word2vec, a Neural Networks based word embeddings algorithm, can be replicated in an aggluginative language like Turkish. Turkish is more challenging than languages like English for complex NLP tasks because of her rich morphology. We picked ontology enrichment, again a relatively harder NLP task, as our test application. Firstly, we show how ontological relations can be extracted automaticaly from Turkish Wikipedia to construct a gold standard. Then by running experiments we show that the word vector representations produced by word2vec are useful to detect ontological relations encoded in Wikipedia. We propose a simple but yet effective weakly supervised ontology enrichment algorithm where for a given word a few know ontologically related concepts coupled with similarity scores computed via word2vec models can result in discovery of other related concepts. We argue how our algorithm can be improved and augmented to make it a viable component of an ontoloy learning and population framework.
Authors and Affiliations
İzzet Pembeci*| Muğla Sıtkı Koçman University. Department of Computer Engineering
Structure-Texture Decomposition of RGB-D Images
In this paper, we study the problem of separating texture from structure in RGB-D images. Our structure preserving image smoothing operator is based on the region covariance smoothing (RCS) method in [16] that we present...
The Usage of Artificial Neural Networks Method in the Diagnosis of Rheumatoid Arthritis
In this study, artificial neural networks (ANN) method is used for the diagnosis of rheumatoid arthritis in order to support medical diagnostics. For the diagnosis of rheumatoid arthritis, backpropagation algorithm was e...
Developing a Fuzzy Logic Decision Support System for Strategic Planning in Industrial Organizations
Internal – External (IE), Strategic Position and Action Evaluation (SPACE), Boston Consulting Group (BCG), and Grand Strategy matrices are important tools in generating and evaluating alternative output strategies which...
Intrusion Detection Forecasting Using Time Series for Improving Cyber Defence
The strength of time series modeling is generally not used in almost all current intrusion detection and prevention systems. By having time series models, system administrators will be able to better plan resource alloca...
Rainfall estimation based on NAW approach using MSG-SEVIRI images: An application in north Algeria
In this work, we will adapt the NAW (Nagri, Adler and Wetzel) precipitation, estimation approach to the north Algeria events using the Meteosat Second Generation (MSG) satellite images. The tests are carried out on seven...