Efficient calculation of sentence semantic similarity: a proposed scheme based on machine learning approaches and NLP te

Journal Title: Scientific Journal of Review - Year 2014, Vol 3, Issue 3

Abstract

Sentence semantic similarity plays a crucial role in a variety of applications such as Machine Translation, Information Retrieval, Question Answering and Multi-document Summarization. Considering the variability of natural language expression, sentence semantic similarity detection is not a trivial task. This paper tries to make use of Natural Language Processing (NLP) as well as machine learning techniques in order to propose a scheme for sentence semantic similarity. In the first part of the proposed scheme, i.e., the NLP section, different sets of linguistic features including string-based, semantic-based, Named Entity-based and syntax-based features are extracted. In the second part, machine learning algorithms are used to construct classification models on the extracted set of features. Experimental results in the first part indicate that extracted features are valid for sentence semantic similarity. Moreover, by comparing the performance of different classification algorithms in the second part, KNN seems to be the most successful algorithm. Overall, experimental results indicate that the proposed approach can be used to improve the performance of sentence semantic similarity detection especially in terms of accuracy.

Authors and Affiliations

M. Roostaee| Department of Computer Science and Engineering and IT, School of Electrical Engineering and Computer, Shiraz, Iran., S. M. Fakhrahmad| Department of Computer Science and Engineering and IT, School of Electrical Engineering and Computer, Shiraz, Iran., M. H. Sadreddini*| Department of Computer Science and Engineering and IT, School of Electrical Engineering and Computer, Shiraz, Iran., A. Khalili| Department of Computer Science and Engineering and IT, School of Electrical Engineering and Computer, Shiraz, Iran.

Keywords

Related Articles

Evaluation of the role of services and facilities in the development of urban tourism (case study of Ramsar city)

Facilities and services in the cities are the most important indicators inthe development of urban tourism. For this reason, the necessity of evaluatingthe role of these services and facilities in the development process...

Recognition and redefinition of theoretical challenges in religious and moral upbringing and training: force and upbring

Human is the result of upbringing and needs upbringing more than any other creature; therefore, the most valuable human affair is upbringing. Regarding the influence of force on upbringing whether about ourselves or othe...

The Nigerian judiciary and the travails of rule of law

Nigerians are increasingly losing hope and confidence in the nation’s judicial system because of the unethical conduct of some judicial rascals. Some judgements have brought embarrassment to the nation. Verdicts are no...

Some observations on oestrus manifestations in the red Sokoto goat

This paper reports of some observations on the type and frequency of oestrous expressions in the red sokoto doe (RSD). There is reduced manifestation of oestrous signs in does kept without males. Presence of males and co...

Significance of litter size, duration of dry period and stage of pregnacy on milk yield and composition in dairy animals

The factors influencing the amount and composition of produced milk can be divided into two groups, namely internal and external factors. This is very important to remember when evaluating the milk quality and in the...

Download PDF file
  • EP ID EP89
  • DOI 10.14196/sjr.v3i3.1259
  • Views 570
  • Downloads 39

How To Cite

M. Roostaee, S. M. Fakhrahmad, M. H. Sadreddini*, A. Khalili (2014). Efficient calculation of sentence semantic similarity: a proposed scheme based on machine learning approaches and NLP te. Scientific Journal of Review, 3(3), 94-106. https://www.europub.co.uk/articles/-A-89