Combining Arabic Nested Noun Compound and Collocation Extraction Using Linguistic and Statistical Approach

Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2016, Vol 18, Issue 5

Abstract

Abstract : Arabic multi-word expressions are the combinations of two or more terms that associated with each other as one concept. The process of extracting such expressions is challenging especially when the length of the combination is getting longer. Recently, researchers attempted to extract nested noun compounds whichconsists of two or more combinations of nouns. Such extraction process requires comprehensive analysis using linguistic and statistical approaches. However, the process of extraction in the state of the art have extended to include 4-gra and 5-gram candidates. This paper aims to combine the extraction of nested noun compound andcollocation in order to extend the process of extraction to include 6-gram and 7-gram. For this manner, a linguistic approach comprises of various kinds of pattern has been used, as well as, three statistical measures have been utilized including NC-value, LLR and PMI. Results shown that the proposed method has the ability to extend the extraction to include longer candidates.

Authors and Affiliations

Maryam Yaseen Al-Mashhadani , Luma Adnan Al-Sagban

Keywords

Related Articles

 Monitoring Wireless Sensor Network using Android based Smart Phone Application

 Abstract: Wireless Sensor Network application’s is use in detection of natural calamities like forest fire detection, flood detection, , earth quick early detection ,snow detection, traffic congestion and various o...

ICT for service delivery in Rural India –scope, challenges and present scenario

Abstract: The present era of globalization is based on knowledge and information as it directly affects the economic, social, cultural and political activities of all the regions of the world. Governments worldwide haver...

An Analytical Study of Genetic Algorithm for Generating Frequent Itemset and Framing Association Rules At Various Support Levels

Abstract: In customary, frequent itemsets are propogated from large data sets by employing association rule mining algorithms like Apriori, Partition, Pincer-Search, Incremental and Border algorithm etc., which gains ino...

 Performance Evaluation of a Distributed System Based UponFault Tree Analysis

 Abstract: Distributed Systems is the study of geographically distant processors, connected to one anotherthrough intermediate devices such as routers and/or switches. Simulation provides an insight into the behavio...

Download PDF file
  • EP ID EP154478
  • DOI -
  • Views 77
  • Downloads 0

How To Cite

Maryam Yaseen Al-Mashhadani, Luma Adnan Al-Sagban (2016). Combining Arabic Nested Noun Compound and Collocation Extraction Using Linguistic and Statistical Approach. IOSR Journals (IOSR Journal of Computer Engineering), 18(5), 64-69. https://www.europub.co.uk/articles/-A-154478