Combining Arabic Nested Noun Compound and Collocation Extraction Using Linguistic and Statistical Approach

Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2016, Vol 18, Issue 5

Abstract

Abstract : Arabic multi-word expressions are the combinations of two or more terms that associated with each other as one concept. The process of extracting such expressions is challenging especially when the length of the combination is getting longer. Recently, researchers attempted to extract nested noun compounds whichconsists of two or more combinations of nouns. Such extraction process requires comprehensive analysis using linguistic and statistical approaches. However, the process of extraction in the state of the art have extended to include 4-gra and 5-gram candidates. This paper aims to combine the extraction of nested noun compound andcollocation in order to extend the process of extraction to include 6-gram and 7-gram. For this manner, a linguistic approach comprises of various kinds of pattern has been used, as well as, three statistical measures have been utilized including NC-value, LLR and PMI. Results shown that the proposed method has the ability to extend the extraction to include longer candidates.

Authors and Affiliations

Maryam Yaseen Al-Mashhadani , Luma Adnan Al-Sagban

Keywords

Related Articles

 Protecting Attribute Disclosure for High Dimensionality and Preserving Publishing of Microdata

 : Generalization and Bucketization, have been designed for privacy preserving microdata publishing. Recent work has shown that generalization loses considerable amount of information, especially for high-dimen...

An Enhanced Authentication System Using Face and Fingerprint Technologies

Abstract: The primary aim of this paper is to develop an enhanced authentication system using a CascadedLink Feed-Forward Neural Networks. In the end, the system overcomes some limitations of face recognition and fingerp...

 Virtualization: A Sustainable Resource Management Strategy inComputing Practices

 Abstract: Many computing practitioners are challenged with resource inefficiencies and insufficienciesemanating from poor management strategy. In order to reduce complexity and risk while improvingproductivity, pra...

 A Study of Geographic Adaptive Fidelity Routing Protocol in Wireless Sensor Network

 Abstract: The Energy consumption is a great challenge in the field on Wireless Sensor Network (WSN). It affects the performance of the whole network. There are two basic functions of nodes in the WSN. First is to c...

Combining Arabic Nested Noun Compound and Collocation Extraction Using Linguistic and Statistical Approach

Abstract : Arabic multi-word expressions are the combinations of two or more terms that associated with each other as one concept. The process of extracting such expressions is challenging especially when the length of t...

Download PDF file
  • EP ID EP154478
  • DOI -
  • Views 115
  • Downloads 0

How To Cite

Maryam Yaseen Al-Mashhadani, Luma Adnan Al-Sagban (2016). Combining Arabic Nested Noun Compound and Collocation Extraction Using Linguistic and Statistical Approach. IOSR Journals (IOSR Journal of Computer Engineering), 18(5), 64-69. https://www.europub.co.uk/articles/-A-154478