BAAC: Bangor Arabic Annotated Corpus
Journal Title: International Journal of Advanced Computer Science & Applications - Year 2018, Vol 9, Issue 11
Abstract
This paper describes the creation of the new Bangor Arabic Annotated Corpus (BAAC) which is a Modern Standard Arabic (MSA) corpus that comprises 50K words manually annotated by parts-of-speech. For evaluating the quality of the corpus, the Kappa coefficient and a direct percent agreement for each tag were calculated for the new corpus and a Kappa value of 0.956 was obtained, with an average observed agreement of 94.25%. The corpus was used to evaluate the widely used Madamira Arabic part-of-speech tagger and to further investigate compression models for text compressed using part-of-speech tags. Also, a new annotation tool was developed and employed for the annotation process of BAAC.
Authors and Affiliations
Ibrahim S Alkhazi, William J. Teahan
Survey on Human Activity Recognition based on Acceleration Data
Human activity recognition is an important area of machine learning research as it has many utilization in different areas such as sports training, security, entertainment, ambient-assisted living, and health monitoring...
Novel Intra-Prediction Framework for H.264 Video Compression using Decision and Prediction Mode
With the increasing usage of multimedia contents and advancement of the communication devices (along with services), there is a heavy demand of an effective multimedia compression protocol. In this regards, H.264 has bee...
Internal Model Control of A Class of Continuous Linear Underactuated Systems
This paper presents an Internal Model Control (IMC) structure designed for a class of continuous linear underactuated systems. The study treats the case of Minimum Phase (MP) systems and those whose zero dynamics are not...
Investigation of Adherence Degree of Agile Requirements Engineering Practices in Non-Agile Software Development Organizations
Requirements are critical for the success of software projects. Requirements are practically difficult to produce, as the hardest stage of building a software system is to decide what the system should do. Moreover, requ...
Software Effort Prediction using Statistical and Machine Learning Methods
Accurate software effort estimation is an important part of software process. Effort is measured in terms of person months and duration. Both overestimation and underestimation of software effort may lead to risky conseq...