DISCOVERY OF ALIASES NAME FROM THE WEB

Journal Title: International Journal of Management, IT and Engineering - Year 2012, Vol 2, Issue 8

Abstract

An individual is typically referred by numerous name aliases on the web. Accurate identification of aliases of a given person name is useful in various web related tasks such as information retrieval, sentiment analysis, personal name disambiguation, and relation extraction. We propose a method to extract aliases of a given personal name from the web. Given a personal name, the proposed method first extracts a set of candidate aliases. Second, we rank the extracted candidates according to the likelihood of a candidate being a correct alias of the given name. We propose a novel, automatically extracted lexical pattern-based approach to efficiently extract a large set of candidate aliases from snippets retrieved from a web search engine. We define numerous ranking scores to evaluate candidate aliases using three approaches: lexical pattern frequency, word co-occurrences in an anchor text graph, and page counts on the web. To construct a robust alias detection system, we integrate the different ranking scores into a single ranking function using ranking support vector machines. We evaluate the proposed method on three data sets: an English personal names data set, an English place names data set, and a Japanese personal names data set. The proposed method outperforms numerous baselines and previously proposed name alias extraction methods, achieving a statistically significant mean reciprocal rank (MRR) of 0.67. Experiments carried out using location names and Japanese personal names suggest the possibility of extending the proposed method to extract aliases for different types of named entities, and for different languages. Moreover, the aliases extracted using the proposed method are successfully utilized in an information retrieval task and improve recall by 20 percent in a relation detection task.

Authors and Affiliations

N. Thilagavathy, T. Balakumaran, P. Ragu and R. Ranjith kumar

Keywords

Related Articles

A study on Stress and Job Performance among School Teachers of Karimnagar City

This is an effort to check the performance, and also the self interest in work setting of school teachers within the karimnagar town. The teacher job is generally connected with psychological feature work and had to de...

slugThe current situation, future prospect of Poverty and inequality in Sudan

This research paper aims to address income poverty and inequality in Sudan. Poverty and inequality indicators were computed using both primary and secondary data sources. P-alpha equation, Povstat and Simsip models wer...

Time management for school directors

The work of the school principal is quite difficult and complex. Time is a precious resource for all leaders and managers used to achieve the goal and objectives of an institution. It is a rather complex process that...

slugAn Integrated Cryptographic Algorithm based on Biometric Features

Biometric cryptography is a technique using biometric features to encrypt the data which can improve the security of the encrypted data and can overcome the shortcomings of the traditional cryptography. Biometric featu...

Mobility Management And its Challenges in Wireless Mesh Networks

In this research the main focus is on wireless mesh networks (WMNs) that become known as a mean knowledge for next-generation wireless network. So their advantages across the further wireless networks, wireless mesh ne...

Download PDF file
  • EP ID EP18501
  • DOI -
  • Views 334
  • Downloads 14

How To Cite

N. Thilagavathy, T. Balakumaran, P. Ragu and R. Ranjith kumar (2012). DISCOVERY OF ALIASES NAME FROM THE WEB. International Journal of Management, IT and Engineering, 2(8), -. https://www.europub.co.uk/articles/-A-18501