A Relevant Document Information Clustering Algorithm for Web Search Engine 

Abstract

Search engines are the Hub of Information, The advances in computing and information storage have provided vast amount of Data, the users of World Wide Web is increasingly day by day, It is become more difficult to users get the required information according to their interests. The IR community has explored document clustering as an alternative method of organizing retrieval results, so by using clustering concept we can find the grouped relevant documents. The purpose of clustering is to partitioning the set of entities into different groups called clusters. These groups may consistent in terms of similarity of its members. As the name suggests, the representative based clustering techniques uses some form of representation for each cluster. Thus every group has a member that represents it. The main use is to increase the efficiency of the algorithm and to decrease the cost of the algorithm. Clustering process is done by using k-means partitioning algorithms and Hierarchical clustering algorithms but there are lot of disadvantages, it works very slow and it is not applicable for large databases. So fast greedy k-means algorithm is used it overcomes the drawbacks of k-means algorithm and it is very much accurate and efficient. So we introduce an efficient method to calculate the distortion for this algorithm. This helps the users to find the relevant documents more easily than by relevance ranking.  

Authors and Affiliations

Y. Suresh Babu , Mr. K. Venkat Mutyalu, , Mr. Y. A. Siva Prasad,

Keywords

Related Articles

A review of vertical handoff algorithms based on Multi Attribute Decision Method

Abstract - Next Generation Wireless Networks (NGWN) consists of heterogeneous networks with the support for vertical handoff. Hence vertical handoff algorithms (VHA) are the key components of NGWN. The vertical handoff...

SALT & PEPPER NOISE REMOVAL USING FUZZY BASED ADAPTIVE FILTER 

This paper is based on a novel of filter which includes detection and removal of salt & pepper noise using fuzzy based adaptive filter. Once the detection stage detects the noisy pixels, they are passed on to t...

Highly Secure Distributed Authentication and Intrusion Detection with DataFusion in MANET 

Continuous user-to-device authentication is a challenging task in high security mobile adhoc networks (MANETs). This paper provides distributed combined authentication and intrusion detection with data fusion in su...

Determination of Noise Levels in Using AMS Features of Noisy Speech Signal and Their Comparison  

Great difficulty in recognizing speech is under a noisy background. The signal to noise ratio plays a very important role in speech recognition techniques. The signal to noise ratio is the ratio of the signal estim...

Model for Intrusion Detection System with Data Mining 

Today internet has become very popular medium to communicate between users publicly, due to this, lots of intruder has spread across the internet that perform malicious activity and attack to destroy useful informa...

Download PDF file
  • EP ID EP136123
  • DOI -
  • Views 99
  • Downloads 0

How To Cite

Y. Suresh Babu, Mr. K. Venkat Mutyalu, , Mr. Y. A. Siva Prasad, (2012). A Relevant Document Information Clustering Algorithm for Web Search Engine . International Journal of Advanced Research in Computer Engineering & Technology(IJARCET), 1(8), 16-20. https://www.europub.co.uk/articles/-A-136123