AN EFFICIENT APPROACH FOR TEXT MINING USING SIDE INFORMATION

Abstract

 Nowadays, in many text mining applications, information is present in the form of text documents. Text document contains various types of information such as side information or metadata. The different types of information such as document provenance information, title of the document, links in the document, user-access behavior from web logs, or other non-textual attributes treated as side information contained into the text document. Such attributes contains a large amount of information for clustering purposes. It is difficult to estimate the importance of this sideinformation when text document contains some of the information is noisy. In such cases, to avoid the low quality of mining process we need a principled way to perform the text mining, to maximize the advantages from using this side information. Conformation to that, this paper represents solution to the use of side information for clustering by hierarchical algorithm which then extends to the classification problem on real data sets.

Authors and Affiliations

Kiran V. Gaidhane

Keywords

Related Articles

 AN APPLICATIONS OF CONTROLLED JUMP MODEL IN FINANCE

 The purpose of this paper is to identify the problem formulation of controlled model with jump process.

 EXPERIMENTAL ANALYSIS OF FIBER METAL LAMINATE WITH ALUMINIUM ALLOY FOR AIRCRAFT STRUCTURES

 The objective is to examine the methods of testing on FIBER/METAL LAMINATE (FML) WITH ALUMINIUM ALLOY to obtain empirical estimates of Load capacity under various loads.FML is the combination of metal and polymer...

 High-Speed Pool of Aggregated Data in Multi-hop Wireless Network

 Data aggregation is a key functionality in wireless sensor networks (WSNs). Focuses on data aggregation scheduling problem to minimize the delay (or latency). The project proposes an efficient distributed algorith...

 BRANCH AND BOUND TECHNIQUE FOR SINGLE MACHINE SCHEDULING PROBLEM USING TYPE-2 TRAPEZOIDAL FUZZY NUMBERS

 This paper deals branch and bound technique to solve single machine scheduling problem involving two processing times along with due date using Type-2 Trapezoidal fuzzy numbers. Our aim is to obtained optimal sequ...

DESIGNING AND ANALYSING DIFFERENT SHAPES OF MEMS BASED ELECTROSTATICALLY CONTROLLED MICROMIRRORS

in adaptive optics and point - to - point communication, various micro mirror devices have been used to reshape the wavefront of a propagating beam to compensate for aberrations in the beam path. Such MEMS mirrors...

Download PDF file
  • EP ID EP112504
  • DOI 10.5281/zenodo.58632
  • Views 105
  • Downloads 0

How To Cite

Kiran V. Gaidhane (30).  AN EFFICIENT APPROACH FOR TEXT MINING USING SIDE INFORMATION. International Journal of Engineering Sciences & Research Technology, 5(7), 1137-1148. https://www.europub.co.uk/articles/-A-112504