SentiTFIDF – Sentiment Classification using Relative Term Frequency Inverse Document Frequency
Journal Title: International Journal of Advanced Computer Science & Applications - Year 2014, Vol 5, Issue 2
Abstract
Sentiment Classification refers to the computational techniques for classifying whether the sentiments of text are positive or negative. Statistical Techniques based on Term Presence and Term Frequency, using Support Vector Machine are popularly used for Sentiment Classification. This paper presents an approach for classifying a term as positive or negative based on its proportional frequency count distribution and proportional presence count distribution across positively tagged documents in comparison with negatively tagged documents. Our approach is based on term weighting techniques that are used for information retrieval and sentiment classification. It differs significantly from these traditional methods due to our model of logarithmic differential term frequency and term presence distribution for sentiment classification. Terms with nearly equal distribution in positively tagged documents and negatively tagged documents were classified as a Senti-stop-word and discarded. The proportional distribution of a term to be classified as Senti-stop-word was determined experimentally. We evaluated the SentiTFIDF model by comparing it with state of art techniques for sentiment classification using the movie dataset.
Authors and Affiliations
Kranti Ghag, Ketan Shah
A Non-Linear Regression Modeling is used for Asymmetry Co-Integration and Managerial Economics in Iraqi Firms
This paper analyzes the cost asymmetry through managerial expectations in a nonlinear regression function. Two development determinants, asymmetry co-integration and managerial expectations are also considered. The resul...
Implementation, Verification and Validation of an OpenRISC-1200 Soft-core Processor on FPGA
An embedded system is a dedicated computer system in which hardware and software are combined to per-form some specific tasks. Recent advancements in the Field Programmable Gate Array (FPGA) technology make it possible t...
A New Message Encryption Method based on Amino Acid Sequences and Genetic Codes
As the use of technology is increasing rapidly, the amount of shared, sent, and received information is also increas-ing in the same way. As a result, this necessitates the need for finding techniques that can save and s...
Security Risk Scoring Incorporating Computers' Environment
A framework of a Continuous Monitoring System (CMS) is presented, having new improved capabilities. The system uses the actual real-time configuration of the system and environment characterized by a Configuration Manage...
Performance Improvement of Threshold based Audio Steganography using Parallel Computation
Audio steganography is used to hide secret information inside audio signal for the secure and reliable transfer of information. Various steganography techniques have been proposed and implemented to ensure adequate secur...