Identification of Fake Contents Using Text-mining Techniques
Journal Title: International Journal of Innovations in Science and Technology - Year 2024, Vol 6, Issue 4
Abstract
In recent years, social media users have become increasingly concerned about sharing content that may be unpleasant or harmful. The widespread use of platforms like Facebook and Twitter has contributed significantly to this growing awareness. The primary objective of our approach is to accelerate and automate the detection of offensive content posted on these platforms, simplifying the process of taking necessary actions and filtering harmful communications. A benchmark dataset, OLID 2019 (Offensive Language Identification Dataset), is available online to aid in this task. Our study focuses on identifying whether a tweet is offensive. Our team, which included several members, rigorously compared various feature extraction methods and model-building algorithms. Ultimately, our comparative analysis revealed that decision trees were the most effective model. The decision trees applied to the normalized dataset resulted in an 84% improvement in the Macro F1 score, which aligns with previous research. In conclusion, a real-time system could be developed across multiple social media platforms to detect and evaluate objectionable posts, enabling timely interventions to promote healthier online behavior and foster a positive societal impact.
Authors and Affiliations
Saqlain Sajjad, Hafiz Muhammad Ghazi, Muhammad Asgher Nadeem, Muhammad Irfan Habib, Muhammad Salman Saeed, Syed Ali Hasnain Naqvi, Zeeshan Ahmad Arfeen, Isheeaq Naeem, Muhammad Irfan
Performance Evaluation of Fuzzy Logic-BasedRPL Objective Functions
Introduction: This paper is based on the evaluation of different fuzzy logic-based approaches, implemented by Routing Protocol for Low-power Lossy networks (RPL), carried out using different topologies. Importance: Th...
Addressing Class Imbalance in Credit Card Fraud Detection: A Hybrid Deep Learning Approach
The rise of credit card fraud is a global concern, demanding reliable detection methods that can overcome challenges with imbalanced datasets and limited exploration of hybrid modeling approaches. This study introduces...
https://journal.50sea.com/index.php/IJIST/article/view/967/1548
Spectral power analysis was employed to assess the Fractal Dimension (FD) and explore fractal scaling using Hurst increment ranges and second-order moment relations in the context of urban population trends. This resea...
AI-Driven Weed Classification for Improved Cotton Farming in Sindh, Pakistan
This research study proclaims the combination of artificial intelligence and also IoT in precision agriculture, highlighting weed discovery plus cotton plant monitoring in Sindh, Pakistan. The uniqueness lies in creati...
A Large Language Model-based Web Application for Contextual Document Conversation
The emergence of Large Language Models (LLM), such as ChatGPT, Gemini, and Claude has ushered in a new era of natural language processing, enabling rich textual interactions with computers. However, despite the capab...