A Semantic Approach for Outlier Detection in Big Data Streams
Journal Title: Webology - Year 2019, Vol 16, Issue 1
Abstract
In recent years, the world faced a big revolution in data generation and collection technologies. The volume, velocity and veracity of data have changed drastically and led to new types of challenges related to data analysis, modeling and prediction. One of the key challenges is related to the semantic analysis of textual data especially in big data streams settings. The existing solutions focus on either topic analysis or the sentiment analysis. Moreover, the semantic outlier detection over data streams as one of the key problems in data mining and data analysis fields has less focus. In this paper, we introduce a new concept of semantic outlier through which the topic of the textual data is considered as the primary content of the data stream while the sentiment is considered as the context in which the data has been generated and affected. Also, we propose a framework for semantic outlier detection in big data streams which incorporates the contextual detection concepts. The advantage of the proposed concept is that it incorporates both topic and sentiment analysis into one single process; while at the same time the framework enables the implementation of different algorithms and approaches for semantic analysis.
Authors and Affiliations
Hussien Ahmad and Salah Dowaji
Teens, librarians, and social networking: What librarians need to know?
This book is a collection of wide-ranging, informative and provocative chapters discussing the use of social networks to serve teens, both online and in the library. Comprehensive surveys on this topic are being discusse...
Social networking tools and research information systems: Do they compete?
Current developments in the area of research information have led to two different kinds of systems dealing with research-related information: Social networking tools for researchers have reached public attention through...
Techniques for text classification: Literature review and current trends
Automated classification of text into predefined categories has always been considered as a vital method to manage and process a vast amount of documents in digital forms that are widespread and continuously increasing....
American libraries and the Internet: The social construction of Web appropriation and use
One can hardly find any aspect of human life that has not been affected one way or another by the Internet. Most of the Internet’s impact is because of the changes it has brought about in the areas of communication and...
Marketing of Library and Information Services in Global Era: A Current Approach
This paper deals with the marketing of library and information services in the global era. It discusses about the marketing concept of today's library and information centers covering various topics such as management...