Dublin Core
Title
Using Exploratory Data Analysis and Big Data Analytics for Detecting Anomalies
in Cloud Computing
in Cloud Computing
Abstract
– While leveraging cloud computing for large-scale distributed applications allows
seamless scaling, many companies struggle following up with the amount of data generated in terms
of efficient processing and anomaly detection, which is a necessary part of the management of
modern applications. As the record of user behavior, weblogs surely become the research item
related to anomaly detection. Many anomaly detection methods based on automated log analysis
have been proposed. However, not in the context of big data applications where anomalous behavior
needs to be detected in understanding phases prior to modeling a system for such use. Big Data
Analytics often ignores anomalous point due to high volume of data. To address this problem, we
propose a complemented methodology for Big Data Analytics – the Exploratory Data Analysis,
which assists in gaining insight into data relationships without the classical hypothesis modeling. In
that way, we can gain better understanding of the patterns and spot anomalies. Results show that
Exploratory Data Analysis facilitates anomaly detection and the CRISP-DM Business
Understanding phase, making it one of the key steps in the Data Understanding phase.
seamless scaling, many companies struggle following up with the amount of data generated in terms
of efficient processing and anomaly detection, which is a necessary part of the management of
modern applications. As the record of user behavior, weblogs surely become the research item
related to anomaly detection. Many anomaly detection methods based on automated log analysis
have been proposed. However, not in the context of big data applications where anomalous behavior
needs to be detected in understanding phases prior to modeling a system for such use. Big Data
Analytics often ignores anomalous point due to high volume of data. To address this problem, we
propose a complemented methodology for Big Data Analytics – the Exploratory Data Analysis,
which assists in gaining insight into data relationships without the classical hypothesis modeling. In
that way, we can gain better understanding of the patterns and spot anomalies. Results show that
Exploratory Data Analysis facilitates anomaly detection and the CRISP-DM Business
Understanding phase, making it one of the key steps in the Data Understanding phase.
Keywords
Cloud Computing, Big Data, Data Mining, Anomaly Detection
Identifier
2637-2835
DOI
10.14706/JONSAE2021320