Anomaly Detection in Microservices Using Ensemble Methods

Salvado, Nuno Miguel da Silva

Publicação

Anomaly Detection in Microservices Using Ensemble Methods

2025-12-02Dissertação de mestrado

datacite.subject.fos	Engenharia e Tecnologia::Engenharia Eletrotécnica, Eletrónica e Informática
dc.contributor.advisor	Freire, Mário Marques
dc.contributor.author	Salvado, Nuno Miguel da Silva
dc.date.accessioned	2025-12-12T12:07:49Z
dc.date.available	2025-12-12T12:07:49Z
dc.date.issued	2025-12-02
dc.date.submitted	2025-10-13
dc.description.abstract	Anomaly detection represents a critical factor in ensuring the reliability and resilience of microservice-based systems, where failures can rapidly propagate and compromise overall service availability. This dissertation investigates the application of classical Machine Learning (ML) algorithms and ensemble methods for anomaly detection in microservices, using the TraceRCA dataset as a representative benchmark. The work begins with a systematic literature review, which categorizes traditional and ML-based approaches to anomaly detection, identifying key research gaps and datasets. Building on this foundation, a complete experimental pipeline was developed, including preprocessing, feature engineering, and anomaly labeling, followed by the evaluation of multiple baseline classifiers such as Logistic Regression (LogReg), Support Vector Machine (SVM), Decision Tree (DT), K-Nearest Neighbors (KNN), Multilayer Perceptron (MLP), and Gaussian Naïve Bayes (GNB). To enhance predictive performance, ensemble techniques including Random Forest (RF), eXtreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LightGBM), and Histogram-based Gradient Boosting (HGBM) were implemented and compared against baselines. The evaluation considered both predictive accuracy and resource efficiency, measuring metrics such as F1-score, precision, recall, accuracy, Receiver Operating Characteristic – Area Under the Curve (ROC-AUC), as well as execution time, Random Access Memory (RAM) consumption, and Central Processing Unit (CPU) utilization. The experimental results demonstrate that ensemble models consistently outperform baselines, with boosting-based methods (XGBoost, LightGBM, HGBM) achieving the highest predictive performance, while RF offered stable results with moderate computational overhead. These findings highlight the trade-offs between accuracy and efficiency, underlining the importance of careful algorithm selection according to deployment constraints. This research contributes by providing a comprehensive benchmark of ML and ensemble methods for anomaly detection in microservices, bridging the gap between predictive performance and practical applicability in real-world environments.	eng
dc.identifier.tid	204079268
dc.identifier.uri	http://hdl.handle.net/10400.6/19574
dc.language.iso	eng	por
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/4.0/
dc.subject	Anomaly Detection	por
dc.subject	Ensemble Methods	por
dc.subject	Machine Learning	por
dc.subject	Microservices	por
dc.subject	Performance Evaluation	por
dc.title	Anomaly Detection in Microservices Using Ensemble Methods	por
dc.type	master thesis	por
dspace.entity.type	Publication
thesis.degree.name	2º Ciclo em Engenharia Informática	por

Ficheiros

Principais

A mostrar 1 - 1 de 1

Nome:: 11513_29166.pdf
Tamanho:: 3.62 MB
Formato:: Adobe Portable Document Format

Ver/Abrir

Coleções

FE - DI | Dissertações de Mestrado e Teses de Doutoramento