Publication
Anomaly Detection in Microservices Using Ensemble Methods
| datacite.subject.fos | Engenharia e Tecnologia::Engenharia Eletrotécnica, Eletrónica e Informática | |
| dc.contributor.advisor | Freire, Mário Marques | |
| dc.contributor.author | Salvado, Nuno Miguel da Silva | |
| dc.date.accessioned | 2025-12-12T12:07:49Z | |
| dc.date.available | 2025-12-12T12:07:49Z | |
| dc.date.issued | 2025-12-02 | |
| dc.date.submitted | 2025-10-13 | |
| dc.description.abstract | Anomaly detection represents a critical factor in ensuring the reliability and resilience of microservice-based systems, where failures can rapidly propagate and compromise overall service availability. This dissertation investigates the application of classical Machine Learning (ML) algorithms and ensemble methods for anomaly detection in microservices, using the TraceRCA dataset as a representative benchmark. The work begins with a systematic literature review, which categorizes traditional and ML-based approaches to anomaly detection, identifying key research gaps and datasets. Building on this foundation, a complete experimental pipeline was developed, including preprocessing, feature engineering, and anomaly labeling, followed by the evaluation of multiple baseline classifiers such as Logistic Regression (LogReg), Support Vector Machine (SVM), Decision Tree (DT), K-Nearest Neighbors (KNN), Multilayer Perceptron (MLP), and Gaussian Naïve Bayes (GNB). To enhance predictive performance, ensemble techniques including Random Forest (RF), eXtreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LightGBM), and Histogram-based Gradient Boosting (HGBM) were implemented and compared against baselines. The evaluation considered both predictive accuracy and resource efficiency, measuring metrics such as F1-score, precision, recall, accuracy, Receiver Operating Characteristic – Area Under the Curve (ROC-AUC), as well as execution time, Random Access Memory (RAM) consumption, and Central Processing Unit (CPU) utilization. The experimental results demonstrate that ensemble models consistently outperform baselines, with boosting-based methods (XGBoost, LightGBM, HGBM) achieving the highest predictive performance, while RF offered stable results with moderate computational overhead. These findings highlight the trade-offs between accuracy and efficiency, underlining the importance of careful algorithm selection according to deployment constraints. This research contributes by providing a comprehensive benchmark of ML and ensemble methods for anomaly detection in microservices, bridging the gap between predictive performance and practical applicability in real-world environments. | eng |
| dc.identifier.tid | 204079268 | |
| dc.identifier.uri | http://hdl.handle.net/10400.6/19574 | |
| dc.language.iso | eng | por |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | |
| dc.subject | Anomaly Detection | por |
| dc.subject | Ensemble Methods | por |
| dc.subject | Machine Learning | por |
| dc.subject | Microservices | por |
| dc.subject | Performance Evaluation | por |
| dc.title | Anomaly Detection in Microservices Using Ensemble Methods | por |
| dc.type | master thesis | por |
| dspace.entity.type | Publication | |
| thesis.degree.name | 2º Ciclo em Engenharia Informática | por |
Files
Original bundle
1 - 1 of 1
