Task failures in decentralized Internet of Things (IoT)-edge computing environments not only lead to inefficiencies, increased latency, and resource wastage but can also introduce system instability and cause application malfunctions. These failures may arise due to network disruptions, resource constraints, or inefficient task scheduling, ultimately affecting the overall reliability and performance of IoT-edge systems. This study presents a novel Long Short-Term Memory (LSTM)-based Federated Learning (FL) framework for proactive task failure prediction, ensuring adaptive scheduling and efficient resource utilization. Unlike existing conventional methods, our approach personalizes failure prediction per device, addressing heterogeneous execution characteristics while preserving data privacy. By integrating LSTM with FL, we improve the failure detection accuracy and reduce unnecessary task executions. We first trained all models using Federated Learning (FL) and then conducted a comparative analysis of Convolutional Neural Networks (CNN), Gated Recurrent Units (GRU), and LSTM. Our findings show that LSTM achieves the highest accuracy and F1 score, while CNN excels in recall and energy efficiency. These insights validate the effectiveness of our FL-based failure prediction framework and highlight the advantages of model personalization for dynamic decentralized IoT-edge environments.
Decentralized IoT-Edge Computing: An LSTM-Based Federated Learning Framework for Personalized Task Failure Prediction
Ali N.;Aloi G.;Gravina R.;Savaglio C.;Fortino G.
2025-01-01
Abstract
Task failures in decentralized Internet of Things (IoT)-edge computing environments not only lead to inefficiencies, increased latency, and resource wastage but can also introduce system instability and cause application malfunctions. These failures may arise due to network disruptions, resource constraints, or inefficient task scheduling, ultimately affecting the overall reliability and performance of IoT-edge systems. This study presents a novel Long Short-Term Memory (LSTM)-based Federated Learning (FL) framework for proactive task failure prediction, ensuring adaptive scheduling and efficient resource utilization. Unlike existing conventional methods, our approach personalizes failure prediction per device, addressing heterogeneous execution characteristics while preserving data privacy. By integrating LSTM with FL, we improve the failure detection accuracy and reduce unnecessary task executions. We first trained all models using Federated Learning (FL) and then conducted a comparative analysis of Convolutional Neural Networks (CNN), Gated Recurrent Units (GRU), and LSTM. Our findings show that LSTM achieves the highest accuracy and F1 score, while CNN excels in recall and energy efficiency. These insights validate the effectiveness of our FL-based failure prediction framework and highlight the advantages of model personalization for dynamic decentralized IoT-edge environments.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


