Enhancing Service Reliability via Graph Reinforcement Learning: Real-Time Dependency Mapping and Failure Prediction

Rameshbabu Lakshmanasamy

doi:10.53469/jrse.2026.08(05).06

Enhancing Service Reliability via Graph Reinforcement Learning: Real-Time Dependency Mapping and Failure Prediction

Authors

Rameshbabu Lakshmanasamy IEEE Member, USA

DOI:

https://doi.org/10.53469/jrse.2026.08(05).06

Keywords:

Graph Reinforcement Learning, service reliability, failure prediction, site reliability engineering, dynamic dependency mapping

Abstract

In large-scale distributed systems with numerous workflows and microservices, traditional service dependency mapping approaches rely on static graphs that fail to capture real-time changes, leading to delayed incident detection and prolonged downtime. This research explores Graph Reinforcement Learning (GRL) as a dynamic solution for modeling inter-service dependencies and predicting failure propagation in real time. By leveraging real-time telemetry data and historical incidents, GRL continuously updates dependency graphs, reducing Mean Time to Detect (MTTD) and Mean Time to Recover (MTTR). The paper further discusses implementation challenges, including computational complexity and scalability, and proposes solutions such as hierarchical clustering and distributed processing. The findings suggest that GRL significantly enhances system resilience, making it a valuable tool for modern reliability engineering.

Downloads

Published

2026-05-21

How to Cite

Lakshmanasamy, R. (2026). Enhancing Service Reliability via Graph Reinforcement Learning: Real-Time Dependency Mapping and Failure Prediction. Journal of Research in Science and Engineering, 8(5), 38–45. https://doi.org/10.53469/jrse.2026.08(05).06

Download Citation

Issue

Vol. 8 No. 5 (2026)

Section

Articles

License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Deprecated: json_decode(): Passing null to parameter #1 ($json) of type string is deprecated in /www/bryanhousepub/ojs/plugins/generic/citations/CitationsPlugin.inc.php on line 49

Enhancing Service Reliability via Graph Reinforcement Learning: Real-Time Dependency Mapping and Failure Prediction

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

License

Current Issue

Browse

Developed By

Information

Make a Submission

Keywords

CONTACT US