WORLD JOURNAL OF INNOVATION AND MODERN TECHNOLOGY (WJIMT )

E-ISSN 2504-4766
P-ISSN 2682-5910
VOL. 7 NO. 2 2023
DOI: https://doi.org/10.56201/wjimt.v7.no2.2023.pg105.108


Fault Tolerance in Distributed Systems (December 2023)

Clarian Makungu


Abstract


This study delves into the intricate web of fault tolerance mechanisms within distributed systems, exploring their real-world applications across various industries. The study scrutinizes how fault tolerance strategies safeguard continuous functionality in the face of faults and failures. Real-life instances from diverse sectors, including social media platforms, financial markets, cloud computing, e-commerce, entertainment, and healthcare, elucidate the practical implementations of fault tolerance. The analysis begins with an exploration of social media platforms like Facebook and Twitter, showcasing how fault tolerance mechanisms ensure uninterrupted user experiences by replicating data across multiple servers. Moving to the financial domain, the study sheds light on stock trading platforms, emphasizing the critical role of redundancy in maintaining transactional integrity during server failures. Cloud computing, a cornerstone in modern infrastructure, is examined to demonstrate how fault tolerance in distributed systems guarantees business continuity for service providers and their clients. Additionally, e-commerce giants' fault tolerance strategies are highlighted, showcasing their ability to manage surges in transaction volumes without compromising user experience. In the realm of entertainment, video streaming services utilize fault tolerance to adaptively adjust video quality, ensuring uninterrupted viewing experiences even in fluctuating network conditions. Furthermore, the abstract delves into the architectural choices within healthcare systems, exemplifying how microservices architecture contributes to fault isolation and system stability. While emphasizing these practical applications, the abstract underscores the perpetual challenge of striking a balance between fault tolerance, system complexity, and resource utilization. The pursuit of faultlessness in distributed systems remains an ongoing endeavor, vital for ensuring reliabili


keywords:

Fault Tolerance, Distributed Systems, Resilience, Redundancy, Cloud Computing, Social Media Platforms, Financial Markets, E-commerce, Video Streaming Services, Healthcare Systems, Microservices Arch


References:


[1] Issues in Testing distributed component -based systems, Sudipto Ghosh,Aditya P. Mathur,
Software Engineering Research Centre,West Lafayette, March 1999.

[2] Massively Distributed Systems: Design Issues and Challenges, Dan Nessett, Technology
Development Center, 3Com Corporation. Proceedings of the Embedded Systems Workshop
Cambridge, Massachusetts, USA, March 29–31, 1999

[3] Replication-Based Fault Tolerance for MPI Applications John Paul Walters and Vipin
Chaudhary, Member, IEEE


DOWNLOAD PDF

Back