reliability
There are 301 repositories under reliability topic.
alibaba/Sentinel
A powerful flow control component enabling reliability, resilience and monitoring for microservices. (面向云原生微服务的高可用流控防护组件)
dastergon/awesome-sre
A curated list of Site Reliability and Production Engineering resources.
upgundecha/howtheysre
A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)
hjacobs/kubernetes-failure-stories
Compilation of public failure/horror stories related to Kubernetes
awslabs/aws-well-architected-labs
Hands on labs and code to help you learn, measure, and build using architectural best practices.
codersguild/System-Design
It's just fascinating. How is modern software designed? 🤔 Some design-level considerations for scalability, maintainability eventual consistency, availability & reliability. 👨💻 Interview Prep. 👨💻
chaostoolkit/chaostoolkit
Chaos Engineering Toolkit & Orchestration for Developers
tnballo/high-assurance-rust
A free book about developing secure and robust systems software.
SquadcastHub/awesome-sre-tools
A curated list of Site Reliability and Production Engineering Tools
hynek/stamina
Production-grade retries for Python
mspnp/cloud-design-patterns
Sample implementations for cloud design patterns found in the Azure Architecture Center.
rosehgal/TrashEmail
A hosted disposable email telegram bot; Extremely privacy friendly; Proudly hosted for community.
jxzhangjhu/Awesome-LLM-Uncertainty-Reliability-Robustness
Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models
p-org/PSharp
A framework for rapid development of reliable asynchronous software.
uber/arachne
An always-on framework that performs end-to-end functional network testing for reachability, latency, and packet loss
teivah/designdeck
An Open-Source Collection of 230+ Flash Cards to Help You Succeed in Your System Design Interview and More 💯
traceloop/jest-opentelemetry
Easily run integration tests for your backends
krkn-chaos/krkn
Chaos and resiliency testing tool for Kubernetes with a focus on improving performance under failure conditions. A CNCF sandbox project.
openturns/openturns
Uncertainty treatment library
oldratlee/software-practice-thoughts
📚 🐣 软件实践文集。主题不限,思考讨论有趣有料就好,包含如 系统的模型分析/量化分析、开源漫游者指南、软件可靠性设计实践、平台产品的逻辑与执行… 🥤
OpenIBC/Ohsce
PHP HI-REL SOCKET TCP/UDP/ICMP/Serial .高可靠性PHP通信&控制框架SOCKET-TCP/UDP/ICMP/硬件Serial-RS232/RS422/RS485 AND MORE!
ConnorBrereton/SiteReliabilityEngineering
Notes on Site Reliability Engineering. Leave a 🌟 if you found this useful!
gwsystems/composite
A component-based OS
dastergon/wheel-of-misfortune
A role-playing game for incident management training
irrustible/async-backplane
Simple, Erlang-inspired fault-tolerance framework for Rust Futures.
boschresearch/pylife
a general library for fatigue and reliability
pln-fing-udelar/fast-krippendorff
Fast computation of Krippendorff's alpha agreement measure in Python.
My-Random-Thoughts/QA-Checks-v4
PowerShell scripts to ensure consistent and reliable build quality and configuration for your servers
kvz/nsfailover
Let's Make DNS Outage Suck Less
tuxera/reliance-edge
Transactional power-failsafe filesystem for microcontrollers
SocketSomeone/nestjs-resilience
🛡️ A module for improving the reliability and fault-tolerance of your NestJS applications
zeroc0d3lab/awesome-sre
A curated list of awesome Site Reliability and Production Engineering resources.
Snowflake-Labs/sansshell
A non-interactive daemon for host management
krkn-chaos/cerberus
Guardian of Kubernetes clusters. Tool to monitor clusters health and signal/alert on failures.
BeRo1985/rnl
RNL - Realtime Network Library - The opensource reliable UDP network library
JerryX1110/RPCMVOS
[AAAI22 Oral] Reliable Propagation-Correction Modulation for Video Object Segmentation