Reliability and Availability is most imoortant concern for successful enterprises. It requires understanding of a complex system with all its related parameters. With advent of complex solutions and tooling it is almost impossible to do it manually. Automation is the only panecia to ensure Reliability reliably.
OpsTree follows the principle of More Tech Less People for SRE and NOC services. We have developed a suite of automations and instrumentation to help create required visibility for a complex ecosystem and bring our the exact health status on simple dashboards. As part of this service we setup detailed instrumentation which have zero noise alerting and self healing abilities.
How to test Ansible playbook/role using Molecules with Docker...Read Article
You might be hesitant to incorporate and reap benefits of spot instances...Read Article
Redis Cluster: Architecture, Replication, Sharding and Failover ...Read Article
Drive common standards and practices across projects via Shared Libraries based Jenkins Pipelines...Read Article
Manage your applications way better using Pods. Let’s explore!!...Read Article
Seamless management of Kafka via Kafka Manager...Read Article