Site Reliability Engineer (Observability team)
inDriver
Software Engineering
Astana0, Kazakhstan
Posted on May 12, 2025
Responsibilities
- Improvement and support of observability tools
- improvement of the incident management process
- SLA 99.99% for the product
- Implementation of SRE practices to dev teams
Qualifications
Must have:
- Experience with observability tools
- Prometheus-like TSDB, EFK/EFK/Loki, Jaeger
- Experience to adaptation observability tools in company
- Experience in troubleshooting problems in production
- Good experience with Kubernetes (including with different operators)
- Any tool for Incident Management (PagerDuty, Opsgenie, etc)
Nice to have:
- Experience working with AWS
- Experience building SRE in the company
- Development experience: python/go
Conditions & Benefits
- Stable salary, official employment
- Health insurance
- Hybrid work mode and flexile schedule
- Relocation package offered for candidates from other regions
- Access to professional counseling services including psychological, financial, and legal support
- Discount club membership
- Diverse internal training programs
- Partially or fully payed additional training courses
- All necessary work equipment