Nitka Technologies develops software for customers in the US and Europe and brings together about 300 professionals from Eastern Europe, North and South America, Armenia, Georgia and Kazakhstan.
We are looking for an experienced SRE Сonsultant. The customer is a U.S. bank holding company that provides a range of financial services, including: retail banking, commercial banking, wealth management and trust services.
Responsibilities:
• Implement and develop SRE practices for banking applications and platforms.
• Identify critical services and business critical user journeys; formalize SLIs, SLOs, and an error budget policy.
• Design and maintain incident management processes: on call, escalation paths, runbooks, and postmortems with technical and regulatory requirements.
• Improve observability: implement metrics, tracing, logging, dashboards, and improve alert quality.
• Collaborate with development, architecture, IT operations teams, and business stakeholders;
Requirements:
• Deep understanding of SLI, SLO, error budget, burn rate, and SLO based alerting.
• Experience with incident management: on call, escalations, postmortems, and runbooks.
• Experience with observability tools such as Prometheus, Grafana, Splunk, Datadog, or similar.
• Familiarity with OpenTelemetry and distributed tracing concepts.
• Experience with cloud platforms (preferably Azure).
• Strong troubleshooting skills across application, infrastructure, and platform layers.
Working conditions:
- Remote work
- Full-time (8 hours/day)
- Attractive USD compensation
- Paid vacation, holidays
Ключевые навыки
- SLI
- SLO
- Prometheus
- Grafana
- Splunk
- Datadog
- OpenTelemetry
- Azure
- Английский — B1 — Средний