Lead Site Reliability Engineer
- Рівень:
- lead
- Джерело:
- djinni.co
Що робити
- Manage SRE teams;
- Technical excellence of teammates;
- Implement and maintain monitoring solutions using Prometheus, Victoria-Metrics, and Grafana to identify and address performance issues proactively;
- Manage logging infrastructure using Fluent, Fluent-bit, ElasticSearch, and Kibana, ensuring efficient log collection, analysis, and visualization;
- Configure and manage alerting systems like AlertManager and Opsgenie to respond to critical incidents and minimize downtime promptly;
Що очікуємо
- 5+ years of experience in DevOps/SRE roles;
- Strong experience with AWS cloud services;
- Advanced knowledge of Kubernetes and container orchestration;
- Solid understanding of DevOps principles and practices;
- Experience with Helm chart development and maintenance;
Що пропонуємо
- Experience with other cloud providers (GCP, Azure);
- Security certifications (AWS, CKS, etc.);
- Experience with service mesh technologies.
Схожі вакансії
З блогу Trackr
Усі статті →Знайдено через trackr.help/jobs · Канал: @trackrhelp · Бот для персональних сповіщень: @trackrhelpBot

