🔹 Lead stability, scalability, and operational excellence at scale.
Softsich is a young and ambitious international product tech company that develops scalable B2B digital platforms. We’re on the lookout for an experienced DevOps Lead/Head who’s ready to grow with us and shape high-load, data-driven solutions used by international partners.
Your key responsibilities will include:
– Owning and managing cloud infrastructure across production, staging, and development environments. – Designing, implementing, and maintaining CI/CD pipelines and deployment automation. – Ensuring production stability, high availability, and scalability of all platform services. – Configuring and maintaining monitoring, alerting, and observability tooling. – Leading incident response for infrastructure-related issues and driving post-incident reviews. – Planning and executing capacity management to support platform growth. – Optimizing infrastructure costs: tracking spend, eliminating waste, right-sizing resources. – Maintaining disaster recovery procedures, runbooks, and on-call readiness. – Coordinating with development teams on infrastructure requirements and service dependencies. – Managing developer tooling and internal platform services to support engineering productivity. – Ensuring security best practices and compliance requirements across infrastructure. – Continuously improving deployment processes, reliability, and operational efficiency. – Leading, mentoring, and growing a team of infrastructure engineers. – Validating and challenging technical estimates to ensure realistic commitments and early risk visibility. – Reviewing team output for quality, completeness, and alignment with technical standards.
It’s a match if you have:
– 5+ years of experience in infrastructure, DevOps, or platform engineering roles, including at least 2 years in a lead or senior position. – Strong hands-on experience with AWS (EC2, RDS, S3, ECS/EKS, Lambda, VPC, IAM, CloudFront, Route53, ElastiCache, SQS/SNS). – Experience with AWS Organizations (multi-account management, SCPs, consolidated billing). – Proficiency with Infrastructure as Code tools (Terraform, Ansible). – Deep knowledge of Docker and Kubernetes (autoscaling, network policies, cluster management). – Experience with Helm, ArgoCD (GitOps workflows), GitLab CI or GitHub Actions. – Strong monitoring expertise (Datadog, Prometheus, Grafana, ELK, PagerDuty, JSM). – Experience with Cloudflare (CDN, WAF, DDoS mitigation, Zero Trust). – Solid networking fundamentals (DNS, ALB/NLB, VPN, VPC peering, firewalls). – Strong security practices (IAM, Secrets Manager/Vault, CSPM, patch management, compliance readiness). – Experience with high-load systems, load testing (k6, Locust, Gatling), capacity planning. – Experience with databases (RDS PostgreSQL/MySQL, DocumentDB, Redshift, Redis/ElastiCache, Kafka). – Strong Linux administration and scripting skills (Bash, Python). – Proven ability to mentor engineers and validate technical estimates. – Strong communication skills and ability to proactively surface risks.