Job Description
Summary
About The Team
The DevOps group is structured in a flat structure of 15 engineers, designating each DevOps engineer to a specific group of RND.
Each DevOps engineer is the DevOps leader for that group. They serve as the point of contact for any project, taking part in the system designs and implementing any needed change.
Each DevOps engineer is also responsible for a “DevOps project” - an infrastructure project aimed to bring upon new capabilities and improvements to our system.
The DevOps group works as a team. They sit together, interact with each other and support each other on a daily basis.
Our infrastructure spans global multi-cloud environments, supporting critical production systems with zero downtime requirements and robust disaster recovery (DR) architecture.
We continuously improve our platform with the latest industry standards around availability, security, and scalability.
What You’ll Do
As a Devops Staff Engineer, you will not be assigned a specific R&D group, but will serve as a focal point for the DevOps engineers, to help and support with any issue.
You’ll be leading projects that cross DevOps, push forward technical discussions and interact with each DevOps engineer as needed to solve diverse complex problems of high scale.
You’ll support multi-region environments, build and maintain tools for automation, deployment, monitoring, and operations.
You’ll troubleshoot and resolve issues in our various environments.
You’ll play a key role in designing and enforcing infrastructure patterns that support zero-downtime deployments, high resilience, and compliance standards.
You’ll collaborate with teams across the company to define and drive forward scalable, production-grade architecture.
You’ll conduct periodic on-call duties and emergency response.
What You’ll Bring
10+ years of experience in the industry, including 6+ years of hands-on experience in high-scale SaaS companies or zero-downtime/disaster recovery enterprise environments (e.g., banking, cybersecurity, healthcare, or large-scale cloud platform providers).
5+ years of experience in DevOps roles across a minimum of 2 different companies, with strong hands-on experience in Kubernetes and AWS. Experience with hybrid or multi-cloud architectures is a strong plus.
Experience with on-call duties to manage critical infrastructure and application issues outside business hours, ensuring high availability and reliability.
3+ years of experience with CI/CD tools such as GitLab, GitHub Actions, CircleCI, or similar.
2+ years of experience with programming languages such as Python or TypeScript. Strong Linux administration skills, including debugging and Bash scripting.
2+ years of experience with Terraform (experience with Terragrunt is a plus), as well as GitOps systems such as ArgoCD.
2+ years of experience with configuration management tools such as Ansible, Chef, or Puppet, and monitoring and alerting systems such as Datadog, Splunk, New Relic, or Grafana.
Strong understanding of networking concepts, including VPC, service meshes, routing, DNS, TLS, and firewalls.
Production-oriented mindset with a strong sense of ownership over reliability, scalability, and incident response.
Skills
- AWS
- Cybersecurity Solutions
- Development
- Leadership
- Software Architecture
- Software Engineering
- TypeScript