Job Description
Summary
We are seeking a highly skilled Site Reliability Engineer with extensive experience in DevOps, infrastructure optimization, and incident reporting & monitoring. In this role, you will be working alongside other DevOps Engineers, Technical Architect and Developers to optimize performance of the Seedify platform.
Responsibilities:
- Infrastructure & IaC: Manage AWS infrastructure using Terraform/Terragrunt; optionally Pulumi or AWS CDK. Optimize cost, reliability, and scalability.
- Kubernetes Ops: Deploy and maintain Kubernetes clusters with Helm and Kustomize. Architect for high availability and zero downtime.
- CI/CD Automation: Own pipelines in GitHub Actions to improve release velocity.
- Observability: Implement monitoring and alerting using New Relic, Prometheus, Grafana, and OpenTelemetry. Create health dashboards and custom metrics.
- Incident & SLA Management: Define SLAs, lead incident response, and run postmortems to improve reliability.
- Dev Collaboration: Partner with engineers to embed reliability, monitoring, and alerting into the SDLC.
Skills & Qualifications:
- Core Tools: Kubernetes, Helm, Kustomize, Docker, Bash, Ansible.
- Cloud: Strong AWS experience (EC2, S3, EKS, RDS, Lambda, etc.).
- Observability Stack:, Prometheus, Grafana, New Relic, OpenTelemetry.
- CI/CD: ArgoCD, GitHub
- IaC: Terraform, Terragrunt; optional Pulumi/AWS CDK.
- Languages: Optional NodeJS or similar for automation.
- Certifications (optional but preferred): AWS Solutions Architect, Kubernetes Admin/Developer, or other cloud/DevOps certifications.
Experience:
- 3+ years in SRE or related roles.
- Hands-on infra ownership, incident response, and system optimization.
- Designed reliability-focused SDLC integrations and dashboards.
- Soft Skills:
- Collaboration: Works well across engineering, product, and ops.
- Ownership: Drives initiatives end-to-end, especially under pressure.
- Adaptability: Thrives in fast-paced, shifting environments
$3,000 - $4,000 a month
Skills
- AWS
- Communications Skills
- Development
- Software Engineering
- Team Collaboration