Job Description

Summary

As an engineer in the Infrastructure department at Alchemy, you will design, deploy, and continuously improve the infrastructure powering our blockchain developer platform — serving 100+ chains, billions of daily requests, and over $150B in annual transactions.

The Infrastructure team provides the infrastructure, tooling, and expertise needed to allow Alchemy engineers to ship, scale, and operate high-quality products in a fast, safe, and cost-efficient manner.

What You'll Do
  1. Architect and operate scalable, self-healing infrastructure leveraging Kubernetes, Terraform, and cloud-native tools across multi-region deployments.
  2. Drive AI enablement across engineering — ensuring repos, tooling, and workflows are optimized for agentic development with tools like Claude Code, Cursor, and Codex.
  3. Build AI-powered infrastructure tooling and automation (e.g., automated K8s upgrades, IaC plan analysis, cost optimization advisors, MCP servers, n8n workflows).
  4. Build and maintain internal developer platform (IDP) capabilities for self-service deployments, observability, and reliability.
  5. Develop observability frameworks using Prometheus and Grafana for metrics, dashboards, and alerting.
  6. Lead incident management with blameless post-mortems; define and enforce SLIs, SLOs, and error budgets across services.
  7. Design and manage multi-cloud, multi-region network architecture — VPC design, IPAM, DNS (Cloudflare), cross-cloud connectivity, security groups, and edge-proxy/istio gateway configuration.
  8. Collaborate with security teams to embed compliance into infrastructure, including IaC scanning and runtime protection.
  9. Provide technical leadership and mentorship to elevate the team's operational capabilities.
What We're Looking For
  1. 5+ years as an Infrastructure Engineer focused on reliability (SRE, Production Engineer, Platform Engineer).
  2. Experience driving company-wide reliability efforts, including SLO frameworks and error budget policies.
  3. Strong proficiency with observability stacks: OpenTelemetry, Prometheus/Grafana.
  4. Deep experience with cloud infrastructure (AWS/GCP), Kubernetes, and multi-region architectures.
  5. Skilled with Terraform, Helm, and GitOps workflows (e.g., ArgoCD) with an automation-first mindset.
  6. Experience leveraging agentic development tools (Claude Code, Cursor, Codex) and workflow automation (n8n) to accelerate IaC and build internal tooling is a strong plus.
  7. Solid networking fundamentals — VPC design, DNS, IPAM, security groups, cross-cloud connectivity, and service mesh (e.g., Istio) experience is a plus.
  8. Calm and effective incident responder with a focus on systemic improvement.
  9. Strong cross-functional communicator across SRE, security, and product engineering.
  10. Blockchain infrastructure, distributed systems, or high-throughput RPC experience — not required but a plus.

The base salary range for this position is estimated to be between $135,000 - $240,000 annually. Please note this range reflects base salary only, and does not include bonus, equity, or benefits. Your salary will be determined by various factors, including relevant experience, skill set, qualifications, and other business needs.

Skills
  • AWS
  • Cloud Management
  • Communications Skills
  • Development
  • Software Engineering
  • Team Collaboration
© 2026 cryptojobs.com. All right reserved.