Job Description

Summary

As a Senior Site Reliability Engineer, you will play a critical role in architecting, building, and maintaining our infrastructure. You’ll lead initiatives to improve the reliability and scalability of our services, working cross-functionally with development, product, and security teams to ensure our systems are robust and resilient. You'll help establish and advocate for best practices, optimize performance, and proactively troubleshoot complex issues.

This role is ideal for an experienced SRE with a passion for automation, operational excellence, and driving system reliability at scale.

We are headquartered in SoHo, New York City, and this role can be partially or fully remote.

Key Responsibilities

Reliability and Performance:

  1. Design, implement, and maintain systems and processes that enhance the reliability, availability, and performance of our services.
  2. Design, implement and maintain CICD tools and processes to increase reliability
  3. Design, implement and maintain cloud constructs to increase reliability
  4. Develop and manage monitoring, alerting, and incident response strategies to minimize downtime and ensure rapid recovery from incidents.
  5. Conduct root cause analysis of system failures and implement preventative measures.
  6. Optimize system performance and automate repetitive tasks to improve operational efficiency..

Collaboration and Communication:

  1. Work closely with software engineering, infrastructure, and product teams to integrate reliability practices into the development lifecycle.
  2. Advocate for SRE best practices and foster a culture of reliability and operational excellence across the organization.
  3. Communicate effectively with stakeholders, providing regular updates on reliability metrics, incidents, and improvement initiatives.

Innovation and Improvement:

  1. Stay abreast of the latest industry trends and technologies in SRE, reliability, and performance.
  2. Continuously evaluate and improve existing systems and processes to enhance reliability and efficiency.
  3. Drive the adoption of new tools and technologies that can improve operational capabilities.

Qualifications

  1. Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field.
  2. 5+ years of experience in site reliability engineering, DevOps, or a related field
  3. Strong understanding of reliability engineering principles, practices, and tools.
  4. Proficiency in monitoring and alerting tools (e.g., Prometheus, Grafana, Nagios).
  5. Experience with cloud platforms (AWS, Azure, GCP) and container orchestration systems (Kubernetes, Docker).
  6. Proficiency in scripting and automation tools, such as Python, Bash, Ansible, or Terraform.
  7. Excellent problem-solving skills and the ability to work under pressure in a fast-paced environment.
  8. Strong communication and interpersonal skills, with the ability to influence and lead teams.

Preferred Qualifications

  1. Experience with continuous integration and continuous deployment (CI/CD) practices and tools.
  2. Knowledge of configuration management tools (e.g., Puppet, Chef).
  3. Experience with database management and optimization.
  4. Familiarity with compliance frameworks and security best practices.
  5. Relevant certifications such as AWS Certified DevOps Engineer, Google Professional SRE, or equivalent.

Minimum full-time salary of $198,000-$220,000 Disclosure in accordance with New York City's Pay Transparency Law. Full Time employees at Uniswap Labs are also eligible for other compensation elements, including equity, tokens, and benefits, dependent on the position type.

Skills
  • AWS
  • Development
  • Problem Solving
  • Python
  • Software Engineering
© 2025 cryptojobs.com. All right reserved.