Job Description

Summary

We are seeking a new Senior Software Engineer to join the Platform Engineering team of our Custody department. You'll play a key role in shaping our engineering practices, ensuring the reliability and performance of our platform across a multi-cloud environment, and empowering our engineering team to deliver innovative features with speed and efficiency. You will drive improvements in our automation, observability, and overall platform stability.

WHAT YOU'LL DO

  1. Design, build, and maintain scalable and resilient infrastructure across Cloud providers such as Azure, AWS, GCP and IBM Cloud.
  2. Automating work including infrastructure needs, testing, failover solutions, failure mitigation, and much more.
  3. Implement and manage monitoring, alerting, and logging systems to ensure system visibility.
  4. Develop and maintain our CI/CD pipelines using GitLab CI.
  5. Proactively identify and resolve potential performance bottlenecks and reliability issues.
  6. Participate in on-call rotations to address production incidents and provide support for service engineers with customer incidents.
  7. Collaborate closely with development teams to integrate and deploy services efficiently.
  8. Contribute to the development and maintenance of our internal platform tools and services.
  9. Lead the implementation of standard methodologies for DevOps and SRE within the engineering organization.
  10. Document processes and procedures.
  11. Automate any software maintenance processes which previously required a manual procedure.

WHAT YOU'LL BRING

  1. 5+ years’ experience with software engineering, platforming engineering or system operations on high available and high traffic environments
  2. Strong experience with Linux-based infrastructures, Linux/Unix administration
  3. Experience with databases such as PostgreSQL
  4. Experience administering Linux servers as well as docker based infrastructure (like Kubernetes, AKS, etc.) in a highly available environment
  5. Experience of scripting languages such as Go or Bash
  6. Experience with message broker/queue technologies like RabbitMQ, AMQP 1.0
  7. Experience with modern monitoring, logging and observability tools in complex distributed systems such as with Application Insights, Grafana, Elastic stack, Datadog, Prometheus, etc
  8. Practical experience with infrastructure-as-code (with tools like Terraform)
  9. Familiarity with GitOps deployment practices using ArgoCD or Flux
  10. Good understanding of cybersecurity fundamentals and best practices
  11. Troubleshooting skills with the ability to spot issues before they become problems
  12. Excellent problem-solving and communication skills
  13. Committed to processes, with excellent documentation skills and a strong ability to work well in a team!

Skills
  • Cybersecurity Solutions
  • Development
  • Problem Solving
  • Software Engineering
  • SQL
© 2025 cryptojobs.com. All right reserved.