ComplyAdvantage

Site Reliability Engineer

Job Description

Posted on: 
May 9, 2025

What you will be doing:

Join our dynamic and collaborative technology team as a Site Reliability Engineer! You'll be at the heart of our operations, playing a pivotal role in ensuring the reliability, scalability, and performance of the critical services our customers depend on.

As part of the DevOps team within our Platform tribe, you'll collaborate with fellow SREs and other engineering teams to support the entire Technology organization and the wider company. The Platform tribe is dedicated to building and maintaining the foundational systems, tooling, and services that empower our developers to bring exceptional products to life and keep them running smoothly and securely in production. We're focused on standardizing key areas like cloud infrastructure, deployment pipelines, and observability, allowing product teams to concentrate on their core applications.

The DevOps team is crucial in architecting, building, and operating the tools that underpin our production environment. Recent initiatives include evolving our Internal Developer Portal, architecting a High Availability solution for Nexus, optimizing our observability costs, championing the adoption of Service Level Objectives (SLOs), and migrating to GitLab SaaS.

As a Site Reliability Engineer you will:

  • Design and Build: Architect, implement, and maintain highly available and reliable foundational services for CI/CD pipelines, observability platforms, and our Internal Developer Platform, which are essential for our engineering teams to deliver scalable services daily.
  • Ensure Reliability: Participate in an on-call rotation to effectively respond to and resolve production incidents swiftly. Lead thorough post-incident reviews to identify root causes and implement proactive preventative measures.
  • Automate Infrastructure: Manage and automate our cloud infrastructure using Terraform and Helm, adhering to GitOps best practices.
  • Collaborate Effectively: Partner closely with development and data engineering teams to ensure seamless deployments and provide robust operational support.

Our Tech Stack:

  • Cloud-Based Infrastructure: Fully cloud-based with a Kubernetes-focused tech stack. Compute workloads run in Kubernetes clusters across multiple regions.
  • Development is organised around Kotlin and Python for our backend languages and TypeScript/ES6+React for our frontend stack
  • We make substantial use of relational database technologies, notably Postgres, Yugabyte
  • We use an event-sourced model powered by Kafka for our communication bus and gRPC for our intra-service communication protocol
  • We use modern observability solutions from Grafana Cloud, we build with GitLab tooling and deploy our code using ArgoCD

We have a strong emphasis on engineering excellence and strive to ship the best possible code and the best possible solutions to our customers.

About you:

  • Deep expertise in cloud services (AWS and/or GCP).
  • Significant experience managing and troubleshooting services within Kubernetes environments.
  • Proven track record with CI/CD tooling.
  • Strong proficiency in observability platforms, including monitoring, alerting, and production operations.
  • Hands-on experience codifying infrastructure with Terraform and Helm charts.
  • Excellent incident response and troubleshooting abilities.
  • Proficiency in scripting and automation using Python.
  • Experience working with containerized workloads.
  • Experience collaborating with software engineers to support production cloud-native applications.

Nice to have:

  • Familiarity with ArgoCD, GitLab CI, and the Grafana, Mimir, Loki & Prometheus stack.

Education:

  • BSc/BA degree in computer science, engineering or related discipline OR relevant years of experience in required skills.

What’s in it for you? 

  • Equity as we want you to have a part of what we are building 
  • Private medical insurance designed to keep you ensuring peace of mind while you excel in your career.
  • Unlimited Time Off Policy - A work-life balance and focus on our well-being are critical to keeping us performing at our best 
  • We embrace a hybrid approach that requires employees to be in the office for two days a week. We strongly believe that this approach fosters collaboration and enables the building of meaningful relationships
  • You will also get a new starter budget to kit out your home office 
  • Opportunity to work on innovative projects with smart-minded people keen to share their knowledge and continuously improve 
  • Annual learning budget (prorated based on start date) to drive your performance and career development 

About us:

ComplyAdvantage is the financial industry’s leading source of AI-driven financial crime risk data and detection technology. Our mission is to neutralise the risk of money laundering, terrorist financing, corruption, and other financial crime. 

More than 1000 companies rely on us to understand the risk of who they’re doing business with through the world’s only global, real-time database of people and companies. Our solutions identify thousands of risk events daily from millions of structured and unstructured data points.

We have five global hubs in New York, London, Singapore, Lisbon and Cluj-Napoca and are backed by Goldman Sachs, Ontario Teachers, Index Ventures, and Balderton Capital. 

Since 2014, we have raised over $100 million in funding, and in 2022 alone grew by over 40% to over 500 people globally. Over the next 12 months, as our revenue increases, we plan to increase to 600.

 

At ComplyAdvantage diversity fuels our rocket ship and our commitment to inclusion across race, gender, age, religion, identity and experience drives us forward every day. We encourage everyone to apply and aspire to consider every application fairly.

We will handle your information in accordance with our Privacy Policy. For further information, please click here.