Employment OS for your Business

Senior DevOps Engineer – Observability and Platform Reliability

Platform SRE – Li-Shuai • Sydney, New South Wales 2000, Australia • Full-time

Role Type

Contract • Full-time • Senior

Description

Our ‘black belt’ specialists are leaders in their domains: platform and cloud champions, delivery-focused engineers, security first practitioners, automation experts, and advocates for engineering best practice.

With a global presence and strong local expertise, we deliver modern, resilient platforms without compromising on quality. Our multidisciplinary teams solve complex problems at scale, embedding reliability, security, and operational excellence through our most senior technologists.

Empower Your Career with Us

Are you ready to lead a high-performing team that enables engineering excellence through world class DevOps and platform practices? We are looking for a Senior DevOps Engineer specialising in observability and platform reliability who thrives in fast paced environments, brings a strong sense of ownership, and is passionate about building scalable, secure, and reliable systems.

Position Overview

We are seeking an experienced Senior DevOps Engineer to own and evolve our cloud and platform capabilities. In this role, you will support the delivery of robust infrastructure, CI/CD platforms, and operational tooling that supports modern application development and CloudOps at scale.

You will work closely with engineering, security, and delivery teams to uplift reliability, improve deployment velocity, and embed strong operational practices across services. A key pillar of this role is owning our observability strategy – ensuring engineering teams have deep, actionable visibility into the behaviour and health of the systems they build and operate.

Key Responsibilities

  • Assist in the design, implementation, and evolution of cloud infrastructure and DevOps platforms
  • Drive infrastructure-as-code practices across environments using tools such as Terraform or CloudFormation
  • Design and own the observability stack – including distributed tracing, metrics pipelines, and structured logging – to provide deep visibility into system behaviour at scale
  • Define and mature SLIs, SLOs, and error budgets in collaboration with engineering and product teams
  • Lead incident response tooling and post-incident review practices, using observability data to drive measurable reliability improvements
  • Collaborate with DevOps engineers to support modern architectures and deployment models, integrating observability instrumentation into CI/CD pipelines so that every deployment ships with coverage
  • Champion engineering excellence, automation, and continuous improvement across all engagements.

Experience

  • Experience in working in autonomous DevOps teams
  • Experience in driving platform engineering initiatives in cloud environments (AWS preferred)
  • Deep hands-on expertise with CI/CD tooling and release automation
  • Proven experience designing and managing infrastructure using infrastructure-as-code
  • Hands-on experience with observability tooling such as OpenTelemetry and Grafana, including building dashboards, alert strategies, and trace analysis
  • Strong understanding of IAM and security controls
  • Solid grounding in SRE fundamentals, including SLOs, error budgets, and reliability-driven incident practice

  • Comfortable working across multiple teams, balancing delivery, reliability, and long-term platform evolution.

Bonus Points For

  • Experience in a serverless environment such as AWS Lambda
  • Strong scripting skills (e.g. Python, Bash)
  • Experience operating platforms in regulated or security sensitive environments.

What We Offer

  • A collaborative, engineering-led culture with a strong focus on quality and outcomes
  • Opportunities to contribute to meaningful platform initiatives and shape technical direction
  • Competitive compensation
  • The opportunity to work on complex, large scale systems alongside senior technologists.

If you are ready to lead platform engineering at scale and make a tangible impact on how modern software is built and delivered, we would love to hear from you.