Senior SRE

2 days ago


Melbourne, Victoria, Australia Heidi Health Trading Pty Full time

Who are Heidi?

Heidi is on a mission to half the time it takes to deliver world class care.

We believe that in 2050 every clinician will practice with AI systems that free them from administrative burden and increase the quality and accessibility of care to patients across the world.

Today, we have a suite of tools that modernise documentation. Tomorrow, we'll equip every healthcare org with AI assistants that undo the tediums of clinical & non clinical work

Our team is a potent mosaic of sage, accomplished leaders & brilliant polymaths hungry to prove it. We achieve in 6 months what it takes our competitors 4 years to do.

We've raised our $10M Series A led by Australia's largest VC firm, Blackbird Ventures and in the midst of our next raise, with an ambitious global go-to-market strategy starting with the US & UK.

The Role

As a Senior Site Reliability Engineer at Heidi, you'll be instrumental in establishing and scaling our reliability practices while ensuring robust, secure, and observable systems.

You'll work closely with our engineering team to implement comprehensive monitoring, incident management, and reliability processes for our AI-powered healthcare solutions.

Primary Responsibilities:

Observability & Monitoring

  • Design and implement comprehensive observability strategies using Datadog, or other tooling that you are able to convince us with

  • Implement OpenTelemetry instrumentation across our backend and frontend services

  • Set up real user monitoring (RUM) and application performance monitoring (APM) to ensure end-to-end visibility

  • Create and maintain dashboards that provide meaningful insights for different stakeholders (technical teams, support, management)

  • Monitor and optimise third-party service integrations, particularly for critical services

Incident Management & Response

  • Establish and implement incident management processes from the ground up

  • Evaluate and implement appropriate incident management tools that integrate with our observability stack

  • Create and maintain incident response playbooks and automated runbooks

  • Lead post-incident reviews and foster a blameless culture

  • Implement and maintain on-call rotations and escalation policies

SLA & SLO Management

  • Define and implement SLOs that align with business requirements and customer expectations

  • Set up error budgets and tracking mechanisms

  • Create comprehensive SLA reporting for enterprise customers

  • Design and implement SLI metrics that provide meaningful insights into service health

Cost Optimisation & Efficiency

  • Optimise observability costs through efficient logging and metrics collection

  • Implement log management and retention strategies

  • Fine-tune alerting to minimise alert fatigue while maintaining service reliability

  • Evaluate and recommend cost-effective tooling solutions

Key Requirements:

  • Extensive experience with observability platforms (Datadog preferred) and understanding of observability architecture

  • Strong knowledge of OpenTelemetry and modern instrumentation practices

  • Experience implementing APM and RUM in Python and React/React Native environments

  • Track record of establishing incident management processes and fostering a blameless culture

  • Experience defining and implementing SLAs/SLOs for enterprise customers

  • Strong background in monitoring distributed systems and third-party service integrations

  • Experience with cloud infrastructure (AWS required, Azure and GCP beneficial)

  • Proven track record in implementing SRE practices and reliability improvements

Preferred Qualifications:

  • Experience with chaos engineering practices

  • Knowledge of automated runbook implementation

  • Healthcare industry experience

  • Understanding of HIPAA or similar healthcare compliance frameworks

What we will look for:

  • Problem-solving mindset with a focus on reliability and scalability

  • Strong communication skills to work with cross-functional teams

  • Ability to balance technical requirements with business needs

  • Experience in fast-paced startup environments

  • Dedication to maintaining high standards in a regulated environment

What do we believe in?

  • We create unconventional solutions to difficult problems and we build them fast. We want you to set impossible goals and make them happen, think landing a rocket but the medical version.

  • You'll be surrounded by a world-class team of engineers, medicos and designers to do your best work, inspired by our shared beliefs:

    • We will stop at nothing to improve patient care across the world.

    • We design user experiences for joy and ship them fast.

    • We make decisions in a flat hierarchy that prioritises the truth over rank.

    • We provide the resources for people to succeed and give them the freedom to do it.

Why you will flourish with us ?

  • Flexible hybrid working environment, with 3 days in the office.

  • Additional paid day off for your birthday and wellness days

  • Special corporate rates at Anytime Fitness in Melbourne, Sydney tbc.

  • A generous personal development budget of $500 per annum

  • Learn from some of the best engineers and creatives, joining a diverse team

  • Become an owner, with shares (equity) in the company, if Heidi wins, we all win

  • The rare chance to create a global impact as you immerse yourself in one of Australia's leading healthtech startups

  • If you have an impact quickly, the opportunity to fast track your startup career

#J-18808-Ljbffr
  • Cloud SRE Lead

    2 weeks ago


    Melbourne, Victoria, Australia Commonwealth Bank Full time

    At Commonwealth Bank, we're proud of our people and technology culture. Our SRE team marries both by applying Software Engineering principles to our operational services. We implement the latest industry-wide methodologies around observability practices.Our SRE teams work together to ensure seamless execution of our award-winning banking apps. We're...


  • Melbourne, Victoria, Australia Xero Full time

    Xero is committed to making life better for people in small business, their advisors, and communities around the world. Our purpose sits at the centre of everything we do.We support our people to do the best work of their lives so that they can help small businesses succeed through better tools, information and connections.As a Senior Technical Resource, you...

  • Senior SRE

    3 weeks ago


    Melbourne, Victoria, Australia black Full time

    Who are Heidi?Heidi is on a mission to half the time it takes to deliver world class care.We believe that in 2050 every clinician will practice with AI systems that free them from administrative burden and increase the quality and accessibility of care to patients across the world.Today, we have a suite of tools that modernise documentation. Tomorrow, we'll...

  • Senior SRE

    4 days ago


    Melbourne, Victoria, Australia black Full time

    Who are Heidi?Heidi is on a mission to half the time it takes to deliver world class care.We believe that in 2050 every clinician will practice with AI systems that free them from administrative burden and increase the quality and accessibility of care to patients across the world.Today, we have a suite of tools that modernise documentation. Tomorrow, we'll...


  • Melbourne, Victoria, Australia Nuage Technology Group Full time

    Nuage Technology Group provided pay rangeThis range is provided by Nuage Technology Group. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.Base pay rangeA$112.50/hr - A$1,137.00/hrCo-Founder & Contracts Director @ Nuage Technology GroupGreenfield SRE OpportunityDaily Rate ContractNuage Technology Group...


  • Melbourne, Victoria, Australia beBee Careers Full time

    About the RoleWe are seeking a highly skilled Senior Cloud Architect to join our team, bringing expertise and innovation to help clients maximise the value of their cloud environments. As a Cloud Architect, you will focus on designing and implementing cloud systems for security, performance, and automation, ensuring our clients implement best practices in...


  • Melbourne, Victoria, Australia Xero Full time

    At Xero, we believe in empowering our people to do the best work of their lives. As a Senior Technical Resource, you will play a critical role in driving enduring reliability, world-class observability, and high-performing services across our product landscape.You will lead a dedicated product SRE team and provide technical leadership to ensure the...


  • Melbourne, Victoria, Australia Xero Full time

    Xero is seeking a highly technical Senior Technical Resource to join our Product SRE team. As a member of this team, you will be responsible for driving enduring reliability, world-class observability, and high-performing services across our product landscape.You will provide technical leadership to ensure the completion of day-to-day deliverables of a...


  • Melbourne, Victoria, Australia Commonwealth Bank Full time

    We're hiring engineers from across Australia and looking for experts who can partner with senior stakeholders and lead a culture of data-driven reliability. As a Staff Engineer in our SRE team, you'll be a technical leader, designing and implementing large scale solutions.Our teams ensure seamless execution of our award-winning banking apps. We're passionate...


  • Melbourne, Victoria, Australia Commonwealth Bank Full time

    We are accelerating our digital strategy with an ambition to provide customers with one of the best digital experiences globally. Our Site Reliability Engineering (SRE) teams ensure that our systems maintain the highest standards of service outcomes for our customers.As a Staff Engineer in our SRE team, you'll be a technical leader, designing and...