Reliability Engineer

6 days ago


Melbourne, Victoria, Australia Xero Full time

About the Role

Xero is a leading cloud-based accounting and bookkeeping platform that empowers small businesses and their advisors to grow and thrive. As a key member of our Reliability Enablement team, you will play a critical role in ensuring the reliability and robustness of our systems, enabling our customers to achieve their goals.

Key Responsibilities

  • Investigate operational surprises and support teams in post-incident activities to identify root causes and implement corrective actions.
  • Conduct in-depth incident analysis and maximize post-incident learning across the organization to drive reliability improvements.
  • Provide short-term reliability consultancy and enablement engagements, including SLO reviews and facilitating pre-mortems, to ensure our systems meet the highest standards of reliability.
  • Work closely with product engineering portfolios to uplift system reliability and robustness, improving on-call health, observability, and addressing operational hotspots.
  • Support the delivery of strategic features and initiatives with reliability and distributed systems expertise, observing and improving rituals and practices relating to production operations, incident response, and incident learning.

Requirements

  • Solid experience in logging, monitoring, and observability of highly distributed systems.
  • Leading incident management and response, including critical, complex, and high-severity incidents.
  • Post-incident reviews, incident analysis, and learning from incidents.
  • Experience working in a tech or product company with comparable scale and complexity.
  • Systems thinking and understanding of how systems and components interact, respond to failure.
  • Proficiency in one or more object-oriented programming languages or experience with infrastructure-as-code.

Preferred Qualifications

  • Experience working with cloud providers such as AWS, Azure, or GCP.
  • Experience designing, developing, and operating distributed systems and large-scale software systems.
  • Strong experience delivering technical initiatives in an operational, site reliability, or platform engineering capacity.
  • The ability to solve engineering challenges outside of your own team, using influence rather than authority to enact change.
  • Demonstrated experience in reliability concepts like capacity management, autoscaling, deployment, and release safety.
  • Experience implementing customer-focused Service Level Objectives (SLOs).
  • Understanding of human factors, safety science, and resilience engineering.

Why Xero?

Xero offers a comprehensive benefits package, including generous paid leave, dedicated paid leave for physical and mental wellbeing, health insurance, life insurance, and income protection. We prioritize our employees' wellbeing and provide a range of programs to support their physical and mental health. Our Employee Resource Groups foster a sense of community and belonging, while our beautiful offices and flexible working arrangements promote work-life balance. Join us in making a difference for small businesses and their advisors around the world.


  • Reliability Engineer

    9 hours ago


    Melbourne, Victoria, Australia Adaps Full time

    {"title": "Site Reliability Engineer", "description": "Job SummaryAdaps is seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the quality and reliability of our software products.Key ResponsibilitiesDesign and implement scalable and reliable software systemsDevelop and...


  • Melbourne, Victoria, Australia Adaps Full time

    About the RoleWe are seeking a highly skilled Reliability Engineer to join our team at Adaps. As a key member of our engineering team, you will play a critical role in ensuring the reliability, availability, and scalability of our software systems.Key ResponsibilitiesDesign and Implement Reliable Systems: Develop and maintain software systems that are highly...


  • Melbourne, Victoria, Australia Xero Full time

    About the RoleWe are seeking an experienced Engineering Manager to lead our Site Reliability Engineering (SRE) team at Xero. As a key member of our engineering leadership team, you will be responsible for driving innovation, fostering a collaborative and inclusive team culture, and ensuring the reliability, scalability, and performance of Xero's products and...


  • Melbourne, Victoria, Australia Firesoft People Full time

    About the RoleFiresoft People is seeking a highly skilled Cloud Reliability Engineer to join our team. As a Cloud Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our customers' enterprise systems.Key ResponsibilitiesProvide relief and sustainable resolution to issues within infrastructure.Use...


  • Melbourne, Victoria, Australia Firesoft People Full time

    About the RoleFiresoft People is seeking a highly skilled Reliability Engineering Specialist to join our team. As a key member of our Site Reliability Engineering team, you will be responsible for ensuring the reliability, scalability, and performance of our customers' platforms and infrastructure.Key ResponsibilitiesProvide relief and sustainable resolution...


  • Melbourne, Victoria, Australia Firesoft People Full time

    About the RoleFiresoft People is seeking a skilled Site Reliability Engineer to join our team in Australia. As a key member of our global digital transformations, you will play a crucial role in maintaining and developing the reliability, scalability, and performance of our customers' platforms and infrastructure.Key ResponsibilitiesProvide relief and...


  • Melbourne, Victoria, Australia Xero Full time

    About the Role Xero is a leading cloud-based accounting platform that empowers small businesses to thrive. As an Engineering Manager at Xero, you will lead our Site Reliability Engineering (SRE) teams, driving innovation and fostering a collaborative culture. Your expertise will ensure the reliability, scalability, and performance of our products and...


  • Melbourne, Victoria, Australia Fletcher Building Full time

    {"title": "Reliability Engineering Manager", "content": "Job Title: Reliability Engineering ManagerLocation: Cheltenham, VictoriaFletcher Building is seeking a seasoned Reliability Engineering Manager to enhance operational effectiveness at our Cheltenham, Victoria site. This individual will champion the reliability journey over a transformative period,...


  • Melbourne, Victoria, Australia Firesoft People Full time

    About the RoleFiresoft People is seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our global digital transformations, you will play a critical role in maintaining and developing the reliability, scalability, and performance of our customers' platforms and infrastructure.Key ResponsibilitiesProvide relief and sustainable...


  • Melbourne, Victoria, Australia Adaps Full time

    About the RoleWe are seeking a highly skilled Reliability Engineering Specialist to join our team at Adaps. As a key member of our engineering team, you will play a critical role in ensuring the reliability, availability, and scalability of our software systems.Key ResponsibilitiesDesign and Implement Reliable Systems: Develop and maintain software systems...


  • Melbourne, Victoria, Australia Fletcher Building Full time

    About the RoleWe are seeking a seasoned Reliability Engineering Manager to enhance operational effectiveness at our site. This individual will champion the reliability journey over a transformative period, reporting to the Manufacturing Operations Manager.Key ResponsibilitiesLead, mentor, and manage a complex maintenance team, fostering a culture of planning...


  • Melbourne, Victoria, Australia Microsoft Full time

    About the RoleThe Azure Kubernetes Service (AKS) team is seeking a skilled Technical Program Manager II to join the AKS Fundamentals team and focus on platform reliability. As a key member of the team, you will define and track success metrics for platform reliability, drive empathy for customer issues, and lead programs to evolve engineering culture around...

  • Engineering Manager

    7 days ago


    Melbourne, Victoria, Australia Xero Full time

    About the RoleXero is a leading cloud-based accounting and bookkeeping platform that empowers small businesses and their advisors to grow and thrive. As a key member of our Site Reliability Engineering (SRE) team, you will play a critical role in ensuring the reliability, scalability, and performance of our products and platforms.Key Responsibilities:Provide...

  • Engineering Manager

    7 days ago


    Melbourne, Victoria, Australia Xero Full time

    About the RoleXero is a leading cloud-based accounting and bookkeeping platform that empowers small businesses and their advisors to succeed. As an Engineering Manager at Xero, you will lead and inspire our Site Reliability Engineering (SRE) teams, driving innovation and fostering a collaborative and inclusive team culture. Your expertise will ensure the...

  • Reliability Engineer

    9 hours ago


    Melbourne, Victoria, Australia Xero Full time

    About the RoleXero is a leading cloud-based accounting platform that empowers small businesses and their advisors to thrive. As a Reliability Engineer in our Tooling and Engineering Health team, you will play a critical role in ensuring the reliability and robustness of our platform.Key ResponsibilitiesDesign and develop robust software components to improve...


  • Melbourne, Victoria, Australia AGL Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our Integrated Energy Technology function. As a key member of our team, you will play a critical role in ensuring the reliability and efficiency of our cloud-native systems.As a Senior Site Reliability Engineer, you will work closely with our Site Reliability Engineering...


  • Melbourne, Victoria, Australia Xero Full time

    About the RoleXero is a leading cloud-based accounting platform that empowers small businesses and their advisors to succeed. As a Reliability Engineer in our Tooling and Engineering Health team, you will play a critical role in ensuring the reliability and performance of our platform.Key ResponsibilitiesDesign and develop robust software components to...


  • Melbourne, Victoria, Australia Pepperstone EU Limited Full time

    About the RoleWe are seeking a highly skilled Senior Cloud Reliability Engineer to join our Platform Engineering Team at Pepperstone EU Limited. As a key member of our team, you will be responsible for driving the design, development, and maintenance of cloud reliability processes, systems, and services.Key ResponsibilitiesContribute to the development of...


  • Melbourne, Victoria, Australia Xero Full time

    About the RoleXero is a leading provider of cloud-based accounting and bookkeeping software for small businesses and their advisors. As a Reliability Engineer - Tooling and Engineering Health, you will play a critical role in ensuring the stability and performance of our platform.Key ResponsibilitiesContribute to the development and maintenance of tools and...


  • Melbourne, Victoria, Australia AGL Full time

    About the RoleThe Senior Site Reliability Engineer position is a critical engineering role focused on enhancing and maintaining the reliability and operability of our critical services. We are currently seeking an experienced and dedicated engineer to implement site reliability engineering principles and practices.Key ResponsibilitiesResolving and reporting...