Current jobs related to Lead Site Reliability Engineer - Melbourne, Victoria - Xero


  • Melbourne, Victoria, Australia NXTGIG Full time

    NXT GIG is seeking a dedicated Site Reliability Engineer (SRE) to join our dynamic team and play a crucial role in ensuring the reliability and performance of our systems and applications. As an SRE, you will be responsible for building and maintaining our infrastructure, developing automation solutions, and monitoring system health. You will collaborate...


  • Melbourne, Victoria, Australia beBee Careers Full time

    Job DescriptionThe Site Reliability Lead will be responsible for collaborating with engineers, data scientists, and product owners to enhance and maintain platforms. This includes developing and maintaining build pipelines, release pipelines, and internal tools to uplift developer experience.Provide input to architectural decisions and manage core...


  • Melbourne, Victoria, Australia beBee Careers Full time

    We are seeking a skilled Senior Site Reliability Engineer to join our team.About the RoleThe Senior Site Reliability Engineer will be responsible for ensuring the stability and scalability of our cloud-based infrastructure. This includes designing, implementing, and monitoring systems to ensure high availability and performance.Key Responsibilities:Design...


  • Melbourne, Victoria, Australia beBee Careers Full time

    At ANZ, we're committed to shaping a world where people and communities thrive. As a Site Reliability Engineering Practice Manager, you will play a key role in achieving this vision.Our community of engineers is instrumental in making this happen, as technology underpins every aspect of our business. We're seeking talent to join our Cloud, Quality, and...


  • Melbourne, Victoria, Australia Mantel Group Full time

    Join to apply for the Senior Site Reliability Engineer role at Mantel Group3 days ago Be among the first 25 applicantsJoin to apply for the Senior Site Reliability Engineer role at Mantel GroupMantel Group is an Australian-owned technology consulting business with capabilities across Cloud, Digital, Data, Delivery & Security. Since our inception in November...


  • Melbourne, Victoria, Australia beBee Careers Full time

    Senior Site Reliability EngineerThis role involves leading the design, implementation, and maintenance of reliable and scalable cloud-based systems. You will work closely with the engineering team to ensure the smooth operation of our services, identifying areas for improvement and implementing changes as needed.Main Responsibilities:Develop and execute test...


  • Melbourne, Victoria, Australia beBee Careers Full time

    We welcome applications from individuals with disabilities. To ensure equal participation, we will provide reasonable adjustments throughout the recruitment process.">Job DescriptionA Senior Site Reliability Engineer is responsible for ensuring the reliability and scalability of our cloud-based infrastructure. This role involves designing, implementing, and...


  • Melbourne, Victoria, Australia Australia And New Zealand Banking Group Limited Part time

    Select how often (in days) to receive an alert: Site Reliability Engineer - ANZ Plus Division: TechnologyLocation: Melbourne About Us At ANZ our purpose is to shape a world where people and communities thrive.We're making this happen by improving our customers' financial wellbeing so they can achieve incredible things - be it buying their home, building a...


  • Melbourne, Victoria, Australia beBee Careers Full time

    We are seeking a skilled Senior Site Reliability Engineer to join our team. As a key member of our technical team, you will be responsible for ensuring the reliability and performance of our systems.About the RoleThis is a challenging and rewarding opportunity to work with a talented team of engineers and contribute to the development of our infrastructure...


  • Melbourne, Victoria, Australia Australia And New Zealand Banking Group Limited Full time

    Select how often (in days) to receive an alert: Site Reliability Engineer - ANZ Plus Division: Technology Location: Melbourne About Us At ANZ our purpose is to shape a world where people and communities thrive.We're making this happen by improving our customers' financial wellbeing so they can achieve incredible things – be it buying their home, building...

Lead Site Reliability Engineer

1 month ago


Melbourne, Victoria, Australia Xero Full time

Xero is a beautiful, easy-to-use platform that helps small businesses and their accounting and bookkeeping advisors grow and thrive.

At Xero, our purpose is to make life better for people in small business, their advisors, and communities around the world. This purpose sits at the centre of everything we do. We support our people to do the best work of their lives so that they can help small businesses succeed through better tools, information and connections. Because when they succeed they make a difference, and when millions of small businesses are making a difference, the world is a more beautiful place.

About the team

Xero's Product SRE teams will consist of dedicated world-class SRE engineers, embedded into product teams to drive enduring reliability, world-class observability, and high-performing services.

The Lead Engineer will be the most senior technical resource in the team, ensuring teams are empowered to own and drive reliability across the product landscape.

About the role

This position requires a highly technical Lead Engineer with a strong engineering background, deep experience in SRE and a passion for enabling high-performing teams.

As a seasoned and relentless engineer, they will contribute to the company's Product SRE strategy and contribute to the ongoing transformation of the Xero SRE culture. As an expert communicator, they'll manage change and ensure the value of robust systems is communicated clearly across the business.

This role will become an acknowledged authority on reliability, observability, operability, and performance of the product you are assigned to through continued delivery of high-quality solutions. We're looking for someone who can solve engineering problems beyond their own team and influence others to make changes.

Any experience with reliability concepts such as capacity management, autoscaling, safe deployment and releases, software strategies for reliability, fault tolerance, and graceful failure would be highly beneficial. Understanding of human factors, safety science, and resilience engineering are also valuable.

What you'll do:

  1. Provide technical leadership to ensure completion of the day-to-day deliverables of a dedicated product SRE team. These will be highly experienced Site Reliability Engineers with a strong culture of ownership, automation first, and constant quality of delivery.
  2. Build long-term relationships with product engineering teams, ensuring everyone can deliver on system reliability with a theme of continuous improvement.
  3. Champion observability best practices, ensuring implementation across products to ensure fast detection of impactful events.
  4. Build a culture of continuous improvement to ensure product reliability is continuously improving and the impact of issues are reduced; create and actively monitor quality standards for SRE teams and report regularly on its adherence.
  5. Build and deliver an Error Budget culture associated with consistent breaches of SLA/SLO.
  6. Provide ongoing training across the business to ensure reliability requirements are well understood and incorporated into product designs.

What you'll bring:

  1. Proven track record in technical leadership roles, with the ability to inspire and empower cross-functional teams to achieve operational excellence and drive continuous improvement.
  2. Extremely technical skillset, with strong engineering and hands-on SRE background. Demonstrable experience of being the technical authority in a highly technical team.
  3. Deep and proven experience in providing technical leadership and mentoring in world-class embedded SRE teams in a fast-growing company.
  4. Obsessed with delivering a high-quality and highly stable customer experience. Passion for customer-first thinking, with a strong product mindset helping to understand and anticipate customer needs.
  5. Experience of building and delivering an error budget culture associated with consistent breaches of SLA/SLO. Coupled with a 24/7 focus on incident response and remediation.
  6. Broad and deep technical understanding of modern cloud technologies (AWS, Azure, GCP) and their incident and problem management practices, particularly high-growth, high-availability SaaS-based transactional systems.
  7. Proficiency in one or more object-oriented programming languages (C#, JavaScript, Java, Python etc) or experience with infrastructure-as-code (e.g. Terraform, CloudFormation).
  8. Experience using observability tooling to monitor the health of a highly distributed system.

Why Xero?

Offering very generous paid leave to use however you'd like (plus statutory holidays), dedicated paid leave to care for your physical and mental wellbeing as well as an Employee Assistance Program to access mental health care for you and your family, health insurance, life insurance, and income protection, wellbeing and sports programmes, employee resource groups, 26 weeks of paid parental leave for primary caregivers, an Employee Share Plan, beautiful offices, flexible working, career development, and many other benefits that reflect our human value, you'll do the best work of your life at Xero.

#J-18808-Ljbffr