Cloud Reliability Engineer Lead

7 days ago


Melbourne, Victoria, Australia Microsoft Full time
Overview

The Azure Kubernetes Service (AKS) team is responsible for delivering a managed Kubernetes service on Azure, enabling enterprises to build, deploy, and manage cloud-native applications with high uptime.

We are working to improve the infrastructure reliability and stability of AKS, ensuring our customers experience the best possible availability in the industry.

Job Description

We seek a skilled Cloud Reliability Engineer Lead to join the AKS Fundamentals team and focus on platform reliability. In this role, you will:

  • Define Success Metrics

You will develop and track metrics to measure and report on the reliability experienced by customers in Azure Kubernetes Service.

Analyze Customer Pain Points

You will gather insights from customers about their experience, analyze data, and prioritize customer pain points to inform improvement initiatives.

Launch Improvement Programs

You will design and drive programs to address quality gaps in existing processes and products, aiming to enhance the overall customer experience.

Collaborate Across Teams

You will partner closely with Azure Compute, Networking, Storage, and other teams to drive the roadmap and ensure alignment across Azure Infrastructure services.

About This Role

This position offers an estimated annual salary of $160,000 - $200,000, commensurate with experience, reflecting the critical nature of this role in driving platform reliability and enhancing customer satisfaction.

As a key member of the AKS team, you will be part of a dynamic organization focused on innovation and customer success.



  • Melbourne, Victoria, Australia Microsoft Full time

    OverviewWe are seeking a skilled Cloud Reliability Engineer to lead the Azure Kubernetes Service (AKS) Fundamentals team and focus on platform reliability.Azure Kubernetes Service is a managed container orchestration service that helps enterprises build, deploy, and manage cloud-native applications. To ensure high uptime for our customers, we work on...


  • Melbourne, Victoria, Australia Microsoft Full time

    About the RoleThe Azure Kubernetes Service (AKS) team is seeking a Technical Program Manager II to join the AKS Fundamentals team and focus on platform reliability. In this role, you will define a set of metrics to measure and report on the reliability experienced by customers in Azure Kubernetes Service. You will analyze the pain points of our customers,...


  • Melbourne, Victoria, Australia Microsoft Full time

    About UsMicrosoft Azure is a leading cloud platform that enables businesses to build, deploy, and manage applications with ease.Compensation: The estimated annual salary for this role is around $175,000.Job SummaryWe are seeking an experienced Cloud Reliability Engineer Leader to join our team at Microsoft Azure. As a key member of our Fundamentals team, you...


  • Melbourne, Victoria, Australia Microsoft Full time

    OverviewAzure Kubernetes Service (AKS) delivers a managed Kubernetes service on Azure to make enterprises successful at building, deploying, and managing cloud native applications. The team is responsible for building AKS' features and experiences, improving its fundamentals like reliability, scale, and performance as well as expanding its ecosystem with 1st...


  • Melbourne, Victoria, Australia FIS Australia Full time

    About the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at FIS Australia. As a Site Reliability Engineer, you will be responsible for ensuring the scalability, high availability, and performance of our software applications.Key Responsibilities:Focusing on scalability, high availability, performance, stability, and...

  • Reliability Engineer

    4 weeks ago


    Melbourne, Victoria, Australia Firesoft People Full time

    Role OverviewFiresoft People is seeking a skilled Reliability Engineer - Cloud Infrastructure Specialist to join our team. As a key member of our Site Reliability Engineering team, you will be responsible for ensuring the reliability, scalability, and performance of our customers' platforms and infrastructure.Key ResponsibilitiesDesign, implement, and...


  • Melbourne, Victoria, Australia Firesoft People Full time

    Firesoft People is a leader in digital transformations and we are seeking a highly skilled Cloud Reliability Specialist to join our team. Based anywhere in Australia, this role offers an exciting opportunity to work on cutting-edge projects with a competitive salary of $120,000 - $180,000 per annum.About the RoleAs a Cloud Reliability Specialist, you will be...


  • Melbourne, Victoria, Australia Easygo Full time

    We are seeking a talented Cloud Infrastructure Engineer to join our team at Easygo in Melbourne, Australia. This is an exciting opportunity for someone who is passionate about designing, implementing, and maintaining scalable, reliable, and secure cloud-based infrastructure solutions.As a Cloud Infrastructure Engineer, you will work closely with our...


  • Melbourne, Victoria, Australia TENNIS AUSTRALIA Full time

    OverviewTennis Australia is a leader in creating a playful world through tennis, fostering a diverse and equitable workplace. Our Technology team seeks an experienced Azure-based DevOps Engineer to join our ranks.CompensationThe estimated salary for this role is $120,000 - $150,000 per annum, depending on experience.Job DescriptionWe are seeking a skilled...

  • Reliability Engineer

    3 weeks ago


    Melbourne, Victoria, Australia Xero Full time

    About the RoleXero is a leading cloud-based accounting software company that helps small businesses and their advisors succeed. We're looking for a highly skilled Senior Site Reliability Engineer to join our team and help us deliver a great customer experience through a better understanding of the behavior and operation of our systems.About the TeamThe...


  • Melbourne, Victoria, Australia XPT Software Australia Pty Ltd Full time

    We are seeking a skilled Cloud Engineering Lead to join our team at XPT Software Australia Pty Ltd. This role is responsible for leading the design, implementation, and management of cloud-based systems and infrastructure.Key Responsibilities:Leverage expertise in cloud technologies such as GCP to drive innovation and improvement in our cloud...


  • Melbourne, Victoria, Australia Firesoft People Full time

    Job DescriptionFiresoft People is seeking a skilled Cloud Infrastructure Reliability Specialist to join our team in Australia.Company OverviewWe are a leading provider of digital transformation services, helping companies across the globe maximize growth and deliver business value through next-gen technologies and hyperscale cloud-native services.About the...


  • Melbourne, Victoria, Australia Microsoft Full time

    OverviewThe Azure Kubernetes Service (AKS) team is responsible for delivering a managed Kubernetes service on Azure to enable enterprises to successfully build, deploy, and manage cloud-native applications. The team focuses on building AKS features and experiences, improving its fundamentals like reliability, scalability, and performance, as well as...


  • West Melbourne, Victoria, Australia FIS Full time

    About the Role:As a Site Reliability Engineer at FIS, you will be responsible for ensuring the scalability, high availability, and performance of our cloud-based applications. You will work closely with cross-functional teams to identify and resolve issues, and collaborate with our global team to implement best practices and automation.Key...

  • Reliability Engineer

    4 weeks ago


    Melbourne, Victoria, Australia Xero Full time

    About the RoleXero is seeking a highly skilled Reliability Engineer to join our Reliability Enablement team. As a key member of this team, you will help teams deliver a great customer experience through a better understanding of the behavior and operation of their systems.Key ResponsibilitiesInvestigate operational surprises and support teams in...


  • Melbourne, Victoria, Australia Firesoft People Full time

    Firesoft People is seeking a skilled Reliability Expert for Cloud Infrastructure to join our team based in Australia. With the right candidate, we can offer a highly competitive salary of $120,000 - $150,000 per annum.About the RoleWe are looking for an experienced Site Reliability Engineer who can bridge the gap between developers and IT operations in a...


  • Melbourne, Victoria, Australia Slade Group Full time

    We're seeking a skilled Senior Cloud Engineer to lead cloud transformation initiatives in a large enterprise organisation. Over the last 12 months, there has been significant change in Cloud, Security and Network environment.The ideal candidate will have strong understanding of network architecture, with hands-on experience in hub-and-spoke design, routing,...


  • Melbourne, Victoria, Australia ANZ Full time

    About ANZWe are a bank that aims to make a difference in people's lives. Our purpose is to shape a world where people and communities thrive. We believe that financial wellbeing is key to achieving incredible things, whether it's buying a home, building a business, or saving for something big or small.Your RoleWe are looking for a skilled Site Reliability...


  • Melbourne, Victoria, Australia Amazon Full time

    About the RoleWe are seeking an experienced Cloud Infrastructure Engineer Lead to join our team at Amazon. This is a unique opportunity to shape the future of cloud computing and work with a diverse group of talented individuals.As a Cloud Infrastructure Engineer Lead, you will be responsible for designing, building, and operating large-scale cloud...


  • Melbourne, Victoria, Australia Xero Full time

    About the RoleAs an Engineering Manager at Xero, you'll lead and inspire our SRE tooling teams, with a passion for production operations and the developer experience. You'll be instrumental in driving innovation, fostering a collaborative and inclusive team culture, and ensuring the reliability, scalability, and performance of Xero's products and...