Principal Manager of Customer Reliability Engineering

7 days ago


Sydney, New South Wales, Australia Microsoft Full time

Overview


Are you eager to contribute to one of the most innovative products at Microsoft, dedicated to surpassing customer expectations and enhancing Microsoft's cloud-first approach? If you are enthusiastic about a dynamic environment, passionate about cloud computing technologies, and keen on fostering growth in one of Microsoft's essential businesses, the Azure Customer Experience (CXP) team is the right place for you. The Customer Reliability Engineering (CRE) team within Microsoft Azure offers customers scalable infrastructure and platforms to build, host, and expand service applications via Microsoft's extensive global data centers.

The Azure CXP CRE team is a pivotal segment of Azure Engineering, spearheading exceptional customer reliability initiatives, creating modern customer-first experiences at scale, and integrating profound customer insights and empathy into the broader Azure Engineering framework.

Our teams are dedicated to listening to customers continuously, driving enhancements and capabilities into services, support programs, service incident experiences, community engagements, and beyond.

Our commitment to a "no dead-ends" philosophy guarantees that every customer, regardless of their size or scale, can achieve their full potential through the Microsoft Cloud.

We are seeking a Principal Reliability Engineering Manager who is customer-focused and passionate about the reliability engineering of the Azure platform at scale, including aspects of availability and supportability.

This role entails accountability for enhancing customer experiences on Azure, diagnosing, and troubleshooting mission-critical customer applications developed on the Microsoft Azure platform.

The ideal candidate will exhibit a broad skill set while managing complex, highly available services, possessing a deep understanding of the underlying components (Azure Platform, Azure SDK, Azure Portal), and collaborating directly with customers, customer support, and engineering teams.

Our team is in search of a Principal Reliability Engineering Manager who will help advance a world-class infrastructure that supports a growing array of CXP customer programs.

You will be responsible for delivering essential, customer-facing features and collaborating across various Azure servicing teams to ensure they align with our customers' needs. You will maintain a keen awareness of cost of goods sold (COGS) at scale, developing features efficiently while ensuring the robustness of our infrastructure.



Responsibilities

  • Build and lead a team of reliability engineers in the region, delivering a world-class customer experience. Demonstrated ability to lead teams and collaborate across geographical regions while establishing strong partnerships.
  • Work closely with Engineering/PM to ensure the availability and performance of Live Site and customer satisfaction.
  • Participate in on-call coverage rotation, providing leadership to all customer-facing teams during incidents.
  • Enhance customer experience by analyzing signals from various sources, driving root cause analyses (RCAs), and implementing service improvements involving bug fixes.
  • Foster continuous improvement in the Azure platform by incorporating feedback from both internal and external customers.
  • Identify and implement customer-centric mitigation strategies and playbooks for operations.
  • Engage in the design of next-generation architecture for cloud infrastructure services, focusing on strategic customer support scenarios.
  • Exhibit enthusiasm, self-motivation, and a collaborative spirit.
  • Possess excellent collaboration, organizational, and time management skills.
  • Be data-driven with a focus on achieving business results for projects undertaken.


  • Sydney, New South Wales, Australia Microsoft Full time

    OverviewAre you enthusiastic about contributing to one of Microsoft's most innovative products, dedicated to surpassing customer expectations and promoting Microsoft's cloud-first initiative? If you are excited about a dynamic environment, passionate about cloud computing technologies, and eager to drive growth in a key area of Microsoft's business, the...


  • Sydney, New South Wales, Australia Microsoft Full time

    About the RoleWe are seeking a seasoned Principal Reliability Engineering Manager to join our Azure Customer Experience (CXP) team. As a key member of our organization, you will be responsible for leading a team of reliability engineers in delivering world-class customer experience on the Azure platform.Key ResponsibilitiesBuild and lead a team of...


  • Sydney, New South Wales, Australia eFinancialCareers Ltd. Full time

    About the RoleWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at eFinancialCareers Ltd. as we accelerate our digital strategy and provide customers with one of the best digital experiences globally.Key ResponsibilitiesDesign and implement large-scale solutions to ensure seamless execution of our award-winning banking...


  • Sydney, New South Wales, Australia Commonwealth Bank Full time

    About the RoleWe are seeking a highly skilled Principal Site Reliability Engineer to join our SRE team at Commonwealth Bank. As a technical leader, you will be responsible for designing and implementing large-scale solutions, influencing and engaging senior stakeholders on modern best practices for improving reliability throughout the software development...


  • Sydney, New South Wales, Australia CommBank Full time

    About the RoleWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at CommBank. As a technical leader, you will be responsible for designing and implementing large-scale solutions, as well as influencing and engaging senior stakeholders on modern best practices for improving reliability throughout the software development...


  • Sydney, New South Wales, Australia CommBank Full time

    About UsWe're a venture-scaler powered by CommBank, building, buying, and investing in startups that benefit the bank's 15 million customers and beyond.We're a unique blend of corporate and startup, navigating the space between both worlds. We leverage the bank's strategy, scale, and stability while maintaining autonomy to try new things.Our CultureWe're a...


  • Sydney, New South Wales, Australia CommBank Full time

    About UsWe're a venture-scaler powered by CommBank, building, buying, and investing in startups that benefit the bank's 15 million customers and beyond.We're a unique blend of corporate and startup, navigating the space between both worlds. We leverage the bank's strategy, scale, and stability while maintaining autonomy to try new things.Our CultureWe're a...


  • Sydney, New South Wales, Australia CommBank Full time

    About the RoleWe are seeking a highly skilled and experienced Principal Site Reliability Engineer to join our team at CommBank. As a key member of our SRE team, you will play a critical role in ensuring the reliability and performance of our digital systems, enabling us to deliver exceptional customer experiences.Key ResponsibilitiesLead the design and...


  • Sydney, New South Wales, Australia Commonwealth Bank Full time

    About the RoleWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at the Commonwealth Bank. As a key member of our SRE team, you will play a critical role in ensuring the reliability and scalability of our digital systems.Key ResponsibilitiesDesign and implement large-scale solutions to improve system reliability and...


  • Sydney, New South Wales, Australia Atlassian Full time

    About the RoleWe are seeking a highly skilled Cloud Reliability Engineer to join our growing Site Reliability Engineering (SRE) teams at Atlassian. As a reliability expert, you will play a key role in scaling our Cloud services and ensuring the highest level of reliability, performance, scalability, and cost efficiency.Key ResponsibilitiesTake ownership of...


  • Sydney, New South Wales, Australia VIRTUE TALENT PTY LTD Full time

    About the RoleVirtue Talent Pty Ltd is seeking a highly experienced Principal Systems Assurance Consultant to join our team. As a key member of our engineering services team, you will play a critical role in delivering high-quality system safety and reliability assurance services to our clients in the rail and transportation industries.Key...


  • Sydney, New South Wales, Australia Atlassian Full time

    About AtlassianAtlassian offers flexible work arrangements, allowing employees to choose their work environment—whether in an office, remotely, or a hybrid model. This flexibility empowers Atlassians to better manage their personal and professional commitments. We are able to hire talent in any country where we maintain a legal presence, and our interview...


  • Sydney, New South Wales, Australia Atlassian Full time

    About AtlassianAt Atlassian, we empower our employees to choose their work environment – whether it be in an office, remotely, or a hybrid of both. This flexibility allows our team members to better manage their personal and professional commitments. We have the capability to hire talent globally, wherever we maintain a legal presence. Our interview and...


  • Sydney, New South Wales, Australia Atlassian Full time

    About AtlassianAt Atlassian, we empower our employees to choose their work environment, whether it be in an office, remotely, or a hybrid approach. This flexibility allows our team members to better manage their personal and professional commitments. We have the capability to hire talent globally in countries where we maintain a legal presence. Our interview...


  • Sydney, New South Wales, Australia Atlassian Full time

    About AtlassianAtlassian provides flexibility in work arrangements, allowing employees to choose between office, remote, or hybrid setups. This approach empowers our team members to balance their professional and personal lives effectively. We are open to hiring talent from any country where we have a legal presence, with virtual interviews and onboarding...


  • Sydney, New South Wales, Australia Atlassian Full time

    About AtlassianAtlassian offers flexible work arrangements, allowing employees to choose their work environment—whether in an office, remotely, or a blend of both. This flexibility empowers Atlassians to better manage their personal and professional commitments. We welcome talent from any country where we maintain a legal presence, and our interview and...


  • Sydney, New South Wales, Australia Atlassian Full time

    About AtlassianAt Atlassian, we empower our employees to choose their work environment, whether that be from home, in an office, or a hybrid of both. This flexibility allows our team members to better manage their personal commitments and aspirations. We welcome talent from any country where we have a legal presence, and our interview and onboarding...


  • Sydney, New South Wales, Australia GoPro, Inc. Full time

    Location: Flexible - This position allows for remote work while maintaining proximity to an office. Overview GoPro, Inc. is on the lookout for an exceptional candidate to fill the role of Principal Mechanical Engineer in a hybrid work setting. This position is pivotal in crafting cutting-edge mechanical design solutions for cameras, mounts, and...


  • Sydney, New South Wales, Australia Cover Genius Ltd Full time

    About Cover Genius LtdCover Genius Ltd is a leading insurtech company that provides innovative insurance solutions to global customers. Our mission is to protect people and businesses from unexpected events, and we're committed to delivering exceptional customer experiences.Job SummaryWe're seeking a highly skilled Site Reliability Engineer to join our team....


  • Sydney, New South Wales, Australia Atlassian Full time

    About the RoleWe are seeking a highly skilled and experienced Principal Software Engineer to join our Atlassian Cloud Storage Engineering (ACSE) team. As a key member of our team, you will play a critical role in designing, implementing, and operating our self-hosted search platform.Key ResponsibilitiesDesign and Implementation: Design and implement new and...