Site Reliability Engineer
5 months ago
Overview
Are you interested in working on one of Microsoft's most exciting products? Are you passionate about exceeding customer expectations and advancing Microsoft's cloud-first strategy? If so, the Azure Customer Experience (CXP) Customer Reliability Engineering (CRE) Team is the place for you
Azure CXP CRE is a top-level pillar of Azure Engineering that leads world-class customer reliability initiatives. It provides modern, customer-centric experiences at scale and infuses deep customer insights and empathy throughout the Azure Engineering organization. Our teams continuously listen to customers, driving enhancements and new capabilities across services, support programs, incident response, community engagements, and more. Our "no dead-ends" philosophy ensures that every customer, regardless of size or scale, can realize their full potential through the Microsoft Cloud.
Azure CXP CRE is seeking a customer-focused Reliability Engineer passionate about customer reliability engineering, including availability, reliability, resiliency, and uptime at scale for the Azure platform. This role is accountable for improving customer experience on Azure and involves diagnosing and troubleshooting mission-critical customer applications built on the Microsoft Azure platform. The ideal candidate will demonstrate technical breadth while managing complex, highly available services and have a deep understanding of the underlying components (Azure Platform, Azure SDK, Azure Portal). They will work directly with customers, customer support, live site teams, and engineering.
To be successful in this role, you must have a proven track record of customer empathy, an engineering mindset, an aptitude for agility, and technical excellence in site reliability engineering.
Qualifications
Must have service engineering experience in a 24/7/365 enterprise environment. Desired: Technical expertise in Azure services and capabilities or cloud platforms. Fluency in one or more automation languages (e.g., PowerShell, CLI). Strong communication skills that enable you to lead and manage communication with customers, internal Microsoft stakeholders, and third-party vendors. Understanding of high availability, disaster recovery, business continuity, and performance tuning. Demonstrates strategic thinking, quantitative and analytical skills, team leadership, and collaboration. Excellent problem resolution, judgment, negotiation, and decision-making skills. Desired: Strong knowledge of the Windows platform or Linux, developer tools, and the ability to diagnose and debug user code. Effectively manage and prioritize multiple tasks according to high-level objectives and projects. Excellent written and oral communication skills; ability to communicate with a variety of audiences, including high-profile customers, executive management, and engineering teams. Desired: BS/BA in computer science, engineering, mathematics, or equivalent experience.Security Screening
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings:
Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
#AzCXP
Responsibilities
Participate in an on-call coverage rotation (approximately 15% of the time) for platform communications and security. Collaborate closely with engineering and product management teams to drive product improvements based on customer feedback. Improve the customer experience by analyzing signals from various sources and driving root cause analyses (RCAs) and service improvements involving bug fixes. Drive continuous improvement in the Azure platform by incorporating feedback from internal and external customers. Identify and drive requirements for enhanced customer resiliency and platform reliability. Identify and drive the implementation of customer-centric mitigation strategies and playbooks for operations. Participate in the design of next-generation architecture for cloud infrastructure services, with a focus on strategic customer scenarios. Be enthusiastic, self-motivated, and a great team player. Demonstrate excellent collaboration, organizational, and time management skills. Be data-driven with a focus on achieving business results in projects. Demonstrate the ability to develop key partnerships. Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.Industry leading healthcareEducational resourcesDiscounts on products and servicesSavings and investmentsMaternity and paternity leaveGenerous time awayGiving programsOpportunities to network and connect-
Site Reliability Engineer
4 weeks ago
Sydney, New South Wales, Australia Rackspace Full timeJob Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Rackspace. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud infrastructure.Key Responsibilities:Design and implement highly scalable APIs using microservices...
-
Site Reliability Engineer
4 weeks ago
Sydney, New South Wales, Australia EFinancialCareers Ltd. Full timeSite Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at EFinancialCareers Ltd. in Sydney, Australia.Key Responsibilities:Collaborate with the development team to ensure smooth operation of software development and continuous support in production.Design and implement automation tools to improve system efficiency...
-
Site Reliability Engineer
4 weeks ago
Sydney, New South Wales, Australia Palantir Technologies Full timeJob Title: Site Reliability EngineerPalantir Technologies is seeking a highly skilled Site Reliability Engineer to join our Database Operations team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our databases and infrastructure.Key Responsibilities:Design and implement scalable...
-
Site Reliability Engineer
1 month ago
Sydney, New South Wales, Australia Cover Genius Ltd Full timeAbout Cover Genius LtdCover Genius Ltd is a leading insurtech company that protects the global customers of the world's largest digital companies. Our award-winning insurance distribution platform, XCover, is integrated with top partners to embed protection for millions of customers worldwide each year.Job Title: Site Reliability EngineerWe're seeking a...
-
Site Reliability Engineer
4 weeks ago
Sydney, New South Wales, Australia Cover Genius Ltd Full timeAbout Cover Genius LtdCover Genius Ltd is a leading insurtech company that protects the global customers of the world's largest digital companies. Our award-winning insurance distribution platform, XCover, is integrated with top partners to embed protection for millions of customers worldwide each year.Job Title: Site Reliability EngineerWe're seeking a...
-
Site Reliability Engineer
1 month ago
Sydney, New South Wales, Australia Stake Australia Full timeAbout Stake AustraliaStake Australia is a leading investment platform that provides a seamless and immersive experience for ambitious investors. With a global customer base of 500,000+ investors and over A$3 billion in assets under administration, we're committed to delivering high-quality execution and growth.Job Title: Site Reliability EngineerWe're...
-
Site Reliability Engineer
1 month ago
Sydney, New South Wales, Australia DynaTrace Software GmbH Full timeAbout the RoleWe're seeking a skilled Site Reliability Engineer to join our team at Dynatrace Software GmbH. As a key member of our engineering team, you'll play a crucial role in ensuring the reliability and efficiency of our cloud-based software intelligence platform.Key ResponsibilitiesTranslate manual tasks into automated processes using your insights...
-
Site Reliability Engineer
4 weeks ago
Sydney, New South Wales, Australia Microsoft Full timeJob Title: Site Reliability EngineerAre you passionate about delivering exceptional customer experiences and advancing Microsoft's cloud-first strategy? We're seeking a skilled Site Reliability Engineer to join our Azure Customer Experience (CXP) Customer Reliability Engineering (CRE) Team.About the RoleAs a Site Reliability Engineer, you will be responsible...
-
Site Reliability Engineer
2 months ago
Sydney, New South Wales, Australia Microsoft Full timeJob Title: Site Reliability EngineerAre you passionate about delivering exceptional customer experiences and advancing Microsoft's cloud-first strategy? We're seeking a skilled Site Reliability Engineer to join our Azure Customer Experience (CXP) Customer Reliability Engineering (CRE) Team.About the RoleAs a Site Reliability Engineer, you will be responsible...
-
Site Reliability Engineer
4 weeks ago
Sydney, New South Wales, Australia Dynatrace Full timeAbout DynatraceDynatrace is a leading software intelligence platform that helps organizations deliver flawless digital experiences. Our mission is to make the world's software work perfectly.Job DescriptionWe're seeking a talented Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the...
-
Site Reliability Engineer
4 weeks ago
Sydney, New South Wales, Australia Palantir Technologies Full timeAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Database Operations team at Palantir Technologies. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our databases and related systems.Key ResponsibilitiesBuild expertise on pre-existing systems,...
-
Site Reliability Engineer
4 weeks ago
Sydney, New South Wales, Australia Citadel Securities Full timeJob Title: Site Reliability EngineerAbout the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Citadel Securities. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our distributed systems and applications.Responsibilities:Design and implement scalable and efficient...
-
Site Reliability Engineer
4 weeks ago
Sydney, New South Wales, Australia Cover Genius Ltd Full timeAbout Cover Genius LtdCover Genius Ltd is a leading insurtech company that protects the global customers of the world's largest digital companies. Our award-winning insurance distribution platform, XCover, is integrated with top partners to embed protection for millions of customers worldwide each year.Job Title: Site Reliability EngineerWe are seeking a...
-
Site Reliability Engineer
4 weeks ago
Sydney, New South Wales, Australia FIS Australia Full timeAbout the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at FIS Australia. As a Site Reliability Engineer, you will be responsible for ensuring the scalability, high availability, and performance of our software applications.Key Responsibilities:Design and implement automation scripts to simplify operations and...
-
Site Reliability Engineer
4 weeks ago
Sydney, New South Wales, Australia Westbury Partners Full timeSite Reliability EngineerWestbury Partners is seeking a highly skilled Site Reliability Engineer to join our team in the APAC region. As a key member of our engineering team, you will play a critical role in ensuring the high availability and low latency of our systems.Responsibilities:Collaborate with the development team to ensure smooth software...
-
Site Reliability Engineer
4 weeks ago
Sydney, New South Wales, Australia Citadel Securities Full timeJob Description Overview At Citadel Securities, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our distributed systems. Responsibilities * Collaborate with cross-functional teams to design, implement, and deploy...
-
Site Reliability Engineer
4 weeks ago
Sydney, New South Wales, Australia Servicenow Full timeJob DescriptionWe are seeking a highly skilled Junior Reliability Engineer to join our Site Reliability Engineering (SRE) team at Service Now. As a key member of our team, you will play a critical role in maintaining and improving the reliability, scalability, and performance of our infrastructure.Key Responsibilities:Assist in resolving issues within our...
-
Site Reliability Engineer
1 month ago
Sydney, New South Wales, Australia Servicenow Full timeJob DescriptionWe are seeking a highly skilled Junior Reliability Engineer to join our Site Reliability Engineering (SRE) team at Service Now. As a key member of our team, you will play a critical role in maintaining and improving the reliability, scalability, and performance of our infrastructure.Key ResponsibilitiesAssist in resolving issues within our...
-
Site Reliability Engineer
1 month ago
Sydney, New South Wales, Australia Servicenow Full timeAbout the RoleWe're seeking a skilled Site Reliability Engineer to join our team at Servicenow. As a key member of our infrastructure team, you'll play a critical role in ensuring the reliability and performance of our cloud-based platform.Key ResponsibilitiesAssist in resolving issues within our infrastructure and provide sustainable solutions.Work with...
-
Site Reliability Engineer
4 hours ago
Sydney, Australia Microsoft Full timeOverviewAre you interested in working on one of Microsoft's most exciting products? Are you passionate about exceeding customer expectations and advancing Microsoft's cloud-first strategy? If so, the Azure Customer Experience (CXP) Customer Reliability Engineering (CRE) Team is the place for you!Azure CXP CRE is a top-level pillar of Azure Engineering that...