Site Reliability Engineer
7 months ago
Overview
Are you interested in working on one of Microsoft's most exciting products? Are you passionate about exceeding customer expectations and advancing Microsoft's cloud-first strategy? If so, the Azure Customer Experience (CXP) Customer Reliability Engineering (CRE) Team is the place for you
Azure CXP CRE is a top-level pillar of Azure Engineering that leads world-class customer reliability initiatives. It provides modern, customer-centric experiences at scale and infuses deep customer insights and empathy throughout the Azure Engineering organization. Our teams continuously listen to customers, driving enhancements and new capabilities across services, support programs, incident response, community engagements, and more. Our "no dead-ends" philosophy ensures that every customer, regardless of size or scale, can realize their full potential through the Microsoft Cloud.
Azure CXP CRE is seeking a customer-focused Reliability Engineer passionate about customer reliability engineering, including availability, reliability, resiliency, and uptime at scale for the Azure platform. This role is accountable for improving customer experience on Azure and involves diagnosing and troubleshooting mission-critical customer applications built on the Microsoft Azure platform. The ideal candidate will demonstrate technical breadth while managing complex, highly available services and have a deep understanding of the underlying components (Azure Platform, Azure SDK, Azure Portal). They will work directly with customers, customer support, live site teams, and engineering.
To be successful in this role, you must have a proven track record of customer empathy, an engineering mindset, an aptitude for agility, and technical excellence in site reliability engineering.
Qualifications
Must have service engineering experience in a 24/7/365 enterprise environment. Desired: Technical expertise in Azure services and capabilities or cloud platforms. Fluency in one or more automation languages (e.g., PowerShell, CLI). Strong communication skills that enable you to lead and manage communication with customers, internal Microsoft stakeholders, and third-party vendors. Understanding of high availability, disaster recovery, business continuity, and performance tuning. Demonstrates strategic thinking, quantitative and analytical skills, team leadership, and collaboration. Excellent problem resolution, judgment, negotiation, and decision-making skills. Desired: Strong knowledge of the Windows platform or Linux, developer tools, and the ability to diagnose and debug user code. Effectively manage and prioritize multiple tasks according to high-level objectives and projects. Excellent written and oral communication skills; ability to communicate with a variety of audiences, including high-profile customers, executive management, and engineering teams. Desired: BS/BA in computer science, engineering, mathematics, or equivalent experience.Security Screening
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings:
Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
#AzCXP
Responsibilities
Participate in an on-call coverage rotation (approximately 15% of the time) for platform communications and security. Collaborate closely with engineering and product management teams to drive product improvements based on customer feedback. Improve the customer experience by analyzing signals from various sources and driving root cause analyses (RCAs) and service improvements involving bug fixes. Drive continuous improvement in the Azure platform by incorporating feedback from internal and external customers. Identify and drive requirements for enhanced customer resiliency and platform reliability. Identify and drive the implementation of customer-centric mitigation strategies and playbooks for operations. Participate in the design of next-generation architecture for cloud infrastructure services, with a focus on strategic customer scenarios. Be enthusiastic, self-motivated, and a great team player. Demonstrate excellent collaboration, organizational, and time management skills. Be data-driven with a focus on achieving business results in projects. Demonstrate the ability to develop key partnerships. Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.Industry leading healthcareEducational resourcesDiscounts on products and servicesSavings and investmentsMaternity and paternity leaveGenerous time awayGiving programsOpportunities to network and connect-
Site Reliability Engineer
1 month ago
Sydney, Australia Microsoft Full timeOverviewAre you interested in working on one of Microsoft's most exciting products? Are you passionate about exceeding customer expectations and advancing Microsoft's cloud-first strategy? If so, the Azure Customer Experience (CXP) Customer Reliability Engineering (CRE) Team is the place for you!Azure CXP CRE is a top-level pillar of Azure Engineering that...
-
Site Reliability Engineer Supervisor
4 weeks ago
Sydney, New South Wales, Australia VGW Full timeSite Reliability Engineer SupervisorVGW is an interactive entertainment company that harnesses technology and creativity to deliver world-class, free-to-play games.We are seeking an experienced Site Reliability Engineer Supervisor to join our Engineering team in Sydney.This role will focus on ensuring the reliability of our systems as we bring new games to...
-
Site Reliability Engineer
3 days ago
Sydney, New South Wales, Australia Immutable Full timeAbout The RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Immutable. As a key member of our SRE team, you will play a crucial role in shaping our infrastructure, observability, and tooling patterns.You will be responsible for developing and releasing infrastructure as code, creating and maintaining multiple Kubernetes...
-
Senior Site Reliability Engineer
1 month ago
Sydney, Australia Lanson Partners Full timeBuild a culture of reliability and develop robust reliability pattern Software Engineering background Sydney-based, 2 days working from home As a Senior Site Reliability Engineer, you will be working closely with software engineering teams and stakeholders to ensure the health and performance of the infrastructure and software. You will play a key role in...
-
Site Reliability Engineer
2 months ago
Sydney, Australia Talent International Full timeA growing FinTech provider is seeking a Site Reliability Engineer to join their team on a permanent basis. Working in a small, close-knit team based in their office in North Sydney, you will be responsible for the support, maintenance and administration of their cloud platform (AWS) as well as application monitoring (Prometheus), ELK / Elastic Stack and...
-
Senior Site Reliability Engineer
7 months ago
Sydney, Australia Firesoft People Full timeSenior Site Reliability Engineer Join a leading electronic trading firm in a pivotal role as a Site Reliability Engineer (SRE). At our firm, we are passionate about market-making and arbitrage opportunities on a global scale. Technology drives our success, fuelling our unified trading platform and enabling precise micro-decisions. With our agility and...
-
Site Reliability Engineer
4 weeks ago
Sydney, New South Wales, Australia Firesoft People Full timeJob DescriptionFiresoft People is seeking a Senior Site Reliability Engineer with strong AWS skills to join our FinTech lending specialist team on a full-time basis. This position offers an exceptional salary package of up to $170,000 per year, plus an annual $2,000 allowance for training and certifications.About the RoleWe are looking for an experienced SRE...
-
Site Reliability Engineering Lead
3 weeks ago
Sydney, New South Wales, Australia Google Full timeAbout GoogleAt Google, we empower and support our employees to succeed by fostering a culture of diversity, equity, and inclusion. We believe that when everyone contributes, we can build better technology for everyone.We welcome Indigenous applicants and are committed to reconciliation through our technology, platforms, and people.Check out our...
-
Sydney, Australia Google Full timeinfo_outlineXInfo At Google, we have a vision of empowerment and equitable opportunity for all Aboriginal and Torres Strait Islander peoples and commit to building reconciliation through Google’s technology, platforms and people and we welcome Indigenous applicants. Please see our Reconciliation Action Plan for more information.At Google, we have a vision...
-
Site Reliability Engineering Manager
3 weeks ago
Sydney, New South Wales, Australia IT Operations & Services Full timeAbout the RoleWe are seeking a highly skilled Site Reliability Engineering Manager to join our IT Operations & Services team at The Star Entertainment Group.Job DescriptionThe successful candidate will be responsible for leading the delivery of property-based IT initiatives, including infrastructure relocations and gaming floor moves. You will work closely...
-
Site Reliability Engineer Supervisor
2 months ago
Sydney, Australia VGW Full timeSite Reliability Engineer Supervisor VGW is an interactive entertainment company, harnessing technology and creativity to deliver world-class, free-to-play games. We have an exciting opportunity to join our Engineering team in Sydney and are currently looking for an Engineering Supervisor to join the team. You'll focus on ensuring the reliability of our...
-
Senior Site Reliability Engineer
3 weeks ago
Sydney, Australia Zip Co Full timeSenior Site Reliability Engineer Experience working in SRE/Devops with Dynatrace and Kubernetes. Work on high impact SRE projects where you’ll own and drive initiatives end to end. Hybrid, flexible working with two team connect days in the office per week. Write your story with Zip Join Zip’s Technology function, responsible for building and...
-
Site Reliability Engineer
5 months ago
Sydney, Australia Palantir Technologies Full timeA World-Changing Company Palantir builds the world’s leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empower our partners to develop lifesaving drugs, forecast supply chain disruptions, locate missing children, and more. The Role We’re looking for a Site Reliability Engineer...
-
Site Reliability Engineer
7 months ago
Sydney, Australia Freelancer.com Full timeSite Reliability Engineer Sydney, Australia Description About the Role:You will join a small team of versatile infrastructure engineers who are responsible for designing, building, and operating the mission-critical cloud platform powering , , and a number of other businesses within the enterprise. You will work with highly scalable FL/OSS services (Linux,...
-
Site Reliability Engineer
3 weeks ago
Sydney, New South Wales, Australia Firesoft People Full timeFiresoft People is a leading electronic trading firm that relies heavily on technology to drive its success. As a Senior Site Reliability Engineer, you will be the face of technology, working directly with our trading desks to ensure seamless operations.The role requires a talented individual who can contribute across our entire technology platform,...
-
Site Reliability Engineer
1 month ago
Sydney, New South Wales, Australia Citadel Securities Full timeSRE Role OverviewCitadel Securities is looking for a skilled Site Reliability Engineer to join our team. As a crucial member of our SRE team, you will be responsible for ensuring the reliability, availability, and performance of our financial systems. Your primary goal will be to design, implement, and maintain scalable, efficient, and highly available...
-
Site Reliability Engineering Expert
3 weeks ago
Sydney, New South Wales, Australia Dimensional Fund Advisors Full timeAbout Dimensional Fund AdvisorsWe are a forward-thinking organization leveraging cutting-edge technology to engineer scalable and innovative solutions that improve our clients' financial lives.Job DescriptionWe are seeking a seasoned Senior Site Reliability Engineer to join our team, responsible for managing our global investment data technology systems and...
-
Reliability Engineering Specialist
4 weeks ago
Sydney, New South Wales, Australia Macquarie Full timeAt Macquarie, a global financial services group operating in 34 markets, we're seeking an experienced Senior Site Reliability Engineer to join our Engineering Enablers team.We're committed to providing the most reliable products and service in the financial industry. As part of our team, you'll contribute to the delivery of software reliability and help...
-
Site Reliability Engineer
3 weeks ago
Sydney, Australia Atlassian Full timeWorking at AtlassianAtlassians can choose where they work – whether in an office, from home, or a combination of the two. That way, Atlassians have more control over supporting their family, personal goals, and other priorities. We can hire people in any country where we have a legal entity. Interviews and onboarding are conducted virtually, a part of...
-
Senior Site Reliability Engineer
7 months ago
Sydney, Australia Firesoft People Full timeSenior Site Reliability Engineer with strong AWS skills, sought to join a FinTech lending specialist on a full time basis. Up to $170K + Super! Key Points: Exceptional salary package on offer + annual $2K allowance for training/certifications of your choice.Brand new pipeline of project work off the back of a recent partnership with one of the big 4...