
Resilience Engineering Specialist
1 week ago
Observability and Resilience Expert
As a member of our team, you will play a key role in shaping the future of observability and resilient systems. We are looking for a talented individual who is passionate about providing service insights and solutions that enable availability, scalability, and reliability of our Always On customer-facing technology.
Your primary responsibility will be to provide metrics and frameworks that drive forward performance, availability, and resilience of the platform by standardizing and supporting our teams. You will cultivate excellent operational practices, identify automation opportunities, and work closely with diverse teams and technology to surface meaningful insights that drive better outcomes.
Responsibilities:
- Drive service reliability and engineering excellence by implementing and maintaining tooling that surfaces metrics using SLIs, SLOs, and SLAs.
- Drive and maintain event intelligence best practices working with teams to measure outcomes.
- Work in close collaboration across teams to shape the future roadmap to improve reliability and establish strong operational readiness across teams.
- Identify areas for improvement across WooliesX and drive technical change to automate operational outcomes to maximize reliability and minimize recovery time.
- Share your knowledge by giving brown bags, tech talks, and evangelizing technology and best practices.
- Contribute to Root Cause Analysis (RCA) investigations and implement appropriate APM solutions (such as Dynatrace) and automation as necessary.
- Roll out of new tools, technologies, and processes that have high business impact and are used by multiple teams that improve reliability and velocity.
- Contribute to documentation and uplifting of teams.
- Ability engage and collaborate with senior leadership and business stakeholders.
Requirements:
- Current hands-on experience in Site Reliability or Observability engineering.
- Prior experience in implementing SRE/Observability capability in a large-scale organization.
- A solid level of understanding of Observability & Event Intelligence best practice.
- Experience with implementation of Dynatrace across a diverse technology stack and ability to identify edge cases, failure modes, erroneous behavior, specific implementations.
- Strong scripting/programming experience with at least one of the following languages:.NET, Python, Java, Go, C# or similar is beneficial.
- You have hands-on experience with cloud infrastructure (AWS, GCE, Azure, Kubernetes, Docker).
- Formal Certification in one of the following: AWS, GCE, Azure, Kubernetes, Docker is beneficial.
- Experience with implementing Circuit breakers, Resilience frameworks, Fault tolerance, and self-healing mechanisms of services.
- Strong organizational and interpersonal skills, with experience developing and instilling a culture of operational maturity.
- Systematic problem-solving approach, coupled with effective communication skills and a sense of ownership and drive.
Grow with the Group
At our company, we care deeply about creating a workplace where our team members feel valued, respected, and empowered. We are committed to providing equal opportunity regardless of gender identity, ethnicity, disability, sexual orientation, or life stage. We value flexibility, and encourage our team members to work in ways that meet their work/life commitments and support their wellbeing.
We work hard to create a safe and inclusive environment for all, and most importantly, we're all about creating better experiences - for our customers and for each other.
-
Resource and Resilience Specialist
1 week ago
Sydney, New South Wales, Australia beBeeResilience Full time $81,854 - $92,486Job Title: Resource and Resilience SpecialistThis is an exciting opportunity to drive resilience efforts through programs and projects that increase community capacity and build local urban resilience.About the Role:Lead collaboration with stakeholders and partners, developing and implementing programs and initiatives of the Resilience Plan against...
-
Establishing Resilient Systems Specialist
2 weeks ago
Sydney, New South Wales, Australia beBeeReliability Full time $150,000 - $170,000Reliable Infrastructure Specialist">As a Reliable Infrastructure Specialist, you will partner with cross-functional teams to develop and drive the adoption of best practices for infrastructure resilience across our organisation. The role requires close collaboration with all engineering teams as an embedded member to help solve the biggest challenges we...
-
Resilience Expert
7 days ago
Sydney, New South Wales, Australia beBeeRiskSpecialist Full time $161,000 - $199,000Risk Specialist, Technology Resilience JobThe role of a Risk Specialist in our Technology Resilience team is pivotal. We are seeking an expert to join us and assess the management of technology resilience across APRA-regulated entities.This role provides a unique opportunity to gain an industry-wide perspective and provide advice on current and emerging...
-
Technical Resilience Specialist
2 weeks ago
Sydney, New South Wales, Australia beBeeResilience Full time $180,000 - $250,000Job DescriptionAs a Technical Product Owner, you will be responsible for defining and delivering product visions and roadmaps for AIOps capabilities. This includes owning the product vision and roadmap for intelligent alerting, anomaly detection, predictive capacity/latency analytics, incident copilots, automated remediation, runbook orchestration,...
-
Community Resilience Specialist
1 week ago
Sydney, New South Wales, Australia beBeeCommunityResilience Full time $110,266 - $122,058Job Title: Community Resilience SpecialistAbout the Role:Community resilience is a strategic priority for many organizations. In this role, you will be part of a pioneering team that aims to foster social cohesion and community resilience.The Community Resilience Officer provides high-quality policy and program support to ensure the delivery of strategic...
-
Operational Resilience Specialist
1 week ago
Sydney, New South Wales, Australia Iag New Zealand Full timeCreate impact as a **Operational Resilience Specialist**Join the largest insurance group in Australia and New Zealand.**YOUR ROLE**As an Operational Resilience Specialist, you'll work collaboratively with Risk Advisors and First Line practitioners across the group to prepare, mature and safeguard the organisation during times of unexpected events. You'll...
-
Resilience Specialist
1 week ago
Sydney, New South Wales, Australia beBeeEnterprise Full time $80,000 - $150,000Business Resilience ProfessionalWe are seeking an experienced Business Resilience Professional to strengthen our operational resilience across key divisions.Support group resilience activities, including implementing the Operational Resilience PolicyAssist in the implementation and monitoring of the Operational Resilience Policy and supporting service...
-
Resilience Risk Management Specialist
1 week ago
Sydney, New South Wales, Australia beBeeRiskManagement Full time $100,000 - $150,000Operational Resilience Risk ManagerThis is a senior position to shape and strengthen resilience practices within an organization.Key Responsibilities:Deliver key assurance activities in our control testing program to ensure high-quality operations.Collaborate with leaders and engineering teams to build scalable web applications, APIs, and microservices that...
-
Risk Specialist, Technology Resilience
6 days ago
Sydney, New South Wales, Australia Australian Prudential Regulation Authority (Apra) Full time**The role**We're seeking a Risk Specialist to join our Technology Resilience team with the Cross-Industry Risk Division (CRD). This role is pivotal in assessing and advising on the management of technology resilience across APRA-regulated entities.You will gain a unique industry-wide perspective and provide advice concerning current and emerging technology...
-
Resilience Analyst
1 week ago
Sydney, New South Wales, Australia Insignia Financial Ltd Full timeResilience Analyst**Location**:SYDNEY, NSW, AU, 2000MELBOURNE, VIC, AU, 3008**Employment Type**:Permanent Full Time- Take the lead. Drive meaningful change.- Shape and steer risk strategy across the Group.- Permanent full-time opportunity, location agnostic**The opportunity to join our team.**- This role offers the chance to play a vital part in...