Lead Site Reliability Engineer
6 days ago
At Xero, we're here to help you supercharge your business. We do this by automating routine tasks, surfacing actionable insights and connecting businesses with the right data, advisors and apps. When that happens, we're not only making life better for small business, we'll be building a stronger economy that can change the world
About the team
Xero's Incident and Problem Management team are a part of the Site Reliability Engineering (SRE) organization and are responsible for the build, delivery and ongoing maintenance of robust process and tooling around Incident management
The team is responsible for driving enduring reliability at Xero through robust, consistent and fast response to high severity incidents. They are responsible for building a world class process and ensuring that process matures as the demands of the business grows
About the roles
We're looking for a Lead Engineer to join Xero's Incident and Problem Management team. This position requires an experienced SRE professional with a strong technical background, deep experience in SRE, a passion for building and delivering robust processes, and extensive experience of leading technical response to high severity cloud issues
You will drive best practice across the business and contribute to the ongoing transformation of the Xero SRE culture. As an expert communicator, you will lead technical discussions to identify and track actions associated with and identified during incident situations
Across our SRE function, we're looking for those who are keen to deep dive into causes of incidents and proactively examine the potential causes of future incidents; working with engineering teams to remove the risk of that failure scenario. Ultimately building playbooks and automation to ensure quick and effective responses. In addition, provide ongoing training across the business to ensure the process is well understood and adhered to
This role will form the backbone of a new team, providing a Technical Duty Officer (TDO) function within the business. TDO's are incident commanders who use SRE skillsets to drive fast mitigation and enduring resolution of impactful events. What you'll do: Own the incident management process, ensuring it drives enduring reliability across all products and services within Xero. Provide expert leadership during critical outages, coordinating multiple teams to ensure streamlined decision-making and quick resolution. Lead and advocate for the transformation to a world-leading SRE organization, promoting SRE principles within the Engineering Department. Promote a customer-focused approach by addressing and mitigating global customer environment issues, and fostering a culture of continuous learning and technical excellence within the SRE team. Develop and implement scalable process frameworks and observability strategies to ensure rapid problem diagnosis, response, and service reliability. Collaborate with product teams to thoroughly analyze failures and integrate insights to improve service reliability, scalability, and operational efficiency. What you'll bring: Previous career experience as a Site Reliability Engineer, in an Operations or Engineering environment Strong hands-on coding experience (preferably Python) and knowledge of software engineering best practice Hands-on experience troubleshooting AWS hosted services Networking knowledge and able to troubleshoot TCP/IP, SSL/TLS, DNSSEC, IPsec, and BGP issues Strong communication (oral & written) skills including the ability to translate technical issues/concepts into agreed actions
Why Xero?
Offering very generous paid leave to use however you'd like (plus statutory holidays), dedicated paid leave to care for your physical and mental wellbeing as well as an Employee Assistance Program to access mental health care for you and your family. Health insurance, life insurance, and income protection
We offer wellbeing and sports programmes, employee resource groups, 26 weeks of paid parental leave for primary caregivers, an Employee Share Plan, beautiful offices, flexible working, career development, and many other benefits that reflect our human value
You'll do the best work of your life at Xero
-
Lead Site Reliability Engineer
1 week ago
Sydney, New South Wales, Australia beBeeSRE Full time $125,000 - $175,000Job Title: Site Reliability Engineering LeadSite reliability engineering combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems.Our site reliability engineers ensure that Google Cloud's services have reliability, uptime appropriate to customer needs and a fast rate of improvement. They also keep...
-
Lead Site Reliability Engineer
4 weeks ago
Sydney, New South Wales, Australia Macquarie Bank Limited Full timeLead Site Reliability Engineer | User Access Management PlatformsOur User Access Management team is a key pillar of cybersecurity and provides valuable services to our organization globally.At Macquarie, we value diverse people and empower them to shape possibilities. We are a global financial services group operating in 31 markets with 56 years of...
-
Lead Site Reliability Engineer
3 weeks ago
Sydney, New South Wales, Australia Macquarie Bank Limited Full timeLead Site Reliability Engineer | User Access Management PlatformsOur User Access Management team is a key pillar of cybersecurity and provides valuable services to our organization globally.At Macquarie, we value diverse people and empower them to shape possibilities. We are a global financial services group operating in 31 markets with 56 years of...
-
Lead Site Reliability Engineer
4 weeks ago
Sydney, New South Wales, Australia Xero Full timeOur PurposeAt Xero, we're here to help you supercharge your business. We do this by automating routine tasks, surfacing actionable insights and connecting businesses with the right data, advisors and apps. When that happens, we're not only making life better for small business, we'll be building a stronger economy that can change the world.About the...
-
Lead Site Reliability Engineer
1 week ago
Sydney, New South Wales, Australia Xero Full time $150,000 - $200,000 per yearOur Purpose At Xero, we're here to help you supercharge your business. We do this by automating routine tasks, surfacing actionable insights and connecting businesses with the right data, advisors and apps. When that happens, we're not only making life better for small business, we'll be building a stronger economy that can change the world.About the...
-
Reliability Site Lead
6 hours ago
Sydney, New South Wales, Australia beBeeLeadership Full time $113,038 - $143,625SITE LEADERSHIP ROLEThis leadership position involves managing a team of Electrical and Mechanical Supervisors, driving safety, reliability, and continuous improvement across the site.Lead and manage site maintenance operations to ensure safe, reliable, and efficient plant performance.Develop and implement preventative maintenance schedules to achieve high...
-
Site Reliability Engineer
2 weeks ago
Sydney, New South Wales, Australia Kindred Group plc Full timeJoin to apply for the Site Reliability Engineer role at Kindred Group plc3 days ago Be among the first 25 applicantsJoin to apply for the Site Reliability Engineer role at Kindred Group plcGet AI-powered advice on this job and more exclusive features.Direct message the job poster from Kindred Group plcTA Partner at FDJ United (formerly known as Kindred) l...
-
Site Reliability Engineer
2 weeks ago
Sydney, New South Wales, Australia Kindred Group plc Full timeJoin to apply for the Site Reliability Engineer role at Kindred Group plc3 days ago Be among the first 25 applicantsJoin to apply for the Site Reliability Engineer role at Kindred Group plcGet AI-powered advice on this job and more exclusive features.Direct message the job poster from Kindred Group plcTA Partner at FDJ United (formerly known as Kindred) l...
-
Site Reliability Engineer
4 weeks ago
Sydney, New South Wales, Australia FIS Full timeFIS Millers Point, New South Wales, Australia Join or sign in to find your next job Join to apply for the Site Reliability Engineer role at FIS FIS Millers Point, New South Wales, Australia Join to apply for the Site Reliability Engineer role at FIS Get AI-powered advice on this job and more exclusive features.Type Of HireExperienced (relevant combo of...
-
Site Reliability Engineer
6 days ago
Sydney, New South Wales, Australia Buscojobs Full timeSite Reliability Engineer Sydney, Hybrid Operations Job Description Site Reliability Engineer IMC Trading | Sydney, Hybrid Senior Level | Fintech / Software Role : Ensure reliability and scalability of real-time trading systems.Provide rapid incident response, support and monitor trading platforms, collaborate with tech and trading teams to implement lasting...