
Site Reliability Engineer
1 week ago
Job Description:
As a
Site Reliability Engineer
, you'll be at the heart of ensuring the resilience and observability of the
Kindred Sportsbook Platform
. This isn't just about keeping the lights on; it's about building systems and solutions that thrive under extreme pressure. You'll collaborate with feature teams to extract rich telemetry and structured event data from their code, architect and build highly distributed observability solutions to handle massive data ingestion and develop tools that turn complex signals into actionable insights for technical and business stakeholders alike.
Your mission: to ensure our systems stay fast, scalable, and reliable—even when millions of bets are placed simultaneously during major international sporting events. Whether it's a local derby in Europe, a championship game in North America, or a global tournament, your work will be critical to delivering seamless betting experiences across continents. If you love solving complex and novel problems, architecting scalable and distributed systems, and being a key player in a high-stakes environment, we'd love to hear from you.
Responsibilities:
- Architect, build, and maintain large-scale, distributed telemetry pipelines and observability platforms that provide real-time insight into system performance and reliability.
- Design innovative solutions for telemetry challenges in our uniquely asynchronous and distributed ecosystem, ensuring high visibility across services.
- Act as a subject matter expert, collaborating with development teams to optimise instrumentation, observability tooling, and reliability strategies.
- Drive capacity planning and proactive performance optimisation, always pushing the envelope to anticipate and meet evolving business and customer needs.
- Partner with teams to define and refine four golden signals, service levels, and error budgets, ensuring we measure and improve critical user journeys and business impact.
- Take ownership in high-stakes production incidents, leading deep-dive investigations and implementing long-term solutions to prevent future disruptions.
- Develop, refine, and automate reliability-focused tooling, reducing toil and increasing engineering efficiency across the platform.
Skills and experience:
- Deep expertise in site reliability engineering concepts and practices.
- Advanced knowledge of observability and telemetry data principles, with hands-on experience in designing and implementing solutions at scale.
- Experience with Linux system administration and fundamentals.
- Solid understanding of network fundamentals, with an emphasis on Layer 7 protocols such as HTTP, gRPC, DNS, and TLS.
- Extensive experience with IaaS platforms, both cloud and on-prem.
- Strong experience with containerisation principles, tooling and orchestration.
- Proficiency in one or more of the following: Go, Python, C#, NodeJS or similar programming languages.
- Strong grasp of CI/CD automation and Infrastructure as Code (IaC) principles
Bonus Skills and Experience:
- Experience in fintech style operations.
- Experience working in large scale, low latency asynchronous systems.
- Hands-on experience with the Opentelemetry ecosystem.
- A keen interest in new technologies and industry trends in the SRE and observability space.
- Strong analytical and troubleshooting skills, with a systematic approach to problem-solving.
- Excellent verbal and written communication skills, with the ability to document systems, processes, and troubleshooting steps clearly.
Benefits:
- We are in a fantastic new office near Barangaroo, close to Wynyard station.
- Our office has a sports hub, if you want to challenge a mate to a game of table tennis or darts.
- Fancy a good cup of coffee? We have an in-house barista to get you that perfect cup
- Many social events to take part in (Melbourne Cup is just one of them).
- Great work life balance and flexibility.
- A continued commitment to employee development.
- Life insurance and income protection plans.
- Wellness benefits.
NEXT STEPS:
To apply for this role, you must be in Australia with a valid work visa, permanent residency, or citizenship. This is a permanent full-time role with a hybrid workplace policy that requires two days per week in the office.
-
Site Reliability Engineer
2 weeks ago
Sydney, New South Wales, Australia Kindred Group plc Full timeJoin to apply for the Site Reliability Engineer role at Kindred Group plc3 days ago Be among the first 25 applicantsJoin to apply for the Site Reliability Engineer role at Kindred Group plcGet AI-powered advice on this job and more exclusive features.Direct message the job poster from Kindred Group plcTA Partner at FDJ United (formerly known as Kindred) l...
-
Site Reliability Engineer
2 weeks ago
Sydney, New South Wales, Australia Kindred Group plc Full timeJoin to apply for the Site Reliability Engineer role at Kindred Group plc3 days ago Be among the first 25 applicantsJoin to apply for the Site Reliability Engineer role at Kindred Group plcGet AI-powered advice on this job and more exclusive features.Direct message the job poster from Kindred Group plcTA Partner at FDJ United (formerly known as Kindred) l...
-
Site Reliability Engineer
19 hours ago
Sydney, New South Wales, Australia Kindred Group plc Full timeJoin to apply for the Site Reliability Engineer role at Kindred Group plc3 days ago Be among the first 25 applicantsJoin to apply for the Site Reliability Engineer role at Kindred Group plcGet AI-powered advice on this job and more exclusive features.Direct message the job poster from Kindred Group plcTA Partner at FDJ United (formerly known as Kindred) l...
-
Site Reliability Engineer
4 weeks ago
Sydney, New South Wales, Australia FIS Full timeFIS Millers Point, New South Wales, AustraliaJoin or sign in to find your next jobJoin to apply for the Site Reliability Engineer role at FISFIS Millers Point, New South Wales, AustraliaJoin to apply for the Site Reliability Engineer role at FISGet AI-powered advice on this job and more exclusive features.Type Of HireExperienced (relevant combo of work and...
-
Site Reliability Engineer
1 week ago
Sydney, New South Wales, Australia Buscojobs Full timeSite Reliability Engineer Sydney, Hybrid Operations Job Description Site Reliability Engineer IMC Trading | Sydney, Hybrid Senior Level | Fintech / Software Role : Ensure reliability and scalability of real-time trading systems.Provide rapid incident response, support and monitor trading platforms, collaborate with tech and trading teams to implement lasting...
-
Site Reliability Engineer
4 days ago
Sydney, New South Wales, Australia Servicenow Full timeOverviewSite Reliability Engineer - SPP at ServiceNow, Millers Point, New South Wales, Australia.RoleAs a Site Reliability Engineer, you will:Provide relief and sustainable resolution to issues within our infrastructure.Use your experience in software development, systems engineering and networking to proactively prevent repeatable issues.Drive initiatives...
-
Site Reliability Engineer
6 days ago
Sydney, New South Wales, Australia Servicenow Full timeOverviewSite Reliability Engineer - SPP at ServiceNow, Millers Point, New South Wales, Australia.RoleAs a Site Reliability Engineer, you will:Provide relief and sustainable resolution to issues within our infrastructure.Use your experience in software development, systems engineering and networking to proactively prevent repeatable issues.Drive initiatives...
-
Site Reliability Engineer
2 weeks ago
Sydney, New South Wales, Australia Macquarie Group Full timeJoin to apply for the Site Reliability Engineer role at Macquarie Group2 days ago Be among the first 25 applicantsJoin to apply for the Site Reliability Engineer role at Macquarie GroupGet AI-powered advice on this job and more exclusive features.Join our world class SRE team providing services for Macquarie Banking and Financial Services. The SRE function...
-
Site Reliability Engineer
1 week ago
Sydney, New South Wales, Australia Macquarie Group Full timeJoin to apply for the Site Reliability Engineer role at Macquarie Group2 days ago Be among the first 25 applicantsJoin to apply for the Site Reliability Engineer role at Macquarie GroupGet AI-powered advice on this job and more exclusive features.Join our world class SRE team providing services for Macquarie Banking and Financial Services. The SRE function...
-
Site Reliability Engineer
5 days ago
Sydney, New South Wales, Australia Macquarie Group Full timeJoin to apply for the Site Reliability Engineer role at Macquarie Group2 days ago Be among the first 25 applicantsJoin to apply for the Site Reliability Engineer role at Macquarie GroupGet AI-powered advice on this job and more exclusive features.Join our world class SRE team providing services for Macquarie Banking and Financial Services.The SRE function is...