
Staff Site Reliability Engineer Australia
2 weeks ago
Aerospike is thereal-time databaseformission-critical use cases and workloads, includingmachine learning, generative, and agentic AI.Aerospike powers millions of transactions per second with millisecond latency, at a fraction of the total cost of ownership compared to other databases.Global leaders, includingAdobe, Airtel, Barclays, Criteo, DBS Bank, Experian, Grab, HDFC Bank, PayPal, Sony Interactive Entertainment, The Trade Desk, and Wayfair,rely on Aerospike forcustomer 360, fraud detection, real-time bidding,profile stores, recommendation engines,and other use cases.At Aerospike, we dream big and deliver even bigger.
Our mission is to unleash the power of the world's real-time data with a database built for infinite scale, speed, and sustainability.If you're ready to shape the future of data, join us.Staff Site Reliability EngineerAs a Staff Site Reliability Engineer at Aerospike, you'll be a technical leader within our global SRE organization, helping drive reliability, performance, and scalability across our hybrid and multi-cloud environments.
You'll bring deep operational experience and lead by example—mentoring others, designing resilient systems, and championing modern SRE practices across new and legacy platforms.You'll play a key role in shaping the direction of our infrastructure initiatives, from Kubernetes-based platforms like AKS and the Aerospike Kubernetes Operator to existing services in AWS and GCP.
Your impact will span teams and systems as you solve complex problems, influence architecture, and foster a culture of ownership, resilience, and continuous improvement.Key ResponsibilitiesProvide technical leadership across multiple systems and environments, proactively identifying risks, shaping architecture decisions, and improving reliability and performance at scale.Lead key infrastructure efforts including Kubernetes platform expansion (AKS, AKO), and application of SRE principles to legacy systems and new cloud offerings.Define, measure, and enforce reliability standards through SLIs/SLOs, observability tooling, and incident response frameworks.Mentor and guide other SREs by leading design sessions, conducting technical deep dives, and reviewing code, configurations, and infrastructure decisions.Partner with product, engineering, and cloud teams to align reliability goals with delivery objectives.Lead root cause analyses and implement systemic fixes for issues spanning multiple platforms or services.Drive automation-first approaches using IaC, CI/CD pipelines, and scripting to reduce toil and increase deployment confidence.Influence cross-functional roadmaps, identifying areas for innovation, technical debt reduction, and long-term scalability.Participate in the global on-call rotation, bringing senior-level calm and clarity during incidents and escalations.Required Experience8+ years of experience in SRE, DevOps, or infrastructure engineering, including significant time operating production systems at scale.Deep hands-on experience with at least one major public cloud (AWS, GCP, Azure), and working knowledge of the others; Azure experience is a plus.Production experience with Kubernetes, including operating clusters, Helm, operators, and supporting microservices in real-world environments.Strong proficiency in infrastructure-as-code tools such as Terraform and CI/CD automation platforms.Expertise in observability tools and practices (Datadog, Prometheus, Grafana, ELK, etc.) and using them to define SLIs and SLOs.
; DataDog experience is a plusProgramming and scripting ability in one or more languages (Python, Go, Bash, etc.).
Experience with large-scale incident response and post-incident review practices.Proven ability to mentor other engineers and influence technical strategy across multiple teams.Strong communication skills to articulate complex concepts to technical and non-technical stakeholders.Preferred Skills and QualificationsHands-on experience managing and optimizing database deployments and services in production environments, ensuring high availability and performance.Familiarity with Aerospike or other distributed databases is a plus.Kubernetes or cloud certifications (CKA, CKS, AWS/GCP DevOps/Architect) a plus but not requireTrack record of influencing architectural decisions across teams or domains.Aerospike is an Equal Opportunity Employer.
We are committed to providing an environment free from discrimination on the basis of race, religion, color, sex, gender identity, sexual orientation, age, non-disqualifying physical or mental disability, national origin, veteran status, or any other basis covered by appropriate law.Create a Job AlertInterested in building your career at Aerospike?
Get future opportunities sent straight to your email.Apply for this job*indicates a required fieldFirst Name *Last Name *Email *PhoneResume/CVEnter manuallyAccepted file types: pdf, doc, docx, txt, rtfEnter manuallyAccepted file types: pdf, doc, docx, txt, rtf
#J-18808-Ljbffr
-
Staff Site Reliability Engineer
3 weeks ago
Sydney, New South Wales, Australia Commonwealth Bank Full timeYou are passionate about SRE and systems engineering We are undergoing one of Australia's largest digital transformations Together we can reimagine banking for millions of customers Do work that matters We're accelerating our digital strategy with an ambition to provide customers with one of the best digital experiences of any company globally.Site...
-
Site Reliability Engineer
3 weeks ago
Sydney, New South Wales, Australia Kindred Group plc Full timeJoin to apply for the Site Reliability Engineer role at Kindred Group plc3 days ago Be among the first 25 applicantsJoin to apply for the Site Reliability Engineer role at Kindred Group plcGet AI-powered advice on this job and more exclusive features.Direct message the job poster from Kindred Group plcTA Partner at FDJ United (formerly known as Kindred) l...
-
Site Reliability Engineer
3 weeks ago
Sydney, New South Wales, Australia Kindred Group plc Full timeJoin to apply for the Site Reliability Engineer role at Kindred Group plc3 days ago Be among the first 25 applicantsJoin to apply for the Site Reliability Engineer role at Kindred Group plcGet AI-powered advice on this job and more exclusive features.Direct message the job poster from Kindred Group plcTA Partner at FDJ United (formerly known as Kindred) l...
-
Site Reliability Engineer
1 week ago
Sydney, New South Wales, Australia Kindred Group plc Full timeJoin to apply for the Site Reliability Engineer role at Kindred Group plc3 days ago Be among the first 25 applicantsJoin to apply for the Site Reliability Engineer role at Kindred Group plcGet AI-powered advice on this job and more exclusive features.Direct message the job poster from Kindred Group plcTA Partner at FDJ United (formerly known as Kindred) l...
-
Staff Site Reliability Engineer
3 weeks ago
Sydney, New South Wales, Australia Commonwealth Bank Full timeYou are passionate about SRE and systems engineeringWe are undergoing one of Australia's largest digital transformationsTogether we can reimagine banking for millions of customersDo work that mattersWe're accelerating our digital strategy with an ambition to provide customers with one of the best digital experiences of any company globally. Site Reliability...
-
Staff Site Reliability Engineer
2 weeks ago
Sydney, New South Wales, Australia Safetyculture Full timeSafetyCultureis a global technology company that is helping to transform workplaces around the world.After witnessing the tragedy of workplace incidents as a private investigator,SafetyCultureFounder Luke Anear recruited a team to help him develop a mobile solution for frontline workers.What we have created is a market-leading workplace operations platform...
-
Principal Site Reliability Engineer
2 weeks ago
Sydney, New South Wales, Australia Commonwealth Bank Full timeJoin to apply for the Principal Site Reliability Engineer role at Commonwealth Bank1 week ago Be among the first 25 applicants Join to apply for the Principal Site Reliability Engineer role at Commonwealth Bank Get AI-powered advice on this job and more exclusive features.You are passionate about SRE and systems engineering We are undergoing one of...
-
Site Reliability Engineer
3 weeks ago
Sydney, New South Wales, Australia Macquarie Group Full timeJoin to apply for the Site Reliability Engineer role at Macquarie Group2 days ago Be among the first 25 applicantsJoin to apply for the Site Reliability Engineer role at Macquarie GroupGet AI-powered advice on this job and more exclusive features.Join our world class SRE team providing services for Macquarie Banking and Financial Services. The SRE function...
-
Site Reliability Engineer
2 weeks ago
Sydney, New South Wales, Australia Macquarie Group Full timeJoin to apply for the Site Reliability Engineer role at Macquarie Group2 days ago Be among the first 25 applicantsJoin to apply for the Site Reliability Engineer role at Macquarie GroupGet AI-powered advice on this job and more exclusive features.Join our world class SRE team providing services for Macquarie Banking and Financial Services. The SRE function...
-
Site Reliability Engineer
2 weeks ago
Sydney, New South Wales, Australia TikTok Full timeSite Reliability Engineer - AML Global Recommendation - USDSSite Reliability Engineer - AML Global Recommendation - USDS2 days ago Be among the first 25 applicantsResponsibilitiesAbout the Team:Site Reliability Engineering (SRE) of the AML (Applied Machine Learning) team combines system engineering and the art of machine learning to develop and run a...