
High Scale System Reliability Expert
5 days ago
Reliability Engineer for High-Scale Systems
We are seeking an experienced engineer to join our team in building high-scale systems that deliver exceptional reliability and performance. This is a unique opportunity to work with a dynamic organization that prioritizes career growth and technical leadership.
The ideal candidate will have a strong background in Site Reliability Engineering, Platform Engineering, or DevOps, with a proven track record of setting and managing Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Service Level Agreements (SLAs) for production systems.
Key Responsibilities:
- Define, measure, and deliver against SLIs, SLOs, and SLAs to guarantee system reliability.
- Build playbooks, lead incident response, and run blameless post-mortems to continuously improve resilience.
- Implement advanced monitoring, logging, and alerting tools to detect and resolve issues proactively.
- Identify bottlenecks and optimize performance to ensure seamless scalability with demand.
- Maintain consistency and repeatability by automating infrastructure using Terraform/CloudFormation.
- Own disaster recovery, backup strategies, and fault-tolerant designs that safeguard customer data.
- Work closely with development teams to embed reliability into the software delivery lifecycle.
- Integrate security best practices into infrastructure and operations.
Requirements:
- 5+ years in Site Reliability Engineering, Platform Engineering, or DevOps within high-scale environments.
- Proven track record of setting and managing SLIs/SLOs/SLAs for production systems.
- Experience in incident management: on-call rotations, root cause analysis, and post-mortem culture.
- Deep knowledge of cloud platforms (AWS) and expertise with Infrastructure as Code (Terraform, CloudFormation).
- Strong background in observability tooling: Prometheus, Grafana, CloudWatch, OpenSearch/ELK, or equivalent.
- Proficiency in scripting/automation (Python, Bash, or Go preferred).
- Exposure to resilience engineering: chaos testing, fault injection, and recovery strategies.
- Strong understanding of cloud security and compliance practices.
About Us
We offer a collaborative environment where engineers can thrive, prioritize career growth, and provide opportunities for technical leadership and ownership of reliability domains. Our hybrid flexibility allows employees to work from a central location and remotely, in a setup that fits their lifestyle.
This is a unique opportunity to shape the future of technology and contribute to building high-scale systems that deliver exceptional reliability and performance.
-
Expert System Reliability Professional
5 days ago
Melbourne, Victoria, Australia beBeeReliability Full time $96,836 - $105,068System Reliability ExpertWe are seeking a skilled System Reliability Engineer to join our team. The ideal candidate will have a strong background in software engineering, DevOps, operations, or cloud engineering.Job DescriptionThe primary responsibility of this role is to ensure the reliability, scalability, and performance of our systems. This includes...
-
Scaling High-Touch Systems for Maximum Impact
18 hours ago
Melbourne, Victoria, Australia beBeeReliability Full time $185,462 - $251,784Reliability Engineer - Lead Scaling and PerformanceWe're seeking a seasoned Reliability Engineer to spearhead the scaling of our infrastructure, observability, and performance across the business. This high-trust, high-impact role requires a sharp individual who can reimagine how people connect.As a lead in this position, you'll own and evolve SRE...
-
Reliable Systems Expert
2 weeks ago
Melbourne, Victoria, Australia beBeeReliability Full time $108,893About the RoleThis position is focused on ensuring system reliability, scalability, and performance. The primary goal is to maintain production systems that are reliable, performant, and scalable.Key ResponsibilitiesDefine Service Level Objectives (SLOs), Service Level Indicators (SLIs), and Service Level Agreements (SLAs) for reliability.Monitor system...
-
Systems Reliability Specialist
5 days ago
Melbourne, Victoria, Australia beBeeReliability Full time $108,571 - $119,893**Reliable Systems Expertise Wanted**We are seeking a seasoned expert in system reliability to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the stability, scalability, and performance of our systems.Key Responsibilities:Design and Implement Reliable SystemsDevelop Service Level Objectives (SLOs), Service Level...
-
Reliable Systems Engineer
1 week ago
Melbourne, Victoria, Australia beBeeSre Full time $130,000 - $180,000We are shaping the future of human connection by building innovative tools for professionals to stay connected.Job DescriptionYou will lead the evolution of our infrastructure as we scale, designing resilient systems and automating deployment processes.You will work alongside existing engineers to drive reliability improvements in partnership with product...
-
Reliable Systems Expert
38 minutes ago
Melbourne, Victoria, Australia beBeeSystemsEngineer Full time $140,000 - $164,000About the Role:This role is responsible for ensuring that all project designs meet operational and maintenance requirements throughout the project lifecycle.The ideal candidate will provide expert leadership in technical reviews, manage requirements traceability, and ensure systems are designed for reliability, functionality, and operator-led outcomes.Key...
-
Expert Systems Architect
3 days ago
Melbourne, Victoria, Australia beBeeBackend Full time $180,000 - $240,000Job Description">As a Staff Backend Engineer, you will lead the design and delivery of core systems across our platform, scaling infrastructure that's fast, reliable, and secure.You will be responsible for building backend systems and APIs that scale with our global growth, leading high-impact initiatives across authentication, permissions, usage tracking,...
-
System Reliability Specialist
5 days ago
Melbourne, Victoria, Australia beBeeReliability Full time $108,571 - $119,893About reliability engineering. Reliability engineering is a discipline that focuses on ensuring the reliability, scalability, and performance of complex systems.Role OverviewWe are seeking a System Reliability Engineer to join our team in Melbourne, Australia. This role will focus on defining Service Level Objectives (SLOs), monitoring system performance,...
-
Operate Large-Scale Systems
3 days ago
Melbourne, Victoria, Australia beBeeSystemAdministration Full time $100,000 - $120,000Job Summary:We are seeking a highly skilled System Operations Manager to oversee the management of large-scale operational systems. This role is ideal for individuals who enjoy providing expert technical support and administering databases and applications.Key Responsibilities:Manage and administer large-scale operational systems efficiently, including...
-
Reliable Systems Specialist
1 week ago
Melbourne, Victoria, Australia beBeeEngineer Full time $108,571 - $119,893Site Reliability EngineerThe primary focus is ensuring system reliability, scalability and performance.Key Responsibilities:Define SLOs, SLIs and SLAs for reliability.Monitor system performance and reduce toil.Capacity planning scaling.Automate reliability improvements.Ensure production systems are reliable, performant and scalable.Required Skills and...