High Scale System Reliability Expert

5 days ago


Melbourne, Victoria, Australia beBeeReliability Full time $120,000 - $150,000

Reliability Engineer for High-Scale Systems

We are seeking an experienced engineer to join our team in building high-scale systems that deliver exceptional reliability and performance. This is a unique opportunity to work with a dynamic organization that prioritizes career growth and technical leadership.

The ideal candidate will have a strong background in Site Reliability Engineering, Platform Engineering, or DevOps, with a proven track record of setting and managing Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Service Level Agreements (SLAs) for production systems.

Key Responsibilities:

  • Define, measure, and deliver against SLIs, SLOs, and SLAs to guarantee system reliability.
  • Build playbooks, lead incident response, and run blameless post-mortems to continuously improve resilience.
  • Implement advanced monitoring, logging, and alerting tools to detect and resolve issues proactively.
  • Identify bottlenecks and optimize performance to ensure seamless scalability with demand.
  • Maintain consistency and repeatability by automating infrastructure using Terraform/CloudFormation.
  • Own disaster recovery, backup strategies, and fault-tolerant designs that safeguard customer data.
  • Work closely with development teams to embed reliability into the software delivery lifecycle.
  • Integrate security best practices into infrastructure and operations.

Requirements:

  • 5+ years in Site Reliability Engineering, Platform Engineering, or DevOps within high-scale environments.
  • Proven track record of setting and managing SLIs/SLOs/SLAs for production systems.
  • Experience in incident management: on-call rotations, root cause analysis, and post-mortem culture.
  • Deep knowledge of cloud platforms (AWS) and expertise with Infrastructure as Code (Terraform, CloudFormation).
  • Strong background in observability tooling: Prometheus, Grafana, CloudWatch, OpenSearch/ELK, or equivalent.
  • Proficiency in scripting/automation (Python, Bash, or Go preferred).
  • Exposure to resilience engineering: chaos testing, fault injection, and recovery strategies.
  • Strong understanding of cloud security and compliance practices.

About Us

We offer a collaborative environment where engineers can thrive, prioritize career growth, and provide opportunities for technical leadership and ownership of reliability domains. Our hybrid flexibility allows employees to work from a central location and remotely, in a setup that fits their lifestyle.

This is a unique opportunity to shape the future of technology and contribute to building high-scale systems that deliver exceptional reliability and performance.



  • Melbourne, Victoria, Australia beBeeReliability Full time $96,836 - $105,068

    System Reliability ExpertWe are seeking a skilled System Reliability Engineer to join our team. The ideal candidate will have a strong background in software engineering, DevOps, operations, or cloud engineering.Job DescriptionThe primary responsibility of this role is to ensure the reliability, scalability, and performance of our systems. This includes...


  • Melbourne, Victoria, Australia beBeeReliability Full time $185,462 - $251,784

    Reliability Engineer - Lead Scaling and PerformanceWe're seeking a seasoned Reliability Engineer to spearhead the scaling of our infrastructure, observability, and performance across the business. This high-trust, high-impact role requires a sharp individual who can reimagine how people connect.As a lead in this position, you'll own and evolve SRE...


  • Melbourne, Victoria, Australia beBeeReliability Full time $108,893

    About the RoleThis position is focused on ensuring system reliability, scalability, and performance. The primary goal is to maintain production systems that are reliable, performant, and scalable.Key ResponsibilitiesDefine Service Level Objectives (SLOs), Service Level Indicators (SLIs), and Service Level Agreements (SLAs) for reliability.Monitor system...


  • Melbourne, Victoria, Australia beBeeReliability Full time $108,571 - $119,893

    **Reliable Systems Expertise Wanted**We are seeking a seasoned expert in system reliability to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the stability, scalability, and performance of our systems.Key Responsibilities:Design and Implement Reliable SystemsDevelop Service Level Objectives (SLOs), Service Level...


  • Melbourne, Victoria, Australia beBeeSre Full time $130,000 - $180,000

    We are shaping the future of human connection by building innovative tools for professionals to stay connected.Job DescriptionYou will lead the evolution of our infrastructure as we scale, designing resilient systems and automating deployment processes.You will work alongside existing engineers to drive reliability improvements in partnership with product...

  • Reliable Systems Expert

    38 minutes ago


    Melbourne, Victoria, Australia beBeeSystemsEngineer Full time $140,000 - $164,000

    About the Role:This role is responsible for ensuring that all project designs meet operational and maintenance requirements throughout the project lifecycle.The ideal candidate will provide expert leadership in technical reviews, manage requirements traceability, and ensure systems are designed for reliability, functionality, and operator-led outcomes.Key...


  • Melbourne, Victoria, Australia beBeeBackend Full time $180,000 - $240,000

    Job Description">As a Staff Backend Engineer, you will lead the design and delivery of core systems across our platform, scaling infrastructure that's fast, reliable, and secure.You will be responsible for building backend systems and APIs that scale with our global growth, leading high-impact initiatives across authentication, permissions, usage tracking,...


  • Melbourne, Victoria, Australia beBeeReliability Full time $108,571 - $119,893

    About reliability engineering. Reliability engineering is a discipline that focuses on ensuring the reliability, scalability, and performance of complex systems.Role OverviewWe are seeking a System Reliability Engineer to join our team in Melbourne, Australia. This role will focus on defining Service Level Objectives (SLOs), monitoring system performance,...


  • Melbourne, Victoria, Australia beBeeSystemAdministration Full time $100,000 - $120,000

    Job Summary:We are seeking a highly skilled System Operations Manager to oversee the management of large-scale operational systems. This role is ideal for individuals who enjoy providing expert technical support and administering databases and applications.Key Responsibilities:Manage and administer large-scale operational systems efficiently, including...


  • Melbourne, Victoria, Australia beBeeEngineer Full time $108,571 - $119,893

    Site Reliability EngineerThe primary focus is ensuring system reliability, scalability and performance.Key Responsibilities:Define SLOs, SLIs and SLAs for reliability.Monitor system performance and reduce toil.Capacity planning scaling.Automate reliability improvements.Ensure production systems are reliable, performant and scalable.Required Skills and...