Site Reliability Engineer

19 hours ago


Sydney, New South Wales, Australia Cover Genius Full time $120,000 - $180,000 per year

About The Company
Cover Genius is a Series E Insurtech that protects the global customers of the world's largest digital companies including Booking Holdings, owner of Priceline, Kayak and , Intuit, Hopper, Skyscanner, Ryanair, Turkish Airlines, Descartes ShipRush, Zip and SeatGeek. We're also available at Amazon, Flipkart, eBay, Wayfair and SE Asia's largest company, Shopee.

Our partners integrate with XCover, our award-winning insurance distribution platform, to embed protection for millions of customers worldwide each year. Our team and products have been recognized sed with dozens of awards including by the Financial Times who ranked Cover Genius as the #1 fastest growing company in APAC in 2020. Our diverse team across 20+ countries and many language groups commits itself to diverse cultural programs, in particular "CG Gives" which makes social entrepreneurs out of us all and funds development initiatives in global communities.

Our People are Bold, Authentic, Purposeful and Inspired

Our People are not Perfect, Traditional, Complacent or Cautious

About The Role
As a Site Reliability Engineer on our Technology Team, you will own the reliable operation and continuous improvement of our production systems. Your primary purpose will be to ensure the seamless and secure functioning of our platforms and operations.

To drive success in this role, you will have a strong background in systems engineering and automation, with experience in release processes, observability, security, core network and infrastructure, and datastores and disaster recovery. You should possess excellent problem-solving skills, a keen attention to detail, and a proactive approach to identifying and mitigating potential issues.

As the Site Reliability Engineer, you will be responsible for:

  • Monitoring system health and ensuring operational stability and security
  • Automating and optimizing platform operations
  • Sharing ownership of production workloads with software engineering teams
  • Writing and maintaining technical documentation, including tutorials, guides, and blameless post-mortems
  • Designing and creating information dashboards based on logging and monitoring data
  • Collaborating with software engineers to drive automation, scalability, and efficiency across technology products and platforms
  • Regular collaboration with software engineering teams, security teams, and other relevant stakeholders will be key in ensuring the reliability and efficiency of our production systems are achieved.

Key Responsibilities

  • Analyze, test and modify systems to improve reliability and optimize performance particularly at an architectural/infrastructure level
  • Apply AWS and GCP knowledge and skills to create & maintain cloud infrastructure for software projects
  • Develop and maintain observability tooling and dashboards
  • Implement automation tools and frameworks, CI/CD pipelines, Reduce toil
  • Troubleshoot production issues and coordinate with the development team to streamline code deployments
  • Design, develop and implement software integrations
  • Collaborate with Software Engineers and other team members with the goal of improving engineering tools, systems, procedures and data security
  • Develop and maintain design and troubleshooting documentation and runbooks
  • Optimize and control costs of the company's computing infrastructure

Skills & Experience
What you will bring

  • Understanding of SRE Principles and best practices
  • Experience using & configuring modern observability tools such as Datadog, Elasticsearch, Prometheus, Grafana
  • Experienced with container technology such as Docker and Ideally experienced with using and managing Kubernetes clusters
  • Experience working with infrastructure & configuration as code tools such as Terraform, Cloudformation, Chef, Puppet etc.
  • Comfortable scripting & developing internal tooling with Bash and at least one programming language (e.g. python, go)
  • Experience working with Linux
  • Solid understanding of networking and system architecture
  • Solid understanding of how to deploy, scale and monitor web applications and databases
  • Good knowledge of AWS and/or GCP platforms and associated best practices
  • Bachelor's degree in Computer Science/Engineering, A postgraduate degree and/or record of academic achievement is also desirable

What You Will Have

  • Strong communication and documentation skills
  • Curious and self motivated learner
  • Professional approach
  • Good team member
  • Organisational and time management skills
  • Excellent attention to detail
  • Positive approach to change


  • Sydney, New South Wales, Australia AI Hustler Full time $120,000 - $180,000 per year

    Site Reliability Engineer (SRE) | Daily Rate Contract | Visa Sponsorship AvailableLocation:Sydney (Hybrid or Remote)Type:Contract (Daily Rate)Experience Level:Mid to Senior (5+ years)Stack:Kubernetes, Terraform, CI/CD, Observability, Cloud (AWS/GCP/Azure)Our client is looking for an experienced Site Reliability Engineerto join a high-scale platform team...


  • Sydney, New South Wales, Australia Luminance Full time $120,000 - $180,000 per year

    The Role Luminance's Site Reliability team combines strong problem solving, infrastructure tooling and wider DevOps practices to provide a service of Luminance's unique software applications. The team plays a crucial role in incident response and issue resolution, swiftly addressing and resolving service interruptions to maintain the highest level of...


  • Sydney, New South Wales, Australia Luminance Full time $120,000 - $180,000 per year

    The RoleLuminance's Site Reliability team combines strong problem solving, infrastructure tooling and wider DevOps practices to provide a service of Luminance's unique software applications. The team plays a crucial role in incident response and issue resolution, swiftly addressing and resolving service interruptions to maintain the highest level of customer...


  • Sydney, New South Wales, Australia Luminance Full time $80,000 - $140,000 per year

    The Role Luminance's Site Reliability team combines strong problem solving, infrastructure tooling and wider DevOps practices to provide a service of Luminance's unique software applications. The team plays a crucial role in incident response and issue resolution, swiftly addressing and resolving service interruptions to maintain the highest level of...


  • Sydney, New South Wales, Australia Luminance Full time $120,000 - $180,000 per year

    The RoleLuminance's Site Reliability team combines strong problem solving, infrastructure tooling and wider DevOps practices to provide a service of Luminance's unique software applications. The team plays a crucial role in incident response and issue resolution, swiftly addressing and resolving service interruptions to maintain the highest level of customer...


  • Sydney, New South Wales, Australia uniqueHire PTY LTD Full time $91,989 - $160,772 per year

    Hiring: Site Reliability Engineer - SydneyAre you passionate about building reliable, scalable, and automated systems?We're looking for a talented SRE Engineer to join our dynamic team and help ensure the performance, reliability, and resilience of our production environments.Key Responsibilities:Design, build, and maintain cloud infrastructure (AWS /...


  • Sydney, New South Wales, Australia CareCone Group Full time $120,000 - $180,000 per year

    Role: Site Reliability Engineer (Dynatrace + ELK)Location: SydneyPermanent (Fulltime)Job Description:Design, deploy, and maintain reliable, scalable, and high-performance systems.Implement and manage observability solutions using Dynatrace for monitoring and APM.Configure and optimize ELK (Elasticsearch, Logstash, Kibana) for centralized logging and...


  • Sydney, New South Wales, Australia CareCone Group Full time $100,000 - $150,000 per year

    Role:Site Reliability EngineerLocation:Sydney, NSWEmployment Type:PermanentMust Have:Full working rights. No sponsorship available.Immediate joiners (0-1 week)Minimum 4 years of work experience as an SREMandatory Skills:AWSTerraformDynatraceInterested consultants can share their updated resume ator call


  • Sydney, New South Wales, Australia Ticketek Entertainment Group Full time $120,000 - $180,000 per year

    About Ticketek Entertainment Group​Ticketek Entertainment Group is a global fan experience Company that tickets, promotes and delivers incredible live experiences that are impossible to forget.  In a distracted world where nothing beats real human moments, We make life better liveOur Group includes; our Fan Experience Platform (Ticketek) that sells...


  • Sydney, New South Wales, Australia TEG Pty Ltd Full time $120,000 - $180,000 per year

    About Ticketek Entertainment Group​Ticketek Entertainment Group is a global fan experience Company that tickets, promotes and delivers incredible live experiences that are impossible to forget.  In a distracted world where nothing beats real human moments, We make life better liveOur Group includes; our Fan Experience Platform (Ticketek) that sells...