Site Reliability Engineer

21 hours ago


Sydney, New South Wales, Australia Cover Genius Ltd Full time
About the Role

We are seeking a highly skilled Site Reliability Engineer to join our team at Cover Genius Ltd. As a Site Reliability Engineer, you will play a critical role in ensuring the reliable operation of our production systems.

Main Responsibilities
  • Analyze, test, and modify systems to improve reliability and optimize performance, particularly at an architectural/infrastructure level.
  • Develop and maintain observability tooling and dashboards to monitor system health and security.
  • Implement automation tools and frameworks, CI/CD pipelines, and reduce toil to improve efficiency.
  • Troubleshoot production issues and coordinate with the development team to streamline code deployments.
  • Apply AWS and GCP knowledge and skills to create and maintain cloud infrastructure for software projects.
  • Design, develop, and implement software integrations to improve engineering tools and systems.
  • Collaborate with Software Engineers and other team members to improve engineering tools, systems, procedures, and data security.
  • Develop and maintain design and troubleshooting documentation and runbooks.
  • Optimize and control costs of the company's computing infrastructure.
Requirements
  • Understanding of SRE Principles and best practices.
  • Experience using and configuring modern observability tools such as ELK/EFK, Prometheus, Grafana.
  • Comfortable scripting and developing internal tooling with Bash and at least one programming language (e.g., Python, Go).
  • Experience working with infrastructure and configuration as code tools such as Terraform, Cloud Formation, Chef, Puppet, etc.
  • Experienced with container technology such as Docker and ideally experienced with using and managing Kubernetes clusters.
  • Experience working with Linux.
  • Solid understanding of networking and system architecture.
  • Solid understanding of how to deploy, scale, and monitor web applications and databases.
  • Good knowledge of AWS and/or GCP platforms and associated best practices.
  • Bachelor's Degree in Computer Science/Engineering or equivalent practical experience.
  • Strong communication and documentation skills.
  • Curious and self-motivated learner.
  • Professional approach.
  • Good team member.
  • Organizational and time management skills.
  • Excellent attention to detail.
  • Positive approach to change.


  • Sydney, New South Wales, Australia Citadel Securities Full time

    Job Title: Site Reliability EngineerAbout the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Citadel Securities. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our distributed systems and applications.Responsibilities:Design and implement scalable and efficient...


  • Sydney, New South Wales, Australia Citadel Securities Full time

    Job Title: Site Reliability EngineerAbout the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Citadel Securities. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our distributed systems and applications.Responsibilities:Design and implement scalable and efficient...


  • Sydney, New South Wales, Australia Audinate Full time

    {"title": "Site Reliability Engineer", "description": "About the RoleWe're seeking a skilled Site Reliability Engineer to join our team at Audinate. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our infrastructure, including our on-prem services and hyperscale cloud services.Key...


  • Sydney, New South Wales, Australia EFinancialCareers Ltd. Full time

    Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at EFinancialCareers Ltd. in Sydney, Australia.About the RoleAs a Site Reliability Engineer, you will be responsible for ensuring the smooth operation of our high-frequency trading systems. You will work closely with the development team to identify and...


  • Sydney, New South Wales, Australia Microsoft Full time

    Job Title: Site Reliability EngineerAre you passionate about delivering exceptional customer experiences and advancing Microsoft's cloud-first strategy? We're seeking a skilled Site Reliability Engineer to join our Azure Customer Experience (CXP) Customer Reliability Engineering (CRE) Team.About the RoleAs a Site Reliability Engineer, you will be responsible...


  • Sydney, New South Wales, Australia Microsoft Full time

    Job Title: Site Reliability EngineerAre you passionate about delivering exceptional customer experiences and advancing Microsoft's cloud-first strategy? We're seeking a skilled Site Reliability Engineer to join our Azure Customer Experience (CXP) Customer Reliability Engineering (CRE) Team.About the RoleAs a Site Reliability Engineer, you will be responsible...


  • Sydney, New South Wales, Australia EFinancialCareers Ltd. Full time

    Job Title: Junior Site Reliability EngineerWe are seeking a highly skilled Junior Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly available...


  • Sydney, New South Wales, Australia Palantir Technologies Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Database Operations team at Palantir Technologies. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our databases and data platforms.Key ResponsibilitiesDesign and implement scalable systems and...


  • Sydney, New South Wales, Australia Stake Australia Full time

    About Stake AustraliaStake Australia is a leading investment platform that provides a seamless and immersive experience for ambitious investors. With a global customer base of 500,000+ investors and over A$3 billion in assets under administration, we're committed to delivering high-quality execution and continuous improvement.Job Title: Site Reliability...


  • Sydney, New South Wales, Australia TikTok Full time

    About the RoleTikTok is a leading platform for short-form mobile video, and our mission is to inspire creativity and bring joy. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and performance of our platform, which is used by millions of Americans every day.Key ResponsibilitiesDesign, build, and maintain highly...


  • Sydney, New South Wales, Australia ServiceNow Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at ServiceNow. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our infrastructure, driving initiatives to improve system design, and promoting a culture of automation and scalability.Key ResponsibilitiesProvide relief...


  • Sydney, New South Wales, Australia Cover Genius Full time

    About the RoleWe're seeking a skilled Site Reliability Engineer to join our team at Cover Genius. As a Site Reliability Engineer, you will play a critical role in ensuring the reliable operation of our production systems.Main ResponsibilitiesAnalyze, test, and modify systems to improve reliability and optimize performance, particularly at an...


  • Sydney, New South Wales, Australia Cover Genius Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Cover Genius. As a Site Reliability Engineer, you will play a critical role in ensuring the reliable operation of our production systems.Key ResponsibilitiesAnalyze, test, and modify systems to improve reliability and optimize performance, particularly at an...


  • Sydney, New South Wales, Australia Cover Genius Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Cover Genius. As a Site Reliability Engineer, you will play a critical role in ensuring the reliable operation of our production systems.Main ResponsibilitiesAnalyze, test, and modify systems to improve reliability and optimize performance, particularly at an...


  • Sydney, New South Wales, Australia Randstad Full time

    Site Reliability EngineerWe are seeking an experienced Site Reliability Engineer to join our team in Sydney. As part of our growing infrastructure and applications team, you will be responsible for maintaining, stabilizing, and scaling our key systems.Key Responsibilities:Lead Infrastructure Development: Implement cutting-edge monitoring, logging, and...


  • Sydney, New South Wales, Australia Cover Genius Ltd Full time

    About Cover Genius LtdCover Genius Ltd is a leading insurtech company that protects the global customers of the world's largest digital companies. Our award-winning insurance distribution platform, XCover, is integrated with our partners to embed protection for millions of customers worldwide each year.Job DescriptionWe are seeking a highly skilled Site...


  • Sydney, New South Wales, Australia Cover Genius Ltd Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Cover Genius Ltd. As a Site Reliability Engineer, you will play a critical role in ensuring the reliable operation of our production systems.Main ResponsibilitiesAnalyze, test, and modify systems to improve reliability and optimize performance, particularly at an...


  • Sydney, New South Wales, Australia Palantir Technologies Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Database Operations team at Palantir Technologies. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our databases and infrastructure.Key ResponsibilitiesDesign and implement scalable systems and...


  • Sydney, New South Wales, Australia Audinate Full time

    About the RoleAudinate is seeking a skilled Site Reliability Engineer to join our dynamic engineering team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and security of our infrastructure and services.You will work closely with our software, hardware, and test engineering teams to deliver our...


  • Sydney, New South Wales, Australia Microsoft Full time

    Job Title: Site Reliability EngineerMicrosoft is seeking a highly skilled Site Reliability Engineer to join our Azure Customer Experience (CXP) Customer Reliability Engineering (CRE) Team. As a key member of our team, you will be responsible for improving customer experience on Azure by analyzing signals from various sources, driving root cause analyses...