Site Reliability Engineer

4 weeks ago


North Sydney, Australia Workday Australia Pty Ltd Full time

About the Role

Are you a creative SRE looking for more opportunities to automate and improve reliability, or an innovative Software Developer that enjoys building solutions to reduce toil and manual effort?

With constant attention and focus on our customers (both internal and external), you will deliver quickly on a wide range of daily tasks - from environment provisioning, performance monitoring, environment troubleshooting, ad-hoc requests and automation efforts; while providing transparency of work being performed.

This role requires a good understanding of Linux systems in a Production Environment as you will be part of a team that writes and maintains scripts (bash, ruby, python) that support public and private cloud environments.

About You

We would love to hear from you if you like trying new techniques and approaches to sophisticated problems, love to learn new technologies, are a natural collaborator and a phenomenal teammate who brings out the best in everyone around you.

You understand that availability of Workday Service is paramount and requires on-call participation, careful planning of changes, detailed runbooks and effective teamwork. If the work performed is manually repeated often, you find a way to automate the task. More so, you deliver

Basic Qualifications

3+ years of experience running and maintaining a 24x7 large-scale production environment, preferably across multiple data centers

Proven expertise with Linux, debug fundamentals and have a solid understanding of how to quickly isolate issues.

Other Qualifications

BS or MS degree in Computer Science, Engineering, or related technical field, or equivalent experience

Experience deploying and operating: Apache Tomcat, HTTPd, MySQL, Java Web Applications preferably with source control

Experience with many tool sets: Chef, Puppet, OSSEC, Splunk, Elasticsearch, Bladelogic, Ansible, JIRA, Confluence, WaveFront, Grafana, Kubernetes, Prometheus

Strong understanding of enterprise level thinking on a few levels; documentation, runbooks, root cause analysis, capacity-trending, bug fixes and scripting

Passionate about monitoring. When false positives show up on your radar you quickly address it. Your inner wish list is to "make monitoring phenomenal again".

Can balance multiple tasks, make the right business decisions and tackle problems while under pressure, and prioritize and organize effectively.

Able to work some nights and weekends is required as part of the on-call support and production update rotation.

Experience with (CentOS, SunOS, Solaris/Linux/DevOps) is a plus.



  • Sydney, Australia Microsoft Full time

    OverviewAre you interested in working on one of Microsoft's most exciting products? Are you passionate about exceeding customer expectations and advancing Microsoft's cloud-first strategy? If so, the Azure Customer Experience (CXP) Customer Reliability Engineering (CRE) Team is the place for you!Azure CXP CRE is a top-level pillar of Azure Engineering that...


  • Sydney, Australia Microsoft Full time

    Overview Are you interested in working on one of Microsoft's most exciting products? Are you passionate about exceeding customer expectations and advancing Microsoft's cloud-first strategy? If so, the Azure Customer Experience (CXP) Customer Reliability Engineering (CRE) Team is the place for you! Azure CXP CRE is a top-level pillar of Azure...


  • Sydney, Australia Microsoft Full time

    OverviewAre you interested in working on one of Microsoft's most exciting products? Are you passionate about exceeding customer expectations and advancing Microsoft's cloud-first strategy? If so, the Azure Customer Experience (CXP) Customer Reliability Engineering (CRE) Team is the place for you!Azure CXP CRE is a top-level pillar of Azure Engineering that...


  • Sydney, Australia Talenza Full time

    Join our esteemed Australian financial services client as a Site Reliability Engineer (SRE). This pivotal role blends Business-As-Usual (BAU) tasks with the exciting challenge of rolling out enterprise-grade observability solutions. Key Responsibilities: Ensure the reliability, performance, and availability of our systems. Develop and implement...


  • Sydney, Australia Talenza Full time

    Join our esteemed Australian financial services client as a Site Reliability Engineer (SRE). This pivotal role blends Business-As-Usual (BAU) tasks with the exciting challenge of rolling out enterprise-grade observability solutions. Key Responsibilities: Ensure the reliability, performance, and availability of our systems. Develop and implement...


  • Sydney, Australia Microsoft Full time

    Overview Are you interested in working on one of Microsoft's most exciting products? Are you passionate about exceeding customer expectations and advancing Microsoft's cloud-first strategy? If so, the Azure Customer Experience (CXP) Customer Reliability Engineering (CRE) Team is the place for you! Azure CXP CRE is a top-level pillar of Azure...


  • Sydney, Australia Firesoft People Full time

    Senior Site Reliability Engineer Join a leading electronic trading firm in a pivotal role as a Site Reliability Engineer (SRE). At our firm, we are passionate about market-making and arbitrage opportunities on a global scale. Technology drives our success, fuelling our unified trading platform and enabling precise micro-decisions. With our agility and...


  • Sydney, Australia Firesoft People Full time

    Senior Site Reliability Engineer Join a leading electronic trading firm in a pivotal role as a Site Reliability Engineer (SRE). At our firm, we are passionate about market-making and arbitrage opportunities on a global scale. Technology drives our success, fuelling our unified trading platform and enabling precise micro-decisions. With our agility and...


  • Sydney, Australia SafetyCulture Full time

    At SafetyCulture, we help businesses get better everyday. As the operational heartbeat of working teams, our technology gives workers a voice and leaders the visibility to make smart decisions. We’re constantly evolving our platform, expanding into sensors/IoT, Scalable and Event-Driven Architecture to name a few, but we believe there’s more to be...


  • Sydney, Australia SafetyCulture Full time

    At SafetyCulture, we help businesses get better everyday. As the operational heartbeat of working teams, our technology gives workers a voice and leaders the visibility to make smart decisions. We’re constantly evolving our platform, expanding into sensors/IoT, Scalable and Event-Driven Architecture to name a few, but we believe there’s more to be...


  • Sydney, Australia Integral Ad Science Full time

    Job Description:Integral Ad Science (IAS) is a global technology and data company that builds verification, optimization, and analytics solutions for the advertising industry and we’re looking for a Senior Site Reliability Engineer to join our Cloud Reliability Engineering team. If you are excited by technology that has the power to handle hundreds of...


  • Sydney, Australia Integral Ad Science Full time

    Job Description:Integral Ad Science (IAS) is a global technology and data company that builds verification, optimization, and analytics solutions for the advertising industry and we’re looking for a Senior Site Reliability Engineer to join our Cloud Reliability Engineering team. If you are excited by technology that has the power to handle hundreds of...


  • Sydney, Australia Google Full time

    info_outlineXInfo At Google, we have a vision of empowerment and equitable opportunity for all Aboriginal and Torres Strait Islander peoples and commit to building reconciliation through Google’s technology, platforms and people and we welcome Indigenous applicants. Please see our Reconciliation Action Plan for more information. At Google, we have a vision...


  • Sydney, Australia VGW Full time

    Senior Site Reliability Engineer VGW is a fast-growing technology company and creator of market-leading online social games. With offices around the globe, we’re on a mission to be the biggest gaming company in the world! Due to major growth we are expanding our Engineering team and currently looking for a Senior Site Reliability Engineer to join the team....


  • Sydney, Australia VGW Full time

    Senior Site Reliability Engineer VGW is a fast-growing technology company and creator of market-leading online social games. With offices around the globe, we’re on a mission to be the biggest gaming company in the world! Due to major growth we are expanding our Engineering team and currently looking for a Senior Site Reliability Engineer to join...


  • Sydney, New South Wales, Australia Dalet Full time

    What you will love about Dalet and why you should be working here Dalet is a media solutions and service provider that places technological innovation and human collaboration at the heart of everything we do, creating powerful tools and products that help our customers tell better stories. With over three decades of innovation, our software solutions enable...


  • Sydney, New South Wales, Australia Dalet Full time

    What you will love about Dalet and why you should be working here Dalet is a media solutions and service provider that places technological innovation and human collaboration at the heart of everything we do, creating powerful tools and products that help our customers tell better stories. With over three decades of innovation, our software solutions enable...


  • Sydney, Australia Q-CTRL Full time

    About usWe hold the key to making quantum technology useful and operate at the highest levels of the emerging quantum sector. Our products are already revolutionizing the way quantum technology is used; From educating the workforce on how quantum computing works with Black Opal, to the industry-first native integration of our  performance-management...


  • Sydney, Australia Q-CTRL Full time

    About usWe hold the key to making quantum technology useful and operate at the highest levels of the emerging quantum sector. Our products are already revolutionizing the way quantum technology is used; From educating the workforce on how quantum computing works with Black Opal, to the industry-first native integration of our  performance-management...


  • Sydney, Australia Freelancer.com Full time

    We are seeking a highly motivated and skilled Mid-Level Site Reliability Engineer (SRE) to join our growing team. You will be responsible for ensuring the reliability, performance, and scalability of our critical infrastructure and applications running primarily on AWS. You will work closely with developers, DevOps engineers, and other stakeholders to...