Site Reliability Engineer
6 hours ago
We are looking for a
Site Reliability Engineer (SRE)
to join our team and ensure the reliability, scalability, and performance of our software systems. This role bridges the gap between software development and IT operations, focusing on automation, monitoring, and incident response to maintain high system uptime and user satisfaction.
Key Responsibilities
- Monitor system performance and availability using tools like Prometheus, Grafana, and ELK stack.
- Build and maintain scalable infrastructure using tools such as Terraform, Ansible, and Kubernetes.
- Automate operational tasks and deployment pipelines (CI/CD).
- Collaborate with development teams to improve system reliability and performance.
- Participate in incident response, root cause analysis, and postmortem documentation.
- Define and maintain service-level objectives (SLOs) and service-level indicators (SLIs).
- Implement disaster recovery and business continuity plans.
- Optimize system performance and resource utilization.
Required Qualifications
- Bachelor's degree in Computer Science, Engineering, or related field.
- 3+ years of experience in Site Reliability Engineering, DevOps, or Software Engineering.
- Proficiency in programming languages such as Python, Go, Java, or Ruby.
- Strong understanding of Linux systems and networking fundamentals.
- Experience with cloud platforms (AWS, Azure, GCP).
- Familiarity with containerization and orchestration (Docker, Kubernetes).
- Knowledge of monitoring and alerting tools.
- Excellent problem-solving and communication skills.
Preferred Qualifications
- Experience with distributed systems and microservices architecture.
- Certifications in cloud technologies (e.g., AWS Certified Solutions Architect).
- Experience with security and compliance in production environments.
-
Site Reliability Engineer
2 weeks ago
Sydney, New South Wales, Australia Kindred Group plc Full timeJoin to apply for the Site Reliability Engineer role at Kindred Group plc3 days ago Be among the first 25 applicantsJoin to apply for the Site Reliability Engineer role at Kindred Group plcGet AI-powered advice on this job and more exclusive features.Direct message the job poster from Kindred Group plcTA Partner at FDJ United (formerly known as Kindred) l...
-
Site Reliability Engineer
3 weeks ago
Sydney, New South Wales, Australia Macquarie Group Full timeJoin to apply for the Site Reliability Engineer role at Macquarie Group2 days ago Be among the first 25 applicantsJoin to apply for the Site Reliability Engineer role at Macquarie GroupGet AI-powered advice on this job and more exclusive features.Join our world class SRE team providing services for Macquarie Banking and Financial Services. The SRE function...
-
Site Reliability Engineer
4 weeks ago
Sydney, New South Wales, Australia Macquarie Group Full timeJoin to apply for the Site Reliability Engineer role at Macquarie Group2 days ago Be among the first 25 applicantsJoin to apply for the Site Reliability Engineer role at Macquarie GroupGet AI-powered advice on this job and more exclusive features.Join our world class SRE team providing services for Macquarie Banking and Financial Services. The SRE function...
-
Site Reliability Engineer
3 weeks ago
Sydney, New South Wales, Australia TikTok Full timeSite Reliability Engineer - AML Global Recommendation - USDSSite Reliability Engineer - AML Global Recommendation - USDS2 days ago Be among the first 25 applicantsResponsibilitiesAbout the Team:Site Reliability Engineering (SRE) of the AML (Applied Machine Learning) team combines system engineering and the art of machine learning to develop and run a...
-
Site Reliability Engineer
4 weeks ago
Sydney, New South Wales, Australia ServiceNow Full timeOverviewSite Reliability Engineer - SPP at ServiceNow, Millers Point, New South Wales, Australia.RoleAs a Site Reliability Engineer, you will:Provide relief and sustainable resolution to issues within our infrastructure.Use your experience in software development, systems engineering and networking to proactively prevent repeatable issues.Drive initiatives...
-
Site Reliability Engineer
4 weeks ago
Sydney, New South Wales, Australia ServiceNow Full timeOverviewSite Reliability Engineer - SPP at ServiceNow, Millers Point, New South Wales, Australia.RoleAs a Site Reliability Engineer, you will:Provide relief and sustainable resolution to issues within our infrastructure.Use your experience in software development, systems engineering and networking to proactively prevent repeatable issues.Drive initiatives...
-
Site Reliability Engineer
4 days ago
Sydney, New South Wales, Australia Whizdom Full time $80,000 - $120,000 per yearSite Reliability Engineer – AWS Infrastructure & ObservabilityAbout the client:Our client is a global consultancy delivering scalable, secure cloud infrastructure and reliability engineering solutions across government and enterprise platforms. This role supports AWS-based systems with a focus on automation, observability, and performance.About the role:We...
-
Site Reliability Engineer
3 weeks ago
Sydney, New South Wales, Australia UST Full timeRole Description UST is looking for a Site Reliability Engineer to join our team in Sydney, Australia. The Site Reliability Engineer will play the mission-critical role of ensuring that critical systems are healthy, monitored, automated, and designed to scale. This role will be responsible for responding to production problems, investigating their...
-
Site Reliability Engineer
3 weeks ago
Sydney, New South Wales, Australia UST Full timeRole DescriptionUST is looking for a Site Reliability Engineer to join our team in Sydney, Australia.The Site Reliability Engineer will play the mission-critical role of ensuring that critical systems are healthy, monitored, automated, and designed to scale. This role will be responsible for responding to production problems, investigating their causes, and...
-
Site Reliability Engineer
7 days ago
Sydney, New South Wales, Australia Avance Consulting Full time $120,000 - $180,000 per yearRole OverviewWe are seeking a skilled Site Reliability Engineer (SRE) with expertise in automation and observability. The ideal candidate will have strong proficiency in PowerShell scripting for automation, infrastructure management, and operational efficiency, as well as Power BI experience for building dashboards, metrics, and insights to support...