Systems Reliability Engineer
5 days ago
About us
We're proud to be Australia's #1 Technology Great Place to Work 2025 for the second year running This is in addition to being Great Place to Work Certified 2024. We've also won the 2024 Gallup Exceptional Workplace Award globally
Macquarie Cloud Services are the Australian specialists in cloud services for business and government. Locally owned and operated, with an industry-leading customer service model, we're trusted by our customers to provide the services that enable their business success.
We have made it our challenge to make our people feel good and love the work they do. Because of this, our people are energised and motivated in their work.
We believe that collaboration & team connection is key for success. This role will be based in Sydney with a blended working arrangement of 3 days in our CBD offices & the remaining working from home. #LI-Hybrid
The Opportunity
Be part of a new and growing capability at Macquarie Cloud Services, helping to shape it from the ground up. Join a dedicated team that's driving innovation, automation, and exceptional outcomes for our customers.
As a Systems Reliability Engineer (SRE), you'll work across hybrid projects 60% software development, 40% infrastructure building and automating Azure environments through code. From Infrastructure as Code (IaC) and Identity Management to Data & AI, you'll deliver impactful solutions with strong automation focus that help customers modernise their cloud operations.
If you're passionate about automation, cloud innovation, and making a real impact this is a great opportunity to join Australia's number one MSP. You will get exposure to cutting-edge cloud automation and data pipeline projects.
In this role, you'll bridge the gap between software development and IT operations by designing, building, and automating Azure environments through code. Working in an agile environment, you'll deliver continuous improvements that enhance reliability and align with customer goals and the Azure Well-Architected Framework.
You'll also strengthen our internal capability by improving knowledge bases, runbooks, and delivery templates to drive faster, more consistent outcomes across Macquarie Cloud Services.
Why Us?- We have been awarded Australia's #1 Great Place to Work 2025 for the second year running
- We're Great Place to Work Certified 2024.
- We have been named a Global Winner of the 2024 Gallup Exceptional Workplace Award
- We're the #1 Managed Services Cloud business in Australia.
- We have the highest Net Promoter Score on the ASX, the World's best customer experience & crowned in 2020 at the World Communications Awards.
- You'll make an impact: Enjoy being part of a driven team with a collaborative culture that values decision-makers and action.
- We Invest in you: Accelerate your career through our learning and development opportunities - think Gallup strengths-based training, defined career pathways and fantastic internal mobility opportunities across the group.
- Meet regularly with your assigned customers and the broader Macquarie operational team (HMC & CMND), manage the backlog, execute tasks and log your time.
- Take a site-reliability mindset in implementing well-architected solutions for customers across automation (using an automation-first mindset), monitoring and alerting, performance optimisation, security and coding standards.
- Bring your deep knowledge of the Azure cloud platform and its services.
- Automate through Infrastructure as Code and supplement with scripting to achieve customer business outcomes.
- Integrate IaC and scripting solutions into DevOps pipelines to provide customers with a seamless outcome.
- Maintain the operations manual for customers including clean documentation for delivered solutions.
- Work with the Product & Architecture team to fine-tune customer landing zone designs.
- Iterate and mature our internal knowledgebase, work instructions, project and delivery artefacts.
- Contribute to our common IaC library for use in customer projects.
- At least 5 years' delivering IaaS and PaaS solutions in Microsoft Azure.
- 5–10 years of experience in an SRE, DevOps, or Cloud Automation roles.
- Extensive hands-on experience with Identity, Virtual Machines, Virtual Networking, Storage, and associated end to end service architecture.
- Experience in GitHub & source code management required.
- Demonstrable experience performing on-premises to Azure migrations leveraging native Azure or 3rd party tooling.
- Strong Documentation skills and the ability to write clear Work Instructions for other engineers to follow.
- 2-3 years' experience building Infrastructure as Code for cloud-based solutions.
- Strong experience in Python, PowerShell scripting and Azure Resource Manager template experience.
- CI/CD pipeline automation required.
- Highly desirable: Experience with Azure Databricks, Microsoft Fabric, and data pipeline development.
- Compute: Virtual Machines (Windows & Linux), Virtual Machine Scale Sets, App Services, Web Apps, Function Apps, Logic Apps, Azure Kubernetes Service, Azure Arc.
- Networking: Application Gateway, Azure Bastion, DDoS Protection, Azure DNS, Azure ExpressRoute, Azure Firewall, Azure Front Door, Content Delivery Network, Azure Private Link, Load Balancer, Network Security Groups, Traffic Manager, VPN Gateway, Virtual Network, Virtual WAN, Web Application Firewall (WAF), Network Watcher.
- Storage: Storage Accounts, Storage Explorer, Archive Storage, Azure Backup, Azure Files, Azure NetApp Files, Blob Storage, Disk Storage, Managed Disks, Queue Storage, Table Storage..
- Identity: Entra ID, AAD Connect, Entra Domain Services (AADDS).
- Management and Governance: Automation, Azure Advisor, Azure Blueprints, Azure Monitor, Azure Policy, ARM & Templates, Azure Service Health, Azure Site Recovery, Cloud Shell.
- Integration: API Management.
- Migration: Azure Migrate, Azure Database Migration Service.
- Security: Key Vault, Security Center.
- Database: SQL Server on VMs, Azure Cache for Redis, Azure SQL Database, Azure Database for MySQL, PostgreSQL.
- Data & AI: Azure AI Foundry, Azure AI Services, Microsoft Fabric, Azure Synapse Analytics.
- Scripting & Automation: PowerShell, Terraform (Desired), Bicep (Optional – willing to learn), Git, Common CI/CD platforms.
- Operating Systems: Advanced knowledge of Microsoft Server operating systems, MCSE certified 2003/8 or higher. Strong Unix / Linux knowledge.
- Broad Technical Knowledge: Advanced knowledge of DNS, Mail and WWW services.
- Agile tools: Jira, Azure DevOps, GitHub Issues, ServiceNow.
- Software development: An understanding of application deployment methodologies for common programming languages: .NET Core (C#), Python, Java.
If this excites you, apply now, we'd love to hear from you
-
Reliability Engineer
6 days ago
Sydney, New South Wales, Australia KBR, Inc. Full time $120,000 - $180,000 per yearTitle:Reliability EngineerAt KBR – We do things that matter. We deliver science, technology and engineering solutions to governments and companies around the world. KBR employs approximately 38,000 people worldwide with customers in more than 80 countries and operations in over 29 countries.KBR is proud to work with its customers across the globe to...
-
Reliability Engineer
1 week ago
Sydney, New South Wales, Australia DLA Piper Full time $80,000 - $120,000 per yearDLA Piper is seeking a skilled and forward-thinking Reliability Engineer to join a growing international team. This is a unique opportunity to work at the intersection of software engineering, DevOps, and infrastructure reliability. Supporting large-scale distributed systems across the Aisa-Pacific regions, Europe, the UK, the Middle East and...
-
Reliability Engineer
1 day ago
Sydney, New South Wales, Australia DLA Piper Full time $90,000 - $120,000 per yearDLA Piper is seeking a skilled and forward-thinking Reliability Engineer to join a growing international team. This is a unique opportunity to work at the intersection of software engineering, DevOps, and infrastructure reliability. Supporting large-scale distributed systems across the Asia-Pacific regions, Europe, the UK, the Middle East and...
-
Site Reliability Engineer
20 hours ago
Sydney, New South Wales, Australia N2S Full time $120,000 - $180,000 per yearWe are looking for aSite Reliability Engineer (SRE)to join our team and ensure the reliability, scalability, and performance of our software systems. This role bridges the gap between software development and IT operations, focusing on automation, monitoring, and incident response to maintain high system uptime and user satisfaction.Key...
-
Site Reliability Engineer
6 days ago
Sydney, New South Wales, Australia Uniquehire Full time $91,836 - $160,714 per yearSite Reliability Engineer (SRE)Location: SydneyAbout the RoleWe're seeking an enthusiastic Site Reliability Engineer (SRE) to help design, build, and scale our observability and reliability framework across critical customer-facing applications. This role offers an exciting opportunity to work hands-on with modern monitoring tools and collaborate with...
-
Site Reliability Engineer
1 week ago
Sydney, New South Wales, Australia Cover Genius Full time $120,000 - $180,000 per yearAbout The CompanyCover Genius is a Series E Insurtech that protects the global customers of the world's largest digital companies including Booking Holdings, owner of Priceline, Kayak and , Intuit, Hopper, Skyscanner, Ryanair, Turkish Airlines, Descartes ShipRush, Zip and SeatGeek. We're also available at Amazon, Flipkart, eBay, Wayfair and SE Asia's largest...
-
Site Reliability Engineer
1 week ago
Sydney, New South Wales, Australia uniqueHire PTY LTD Full time $91,989 - $160,772 per yearHiring: Site Reliability Engineer - SydneyAre you passionate about building reliable, scalable, and automated systems?We're looking for a talented SRE Engineer to join our dynamic team and help ensure the performance, reliability, and resilience of our production environments.Key Responsibilities:Design, build, and maintain cloud infrastructure (AWS /...
-
Site Reliability Engineer
1 week ago
Sydney, New South Wales, Australia Zentact Systems Full time $120,000 - $140,000 per yearDesigning and implementing SLIs/SLOs aligned to key customer journeys.Strong knowledge of observability concepts: logs, metrics, traces, SLIs/SLO.Integrating observability tools like Dynatrace, Elastic, and Nagios to provide deep insights into application performance and reliability.Building alerting pipelines via PagerDuty to ensure timely and actionable...
-
Site Reliability Engineer
21 hours ago
Sydney, New South Wales, Australia Ticketek Entertainment Group Full time $120,000 - $180,000 per yearAbout Ticketek Entertainment GroupTicketek Entertainment Group is a global fan experience Company that tickets, promotes and delivers incredible live experiences that are impossible to forget. In a distracted world where nothing beats real human moments, We make life better liveOur Group includes; our Fan Experience Platform (Ticketek) that sells...
-
Site Reliability Engineer
2 weeks ago
Sydney, New South Wales, Australia CareCone Group Full time $120,000 - $180,000 per yearRole: Site Reliability Engineer (Dynatrace + ELK)Location: SydneyPermanent (Fulltime)Job Description:Design, deploy, and maintain reliable, scalable, and high-performance systems.Implement and manage observability solutions using Dynatrace for monitoring and APM.Configure and optimize ELK (Elasticsearch, Logstash, Kibana) for centralized logging and...