Site Reliability engineer Lead
3 weeks ago
Overview
Position Summary:
As a Site Reliability Engineer Lead, you will lead a team of SRE engineers and manage the challenges of scaling our client's digitization program. Your expertise in coding, algorithms, complexity analysis, and large-scale system design will be crucial in building scalable, reliable, durable, and secure applications for our customers and internal users. You will develop highly reliable applications with a customer-first approach while innovating technically and understanding our customers' needs.
Mandatory Skills
- Strong experience in Java, Spring Boot, Node.js, microservices, RDBMS, NoSQL
- Proficiency with AWS services such as EC2, S3, Lambda, IAM, ECS, EKS, SQS, Kinesis
- Observability using Splunk, NewRelic
- Infrastructure as Code using Terraform
- APIs and event-driven approaches
- Security patterns
- Unix/Linux systems administration, with familiarity in Docker
- Strong experience in analyzing and troubleshooting large-scale distributed systems
- Ability to debug and optimize code and automate routine tasks
- Familiarity with containerization and orchestration technologies such as Docker and Kubernetes
- Knowledge of modern software engineering practices and tools - Agile and DevOps
- Strong communication skills and the ability to explain complex technical matters in an easy-to-understand way
- Strong domain knowledge of telecom billing and charging rating systems
Duties and Responsibilities
- Within the Site Reliability Engineering team, collaborate with various development teams and other partner teams to ensure applications’ reliability, efficiency, and performance meet customer needs, while keeping the service operational, scalable, and automated.
- Develop tools and automation to streamline operations and improve system reliability, efficiency, and performance.
- Partner with development teams on feature launches to ensure reliable and scalable functionality for customers.
- Build deep knowledge of production infrastructure to debug distributed systems problems and identify system improvements.
- Operations, SLO, SLA management
- Metrics reporting and progress tracking
- Manage infrastructure costs and optimize resource utilization
- Work with security teams to ensure compliance with security policies and procedures
- Participate in on-call rotations to provide 24/7 support for our systems
- Observability (alarms, monitoring, synthetics)
- Error management
Qualifications & Certifications (Optional)
· Bachelor’s degree in computer science or a related engineering degree
20+ years of IT industry experience
Salary Range
>100,000
Date of Posting
25 September 2025
Next Steps
If you feel this opportunity suits you, or Cognizant is the type of organization you would like to join, we want to have a conversation with you. Please apply directly with us.
For a complete list of open opportunities with Cognizant, visit http://www.cognizant.com/careers. Cognizant is committed to providing Equal Employment Opportunities. Successful candidates will be required to undergo a background check.
#LI-CTSAPAC
#J-18808-Ljbffr
-
Site Reliability engineer Lead
4 days ago
Melbourne, Victoria, Australia Cognizant Full time $120,000 - $180,000 per yearPosition SummaryAs a Site Reliability Engineer Lead, you will have the opportunity to lead a team of SRE engineers and manage the unique challenges of scaling our client's digitization program. Your expertise in coding, algorithms, complexity analysis, and large-scale system design will be crucial in providing scalable, reliable, durable, and secure...
-
Site Reliability engineer Lead
6 days ago
Melbourne VIC, Australia Cognizant Technology Solutions Full time $140,000 - $220,000 per yearPosition Summary:As a Site Reliability Engineer Lead, you will have the opportunity to lead a team of SRE engineers and manage the unique challenges of scaling our client's digitization program. Your expertise in coding, algorithms, complexity analysis, and large-scale system design will be crucial in providing scalable, reliable, durable, and secure...
-
Site Reliability Engineer
3 days ago
Melbourne, Victoria, Australia Salient Group Full time $120,000 - $180,000 per yearSite Reliability Engineer | Scale a Next-Gen SaaS PlatformLocation:Melbourne (Hybrid)AboutSalient is proud to be partnering with a fast-growing fintech scale-up that's tackling one of the world's most pressing challenges: financial crime. Their AI-powered SaaS platform is already trusted by leading banks and financial institutions across Australia, New...
-
Site Reliability Engineer
2 weeks ago
Council of the City of Sydney, Australia TEG Full timeAbout Ticketek Entertainment Group Ticketek Entertainment Group is a global fan experience Company that tickets, promotes and delivers incredible live experiences that are impossible to forget. In a distracted world where nothing beats real human moments, We make life better live! Our Group includes; our Fan Experience Platform (Ticketek) that sells...
-
Site Reliability Engineer
3 days ago
Melbourne, Victoria, Australia Bupa Full time $104,000 - $130,878 per yearAbout the RoleWe are seeking a Site Reliability Engineer (SRE) to own the stability, observability, and reliability of our non-production and production environments that support our mobile app delivery and customers. This role is responsible for ensuring development, integration, pre-production, and production environments remain healthy, available, and...
-
Asset Reliability Engineer
2 days ago
City of Melbourne, Australia John Holland Pty Ltd Full timeAt John Holland, our purpose is simple - we transform lives with everything we do. We/'ve always known that Infrastructure is about people — our customers, our employees, and the communities in which we work every day. That/'s our difference. Deep experience and capability with a genuine care about creating better lives for people along the way. Be part...
-
Site Reliability Engineer
4 days ago
Melbourne, Victoria, Australia Cubewire Full time $120,000 - $180,000 per yearAbout Cubewire:We are ablockchain infrastructure company building enterprise-gradeWeb3 applicationsWeb3 applications, specializing in programmable payments, wallet infrastructure, chain infrastructure and stablecoin solutions— featuring multi-network support, native KYC, policy controls, risk scoring, and compliance frameworks.We're seeking an SRE with...
-
Council of the City of Sydney, Australia Google Inc. Full timeStaff Software Engineer, Site Reliability Engineering, Cloud Google Sydney NSW, Australia Apply At Google, we have a vision of empowerment and equitable opportunity for all Aboriginal and Torres Strait Islander peoples and commit to building reconciliation through Google’s technology, platforms and people and we welcome Indigenous applicants. Please see...
-
Melbourne, Victoria, Australia Airwallex Full time $200,000 - $250,000 per yearAbout AirwallexAirwallex is the only unified payments and financial platform for global businesses. Powered by our unique combination of proprietary infrastructure and software, we empower over 150,000 businesses worldwide – including Brex, Rippling, Navan, Qantas, SHEIN and many more – with fully integrated solutions to manage everything from business...
-
Site Reliability Engineer
2 weeks ago
Melbourne, Victoria, Australia Tata Consultancy Services Full time $120,000 - $180,000 per yearAbout TCS:Join Tata Consultancy Services, Asia Pacific and be part of an organization committed to sustainable development for our future. TCS follows the Tata group philosophy of building sustainable businesses that are rooted in the community and demonstrate care for the environment. Our unique values position us to combine a purpose-driven worldview with...