
Site Reliability Engineer
4 weeks ago
Are you ready for new challenges and new opportunities?
Join our team
Current job opportunities are posted here as they become available.
Subscribe to our RSS feeds to receive instant updates as new positions become available.
NEP is Australia's leading provider of outsourced television production services.
We are always looking for great people to join our team; people with a passion for people and teamwork helping us deliver exceptional results for our clients.
NEP Australia is currently looking for a Site Reliability Engineer to join our team at the Andrew's HUB in Southbank
The principal purpose of this position is to drive proactive improvements in the performance, reliability, and developer experience of the NEP Platform products and services across Kubernetes-based infrastructure running on-premise and in AWS/GCP environments.
Key Responsibilities but not limited to:- Design, implement, and maintain developer-friendly tools to improve productivity, code quality, and deployment efficiency for Kubernetes-based workloads.
- Identify bottlenecks in integration and deployment pipelines and implement enhancements to support faster, more reliable deployments to on-premise and cloud Kubernetes clusters.
- Collaborate with development teams to enable self-service tooling for managing deployments, logs, and infrastructure resources in Kubernetes environments.
- Continuously improve build, test, and deployment automation for Kubernetes infrastructure across on-premise and cloud environments (AWS/GCP).
- Provide better visibility into Kubernetes environments through improved observability tools, dashboards, and metrics.
- Manage and improve Kubernetes orchestration across on-premise infrastructure and AWS/GCP clusters to ensure reliability, scalability, and consistency.
- Enhance observability by implementing robust monitoring, logging, and alerting solutions tailored to Kubernetes workloads using tools like Grafana, Loki or cloud-native tools like CloudWatch (AWS) and Stackdriver (GCP).
- Collaborate with Engineering Leadership to implement reliability engineering practices such as load testing, chaos testing, and recovery mechanisms for Kubernetes services.
- Bachelors or Masters in Computer Engineering (or equivalent experience)
- 2+ years in Software or Systems Engineering
- Automation for scaling using tools like Ansible, Terraform, Helm, and ArgoCD.
- Software development in at least one language such as Go or Python
- Experience in building and maintaining container platforms, such as Kubernetes
- Expert in Observability platforms such as Grafana, Prometheus etc
- Experienced in using and tuning cloud native technology
- Solid understanding of basic Linux and cloud networking (e.g., routing, firewalls, DNS, VPCs, subnets, load balancers).
NEP believes that, ?rst and foremost, the e?orts of our people are what contribute to our successes. We o?er a range of bene?ts that assist our team in their professional development and wellbeing, including:
- Salary continuance insurance
- NEP Days – additional 5 days of leave per year (conditions apply)
- NEP Travel benefits & discounts including Qantas Club Membership
- Discounts through Employment Hero Work app
- Employee Assistance Programme
This is a full-time role and is a unique opportunity for the right person. So if you want to be part of a global company apply today
You must have the right to live and work in Australia to apply for this job.
Only shortlisted candidates will be contacted.
At NEP, we are committed to employing individuals who align with Our Values and meet the requirements of the role. As part of the recruitment process, there are several checks which may be conducted to demonstrate applicants' suitability for a role including police / criminal background checks, right to work checks, and reference checks.
NEP is the largest media technology partner for content producers of live sports, entertainment, and corporate events globally. For more than 35 years, NEP has been delivering innovative products and services that enable clients to make, manage and show the world their content—anywhere, anytime, on any platform.
As a trusted partner working on some of the largest productions in the world, NEP offers a complete set of end-to-end solutions, from content capture to distribution—including a growing portfolio of transformational cloud-based, software-based and virtualized technologies.
• NEP's Live Production solutions range from AV services and live audience enhancements to traditional outside broadcast and cutting-edge centralized and cloud production.
• NEP's Virtual Production solutions start at the creative stage and end with exceptional execution across ICVFX, augmented reality, LED stages and more.
• NEP's Media Processing solutions provide the tools and products our clients need to ingest, edit, store, search, manage and distribute their digital assets to rights holders across multiple platforms.
Headquartered in the United States, NEP has operations in 25 countries with over 4,000+ employees. Together, NEP has supported productions in over 100 countries on all seven continents and is still growing. Clients range from the leaders in sport, music, film and TV, to major corporate brands, agencies, to new content owners and creators all around the world.
Anywhere, anytime, on any platform—we help our clients make, manage, and show the world their content.
#J-18808-Ljbffr-
Site Reliability Engineer
2 weeks ago
Melbourne, Victoria, Australia Salient Group Full time $120,000 - $180,000 per yearSite Reliability Engineer | Scale a Next-Gen SaaS PlatformLocation:Melbourne (Hybrid)AboutSalient is proud to be partnering with a fast-growing fintech scale-up that's tackling one of the world's most pressing challenges: financial crime. Their AI-powered SaaS platform is already trusted by leading banks and financial institutions across Australia, New...
-
Site Reliability Engineer
2 weeks ago
Melbourne, Victoria, Australia BURGEON IT SERVICES Full time $125,000 - $175,000 per yearPosition: Site Reliability Engineer Lead Engineer Location: Melbourne, VIC Duration: 6 months Relevant Exp: 10 years Primary Focus: Ensuring system reliability, scalability, and performance. Key Responsibilities: - Defining SLOs, SLIs, and SLAs for reliability. - Monitoring system performance and reducing toil. - Incident response and root cause...
-
Site Reliability Engineer
4 weeks ago
Melbourne, Victoria, Australia Bupaoptical Full timeAbout the RoleWe are seeking a Site Reliability Engineer (SRE) to own the stability, observability, and reliability of our non-production and production environments that support our mobile app delivery and customers. This role is responsible for ensuring development, integration, pre-production, and production environments remain healthy, available, and...
-
Site Reliability Engineer
2 weeks ago
Melbourne, Victoria, Australia Bupa Full time $104,000 - $130,878 per yearAbout the RoleWe are seeking a Site Reliability Engineer (SRE) to own the stability, observability, and reliability of our non-production and production environments that support our mobile app delivery and customers. This role is responsible for ensuring development, integration, pre-production, and production environments remain healthy, available, and...
-
Site Reliability Engineer
4 weeks ago
Melbourne, Victoria, Australia Infosys Limited Full timeOverviewAbout Us: Infosys is a global leader in next-generation digital services and consulting. We enable clients in more than 56 countries to navigate their digital transformation. With over four decades of experience in managing the systems and workings of global enterprises, we expertly steer our clients through their digital journey. We do it by...
-
Site Reliability Engineer
4 weeks ago
Melbourne, Victoria, Australia Infosys Limited Full timeOverviewAbout Us: Infosys is a global leader in next-generation digital services and consulting. We enable clients in more than 56 countries to navigate their digital transformation. With over four decades of experience in managing the systems and workings of global enterprises, we expertly steer our clients through their digital journey. We do it by...
-
Site Reliability engineer Lead
2 weeks ago
Melbourne, Victoria, Australia Cognizant Full time $120,000 - $180,000 per yearPosition SummaryAs a Site Reliability Engineer Lead, you will have the opportunity to lead a team of SRE engineers and manage the unique challenges of scaling our client's digitization program. Your expertise in coding, algorithms, complexity analysis, and large-scale system design will be crucial in providing scalable, reliable, durable, and secure...
-
Senior Site Reliability Engineer
4 weeks ago
Melbourne, Victoria, Australia Culture Amp Full timeSenior Site Reliability Engineer - Data Intelligence - Realtime Analytics Platform We are looking for a Senior Site Reliability Engineer to join our newly formed Realtime Analytics Platform Team. We sit within the Data Intelligence Camp and are responsible for developing an internal platform capability that enables all product teams across Culture Amp to...
-
Senior Site Reliability Engineer
4 weeks ago
Melbourne, Victoria, Australia Culture Amp Full timeSenior Site Reliability Engineer - Data Intelligence - Realtime Analytics PlatformWe are looking for a Senior Site Reliability Engineer to join our newly formed Realtime Analytics Platform Team. We sit within the Data Intelligence Camp and are responsible for developing an internal platform capability that enables all product teams across Culture Amp to...
-
Principal Site Reliability Engineer
4 weeks ago
Melbourne, Victoria, Australia Commonwealth Bank Full timeYou are passionate about SRE and systems engineering We are undergoing one of Australia's largest digital transformations Together we can reimagine banking for millions of customers Do work that matters We're accelerating our digital strategy with an ambition to provide customers with one of the best digital experiences of any company globally.Site...