
Site Reliability Engineer
11 hours ago
NetApp is looking for a Senior Techops Engineer to join our growing Instaclustr team in Australia. NetApp's Instaclustr offering provides open source as-a-service company, delivering reliability at scale. We manage cutting edge open-source technologies (Cassandra, Kafka, PostgreSQL, Redis/Valkey, OpenSearch, Postgres, ClickHouse and Cadence) for our customers around the world.
NetApp Instaclustr makes it easy for our customers to run powerful open-source applications at the highest levels of scale. We have developed a platform that takes care of the whole lifecycle: provisioning infrastructure, installing applications and, most importantly, keeping the applications running reliably in production. Since being founded in 2013, Instaclustr has grown strongly, with over 300 customers worldwide, and over 19,000 nodes under management.
Our Technical Operations Engineers are the frontline team keeping our large fleet of cloud-hosted open-source clusters up and running. Your work will ensure the security, reliability and performance of world-class systems and databases. You will collaborate with our customer's technical teams, from globally recognised companies in the gaming, banking and logistics industry sectors, ranging from big multinationals to emerging start-ups.
The RoleIf you have excellent operational knowledge in managing Kafka clusters, look no further
As a Site Reliability Engineer (Kafka), you are in the frontline team keeping our large fleet of cloud-hosted Kafka clusters up and running. Every day, you will diagnose and solve interesting technical problems, providing Kafka as a Managed Service in a highly automated environment. Our service is relied on by some of the leading global names in Banking and Financial Services, Telecom, IoT and Tech companies that interact with millions of end users.
Skills & ExperienceWe're looking for smart engineers with exceptional communication skills, a positive attitude, and a passion for IT and learning new things. We expect you to be, or quickly become proficient in a range of the technologies we use. Successful candidates for this role will:
- Have strong experience in Kafka, and a desire to learn more and develop to a true expert level.
- Ideally should already have experience diagnosing various operational issues through the analysis of logs /graphs.
- Past experience with abovementioned tech's upgrades and migrations would be favourable.
- Have good experience working on one Public Cloud provider such as AWS, Azure or GCP.
- Preferably have past IT Customer service/support experience.
- Good fundamental Computer science / software engineering skills and knowledge, particularly Operating System internals, memory management, and networking.
- Strong knowledge and experience with Linux and be comfortable working from the command line (essential)
- Exceptional ability to communicate clearly and professionally in written and verbal English (essential).
- Work as part of a team and use your initiative to get things done.
- Ability to follow required processes and procedures.
- Investigating/researching issues by reviewing the source code.
- Programming skills in Python or Java, and source code control using Git would be a plus.
- Provide expert operational support to our nodes running in the cloud (AWS, Azure and GCP), using technologies such as Linux (Debian), Docker, and languages including Java, Python and bash.Liaise with our customers' engineers in resolving interesting issues related to Kafka usage and other supported technologies.
- Participate in on-call Level 2 roster.
- Liaise with our customers' engineers in resolving interesting issues related to Kafka.
- Undertake complex cluster operations such as migrations, upgrades and maintenance on our fleet.
- Develop and continually improve our suite of internal automation tools, applications, and processes.
-
Site Reliability Engineer
11 hours ago
Canberra, ACT, Australia NetApp, Inc. Full time $120,000 - $180,000 per yearJob Summary NetApp is looking for a Senior TechOps Engineer to join our growing Instaclustr team in Australia. NetApp's Instaclustr offering provides open source as-a-service company, delivering reliability at scale. We manage cutting edge open-source technologies (Cassandra, Kafka, PostgreSQL, OpenSearch, Cadence, Postgres and ClickHouse) for our customers...
-
Site Reliability
2 weeks ago
Canberra, ACT, Australia Canonical Full timeCanonical Canberra, Australian Capital Territory, Australia Join or sign in to find your next job Join to apply for the Site Reliability / Gitops Engineer role at Canonical Canonical Canberra, Australian Capital Territory, Australia 1 day ago Be among the first 25 applicants Join to apply for the Site Reliability / Gitops Engineer role at Canonical ...
-
Site Reliability
2 weeks ago
Canberra, ACT, Australia Canonical Full timeCanonical Canberra, Australian Capital Territory, AustraliaJoin or sign in to find your next jobJoin to apply for the Site Reliability / Gitops Engineer role at CanonicalCanonical Canberra, Australian Capital Territory, Australia1 day ago Be among the first 25 applicantsJoin to apply for the Site Reliability / Gitops Engineer role at CanonicalCanonical is a...
-
Site Reliability Engineer
2 weeks ago
Canberra, ACT, Australia Canonical Full timeCanonical Canberra, Australian Capital Territory, Australia Overview Join to apply for the Site Reliability Engineer role at Canonical .Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets.Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud,...
-
Site Reliability Engineering Lead
1 week ago
Canberra, ACT, Australia beBeeSoftware Full time $147,093 - $164,455Job OpportunityAbout the PositionWe are seeking a senior engineer to lead our site reliability engineering team in implementing DevOps best practices and driving internal projects. The ideal candidate will have experience with software delivery using infrastructure as code, managing DevOps teams, and understanding complex distributed...
-
Site Reliability Engineer
2 weeks ago
Canberra, ACT, Australia Canonical Full timeCanonical Canberra, Australian Capital Territory, AustraliaOverviewJoin to apply for the Site Reliability Engineer role at Canonical. Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud,...
-
Site Reliability Engineer
2 weeks ago
Canberra, ACT, Australia NetApp Full timeJob SummaryOur TechOps Engineers are the frontline team keeping our large fleet of cloud-hosted Apache Kafka, Cassandra, OpenSearch, Cadence, Valkey, Clickhouse and PostgreSQL clusters up and running. Every day you will diagnose and solve challenging and interesting technical problems providing a service that is relied on by some of the leading global names...
-
Site Reliability Engineer
2 weeks ago
Canberra, ACT, Australia Canonical Full timeCanonical Canberra, Australian Capital Territory, AustraliaOverviewJoin to apply for theSite Reliability Engineerrole atCanonical.Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets.Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data...
-
Reliability Engineering Lead
1 week ago
Canberra, ACT, Australia beBeeReliability Full time $140,000 - $190,000About UsWe empower businesses to automate routine tasks, surface actionable insights, and connect with the right data, advisors, and apps.This helps small businesses build a stronger economy.About the TeamOur Incident and Problem Management team is part of the Site Reliability Engineering organization, responsible for robust process and tooling around...
-
Site Reliability Infrastructure Specialist
2 weeks ago
Canberra, ACT, Australia beBeeEngineer Full time $120,000 - $160,000Job OverviewWe are seeking a skilled and innovative professional to join our team as a Site Reliability Engineer.Key ResponsibilitiesDesign, deploy, and manage scalable infrastructure solutions using OpenStack, Kubernetes, and software-defined storage technologies.Implement devsecops practices for applications running on the engineered...