Principal Site Reliability Administrator

2 weeks ago


Canberra, Australia opentext Full time

**OPENTEXT - THE INFORMATION COMPANY**

As the Information Company, our mission at OpenText is to create software solutions and deliver services that redefine the future of digital. Be part of a winning team that leads the way in Enterprise Information Management.

**The Opportunity**:
You will join a team of globally located Site Reliability Engineers to design, automate, build, operate, and continuously improve some of the services that back our customer facing SaaS products. You will be responsible for delivering an operating a highly available design that includes security, scalability, monitoring, upgradeability, and data backup and recovery, across non-production and production environments. You will work in a fast-paced organization while quickly learning new skills and creating ways to consistently meet service-level agreements for our global cloud services.

The best person for this role is someone that has a collaborative spirit - in our world, it is not about being a hero and having all the answers, it is about sometimes saying "I don't know" and working on finding solutions rather than starting with an assumption. The team needs someone who can ask questions, learn from others, and turn chaos into order. This role would be a great fit for someone with creative and innovative critical thinking skills. You will develop and implement solutions that operate at scale. Our teams are empowered an expected to improve our products to truly deliver a reliable experience to customers.

**Your Responsibilities Will Include**:
Designing, automating, building, operating, and continuously improving multiple backing services including Oracle, SQL Server, and Postgres.
Migrating VM based deployments to containerized solutions or managed services in AWS, GCP, or Azure
Identifying tactical and strategic opportunities to improve service health, performance, reliability, and telemetry
Contributing to capacity planning and management processes
Supporting the migration of legacy deployments to modernized design patterns
Supporting and responding to service requests that satisfy our OLAs
Supporting incident resolution process for backing services that we are responsible for
Participating in training and information sharing activities
Interacting with third party provider(s) who provide additional expertise and a layer of escalation support for our services

Acting as backup for other team members when necessary
Problem solving and finding solutions to resolve issues
Building repeatable technology design patterns
Learning new technology on your own or in conjunction with an online learning platform
Creating and updating documentation such as operational procedures, change execution plans, and incident write-ups
Will require shift work
On-call rotation is required, as 7x24x365 support is necessary

**Qualifications**:
Bachelor’s Degree in Computer Engineering or related field
8+ years of Information Technology experience, working on large scale enterprise systems
8+ years of experience working within the Linux operating system
3+ years of operations experience for one or more of the backing services that we are supporting (GCP Cloud SQL, AWS RDS and Azure SQL Database)
Intermediate knowledge of private and public cloud infrastructure platforms (VMware/AWS/GCP)
Firsthand experience with configuration of monitoring an alerting tool such as Prometheus, Grafana, Zabbix, Everbridge and/or Pager Duty
Experience with automation or CI/CD tools, such as Terraform, Ansible, and GitLab
Should be extremely detail oriented and meticulous
Strong written and verbal communication skills
Ability to thrive in a fast-paced environment working on projects against strict deadlines
Strong understanding of ITIL principles, certification is a plus
Ability to diagnose and troubleshoot user facing service incidents & outages
Understanding availability and performance monitoring tools and concepts

**Additional Value-Added Qualifications**:
Administrative experience with relational databases such as Oracle, SQL Server, or PostgreSQL
Clustering / load balancing concepts
Awareness an insight into industry trends (technology, methods, and tooling)
Experience with Run Deck or Ansible Tower



  • Canberra, ACT, Australia Opentext Full time

    **OPENTEXT - THE INFORMATION COMPANY**As the Information Company, our mission at OpenText is to create software solutions and deliver services that redefine the future of digital. Be part of a winning team that leads the way in Enterprise Information Management.**The Opportunity**:You will join a team of globally located Site Reliability Engineers to design,...


  • Canberra, Australia opentext Full time

    **OPENTEXT - THE INFORMATION COMPANY** As the Information Company, our mission at OpenText is to create software solutions and deliver services that redefine the future of digital. Be part of a winning team that leads the way in Enterprise Information Management. **The opportunity** As a Site Reliability Engineer (SRE) Principle, you will join a global...


  • Canberra, ACT, Australia Opentext Full time

    **OPENTEXT - THE INFORMATION COMPANY**As the Information Company, our mission at OpenText is to create software solutions and deliver services that redefine the future of digital. Be part of a winning team that leads the way in Enterprise Information Management.**The opportunity**As a Site Reliability Engineer (SRE) Principle, you will join a global team,...


  • Canberra, Australia Open Text Corporation Full time

    **Lead Site Reliability Administrator**: - Req id: 35177- Canberra, ACT, AU**OPENTEXT - THE INFORMATION COMPANY** As the Information Company, our mission at OpenText is to create software solutions and deliver services that redefine the future of digital. Be part of a winning team that leads the way in Enterprise Information Management. **The...

  • Site Reliability

    7 days ago


    Canberra, ACT, Australia Canonical Full time

    Canonical Canberra, Australian Capital Territory, Australia Join or sign in to find your next job Join to apply for the Site Reliability / Gitops Engineer role at Canonical Canonical Canberra, Australian Capital Territory, Australia 1 day ago Be among the first 25 applicants Join to apply for the Site Reliability / Gitops Engineer role at Canonical ...

  • Site Reliability

    7 days ago


    Canberra, ACT, Australia Canonical Full time

    Canonical Canberra, Australian Capital Territory, AustraliaJoin or sign in to find your next jobJoin to apply for the Site Reliability / Gitops Engineer role at CanonicalCanonical Canberra, Australian Capital Territory, Australia1 day ago Be among the first 25 applicantsJoin to apply for the Site Reliability / Gitops Engineer role at CanonicalCanonical is a...


  • Canberra, ACT, Australia Canonical Full time

    Senior Site Reliability / Gitops EngineerCanonical Canberra, Australian Capital Territory, AustraliaJoin or sign in to find your next jobJoin to apply for the Senior Site Reliability / Gitops Engineer role at CanonicalSenior Site Reliability / Gitops EngineerCanonical Canberra, Australian Capital Territory, Australia2 days ago Be among the first 25...


  • Canberra, ACT, Australia beBeeEngineer Full time $120,000 - $160,000

    Job OverviewWe are seeking a skilled and innovative professional to join our team as a Site Reliability Engineer.Key ResponsibilitiesDesign, deploy, and manage scalable infrastructure solutions using OpenStack, Kubernetes, and software-defined storage technologies.Implement devsecops practices for applications running on the engineered...


  • Canberra, n Capital Territory, Australia Kompozition Full time $120,000 - $180,000 per year

    We are seeking a highly skilled Site Reliability Engineer to design, implement, and maintain reliable, scalable, and secure cloud-based systems. Working closely with cross-functional teams, you will be responsible for the stability and performance of our development and production environments, driving continuous improvement across infrastructure, deployment...


  • Canberra, Australia opentext Full time

    **OPENTEXT - THE INFORMATION COMPANY** As the Information Company, our mission at OpenText is to create software solutions and deliver services that redefine the future of digital. Be part of a winning team that leads the way in Enterprise Information Management. **The opportunity** As a Site Reliability Engineer (SRE) Lead, you will join a global team,...