Intermediate Site Reliability Engineer, Database Operations
7 hours ago
GitLab is an open-core software company that develops the most comprehensive AI-powered DevSecOps Platform, used by more than 100,000 organizations. Our mission is to enable everyone to contribute to and co-create the software that powers our world. When everyone can contribute, consumers become contributors, significantly accelerating human progress. Our platform unites teams and organizations, breaking down barriers and redefining what's possible in software development. Thanks to products like Duo Enterprise and Duo Agent Platform, customers get AI benefits at every stage of the SDLC.
The same principles built into our products are reflected in how our team works: we embrace AI as a core productivity multiplier, with all team members expected to incorporate AI into their daily workflows to drive efficiency, innovation, and impact. GitLab is where careers accelerate, innovation flourishes, and every voice is valued. Our high-performance culture is driven by our values and continuous knowledge exchange, enabling our team members to reach their full potential while collaborating with industry leaders to solve complex problems. Co-create the future with us as we build technology that transforms how the world develops software.
An overview of this roleYou will join our Database Operations team as an Intermediate Site Reliability Engineer, keeping —one of the largest single-tenancy open source SaaS platforms on the internet—running smoothly and reliably. In this role, you'll take ownership of the PostgreSQL database infrastructure that powers millions of developers worldwide, automating operational tasks, improving system performance and reliability, and designing solutions that scale to support hundreds of thousands of concurrent users. You'll work at a unique scale where your decisions directly impact the experience of our customers and the feedback you generate informs product development across GitLab. Over your first year, you'll establish expertise in a core area of database operations, mentor junior team members, and drive projects that deliver measurable improvements to system efficiency and reliability.
You bring both pragmatic operational discipline and software craftsmanship to this role. You're not just responding to incidents—you're designing systems and building automation that prevent them. You'll partner with engineering teams across GitLab to review database changes, optimize performance, and help others succeed through self-service tooling and knowledge sharing. This is hands-on infrastructure work at scale, where your contributions directly shape how reliably and securely GitLab serves the entire platform.
Some examples of projects you could work on:
- Design and implement mature automation for database provisioning, replication, and backup testing using tools like Terraform and Ansible.
- Develop self-service tools and dashboards that empower other teams to manage their own database resources.
- Lead capacity planning and scalability initiatives to ensure continues growing reliably.
- Participate in production incident response and help implement systemic improvements to prevent recurrence.
- Automate operational tasks across all environments—from package updates and configuration changes to provisioning of user-facing services—so manual effort becomes the exception, not the rule.
- Design and maintain PostgreSQL database infrastructure components that allow to scale reliably while supporting hundreds of thousands of concurrent users.
- Respond to production incidents and platform emergencies, working with peer SREs to diagnose and resolve database-related issues quickly and thoroughly.
- Build observability systems that monitor database health, predict capacity needs based on usage patterns, and alert on symptoms rather than outages.
- Develop and ship database performance solutions in collaboration with product and engineering teams, including query optimization, migration reviews, and infrastructure recommendations.
- Create self-service tools and automation—using Terraform, Ansible, Chef, and GitLab ChatOps—that empower engineering teams to manage their own database interactions safely.
- Document decisions, learnings, and operational procedures so that knowledge becomes repeatable actions and eventually becomes automation.
- Participate in regularly scheduled on-call rotations to ensure remains operational during off-hours and weekends when necessary.
- Hands-on experience running PostgreSQL in high-growth, large production environments, including both self-managed infrastructure and database-as-a-service platforms.
- Expertise with infrastructure automation and configuration management tools such as Ansible, Terraform, Chef, or Puppet to automate operational tasks and drive system reliability.
- Solid understanding of SQL, PL/pgSQL, data modeling, and data structure design; ability to analyze PostgreSQL internals to troubleshoot and optimize systems.
- Experience working in large-scale, distributed SaaS production environments where you've managed reliability, performance, and scalability challenges at significant scale.
- Strong written communication skills and commitment to documentation; you thrive in remote, asynchronous environments and share knowledge effectively across your team.
- Proactive, hands-on approach where you identify issues, take ownership of solutions, and contribute improvements to infrastructure and code.
- Capability to mentor junior team members and develop deep expertise in your domain areas, then share that knowledge to help others grow.
- Backend engineering experience with languages such as Ruby or Go, and/or familiarity with OLAP databases like Clickhouse.
We are responsible for building, running, and evolving the entire lifecycle of the PostgreSQL database engine that powers You'll be part of our team focused on owning the reliability, scalability, performance, and security of our database infrastructure and supporting services. is one of the largest single-tenancy open source SaaS sites on the internet, which means your work directly impacts hundreds of thousands of concurrent users worldwide. We operate in a fully distributed, asynchronous environment across multiple regions, collaborating on everything from database automation and infrastructure design to incident response and capacity planning. You'll be solving novel challenges at scale—from implementing observability stacks that predict capacity needs to designing the infrastructure components that allow GitLab to scale reliably. We continuously seek to reduce complexity and improve efficiency by leveraging cloud vendor managed products and services where appropriate, ensuring remains a best-in-class production environment. For more on how we operate, see Database Operations Team Handbook Page.
How GitLab will support you- Benefits to support your health, finances, and well-being
- Flexible Paid Time Off
- Team Member Resource Groups
- Equity Compensation & Employee Stock Purchase Plan
- Growth and Development Fund
- Parental leave
- Home office support
Please note that we welcome interest from candidates with varying levels of experience; many successful candidates do not meet every single requirement. Additionally, studies have shown that people from underrepresented groups are less likely to apply to a job unless they meet every single qualification. If you're excited about this role, please apply and allow our recruiters to assess your application.
Country Hiring Guidelines: GitLab hires new team members in countries around the world. All of our roles are remote, however some roles may carry specific location-based eligibility requirements. Our Talent Acquisition team can help answer any questions about location after starting the recruiting process.
Privacy Policy: Please review our Recruitment Privacy Policy. Your privacy is important to us.
GitLab is proud to be an equal opportunity workplace and is an affirmative action employer. GitLab's policies and practices relating to recruitment, employment, career development and advancement, promotion, and retirement are based solely on merit, regardless of race, color, religion, ancestry, sex (including pregnancy, lactation, sexual orientation, gender identity, or gender expression), national origin, age, citizenship, marital status, mental or physical disability, genetic information (including family medical history), discharge status from the military, protected veteran status (which includes disabled veterans, recently separated veterans, active duty wartime or campaign badge veterans, and Armed Forces service medal veterans), or any other basis protected by law. GitLab will not tolerate discrimination or harassment based on any of these characteristics. See also GitLab's EEO Policy and EEO is the Law. If you have a disability or special need that requires accommodation, please let us know during the recruiting process.
-
Database Reliability Engineer
6 hours ago
Australia - Remote, VC CrowdStrike Full time €90,000 - €120,000 per yearAs a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn't changed — we're here to stop breaches, and we've redefined modern security with the world's most advanced AI-native platform. We work on large scale distributed systems, processing almost 3...
-
Site Reliability Engineer
8 hours ago
Remote, Barcelona - Australia Flight Centre Careers Full time $120,000 - $180,000 per yearKia Ora, Hola, สวัสดี, Guten TagWhereTo is a business travel startup from San Francisco that evolved into an agile development and design studio within the Flight Centre family. We build travel solutions used by some of the largest companies on the planet - we have just one goal: making business travel better for everybody.WhereTo provides an...
-
Remote, Australia GitLab Full time $120,000 - $180,000 per yearGitLab is an open-core software company that develops the most comprehensive AI-powered DevSecOps Platform, used by more than 100,000 organizations. Our mission is to enable everyone to contribute to and co-create the software that powers our world. When everyone can contribute, consumers become contributors, significantly accelerating human progress. Our...
-
Senior Backend Engineer
7 hours ago
Remote, APAC; Remote, Australia; Remote, New Zealand GitLab Full time $120,000 - $180,000 per yearGitLab is an open-core software company that develops the most comprehensive AI-powered DevSecOps Platform, used by more than 100,000 organizations. Our mission is to enable everyone to contribute to and co-create the software that powers our world. When everyone can contribute, consumers become contributors, significantly accelerating human progress. Our...
-
Technical Customer Support Engineer
7 hours ago
Australia (Remote) ClickHouse Full time $104,000 - $156,000 per yearAbout ClickHouseEstablished in 2009, ClickHouse leads the industry with its open-source column-oriented database system, driven by the vision of becoming the fastest OLAP database globally. The company empowers users to generate real-time analytical reports through SQL queries, emphasizing speed in managing escalating data volumes. Enterprises globally,...
-
Software Engineer
6 days ago
Remote, Australia Octopus Deploy Full time $125,000 - $145,000 per yearOctopus Deploy sets the standard for Continuous Delivery, empowering software teams to deliver value in an agile way. Over 4,000 organizations globally – including Ubisoft, Xero, Stack Overflow, NASA, and Disney – rely on our Continuous Delivery, GitOps, and release orchestration solutions.If you join Octopus, you'll become a part of a high-trust,...
-
Site Contract Specialist
1 week ago
Remote, Australia Incyte Corporation Full time $90,000 - $120,000 per yearOverview:Job SummaryThe Site Contract Specialist is responsible for participating in end-to-end site contract management from feasibility to study closure, serving as a point of contact for investigative sites and the study team members. This includes supporting site contract activity related site qualification, site level regulatory green light, and site...
-
Senior Customer Reliability Engineer
8 hours ago
Australia - Remote Replicated Full time $170,000 - $210,000 per yearReplicated is a Commercial Software Distribution Platform. Replicated helps software vendors distribute their applications into self-hosted environments like VPC, on-prem, air gap, and more. With a suite of tools ranging from installation, to testing, to licensing and support, Replicated is the best way to operationalize and scale the distribution of...
-
Database SRE Manager
5 hours ago
Australia - Remote, SA CrowdStrike Full time $120,000 - $180,000 per yearAs a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn't changed — we're here to stop breaches, and we've redefined modern security with the world's most advanced AI-native platform. We work on large scale distributed systems, processing almost 3...
-
Senior Software Engineer
2 hours ago
Australia (remote) ClickHouse Full time $120,000 - $180,000 per yearAbout the Team The Cloud Infrastructure Engineering team builds and manages the foundational blocks of ClickHouse Cloud data plane end-to-end. This includes compute, networking, security, and a multi-cloud, multi-region architecture that provides a reliable and scalable managed ClickHouse experience for ClickHouse Cloud customers. Our team is looking for...