Site Reliability Engineering Manager, Core Data
3 days ago
At Google, we have a vision of empowerment and equitable opportunity for all Aboriginal and Torres Strait Islander peoples and commit to building reconciliation through Google's technology, platforms and people and we welcome Indigenous applicants. Please see our Reconciliation Action Plan for more information.
Minimum qualifications:
- Bachelor's degree in Computer Science, a related field, or equivalent practical experience.
- 8 years of experience with software development in one or more programming languages.
- 3 years of experience in designing, analyzing, and troubleshooting distributed systems.
- 3 years of experience with managing people or teams.
- 3 years of experience with leading projects.
Preferred qualifications:
- Master's degree in Computer Science or Engineering.
- 8 years of experience with data structures and algorithms.
- 5 years of experience with software development in one or more programming languages.
About the job
Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google's services—both our internally critical and our externally-visible systems—have reliability, uptime appropriate to users' needs and a fast rate of improvement. Additionally SRE's will keep an ever-watchful eye on our systems capacity and performance.
Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you'll have the opportunity to manage the complex challenges of scale which are unique to Google, while using your expertise in coding, algorithms, complexity analysis and large-scale system design.
SRE's culture of intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.
To learn more: check out our books on Site Reliability Engineering or read a career profile about why a Software Engineer chose to join SRE.
Caching Site Reliability Engineering (SRE) is a team in Core Data foundations that manages critical, business and user-impacting services.
As a Site Reliability Engineering Manager, you will foster a culture of engineering excellence, drive technical strategy, and develop a high-performing, collaborative team. Your role is pivotal in ensuring our services meet stringent Service Level Objectives (SLOs) and in building resilient, automated production environments.
Behind everything our users see online is the architecture built by the Technical Infrastructure team to keep it running. From developing and maintaining our data centers to building the next generation of Google platforms, we make Google's product portfolio possible. We're proud to be our engineers' engineers and love voiding warranties by taking things apart so we can rebuild them. We keep our networks up and running, ensuring our users have the best and fastest experience possible.
Responsibilities
- Lead, mentor, and grow a team of Site Reliability Engineers, guide in their professional development and ensure their success.
- Manage the availability, latency, performance, and efficiency of services.
- Develop and execute the goal and strategy for the Site Reliability Engineering (SRE) team, aligning with organizational goals.
- Perform, manage and improve on-call rotations, spanning multiple continents, ensuring healthy on-call practices and response.
- Design, write and deliver software to improve the availability, scalability, latency and efficiency of Google's services.
Google is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. See also Google's EEO Policy and EEO is the Law. If you have a disability or special need that requires accommodation, please let us know by completing our Accommodations for Applicants form.
-
Sydney, New South Wales, Australia Google Full timeinfo_outlineXAt Google, we have a vision of empowerment and equitable opportunity for all Aboriginal and Torres Strait Islander peoples and commit to building reconciliation through Google's technology, platforms and people and we welcome Indigenous applicants. Please see our Reconciliation Action Plan for more information.Minimum qualifications:Bachelor's...
-
Site Reliability Engineer
2 weeks ago
Sydney, New South Wales, Australia TikTok Full time $120,000 - $180,000 per yearResponsibilitiesAbout the Team:Site Reliability Engineering (SRE) of the AML (Applied Machine Learning) team combines system engineering and the art of machine learning to develop and run a massively distributed AI/ML recommendation system for the United States and all around the world.On the SRE team, you'll have the opportunity to sharpen your expertise in...
-
Senior Site Reliability Engineer
2 weeks ago
Sydney, New South Wales, Australia Autonomai Recruitment Full timeRole:Senior SRESkills:Deep Linux, Scripting - Python, DevOps, KubernetesSalary:AU$600k+Location:SydneyThe ideal candidate comes from a top-tier tech environment (FAANG, elite trading, hyperscale infra). They have experience building technology0→1, owning systems end-to-end, and working close to the metal. They will operate across everything frombare-metal...
-
Sr. Site Reliability Engineer
1 week ago
Sydney, New South Wales, Australia Duck Creek Payments Full timeHelping careers take flight. Reshaping an industry. Enable your career to be Made on Duck Creek.WHO WE ARE:Duck Creek Technologies is the intelligent solutions provider defining the future of the property and casualty (P&C) and general insurance industry. We are the platform upon which modern insurance systems are built, enabling the industry to capitalize...
-
Sr. Site Reliability Engineer
1 week ago
Sydney, New South Wales, Australia Duck Creek Technologies Full timeHelping careers take flight. Reshaping an industry. Enable your career to be Made on Duck Creek. WHO WE ARE: Duck Creek Technologies is the intelligent solutions provider defining the future of the property and casualty (P&C) and general insurance industry. We are the platform upon which modern insurance systems are built, enabling the...
-
Site Reliability Engineer
2 weeks ago
Sydney, New South Wales, Australia Luminance Full time $120,000 - $180,000 per yearThe Role Luminance's Site Reliability team combines strong problem solving, infrastructure tooling and wider DevOps practices to provide a service of Luminance's unique software applications. The team plays a crucial role in incident response and issue resolution, swiftly addressing and resolving service interruptions to maintain the highest level of...
-
Site Reliability Engineer, Cloud
1 day ago
Sydney, New South Wales, Australia Google Full timeinfo_outlineXAt Google, we have a vision of empowerment and equitable opportunity for all Aboriginal and Torres Strait Islander peoples and commit to building reconciliation through Google's technology, platforms and people and we welcome Indigenous applicants. Please see our Reconciliation Action Plan for more information.Minimum qualifications:Bachelor's...
-
Data Engineer
1 week ago
Sydney, New South Wales, Australia NTT DATA Full timeMake an impact with NTT DATAJoin a company that is pushing the boundaries of what is possible. We are renowned for our technical excellence and leading innovations, and for making a difference to our clients and society. Our workplace embraces diversity and inclusion – it's a place where you can grow, belong and thrive.Your day at NTT DATAThe Data Engineer...
-
Senior Site Reliability Engineer, Play
5 days ago
Sydney, New South Wales, Australia Google Full timeMinimum qualifications:Bachelor's degree in Computer Science, a related field, or equivalent practical experience.5 years of experience with software development in one or more programming languages.3 years of experience in designing, analyzing, and troubleshooting distributed systems.2 years of experience leading projects and providing technical leadership....
-
Site Reliability Engineer
2 weeks ago
Sydney, New South Wales, Australia Macquarie Group Full timeOur Data Technology Division supports a critical data platform that enables essential business operations for Commodities and Global Markets and Macquarie Capital. Built on cutting-edge AWS cloud technologies, the platform leverages Redshift, S3, Kubernetes, PostgreSQL, Argo Workflows, and Python to process batch and event-driven data.At Macquarie, our...