
Site Reliability Engineer 3
24 hours ago
The Company
Serving the People Who Serve the People
Granicus is driven by the excitement of building, implementing, and maintaining technology that is transforming the Govtech industry by bringing governments and their constituents together. We are on a mission to support our customers by meeting the needs of their communities and implementing our technology in ways that are equitable and inclusive. Granicus has consistently appeared on the GovTech 100 list over the past 5 years and has been recognized as the best companies to work on BuiltIn.
Over the last 25 years, we have served 5,500 federal, state, and local government agencies and more than 300 million citizen subscribers powering an unmatched Subscriber Network that uses our digital solutions to make the world a better place. With comprehensive cloud-based solutions for communications, government website design, meeting and agenda management software, records management, and digital services, Granicus empowers stronger relationships between government and residents across the U.S., U.K., Australia, New Zealand, and Canada. By simplifying interactions with residents, while disseminating critical information, Granicus brings governments closer to the people they serve—driving meaningful change for communities around the globe.
Want to know more? See more of what we do here .
Summary Description - Site Reliability Engineer (Kubernetes, GCP) Granicus is seeking an experienced and highly skilled Senior Site Reliability Engineer (SRE) to join our SRE team. As a Senior SRE, you will play a pivotal role in ensuring the reliability, scalability, and performance of our services. You will lead efforts in building and maintaining a robust infrastructure, automating processes, and guiding the team to implement best practices in site reliability.
Essential Function
On-call Production Support: Provide production support on a shift according to the team on-call roster.
Work on the customer and internal engineering/implementation team raised tickets while not on-call for production support. For example, a client may request to correct some data on the database server which cannot be done through the web interface.
Work on SREs backlog items.
Monitor and Maintain Systems: Continuously monitor the health and performance of our services, systems, and infrastructure. Respond to alerts and incidents promptly to ensure high availability.
Automate Processes: Develop and maintain automation scripts and tools to streamline operations and reduce manual intervention.
Incident Management: Assist in troubleshooting and resolving incidents, performing root cause analysis, and implementing long-term fixes to prevent recurrence.
System Improvements: Participate in designing and implementing system improvements to enhance reliability, scalability, and performance.
Collaboration: Work closely with software engineers to understand application requirements, provide feedback on design and architecture, and support deployment and release processes.
Documentation: Create and maintain documentation for processes, procedures, and troubleshooting guides to ensure knowledge sharing within the team.
Capacity Planning: Assist in capacity planning activities to anticipate future needs and ensure that our infrastructure can handle growth.
Security: Implement and adhere to security best practices to protect our systems and data.
Knowledge/Skills/Abilities Technical Skills: Good understanding of Linux/Unix systems, networking, and cloud services (AWS, Azure, or Google Cloud). Experience with scripting languages such as Python, Bash, or Ruby.
Education: Bachelor's or Master's degree in Computer Science, Information Technology, or a related field, or equivalent practical experience.
Experience: 5+ years of experience in site reliability engineering, system administration, or a similar role, with a proven track record of managing large-scale, high-availability systems
Technical Skills: Expertise in Linux/Unix systems, networking, and cloud services (AWS, Azure, or Google Cloud). Proficiency in scripting languages (Python, Bash, Ruby) and programming languages (Go, Java, C++).
Tools and Technologies: Advanced knowledge of monitoring and logging tools (Prometheus, Grafana, Splunk), configuration management (Ansible, Chef, Puppet), and CI/CD pipelines.
Problem-Solving: Strong analytical and problem-solving skills with the ability to diagnose and resolve complex issues efficiently.
Communication: Excellent verbal and written communication skills, with the ability to convey complex technical concepts to non-technical stakeholders.
Leadership: Demonstrated ability to lead and mentor a team, drive projects to completion, and manage cross-functional initiatives.
Experience/Credentials: 5+ years experience in a SRE, DevOps or Software Engineering role
Certifications: Relevant certifications such as AWS Certified DevOps Engineer, Google Cloud Professional DevOps Engineer, or similar.
Knowledge: In-depth understanding of containerization (Docker, Kubernetes) and infrastructure as code (Terraform, CloudFormation).
Experience: Experience with database management (SQL, NoSQL), load balancing, and distributed systems.
Other Job Info
These statements are intended to describe the general nature and level of work being performed by employees assigned to this job. This is not intended to be an exhaustive list of all responsibilities, duties, and skills required of employees assigned to this job.
This role is typically performed on a computer using Zoom or Teams. Individuals will be on camera throughout the day engaging with other employees. The role is typically performed indoors within a home office environment. This role is typically performed while sitting or standing at a desk. The individual will occasionally lift light objects.
Academic Qualifications and Certifications:
Bachelor's degree in computer science, Information Technology, or a related field, or equivalent practical experience
Shift Time :
The position requires flexibility in working hours to cover for any overlap and attend team meetings as needed.
Shift Time: 24/7 on-call, including weekends (typically two weeks every month)
Security Requirement: Responsible for Granicus information security by appropriately preserving the Confidentiality, Integrity, and Availability (CIA) of Granicus information assets in accordance with the company's information security program.
ClosingfromDefault - All locations
Don't have all the skills/experience mentioned above? At Granicus, we are trying to build diverse, inclusive teams. We do not have degree requirements for most of our roles. If you don't meet every requirement above but are excited to learn more, we encourage you to apply. We might just be able to find another role that could be a perfect fit
Security and Privacy Requirements
-Responsible for Granicus information security by appropriately preserving the Confidentiality, Integrity, and Availability (CIA) of Granicus information assets in accordance with the company's information security program.
-Responsible for ensuring the data privacy of our employees and customers, their data, as well as taking all required privacy training in a timely manner, in accordance with company policies.
The Team
- We are a remote-first company with a globally distributed workforce across the United States, Canada, United Kingdom, India, Armenia, Australia, and New Zealand.
The Culture
- At Granicus, we are building a transparent, inclusive, and safe space for everyone who wants to be
a part of our journey.
- A few culture highlights include –Employee Resource Groups to encourage diverse voices
- Coffee with Mark sessions – Our employees get to interact with our CEO on very important and
sometimes difficult issues ranging from mental health to work-life balance and current affairs.
- Microsoft Teams communities focused on wellness, art, furbabies, family, parenting, and more.-=- - We bring in special guests from time to time to discuss issues that impact our employee
population
The Impact
- We are proud to serve dynamic organizations around the globe that use our digital solutions to make the world a better place — quite literally.We have so many powerful success stories that illustrate how our solutions are impacting the world. See more of our impact here .
Granicus is committed to providing equal employment opportunities. All qualified applicants and employees will be considered for employment and advancement without regard to race, color, religion, creed, national origin, ancestry, sex, gender, gender identity, gender expression, physical or mental disability, age, genetic information, sexual or affectional orientation, marital status, status regarding public assistance, familial status, military or veteran status or any other status protected by applicable law.
#J-18808-Ljbffr
-
Lead Site Reliability Engineer
24 hours ago
Perth, Western Australia Talent Full timeTalent Perth, Western Australia, AustraliaJoin or sign in to find your next job Join to apply for the Lead Site Reliability Engineer role at TalentTalent Perth, Western Australia, Australia1 day ago Be among the first 25 applicantsJoin to apply for the Lead Site Reliability Engineer role at TalentPerth based role12 month initial contractsAustralia...
-
Site Reliability
2 weeks ago
Perth, Western Australia Canonical Full timeJoin to apply for the Site Reliability / Gitops Engineer role at Canonical3 days ago Be among the first 25 applicantsJoin to apply for the Site Reliability / Gitops Engineer role at CanonicalCanonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is very widely used...
-
Site Reliability
2 weeks ago
Perth, Western Australia Canonical Full timeJoin to apply for the Site Reliability / Gitops Engineer role at Canonical3 days ago Be among the first 25 applicantsJoin to apply for the Site Reliability / Gitops Engineer role at CanonicalCanonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is very widely used...
-
System Reliability Engineer
24 hours ago
Perth, Western Australia ShiftCare Full time1 week ago Be among the first 25 applicantsGet AI-powered advice on this job and more exclusive features.We're on the lookout for a passionate and exceptional reliability engineer to join our dynamic team and help us transform the homecare industry. Rally with us in creating meaningful experiences for our hyper-growth healthcare startup.Why ShiftCare?We're...
-
Site Reliability Engineer
24 hours ago
Perth, Western Australia Buscojobs Full timeRetail Sector | Sydney (Hybrid)Our client, a leading name in retail, is seeking an experienced Site Reliability Engineer (SRE) to join their team on a contract basis. You will play a key role in fabricating infrastructure, driving cost-reduction initiatives, and supporting critical website and mobile applications that operate at a massive scale.What You'll...
-
Reliability Engineer
24 hours ago
Perth, Western Australia Norton Gold Fields Limited Full timeJob Category: Mining - Engineering & MaintenanceWORK SITE: BinduliROSTER: 8:6 Day Shift Only - 12hrs (Residential or FIFO from Perth)PACKAGE: Salary + Performance Bonus + Residential Allowance + Site Allowance + Family Private Health Insurance or Allowance + Death and TPD Coverage or Allowance + Mental Health Programs + Gym Membership Discount + Other...
-
Reliability Engineer
24 hours ago
Perth, Western Australia Norton Gold Fields Limited Full timeJob Category: Mining - Engineering & MaintenanceWORK SITE:BinduliROSTER:8:6 Day Shift Only – 12hrs (Residential or FIFO from Perth)PACKAGE:Salary + Performance Bonus + Residential Allowance + Site Allowance + Family Private Health Insurance or Allowance + Death and TPD Coverage or Allowance + Mental Health Programs + Gym Membership Discount + Other...
-
Principal Site Reliability Engineer
2 days ago
Perth, Western Australia Buscojobs Full timeYou are passionate about SRE and systems engineeringWe are undergoing one of Australia's largest digital transformationsTogether we can reimagine banking for millions of customersDo work that mattersWe're accelerating our digital strategy with an ambition to provide customers with one of the best digital experiences of any company globally. Site Reliability...
-
Reliability Engineer
24 hours ago
Perth, Western Australia fmgl Full time $90,000 - $120,000 per yearOur Opportunity Work Location: Fortescue's Christmas Creek mine, located on the traditional lands of the Palyku and Nyiyaparli peoples.Roster: 8D/ 6R – FIFO ex PerthWe are seeking an experienced Reliability Engineer to join our team at Christmas Creek on a family-friendly 8/6 roster. In this role, you will provide reliability engineering support,...
-
Reliability Engineer
2 weeks ago
Perth, Western Australia Fortescue Full timeSelect how often (in days) to receive an alert: Work Location: Fortescue's Christmas Creek mine, located on the traditional lands of the Palyku and Nyiyaparli peoples.Roster: 8 D/ 6 R – FIFO ex Perth We are seeking an experienced Reliability Engineer to join our team at Christmas Creek on a family-friendly 8/6 roster.In this role, you will provide...