Software Reliability Engineer
1 week ago
Job Description
Join the team redefining how the world experiences design.
Hey, g'day, mabuhay, kia ora, 你好, hallo, vítejte
Thanks for stopping by. We know job hunting can be a little time consuming and you're probably keen to find out what's on offer, so we'll get straight to the point.
Where and how you can work
Our flagship campus is in Sydney. We also have a campus in Melbourne and co-working spaces in Brisbane, Perth and Adelaide. But you have choice in where and how you work, we trust our Canvanauts to choose the balance that empowers them and their team to achieve their goals.
What you'd be doing in this role
As Canva scales change continues to be part of our DNA. But we like to think that's all part of the fun. So this will give you the flavour of the type of things you'll be working on when you start, but this will likely evolve.
At the moment, this role is focused on:
- Being responsible for building and improving our observability platform and tooling, which is used by all Canva engineers.
- Providing technical leadership and expertise to drive pragmatic solutions and achieve impactful design decisions.
- Brainstorming, researching and prototyping to optimize our tracing and exceptions platforms, improve our operational effectiveness and increase reliability.
- Being proactive in improving the tracing user experience and advocating for best practices.
- Finding ways to improve the use of traces and exceptions, providing better insights to our engineers.
- Enhancing our exception workflow to help engineers seamlessly capture errors, gain actionable insights through clear visualizations, and set up high-signal, low-noise alerts.
- Participating in team ceremonies, knowledge sharing and brainstorming sessions.
- Becoming an observability champion, evangelising best practices and guiding other Canvanauts in the observability space.
You're probably a match if
- You are proficient and happy to code in Python, Java or Golang.
- You have deep knowledge and understanding of Computer Engineering fundamentals and first principles.
- You have a solid knowledge of AWS (EC2, EKS, Lambda, SQS, Kinesis, S3) or equivalent.
- You have experience deploying and running containerized workloads on a platform like Kubernetes.
- You have experience with Observability Tooling – having competency with tools like Elasticsearch, Grafana, Sentry, Jaegar Tracing or similar.
- Experience running highly available and reliable distributed systems, with highly scalable data stores.
- You are proficient with infrastructure-as-code - we're a Terraform shop, but strong experience with other IaC tools will do the trick.
Not essential; but helpful experience
- You have experience with OpenTelemetry because it underpins a lot of the infrastructure and tooling that the team owns.
- You have experience writing application code in Java or frontend code in TypeScript, since we also maintain the tracing libraries.
- You have experience building and running monitoring infrastructure at scale. For example, Petabyte-scale Elasticsearch clusters or similar databases.
- You have experience with data handling at scale.
- You have experience with Clickhouse.
- You have experience with data security, data obfuscation and PII detection.
About the team
You'll join The Observability Traces & Exceptions Team, responsible for operational insights inside Canva. Our goal is to provide our development team with world-class tools to view how their services are performing in production. We achieve this by combining industry-leading third-party solutions with our own solutions developed in-house.
We work across the entire stack maintaining our TypeScript and Java tracing libraries, our tracing infrastructure, error reporting libraries and error handling guidelines to name just a few. As we scale all of these areas, we require more sophisticated solutions to ensure that Canva developers can continue to grow without compromising on reliability or availability.
What's in it for you?
Achieving our crazy big goals motivates us to work hard - and we do - but you'll experience lots of moments of magic, connectivity and fun woven throughout life at Canva, too. We also offer a range of benefits to set you up for every success in and outside of work.
Here's a taste of what's on offer:
- Equity packages - we want our success to be yours too
- Inclusive parental leave policy that supports all parents & carers
- An annual Vibe & Thrive allowance to support your wellbeing, social connection, office setup & more
- Flexible leave options that empower you to be a force for good, take time to recharge and supports you personally
Check out for more info.
Other stuff to know
We see AI as a powerful amplifier of creativity and technology at Canva. We're evolving how we assess AI skills in our Technology hiring experience - you'll tackle interactive, real-time challenges that reflect the kind of work we do. In some interviews, you may also be asked to solve a problem using an AI tool to show how you approach challenges with tech by your side. Your recruitment partner will walk you through what to expect. We make hiring decisions based on your experience, skills and passion, as well as how you can enhance Canva and our culture.
When you apply, please tell us the pronouns you use and any reasonable adjustments you may need during the interview process. We celebrate all types of skills and backgrounds at Canva, so even if you don't feel like your skills quite match what's listed above - we still want to hear from you
Please note that interviews are conducted virtually.
-
Senior Software Engineer
3 days ago
Sydney, New South Wales, Australia Software At Scale Full time $120,000 - $180,000 per yearOur 'black belt' specialists are leaders in their domains: digital champions, delivery-focused experts, top-tier security professionals, AI thought leaders, and engineering best practice advocates.With a global presence and local expertise, we deliver innovative solutions without compromising on quality. Our multidisciplinary teams provide tailored expertise...
-
Senior Software Engineer
3 days ago
Sydney, New South Wales, Australia Software At Scale Full time $120,000 - $180,000 per yearOur 'black belt' specialists are leaders in their domains: digital champions, delivery-focused experts, top-tier security professionals, AI thought leaders, and engineering best practice advocates.With a global presence and local expertise, we deliver innovative solutions without compromising on quality. Our multidisciplinary teams provide tailored expertise...
-
Senior Software Engineer
1 week ago
Sydney, New South Wales, Australia Software At Scale Full time $120,000 - $180,000 per yearOur 'black belt' specialists are leaders in their domains: digital champions, delivery-focused experts, top-tier security professionals, AI thought leaders, and engineering best practice advocates.With a global presence and local expertise, we deliver innovative solutions without compromising on quality. Our multidisciplinary teams provide tailored expertise...
-
Senior Software Engineer
5 days ago
Sydney, New South Wales, Australia LEAP Legal Software Full timeAbout LEAPLEAP is the leading provider of Legal Practice Management Solutions in the world and is part of ATI – one of the largest international LegalTech companies. For more than 30 years, our curiosity and commitment to continual improvement has kept us reimagining productivity tools for lawyers and their staff to support our guiding purpose, to 'Help...
-
Site Reliability Engineer
3 days ago
Sydney, New South Wales, Australia N2S Full time $120,000 - $180,000 per yearWe are looking for aSite Reliability Engineer (SRE)to join our team and ensure the reliability, scalability, and performance of our software systems. This role bridges the gap between software development and IT operations, focusing on automation, monitoring, and incident response to maintain high system uptime and user satisfaction.Key...
-
Reliability Engineer
1 week ago
Sydney, New South Wales, Australia KBR, Inc. Full time $120,000 - $180,000 per yearTitle:Reliability EngineerAt KBR – We do things that matter. We deliver science, technology and engineering solutions to governments and companies around the world. KBR employs approximately 38,000 people worldwide with customers in more than 80 countries and operations in over 29 countries.KBR is proud to work with its customers across the globe to...
-
Staff Software Reliability Engineer
1 week ago
Sydney, New South Wales, Australia Canva Full time $120,000 - $180,000 per yearJob DescriptionJoin the team redefining how the world experiences design.Hey, g'day, mabuhay, kia ora, 你好, hallo, vítejteThanks for stopping by. We know job hunting can be a little time consuming and you're probably keen to find out what's on offer, so we'll get straight to the point.Where And How You Can WorkOur flagship campus is in Sydney. We also...
-
Senior Software Engineer
5 days ago
Sydney, New South Wales, Australia Aurec Full timeSoftware Engineer - Python - Platform - Data Who they are A global fintech powerhouse is looking for a Software Engineer to help design, build and scale data platforms that sit at the heart of their world-class trading and research operations. This is a high-impact engineering role, ideal for someone who blends strong software fundamentals with a passion...
-
Site Reliability Engineer
24 hours ago
Sydney, New South Wales, Australia Luminance Full time $120,000 - $180,000 per yearThe Role Luminance's Site Reliability team combines strong problem solving, infrastructure tooling and wider DevOps practices to provide a service of Luminance's unique software applications. The team plays a crucial role in incident response and issue resolution, swiftly addressing and resolving service interruptions to maintain the highest level of...
-
Site Reliability Engineer
21 hours ago
Sydney, New South Wales, Australia Luminance Full time $120,000 - $180,000 per yearThe RoleLuminance's Site Reliability team combines strong problem solving, infrastructure tooling and wider DevOps practices to provide a service of Luminance's unique software applications. The team plays a crucial role in incident response and issue resolution, swiftly addressing and resolving service interruptions to maintain the highest level of customer...