Senior Site Reliability Engineer
vor 2 Wochen
At , we are building the world's first platform to create full-stack, on-chain applications through natural language. Our mission is to make building software as simple as a conversation, transforming ideas into live applications instantly. We are a cross-functional team of engineers and researchers building the AI that will power this new paradigm. To do this, we need to ensure the platform that performs this magic is exceptionally reliable, fast, and scalable.
About the Role
As a Senior Site Reliability Engineer, you will be the guardian of the user experience. You are not just keeping servers online; you are ensuring the end-to-end reliability of the core "idea-to-application" journey. Your focus will be on the availability, reliability, and scalability of our user-facing products and the complex AI-driven microservices that power them. You will be deeply embedded with our product and engineering teams, acting as the critical bridge between our ambitious AI vision and a rock-solid production reality.
This is a hands-on role for an engineer who thinks about reliability from the user's perspective and wants to provide the best developer experience for your fellow engineers and wants to solve novel challenges in a rapidly evolving AI/ML environment.
What You'll Do
- Own Product Reliability:
Take ownership of the availability and reliability of the platform. You'll define our Service Level Objectives (SLOs), provide a reliable Continuous Delivery (CD) platform and work across teams to meet and exceed them. - Build Deep Product Insight:
Design, implement, and manage our observability stack (Datadog, Opentelemetry, distributed tracing, logs, metrics) to provide high-fidelity signals into the health of our services and, most importantly, the user experience. - Engineer Scalable Solutions:
Dive deep into our architecture to identify and eliminate performance bottlenecks, single points of failure, and sources of toil. You'll write code—primarily in Rust, Go and Typescript (we use Pulumi)—to automate operations and build robust, self-healing systems. You will setup routing and service mesh configurations (e.g. Istio). - Champion Reliability from Day One:
Partner with software engineers during design and code reviews to proactively bake in reliability, scalability, and operability. You will be the expert voice that helps the team build for production from the start. - Lead and Learn from Incidents:
Coordinate the incident response process for our production services. You'll lead blameless post-mortems that drive meaningful improvements across our systems and processes. - Participate in an On-Call Rotation:
As a key member of the team, you will be part of a compensated on-call rotation focused on coordinating incident response and ensuring platform stability.
Who You Are
- You are a product-minded engineer with proven experience as a Site Reliability Engineer, with a strong focus on user-facing applications and distributed service architectures.
- You have deep expertise in building and running modern observability stacks (e.g., Datadog, Opentelemetry) and believe in data-driven decision-making.
- You are a proficient software developer. You have experience designing and writing production-grade applications and automation,
ideally in a systems
and infra language like
Rust
or
Go
, and are open to use
Python
, Typescript or Bash.
- You are a methodical troubleshooter, capable of systematically diagnosing complex issues across the entire stack, from networking protocols (TCP/IP, DNS, TLS) up to the application layer.
- You understand the complexities of modern CI/CD pipelines and have experience building and maintaining them.
- You thrive in a collaborative environment and possess excellent communication skills, capable of explaining complex technical concepts to a diverse audience.
Bonus
- You have experience with the reliability and performance challenges of AI/ML-powered systems or large-scale data processing pipelines.
*This is a hybrid role based in our Zurich office, with a requirement of 3+ days in the office per week.
-
Senior Site Reliability Engineer
vor 2 Wochen
Zürich, Zürich, Schweiz bd585dbf-1f4e-4b99-9513-b347ddcb2e30 Vollzeit CHF 80'000 - CHF 150'000 pro JahrAt , we are building the world's first platform to create full-stack, on-chain applications through natural language. Our mission is to make building software as simple as a conversation — transforming ideas into live applications instantly. We are a cross-functional team of engineers and researchers building the AI that powers this new paradigm. To do...
-
Site Reliability Engineer
Vor 6 Tagen
Zürich, Zürich, Schweiz Thrive IT Systems Vollzeit CHF 120'000 - CHF 180'000 pro JahrSite Reliability Engineer, Events (Enterprise Messaging)Your roleWe are a group of professionals who enjoy identifying areas of improvement and engineering better solutions. As a crew, we do our best to create a supportive environment where each of us feel appreciated and have a chance to develop professionally.We are looking for candidates to take the role...
-
Site Reliability Engineer
Vor 4 Tagen
Zürich, Zürich, Schweiz Us3 Consulting Vollzeit CHF 80'000 - CHF 120'000 pro JahrSite Reliability Engineer (SRE Engineer)Onsite 100% RoleLocation: ZurichDuration of contract: 6 monthsGerman/ French and EnglishFocus onSRE experiencein Information Technology Enterprise and Infrastructure engineering support/engineering roles.Proficient in Linux/Unix, Full Microsoft Stack (AD, Exchange), Network Infrastructure (Cisco), Application...
-
Senior Site Reliabilty Engineer
vor 1 Woche
Zürich, Zürich, Schweiz Avaloq Vollzeit CHF 120'000 - CHF 180'000 pro JahrJoin our Technology R&D Lab as a Senior Site Reliability Engineer and help shape the operational foundation of a new generation of cloud-native, composable banking platforms. You will design and evolve the systems, automation, and practices that keep our SaaS products reliable, observable, and secure as we scale globally. You will work closely with...
-
Zürich, Zürich, Schweiz EPAM Systems Vollzeit CHF 120'000 - CHF 180'000 pro JahrWe are seeking a skilledSenior Site Reliability Engineerto join our crew in Zürich. In this role, you will provide 3rd level/SRE support for IBM MQ in distributed and mainframe environments, focusing on planning, configuring, building, migrating and administering MQ managers. You will work closely with payments application teams to help drive automation,...
-
Senior Site Reliabilty Engineer
vor 1 Woche
Zürich, Zürich, Schweiz Avaloq Vollzeit CHF 120'000 - CHF 180'000 pro JahrSorry, Internet Explorer 11 is no longer supported by SmartRecruiters Please update to one of the following browsers:Google Chrome Microsoft Edge Apple Safari Mozilla Firefox You can find details about supported web browsershere. Senior Site Reliabilty Engineer Company Description Founded and headquartered in Switzerland, Avaloq is continuously...
-
Site reliability engineer
vor 1 Woche
Zürich, Zürich, Schweiz Rocken AG Vollzeit CHF 100'000 - CHF 150'000 pro JahrSite reliability engineer Rocken AG days ago Role details Contract type Permanent contract Employment type Full-time (> 32 hours Working hours Shift work Languages German Compensation CHF 125K Job location Tech stack Java Agile Methodologies C Sharp (Programming Language Software Documentation Continuous Integration Data Integration DevOps Software...
-
Senior Site Reliabilty Engineer
vor 2 Wochen
Zürich, Zürich, Schweiz Avaloq Vollzeit CHF 120'000 - CHF 180'000 pro JahrCompany DescriptionFounded and headquartered in Switzerland, Avaloq is continuously expanding its global footprint with around 2,500 colleagues in 12 countries, and more than 170 clients in 35 countries. We are an industry-leading provider of wealth management technology and services for financial institutions around the world, including private banks and...
-
Senior Site Reliabilty Engineer
vor 2 Wochen
Zürich, Zürich, Schweiz Avaloq Vollzeit CHF 120'000 - CHF 180'000 pro JahrCompany Description Founded and headquartered in Switzerland, Avaloq is continuously expanding its global footprint with around 2,500 colleagues in 12 countries, and more than 170 clients in 35 countries. We are an industry-leading provider of wealth management technology and services for financial institutions around the world, including private banks and...
-
Database Reliability Engineer
vor 2 Wochen
Zürich, Zürich, Schweiz Thrive IT Systems Vollzeit CHF 68'000 - CHF 120'000 pro JahrDatabase Reliability EngineerLocation:Zurich, SwitzerlandWork Model:Hybrid (3 days/week on-site)Rate:680 – Open to NegotiateExperience:12+ years overall (with strong Oracle & PostgreSQL expertise)About the RoleAre you passionate about cloud transformation and database reliability?We're looking for an experiencedDatabase Reliability Engineer (DRE)to join...