Site Reliability Engineer

vor 2 Wochen

Mendrisio, Schweiz Jobtome Vollzeit

About the companyAt Jobtome - https://weare.jobtome.com/- we are building a modern, cloud-native recruitment and marketing platform used at scale across multiple countries and brands.Our systems power high-traffic job distribution, integrations with external partners, and real-time data pipelines, with a strong focus on reliability, observability, and automation.Engineering is a core function of the company: we value ownership, pragmatic decision-making, and long-term technical excellence over short-term fixes.The roleAs a Senior Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our production systems.You will work closely with Backend, Frontend, and Product teams to:design resilient architecturesdefine reliability standardsimprove observability and incident responsereduce operational toil through automationThis is not a pure ops role: you will contribute to codebases, collaborate on system design, and help evolve our engineering culture toward SRE best practices.What you will doDesign, implement, and maintain reliable and scalable cloud infrastructureDefine and evolve SLIs, SLOs, and error budgetsImprove monitoring, alerting, and observability across servicesLead and participate in incident response, post-mortems, and root-cause analysisAutomate repetitive operational tasks to reduce toilCollaborate with Backend engineers on service design, scalability, and failure modesImprove CI/CD pipelines, deployment strategies, and release safetyContribute to infrastructure as code and platform toolingAct as a reliability advocate across the engineering organizationTech stackCloud: Google Cloud Platform (preferred), AWSContainers & orchestration: Docker, Kubernetes (GKE)Infrastructure as Code: TerraformCI/CD: GitLab CI/CDObservability: Cloud Monitoring, Logging, Prometheus, GrafanaLanguages: Go, Python, BashNetworking & security: IAM, VPCs, service accounts, secrets managementWhat we expect from a senior SREStrong experience running production systems at scaleSolid understanding of distributed systems and failure modesProven experience with SLO-driven reliabilityStrong coding skillsCloud infrastructure automation experienceAbility to debug complex cross-system issuesOwnership mindset and strong communication skillsPragmatic approach to reliability, speed, and cost trade-offsWorking modelFlexible working hoursRemote-friendly setupSmall autonomous teamsDirect collaboration with product and leadership

Site Reliability Engineer

vor 2 Wochen

Mendrisio, Schweiz Jobtome Vollzeit

About the company At Jobtome - we are building a modern, cloud-native recruitment and marketing platform used at scale across multiple countries and brands.Our systems power high-traffic job distribution, integrations with external partners, and real-time data pipelines, with a strong focus on reliability, observability, and automation. Engineering is a core...
Cloud Architect

vor 2 Wochen

Mendrisio, Schweiz Red Hat (Switzerland) SARL Vollzeit

The Red Hat Consulting Services team is looking for an Architect to join us in Zurich, Switzerland. In this role, you will work at the customer site as a subject-matter expert in Red Hat infrastructure and cloud technology. You'll answer questions about best practices around reliability, scalability, maintainability, high availability, and failover setups,...

Amerika

Europa

Asien / Ozeanien

Afrika

Site Reliability Engineer

Site Reliability Engineer

Cloud Architect