Site Reliability Engineer
vor 2 Wochen
About the companyAt Jobtome - https://weare.jobtome.com/- we are building a modern, cloud-native recruitment and marketing platform used at scale across multiple countries and brands.Our systems power high-traffic job distribution, integrations with external partners, and real-time data pipelines, with a strong focus on reliability, observability, and automation.Engineering is a core function of the company: we value ownership, pragmatic decision-making, and long-term technical excellence over short-term fixes.The roleAs a Senior Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our production systems.You will work closely with Backend, Frontend, and Product teams to:design resilient architecturesdefine reliability standardsimprove observability and incident responsereduce operational toil through automationThis is not a pure ops role: you will contribute to codebases, collaborate on system design, and help evolve our engineering culture toward SRE best practices.What you will doDesign, implement, and maintain reliable and scalable cloud infrastructureDefine and evolve SLIs, SLOs, and error budgetsImprove monitoring, alerting, and observability across servicesLead and participate in incident response, post-mortems, and root-cause analysisAutomate repetitive operational tasks to reduce toilCollaborate with Backend engineers on service design, scalability, and failure modesImprove CI/CD pipelines, deployment strategies, and release safetyContribute to infrastructure as code and platform toolingAct as a reliability advocate across the engineering organizationTech stackCloud: Google Cloud Platform (preferred), AWSContainers & orchestration: Docker, Kubernetes (GKE)Infrastructure as Code: TerraformCI/CD: GitLab CI/CDObservability: Cloud Monitoring, Logging, Prometheus, GrafanaLanguages: Go, Python, BashNetworking & security: IAM, VPCs, service accounts, secrets managementWhat we expect from a senior SREStrong experience running production systems at scaleSolid understanding of distributed systems and failure modesProven experience with SLO-driven reliabilityStrong coding skillsCloud infrastructure automation experienceAbility to debug complex cross-system issuesOwnership mindset and strong communication skillsPragmatic approach to reliability, speed, and cost trade-offsWorking modelFlexible working hoursRemote-friendly setupSmall autonomous teamsDirect collaboration with product and leadership
-
Site Reliability Engineer
vor 2 Wochen
Mendrisio, Schweiz Jobtome VollzeitAbout the company At Jobtome - we are building a modern, cloud-native recruitment and marketing platform used at scale across multiple countries and brands.Our systems power high-traffic job distribution, integrations with external partners, and real-time data pipelines, with a strong focus on reliability, observability, and automation. Engineering is a core...
-
Cloud Architect
vor 2 Wochen
Mendrisio, Schweiz Red Hat (Switzerland) SARL VollzeitThe Red Hat Consulting Services team is looking for an Architect to join us in Zurich, Switzerland. In this role, you will work at the customer site as a subject-matter expert in Red Hat infrastructure and cloud technology. You'll answer questions about best practices around reliability, scalability, maintainability, high availability, and failover setups,...