Site Reliability Engineer

Vor 5 Tagen


Zürich, Zürich, Schweiz DFINITY Vollzeit
About the Role

We are seeking a highly skilled Site Reliability Engineer to join our team at DFINITY. As a key member of our infrastructure team, you will be responsible for designing, building, and maintaining the high availability and reliability of our products and services.

Key Responsibilities
  • Service Management: Develop and implement service management strategies to ensure the stability and performance of our infrastructure.
  • Automation: Identify opportunities to automate processes and implement solutions to enhance efficiency and reduce manual intervention.
  • Reliability and Operability: Collaborate with cross-functional teams to integrate reliability and operability into our products and services.
  • Collaboration: Work closely with engineering and security teams to establish processes that align with our goals and objectives.
  • Service Level Objectives (SLOs): Collaborate with product owners to define SLOs and implement them in code and observability infrastructure.
  • On-Call Duty: Participate in on-call duties for production services, responding to incidents and ensuring timely resolution.
  • Unix Systems: Operate, troubleshoot, and deploy software on Unix systems.
Requirements
  • Observability: Proven experience in monitoring and maintaining large production systems using tools such as Prometheus, Victoria Metrics, Elastic Search, and Grafana.
  • Kubernetes: Proficiency in managing multiple observability stacks across various availability zones, leveraging Kubernetes for deployment orchestration.
  • Rust Coding: Extensive experience in designing and developing moderate-sized applications in Rust, with a focus on reliability and operability.
  • Systemic Thinking: Ability to approach problems methodically and systemically, especially during troubleshooting.
  • Pragmatism: Ability to balance immediate needs with long-term goals, understanding when a solution is "good enough for the next 12 months."
  • Incident Response: Expertise in coordinating incident response across multiple teams, with excellent communication skills.
  • Reliability Engineering: Experience in Site Reliability Engineering within a crypto environment, with a focus on reliability and operability.
  • Security Background: Experience in building security-sensitive tools and managing security risks in crypto environments.
  • Community interaction: Proven experience in engaging with community members of large open-source projects.
About DFINITY

DFINITY is a leading contributor to the Internet Computer Protocol (ICP), with a mission to bring the world's compute onto the secure ICP network. Our technology enables the development and operation of unstoppable, tamper-proof, fully decentralized web applications.

We are a team of over 250 talented individuals, including world-renowned cryptographers, distributed systems engineers, programming language experts, and industry leaders, who are shaping the future of the internet and web3.

We are an equal opportunities employer and welcome applications from all qualified candidates.



  • Zürich, Zürich, Schweiz DFINITY Foundation Vollzeit

    {"h1": "Site Reliability Engineer at DFINITY Foundation", "p": "We are seeking a skilled Site Reliability Engineer to join our team at DFINITY Foundation. As a Site Reliability Engineer, you will play a critical role in ensuring the stability and scalability of our Internet Computer Protocol (ICP).", "ul": [{"li": "Design, build, deploy, and maintain...


  • Zürich, Zürich, Schweiz dfinity Vollzeit

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at DFINITY. As a key member of our engineering team, you will play a critical role in ensuring the stability and reliability of our Internet Computer platform.Key ResponsibilitiesService Management: Design, build, deploy, and maintain services to ensure high availability...


  • Zürich, Zürich, Schweiz Open Systems AG Vollzeit

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Open Systems AG. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our software systems.Key ResponsibilitiesDevelop and maintain automation frameworks to reduce manual intervention and improve system...


  • Zürich, Zürich, Schweiz Inventx AG Vollzeit

    JobbeschreibungAls Site Reliability Engineer bei Inventx AG bist du verantwortlich für den Betrieb und die Optimierung von geschäftskritischen Anwendungen in der Finanz- und Versicherungsindustrie.Deine AufgabenDer Betrieb von Windows-basierten Systemen, einschließlich 2nd-Level-Support und Sicherstellung der Verfügbarkeit.Die Analyse von Störungen und...


  • Zürich, Zürich, Schweiz DFINITY Foundation Vollzeit

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at the DFINITY Foundation. As a key member of our engineering team, you will play a critical role in ensuring the stability and reliability of our Internet Computer platform.Key ResponsibilitiesService Management: Design, build, deploy, and maintain services to ensure...


  • Zürich, Zürich, Schweiz Tamedia Publikationen Deutschschweiz AG Vollzeit

    Job DescriptionWe are seeking a highly skilled Site Reliability Engineer to join our team at Tamedia Publikationen Deutschschweiz AG. As a key member of our Platform team, you will play a crucial role in shaping the digital future of our company.About the RoleCollaborate with our development teams to ensure the smooth operation of our publishing platform,...


  • Zürich, Zürich, Schweiz Ergon Informatik AG Vollzeit

    ÜberblickAls Site Reliability Engineer (SRE) bei Ergon Informatik AG bist du für den Aufbau und Betrieb der Airlock SaaS-Plattform verantwortlich. Du arbeitest in interdisziplinären Teams und meisterst agile Vorgehensmethoden wie Scrum. Deine Erfahrung in der Umsetzung anspruchsvoller Projekte und deine Leidenschaft für SRE- und DevOps-Kultur, -Methoden...


  • Zürich, Zürich, Schweiz Nexxiot Vollzeit

    About NexxiotNexxiot is a leading TradeTech company that specializes in hardware-enabled data solutions, aiming to reduce uncertainty in cargo transportation. With a vision to digitize the global supply chain, Nexxiot operates the largest digital fleet of rail cars and intermodal containers, with a growth plan to quadruple its digitized assets by 2027. The...


  • Zürich, Zürich, Schweiz dfinity Vollzeit

    {"h1": "Reliability and Operations Specialist", "p": "At DFINITY, we're building a world computer that's infinitely scalable and secure. As a Reliability and Operations Specialist, you'll play a critical role in ensuring the stability and reliability of our infrastructure. You'll work closely with our engineering and security teams to design, build, and...


  • Zürich, Zürich, Schweiz Ergon Informatik AG Vollzeit

    Beschreibung der RolleAls Site Reliability Engineer (SRE) bei Ergon Informatik AG bist du für den Aufbau und Betrieb der Airlock SaaS-Plattform verantwortlich. Du arbeitest in interdisziplinären Teams und meisterst agile Vorgehensmethoden wie Scrum. Deine Erfahrung in der Umsetzung anspruchsvoller Projekte und deine Leidenschaft für SRE- und...


  • Zürich, Zürich, Schweiz Nexxiot Vollzeit

    About NexxiotNexxiot is a leading TradeTech company that specializes in hardware-enabled data solutions. Our mission is to reduce uncertainty in cargo transportation by providing real-time monitoring and analytics of assets and cargo. We operate the largest digital global fleet of rail cars and intermodal containers, with a vision to quadruple our digitized...


  • Zürich, Zürich, Schweiz Ergon Informatik Vollzeit

    Beschreibung der PositionWir suchen einen erfahrenen Site Reliability Engineer, der sich auf die Entwicklung und den Betrieb von Cloud-basierten Plattformen spezialisiert hat. Der Kandidat wird Teil eines interdisziplinären Teams sein, das sich auf die Entwicklung von Sicherheitslösungen und die Optimierung von Betriebsprozessen...


  • Zürich, Zürich, Schweiz Ergon Informatik AG Vollzeit

    ÜberblickAls erfahrener Site Reliability Engineer (SRE) bei Ergon Informatik AG bist du für den Aufbau und Betrieb der Airlock SaaS-Plattform verantwortlich. Du arbeitest in interdisziplinären Teams und meisterst agile Vorgehensmethoden wie Scrum. Deine Erfahrung in der Umsetzung anspruchsvoller Projekte und deine Leidenschaft für SRE- und DevOps-Kultur,...


  • Zürich, Zürich, Schweiz IO Associates Vollzeit

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our dynamic Publishing Technology Solutions team at IO Associates. As a key member of our team, you will play a pivotal role in developing and maintaining a cutting-edge publishing platform that streamlines development and sparks innovation.Key ResponsibilitiesCollaborate with...


  • Zürich, Zürich, Schweiz UBS Vollzeit

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Data Platform Service team in Zurich, Switzerland. As a key member of our Site Reliability Engineering Team, you will be responsible for administering critical infrastructure for data platforms, implementing migrations, upgrades, and patches, and ensuring 'lights-on' support...


  • Zürich, Zürich, Schweiz Tamedia Vollzeit

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Tamedia. As a key member of our Publishing Technology Solutions group, you will play a crucial role in shaping the digital future of our company.Key ResponsibilitiesDesign, build, and maintain core infrastructure pieces to support our publishing solutions, ensuring...


  • Zürich, Zürich, Schweiz Supertext Vollzeit

    Über SupertextSupertext ist ein international erfolgreicher AI-Sprachdienstleister mit Standorten in Zürich und Berlin. Wir entwickeln innovative, massgeschneiderte und sichere Language-AI-Lösungen – und lassen sie zusammen mit erfahrenen Sprachprofis zur Höchstform auflaufen.Unser TeamUnser Team aus 120 Expert*innen in AI Research, Software,...


  • Zürich, Zürich, Schweiz DFINITY Foundation Vollzeit

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at the DFINITY Foundation. As a key member of our engineering team, you will play a critical role in ensuring the stability and reliability of our Internet Computer platform.Key ResponsibilitiesService Management: Design, build, deploy, and maintain services to ensure...

  • Reliability Engineer

    Vor 6 Tagen


    Zürich, Zürich, Schweiz ETH get hired Vollzeit

    About ETH get hiredETH get hired is a leading video game development company that specializes in creating immersive and engaging simulation and management games. Our team is passionate about pushing the boundaries of innovation and excellence in the gaming industry.About the RoleWe are seeking a highly skilled and experienced Site Reliability Engineer to...


  • Zürich, Zürich, Schweiz UBS Vollzeit

    About the RoleWe are seeking a highly skilled Data Protection Site Reliability Engineer to join our team at UBS. As a key member of our Data Confidentiality Protection operational team, you will be responsible for ensuring the operational reliability, stability, and availability of our Data Protection services.Key ResponsibilitiesRelease management of a vast...