Site Reliability Engineer, Zürich

vor 3 Wochen


Zürich, Schweiz TN Switzerland Vollzeit

The SRE team at DFINITY is charged with creating tools, processes, and frameworks that ensure the stability of the Internet Computer, which is distributed and scalable. As a member of the team you will work with engineering, infrastructure, and security teams to bake reliability and operability into the product from the start, by participating in design and code reviews, identifying risks, problems, and mitigations. This is not a team that exists to be on-call; this is a team that elects to be on-call because it helps do the job better.

Responsibilities

  • Service Management: Design, build, deploy, and maintain services to ensure the high availability and reliability of DFINITY's products and the Internet Computer Protocol (ICP).
  • Automation: Identify and implement opportunities to automate processes through coding, enhancing efficiency and reducing manual intervention.
  • Reliability and Operability: Integrate reliability and operability into the product from the start by participating in design and code reviews, identifying risks, and proposing mitigations.
  • Collaboration: Work with engineering and security teams to establish processes that align with the goals of the Internet Computer while remaining operationally feasible and automatable.
  • Service Level Objectives (SLOs): Collaborate with product owners to define SLOs and implement them in code and observability infrastructure.
  • On-Call Duty: Participate in on-call duties for production services on a 12/7 schedule, split across two sites. On-call duty is approximately 1 week every 6 weeks. Coordinate incident response and ensure resolution, involving engineers from other teams as necessary. On-call work is compensated with a monetary and a time off compensation.
  • On-Call Philosophy: Our team chooses to be on-call because it enhances our ability to identify and address system alerts, ultimately improving performance.
  • Unix Systems: Operate, troubleshoot, and deploy software on Unix systems.

Requirements:

  • Observability: Proven experience in monitoring and maintaining large production systems using tools such as Prometheus, Victoria Metrics, Elastic Search, and Grafana.
  • Kubernetes: Proficiency in managing multiple observability stacks across various availability zones, leveraging Kubernetes for deployment orchestration.
  • Rust Coding: Extensive experience in designing and developing moderate-sized applications (up to ~10K lines of code) in Rust. Skilled in setting up automated testing and CI/CD environments. Ability to identify and implement opportunities for automation and process improvement. Experience in developing reliability engineering tools for large open-source projects is highly desirable.
  • Systemic Thinking: Capable of approaching problems methodically and systemically, especially during troubleshooting.
  • Pragmatism: Ability to balance immediate needs with long-term goals, understanding when a solution is 'good enough for the next 12 months.'
  • Incident Response: Expertise in coordinating incident response across multiple teams, with excellent communication skills to clearly understand the situation, next steps, and team responsibilities.
  • Reliability Engineering: Preferable experience in Site Reliability Engineering (SRE) within a crypto environment where decisions are governed by DAOs.
  • Security Background: Experience in building security-sensitive tools and managing security risks in such environments. A background in DevSecOps is highly desirable.
  • Community Interaction: Proven experience in engaging with community members of large open-source projects. Ideally, the candidate is already active within the ICP community.

Within 1 month, you will:

  • Gain a thorough understanding of DFINITY's infrastructure and production environment.
  • Start working on a suitable starter project.
  • Submit improvements to our documentation and processes based on your onboarding experience.

Within 3 months, you will:

  • Successfully deliver your starter project.
  • Shadow team members on-call, preparing to join the on-call rotation from month 4 onwards.
  • Proactively identify and propose improvements, initiating projects to implement them.

About DFINITY and the Internet Computer:

The Internet Computer is the fastest and only infinitely scalable general-purpose blockchain — incubated and launched by the DFINITY Foundation in May 2021. A team of over 200 world-renowned cryptographers, distributed systems engineers, and programming language experts have taken on the massive technological challenge of building, maintaining, and continuously improving a ‘world computer’ powerful enough to host Web3 dApps, DeFi, games, NFTs, social media, and metaverse projects.

DFINITY was founded in 2016 by entrepreneur and crypto theoretician, Dominic Williams, and attracted interest and financial contributions from early members of the Ethereum community. Later, top-tier institutions such as Andreessen Horowitz, Polychain Capital, and SV Angel backed the Internet Computer in a collective effort to help build out Web3.

All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status.

#J-18808-Ljbffr

  • Zürich, Schweiz Google Inc. Vollzeit

    Job RequirementsBachelor’s degree in Computer Science, a related field, or equivalent practical experience.1 year of experience with data structures/algorithms and software development in one or more programming languages.Databases Site Reliability EngineerBachelor's degree or equivalent practical experience.2 years of experience with programming in one or...


  • Zürich, Schweiz TN Switzerland Vollzeit

    Site Reliability Engineer (m/w/d), ZürichZürich, SwitzerlandSite Reliability Engineer (m/w/d)Freiberuflich/in temporärer Festanstellung für ein Projekt in ZürichStartdatum: sofortReferenznummer: 772499/1AufgabenSicherstellung der Systemverfügbarkeit / Monitoring / Incident ResponseUnterstützung des Entwicklerteams bei der Automatisierung, Architektur...

  • DevOps Engineer

    vor 3 Wochen


    Zürich, Schweiz TN Switzerland Vollzeit

    DevOps Engineer & Site Reliability Engineer Architect/Consultant, ZürichDigital Architects ZurichZürich, SwitzerlandWer wir sindWir sind spezialisiert darauf, Transformations-Projekte für Cloud-native und AI-driven Continuous Delivery (CI/CD) sowie Observability & AIOps umzusetzen. Dabei wird Site Reliability Engineering (SRE) als neue DevOps-Disziplin...

  • DevOps Engineer

    Vor 3 Tagen


    Zürich, Schweiz TN Switzerland Vollzeit

    DevOps Engineer & Site Reliability Engineer Architect/Consultant, Zürich Digital Architects Zurich Zürich, Switzerland Wer wir sind Wir sind spezialisiert darauf, Transformations-Projekte für Cloud-native und AI-driven Continuous Delivery (CI/CD) sowie Observability & AIOps umzusetzen. Dabei wird Site Reliability Engineering (SRE) als neue...


  • Zürich ZH, Schweiz Inventx AG Vollzeit

    Site Reliability Engineer Linux 80 - 100% Gallen, Bern oder im Home-Office, dabei stehen dir attraktive und flexible Voll- und Teilzeitmodelle offen. "2nd-level Support und Sicherstellung der Verfügbarkeit Ausbildung in Informatik/ Wirtschaftsinformatik oder entsprechende Berufserfahrung Bash, Python, etc.) sowie XML/JSON für Konfigurationen ...


  • Zürich Zh, Schweiz Inventx AG Vollzeit

    Site Reliability Engineer Linux 80 - 100% Gallen, Bern oder im Home-Office, dabei stehen dir attraktive und flexible Voll- und Teilzeitmodelle offen. "2nd-level Support und Sicherstellung der Verfügbarkeit Ausbildung in Informatik/ Wirtschaftsinformatik oder entsprechende Berufserfahrung Bash, Python, etc.) sowie XML/JSON für Konfigurationen ...


  • Zürich, Schweiz Inventx AG Vollzeit

    Site Reliability EngineerDu arbeitest in Chur, The Circle/Zürich, St. Gallen, Bern oder im Home-Office, dabei stehen dir attraktive und flexible Voll- und Teilzeitmodelle offen. «Ich finde es mega, dass die Inventx so familiär ist.» Bei Inventx gestaltest du den digitalen Wandel in der Finanz- und Versicherungsindustrie mit. An der Schnittstelle zwischen...


  • Zürich ZH, Schweiz Inventx AG Vollzeit

    Site Reliability Engineer Linux 80 - 100% Gallen, Bern oder im Home-Office, dabei stehen dir attraktive und flexible Voll- und Teilzeitmodelle offen. "2nd-level Support und Sicherstellung der Verfügbarkeit Ausbildung in Informatik/ Wirtschaftsinformatik oder entsprechende Berufserfahrung Bash, Python, etc.) sowie XML/JSON für Konfigurationen ...


  • Zürich, Schweiz Inventx AG Vollzeit

    Du wählst - arbeite an unseren Standorten in Chur, The Circle/Zürich, St. Gallen, Bern oder im Home-Office, dabei stehen dir attraktive und flexible Voll- und Teilzeitmodelle offen.„Ich finde es mega, dass die Inventx so familiär ist.“Bei Inventx gestaltest du den digitalen Wandel in der Finanz- und Versicherungsindustrie mit. An der Schnittstelle...


  • Zürich, Schweiz DENTSPLY International Vollzeit

    Dentsply Sirona is the world’s largest manufacturer of professional dental products and technologies, with a 130-year history of innovation and service to the dental industry and patients worldwide. Dentsply Sirona develops, manufactures, and markets a comprehensive solutions offering including dental and oral health products as well as other consumable...


  • Zürich, Schweiz Dentsply Sirona Vollzeit

    Requisition ID: 78751Dentsply Sirona is the world’s largest manufacturer of professional dental products and technologies, with a 130-year history of innovation and service to the dental industry and patients worldwide. Dentsply Sirona develops, manufactures, and markets a comprehensive solutions offering including dental and oral health products as well...


  • Zürich, Schweiz Inventx AG Vollzeit

    Site Reliability Engineer Linux 80 - 100% Du wählst - arbeite an unseren Standorten in Chur, The Circle/Zürich, St. Gallen, Bern oder im Home-Office, dabei stehen dir attraktive und flexible Voll- und Teilzeitmodelle offen. "Nach zehn Jahren in der Branche habe ich bei Inventx Technologien gesehen, die ich mir nicht hätte träumen lassen – jeden Tag...


  • Zürich, Schweiz Inventx AG Vollzeit

    Site Reliability Engineer Linux 80 - 100% Du wählst - arbeite an unseren Standorten in Chur, The Circle/Zürich, St. Gallen, Bern oder im Home-Office, dabei stehen dir attraktive und flexible Voll- und Teilzeitmodelle offen. "Nach zehn Jahren in der Branche habe ich bei Inventx Technologien gesehen, die ich mir nicht hätte träumen lassen – jeden Tag...


  • Zürich, Schweiz Inventx AG Vollzeit

    Site Reliability Engineer Linux 80 - 100% Du wählst - arbeite an unseren Standorten in Chur, The Circle/Zürich, St. Gallen, Bern oder im Home-Office, dabei stehen dir attraktive und flexible Voll- und Teilzeitmodelle offen. "Nach zehn Jahren in der Branche habe ich bei Inventx Technologien gesehen, die ich mir nicht hätte träumen lassen – jeden Tag...


  • Zürich Stadt, Schweiz Inventx AG Vollzeit

    Site Reliability Engineer Linux 80 - 100% Du wählst - arbeite an unseren Standorten in Chur, The Circle/Zürich, St. Gallen, Bern oder im Home-Office, dabei stehen dir attraktive und flexible Voll- und Teilzeitmodelle offen. "Nach zehn Jahren in der Branche habe ich bei Inventx Technologien gesehen, die ich mir nicht hätte träumen lassen – jeden Tag...


  • Zürich Zh, Schweiz Inventx AG Vollzeit

    Site Reliability Engineer Linux 80 - 100% Du wählst - arbeite an unseren Standorten in Chur, The Circle/Zürich, St. Gallen, Bern oder im Home-Office, dabei stehen dir attraktive und flexible Voll- und Teilzeitmodelle offen. "Nach zehn Jahren in der Branche habe ich bei Inventx Technologien gesehen, die ich mir nicht hätte träumen lassen – jeden Tag...


  • Zürich, Zürich, Schweiz Google Inc. Vollzeit

    About UsAt Google Inc., we're committed to building a workforce that reflects the diversity of our users. We strive to create an environment that provides the support and mentorship needed to learn and grow, while promoting self-direction to work on meaningful projects.We're looking for a skilled Site Reliability Engineering Expert to join our team. As a...

  • Software Engineer III

    Vor 4 Tagen


    Zürich, Zürich, Schweiz Google Inc. Vollzeit

    About the JobWe are looking for a highly skilled Software Engineer III to join our Site Reliability Engineering team at Google Inc. In this role, you will be responsible for developing software solutions that meet the high standards of reliability, uptime, and performance.As a member of our team, you will have the opportunity to work on complex technical...


  • Zürich, Zürich, Schweiz Google Inc. Vollzeit

    Job DescriptionAs a Site Reliability Engineer, you will play a critical role in ensuring the reliability, uptime, and performance of our large-scale distributed systems.You will work closely with cross-functional teams to design, develop, test, deploy, maintain, and enhance software solutions that meet the needs of our customers.


  • Zürich, Schweiz TN Switzerland Vollzeit

    Social network you want to login/join with:Nexxiot is a TradeTech leader with hardware-enabled data solutions and a Vision to Reduce Uncertainty in Cargo. Nexxiot operates the most significant digital global fleet of around 300’000 Rail cars and 800’000 Intermodal containers in 2023 and follows an ambitious growth plan to quadruple the number of...