Skip to content

Infrastructure Team Lead (f/m/d) - Remote EMEA

RemoteBerlin, Berlin, GermanyEngineering

Job description

Purpose of your role as an Infrastructure Team Lead

As an Infrastructure Team Lead, you will be working alongside engineers and collaborating with other stakeholders, making sure all work related to scaling and securing our product is planned and delivered incrementally and with quality built-in. You will support and coach other engineers, actively enabling them to grow based on their particular interests.

Our tech stack

  • TypeScript, Node.js, React, Golang, PSQL
  • GraphQL, REST
  • Pulumi, AWS, Vercel, Cloudflare, Fastly, New Relic, Github actions

What you'll do:

  • Collaborate closely with stakeholders to prioritize and plan projects that allow our product to scale, as well as tooling to operate it efficiently
  • Work with the team on strategy and execution, delivering testable, maintainable, and high-quality solutions
  • Help engineers identify and grow in their areas of interest by coaching, mentoring, and giving timely feedback
  • Identify opportunities to improve our architecture, monitoring & observability, cost optimizations, on-call practices, etc. to then prioritize and act on them
  • Bring creative ideas and expertise to the table, having a real impact on our product and engineering practices
  • Work in an environment that supports your individual growth

What our Infrastructure team does:

  • Improve and maintain our production and testing environments
  • Focus on automation and improvements to our infrastructure
  • Benchmark, scale, and tune applications and databases
  • Improve the observability of our services
  • Drive cost optimizations or reductions
  • Collaborate with the engineering team's on-call responsibilities in rotation
  • Work with customers when required to troubleshoot and solve problems

Expectations timeline

1 Month

You have gone through different onboarding sessions covering our product, current architecture, and relevant services we run on production, learned about the company's origin and current vision and met colleagues from different departments as part of onboarding as well as weekly virtual social events. You'd have started to get to know your teammates, learned how we work daily, and contributed to our codebase.

3 Months

You will be familiar with most concepts related to our product and worked alongside your teammates to improve our infrastructure and troubleshoot operational issues.

You'd have had a few 1:1s with your team members to check in on how things are going and collaborated with fellow engineers to improve our monitoring and observability as well as our internal developer experience. You would also have started influencing the way the team works and iterating on ways of working to improve in areas you see fit.

6 Months

You will have made solid contributions to our product and stack, influenced our ways of working, shared knowledge and previous experiences, helping substantially with important decision-making. You will be working continuously on improving our processes and enabling our product to scale further.

Job requirements

What we expect from you:

  • Experience leading a team, covering people and delivery management
  • Experience mentoring and coaching other teammates to grow and improve continuously
  • Experience with on-call rotation: responding, putting processes in place and measuring it
  • Strong collaboration and communication skills, both verbal and written. Ability to take ownership, but also ask for help and advice when needed
  • Openness to feedback and willingness to learn, reflect, and grow within the organization
  • Experience in successfully driving technical, business, and people-related initiatives that improved productivity, performance, and quality
  • 8+ years of engineering experience with Infrastructure/DevOps/SRE exposure
  • Experience with one or more cloud computing environments (AWS, GCP, Azure)
  • Experience building and maintaining CI/CD pipelines
  • Experience writing infrastructure as code (Terraform, Pulumi, CloudFormation, etc)
  • Comfortable writing code following best practices and design patterns when applicable (ideally using Golang or another strongly typed language)
  • Experience working with a distributed service architecture
  • Knowledge of containerized applications (Docker)
  • Mindful about performance and able to measure it meaningfully

Bonus points:

  • Experience with GraphQL, Golang, JS/TS
  • Experience using Pulumi for infrastructure as code
  • Experience with database administration and optimization (PostgreSQL)
  • Experience working with Kubernetes

The Process

  • Intro call with Talent Acquisition
  • Hiring Manager Interview
  • Technical Interview
  • Team Fit call
  • Reference Check and Offer

About us

At Hygraph we're building the leading GraphQL Federated Content Platform. Our goal is to enable developers and content operators to create, enrich, unify, and deliver content across platforms seamlessly. We are trusted to manage content for teams from over 50,000 organizations like Telenor, Burrow, Gamescom, and Shure. With over $10M in funding from OpenOcean, Peak, and Paua Ventures, you will be part of a remote-first and globally distributed team of over 60 colleagues, committed to working collaboratively, transparently, and passionately.

Hygraph is an equal opportunity employer committed to hiring people with diverse backgrounds. We believe that diversity, unique experiences, qualities, and different cultures enrich our workspace's productivity and promote innovation and creativity.