View All Jobs 157570

Senior Infrastructure Engineer

Manage and optimize global bare-metal datacenter infrastructure for performance and security
London
Senior
yesterday
Talos

Talos

A cybersecurity firm specializing in threat intelligence and advanced analysis to protect against global cyber threats.

Talos: Institutional Fabric For Digital Asset Markets

Founded in 2018, Talos provides institutional-grade trading technology for the global digital asset market, powering many of the major players in the crypto ecosystem. Our mission is clear: to advance the mass adoption of digital assets by seamlessly connecting institutions to the digital asset ecosystem. We are committed to building the most innovative and trusted platform in the world, supporting the entire trading lifecycle.

At Talos, you'll find an environment that champions kindness and respect, values diverse perspectives, and upholds inclusivity at every turn. We believe every member of our team brings invaluable insights and abilities that drive Talos forward. In our pursuit of excellence, we foster a culture of trust, integrity, collaboration, and mutual growth.

We are a tight-knit yet globally distributed team of highly experienced engineers and business leaders, with hubs in New York, London, Singapore, and Cyprus. You'll be part of a hybrid-friendly environment where your unique talents and insights will play a crucial role in building something extraordinary.

Infrastructure Engineer

At Talos, our Infrastructure Engineers are responsible for the hardware and software systems that underpin the secure, efficient, and reliable operation of our products. We run a bare-metal, global infrastructure footprint with datacenters in Paris, NYC, and Dallas, and partner closely with engineering teams to keep our environments performant, scalable, and resilient.

This is a hands-on role covering datacenter operations, Linux systems, Kubernetes clusters, networking, and CI/CD pipelines. You'll have the opportunity to work across the full stack, from low-level infrastructure to developer tooling, and play a critical role in ensuring our products and engineering teams run at scale.

You'll join a small, highly skilled team with significant responsibility and autonomy, working in an environment where infrastructure excellence directly impacts our clients and product teams.

Responsibilities

  • Infrastructure operations: Build, maintain, and troubleshoot global bare-metal infrastructure (50+ servers per datacenter).
  • Datacenter support: Collaborate with vendors to manage system stability, tune performance, troubleshoot problems and upgrade capacity.
  • Kubernetes administration: Manage cluster upgrades, patching, testing, and security hardening.
  • Monitoring & incident response: Operate and improve monitoring/alerting with Prometheus, Grafana, and Loki; provide rapid incident response, root-cause analysis, and remediation.
  • CI/CD support: Manage GitLab pipelines, build runners, and related services; step in during incidents impacting developer productivity.
  • Collaboration: Partner with engineering teams on capacity planning, scaling projects, and infrastructure tooling.
  • Automation: Use infrastructure as code (Ansible, Terraform, bash, Python, Docker) to standardize and automate system management.
  • Continuous improvement: Identify gaps and opportunities to improve reliability, efficiency, and tooling.
  • Coverage: Provide infrastructure support during EU hours, contributing to 24/7 team capacity.

Qualifications & Attributes

  • Hands-on with bare-metal datacenter design, planning, implementation, and support.
  • Strong background in Ubuntu Linux administration, with automation using IPMI, Ansible, and Terraform.
  • Solid understanding of IPv4/IPv6 networking, VPNs (Wireguard), DNS, and reverse proxies/load balancers (nginx, tinyproxy, APISIX), plus Cloudflare.
  • Skilled in Kubernetes cluster administration, including tools such as Cilium, Helm, ArgoCD, and Kyverno.
  • Experience with at least one of: Postgres HA/scale (80+ DBs per DC, largest >100TB), Kafka, Minio, or blockchain fullnodes.
  • Familiarity with Prometheus, Loki, and Grafana for monitoring and alerting.
  • Exposure to cloud computing environments (AWS, GCP).
  • Practical experience designing and supporting CI/CD pipelines and build environments.
  • Ability to configure and troubleshoot GitLab CI/CD pipelines and build runners.
  • Proficiency in scripting and automation using bash, Python, and Docker.
+ Show Original Job Post
























Senior Infrastructure Engineer
London
Engineering
About Talos
A cybersecurity firm specializing in threat intelligence and advanced analysis to protect against global cyber threats.