At H1, we believe access to the best healthcare information is a basic human right. Our mission is to provide a platform that can optimally inform every doctor interaction globally. This promotes health equity and builds needed trust in healthcare systems. To accomplish this our teams harness the power of data and AI-technology to unlock groundbreaking medical insights and convert those insights into action that result in optimal patient outcomes and accelerates an equitable and inclusive drug development lifecycle.
Data Engineering is responsible for the development and delivery of our most important asset—our data. With thousands of data sources from around the world, the team ensures that data is accurate, normalized, and delivered at a velocity that keeps up with real-world changes. As we expand our markets and the scope of data we provide to our customers, our team must scale to meet that demand.
We're looking for a seasoned Senior Data Engineer who is operating at a high level. You will take ownership of designing and scaling the systems and pipelines that power H1's data platform. You will work cross-functionally with other engineers, product managers, and stakeholders to deliver high-performance, reliable, and maintainable data solutions. This is an opportunity to play a key role in shaping the future of our data infrastructure while mentoring others and driving best practices.
You will:
You are a seasoned data engineer with a track record of building and maintaining large-scale data systems. You're excited by the opportunity to work on complex problems, enjoy collaborative work, and are passionate about building high-quality, performant solutions that impact real-world healthcare outcomes. You have an understanding of Large Language Models (LLMs) and their applications. It's a bonus if you're familiar with model training and fine-tuning, particularly in NLP (Natural Language Processing) contexts. You possess a basic knowledge of network, security, and encryption protocols such as HTTP/HTTPS/TLS. You're able to work collaboratively across teams and communicate effectively with both technical and non-technical stakeholders. You have strong analytical and problem-solving skills with a focus on data quality and performance optimization. You have a passion for writing clean, efficient code and following best practices.
- 5+ years professional experience in data engineering or software engineering, working with large-scale data systems and pipelines.
- Strong proficiency in Python.
- Proficiency in web scraping strategies and technologies: curl, network analysis, proxies and selenium/playwright.
- Strong SQL skills and experience with PostgreSQL.
- Experience with big data tools like Apache Spark, particularly on cloud platforms, with a preference for AWS EMR.
- Experience with Docker or other containerization technologies.
Compensation: This role pays $135,000 to $160,000 per year, based on experience, in addition to stock options.
Anticipated role close date: 4/28/2026
H1 offers:
H1 is proud to be an equal opportunity employer that celebrates diversity and is committed to creating an inclusive workplace with equal opportunity for all applicants and teammates. Our goal is to recruit the most talented people from a diverse candidate pool regardless of race, color, ancestry, national origin, religion, disability, sex (including pregnancy), age, gender, gender identity, sexual orientation, marital status, veteran status, or any other characteristic protected by law. H1 is committed to working with and providing access and reasonable accommodation to applicants with mental and/or physical disabilities. If you require an accommodation, please reach out to your recruiter once you've begun the interview process. All requests for accommodations are treated discreetly and confidentially, as practical and permitted by law.