Are you passionate about building robust Python applications and leveraging big data technologies to solve complex business challenges? Do you thrive in environments where your technical expertise drives meaningful impact? We are seeking an experienced Python Lead Developer to join our dynamic LSH Data Platform Team. You will play a pivotal role in designing, developing, and maintaining Python-based data processing applications and services that support scalable big data solutions. If you're excited about combining deep Python skills with cutting-edge technologies like Apache Spark, PySpark, GraphDB, SQL, and relational databases, we want to hear from you.
You will be building core Python services for the C3 Data Platform, designed to power data delivery and deliver intelligent insights across life sciences and healthcare. These services, built with FastAPI, will expose data from Databricks and graph databases, enabling seamless integration and consumption by downstream applications. Your work will also include designing and optimizing Spark/Databricks ETL pipelines written in PySpark to ensure high performance and scalability. You will contribute to building a knowledge graph that interconnects data across multiple healthcare and life science domains, forming the backbone for advanced data exploration and relationship discovery. Additionally, you will have the opportunity to work on the backend of an AI assistant that leverages LLMs and machine learning algorithms to expose platform data to customers, extract valuable insights, and connect information across different domains.
You'll be joining the C3 Data Platform Team, a passionate and collaborative group that thrives in a startup-like environment within a larger organization. We move fast, embrace new ideas, and value creativity and initiative. Our mission is to build a powerful, scalable data platform that transforms how life science and healthcare data is processed, connected, and delivered. The team works with modern technologies including Spark, Databricks, Elasticsearch, Python, and graph databases, combining them with machine learning and AI to solve real-world challenges. We believe in open communication, continuous learning, and giving engineers the autonomy to shape solutions that have a direct impact on our products and customers.
This is a full-time, hybrid role, requiring presence in the office 2-3 days per week, with flexibility to accommodate collaboration across global teams and time zones.
At Clarivate, we are committed to providing equal employment opportunities for all qualified persons with respect to hiring, compensation, promotion, training, and other terms, conditions, and privileges of employment. We comply with applicable laws and regulations governing non-discrimination in all locations.