✨ About The Role
- As a Data Engineer at H1, you will contribute to the development, optimization, and scaling of data pipelines and infrastructure.
- The role involves designing, developing, and maintaining scalable data extraction frameworks from diverse sources.
- You will continuously improve the efficiency and reliability of data collection, extraction, and normalization processes.
- Working with large datasets, you will transform and process structured and unstructured data for downstream use.
- The position requires building and maintaining efficient, reliable data pipelines and ETL processes using big data tools such as Spark.
- Collaboration with senior engineers to improve data architecture and infrastructure is a key responsibility.
- You will support data integration efforts from multiple sources, ensuring consistency and accuracy.
- Troubleshooting data issues, optimizing queries, and improving data retrieval performance are part of the job.
- Documenting data processes and workflows to ensure transparency and repeatability is expected.
⚡ Requirements
- The ideal candidate has solid technical skills in data engineering and a passion for building efficient, scalable solutions.
- They thrive in a collaborative environment and enjoy learning from experienced team members.
- The candidate should be eager to take on increasing responsibility as they grow in the role.
- A basic understanding of Large Language Models (LLMs) and their applications is essential.
- Familiarity with model training and fine-tuning, particularly in NLP contexts, is a bonus.
- Strong analytical and problem-solving skills with a focus on data quality and performance optimization are crucial.
- The candidate should possess a passion for writing clean, efficient code and following best practices.