Summary of Role: Work back from the business problems to be solved, collect proper data to perform various analysis, select proper machine learning and/or large language modeling approaches, eventually rollout the models in production environment to perfect business decision. Meanwhile, lead the project, coach junior data scientists, and maintain a collaborative relationship with business partners during projects.
• Deeply understand business needs and translate business needs to a technical project.
• Identify appropriate data sources to use. Perform data aggregations and feature engineering as needed.
• Write Python programming code to make visualizations, build and implement models.
• Be flexible and open to innovative ideas and alternative ways of solving problems.
• Present the results to non-tech partners and influence them to use predictive models.
• Coach junior associates during the projects
• Master's degree (or higher) in Statistics, Data Science, Mathematics, Economics or related analytical discipline.
• At least 5 years' experience in building end-to-end models in python through production.
• At least 2 years' experience in leading projects and coaching junior associates.
• Proficiency in SQL and Python programming languages
• In-depth understanding of statistical knowledge and machine learning algorithms. Exposure to some Large Language Models is nice to have.
• Specifically, expertise with the following techniques is a must-have to perform daily work: Linear Regression and GLM, Random Forest, GBM, XGboost, various segmentation techniques etc.
• Excellent communication skills
• Ability to work on large-scale cross functional projects.
• Ability to lead mid-scale projects and coach junior associates.