This role is responsible for data collection procedures, including accurate and relevant data for machine learning models, extracting and analyzing data from the primary and secondary database. The role conceptualizes, designs and develops analytics models in addressing complex business problems, discovering insights and identifying opportunities that are of value to internal and external business stakeholders, typically using a hypothesis-driven approach. The role codes, tests, debugs and documents complex programs, and enhances existing programs to ensure that data processing production systems continue to meet user requirements.
Codes limited enhancements, updates, and programming changes for portions and subsystems of data pipelines, repositories or models for structured/unstructured data.
Analyzes design and determines coding, programming, and integration activities required based on objectives and guidance from senior project team members.
Executes established portions of testing plans, protocols, and documentation for assigned portion of application; identifies and debugs issues with code and suggests changes or improvements.
Participates as a member of a project team of other data science professionals to develop reliable, cost effective and high-quality solutions for assigned data system, model, or component.
Identifies complex areas to solve new technical problems and provides innovative technical solutions within data science and machine learning.
Codes, tests, debugs and documents simple programs, and enhances existing programs to ensure that data processing production systems continue to meet user requirements.
Identifies opportunities and supports the development of automated solutions that will enhance the quality of enterprise data.
Executes data profiling and preventative procedures to improve data quality; uses technology to extract and analyze raw data.
Translates data into information and insights with clear scenario analysis and business impact and execution plan to drive impact.
Models and frames business scenarios that are meaningful and impact critical business processes and/or decisions.
Recommended: Four-year degree in computer science, information technology, software engineering, statistics/mathematics, or any other related discipline or commensurate work experience or demonstrated competence. Typically has 0-2 years of work experience, preferably in data analytics, data engineering, data modeling, or a related field.
Preferred Certifications: Programming Language/s Certification (SQL, Python, or similar)
Knowledge & Skills: Agile Methodology, Amazon Web Services, Apache Hadoop, Apache Kafka, Apache Spark, Big Data, Computer Science, Data Analysis, Data Engineering, Data Modeling, Data Pipelines, Data Warehousing, Extract Transform Load (ETL), Java (Programming Language), Machine Learning, Microsoft Azure, Python (Programming Language), Scala (Programming Language), Scalability, SQL (Programming Language)
Cross-Org Skills: Effective Communication, Results Orientation, Learning Agility, Digital Fluency, Customer Centricity
Impact & Scope: Impacts own work and acts as a team member by providing information, analysis, and recommendations in support of team efforts.
Complexity: Learns to apply basic theories and concepts to work tasks.
Equal Opportunity Employer (EEO) - HP, Inc. provides equal employment opportunity to all employees and prospective employees, without regard to race, color, religion, sex, national origin, ancestry, citizenship, sexual orientation, age, disability, or status as a protected veteran, marital status, familial status, physical or mental disability, medical condition, pregnancy, genetic predisposition or carrier status, uniformed service status, political affiliation or any other characteristic protected by applicable national, federal, state, and local law(s).
For more information, review HP's EEO Policy or read about your rights as an applicant under the law here: Know Your Rights: Workplace Discrimination is Illegal