Agile Defense is seeking a Data Scientist/Engineer to support the design, development, and operational deployment of scalable, AI-enabled data solutions within the Department of Defense's CDAO ADA IR program. This role is part of a multidisciplinary team integrating advanced analytics, machine learning, and engineering practices into mission-critical environments at Combatant Commands and Joint Staff.
You will help shape and deploy data pipelines, pre-processing workflows, feature engineering strategies, and machine learning services within secure, containerized environments. The ideal candidate brings a hybrid of statistical modeling fluency and hands-on software engineering expertise. You will collaborate closely with product managers, full-stack developers, platform engineers, and mission stakeholders to transform raw data into meaningful insights and decision-support tools. This role requires strong technical communication skills, a collaborative mindset, and experience working in agile environments that value reproducibility, testing, and continuous delivery. Familiarity with cloud-based data platforms such as Databricks, Palantir, or AWS-native data services is highly preferred.
Objective 1: Design and Maintain Scalable Data Science Services.
Objective 2: Build and Operationalize AI/ML Solutions
Objective 3: Perform Exploratory Data Analysis and Communicate Insights
Objective 4: Collaborate Across Teams to Deliver Mission Impact
Experience developing and documenting data pipelines for the ingest, transformation, and preparation of data for Artificial Intelligence applications.
Experience designing and implementing scalable technologies such as streaming transformation for joining, analyzing, and processing disparate data sets into features for predictive analytics.
Experience developing API interfaces to facilitate data accessibility.
Experience designing reusable standardized data pipelines.
4+ years of experience in applied data science, machine learning engineering, or data pipeline development.
Proficient in Python, SQL, and distributed data frameworks (e.g., Spark, Databricks, PySpark).
Experience developing ML models from training to deployment using industry-standard tools and libraries (e.g., scikit-learn, TensorFlow, XGBoost, MLflow).
Familiarity with MLOps, API development, and secure cloud-based environments (e.g., AWS, Azure, Palantir Foundry).
Strong understanding of data validation, model testing, and performance evaluation techniques.
Experience with data visualization and storytelling using tools such as Tableau, Plotly, or Matplotlib.
Excellent technical communication skills, with the ability to explain complex concepts to non-technical audiences.
Contractor site with 0%-10% travel possible. Possible off-hours work to support releases and outages. General office environment. Work is generally sedentary in nature, but may require standing and walking for up to 10% of the time. The working environment is generally favorable. Lighting and temperature are adequate, and there are not hazardous or unpleasant conditions caused by noise, dust, etc. Work is generally performed within an office environment, with standard office equipment available.
Sedentary – 10 lbs. Maximum lifting, occasional lift/carry of small articles. Some occasional walking or standing may be required. Jobs are sedentary if walking and standing are required only occasionally, and all other sedentary criteria are met.
Stand or Sit.