Lead Data Scientist, Booz Allen Hamilton
Jul 2018 - Present | Arlington, VA
Clean, visualize and leverage data to empower clients and strengthen decision making processes. Develop pipelines and apply machine learning techniques to simplify workflows and expand the power of available data.
- Responsible for monthly aggregation and analysis of global flight data for the United States Air Force representing $8 billion in defense resource allocation annually
- Determined and effectively communicated the viability of $500 million reallocation of defense funding by performing sensitivity analysis and downstream impact analysis, then organizing findings into a cogent argument in support of the decision
- Developed a pipeline in Python to automate ingestion, cleaning, and visualization of a data feed, reducing process time by 98% and creating three days per month of time savings, while also increasing accuracy and reliability
- Ported legacy Excel tools and VBA into Python scripts and dashboards, then deployed to a cloud solution, expanding access and availability for analysis team
- Derived features in historical Air Force data, performed feature selection, then trained, tested, and selected machine learning models for predictive and descriptive analyses
- Provided subject matter expertise for the development and delivery of data science courses taught to 1500+ federal employees as a key piece of the agency's modernization effort
- Led development of computer vision models spanning classification, object detection, and semantic segmentation using Fast.ai and PyTorch
- Increased accuracy from 73% to 98% for aircraft image classification model with 20+ disparate categories by training on varied perspectives, skews, and lightings
- Deployed trained computer vision models and designed web application that allowed for upload of images and interaction with models
- Delivered findings to clients and presented storyboards that explain the insights and best courses of action gleaned from the results
- Facilitated team collaboration and monitored tasking through JIRA, using Scrum and Agile methodology
- Wrote PowerShell/Bash scripts to automate daily tasks and expedite repeated processes
- Maintained GitHub repositories for tools and code/scripts that are used by various teams