Summary

Lead Data Engineer with expertise in Python, SQL, cloud platforms, and machine learning. Experienced in building and optimizing data processing pipelines, operationalizing ML models, and creating customer-facing dashboards. Strong background in mathematics with a proven track record of delivering impactful solutions.

Contact

Employment

Lead Data Engineer — Capital One

June 2025 – Present

  • Led migration of streaming data pipelines from legacy infrastructure to a new internal ingestion platform, updating producer applications written in Spring Boot and Python to target new ingestion APIs in place of legacy SDKs.
  • Provided DevOps support for an agentic AI application facilitating user onboarding to an internal metadata platform, including containerizing applications and configuring Kubernetes clusters.
  • Built an optimized data profiling workflow on Spark in Databricks and led a team developing a UI for running profiling jobs and evaluating raw data ahead of ingestion into systems of record, in support of onboarding Discover Financial data following Capital One’s acquisition.

Data Engineer — Kollective Technology

April 2022 – June 2025

  • Designed forecasting and anomaly detection pipeline leveraging the Prophet framework with GitHub-based CI/CD workflows.
  • Developed and optimized PySpark streaming data processing pipelines, achieving a 24x reduction in processing time in one case.
  • Created customer-facing Looker dashboards to visualize diagnostic data, providing a key differentiator for the sales team.

Data Scientist — Trusted Media Brands

April 2021 – April 2022

  • Built and maintained batch data processing pipelines using BigQuery, Dataform, and Apache Airflow.
  • Designed internal business reporting dashboards using Google Data Studio and Looker.

Database Manager — Immune Deficiency Foundation

May 2019 – April 2021

  • Developed a Python script for geographical partitioning of constituent communications, enabling targeted location-based fundraising.
  • Played a key role in database integration projects ensuring seamless data flow and consistency.

Lecturer — Towson University

August 2016 – May 2019

  • Taught undergraduate and graduate mathematics courses.
  • Administered a pilot peer tutoring program for undergraduate statistics students.
  • Received a Project NExT teaching fellowship.

Education

PhD, Mathematics — University of Victoria

2011 – 2016 · Ergodic Theory and Dynamical Systems

MS, Mathematics — Montana State University

2009 – 2011

BS, Mathematics — Montana State University

2005 – 2009

Skills

CategoryTools
LanguagesPython, Java, R, MATLAB, SQL
FrameworksSpring Boot, React
Data ProcessingPySpark, PyFlink, Pandas
ML / AIScikit-learn, Prophet, XGBoost, LangChain
Data StoresDelta Lake, BigQuery, Citus, MySQL, Databricks
BI & DashboardsLooker, Google Data Studio
OrchestrationApache Airflow, Dataform, cron
CloudMicrosoft Azure, Google Cloud Platform
DevOpsGit, GitHub, GitLab, Kubernetes
AI ToolsClaude Code, Windsurf