Summary
Lead Data Engineer with expertise in Python, SQL, cloud platforms, and machine learning. Experienced in building and optimizing data processing pipelines, operationalizing ML models, and creating customer-facing dashboards. Strong background in mathematics with a proven track record of delivering impactful solutions.
Contact
- Email: [email protected]
- GitHub: sethchart
- LinkedIn: sethchart
- Location: Towson, MD
Employment
Lead Data Engineer — Capital One
June 2025 – Present
- Led migration of streaming data pipelines from legacy infrastructure to a new internal ingestion platform, updating producer applications written in Spring Boot and Python to target new ingestion APIs in place of legacy SDKs.
- Provided DevOps support for an agentic AI application facilitating user onboarding to an internal metadata platform, including containerizing applications and configuring Kubernetes clusters.
- Built an optimized data profiling workflow on Spark in Databricks and led a team developing a UI for running profiling jobs and evaluating raw data ahead of ingestion into systems of record, in support of onboarding Discover Financial data following Capital One’s acquisition.
Data Engineer — Kollective Technology
April 2022 – June 2025
- Designed forecasting and anomaly detection pipeline leveraging the Prophet framework with GitHub-based CI/CD workflows.
- Developed and optimized PySpark streaming data processing pipelines, achieving a 24x reduction in processing time in one case.
- Created customer-facing Looker dashboards to visualize diagnostic data, providing a key differentiator for the sales team.
Data Scientist — Trusted Media Brands
April 2021 – April 2022
- Built and maintained batch data processing pipelines using BigQuery, Dataform, and Apache Airflow.
- Designed internal business reporting dashboards using Google Data Studio and Looker.
Database Manager — Immune Deficiency Foundation
May 2019 – April 2021
- Developed a Python script for geographical partitioning of constituent communications, enabling targeted location-based fundraising.
- Played a key role in database integration projects ensuring seamless data flow and consistency.
Lecturer — Towson University
August 2016 – May 2019
- Taught undergraduate and graduate mathematics courses.
- Administered a pilot peer tutoring program for undergraduate statistics students.
- Received a Project NExT teaching fellowship.
Education
PhD, Mathematics — University of Victoria
2011 – 2016 · Ergodic Theory and Dynamical Systems
MS, Mathematics — Montana State University
2009 – 2011
BS, Mathematics — Montana State University
2005 – 2009
Skills
| Category | Tools |
|---|---|
| Languages | Python, Java, R, MATLAB, SQL |
| Frameworks | Spring Boot, React |
| Data Processing | PySpark, PyFlink, Pandas |
| ML / AI | Scikit-learn, Prophet, XGBoost, LangChain |
| Data Stores | Delta Lake, BigQuery, Citus, MySQL, Databricks |
| BI & Dashboards | Looker, Google Data Studio |
| Orchestration | Apache Airflow, Dataform, cron |
| Cloud | Microsoft Azure, Google Cloud Platform |
| DevOps | Git, GitHub, GitLab, Kubernetes |
| AI Tools | Claude Code, Windsurf |