About Me

Data Scientist at Deloitte

I'm a full-stack data scientist with 4 years of experience spanning data engineering, model development, statistical inference, and production deployment. At Deloitte, I've built ML systems serving federal agencies—from orchestrator agents that automate workflows to XGBoost models for financial auditing.

I work across the entire stack: PySpark and SQL for data pipelines, Python and R for modeling and statistical analysis, Docker/FastAPI for deployment, MLFlow for logging and experimenting models, and recently RAG and agentic AI architectures. Outside of work I enjoy cooking and trying out new recipes.

Databricks

Experience

Deloitte logo

Data Scientist

Deloitte

February 2024 - Present

  • Co-led a team of 5 engineers to develop a RAG agent and independently designed a statistical agent for Deloitte's Agentic AI service
  • Deployed the first Production Grade ML model across financial workflows within the organization, reduced auditors workload
  • Optimized an autoloader for efficient data ingestion and processing, saved $100+ cost in cluster cost
Censeo Consulting Group logo

Associate Data Scientist

Censeo Consulting Group

May 2022 - January 2024

  • Developed fraud detection model identifying $27M in recoverable funds across 10,000 FCC Emergency Connectivity Fund applicants and 900 service providers using a combination of statistical and ML methods
  • Analyzed 150M rows of program data in PySpark to provide evidence supporting continuation of $14.2B broadband subsidy program serving 20M low-income Americans
  • Built synthetic data pipeline using Mistral 7B and Python to generate privacy-preserving datasets, enabling public release of CFPB consumer complaints without exposing PII
Peace Corps logo

Data Analyst Intern

Peace Corps

November 2021 - April 2022

  • Built Tableau dashboard visualizing hiring pipeline attrition rates and diagnosed barriers for underrepresented groups
  • Analyzed attrition data and presented findings to stakeholders informing Peace Corps' 5-year diversity hiring strategy

Skills & Tech Stack

Generative AI & LLMs

LangchainLangGraphOpenAI AgentsRAGPrompt EngineeringFine-tuning (LoRA/DoRA)Vector Databases (Chroma, FAISS, Qdrant)DsPY

ML & Statistical Modeling

PyTorchHugging FaceNLP (Natural Language Processing)Supervised LearningUnsupervised LearningHypothesis TestingA/B TestingSpark MLlibNumPySampling Methods

Data Engineering & Architecture

DatabricksPySparkOracleBigQueryMongoDBETL PipelineInformaticaErwin Data ModelerDbeaver

Cloud & MLOps

AWSAzureDockerFastAPIMLflowDatabricks WorkflowsGradio