Project Portfolio

A collection of professional projects and technical initiatives demonstrating experience across data engineering, cloud infrastructure, and analytics platform development.

UnitedHealth Group (Optum)

Snowflake Performance Optimization Framework

Developed an automated testing framework in Python and Snowpark to analyze query performance and optimize warehouse sizing. This production implementation reduced Snowflake compute costs by 38% while improving query response times.

πŸ“Š 38% cost reduction

PythonSnowparkSnowflakePerformance Tuning
Sub-Zero Group

ERP Migration Data Infrastructure

Served as key data engineering resource for Sub-Zero's enterprise migration from Infor XA to SAP. Designed and implemented dimensional models, ETL pipelines, and analytics infrastructure to support the new system. Utilized dbt for transformations, Fivetran for data ingestion, and Terraform for infrastructure as code.

πŸ“Š Enterprise-wide migration

dbtSnowflakeFivetranSAPTerraform
UnitedHealth Group (Optum)

HIPAA-Compliant Security Framework

Designed and implemented comprehensive row-level and column-level security patterns in Snowflake for PHI data handling in support of HITRUST certification. Created reusable security patterns enabling consistent implementation across teams.

πŸ“Š HITRUST certification

SnowflakeData GovernanceHIPAASecurity
UnitedHealth Group (Optum)

Teradata to Snowflake Migration

Migrated legacy DataStage/Teradata/Unix pipelines to Snowflake cloud architecture. Rebuilt ETL processes using Python and Azure Data Factory, improving system reliability through reduced batch failures.

πŸ“Š 21% faster pipelines, 81% less manual work (Snowflake optimization)

SnowflakeTeradataAzure Data FactoryPythonSQL
Boston University

NBA Data Warehouse

Academic capstone project for Data Warehousing course. Designed and implemented a complete star schema warehouse for NBA game statistics analysis, including fact tables for scoring and dimension tables for players, teams, coaches, and custom date dimensions for seasonal analysis. Applied dimensional modeling principles later used in professional work.

πŸ“Š Academic capstone project

SQL ServerStar SchemaDimensional ModelingTableau
Personal Project

Job Application Tracking Database

Personal project: designed and implemented a normalized relational database to track job applications, communications, assessments, and offers. Utilized stored procedures and proper database design principles to maintain data integrity while developing SQL proficiency.

πŸ“Š Personal productivity tool

SQL ServerDatabase DesignStored ProceduresNormalization
Boston University

Beijing Air Quality Analysis

Master's capstone project analyzing Beijing air quality data and weather patterns to predict pollution levels. Applied machine learning models in R to identify environmental patterns. Firsthand experience living in Beijing provided unique context for this data analysis.

πŸ“Š Academic research project

RMachine LearningData VisualizationStatistical Analysis
Sub-Zero Group

Data Quality Monitoring Platform

Implemented comprehensive data quality monitoring using dbt tests and Monte Carlo. Configured automated alerting, data lineage tracking, and validation rules to enable proactive issue detection and resolution before business impact.

πŸ“Š Proactive data reliability

dbtMonte CarloSnowflakeData Quality