|

Data Engineer

Architecting the backbone of data-driven success

About Me

I am a data engineer passionate about building robust and scalable data infrastructure for better data-driven decision-making. As a Research Assistant at the University of Iowa, I've engineered data pipelines to process and analyze over 20 million patient records across 80+ healthcare sites, implementing efficient ETL processes and optimizing data workflows. I've also worked with CIVCO Medical Solutions to create a data analytics infrastructure, evaluate sales performance, and build Power BI dashboards that revealed actionable business insights.

My passion lies in building robust data infrastructure and ensuring the data quality that not only solves immediate challenges but also scales to meet future needs, making data operations more efficient and reliable for everyone. I am also interested in using the data to find critical insights and solve real-world problems.

When I'm not analyzing datasets, you'll find me looking for good restaurants, enjoying a good cup of coffee while listening to music, or playing songs myself with guitar or piano.

Austin Lee profile photo

Skills

Programming Languages

Python, MySQL, MS SQL, PostgreSQL, R, Java

Tools & Platforms

Git, Tableau, Power BI, Microsoft Azure, Microsoft Office, Palantir Foundry, TriNetX, SSMS, DBeaver

Libraries & Frameworks

Pandas, NumPy, Matplotlib, Scikit-learn, SciPy, PyTorch, Spark

Technical Skills

Machine Learning, Data Visualization, Data Cleaning, Statistical Analysis, ETL Process, A/B Testing

Languages

Korean (Native), English (Fluent), Japanese (Advanced; JLPT N2)

Soft Skills

Problem Solving, Critical Thinking, Communication, Team Collaboration, Time Management, Adaptability

Professional Experience

Associate Data Engineer

Ruan Transportation Management Systems
July 2025 - Present

Research Assistant

University of Iowa
July 2023 - June 2025

  • Participated in 3+ NIH-funded healthcare studies utilizing Palantir to analyze N3C data
  • Transformed 20M+ patient records using SQL and R, enhancing data analytics capabilities
  • Conducted healthcare disparity analysis using TriNetX analytics platform to identify key insights
  • Presented critical findings by visualizing data using Tableau and Excel to research team across 7 universities
  • Contributed to research analyzing patterns across 80+ healthcare sites, enabling identification of key risk factors for adverse COVID-19 outcomes

Data Analysis Consultant

CIVCO Medical Solutions
March 2025 - May 2025

  • Evaluated sales performance for CIVCO Medical Solutions to identify revenue drivers and customer patterns
  • Managed data using PostgreSQL through DBeaver to transform and clean the provided raw CRM and ERP data
  • Built Power BI dashboards integrating customer data, revenue metrics, and customer retention analytics
  • Discovered a significant business insight that their successful newly launched product has 86% expansion rate
  • Presented critical findings to their product and sales operation team

Education

Graduate Education

University of Iowa

MS in Data Science — Graduated May 2025

GPA: 3.93/4.00

Undergraduate Education

University of Iowa

BA in Computer Science — Graduated May 2024

BA in Psychology — Graduated May 2024

Minor in Japanese Language and Literature

GPA: 3.81/4.00

Projects

Racial Disparities in Diabetes Care

Analyzed 115M+ patient records to investigate racial disparities in diabetes care outcomes for visually impaired patients, revealing significant differences in CKD risk ratios and care standards across racial groups.

Data Analysis Healthcare Statistical Analysis Research
Learn More

CSAS 2025: Quantifying MLB Home Run Attempts

Analyzed MLB Statcast data to identify key factors influencing home run attempts using bat speed, swing length, and advanced statistical modeling.

Data Analysis Sports Analytics Statistical Modeling Machine Learning
Learn More

UK-ecommerce Retention Analysis

Analyzed UK e-commerce data to uncover retention trends and actionable strategies for improving new customer retention.

Cohort Analysis Retention E-commerce SQL
Learn More

Publications

Madlock-Brown, C., Austin Lee, Seltzer, J., Solomonides, A., Mathews, N., Phuong, J., Weiskopf, N., Adams, W. G., Lehmann, H., & Espinoza, J. (2024). Racial Disparities in Diabetes Care and Outcomes for Patients with Visual Impairment: A Descriptive Analysis of the TriNetX Research Network. Research Square, rs.3.rs-3901158. View Paper → (Under review)

Contact

Iowa City, IA