About
I'm a Data Science major at the University of Michigan, focused on machine learning, statistical analysis, and turning data into clear, actionable insights. I enjoy building projects that bridge theory and practice.
Experience
-
Research Assistant — Computational Lead University of Michigan Department of Epidemiology
- Leading the technical development of a Python pipeline using a large language model to classify narratives of collective violence.
- Performing data cleaning, exploratory data analysis, and descriptive statistical analysis on a large dataset using R and Python.
- Evaluating model performance using performance metrics including accuracy, precision, recall, and F1 score.
-
Intern, Data Management Ally Financial, Inc.
- Created data dictionaries and data lineage documentation leveraging Python, enhancing data governance and compliance.
- Added data quality checks to existing data pipelines using Python and SQL, improving data integrity and reliability.
- Developed a data availability dashboard in Power BI to monitor data refresh rates and availability, ensuring timely access to critical data.
-
Research Assistant University of Michigan Department of Epidemiology
- Awarded 3rd place (top ~1%) in the CDC Youth Mental Health: Novel Variables Data Challenge.
- Developed Python pipelines using a large language model on a high performance computing cluster to extract trends from the narratives of victims of suicide.
- Cleaned, preprocessed, and feature engineered a confidential dataset of over 400,000 narratives, applying natural language processing techniques to identify relevant cases and enhance the annotation process.
-
Certified Grassroots Soccer Referee Michigan Soccer Referee Association
- Officiating on a referee team to create a safe, fair, and fun environment for players, coaches, and spectators.
- Applying knowledge of the rules of soccer to facilitate the sport.
Resume
Education
Relevant coursework: Machine Learning, Artificial Intelligence, Computer Vision, Computational Modelling of Complex Systems, Database Management, Regression Analysis, Probability and Statistics, Bayesian Data Analysis, Data Structures and Algorithms
Activities: Michigan Data Science Team (MDST) — Member 2 years, Heartbeat (A Cappella) — Member 1 year
Activities: Men's Varsity Soccer — Captain 1 year, Member 2 years, Select Choir — Member 3 years
Projects
-
Predicting Survival in the ICU
Developed logistic regression and RNN models to predict patient survival in the ICU using a public dataset. Performed data cleaning, preprocessing, and feature engineering to prepare data for model training. Fine-tuned and evaluated the models using cross-validation and grid search.
Python · PyTorch · Scikit-learn · Cross-validation · Grid Search
Skills
- Programming Languages
- C++, Python, R, SQL, Java
- Python Libraries
- PyTorch, Scikit-learn, NumPy, Pandas, Matplotlib, NLTK, spaCy
- Software & Tools
- Git, VS Code, Microsoft Office Suite, Google Suite, Power BI, Snowflake, Collibra, Alation, Manta