1. Survival of the Fittest; Variable Selection on Agricultural Data from the Galapagos Islands

    Found key variables among 200+ candidates using Elastic Net and Forward Selection.

    PDF Github

  2. Hidden warnings; using topic modeling on Yelp reviews to predict health inspection violations

    A deep dive into the predictive power of Supervised Latent Dirichlet Allocation on text reviews to detect food safety concerns.

    PDF Github

  3. Yelp review text recommender system

    My first foray into text mining to build a modified recommender system with reduced sparsity.

    Github

  4. Knowledge of influenza vaccination recommendation and early vaccination uptake during the 2015–16 season

    Contributed data processing and calculation of estimates for journal article in Vaccine

    Journal

  5. Impact of Survey Device on Response to an Early Season Influenza Vaccination Survey

    Compared responses on Mobile vs Tablet vs Computer devices to National Internet Flu Survey for CDC. Presented findings at American Association for Public Opinion Research (AAPOR) 2016 conference.

    PDF

  6. Fit for Purpose

    Evaluated suitability of web panel survey compared to traditional large-scale surveys for early-season flu vaccination estimation for CDC. Presented findings at AAPOR 2015 conference.

    PDF

  7. College Scorecard

    Programmed data processing scripts for the Department of Education’s inaugural edition of the College Scorecard.

    Website