Big Data Project: Visualization of the Opioid Crisis in the USA

less than 1 minute read

Published:

Visualization of the Opioid Crisis in the USA

1 Introduction

CSE 6242 is a graduate level course at Georgia Tech that focused on using techniques and tools for analyzing and visualizing data at scale. My final project revolved around cleaning, analyzing, and deriving meaning from an 88+ GB opioid dataset with 179 million transaction records. Technologies and tools utilized throughout this project included:

  • Dask
  • Pandas
  • SQLite
  • Python Jupyter notebooks
  • GitHub Pages

Additionally, the following predicitive algorithms were utilized:

  • log-log linear regression
  • log-log LASSO regression
  • random forest
  • linear regression

2 Poster

3 Report