/Data-Projects

A small repository for some personal interest projects regarding data science

Primary LanguagePython

Data-Projects

A small repository for some personal interest projects regarding data science.

This is a collection of some of my work in data science, my primary interests are in modeling and predictive analysis for all sorts of data, but I also enjoy desciptive andalysis for both social and scientific subjects. If there are any questions about any of there projcets please reach out to me at reeder.nick@gmail.com Below is a list of my current projects:

  • IMDB Director Network
    • Scraped data from IMDB to build a bipartite newtwork of directors and crew members. Edges are created when a crew member worked for a director. Only data from feature length films was extracted.
    • Weights for edges were calculated with the below formula. Each edge additionally has a feature for the primary role of that crew-director pair.
      • $\text{weight} = \frac{\text{number of times this crew members has worked for this director in their primary role for this director}}{\text{number of times this director has employed this role}}$
    • Each director node has the following features
      • Name
      • Sex
      • Ethnicity
      • Lables
        • H: top 25 grossing
        • Q: LGBTQ
    • Each crew node has the following features
      • Name
    • After network was generated, made visualizations for the graph and answered key research questions