/batter-pitcher-2vec

A model for learning distributed representations of MLB players.

Primary LanguageJupyter NotebookMIT LicenseMIT

(batter|pitcher)2vec

(batter|pitcher)2vec is a model inspired by word2vec that learns distributed representations of MLB players. The associated paper, "(batter|pitcher)2vec: Statistic-Free Talent Modeling With Neural Player Embeddings", was one of eight (out of "nearly 200" submitted) selected to be presented in the Research Papers Competition at the 2018 MIT Sloan Sports Analytics Conference (a recording of the presentation can be found here and the slides can be found here). I was lucky enough to advance to the final four, where each presenter gave a shorter presentation (recording here) to a panel of judges consisting of Nate Silver (of FiveThirtyEight), Kirk Goldsberry (the guy who invented these hexagonal shot charts), and Will Edmonson (the Director of Analytics for Major League Baseball Advanced Media). I ended up finishing in third place, which came with a $1,000 prize. A Jupyter notebook reproducing the results discussed in the paper can be found here.