Attention is All You Need, for Sports Tracking Data

Introduction

The rapid advancement of spatial tracking technologies in sports has led to an unprecedented surge in high-quality, high-volume tracking data across all levels of play. While this data has catalyzed innovations in sports analytics, current methodologies often struggle with a fundamental challenge: the player-ordering problem. This issue arises from the dynamic nature of team sports, where player roles and formations are fluid and can vary between games, making it difficult to create consistent input structures for machine learning models. This paper introduces a transformative approach to sports analytics by applying Transformer architectures to address these challenges. Our framework operates end-to-end on raw player tracking data, naturally handles unordered collections of player vectors, and is inherently designed to learn pairwise spatial interactions between players. Using the NFL's 2024 Big Data Bowl dataset, we demonstrate our approach's effectiveness in predicting tackle locations, showing significant improvements over commonly used architectures, particularly in generalizing to diverse game situations. The repository contains our implementation and experimental results, providing a foundation for future research in sports analytics using Transformer-based architectures. We believe this approach can be extended beyond American football to other team sports, offering a more robust and generalizable framework for analyzing player tracking data.

Our key contributions include:

A minimal-feature-engineering approach to handling the player-ordering problem
An end-to-end Transformer architecture adapted for sports tracking data
Empirical evidence showing superior generalization compared to existing methods
Open-source implementation for reproducibility and further research

We hope this work catalyzes a shift in sports analytics research methodologies, advancing our ability to derive meaningful insights from tracking data across various sports domains.

Getting Started With The Code

This repo uses uv to manage python version, environment, and dependencies. Start off by installing it:

# On macOS and Linux.
$ curl -LsSf https://astral.sh/uv/install.sh | sh

# On Windows.
$ powershell -c "irm https://astral.sh/uv/install.ps1 | iex"

After cloning the repo and opening a new terminal inside the repo workspace, perform the following steps:

Run uv python install to install the python version pinned in the .python-version file.
Run uv sync to create a .venv and populate it with the locked dependencies.
Run uv venv and source .venv/bin/activate to create and activate the virtual environment.

The notebook results_analysis has some viz and results.

SumerSports/SportsTrackingTransformer

Attention is All You Need, for Sports Tracking Data

Introduction

Getting Started With The Code