/dss_ml_edition_2024

Repository containing the code used for my presentation at Data Science Summit 2024, ML Edition

Primary LanguageJupyter Notebook

Cover image

Data Science Summit 2024, ML Edition

Repository containing the code used for my presentation at Data Science Summit 2024, ML Edition

Classifying Legendary Pokémon

A demo project showcasing experiment tracking capabilities of DVC. The example is built on the case of classifying Pokémon as legendary.

Known limitations

  • Generation is based on Pokédex number. That does not reflect regional forms (Galarian, Alolan, etc.), Mega Evolutions, etc.
  • Until generation 8 (incl.), the legendary classification consisted of the following: Sub-Legendary Pokémon | Legendary Pokémon | Mythical Pokémon. We treat them all as legendary for the sake of this exercise.
  • From generation 9 onwards, the legendary classification is slightly different, that is, there are groups called: Sub-Legendary Pokémon | Ultra Beasts | Paradox Pokémon | Restricted Legendary Pokémon | Mythical Pokémon. For example, Ultra Beasts are now a separate class and are not considered Legendary.

Instructions

  • prepare the raw dataset using the src/getting_data/get_pokedex.py
  • execute the pipeline (or its steps) as described in dvc.yaml

References

Docs:

My articles on setting up experimentation with DVC:

How DVC uses git references for experiment tracking:

Misc:

Icons: