/ibjjf_scraper_to_excel

Data scraping IBJJF's website to get a list of my teammates competing in BJJ. I built this for my own personal use.

Primary LanguagePython

ibjjf_scraper_to_excel

The International Brazilian Jiu-Jitsu Federation (IBJJF) is a for-profit company that hosts several of the biggest Brazilian jiu-jitsu tournaments in the world. I love BJJ, actively compete in it, and have many teammates compete in these IBJJF events often. Some of these tournaments have 1000s of competitor entries. Using BeautifulSoup, pandas, and openpyxl libraries, I webscrape a list of my teammates into an Excel sheet where I will later figure out each individual competitor's mat assignment and time. I added Excel conditional formatting via Python based on their current BJJ belt rank. Then I convert each person's name into a URL of their invdividual bracket in the Excel file. I will be able to use this scraper for future events to get a list of my teammates faster. We attend about 7 of these events per year.

Once bouts are assigned, the time and mat assignment is dumped in, and then my team uses this list to stay organized as one big group at the event. I built this for my own personal use because I sometimes have 30+ teammates competing. I scrape the IBJJF websites in three parts because the IBJJF does not create all three websites at the same time. They typically release them in a certain order. And so I scrape in a way that matches that.

IBJJF Registration List:

These lists are typically over a thousand athletes.
IBJJF Screenshot

IBJJF Brackets with URL:

Hundreds of brackets.
IBJJF Screenshot

Intermediary Result (Part 2):

Excel Screenshot

Intermediary Result (Part 3):

Excel Screenshot

End Result:

I then can use Excel's custom sort function (sort by DateTime) to sort the list of competitors by when they compete.
Each person's name is a link to their actual bracket.
Excel Screenshot

Example of Individual Bracket:

Excel Screenshot