/Genius-NoToken-Scraper

A Python program that scrapes the Genius and Spotify album page for lyrics, annotation, and URI data without the need for a client access token.

Primary LanguagePythonGNU General Public License v3.0GPL-3.0

GeniusNoTokenScraper:

Python version License: MIT

A Python program that scrapes the Genius and Spotify album page for lyrics, annotation, and URI data without the need for a client access token.

Usage

You are free to fork or just copy the source to use in your project according the GNU GENERAL PUBLIC LICENSE.

Example

album_spotify_link = "https://open.spotify.com/album/41GuZcammIkupMPKH2OJ6I"
album_genius_link = "https://genius.com/albums/Travis-scott/Astroworld"
singer = "Travis Scott"
year = "2018"
get_album_data_csv(album_genius_link, album_spotify_link, singer, year)
csv_to_json("data.csv")

Result: initially the album in parsed in a CSV file: data.csv and then the final results are outputed as a JSON file data.JSON

Methods and Variables

method effect input output
get_album_data_csv Parse album data from Genius page and album Spotify page genius_album_link (String), spotify_album_link (String), artist_name (String), album_year (String), a CSV file with the scraped album data
get_song_data Scrap a song's Genius lyrics page from lyrics data and lyrics annotation Ids genius_link (String) a list song lyrics and corresponding annotation Id for each lyric
csv_to_json use the csv file generated using get_album_data_csv to get scrape the full album for lyrics and annotation ids to a JSON file artist_csv (String) JSON file with the album data for each lyric