Corporate Drones Drone

This project is a web scraper that runs every morning at 8 EST and gets all submitted songs in the corporate drones music league https://app.musicleague.com/l/4b82d7c3ca0e4d5db2d9807e3f1da0cc/ and saves it to s3

How to run locally

Note: chromedriver_prod will be used in production as a headless linux driver. Local development will use a whichever chromedriver you have in the project

Install dependencies

pip install -r requirements.txt

Export Env variables

This project will use spotify credentials to authenticate with spotify. You must export spotify credentials before running

export SPOTIFY_USERNAME={SPOTIFY_USERNAME}
export SPOTIFY_PASSWORD={SPOTIFY_PASSWORD}

Run main file

python3 main.py

Deployment

This uses AWS SAM to run deployments

Lambda layer arn:aws:lambda:us-east-1:373505631200:layer:chromedriver:8 holds the headless chromium and chrome driver. Lot of version issues with python version, selenium, and the chromium/chrome driver. Versions used are serverless-chrome v1.0.0.37 chromedriver 2.37

To deploy run

sam build
sam deploy