Python scraper for extract data from the page europeanstartups, using python, and a google chrome data with a premium account already logged.
Project type: client
The project extract all results from the page https://app.europeanstartups.co/companies.startups/f/data_type/anyof_Verified/regions/allof_European%20Union, and save the output data in a csv file.
The project is a python script, that use a google chrome data with a premium account already logged, to extract the data from the page.
The data extract is:
- NAME
- DEALROOM SIGNAL
- MARKET
- TYPE
- LAUNCH DATE
- VALUATION
- FUNDING
- LOCATION
- LAST ROUND
- REVENUE
- STATUS
- GROWTH STAGE
- EMPLOYEES
- OWNERSHIP
- MARKET CAP
- DEBT
- URL WEBSITE
- LINKEDIN PROFILE
- TWITTER PROFILE
- FIRM VALUATION
- TAGS
- Clone the repo
git clone https://github.com/darideveloper/europeanstartups_scraper.git
- Install python packages (opening a terminal in the project folder)
python -m pip install -r requirements.txt
Create a .env
file with the following content
CHROME_PATH = C:/Users/<<your-user-name>>/AppData/Local/Google/Chrome/User Data/ #the chrome path is the folder where chrome data its installed
SHOW_BROWSER = True # Show (True) or Hide (False) the google chrome window
- Go to https://app.europeanstartups.co/companies.startups/f/data_type/anyof_Verified/regions/allof_European%20Union and create an account (if you have problems with your email, try with a proton email)
- Activate the premium trial or buy a premium account
- be sure to keep the account logged in the browser.
- Open a terminal in the project folder
- Run the project folder with python:
python .
- Wait until the script finish, and check the
output.csv
file in the project folder (note: while the script its running, you can't use google chrome).
- Use chrome data fror avoid login in the page
- Extract all data from the page
- Save output data in csv file