/europeanstartups-scraper

Python scraper for extract data from the page europeanstartups, using python, and a google chrome data with a premium account already logged.

Primary LanguagePythonMIT LicenseMIT

MIT License Linkedin Telegram Github Fiverr Discord Gmail


Europeanstartups Scraper

Europeanstartups Scraper

Python scraper for extract data from the page europeanstartups, using python, and a google chrome data with a premium account already logged.

Project type: client


Table of Contents
  1. Build With
  2. Media
  3. Details
  4. Install
  5. Settings
  6. Run
  7. Roadmap

Build with

Python Selenium

Media

web page 1

web page 2

sample terminal

sample csv file

Details

The project extract all results from the page https://app.europeanstartups.co/companies.startups/f/data_type/anyof_Verified/regions/allof_European%20Union, and save the output data in a csv file.

The project is a python script, that use a google chrome data with a premium account already logged, to extract the data from the page.

The data extract is:

  • NAME
  • DEALROOM SIGNAL
  • MARKET
  • TYPE
  • LAUNCH DATE
  • VALUATION
  • FUNDING
  • LOCATION
  • LAST ROUND
  • REVENUE
  • STATUS
  • GROWTH STAGE
  • EMPLOYEES
  • OWNERSHIP
  • MARKET CAP
  • DEBT
  • URL WEBSITE
  • LINKEDIN PROFILE
  • TWITTER PROFILE
  • FIRM VALUATION
  • TAGS

Install

Prerequisites

Installation

  1. Clone the repo
    git clone https://github.com/darideveloper/europeanstartups_scraper.git
  2. Install python packages (opening a terminal in the project folder)
    python -m pip install -r requirements.txt 

Settings

Create a .env file with the following content

 CHROME_PATH = C:/Users/<<your-user-name>>/AppData/Local/Google/Chrome/User Data/ #the chrome path is the folder where chrome data its installed
 SHOW_BROWSER = True # Show (True) or Hide (False) the google chrome window

Run

  1. Go to https://app.europeanstartups.co/companies.startups/f/data_type/anyof_Verified/regions/allof_European%20Union and create an account (if you have problems with your email, try with a proton email)
  2. Activate the premium trial or buy a premium account
  3. be sure to keep the account logged in the browser.
  4. Open a terminal in the project folder
  5. Run the project folder with python:
    python .
  6. Wait until the script finish, and check the output.csv file in the project folder (note: while the script its running, you can't use google chrome).

Roadmap

  • Use chrome data fror avoid login in the page
  • Extract all data from the page
  • Save output data in csv file