/SeekingAlpha-Scraper

A bot for collecting Earnings Announcement Transcripts from SeekingAlpha.com

Primary LanguagePythonBSD 2-Clause "Simplified" LicenseBSD-2-Clause

SeekingAlpha-Scraper

A bot for collecting Earnings Announcement Transcripts from SeekingAlpha.com

Disclaimer

DO NOT ABUSE THIS SCRAPER.

This scraper was developed for educational purposes only and serves no other purposes. SeekingAlpha TOS prohibit scraping and I make no representation that your account won't be banned if you use this program. Furthermore, should I be contacted by SeekingAlpha with a request to remove this repo, I will do so immediately.

About

This scraper collects the earnings announcements of a specific company (identified with ticker or t).

Installation

First, make sure that you're using Python 3.7+.

  1. Clone or download this repository.
  2. Run pip install -r requirements.txt inside this repo. Consider doing this inside of a Python virtual environment.
  3. Create a secret.yaml file containing the keys username and password with your SeekingAlpha login information. You should consider creating a dummy SeekingAlpha account.
  4. It is highly recommended (but not required) that you use a virtual environment for your project.

Usage

usage: Scraper.py [-h] [-t ticker]

optional arguments:
  -h, --help                                  show this help message and exit
  -t TICKER, --ticker TICKER                  Ticker Symbol of the desired company.

Run the script as follows, taking Costco (Ticker Symbol: COST) as an example.

Example 1

python Scraper.py -t COST

The outputs will be saved in Outputs/Transcripts as HTML files. Each file will be saved in the following format:

{ticker}_transcripts_{article_title}.html

where ticker is the ticker symbol and article_title is the title of the article.

Note

  • At this moment, this scraper does not pass any CAPTCH tests. You should probably do it manually in the session that you start and try again.
  • After the first time you sing in, you will not need to sing in again because the chromedriver option user-data-dir is added.
  • This module uses webdriver-manager, which handles chromedriver versions automatically.
  • Lastly and importantly, this scraper is slow by design. As mentioned before, this is for educational purposes only and nothing more.