SMAD_CCCP_WORK

Synthesio Dashboard

https://www.synthesio.com/login/

Goal: automatically download tweets from the Synthesio Dashboard.

Technique: Selenium in Python

Code

Hadoop Server

Goal: Import tweets to the Hive Table in the Hadoop

Use SQL to select and download tweets from the Hadoop server

Possible work: build a new table to import tweets downloaded from the Synthesio

Technique: SQL

Code

Learn how to use SQL to download data from the server using Aman's tutorial

BadegerLink

http://badgerlink.newsmemory.com/wna/badgerlink/old.php

Goal: automatically download tweets from the BadgerLink archive

Technique: Python Web scrape

Code

LexisNexis

https://www.lexisnexis.com/en-us/gateway.page

Goal: automatically download tweets from the LexisNexis archive

Technique: Selenium in Python

Code:

CrowdTangle

https://www.crowdtangle.com/

Goal: extract information from the CrowdTangle API

Technique: Python

Code:

Web Contents Scraping

Goal: Scrape web contents from a given website (e.g., A news from the NewYork Times)

Technique: Python Web scrape

Code: