No authentication. No API. No limits.
Formerly known as Tweep, Twint is an advanced Twitter scraping tool written in Python that allows for scraping Tweets from Twitter profiles without using Twitter's API.
Twint utilizes Twitter's search operators to let you scrape Tweets from specific users, scrape Tweets relating to certain topics, hashtags & trends, or sort out sensitive information from Tweets like e-mail and phone numbers. I find this very useful, and you can get really creative with it too.
Twint also makes special queries to Twitter allowing you to also scrape a Twitter user's followers, Tweets a user has liked, and who they follow without any authentication, API, Selenium, or browser emulation.
Some of the benefits of using Twint vs Twitter API:
- Can fetch almost all Tweets (Twitter API limits to last 3200 Tweets only)
- Fast initial setup
- Can be used anonymously and without Twitter sign up
- No rate limitations
- Python 3.5/3.6
pip3 install -r requirements.txt
- Git:
git clone https://github.com/haccer/twint.git
- Pip:
pip3 install twint
- With MySQL:
git clone -b mysql --single-branch https://github.com/haccer/twint.git twint-mysql
A few simple examples to help you understand the basics:
python3 twint.py -u username
- Scrape all the Tweets from user's timeline.python3 twint.py -u username -s pineapple
- Scrape all Tweets from the user's timeline containing pineapple.python3 twint.py -s pineapple
- Collect every Tweet containing pineapple from everyone's Tweets.python3 twint.py -u username --year 2014
- Collect Tweets that were tweeted before 2014.python3 twint.py -u username --since 2015-12-20
- Collect Tweets that were tweeted since 2015-12-20.python3 twint.py -u username -o file.txt
- Scrape Tweets and save to file.txt.python3 twint.py -u username -o file.csv --csv
- Scrape Tweets and save as a csv file.python3 twint.py -u username --fruit
- Show Tweets with low-hanging fruit.python3 twint.py -s "Donald Trump" --verified
- Display Tweets by verified users that Tweeted about Donald Trump.python3 twint.py -g="48.880048,2.385939,1km" -o file.csv --csv
- Scrape Tweets from a radius of 1km around a place in Paris and export them to a csv file.python3 twint.py -u username -es localhost:9200
- Output Tweets to Elasticsearchpython3 twint.py -u username -o file.json --json
- Scrape Tweets and save as a json file.python3 twint.py -u username --database tweets.db
- Save Tweets to a SQLite database.python3 twint.py -u username --followers
- Scrape a Twitter user's followers.python3 twint.py -u username --following
- Scrape who a Twitter user follows.python3 twint.py -u username --favorites
- Collect all the Tweets a user has favorited.python3 twint.py -u username --following --user-full
- Collect full user information a person followspython3 twint.py -u username --profile-full
- Use a slow, but effective method to gather all the Tweets from a user's profile (Including Retweets).python3 twint.py -u username --retweets
- Use a quick method to gather the last 900 Tweets (that includes retweets) from a user's profile.
More detail about the commands and options are located in the wiki
Twint can now be used as a module and supports custom formatting. More details are located in the wiki
import twint
# Configure
c = twint.Config()
c.Username = "now"
c.Search = "pineapple"
c.Format = "Tweet id: {id} | Tweet: {tweet}"
# Run
twint.run.Search(c)
955511208597184512 2018-01-22 18:43:19 GMT <now> pineapples are the best fruit
- Write to file.
- CSV
- JSON
- SQLite
- Elasticsearch
- MySQL (See MySQL Branch)
Details on setting up Elasticsearch with Twint is located in the wiki.
Graph details are also located in the wiki.
We are testing a (free) graph plugin for Kibana, details located in the Wiki!
Thanks to @hpiedcoq & @pielco11 for contributing several features!
Shout me out on Twitter: @now
If you have problems or have suggestions don't hesitate to open an issue or ask about it directly.