/authortwitter

Grabs the Twitter username of a blog post's author with a URL

Primary LanguagePythonMIT LicenseMIT

authortwitter

Python tool that grabs the Twitter username of a blog post's author with a URL. Different blogs have different ways of displaying author Twitter usernames. Increasingly, blogs are starting to use the <meta name="twitter:creator" content="@username"> tag, but a lot still don't. Usernames could be in a Twitter button at the bottom of the post, or buried in the author's own user archive page. Each blog that's supported as detailed below has the current XPath routes for the username.

Compatible with Python 3.x

Requires: lxml, requests, urllib, re

Examples

https://www.engadget.com/2017/05/19/watch-an-ai-teach-itself-to-drive-in-gta-v/ will output OutOnALumb https://contently.com/strategist/2017/05/18/new-demand-waterfall/ will output JoeLazauskas http://contentmarketinginstitute.com/2017/05/create-high-converting-content/ will output shane_barker

Usage

from authortwitter import AuthorTwitter

get_username = AuthorTwitter()
url = 'https://contently.com/strategist/2017/05/18/new-demand-waterfall/'
return get_username.extract_author(url) # Returns 'JoeLazauskas'

Blogs supported

* (Any website that uses the 'twitter:creator' meta tag)
* bufferapp.com
* citylab.com
* blogs.cisco.com
* coindesk.com
* consumerist.com
* contently.com
* contentmarketinginstitute.com
* conversionxl.com
* convinceandconvert.com
* digitaltrends.com
* engadget.com
* entrepreneur.com
* financemagnates.com
* fivethirtyeight.com
* blog.hootsuite.com
* kinja.com
* lifehacker.com
* marketingland.com
* martech.zone
* mashable.com
* medium.com (Blogs with custom domains planned)
* nakedsecurity.sophos.com
* readwrite.com
* shopify.com
* socialmediaexplorer.com
* socialmediatoday.com
* techcrunch.com
* thehackernews.com
* theregister.co.uk
* theverge.com
* venturebeat.com
* wired.co.uk