/email-scraper

Simple Python library to scrape email addresses from HTML

Primary LanguagePythonMIT LicenseMIT

Python Module for Scraping Email Addresses from HTML

The email_scraper module provides a simple method that extracts email addresses from HTML. It is able to find emails in plain text, links, atob() obfuscation and HTML entities obfuscation.

Available on PyPI.

Usage

>>> from email_scraper import scrape_emails
>>> scrape_emails('<html><body><a href="mailto:hello@world.com">email me</a></body></html>')
{'hello@world.com'}
>>> scarpe_emails('<a href="javascript:window.location.href=atob(\'bWFpbHRvOmVtYWlsQGV4YW1wbGUuY29t\')">E-Mail</a>')
{'email@example.com'}