/Web-Scraper-SEO

In this repository, I am sharing the Web Scraper for SEO Marketers in which they can extract metadata from the websites whether it's a static website or a Javascript-based website.

Primary LanguagePython

Web Scraper for SEO Marketers 👨‍💻

Beautiful Soup is the powerful library when it comes web scraping but it often struggle for the Javascript Enabled Websites. To tackle this issue, I am using the Selenium along with BeautifulSoup to parse the rendered source code of the page into python variable and then using to scrap it using BS4.

What You Can Extract?

The Meta Tags which you can extract using this scraper are:

  1. Page Title (Length of Title)

  2. H1 (Length of H1)

  3. H2

  4. Meta Description (Length of Meta Description)

  5. Meta Keywords

  6. Alt Image Tags

  7. Anchor Text

  8. Internal Links

Please try and let me know if you liked it..!!!!

In future, I am looking to add some features which I will share it soon!

Enjoy, also please scrap the website on your own risk..!!!!!

Note: Please change the path of the Chrome Driver which I have provied in my Github repository with name "chromedriver.exe".