/pure_search

a web application to visualize job board data collected from Indeed in Python, Flask, Pandas, Selenium, and ReactJS.

Primary LanguageShell

PureSearch

👋 Thanks for checking out my project! This is a web application to visualize job board data collected from Indeed.


This is the main repo to bridge together my capstone project from App Academy. When I was a recruiter, I had always hoped for a way to automatically aggregate job postings from various markets and funnel them into an interface, so I built one!

This application and web scraper was able to collect more than 18 thousand job postings from Indeed.com across seven different geographic markets and four different technologies (Python, JavaScript, Ruby, and Java). After filtering the data through Pandas to wrangle it into something more useful, I was able to extract some really interesting insights into the current US tech market (avg salaries per geographic market, highest salaries per technology, technology with most job volume, etc) This is data that I could have only dreamed of having while recruiting in the competitive NYC tech market.

The project is comprised of several repositories linked here:

  • API - Built with Python and Flask
  • Client - Built with React
  • Scraper - Built with Python, Pandas, and Selenium

Takeaways:

  • Combining Selenium, Pandas, and Beautiful Soup to scrape data that powers interesting and useful applications is very exciting! This project is immediately valuable to me as I enter into my first job search as a developer, and I think that's cool! 😎
  • Filtering data in a performant way is tricky.
  • I really enjoyed building a small pipeline of data but have a lot of room for improvement and automation. In the months following the bootcamp, I've created a small project that containerizes Selenium and runs Chrome Driver headlessly on my virtual private server with some cron jobs and would like to use it to automate this part of the process. That would be next on the list.
  • Pandas is an incredible library! My wife is a financial analyst and used to work in excel all day. Now she uses Python and Pandas every day because of this project. 🚀
  • I do love JavaScript, but Python just feels better. ¯\_(ツ)_/¯

Link to live demo -> Click on Demo User
you may have to give Heroku a minute!


PureSearch

PureSearch