cactusdove's Stars
opennukit/Nukit-Open-Air-Purifier
Nukit Open Air Purifiers are Open Hardware devices for improving indoor air quality. They are designed to be used with North American standard HVAC filters and PC fans. They are often an improvement over commercial air purifiers as they are quieter per m3 CADR delivered, have a lower cost of ownership per year, and are easily repairable.
Enraged-Rabbit-Community/ERCF_v2
Community designed ERCF v2
CTreffOS/air-filter
html5lib/html5lib-python
Standards-compliant library for parsing and serializing HTML documents and fragments in Python
scrapy/parsel
Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors
scrapefulldotcom/crunchbase-scraper
Scrape Crunchbase company data reliably without an account.
PierreMrt/yahoo_finance_scrap
Easily monitor companies you want by scraping their financial statements into excel
AdamGetbags/secAPI
Get SEC Filing Data From The SEC API
Deewens/Company-Scraper
This software is a data scraping tool that can extract company employees data from LinkedIn and from the Crunchbase API, then store those in a MySQL Database.
boringPpl/data-science-roadmap
Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups
alephdata/followthemoney
Data model and processing tools for investigative entity data
alephdata/aleph
Search and browse documents and data; find the people and companies you look for.
computerise/stonks
Scrape stock market data and perform quantitative analysis to value publicly-traded companies.
MontFerret/ferret
Declarative web scraping
alirezamika/autoscraper
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
BruceDone/awesome-crawler
A collection of awesome web crawler,spider in different languages
iawia002/lux
👾 Fast and simple video download library and CLI tool written in Go
cheeriojs/cheerio
The fast, flexible, and elegant library for parsing and manipulating HTML and XML.
vinta/awesome-python
An opinionated list of awesome Python frameworks, libraries, software and resources.
lorien/awesome-web-scraping
List of libraries, tools and APIs for web scraping and data processing.
perwu/MIAS
Codes for the manuscript: Prediction of biomarkers and therapeutic combinations for anti-PD-1 immunotherapy using the global gene network association
sec-edgar/sec-edgar
Download all companies periodic reports, filings and forms from EDGAR database.
psf/requests-html
Pythonic HTML Parsing for Humans™
apify/crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
janlukasschroeder/sec-api
sec.gov EDGAR API | search & filter SEC filings | over 150 form types supported | 10-Q, 10-K, 8, 4, 13, S-11, ... | insider trading
public-api-lists/public-api-lists
A collective list of free APIs for use in software and web development 🚀
patrickmineault/ismyblue
Is my blue your blue?
Insiyaa/Music-Tagging
Machine learning algorithms from scratch for genre classification
julianpoy/RecipeSage-selfhost
A collection of configuration files to host your own private instance of RecipeSage for personal use.
plamere/SetListener
Creates Spotify playlist for your favorite artist's most recent show