cactusdove

cactusdove's Stars

opennukit/Nukit-Open-Air-Purifier
Nukit Open Air Purifiers are Open Hardware devices for improving indoor air quality. They are designed to be used with North American standard HVAC filters and PC fans. They are often an improvement over commercial air purifiers as they are quieter per m3 CADR delivered, have a lower cost of ownership per year, and are easily repairable.
30413
Enraged-Rabbit-Community/ERCF_v2
Community designed ERCF v2
1.5k148
CTreffOS/air-filter
171
html5lib/html5lib-python
Standards-compliant library for parsing and serializing HTML documents and fragments in Python
Language:Python1.1k286
scrapy/parsel
Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors
Language:Python1.2k147
scrapefulldotcom/crunchbase-scraper
Scrape Crunchbase company data reliably without an account.
2
PierreMrt/yahoo_finance_scrap
Easily monitor companies you want by scraping their financial statements into excel
Language:Python2
AdamGetbags/secAPI
Get SEC Filing Data From The SEC API
Language:Python5319
Deewens/Company-Scraper
This software is a data scraping tool that can extract company employees data from LinkedIn and from the Crunchbase API, then store those in a MySQL Database.
Language:Python1
boringPpl/data-science-roadmap
Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups
630125
alephdata/followthemoney
Data model and processing tools for investigative entity data
Language:Python22254
alephdata/aleph
Search and browse documents and data; find the people and companies you look for.
Language:JavaScript2.1k278
computerise/stonks
Scrape stock market data and perform quantitative analysis to value publicly-traded companies.
Language:Python2
MontFerret/ferret
Declarative web scraping
Language:Go5.8k303
alirezamika/autoscraper
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
Language:Python6.6k682
BruceDone/awesome-crawler
A collection of awesome web crawler,spider in different languages
6.6k707
iawia002/lux
👾 Fast and simple video download library and CLI tool written in Go
Language:Go28.1k3k
cheeriojs/cheerio
The fast, flexible, and elegant library for parsing and manipulating HTML and XML.
Language:TypeScript28.9k1.6k
vinta/awesome-python
An opinionated list of awesome Python frameworks, libraries, software and resources.
Language:Python230k25.1k
lorien/awesome-web-scraping
List of libraries, tools and APIs for web scraping and data processing.
Language:Makefile6.8k794
perwu/MIAS
Codes for the manuscript: Prediction of biomarkers and therapeutic combinations for anti-PD-1 immunotherapy using the global gene network association
62
sec-edgar/sec-edgar
Download all companies periodic reports, filings and forms from EDGAR database.
Language:Python1.1k295
psf/requests-html
Pythonic HTML Parsing for Humans™
Language:Python13.8k981
apify/crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
Language:TypeScript16.5k727
janlukasschroeder/sec-api
sec.gov EDGAR API | search & filter SEC filings | over 150 form types supported | 10-Q, 10-K, 8, 4, 13, S-11, ... | insider trading
Language:JavaScript23134
public-api-lists/public-api-lists
A collective list of free APIs for use in software and web development 🚀
10.8k1k
patrickmineault/ismyblue
Is my blue your blue?
Language:Jupyter Notebook16428
Insiyaa/Music-Tagging
Machine learning algorithms from scratch for genre classification
Language:Jupyter Notebook64
julianpoy/RecipeSage-selfhost
A collection of configuration files to host your own private instance of RecipeSage for personal use.
Language:Shell12929
plamere/SetListener
Creates Spotify playlist for your favorite artist's most recent show
Language:CSS4212