seejay
TechGeek, Blogger, Podcaster and Software Developer who promotes GNU/Linux and FOSS 24/7/365. Scraping the web like a boss @Scrapinghub
Scrapinghub.comSri Lanka
Pinned Repositories
crm114
Windows ports and some blathering about crm114, the statisitical classifier suite, a.k.a. the Regex Mutilator / spam filter.
drymail
An email auto-responder that uses crm114 training to pick a response template
feedIO
A Feed Aggregator that Knows What You Want to Read.
pipe2py
A project to compile Yahoo! Pipes into Python (see it hosted on Google App Engine: http://pipes-engine.appspot.com)
plexydesk
Qt based Mobile Desktop
readability-api
xgoogle
Python library to Google services (google search, google sets, google translate, sponsored links)
seejay's Repositories
seejay/feedIO
A Feed Aggregator that Knows What You Want to Read.
seejay/100days
100 days of algorithms
seejay/awesome-mental-health
A curated list of awesome articles, websites and resources about mental health in the software industry.
seejay/curlconverter
convert curl commands to Python, JavaScript, PHP, R, Go, Rust, Dart, JSON, Ansible, Elixir
seejay/desktop
Simple collaboration from your desktop
seejay/DevToolboxWeb
seejay/fonts
Font files available from Google Fonts
seejay/Giveme5W1H
Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?
seejay/GPTs
leaked prompts of GPTs
seejay/jsonlint.com
Source code for jsonlint.com
seejay/leon
🧠 Leon is your open-source personal assistant.
seejay/Memex
Browser Extension to full-text search your browsing history & bookmarks.
seejay/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
seejay/news-please
news-please - an integrated web crawler and information extractor for news that just works.
seejay/news-summarizer
News summarizer with GPT-3 – specifically for TechCrunch articles
seejay/NewsBlur
NewsBlur is a personal news reader that brings people together to talk about the world. A new sound of an old instrument.
seejay/newspaper
News, full-text, and article metadata extraction in Python 3. Advanced docs:
seejay/nuklear
A single-header ANSI C gui library
seejay/ollama
Get up and running with Llama 2 and other large language models locally
seejay/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
seejay/polar-bookshelf
Polar is a personal knowledge repository for PDF and web content supporting incremental reading and document annotation.
seejay/privateGPT
Interact privately with your documents using the power of GPT, 100% privately, no data leaks
seejay/PyCRM114
A Python module for The CRM-114 Discriminator, which handles learning and classification of text streams.
seejay/pyminer
Python miner for bitcoin
seejay/pyNuklear
seejay/roomGPT
Upload a photo of your room to generate your dream room with AI.
seejay/seafaring
Code for "Active Learning from the Web" (WWW 2023)
seejay/Spider-Sense
A browser extension to monitor your spiders deployed on Scrapy Cloud.
seejay/stable-diffusion
A latent text-to-image diffusion model
seejay/twint
An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.