CorentinB
26 y-o developer, digital archivist and open source enthusiast. Feeding AIs at @mistralai 🧠Ex-@internetarchive
@mistralaiPérigord Noir, France
Pinned Repositories
archivetube
💾 Little script based on youtube-dl for archiving YouTube content
DeepSort
🧠AI powered image tagger backed by DeepDetect
go-warcprox
WARC writing MITM HTTP/S proxy in Go
gobbox
:white_square_button: Pure Go bounding boxes generation with labeling
radio_archiving
Tools for (web) radio stations archiving.
sokoban
Sokoban game in C, using ncurses
YouTube-IG
💾 Light and fast YouTube video IDs grabber.
YouTube-MA
💾 YouTube video metadata archiver written in Golang
gowarc
Read and write WARC files in Go
Zeno
State-of-the-art web crawler 🔱
CorentinB's Repositories
CorentinB/YouTube-MA
💾 YouTube video metadata archiver written in Golang
CorentinB/YouTube-IG
💾 Light and fast YouTube video IDs grabber.
CorentinB/go-warcprox
WARC writing MITM HTTP/S proxy in Go
CorentinB/big-list-of-naughty-strings
The Big List of Naughty Strings is a list of strings which have a high probability of causing issues when used as user-input data.
CorentinB/tiktokdl
CorentinB/VimeoCrawler
Crawler for vimeo.com
CorentinB/Architeuthis
MITM HTTP(S) proxy with integrated load-balancing, rate-limiting and error handling. Built for automated web scraping.
CorentinB/aria2
aria2 is a lightweight multi-protocol & multi-source, cross platform download utility operated in command-line. It supports HTTP/HTTPS, FTP, SFTP, BitTorrent and Metalink.
CorentinB/brook
Brook is a cross-platform(Linux/MacOS/Windows/Android/iOS) proxy/vpn software
CorentinB/brozzler
brozzler - distributed browser-based web crawler
CorentinB/calibre-comicvine
Comicvine calibre metadata source for comic-books and Graphic Novels
CorentinB/certmagic
Automatic HTTPS for any Go program: fully-managed TLS certificate issuance and renewal
CorentinB/comics-downloader
command-line tool to download comics and manga in pdf/epub/cbr/cbz from a website
CorentinB/comixed
The ComiXed Digital Comic Management System
CorentinB/dacite
Hash-based image image storage and upload.
CorentinB/deepdetect
Deep Learning API and Server in C++11 with Python bindings and support for Caffe, Tensorflow, XGBoost and TSNE
CorentinB/ExportTools.bundle
Export tools for Plex
CorentinB/freebox-stats
Get real time statistics from your Freebox
CorentinB/GAPdecoder
Google Art Project decoder
CorentinB/go-chromecast
cli for Google Chromecast, Home devices and Cast Groups
CorentinB/Instagram-API-python
Unofficial instagram API, give you access to ALL instagram features (like, follow, upload photo and video and etc)! Write on python.
CorentinB/Kadoc
Find directories with resembling names
CorentinB/mitm
CorentinB/netdata-influx
Netdata to Influx exporter + Grafana dashboard template
CorentinB/puppeteer-cluster
Run puppeteer in parallel with a pool of instances
CorentinB/scrapy-warcio
Support for writing WARC files with Scrapy
CorentinB/screenshot
Go library to capture desktop to image
CorentinB/udemy-dl
A cross-platform python based utility to download courses from udemy for personal offline use.
CorentinB/warc-1
Golang WARC (Web ARChive) Library
CorentinB/ytdl
YouTube download library and CLI written in Go