thunderpoot's Stars
the-turing-way/the-turing-way
Host repository for The Turing Way: a how to guide for reproducible data science
jt55401/gzinspector
A utility to do detailed analysis of gzip files.
joeraut/latex2image-web
LaTeX to image converter with web UI using Node.js / Docker
mlcommons/modelbench
Run safety benchmarks against AI models and view detailed reports showing how well they performed.
IBM/data-prep-kit
Open source project for data preparation of LLM application builders
commoncrawl/web-languages
Crowd-sourced lists of urls to help Common Crawl crawl under-resourced languages. See https://github.com/commoncrawl/web-languages-code/ for the code
continuedev/continue
⏩ Continue is the leading open-source AI code assistant. You can connect any models and any context to build custom autocomplete and chat experiences inside VS Code and JetBrains
langtech-bsc/salamandra
martinthomson/aasvg
Turn ASCII art into SVG
andrefs/node-x-sampa-ipa
X-SAMPA to IPA and IPA to X-SAMPA converter
thunderpoot/webdelta
A JavaScript utility for web pages that creates dynamic, human-readable dates, times, and relative time descriptions from UNIX timestamps.
ietf-tools/datatracker
The day-to-day front-end to the IETF database for people who work on IETF standards.
mlcommons/dynabench
vaughantype/wumpus-mono
A modern and functional monospaced typeface with a focus on legibility.
madler/pigz
A parallel implementation of gzip for modern multi-processor, multi-core machines.
thunderpoot/audio-censor
Rudimentary program for speech transcription, manipulation, and redaction.
blekko/slashtag-data
Open Data from blekko's web curators
pjox/cc-downloader
A polite and user-friendly downloader for Common Crawl data
commoncrawl/cc-citations
Scientific articles using or citing Common Crawl data
thunderpoot/dotcam
DotCam is a simple web–app for realtime video manipulation using dithering with customisable dot patterns.
Data-Provenance-Initiative/Data-Provenance-Collection
webrecorder/pywb
Core Python Web Archiving Toolkit for replay and recording of web archives
thunderpoot/NovaTeleBASIC
TeleBASIC syntax highlighting for Nova
whitfin/runiq
An efficient way to filter duplicate lines from input, à la uniq.
larsenwork/postcss-easing-gradients
PostCSS plugin to create smooth linear-gradients that approximate easing functions.
aws/aws-cli
Universal Command Line Interface for Amazon Web Services
amrisi/amr-guidelines
thunderpoot/unconventional-commits
Unconventional Commits: Advancing the Art of Code Versioning: An in-depth exploration of commit messaging.
apple/pkl
A configuration as code language with rich validation and tooling.
andrew-d/emoji256
Base256 encoding with emoji