markpbaggett's Stars
pocketbase/pocketbase
Open Source realtime backend in 1 file
unclecode/crawl4ai
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
menloresearch/jan
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer
stanford-oval/storm
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
quickwit-oss/quickwit
Cloud-native search engine for observability. An open-source alternative to Datadog, Elasticsearch, Loki, and Tempo.
peak/s5cmd
Parallel S3 and local filesystem execution tool.
ianarawjo/ChainForge
An open-source visual programming environment for battle-testing prompts to LLMs.
wabarc/wayback
An archiving tool with an IM-style interface that prioritizes privacy and accessibility, integrated with various archival services including Internet Archive, archive.today, Ghostarchive, IPFS, Telegraph, and file systems.
matuzo/HTMHell
A collection of bad practices in HTML found on real websites.
grantmcconnaughey/Flake8Rules
Descriptions and examples for the rules in Flake8 (pyflakes, pycodestyle, and mccabe).
MicheleCotrufo/pdf2doi
A python library/command-line tool to extract the DOI or other identifiers of a scientific paper from a pdf file.
MicheleCotrufo/pdf2bib
A python library/command-line tool to quickly and automatically generate BibTeX data starting from the pdf file of a scientific publication.
edwardanderson/krml
Create knowledge graphs with Markdown
rsimon/immarkus
An image annotation environment for the MARKUS platform
project-lux/data-pipeline
Data pipeline to harvest, transform, reconcile, enrich and export Linked Art data for LUX (or other system)
quackscience/duckdb-googlesheets-engine
DuckDB Engine as Google Sheets Library
iipc/warcaroo
nulib/dc-api-v2
API providing access to the rich collections of the Northwestern University Libraries
gvwilson/rsdx
Research Software Design by Example
jptmoore/awesome-iiif-annotations
A curated list of web annotations
dlebansais/PgSearch-Disclosed
A tool to search through public data in the Project: Gorgon MMORPG
nationalarchives/annosearch
W3C web annotation search using the IIIF content search API
nulib/iiif-signed-uri-auth
A Specification for Signed URI Authorization for IIIF Image Resources
nulib/nuldc
A small set of python helpers consuming the dcapi. Also has a set of CLIs for common tasks.
maps-as-data/maptext_data_model
nulib-ds/AnAmericansAfrica
thisismattmiller/hathi-pd-2025
Hathi Trust PD Book processes for 2025
iiif-test/iiif-test.github.io
GitHubPages site
jameswsullivan/NewspaperBatchAssemblyTool
A newspaper batch assembly tool for digitized newspapers.
uclibs/digitization-workflow
Documentation of digitization workflow at the University of Cincinnati Libraries