Pinned Repositories
binder-notebooks
Notebooks configured to be run with Binder, usually found on my blog.
embuddy
`embuddy` is a package that helps with using text embeddings for local data analysis.
remerge-mwe
REMERGE - Multi-Word Expression discovery algorithm
setfit
spacy-html-tokenizer
spacy-setfit-textcat
syntax-speaker-prediction
The tastiest machine learning project. Can we predict who is speaking for how long during an episode of the syntax.fm podcast?
text-feat-lib
Provide a comprehensive list of tokenizers, features, and general NLP things used for text analysis with examples. The initial focus is on features used for twitter data and sentiment analysis.
rota
Rapid Offense Text Autocoder
pmbaumgartner's Repositories
pmbaumgartner/binder-notebooks
Notebooks configured to be run with Binder, usually found on my blog.
pmbaumgartner/setfit
pmbaumgartner/prodigy-iaa
pmbaumgartner/fasttext-lite
pmbaumgartner/altair-saver-playwright
An easier to install version of `altair_saver`
pmbaumgartner/nav-labeled-data
pmbaumgartner/pdf-sketches
pmbaumgartner/aoc-2023
pmbaumgartner/blaseball-stuff
pmbaumgartner/catalogue
Super lightweight function registries for your library
pmbaumgartner/citi_bike_sample_data
pmbaumgartner/dct
pmbaumgartner/dialnarr-workshop
pmbaumgartner/fasttext-langdetect
80x faster and 95% accurate language identification with Fasttext
pmbaumgartner/gaming-causal-inference-paper
pmbaumgartner/gh-author-inspect
A CLI tool for identifying comments and discussions by an author on a repository.
pmbaumgartner/llama-cpp-python
Python bindings for llama.cpp
pmbaumgartner/observers
A Lightweight Library for AI Observability
pmbaumgartner/personal-hugo-site
personal site built using hugo
pmbaumgartner/ragas
Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines
pmbaumgartner/raycast-extensions
Everything you need to extend Raycast.
pmbaumgartner/scad-stuff
pmbaumgartner/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
pmbaumgartner/spaCy
💫 Industrial-strength Natural Language Processing (NLP) in Python
pmbaumgartner/sumgram
sumgram is a tool that summarizes a collection of text documents by generating the most frequent sumgrams (conjoined ngrams)
pmbaumgartner/text-reflow-pyscript
Webapp to reflow docstrings to use a certain width and indentation
pmbaumgartner/timesheet-relabeler
pmbaumgartner/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
pmbaumgartner/tortoise-tts-docker
pmbaumgartner/Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.