koaning
Solving problems involving data. Mostly NLP these days. AskMeAnything[tm].
@explosionAmsterdam
Pinned Repositories
bulk
A Simple Bulk Labelling Tool
calm-notebooks
notebooks that are used at calmcode.io
cluestar
Gain clues from clustering!
clumper
A small python library that can clump lists of data together.
doubtlab
Doubt your data, find bad labels.
drawdata
Draw datasets from within Jupyter.
embetter
just a bunch of useful embeddings
human-learn
Natural Intelligence is still a pretty good idea.
scikit-lego
Extra blocks for scikit-learn pipelines.
whatlies
Toolkit to help understand "what lies" in word embeddings. Also benchmarking!
koaning's Repositories
koaning/whatlies
Toolkit to help understand "what lies" in word embeddings. Also benchmarking!
koaning/simsity
Super Simple Similarities Service
koaning/mktestdocs
Run pytest against markdown files/docstrings.
koaning/memo
Decorators that logs stats.
koaning/tuilwindcss
Very much like Tailwind, but for TUI frameworks in Textual.
koaning/tokenwiser
Bag of, not words, but tricks!
koaning/justcharts
Just charts. Really.
koaning/scikit-prune
Prune your sklearn models
koaning/prodigy-tui
A textual TUI for Prodigy
koaning/sentence-models
A different, but useful, textcat approach.
koaning/kolektor
Let's give this git-scraping a try.
koaning/boondoc
lightweight Python API docs for markdown
koaning/calm-stats
Some GitScrapers
koaning/featherbed
Very lightweight text vectors via tf/idf + SVD
koaning/gli
my gleeful scripts for the cli
koaning/manyterms
Many terms for whatever purposes (weak labelling)
koaning/scikit-prodigy
Helpers to leverage scikit-learn pipelines in Prodigy.
koaning/skooba
less weak supervision
koaning/spaCy
💫 Industrial-strength Natural Language Processing (NLP) with Python and Cython
koaning/srsly
🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)
koaning/there-are-no-bad-labels
Repo for the PyData 2023 Workshop
koaning/awesome-normconf
List of resources coming out of Normconf Slack
koaning/blog
Public repo for HF blog posts
koaning/bunchmarks
Data for a bunch of benchmarks.
koaning/json-schema-demo
json schemas as a demo
koaning/projects
🪐 End-to-end NLP workflows from prototype to production
koaning/radicli
🕊️ Radically lightweight command-line interfaces
koaning/typerchecks
koaning/weasel
🦦 weasel: A small and easy workflow system
koaning/weave
Weave, developed by the team at Weights and Biases, is a new open-source toolkit designed for performant, interactive data exploration. Our mission is to equip Machine Learning practitioners with the best tools to turn data into insights quickly and easily.