koaning
Solving problems involving data. Mostly NLP these days. AskMeAnything[tm].
@explosionAmsterdam
Pinned Repositories
bulk
A Simple Bulk Labelling Tool
calm-notebooks
notebooks that are used at calmcode.io
cluestar
Gain clues from clustering!
clumper
A small python library that can clump lists of data together.
doubtlab
Doubt your data, find bad labels.
drawdata
Draw datasets from within Jupyter.
embetter
just a bunch of useful embeddings
human-learn
Natural Intelligence is still a pretty good idea.
scikit-lego
Extra blocks for scikit-learn pipelines.
whatlies
Toolkit to help understand "what lies" in word embeddings. Also benchmarking!
koaning's Repositories
koaning/scikit-lego
Extra blocks for scikit-learn pipelines.
koaning/human-learn
Natural Intelligence is still a pretty good idea.
koaning/drawdata
Draw datasets from within Jupyter.
koaning/bulk
A Simple Bulk Labelling Tool
koaning/embetter
just a bunch of useful embeddings
koaning/cluestar
Gain clues from clustering!
koaning/memo
Decorators that logs stats.
koaning/tokenwiser
Bag of, not words, but tricks!
koaning/arxiv-frontpage
My personal frontpage app
koaning/scikit-partial
Pipeline components that support partial_fit.
koaning/koaning
koaning/justcharts
Just charts. Really.
koaning/thismonth.rocks
motivational website to do something special this month
koaning/sentence-models
A different, but useful, textcat approach.
koaning/scikit-churn
Exploring some issues related to churn
koaning/kolektor
Let's give this git-scraping a try.
koaning/lazylines
Pipelines for JSONL files
koaning/playtime
Polarizingly fun tools for timeseries tasks.
koaning/salary-bias
just another dangerous situation
koaning/scikit-bloom
Bloom tricks for text pipelines in scikit-learn.
koaning/trogon
Easily turn your Click CLI into a powerful terminal application
koaning/benchy
Fun datasets for some light benchmarks.
koaning/ibis
the portable Python dataframe library
koaning/lancealot
Utils for LanceDB, because I like LanceDB, a lot.
koaning/lancedb
Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!
koaning/spaCy
💫 Industrial-strength Natural Language Processing (NLP) with Python and Cython
koaning/there-are-no-bad-labels
Repo for the PyData 2023 Workshop
koaning/toolong
A terminal application to view, tail, merge, and search log files (plus JSONL).
koaning/blog
Public repo for HF blog posts
koaning/gh-runner-demo
A demo with self-hosted runners