willeppy's Stars
aryn-ai/sycamore
🍁 Sycamore is an LLM-powered search and analytics platform for unstructured data.
michelle123lam/lloom
Concept Induction: Analyzing Unstructured Text with High-Level Concepts Using LLooM (CHI 2024 paper). LLooM automatically surfaces high-level concepts to analyze unstructured text.
pixegami/rag-tutorial-v2
An Improved Langchain RAG Tutorial (v2) with local LLMs, database updates, and testing.
cmudig/Texture
Visualize your text data with structured attributes
microsoft/autogen
A programming framework for agentic AI 🤖
PAIR-code/auto-histograms
1rgs/jsonformer
A Bulletproof Way to Generate Structured JSON from Language Models
uwdata/mosaic
An extensible framework for linking databases and interactive views.
apple/ml-translate-vis
Angler: Machine Translation Visualization (CHI 2023)
neulab/prompt2model
prompt2model - Generate Deployable Models from Natural Language Instructions
yeounoh/slicefinder
automatic data slicing
google/BIG-bench
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
whylabs/whylogs
An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collection, ensuring safety & robustness. 📈
jupyterlab/jupyterlab-desktop
JupyterLab desktop application, based on Electron.
mito-ds/mito
The mitosheet package, trymito.io, and other public Mito code.
rikky0611/teach-PUI-2023S
manzt/anywidget
jupyter widgets made easy
rikky0611/teach-PUI-2023S-example
yzhao062/pyod
A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques
interactive-structures/pui-materials
rom1504/clip-retrieval
Easily compute clip embeddings and build a clip retrieval system with them
poloclub/nova
Simple method to create notebook-ready visual analytics tools!
cmudig/AutoProfiler
Automatically profile dataframes in the Jupyter sidebar
deepchecks/deepchecks
Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.
neubig/anlp-code
zeno-ml/zeno
AI Data Management & Evaluation Platform
LineaLabs/lineapy
Move fast from data science prototype to pipeline. Capture, analyze, and transform messy notebooks into data pipelines with just two lines of code.
lightdash/lightdash
Self-serve BI to 10x your data team ⚡️
dolthub/dolt
Dolt – Git for Data
jupyterlab/jupyterlab-data-explorer
First class datasets in JupyterLab