1danjordan's Stars
meilisearch/meilisearch
A lightning-fast search engine API bringing AI-powered hybrid search to your sites and applications.
roboflow/supervision
We write your reusable computer vision tools. 💜
HumanSignal/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
VikParuchuri/surya
OCR, layout analysis, reading order, table recognition in 90+ languages
dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.
interpretml/interpret
Fit interpretable models. Explain blackbox machine learning.
evidentlyai/evidently
Evidently is an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.
Layout-Parser/layout-parser
A Unified Toolkit for Deep Learning Based Document Image Analysis
Zipstack/unstract
No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents
opencv/opencv-python
Automated CI toolchain to produce precompiled opencv-python, opencv-python-headless, opencv-contrib-python and opencv-contrib-python-headless packages.
quarto-dev/quarto-cli
Open-source scientific and technical publishing system built on Pandoc.
huggingface/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
mljar/mljar-supervised
Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentation
Filimoa/open-parse
Improved file parsing for LLM’s
MAIF/shapash
🔅 Shapash: User-friendly Explainability and Interpretability to Develop Reliable and Transparent Machine Learning Models
erikbern/git-of-theseus
Analyze how a Git repo grows over time
microsoft/table-transformer
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.
dbt-labs/dbt-utils
Utility functions for dbt projects.
react-querybuilder/react-querybuilder
The Query Builder component for React
explosion/spacy-transformers
🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
facebookexperimental/Robyn
Robyn is an experimental, AI/ML-powered and open sourced Marketing Mix Modeling (MMM) package from Meta Marketing Science. Our mission is to democratise modeling knowledge, inspire the industry through innovation, reduce human bias in the modeling process & build a strong open source marketing science community.
data-8/textbook
The textbook Computational and Inferential Thinking: The Foundations of Data Science
guillermo-navas-palencia/optbinning
Optimal binning: monotonic binning with constraints. Support batch & stream optimal binning. Scorecard modelling and counterfactual explanations.
poloclub/unitable
UniTable: Towards a Unified Table Foundation Model
memgraph/orb
Graph visualization library
1rgs/clarity-reader
Layered, depth-first reading—start with summaries, tap to explore details, and gain clarity on complex topics.
furkanbiten/idl_data
OCR Annotations from Amazon Textract for Industry Documents Library
petermckeeverPerform/themepy
An open source theme selector for matplotlib
rfdearborn/dbt-docs-to-notion
A github action for exporting dbt docs to a notion database
davhbrown/interactive_classification_metrics
Get an intuitive sense for the ROC curve and other binary classification metrics with interactive visualization.