desaetiis's Stars
openai/openai-cookbook
Examples and guides for using the OpenAI API
CamDavidsonPilon/Probabilistic-Programming-and-Bayesian-Methods-for-Hackers
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)
PrefectHQ/prefect
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
ludwig-ai/ludwig
Low-code framework for building custom LLMs, neural networks, and other AI models
karpathy/micrograd
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
PrefectHQ/marvin
✨ Build AI interfaces that spark joy
Layout-Parser/layout-parser
A Unified Toolkit for Deep Learning Based Document Image Analysis
Arize-ai/phoenix
AI Observability & Evaluation
dedupeio/dedupe
:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
naiveHobo/InvoiceNet
Deep neural network to extract intelligent information from invoice documents.
ajndkr/lanarky
The web framework for building LLM microservices
noxone/regex-generator
Generate regular expressions from sample texts.
allanj/pytorch_neural_crf
Pytorch implementation of LSTM/BERT-CRF for named entity recognition
e-johnstonn/SalesCopilot
Intelligent sales assistant built using Deep Lake, Whisper, LangChain, and GPT 3.5/4
elisemercury/AutoClean
Python package for automated data preprocessing & cleaning.
wjbmattingly/ocr_python_textbook
ljvmiranda921/prodigy-pdf-custom-recipe
Custom recipe and utilities for document processing
INK-USC/TriggerNER
TriggerNER: Learning with Entity Triggers as Explanations for Named Entity Recognition (ACL 2020)
o19s/hello-ltr
Set of Jupyter notebooks demonstrating Learning to Rank integrated with Solr and Elasticsearch
jiggy-ai/hnsqlite
hnsqlite integrates hnswlib and sqlite for simple text embedding search
alephdata/fingerprints
Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.
wjbmattingly/freecodecamp_spacy
wjbmattingly/topic_modeling_textbook
wjbmattingly/ner_youtube
PatrickKalkman/python-docuvortex
A repository that contains all the examples that go with a Medium article called
wjbmattingly/holocaust_ner_lessons
maikelpenz/dataflow-automation-infra
Repository to maintain infrastructure to automate Data Workflows
wjbmattingly/spacy_tutorials_3x
Lyonk71/pandas-usaddress
The usaddress library made easy with Pandas
dancromartie/doc-similarity-lite
low configuration document similarity with sqlite