cash's Stars
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
tinygrad/tinygrad
You like pytorch? You like micrograd? You love tinygrad! ❤️
stanfordnlp/dspy
DSPy: The framework for programming—not prompting—language models
benfred/py-spy
Sampling profiler for Python programs
doccano/doccano
Open source annotation tool for machine learning practitioners.
prompt-toolkit/python-prompt-toolkit
Library for building powerful interactive command line applications in Python
HypothesisWorks/hypothesis
Hypothesis is a powerful, flexible, and easy to use library for property-based testing.
stephenmcd/mezzanine
CMS framework for Django
argilla-io/argilla
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
fchollet/ARC-AGI
The Abstraction and Reasoning Corpus
adbar/trafilatura
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
pydata/xarray
N-D labeled arrays and datasets in Python
libredirect/browser_extension
A browser extension that redirects popular sites to alternative privacy friendly frontends
rougier/matplotlib-cheatsheet
Matplotlib 3.1 cheat sheet.
nlplab/brat
brat rapid annotation tool (brat) - for all your textual annotation needs
omnilib/aiomultiprocess
Take a modern Python codebase to the next level of performance.
linkedin/shiv
shiv is a command line utility for building fully self contained Python zipapps as outlined in PEP 441, but with all their dependencies included.
kislyuk/argcomplete
Python and tab completion, better together.
facebookresearch/cc_net
Tools to download and cleanup Common Crawl data
DataBiosphere/toil
A scalable, efficient, cross-platform (Linux/macOS) and easy-to-use workflow engine in pure Python.
inception-project/inception
INCEpTION provides a semantic annotation platform offering intelligent annotation assistance and knowledge management.
jacksonllee/pycantonese
Cantonese Linguistics and NLP
Parquery/icontract
Design-by-contract in Python3 with informative violation messages and inheritance
sigma-py/accupy
:dart: Accurate sums and dot products for Python.
hchasestevens/show_ast
An IPython notebook plugin for visualizing ASTs.
capreolus-ir/capreolus
A toolkit for end-to-end neural ad hoc retrieval
ryanzhumich/awesome-clir
A curated list of resources for Cross-lingual Information Retrieval (CLIR).
lyutyuh/gazetteer-NER-acl19
Code for ACL '19 paper: Towards Improving Neural Named Entity Recognition with Gazetteers
x-way/cal-heatmap
Cal-Heatmap is a javascript module to create calendar heatmap to visualize time series data