ichalkiad's Stars
karpathy/LLM101n
LLM101n: Let's build a Storyteller
karpathy/llm.c
LLM training in simple, raw C/CUDA
andrewzm/afgwardiary
Replication code for the AWD paper (PNAS)
ragnarlevi/GTST
royal-statistical-society/datavisguide
Introductory guide to the art and science of data visualisation. Insights, advice, and examples (with code) to make data outputs more readable, accessible, and impactful.
SpeechifyInc/Meta-voicebox
Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.
Jingjing-NLP/VOLT
Code for paper "Vocabulary Learning via Optimal Transport for Neural Machine Translation"
booknlp/booknlp
BookNLP, a natural language processing pipeline for books
BaptisteBlouin/EventExtractionPapers
A list of NLP resources focused on event extraction task
geometric-kernels/GeometricKernels
Geometric kernels on manifolds, meshes and graphs
stk-kriging/stk
The STK is a (not so) Small Toolbox for Kriging. Its primary focus is on the interpolation/regression technique known as kriging, which is very closely related to Splines and Radial Basis Functions, and can be interpreted as a non-parametric Bayesian method using a Gaussian Process (GP) prior.
leslie-huang/stylest
R package for estimating speaker style distinctiveness in texts. Install it from CRAN!
ott-jax/ott
Optimal transport tools implemented with the JAX framework, to get differentiable, parallel and jit-able computations.
BorgwardtLab/GraphKernels
A package for computing Graph Kernels
cainesap/syllabify
Automatically convert plain text into phonemes (US English pronunciation) and syllabify
public-apis/public-apis
A collective list of free APIs
gesiscss/awesome-computational-social-science
A list of awesome resources for Computational Social Science
h2oai/h2o-llmstudio
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://h2oai.github.io/h2o-llmstudio/
GAMES-UChile/mogptk
Multi-Output Gaussian Process Toolkit
MartinHeinz/mastodon-local
Setup for local/playground instance for Mastodon
ragnarlevi/MMD_Graph_Diversification
jonathf/chaospy
Chaospy - Toolbox for performing uncertainty quantification.
ArthurSpirling/LargeLanguageArguments
Public repository for Palmer & Spirling work on LLM/human arguments
jim-schwoebel/voice_datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
thibsej/unbalanced-ot-functionals
It is a repo which allows to compute all divergences derived from the theory of entropically regularized, unbalanced optimal transport. It relies on a pytorch backend.
jeanfeydy/global-divergences
MMD, Hausdorff and Sinkhorn divergences scaled up to 1,000,000 samples.
jeanfeydy/geomloss
Geometric loss functions between point clouds, images and volumes
JiajingZ/CopSens
JasonKessler/scattertext
Beautiful visualizations of how language differs among document types.
jim-schwoebel/voicebook
🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).