Ravisutha's Stars
donnemartin/system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
unslothai/unsloth
Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
facebookresearch/denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
facebookresearch/personal-timeline
A public release of TimelineBuilder for building personal digital data timelines.
google-research-datasets/tydiqa
TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and without the use of translation, and is designed for the training and evaluation of automatic question answering systems. This repository provides evaluation code and a baseline system for the dataset.
RUCAIBox/HaluEval
This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
ServiceNow/picard
PICARD - Parsing Incrementally for Constrained Auto-Regressive Decoding from Language Models. PICARD is a ServiceNow Research project that was started at Element AI.
AI-Yash/st-chat
Streamlit Component, for a Chatbot UI
fehiepsi/rethinking-numpyro
Statistical Rethinking (2nd ed.) with NumPyro
facebookresearch/ParlAI
A framework for training and evaluating AI models on a variety of openly available dialogue datasets.
enzoampil/fastquant
fastquant — Backtest and optimize your ML trading strategies with only 3 lines of code!
nmslib/nmslib
Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-metric spaces.
rentruewang/koila
Prevent PyTorch's `CUDA error: out of memory` in just 1 line of code.
booknlp/booknlp
BookNLP, a natural language processing pipeline for books
pariajm/awesome-disfluency-detection
A curated list of awesome disfluency detection publications along with the released code and bibliographical information
ranaroussi/quantstats
Portfolio analytics for quants, written in Python
Savvysherpa/slda
Cython implementations of Gibbs sampling for supervised LDA
JoeZJH/Labeled-LDA-Python
Implement of L-LDA Model(Labeled Latent Dirichlet Allocation Model) with python
MaartenGr/KeyBERT
Minimal keyword extraction with BERT
google/ehr-predictions
YangLinyi/FinNLP-Progress
NLP progress in Fintech. A repository to track the progress in Natural Language Processing (NLP) related to the domain of Finance, including the datasets, papers, and current state-of-the-art results for the most popular tasks.
HHousen/TransformerSum
Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive summarization datasets to the extractive task.
OATML/non-parametric-transformers
Code for "Self-Attention Between Datapoints: Going Beyond Individual Input-Output Pairs in Deep Learning"
RicardoEPRodrigues/magicmouse-hid
Magic Mouse 2 driver for Linux.
iipc/awesome-web-archiving
An Awesome List for getting started with web archiving
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
tensorflow/tensor2tensor
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
gpakosz/.tmux
🇫🇷 Oh my tmux! My self-contained, pretty & versatile tmux configuration made with ❤️
StatProofBook/StatProofBook.github.io
The Book of Statistical Proofs