Ravisutha

Senior Data Scientist at Fidelity Investments

Boston

Ravisutha's Stars

donnemartin/system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Language:Python283k47.3k
unslothai/unsloth
Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
Language:Python19.7k1.4k
facebookresearch/denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
Language:Python1.7k302
facebookresearch/personal-timeline
A public release of TimelineBuilder for building personal digital data timelines.
Language:Jupyter Notebook34727
google-research-datasets/tydiqa
TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and without the use of translation, and is designed for the training and evaluation of automatic question answering systems. This repository provides evaluation code and a baseline system for the dataset.
Language:Python29643
RUCAIBox/HaluEval
This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.
Language:Python42127
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
27.7k2.3k
ServiceNow/picard
PICARD - Parsing Incrementally for Constrained Auto-Regressive Decoding from Language Models. PICARD is a ServiceNow Research project that was started at Element AI.
Language:Haskell347121
AI-Yash/st-chat
Streamlit Component, for a Chatbot UI
Language:JavaScript969264
fehiepsi/rethinking-numpyro
Statistical Rethinking (2nd ed.) with NumPyro
Language:Jupyter Notebook45074
facebookresearch/ParlAI
A framework for training and evaluating AI models on a variety of openly available dialogue datasets.
Language:Python10.5k2.1k
enzoampil/fastquant
fastquant — Backtest and optimize your ML trading strategies with only 3 lines of code!
Language:Jupyter Notebook1.5k240
nmslib/nmslib
Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-metric spaces.
Language:C++3.4k453
rentruewang/koila
Prevent PyTorch's `CUDA error: out of memory` in just 1 line of code.
Language:Python1.8k63
booknlp/booknlp
BookNLP, a natural language processing pipeline for books
Language:Python810100
pariajm/awesome-disfluency-detection
A curated list of awesome disfluency detection publications along with the released code and bibliographical information
715
ranaroussi/quantstats
Portfolio analytics for quants, written in Python
Language:Python5.1k883
Savvysherpa/slda
Cython implementations of Gibbs sampling for supervised LDA
Language:Python6111
JoeZJH/Labeled-LDA-Python
Implement of L-LDA Model(Labeled Latent Dirichlet Allocation Model) with python
Language:Python12131
MaartenGr/KeyBERT
Minimal keyword extraction with BERT
Language:Python3.6k358
google/ehr-predictions
Language:Python9717
YangLinyi/FinNLP-Progress
NLP progress in Fintech. A repository to track the progress in Natural Language Processing (NLP) related to the domain of Finance, including the datasets, papers, and current state-of-the-art results for the most popular tasks.
39457
HHousen/TransformerSum
Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive summarization datasets to the extractive task.
Language:Python42958
OATML/non-parametric-transformers
Code for "Self-Attention Between Datapoints: Going Beyond Individual Input-Output Pairs in Deep Learning"
Language:Python40540
RicardoEPRodrigues/magicmouse-hid
Magic Mouse 2 driver for Linux.
Language:C15017
iipc/awesome-web-archiving
An Awesome List for getting started with web archiving
2.1k157
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python137k27.4k
tensorflow/tensor2tensor
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
Language:Python15.7k3.5k
gpakosz/.tmux
🇫🇷 Oh my tmux! My self-contained, pretty & versatile tmux configuration made with ❤️
Language:Shell22.3k3.4k
StatProofBook/StatProofBook.github.io
The Book of Statistical Proofs
Language:HTML32463

Ravisutha

Ravisutha's Stars

donnemartin/system-design-primer

unslothai/unsloth

facebookresearch/denoiser

facebookresearch/personal-timeline

google-research-datasets/tydiqa

RUCAIBox/HaluEval

google-research/tuning_playbook

ServiceNow/picard

AI-Yash/st-chat

fehiepsi/rethinking-numpyro

facebookresearch/ParlAI

enzoampil/fastquant

nmslib/nmslib

rentruewang/koila

booknlp/booknlp

pariajm/awesome-disfluency-detection

ranaroussi/quantstats

Savvysherpa/slda

JoeZJH/Labeled-LDA-Python

MaartenGr/KeyBERT

google/ehr-predictions

YangLinyi/FinNLP-Progress

HHousen/TransformerSum

OATML/non-parametric-transformers

RicardoEPRodrigues/magicmouse-hid

iipc/awesome-web-archiving

huggingface/transformers

tensorflow/tensor2tensor

gpakosz/.tmux

StatProofBook/StatProofBook.github.io