hmohebbi's Stars
microsoft/generative-ai-for-beginners
18 Lessons, Get Started Building with Generative AI ๐ https://microsoft.github.io/generative-ai-for-beginners/
huggingface/datasets
๐ค The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
ahkarami/Deep-Learning-in-Production
In this repository, I will share some useful notes and references about deploying deep learning-based models in production.
huggingface/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
allenai/RL4LMs
A modular RL library to fine-tune language models to human preferences
tomohideshibata/BERT-related-papers
BERT-related papers
marcotcr/checklist
Beyond Accuracy: Behavioral Testing of NLP models with CheckList
konradhalas/dacite
Simple creation of data classes from dictionaries.
timoschick/pet
This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"
moses-smt/mosesdecoder
Moses, the machine translation system
allenai/natural-instructions
Expanding natural instructions
clab/fast_align
Simple, fast unsupervised word aligner
acl-org/aclpubcheck
Tools for checking ACL paper submissions
understandable-machine-intelligence-lab/Quantus
Quantus is an eXplainable AI toolkit for responsible evaluation of neural network explanations
inseq-team/inseq
Interpretability for sequence generation models ๐ ๐
gabolsgabs/DALI
DALI: a large Dataset of synchronised Audio, LyrIcs and vocal notes.
facebookresearch/covost
CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)
JetRunner/BERT-of-Theseus
โต๏ธThe official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).
EleutherAI/concept-erasure
Erasing concepts from neural representations with provable guarantees
persiannlp/parsinlu
A comprehensive suite of high-level NLP tasks for Persian language
gorokoba560/norm-analysis-of-transformer
WangHelin1997/SpeechTasks
This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent speech tool development, and speech applications.
interpretingdl/eacl2024_transformer_interpretability_tutorial
Materials for EACL2024 tutorial: Transformer-specific Interpretability
mohsenfayyaz/GlobEnc
[NAACL 2022] GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformers
mohsenfayyaz/DecompX
DecompX: Explaining Transformers Decisions by Propagating Token Decomposition
Achuttarsing/inflecteur
python inflector ๐ for French language : control gender, tense and number
gchrupala/neurospoken
Neural models of spoken language - LOT Winter school 2024
jumelet/fidam-eval
Code for the ACL Findings paper "Feature Interactions Reveal Linguistic Structure in Language Models"