serjtroshin's Stars
apple/ml-planner
justinlovelace/Diffusion-Guided-LM
louaaron/Score-Entropy-Discrete-Diffusion
[ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)
bansky-cl/Diffusion_NLP_Papers
Listing some diffusion papers in NLP domain I have read, text generation is main, table will continue to be updated.
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
cindyxinyiwang/multiview-subword-regularization
PyTorch implementation of NAACL 2021 paper "Multi-view Subword Regularization"
lucidrains/MEGABYTE-pytorch
Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch
pytorch/tensordict
TensorDict is a pytorch dedicated tensor container.
metauto-ai/GPTSwarm
🐝 GPTSwarm: LLM agents as (Optimizable) Graphs
princeton-nlp/SWE-agent
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges.
eth-sri/language-model-arithmetic
Controlled Text Generation via Language Model Arithmetic
naver/disco
A Toolkit for Distributional Control of Generative Models
wellecks/naturalprover
NaturalProver: Grounded Mathematical Proof Generation with Language Models
unitaryai/detoxify
Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using ⚡ Pytorch Lightning and 🤗 Transformers. For access to our API, please email us at contact@unitary.ai.
Hironsan/HateSonar
Hate Speech Detection Library for Python.
alisawuffles/DExperts
code associated with ACL 2021 DExperts paper
salesforce/GeDi
GeDi: Generative Discriminator Guided Sequence Generation
timoschick/self-debiasing
This repository contains the code for "Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP".
conversationai/perspectiveapi
Perspective is an API that uses machine learning models to score the perceived impact a comment might have on a conversation. See https://developers.perspectiveapi.com for more information.
allenai/real-toxicity-prompts
launchnlp/BOLT
Code for ACL 2023 paper "BOLT: Fast Energy-based Controlled Text Generation with Tunable Biases".
jiacheng-xu/vmf_vae_nlp
Code for EMNLP18 paper "Spherical Latent Spaces for Stable Variational Autoencoders"
alexa/Topical-Chat
A dataset containing human-human knowledge-grounded open-domain conversations.
AdityaGolatkar/SelectiveForgetting
vermashresth/awesome-emergent-languages
Paper Reading in Neural Emergent Communication Literature
facebookresearch/fairseq2
FAIR Sequence Modeling Toolkit 2
facebookresearch/EGG
EGG: Emergence of lanGuage in Games
probcomp/LLaMPPL
A domain-specific probabilistic programming language for modeling and inference with language models
wouterkool/stochastic-beam-search
Implementation of Stochastic Beam Search using Fairseq
paul-rottger/exaggerated-safety
Röttger et al. (2023): "XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models"