sori424's Stars
state-spaces/mamba
Mamba SSM architecture
mistralai/mistral-inference
Official inference library for Mistral models
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
microsoft/LMOps
General technology for enabling AI capabilities w/ LLMs and MLLMs
QData/TextAttack
TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/
zjunlp/EasyEdit
[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.
nyu-mll/jiant
jiant is an nlp toolkit
allenai/comet-atomic-2020
hendrycks/ethics
Aligning AI With Shared Human Values (ICLR 2021)
skywalker023/sodaverse
🥤🧑🏻🚀Code and dataset for our EMNLP 2023 paper - "SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization"
ericwtodd/function_vectors
Function Vectors in Large Language Models (ICLR 2024)
evandez/REMEDI
Inspecting and Editing Knowledge Representations in Language Models
medianeuroscience/emfdscore
Fast, flexible extraction of moral information from textual input data.
Mars-tin/awesome-theory-of-mind
Machine Theory of Mind Reading List. Built upon EMNLP Findings 2023 Paper: Towards A Holistic Landscape of Situated Theory of Mind in Large Language Models
gao-g/metaphor-in-context
Code for the paper "Neural Metaphor Detection in Context".
oaraque/moral-foundations
Additional material for the paper "MoralStrength: Exploiting a Moral Lexicon and Embedding Similarity for Moral Foundations Prediction"
google-research-datasets/GSM-IC
Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant sentences in problem descriptions. GSM-IC is constructed to evaluate the distractibility of language models.
belindal/state-probes
Code for the paper "Implicit Representations of Meaning in Neural Language Models"
skywalker023/fantom
👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"
allenai/scruples
A corpus and code for understanding norms and subjectivity. 🤖
michaelnny/InstructLLaMA
Implements pre-training, supervised fine-tuning (SFT), and reinforcement learning from human feedback (RLHF), to train and fine-tune the LLaMA2 model to follow human instructions, similar to InstructGPT or ChatGPT, but on a much smaller scale.
john-hewitt/control-tasks
Repository describing example random control tasks for designing and interpreting neural probes
aalok-sathe/surprisal
A unified interface for computing surprisal (log probabilities) from language models! Supports neural, symbolic, and black-box API models.
Walter0807/RepBelief
[ICML 2024] Language Models Represent Beliefs of Self and Others
EhsanAghazadeh/Metaphors_in_PLMs
Probing and Generalization of Metaphorical Knowledge in Pre-Trained Language Modelss[ACL 2022]
nyu-mll/jiant-v1-legacy
The jiant toolkit for general-purpose text understanding models
ninodimontalcino/moralchoice
Evaluating the Moral Beliefs Encoded in LLMs
abdulhaim/moral_foundations_llms
joshnguyen99/moral_dilemma_topics
Topic modeling on 100,000 r/AmItheAsshole threads.
kuribayashi4/llm-cognitive-modeling