Hzfinfdu's Stars
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
meta-llama/llama
Inference code for Llama models
nndl/nndl.github.io
《神经网络与深度学习》 邱锡鹏著 Neural Network and Deep Learning
plotly/plotly.py
The interactive graphing library for Python :sparkles: This project now includes Plotly Express!
openai/evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
OpenMOSS/MOSS
An open-source tool-augmented conversational language model from Fudan University
OFA-Sys/OFA
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
IST-DASLab/gptq
Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
XiangLi1999/Diffusion-LM
Diffusion-LM
txsun1997/MOSS
MOSS is a conversational language model like ChatGPT.
txsun1997/LMaaS-Papers
Awesome papers on Language-Model-as-a-Service (LMaaS)
OpenLMLab/GAOKAO-Bench
GAOKAO-Bench is an evaluation framework that utilizes GAOKAO questions as a dataset to evaluate large language models.
Hzfinfdu/Diffusion-BERT
ACL'2023: DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models
IndexFziQ/Diffusion4NLP-Papers
A paper list about diffusion models for natural language processing.
anthropics/PySvelte
A library for bridging Python and HTML/Javascript (via Svelte) for creating interactive visualizations
HoagyC/sparse_coding
Using sparse coding to find distributed representations used by neural networks.
likenneth/othello_world
Emergent world representations: Exploring a sequence model trained on a synthetic task
OpenMOSS/HalluQA
Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"
anthropics/toy-models-of-superposition
Notebooks accompanying Anthropic's "Toy Models of Superposition" paper
FranxYao/Distributional-Generalization-in-Natural-Language-Processing
Distributional Generalization in NLP. A roadmap.
saprmarks/geometry-of-truth
fastnlp/ElasticBERT
A pre-trained model with multi-exit transformer architecture.
Spico197/awesome-lm-evaluation
🩺 A collection of ChatGPT evaluation reports on various bechmarks.
OpenMOSS/Language-Model-SAEs
For OpenMOSS Mechanistic Interpretability Team's Sparse Autoencoder (SAE) research.
callummcdougall/sae_visualizer
AI21Labs/pmi-masking
This repository includes the masking vocabulary used in the ICLR 2021 spotlight PMI-Masking paper
Phylliida/MambaLens
Mamba support for transformer lens
choosewhatulike/case2code
ningyuxu/tip_of_tongue
Code for the paper "On the Tip of the Tongue: Analyzing Conceptual Representation in Large Language Models with Reverse-Dictionary Probe"
RobertHuben/othellogpt_sparse_autoencoders