gm0616's Stars
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
niconi19/LLM-Conversation-Safety
[NAACL2024] Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey
modelscope/modelscope-agent
ModelScope-Agent: An agent framework connecting models in ModelScope with the world
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
nomic-ai/gpt4all
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
meta-llama/llama
Inference code for Llama models
JetRunner/BERT-of-Theseus
⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
LAION-AI/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
adambielski/siamese-triplet
Siamese and triplet networks with online pair/triplet mining in PyTorch
bytedance/lightseq
LightSeq: A High Performance Library for Sequence Processing and Generation
hiyoung123/SoftMaskedBert
Soft-Masked Bert 复现论文:https://arxiv.org/pdf/2005.07421.pdf
lixin4ever/Conference-Acceptance-Rate
Acceptance rates for the major AI conferences
jd-aig/JAVE
google-research-datasets/MAVE
The dataset contains 3 million attribute-value annotations across 1257 unique categories on 2.2 million cleaned Amazon product profiles. It is a large, multi-sourced, diverse dataset for product attribute extraction study.
Alibaba-NLP/KB-NER
Winner system (DAMO-NLP) of SemEval 2022 MultiCoNER shared task over 10 out of 13 tracks.
huggingface/evaluate
🤗 Evaluate: A library for easily evaluating machine learning models and datasets.
srush/Tensor-Puzzles
Solve puzzles. Improve your pytorch.
lucidrains/flamingo-pytorch
Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch
HHousen/TransformerSum
Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive summarization datasets to the extractive task.
NVlabs/FAN
Official PyTorch implementation of Fully Attentional Networks
XuehaiPan/nvitop
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
salesforce/ALPRO
Align and Prompt: Video-and-Language Pre-training with Entity Prompts
texttron/tevatron
Tevatron - A flexible toolkit for neural retrieval research and development.
rom1504/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
synercys/annotated_latex_equations
Examples of how to create colorful, annotated equations in Latex using Tikz.
facebookresearch/metaseq
Repo for external large-scale work