XuexII's Stars
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
mem0ai/mem0
The Memory layer for your AI apps
google-research/text-to-text-transfer-transformer
Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
andrewyng/translation-agent
amazon-science/mm-cot
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)
microsoft/LMOps
General technology for enabling AI capabilities w/ LLMs and MLLMs
bigscience-workshop/promptsource
Toolkit for creating, sharing and using natural language prompts.
huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
facebookresearch/DPR
Dense Passage Retriever - is a set of tools and models for open domain Q&A task.
amazon-science/auto-cot
Official implementation for "Automatic Chain of Thought Prompting in Large Language Models" (stay tuned & more will be updated)
gururise/AlpacaDataCleaned
Alpaca dataset from Stanford, cleaned and curated
AlibabaResearch/AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
reata/sqllineage
SQL Lineage Analysis Tool powered by Python
allenai/open-instruct
mlfoundations/dclm
DataComp for Language Models
google-research/deduplicate-text-datasets
facebookresearch/cc_net
Tools to download and cleanup Common Crawl data
sooftware/conformer
[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
conversationai/perspectiveapi
Perspective is an API that uses machine learning models to score the perceived impact a comment might have on a conversation. See https://developers.perspectiveapi.com for more information.
princeton-nlp/SimPO
[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward
arielnlee/Platypus
Code for fine-tuning Platypus fam LLMs using LoRA
xfactlab/orpo
Official repository for ORPO
princeton-nlp/LESS
[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning
itsnamgyu/reasoning-teacher
Official code for "Large Language Models Are Reasoning Teachers", ACL 2023
Aiden0526/SymbCoT
Codes and Data for ACL 2024 Paper "Faithful Logical Reasoning via Symbolic Chain-of-Thought".
microsoft/simulated-trial-and-error
gpt4life/alpagasus
Unofficial implementation of AlpaGasus
wangpf3/consistent-CoT-distillation
Glareone/GenAI-System-2-Attention-S2A-by-Meta
datasets from the paper "Towards Understanding Sycophancy in Language Models"