sdk-ai's Stars
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
microsoft/autogen
A programming framework for agentic AI 🤖
meta-llama/llama3
The official Meta Llama 3 GitHub site
karpathy/llm.c
LLM training in simple, raw C/CUDA
huggingface/trl
Train transformer language models with reinforcement learning.
lavague-ai/LaVague
Large Action Model framework to develop AI Web Agents
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
argilla-io/argilla
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
microsoft/sample-app-aoai-chatGPT
Sample code for a simple web chat experience through Azure OpenAI, including Azure OpenAI On Your Data.
rahulnyk/knowledge_graph
Convert any text to a graph of knowledge. This can be used for Graph Augmented Generation or Knowledge Graph based QnA
OpenLMLab/MOSS-RLHF
MOSS-RLHF
graspologic-org/graspologic
Python package for graph statistics
monarch-initiative/ontogpt
LLM-based ontological extraction tools, including SPIRES
facebookresearch/minihack
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
zjunlp/AutoKG
LLMs for Knowledge Graph Construction and Reasoning: Recent Capabilities and Future Opportunities
leobeeson/llm_benchmarks
A collection of benchmarks and datasets for evaluating LLM.
amazon-science/RefChecker
RefChecker provides automatic checking pipeline and benchmark dataset for detecting fine-grained hallucinations generated by Large Language Models.
hkust-nlp/AgentBoard
An Analytical Evaluation Board of Multi-turn LLM Agents
qzed/irl-maxent
Maximum Entropy and Maximum Causal Entropy Inverse Reinforcement Learning Implementation in Python
Toloka/crowd-kit
Control the quality of your labeled data with the Python tools you already know.
HumanSignal/RLHF
Collection of links, tutorials and best practices of how to collect the data and build end-to-end RLHF system to finetune Generative AI models
rll-research/BPref
Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.
neo4j/apoc
Alab-NII/2wikimultihop
princeton-nlp/calm-textgame
[EMNLP 2020] Keep CALM and Explore: Language Models for Action Generation in Text-based Games
mikecvet/nl-sh
The Natural Language Shell integrates OpenAI's GPTs, Anthropic's Claude, or local GGUF-formatted LLMs directly into the terminal experience, allowing operators to describe their tasks in either POSIX commands or fluent human language
Stanford-ILIAD/APReL
A Library for Active Preference-based Reward Learning Algorithms
microsoft/promptflow-rag-project-template
An end-to-end sample of RAG showcasing development, evaluation, experimentation, and deployment using Promptflow, search products like CosmosDB, PostgresSQL, and Azure AI Search
facebookresearch/rlfh-gen-div
This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity