sdk-ai

MicrosoftBerkeley

sdk-ai's Stars

Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Language:Python168k 1.5k 2.8k44.4k
microsoft/autogen
A programming framework for agentic AI 🤖
Language:Python34.5k 400 1.9k5k
meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python27.2k 226 2643.1k
karpathy/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda24.5k 246 1392.8k
huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python10.1k 77 1.2k1.3k
lavague-ai/LaVague
Large Action Model framework to develop AI Web Agents
Language:Python5.5k 54 291503
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Language:Python4.5k 50 290472
argilla-io/argilla
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
Language:Python4k 31 2.2k377
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
3.5k 61 3211
microsoft/sample-app-aoai-chatGPT
Sample code for a simple web chat experience through Azure OpenAI, including Azure OpenAI On Your Data.
Language:Python1.7k 37 3922.6k
rahulnyk/knowledge_graph
Convert any text to a graph of knowledge. This can be used for Graph Augmented Generation or Knowledge Graph based QnA
Language:Jupyter Notebook1.5k 24 17289
OpenLMLab/MOSS-RLHF
MOSS-RLHF
Language:Python1.3k 34 53101
graspologic-org/graspologic
Python package for graph statistics
Language:Python820 19 529144
monarch-initiative/ontogpt
LLM-based ontological extraction tools, including SPIRES
Language:Jupyter Notebook611 20 20777
facebookresearch/minihack
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Language:Python480 13 4058
zjunlp/AutoKG
LLMs for Knowledge Graph Construction and Reasoning: Recent Capabilities and Future Opportunities
Language:Python365 9 633
leobeeson/llm_benchmarks
A collection of benchmarks and datasets for evaluating LLM.
312 2 121
amazon-science/RefChecker
RefChecker provides automatic checking pipeline and benchmark dataset for detecting fine-grained hallucinations generated by Large Language Models.
Language:Python303 10 1531
hkust-nlp/AgentBoard
An Analytical Evaluation Board of Multi-turn LLM Agents
Language:SAS249 4 1426
qzed/irl-maxent
Maximum Entropy and Maximum Causal Entropy Inverse Reinforcement Learning Implementation in Python
Language:Jupyter Notebook227 5 556
Toloka/crowd-kit
Control the quality of your labeled data with the Python tools you already know.
Language:Python213 13 2216
HumanSignal/RLHF
Collection of links, tutorials and best practices of how to collect the data and build end-to-end RLHF system to finetune Generative AI models
Language:Jupyter Notebook195 7 341
rll-research/BPref
Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.
Language:Python114 0 928
neo4j/apoc
Language:Java99 41 22029
Alab-NII/2wikimultihop
Language:Python72 7 41
princeton-nlp/calm-textgame
[EMNLP 2020] Keep CALM and Explore: Language Models for Action Generation in Text-based Games
Language:Python66 4 87
mikecvet/nl-sh
The Natural Language Shell integrates OpenAI's GPTs, Anthropic's Claude, or local GGUF-formatted LLMs directly into the terminal experience, allowing operators to describe their tasks in either POSIX commands or fluent human language
Language:Rust53 1 01
Stanford-ILIAD/APReL
A Library for Active Preference-based Reward Learning Algorithms
Language:Python47 5 111
microsoft/promptflow-rag-project-template
An end-to-end sample of RAG showcasing development, evaluation, experimentation, and deployment using Promptflow, search products like CosmosDB, PostgresSQL, and Azure AI Search
Language:Jupyter Notebook45 6 411
facebookresearch/rlfh-gen-div
This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity
Language:Python38 3 35

sdk-ai

sdk-ai's Stars

Significant-Gravitas/AutoGPT

microsoft/autogen

meta-llama/llama3

karpathy/llm.c

huggingface/trl

lavague-ai/LaVague

CarperAI/trlx

argilla-io/argilla

opendilab/awesome-RLHF

microsoft/sample-app-aoai-chatGPT

rahulnyk/knowledge_graph

OpenLMLab/MOSS-RLHF

graspologic-org/graspologic

monarch-initiative/ontogpt

facebookresearch/minihack

zjunlp/AutoKG

leobeeson/llm_benchmarks

amazon-science/RefChecker

hkust-nlp/AgentBoard

qzed/irl-maxent

Toloka/crowd-kit

HumanSignal/RLHF

rll-research/BPref

neo4j/apoc

Alab-NII/2wikimultihop

princeton-nlp/calm-textgame

mikecvet/nl-sh

Stanford-ILIAD/APReL

microsoft/promptflow-rag-project-template

facebookresearch/rlfh-gen-div