alterego238's Stars
public-apis/public-apis
A collective list of free APIs
run-llama/llama_index
LlamaIndex is a data framework for your LLM applications
microsoft/autogen
A programming framework for agentic AI 🤖
openai/gym
A toolkit for developing and comparing reinforcement learning algorithms.
rasbt/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
openai/openai-python
The official Python library for the OpenAI API
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
tailscale/tailscale
The easiest, most secure way to use WireGuard and 2FA.
BerriAI/litellm
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
THUDM/GLM-4
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
OpenBMB/ToolBench
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
huggingface/deep-rl-class
This repo contains the syllabus of the Hugging Face Deep Reinforcement Learning Course.
openai/human-eval
Code for the paper "Evaluating Large Language Models Trained on Code"
openai/multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Tongji-KGLLM/RAG-Survey
wangshusen/SearchEngine
搜索引擎原理
ikechan8370/chatgpt-plugin
云崽系机器人的智能聊天插件
openai/summarize-from-feedback
Code for "Learning to summarize from human feedback"
edbeeching/godot_rl_agents
An Open Source package that allows video game creators, AI researchers and hobbyists the opportunity to learn complex behaviors for their Non Player Characters or agents
alex-petrenko/sample-factory
High throughput synchronous and asynchronous reinforcement learning
sjtug/SJTUBeamer
上海交通大学 Beamer 模版 | Beamer template for Shanghai Jiao Tong University
diambra/arena
DIAMBRA Arena: a New Reinforcement Learning Platform for Research and Experimentation
karthikv792/LLMs-Planning
An extensible benchmark for evaluating large language models on planning
aibasel/downward
The Fast Downward domain-independent classical planning system
RewardReports/reward-reports
Documentation for dynamic machine learning systems.
alterego238/IGE