alterego238

alterego238's Stars

public-apis/public-apis
A collective list of free APIs
Language:Python319k 4.2k 63733.9k
run-llama/llama_index
LlamaIndex is a data framework for your LLM applications
Language:Python37k 243 5.5k5.3k
microsoft/autogen
A programming framework for agentic AI 🤖
Language:Python34.9k 407 2k5k
openai/gym
A toolkit for developing and comparing reinforcement learning algorithms.
Language:Python34.9k 1.1k 1.8k8.6k
rasbt/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Language:Jupyter Notebook33.6k 364 1074.1k
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python30.6k 425 4.2k6.4k
openai/openai-python
The official Python library for the OpenAI API
Language:Python23.2k 308 8103.3k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python20.4k 159 1.5k2.3k
tailscale/tailscale
The easiest, most secure way to use WireGuard and 2FA.
Language:Go19.5k 162 7.4k1.5k
BerriAI/litellm
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
Language:Python14.3k 76 3.6k1.7k
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Language:Python9.2k 65 1.5k1.7k
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Language:Python6.1k 52 625478
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Language:Python5.8k 38 185649
THUDM/GLM-4
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
Language:Python5.4k 34 569441
OpenBMB/ToolBench
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
Language:Python4.9k 49 297426
huggingface/deep-rl-class
This repo contains the syllabus of the Hugging Face Deep Reinforcement Learning Course.
Language:MDX3.9k 83 305603
openai/human-eval
Code for the paper "Evaluating Large Language Models Trained on Code"
Language:Python2.4k 130 36348
openai/multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Language:Python2.4k 176 84787
Tongji-KGLLM/RAG-Survey
1.8k 32 18123
wangshusen/SearchEngine
搜索引擎原理
1.5k 20 7124
ikechan8370/chatgpt-plugin
云崽系机器人的智能聊天插件
Language:JavaScript1k 7 551106
openai/summarize-from-feedback
Code for "Learning to summarize from human feedback"
Language:Python994 146 21144
edbeeching/godot_rl_agents
An Open Source package that allows video game creators, AI researchers and hobbyists the opportunity to learn complex behaviors for their Non Player Characters or agents
Language:Python974 23 10470
alex-petrenko/sample-factory
High throughput synchronous and asynchronous reinforcement learning
Language:Python831 17 102112
sjtug/SJTUBeamer
上海交通大学 Beamer 模版 | Beamer template for Shanghai Jiao Tong University
Language:TeX592 8 3965
diambra/arena
DIAMBRA Arena: a New Reinforcement Learning Platform for Research and Experimentation
Language:Python317 9 4222
karthikv792/LLMs-Planning
An extensible benchmark for evaluating large language models on planning
Language:PDDL294 6 1532
aibasel/downward
The Fast Downward domain-independent classical planning system
Language:C++270 11 0145
RewardReports/reward-reports
Documentation for dynamic machine learning systems.
Language:EJS27 4 66
alterego238/IGE
Language:Python2 1 00