Vince-Lau's Stars
GT-RIPL/Awesome-LLM-Robotics
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
AgibotTech/agibot_x1_hardware
The hardware design for AgiBot X1.
answerlink/IntelliQ
Advanced Multi-Turn QA System with LLM and Intent Recognition. 基于LLM大语言模型意图识别、参数抽取结合slot词槽技术实现多轮问答、NL2API. 打造Function Call多轮问答最佳实践
ollama/ollama
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
xorbitsai/inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
ray-project/lightgbm_ray
LightGBM on Ray
paris0120/CS7638_Robotics-AI-Techniques
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
uswds/uswds
The U.S. Web Design System helps the federal government build fast, accessible, mobile-friendly websites.
labring/FastGPT
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, letting you easily develop and deploy complex question-answering systems without the need for extensive setup or configuration.
Tele-AI/Telechat
MathFoundationRL/Book-Mathematical-Foundation-of-Reinforcement-Learning
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
HumanCompatibleAI/overcooked_ai
A benchmark environment for fully cooperative human-AI performance.
absadiki/tensorboard_plugin_customizable_plots
A plugin for TensorBoard to customize and export Scalar plots
pramodc08/LunarLanderV2-DQN
This project uses Deep Reinforcement Learning to solve the Lunar Lander environment of the OpenAI-Gym
foyoux/aligo
🔥简单、易用、可扩展的阿里云盘 API 接口库🚀
zcswdt/merge_text_boxs
MrShininnnnn/Temporal-Differences-Learning
"Learning to Predict by the Methods of Temporal Differences" by Sutton, Richard S. (1988)
NeuronDance/DeepRL
Deep Reinforcement Learning Lab, a platform designed to make DRL technology and fun for everyone
qiwihui/reinforcement-learning-an-introduction-chinese
《Reinforcement Learning: An Introduction》(第二版)中文翻译
SeedV/generative-ai-roadmap
The roadmap of generative AI: use cases and applications | 生成式AI的应用路线图
dvlab-research/LongLoRA
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
DaertML/context_distillation
Framework to achieve context distillation in LLMs
xunfeng1980/Qwen-GPTQ-vLLM-Docker
opendilab/LLMRiddles
Open-Source Reproduction/Demo of the LLM Riddles Game
LooperXX/CS224n-Reading-Notes
CS224n Reading Notes in Chinese 中文阅读笔记
Classmate-Huang/CV_ImageClassification
Code for ImageClassification based on Pytorch
Halfish/cs231n
斯坦福 cs231n 作业代码实践
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
zergtant/pytorch-handbook
pytorch handbook是一本开源的书籍,目标是帮助那些希望和使用PyTorch进行深度学习开发和研究的朋友快速入门,其中包含的Pytorch教程全部通过测试保证可以成功运行