jqwang2373's Stars
gradio-app/gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
meta-llama/llama3
The official Meta Llama 3 GitHub site
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
eugeneyan/open-llms
📋 A list of open LLMs available for commercial use.
microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
huggingface/trl
Train transformer language models with reinforcement learning.
shibing624/MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。
eureka-research/Eureka
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)
luban-agi/Awesome-Domain-LLM
收集和梳理垂直领域的开源模型、数据集及评测基准。
km1994/LLMsNineStoryDemonTower
【LLMs九层妖塔】分享 LLMs在自然语言处理(ChatGLM、Chinese-LLaMA-Alpaca、小羊驼 Vicuna、LLaMA、GPT4ALL等)、信息检索(langchain)、语言合成、语言识别、多模态等领域(Stable Diffusion、MiniGPT-4、VisualGLM-6B、Ziya-Visual等)等 实战与经验。
km1994/LLMs_interview_notes
该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题
ai-vip/stable-diffusion-tutorial
全网最全Stable Diffusion全套教程,从入门到进阶,耗时三个月制作
pjlab-sys4nlp/llama-moe
⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)
CASIA-IVA-Lab/AnomalyGPT
[AAAI 2024 Oral] AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language Models
yxuansu/PandaGPT
[TLLM'23] PandaGPT: One Model To Instruction-Follow Them All
SinclairCoder/Instruction-Tuning-Papers
Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).
QingruZhang/AdaLoRA
AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning (ICLR 2023).
SupritYoung/RLHF-Label-Tool
用于大模型 RLHF 进行人工数据标注排序的工具。A tool for manual response data annotation sorting in RLHF stage.
reka-ai/reka-vibe-eval
Multimodal language model benchmark, featuring challenging examples
pengzhangzhi/Awesome-Mamba
Awesome list of papers that extend Mamba to various applications.
NVlabs/progprompt-vh
ProgPrompt for Virtualhome
yongchao98/AutoTAMP
Enhancing LLM/VLM capability for robot task and motion planning with extra algorithm based tools.
cagatayyildiz/npode
Learning unknown ODE models with Gaussian processes
yongchao98/multi-agent-framework
LLM multi-agent discussion framework for multi-agent/robot situations.
NL2Code/CodeS
Scientific-Computing-Lab-NRCN/MPI-rigen
MPI Code Generation through Domain-Specific Language Models
NoemieJaquier/sequencing-blending
This repository contains code examples for the paper "Learning to sequence and blend robotics skills via differentiable optimization".
PKU-RL/AdaRefiner
AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback (NAACL 2024)
Scientific-Computing-Lab-NRCN/Tokompiler
Scope is all you need: Transforming LLMs for HPC Code