jqwang2373

University of Wisconsin-Madison

jqwang2373's Stars

gradio-app/gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Language:Python32.5k 170 4.8k2.4k
meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python26.5k 219 2473k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python19.6k 158 1.5k2.2k
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
Language:Python12.1k 99 533850
eugeneyan/open-llms
📋 A list of open LLMs available for commercial use.
11k 236 36702
microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Language:Python10.4k 68 105669
huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python9.6k 74 1.1k1.2k
shibing624/MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。
Language:Python3.2k 38 392492
eureka-research/Eureka
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)
Language:Jupyter Notebook2.8k 25 38254
luban-agi/Awesome-Domain-LLM
收集和梳理垂直领域的开源模型、数据集及评测基准。
2.2k 35 3165
km1994/LLMsNineStoryDemonTower
【LLMs九层妖塔】分享 LLMs在自然语言处理（ChatGLM、Chinese-LLaMA-Alpaca、小羊驼 Vicuna、LLaMA、GPT4ALL等）、信息检索（langchain）、语言合成、语言识别、多模态等领域（Stable Diffusion、MiniGPT-4、VisualGLM-6B、Ziya-Visual等）等实战与经验。
1.7k 18 0168
km1994/LLMs_interview_notes
该仓库主要记录大模型（LLMs）算法工程师相关的面试题
1.4k 10 199
ai-vip/stable-diffusion-tutorial
全网最全Stable Diffusion全套教程，从入门到进阶，耗时三个月制作
1.3k 11 2124
pjlab-sys4nlp/llama-moe
⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)
Language:Python861 8 1946
CASIA-IVA-Lab/AnomalyGPT
[AAAI 2024 Oral] AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language Models
Language:Python775 7 10895
yxuansu/PandaGPT
[TLLM'23] PandaGPT: One Model To Instruction-Follow Them All
Language:Python754 11 2760
SinclairCoder/Instruction-Tuning-Papers
Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).
749 16 324
QingruZhang/AdaLoRA
AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning (ICLR 2023).
Language:Python259 1 2628
SupritYoung/RLHF-Label-Tool
用于大模型 RLHF 进行人工数据标注排序的工具。A tool for manual response data annotation sorting in RLHF stage.
Language:Python241 5 218
reka-ai/reka-vibe-eval
Multimodal language model benchmark, featuring challenging examples
Language:Python145 14 36
pengzhangzhi/Awesome-Mamba
Awesome list of papers that extend Mamba to various applications.
125 7 412
NVlabs/progprompt-vh
ProgPrompt for Virtualhome
Language:Python109 4 613
yongchao98/AutoTAMP
Enhancing LLM/VLM capability for robot task and motion planning with extra algorithm based tools.
Language:Jupyter Notebook42 1 31
cagatayyildiz/npode
Learning unknown ODE models with Gaussian processes
Language:Matlab26 5 34
yongchao98/multi-agent-framework
LLM multi-agent discussion framework for multi-agent/robot situations.
Language:Python16 1 02
NL2Code/CodeS
Language:Python14 1 00
Scientific-Computing-Lab-NRCN/MPI-rigen
MPI Code Generation through Domain-Specific Language Models
Language:Python12 0 02
NoemieJaquier/sequencing-blending
This repository contains code examples for the paper "Learning to sequence and blend robotics skills via differentiable optimization".
Language:Python11 1 00
PKU-RL/AdaRefiner
AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback (NAACL 2024)
Language:Python90
Scientific-Computing-Lab-NRCN/Tokompiler
Scope is all you need: Transforming LLMs for HPC Code
Language:Python93

jqwang2373

jqwang2373's Stars

gradio-app/gradio

meta-llama/llama3

haotian-liu/LLaVA

OpenBMB/MiniCPM-V

eugeneyan/open-llms

microsoft/LoRA

huggingface/trl

shibing624/MedicalGPT

eureka-research/Eureka

luban-agi/Awesome-Domain-LLM

km1994/LLMsNineStoryDemonTower

km1994/LLMs_interview_notes

ai-vip/stable-diffusion-tutorial

pjlab-sys4nlp/llama-moe

CASIA-IVA-Lab/AnomalyGPT

yxuansu/PandaGPT

SinclairCoder/Instruction-Tuning-Papers

QingruZhang/AdaLoRA

SupritYoung/RLHF-Label-Tool

reka-ai/reka-vibe-eval

pengzhangzhi/Awesome-Mamba

NVlabs/progprompt-vh

yongchao98/AutoTAMP

cagatayyildiz/npode

yongchao98/multi-agent-framework

NL2Code/CodeS

Scientific-Computing-Lab-NRCN/MPI-rigen

NoemieJaquier/sequencing-blending

PKU-RL/AdaRefiner

Scientific-Computing-Lab-NRCN/Tokompiler