yuanrr

Ph.D student, focusing on image and video understanding, i.e., visual question answering, video question answering, etc.

yuanrr's Stars

f/awesome-chatgpt-prompts
This repo includes ChatGPT prompt curation to use ChatGPT better.
Language:HTML107k 1.4k 014.6k
chatanywhere/GPT_API_free
Free ChatGPT API Key，免费ChatGPT API，支持GPT4 API（免费），ChatGPT国内可用免费转发API，直连无需代理。可以搭配ChatBox等软件/插件使用，极大降低接口使用成本。国内即可无限制畅快聊天。
Language:Python18.5k 100 2411.4k
Hannibal046/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
15.9k 343 241.3k
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
Language:Python7.4k 112 287453
Lightning-AI/lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
Language:Python5.9k 67 269507
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Language:Python5.3k 63 92484
Lightning-AI/lit-gpt
Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
Language:Python5.2k 63 476540
lyogavin/Anima
33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU
Language:Jupyter Notebook3.5k 98 137291
PKU-YuanGroup/Video-LLaVA
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Language:Python2.7k 27 164192
frgfm/torch-cam
Class activation maps for your PyTorch models (CAM, Grad-CAM, Grad-CAM++, Smooth Grad-CAM++, Score-CAM, SS-CAM, IS-CAM, XGrad-CAM, Layer-CAM)
Language:Python1.9k 11 59194
AGI-Edgerunners/LLM-Adapters
Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"
Language:Python999 12 5690
yunlong10/Awesome-LLMs-for-Video-Understanding
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
904 25 153
CircleRadon/Osprey
[CVPR2024] The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"
Language:Python713 14 3240
dvlab-research/LLaMA-VID
Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models
Language:Python618 12 9439
OpenGVLab/OmniQuant
[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.
Language:Python614 16 6948
csuhan/OneLLM
[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language
Language:Python506 11 1925
shenyunhang/APE
[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception
Language:Python456 6 5328
Yangyi-Chen/Multimodal-AND-Large-Language-Models
Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.
449 18 327
zhenyingfang/Awesome-Temporal-Action-Detection-Temporal-Action-Proposal-Generation
Temporal Action Detection & Weakly Supervised Temporal Action Detection & Temporal Action Proposal Generation
398 21 335
llava-rlhf/LLaVA-RLHF
Aligning LMMs with Factually Augmented RLHF
Language:Python271 8 3015
taesiri/ArXivQA
WIP - Automated Question Answering for ArXiv Papers with Large Language Models (https://arxiv.taesiri.xyz/)
Language:Python260 10 711
mlpc-ucsd/BLIVA
(AAAI 2024) BLIVA: A Simple Multimodal LLM for Better Handling of Text-rich Visual Questions
Language:Python242 12 1822
nbasyl/LLM-FP4
The official implementation of the EMNLP 2023 paper LLM-FP4
Language:Python145 5 97
OPPOMKLab/u-LLaVA
u-LLaVA: Unifying Multi-Modal Tasks via Large Language Model
Language:Python120 2 26
AGI-Edgerunners/LLM-Continual-Learning-Papers
Must-read Papers on Large Language Model (LLM) Continual Learning
116 3 010
j-min/HiREST
Hierarchical Video-Moment Retrieval and Step-Captioning (CVPR 2023)
Language:Python87 5 108
minghangz/cpl
CPL: Weakly Supervised Temporal Sentence Grounding with Gaussian-based Contrastive Proposal Learning
Language:Python55 3 95
jayleicn/VideoLanguageFuturePred
[EMNLP 2020] What is More Likely to Happen Next? Video-and-Language Future Event Prediction
Language:Python47 2 104
RenShuhuai-Andy/TESTA
[EMNLP 2023] TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding
Language:Python42 2 03
yl3800/TranSTR
Language:Python10 2 70

yuanrr

yuanrr's Stars

f/awesome-chatgpt-prompts

chatanywhere/GPT_API_free

Hannibal046/Awesome-LLM

01-ai/Yi

Lightning-AI/lit-llama

pytorch-labs/gpt-fast

Lightning-AI/lit-gpt

lyogavin/Anima

PKU-YuanGroup/Video-LLaVA

frgfm/torch-cam

AGI-Edgerunners/LLM-Adapters

yunlong10/Awesome-LLMs-for-Video-Understanding

CircleRadon/Osprey

dvlab-research/LLaMA-VID

OpenGVLab/OmniQuant

csuhan/OneLLM

shenyunhang/APE

Yangyi-Chen/Multimodal-AND-Large-Language-Models

zhenyingfang/Awesome-Temporal-Action-Detection-Temporal-Action-Proposal-Generation

llava-rlhf/LLaVA-RLHF

taesiri/ArXivQA

mlpc-ucsd/BLIVA

nbasyl/LLM-FP4

OPPOMKLab/u-LLaVA

AGI-Edgerunners/LLM-Continual-Learning-Papers

j-min/HiREST

minghangz/cpl

jayleicn/VideoLanguageFuturePred

RenShuhuai-Andy/TESTA

yl3800/TranSTR