clownrat6's Stars
roboflow/supervision
We write your reusable computer vision tools. 💜
KwaiVGI/LivePortrait
Bring portraits to life!
QwenLM/Qwen2.5
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
mlfoundations/open_clip
An open source implementation of CLIP.
wdndev/llm_interview_note
主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
rom1504/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
jy0205/Pyramid-Flow
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
Drexubery/ViewCrafter
Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"
LTH14/rcg
PyTorch implementation of RCG https://arxiv.org/abs/2312.03701
haoliuhl/ringattention
Large Context Attention
nomic-ai/contrastors
Train Models Contrastively in Pytorch
THUDM/Inf-DiT
Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer
luyug/GradCache
Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint
RLHF-V/RLAIF-V
RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness
NJU-PCALab/OpenVid-1M
SkyworkAI/MoH
MoH: Multi-Head Attention as Mixture-of-Head Attention
wyhuai/SkillMimic
Official code release for the paper "SkillMimic: Learning Reusable Basketball Skills from Demonstrations"
SkyworkAI/MoE-plus-plus
MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts
DAMO-NLP-SG/SeaLLMs
[ACL 2024 Demo] SeaLLMs - Large Language Models for Southeast Asia
wdndev/mllm_interview_note
主要记录大语言大模型(LLMs) 算法(应用)工程师多模态相关知识
wwxu21/CUT
Source code of "Reasons to Reject? Aligning Language Models with Judgments"
SihengLi99/LLM-Honesty-Survey
A Survey on the Honesty of Large Language Models
SaFoLab-WISC/AdaShield
[ECCV 2024] The official code for "AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shield Prompting."
DAMO-NLP-SG/CMM
✨✨The Curse of Multi-Modalities (CMM): Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio
wwxu21/AMR-SG
DAMO-NLP-SG/MT-LLaMA
Multi-Task instruction-tuned LLaMA
wwxu21/CGR
code for "Exploiting Reasoning Chains for Multi-hop Science Question Answering"
lyuwenyu/PP-InsCapTagger
Instance Capability Tagger(InsCapTagger) is a multimodal data capability tagging model. 多模态数据能力标签模型,可用于图文数据分析和处理(e.g. 基于信息密度的数据过滤方案、基于模型能力的数据配比方案)。 🔥 🔥 🔥
wwxu21/ConReader
wwxu21/CER-MT