Beckschen's Stars
run-llama/llama_index
LlamaIndex is a data framework for your LLM applications
meta-llama/llama3
The official Meta Llama 3 GitHub site
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
voxel51/fiftyone
The open-source tool for building high-quality datasets and computer vision models
LargeWorldModel/LWM
LLaVA-VL/LLaVA-NeXT
lucidrains/vector-quantize-pytorch
Vector (and Scalar) Quantization, in Pytorch
hyp1231/awesome-llm-powered-agent
Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
lucidrains/self-rewarding-lm-pytorch
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
open-compass/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks
HuangOwen/Awesome-LLM-Compression
Awesome LLM compression research papers and tools.
ActiveVisionLab/Awesome-LLM-3D
Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources
CLAY-3D/OpenCLAY
CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets
mathiasuy/Soluciones-Klenberg
Algorithm Design (Kleinberg Tardos 2005) - Solutions
YingqingHe/Awesome-LLMs-meet-Multimodal-Generation
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
MrGiovanni/SyntheticTumors
[CVPR 2023] Label-Free Liver Tumor Segmentation
bytedance/fc-clip
[NeurIPS 2023] This repo contains the code for our paper Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP
Beckschen/3D-TransUNet
This is the official repository for the paper "3D TransUNet: Advancing Medical Image Segmentation through Vision Transformers"
kyegomez/awesome-multi-agent-papers
A compilation of the best multi-agent papers
Beckschen/ViTamin
[CVPR 2024] Official implementation of "ViTamin: Designing Scalable Vision Models in the Vision-language Era"
bytedance/coconut_cvpr2024
MrGiovanni/DiffTumor
[CVPR 2024] Generalizable Tumor Synthesis - Realistic Synthetic Tumors in Liver, Pancreas, and Kidney
bytedance/OmniScient-Model
This repo contains the code for our paper Towards Open-Ended Visual Recognition with Large Language Model
X-PLUG/mPLUG-HalOwl
mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating
Beckschen/LLaVolta
Efficient Multi-modal Models via Stage-wise Visual Context Compression
MrGiovanni/Touchstone
[NeurIPS 2024] Touchstone - Benchmarking AI on 5,172 o.o.d. CT volumes and 9 anatomical structures
collinskatie/awesome-inverse-graphics
Curated list of papers and resources related to inverse graphics!
TACJu/Compositor
This repo contains the code for our paper Compositor: Bottom-Up Clustering and Compositing for Robust Part and Object Segmentation
YihongSun/MOD-UV
[ECCV24] MOD-UV: Learning Mobile Object Detectors from Unlabeled Videos