SeuTao's Stars
apple/ml-aim
This repository provides the code and model checkpoints of the research paper: Scalable Pre-training of Large Autoregressive Image Models
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
lichao-sun/Mora
Mora: More like Sora for Generalist Video Generation
NVIDIA/NeMo-Curator
Scalable data pre processing and curation toolkit for LLMs
Kohulan/DECIMER-Image_Transformer
DECIMER: Deep Learning for Chemical Image Recognition using Efficient-Net V2 + Transformer
lbnlp/NERRE
Code and data for the publication "Structured information extraction from scientific text with large language models" by Dagdelen & Dunn et al.
baaivision/Emu
Emu Series: Generative Multimodal Models from BAAI
patrickbryant1/AFProfile
Improved protein complex prediction with AlphaFold-multimer by denoising the MSA profile
openai/transformer-debugger
deepseek-ai/DeepSeek-VL
DeepSeek-VL: Towards Real-World Vision-Language Understanding
netease-youdao/QAnything
Question and Answer based on Anything.
baker-laboratory/rf_diffusion_all_atom
Public RFDiffusionAA repo
Vchitect/Latte
Latte: Latent Diffusion Transformer for Video Generation.
OpenMOSS/AnyGPT
Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"
csuhan/OneLLM
[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language
invictus717/MetaTransformer
Meta-Transformer for Unified Multimodal Learning
lujiarui/Str2Str
Codebase of the paper "Str2Str: A Score-based Framework for Zero-shot Protein Conformation Sampling" (ICLR 2024)
baker-laboratory/RoseTTAFold-All-Atom
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
Alpha-VLLM/LLaMA2-Accessory
An Open-source Toolkit for LLM Development
InternLM/InternLM-XComposer
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
dina-lab3D/CombFold
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
NUS-HPC-AI-Lab/VideoSys
VideoSys: An easy and efficient system for video generation
bytedance/MVDream
Multi-view Diffusion for 3D Generation
lichao-sun/SoraReview
The official GitHub page for the review paper "Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models".
evo-design/evo
Biological foundation modeling from molecular to genome scale
yformer/EfficientSAM
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
apple/ml-ferret