dxli94

Salesforce Research

dxli94's Stars

karpathy/LLM101n
LLM101n: Let's build a Storyteller
32.9k 3.1k 01.8k
linexjlin/GPTs
leaked prompts of GPTs
29.5k 321 274k
VikParuchuri/marker
Convert PDF to markdown + JSON quickly with high accuracy
Language:Python23.3k 97 4141.4k
Byaidu/PDFMathTranslate
PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译，支持 Google/DeepL/Ollama/OpenAI 等服务，提供 CLI/GUI/Docker/Zotero
Language:Python19.2k 75 6281.6k
neutraltone/awesome-stock-resources
:city_sunrise: A collection of links for free stock photography, video and Illustration websites
13.3k 291 86777
OpenTalker/SadTalker
[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Language:Python12.5k 154 8282.3k
xuebinqin/U-2-Net
The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."
Language:Python8.9k 147 3451.5k
mosaicml/composer
Supercharge Your Model Training
Language:Python5.3k 50 555435
huggingface/safetensors
Simple, safe way to store and distribute tensors
Language:Python3.2k 41 206230
baaivision/EVA
EVA Series: Visual Representation Fantasies from BAAI
Language:Python2.5k 31 164182
huggingface/nanotron
Minimalistic large language model 3D-parallelism training
Language:Python1.7k 47 104166
Yuanshi9815/OminiControl
A minimal and universal controller for FLUX.1.
Language:Python1.3k 18 9089
XueFuzhao/awesome-mixture-of-experts
A collection of AWESOME things about mixture-of-experts
1.1k 25 278
rhymes-ai/Allegro
Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.
Language:Python1.1k 26 4162
rhymes-ai/Aria
Codebase for Aria - an Open Multimodal Native MoE
Language:Jupyter Notebook1k 20 5286
haofanwang/ControlNet-for-Diffusers
Transfer the ControlNet with any basemodel in diffusers🔥
Language:Python822 15 4948
kakaobrain/karlo
Language:Python694 11 1242
devilismyfriend/StableTuner
Finetuning SD in style.
Language:Python676 17 7652
LAION-AI/dalle2-laion
Pretrained Dalle2 from laion
Language:Python501 23 3965
salesforce/PyRCA
PyRCA: A Python Machine Learning Library for Root Cause Analysis
Language:Python458 15 2747
Coobiw/MPP-LLaVA
Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train your own 8B/14B LLaVA-training-like MLLM on RTX3090/4090 24GB.
Language:Jupyter Notebook427 6 3623
Q-Future/Q-Align
③[ICML2024] [IQA, IAA, VQA] All-in-one Foundation Model for visual scoring. Can efficiently fine-tune to downstream datasets.
Language:Python387 2 4424
longvideobench/LongVideoBench
[Neurips 24' D&B] Official Dataloader and Evaluation Scripts for LongVideoBench.
Language:Python90 0 112
PathOnAI/LiteMultiAgent
The Library for LLM-based multi-agent applications
Language:Python76 12 5616
kjerk/instructblip-pipeline
A multimodal inference pipeline that integrates InstructBLIP with textgen-webui for Vicuna and related models.
Language:Python30 2 32
OpenNLPLab/FNAC_AVL
[CVPR 2023] Official implementation of our paper - Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learning
Language:Python24 0 33
OpenNLPLab/Vicinity-Vision-Transformer
[TPAMI 2023] This is an official implementation for "Vicinity Vision Transformer".
Language:Python20 0 02
VideoAutoArena/VideoAutoBench
[CVPR 2025] Official Dataloader and Evaluation Scripts for VideoAutoBench.
Language:Python10 0 10
avalanchesiqi/pyquantifier
A Python package to estimate class prevalence in unlabeled datasets by specifying stability assumptions
Language:Jupyter Notebook1 3 01
yeoedward/vimrc
Language:Vim Script1 1 00

dxli94

dxli94's Stars

karpathy/LLM101n

linexjlin/GPTs

VikParuchuri/marker

Byaidu/PDFMathTranslate

neutraltone/awesome-stock-resources

OpenTalker/SadTalker

xuebinqin/U-2-Net

mosaicml/composer

huggingface/safetensors

baaivision/EVA

huggingface/nanotron

Yuanshi9815/OminiControl

XueFuzhao/awesome-mixture-of-experts

rhymes-ai/Allegro

rhymes-ai/Aria

haofanwang/ControlNet-for-Diffusers

kakaobrain/karlo

devilismyfriend/StableTuner

LAION-AI/dalle2-laion

salesforce/PyRCA

Coobiw/MPP-LLaVA

Q-Future/Q-Align

longvideobench/LongVideoBench

PathOnAI/LiteMultiAgent

kjerk/instructblip-pipeline

OpenNLPLab/FNAC_AVL

OpenNLPLab/Vicinity-Vision-Transformer

VideoAutoArena/VideoAutoBench

avalanchesiqi/pyquantifier

yeoedward/vimrc