liuzhuang1024's Stars
xai-org/grok-1
Grok open release
meta-llama/llama3
The official Meta Llama 3 GitHub site
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
WongKinYiu/yolov9
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
HumanAIGC/EMO
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
openai/transformer-debugger
layerdiffusion/sd-forge-layerdiffuse
[WIP] Layer Diffusion for WebUI (via Forge)
guoqincode/Open-AnimateAnyone
Unofficial Implementation of Animate Anyone
TMElyralab/MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
jiaweizzhao/GaLore
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
PixArt-alpha/PixArt-sigma
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
TencentARC/BrushNet
The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
mini-sora/minisora
MiniSora: A community aims to explore the implementation path and future development direction of Sora.
willisma/SiT
Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"
clovaai/cord
CORD: A Consolidated Receipt Dataset for Post-OCR Parsing
sanderwood/bgpt
Beyond Language Models: Byte Models are Digital World Simulators
Royalvice/DocDiff
ACM Multimedia 2023: DocDiff: Document Enhancement via Residual Diffusion Models. Also contains 1597 red seals in Chinese scenes, along with their corresponding binary masks.
csxmli2016/MARCONet
Learning Generative Structure Prior for Blind Text Image Super-resolution [CVPR 2023]
johndpope/Emote-hack
using chatgpt (now Claude 3) to reverse engineer code from Emote white paper. (abandoned)
jinyeying/FogRemoval
[ACCV22] Structure Representation Network and Uncertainty Feedback Learning for Dense Non-Uniform Fog Removal, https://arxiv.org/abs/2210.03061
eduardzamfir/NTIRE23-RTSR
CVPR NTIRE 2023 Challenge on Real-Time Super-Resolution
OpenGVLab/ChartAst
ChartAssistant is a chart-based vision-language model for universal chart comprehension and reasoning.
retsuh-bqw/SRFormer-Text-Det
[AAAI'24] SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression
mxin262/Bridging-Text-Spotting
(CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.
HCIILAB/LAST
Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition
XLearning-SCU/2024-TIP-CREAM
PyTorch implementation for Cross-modal Retrieval with Noisy Correspondence via Consistency Refining and Mining (TIP 2024)
pprp/BitNet
Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch
BaseMax/MyAutoBuildActions
A python script to automaticly create a clone of your react native application and auto replace based on given regexs. - A Python script to get a list of all open issues in a repository with specific labels, and fetch their corresponding bodies and comments in chronological order (oldest to newest).