ctx-eva's Stars
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
lucidrains/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
state-spaces/mamba
Mamba SSM architecture
civitai/civitai
A repository of models, textual inversions, and more
kf-liu/The-Art-of-Linear-Algebra-zh-CN
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone", 线性代数的艺术中文版, 欢迎PR.
openvinotoolkit/anomalib
An anomaly detection library comprising state-of-the-art algorithms and features such as experiment management, hyper-parameter optimization, and edge inference.
TMElyralab/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
TMElyralab/MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
dvlab-research/LongLoRA
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
TMElyralab/MusePose
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
chaofengc/IQA-PyTorch
👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including PSNR, SSIM, LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...
botuniverse/onebot
OneBot:统一的聊天机器人应用接口标准
bytedance/piano_transcription
chaofengc/Awesome-Image-Quality-Assessment
A comprehensive collection of IQA papers
bytedance/MVDream
Multi-view Diffusion for 3D Generation
bytedance/ImageDream
The code releasing for https://image-dream.github.io/
breezedeus/CnSTD
CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula Detection, MFD)、篇章分析(Layout Analysis)的Python3 包
Q-Future/Q-Align
③[ICML2024] [IQA, IAA, VQA] All-in-one Foundation Model for visual scoring. Can efficiently fine-tune to downstream datasets.
yangjianxin1/LongQLoRA
LongQLoRA: Extent Context Length of LLMs Efficiently
bytedance/FreeSeg
bytedance/SPTSv2
The official implementation of SPTS v2: Single-Point Text Spotting
YounggjuuChoi/Deep-Video-Super-Resolution
The state-of-the-art VSR
ViTAE-Transformer/ViTAE-Transformer-Scene-Text-Detection
A comprehensive list [Hi-SAM@TPAMI'24, GoMatching@NeurIPS'24, DeepSolo(++)@ CVPR'23, DPText-DETR@AAAI'23, I3CL@IJCV'22] of our research works related to scene text detection, spotting, etc., including papers, codes.
bytedance/AvatarVerse
code repo for the paper "AvatarVerse: High-quality & Stable 3D Avatar Creation from Text and Pose" (AAAI2024)
bytedance/DecompDiff
The official implementation of DecompDiff: Diffusion Models with Decomposed Priors for Structure-Based Drug Design (ICML 2023)
bytedance/ColTrack
This repository is an official implementation of Collaborative Tracking Learning for Frame-Rate-Insensitive Multi-Object Tracking
bytedance/libdmet_with_QC
libDMET with quantum computing chemistry solvers of UCCSD
CH4ACKO3/Chack-RL
Chack's RL repository