Jerrisk's Stars
HFAiLab/hai-platform
一种任务级GPU算力分时调度的高性能深度学习训练平台
pytorch/torchtitan
A PyTorch native library for large model training
modelcontextprotocol/servers
Model Context Protocol Servers
RooVetGit/Roo-Cline
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
fferflo/einx
Universal Tensor Operations in Einstein-Inspired Notation for Python.
xxlong0/Wonder3D
Single Image to 3D using Cross-Domain Diffusion for 3D Generation
liuff19/ReconX
ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model
wenqsun/DimensionX
DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion
rhymes-ai/Allegro
Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.
ml-explore/mlx-examples
Examples in the MLX framework
BBuf/Image-processing-algorithm
paper implement
sayakpaul/diffusers-torchao
End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).
facebookresearch/blt
Code for BLT research paper
magic-research/piecewise-rectified-flow
PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)
open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
huggingface/hf_transfer
facebookresearch/imu2clip
Code repository for IMU2CLIP(https//arxiv.org/pdf/2210.14395.pdf)
InternLM/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
merveenoyan/smol-vision
Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜
kijai/ComfyUI-HunyuanVideoWrapper
ostris/ai-toolkit
Various AI scripts. Mostly Stable Diffusion stuff.
ali-vilab/In-Context-LoRA
Official repository of In-Context LoRA for Diffusion Transformers
carla-simulator/scenario_runner
Traffic scenario definition and execution engine
fishaudio/fish-speech
SOTA Open Source TTS
WisconsinAIVision/ViP-LLaVA
[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts
FacePerceiver/facer
Face analysis tools for modern research, equipped with state-of-the-art Face Parsing and Face Alignment
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
foivospar/Arc2Face
[ECCV 2024 Oral🔥] Arc2Face: A Foundation Model for ID-Consistent Human Faces
Tencent/TFace
A trusty face analysis research platform developed by Tencent Youtu Lab
mbzuai-oryx/Video-ChatGPT
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.