Jerrisk

Computer vision, Recommendation System, NLP

Tongji UniversityHangzhou, China

Jerrisk's Stars

HFAiLab/hai-platform
一种任务级GPU算力分时调度的高性能深度学习训练平台
Language:Python35145
pytorch/torchtitan
A PyTorch native library for large model training
Language:Python2.9k233
modelcontextprotocol/servers
Model Context Protocol Servers
Language:JavaScript6.2k735
RooVetGit/Roo-Cline
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
Language:TypeScript1.5k125
fferflo/einx
Universal Tensor Operations in Einstein-Inspired Notation for Python.
Language:Python3359
xxlong0/Wonder3D
Single Image to 3D using Cross-Domain Diffusion for 3D Generation
Language:Python4.9k395
liuff19/ReconX
ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model
60620
wenqsun/DimensionX
DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion
Language:Python1.1k66
rhymes-ai/Allegro
Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.
Language:Python1k51
ml-explore/mlx-examples
Examples in the MLX framework
Language:Python6.5k922
BBuf/Image-processing-algorithm
paper implement
Language:C++915282
sayakpaul/diffusers-torchao
End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).
Language:Python2999
facebookresearch/blt
Code for BLT research paper
Language:Python1.2k86
magic-research/piecewise-rectified-flow
PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)
Language:Jupyter Notebook47228
open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Language:Python4.4k468
huggingface/hf_transfer
Language:Rust33827
facebookresearch/imu2clip
Code repository for IMU2CLIP(https//arxiv.org/pdf/2210.14395.pdf)
Language:Python868
InternLM/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Language:Python4.1k325
merveenoyan/smol-vision
Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜
Language:Jupyter Notebook1.1k97
kijai/ComfyUI-HunyuanVideoWrapper
Language:Python1.4k103
ostris/ai-toolkit
Various AI scripts. Mostly Stable Diffusion stuff.
Language:Python3.7k414
ali-vilab/In-Context-LoRA
Official repository of In-Context LoRA for Diffusion Transformers
1.4k74
carla-simulator/scenario_runner
Traffic scenario definition and execution engine
Language:Python545368
fishaudio/fish-speech
SOTA Open Source TTS
Language:Python18.1k1.4k
WisconsinAIVision/ViP-LLaVA
[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts
Language:Python30621
FacePerceiver/facer
Face analysis tools for modern research, equipped with state-of-the-art Face Parsing and Face Alignment
Language:Python35138
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
Language:Python13.3k1.1k
foivospar/Arc2Face
[ECCV 2024 Oral🔥] Arc2Face: A Foundation Model for ID-Consistent Human Faces
Language:Python62346
Tencent/TFace
A trusty face analysis research platform developed by Tencent Youtu Lab
Language:Python1.4k229
mbzuai-oryx/Video-ChatGPT
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
Language:Python1.3k110

Jerrisk

Jerrisk's Stars

HFAiLab/hai-platform

pytorch/torchtitan

modelcontextprotocol/servers

RooVetGit/Roo-Cline

fferflo/einx

xxlong0/Wonder3D

liuff19/ReconX

wenqsun/DimensionX

rhymes-ai/Allegro

ml-explore/mlx-examples

BBuf/Image-processing-algorithm

sayakpaul/diffusers-torchao

facebookresearch/blt

magic-research/piecewise-rectified-flow

open-compass/opencompass

huggingface/hf_transfer

facebookresearch/imu2clip

InternLM/xtuner

merveenoyan/smol-vision

kijai/ComfyUI-HunyuanVideoWrapper

ostris/ai-toolkit

ali-vilab/In-Context-LoRA

carla-simulator/scenario_runner

fishaudio/fish-speech

WisconsinAIVision/ViP-LLaVA

FacePerceiver/facer

SYSTRAN/faster-whisper

foivospar/Arc2Face

Tencent/TFace

mbzuai-oryx/Video-ChatGPT