HYLcool

Algorithm Engineer @alibaba now.

Peking UniversityHangzhou, Zhejiang, China

HYLcool's Stars

langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
Language:Jupyter Notebook95.1k 690 7.9k15.4k
yt-dlp/yt-dlp
A feature-rich command-line audio/video downloader
Language:Python90.4k 518 8.1k7k
Stirling-Tools/Stirling-PDF
#1 Locally hosted web application that allows you to perform various operations on PDF files
Language:Java46.4k 153 9133.8k
ray-project/ray
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Language:Python34.1k 474 18.9k5.8k
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python30.5k 249 5.3k4.6k
karpathy/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda24.5k 246 1392.8k
KindXiaoming/pykan
Kolmogorov Arnold Networks
Language:Jupyter Notebook15.1k 113 4081.4k
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Language:Python11.6k 152 3531k
XavierXiao/Dreambooth-Stable-Diffusion
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
Language:Jupyter Notebook7.6k 92 148795
trekhleb/state-of-the-art-shitcode
💩State-of-the-art shitcode principles your project should follow to call it a proper shitcode
5.7k 50 36324
dvlab-research/MGM
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
Language:Python3.2k 28 132279
modelscope/data-juicer
A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据！
Language:Python3k 19 200178
LLaVA-VL/LLaVA-NeXT
Language:Python2.9k 34 302250
xinyu1205/recognize-anything
Open-source and strong foundation image recognition models.
Language:Jupyter Notebook2.9k 27 157276
Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
Language:Python2.1k 30 8788
cambrian-mllm/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Language:Python1.8k 21 69115
Vchitect/Latte
Latte: Latent Diffusion Transformer for Video Generation.
Language:Python1.7k 23 106178
aigc-apps/EasyAnimate
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
Language:Python1.5k 19 107108
Unity-Technologies/com.unity.perception
Perception toolkit for sim2real training and validation in Unity
Language:C#927 37 328177
Vchitect/VBench
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
Language:Python580 11 7428
snap-research/Panda-70M
[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
Language:Python525 11 4519
mira-space/MiraData
Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"
Language:Python369 14 159
vis-nlp/ChartQA
Language:Python169 6 1121
evalcrafter/EvalCrafter
[CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Models
Language:Jupyter Notebook142 1 207
modelscope/lite-sora
An initiative to replicate Sora
Language:Python99 4 36
OpenGVLab/MMT-Bench
ICML'2024 | MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI
Language:Python95 5 73
sayakpaul/single-video-curation-svd
Educational repository for applying the main video data curation techniques presented in the Stable Video Diffusion paper.
Language:Jupyter Notebook81 3 17
locuslab/scaling_laws_data_filtering
Language:Python64 3 04
zwq2018/Multi-modal-Self-instruct
The codebase for our EMNLP24 paper: Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model
Language:Python55 3 14
wenhanwu95/FreqMixFormer
[ACM MM 2024] Frequency Guidance Matters: Skeletal Action Recognition by Frequency-Aware Mixed Transformer
7 1 00

HYLcool

HYLcool's Stars

langchain-ai/langchain

yt-dlp/yt-dlp

Stirling-Tools/Stirling-PDF

ray-project/ray

vllm-project/vllm

karpathy/llm.c

KindXiaoming/pykan

PKU-YuanGroup/Open-Sora-Plan

XavierXiao/Dreambooth-Stable-Diffusion

trekhleb/state-of-the-art-shitcode

dvlab-research/MGM

modelscope/data-juicer

LLaVA-VL/LLaVA-NeXT

xinyu1205/recognize-anything

Alpha-VLLM/Lumina-T2X

cambrian-mllm/cambrian

Vchitect/Latte

aigc-apps/EasyAnimate

Unity-Technologies/com.unity.perception

Vchitect/VBench

snap-research/Panda-70M

mira-space/MiraData

vis-nlp/ChartQA

evalcrafter/EvalCrafter

modelscope/lite-sora

OpenGVLab/MMT-Bench

sayakpaul/single-video-curation-svd

locuslab/scaling_laws_data_filtering

zwq2018/Multi-modal-Self-instruct

wenhanwu95/FreqMixFormer