MengHao666

Working in Video Generation in Alibaba now. Previous Computer Graphics Engineer on 3D digital human in Tenent. Master in BeiHang University.

AlibabaHangzhou, China

MengHao666's Stars

krahets/hello-algo
《Hello 算法》：动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新，English version ongoing
Language:Java107k 566 24213.4k
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Language:Python63.3k 441 4.3k6.8k
dair-ai/Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
Language:MDX52.4k 577 2075.1k
marktext/marktext
📝A simple and elegant markdown editor, available for Linux, macOS and Windows.
Language:JavaScript48.2k 424 2.8k3.6k
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python36.2k 348 2.9k4.2k
d2l-ai/d2l-en
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Language:Python24.5k 420 2964.4k
openai/gpt-2
Code for the paper "Language Models are Unsupervised Multitask Learners"
Language:Python22.8k 631 2705.6k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
13.5k 262 130857
karpathy/nn-zero-to-hero
Neural Networks: Zero to Hero
Language:Jupyter Notebook12.7k 301 341.7k
NVIDIA/apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
Language:Python8.5k 100 1.2k1.4k
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Language:Python6.8k 57 730523
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
6.2k 94 11340
meta-llama/llama-stack
Composable building blocks to build Llama Apps
Language:Python6k 139 221739
salesforce/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Language:Jupyter Notebook5k 33 202659
wdndev/llm_interview_note
主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题
Language:HTML4.7k 23 8539
QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Language:Python4.2k 30 484252
hojonathanho/diffusion
Denoising Diffusion Probabilistic Models
Language:Python4k 20 22385
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
3.6k 61 4222
LLaVA-VL/LLaVA-NeXT
Language:Python3.3k 37 348288
NVlabs/VILA
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
Language:Python2.7k 39 154215
microsoft/Phi-3CookBook
This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks
Language:Jupyter Notebook2.7k 17 88310
open-compass/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks
Language:Python1.7k 12 272237
amazon-science/auto-cot
Official implementation for "Automatic Chain of Thought Prompting in Large Language Models" (stay tuned & more will be updated)
Language:Jupyter Notebook1.7k 17 7151
mbzuai-oryx/LLaVA-pp
🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)
Language:Python822 10 3462
facebookresearch/AudioDec
An Open-source Streaming High-fidelity Neural Audio Codec
Language:Python453 30 3521
facebookresearch/MovieGenBench
Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen
362 31 122
showlab/Awesome-Unified-Multimodal-Models
📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.
320 16 013
WisconsinAIVision/YoLLaVA
🌋👵🏻 Yo'LLaVA: Your Personalized Language and Vision Assistant
Language:Python77 1 86
athn-nik/motionfix
MotionFix: Text-Driven 3D Human Motion Editing [SIGGRAPH ASIA 2024]
Language:Python493
nvlm-project/nvlm-project.github.io
Language:HTML1 2 00

MengHao666

MengHao666's Stars

krahets/hello-algo

comfyanonymous/ComfyUI

dair-ai/Prompt-Engineering-Guide

marktext/marktext

microsoft/DeepSpeed

d2l-ai/d2l-en

openai/gpt-2

BradyFU/Awesome-Multimodal-Large-Language-Models

karpathy/nn-zero-to-hero

NVIDIA/apex

OpenGVLab/InternVL

hijkzzz/Awesome-LLM-Strawberry

meta-llama/llama-stack

salesforce/BLIP

wdndev/llm_interview_note

QwenLM/Qwen2-VL

hojonathanho/diffusion

opendilab/awesome-RLHF

LLaVA-VL/LLaVA-NeXT

NVlabs/VILA

microsoft/Phi-3CookBook

open-compass/VLMEvalKit

amazon-science/auto-cot

mbzuai-oryx/LLaVA-pp

facebookresearch/AudioDec

facebookresearch/MovieGenBench

showlab/Awesome-Unified-Multimodal-Models

WisconsinAIVision/YoLLaVA

athn-nik/motionfix

nvlm-project/nvlm-project.github.io