LandyGuo

Happy Coding

ISCAS Haidian,Beijing

LandyGuo's Stars

AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
Language:Python141k 1.1k 7.7k26.7k
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
Language:Python64.9k 276 1.6k8k
abi/screenshot-to-code
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
Language:Python56.7k 330 3047k
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
Language:TypeScript48.4k 350 4.3k6.9k
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Language:Python25.6k 199 4.1k5.3k
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python21.9k 186 4902.1k
HumanAIGC/AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
14.4k 670 92969
LargeWorldModel/LWM
Language:Python7.1k 66 71550
IDEA-Research/GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Language:Python6.5k 42 301671
CompVis/taming-transformers
Taming Transformers for High-Resolution Image Synthesis
Language:Jupyter Notebook5.8k 76 2201.1k
UX-Decoder/Segment-Everything-Everywhere-All-At-Once
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
Language:Python4.3k 59 147394
mlfoundations/open_flamingo
An open-source framework for training large multimodal models.
Language:Python3.7k 48 175281
NExT-GPT/NExT-GPT
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
Language:Python3.3k 57 101321
Breakthrough/PySceneDetect
:movie_camera: Python and OpenCV-based scene cut/transition detection program & library.
Language:Python3.2k 69 328392
dvlab-research/MGM
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
Language:Python3.2k 28 131277
X-PLUG/MobileAgent
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
Language:Python2.8k 48 54265
google-research/big_vision
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
Language:Jupyter Notebook2.3k 41 56151
tencent-ailab/V-Express
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
Language:Python2.2k 38 49278
Zz-ww/SadTalker-Video-Lip-Sync
本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形，设置面部区域可配置的增强方式进行合成唇形（人脸）区域画面增强，提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧，补充帧间合成唇形的动作过渡，使合成的唇形更为流畅、真实以及自然。
Language:Python1.9k 36 93318
baaivision/Emu
Emu Series: Generative Multimodal Models from BAAI
Language:Python1.6k 21 8685
eric-ai-lab/MiniGPT-5
Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"
Language:Python850 12 4452
soCzech/TransNetV2
TransNet V2: Shot Boundary Detection Neural Network
Language:Python462 9 4786
llava-rlhf/LLaVA-RLHF
Aligning LMMs with Factually Augmented RLHF
Language:Python315 9 3621
azad-academy/denoising-diffusion-model
A simple guide to diffusion models. Helpful in understanding the concept and practicing with the method.
Language:Jupyter Notebook204 5 432
feizc/Visual-LLaMA
Open LLaMA Eyes to See the World
Language:Python173 6 410
alipay/Ant-Multi-Modal-Framework
Research Code for Multimodal-Cognition Team in Ant Group
Language:Python115 4 195
DmitryRyumin/NewEraAI-Papers
The repository provides links to collections of influential and interesting research papers from top AI conferences, with open-source code to promote reproducibility and provide detailed implementation insights beyond the scope of the article. Stay up to date with the latest advances in AI research!
Language:Python91 8 22
williechai/speedup-plugin-for-stable-diffusions
Language:Python80 6 37
Cranial-XIX/FAMO
Official PyTorch Implementation for Fast Adaptive Multitask Optimization (FAMO)
Language:Python68 4 910
allenai/unified-io-2.pytorch
Language:Python62 7 112

LandyGuo

LandyGuo's Stars

AUTOMATIC1111/stable-diffusion-webui

binary-husky/gpt_academic

abi/screenshot-to-code

langgenius/dify

huggingface/diffusers

hpcaitech/Open-Sora

HumanAIGC/AnimateAnyone

LargeWorldModel/LWM

IDEA-Research/GroundingDINO

CompVis/taming-transformers

UX-Decoder/Segment-Everything-Everywhere-All-At-Once

mlfoundations/open_flamingo

NExT-GPT/NExT-GPT

Breakthrough/PySceneDetect

dvlab-research/MGM

X-PLUG/MobileAgent

google-research/big_vision

tencent-ailab/V-Express

Zz-ww/SadTalker-Video-Lip-Sync

baaivision/Emu

eric-ai-lab/MiniGPT-5

soCzech/TransNetV2

llava-rlhf/LLaVA-RLHF

azad-academy/denoising-diffusion-model

feizc/Visual-LLaMA

alipay/Ant-Multi-Modal-Framework

DmitryRyumin/NewEraAI-Papers

williechai/speedup-plugin-for-stable-diffusions

Cranial-XIX/FAMO

allenai/unified-io-2.pytorch