yuangpeng's Stars
pengbo807/ConditionVideo
Training-Free Condition-Guided Text-to-Video Generation
WayneMao/PillarNeSt
The Official Implementation of PillarNeSt
Ucas-HaoranWei/Vary-toy
Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)
lucidrains/DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
open-mmlab/PowerPaint
[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model. 一个高质量多功能的图像修补模型,可以同时支持插入物体、移除物体、图像扩展、形状可控的物体生成,只需要一个模型
Ucas-HaoranWei/Vary
[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.
THUDM/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
THUDM/CogView2
official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"
LC044/WeChatMsg
提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手
CompVis/taming-transformers
Taming Transformers for High-Resolution Image Synthesis
ytongbai/LVM
Ahnsun/merlin
[ECCV2024] Official code implementation of Merlin: Empowering Multimodal LLMs with Foresight Minds
google-research/magvit
Official JAX implementation of MAGVIT: Masked Generative Video Transformer
linexjlin/GPTs
leaked prompts of GPTs
openai/consistencydecoder
Consistency Distilled Diff VAE
stanford-crfm/helm
Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image models in HEIM (https://arxiv.org/abs/2311.04287) and vision-language models in VHELM (https://arxiv.org/abs/2410.07112).
langchain-ai/opengpts
ffhibnese/Model-Inversion-Attack-ToolBox
A comprehensive toolbox for model inversion attacks and defenses, which is easy to get started.
yang-song/yang-song.github.io
Personal website
ChatGPTNextWeb/ChatGPT-Next-Web
A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。
Megvii-BaseDetection/YOLOX
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
persimmon-ai-labs/adept-inference
Inference code for Persimmon-8B
ffhibnese/GIFD_Gradient_Inversion_Attack
[ICCV-2023] Gradient inversion attack, Federated learning, Generative adversarial network.
alshedivat/al-folio
A beautiful, simple, clean, and responsive Jekyll theme for academics
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
RobustNLP/CipherChat
A framework to evaluate the generalization capability of safety alignment for LLMs
openmedlab/PULSE
PULSE: Pretrained and Unified Language Service Engine
lllyasviel/Fooocus
Focus on prompting and generating
megvii-research/megfile
Megvii FILE Library - Working with Files in Python same as the standard library
CiaraStrawberry/TemporalKit
An all in one solution for adding Temporal Stability to a Stable Diffusion Render via an automatic1111 extension