lxy2017's Stars
abi/screenshot-to-code
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
rasbt/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
1c7/chinese-independent-developer
👩🏿💻👨🏾💻👩🏼💻👨🏽💻👩🏻💻**独立开发者项目列表 -- 分享大家都在做什么
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
NanmiCoder/MediaCrawler
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
WongKinYiu/yolov9
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
cooderl/wewe-rss
🤗更优雅的微信公众号订阅方式,支持私有化部署、微信公众号RSS生成(基于微信读书)v2.x
Stability-AI/StableCascade
Official Code for Stable Cascade
THUDM/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
naver/dust3r
DUSt3R: Geometric 3D Vision Made Easy
VAST-AI-Research/TripoSR
lllyasviel/sd-forge-layerdiffuse
[WIP] Layer Diffusion for WebUI (via Forge)
Codium-ai/AlphaCodium
Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""
IDEA-Research/T-Rex
[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
aixcoder-plugin/aiXcoder-7B
official repository of aiXcoder-7B Code Large Language Model
ZhengPeng7/BiRefNet
[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation
d8ahazard/sd_dreambooth_extension
microsoft/LLaVA-Med
Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.
TencentARC/BrushNet
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
rohitgandikota/sliders
Concept Sliders for Precise Control of Diffusion Models
bclswl0827/ChatGemini
✨ ChatGemini 是一个基于 Google Gemini 的网页客户端,对标 ChatGPT 3.5,操作逻辑同 ChatGPT 3.5 一致,同时支持在聊天中上传图片,应用会自动调用 Gemini-Pro-Vision 模型进行识图。
kongzhecn/OMG
[ECCV 2024] OMG: Occlusion-friendly Personalized Multi-concept Generation In Diffusion Models
maturk/dn-splatter
DN-Splatter + AGS-Mesh: Depth and Normal Priors for Gaussian Splatting
kujason/ip_basic
Image Processing for Basic Depth Completion
lichao-sun/SoraReview
The official GitHub page for the review paper "Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models".
czh-98/REALY
REALY: Rethinking the Evaluation of 3D Face Reconstruction (ECCV 2022)
Wuuu3511/GoMVS
[CVPR'24]🦿GoMVS: Geometrically Consistent Cost Aggregation for Multi-View Stereo
resemin/WeightedDeepSupervision