ytzhangscr's Stars
aigc-apps/sd-webui-EasyPhoto
📷 EasyPhoto | Your Smart AI Photo Generator.
Pythagora-io/gpt-pilot
The first real AI developer
roboflow/supervision
We write your reusable computer vision tools. 💜
apple/ml-ferret
microsoft/semantic-kernel
Integrate cutting-edge LLM technology quickly and easily into your apps
arc53/DocsGPT
GPT-powered chat for documentation, chat with your documents
OpenBMB/ChatDev
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
ToolJet/ToolJet
Low-code platform for building business applications. Connect to databases, cloud storages, GraphQL, API endpoints, Airtable, Google sheets, OpenAI, etc and build apps using drag and drop application builder. Built using JavaScript/TypeScript. 🚀
microsoft/autogen
A programming framework for agentic AI 🤖
OpenNLG/OpenBA
jun-long-li/TCOVIS
TCOVIS: Temporally Consistent Online Video Instance Segmentation (ICCV 2023)
Ziyang412/UCoFiA
Pytorch Code for "Unified Coarse-to-Fine Alignment for Video-Text Retrieval" (ICCV 2023)
mengting2023/LG-Track
Localization-Guided Track: A Deep Association Multi-Object Tracking Framework Based on Localization Confidence of Detections
sczhou/ProPainter
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
WuyangLuo/RefFaceInpainting
phil329/SDFlow
XPixelGroup/HAT
CVPR2023 - Activating More Pixels in Image Super-Resolution Transformer Arxiv - HAT: Hybrid Attention Transformer for Image Restoration
serengil/deepface
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
gnobitab/InstaFlow
:zap: InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)
lucidrains/voicebox-pytorch
Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
THUDM/RelayDiffusion
The official implementation of "Relay Diffusion: Unifying diffusion process across resolutions for image synthesis" [ICLR 2024 Spotlight]
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
lucidrains/spear-tts-pytorch
Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorch
lucidrains/naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
neulab/prompt2model
prompt2model - Generate Deployable Models from Natural Language Instructions
eugeneyan/open-llms
📋 A list of open LLMs available for commercial use.
spcl/graph-of-thoughts
Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"
Echo0125/MAT-Memory-and-Anticipation-Transformer
[ICCV 2023] Official implementation of Memory-and-Anticipation Transformer for Online Action Understanding
apple/ml-fastvit
This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization" ICCV 2023