imba-pericia

imba-pericia's Stars

mifi/lossless-cut
The swiss army knife of lossless video/audio editing
Language:TypeScript26.8k 239 1.4k1.3k
facefusion/facefusion
Industry leading face manipulation platform
Language:Python18.6k 178 4382.8k
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Language:Python11.3k 160 3011k
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Jupyter Notebook9.7k 97 654955
GoogleCloudPlatform/generative-ai
Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI
Language:Jupyter Notebook7k 150 1981.9k
google-gemini/cookbook
Examples and guides for using the Gemini API
Language:Jupyter Notebook4.9k 75 75706
yisol/IDM-VTON
[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
Language:Python3.7k 56 149584
dvlab-research/MGM
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
Language:Python3.2k 28 131277
ali-vilab/VGen
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
Language:Python2.9k 33 133258
Doubiiu/DynamiCrafter
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Language:Python2.5k 32 129197
TMElyralab/MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
Language:Python2.4k 33 116255
Picsart-AI-Research/StreamingT2V
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
Language:Python1.4k 43 57143
open-mmlab/PIA
[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA，你的个性化图像动画生成器，利用文本提示将图像变为奇妙的动画
Language:Python888 22 4170
ammen99/wf-recorder
Language:C++863 12 16563
yuval-alaluf/Attend-and-Excite
Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)
Language:Jupyter Notebook687 15 3662
Doriandarko/RepoToTextForLLMs
Automate the analysis of GitHub repositories for LLMs with RepoToTextForLLMs. Fetch READMEs, structure, and non-binary files efficiently. Outputs include analysis prompts to aid in comprehensive repo evaluation
Language:Python652 9 686
G-U-N/AnimateLCM
[SIGGRAPH ASIA 2024 TCS] AnimateLCM: Computation-Efficient Personalized Style Video Generation without Personalized Video Data
Language:Python581 29 3442
vtosters/lite
Модифицированный клиент VK
Language:Smali545 21 1.5k30
TIGER-AI-Lab/AnyV2V
Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks"
Language:Jupyter Notebook463 17 1034
ttchengab/zest_code
[ECCV-2024] This is the official implementation of ZeST.
Language:Jupyter Notebook353 10 1022
pkoutoupis/rapiddisk
An Advanced Linux RAM Drive and Caching kernel modules. Dynamically allocate RAM as block devices. Use them as stand alone drives or even map them as caching nodes to slower local disk drives. Access those volumes locally or export them across an NVMe Target network. Manage it all from a web API.
Language:C296 23 10549
ali-vilab/Ranni
Language:Python210 8 2315
JaKooLit/OpenSuse-Hyprland
Automated Hyprland Install script for OpenSuse Tumbleweed. All gpu supported
Language:Shell151 3 1712
alphacep/vosk-tts
Text To Speech Synthesis with Vosk
Language:Python119 14 2918
winniesi/tg-gemini-bot
Just a single click and you've got it set up on Vercel.
Language:Python89 3 842
Haoming02/sd-webui-mosaic-outpaint
An Extension for Automatic1111 Webui that trivializes outpainting
Language:Python81 3 81
dsavell/docker-grav
Docker Container for GRAV CMS
Language:Shell40 5 2618
brick2face/seamless-tile-inpainting
An automatic1111 extension for making seamless tiles using Stable Diffusion inpainting
Language:Python31 2 01
kaifcoder/Invoice-Query-Tool-using-gemini-ai
This repository contains a Python project that leverages the Gemini Pro Vision API to extract invoice information from images. The primary goal of this project is to allow users to upload images of receipts and query specific details about the invoice. The project utilizes Conda for dependency management.
Language:Python8 1 05
tillo13/kumori_cli_engine
The Kumori CLI engine automation tool leverages InstantID and HuggingFace/Diffusers to batch-generate personalized, identity-preserving stylized images using sophisticated facial analysis and pose estimation techniques, all through a Python command-line interface.
Language:Python8 1 01