giorgiop's Stars
yt-dlp/yt-dlp
A feature-rich command-line audio/video downloader
lllyasviel/Fooocus
Focus on prompting and generating
svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion
yoheinakajima/babyagi
stitionai/devika
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
phidatahq/phidata
Build AI Assistants with memory, knowledge and tools.
jacobgil/pytorch-grad-cam
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
OpenBMB/MiniCPM-V
MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone
facebookresearch/dino
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
IDEA-Research/GroundingDINO
Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Skyvern-AI/skyvern
Automate browser-based workflows with LLMs and Computer Vision
XuehaiPan/nvitop
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
sensity-ai/dot
The Deepfake Offensive Toolkit
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
TorchSSL/TorchSSL
A PyTorch-based library for semi-supervised learning (NeurIPS'21)
microsoft/Semi-supervised-learning
A Unified Semi-Supervised Learning Codebase (NeurIPS'22)
yeungchenwa/OCR-SAM
Combining MMOCR with Segment Anything & Stable Diffusion. Automatically detect, recognize and segment text instances, with serval downstream tasks, e.g., Text Removal and Text Inpainting
NVIDIA/TensorRT-Model-Optimizer
TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization and sparsity. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed on NVIDIA GPUs.
erprogs/CViT
Deepfake Video Detection Using Convolutional Vision Transformer
davide-coccomini/MINTIME-Multi-Identity-size-iNvariant-TIMEsformer-for-Video-Deepfake-Detection
Code for Video Deepfake Detector from "MINTIME: Multi-Identity Size-Invariant Video Deepfake Detection", paper available on IEEE Transactions on Information Forensics and Security.
erprogs/GenConViT
Deepfake Video Detection Using Generative Convolutional Vision Transformer
QingyuLiu/Exposing-the-Deception
This repo is the official implementation of “Exposing the Deception: Uncovering More Forgery Clues for Deepfake Detection”. Accepted by AAAI-2024.
BSI-OFIQ/OFIQ-Project
Open Source Facial Image Quality
AmritaBh/ConDA-gen-text-detection
Code for the paper: ConDA: Contrastive Domain Adaptation for AI-generated Text Detection
microsoft/xpoc-framework
Cross-Platform Origin of Content framework
jonasricker/aeroblade
[CVPR2024] AEROBLADE: Training-Free Detection of Latent Diffusion Images Using Autoencoder Reconstruction Error
Kitware/image_attribution
janbutora/adobe-detector