SIGMIND's Stars
ggerganov/llama.cpp
LLM inference in C/C++
zylon-ai/private-gpt
Interact with your documents using the power of GPT, 100% privately, no data leaks
xai-org/grok-1
Grok open release
hacksider/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image
lllyasviel/ControlNet
Let us control diffusion models!
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell.
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
rhasspy/piper
A fast, local neural text to speech system
metavoiceio/metavoice-src
Foundational model for human-like, expressive TTS
ZhaoJ9014/face.evoLVe
🔥🔥High-Performance Face Recognition Library on PaddlePaddle & PyTorch🔥🔥
DAMO-NLP-SG/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
facebookresearch/jepa
PyTorch code and models for V-JEPA self-supervised learning from video.
NVIDIA/GenerativeAIExamples
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
Djdefrag/QualityScaler
QualityScaler - image/video AI upscaler app
NVlabs/VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
yunlong10/Awesome-LLMs-for-Video-Understanding
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
mbzuai-oryx/Video-ChatGPT
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
ggozad/oterm
a text-based terminal client for Ollama
magic-research/PLLaVA
Official repository for the paper PLLaVA
Vision-CAIR/MiniGPT4-video
Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding
pjueon/JetsonGPIO
A C++ library that enables the use of Jetson's GPIOs
tjtanaa/awesome-large-action-model
Awesome Large Action Model (LAM): Models that could help gets things done.
Quedale/OnvifDeviceManager
Onvif Device Manager for Linux
Rubberazer/JETGPIO
C library to manage the GPIO header of the Nvidia Jetson boards
yonseivnl/vlm-rlaif
ACL'24 (Oral) Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback
NVIDIA-AI-IOT/mmj_genai
A reference example for integrating NanoOwl with Metropolis Microservices for Jetson
noahc1510/trt-llm-rag-linux
A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Linux using TensorRT-LLM
Seeed-Projects/Multimodal-RAG-on-Jetson
This project has implemented the RAG function on Jetson with video formats.
MubtasimFuad10/Okkhor-Diffusion
Okkhor-Diffusion: Bangla Handwritten Character Generation using DDPM
xhuvom/linux-copilot
A simple co-pilot for Linux to interpret human language queries into useful Linux terminal commands and execute them