ttkrpink

ttkrpink's Stars

linexjlin/GPTs
leaked prompts of GPTs
29k 312 273.9k
Cinnamon/kotaemon
An open-source RAG-based tool for chatting with your documents.
Language:Python18.1k 102 3291.4k
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python13k 137 7341.4k
stackblitz/bolt.new
Prompt, run, edit, and deploy full-stack web applications
Language:TypeScript10.8k 111 4.5k5.1k
Huanshere/VideoLingo
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音，一键全自动视频搬运AI字幕组
Language:Python8.7k 57 266853
Vaibhavs10/insanely-fast-whisper
Language:Jupyter Notebook7.9k 68 199552
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language:Jupyter Notebook6.6k 73 1k800
opendatalab/PDF-Extract-Kit
A Comprehensive Toolkit for High-Quality PDF Content Extraction
Language:Python6.2k 44 148397
lipku/LiveTalking
Real time interactive streaming digital human
Language:Python4.2k 53 311607
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Language:Jupyter Notebook3.9k 48 212350
QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Language:Python3.8k 30 437235
FunAudioLLM/SenseVoice
Multilingual Voice Understanding Model
Language:Python3.8k 41 160341
huggingface/speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
Language:Python3.6k 44 91389
jianchang512/stt
Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具，输出json、srt字幕、纯文字格式
Language:Python2.7k 12 97299
naver/mast3r
Grounding Image Matching in 3D with MASt3R
Language:Python1.5k 33 85115
tomasonjo/blogs
Jupyter notebooks that support my graph data science blog posts at https://bratanic-tomaz.medium.com/
Language:Jupyter Notebook1.4k 44 45364
SamurAIGPT/AI-Youtube-Shorts-Generator
A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.
Language:Python1.4k 23 13182
harry0703/AudioNotes
快速提取音视频内容，整理成一份结构化的markdown笔记
Language:Python1.1k 10 35132
juanmc2005/diart
A python package to build AI-powered real-time audio applications
Language:Python1.1k 22 15290
Vision-CAIR/MiniGPT4-video
Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding
Language:Python571 12 4261
zju3dv/GVHMR
Code for "GVHMR: World-Grounded Human Motion Recovery via Gravity-View Coordinates", Siggraph Asia 2024
Language:Jupyter Notebook509 19 3931
supabase-community/babelfish.ai
A realtime live transcription and translation app built with Huggingface Transformer.js and Supabase Realtime.
Language:JavaScript429 9 241
MixedRealityToolkit/MixedRealityToolkit-Unity
This repository holds the third generation of the Mixed Reality Toolkit for Unity. The latest version of the MRTK can be found here.
Language:C#421 20 624114
abetusk/dev
dev log
Language:Roff272 14 0173
jim60105/docker-whisperX
Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and test)
Language:Dockerfile194 6 1922
steinathan/reelsmaker
ReelsMaker is a Python-based/streamlit application designed to create captivating faceless videos for social media platforms like TikTok and YouTube.
Language:Python188 3 424
nianticlabs/doubletake
[ECCV 2024] DoubleTake: Geometry Guided Depth Estimation
Language:Python167 4 412
Relsoul/whisper-win-gui
基于whisper的实时语音识别网页和桌面客户端
Language:Python153 1 15
facebookresearch/efm3d
This is the official release for the paper "EFM3D A Benchmark for Measuring Progress Towards 3D Egocentric Foundation Models" (https//arxiv.org/abs/2406.10224).
Language:Python118 29 28
KeKsBoTer/cinematic-gaussians
Code for our paper "Application of 3D Gaussian Splatting for Cinematic Anatomy on Consumer Class Devices"
Language:Python17 1 22