kevinwck

Pinned Repositories

3D-LLM
Code for 3D-LLM: Injecting the 3D World into Large Language Models
Language:Python00
ai-video
a web application that captures media streams from various sources such as a webcam, desktop, or specific applications. It captures frames at intervals and uses AI to analyze and summarize the frames, providing insights using GPT-4.
Language:JavaScript00
AI-Vtuber
AI Vtuber是一个由【ChatterBot/ChatGPT/claude/langchain（本地/llm）/chatglm/text-generation-webui/闻达/文心一言/通义千问】驱动的虚拟主播【Live2D】，可以在【Bilibili/抖音/快手/斗鱼】直播中与观众实时互动或直接在本地进行聊天。它使用自然语言处理和文本转语音技术【edge-tts/VITS/elevenlabs/bark/VALL-E-X】生成回答并可以选择【so-vits-svc/DDSP-SVC】变声；通过特定指令协同Stable Diffusion进行画图。并可自定义文案进行播放。
Language:JavaScript00
amblegpt
Video surveilance footage analyst powered by GPT-4o
Language:Python00
Applio
Ultimate voice cloning tool, meticulously optimized for unrivaled power, modularity, and user-friendly experience.
Language:Python00
ARLocation
在摄像头捕获的图像中显示周围建筑物的方向, 距离, 以及名称等信息, 摄像头旋转, 街景信息也会跟随着旋转;
Language:Objective-C00
Attendize
Attendize is an open-source ticket selling and event management platform built on Laravel.
Language:PHP00
auto-gmail-responder
Auto respond to gmail/emails with openai/chatgpt and python
Language:Python00
Auto-PPT
Auto generate pptx using gpt-3.5, Free to use online / 通过gpt-3.5生成PPT,免费在线使用 http://www.limaoyi.top:4399/#
Language:Python00
Bark-Voice-Cloning
Bark Voice Cloning and Voice Cloning for Chinese Speech
Language:Jupyter Notebook00

kevinwck's Repositories

kevinwck/Video-ChatGPT
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
kevinwck/ai-video
a web application that captures media streams from various sources such as a webcam, desktop, or specific applications. It captures frames at intervals and uses AI to analyze and summarize the frames, providing insights using GPT-4.
kevinwck/amblegpt
Video surveilance footage analyst powered by GPT-4o
kevinwck/easy-gpt4o
Easy-GPT4O opensource version
kevinwck/thepipe
Multimodal file/web extraction for GPT-4o in one line of code ⚡
kevinwck/Applio
Ultimate voice cloning tool, meticulously optimized for unrivaled power, modularity, and user-friendly experience.
kevinwck/TalkingHead
Talking Head (3D): A JavaScript class for real-time lip-sync using Ready Player Me full-body 3D avatars.
kevinwck/Streamline-Analyst
An AI agent powered by LLMs that streamlines the entire process of data analysis. 🚀
kevinwck/webcamGPT
webcamGPT - chat with video stream 💬 + 📸
kevinwck/GPTVoiceAssistant2024
Use microphone input to talk to ChatGPT
kevinwck/3D-LLM
Code for 3D-LLM: Injecting the 3D World into Large Language Models
kevinwck/llama2-faiss-langchain-qa-rag
kevinwck/text-to-motion
Official implementation for "Generating Diverse and Natural 3D Human Motions from Texts (CVPR2022)."
kevinwck/SQL-GPT
Use ChatGPT to generate SQL and perform execution. Optimization and error correction of SQL is also possible.
kevinwck/so-vits-svc-fork
so-vits-svc fork with realtime support, improved interface and more features.
kevinwck/WebcamGPT-Vision
Lightweight GPT-4 Vision processing over the Webcam
kevinwck/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
kevinwck/ChatGLM3
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
kevinwck/AI-Vtuber
AI Vtuber是一个由【ChatterBot/ChatGPT/claude/langchain（本地/llm）/chatglm/text-generation-webui/闻达/文心一言/通义千问】驱动的虚拟主播【Live2D】，可以在【Bilibili/抖音/快手/斗鱼】直播中与观众实时互动或直接在本地进行聊天。它使用自然语言处理和文本转语音技术【edge-tts/VITS/elevenlabs/bark/VALL-E-X】生成回答并可以选择【so-vits-svc/DDSP-SVC】变声；通过特定指令协同Stable Diffusion进行画图。并可自定义文案进行播放。
kevinwck/faceswap
Deepfakes Software For All
kevinwck/DeepFaceLab
DeepFaceLab is the leading software for creating deepfakes.
kevinwck/OpenAI_Document_Analyzer
Demo application to show how to use Azure AI Document Intelligence and Azure OpenAI Service to increase the efficiency of document analysis
kevinwck/ebsynth_utility
AUTOMATIC1111 UI extension for creating videos using img2img and ebsynth.
kevinwck/MathGLM
Official Pytorch Implementation for MathGLM
kevinwck/poe-api-wrapper
👾 A Python API wrapper for Poe.com, using Httpx. With this, you will have free access to ChatGPT, Claude, Llama, Google-PaLM and more! 🚀
kevinwck/XrayGLM
🩺 首个会看胸部X光片的中文多模态医学大模型 | The first Chinese Medical Multimodal Model that Chest Radiographs Summarization.
kevinwck/ShortGPT
🚀🎬 ShortGPT - Experimental AI framework for automated short/video content creation.
kevinwck/fastapi_poe
A helper library for writing Poe API bots using FastAPI
kevinwck/storyteller
Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech
kevinwck/Auto-PPT
Auto generate pptx using gpt-3.5, Free to use online / 通过gpt-3.5生成PPT,免费在线使用 http://www.limaoyi.top:4399/#