JosenJin

Pinned Repositories

3D-Speaker
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Language:Python0 0 00
3DDFA_V2
The official PyTorch implementation of Towards Fast, Accurate and Stable 3D Dense Face Alignment, ECCV 2020.
Language:Python0 0 00
Advanced-Video
Language:C++0 0 00
AI-text-to-video-model-from-scratch
In this blog, we will build a small scale text-to-video model from scratch. We will input a text prompt, and our trained model will generate a video based on that prompt.
Language:Jupyter Notebook0 0 00
AI-YinMei
AI吟美-人工智能主播-Vtuber
Language:Python0 0 00
alist
🗂️A file list program that supports multiple storages, powered by Gin and Solidjs. / 一个支持多存储的文件列表程序，使用 Gin 和 Solidjs。
Language:Go0 0 00
DataVisualization
:smiling_imp: by vue2.x with echarts3.3.2
Language:Vue1 0 00
fomo3d_clone
clone fomo3d contract source
Language:JavaScript1 0 00
THSTrader-1
量化交易。同花顺免费模拟炒股软件客户端的python API。(Python3)
Language:Jupyter Notebook1 0 00
vr-hall
three.js 3D vr hall
Language:JavaScript1 0 00

JosenJin's Repositories

JosenJin/vr-hall
three.js 3D vr hall
Language:JavaScript1 0 00
JosenJin/3D-Speaker
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Language:Python0 0 00
JosenJin/AI-text-to-video-model-from-scratch
In this blog, we will build a small scale text-to-video model from scratch. We will input a text prompt, and our trained model will generate a video based on that prompt.
Language:Jupyter Notebook0 0 00
JosenJin/AI-YinMei
AI吟美-人工智能主播-Vtuber
Language:Python0 0 00
JosenJin/AutoX
A UiAutomator on android, does not need root access(安卓平台上的JavaScript自动化工具)
Language:JavaScript0 0 00
JosenJin/ChaosFlow
Some personal original ComfyUI workflows
0 0 00
JosenJin/ChatGLM3
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
Language:Python0 0 00
JosenJin/CoDeF
Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
Language:Python0 0
JosenJin/ComfyUI-Templates
ComfyUI Templates
0 0
JosenJin/ComfyUI-wiki
Everything about ComfyUI, including workflow sharing, resource sharing, knowledge sharing, tutorial sharing, and more.关于ComfyUI的一切，工作流分享、资源分享、知识分享、教程分享等
0 0
JosenJin/ComfyUI_Prompt_Gallery
ComfyUI custom node that adds a quick and visual UI selector for building prompts to the sidebar.
Language:JavaScript0 0
JosenJin/crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
Language:TypeScript0 0
JosenJin/GLM-4
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
Language:Python0 0
JosenJin/hon
Home Assistant integration for Haier hOn: support for Haier/Candy/Hoover home appliances like washing machines and air conditioners in 19 languages.
Language:Python0 0
JosenJin/IPTV
M3U Playlist for free TV channels
Language:Python0 0
JosenJin/kimi-free-api
🚀 KIMI AI 长文本大模型逆向API白嫖测试【特长：长文本解读整理】，支持高速流式输出、智能体对话、联网搜索、长文档解读、图像OCR、多轮对话，零配置部署，多路token支持，自动清理会话痕迹。
Language:TypeScript0 0
JosenJin/MemGPT-AutoGEN-LLM
Run MemGPT-AutoGEN-Local LLM Together
Language:Python0 0
JosenJin/MoneyPrinter
Automate Creation of YouTube Shorts using MoviePy.
Language:Python0 0
JosenJin/moonshine
Fast and accurate automatic speech recognition (ASR) for edge devices
Language:Python0 0
JosenJin/open-interpreter
OpenAI's Code Interpreter in your terminal, running locally
Language:Python0 0
JosenJin/pyhOn
Control hOn devices with python
Language:Python0 0
JosenJin/SalesGPT
Context-aware AI Sales Agent to automate sales outreach.
Language:Python0 0
JosenJin/spiritlm
Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".
JosenJin/ToolGen
Implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"
Language:Python0 0
JosenJin/VideoChat
实时语音交互数字人，支持端到端语音方案（GLM-4-Voice - THG）和级联方案（ASR-LLM-TTS-THG）。可自定义形象与音色，支持音色克隆，首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and cascaded solutions (ASR-LLM-TTS-THG). Customizable appearance and voice, supporting voice cloning, with initial package delay as low as 3 seconds.
Language:Python0 0
JosenJin/VITA
✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM
0 0
JosenJin/WuKongIM
8年积累，沉淀出来的高性能通用通讯服务，支持即时通讯（聊天软件）(IM)(Chat)，消息推送，物联网通讯，音视频信令，直播弹幕，客服系统，AI通讯，即时社区等场景。High-performance universal communication service that supports instant messaging, message push, IoT communication, audio and video signaling, live broadcasting with bullet comments, customer service systems
Language:Go0 0
JosenJin/Xiao_Sense_CameraWebServer_Audio
Arduino sketch that allows to use the Xiao ESP32S3 Sense as a webserver for streaming microphone and camera feeds
Language:C0 0
JosenJin/xiaozhi
Build your own AI friend
Language:JavaScript0 0
JosenJin/xiaozhi-esp32
Build your own AI friend
Language:C++0 0