d4ny1's Stars
ttengwang/Awesome_Long_Form_Video_Understanding
Awesome papers & datasets specifically focused on long-term videos.
PeterH0323/Streamer-Sales
Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭建后端🗝️、Docker-compose 打包部署🐋
2noise/ChatTTS
A generative speech model for daily dialogue.
VikParuchuri/marker
Convert PDF to markdown quickly with high accuracy
leptonai/search_with_lepton
Building a quick conversation-based search demo with Lepton AI.
HVision-NKU/StoryDiffusion
Create Magic Story!
yoheinakajima/prettygraph
An experimental UI for text-to-knowledge-graph generation
teableio/teable
✨ The Next Gen Airtable Alternative: No-Code Postgres
searxng/searxng
SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
numz/sd-wav2lip-uhq
Wav2Lip UHQ extension for Automatic1111
harry0703/MoneyPrinterTurbo
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
jianchang512/pyvideotrans
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音
ydyjya/Awesome-LLM-Safety
A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights into the safety implications, challenges, and advancements surrounding these powerful models.
madawei2699/awesome-seo
Google SEO Research and Web Traffic Monetization
1003715231/gptstore-prompts
Here are the Top 100 prompts on GPTStore, which we can use to learn and improve prompt engineering.
t3-oss/create-t3-app
The best way to start a full-stack, typesafe Next.js app
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Librum-Reader/Librum
The Librum client application
lobehub/lobe-chat
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Azure / DeepSeek), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Vision/TTS) and plugin system. One-click FREE deployment of your private ChatGPT/ Claude application.
mrmps/SMRY
A tool to get summaries and get past paywalls
paul-gauthier/aider
aider is AI pair programming in your terminal
xszyou/Fay
Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guides, broadcasters, assistants, waiters, teachers, and voice or text-based mobile assistants.
PromtEngineer/localGPT
Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.
FlowiseAI/Flowise
Drag & drop UI to build your customized LLM flow
karpathy/nn-zero-to-hero
Neural Networks: Zero to Hero
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
eugeneyan/open-llms
📋 A list of open LLMs available for commercial use.
AndrewZhe/lawyer-llama
中文法律LLaMA (LLaMA for Chinese legel domain)
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.