Pinned Repositories
academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
ADENet
AIAlpha
Use unsupervised and supervised learning to predict stocks
alex
Alex Dialogue Systems Framework
AppAgent
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
ArchiveBox
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
ASLP_NWPU_ASR_HW
ASR_Course
avobjects
Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
zysilence's Repositories
zysilence/academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
zysilence/ADENet
zysilence/AppAgent
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
zysilence/ArchiveBox
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
zysilence/ASLP_NWPU_ASR_HW
zysilence/ASR_Course
zysilence/botsim
BotSIM - a data-efficient end-to-end Bot SIMulation toolkit for evaluation, diagnosis, and improvement of commercial chatbots
zysilence/ChatGPT-Next-Web
One-Click to deploy well-designed ChatGPT web UI on Vercel. 一键拥有你自己的 ChatGPT 网页服务。
zysilence/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
zysilence/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
zysilence/dasheng
Official PyTorch code for Deep Audio-Signal Holistic Embeddings
zysilence/DeepSpeedExamples
Example models using DeepSpeed
zysilence/EchoMimic
Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
zysilence/FACEGOOD-Audio2Face
http://www.facegood.cc
zysilence/FACIAL
FACIAL: Synthesizing Dynamic Talking Face With Implicit Attribute Learning. ICCV, 2021.
zysilence/generative-ai-for-beginners
18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
zysilence/GLM-4-Voice
GLM-4-Voice | 端到端中英语音对话模型
zysilence/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
zysilence/llama_docs_bot
Bottoms Up Development with LlamaIndex - Building a Documentation Chatbot
zysilence/MSDWild
zysilence/mujoco
Multi-Joint dynamics with Contact. A general purpose physics simulator.
zysilence/muzic
Muzic: Music Understanding and Generation with Artificial Intelligence
zysilence/ProAgent
An LLM-based Agent for the New Automation Paradigm - Agentic Process Automation
zysilence/rag-search
RAG Search API
zysilence/searxng
SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.
zysilence/SymphonyNet
Symphony Generation with Permutation Invariant Language Model
zysilence/voxpopuli
A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation
zysilence/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.
zysilence/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
zysilence/Yulan-GARDEN
Official Repository for SIGIR2024 Demo Paper "An Integrated Data Processing Framework for Pretraining Foundation Models"