magicleo's Stars
hacksider/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image
2noise/ChatTTS
A generative speech model for daily dialogue.
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
PaddlePaddle/PaddleNLP
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
Vaibhavs10/insanely-fast-whisper
shenweichen/DeepCTR
Easy-to-use,Modular and Extendible package of deep-learning based CTR models .
yl4579/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
OpenBMB/ToolBench
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
OFA-Sys/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
PaddlePaddle/PaddleRec
Recommendation Algorithm大规模推荐算法库,包含推荐系统经典及最新算法LR、Wide&Deep、DSSM、TDM、MIND、Word2Vec、Bert4Rec、DeepWalk、SSR、AITM,DSIN,SIGN,IPREC、GRU4Rec、Youtube_dnn、NCF、GNN、FM、FFM、DeepFM、DCN、DIN、DIEN、DLRM、MMOE、PLE、ESMM、ESCMM, MAML、xDeepFM、DeepFEFM、NFM、AFM、RALM、DMR、GateNet、NAML、DIFM、Deep Crossing、PNN、BST、AutoInt、FGCNN、FLEN、Fibinet、ListWise、DeepRec、ENSFM,TiSAS,AutoFIS等,包含经典推荐系统数据集criteo 、movielens等
shibing624/MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。
CVI-SZU/Linly
Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集
Doragd/Algorithm-Practice-in-Industry
搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)
ArtifexSoftware/pdf2docx
Open source Python library for converting PDF to DOCX.
Kensuke-Hinata/statistic
collecting books, papers and docs.
lyhue1991/torchkeras
Pytorch❤️ Keras 😋😋
alibaba/EasyRec
A framework for large scale recommendation algorithms.
ray-project/llm-applications
A comprehensive guide to building RAG-based LLM applications for production.
MgArcher/Text_select_captcha
实现文字点选、选字、选择、点触验证码识别,基于pytorch训练
bytedance/SALMONN
SALMONN: Speech Audio Language Music Open Neural Network
reczoo/FuxiCTR
A configurable, tunable, and reproducible library for CTR prediction https://fuxictr.github.io
Muennighoff/sgpt
SGPT: GPT Sentence Embeddings for Semantic Search
thu-coai/ConvLab-2
ConvLab-2: An Open-Source Toolkit for Building, Evaluating, and Diagnosing Dialogue Systems
RUCAIBox/DenseRetrieval
chenwr727/esmm_mmoe_deepfm
基于ESMM、MMoE和deepFM的多目标模型