VincentWong1
Graduated from School of Management and Engineering, Nanjing University.
Alibaba GroupNanjing, Jiangsu Province, PRC
VincentWong1's Stars
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
zejunwang1/CSTS
中文自然语言推理与语义相似度数据集
modelscope/ms-swift
Use PEFT or Full-parameter to finetune 350+ LLMs or 90+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)
mimno/Mallet
MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to text.
TIGER-AI-Lab/LongRAG
Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".
dwzhu-pku/LongEmbed
Official implementation for the paper "LongEmbed: Extending Embedding Models for Long Context Retrieval"
facebookresearch/contriever
Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning
nipunsadvilkar/pySBD
🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.
TencentGameMate/chinese_speech_pretrain
chinese speech pretrained models
nl8590687/ASRT_SpeechRecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
nlpxucan/WizardLM
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
QwenLM/Qwen2.5
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
itsnamgyu/reasoning-teacher
Official code for "Large Language Models Are Reasoning Teachers", ACL 2023
HarleysZhang/dl_note
深度学习系统笔记,包含深度学习数学基础知识、神经网络基础部件详解、深度学习炼丹策略、模型压缩算法详解。
milvus-io/pymilvus
Python SDK for Milvus.
embeddings-benchmark/mteb
MTEB: Massive Text Embedding Benchmark
NLP-LOVE/Introduction-NLP
HanLP作者的新书《自然语言处理入门》详细笔记!业界良心之作,书中不是枯燥无味的公式罗列,而是用白话阐述的通俗易懂的算法模型。从基本概念出发,逐步介绍中文分词、词性标注、命名实体识别、信息抽取、文本聚类、文本分类、句法分析这几个热门问题的算法原理与工程实现。
baidu/DDParser
百度开源的依存句法分析系统
explodinggradients/ragas
Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines
JackKuo666/NLP_basis
这是我学习一个NLP教程【2019最新AI 自然语言处理之深度机器学习顶级项目实战课程】做的笔记与代码
HIT-SCIR/ltp
Language Technology Platform
percent4/embedding_rerank_retrieval
本项目是针对RAG中的Retrieve阶段的召回技术及算法效果所做评估实验。使用主体框架为LlamaIndex.
X-PLUG/Multi-LLM-Agent
billxbf/ReWOO
Decoupling Reasoning from Observations for Efficient Augmented Language Models
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
mermaid-js/mermaid
Generation of diagrams like flowcharts or sequence diagrams from text in a similar manner as markdown
THUDM/AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
microsoft/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
TigerResearch/TigerBot
TigerBot: A multi-language multi-task LLM
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs