leonnewton's Stars
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
chatchat-space/Langchain-Chatchat
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
microsoft/semantic-kernel
Integrate cutting-edge LLM technology quickly and easily into your apps
NielsRogge/Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
LianjiaTech/BELLE
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
lonePatient/awesome-pretrained-chinese-nlp-models
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
shibing624/text2vec
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
schemaspy/schemaspy
Database documentation built easy
alibaba/EasyNLP
EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit
Ucas-HaoranWei/Vary
[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.
LIAAD/yake
Single-document unsupervised keyword extraction
capitalone/DataProfiler
What's in your data? Extract schema, statistics and entities from datasets
fighting41love/cocoNLP
A Chinese information extraction tool.
bytedance/godlp
sensitive information protection toolkit
saltudelft/ml4se
A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering
BeyonderXX/InstructUIE
Universal information extraction with instruction learning
wenge-research/YAYI-UIE
雅意信息抽取大模型:在百万级人工构造的高质量信息抽取数据上进行指令微调,由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)
LC1332/Luotuo-Text-Embedding
Luotuo Embedding(骆驼嵌入) is a text embedding model, which developed by 李鲁鲁, 冷子昂, 陈启源, 蒟蒻等.
AvalZ/WAF-A-MoLE
A guided mutation-based fuzzer for ML-based Web Application Firewalls
nvuillam/github-dependents-info
Collect information about dependencies between a github repo and other repositories. Results available in JSON, markdown and badge
shibing624/nerpy
🌈 NERpy: Implementation of Named Entity Recognition using Python. 命名实体识别工具,支持BertSoftmax、BertSpan等模型,开箱即用。
megagonlabs/sato
Code and data for Sato https://arxiv.org/abs/1911.06311.
percent4/keras_bert_sequence_labeling
本项目采用Keras和Keras-bert实现中文序列标注,对BERT进行微调,并在多个命名实体识别数据集上进行测试。
Yuerino/obfuscator-pass
A collection of LLVM passes for obfuscating
hybridtheory/floc-simhash
A fast python implementation of the SimHash algorithm.
data-dev/DataTracer
Data Lineage Tracing Library
hululuzhu/gpt-j
用 GPT-J (开源简化版 GPT-3) 测试输入中文指令(zero shot learning),评估输出的代码或文字答案
a42labs/infer-col-types