Pinned Repositories
bert_multilabel
bert multilabel. classifier
CLUEDatasetSearch
搜索所有中文NLP数据集,附常用英文NLP数据集
dataset
Easy-to-use data handling for SQL data stores with support for implicit table creation, bulk loading, and transactions. Dataset also includes support for freezing data to CSV and JSON flat files.
DeeplearningForTextClassification
deep learning model for text classification
fbthrift
Facebook's branch of Apache Thrift, including a new C++ server.
folly
An open-source C++ library developed and used at Facebook.
OnlineLabel
Online Segmentation ans POS tagger with Average Perceptron
protobuf
Protocol Buffers - Google's data interchange format
SimpleXGBoost
a simple xgboost implement
XgboostAndLR
use xgboost and lr model for text classification. xgboost is used to be a feature transform for LR
zldeng's Repositories
zldeng/bert_multilabel
bert multilabel. classifier
zldeng/CLUEDatasetSearch
搜索所有中文NLP数据集,附常用英文NLP数据集
zldeng/TextBrewer
A PyTorch-based knowledge distillation toolkit for natural language processing
zldeng/Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
zldeng/awesome-knowledge-graph
整理知识图谱相关学习资料
zldeng/BELLE
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
zldeng/chinese_chatbot_corpus
中文公开聊天语料库
zldeng/CircleLoss
Pytorch implementation of the paper "Circle Loss: A Unified Perspective of Pair Similarity Optimization"
zldeng/cleanlab
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
zldeng/ColossalAI
Making large AI models cheaper, faster and more accessible
zldeng/DeepMatch
A deep matching model library for recommendations & advertising. It's easy to train models and to export representation vectors for user and item which can be used for ANN search.
zldeng/developer2gwy
公务员从入门到上岸,最佳程序员公考实践教程
zldeng/examples
Examples for Bazel
zldeng/FlagEmbedding
Dense Retrieval and Retrieval-augmented LLMs
zldeng/GNN4NLP-Papers
A list of recent papers about Graph Neural Network methods applied in NLP areas.
zldeng/handson-ml2
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.
zldeng/In-Context-Learning_PaperList
Paper List for In-context Learning 🌷
zldeng/incubator-brpc
Industrial-grade RPC framework used throughout Baidu, with 1,000,000+ instances and thousands kinds of services. "brpc" means "better RPC".
zldeng/KBQA-BERT
基于知识图谱的问答系统,BERT做命名实体识别和句子相似度,分为online和outline模式
zldeng/Keyword-BERT
zldeng/kkndme_tianya
天涯 kkndme 神贴聊房价
zldeng/learning-spark
Example code from Learning Spark book
zldeng/libhv
Like libevent, libev, and libuv, libhv provides event-loop with non-blocking IO and timer, but simpler apis and richer protocols.
zldeng/MMdnn
MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Convert models between Caffe, Keras, MXNet, Tensorflow, CNTK, PyTorch Onnx and CoreML.
zldeng/modern-cpp-tutorial
📚 Modern C++ Tutorial: C++11/14/17/20 On the Fly
zldeng/NLP_ability
总结梳理自然语言处理工程师(NLP)需要积累的各方面知识,包括面试题,各种基础知识,工程能力等等,提升核心竞争力
zldeng/numpy-ml
Machine learning, in numpy
zldeng/pytorch-question-answering
Collection of important Question-Answering papers implemented in PyTorch
zldeng/QA-Survey
北航大数据高精尖中心研究团队对问答系统的调研。包括基于知识图谱的问答系统(KBQA)和基于文本的问答系统(TextQA),每类系统分别对学术界和工业界进行调研。
zldeng/radish
C++ model train&inference framework