Pinned Repositories
2018-CCF-BDCI-China-Unicom-Research-Institute-top2
2018-CCF大数据与计算智能大赛-面向电信行业存量用户的智能套餐个性化匹配模型联通赛-复赛第二名
2018-iFLYTEK-Marketing-Algorithms-Competition-Finals-Rank1
2018科大讯飞营销算法大赛(冠军方案)
2019-CCF-BDCI-OCR-MCZJ-OCR-IdentificationIDElement
2019CCF-BDCI大赛 最佳创新探索奖获得者 基于OCR身份证要素提取赛题冠军 天晨破晓团队 赛题源码
2019Baai-zhihu-Cup-findexp-4th
2019年知乎看山杯第四名
Chatbot_CN
基于金融-司法领域(兼有闲聊性质)的聊天机器人,其中的主要模块有信息抽取、NLU、NLG、知识图谱等,并且利用Django整合了前端展示,目前已经封装了nlp和kg的restful接口
CrossWOZ
A Large-Scale Chinese Cross-Domain Task-Oriented Dialogue Dataset
DIAC
问题等价性判断数据预处理,包含添加对抗样本(同音字、近义词替换等)、获取样本的pattern(用通配符替换相同词汇,提取相同和不同词汇)
DrQA
Reading Wikipedia to Answer Open-Domain Questions
lmft
Language Model Fine-Tuning, for ChatGLM, BELLE, LLaMA fine-tuning.
MacBERT
Revisiting Pre-trained Models for Chinese Natural Language Processing (Findings of EMNLP)
haojiepan1's Repositories
haojiepan1/deep-ctr
An attempt of training DNN models to predict ad click-through rate, implemented with Theano.
haojiepan1/Di-tech
滴滴出行供需预测大赛--十强
haojiepan1/Dialog_Corpus
用于训练中英文对话系统的语料库 Datasets for Training Chatbot System
haojiepan1/Feature-engineering
feature engineering example for Kaggle bicycle competition
haojiepan1/Feature_Engineering_and_XGBoost_Parameter_Tuning
Examples of how to do feature engineering and Xgboost parameter tuning
haojiepan1/gbdt_lr_in_recsys
Using gbdt+lr in recommend system and comparing the auc of lr, gbdt, gbdt+lr.
haojiepan1/kaggle-2014-criteo
haojiepan1/kaggle-avazu
2nd place solution for Avazu click-through rate prediction competition
haojiepan1/kaggle-avazu-1
haojiepan1/MIXER
Mixed Incremental Cross-Entropy REINFORCE ICLR 2016
haojiepan1/NLP-Extractive-NEWS-summarization-using-MMR
A simple python implementation of the Maximal Marginal Relevance (MMR) baseline system for text summarization.
haojiepan1/Pandas_data_analysis_guide_examples
Pandas data analysis guide examples
haojiepan1/pyLightGBM
Python binding for Microsoft LightGBM
haojiepan1/python_and_data_easy_examples
haojiepan1/SVM-LBP-picture-classifier
使用LBP方法提取特征,再使用svm进行分类
haojiepan1/tensorflow-zh
谷歌全新开源人工智能系统TensorFlow官方文档中文版
haojiepan1/TextInfoExp
自然语言处理相关实验(基于sougou数据集),包含文本特征提取(TF-IDF),文本分类,文本聚类,word2vec训练词向量及同义词词林中文词语相似度计算、文档自动摘要,信息抽取,情感分析与观点挖掘等。