/NERAutoRecognition

A NER project that can label entity automatically based on Hanlp

Primary LanguagePython

Introduction

A NER auto labeling project based on doccano and hanLP

features

  • preprocess the source text data
  • auto labeling based on HanLP
  • convert file format

procedures

  1. 数据预处理
  2. 是否调用hanlp
  3. 导入doccano后标注
  4. 导出转格式
  5. 句子统计、实体统计
  6. 数据集划分