lijuan-su
AI Scientist (ML); Runner Hiker & Climber. Co-founder of JOY GROUP. @Harvard University @ZJU @UofA @Tencent @MIT UV
@Harvard Medical School @ Massachusetts General Hospital @ZJU @UA @TencentUSA, China
lijuan-su's Stars
yixiu00001/LSTM-CRF-medical
构建医疗实体识别的模型,包含词典和语料标注,基于python构建
fighting41love/jieba_fast
Use C Api and Swig to Speed up jieba 高效的中文分词库
lingluodlut/BioNER-Progress
This repository aims to track the progress in BioNER and give a related paper list and an overview of the state-of-the-art (SOTA).
graykode/nlp-roadmap
ROADMAP(Mind Map) and KEYWORD for students those who have interest in learning NLP
kamranahmedse/developer-roadmap
Interactive roadmaps, guides and other educational content to help developers grow in their careers.
qq547276542/Agriculture_KnowledgeGraph
农业知识图谱(AgriKG):农业领域的信息检索,命名实体识别,关系抽取,智能问答,辅助决策
OHDSI/Vocabulary-v5.0
Build process for the OHDSI Standardized Vocabularies. Currently not available as independent release.
liuhuanyong/TextGrapher
Text Content Grapher based on keyinfo extraction by NLP method。输入一篇文档,将文档进行关键信息提取,进行结构化,并最终组织成图谱组织形式,形成对文章语义信息的图谱化展示。
liuhuanyong/QASystemOnMedicalKG
A tutorial and implement of disease centered Medical knowledge graph and qa system based on it。知识图谱构建,自动问答,基于kg的自动问答。以疾病为中心的一定规模医药领域知识图谱,并以该知识图谱完成自动问答与分析服务。
liuhuanyong/MedicalNamedEntityRecognition
Medical Named Entity Recognition implement using bi-directional lstm and crf model with char embedding.CCKS2017中文电子病例命名实体识别项目,主要实现使用了基于字向量的四层双向LSTM与CRF模型的网络.该项目提供了原始训练数据样本(一般醒目,出院情况,病史情况,病史特点,诊疗经过)与转换版本,训练脚本,预训练模型,可用于序列标注研究.把玩和PK使用.
zhanzecheng/Chinese_segment_augment
python3实现互信息和左右熵的新词发现
blmoistawinde/HarvestText
文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法
SophonPlus/ChineseNlpCorpus
搜集、整理、发布 中文 自然语言处理 语料/数据集,与 有志之士 共同 促进 中文 自然语言处理 的 发展。
wainshine/Chinese-Names-Corpus
中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。
codemayq/chinese-chatbot-corpus
中文公开聊天语料库
Belval/TextRecognitionDataGenerator
A synthetic data generator for text recognition
Sanster/text_renderer
Generate text images for training deep learning ocr model
ankush-me/SynthText
Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.
google-research/google-research
Google Research
thunlp/OpenKE
An Open-Source Package for Knowledge Embedding (KE)
thunlp/GNNPapers
Must-read papers on graph neural networks (GNN)
JetBrains/awesome-pycharm
A curated list of resources for learning and using PyCharm, a Python IDE from JetBrains
kermitt2/grobid
A machine learning software for extracting information from scholarly documents
hwalsuklee/awesome-deep-text-detection-recognition
A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.
chiphuyen/lazynlp
Library to scrape and clean web pages to create massive datasets.
jiesutd/RichWordSegmentor
Neural word segmentation with rich pretraining, code for ACL 2017 paper
lancopku/pkuseg-python
pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
NLPatVCU/medaCy
:hospital: Medical Text Mining and Information Extraction with spaCy
crownpku/Information-Extraction-Chinese
Chinese Named Entity Recognition with IDCNN/biLSTM+CRF, and Relation Extraction with biGRU+2ATT 中文实体识别与关系提取
shiyybua/NER
基于tensorflow深度学习的中文的命名实体识别