kyodocn
I am a current student from Shenzhen University, and my interests include natural language processing and data visualization.
Shenzhen UniversityShenZhen
Pinned Repositories
100knocks-preprocess
データサイエンス100本ノック(構造化データ加工編)
a_bccwj
Universal Dependencies online documentation
allennlp-shiba-model
AllenNLP integration for Shiba: Japanese CANINE model
amazon-reviews-scraper
Yet another multi language scraper for Amazon targeting reviews.
Anatext
A tool to extract relations from the transaction comments.
AnnotatedFKCCorpus
Annotated Fuman Kaitori Center Corpus
deplacy
CUI-based Tree Visualizer for Universal Dependencies and Immediate Catena Analysis
textprep
Textprep is an analyzing tool for both parallel and non-parallel corpus and its down-stream Natural Language Processing and Machine Translation tasks. It is designed especially for logographic languages such as Chinese and Japanese.
kyodocn's Repositories
kyodocn/100knocks-preprocess
データサイエンス100本ノック(構造化データ加工編)
kyodocn/a_bccwj
Universal Dependencies online documentation
kyodocn/awesome-japanese-nlp-resources
A curated list of resources dedicated to Python libraries, pre-trained models, dictionaries, and corpora of NLP for Japanese
kyodocn/BCCWJ-SPR2
kyodocn/bert-book
「BERTによる自然言語処理入門: Transformersを使った実践プログラミング」サポートページ
kyodocn/bert-classification-tutorial
【2023年版】BERTによるテキスト分類
kyodocn/BERT_Japanese_Google_Colaboratory
Google Colaboratoryで日本語のBERTを動かす方法です。
kyodocn/bunkai
Sentence boundary disambiguation tool for Japanese texts (日本語文境界判定器)
kyodocn/buzz
linguistics backend
kyodocn/chatgpt-vscode
A VSCode extension that allows you to use ChatGPT inside the IDE
kyodocn/chiVe
Japanese word embedding with Sudachi and NWJC 🌿
kyodocn/d2l-zh
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被60个国家的400所大学用于教学。
kyodocn/data-science-ipython-notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
kyodocn/esupar
Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa models for Japanese and other languages
kyodocn/ja_sentence_segmenter
japanese sentence segmentation library for python
kyodocn/jcms
kyodocn/jd-shopper
京东自动下单 (自动登录,指定时间预约商品,商品补货监控,自动加购物车,自动下单)
kyodocn/kintoki
kyodocn/kwja
A unified language analyzer for Japanese
kyodocn/news-fetch
A Python Package which helps to scrape all news details from any news websites
kyodocn/newspaper
News, full-text, and article metadata extraction in Python 3. Advanced docs:
kyodocn/notebooks
Jupyter notebooks for the Natural Language Processing with Transformers book
kyodocn/numpy-100
100 numpy exercises (with solutions)
kyodocn/Python-100-Days
Python - 100天从新手到大师
kyodocn/sentence-transformers
Sentence Embeddings with BERT & XLNet
kyodocn/sentiment_ja
オリジナルのリポジトリがなくなったので、偶然直前にクローンしていたデータをアップロードしています。著作権、ライセンスはオリジナルに準じます。
kyodocn/SudachiTra
Japanese tokenizer for Transformers
kyodocn/SuPar-UniDic
Tokenizer POS-tagger Lemmatizer and Dependency-parser for modern and contemporary Japanese with BERT models
kyodocn/Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
kyodocn/vaporetto
🛥 Vaporetto: a fast and lightweight pointwise prediction based tokenizer