Pinned Repositories
DependencyParser.jl
Dependency parser written in Julia.
fago
Something like fasd or autojump, in Golang (WIP).
gazetter-of-japan
Japanese place name dictionary
hokkaidle
北海道の市町村を地図で当てるゲーム。Inspired by Wordle.
ngram
CLI n-gram count. Written in Rust
people-map-japan
A People Map of Japan, inspired by The Pudding
sudachi.js
A Japanese tokenizer Sudachi in JavaScript (incomplete)
TACL-Membership
Membership Inference Attacks on Sequence-to-Sequence Models (Hisamoto et al., TACL 2020)
tokaido-scrollytelling
53 Stations of the Tōkaidō - Scrollytelling (Scroll + Storytelling)
sudachi.rs
Sudachi in Rust 🦀 and new generation of SudachiPy
sorami's Repositories
sorami/DependencyParser.jl
Dependency parser written in Julia.
sorami/sudachi.js
A Japanese tokenizer Sudachi in JavaScript (incomplete)
sorami/TACL-Membership
Membership Inference Attacks on Sequence-to-Sequence Models (Hisamoto et al., TACL 2020)
sorami/fago
Something like fasd or autojump, in Golang (WIP).
sorami/ngram
CLI n-gram count. Written in Rust
sorami/gopun
文をGo言語っぽくする
sorami/sudachi.ts
Unofficial, incomplete
sorami/JuliaTokyo-2
JuliaTokyo #2 の導入セッション用資料です。
sorami/elasticsearch-sudachi
The Japanese analysis plugin for elasticsearch
sorami/ginza
A Japanese NLP Library using spaCy as framework based on Universal Dependencies
sorami/gitignore
A collection of useful .gitignore templates
sorami/homebrew-cask
A CLI workflow for the administration of Mac applications distributed as binaries
sorami/Japanese-Company-Lexicon
sorami/julia-doc-ja
A Japanese translation of the Julia documentation
sorami/logo
sorami/myka
Burmese or Georgian ?
sorami/open-data-registry
A registry of publicly available datasets on AWS
sorami/primer
Primer is a Jekyll theme for GitHub Pages
sorami/qmk_firmware
Miryoku is an ergonomic, minimal, orthogonal layout for ergo or ortho keyboards. Crkbd Keymap by Manna Harbour includes crkbd-specific hardware feature support. See the forkreadme branch (linked).
sorami/Sudachi
A Japanese Tokenizer for Business
sorami/SudachiDict
A lexicon for Sudachi
sorami/SudachiPy
Python version of Sudachi, a Japanese tokenizer.
sorami/SudachiTra
Japanese tokenizer for Transformers
sorami/transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
sorami/w2v-sembei
C++ implementation of word segmentation-free version of word2vec
sorami/xsv
A fast CSV command line toolkit written in Rust.