ewfian's Stars
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
hankcs/HanLP
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification
HumanSignal/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
junyanz/CycleGAN
Software that can generate photos from paintings, turn horses into zebras, perform style transfer, and more.
attardi/wikiextractor
A tool for extracting plain text from Wikipedia dumps
dotnet/aspnet-api-versioning
Provides a set of libraries which add service API versioning to ASP.NET Web API, OData with ASP.NET Web API, and ASP.NET Core.
vietanhdev/anylabeling
Effortless AI-assisted data labeling with AI support from YOLO, Segment Anything (SAM+SAM2), MobileSAM!!
kajweb/dict
英语字典 英语词库 字典词库 四级单词 六级单词 考研单词 雅思 托福 SAT GMAT TOEFL GRE
kha-white/manga-ocr
Optical character recognition for Japanese text, with the main focus being Japanese manga
travisgoodspeed/gbrom-tutorial
Tutorial for extracting the GameBoy ROM from photographs of the die.
annotorious/annotorious
Add image annotation functionality to any web page with a few lines of JavaScript.
rbrahul/Awesome-JSON-Viewer
:fire: A Chrome extension to visualise JSON response and introduce awesome JSON prettifying experiences.
polm/fugashi
A Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis.
allenai/pawls
Software that makes labeling PDFs easy.
zzzDavid/ICDAR-2019-SROIE
ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction
sparkfish/augraphy
Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes
stephenmk/Jitendex
A free, offline, and openly licensed Japanese-to-English dictionary. Updates weekly!
ikegami-yukino/neologdn
Japanese text normalizer for mecab-neologd
martincostello/xunit-logging
Logging extensions for xunit
chakki-works/chABSA-dataset
chakki's Aspect-Based Sentiment Analysis dataset
stockmarkteam/ner-wikipedia-dataset
Wikipediaを用いた日本語の固有表現抽出データセット
DocCreator/DocCreator
DIAR software for synthetic document image and groundtruth generation, with various degradation models for data augmentation
trbenning/serilog-sinks-xunit
The xunit test output sink for Serilog
stephenmk/yomichan-jlpt-vocab
JLPT level tags for words in Yomichan
tanreinama/Japanese-BPEEncoder_V2
Japanese-BPEEncoder Version 2
librz/shell-scripts
bash scripts & dot files for better terminal experience
SimonTart/json-fragment-parser
parse partial json string
pettenuzzofabio/image-augmentation
Image dataset augmentation for machine learning
DungLe13/bidding-dataset
Bidding documents for paper "CinBidding: A Dataset for Domain-specific Information Extraction with Limited Data"
retarfi/jptranstokenizer
Japanese Tokenizer for transformers library