Pinned Repositories
bunkai
Sentence boundary disambiguation tool for Japanese texts (日本語文境界判定器)
ditto
Code for the paper "Deep Entity Matching with Pre-trained Language Models"
ginza
A Japanese NLP Library using spaCy as framework based on Universal Dependencies
HappyDB
A corpus of 100,000 happy moments
jrte-corpus
Japanese Realistic Textual Entailment Corpus (NLP 2020, LREC 2020)
opiniondigest
OpinionDigest: A Simple Framework for Opinion Summarization (ACL 2020)
sato
Code and data for Sato https://arxiv.org/abs/1911.06311.
SubjQA
A question-answering dataset with a focus on subjective information
t5-japanese
Codes to pre-train Japanese T5 models
vecscan
Megagon Labs's Repositories
megagonlabs/ginza
A Japanese NLP Library using spaCy as framework based on Universal Dependencies
megagonlabs/ditto
Code for the paper "Deep Entity Matching with Pre-trained Language Models"
megagonlabs/bunkai
Sentence boundary disambiguation tool for Japanese texts (日本語文境界判定器)
megagonlabs/sato
Code and data for Sato https://arxiv.org/abs/1911.06311.
megagonlabs/jrte-corpus
Japanese Realistic Textual Entailment Corpus (NLP 2020, LREC 2020)
megagonlabs/vecscan
megagonlabs/SubjQA
A question-answering dataset with a focus on subjective information
megagonlabs/asdc
Accommodation Search Dialog Corpus (宿泊施設探索対話コーパス)
megagonlabs/instruction_ja
Japanese instruction data (日本語指示データ)
megagonlabs/cocosum
:coconut: Code & Data for Comparative Opinion Summarization via Collaborative Decoding (Iso et al; Findings of ACL 2022)
megagonlabs/starmie
Resources for PVLDB 2023 submission
megagonlabs/zett
:see_no_evil: Code for Zero-shot Triplet Extraction by Template Infilling (Kim et al; IJCNLP-AACL 2023)
megagonlabs/sudowoodo
The source code of the Sudowoodo paper in ICDE 2023
megagonlabs/xatu
🕊️ Code and Data for XATU: A Fine-grained Instruction-based Benchmark for Explainable Text Updates (Zhang et al; LREC-COLING 2024)
megagonlabs/llm-longeval
💵 Code for Less is More for Long Document Summary Evaluation by LLMs (Wu, Iso et al; EACL 2024)
megagonlabs/magneton
Repository of the Magneton framework for authoring interaction-aware and customizable widgets.
megagonlabs/witqa
megagonlabs/meganno-client
megagonlabs/minun
Evaluating Counterfactual Explanations for Entity Matching
megagonlabs/Tyrogue
megagonlabs/MCR
megagonlabs/pilota
✈ SCUD generator (解釈文生成器)
megagonlabs/quasi_japanese_reviews
Quasi Japanese Reviews (擬似レビューデータ)
megagonlabs/hotel_review_scud
宿泊施設口コミ解釈データ
megagonlabs/magneton-examples
Example widgets created using the Magneton framework
megagonlabs/meganno-service
megagonlabs/meganno-ui
megagonlabs/rjdb
megagonlabs/scud2query
Scud2Query dataset
megagonlabs/watchog
The code for SIGMOD 2024 paper titled "Watchog: A Light-weight Contrastive Learning based Framework for Column Annotation"