/jp-punctuation

(日本語)句読点のない文字列を文単位に区切る

Primary LanguageRuby

How to Use

Basic
# show all texts
cat json/001.json | jq -r .paragraphs[].cues[].text

# show only one paragraph
cat json/001.json | jq -r .paragraphs[0].cues[].text

# redirect to file
cat json/001.json | jq -r .paragraphs[0].cues[].text > raw_text/001.txt

# remove line breaks
cat json/001.json | jq -r '.paragraphs[].cues[].text | gsub("[\\n\\t]"; "")'