/IADocWordCount

Bash script which takes arguments and uses them to gather item word count from an Internet Archive object.

Primary LanguageShellMIT LicenseMIT

IADocWordCount

Bash script which takes arguments and uses them to gather item word count from an Internet Archive object.

This script takes 2 arguments from the commandline.

1: Identifier for the Internet Archive Object

2: Free text designating a 'grouping' for the gathered files.

Sample script execution: "bash ia_word_count.sh myid doc"

Data gathered from the Internet Archive:

1: djvu.txt file which hold the raw OCR text for the object.

2: API call which gathered a JSON file holding the publication date for the object.

3: Parse date from JSON file and store as a variable.

Processing:

1: Count the number of words in the djvu.txt file.

2: Store the word count in a file that is named 'grouping_date_identifier_words.txt'

License

This software on this site is provided "as-is," without any express or implied warranty. In no event shall libmanuk be held liable for any damages arising from the use of the software.