Python text manipulation

-- Steps to use the scripts available in this repository, is included in the README file for each folder --

Requirements for this repository:

  • Python 2.7
  • Flask Framework
  • Python packages (python-docx, openpyxl, Anaconda, pip, pandas, csv, json, BeautifulSoup)
  • mySQL

Learning

1. Learn Python

  • Solved examples (50 examples) from learnpythonthehardway.com

2. Flask Tutorial

  • Flask Python tutorials

3. Examples

  • Add new word (add_new_word)
  • Capitalize each line (capitalize_line)
  • Add, delete and view records from MySQL (console_to_mysql)
  • Contact manager using Flask REST API and SQLite (contact_manager)
  • Contact manager from console on SQL (contact_manager_SQL)
  • Find count of digits and letters (count_digits_letters)
  • Copy column in tab separated (csv_copy_column)
  • Find number of vowels in file (find_vowels)
  • Format a sentence (format_sentence)
  • Generate 2D array (generate_array)
  • Copy json file to csv (json_to_csv)
  • Find numbers divisible by 7 and not a multiple of 5 (num_divisible)
  • Print words of specific length (print_words)
  • Sort words and remove duplicates in a string (sort_remove_duplicates)
  • Split and join words in a comma separated string (split_join)

File conversions

1. Demo

  • Basic HTML to CSV (basic_html_to_csv)
  • Single DOCX to multiple MD for QA (docx_to_md_QA_single)
  • Single XLSX to single MD (xlsx_to_md)
  • Single XLSX to multiple MD Notes v1 (xlsx_to_md_file_single)
  • Single XLSX to multiple MD Notes v2 (xlsx_to_md_file_single)
  • Basic XLSX to TXT (xlsx_to_txt)

2. Tasks

  • Single CSV to single USFM file (csv_to_usfm)
  • DOCX to MD QA (docx_to_md_QA)
  • DOCX to MD Notes (docx_to_md_notes)
  • DOCX to USFM (docx_to_usfm)
  • HTML to CSV interlinear (html_to_csv_interlinear)
  • Simple USFM to CSV (usfm_to_csv)
  • XLSX Formatting (xlsx_format)
  • XLSX to MD Format Terms (xlsx_to_md_format_terms)
  • XLSX to MD QA (xlsx_to_md_QA)
  • XLSX to MD Notes (xlsx_to_md_notes)
  • XLSX to MD terms (xlsx_to_md_terms)

Other Tasks

  • Lexicon Search app using Python GUI (LexiconSearchScript)
  • Format translation of QA and copy to target file (formatQAfromXlsx)
  • Open Bible Stories (OBS)
  • Parse Strong's Lexicon (parseStrongsLexicon)
  • Rectify USFM errors (rectify_USFM)
  • Translation Words Project (translation_Words)
  • Google Translate API to translate files (only md/text files) (using_Google_Translate)
  • USFM tag validator (validate_usfm_tags)