/clui

Command Line Unicode Information for macOS

Primary LanguageShellMIT LicenseMIT

clui

Get the description, code point(s) and UTF encoding of Unicode characters and sequences, in a variety of formats and encodings with clui (Command Line Unicode Information) for macOS

Check out blog entries on clui at brunerd.com

clui demo video

See clui in action:
clui walkthrough

clui usage

./clui -u for usage/help output in less

clui (1.0) - Command Line Unicode Info (https://github.com/brunerd/clui)
Usage: clui [options] <input> ...

Input can be:
  * Unicode characters, space or comma delimited (use -x to expand non-delimited strings)
  * Hexadecimal codepoint representations (U+hhhhh or 0xhhhhh), double-quoted muti-point sequences
  * Hyphenated ranges (ascending or descending): z-a, U+A1-U+BF or 0x20-0x7E
  * Category or Group names (see Input Options)
  * Descriptive words or phrases (see Input Options)

Output Options

 -D  Discrete info fields for CharacterDB and localized AppleName.strings

 -H  Hide characters lacking info descriptions

 -l <localization>  
     Show localized info (Emoji only). Use -Ll to list available localizations.

 -p  Preserve case of CharacterDB info          
 
 Encoding style for UTF field
  -E <encoding>
     h*   UTF-8 hexadecimal, space delimited and capitalized (NN) (default)
     H    Hex HTML Entity UTF-32 (&#xnnnn;)
     0    Octal UTF-8 with leading 0 (\0nnn)
     o    Octal UTF-8 (\nnn)
     x    Shell style UTF-8 hex(\xnn)
     u    JS style UTF-16 (\unnnn)
     U    zsh style UTF-32 Unicode Code Point (\Unnnnnnnn)
     w    Web/URL UTF-8 encoding (%nn)

 Output format 
  -O <output format>
     C*   CSV (default)
     c    Character-only, space delimited
     j    JSON output (array of objects)
     J    JSON Sequence output (objects delimited by 0x1E and 0x0A)
     p    Plain output (no field descriptions)
     r    RTF output (plain output with large sized characters)
     y    YAML output

  Format dependent output options
   -f <char size, info size>  set font size for RTF output of char and info fields (default: 256,32)
   -h  Hide headers for CSV output
     
Input Options

 -C <Category>[,Subsection]
      Treat input as a Category name with possible a subsection (see -L for listing)

 -F  Remove Fitzpatrick skin tone modifier and process, then process as-is

 -G <Group>[,Category]
      Treat input as a Group name with possible category name (see -L for listing)

 -l <localization>  
     Search localized descriptions (Emoji only), use -Ll to list available localizations

 -S <mode>  
    Treat input as search criteria
    d    Search descriptions in CharacterDB and AppleName.strings (case insensitive)
    c    Search for character in other Unicode sequences
    C    Search for character plus "related characters"

 -x  Expand and describe each individual code point in a sequence
 -X  Expand plus display original sequence prior to expansion

 -V  Verbatim, process input raw/as-is, no additional interpretation or delimitation
  
Other Modes

 List categories and groups
  -L <mode>  List categories or groups in CSV (use -h to suppress header)
     c    Category list (* after a name denotes subsections)
     C    Category list, with subsections expanded
     g    Groups of categories, top level name
     G    Group name with member categories expanded
     l    Locales available to search and display results from (Emoji only) 
     
 -u  Display usage info (aka help) with less (press q to quit)
  
Examples:

  Search for characters a to z plus "related characters" and output as CSV (default)
  clui -SC a-z

  Look up all available Categories
  clui -Lc

  Get every character in Emoji category and output in RTF to a file
  clui -Or -C Emoji > Emoji.rtf

  All characters in Emoji category with discrete info fields in Spanish to a CSV to a file
  clui -D -l es -C Emoji > Emoji-es.csv

  Search descriptions for substring "family" and expand multi-code point ZWJ sequences 
  clui -X -Sd "family"