Columnar
A library for creating columnar output strings using data as input.
Installation
pip install columnar
Examples
from columnar import columnar
from click import style
data = [
['Strade Bianche', 'Saturday 9th March', 'Tuscany, Italy', "Men's - 176km / Women's - 103km. Both men's and women's "],
['Omloop Het Nieuwsblad', 'Saturday 2nd March', 'Gent, Belgium', "Men's - 200km / Women's - 130km"],
['Milan-San Remo', 'Saturday 23rd March', 'Milan to San Remo, Italy', '295km'],
['Tour of Flanders', 'Sunday 7th April', 'Antwerp to Oudenaarde, Flanders region, Belgium', '260km']
]
patterns = [
('Saturday.+', lambda text: style(text, fg='white', bg='blue')),
('\d+km', lambda text: style(text, fg='cyan')),
('Omloop Het Nieuwsblad', lambda text: style(text, fg='green')),
('Strade Bianche', lambda text: style(text, fg='white')),
('Milan-San Remo', lambda text: style(text, fg='red')),
('Tour of Flanders', lambda text: style(text, fg='yellow')),
]
table = columnar(data, headers=['Race', 'Date', 'Location', 'Distance'], patterns=patterns)
print(table)
Or for that fresh Docker look:
from columnar import columnar
headers = ['name', 'id', 'host', 'notes']
data = [
['busybox', 'c3c37d5d-38d2-409f-8d02-600fd9d51239', 'linuxnode-1-292735', 'Test server.'],
['alpine-python', '6bb77855-0fda-45a9-b553-e19e1a795f1e', 'linuxnode-2-249253', 'The one that runs python.'],
['redis', 'afb648ba-ac97-4fb2-8953-9a5b5f39663e', 'linuxnode-3-3416918', 'For queues and stuff.'],
['app-server', 'b866cd0f-bf80-40c7-84e3-c40891ec68f9', 'linuxnode-4-295918', 'A popular destination.'],
['nginx', '76fea0f0-aa53-4911-b7e4-fae28c2e469b', 'linuxnode-5-292735', 'Traffic Cop'],
]
table = columnar(data, headers, no_borders=True)
print(table)
Columnar also supports emojis and characters that take up two display columns (assuming your display knows what to do with them):
from columnar import columnar
headers = ["User", "Message", "Zip"]
data = [
['Yiying Lu', 'Fried Dumplings!!!! Yum! 😍😍😍', 47130],
['Jennifer Lee', 'Facebook banned the 🍑, can you believe it?', 97153],
['Premier12', '本日のヒーロー🦸周東選手✨ #周東佑京 #侍ジャパン #プレミア12 #AUSP12 #Premier12', 549726]
]
table = columnar(data, headers)
print(table)
Patterns
Columnar supports patterns, which are two-item tuples each containing a regular expression and a function. The regular expression is applied to each item in data
using re.search()
and if there is a match the corresponding function is applied to the text of that element. Only the first matching pattern is applied, meaning patterns can be prioritized by their order in the input array. This can be used to perform colorization, casing, or other custom tasks that will affect the display of the text in the table.
Color Support
As noted above, color may be applied to text by adding it to the text through a pattern. However, text may also be pre-colored by applying ANSI color codes to the text before it is passed to columnar
as made easy by libraries like click
and colorama
. Note however, that any color that is applied will be applied to the contents of the whole cell. For example, if the text for a cell is
f"unmodified text {click.style('modified text', fg='blue')} more unmodified text"
the entire cell's text will be turned blue.
Selecting Columns
If your table has a large number of columns, or you wish to highlight a subset of the columns use the select
keyword argument. It takes a list of strings which are compiled to regular expressions using re.compile(arg, re.I)
and used to select columns using pattern.search(column_name)
. For example, given the following columns
['Name', 'BirthDate', 'Zip Code', 'City Code', 'County Code']
using select=['name', '.*code']
will select all columns except the BirthDate
column.
Dropping Columns
It is often the case that one or more columns of the data will not be useful. For example, columns where all the values are "Null" or "-". To filter out these columns use the drop
keyword argument. This argument takes a list of values and drops any column whose contents are a subset of those values. For example, given four columns
a NA 1 -
b NA 2 Null
- NA 3 -
d NA 4 None
using drop=['-', 'Null', 'NA', 'None']
will drop the second and fourth columns, even though the first column contains a dash also.
Column Sizing Algorithm
There are an infinite number of ways to determine column sizing and text wrapping given a dataset. This package allows the user to specify a minimum column width, a maximum column width, and a "wrap max" which partially define wrapping and column sizing. The rest of the logic that goes into determining how to fit data into a table when the data is wider than the terminal employs a pretty simple heuristic. First determine how wide each column wants to be without wrapping. If all the columns are too wide to fit on the screen, shrink as many columns as are needed in order for the table to fit, starting with the widest column and progressing through the columns from largest to smallest. If the size of the columns falls below the minimum column width then raise an exception, specifically a columnar.exceptions.TableOverflowError
. This should only happen if there are so many columns that terminal_width / num_columns
is less than the minimum column width.
Text Wrapping
The contents of a column are wrapped as needed to fit in the column with no effort made to split on spaces. However, new-line characters are preserved and tab characters are replaced with four spaces. The maximum number of times the contents of a column are wrapped before being truncated is given by wrap_max
. Another way to think about wrap_max
is that wrap_max + 1
is the maximum number of rows a single cell can occupy. Any content past the wrap_max + 1
th row is truncated.
API
columnar()
Arguments
data
An iterable of iterables, typically a list of lists of strings where each string will occupy its own cell in the table. However, list elements need not be strings. No matter what is passed, each element in the list is converted to a string using str()
.
headers=None
A list of strings, one for each column in data
, which will be displayed as the table headers. If left as None
this will produce a table that does not have a header row.
patterns=[]
As explained above, patterns accepts a list of two-item tuples which can be used to transform the input data
in order to perform tasks like text coloring, capitalization, or other formatting.
drop=[]
As explained above, drop takes a list of strings and if any column contains only elements in that list the column and its corresponding header will be excluded from the table. Can be used to exclude columns where all the values are "Null", or "-", etc. If an empty list is passed (default) then no columns are dropped.
select=[]
Accepts a list of string that are compiled to regular expressions using the case insensitive, re.I
, flag. Any column that matches any of these regular expressions is kept while all other columns are dropped. If select
is specified drop
is ignored, meaning that it is possible to display columns that may have been dropped by drop
by specifying them in select
. Passing an empty list (default) causes all columns not dropped by drop
to be displayed.
no_borders=False
Accepts a boolean value that specifies whether or not to display the borders between rows and columns. Passing True
will hide all the borders and convert the headers to all caps for a more minimalistic look.
head=0
Similar to the unix bash command, displays only head
number of rows of data. For example
columnar(data, headers, head=4)
will display the first four rows of data. Passing 0
(default) will display all the data.
justify='l'
Specifies how each column should be justified. Justification options are either l
, c
, or r
for left, center, and right justification respectively.
This argument accepts either a single value, or a list with len(list) == num_columns
. If a single value is specified the justification for all columns will be set to that value. Otherwise, if a list is supplied, values will be applied to each column individually. For exmaple
columnar(data, headers=['one', 'two', 'three'], justify='c')
will center all three columns, while
columnar(data, headers=['one', 'two', 'three'], justify=['r', 'c', 'l'])
will right-justify column 'one', center column 'two', and left-justify column 'three'.
wrap_max=5
Sets the maximum number of times a line will wrap inside its cell. Another way to think of this is that wrap_max + 1
is the maximum number of lines that a cell can occupy. New-line characters in the input are preserved, meaning that they count against the value of wrap_max
.
max_column_width=None
Sets the maximum width for a column, causing the contents to wrap if they contain more characters than max_column_width
. Setting this value to None
(default) will only cause text to be wrapped if the whole table is too wide to fit on the screen causing the column-sizing algorithm to kick in.
min_column_width=5
Sets the minimum width of a column, adding whitespace to either the left side, right side, or both sides depending on the value of justify
. Note that if min_column_width
is too high the table may not fit on the screen and a columnar.exceptions.TableOverflowError
will be thrown.
row_sep='-'
Specifies the character, or string, used to draw borders between the rows.
column_sep='|'
Specifies the character, or string, used to draw borders between the columns.
terminal_width=None
Specifies the width of the output display. If left as None
the width will default to shutil.get_terminal_size().columns
. However, for cases where the default does not give a desirable result the display width can be specified here.