A CLI tool written in Python to dump binary files and files in intel hex format. It can generate a report for any file based on report template. This is useful for images which contain specific data always on the same address, e.g. a CRC, signature, etc.
There are a lot of hex viewers already, but I was not able to find one which I could configure in a way to generate something like a report.
- Installation
- Usage
- Overview
- Datatypes
- Examples
- Macros
- macros_compare_values()
- m_read_uint8()
- m_read_uint16le()
- m_read_uint16be()
- m_read_uint32le()
- m_read_uint32be()
- m_read_uint64le()
- m_read_uint64be()
- m_read_int8()
- m_read_int16le()
- m_read_int16be()
- m_read_int32le()
- m_read_int32be()
- m_read_int64le()
- m_read_int64be()
- m_read_float32le()
- m_read_float32be()
- m_read_float64le()
- m_read_float64be()
- m_read_string()
- m_calc_checksum()
- m_swap_bytes_u16()
- m_swap_bytes_u32()
- m_swap_words_u32()
- Used Libraries
- Issues, Ideas And Bugs
- License
- Contribution
$ git clone https://github.com/BlueAndi/pyHexDump.git
$ cd pyHexDump
$ pip install .
Show help information:
$ pyHexDump --help
- "int8": signed 8-bit
- "uint8": unsigned 8-bit
- "int16le": signed 16-bit little endian
- "int16be": signed 16-bit big endian
- "uint16le": unsigned 16-bit little endian
- "uint16be": unsigned 16-bit big endian
- "int32le": signed 32-bit little endian
- "int32be": signed 32-bit big endian
- "uint32le": unsigned 32-bit little endian
- "uint32be": unsigned 32-bit big endian
- "int64le": signed 64-bit little endian
- "int64be": signed 64-bit big endian
- "uint64le": unsigned 64-bit little endian
- "uint64be": unsigned 64-bit big endian
- "float32le": floating point 32-bit little endian
- "float32be": floating point 32-bit big endian
- "float64le": floating point 64-bit little endian
- "float64be": floating point 64-bit big endian
- "utf8": String encoded in UTF-8
Don't miss the examples in the example folder. In the following chapters you can see how to use pyHexDump and its output.
$ pyHexDump dump ./examples/data/aurix_tc397.hex -a 0x80000020
Result:
80000020: 02 58 DA 01 9B 1F 00 F0 0F 4F 10 F0 6C 41 C5 FF
80000030: 00 01 BC F1 82 04 6D 00 2F 04 91 30 00 FF 39 FF
80000040: 30 06 16 0F 4B 0F 41 F1 4B F2 51 F0 3C 01 3C 01
80000050: 4B 0F 31 F1 3B 80 3E 00 4B 0F 01 02 E2 08 3C 01
$ pyHexDump dump ./examples/data/aurix_tc397.hex -a 0x80000020 -dt uint32le
Result:
80000020: 01DA5802 F0001F9B F0104F0F FFC5416C
80000030: F1BC0100 006D0482 3091042F FF39FF00
80000040: 0F160630 F1410F4B F051F24B 013C013C
80000050: F1310F4B 003E803B 02010F4B 013C08E2
80000060: 4800800B 00CE006D 4F409000 80DA4802
80000070: 5008003B F440F5A6 006D8402 01DA028B
80000080: F0108F0F 9000F16C FFFFFFFF 7FFFFFFF
80000090: F0248160 00873802 FFFFFFFF F0248164
800000A0: 0000D066 FFFFFFFF F0248168 00073802
800000B0: FFFFFFFF F024816C 00009826 FFFFFFFF
800000C0: F0248124 000000C9 FFFFFFFF F0248108
800000D0: 30360001 FFFFFFFF F024810C 0B690708
800000E0: FFFFFFFF F0248128 0121048E FFFFFFFF
800000F0: 00000000 FFFFFFFF FFFFFFFF FFFFFFFF
80000100: F8000091 3048FFD9 0200000D 0FDCF402
80000110: 00000000 00000000 00000000 00000000
Calculate a CRC checksum over a specific range.
$ pyHexDump checksum ./examples/data/aurix_tc397.hex -sa 0x80000020 -ea 0x80000040
Result:
219725A2
The following optional arguments are supported:
-bde
The binary data endianess and bit width:- "uint8": unsigned 8-bit
- "uint16le": unsigned 16-bit little endian
- "uint16be": unsigned 16-bit big endian
- "uint32le": unsigned 32-bit little endian
- "uint32be": unsigned 32-bit big endian
-sa
: Start address of the CRC calculation.-ea
: End address of the CRC calculation (not included).-p
: The polynomial for the CRC calculation. Default: 0x04C11DB7-bw
: The bit width, e.g. 8 in case of a CRC-8. Default: 32-s
-: The seed value which to use. Default: 0-ri
: If the input data shall be reflected, set to True. Default: False-ro
: If the output data shall be reflected, set to True. Default: False-fx
: If the output shall be have a final XOR with all bits set, set to True. Default: False
Elements with their name, address, datatype and count can be configured separately.
By using the print
command all of the values in the configuration are printed to the CLI.
The following datatypes are supported:
- "uint8": unsigned 8-bit
- "uint16le": unsigned 16-bit little endian
- "uint16be": unsigned 16-bit big endian
- "uint32le": unsigned 32-bit little endian
- "uint32be": unsigned 32-bit big endian
- "int8": signed 8-bit
- "int16le": signed 16-bit little endian
- "int16be": signed 16-bit big endian
- "int32le": signed 32-bit little endian
- "int32be": signed 32-bit big endian
- "float32le": 32-bit floating point little endian
- "float32be": 32-bit floating point big endian
- "float64le": 64-bit floating point little endian
- "float64be": 64-bit floating point big endian
$ pyHexDump print ./examples/data/aurix_tc397.hex ./examples/config.json --onlyInHex
with config.json
like
{
"elements": [{
"name": "UCB00_BMI_BMHDID",
"addr": "0xAF400000",
"dataType": "uint32le",
"count": 1
}, {
"name": "UCB00_STAD",
"addr": "0xAF400004",
"dataType": "uint32le",
"count": 1
}, {
"name": "UCB00_CRCBMHD",
"addr": "0xAF400008",
"dataType": "uint32le",
"count": 1
}, {
"name": "UCB00_CRCBMHD_N",
"addr": "0xAF40000C",
"dataType": "uint32le",
"count": 1
}, {
"name": "UCB00_PWx",
"addr": "0xAF400104",
"dataType": "uint32le",
"count": 8
}, {
"name": "UCB00_CONFIRMATION",
"addr": "0xAF4001F0",
"dataType": "uint32le",
"count": 1
}]
}
Result:
UCB00_BMI_BMHDID @ AF400000: 0xB35900FE
UCB00_STAD @ AF400004: 0xA0000000
UCB00_CRCBMHD @ AF400008: 0x31795570
UCB00_CRCBMHD_N @ AF40000C: 0xCE86AA8F
UCB00_PWx @ AF400104: [0x00000000, 0x00000000, 0x00000000, 0x00000000, 0x00000000, 0x00000000, 0x00000000, 0x00000000]
UCB00_CONFIRMATION @ AF4001F0: 0x43211234
The Mako template library is used, to provide a lot of functionality. Please have a look to the Mako documentation for details.
A configuration element can be accessed in the template via:
${<config-element-name>}
: Prints the decimal value.${<config-element-name>.hex()}
: Prints the value in hex with "0x" as prefix by default.${<config-element-name>.hex("")}
: Prints the value in hex without a prefix.${<config-element-name>.addr()}
: Prints the address in decimal.
$ pyHexDump print ./examples/data/aurix_tc397.hex ./examples/config.json --templateFile ./examples/markdown.mako
with config.json
like
{
"elements": [{
"name": "UCB00_BMI_BMHDID",
"addr": "0xAF400000",
"dataType": "uint32le",
"count": 1
}, {
"name": "UCB00_STAD",
"addr": "0xAF400004",
"dataType": "uint32le",
"count": 1
}, {
"name": "UCB00_CRCBMHD",
"addr": "0xAF400008",
"dataType": "uint32le",
"count": 1
}, {
"name": "UCB00_CRCBMHD_N",
"addr": "0xAF40000C",
"dataType": "uint32le",
"count": 1
}, {
"name": "UCB00_PWx",
"addr": "0xAF400104",
"dataType": "uint32le",
"count": 8
}, {
"name": "UCB00_CONFIRMATION",
"addr": "0xAF4001F0",
"dataType": "uint32le",
"count": 1
}]
}
with markdown.mako
like
<%text># Aurix TC397 - Blinky Example</%text>
<%text>## User Control Block 00</%text>
|Short Name|Value|
|----------|-----|
| BMI_BMHDID | ${UCB00_BMI_BMHDID.hex()} |
| STAD | ${UCB00_STAD.hex()} |
| CRCBMHD | ${UCB00_CRCBMHD.hex()} |
| CRCBMHD_N | ${UCB00_CRCBMHD_N.hex()} |
| PWx | ${UCB00_PWx.hex()} |
| CONFIRMATION | ${UCB00_CONFIRMATION.hex()} |
<%
bmi_bmhdid = UCB00_BMI_BMHDID
bmi = (bmi_bmhdid >> 0) & 0xFFFF
bmhdid = (bmi_bmhdid >> 16) & 0xFFFF
pindis = (bmi >> 0) & 0x01
hwcfg = (bmi >> 1) & 0x07
mode_by_hwcfg = "disabled"
if pindis == 0:
mode_by_hwcfg = "enabled"
start_up_mode = "invalid"
if hwcfg == 0x07:
start_up_mode = "internal start from flash"
elif hwcfg == 0x06:
start_up_mode = "alternate boot mode"
elif hwcfg == 0x04:
start_up_mode = "generic bootstrap loader mode"
elif hwcfg == 0x03:
start_up_mode = "asc bootstrap loader mode"
is_bmh_valid = "invalid"
if bmhdid == 0xB359:
is_bmh_valid = "OK"
calculated_crc_bmhd = m_calc_checksum("uint32le", UCB00_BMI_BMHDID.addr(), UCB00_CRCBMHD.addr(), 0x04c11db7, 32, 0xffffffff, True, True, True)
calculated_crc_bmhd_n = m_calc_checksum("uint32le", UCB00_BMI_BMHDID.addr(), UCB00_CRCBMHD.addr(), 0x04c11db7, 32, 0xffffffff, True, True, False)
is_bmh_integrity_given = "Not OK"
if calculated_crc_bmhd == UCB00_CRCBMHD:
if calculated_crc_bmhd_n == UCB00_CRCBMHD_N:
is_bmh_integrity_given = "OK"
%>
<%text>### Boot Mode Index (BMI)</%text>
* Mode selection by configuration pins: ${mode_by_hwcfg}
* Start-up mode: ${start_up_mode}
<%text>### Boot Mode Header Identifier (BMHDID)</%text>
Is boot mode header valid: ${is_bmh_valid}
<%text>### Boot Mode Header CRC (CRCBMHD/CRCBMHD_N)</%text>
Is boot mode header integrity given: ${is_bmh_integrity_given}
Result:
# Aurix TC397 - Blinky Example
## User Control Block 00
|Short Name|Value|
|----------|-----|
| BMI_BMHDID | 0xB35900FE |
| STAD | 0xA0000000 |
| CRCBMHD | 0x31795570 |
| CRCBMHD_N | 0xCE86AA8F |
| PWx | [0x00000000, 0x00000000, 0x00000000, 0x00000000, 0x00000000, 0x00000000, 0x00000000, 0x00000000] |
| CONFIRMATION | 0x43211234 |
### Boot Mode Index (BMI)
* Mode selection by configuration pins: enabled
* Start-up mode: internal start from flash
### Boot Mode Header Identifier (BMHDID)
Is boot mode header valid: OK
### Boot Mode Header CRC (CRCBMHD/CRCBMHD_N)
Is boot mode header integrity given: OK
If several elements are right behind each other like in a structure, it can be configured in a similar way by using a list of elements for a datatype. The address of each element in the structure is calculated by the given base address in the datatype of each element.
Note that nested structures are not supported yet!
{
"elements": [{
"name": "UCB00",
"addr": "0xAF400000",
"dataType": [{
"name": "BMI_BMHDID",
"dataType": "uint32le",
"count": 1
}, {
"name": "STAD",
"dataType": "uint32le",
"count": 1
}, {
"name": "CRCBMHD",
"dataType": "uint32le",
"count": 1
}, {
"name": "CRCBMHD_N",
"dataType": "uint32le",
"count": 1
}, {
"name": "PWx",
"dataType": "uint32le",
"offset": "0x0104",
"count": 8
}, {
"name": "CONFIRMATION",
"dataType": "uint32le",
"offset": "0x01F0",
"count": 1
}],
"count": 1
}]
}
To access it in the template, you can use the "." notation or Python dictionary syntax.
with markdown.mako
like
<%text># Aurix TC397 - Blinky Example</%text>
<%text>## User Control Block 00</%text>
|Short Name|Value|
|----------|-----|
| BMI_BMHDID | ${UCB00.BMI_BMHDID.hex()} |
| STAD | ${UCB00.STAD.hex()} |
| CRCBMHD | ${UCB00.CRCBMHD.hex()} |
| CRCBMHD_N | ${UCB00.CRCBMHD_N.hex()} |
| PWx | ${UCB00.PWx.hex()} |
| CONFIRMATION | ${UCB00.CONFIRMATION.hex()} |
<%
bmi_bmhdid = UCB00.BMI_BMHDID
bmi = (bmi_bmhdid >> 0) & 0xFFFF
bmhdid = (bmi_bmhdid >> 16) & 0xFFFF
pindis = (bmi >> 0) & 0x01
hwcfg = (bmi >> 1) & 0x07
mode_by_hwcfg = "disabled"
if pindis == 0:
mode_by_hwcfg = "enabled"
start_up_mode = "invalid"
if hwcfg == 0x07:
start_up_mode = "internal start from flash"
elif hwcfg == 0x06:
start_up_mode = "alternate boot mode"
elif hwcfg == 0x04:
start_up_mode = "generic bootstrap loader mode"
elif hwcfg == 0x03:
start_up_mode = "asc bootstrap loader mode"
is_bmh_valid = "invalid"
if bmhdid == 0xB359:
is_bmh_valid = "OK"
calculated_crc_bmhd = m_calc_checksum("uint32le", UCB00.BMI_BMHDID.addr(), UCB00.CRCBMHD.addr(), 0x04c11db7, 32, 0xffffffff, True, True, True)
calculated_crc_bmhd_n = m_calc_checksum("uint32le", UCB00.BMI_BMHDID.addr(), UCB00.CRCBMHD.addr(), 0x04c11db7, 32, 0xffffffff, True, True, False)
is_bmh_integrity_given = "Not OK"
if calculated_crc_bmhd == UCB00.CRCBMHD:
if calculated_crc_bmhd_n == UCB00.CRCBMHD_N:
is_bmh_integrity_given = "OK"
%>
<%text>### Boot Mode Index (BMI)</%text>
* Mode selection by configuration pins: ${mode_by_hwcfg}
* Start-up mode: ${start_up_mode}
<%text>### Boot Mode Header Identifier (BMHDID)</%text>
Is boot mode header valid: ${is_bmh_valid}
<%text>### Boot Mode Header CRC (CRCBMHD/CRCBMHD_N)</%text>
Is boot mode header integrity given: ${is_bmh_integrity_given}
If a structure shall be used several times, define it as a datatype and use its name.
{
"elements": [{
"name": "UCB00",
"addr": "0xAF400000",
"dataType": "UCB_t",
"count": 1
}],
"structures": [{
"name": "UCB_t",
"elements": [{
"name": "BMI_BMHDID",
"dataType": "uint32le",
"count": 1
}, {
"name": "STAD",
"dataType": "uint32le",
"count": 1
}, {
"name": "CRCBMHD",
"dataType": "uint32le",
"count": 1
}, {
"name": "CRCBMHD_N",
"dataType": "uint32le",
"count": 1
}, {
"name": "PWx",
"dataType": "uint32le",
"offset": "0x0104",
"count": 8
}, {
"name": "CONFIRMATION",
"dataType": "uint32le",
"offset": "0x01F0",
"count": 1
}]
}]
}
The following macros are available in the templates.
Compares the set value with the actual value.
Parameters:
- set_value: Set value
- actual_value: Actual value
- value_format="{:02X}": Value format used to print them in case they are different.
Returns:
- "Ok": If both values are equal.
- "Not Ok (Set: <set_value>, Actual: <actual_value>)": If the values are different.
Read unsigned 8-bit value from binary data at given address and returns it.
Parameters:
- addr: Address
Read unsigned 16-bit little endian value from binary data at given address and returns it.
Parameters:
- addr: Address
Read unsigned 16-bit big endian value from binary data at given address and returns it.
Parameters:
- addr: Address
Read unsigned 32-bit little endian value from binary data at given address and returns it.
Parameters:
- addr: Address
Read unsigned 32-bit big endian value from binary data at given address and returns it.
Parameters:
- addr: Address
Read unsigned 64-bit little endian value from binary data at given address and returns it.
Parameters:
- addr: Address
Read unsigned 64-bit big endian value from binary data at given address and returns it.
Parameters:
- addr: Address
Read signed 8-bit value from binary data at given address and returns it.
Parameters:
- addr: Address
Read signed 16-bit little endian value from binary data at given address and returns it.
Parameters:
- addr: Address
Read signed 16-bit big endian value from binary data at given address and returns it.
Parameters:
- addr: Address
Read signed 32-bit little endian value from binary data at given address and returns it.
Parameters:
- addr: Address
Read signed 32-bit big endian value from binary data at given address and returns it.
Parameters:
- addr: Address
Read signed 64-bit little endian value from binary data at given address and returns it.
Parameters:
- addr: Address
Read signed 64-bit big endian value from binary data at given address and returns it.
Parameters:
- addr: Address
Read floating point 32-bit little endian value from binary data at given address and returns it.
Parameters:
- addr: Address
Read floating point 32-bit big endian value from binary data at given address and returns it.
Parameters:
- addr: Address
Read floating point 64-bit little endian value from binary data at given address and returns it.
Parameters:
- addr: Address
Read floating point 64-bit big endian value from binary data at given address and returns it.
Parameters:
- addr: Address
Read string from binary data at given address and returns it. It will consider the string termination.
Parameters:
- encoding: The character encoding.
- Default: utf-8
Calculate the CRC checksum.
Parameters:
- binary_data_endianess: The binary data endianess and bit width:
- "uint8": unsigned 8-bit
- "uint16le": unsigned 16-bit little endian
- "uint16be": unsigned 16-bit big endian
- "uint32le": unsigned 32-bit little endian
- "uint32be": unsigned 32-bit big endian
- start_address: Start address of the CRC calculation.
- end_address: End address of the CRC calculation (not included).
- polynomial: The polynomial for the CRC calculation.
- bit_width: The bit width, e.g. 8 in case of a CRC-8.
- seed: The seed value which to use.
- reverse_input: If the input data shall be reflected, set to True otherwise to False.
- reverse_output: If the output data shall be reflected, set to True otherwise to False.
- final_xor: If the output shall be have a final XOR with all bits set, set to True otherwise to False.
Swaps the bytes of a unsigned 16-bit value.
Parameters:
- value: Source value
Returns:
- Swapped value
Swaps the bytes of a unsigned 32-bit value.
Parameters:
- value: Source value
Returns:
- Swapped value
Swaps the 16-bit words of a unsigned 32-bit value.
Parameters:
- value: Source value
Returns:
- Swapped value
Used 3rd party libraries which are not part of the standard Python package:
- intelhex - Reading files in IntelHex format - BSD-3 License.
- Mako - Template engine - MIT License
- toml - Parsing TOML - MIT License
If you have further ideas or you found some bugs, great! Create a issue or if you are able and willing to fix it by yourself, clone the repository and create a pull request.
The whole source code is published under the MIT license. Consider the different licenses of the used third party libraries too!
Unless you explicitly state otherwise, any contribution intentionally submitted for inclusion in the work by you, shall be licensed as above, without any additional terms or conditions.