insights-client is the Client API Wrapper for the Client API that lives in the Insights Core.
Instructions are for RHSM-subscribed machines only.
- Clone this repo and https://github.com/RedHatInsights/insights-core to the same directory.
$ git clone git@github.com:RedHatInsights/insights-client.git
$ git clone git@github.com:RedHatInsights/insights-core.git
- If you are using a virtual environment for development, make sure that it uses the system-site-packages flag.
If not, set the variable to true in the
pyvenv.cfg
file.
--system-site-packages = true
- Bootstrap the configuration script. This will run
configure
for you with no options.
$ cd insights-client
$ ./autogen.sh
$ make
- To build an insights-core egg from source, run
./build_client_egg.sh
in the insights-core repo.
$ ../insights-core/build_client_egg.sh
- Run the client with the following options to disable GPG since this egg is unsigned.
$ sudo BYPASS_GPG=True EGG=../insights-core/insights.zip ./src/insights-client --no-gpg
- Repeat steps 4 and 5 upon making code changes. The majority of the client code lives in
insights-core/insights/client
.
The Insights Client consists of two pieces: the main RPM-installed executable that ships with RHEL (from here on, referred to as wrapper), and the updatable core module (from here on, referred to as egg).
The wrapper is the main entry point for the Insights Client. The wrapper's job is to initiate the phases, described later. For each phase, the wrapper iterates through the available eggs, and tries each in succession to perform a successful collection & upload. If an egg fails, the client will go on to try the next available egg. If all eggs fail, execution will halt. All possible eggs are described as follows, in the order in which they are tried:
ENV_EGG
- an egg specified by the environment variableEGG
NEW_EGG
- newest available egg, if an update has been performedSTABLE_EGG
- the last egg that performed a successful collection & uploadRPM_EGG
- the default egg that ships with the RPM
The egg is the bundle that contains the Insights Core module, which has all the main functionality in it. All the options/switches/config are passed through to the egg from the wrapper. The egg contains phase information
Summary of the client's run, from start to finish.
The configuration file uses INI format (see configparser).
The main section for configuration variables is [insights-client]
.
Configuration follows a precedence hierarchy of CLI -> /etc/insights-client/insights-client.conf
file -> environment variable.
Environment configuration can be used by setting environment variables with names in the format INSIGHTS_xxxxxx, where xxxxxx is the configuration variable name, in all caps.
Configuration variables available in the configuration file and their explanations:
loglevel
- set the Python logger's default level (DEBUG
,INFO
,WARNING
,ERROR
,CRITICAL
). DefaultDEBUG
auto_config
- attempt to auto-configure the network connection with Satellite or RHSM. DefaultTrue
authmethod
- authentication method for the Portal (BASIC, CERT). Default isBASIC
Note: whenauto_config
is enabled,CERT
will be used if RHSM or Satellite is detectedusername
- username for basic auth. Blank by defaultpassword
- password for basic auth. Blank by defaultbase_url
- base url for the Insights API. Defaultcert-api.access.redhat.com:443/r/insights
cert_verify
- path to CA cert to verify SSL against. Default/etc/insights-client/cert-api.access.redhat.com.pem
proxy
- proxy URL. Blank by defaultauto_update
- whether to update the rule spec file (uploader.json) and the insights-core egg. DefaultTrue
obfuscate
- whether to obfuscate IP addresses in collected data. DefaultFalse
obfuscate_hostname
- whether to obfuscate hostnames in collected data. DefaultFalse
cmd_timeout
- how many seconds to allow a command to run before issuing a termination or kill signal. Default 120http_timeout
- how many seconds to allow an HTTP call to execute before timing out. Default 120core_collect
- ifTrue
, use insights-core to run collection instead of commands/files from uploader.json. DefaultFalse
redaction_file
- location of the redaction file. Default/etc/insights-client/file-redaction.yaml
content_redaction_file
- location of the content redaction file. Default/etc/insights-client/file-content-redaction.yaml
legacy_upload
- Use legacy HTTP configuration to perform the upload. DefaultTrue
Command line switches available and their explanations.
--register
- Register a system with the Insights API. Required for basic collection & upload, except in the case of certain targets or if--offline
is specified--display-name=DISPLAYNAME
- Display name to appear in the insights web UI. This can be used at registration time, or standalone to change the display name any time.--group=GROUP
- Group to add the system to in the insights web UI. Only used on machine registration.--retry=RETRIES
- Number of times to retry the collection archive upload. Default is 1.--quiet
- Run with limited console output. This will only print ERROR level messages.--silent
- Run with no console output at all.--conf
,-c
- Load a custom configuration file other than the default/etc/insights-client/insights-client.conf
--offline
- Run with no network connectivity at all. Implies--no-upload
and makes machine registration unnecessary.--logging-file
- Log to a file other than the default/var/log/insights-client/insights-client.log
--force-reregister
- Force a new registration. This delete's the machine's existing machine-id and registers a new one.--verbose
- Run with all log output. This will print DEBUG level messages.--no-upload
- Collect the archive, but do not upload it.--keep-archive
- Collect the archive, and do not delete it after upload.--net-debug
- Show network debug messages in the console output.--output-dir=DIR
- Write the collected data to a specified directory without compression. Does not upload.--output-file=FILE
- Write the collected data to a specified archive. Does not upload.
These particular switches supersede normal client operation; they skip collection and exit after printing their necessary output.
--version
- Print the versions of both the wrapper and egg, then exit.--unregister
- Unregister this system from the Insights API.--display-name=DISPLAYNAME
- When used without--register
, change the display name and exit.--validate
- Validate the format of the remove.conf file.--enable-schedule
- Enable the Insights systemd job.--disable-schedule
- Disable the Insights systemd job.--test-connection
- Run a test to confirm connectivity from the machine to the Insights API.--support
- Print a log of basic diagnostics such as version, registration status, connectivity, config, etc.--status
- Print the registration status.--payload=PAYLOAD
- Upload a specified filePAYLOAD
. Requires--content-type
--content-type=CONTENTTYPE
- Specify the console.redhat.com platform-specific content type for--payload
option.--diagnosis=ID
- Retrieve remediations for this host, optionally providing a diagnosis ID.--compliance
- Perform a compliance upload.--show-results
- Log the cached system profile to console. See also:--check-results
under Hidden switches
Hidden switches
These switches are undocumented and for developer use only.
--no-gpg
- Run without verifying the signature of the egg or rule collection spec.--debug-phases
- Print info about phase execution and egg fallback--to-json
- Print the collection results to the console as JSON. Deprecated as rule results are no longer returned by the upload service.--check-results
- Fetch the system profile and cache it. Produces no output to console. Not meant to be run as a standalone option but rather as part of a regular systemd job that refreshes the cached data.
For convenience, some sample configs are provided here for developers for connecting to the different environments the client can interface with. These configurations can be defined via config file or via environment variables using the naming described under Environment Variables. The following are in config file notation and can be used as drop-in configuration.
Note: CI requires basic auth.
[insights-client]
auto_config=False
username=<username>
password=<password>
legacy_upload=False
base_url=ci.cloud.redhat.com/api
cert_verify=False
Note: QA requires basic auth.
[insights-client]
auto_config=False
username=<username>
password=<password>
legacy_upload=False
base_url=qa.cloud.redhat.com/api
cert_verify=False
Note: This configuration assumes that the system is registered to Stage RHSM. Insights Client will autoconfigure to interface with stage.
[insights-client]
proxy=http://squid.corp.redhat.com:3128
[insights-client]
auto_config=False
username=<username>
password=<password>
legacy_upload=False
base_url=cert.cloud.stage.redhat.com/api
cert_verify=True
proxy=http://squid.corp.redhat.com:3128
[insights-client]
auto_config=False
authmethod=CERT
legacy_upload=False
base_url=cert.cloud.redhat.com/api
cert_verify=True
[insights-client]
auto_config=False
username=<username>
password=<password>
legacy_upload=False
base_url=cert.cloud.redhat.com/api
cert_verify=True
No additional configuration is required beyond the defaults. Insights Client will autoconfigure to interface with classic prod.
[insights-client]
auto_config=False
username=<username>
password=<password>
The Insights Client runs using phases of execution, modularized so that they if one crashes due to a bad egg, they can be resumed at the current phase using the following egg in the priority list.
Execute any "switches that exit immediately" that were specified (except --status
and --unregister
), and exit if needed
Establish a connection to Insights and update the egg if the egg available upstream is newer than newest.egg (check etags).
Download the newest version of uploader.json
for file collection (legacy collection only).
Process registration options. Check registration. If unregistered and operating in a mode in which registration is required, exit.
Run the collection. Output to the desired format; default is archive uploaded to Insights.
Insights Client utilizes insights-core to perform data collection. Details on the workings of insights-core can be found here: https://github.com/RedHatInsights/insights-core#documentation
The Insights archive has the following structure:
insights.tar.gz
│
├───blacklist_report <-- Usage metrics for file redaction
├───branch_info <-- Satellite metadata
├───collection_stats <-- Collection execution metrics (e.g., elapsed time, command exit codes)
├───data
│ ├───boot
│ │ └───...
│ ├───etc
│ │ ├───insights-client
│ │ │ └───machine-id <-- unique identifier for this system
│ │ └───...
│ ├───insights_commands <-- output dumps of commands
│ │ └───...
│ ├───proc
│ │ └───...
│ ├───sys
│ │ └───...
│ └───var
│ └───...
├───display_name <-- Display name of this host, if specified
├───egg_release <-- Record of the egg release channel
├───metadata <-- Metadata generated by insights-core
│ └───...
├───tags.json <-- Tags for this host
└───version_info <-- Record of the client and core versions
Note: This method of collection is deprecated in favor of core collection and is planned to be removed in a future release.
Insights Client's classic collection has two parts: files and commands. Both are implemented in insights-core/insights/client/insights_spec.py
.
Files are collected using sed to scrape file contents, with the following caveats:
- Files defined in the
files
section offile-redaction.yaml
are skipped completely - Lines containing anything in the
patterns
section offile-content-redaction.yaml
will be omitted from the file content - Keywords defined in the
keywords
section offile-content-redaction.yaml
will be replaced withkeyword#
in the file content. The file contents are copied to a path within the archive defined from theuploader.json
, which in most cases is similar or identical to its path on the filesystem mountpoint, i.e./etc/cluster/cluster.conf
.
Commands are run such that their output is collected by the client, with the following caveats:
- Commands
rm
,kill
,reboot
,shutdown
are never run - Commands defined in the
commands
section offile-redaction.yaml
are skipped completely - Lines containing anything in the
patterns
section offile-content-redaction.yaml
will be omitted from command output - Keywords defined in the
keywords
section offile-content-redaction.yaml
will be replaced withkeyword#
in the command output - If a command does not exist, the client will gracefully report as such, and continue.
Command outputs are copied to a path within the archive
insights_commands/<command>
where<command>
is an escaped format of the full command syntax, including arguments, i.e.,/insights_commands/df_-alP
.
The legacy Insights archive has the following structure:
insights.tar.gz
│
├───blacklist_report <-- Usage metrics for file redaction
├───boot
│ └───...
├───branch_info <-- Satellite metadata
├───collection_stats <-- Collection execution metrics (e.g., elapsed time, command exit codes)
├───display_name <-- Display name of this host, if specified
├───egg_release <-- Record of the egg release channel
├───etc
│ ├───insights-client
│ │ └───machine-id <-- unique identifier for this system
│ └───...
├───insights_commands <-- output dumps of commands
│ └───...
├───proc
│ └───...
├───sys
│ └───...
├───tags.json <-- Tags for this host
├───var
│ └───...
└───version_info <-- Record of the client and core versions