scalyr/scalyr-agent-2

[error="failedCheckpointWrite"] Could not write checkpoint file

msdhillon8989 opened this issue · 1 comments

File path: '/var/lib/scalyr-agent-2/active-checkpoints-worker-default-0.json', type: <class 'str'>.
Temporary file path: '/var/lib/scalyr-agent-2/active-checkpoints-worker-default-0.json~', type: <class 'str'>.
File exists: True.
Temporary file exists: True.
File system encoding: utf-8.
Error: :stack_trace:
OSError: [Errno 28] No space left on device

Seems like scalyr is not deleting the tempopapry checkpoint file in container. Is there any configuration to del temp files ?

@msdhillon8989 Sorry for the delayed response.

The checkpoint files should only occupy a small portion of the disk and the value is overwritten not appended.

ubuntu@ip-172-31-20-128:/var/lib/scalyr-agent-2$ ls -al
total 28
drwxrwxr-x  2 root root 4096 Feb 28 20:24 .
drwxr-xr-x 45 root root 4096 Oct  6  2022 ..
-rw-r--r--  1 root root 1588 Feb 28 20:24 active-checkpoints-worker-default-0.json
-rw-r--r--  1 root root 1880 Feb 28 20:23 checkpoints-worker-default-0.json
-rw-r--r--  1 root root 1870 Feb 28 20:24 checkpoints.json
-rw-r--r--  1 root root 2744 Jan 20 23:47 last_status
-rw-r--r--  1 root root    4 Jan 20 23:47 status_format
ubuntu@ip-172-31-20-128:/var/lib/scalyr-agent-2$ vi checkpoints-worker-default-0.json
ubuntu@ip-172-31-20-128:/var/lib/scalyr-agent-2$ sudo scalyr-agent-2 version
The Scalyr Agent 2 version is 2.2.

I believe that The error message suggests the host's disk is full so the checkpoint file cannot be rewritten. It doesn't necessarily mean that the checkpoint files are the culprit and occupy the entire disk space.