hzeller/rpi-rgb-led-matrix

Another flicker issue

btownshend opened this issue · 68 comments

First, thanks for a great library and all the thought that's gone into the details!

I'm working with a 64x64 matrix and see drops in intensity randomly every 5-15s. I've gone through your notes and tried everything to eliminate these:

  • added 3300 uf low-ESD caps at the leads
  • have a solid 40A power supply with heavy gauge hookup
  • using an Adafruit hat with the PWM modification
  • stopped audio, GUI on the Pi
  • no other processes running
  • tried the iso1cpus=3 setting
  • running with
    pixel-push -i wlan0 -U --led-chain=4 --led-parallel=1 -R 90 -u 65507 --led-show-refresh --led-scan-mode=0 --led-gpio-mapping=adafruit-hat-pwm --led-slowdown-gpio=2 -d
  • uname -a:
    Linux raspberrypi 4.9.35-v7+ #1014 SMP Fri Jun 30 14:47:43 BST 2017 armv7l GNU/Linux
  • running on Pi 3 Model B

I hooked up a photoresistor to a scope to capture what I'm seeing visually -- see attached image. Looks like ~160Hz refresh rate with occasional drops in level.
img_3906

Any other ideas?

Thanks,
Brent

  • Make sure to not run anything else on the Pi. Just having a top running, will reduce brightness every time there is refresh of the display.
  • it is isolcpus not iso1cpus (for cpu isolation)
  • Try setting the DISABLE_RT_THROTTLE setting. This in particular could explain relatively regular brightness changes.

• Nothing else running
• Compiled with DISABLE_RT_THROTTLE -- no difference
• Yes, I put in "isolcpus=3" in /boot/cmdline.txt (with an "el" not "1")
On the last point though, looking at the output of pidstat shows pixel-push on CPU1 along with several other daemons.

`pi@raspberrypi:~/Beatfield/RaspberryPi/extern/rpi-matrix-pixelpusher $ pidstat
Linux 4.9.35-v7+ (raspberrypi)  01/19/2018      _armv7l_        (4 CPU)

10:24:27 PM   UID       PID    %usr %system  %guest    %CPU   CPU  Command
10:24:27 PM     0         1    0.04    0.04    0.00    0.08     2  systemd
10:24:27 PM     0         3    0.00    0.00    0.00    0.00     0  ksoftirqd/0
10:24:27 PM     0         7    0.02    0.00    0.00    0.02     2  rcu_sched
10:24:27 PM     0         9    0.00    0.00    0.00    0.00     0  migration/0
10:24:27 PM     0        13    0.00    0.00    0.00    0.00     1  migration/1
10:24:27 PM     0        14    0.00    0.00    0.00    0.00     1  ksoftirqd/1
10:24:27 PM     0        18    0.00    0.00    0.00    0.00     2  migration/2
10:24:27 PM     0        19    0.00    0.00    0.00    0.00     2  ksoftirqd/2
10:24:27 PM     0        23    0.00    0.00    0.00    0.00     3  migration/3
10:24:27 PM     0        87    0.00    0.07    0.00    0.07     0  irq/92-mmc1
10:24:27 PM     0        90    0.00    0.02    0.00    0.02     0  mmcqd/0
10:24:27 PM     0        92    0.00    0.00    0.00    0.00     3  kworker/3:1
10:24:27 PM     0        93    0.00    0.00    0.00    0.00     0  jbd2/mmcblk0p7-
10:24:27 PM     0       128    0.03    0.02    0.00    0.05     1  systemd-journal
10:24:27 PM     0       136    0.00    0.01    0.00    0.01     0  kworker/0:2
10:24:27 PM     0       147    0.01    0.00    0.00    0.01     0  systemd-udevd
10:24:27 PM     0       256    0.03    0.00    0.00    0.03     0  brcmf_wdog/mmc1
10:24:27 PM     0       433    0.00    0.00    0.00    0.00     2  cron
10:24:27 PM     0       434    0.01    0.01    0.00    0.02     0  rsyslogd
10:24:27 PM   105       435    0.21    0.07    0.00    0.28     0  avahi-daemon
10:24:27 PM   104       440    0.01    0.00    0.00    0.01     1  dbus-daemon
10:24:27 PM 65534       475    0.00    0.00    0.00    0.00     1  thd
10:24:27 PM     0       477    0.00    0.00    0.00    0.00     0  dhcpcd
10:24:27 PM     0       519    0.00    0.00    0.00    0.00     2  systemd-logind
10:24:27 PM     0       536    0.00    0.00    0.00    0.00     0  sshd
10:24:27 PM     0       538    0.00    0.00    0.00    0.00     2  vncserver-x11-s
10:24:27 PM     0       561    0.02    0.04    0.00    0.05     0  vncserver-x11-c
10:24:27 PM     0       562    0.00    0.00    0.00    0.00     1  wpa_supplicant
10:24:27 PM     0       572    0.00    0.00    0.00    0.00     1  agetty
10:24:27 PM   106       574    0.01    0.01    0.00    0.01     1  ntpd
10:24:27 PM     0       709    0.00    0.00    0.00    0.00     2  bluetoothd
10:24:27 PM     0      1082    0.00    0.00    0.00    0.00     2  kworker/2:0
10:24:27 PM     0      1243    0.00    0.00    0.00    0.00     0  sshd
10:24:27 PM  1000      1247    0.00    0.00    0.00    0.00     2  systemd
10:24:27 PM  1000      1254    0.03    0.03    0.00    0.05     2  sshd
10:24:27 PM  1000      1256    0.05    0.01    0.00    0.05     0  bash
10:24:27 PM     0      1257    0.00    0.06    0.00    0.06     2  kworker/u8:2
10:24:27 PM     0      2159    0.00    0.00    0.00    0.00     0  packagekitd
10:24:27 PM     0      2163    0.00    0.00    0.00    0.00     0  polkitd
10:24:27 PM     0      2183    0.00    0.01    0.00    0.01     0  kworker/u8:3
10:24:27 PM     0      2266    0.00    0.00    0.00    0.00     1  kworker/1:0
10:24:27 PM     0      2301    0.00    0.00    0.00    0.00     2  kworker/u8:1
10:24:27 PM     0      2473    0.00    0.00    0.00    0.00     1  kworker/1:1
10:24:27 PM     0      2539    0.00    0.00    0.00    0.00     3  kworker/3:0
10:24:27 PM     1      2550    6.64    0.26    0.00    6.90     1  pixel-push
10:24:27 PM     0      3429    0.00    0.01    0.00    0.01     0  kworker/u8:0
10:24:27 PM     0      3432    0.00    0.00    0.00    0.00     1  kworker/1:2
10:24:27 PM  1000      3537    0.00    0.00    0.00    0.00     1  pidstat
pi@raspberrypi:~/Beatfield/RaspberryPi/extern/rpi-matrix-pixelpusher $ `

Or am I missing something? Do I need to use taskset to assign pixel-push to CPU 3?

Hmm, modified my startup script to use:

taskset -c 3 pixel-push -i wlan0 -U --led-chain=4 --led-parallel=1 -R 90 -u 65507 --led-
show-refresh --led-scan-mode=0 --led-gpio-mapping=adafruit-hat-pwm --led-slowdown-gpio=2 -d

Check with pidstat now shows only pixel-push on CPU 3 (with only kworker/3:0, kworker/3:1, and migration/3), other processes on other CPUs

But, perhaps it glitches a little less, but there are still noticeable dropouts at 5-10s intervals.

Also, note that I'm not sending any data continually to the pixelpush -- just sent a fixed all-on display and then leaving that static.

Unfortunately, the kernel sometimes takes away some CPU time doing just whatever accounting it wants, even though it should leave a RT thread isolated on core3 alone. But yeah, there are kernel workers on that core - I wonder if they can be disabled ?

Only the main update thread should run on core3, all the other threads of pixel-push can (and should) be on other cores.

Interesting if the taskset is not working, I wonder if this also results in the realtime-thread not being a realtime thread ? Can you check if the thread that is using the most CPU (that is the update thread) is actually a RT thread ? Maybe that feature is not compiled into that kernal ?

In the end, we're dealing with a multuser/multitasking operating system that in the stock kernel version is not optimized for realtime stuff. So it might be worthwhile trying to get the most stripped-down kernel with all the realtime-ness goodies enabled compiled. Unfortunately, I don't know the best settings here, but if there is someone here knowing about a good set of kernel settings to enable and disable, that would be very helpful.

If I understand correctly, then pidstat and chrt will not show the process on CPU 3 or the scheduling as FIFO because only the refresh thread is set to be realtime on CPU 3? Because if it is launched normally, the process ends up with SCHED_OTHER and not bound to CPU 3.

I tried launching it with:
chrt -f 99 taskset -c 3
and then I do indeed see that chrt shows the correct scheduler (FIFO) and priority (99) and pidstat does show CPU3.

Yet, the flickering continues.

I noticed that there is a SLEEP_JITTER define in the code -- will that help isolate whether the issue is really CPU scheduling?

the DEBUG_SLEEP_JITTER is just a way to quantify the jitter for sleeping, but if you are using the PWM, sleep is essentially not used.

I suspect that the kernel is taking some CPU time when clocking in the data, which means that the time sending things into the matrix takes longer, so there are longer dark phases.

Best next steps are essentially looking what is happening when you see the brightness change. As described in the readme, some user was seeing that a regular ntp update was creating some faint flicker, which went away when it was disabled.

Sounds like a tricky thing to debug!

I killed ntpd, might've made things a little better, but there are still flickers.

Note that running things in a terminal window creates a lot of flicker (e.g. "service --status-all"
is especially bad). Seems to me that if the panel is running on an isolated CPU, running user-level commands shouldn't affect it much.

Is there any way the library can detect if something was off in timing? Or, failing that, is there a good place (or place(s)) to hook a scope onto the PI to measure precisely what's going wrong?

Thanks for all the help!
Brent

yes, running things in the terminal creates a lot of flicker. I suspect there is something going on with using the memory bus that starves the memory bus. I noticed that myself in some situations, but haven't had a time to find a strategy how to poke inside the kernel to figure out what is going on. So if you want to look into that, that would be highly appreciated.

Regarding measuring, I'd probably hook the scope on the strobe/latch signal. It should be triggered with (number_of_rows/2) * refresh frequency so you can quickly see if there is some huge variation. Now, how to correlate that with something that goes on inside the kernel (between the last and the delayed strobe signal that is), I don't know. I suspect, there are some kernel activity logging tools, so if we can timestamp things properly there, it might lead to the culprit...

Ok, will try that. In the meantime, have been playing with the 'perf' command and looking at:
[http://epickrram.blogspot.com/2015/09/reducing-system-jitter.html]

Using:
sudo perf record -e "sched:sched_switch" -C 3
then
sudo perf script

Clearly shows context switching between pixel_push on CPU 3 and other processes (such as the bash, grep, etc generated by running "service -s-status-all") that shouldn't be on CPU 3, if I understand isolcpu correctly. Yet, pidstat never shows anything but the kworkers and migration on that CPU.

Yes, I also have the strong suspicion that the kernel only takes the isolcpu as, uhm, 'suggestion'.
So would be good if you could figure out what else we have to do to make it work as we expect it to.

Thanks for joining in Andy.

That sounds reasonable, but what I'm seeing is that shell scripts run on the command line end up with their commands (grep, bash, etc) running on the isolated CPU. Or at least that's what perf shows. I'm wondering if maybe they get launched on whatever CPU and then subsequently get moved off of the isolated one.

Henner,
I was able to capture the signals on a scope and I think I see the physical manifestation of this. On the image below, the top trace is clk, then latch, and the bottom is OE. This appears to have captured 3 full updates. The right 2 look similar, but the left one has a 20usec gap in it. There may also be longer gaps, but I haven't figured out how to get the scope to trigger on the failures.
img_3907

OK, so my mistake on one part of this: I had isolcpus=3 on a separate line of /boot/cmdline.txt instead of at the end of the first line, which caused it to be ignored. I noticed this when checking /proc/cmdline to verify everything. Unfortunately, it still doesn't fix the problem, though now I don't see any unexpected commands runnings on CPU 3 when using "perf record" other than kworker/3 and those don't seem to be correlated with the display issues.

So, even running something on another CPU is impacting the display -- it does NOT seem to be a CPU allocation issue. As you suggested, perhaps it is a memory bus issue.

But there's one other thing I don't understand. If I write 255,255,255 to all pixels of the display, then none of the updates of the display should ever turn off any LEDs. Regardless of delays in the PWM, the display should just be constant on, right? So, why are we seeing flicker? Is the data getting misclocked?

The displays can't display a full screen of data, it is multiplexed between the rows, so this is why the CPU constantly has to clock in data and why it is sensitive to CPU or memory being allocated elsewhere.

There are only two rows lit at any time, it is cycling through that quickly...

So I have now installed a pi with a recent raspbian (2017-11-29) and I can confirm that the flicker changes are quite noticeable vs. some 2016 old version of Raspbian I had lying around. So it used to be better.

Something is going on where the kernel is doing something on that third core even though it is asked not to (even if isolcpus=3 is set and echo -1 > /proc/sys/kernel/sched_rt_runtime_us).

So I guess we should start making a minimal Linux distribution, with a kernel with various realtime and nohz options compiled in, and unnecessary cruft removed, so that we don't rely on the volatile nature of whatever Raspian is providing.

Anyone would like to contribute that ? Or maybe knows a minimal Raspberry Pi distribution that is up for the task ?

Yes, there are sleeps in there for areas that are longer wait periods, and there is code you can use to work around sleeping glitches.

First, you set the sleeping jitter debug define in lib/gpio.cc to 1

#define DEBUG_SLEEP_JITTER 1

Then recompile and run a tool, such as the led-image-viewer that shows a static image for a while and then CTRL-C to stop it. Then you can see a histogram of values that were larger than the already built-in jitter allowance.
So I just did that with the new Pi and it is noticeable, that while most of the time the sleeps stay within 25usec overshoot, sometimes, it is some 35usec longer than even that, so 60usec total:

Overshoot histogram >= EMPIRICAL_NANOSLEEP_OVERHEAD_US=25
  usec |   count |   accum
<=  0us:   131275  99.969%
 +  1us:        3  99.971%
 +  2us:        4  99.974%
 +  5us:        5  99.978%
 +  6us:        2  99.979%
 +  7us:        1  99.980%
 + 14us:        1  99.981%
 + 30us:        1  99.982%
 + 32us:        1  99.982%
 + 33us:        4  99.986%
 + 34us:       15  99.997%
 + 35us:        2  99.998%
 + 36us:        1  99.999%
 + 38us:        1 100.000%

So now, you can adjust the allowance by this additional value. So, let's set it to 60usec, by changing the define before that in lib/gpio.cc

#define EMPIRICAL_NANOSLEEP_OVERHEAD_US 60

(which is the original 25usec plus the value of the majority of outlyers).

This should now make things more smooth, as we now spend more time doing busy waiting to exactly reach the time. Testing with my set-up here, it looks like the visible flicker glitches went entirely - the image is dead solid.

It comes with a theoretical downside, which is more CPU usage on that core the update thread is running. However, in practice (with isolcpus=3 set), this won't take away any of the CPU for anything else, because the thread is already locked on one core and nobody else can use it.

If this works for you, maybe I should make this higher setting the default. It is a bit more tricky as it should only apply for Pi2/3 that hare more than one core; old Pi1 (or Raspberry Pi Zero) should have a more moderate setting here, otherwise we use away too much CPU time for other things.

(don't forget to set DEBUG_SLEEP_JITTER to 0 once you're satisified)

I've now updated that jitter allowance by 35usec for Raspberry Pi 2 and 3, so this should hopefully be better now if this was really the source of your flicker.
I've also updated the sub-project link in the rpi-matrix-pixelpusher that you're probably using.

Let me know if this fixes your observation or if we have to dig somewhere else.

Did it get better @btownshend ?

Sorry, I was distracted by other things for the last few days.

I did test it out tonight, and it is still flickering. It is possible that it is a bit better, but can't tell.

I tried checking the SLEEP_JITTER and here's what I got:

While running 'service --status-all' in another window (on another CPU), which causes lots of flickering:

Overshoot histogram >= empirical overhead of 60us
  usec |   count |   accum
<=  0us:   142311  99.982%
 +  2us:        1  99.983%
 +  5us:        6  99.987%
 +  6us:        2  99.989%
 +  7us:        4  99.992%
 +  9us:        1  99.992%
 + 10us:        2  99.994%
 + 11us:        1  99.994%
 + 12us:        3  99.996%
 + 13us:        1  99.997%
 + 19us:        1  99.998%
 + 21us:        1  99.999%
 + 38us:        1  99.999%
 +255us:        1 100.000%

While leaving the system idle, still seeing minor flickers all the time, with more visible ones roughtly every 10s:

Overshoot histogram >= empirical overhead of 60us
  usec |   count |   accum
<=  0us:   256426  99.994%
 +  4us:        3  99.995%
 +  5us:        8  99.998%
 +  6us:        3 100.000%
 + 10us:        1 100.000%

Judging from the second set, it doesn't seem like the sleep jitter is causing the flickers.

Whenver something heavy happens in another terminal, the kernel is very distracted and generates a lot of flicker.
I think I want to compile some realtime kernel and see what is happening...

And I just tried increasing the EXTRA overhead to give a total of 90usec. Now, the overshoot is always <=0, but the flickers are still there.

yeah, so the kernel is either (a) messing with that CPU core too much or (b) something else is going on in the background that is creating too much interference in memory etc.

I hope you're not using the taskset setting, because otherwise all threads are running on that core.

Check out the new compile-time option in FIXED_FRAME_MICROSECONDS in lib/Makefile. It will allow to time-buffer glitches created by the rest of the system.

Just wanted to chime in that FIXED_FRAME_MICROSECONDS after calibrating has totally solved my flicker issues. Refresh is down to 140hz, but it is looking fine to me, and I would rather the slightly more sluggish refresh that's flicker-free.

Cool!

6 64x32 panels on RPi v2 with 2017-09-07-raspbian-stretch-lite with Adafruits RGBMatrix+RTC has significantly less jitter than the same setup but on RPi v3 with 2018-06-27-raspbian-stretch-lite. This includes with or without cpu isolation as suggested in this thread, and includes after accounting for FIXED_FRAME_MICROSECONDS and includes disabling wifi and bluetooth first.

I'm assuming I've reach some theoretical maximum limit. Already the --led-show-refresh suggests setting fixed frame microseconds to 23000+

I may have a chance to further examine this in the future. not sure yet. For now I had to put the project on hold.

Yes, something happened in the old vs. new Raspbian stretch which made it more prone to jitter. If someone with kernel experience can figure out what happened, that would be great...

I only had to add the FIXED_FRAME_MICROSECONDS after complaints came in after that Debian version...

Just out of curiosity has anyone tried Preempt-RT? There have been efforts to provide a more real-time kernel. FIXED_FRAME_MICROSECONDS does the trick with two panels but to drive more pixels I wonder if this would help.

https://lemariva.com/blog/2018/07/raspberry-pi-preempt-rt-patching-tutorial-for-kernel-4-14-y

That is an interesting tutorial. It also suggests when isolating cpu's to look in /proc/interrupts to see what is interrupting which cpu and how often

Even if you set the isolcpus (as described in the rpi-rgb-led-matrix readme), you see that there are still interrupts serviced on the isolated core unfortunately :/ The isolcpus setting seems to only influence user-tasks not kernel tasks.

but does the interrupt information help us understand what in the kernel, and perhaps what change to the kernel from previous versions, is causing the increase in interrupts from older revisions

Sorry to bring a closed topic back from the dead, but I too am having the similar issue. (All leds blink)

The only solution that seems to work is to disable all CPU cores leaving one.
By editing /boot/cmdline.txt and adding maxcpus=1

This slows things down, so If anyone can help with this would be great.

Same problem.
/boot/cmdline.txt maxcpus=1
Not Flickr, but too slow to run video.
isolcpus=3 and all other settings, It doesn't change anything.
Unfortunately my knowledge of Linux is too low to help.

Can I change maxcpus=1 runtime, if i have to show pictures or video ?

The Raspberry Pi memroy bus in general is not very good if there are other tasks going on that have a lot of memory churn. If you run the LED matrix, it should almost be the only thing running there.

If you want to play videos, make sure use tha advise for the video-player: either pre-generate them as 'stream' (best solution, but uses some space on disk) or recode the video-file to the target size beforehand so that no resizing is needed in the replay step. But playing a large video and resizing it on-the fly will not work on a Pi and result in a lot of flicker (if you find a way to do that on the GPU, maybe; the video-viewer is just a demo code to show the simplest possible way).

The mode with the pre-generated stream is bas that as it almost needs zero additional CPU time at playtime.

You see less flicker if you limit the CPU to one, because then there is only a single core accessing the memory subsystem.

In general, try to minimize running other stuff, the Pi is simply too slow.

Maxcpus=1 is a boot-setting, you can't set it at runtime.

In my case I wasn't playing video but was only displaying static text and would have the flickr every 10 seconds or so. I've since abandoned the whole project but my thought to revive it is to perhaps split the matrix into seperate raspberry pi's and multiplext them? (384x32 is perhaps just too much) Do you have any other suggestions for limits and good hacks to make large(er) displays?

I'm trying to use Tiny Core.
There is no flickr
But I can not compile WiringPi and install pyton3-gpiozero

@gioreva you don't need wiring pi for the led matrix, in fact you should not mix the two. The led matrix already provides a way to read the remaining gpio bits.

For start video when GPIO change.

 if [ "$(gpio -g read 2)" = "0" ]; then     ./video-viewer az.avi

On Tiny core there are not build-essential

And for python3-gpiozero library, I'm trying to copy them from Raspbian to tiny core but the folders are different.

rasbian has:
/usr/lib/python3/dist-packages/gpiozero/

and tinycore has:
/usr/lib/micropython/os

I'd advise to avoid using Python on the raspberry pi, it is too resource-intensive, resulting in slowness and flicker (not to mention the nightmares with dependencies, compatibility and install you see in your case).

It should only be a couple of lines of code to modify the led-image-viewer to play a stream on a button-press.

Python and gpiozero, are used for the script, to use the new IC
How long does it take to implement that function in your code ?
I have to deliver the panel in days.

It will take a few lines of codes, and is implemented in a few minutes; definitely shorter than trying to figure out how to install the correct python libraries (leave alone that you won't be able to run it in parallel with the matrix). And a lot more solid to operate, because you only have one compiled binary that runs.

First, check that you can create a stream for the led-image-viewer to play your video (you can create that stream with the video-viewer; don't use the video viewer directly to display things, it is too slow and creates flicker on the Rasbperry Pi).

Then familiarize yourself with the RequestInputs() and AwaitInputChange() functions (API doc in include file or also look at input-example.cc).

Use that to read your input, and modify the led-image-viewer to be running all the time, but is triggered by the input. So in the main do { loop, you add the button check as a gateway to show the next image (or stream).
So your code change around there would roughly look like this:

uint32_t pin_we_are_interested_in = 1 << 2;  // the GPIO we are interested in.
if (matrix->gpio()->RequestInputs(pin_we_are_interested_in) != pin_we_are_interested_in) {
   fprintf(stderr, "Pin is not available\n");
   return 1;
}
 do {
         // Trigger showing the next image whenever a button is pressed.
          uint32_t input = matrix->AwaitInputChange(100);
          if (input & pin_we_are_interested_in == 0)  continue;  // Check that the button went high.
         ....  // the usual stuff in this loop

Video work well, and this code is ok
I can make it, but problem is the python script #746 for new IC.
Now all panels has new FM6126 IC.

I might implement that in the main library soon; I have ordered one of the new Panels.

I have a few days to deliver, you want to try to write it and I try it?
We write by email.

libavcodec libavformat libswscale there are on repo of tiny core

Hello, @gioreva I would like to test tiny core also for my project, do you have any kind of "guide" / step-by-step to build the library? Are you generating a tcz package?

I have already forgotten. I have these notes.
tce-load -wi git compiletc nano libavcodec libavformat libswscale bash
dpkg-deb

I put all the folders in the save, but they scolded me that it's wrong.
I didn't understand how to make a package.

If you do, will you pass me the step by step guide ?

But @hzeller , did you implement the script for the new chip in the sources ?
Python script not work on TinyCore

A couple of years ago I was also playing with picore for a similar project and I already created my own packages (even I prepared a captured small web that allowed to load updates and let the user configure the wifi). I have to remember the steps, and sure, I will share them, It could be a nice addition for the @hzeller already great documentation.
At this moment I'm testing balenaos, and is very very easy, but probably will have problems with the flickering (I have to test more). Maybe if i use the alpine image...

@gioreva Do you remember wich piCore version did you used?
I downloaded the 10 arm7 but it doesn't have compiletc metapackage yet.
The 9 arm6 seems that doesn't load on a raspi 3b+

I don't remember well but I doubt that I compiled the library in armbian, and then copied the executables to tinycore.

Sorry to bring this thread back around again, but I'm curious if anyone has had the chance to test how the RPI 4 affects flickering? I'm currently running an RPI 3 B+, but if the faster clock on the RPI4 reduced or removed flicker altogether, I'd drop the cash on an upgrade in a heartbeat!

It is faster, and if you are running some more heavy lifting thing, it will reduce flickering. For instance, I could decode video and play it without creating flicker while this was not possible with the Pi3 (this is an example of course, as video playback should anyway be done using the streaming).

As always, it depends what you are doing. Usually, you only get flicker if you have some heavy stuff going on, like something with a lot of memory churn (e.g. when programming in Python). Then also, using the most minimal operating system set-up is best; I have heard of people using DietPi to help them get rid of flicker issues they had before with Raspbian (haven't tried that myself).

@hzeller Thanks for the quick response! It sounds like the new Pi is worth the upgrade then. For now, I'm just displaying video, text, and images. The framework I built on top of this library, while not in python, is still a little heavy, so I imagine that could benefit from faster clock speeds. I was hoping to integrate some networked tasks like pulling Spotify data from Last.fm, weather data, controlling the panel through a web app, etc... Not quite sure if I'm just living in a fantasy world thinking the Pi could handle both of these without creating flickering/stuttering though.

make sure to pre-scale video, or better, pre-generate as a stream (see documentation of led-image-viewer and video-viewer; also content-streamer.h api). With that, I can play a video with 250fps (essentially swapping frame with each refresh; watching a Mandelbrot zoom video with that is quite a ride...)

In the past, the Pi's had issues with accessing USB and network as it all was sharing a bus; this might be better with the Pi4.

Interesting - I'll definitely take a look at your documentation on that. I'm working with some specific formats (pinball DMD animations!) that don't store or update with nearly as much data (mono-color, 0-15 intensity values), so I figured I could write something myself to take advantage of that. it sounds like pre-generating as a stream is the way to go. If nothing else, doing it myself has been a good learning experience :)

I see further up different suggestions for flicker reduction, but the main thing that's really fixed flicker on my end is enforcing a sleep at the end of my update loop (~usleep(100000) seems to look the best)... This seems like I'm going against the intended use of the library though, not to mention it locks my framerate to <10fps. Is this effectively what FIXED_FRAME_MICROSECONDS tries to do?

Yes, the more you can pre-generate, the better; typically some pre-allocated FrameCanvas'es, that you SwapOnVSync() are a good way. With the content-streamer.h you can fill two FrameCanvas-es in turn to do that.

FIXED_FRAME_MICROSECONDS is used within the internal update thread to make sure that each screen refresh is taking the same time; otherwise if there are faster and slower updates, this will push more or less photons per time-unit, which is what we perceive as flicker.

the sleep that you do in your update loop is making sure to use less CPU and memory churn to less influence the updating that is going on (you do have the isolcpus=3 setting enabled, to not use that core, right ?)

That's very good to know. I've basically used the Canvas and Canvas::SetPixel() as a wrapper for interfacing with the panel at a simple level, then built everything else myself on top of that. Perhaps I'm not making good enough use of FrameCanvas and SwapOnVSync. Currently I'm just drawing the frame with SetPixel, sleeping, then clearing the canvas before the next frame. Originally this was just to get things running while I build everything out, but now I'm thinking it may be a little naive.

I'm working on better calibrating FIXED_FRAME_MICROSECONDS now - sounds like this is something I'd definitely want. And yes, I did add isolcpus=3 to my /boot/cmdline.txt file.

With FIXED_FRAME_MICROSECONDS commented out and --led-slowdown-gpio left at default, I'm sitting at 7425u. It sounds like this is higher than others in this thread. Perhaps I need to optimize things better first then...

Hello,
A little late to the conversation, but In my system I'm using a raspi 3b+ that just receives the data from an open socket, and with rasbian it had flickering, I tryied openbalena and also, and now I'm using tinycorelinux (piCore) without problems.

@gpulido That's great to know. I'll take a look at that!

Wanted to leave some findings here for what it's worth. I was able to take my refresh from ~120hz to ~400hz and completely remove flicker by setting --led-pwm-lsb-nanoseconds=50 --led-pwm-dither-bits=2. The dither was the game-changer and the low led-pwm-lsb-nanoseconds doesn't seem to result in any ghosting. My panels are already quite bright, so I had my brightness at ~25% anyway. Setting it back to max, then compromising on a little brightness with dithering definitely gets me the best results.

Thanks for the info, however in my case my panels are going to be used on exterior with direct sunlight, so to reduce the brightness is not an option :(

I know this issue is closed, but in my case i removed the flickering completely by removing the "led-show-refresh"-command. I nearly tried everything thats described above, but in the end, this was the point. (for me)
I used this board https://www.electrodragon.com/product/rgb-matrix-panel-drive-board-raspberry-pi/ and a pi three with raspbian lite. I have two panels parallel in two chains.
So my command is: sudo ./text-example -y -1 -x 4 -f ../fonts/myfont.bdf -C 255,255,255 –led-rows=32 –led-cols=32 –led-chain=2 –led-parallel=2 –led-multiplexing=6 –led-brightness=100
Hope this information is useful for somebody