Fig1024/OP_RBF

性能

Opened this issue · 0 comments

Original Recursive Bilateral Filter implementation
Image: lw_cross.jpg, size: 1280 x 720, time ms: 70.2
Image: Thefarmhouse.jpg, size: 1440 x 1080, time ms: 119.3
Image: testGirl.jpg, size: 448 x 626, time ms: 23.6

Optimized SSE2 single threaded, single stage (non-pipelined)
Image: lw_cross.jpg, size: 1280 x 720, time ms: 86.6
Image: Thefarmhouse.jpg, size: 1440 x 1080, time ms: 142.0
Image: testGirl.jpg, size: 448 x 626, time ms: 25.6

Optimized SSE2 2x multithreading, single stage (non-pipelined)
Image: lw_cross.jpg, size: 1280 x 720, time ms: 44.3
Image: Thefarmhouse.jpg, size: 1440 x 1080, time ms: 78.4
Image: testGirl.jpg, size: 448 x 626, time ms: 13.4

Optimized SSE2 4x multithreading, single stage (non-pipelined)
Image: lw_cross.jpg, size: 1280 x 720, time ms: 24.9
Image: Thefarmhouse.jpg, size: 1440 x 1080, time ms: 41.5
Image: testGirl.jpg, size: 448 x 626, time ms: 7.5

Optimized SSE2 4x2 thread pipelined 2 stages
Image: lw_cross.jpg, size: 1280 x 720, time ms: 18.6
Image: Thefarmhouse.jpg, size: 1440 x 1080, time ms: 32.0
Image: testGirl.jpg, size: 448 x 626, time ms: 5.9
Finish

您好,我的cpu是i7-8700 cpu@3.20GHZ的,但是这个复现结果和您的结果差距在十多倍,请问会是什么原因呢?谢谢