Infinite-isp is a collections of camera pipeline modules implemented at the application level for converting an input RAW image from a sensor to an output RGB image. Infinite-isp aims to contain simple to complex algorithms at each modular level.
ISP pipeline for infinite-isp v1.0
Many open-source ISPs are available over the internet. Most of them are developed by individual contributors, each having its own strengths. This project aims to centralize all the open-source ISP development to a single place enabling all the ISP developers to have a single platform to contribute. Infinite-isp will not only contain the conventional algorithms but aims to contain state-of-the-art deep learning algorithms as well enabling a clean comparison between the two. This project has no bounds to ideas and is aimed to contain any algorithm that improves the overall results of the pipeline regardless of their complexity.
A comparison of features with the famous openISP.
Infinite-isp also tries to simulate the 3A-Algorithms.
Modules | infinite-isp | openISP |
---|---|---|
Crop | Bayer pattern safe cropping | ---- |
Dead Pixel Correction | Modified Yongji's et al, Dynamic Defective Pixel Correction for Image Sensor | Yes |
Black Level Correction | Calibration / sensor dependent - Applies BLC from config |
Yes |
Optical Electronic Transfer Function (OECF) | Calibration / sensor dependent - Implements a LUT from config |
---- |
Anti Aliasing Filter | ---- | Yes |
Digital Gain | Gains from config file | Brightness contrast control |
Lens Shading Correction | To Be Implemented | ---- |
Bayer Noise Reduction | Green Channel Guiding Denoising by Tan et al | Chroma noise filtering |
White Balance | WB gains from config file | Yes |
CFA Interpolation | Malwar He Cutler’s demosaicing algo | Yes - Malvar He Cutler |
3A - Algorithms | AE & AWB | ---- |
Auto White Balance | - Gray World - Norm 2 - PCA algorithm |
---- |
Auto Exposure | - Auto Exposure based on skewness | ---- |
Color Correction Matrix | Calibration / sensor dependent - 3x3 CCM from config |
Yes - 4x3 CCM |
Gamma Tone Mapping | Gamma LUT in RGB from config file | Yes - YUV and RGB domain |
Color Space Conversion | YUV analogue and YCbCr digital - BT 601 - Bt 709 |
Yes - YUV analogue |
Contrast Enhancement | Modified contrast limited adaptive histogram equalization | ---- |
Edge Enhancement / Sharpeining | ---- | Yes |
Noise Reduction | Non-local means filter | Yes - NLM filter - Bilateral noise filter |
Hue Saturation Control | ---- | Yes |
Scale | - Integer Scaling - Non-Integer Scaling |
---- |
False Color Suppression | ---- | Yes |
YUV Format | - YUV - 444 - YUV - 422 |
---- |
The project is compatible with Python_3.9.12
The dependencies are listed in the requirements.txt file.
The project assumes pip package manager as a pre-requisite.
Follow the following steps to run the pipeline
- Clone the repo using
git clone https://github.com/xx-isp/infinite-isp
- Install all the requirements from the requirements file by running
pip install -r requirements.txt
There are a few sample images with tuned configurations already added to the project at in_frames/normal folder. In order to run any of these, just replace the config file name with any one of the sample configurations provided. For example to run the pipeline on Indoor1_2592x1536_12bit_RGGB.raw
simply replace the config file name in isp_pipeline.py
config_path = './config/Indoor1-configs.yml'
Here are the results of this pipeline compared with a market competitve ISP. The outputs of our ISP are displayed on the right, with the underlying ground truths on the left.
A comparison of the above results based on PSNR and SSIM image quality metrics
Images | PSNR | SSIM |
---|---|---|
Indoor1 | 21.51 | 0.8624 |
Outdoor1 | 22.87 | 0.9431 |
Outdoor2 | 20.54 | 0.8283 |
Outdoor3 | 19.22 | 0.7867 |
Outdoor4 | 22.25 | 0.8945 |
You can run the project by simply executing the isp_pipeline.py. This is the main file that loads all the algorithic parameters from the configs.yml The config file contains tags for each module implemented in the pipeline. A brief description as well as usage of each module is as follows:
platform | Details |
---|---|
filename | Specifies the file name for running the pipeline. The file should be placed in the in_frames/normal directory |
disable_progress_bar | Enables or disables the progress bar for time taking modules |
leave_pbar_string | Hides or unhides the progress bar upon completion |
sensor Info | Details |
---|---|
bayer_pattern | Specifies the bayer patter of the RAW image in lowercase letters - bggr - rgbg - rggb - grbg |
range | Not used |
bitdep | The bit depth of the raw image |
width | The width of the input raw image |
height | The height of the input raw image |
hdr | Not used |
crop | Details |
---|---|
isEnable | Enables or disables this module. When enabled it ony crops if bayer pattern is not disturbed |
isDebug | Flag to output module debug logs |
new_width | New width of the input RAW image after cropping |
new_height | New height of the input RAW image after cropping |
dead_pixel_correction | Details |
---|---|
isEnable | Enables or disables this module |
isDebug | Flag to output module debug logs |
dp_threshold | The threshold for tuning the dpc module. The lower the threshold more are the chances of pixels being detected as dead and hence corrected |
To be implemented
black_level_correction | Details |
---|---|
isEnable | Enables or disables this module |
r_offset | Red channel offset |
gr_offset | Gr channel offset |
gb_offset | Gb channel offset |
b_offset | Blue channel offset |
isLinear | Enables or disables linearization. When enabled the BLC offset maps to zero and saturation maps to the highest possible bit range given by the user |
r_sat | Red channel saturation level |
gr_sat | Gr channel saturation level |
gb_sat | Gb channel saturation level |
b_sat | Blue channel saturation level |
OECF | Details |
---|---|
isEnable | Enables or disables this module |
r_lut | The look up table for oecf curve. This curve is mostly sensor dependent and is found by calibration using some standard technique |
digital_gain | Details |
---|---|
isEnable | This is a essential module and cannot be disabled |
isDebug | Flag to output module debug logs |
gain_array | Gains array. User can select any one of the gain listed here. This module works together with AE module |
current_gain | Index for the current gain starting from zero |
To be implemented
bayer_noise_reduction | Details |
---|---|
isEnable | When enabled reduces the noise in bayer domain using the user given parameters |
filt_window | Should be an odd window size |
r_stdDevS | Red channel gaussian kernel strength. The more the strength the more the blurring. Cannot be zero |
r_stdDevR | Red channel range kernel strength. The more the strength the more the edges are preserved. Cannot be zero |
g_stdDevS | Gr and Gb gaussian kernel strength |
g_stdDevR | Gr and Gb range kernel strength |
b_stdDevS | Blue channel gaussian kernel strength |
b_stdDevR | Blue channel range kernel strength |
white_balance | Details |
---|---|
isEnable | Applies user given white balance gains when enabled |
isAuto | When true enables the 3A - AWB and does'nt use the user given WB gains |
r_gain | Red channel gain |
b_gain | Blue channel gain |
demosaic | Details |
---|---|
isEnable | This is a essential module and cannot be disabled |
auto_white_balance | Details |
---|---|
algorithm | Can select one of the following algos - grey_world - norm_2 - pca |
percentage | [0 - 100] - Parameter to select dark-light pixels percentage for pca algorithm |
color_correction_matrix | Details |
---|---|
isEnable | When enabled applies the user given 3x3 CCM to the 3D RGB image having rows sum to 1 convention |
corrected_red | Row 1 of CCM |
corrected_green | Row 2 of CCM |
corrected_blue | Row 3 of CCM |
gamma_correction | Details |
---|---|
isEnable | When enabled applies tone mapping gamma using the LUT |
gammaLut | The look up table for gamma curve |
auto_exposure | Details |
---|---|
isEnable | When enabled applies the 3A- Auto Exposure algorithm |
isDebug | Flag to output module debug logs |
center_illuminance | The value of center illuminance for skewness calculation ranges from 0 to 255. Default is 90 |
histogram_skewness | The range of histogram skewness should be between 0 and 1 for correct exposure calculation |
color_space_conversion | Details |
---|---|
isEnable | This is a essential module and cannot be disabled |
conv_standard | The standard to be used for conversion - 1 : Bt.709 HD - 2 : Bt.601/407 |
conv_type | The conversion type - 1 : Analogue YUV - 2 : Digital YCbCr |
ldci | Details |
---|---|
isEnable | When enabled local dynamic contrast enhancement is applied to the Y channel |
clip_limit | The clipping limit that controls amount of detail to be enhanced |
wind | Window size for applying filter |
To be implemented
2d_noise_reduction | Details |
---|---|
isEnable | When enabled applies the non-local mean filtering |
window_size | Search window size for applying the filter |
patch_size | Patch window size for applying filter |
h | Strength of blurring |
scale | Details |
---|---|
isEnable | When enabled down scales the input image |
isDebug | Flag to output module debug logs |
new_width | Down scaled width of the output image |
new_height | Down scaled height of the output image |
isHardware | When true applies the hardware friendly techniques for downscaling. This can only be applied to any one of the input sizes 3 input sizes and can downscale to - 2592x1944 to 1920x1080 or 1280x960 or 1280x720 or 640x480 or 640x360 - 2592x1536 to 1280x720 or 640x480 or 640x360 - 1920x1080 to to 1280x720 or 640x480 or 640x360 |
Algo | Software friendly scaling. Only used when isHardware is disabled - Nearest_Neighbor - Bilinear |
upscale_method | Used only when isHardware enabled. Upscaling method, can be one of the above algos |
downscale_method | Used only when isHardware enabled. Downscaling method, can be one of the above algos |
yuv_conversion_format | Details |
---|---|
isEnable | Enables or disables this module |
conv_type | Can convert the YCbCr to YUV - 444 - 422 |
TBD
TBD
TBD
Why is it named infinite-isp?
ISPs are hardware dependent. In them algorithms are limited to perform to their best because of hardware limitations. Infinite-isp tends to somewhat remove this limitation and let the algorithms perform to the full potential targeting best results.
Will inifnite-isp also contain algorithms that involve machine learning?
Yes definitely this is mainly because it is seen that machine learning models tend to give perform much better results as compared to conventional models. The plan is as follows
-
The release
v0.x
tillv1.0
will involve buildng a basic ISP pipelne at conventional level. -
The release
v1.0
will have all camera pipeline modules implemented at conventional level. This release will mostly contain algorithms that can be easily ported to hardware ISPs -
v1.x.x
releases will have all the necessary improvements of these conventional algorithms till releasev2.0
-
From release
v2.0
infinite-isp will start implementing machine learning models for specific algorithms. -
Release
v3.0
will have infinite-isp having both conventional and deep learning algorithms (not for all pipeline modules but for specific ones)
This project is licensed under Apache 2.0 (see LICENSE file).
- This project started of from the inspiration of cruxopen/openISP