processing pipeline for digital cameras Local, Linear and Learned (L - PowerPoint PPT Presentation

Accelerating a learning – based image processing pipeline for digital cameras Local, Linear and Learned (L 3 ) pipeline Qiyuan Tian and Haomiao Jiang Department of Electrical Engineering Stanford University GPU Technology Conference, San Jose March 17, 2015

Digital camera sub-systems RAW image Display image Focus control Pre-processing Image • dead pixel removal • dark floor subtraction processing • structured noise reduction pipeline • quantization CFA • etc. Exposure control Transform the Lens, aperture and sensor sensor data into a display image

Standard image processing pipeline CFA Sensor Illuminant Noise Tone interpolation conversion correction reduction scale RAW image Display image − Requires multiple algorithms − Each algorithm requires optimization − Optimized only for Bayer (RGB) color filter array (CFA)

Opportunity Extra sensor pixels enable new CFAs that improve sensor functionality and open new applications Bayer RGBW RGBX RGBCMY Medical low-light sensitivity infrared multispectral specialized dynamic range light field application Challenge − Customized image processing pipeline − Speed and low power

L 3 image processing pipeline Classify Retrieve and apply CFA Sensor Illuminant Noise Tone interpolation pixels conversion correction reduction scale transforms RAW image Display image Local, Linear and Learned (L 3 ) − Combines multiple algorithms into one − Rendering is simple, fast and low-power − Uses machine learning to optimize the class transforms for any CFA

Classify pixels RAW image Flat Texture Sensor voltage level Center pixel color Intensity Contrast Class Center pixel color: red Intensity: high Contrast: flat “Local” pixel values (local patch)

Retrieve and apply transforms “Linear” transforms RAW image R G B Weighted summation Rendered R, G, B values Class Center pixel color: red Learned Intensity table of Intensity: high linear Contrast: flat transforms Contrast

Table-based architecture suits GPU Weighted summation Weighted summation GPU − Independent calculation for each pixel − Simple weighted summation Thus well-suited for parallel rendering using GPU

GPU implementations Render one pixel ( i, j ) • Calculate class index • Retrieve transforms • Weight sum Table of transforms Constants, e.g. CFA pattern

GPU acceleration results Results CPU GPU Image 0.062s 12.4s (1280 × 720) (16 fps) Video 163.2s (1280 × 720 × 1800) (11 fps) − GPU: NVidia GTX 770 (1536 kernels, 1.085 GHz) − CPU: Intel Core i7-4770K (3.5 GHz) − CUDA/C programming Tian et al. 2015

Potential speed improvement Use shared memory and registers Specialized image signal processor (ISP) L 3 ISP

L 3 processing “Learn” the transforms Table of Transforms Pre- Local Patch Transform processing Classification Application Novel RAW Image Classification Display Camera Map image GPU

Locally linear transform − Globally nonlinear for an entire image − 480 linear transforms in total Center color Intensity Contrast red 1 V flat green Local patches white texture blue 0 V 20 levels

Learn the locally linear transform for each class ? R G B Linear Local RAW Desired RGB values values transform A 𝐲 = 𝐜

Solve the transform ? ? R G B Linear Local RAW Desired values RGB values transform A 𝐲 = 𝐜 A𝐲 − 𝐜 2 + Γ𝐲 2 minimize 𝐲 ridge regression

Training data from camera simulation Simulated Local Desired RAW image patches RGB Classification … ISET camera simulator Multispectral radiance Registered desired (with calibrated optics and training scenes RGB images Training data sensor parameters) http://imageval.com − Simulate any camera designs − Various training scenes, illuminants and luminances − Registered and desired RGB images

Learned transforms Red-pixel Transforms that centered patch solve for R-channel Dark class Bright class (use more W) (use more RGB) − Accounts for spatial and spectral correlation − Accounts for sensor and photon noise

Advantages of learning − Adapts to any application and scene content Consumer Document Industrial Pathology Endoscopy Photography Digitization Inspection − Adapt to any CFA Bayer RGBW RGBX RGBCMY Medical

Solve RGBW rendering In dark scene − Two f-stops gain In bright scene − Same performance Simulation conditions Exposure: 100 ms F-number: f/4 Tian et al. 2014

Smooth transition from dark to bright .01 .1 1 10 100 200 300 cd/m 2 Scene Luminance Tian et al. 2014

Compare RGBW CFA designs Bayer Parmar & Wandell, 2009 Aptina CLARITY+ Simulation conditions Luminance: 1cd/m 2 Exposure: 100 ms F-number: f/4 Tian et al. 2014 Kodak Wang et al., 2011

Five-band camera prototype RGB Cyan Orange 4 × 4 super-pixel Tian et al. 2015

L 3 solves five-band prototype rendering Tian et al. 2015

GPU acceleration results Results GPU CPU Image 0.062s 12.4s (1280 × 720) (16 fps) Video 163.2s (1280 × 720 × 1800) (11 fps) − GPU: NVidia GTX 770 (1536 kernels, 1.085 GHz) − CPU: Intel Core i7-4770K (3.5 GHz) − CUDA/C programming Tian et al. 2015

Desired L 3 learning Multispectral RGB Images Scenes Camera ISET camera Supervised Calibration Simulation Learning Novel Calibrated Simulated Table of Camera Parameters RAW Image Transforms L 3 processing Table of transforms Pre- Local Patch Transform processing Classification Application Novel RAW Image Classification Display GPU Camera Map image

Local, linear and learned pipeline (L 3 ) summary − Table-based rendering architecture is ideal for GPU acceleration − Machine learning automates image processing for any CFA and scene content Rethink image processing pipeline

Acknowledgement Advisors Brian Wandell, Joyce Farrell Group members Henryk Blasinski, Andy Lin Stanford collaborators Francois Germain, Iretiayo Akinola Olympus collaborators Steven Lansel, Munenori Fukunishi

References Tian, Q., Lansel, S., Farrell, J. E., and Wandell , B. A., “Automating the design of image processing pipelines for novel color filter arrays: Local, Linear, Learned (L 3 ) method,” in [IS&T/SPIE Electronic Imaging], 90230K– 90230K, International Society for Optics and Photonics (2014). Tian, Q., Blasinski, H., Lansel, S., Jiang, H., Fukunishi, M., Farrell, J. E., and Wandell, B. A., “Automatically designing an image processing pipeline for a five -band camera prototype using the local, linear, learned (L 3 ) method,” in [IS&T/SPIE Electronic Imaging], 940403-940403-6, International Society for Optics and Photonics (2015).

End Thanks for your attention! Questions? Contacts qytian@stanford.edu hjiang36@stanford.edu

Potential speed improvement • Local vs Global • L3 is locally linear: can use local memory to speed up • Locality in memory: writing output as RGBRGB is faster than writing as image plane • Device based optimization • CFA pattern and other parameters are fixed: Constant Memory & no need to pass in • Symmetry and other properties • CUDA, GLSL, FPGA, Hardware • L3 rendering is based on linear transforms and can be implemented with shaders or hardware circuits to achieve further acceleration

processing pipeline for digital cameras Local, Linear and Learned (L - PowerPoint PPT Presentation

Accelerating a learning based image processing pipeline for digital cameras Local, Linear and Learned (L 3 ) pipeline Qiyuan Tian and Haomiao Jiang Department of Electrical Engineering Stanford University GPU Technology Conference, San Jose

Cameras Cameras Good cameras need focusing, cheap ones dont Good cameras need focusing,

Range gated cameras technology and its applications Friday, October 10, 14 Range gated cameras

Cameras Cameras photographs, you should: photographs, you should: increase the diameter of the

Body Worn Cameras and Transparency Increase in use of cameras Expectation of

The OpenGL Rendering Pipeline The Rendering Pipeline The process to generate two-dimensional

Natural Language Processing: Traditional Processing Pipeline Roman Kern <rkern@tugraz.at>

DWI The New World of Videos & Blood Tests Cameras Can Be Vicious First they frame you! Then

Photography Photography By: Jason Cox By: Jason Cox Cameras Cameras Pinhole Pinhole

Fredericksburg Police Department Body-worn Camera System The Start of Body-Worn Cameras The

OF WILDLIFE Division of Law Enforcement Trail Cameras June 2013 Trail Cameras SD card

Order One 46 Classrooms Cameras only: $56,108.00 Order Two 8 Classrooms Cameras

Even better cameras? Even better cameras? Are they needed, possible and Are they needed,

Cameras EECS 442 Prof. David Fouhey Winter 2019, University of Michigan

Office of Pipeline Safety Office of Pipeline Safety Presentation on Presentation on Damage

Ma Magic Mountain Pipeline Phase 6 gic Mountain Pipeline Phase 6 Project ject Board Meeting

Internal Pipeline Corrosion Kenneth Lee Pipeline Safety Director, Engineering & Research

Visibility measurement using digital camera Vladimr Brzda Department of Electrical

SOFTWARE MODULES: SERIAL COMUNICATIONS -BASIC FOR DEBUGGING -RECEIVING BUFFER -INTERRUPTION I2C

Research Project Brief A recent article/publication; how it has influenced our practice and

Bader Dashti Bill Dunlap Chris Hansen Mechanical Mechatronic Mechanical Ahmed Kamfar

Years ago, sitting atop a mound of lava on the island of Baltra in the Galapagos, a friend of

Measuring Test Mass Scattering Christian Pluchar Advisor: Keita Kawabe Outline -What Ive

Heritage Grants Program: Evaluation Plan Presenters: Nancy Hewat, Ph.D. Synthesis Evaluation

New Database Metrics to Track and Analyze Multiple Business Stakeholders Transportation Center

processing pipeline for digital cameras Local, Linear and Learned (L - PowerPoint PPT Presentation

Accelerating a learning based image processing pipeline for digital cameras Local, Linear and Learned (L 3 ) pipeline Qiyuan Tian and Haomiao Jiang Department of Electrical Engineering Stanford University GPU Technology Conference, San Jose

Cameras Cameras Good cameras need focusing, cheap ones dont Good cameras need focusing,

Range gated cameras technology and its applications Friday, October 10, 14 Range gated cameras

Cameras Cameras photographs, you should: photographs, you should: increase the diameter of the

Body Worn Cameras and Transparency Increase in use of cameras Expectation of

The OpenGL Rendering Pipeline The Rendering Pipeline The process to generate two-dimensional

Natural Language Processing: Traditional Processing Pipeline Roman Kern &lt;rkern@tugraz.at&gt;

DWI The New World of Videos &amp; Blood Tests Cameras Can Be Vicious First they frame you! Then

Photography Photography By: Jason Cox By: Jason Cox Cameras Cameras Pinhole Pinhole

Fredericksburg Police Department Body-worn Camera System The Start of Body-Worn Cameras The

OF WILDLIFE Division of Law Enforcement Trail Cameras June 2013 Trail Cameras SD card

Order One 46 Classrooms Cameras only: $56,108.00 Order Two 8 Classrooms Cameras

Even better cameras? Even better cameras? Are they needed, possible and Are they needed,

Cameras EECS 442 Prof. David Fouhey Winter 2019, University of Michigan

Office of Pipeline Safety Office of Pipeline Safety Presentation on Presentation on Damage

Ma Magic Mountain Pipeline Phase 6 gic Mountain Pipeline Phase 6 Project ject Board Meeting

Internal Pipeline Corrosion Kenneth Lee Pipeline Safety Director, Engineering &amp; Research

Visibility measurement using digital camera Vladimr Brzda Department of Electrical

SOFTWARE MODULES: SERIAL COMUNICATIONS -BASIC FOR DEBUGGING -RECEIVING BUFFER -INTERRUPTION I2C

Research Project Brief A recent article/publication; how it has influenced our practice and

Bader Dashti Bill Dunlap Chris Hansen Mechanical Mechatronic Mechanical Ahmed Kamfar

Years ago, sitting atop a mound of lava on the island of Baltra in the Galapagos, a friend of

Measuring Test Mass Scattering Christian Pluchar Advisor: Keita Kawabe Outline -What Ive

Heritage Grants Program: Evaluation Plan Presenters: Nancy Hewat, Ph.D. Synthesis Evaluation

New Database Metrics to Track and Analyze Multiple Business Stakeholders Transportation Center

Natural Language Processing: Traditional Processing Pipeline Roman Kern <rkern@tugraz.at>

DWI The New World of Videos & Blood Tests Cameras Can Be Vicious First they frame you! Then

Internal Pipeline Corrosion Kenneth Lee Pipeline Safety Director, Engineering & Research