Small, Medium, and Big Data: Application of Machine Learning Methods - PowerPoint PPT Presentation

Small, Medium, and Big Data: Application of Machine Learning Methods to the Solution of Real-World Imaging and Printing Problems A Personal Journey Jan Allebach Electronic Imaging Systems Laboratory (EISL) Purdue University 1 May 2018 SCV IEEE SPS Chapter – 1 May 2018

What are the Essential Ingredients of Machine Learning? (1/2) l A well-defined task w Choose a decision from a finite set of outcomes, based on observed data. w Estimate or predict the value of a continuous variable, based on observed data. l A well-defined decision or estimation structure w Clustering w Decision tree w Linear regression w Support vector machine w Neural network, including convolutional neural network (CNN) w Or other SCV IEEE SPS Chapter – 1 May 2018

What are the Essential Ingredients of Machine Learning? (2/2) l Features w Computed from observed data. w Serve as input to the decision or estimation structure. w May be handcrafted or determined autonomously as part of the training process. l Training data w Representative of the observed data. w Sufficiently diverse or rich to avoid over-fitting. l A well-defined cost function to penalize errors in classification or estimation. l A procedure for training the free parameters of the decision or estimation structure to minimize the cost function. SCV IEEE SPS Chapter – 1 May 2018

Synopsis w K Nearest Neighbor classification applied to printer forensics w Extension of K Means to Scalar Sequential Quantization w Optimal tree-structured piece-wise linear filter for image scaling w Training-based methods for digital haftoning w Black-box model for print prediction based on training and linear regression w Print macrouniformity prediction (Method 1) w Print macrouniformity prediction (Method 2) w Fashion photograph aesthetic quality predictor based on SVM and CNN w Facial landmark detection using CNN w Logo identification using CNN w Text field category classification via natural language processing SCV IEEE SPS Chapter – 1 May 2018

Printer Forensics l text SCV IEEE SPS Chapter – 1 May 2018

Whodunnit? HP Deskjet 1112 Canon MG2522 Epson XP-340 HP Deskjet 2655 Brother MFC-J485DW HP Envy 5549 Canon PIXMA MG3620 Canon MX922 SCV IEEE SPS Chapter – 1 May 2018

Supervised Clustering K Nearest Neighbors (KNN) Cyan Magenta HP Envy 5549 Yellow “Intrinsic Signatures of Inkjet Devices,” invited presentation, Center for Counterfeit Analysis Symposium (CAC-18), European Central Bank, Frankfurt Am Main, Germany, 6-7 March 2018. SCV IEEE SPS Chapter – 1 May 2018

Example image analysis for HP Envy 5549 Y and G clusters Connected components Y channel Keep clusters that are > 50 G channel pixels in size Each centroid is Centroids represented by 5x5 pixels SCV IEEE SPS Chapter – 1 May 2018

Unsupervised Clustering K-means SCV IEEE SPS Chapter – 1 May 2018

A special case of K-means: Structured Vector Quantization* l text *Research supported by Eastman Kodak Company. R. Balasubramanian, C. A. Bouman, and J. P. Allebach, “Sequential J. Z. Chang, J. P. Allebach, and C. A. Bouman, “Sequential Linear Scalar Quantization of Vectors: An Analysis,” IEEE Trans. on Image Interpolation of Multidimensional Functions,” IEEE Trans. on Image Processing, Vol. 4, pp. 1282-1295, September 1995. Processing , Vol. 6, pp. 1231-1245, September 1997. SCV IEEE SPS Chapter – 1 May 2018

Tree-Structured Classifiers: Resolution Synthesis – An Optimal Piecewise Linear Interpolator* l text C. B. Atkins, C. A. Bouman, and J. P. Allebach, “Tree-Based C. B. Atkins, C. A. Bouman, and J. P. Allebach, “Optimal Resolution Synthesis,” Proceedings of PICS-99: the 1999 Image Scaling Using Pixel Classification,” Proceedings of the IS&T Image Processing, Image Quality, Image Capture 2001 International Conference on Image Processing , Systems Conference , Savannah, GA, 25-28 April 1999. Thessaloniki, Greece, 7 October – 10 October 2001. B. Zhang, J. P. Allebach, J. Gondek, and M. Schramm, “Improved *Research supported Resolution Synthesis Algorithm for Image Interpolation,” Proceedings of NIP22 22nd International Conference on Digital by HP, Inc. Printing Technologies , Denver, CO, 17-22 September 2006. SCV IEEE SPS Chapter – 1 May 2018

Optimal image scaling T X Z Estimate X from W L realization of Z Scaled Source Image Image SCV IEEE SPS Chapter – 1 May 2018

Scaling procedure () { } ⋅ → − Z C T : 0 , … , M 1 z ( ) e 0 , e t − < z 0 ? z A , Classify 0 x ˆ 0 0 j j ( ) = j C z yes no T e 1 , e 2 , = + 1 2 x ˆ A j z j = = = j 0 j 1 j 4 e 3 , 3 A 0 , A 1 , A 4 , 0 1 4 = = j 2 j 3 A 2 , A 3 , 2 3 SCV IEEE SPS Chapter – 1 May 2018

4X scaling results Photoshop Bicubic Interpolation Tree-Based Resolution Synthesis SCV IEEE SPS Chapter – 1 May 2018

Synopsis w K Nearest Neighbor classification applied to printer forensics w Extension of K Means to Scalar Sequential Quantization w Optimal tree-structured piece-wise linear filter for image scaling w Training-based methods for digital haftoning w Black-box model for print prediction based on training and linear regression w Print macrouniformity prediction (Method 1) w Print macrouniformity prediction (Method 2) w Fashion photograph aesthetic quality predictor based on SVM and CNN w Facial landmark detection using CNN w Logo identification using CNN w Text field category classification via natural language processing SCV IEEE SPS Chapter – 1 May 2018

Training-based development of optimal rendering algorithms Human Rendering visual device system model model Training Quality data metric Rendering algorithm Free parameters of algorithm Search Constraints strategy SCV IEEE SPS Chapter – 1 May 2018

Model-Based Halftoning: Direct Binary Search (DBS)* Human visual system filter kernel ~ p( x ) ~ f( x ) f[ n ] + ∫ ( e( x ) ) 2 d x ~ ∑ e( x ) ~ arg min Human visual system filter kernel g[ n ] x - ~ p( x ) ~ g( x ) . . . g 0 [ n ] g[ n ] *Research supported by HP, Inc. Analoui and J. P. Allebach, “Model-based Halftoning by D. J. Lieberman, and J. P. Allebach, “A Dual Interpretation for Direct Binary Search,” Proceedings of the 1992 SPIE/IS&T Direct Binary Search and its Implications for Tone Symposium on Electronic Imaging Science and Technology , Reproduction and Texture Quality,” IEEE Trans. on Image San Jose, CA, February 9-14, 1992, Vol. 1666, pp. 96-108. Processing , Vol. 9, pp. 1950-1963, November 2000. SCV IEEE SPS Chapter – 1 May 2018

The DBS search heuristic Toggle Accept Swap 1 pattern with Swap 2 lowest error Swap 3 SCV IEEE SPS Chapter – 1 May 2018

DBS convergence: 0, 1, 2, 4, 6, and 8 iterations SCV IEEE SPS Chapter – 1 May 2018

Model-Based Training Supervised Halftoning Tone-Dependent Error Diffusion (TDED)* f[m,n] u[m,n] g[m,n] + Q(•) - + - d[m,n] w k,l (f[m,n]) X *Research supported P. Li and J. P. Allebach, “Tone-Dependent Error Diffusion,” IEEE by HP, Inc. Trans. on Image Processing , Vol. 13, pp. 201-215, February 2004. SCV IEEE SPS Chapter – 1 May 2018

Optimization of TDED parameters ∧ G DBS [ u, v; a ] | DFT | 2 DBS + Constant Normalized Patch MSE (absorptance a ) - | DFT | 2 TDED ∧ G TDED [ u, v; a ] Update weights and thresholds ∧ ∧ Cost function l − 2 ( G [ u, v; a ] G [ u, v; a ]) ∑∑ DBS TDED ξ = ( a ) . ∧ 2 G [ u, v; a ] u v DBS SCV IEEE SPS Chapter – 1 May 2018

Optimal weights and thresholds 1 0.8 Weight 0.6 0.4 0.2 0 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1 0.8 Threshold 0.6 0.4 0.2 0 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Absorptance X SCV IEEE SPS Chapter – 1 May 2018

Floyd-Steinberg vs TDED Floyd-Steinberg TDED SCV IEEE SPS Chapter – 1 May 2018

TDED vs DBS TDED DBS SCV IEEE SPS Chapter – 1 May 2018

Marking engine technologies: laser electrophotographic l text Typical low-end laser electrophotographic printer: HP LaserJet M252dw $249.99 List Architecture of laser electrophotographic printer Instability of electrophotographic process Periodic, clustered- dot halftone textures are generally preferred for electrophotographic printers Student: F. Baqai SCV IEEE SPS Chapter – 1 May 2018

Commercial/industrial scale electrophotographic printing HP Indigo Press 3050 2,000 4-color sheets/hr. HP Indigo Press 30000 4600 3-color sheets/hr. SCV IEEE SPS Chapter – 1 May 2018

Linear Regression Predicting Printed Absorptance From a Digital Halftone: the Black-Box Model* Y. Ju, T. Kashti, T. Frank, D. Kella, D. Shaked, M. Fischer, R. Ulichney, and J. P. Allebach, “Black-Box Models for Laser Electrophotographic Printers – Recent *Research supported Progress,” Proceedings NIP29: IS&T’s 29th International Conference on Digital by HP, Inc. Printing Technologies , Seattle, WA, 29 September – 3 October 2013 SCV IEEE SPS Chapter – 1 May 2018

Structure of the Black-Box Model SCV IEEE SPS Chapter – 1 May 2018

How Do We Train the Model? 5. 1. 2. 3. 4. SCV IEEE SPS Chapter – 1 May 2018

Small, Medium, and Big Data: Application of Machine Learning Methods - PowerPoint PPT Presentation

Small, Medium, and Big Data: Application of Machine Learning Methods to the Solution of Real-World Imaging and Printing Problems A Personal Journey Jan Allebach Electronic Imaging Systems Laboratory (EISL) Purdue University 1 May 2018 SCV

Machine Learning Anders Holst SICS Big Data Analytics Analysis Big Data Big Value Big Data

Big Data Algorithms with Medical Applications Yixin Chen Outline Challenges to big data

Big Data Personal data Personal (Big) Data Personal Information Economy luk vervenne x

x ? Machine Learning 5/4/20 Tim Althoff, UW CS547: Machine Learning for Big Data,

Presentation Outline 1. Medium Term Fiscal projections 1. The 2011/12 and Medium Term Budget

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

Differential Privacy Machine Learning Li Xiong Big Data + Machine Learning + Machine

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

CS535 Big Data 1/22/2020 Sangmi Lee Pallickara CS535 Big Data | Computer Science Department

COMP9313: Big Data Management Introduction to Big Data Management What is big data? Tweeted by

The Future of Small and The Future of Small and Medium Sized Nuclear Medium Sized Nuclear

Big Changes for Small Agencies Daniel Schiavone Outline Intro - 2017 Drupal Business Survey How

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

High Definition Oscilloscopes Embargo Date - October 22 nd 2012 Oscilloscope Evolution Digital

Efficient Analysis of Multidimensional Linear Systems for Wordlength Optimization Gal Deest

Large-Scale R-CNN with Classifier Adaptive Quantization Redmon et al., ECCV 2016 Mincheul Kang

Autonomous SoC for Fuzzy Robot Path Tracking Kyriakos M. Deliparaschos, George P. Moustris,

Berry Phases with Time Reversal Invariance Characterization of Gapped Quantum Liquids & LSM

North Palos School District 117s Reopening Plan for 2020-2021 Board of Education Meeting July

Containment and Control WHO Pandemic Phases Phase Period Level of Risk 1 Interpandemic period

Security And Facilitation PRESENTED BY: KWADWO ABREFA SARKODIE REGIONAL AIRPORT MANAGER

Small, Medium, and Big Data: Application of Machine Learning Methods - PowerPoint PPT Presentation

Small, Medium, and Big Data: Application of Machine Learning Methods to the Solution of Real-World Imaging and Printing Problems A Personal Journey Jan Allebach Electronic Imaging Systems Laboratory (EISL) Purdue University 1 May 2018 SCV

Machine Learning Anders Holst SICS Big Data Analytics Analysis Big Data Big Value Big Data

Big Data Algorithms with Medical Applications Yixin Chen Outline Challenges to big data

Big Data Personal data Personal (Big) Data Personal Information Economy luk vervenne x

x ? Machine Learning 5/4/20 Tim Althoff, UW CS547: Machine Learning for Big Data,

Presentation Outline 1. Medium Term Fiscal projections 1. The 2011/12 and Medium Term Budget

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

MACHINE LEARNING kernels 1 MACHINE LEARNING 2012 MACHINE LEARNING Kernels: Intuition How

Differential Privacy Machine Learning Li Xiong Big Data + Machine Learning + Machine

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

MICROSOFT AZURE MACHINE LEARNING Oscar Naim Microsoft Microsoft Azure Machine Learning What is

MACHINE LEARNING Overview 1 1 APPLIED MACHINE LEARNING 2011-2012 APPLIED MACHINE LEARNING

CS535 Big Data 1/22/2020 Sangmi Lee Pallickara CS535 Big Data | Computer Science Department

COMP9313: Big Data Management Introduction to Big Data Management What is big data? Tweeted by

The Future of Small and The Future of Small and Medium Sized Nuclear Medium Sized Nuclear

Big Changes for Small Agencies Daniel Schiavone Outline Intro - 2017 Drupal Business Survey How

A Machine Learning Approach A Machine Learning Approach A Machine Learning Approach A Machine

High Definition Oscilloscopes Embargo Date - October 22 nd 2012 Oscilloscope Evolution Digital

Efficient Analysis of Multidimensional Linear Systems for Wordlength Optimization Gal Deest

Large-Scale R-CNN with Classifier Adaptive Quantization Redmon et al., ECCV 2016 Mincheul Kang

Autonomous SoC for Fuzzy Robot Path Tracking Kyriakos M. Deliparaschos, George P. Moustris,

Berry Phases with Time Reversal Invariance Characterization of Gapped Quantum Liquids &amp; LSM

North Palos School District 117s Reopening Plan for 2020-2021 Board of Education Meeting July

Containment and Control WHO Pandemic Phases Phase Period Level of Risk 1 Interpandemic period

Security And Facilitation PRESENTED BY: KWADWO ABREFA SARKODIE REGIONAL AIRPORT MANAGER

Berry Phases with Time Reversal Invariance Characterization of Gapped Quantum Liquids & LSM