F i n e - g r a i n e d F l o w e r a n d F u - PowerPoint PPT Presentation

F i n e - g r a i n e d F l o w e r a n d F u n g i C l a s s i fj c a t i o n a t C MP Mi l a n Š u l c , L u k á š P i c e k , J i ř í Ma t a s 1 2 1 1 V i s u a l R e c o g n i t i o n G r o u p , C e n t e r f o r Ma c h i n e P e r c e p t i o n , D e p t . o f C y b e r n e t i c s , F E E C z e c h T e c h n i c a l U n i v e r s i t y i n P r a g u e 2 D e p t . o f C y b e r n e t i c s , F A S U n i v e r s i t y o f We s t B o h e m i a Mi l a n Š u l c F i n e - g r a i n e d F l o w e r a n d F u n g i C l a s s i fj c a t i o n a t C MP

Label Distributions FGVCx Flower and Fungi Classification datasets available for training follow a “long-tail distribution” of classes, which may not correspond with the test-time distribution. FGVCx Flowers FGVCx Fungi 0.6 T raining set T raining set 0.5 Validation set 0.5 0.4 0.4 % samples % samples 0.3 0.3 0.2 0.2 0.1 0.1 0.0 0.0 0 200 400 600 800 0 200 400 600 800 1000 1200 Class (sorted) Class (sorted) Mi l a n Š u l c F i n e - g r a i n e d F l o w e r a n d F u n g i C l a s s i fj c a t i o n a t C MP 2 / 1 5

Label Distributions We recently observed a similar problem in the LifeCLEF plant identification challenge: majority of training data comes from the web, while test images come from a different source. Can we compensate for this imbalance? T est set T est set T raining set T raining set 2500 2500 2000 2000 # images # images 1500 1500 1000 1000 500 500 0 0 0 10 20 0 2000 4000 6000 8000 10000 Class (sorted by N train ) Class (sorted by N train ) Figure: PlantCLEF 2017 label distribution in the “trusted” training set. [1] Improving CNN classifiers by estimating test-time priors. Milan Šulc and Jiří Matas. arXiv:1805.08235 [cs.CV], 2018. Mi l a n Š u l c F i n e - g r a i n e d F l o w e r a n d F u n g i C l a s s i fj c a t i o n a t C MP 3 / 1 5

CNN Outputs as Posterior Estimates Training neural networks ( f with parameters ) by cross-entropy loss minimization means training it to estimate the posterior probabilities: where Then: Mi l a n Š u l c F i n e - g r a i n e d F l o w e r a n d F u n g i C l a s s i fj c a t i o n a t C MP 4 / 1 5

CNN Outputs as Posterior Estimates Experiment on selected subsets of CIFAR-100 with different class priors: How well do the posterior estimates marginalize over dataset samples? [1] Improving CNN classifiers by estimating test-time priors. Milan Šulc and Jiří Matas. arXiv:1805.08235 [cs.CV], 2018. Mi l a n Š u l c F i n e - g r a i n e d F l o w e r a n d F u n g i C l a s s i fj c a t i o n a t C MP 5 / 1 5

Adjusting Estimates to New Priors Assuming that the probability density function remains unchanged: The mutual relation of the posteriors is: Mi l a n Š u l c F i n e - g r a i n e d F l o w e r a n d F u n g i C l a s s i fj c a t i o n a t C MP 6 / 1 5

When Test Set Priors Are Unknown How to estimate the test-set priors? Saerens et al. [1] proposed a simple EM procedure to maximize the likelihood L( x 0 , x 1 , x 2 ,...) : This procedure is equivalent [2] to fixed-point-iteration minimization of the KL divergence between and . [1] Adjusting the outputs of a classifier to new a priori probabilities: a simple procedure. Marco Saerens, Patrice Latinne, and Christine Decaestecker. Neural computation 14.1 (2002): 21-41. [2] Semi-supervised learning of class balance under class-prior change by distribution matching. Marthinus Christoffel Du Plessis and Masashi Sugiyama. Neural Networks, 50:110–119, 2014. Mi l a n Š u l c F i n e - g r a i n e d F l o w e r a n d F u n g i C l a s s i fj c a t i o n a t C MP 7 / 1 5

Test Set Prior Estimation in LifeCLEF Preliminary experiments (using the 2017 test set for validation): ● When the whole test set is available: Inception-ResNet-v2: 82.9% → 85.8% Inception-v4: 82.8% → 86.3% ● On-line [1] after each new test image: [1] Improving CNN classifiers by estimating test-time priors. Milan Šulc and Jiří Matas. arXiv:1805.08235 [cs.CV], 2018. Mi l a n Š u l c F i n e - g r a i n e d F l o w e r a n d F u n g i C l a s s i fj c a t i o n a t C MP 8 / 1 5

When New Priors Are Known T raining set 0.5 Validation set 0.4 % samples 0.3 0.2 0.1 0.0 0 200 400 600 800 1000 1200 Class (sorted) FGVCx Fungi 2018 Mi l a n Š u l c F i n e - g r a i n e d F l o w e r a n d F u n g i C l a s s i fj c a t i o n a t C MP 9 / 1 5

When New Priors Are Known Note: in the iNaturalist 2017 challenge, the winning GMV submission [1] approached the change in priors as follows: “To compensate for the imbalanced training data, the models were further fine-tuned on the 90% subset of the validation data that has a more balanced distribution.” We, instead, only use the validation set statistics – i.e. uniform class distribution in this case. [1] The iNaturalist Species Classification and Detection Dataset-Supplementary Material. Grant Van Horn, Oisin Mac Aodha, Yang Song, Yin Cui, Chen Sun, Alex Shepard, Hartwig Adam, Pietro Perona, and Serge Belongie. Reptilia 32, no. 400: 5426. Mi l a n Š u l c F i n e - g r a i n e d F l o w e r a n d F u n g i C l a s s i fj c a t i o n a t C MP 1 0 / 1 5

When New Priors Are Known 52.5 T raining set 0.5 Validation set 50.0 0.4 47.5 Accuracy [%] % samples 45.0 0.3 42.5 0.2 40.0 0.1 37.5 CNN output accuracy Known (fmat) test distr. 35.0 0.0 0 200 400 600 800 1000 1200 0 50000 100000 150000 200000 250000 300000 350000 400000 Class (sorted) T raining steps Inception v4 FGVCx Fungi 2018 [1] Improving CNN classifiers by estimating test-time priors. Milan Šulc and Jiří Matas. arXiv:1805.08235 [cs.CV], 2018. Mi l a n Š u l c F i n e - g r a i n e d F l o w e r a n d F u n g i C l a s s i fj c a t i o n a t C MP 1 1 / 1 5

Tricks used in both challenges Predictions re-weighted simply assuming uniform class priors. Moving average of trained variables (exponential decay). Training time augmentation: - Random crops - Color distortions Test-time data augmentation: ⨯ ⨯ 14 per image : 7 crops 2 (mirror) Mi l a n Š u l c F i n e - g r a i n e d F l o w e r a n d F u n g i C l a s s i fj c a t i o n a t C MP 1 2 / 1 5

Final Ensembles FGVCx Fungi : 6 nets (averaged) 2x Inception-v4 299x299 initialized from ImageNet and LifeCLEF ckpts 2x Inception-v4 598x598 initialized from ImageNet and LifeCLEF ckpts 2x Inception-ResNet-v2 299x299 from ImageNet and LifeCLEF ckpts FGVCx Flowers : 5 nets (modus) 3x Inception-v4 299x299 initialized from ImageNet, LifeCLEF, iNaturalist ckpts 1x Inception-v4 598x598 initialized from LifeCLEF ckpt 1x Inception-ResNet-v2 299x299 initialized from LifeCLEF ckpt Mi l a n Š u l c F i n e - g r a i n e d F l o w e r a n d F u n g i C l a s s i fj c a t i o n a t C MP 1 3 / 1 5

Leaderboard FGVCx Fungi FGVCx Flowers Mi l a n Š u l c F i n e - g r a i n e d F l o w e r a n d F u n g i C l a s s i fj c a t i o n a t C MP 1 4 / 1 5

Discussion • Standard CNN classifiers (and their ensembles) achieve best results in plant and fungi recognition. ● Future work: Learning from Ensembles? • Important to take into account change in class prior distribution [1] ● New priors can be estimated on-line, as new test-samples appear. • Q & A sulcmila@cmp.felk.cvut.cz [1] Improving CNN classifiers by estimating test-time priors. Milan Šulc and Jiří Matas. arXiv:1805.08235 [cs.CV], 2018. Mi l a n Š u l c F i n e - g r a i n e d F l o w e r a n d F u n g i C l a s s i fj c a t i o n a t C MP 1 5 / 1 5

F i n e - g r a i n e d F l o w e r a n d F u - PowerPoint PPT Presentation

F i n e - g r a i n e d F l o w e r a n d F u n g i C l a s s i fj c a t i o n a t C MP Mi l a n u l c , L u k P i c e k , J i Ma t a s 1 2 1 1 V i s u a l

Public Consultation Project Update Muskoka Rd 15 (Santas Village Rd) Transportation

S CIE NC GE AI AG THA N T SE S ORE Corporate Profile Thoresen Thai Agencies Plc. g THO

EMERGING POLICIES: EMERGING POLICIES: LATERAL REPLACEMENTS ON LATERAL REPLACEMENTS ON STREET

Penn Park Shelter Replacement Project Public Meeting 2 Penn Park Shelter Replacement Project

Water Resource Update June 3, 2015 Current Stream Conditions Animas-La Plata Status

Attachment 1 Presentation by Steve Hemstrom on Rocky Reach Reservoir Operations Page 8 PRFF and

The Joplin Water Supply Project Meeting the water supply needs of our community now and in the

Challenges and Opportunities of the North American Petroleum Justin Kringstad Trisha Curtis

Concrete Beam-Column Sub-Assemblage I. K. Fang , C. C. Lee, L. W. Chou, P. S. Chiu, T. W. Liu, J.

WETTING BEHAVIORS IN RESIN-FIBER SYSTEM T. Setoguchi 1 *, Y. Fukuhara 1 , I. Ueno 2 , S. Ogihara 2

Gin, from the 13 th century to today History of Mothers Ruin The roots of modern gin were known

Nuclear Why the Resurgence? William F. Henze II July 2010 Nuclear power is our only

Your Specialty Chemical Partner Click to edit Master title style September 2018 Investor Deck

San Felipe Del Rio CISD District of Innovation Annual Report Board Presentation June 15, 2020

Radial, Pedal and Direct SFA Puncture for SFA Revascularization Beau Hawkins, MD, FSCAI May 22,

Focus on tropical convection (OLR), and zonal and global atmospheric angular momentum (AAM)

Federal Ministry of Aviation Nigerias Aviation Sector A Presentation to the Senate Committee On

Closing the Protection Gap from Both Directions Insurance + Mitigation 1 Residential Earthquake

Feb 2016 1 Vision We are geared to deliver a bold new world Our Vision is to be the innovator

Investigation of Acoustical Properties for a Studio Designed by Bksy va Arat-Borsi

Renovation Lauren Johnson Current Issues Limited Bathrooms Not Enough Offices

absolute sound energy Aly Magassouba, Nancy Bertin and Francois Chaumette IROS 2016 Presented

NEW EXPERIMENTAL INVESTIGATIONS OF ADHESIVE BONDS WITH ULTRASONIC SH GUIDED WAVES Patrick Prs

Deaf Scotland Expo 15th March 2019 Who are Phonak? Who are Phonak? We are one of the

Sambuz

Useful Links

Newsletter

Mail Us