Cold Case : The Lost MNIST Digits The Sherlocks: Chhavi Yadav NYU - PowerPoint PPT Presentation

Feb 03, 2024 •224 likes •342 views

Cold Case : The Lost MNIST Digits The Sherlocks: Chhavi Yadav NYU Lon Bottou FAIR,NYU What about MNIST? MNIST is a subset of NIST [1] Original MNIST Testing set - 60K digits Was chopped off to 10K digits before further

Cold Case : The Lost MNIST Digits The Sherlocks: Chhavi Yadav NYU Léon Bottou FAIR,NYU
What about MNIST? ● MNIST is a subset of NIST [1] ● Original MNIST Testing set - 60K digits ● Was chopped off to 10K digits before further preprocessing Fig. 1 [2] This is all the information we have about how MNIST was created!!
How did we reconstruct MNIST? ● Using description on previous slide & a resampling algorithm found in an ancient Lush codebase a ● Hungarian matching algorithm(only training set) ● Inspection of the worst matched ● Fine tuning of algorithms a See https://tinyurl.com/y5z7qtcg
Fig. 2 Side-by-side display of the first sixteen digits in the MNIST and QMNIST training set.
Why use QMNIST? ● QMNIST Test Set = 6x MNIST Test set!! ● Metadata like writer id, partition id ● Download from https://github.com/facebookresearch/qmnist
Overfitting on MNIST? ● Since MNIST has been around for a quarter century, many researchers doubt that the immense experimentation has led to overfitting on MNIST. ● Tested previous classifiers with 50K new samples in QMNIST Test set.
Drop in accuracy going from MNIST to QMNIST50K Close reconstruction Fig. 3 MLP error rates for various hidden layer sizes after training on MNIST & testing on MNIST, QMNIST10K & QMNIST50K
Consistent drop in accuracy going from MNIST to QMNIST50K Fig. 4: Scatter plot comparing the MNIST and QMNIST50K testing performance of all the models trained on MNIST during the course of this study.
Conclusion ● “Testing Set Rot” exists but is far less severe than feared ● Confirms trends observed by Recht et al. [3, 4] - on a different dataset & substantially controlled setup ● In practice, this suggests that a shifting data distribution is far more dangerous than overusing an adequately distributed testing set
References [1]Patrick J. Grother and Kayee K. Hanaoka NIST Special Database 19: Handprinted Forms and Characters Database 1990 [2]Bottou, Léon et. al. Comparison of classifier methods: a case study in handwritten digit recognition 1994 [3]Recht, Benjamin et. al. Do CIFAR-10 Classifiers Generalize to CIFAR-10? 2018 [4]Recht, Benjamin et. al. Do ImageNet Classifiers Generalize to ImageNet? 2019
..Thank you..

Recommend

As Geometry is lost What connections are lost? What reasoning is lost? What students are lost?

As Geometry is lost What connections are lost? What reasoning is lost? What students are lost? Does it matter? Walter Whiteley York University Graduate Programs in Math, in Education, in Computer Science, in Interdisciplinary Studies

802 views • 42 slides

Lesson 9 - I can multiply 3 digits by 1 digit Today we will learn to multiply 3 digits by 1

Lesson 9 - I can multiply 3 digits by 1 digit Today we will learn to multiply 3 digits by 1 digit. We use the same method as we would to multiply 2 digits by 1 digit, the only difference is we have to multiply the hundreds as well as the

501 views • 13 slides

Announcements Containers Working with Lists >>> digits = [1, 8, 2, 8] >>>

Announcements Containers Working with Lists >>> digits = [1, 8, 2, 8] >>> digits = [2//2, 2+2+2+2, 2, 2*2*2] The number of elements >>> len(digits) 4 An element selected by its index >>> digits[3]

67 views • 4 slides

Advanced Cold Asphalts HIGH PERFORMANCE ASPHALT COLD MIX FOR POT HOLE AND UTILITY CUT REPAIRS

Advanced Cold Asphalts HIGH PERFORMANCE ASPHALT COLD MIX FOR POT HOLE AND UTILITY CUT REPAIRS By TODD MELLEMA What is Advanced Cold Asphalt? Advanced Cold Asphalt (ACA) are High Performance Asphalt Cold Mixes engineered to provide

669 views • 45 slides

Cold Brew THIS IS COLD BREW Cofgee brewed with cold fresh water over a long time gets unique

Cold Brew THIS IS COLD BREW Cofgee brewed with cold fresh water over a long time gets unique fmavour characteristics. Cold Brew is already a hot trend in USA and Japan. Now we ofger the Swedish market an innovative product that matches an

76 views • 6 slides

WEATHER FRONTS Map Obtained from TWC COLD FRONTS We already have stated that a cold front is a

WEATHER FRONTS Map Obtained from TWC COLD FRONTS We already have stated that a cold front is a boundary separating two air masses (a cold air mass and a warm air mass) The cold air is found behind the cold front and it advances towards the

560 views • 31 slides

E x ploring fashion MNIST dataset AD VAN C E D D IME N SION AL ITY R E D U C TION IN R

E x ploring fashion MNIST dataset AD VAN C E D D IME N SION AL ITY R E D U C TION IN R Federico Castanedo Data Scientist at DataRobot What is Fashion MNIST ? 70.000 gra y scale images of 10 clothing categories 28x28 pi x els Identical format

586 views • 47 slides

MATH6380o Mini-Project 1 Feature Extraction and Transfer Learning on Fashion-MNIST Jason WU ,

MATH6380o Mini-Project 1 Feature Extraction and Transfer Learning on Fashion-MNIST Jason WU , Peng XU, Nayeon LEE 08.Mar.2018 Introduction: Fashion-MNIST Dataset 60,000 training examples and a 10,000 testing examples Each example is

993 views • 42 slides

Challenges of Cold Supply Chain Challenges of Cold Supply Chain Dr. Armin Hoffmann, Pharm.D.,

Challenges of Cold Supply Chain Challenges of Cold Supply Chain Dr. Armin Hoffmann, Pharm.D., MBA, QP Dr. Armin Hoffmann, Pharm.D., MBA, QP An excerpt from A Compliant Cold Chain Management for An excerpt from A Compliant Cold Chain

1.05k views • 28 slides

CONCRETING 1 3/4/2015 2 3/4/2015 3 3/4/2015 ACI DEFINITION OF COLD WEATHER Cold Weather - A

3/4/2015 COLD WEATHER CONCRETE PRACTICES CRMCA SOUTHERN MARKETING O SUCCESSFUL COLD WEATHER CONCRETING 1 3/4/2015 2 3/4/2015 3 3/4/2015 ACI DEFINITION OF COLD WEATHER Cold Weather - A period when, for more than 3 consecutive days, the

511 views • 25 slides

Cold War Development of the Cold War The Cold War (1945-91) was one of perception where

Origins of the Cold War Development of the Cold War The Cold War (1945-91) was one of perception where neither side fully understood the intentions and ambitions of the other. This led to mistrust and military build-ups. United States

398 views • 37 slides

2015 Operations Stay Treat CHL2 4K Cold Box Modifications SC1 2K Cold Box CC4 Failure

2015 Operations Stay Treat CHL2 4K Cold Box Modifications SC1 2K Cold Box CC4 Failure Helium and LN2 Losses Contamination 2015 Operations Stay Treat 1 CHL2 4K Cold Box Modifications New 4K cold box designed and built for 12GeV

284 views • 16 slides

Cold Atom Atom Clocks Clocks Cold Cold Atom Clocks and Fundamental Fundamental Tests Tests

Cold Atom Atom Clocks Clocks Cold Cold Atom Clocks and Fundamental Fundamental Tests Tests and and Fundamental Tests C. Salomon Laboratoire Kastler Brossel, Ecole Normale Suprieure, Paris

768 views • 38 slides

Jack Fried Cold Electronics Review October 13, 2016 10/13/2016 Cold Electronics Review 1 APA

SBND Warm Electronics Design and Integration Test with DAQ System Jack Fried Cold Electronics Review October 13, 2016 10/13/2016 Cold Electronics Review 1 APA with Integrated Cold Electronics Cold electronics module and its attachment to

540 views • 34 slides

The Cold Weather Plan The Cold Weather Plan update for this winter p Preventing Illness by

The Cold Weather Plan The Cold Weather Plan update for this winter p Preventing Illness by Tackling Cold Homes Preventing Illness by Tackling Cold Homes 23 rd Sept 2015 Dr Angie Bone Extreme Events and Health Protection

358 views • 14 slides

Scattered But Not Lost: A Unique Presentation of the Journey of the Scattered But Not Lost: A

[PDF] Scattered But Not Lost: A Unique Presentation of the Journey of the Chosen Family, the... Scattered But Not Lost: A Unique Presentation of the Journey of the Scattered But Not Lost: A Unique Presentation of the Journey of the Chosen Family,

300 views • 3 slides

CAPITAL MARKETS DAY February 28, 2019 Tab 2018 performance Judith HARTMANN p. 3 1 11:00

CAPITAL MARKETS DAY February 28, 2019 Tab 2018 performance Judith HARTMANN p. 3 1 11:00 12:15 Strategic orientation Isabelle KOCHER p. 27 2 Capital allocation & medium-term guidance Judith HARTMANN p. 59 3 12:15 1:00

2.64k views • 184 slides

Distributed Systems Introduction to Cryptography Paul Krzyzanowski pxk@cs.rutgers.edu Except as

Distributed Systems Introduction to Cryptography Paul Krzyzanowski pxk@cs.rutgers.edu Except as otherwise noted, the content of this presentation is licensed under the Creative Commons Attribution 2.5 License. Page 1 Page 1 Ngywioggazhon

1.3k views • 127 slides

Homomorphic SIM 2 D operations: Single Instruction Much More Data Wouter Castryck Ilia

Homomorphic SIM 2 D operations: Single Instruction Much More Data Wouter Castryck Ilia Iliashenko Frederik Vercauteren Homomorphic encryption cryp 175.2 {#*| Homomorphic encoding real-world data plaintext ciphertext

512 views • 40 slides

Latent Variable Models with Gaussian Processes Neil D. Lawrence GP Master Class 6th February

Latent Variable Models with Gaussian Processes Neil D. Lawrence GP Master Class 6th February 2017 Outline Motivating Example Linear Dimensionality Reduction Non-linear Dimensionality Reduction Outline Motivating Example Linear

973 views • 93 slides

Software for TDA ACM-BCB Workshop on TDA October 2, 2016 by Svetlana Lockwood Topological Data

Open Source Software for TDA ACM-BCB Workshop on TDA October 2, 2016 by Svetlana Lockwood Topological Data Analysis 1. Persistence-Way Topological analysis using persistent homology Finds topological invariants in data (# of

823 views • 35 slides

CS/COE 1520 pitt.edu/~ach54/cs1520 Responsive Web Design Viewing a webpage in a small window 2

CS/COE 1520 pitt.edu/~ach54/cs1520 Responsive Web Design Viewing a webpage in a small window 2 Viewing a webpage on a smartphone 3 Viewports Visual viewport Layout viewport 4 The idea behind responsive design "If you put water

309 views • 15 slides

COMPLETE STATISTICAL THEORY OF LEARNING LEARNING USING STATISTICAL INVARIANTS Vladimir Vapnik

COMPLETE STATISTICAL THEORY OF LEARNING LEARNING USING STATISTICAL INVARIANTS Vladimir Vapnik 1 PART I VC THEORY OF GENERALIZATION 2 THE MAIN QUESTION OF LEARNING THEORY QUESTION: When in set of functions { f ( x ) } we can minimize

1.18k views • 59 slides

Alessandro Acq isti and Ralph Gross Alessandro Acquisti and Ralph Gross Heinz College/CyLab C

Alessandro Acq isti and Ralph Gross Alessandro Acquisti and Ralph Gross Heinz College/CyLab C Carnegie Mellon University i M ll U i it Research support from National Science Foundation, U.S. Army R Research Office (through CyLab), Carnegie

303 views • 26 slides