Nameless Feature Selection Challenge Attempt By Ran Gilad-Bachrach - PowerPoint PPT Presentation

Mar 06, 2024 •221 likes •344 views

Nameless Feature Selection Challenge Attempt By Ran Gilad-Bachrach and Amir Navot Overview In most cases we have used standard out of the box algorithms Obvious modifications for balanced error were done A novel feature

Nameless Feature Selection Challenge Attempt By Ran Gilad-Bachrach and Amir Navot
Overview • In most cases we have used standard “out of the box” algorithms • Obvious modifications for balanced error were done • A novel feature selection algorithm was introduced (distBased) • Over fit was probably done by running over too many algorithms with too many parameters
Classification Method • SVM – We have used the SVM toolbox by Gavin Cawley (University of East Anglia, England) • Naïve Bayes – Good-Turing zero correction • Preceptron – Aggressive version (Crammer et al.)
Feature Selection Methods • MI1 – features are scored by the mutual information between the feature value and the labels – Non binary data, was compared to the median • MI2 – same as MI1 while zero valued featured are assumed to be sleeping
Feature Selection Methods – Cont. • DistBased – CGNT02 defined the proper margin for prototype based algorithms (Nearest Neighbor, LVQ, SVM-RBF) – The margin of an instance is the difference between the distance to the closest negative prototype and the closest positive prototype – We selected features that maximizes this margin
Arcene - Observation • The data has a clear hierarchical structure, which can be revealed by clustering • The figure shows the mutual distance between instances • The instances were reordered by k-means
Arcene – Algorithm • Normalization: The maximum absolute value of each feature was set to 1 • Representation: PCA • Feature selection: distBased. 81 principal components were used. • Classification: SVM – Kernel: rbf(0.005) – C=8
Gisette - Algorithm • Normalization: The maximum absolute value of each feature was set to 1 • Feature selection: MI1 • Classification: aggressive perceptron with a limit set to 600 (i.e. we require that y(w \cdot x) > 600 for each (x,y) in the training set).
Dexter - Algorithm • Normalization: none • Feature selection: MI1 • Classification: Transductive SVM – Kernel: linear – C=10 – 3 transduction rounds with addition of 15% of the unlabeled sample in each round.
Dorothea - Algorithm • Normalization: none • Feature selection: MI2 • Classification: – Naïve Bayes – Good Turing Zero Correction
Madelon - Algorithm • Normalization: The maximum absolute value of each feature was set to 1 • Feature selection: distBased • Classification: Trasductive SVM – Kernel: rbf(50) – C=5 – 13 transduction rounds. in each round 10% of the unlabeled data was added.

Recommend

Nameless Representation of Terms CIS500: Software Foundations Nameless Representation of Terms

Nameless Representation of Terms CIS500: Software Foundations Nameless Representation of Terms p.1/29 First, some review . . . A Proof on -Terms Nameless Representation of Terms p.2/29 Proof (1) We want to prove that if z FV ([

587 views • 37 slides

CSE 521: Algorithms Linear Programming Slides by Paul Beame, Anna Karlin, probably nameless

CSE 521: Algorithms Linear Programming Slides by Paul Beame, Anna Karlin, probably nameless others, and occasionally L. Ruzzo 1 Linear Programming The process of minimizing a linear objective function subject to a finite number of

997 views • 78 slides

Nameless Writes Remzi H. Arpaci-Dusseau Professor @ University of Wisconsin-Madison (+visiting

Nameless Writes Remzi H. Arpaci-Dusseau Professor @ University of Wisconsin-Madison (+visiting professor @ EPFL) Joint work with: Andrea C. Arpaci-Dusseau (UW, EPFL) Vijayan Prabhakaran (MSR Silicon Valley) Indirection All problems in

688 views • 27 slides

Untyped Lambda Calculus Principles of Programming Languages CSE 526 Syntax 1 Variables and

Untyped Lambda Calculus Principles of Programming Languages CSE 526 Syntax 1 Variables and Substitution 2 Reductions 3 Nameless Representation 4 Compiled at 13:40 on 2018/02/15 Programming Languages The Untyped Lambda Calculus CSE 526

286 views • 15 slides

Lambda Usefull outside functional programming, functions nowadays also in Java, C++, Python,

Nameless functions, "functions as values" or "function literals" Lambda Usefull outside functional programming, functions nowadays also in Java, C++, Python, C#, ... Compare "int i = 3;" and

564 views • 17 slides

Anonymity in Bitcoin Tumbler/Mixer Oct 9, 2019 Anonymity and Pseudonymity anonymous =

Anonymity in Bitcoin Tumbler/Mixer Oct 9, 2019 Anonymity and Pseudonymity anonymous = Nameless, unidentifiable pseudonymous = Fake name, still traceable Tracing Bitcoin transactions Normal redeem script: Provide public key pk and proof

438 views • 33 slides

Recognizing Named Entities using Automatically Extracted Transduction Rules D. Nouvel, J.Y.

Recognizing Named Entities using Automatically Extracted Transduction Rules D. Nouvel, J.Y. Antoine, N. Friburger, A. Soulet Universit Franois Rabelais Tours Laboratoire dInformatique Equipe BDTLN Nouvel et al. (Franois Rabelais

508 views • 21 slides

Designing and comparing G2P-type lemmatizers for a morphology-rich language Steffen Eger, Goethe

Designing and comparing G2P-type lemmatizers for a morphology-rich language Steffen Eger, Goethe University Frankfurt am Main, Text Technology Lab steeger@em.uni-frankfurt.de 1 / 34 Goals Compare performances of different lemmatization

390 views • 34 slides

Vision and Strategy for QIS at Fermilab Panagiotis Spentzouris Fermilab PAC Meeting January 17 th

Vision and Strategy for QIS at Fermilab Panagiotis Spentzouris Fermilab PAC Meeting January 17 th , 2019 Quantum Science Program Exploit quantum properties (coherence, superposition, entanglement, squeezing, ) for acquiring, communicating,

272 views • 15 slides

Transfer Theorems Igor Walukiewicz Bordeaux University 1 / 59 Recursion stacks 2 / 59

Transfer Theorems Igor Walukiewicz Bordeaux University 1 / 59 Recursion stacks 2 / 59 Recursion stacks F x . if x = 0 then 1 else F ( x 1) x . 3 / 59 Recursion stacks F x . if x = 0 then 1 else F ( x 1) x

1.29k views • 59 slides

Programming Macro Tree Transducers Patrick Bahr 1 Laurence E. Day 2 1 University of Copenhagen,

u n i v e r s i t y o f c o p e n h a g e n d e p a r t m e n t o f c o m p u t e r s c i e n c e Faculty of Science Programming Macro Tree Transducers Patrick Bahr 1 Laurence E. Day 2 1 University of Copenhagen, Department of Computer Science

880 views • 62 slides

Data Mining in Bioinformatics Day 4: Text Mining Karsten Borgwardt February 25 to March 10

Data Mining in Bioinformatics Day 4: Text Mining Karsten Borgwardt February 25 to March 10 Bioinformatics Group MPIs Tbingen Karsten Borgwardt: Data Mining in Bioinformatics, Page 1 What is text mining? Definition Text mining is the use

603 views • 31 slides

Agent Architectures and Hierarchical Control Overview: Agents and Robots Agent systems and

Agent Architectures and Hierarchical Control Overview: Agents and Robots Agent systems and architectures Agent controllers Hierarchical controllers D. Poole and A. Mackworth 2009 c Artificial Intelligence, Lecture 2.1, Page 1 Agents and

379 views • 9 slides

Another Diversity-Promoting Objective Function for Neural Dialogue Generation Ryo Nakamura ,

https://arxiv.org/abs/1811.08100 Another Diversity-Promoting Objective Function for Neural Dialogue Generation (Nakamura et al. 2018) AAAI 2019 DEEP-DIAL workshop Another Diversity-Promoting Objective Function for Neural Dialogue Generation Ryo

196 views • 15 slides

Sustainable Biodiesel Production Veera Gnaneswar Gude, Ph.D., P.E. Georgene Elizabeth Grant

Sustainable Biodiesel Production Veera Gnaneswar Gude, Ph.D., P.E. Georgene Elizabeth Grant Mississippi State University Mississippi State, MS 39762 Prafulla Patil, Ph.D. Prof. Shuguang Deng New Mexico State University Las Cruces, NM 88003

464 views • 20 slides

Introduction to the Transport and Services Area (TSV) David L. Black, Dell EMC Mirja Khlewind,

Introduction to the Transport and Services Area (TSV) David L. Black, Dell EMC Mirja Khlewind, ETH Zurich What is TSV (Transport) Area? The transport and services [TSV] areacovers a range of technical topics related to data

950 views • 50 slides

Shifting Gears in Transportation Analysis Revised CEQA Guidelines Proposal Implementing SB 743 1

Shifting Gears in Transportation Analysis Revised CEQA Guidelines Proposal Implementing SB 743 1 CEQA Guidelines and Technical Advisory Agenda 1. Background 2. Current draft materials 3. Frequently asked questions 4. What cities can do to

771 views • 43 slides

#JustGrowth Webinar December 6, 2018 Agenda #JustGrowth Presenters: Making All Trips

#JustGrowth Webinar December 6, 2018 Agenda #JustGrowth Presenters: Making All Trips Visible Mobility Campaigns Vision for 2019 Naomi Iwasaki Alfonso Directo Deputy Director Policy Analyst #JustGrowth #JustGrowth Concept

587 views • 34 slides

Retail Travelution: Setting the Scene Peter Jones Scientific Coordinator Retail Travelution,

Retail Travelution: Setting the Scene Peter Jones Scientific Coordinator Retail Travelution, Addleshaw Goddard, 23 rd January 2018 Overview CREATE sets out to draw on urban transport policy experiences from Western European cities, to aid

535 views • 12 slides

Material Barriers to Momentum and Vorticity Transport George Haller ETH Zrich Collaborators :

Material Barriers to Momentum and Vorticity Transport George Haller ETH Zrich Collaborators : Stergios Katsanoulis & Markus Holzner (ETH), Davide Gatti & Bettina Frohnapfel (KIT) Transport barriers: frequently discussed -- rarely

189 views • 16 slides

Arizonas new HDM Jennifer Toth Steve Olmsted Director Environmental Project Manager

A guide for complete transportation: Arizonas new HDM Jennifer Toth Steve Olmsted Director Environmental Project Manager Maricopa County Department of Sustainable Transportation Program Transportation Manager Arizona DOT State Smart

401 views • 29 slides

AM++: A Generalized Active Message Framework Jeremiah Willcock , Torsten Hoefler, Nicholas

AM++: A Generalized Active Message Framework Jeremiah Willcock , Torsten Hoefler, Nicholas Edmonds, and Andrew Lumsdaine Large-Scale Computing Not just for PDEs anymore Many new, important HPC applications are data-driven

395 views • 28 slides

STATUS UPDATE AGE-FRIENDLY ACTION PLAN ACTIVE TRANSPORTATION PLAN TRANSIT FUTURE PLAN

STATUS UPDATE AGE-FRIENDLY ACTION PLAN ACTIVE TRANSPORTATION PLAN TRANSIT FUTURE PLAN Overview Pur Purpose: ose: Provide an update on implementation status of 3 City Plans Age-Friendly Action Plan Active Transportation Plan

568 views • 23 slides

Decarbonising transport Nick Shaw Deputy Head, Environment Strategy 18 June 2020 Transport is

Decarbonising transport Nick Shaw Deputy Head, Environment Strategy 18 June 2020 Transport is the UKs largest GHG emitter UK domestic emissions 1990-2018 (MtCO 2 e) In 2018, 28% of the UKs greenhouse gas (GHG) 300 emissions were

335 views • 10 slides