Michael Ryan, John Noecker Jr Evaluating Variations in Language Lab - PowerPoint PPT Presentation

Nov 01, 2023 •260 likes •365 views

Michael Ryan, John Noecker Jr Evaluating Variations in Language Lab Duquesne University mryan, jnoecker @ jgaap.com Tools JGAAP (Java Graphical Authorship Attribution Program) - a modular test bed for authorship attribution methods.

Michael Ryan, John Noecker Jr Evaluating Variations in Language Lab Duquesne University mryan, jnoecker @ jgaap.com
Tools  JGAAP (Java Graphical Authorship Attribution Program) - a modular test bed for authorship attribution methods.  All methods used are either available in JGAAP or were extensions of it  Source code for the methods used in this experiment is available at jgaap.com
Mixture of Experts  Combined three Authorship Attribution techniques  Each technique assigns a vote on the author of the document  If there is not majority author assume the author was not in the sample group
Centroid L1  Break documents into feature vectors of character 3- grams using relative frequencies of 3-grams  Build Centroids for the known authors  Take the average of that authors feature vectors  Measure the L1 Distance between the authors’ centroids and the unknown’s feature vector  Assign your vote to the author whose centroid had the smallest L1 Distance
WEKA SMO  Break documents into feature vectors of character 3- grams using relative frequencies of 3-grams  Train WEKA’s Sequential Minimal Optimization Support Vector Machines (SMO) using the known authors’ feature vectors  SMO will rate authors similarity  Assign a vote to the most similar author
Repeated Microdocument Analysis  Break all documents into 3,000 character chunks  Reduce all contiguous whitespace to single spaces and all character to lower case  Break chunks into feature vectors of character 11-grams using relative frequencies of 11-grams  Generate Centroids for the known authors  Take the average of the author’s feature vectors  Measure the Intersection Distance between the author centroids and chunks, assigning the closest centroid’s author to each chunk  Vote on the author who receives a majority of the chunks
Author Diarization Method  Break documents into paragraphs  Extract named entities from paragraphs  Group paragraphs with named entities in common  Assume each group is an author  Use the grouped paragraphs as known chunks with Repeated Microdocument Analysis and ungrouped paragraphs as unknowns  Add the ungrouped paragraph that is closest to a group to that group and re-run the analysis until all paragraphs are grouped
Results Problem Number Correct Total Accuracy A 6 6 100% B 7 10 70% C 7 8 87.5% D 10 17 58.8% E 83 90 92.2% F 77 80 96.3% I 12 14 85.7% J 12 16 75.0% Total 214 241 88.8%
Conclusions  These methods show promise with document accuracy of 88.8% and mean accuracy of 83.2%, respectively first and third in the competition.  The method used preformed poorly on open-class problems because they were developed with only closed class in mind, removing the open-class portions changes our accuracies to 91.6% and 88.5%
Future Work  Refine analysis of open-class problems by examining how different experts preform in identifying them and how many experts it takes to reach a conclusion.

Recommend

City of Saint Paul 3K Saint Paul Councilmember Rebecca Noecker December, 2019 To coordinate and

City of Saint Paul 3K Saint Paul Councilmember Rebecca Noecker December, 2019 To coordinate and expand access to high quality early learning to 3 and 4-year-olds in Saint Paul, so that all children are ready for kindergarten and all families

175 views • 13 slides

Monthly & Quarterly Tariff Variations July 2016 to June 2019 Tariff Variations Tariff

Monthly & Quarterly Tariff Variations July 2016 to June 2019 Tariff Variations Tariff Variations Setting the Context KEs Request Notification of MYT KE requests NEPRA to Ministry of Energy (Power process and approve the Division)

1.04k views • 89 slides

Project Gestur by Reihlo Kyle Carson, Ryan Kaveh, Jon Young, Ryan Lee, Ryan Tsukomoto

Project Gestur by Reihlo Kyle Carson, Ryan Kaveh, Jon Young, Ryan Lee, Ryan Tsukomoto Introduction Introduction Background Overview Subsystems Hardware Jon Young Ryan Kaveh Kyle Carson Construction 4th Year EE 4th Year CE 4th Year CE

257 views • 22 slides

CPSC 875 CPSC 875 John D McGregor John D. McGregor C 8 More Design 3 tier 3 tier Variations

CPSC 875 CPSC 875 John D McGregor John D. McGregor C 8 More Design 3 tier 3 tier Variations Variations Tier vs Layer Tier vs Layer Abstracting away from user Abstracting away from hardware Modes Modes Modes Modes Each mode can

527 views • 42 slides

Variations in the Quality of Variations in the Quality of TN-VPK Classrooms TN-VPK Classrooms

Variations in the Quality of Variations in the Quality of TN-VPK Classrooms TN-VPK Classrooms Dale C. Farran Kerry Hofer Mark Lipsey Carol Bilbrey The Society for Research on Educational Effectiveness Washington, DC, 3/8/14 Research Team

653 views • 26 slides

Repeat Repeat runs/variations on a theme runs/variations on a theme Model

Repeat Repeat runs/variations on a theme runs/variations on a theme Model Participant RESRAD-BIOTA [basics] Sunita Kamboj (ANL, USA) RESRAD-BIOTA [available Mike Wood (Liverpool, UK) software] EA R&D128

752 views • 8 slides

Variations of Parotidectomy Variations of Parotidectomy Indications and Technique

Variations of Parotidectomy Variations of Parotidectomy Indications and Technique Indications and Technique Kerry D. Olsen, M.D. Kerry D. Olsen, M.D. Professor and Chair Professor and Chair Head and Neck Surgery Head and Neck

667 views • 33 slides

Variations on a Theme by Friedman Ali Enayat, G oteborgs Universitet September 5, 2013

Variations on a Theme by Friedman Ali Enayat, G oteborgs Universitet Variations on a Theme by Friedman Ali Enayat, G oteborgs Universitet September 5, 2013 Honorary Doctorate Harvey Friedman, Universiteit Ghent Variations on a Theme

1.14k views • 65 slides

Brownian Motion Variations and Brownian Motion with drift Today: Various variations of

Brownian Motion Variations and Brownian Motion with drift Today: Various variations of Brownian motion, reflected, absorbed, Bo Friis Nielsen 1 Brownian bridge, with drift, geometric Next week 1 DTU Informatics General course overview

376 views • 3 slides

P P Partial Partial-Scan & Scan ti l ti l S S Scan & Scan & S & S

Testability: Lecture Testability: Lecture 24 24 P P Partial Partial-Scan & Scan ti l ti l S S Scan & Scan & S & S Variations Variations Variations Variations Shaahin Hessabi Department of Computer Engineering

338 views • 17 slides

Variations and Brownian Motion with drift Bo Friis Nielsen 1 1 DTU Informatics 02407 Stochastic

Variations and Brownian Motion with drift Bo Friis Nielsen 1 1 DTU Informatics 02407 Stochastic Processes 12, November 27 2018 Bo Friis Nielsen Variations and Brownian Motion with drift Brownian Motion Today: Various variations of Brownian

577 views • 12 slides

Enteric Fermentation: origin of gases, variations, predictions and mitigation Michael Blmmel

Enteric Fermentation: origin of gases, variations, predictions and mitigation Michael Blmmel Outline of Presentation Origin of ruminal CO 2 and CH 4 from fermentation products Causes and implications of variations in ruminal CO 2 and CH

789 views • 34 slides

Renewable Energy Projects Evaluating Tax Risks, Navigating Structural Variations, Leveraging

Presenting a live 90-minute webinar with interactive Q&A Using Inverted Leases to Finance Renewable Energy Projects Evaluating Tax Risks, Navigating Structural Variations, Leveraging Pass-Through Election WEDNESDAY, MARCH 29, 2017 1pm

660 views • 50 slides

Authorship ID at PAN11 What -- Why -- How Patrick Juola Evaluating Variations in Language

Authorship ID at PAN11 What -- Why -- How Patrick Juola Evaluating Variations in Language Laboratory Duquesne University, Pittsburgh PA, USA juola@mathcs.duq.edu Authorship Identification needs little definition among this group

495 views • 11 slides

Accounting, Capital Requirements, and Financial Stability Stephen Ryan Macro Financial Modeling

Accounting, Capital Requirements, and Financial Stability Stephen Ryan Macro Financial Modeling Conference March 10, 2017 Agenda Background: Ryan (2017) and Dou, Ryan (2017) essays and also Acharya, Ryan (2016) Financial stability,

525 views • 39 slides

DEMOCRATIC CONSENSUS: A COMPARATIVE INTRODUCTION TO THE GRAPHENE BLOCKCHAIN RYAN R. FOX

DEMOCRATIC CONSENSUS: A COMPARATIVE INTRODUCTION TO THE GRAPHENE BLOCKCHAIN RYAN R. FOX RYAN@RYANRFOX.COM https://linkedin.com/in/ryanrfox RYAN R. FOX RYAN@RYANRFOX.COM A COMPARATIVE IN INTRODUCTION TO GRAPHENE USING BITCOIN AS THE

571 views • 21 slides

Max-Margin Markov Networks Ben Taskar Carlos Guestrin Daphne Koller Main Contribution The

Max-Margin Markov Networks Ben Taskar Carlos Guestrin Daphne Koller Main Contribution The authors combine a graphic model and a discriminative model and apply it in a sequential learning setting. Graphic models: better at

222 views • 8 slides

Soft-margin SVM, SMO Algorithm, Decision Trees Milan Straka November 25, 2019 Charles

NPFL129, Lecture 6 Soft-margin SVM, SMO Algorithm, Decision Trees Milan Straka November 25, 2019 Charles University in Prague Faculty of Mathematics and Physics Institute of Formal and Applied Linguistics unless otherwise stated Kernel

1.09k views • 27 slides

SMO Algorithm Milan Straka December 02, 2019 Charles University in Prague Faculty of

NPFL129, Lecture 7 SMO Algorithm Milan Straka December 02, 2019 Charles University in Prague Faculty of Mathematics and Physics Institute of Formal and Applied Linguistics unless otherwise stated Kernel Linear Regression 3 O ( D ) D When

936 views • 22 slides

Generating Wannier Function within OpenMX Hongming Weng ( ) Institute of Physics,

Generating Wannier Function within OpenMX Hongming Weng ( ) Institute of Physics, Chinese Academy of Sciences July. 2-12, 2018@ISSP Wave-function in Solids periodical boundary condition 1, Bloch representation R ] = 0 n k (r)

294 views • 28 slides

Markov processes (Markov chains) Construct a Bayes net from these variables: parents? Markov

Markov processes (Markov chains) Construct a Bayes net from these variables: parents? Markov assumption: X t depends on bounded subset of X 0: t 1 Temporal probability models First-order Markov process: P ( X t | X 0: t 1 ) = P ( X t | X t

361 views • 7 slides

Support Vector Machines Marco Chiarandini Department of Mathematics & Computer Science

DM825 Introduction to Machine Learning Lecture 9 Support Vector Machines Marco Chiarandini Department of Mathematics & Computer Science University of Southern Denmark Kernels Soft margins Overview SMO Algorithm Support Vector

641 views • 23 slides

slides Data February 2015 CITATIONS READS 0 58 1 author: Rajeev Piyare Amber Agriculture

See discussions, stats, and author profiles for this publication at: https://www.researchgate.net/publication/271840969 slides Data February 2015 CITATIONS READS 0 58 1 author: Rajeev Piyare Amber Agriculture 28 PUBLICATIONS 785 CITATIONS

463 views • 20 slides

ABSTRACT CHARACTERIZATIONS OF PSEUDODIFFERENTIAL OPERATORS References [1] H. O. Cordes, On

ABSTRACT CHARACTERIZATIONS OF PSEUDODIFFERENTIAL OPERATORS References [1] H. O. Cordes, On pseuso-differential operators and smoothness of special Lie-group represen- tations , Manuscripta Math. 28 (1979) 51-69. [2] Marc A. Rieffel, Deformation

383 views • 12 slides

Michael Ryan, John Noecker Jr Evaluating Variations in Language Lab - PowerPoint PPT Presentation

Michael Ryan, John Noecker Jr Evaluating Variations in Language Lab Duquesne University mryan, jnoecker @ jgaap.com Tools JGAAP (Java Graphical Authorship Attribution Program) - a modular test bed for authorship attribution methods.

City of Saint Paul 3K Saint Paul Councilmember Rebecca Noecker December, 2019 To coordinate and

Monthly & Quarterly Tariff Variations July 2016 to June 2019 Tariff Variations Tariff

Project Gestur by Reihlo Kyle Carson, Ryan Kaveh, Jon Young, Ryan Lee, Ryan Tsukomoto

CPSC 875 CPSC 875 John D McGregor John D. McGregor C 8 More Design 3 tier 3 tier Variations

Variations in the Quality of Variations in the Quality of TN-VPK Classrooms TN-VPK Classrooms

Repeat Repeat runs/variations on a theme runs/variations on a theme Model

Variations of Parotidectomy Variations of Parotidectomy Indications and Technique

Variations on a Theme by Friedman Ali Enayat, G oteborgs Universitet September 5, 2013

Brownian Motion Variations and Brownian Motion with drift Today: Various variations of

P P Partial Partial-Scan & Scan ti l ti l S S Scan & Scan & S & S

Variations and Brownian Motion with drift Bo Friis Nielsen 1 1 DTU Informatics 02407 Stochastic

Enteric Fermentation: origin of gases, variations, predictions and mitigation Michael Blmmel

Renewable Energy Projects Evaluating Tax Risks, Navigating Structural Variations, Leveraging

Authorship ID at PAN11 What -- Why -- How Patrick Juola Evaluating Variations in Language

Accounting, Capital Requirements, and Financial Stability Stephen Ryan Macro Financial Modeling

DEMOCRATIC CONSENSUS: A COMPARATIVE INTRODUCTION TO THE GRAPHENE BLOCKCHAIN RYAN R. FOX

Max-Margin Markov Networks Ben Taskar Carlos Guestrin Daphne Koller Main Contribution The

Soft-margin SVM, SMO Algorithm, Decision Trees Milan Straka November 25, 2019 Charles

SMO Algorithm Milan Straka December 02, 2019 Charles University in Prague Faculty of

Generating Wannier Function within OpenMX Hongming Weng ( ) Institute of Physics,

Markov processes (Markov chains) Construct a Bayes net from these variables: parents? Markov

Support Vector Machines Marco Chiarandini Department of Mathematics & Computer Science

slides Data February 2015 CITATIONS READS 0 58 1 author: Rajeev Piyare Amber Agriculture

ABSTRACT CHARACTERIZATIONS OF PSEUDODIFFERENTIAL OPERATORS References [1] H. O. Cordes, On

Sambuz

Useful Links

Newsletter

Mail Us

Michael Ryan, John Noecker Jr Evaluating Variations in Language Lab - PowerPoint PPT Presentation

Michael Ryan, John Noecker Jr Evaluating Variations in Language Lab Duquesne University mryan, jnoecker @ jgaap.com Tools JGAAP (Java Graphical Authorship Attribution Program) - a modular test bed for authorship attribution methods.

City of Saint Paul 3K Saint Paul Councilmember Rebecca Noecker December, 2019 To coordinate and

Monthly &amp; Quarterly Tariff Variations July 2016 to June 2019 Tariff Variations Tariff

Project Gestur by Reihlo Kyle Carson, Ryan Kaveh, Jon Young, Ryan Lee, Ryan Tsukomoto

CPSC 875 CPSC 875 John D McGregor John D. McGregor C 8 More Design 3 tier 3 tier Variations

Variations in the Quality of Variations in the Quality of TN-VPK Classrooms TN-VPK Classrooms

Repeat Repeat runs/variations on a theme runs/variations on a theme Model

Variations of Parotidectomy Variations of Parotidectomy Indications and Technique

Variations on a Theme by Friedman Ali Enayat, G oteborgs Universitet September 5, 2013

Brownian Motion Variations and Brownian Motion with drift Today: Various variations of

P P Partial Partial-Scan &amp; Scan ti l ti l S S Scan &amp; Scan &amp; S &amp; S

Variations and Brownian Motion with drift Bo Friis Nielsen 1 1 DTU Informatics 02407 Stochastic

Enteric Fermentation: origin of gases, variations, predictions and mitigation Michael Blmmel

Renewable Energy Projects Evaluating Tax Risks, Navigating Structural Variations, Leveraging

Authorship ID at PAN11 What -- Why -- How Patrick Juola Evaluating Variations in Language

Accounting, Capital Requirements, and Financial Stability Stephen Ryan Macro Financial Modeling

DEMOCRATIC CONSENSUS: A COMPARATIVE INTRODUCTION TO THE GRAPHENE BLOCKCHAIN RYAN R. FOX

Max-Margin Markov Networks Ben Taskar Carlos Guestrin Daphne Koller Main Contribution The

Soft-margin SVM, SMO Algorithm, Decision Trees Milan Straka November 25, 2019 Charles

SMO Algorithm Milan Straka December 02, 2019 Charles University in Prague Faculty of

Generating Wannier Function within OpenMX Hongming Weng ( ) Institute of Physics,

Markov processes (Markov chains) Construct a Bayes net from these variables: parents? Markov

Support Vector Machines Marco Chiarandini Department of Mathematics &amp; Computer Science

slides Data February 2015 CITATIONS READS 0 58 1 author: Rajeev Piyare Amber Agriculture

ABSTRACT CHARACTERIZATIONS OF PSEUDODIFFERENTIAL OPERATORS References [1] H. O. Cordes, On

Sambuz

Useful Links

Newsletter

Mail Us

Monthly & Quarterly Tariff Variations July 2016 to June 2019 Tariff Variations Tariff

P P Partial Partial-Scan & Scan ti l ti l S S Scan & Scan & S & S

Support Vector Machines Marco Chiarandini Department of Mathematics & Computer Science