The Process: Running Experiments, Writing, Presenting LING575 - PowerPoint PPT Presentation

The Process:   Running Experiments, Writing, Presenting LING575 Analyzing Neural Language Models Shane Steinert-Threlkeld February 6 2020 h/t Bowman, MacCartney, Manning, Potts, … 1

Running Experiments 2

Getting Started ● As soon as possible (all in your shared repo): ● Find/build code to read your data ● Find/build evaluation code ● If you’re using e.g. diagnostic classifiers, use existing libraries’ evaluations ● For some analysis projects, this might be harder to find ● Get simplest version possible of pipeline running (e.g. one pre-tained model) ● Play with very small / toy data, etc., so you can iterate quickly 3

Experiments ● Main point: log everything!! (Think: modern lab notebook.) ● For each experiment, record (e.g. in a spreadsheet): ● Command ran ● Any relevant parameters included here ● Including random seeds! (specify via command-line or, e.g. in AllenNLP config) ● [NB: `allennlp train` writes the conf to the serialization dir] ● Git checkpoint used ● Notes on why you ran / what the outcome was 4

Iterate ● Once the basic infrastructure is setup, research becomes an “anytime algorithm” ● Submit condor jobs, wait, log / analyze results, think about what to do next ● Your future self will also very strongly thank you for keeping detailed records ● Very helpful when writing 5

Writing a Paper 6

Typical Format ● Conference papers: eight (or four) pages, two-column ACL format ● Sections: ● Introduction ● Related Work (possibly later) ● Model/proposal ● Data ● Experimental setup ● Results ● Discussion ● Conclusion (future work / possible follow-ups) 7

Writing Styles (Shieber) ● Continental: “one states the solution with as little introduction or motivation as possible, sometimes not even saying what the problem was. … Readers will have no clue as to whether you are right or not without incredible efforts in close reading of the paper, but at least they'll think you're a genius.” ● Historical: “a whole history of false starts, wrong attempts, near misses, redefinitions of the problem. … a careful reader can probably follow the line of reasoning that the author went through, and use this as motivation. But the reader will probably think you are a bit addle-headed. Why would you even think of trying half the stuff you talked about?” ● Rational Reconstruction: “You don’t present the actual history that you went through, but rather an idealized history that perfectly motivates each step in the solution. … The goal in pursuing the rational reconstruction style is not to convince the reader that you are brilliant (or addle-headed for that matter) but that your solution is trivial . It takes a certain strength of character to take that as one’s goal.” ● It’s a story, but the characters are ideas, not people (not you, not previous researchers). 8

Introduction ● 1-2 paragraphs general setup + motivation ● Somewhat general, but with some citations to prior work ● Culminating in your main research question / hypothesis ● 1 paragraph summary of main contributions and results of your paper ● How you’re advancing the state of knowledge just described ● 1 paragraph “sign-posting” the rest of the paper ● More than just “Next is methods, then results, then discussion.” 9

Related Work ● Brief discussions of prior research that’s related to your paper ● NOT a mere summary of everything that’s come before ● Should be used as part of motivation: ● Limitations in prior work ● Differences between it and yours ● (If this is hard to do without seeing your results first, can be put towards end of paper.) 10

Model / Proposal ● Goal: a researcher in the field should be able to roughly reproduce your experiments from reading this section ● Complete reproducibility details can be in appendices / code repositories ● Describe: datasets, models, evaluations ● Citing existing examples when possible ● Include math only if necessary for understanding, not for its own sake ● Some tips for formatting 11

Results ● Tables, elaborating your evaluations in your different conditions ● Ideally: ● Comparisons to baselines (when applicable) ● Several runs / random seeds (avg plus std) ● Guide the reader through the main take-aways: tables are hard to read! 12

Discussion ● What do we learn from the results? ● Frame in terms of your motivating question / hypothesis ● A great place for some qualitative analysis ● Example outputs ● Suggestions for what might be causing results 13

Conclusion ● One sentence re-iterating the design ● Drive home the take-away message; make sure the reader knows what the main point is ● Finish with future work / next directions ● Not necessarily what you are going to do, but what kinds of questions this work opens up 14

Publishing and Presenting 15

From course to conference ● Course papers are “proto-papers” ● Ask the right question / formulate the right hypothesis ● Preliminary results with suggestive conclusions ● Paper: ● More thorough controls / experiments ● Detailed analysis and discussion ● Think in terms of “audience design”: who’s the intended reader, and how can you convince them to be excited about your project 16

Abstract ● Open with broad overview: glimpse of the main problem ● Middle: elaborate, by connecting with the central results of the paper ● Finish: link the results with broader questions / implications ● So reviewer / reader can easily answer: does it make a substantive / original contribution 17

Venues ● Major conferences: ACL, EMNLP, COLING, CoNLL, CogSci, AAAI, ICML, NeurIPS, ICLR, … ● Upcoming: ● COLING: April 8 ● EMNLP: May 11 ● BlackboxNLP: July 15 ● This is an archival workshop; many are attached to the big conferences 18

Venues (cont) ● While there are obvious time pressures for your CVs, there’s always another conference ● Do the best work you can, find the right home for it ● arXiv: in general, do post there; the CL/NLP communities follow it ● But: don’t rush! It can become authoritative, impact your reputation ● Check: anonymity periods of major conferences ● EG: ACL doesn’t allow posting within one month of deadline, and no major advertising on social media of arXiv papers 19

How the Sausage Gets Made ● You submit your paper, with keywords and abstract ● Magical mixing plus lots of hard work by Area Chairs matches reviewers with papers ● Reviewers provide comments ● Authors respond ● … ● Decisions made ● NB: rejection is the mode!! Many hard decisions have to be made; often feels arbitrary. Nothing to be ashamed of. Try and try again. 20

What Reviewers Do ● ACL form, almost entirely: ● What is the paper about? Main strengths and weaknesses? ● Reasons to accept ● Reasons to reject ● Overall recommendation (numeric) ● Reviewer confidence (numeric) ● Feedback for authors: ● Questions ● Missing references ● Typographic 21

Presentations 22

Basic Structure ● Mirrors paper, but briefer ● Beginning: ● What problem? Why is it interesting? Why have previous solutions failed? ● Middle: ● Data, model, evaluation ● End: ● Results, what techniques contributed the most, examples 23

Pullum’s Five (Six) Rules ● Don’t ever begin with an apology ● Don’t ever underestimate the audience’s intelligence ● Respect time limits ● Don’t survey the whole damn field ● Remember that you’re an advocate, not the defendant ● Expect questions that will floor you 24

My Guiding Principle ● The audience is intelligent, but also tired. And you are the expert on your own work. ● Your talk will be amazingly successful if each audience member can remember one thing from it. ● So: make compelling figures. ● Don’t be afraid to be repetitive: they’re hearing this for the first time and you’re an expert. Tell them the take-home message a few times. 25

Practical points ● Turn off notifications ● Make sure your screen stays awake ● Shut down running applications ● Make sure desktop/browser/anything is free of content you don’t want the world to see ● If using Google Slides/Keynote/Powerpoint, make a PDF backup 26

Q&A ● Mainly: make the audience feel like their question has been addressed. ● Try to view it as joint inquiry, not an interrogation. ● Pause before answering ● Be honest when you don’t know. ● But say more than “I don’t know.” Add “but…” Or “That reminds me of…” “One thing that suggests to me…” ● Questions don’t always make sense. Try to bend it into something that does and that makes the questioner feel valued. Everyone will love you. 27

Next Time ● Special Topics presentations! ● Reminder: everyone is expected to contribute to the discussion. Come to class having done the suggested readings. ● I will post more explicit guidelines about final papers and presentations (at the final presentation fest) soon. 28

The Process: Running Experiments, Writing, Presenting LING575 - PowerPoint PPT Presentation

The Process: Running Experiments, Writing, Presenting LING575 Analyzing Neural Language Models Shane Steinert-Threlkeld February 6 2020 h/t Bowman, MacCartney, Manning, Potts, 1 Running Experiments 2 Getting Started As soon as

Red- -Light Running Light Running Red Red-Light Running 2 Traffic Signals Traffic Signals

Red- -Light Running Light Running Red Red-Light Running 2 Traffic Signals Traffic Signals

Lesson 5 Emphasis WRITING CAN BE WRITING CAN BE BOLD WRITING CAN BE BOLD COLOR WRITING CAN

11-823 Conlanging Writing Writing Systems Different Writing Systems What makes a writing

Writing for Funding Part 1: General Writing and Writing for Specific Review Alicia J. Knoedler,

Writing Your CAREER Proposal Writing Your CAREER Proposal 2016 NSF/CMMI CAREER Proposal Writing

Writing and SPAG Lickey Hills Primary School and Nursery November 2017 Aims: Writing Writing

Leah Soule 27 March 2010 Virtual Writing Center 2.0 A virtual writing center, or online writing

Writing Linguistics 203 Languages of the World Writing and Language Many people associate

Writing Home 8: Formal and Informal Writing We We use formal language for: Writing to

Experiments on deflection of charged Experiments on deflection of charged Experiments on

Running Probabilistic Running Probabilistic Running Probabilistic Programs Backwards Programs

Running Time Why do we need to analyze the running Algorithm/Running Time Analysis time of a

D7: Front-running Race conditions #7: Front ont-running running A form of a race condition

Looking and Writing: Teaching Writing in the Discipline Teaching Writing in the Discipline of

>>>CLICK HERE<<< Powerpoint presentation writing persuasive essay Shawinigan.

Interoperabilty: The SAGA Approach and Experience Shantenu Jha, Andre Merzky, Ole Weidner &

A Transformation Approach for Classifying ALCHI(D) Ontologies with a Consequence-based ALCH

Interactive NanoAOD analysis Nick Amin Aug 19, 2019 Introduction Condor jobs have a lot of

CS344M Autonomous Multiagent Systems Patrick MacAlpine Department of Computer Science The

CS 6453 LECTURE 6: MESOS PLATFORM REUBEN RAPPAPORT WHAT IS THE PROBLEM? There are many

P ROGRAMMING IN CUDA Tams Budavri / The Johns Hopkins University 7/18/2012 How I got into

WISENT Distributed processing of energy meteorologic data with Condor Jan Ploski September 14th,

FX Options Trading Class 8 Gregory McDermott, MApp Fin OU Chief FX Strategist U.S.

The Process: Running Experiments, Writing, Presenting LING575 - PowerPoint PPT Presentation

The Process: Running Experiments, Writing, Presenting LING575 Analyzing Neural Language Models Shane Steinert-Threlkeld February 6 2020 h/t Bowman, MacCartney, Manning, Potts, 1 Running Experiments 2 Getting Started As soon as

Red- -Light Running Light Running Red Red-Light Running 2 Traffic Signals Traffic Signals

Red- -Light Running Light Running Red Red-Light Running 2 Traffic Signals Traffic Signals

Lesson 5 Emphasis WRITING CAN BE WRITING CAN BE BOLD WRITING CAN BE BOLD COLOR WRITING CAN

11-823 Conlanging Writing Writing Systems Different Writing Systems What makes a writing

Writing for Funding Part 1: General Writing and Writing for Specific Review Alicia J. Knoedler,

Writing Your CAREER Proposal Writing Your CAREER Proposal 2016 NSF/CMMI CAREER Proposal Writing

Writing and SPAG Lickey Hills Primary School and Nursery November 2017 Aims: Writing Writing

Leah Soule 27 March 2010 Virtual Writing Center 2.0 A virtual writing center, or online writing

Writing Linguistics 203 Languages of the World Writing and Language Many people associate

Writing Home 8: Formal and Informal Writing We We use formal language for: Writing to

Experiments on deflection of charged Experiments on deflection of charged Experiments on

Running Probabilistic Running Probabilistic Running Probabilistic Programs Backwards Programs

Running Time Why do we need to analyze the running Algorithm/Running Time Analysis time of a

D7: Front-running Race conditions #7: Front ont-running running A form of a race condition

Looking and Writing: Teaching Writing in the Discipline Teaching Writing in the Discipline of

&gt;&gt;&gt;CLICK HERE&lt;&lt;&lt; Powerpoint presentation writing persuasive essay Shawinigan.

Interoperabilty: The SAGA Approach and Experience Shantenu Jha, Andre Merzky, Ole Weidner &amp;

A Transformation Approach for Classifying ALCHI(D) Ontologies with a Consequence-based ALCH

Interactive NanoAOD analysis Nick Amin Aug 19, 2019 Introduction Condor jobs have a lot of

CS344M Autonomous Multiagent Systems Patrick MacAlpine Department of Computer Science The

CS 6453 LECTURE 6: MESOS PLATFORM REUBEN RAPPAPORT WHAT IS THE PROBLEM? There are many

P ROGRAMMING IN CUDA Tams Budavri / The Johns Hopkins University 7/18/2012 How I got into

WISENT Distributed processing of energy meteorologic data with Condor Jan Ploski September 14th,

FX Options Trading Class 8 Gregory McDermott, MApp Fin OU Chief FX Strategist U.S.

>>>CLICK HERE<<< Powerpoint presentation writing persuasive essay Shawinigan.

Interoperabilty: The SAGA Approach and Experience Shantenu Jha, Andre Merzky, Ole Weidner &