make experiment
play

make experiment WMT 2010 workflow management Goals JHU Submission - PowerPoint PPT Presentation

make experiment WMT 2010 workflow management Goals JHU Submission WMT 2010 Running translation pipeline should be easy Easy to understand Easy to configure Easy to monitor Easy to run All results must be reproducible Original Data


  1. make experiment WMT 2010 workflow management

  2. Goals JHU Submission WMT 2010 Running translation pipeline should be easy Easy to understand Easy to configure Easy to monitor Easy to run All results must be reproducible

  3. Original Data Understand Compressed Decompressed the Pipeline Plain text (XML removed) Tokenize Normalize Workflow is complex Decompress remaining files Subsample Subsample run1 run2 Visualize using GraphViz Word Alignments Word Alignments run1 run2 Simple text format Trained LM Trained Grammar Trained Grammar Recasing Model run1 run1 run2 Parameter Optimization nodeName [label=”text”] run1 Translate Test Set nodeA -> nodeB Truecased Translations Output as graphics file Detokenized Translations Score Translations Joshua Machine Translation Workflow

  4. Configuration Configure each step Don’ t repeat yourself Explicitly mark dependencies Challenge: Should each step define variables for each input, or should can steps assume they know what their input is?

  5. Monitor experiments Run results Result dir gets name from its config file Steps are numbered, named, & labelled Challenge: automatic naming of log files Challenge: visualize run status (via remote web interface?)

  6. Dry run, run, re-run See what will be run: $ make --dry-run -f config/014.MERT.de-en.bleu.run1.mk Kick off the job: $ nohup make -f config/014.MERT.de-en.bleu.run1.mk &> 999.logs/014.MERT.de-en.bleu.run1.log & Verify that everything finished: $ make --dry-run -f config/014.MERT.de-en.bleu.run1.mk make: Nothing to be done for `mert'.

  7. Try it out Make scripts defining each logical step: svn co https:/ /joshua.svn.sourceforge.net/svnroot/ joshua/branches/pipeline/wmt10 000.makefiles Make scripts configuring each actual job: svn co https:/ /joshua.svn.sourceforge.net/svnroot/ joshua/branches/pipeline/wmt10-config configure- experiment Experiments to date: a01, a02, a03, a04, a05 /mnt/data/wmt10.labelled

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend