TRACER TUTORIAL: TEXT REUSE DETECTION SELECTION
Marco B¨ uchler, Emily Franzini and Greta Franzini
TRACER TUTORIAL: TEXT REUSE DETECTION SELECTION Mar co B uchler, - - PowerPoint PPT Presentation
TRACER TUTORIAL: TEXT REUSE DETECTION SELECTION Mar co B uchler, Emily Franzini and Greta Franzini TABLE OF CONTENTS 1. Wha t is Selection? 2. Selection techniques 3. Hacking 4. Conclusion and revision 2/29 REMINDER: CURRENT APPROACH 3/29
Marco B¨ uchler, Emily Franzini and Greta Franzini
2/29
3/29
What do you associate with Selection?
5/29
From biometry:
6/29
7/29
sentence);
8/29
10/29
11/29
12/29
13/29
A C D F G E B H I J K s1 1 1 1 1 1 s2 1 1 1 1 1 s3 1 1 1 1 1 s4 1 1 1 1 1 s5 1 1 1 1 1 = F A C D F G E s1 1 1 1 1 s2 1 1 1 1 1 s3 1 1 1 1 1 s4 1 1 1 1 1 s5 1 = S
14/29
F = n
i=1
m′
j=1 sij
n
i=1
m
j=1 fij 15/29
17/29
Tasks:
techniques ...
LocalMaxFeatureFrequencySelectorImpl
18/29
Questions:
distributions” (you find all the information in the Selection folder in e.g. *.meta).
Which influence does the Selection strategy have?
Microsoft Excel or OpenOffice to open the Selection file; sort by columns B and C).
19/29
Hint: The configuration file can be found in: $TRACER HOME/conf/tracer conf.xml
20/29
Stimulus Response prob. Number of prob’s Co-occurrence Significance Butter Bread 60 Bread 51 Soft 40 Cheese 49 Milk 32 Sugar 29 Margarine 27 Milk 23 Cheese 20 Margarine 22 Fat 16 Farina 18 Yellow 14 Eggs 16 Bread and butter 8 Pound 14 Box/can 6 Meat 13 Eat 6
21/29
22/29
23/29
How do Preprocessing and Featuring influence Selection?
25/29
26/29
27/29
Team Marco B¨ uchler, Greta Franzini and Emily Franzini. Visit us http://www.etrap.eu contact@etrap.eu
28/29
The theme this presentation is based on is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. Changes to the theme are the work of eTRAP.
29/29