meta interpretive learning of data transformation programs
play

Meta-interpretive learning of data transformation programs Andrew - PowerPoint PPT Presentation

Meta-interpretive learning of data transformation programs Andrew Cropper, Alireza Tamaddoni-Nezhad, Stephen H. Muggleton Imperial College London Input P_011 67 year Output lung disease: n/a, Diagnosis: Unknown P_011 67 Unknown 80.78%


  1. Meta-interpretive learning of data transformation programs Andrew Cropper, Alireza Tamaddoni-Nezhad, Stephen H. Muggleton Imperial College London

  2. Input P_011 67 year Output lung disease: n/a, Diagnosis: Unknown P_011 67 Unknown 80.78% P_003 56 carcinoma P_003 56 P_013 56 pneumonia Diagnosis: carcinoma, lung disease: unknown 20.78 P_013 70 Diagnosis: pneumonia 55.9 • Semi-structured • Positive only learning • Background knowledge

  3. Input P_011 67 year Output lung disease: n/a, Diagnosis: Unknown P_011 67 Unknown 80.78% P_003 56 carcinoma P_003 56 P_013 56 pneumonia Diagnosis: carcinoma, lung disease: unknown 20.78 P_013 70 Diagnosis: pneumonia 55.9 f(A,B):- f2(A,C), f1(C,B). f2(A,B):- find_patient_id(A,C), find_int(C,B). f1(A,B):- open_interval(A,B,[':',' ‘],['','n']). f1(A,B):- open_interval(A,B,[':',' '],[',',' ']).

  4. MetagolD Implementation of meta-interpretive learning*, a form of inductive logic programming based on a Prolog meta-interpreter, which supports predicate invention and the learning of recursive theories * S.H. Muggleton, D. Lin, and A. Tamaddoni-Nezhad. Meta-interpretive learning of higher-order dyadic datalog: Predicate invention revisited. Machine Learning, 100(1):49-73, 2015.

  5. Transformation language • find_sublist/3 • closed_interval/4 • open_interval/4

  6. open_interval/4 and closed_interval/4 Input = [i,n,d,u,c,t,i,o,n], Start = [n,d], End = [t,i] open_interval(Input,[u,c],Start,End). closed_interval(Input,[n,d,u,c,t,i],Start,End).

  7. Experiment: ecological papers Input Harpalus rufipes eats large prey such as Lepidoptera Bembidion lampros. In cereals the main food was Collembola Output Harpalus rufipes eats Lepidoptera Bembidion food Collembola lampros Learned program f(A,B):- f3(A,C), find_species(C,B). f3(A,B):- find_species(A,C), f2(C,B). f2(A,B):- closed_interval(A,B,[f,o],[o,d]). f3(A,B):- find_species(A,C), f1(C,B). f1(A,B):- closed_interval(A,B,[e,a],[t,s]).

  8. Experiment: ecological papers 80 1 delimiter size 1 delimiter size 2 Mean learning time (seconds) Mean predictive accuracy delimiter size 3 60 0 . 8 40 0 . 6 delimiter size 1 20 delimiter size 2 delimiter size 3 default accuracy 0 0 . 4 1 2 3 4 5 1 2 3 4 5 No. training examples No. training examples

  9. Experiment: medical records Input P_011 67 year Output lung disease: n/a, Diagnosis: Unknown P_011 67 Unknown 80.78% P_003 P_003 56 carcinoma 56 P_013 56 pneumonia Diagnosis: carcinoma, lung disease: unknown 20.78 P_013 70 Diagnosis: pneumonia 55.9 f(A,B):- f2(A,C), f1(C,B). f2(A,B):- find_patient_id(A,C), find_int(C,B). f1(A,B):- open_interval(A,B,[':',' ‘],['','n']). f1(A,B):- open_interval(A,B,[':',' '],[',',' ']).

  10. Experiment: medical records 60 1 delimiter size 1 delimiter size 2 Mean learning time (seconds) Mean predictive accuracy delimiter size 3 40 0 . 8 20 0 . 6 delimiter size 1 delimiter size 2 delimiter size 3 default accuracy 0 0 . 4 1 2 3 4 5 1 2 3 4 5 No. training examples No. training examples

  11. Conclusions • MIL is able to generate accurate data transformation programs from a small number of examples • Delimiter size effects learning performance Future work • Apply to problems which require recursion • Generate hypotheses in a scripting language • Probabilistic approaches / noise handling

  12. Thank you

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend