P. Hamk, I, Kopeek, R. Olejek , J. Plhk LAB OF SOFTWARE - PowerPoint PPT Presentation

DIALOGUE-BASED INFORMATION RETRIEVAL FROM IMAGES P. Hamřík, I, Kopeček, R. Ošlejšek , J. Plhák LAB OF SOFTWARE ARCHITECTURES AND INFORMATION SYSTEMS FACULTY OF INFORMATICS MASARYK UNIVERSITY

2 R. Ošlejšek, ICCHP'14, Paris Motivation – Communicative Images Communicative image ● An image enabling users to explore its content by means – of dialogues. Window to the depicted world fully accessible through – natural language.

3 R. Ošlejšek, ICCHP'14, Paris Key Principles – Annotated Pictures Semantics: System of OWL/RDF ontologies for picture ● annotation and shared multilingual knowledge. Defjnes grammar of the dialogue system. Graphic format: SVG as fmexible XML wrapper enabling us ● to embed the original raster image together with structured semantics

4 R. Ošlejšek, ICCHP'14, Paris Key Principles – Dialogue Subsystem Restricted grammars (only a small fragment of natural ● language): Generic grammar: “Describe picture.” , “What is in the – picture” , etc. What-Where Language: “Where is object?” , “What is in – the upper-left corner?” . Experimental domain-specifjc grammars: Fine-tuned for – concrete picture. Dialogue frames: templates for questions with slots that ● can be fjlled by specifjc entries from ontologies. “How far is it from SLOT1 to SLOT2?” –

5 R. Ošlejšek, ICCHP'14, Paris Workfmow ● Client (e.g. plug-in to web browser) sends an image to our server The image can be communicative or not – JavaEE server providing REST services – ● Server embeds the image in SVG and do additional preprocessing Auto-detection and image-recognition techniques would – help to gather initial semantics [in development] ● Client sends questions (sentences) to the server, dialogue module parses the questions, inspects ontology and composes answer No intelligence on the client side –

6 R. Ošlejšek, ICCHP'14, Paris Demo ● Exact transcription of our experimental system Adjusted for screen readers: ● – http://lsd.fj.muni.cz/~xplhak/gate/ With styles: ● – http://lsd.fj.muni.cz/~xplhak/virtualgate/

7 R. Ošlejšek, ICCHP'14, Paris Goal 1: identify (hidden) image What is in the middle? There are Philip, Jesus, T able, John, Peter, Thomas, Judas and James Major in the central part. What is Jesus doing? In this picture, Jesus is Sitting. Hwo is author? This painting was created by Leonardo da Vinci. When did Leonardo create the painting? Painting has been fjnished in 1498.

8 R. Ošlejšek, ICCHP'14, Paris Goal 2: Find Judas Judas? Where is Judas? Judas is in the middle of the picture. How Judas looks? In this picture, Judas is Withdrawn. Is Judas next to Jesus? From your point of view, Judas is not next to Jesus. Judas is on the left of Jesus.

9 R. Ošlejšek, ICCHP'14, Paris Judas Goal 2: Find Judas (cont.) and Peter What is Peter doing? In this picture, Peter is T alking and Standing. Is Judas in front of Peter? From your point of view, Judas is in front of Peter.

10 R. Ošlejšek, ICCHP'14, Paris User evaluation ● 4 blind users and 4 sighted users ● T esting scenarios – Start the interaction with the picture in any way you like. And end it at any point you like. – If the user haven’t done it in the previous scenario, then: ● Obtain general information about the picture ● Learn who painted the painting in the picture. ● List all people in the picture. ● ... ● Evaluation: quantitative and qualitative questionnaire

11 R. Ošlejšek, ICCHP'14, Paris Current Limits and Future Goals Manual annotation ● Boring and exhausting, prone to errors even when using – supporting tools like Protege. Auto-learning dialogue strategy ● User question “What is the castle behind Jane?” – indicates that there is some castle and some object called Jane in the picture. The communicative picture takes over the initiative to – learn more about these two things, asking the user “Who or what is Jane?” and then extending the ontology with these new facts.

12 R. Ošlejšek, ICCHP'14, Paris Current Limits and Future Goals (cont.) Manually confjgured dialogues ● Carefully prepared and fjne-tuned grammars and – dialogue frames for concrete domain (picture content). Dialogues generated from ontologies ● Frames driven by ontology structure – Object and data properties = frames (utterances). – Classes and datatypes involved in properties = slots. – Individuals = slot values. –

13 R. Ošlejšek, ICCHP'14, Paris Questions? Thank you for your attention

P. Hamk, I, Kopeek, R. Olejek , J. Plhk LAB OF SOFTWARE - PowerPoint PPT Presentation

DIALOGUE-BASED INFORMATION RETRIEVAL FROM IMAGES P. Hamk, I, Kopeek, R. Olejek , J. Plhk LAB OF SOFTWARE ARCHITECTURES AND INFORMATION SYSTEMS FACULTY OF INFORMATICS MASARYK UNIVERSITY 2 R. Olejek, ICCHP'14, Paris Motivation

DNS in OpenStack What is the OpenStack DNS API? https://gra.ham.ie | @grahamhayes 1 Graham

CASE 25: DC 10 JUMBO JET [Real-World] Jake Fekete Tej aswini Gautham Lej ia Peng Daniel S

GERMANY AND AUSTRIA Europe with ith: Germany Austria Werfen Austria Sa Saalfeld Germany

c*,nr+-T)1,.^ ,a,],;.) \ Nsr'pLh,. c-^-a- 5(n) = Aj \anet.({) lo yI}ii,;:", J atwa(r) = 4

! th Lt, 3L! a.y =o l" +o, l'+ Repe.leJ .l toc rlr ch.rid c tt cr: t.G p.rh.'ar..

HamSCI HAM RADIO SCIENCE CITIZEN INVESTIGATION HOW HAM RADIO OPERATORS (CITIZEN SCIENTISTS) ARE

WL2K During Emergencies Ham Radio operators will ill be doin ing More of th their ir

of of Ch Chat atham ham Chatha ham Counci cil l on Ag Aging Needs s As Asses essme

Todays Presenters Kelli Ham Kelli Ham Consumer Health and Technology Coordinator, National

Ham Sandwich Theorem Carola Wenk 3/8/16 1 CMPS 6640/4040 Computational Geometry Ham-Sandwich

ARM Reference Kernel of Tizen 3 MyungJoo Ham, Ph.D. MyungJoo Ham Tizen System Domain &

Linux for the Frugal HAM Linux: A computer operating system like Windows or IOS (Mac/Apple)

Ham Branch Nomination Presentation to Rivers Management Advisory Committee (RMAC) Concord, NH

BROADBAND HAM NETWORK SLSRC Meeting Friday, September 27, 2013 Bill Wallace W0WHW 1

Selection, Gender and the Impact of Schooling Type in the Dhaka Slums John C. Ham 1 Saima Khan 2 1

Meet the Team: Big Ham McChaCho [Yeseul Monica Cho] [Brent Biglin] [Yoomi Cha] [Dixie Hamilton]

ELECTRONIC WD-10 TRAINING http://www.dol.gov/whd/programs/ dbra/wd10/index.htm Electronic WD-10

Accelerating Document Retrieval and Ranking for Cognitive Applications Presenters: Tim Kaldewey

Quantitative Evaluation of Passage Retrieval Algorithms for Question Answering Stefanie Tellex,

Video Retrieval using Speech and Image Information Alexander G. Hauptmann, Rong Jin, and Tobun D.

2 3 4 5 6

with Continuous Relevance Feedback KALICIAK, MYRHAUG, AND GOKER Content Image Retrieval

Cross-Language Information Retrieval Carol Peters ISTI-CNR, Pisa Cross-Language Information

Multiagent Reactive Plan Application Learning in Dynamic Environments H useyin Sevay

Sambuz

Useful Links

Newsletter

Mail Us

P. Hamk, I, Kopeek, R. Olejek , J. Plhk LAB OF SOFTWARE - PowerPoint PPT Presentation

DIALOGUE-BASED INFORMATION RETRIEVAL FROM IMAGES P. Hamk, I, Kopeek, R. Olejek , J. Plhk LAB OF SOFTWARE ARCHITECTURES AND INFORMATION SYSTEMS FACULTY OF INFORMATICS MASARYK UNIVERSITY 2 R. Olejek, ICCHP'14, Paris Motivation

DNS in OpenStack What is the OpenStack DNS API? https://gra.ham.ie | @grahamhayes 1 Graham

CASE 25: DC 10 JUMBO JET [Real-World] Jake Fekete Tej aswini Gautham Lej ia Peng Daniel S

GERMANY AND AUSTRIA Europe with ith: Germany Austria Werfen Austria Sa Saalfeld Germany

c*,nr+-T)1,.^ ,a,],;.) \ Nsr'pLh,. c-^-a- 5(n) = Aj \anet.({) lo yI}ii,;:&quot;, J atwa(r) = 4

! th Lt, 3L! a.y =o l&quot; +o, l'+ Repe.leJ .l toc r*lr ch.rid c tt cr: t.G p.rh.'ar..*

HamSCI HAM RADIO SCIENCE CITIZEN INVESTIGATION HOW HAM RADIO OPERATORS (CITIZEN SCIENTISTS) ARE

WL2K During Emergencies Ham Radio operators will ill be doin ing More of th their ir

of of Ch Chat atham ham Chatha ham Counci cil l on Ag Aging Needs s As Asses essme

Todays Presenters Kelli Ham Kelli Ham Consumer Health and Technology Coordinator, National

Ham Sandwich Theorem Carola Wenk 3/8/16 1 CMPS 6640/4040 Computational Geometry Ham-Sandwich

ARM Reference Kernel of Tizen 3 MyungJoo Ham, Ph.D. MyungJoo Ham Tizen System Domain &amp;

Linux for the Frugal HAM Linux: A computer operating system like Windows or IOS (Mac/Apple)

Ham Branch Nomination Presentation to Rivers Management Advisory Committee (RMAC) Concord, NH

BROADBAND HAM NETWORK SLSRC Meeting Friday, September 27, 2013 Bill Wallace W0WHW 1

Selection, Gender and the Impact of Schooling Type in the Dhaka Slums John C. Ham 1 Saima Khan 2 1

Meet the Team: Big Ham McChaCho [Yeseul Monica Cho] [Brent Biglin] [Yoomi Cha] [Dixie Hamilton]

ELECTRONIC WD-10 TRAINING http://www.dol.gov/whd/programs/ dbra/wd10/index.htm Electronic WD-10

Accelerating Document Retrieval and Ranking for Cognitive Applications Presenters: Tim Kaldewey

Quantitative Evaluation of Passage Retrieval Algorithms for Question Answering Stefanie Tellex,

Video Retrieval using Speech and Image Information Alexander G. Hauptmann, Rong Jin, and Tobun D.

2 3 4 5 6

with Continuous Relevance Feedback KALICIAK, MYRHAUG, AND GOKER Content Image Retrieval

Cross-Language Information Retrieval Carol Peters ISTI-CNR, Pisa Cross-Language Information

Multiagent Reactive Plan Application Learning in Dynamic Environments H useyin Sevay

Sambuz

Useful Links

Newsletter

Mail Us

c*,nr+-T)1,.^ ,a,],;.) \ Nsr'pLh,. c-^-a- 5(n) = Aj \anet.({) lo yI}ii,;:", J atwa(r) = 4

! th Lt, 3L! a.y =o l" +o, l'+ Repe.leJ .l toc rlr ch.rid c tt cr: t.G p.rh.'ar..

ARM Reference Kernel of Tizen 3 MyungJoo Ham, Ph.D. MyungJoo Ham Tizen System Domain &