Identifying and inferring objects from textual descriptions of - - PowerPoint PPT Presentation

identifying and inferring objects from textual
SMART_READER_LITE
LIVE PREVIEW

Identifying and inferring objects from textual descriptions of - - PowerPoint PPT Presentation

Identifying and inferring objects from textual descriptions of scenes from books Andrew Cropper Outline Text-to-scene conversion (TTSC) TTSC from books WordNet Implementation Experiments Conclusions and future


slide-1
SLIDE 1

Identifying and inferring objects from textual descriptions of scenes from books

  • Andrew Cropper
slide-2
SLIDE 2

Outline

  • Text-to-scene conversion (TTSC)
  • TTSC from books
  • WordNet
  • Implementation
  • Experiments
  • Conclusions and future work
slide-3
SLIDE 3

Text-to-scene conversion

“The lawn mower is 5 feet tall. John pushes the lawn mower. The cat is 5 feet behind John. The cat is 10 feet tall.”

slide-4
SLIDE 4

Text-to-scene conversion

“The lawn mower is 5 feet tall. John pushes the lawn mower. The cat is 5 feet behind John. The cat is 10 feet tall.”

slide-5
SLIDE 5

TTSC from books

“I was going to email Van and Jolu to tell them about the hassles with the cops, but as I put my fingers to the keyboard, I stopped again.”

slide-6
SLIDE 6

TTSC from books

“I was going to email Van and Jolu to tell them about the hassles with the cops, but as I put my fingers to the keyboard, I stopped again.”

?

slide-7
SLIDE 7

TTSC from books

words reader scene

slide-8
SLIDE 8

TTSC from books

words ? scene

slide-9
SLIDE 9

TTSC from books

words POS tagging scene

slide-10
SLIDE 10

POS tagging

“She placed the pen on the desk”

slide-11
SLIDE 11

POS tagging

“She placed the pen on the desk”

  • she/PRP placed/VBD the/DT pen/NN on/IN the/DT desk/NN
slide-12
SLIDE 12

POS tagging

“She placed the pen on the desk”

slide-13
SLIDE 13

POS tagging limitations

“Whilst talking about the weather, she placed the pen on the desk”

slide-14
SLIDE 14

POS tagging limitations

“Whilst talking about the weather, she placed the pen on the desk”

  • whilst/IN talking/VBG about/IN the/DT weather/NN ,/,

she/PRP put/VBD the/DT pen/NN on/IN the/DT table/NN

slide-15
SLIDE 15

TTSC from books

words POS tagging + Wordnet scene

slide-16
SLIDE 16

Wordnet

slide-17
SLIDE 17

Wordnet

45 logical categories, including:

  • noun.person: denoting people
  • noun.location: denoting spatial position
  • noun.communication: denoting communicative

processes and contents

  • noun.artifact: denoting man-made objects
slide-18
SLIDE 18

Wordnet

“Whilst talking about the weather, she placed the pen on the desk”

  • <noun.phenomenon>S: (n) weather, weather condition,

conditions, atmospheric condition (the atmospheric conditions that comprise the state of the atmosphere in terms of temperature and wind and clouds and precipitation)

  • <noun.artifact>S: (n) pen (a writing implement with a point

from which ink flows)

  • <noun.artifact>S: (n) table (a piece of furniture having a

smooth flat top that is usually supported by one or more vertical legs)

slide-19
SLIDE 19

WordNet limitations

(why we need POS + WordNet)

“The politician wishes to table an amendment to the proposal”

  • The/DT politician/NN wishes/VBZ to/TO table/VB an/DT

amendment/NN to/TO the/DT proposal/NN noun.artifact in Wordnet

slide-20
SLIDE 20

TTSC from books - what we have

“She placed the pen on the desk”

slide-21
SLIDE 21

TTSC from books - what we want

“She placed the pen on the desk”

slide-22
SLIDE 22

Automatic TTSC from books

words POS tagging + Wordnet + Wikipedia scene

slide-23
SLIDE 23

Wikipedia

slide-24
SLIDE 24

Wikipedia

slide-25
SLIDE 25

Implementation notes

  • Python + Natural Language Toolkit
  • Wikipedia export pages
  • Tokenising, POS tagging, singularise plurals, aggregate

synonyms

  • Identify objects by the noun.artifact category
  • Look at the corresponding Wikipedia page for each

potential object in a scene.

  • Rank objects by tfidf
slide-26
SLIDE 26

Experiments

anachronism, noun

  • a thing belonging or appropriate to a period other

than that in which it exists, especially a thing that is conspicuously old-fashioned: the town is a throwback to medieval times, an anachronism that has survived the passing years.

slide-27
SLIDE 27

Experiments

Corey Doctorow’s Little Brother, manually parsed

slide-28
SLIDE 28

Objects identified

good: bed, computer, picture, telephone, projector, screen, microscope, bag, keyboard

  • bad: room, ceiling, wall
  • ugly: jail, camp, room, filter, radar
slide-29
SLIDE 29

Objects missed

“I hooked up my Xbox as soon as I got to my room” Not in Wordnet

slide-30
SLIDE 30

Objects inferred

slide-31
SLIDE 31

Conclusions

  • Use Wikipedia and WordNet to identify explicit objects

and infer implicit objects from scenes from a book

  • Able to infer implicit objects such as keyboard and

screen by identifying explicit objects such as computer

  • Future work
  • Better weighting scheme
  • Use more sophisticated NLP techniques, such as using

word-sense disambiguation

slide-32
SLIDE 32

References

Terry Winograd. Procedures as a representation for data in a computer program for understanding natural language. Technical report, DTIC Document, 1971.

  • Bob Coyne and Richard Sproat. Wordseye: an automatic text-to-scene

conversion sys- tem. In Proceedings of the 28th annual conference on Computer graphics and interactive techniques, pages 487–496. ACM, 2001.

  • Richard Sproat. Inferring the environment in a text-to-scene conversion
  • system. In Proceedings of the 1st international conference on Knowledge

capture, pages 147–154. ACM, 2001.

  • George A Miller. Wordnet: a lexical database for english. Communications of

the ACM, 38(11):39–41, 1995.

  • Angel X Chang, Manolis Savva, and Christopher D Manning. Semantic

parsing for text to 3d scene generation. ACL 2014, page 17, 2014.

slide-33
SLIDE 33

Thank you