capturing natural interactions
Nick Campbell Trinity College, Dublin Clarin/FLaReNet Workshop@KTH November 26th, 2009
Thursday 26 November 2009
capturing natural interactions Nick Campbell Trinity College, - - PowerPoint PPT Presentation
capturing natural interactions Nick Campbell Trinity College, Dublin Clarin/FLaReNet Workshop@KTH November 26th, 2009 Thursday 26 November 2009 introduction Speech recognition and synthesis technologies can now be considered mature,
Nick Campbell Trinity College, Dublin Clarin/FLaReNet Workshop@KTH November 26th, 2009
Thursday 26 November 2009
mature, but their simple incorporation into speech-based human-computer interfaces reveals shortcomings in their capabilities.
explicitly to convert between text and spoken modalities, without taking into consideration the complexities of human spoken interaction as the joint creation of mutually understood meaning.
interfaces and make them more intelligent and human-like (through a better understanding of human interaction and communication), then we might make a start by designing improved techniques for efficiently capturing, storing, annotating, and distributing large corpora of natural spoken interactions
Thursday 26 November 2009
Viewing & Distributing Data
in the Humanities, a selection of 18th Century poetic thought (but with an engineering bias) !
Thursday 26 November 2009
modelling human conversational interactions
“quit your books; let Nature be your teacher”
Thursday 26 November 2009
Verse > William Wordsworth > Complete Poetical Works THE TABLES TURNED, 1798
UP! up! my Friend, and quit your books; Or surely you'll grow double: Up! up! my Friend, and clear your looks; Why all this toil and trouble? The sun, above the mountain's head, A freshening lustre mellow Through all the long green fields has spread, His first sweet evening yellow. Books! 'tis a dull and endless strife: Come, hear the woodland linnet, How sweet his music! on my life, There's more of wisdom in it. And hark! how blithe the throstle sings! He, too, is no mean preacher: Come forth into the light of things, Let Nature be your teacher. She has a world of ready wealth, Our minds and hearts to bless-- Spontaneous wisdom breathed by health, Truth breathed by cheerfulness. One impulse from a vernal wood May teach you more of man, Of moral evil and of good, Than all the sages can. Sweet is the lore which Nature brings; Our meddling intellect Mis-shapes the beauteous forms of things:-- We murder to dissect. Enough of Science and of Art; Close up those barren leaves; Come forth, and bring with you a heart That watches and receives.
Thursday 26 November 2009
participant behaviour can we gather a corpus that will teach us something new about human conversational interaction
Thursday 26 November 2009
Thursday 26 November 2009
Thursday 26 November 2009
Thursday 26 November 2009
Thursday 26 November 2009
Thursday 26 November 2009
the inherent bias in existing corpora:
differences in “Contact Management” between corpora
ʻoptionalʼ dimension, since this aspect of communication is not reflected in most existing dialogue act annotation schemes (6 out of 18). It was noticed, however, that for some types of dialogues, e.g. phone conversations or tele- conferences (as in the OVIS corpus), this aspect may be important.”
Thursday 26 November 2009
Thursday 26 November 2009
Annotating, Viewing and Distributing New Data
There are presently several tools for manual annotation of data that each store the results in a prescribed format, easy for dissemination, but my experience of working with these and of talking with people who use them regularly is that the task is tedious, and the framework often restrictive. Rather than prescribe a standard at this time, we might benefit more from creating a support group whereby people who annotate data regularly can communicate and share samples, tools, and formats for rapid assisted evolution. My LREC 2010 paper (A Software Toolkit for Viewing Annotated
Multimodal Data Interactively over the Web) may be relevant here.
Thursday 26 November 2009
Thursday 26 November 2009
A Software Toolkit for Viewing Annotated Multimodal Data Interactively over the Web
Thursday 26 November 2009
Thursday 26 November 2009
derived annotations, data streams, and compressed video versions .... flash movie format (xxx.flv) appears to offer the most efficient service and access software ........
Thursday 26 November 2009
By sharing a corpus, we stand to gain added annotation levels. We should also examine crowd-sourcing in this respect. As with our own FreeTalk corpus (www.speech-data.jp), by making the initial data public and co-operating worldwide with interested partners, the annotations can be grown as researchers with different interests contribute their own layers of knowledge. Since the world of multimodal corpora is still young, perhaps the most we might expect from this initial meeting is the opening up of channels whereby the exchange of sources and resources might take place.
Thursday 26 November 2009
standards” but we try to keep a flexible, open- minded approach
each from their own viewpoints, using different software and both ‘top-down’ (theory driven) vs ‘bottom-up’ (data driven) approaches ....
Thursday 26 November 2009
‘balanced’ and ‘representative’ speech corpus
complex multimodal data packages
and share these types of data
and interchange of related annotations & data
Thursday 26 November 2009
Thursday 26 November 2009