Hello! My name is... Buffy Automatic TV series Naming of Characters - PowerPoint PPT Presentation

T EXT , LANGUAGE , AND IMAGERY Yu-Ting Peng 1

R ESOURCE - S CRIPTS 2

R ESOURCE - S UBTITLES 3

R ESOURCE - N EWS 4

R ESOURCE - W IKIPEDIA 5

Paper Resource Objective News Name faces Names and Faces in the News , by T. Berg, A. photos Berg, J. Edwards, M. Maire, R. White, Y. Teh, E. Learned- Miller and D. Forsyth, CVPR 2004. Movies / Name faces “Hello! My name is... Buffy” – Automatic TV series Naming of Characters in TV Video , by M. (video) Everingham, J. Sivic and A. Zisserman, BMVC 2006. Movies / Action retrieval Movie/Script: Alignment and Parsing of TV series and movie Video and Text Transcription , by T. Cour, C. (video) structure Jordan, E. Miltsakaki, and B. Taskar, ECCV 2008. recovery Corel learning Nouns: Exploiting Prepositions and dataset classifiers for Comparative Adjectives for Learning (images) nouns and Visual Classifiers , A. Gupta and L. Davis, ECCV relationships 2008. (prep. & adj). 6

N AMES AND F ACES IN THE N EWS � Aim: Given an input image and an associated caption, automatically detects faces in the image and possible name strings. � Application: to label faces in news images or to organize news pictures by individuals present. 7

D ATASET � half a million news pictures and captions from Yahoo News over a period of roughly two years. � Obtained 44,773 face images � more realistic than usual face recognition datasets � it contains faces captured “in the wild” in a variety of configurations with respect to the camera, taking a variety of expressions, and under illumination of widely varying color. 8

P ROCEDURE Extract Detect and Images names from represent associated the caption. faces with names 9

E XTRACT NAMES FROM THE CAPTION . Words are classified as verbs by first applying a list of morphological rules to present tense singular forms, and then comparing these to a database of known verbs. identifying two or more capitalized words followed by a present tense verb. This lexicon is matched to each caption. Each face detected in an image is associated 10 with every name extracted from the associated caption

E XAMPLES 11

F ACE D ETECTION &R ECTIFICATION � Face detector (K. Mikolajczyk) - biased to frontal faces � Rectify face to canonical pose. • Geometric blur applied to grayscale patches • 5 SVM (trained with 150 hand clicked faces) • Determine affine transformation which best maps detected points to canonical positions � Remove images with low rectification score 13 13

R EPRESENT FACES � kernel principal components analysis (kPCA)-to reduce the dimensionality of data � linear discriminant analysis (LDA) - to project data into a space that is suited for the discrimination task. 14

M ODIFIED K- MEANS C LUSTERING Randomly assign each image to one of its extracted names For each distinct name (cluster), calculate the mean of image vectors assigned to that name Repeat until convergence Reassign each image to the closest mean of its extracted 16 names

M ERGING C LUSTERS � Aim: different names that actually correspond to a single person. � merge names that correspond to faces that look the same. 17

T HE DIFFICULTY OF FACE RECOGNITION scale, pose • lighting • partial • occlusion expressions • 20 *slides from Andrew Zisserman

“H ELLO ! M Y NAME IS ... B UFFY ” � Aim - automatically label television or movie footage with the identity of the people present in each frame of the video. 21

P ROCESS Extract Assign Obtain face labels to names tracks faces 22

O BTAIN NAMES � �� 23

A LIGNMENT BY D YNAMIC T IME W ARPING 24

D ETECT AND TRACK FACES � Face detector- by P . Viola and M. Jones. � Frontal face � KLT tracker-point tracks � Reduces the volume of data to be processed � Allows stronger appearance models to be built for each character. 26 *Pictures from Andrew Zisserman

F ACE TRACKS 27 *slides from Andrew Zisserman

R EPRESENTING F ACE A PPEARANCE Locate facial features (Nine facial features eyes/nose/mouth) Face normalization (Affine transform) Representing Face 28 (SIFT Descriptor or Simple pixel-wised descriptor)

R EPRESENTING C LOTHING A PPEARANCE � Matching the appearance of the face can be extremely challenging; clothing can provide additional cues � Represent Clothing Appearance by detecting a bounding box containing cloth of a person � Similar clothing appearance suggests the same character, but different clothing does not necessarily imply a different character � Straightforward weighting of the clothing appearance relative to the face appearance proved effective 29

S PEAKER AMBIGUITY 30

S PEAKER DETECTION 31

A SSIGN LABELS TO FACES • Assign names to unlabelled faces by classification based on extracted exemplars • Classify tracks by nearest exemplar • Estimate probability of class from distance ratios 33 *Graphs from Andrew Zisserman

M OVIE /S CRIPT : A LIGNMENT AND P ARSING OF V IDEO AND T EXT T RANSCRIPTION � Aim: Automatically extracting large collections of actions “in the wild” � Method: recovering scene structure in movies and TV series � Application: semantic retrieval and indexing, browsing by character or object, re-editing and many more. 35

G ENERATIVE MODEL FOR SCENE STRUCTURE � This uncovered structure can be used to analyze the content of the video for tracking objects across cuts, action retrieval, as well as enriching browsing and editing interfaces. � To model the scene structure, we propose a unified generative model for joint scene segmentation and shot threading. 36

V IDEO DECONSTRUCTION 37

A LIGNMENT � screenplay � Dialogues � speaker identity, � scene transitions � no time-stamps � closed captions � Dialogues � time-stamps � nothing else. 38

P RONOUN RESOLUTION AND VERB FRAMES 39

A CTION RETRIEVAL � After pronoun resolution and verb frames, then matched to detected and named characters in the video sequence 40

Hello! My name is... Buffy Automatic TV series Naming of Characters - PowerPoint PPT Presentation

T EXT , LANGUAGE , AND IMAGERY Yu-Ting Peng 1 R ESOURCE - S CRIPTS 2 R ESOURCE - S UBTITLES 3 R ESOURCE - N EWS 4 R ESOURCE - W IKIPEDIA 5 Paper Resource Objective News Name faces Names and Faces in the News , by T. Berg, A. photos

Automatic Verification of Automatic Verification of Automatic Verification of Automatic

Tansi Keh te-ha-yak Hello Elders Tansi nimisak Hello, my older sister Tansi nistisak

Increasing Impact and Engagement: Using the Learning Partnerships Model Using the Learning

HYPERTENSION BUFFY POWELL, DNP, RN, ACNP-BC no disclosures HYPERTENSION-HOW DO WE DEFINE IT?

Earth Do We Help Them Be Compliant? Buffy Kelly MT(ASCP) Objectives Discuss checklist of items

Laboratory & Nursing Relationships Buffy Kelly MT(ASCP) Objectives Discuss the importance

Interpersonal Relationships & Team Building Buffy Kelly MT(ASCP) Huron Consulting Group

Calculus (Math 1A) Lecture 6 Vivek Shende September 5, 2017 Hello and welcome to class! Hello

Automatic Enrollment and Automatic IRAs David C. John The Heritage Foundation The Retirement

Automatic Registration and Calibration Automatic Registration and Calibration Automatic

Dependency Dependency- -Based Automatic Evaluation Based Automatic Evaluation Dependency

Talk to me Drupal Talk to me Drupal Using Drupal to power a Voice App Hello My Name Is Frank

1 ~Title~ ~Hello ~ Hello, my name is Paul Sztorc. I have been a Bitcoiner since 2012. And Ive

1 2 3 Places Example (require racket/place racket/place/distributed) (provide hello-world)

Introduction to Computer Systems Hello World and gcc Eric McCreath Hello World The classic

Calculus (Math 1A) Lecture 10 Vivek Shende September 15, 2017 Hello and welcome to class!

CHILDRENS HEALTHY WEIGHT COIIN How to Dynamically Tell the Story of Your Work on a Poster A

FIELD DATA COLLECTION By Wende Mix & USING SMART PHONES, Mary Perrelli TABLETS, AND GPS

UNDERSTANDING IMAGE QUALITY AND TRUST IN PEER-TO-PEER MARKETPLACES Xiao Ma [1] Lina Mezghani [2*]

DeepStereo: Learning to Predict New Views from the Worlds Imagery Example video Deep networks

T raditionally, farm managers apply fertilizers, (1968) conducted a study of several Ap horizons

Learning Approaches to Estimate Depth from RGB Lecture 5 What will we learn - Latest Approaches

Whe here To To Find nd The Themes F For Google Sl Slides Spice up any presentation with

Energy Infrastructure Map of the World Varun Nair Tamasha Pathirathna Xiaolan You Qiwei

Sambuz

Useful Links

Newsletter

Mail Us

Hello! My name is... Buffy Automatic TV series Naming of Characters - PowerPoint PPT Presentation

T EXT , LANGUAGE , AND IMAGERY Yu-Ting Peng 1 R ESOURCE - S CRIPTS 2 R ESOURCE - S UBTITLES 3 R ESOURCE - N EWS 4 R ESOURCE - W IKIPEDIA 5 Paper Resource Objective News Name faces Names and Faces in the News , by T. Berg, A. photos

Automatic Verification of Automatic Verification of Automatic Verification of Automatic

Tansi Keh te-ha-yak Hello Elders Tansi nimisak Hello, my older sister Tansi nistisak

Increasing Impact and Engagement: Using the Learning Partnerships Model Using the Learning

HYPERTENSION BUFFY POWELL, DNP, RN, ACNP-BC no disclosures HYPERTENSION-HOW DO WE DEFINE IT?

Earth Do We Help Them Be Compliant? Buffy Kelly MT(ASCP) Objectives Discuss checklist of items

Laboratory &amp; Nursing Relationships Buffy Kelly MT(ASCP) Objectives Discuss the importance

Interpersonal Relationships &amp; Team Building Buffy Kelly MT(ASCP) Huron Consulting Group

Calculus (Math 1A) Lecture 6 Vivek Shende September 5, 2017 Hello and welcome to class! Hello

Automatic Enrollment and Automatic IRAs David C. John The Heritage Foundation The Retirement

Automatic Registration and Calibration Automatic Registration and Calibration Automatic

Dependency Dependency- -Based Automatic Evaluation Based Automatic Evaluation Dependency

Talk to me Drupal Talk to me Drupal Using Drupal to power a Voice App Hello My Name Is Frank

1 ~Title~ ~Hello ~ Hello, my name is Paul Sztorc. I have been a Bitcoiner since 2012. And Ive

1 2 3 Places Example (require racket/place racket/place/distributed) (provide hello-world)

Introduction to Computer Systems Hello World and gcc Eric McCreath Hello World The classic

Calculus (Math 1A) Lecture 10 Vivek Shende September 15, 2017 Hello and welcome to class!

CHILDRENS HEALTHY WEIGHT COIIN How to Dynamically Tell the Story of Your Work on a Poster A

FIELD DATA COLLECTION By Wende Mix &amp; USING SMART PHONES, Mary Perrelli TABLETS, AND GPS

UNDERSTANDING IMAGE QUALITY AND TRUST IN PEER-TO-PEER MARKETPLACES Xiao Ma [1] Lina Mezghani [2*]

DeepStereo: Learning to Predict New Views from the Worlds Imagery Example video Deep networks

T raditionally, farm managers apply fertilizers, (1968) conducted a study of several Ap horizons

Learning Approaches to Estimate Depth from RGB Lecture 5 What will we learn - Latest Approaches

Whe here To To Find nd The Themes F For Google Sl Slides Spice up any presentation with

Energy Infrastructure Map of the World Varun Nair Tamasha Pathirathna Xiaolan You Qiwei

Sambuz

Useful Links

Newsletter

Mail Us

Laboratory & Nursing Relationships Buffy Kelly MT(ASCP) Objectives Discuss the importance

Interpersonal Relationships & Team Building Buffy Kelly MT(ASCP) Huron Consulting Group

FIELD DATA COLLECTION By Wende Mix & USING SMART PHONES, Mary Perrelli TABLETS, AND GPS