Drivers in Naturalistic Recordings Using Existing Tools Sumit Jha - - PowerPoint PPT Presentation

drivers in naturalistic recordings
SMART_READER_LITE
LIVE PREVIEW

Drivers in Naturalistic Recordings Using Existing Tools Sumit Jha - - PowerPoint PPT Presentation

Challenges in Head Pose Estimation of Drivers in Naturalistic Recordings Using Existing Tools Sumit Jha and Carlos Busso Multimodal Signal Processing (MSP) Laboratory Department of Electrical Engineering, The University of Texas at Dallas,


slide-1
SLIDE 1

msp.utdallas.edu

Challenges in Head Pose Estimation of Drivers in Naturalistic Recordings Using Existing Tools

Sumit Jha and Carlos Busso

Multimodal Signal Processing (MSP) Laboratory Department of Electrical Engineering, The University of Texas at Dallas, Richardson TX-75080, USA

1 Ideal Scenarios Challenging Scenarios

slide-2
SLIDE 2

msp.utdallas.edu

Head Pose Estimation

2

  • The position and orientation of the head find useful

application in multiple interactive environment

  • Human computer interaction
  • Non-verbal communication
  • Visual attention
  • In a vehicle setting
  • Visual attention of driver
slide-3
SLIDE 3

msp.utdallas.edu

Motivations

  • Head Pose estimation in a controlled environment with

limited head motion is a solved (almost) problem

  • Additional challenges in driving environment
  • Wide variation in lighting
  • High head rotations
  • Occlusion
  • Questions
  • What are the factors that affect the performance of Head Pose

Estimation (HPE) algorithms?

  • What are the conditions where,
  • most algorithms work?
  • most algorithms fail?

3

slide-4
SLIDE 4

msp.utdallas.edu

Objective

  • Use a reference head poses in a naturalistic driving dataset

to study factors affecting head pose estimation

  • Glasses
  • Illumination
  • Head rotation
  • Isolate frames which are easiest to process and the ones

that are the most challenging

  • Ideal Scenario – Frames that always give good estimate
  • Challenging Scenario – Frames that always fail estimation

4

slide-5
SLIDE 5

msp.utdallas.edu

Outline

  • Tools and Dataset
  • Factors affecting Head Pose Estimation
  • Ideal Scenarios and Challenging Scenarios
  • Conclusions and Future Work

5

slide-6
SLIDE 6

msp.utdallas.edu

Tools analyzed

  • We analyse 3 state-of-the-art head pose estimation

tools

  • IntraFace [Xiong et al., 2013]

Supervised Gradient Descent (SGD) to track non-linear features associated with each landmarks

  • OpenFace [Baltrusaitis et al., 2016]

Conditional Local Neural Fields(CLNF) which learns the landmark shape and appearance variations

  • Zface [Jeni et al., 2015]

Iteratively build a 3D mesh from the 2D landmarks to register a dense model

6

  • L. A. Jeni, J. F. Cohn, and T. Kanade. Dense 3d face alignment from 2d videos in real-time. In Automatic Face and

Gesture Recognition (FG), 2015 11th IEEE International Conference and Workshops on, volume 1, pages 1–8, Ljubljana, Slovenia, May 2015. IEEE.

  • X. Xiong and F. De la Torre. Supervised descent method and its applications to face alignment. In Proceedings of the

IEEE conference on computer vision and pattern recognition, pages 532–539, Portland, Oregon, June 2013. IEEE.

  • T. Baltrusaitis, P. Robinson, L.-P. Morency, et al. Openface: an open source facial behavior analysis toolkit. In 2016 IEEE

Winter Conference on Applications of Computer Vision (WACV), pages 1–10, Lake Placid, NY, USA, March 2016. IEEE.

slide-7
SLIDE 7

msp.utdallas.edu

Database

  • Collected naturalistic driving data in the UTDrive platform
  • Dash Cameras record the road and

driver’s face

  • Blackvue dr650gw 2 channel
  • Rear camera records the face
  • Front camera records the road
  • Data Collected with 16 subjects

(10 males and 6 females) ~ 6 hours of naturalistic driving

7

slide-8
SLIDE 8

msp.utdallas.edu

AprilTags for Head Pose Estimation

  • AprilTags [Olson, 2011]: 2D barcodes that can be robustly

detected in an image

  • Headband designed with 17 AprilTags
  • Used to establish reference head position and orientation

8

Olson, Edwin. "AprilTag: A robust and flexible visual fiducial system." Robotics and Automation (ICRA), 2011 IEEE International Conference on. IEEE, 2011.

slide-9
SLIDE 9

msp.utdallas.edu

Performance of AprilTag system

  • Accuracy of AprilTag based detection
  • Rendered the band on a head in virtual environment
  • Studied the performance in various quality of

rendering and adding external effects like illumination

9

Data condition Median angle error High Quality render (3840 x 2160) 0.89° Medium Quality render (960 x 540p) 1.26° Data with added illumination 2.69°

slide-10
SLIDE 10

msp.utdallas.edu

Affect of Headband on HPE

  • Head band occludes a part of forehead

which can confuse HPE

  • 7 subject collected without headband for

comparison

  • Frames missed by each algorithm

10

From AprilTag IntraFace OpenFace Zface With AprilTag 5.3 % 27.3 % 24.1 % 21.9 % Without AprilTag

  • 19.0 %

21.8 % 8.9 %

slide-11
SLIDE 11

msp.utdallas.edu

Outline

  • Tools and Dataset
  • Factors affecting Head Pose Estimation
  • Ideal Scenarios and Challenging Scenarios
  • Conclusions and Future Work

11

slide-12
SLIDE 12

msp.utdallas.edu

  • We study factors that effects Head Pose Estimation

in driving environment

  • Glasses
  • Illumination
  • Head Rotation

12

Factors affecting Head Pose Estimation

slide-13
SLIDE 13

msp.utdallas.edu

Occlusion due to glasses

  • Glasses occlude the face affecting performance of HPE
  • Percentage of the total frames that failed detection by each

algorithm

13

Method No Glasses Glasses with thick frame Glasses with Normal frame IntraFace 15.40% 67.70% 18.50% OpenFace 12.10% 67.50% 13.70% Zface 8.80% 64.10% 13.80%

slide-14
SLIDE 14

msp.utdallas.edu

Effect of Illumination

  • Both high and low illumination affects the

quality of image

  • We study the effect of saturation of the face

image

  • Partial or total saturation of face image
  • Performance depends on the third quartile
  • f the face image
  • Third Quartile of the intensity(Q3) is high when

part of the face is saturated

14

slide-15
SLIDE 15

msp.utdallas.edu

Effect of head rotation

  • Face detection and head pose estimation affected by

high head rotation

  • Most tools only work for frontal and semi-frontal faces
  • Naturalistic driving scenario
  • Distribution of head poses
  • Bright – High frequency
  • Dark – Low frequency
  • Most of the time head pose

is frontal

  • The robustness is more crucial

when head pose is non frontal

15

slide-16
SLIDE 16

msp.utdallas.edu

Percentage of Frames Missed by the HPEs at Different Angles

IntraFace OpenFace ZFace

  • Analysis of percentage of face missed by HPE at different

angles

  • Bright pixels – most frames not detected by HPE
  • Dark pixels – few frames not detected by HPE

16

slide-17
SLIDE 17

msp.utdallas.edu

Difference in Angle between estimates from AprilTags and HPEs

17

IntraFace OpenFace ZFace

  • Difference in estimation for the frames detected by

each Algorithm

  • Bright Pixels – Large difference in angular estimation
  • Dark Pixels – Small difference in angular estimation
slide-18
SLIDE 18

msp.utdallas.edu

Outline

  • Tools and Dataset
  • Factors affecting Head Pose Estimation
  • Ideal Scenarios and Challenging Scenarios
  • Conclusions and Future Work

18

slide-19
SLIDE 19

msp.utdallas.edu

Ideal Scenarios(IS) and Challenging Scenarios(CS)

  • We extract two types of frames from the database
  • Ideal Scenarios (IS) : Frames successfully detected and

estimation error less than 10°

  • Challenging Scenarios (CS) : Frames that failed

detection by all the three HPEs

  • Helps in design of more robust algorithms that

work for challenging cases

19

slide-20
SLIDE 20

msp.utdallas.edu

Ideal Scenarios

  • Distribution of Ideal frames at different rotation angles and

illumination

20 IntraFace ✔ OpenFace ✔ ZFace ✔

slide-21
SLIDE 21

msp.utdallas.edu

Challenging Scenarios

  • Distribution of Challenging frames at different rotation

angles and illumination

21 IntraFace X OpenFace X ZFace X

slide-22
SLIDE 22

msp.utdallas.edu

Outline

  • Tools and Dataset
  • Factors affecting Head Pose Estimation
  • Ideal Scenarios and Challenging Scenarios
  • Conclusions and Future Work

22

slide-23
SLIDE 23

msp.utdallas.edu

Conclusions and Future Work

23

  • Open access face processing tools have limited reliability in

naturalistic driving environment

  • Reliable estimation of head pose can be useful in designing

smart systems in car

  • Future Work
  • A more robust reference system with minimal obtrusion
  • Investigate and evaluate other modalities such as depth sensing

cameras

  • Extend the database with more subjects under varying

conditions

slide-24
SLIDE 24

msp.utdallas.edu 24

msp.utdallas.edu