Wearable barcode scanning Advancements in code localization, motion - - PowerPoint PPT Presentation

wearable barcode scanning
SMART_READER_LITE
LIVE PREVIEW

Wearable barcode scanning Advancements in code localization, motion - - PowerPoint PPT Presentation

Wearable barcode scanning Advancements in code localization, motion blur compensation, and gesture control Gbor Srs Doctoral examination ETH Zurich May 3, 2016 Linking the physical and the digital services data 01011101010 objects


slide-1
SLIDE 1

Wearable barcode scanning

Advancements in code localization, motion blur compensation, and gesture control

Doctoral examination ETH Zurich May 3, 2016

Gábor Sörös

slide-2
SLIDE 2

2

Linking the physical and the digital

01011101010 11010110100 01001011010 11001110001

atoms bits data services

  • bjects
slide-3
SLIDE 3

3

Visual codes are everywhere

slide-4
SLIDE 4

Ubiquitous wearable scanners allow us to access information on every physical object

4

Wearable barcode scanning

Smartphones, tablets, watches, glasses

  • are always with us
  • have cameras, sensors, intuitive UI
  • are easily programmable

wearable barcode scanning traditional barcode scanning

Barcode scanners

  • are expensive
  • are used by only few people
  • use proprietary protocols
slide-5
SLIDE 5

5

Challenges

no laser for localization (multiple) small codes defocus and motion blur limited input capabilities

slide-6
SLIDE 6
  • Make wearable barcode scanning an attractive

alternative of traditional laser scanning

  • by compensating the shortcomings,

and adding new features

  • by leveraging the advanced computing and sensing

capabilities of the wearables

6

Research goals

slide-7
SLIDE 7

7

Contributions

Fast and robust localization

  • f visual tags

MUM’13, ICASSP’14

Part I

Fast and robust blur compensation for scanners

WSCG’15, ISWC’15

Part II

Fast and robust gesture control for wearables

BSN’14, UIST’14, CHI’15

Part III

slide-8
SLIDE 8

8

Fast and robust code localization

goals: invariant to size, orientation, blur, symbology

slide-9
SLIDE 9
  • 1D barcodes contain lots of edges

blur deletes many of them

  • 2D barcodes contain lots of corners

blur smears corners but they still remain corners

  • codes are almost always black and white

blur mixes black and white to gray detect areas with edges and/or corners & low saturation in HSV color space

9

Observations

slide-10
SLIDE 10

10

Joint 1D and 2D barcode localization for smartphones

1D 2D

slide-11
SLIDE 11

11

Live localization on the mobile GPU

slide-12
SLIDE 12

smartglasses

12

Results

Our method

  • can localize visual codes of various symbologies
  • with performance like the state of the art
  • without assumptions on code size, code orientation, or code position,

while it is more robust to blur

  • is portable to GPU and a wide range of devices

blurry small big tilted

slide-13
SLIDE 13

13

Multiple codes

sharp

2D works well in both cases 1D sensitive to blur

blurry

slide-14
SLIDE 14

14

Extension to blurry 1D codes

Low S1 and S2 Rectangle detection in the saturation channel

slide-15
SLIDE 15

15

Fast and robust code localization allows:

  • scanning multiple codes simultaneously
  • scanning visual codes from further away
  • scanning blurry codes in the whole image
slide-16
SLIDE 16

16

Contributions

Fast and robust localization

  • f visual tags

MUM’13, ICASSP’14

Part I

Fast and robust blur compensation for scanners

WSCG’15, ISWC’15

Part II

Fast and robust gesture control for wearables

BSN’14, UIST’14, CHI’15

Part III

slide-17
SLIDE 17

17

motion blur makes the codes unreadable

Motion blur compensation

we recover the information from motion-blurred QR codes

gabor.soros@inf.ethz.ch

  • ur input
  • ur output
slide-18
SLIDE 18

18

Basics of blurry image formation

uniform blur model sharp scene

𝑱

blurry scene

𝑱 ∗ 𝒍

  • bserved image

𝑪 = 𝑱 ∗ 𝒍 + 𝒐

convolution with a blur kernel 𝒍 adding camera noise 𝒐

slide-19
SLIDE 19

19

Blur removal problem

=

deconvolution:

𝑪 = ? ∗ 𝒍 + 𝒐

blind deconvolution:

𝑪 = ? ∗ ? + 𝒐

a motion blur kernel

identity (Dirac) kernel

a defocus blur kernel

slide-20
SLIDE 20

20

Blind deconvolution for QR scanning?

Existing blind deconvolution algorithms

  • are slow even on PC
  • are tuned to natural images
  • usually fail on QR codes (structure very different!)
  • utputs of some previous methods

input

slide-21
SLIDE 21
  • blur can be estimated from the many QR edges
  • but we need to suppress the small structures
  • QR codes do not need to look good for decoding
  • in contrast to photographs, where restoration quality counts,
  • ur main concern is speed
  • QR codes include error correction / checksum
  • the algorithm can stop when the checksum is correct
  • false decoding is practically impossible
  • only partially restored codes might be decoded too

21

Observations for deblurring QR codes

slide-22
SLIDE 22

22

Restoration-recognition loop

We follow a common recipe for blind deconvolution:

  • alternate between solving for I and solving for k
  • suppress noise and boost edges: enforce QR properties
  • try to decode at every iteration
  • repeat on several scales

argmin

𝐽,𝑙

𝐶 − 𝑙 ∗ 𝐽 + λ𝐽𝑞𝐽 𝐽 + λ𝑙𝑞𝑙(𝑙) I

QR QR

I’ k

Blind deconvolution via energy minimization

B

slide-23
SLIDE 23

23

experiments (synthetic blur)

quality is on par with the state of the art, and a magnitude faster

[Cho2009] 0.48s [Sun2013] 217.73s [Xu2010] 0.96s [Xu2013] 1.05s (GPU) input [Perrone2014] 171.90s [Pan2014] 12.74s

  • urs

0.61s ground truth [Pan2013] 133.8s

slide-24
SLIDE 24

24

experiments (real blur)

1.69s 2.82s 18.62s 12.52s 14.65s 14.37s

slide-25
SLIDE 25

25

Live deblurring on a smartphone

input estimated image estimated kernel camera view search window

slide-26
SLIDE 26

additional clues:

  • the blur is ’encoded’ in the image of point light sources
  • wearables have inertial sensors
  • rotational motion blur is dominant – use gyroscopes
  • reconstruct the camera motion, render the blur kernel

26

Can we make it even faster?

virtual point light source virtual camera

slide-27
SLIDE 27

27

Rendering blur kernels for initialization

captured frame generated kernels deblurred frame

Rotational blur depends on the position in the image

slide-28
SLIDE 28

28

Patch-wise restoration

We can initialize the restoration loop with the rendered kernels

slide-29
SLIDE 29

Fast and robust blur removal allows:

  • scanning in low lighting
  • scanning moving codes
  • and tiny or distant codes

(super resolution)

29

slide-30
SLIDE 30

30

Contributions

Fast and robust localization

  • f visual tags

MUM’13, ICASSP’14

Part I

Fast and robust blur compensation for scanners

WSCG’15, ISWC’15

Part II

Fast and robust gesture control for wearables

BSN’14, UIST’14, CHI’15

Part III

slide-31
SLIDE 31

31

Codes for interaction with smart objects

[Rohs 2005] [Heun 2013a] [Ballagas 2006] [Heun 2013b] [Mayer 2012] [Chan 2015] [Mayer 2014]

slide-32
SLIDE 32

32

Outsourcing user interfaces

[fitbit activity tracker]

The smartphone is becoming a universal interaction device.

[LIFX light bulb] [Nespresso coffee machine]

volume How about other wearables?

[Wahoo cycling sensor]

slide-33
SLIDE 33

33

Joint work with Simon Mayer

cross-device automatic GUI generation: user interface beaming

Outsourcing user interfaces

slide-34
SLIDE 34

34

Gesture recognition on wearables

Joint work with Jie Song, Fabrizio Pece, Otmar Hilliges

slide-35
SLIDE 35

35

Live gesture recognition on mobile devices

slide-36
SLIDE 36

input segmentation labeled output

Gesture classification as pixel labeling

36

slide-37
SLIDE 37

...

F0

< Г0 > Г0

F1 F2 F3 F4 F5 F6

< Г2 < Г1 > Г1 > Г2

v w v w

... F0(w,v): F2(w,v):

Pixel labeling with a decision tree

37

slide-38
SLIDE 38
  • 1. pooling over trees:

this pixel is ’red’

T1 T2 T3

  • 2. pooling over all pixels:

this gesture is ’red’

Pixel labeling with a decision forest

38

slide-39
SLIDE 39

close middle far Coarse Depth Classification Shape Classification pinch point splayed palm Part Classification

Pixel labeling with multi-stage decision forests

39

slide-40
SLIDE 40

close middle far Coarse Depth Classification Shape Classification pinch point splayed palm

Enabling 3D interaction

40

Depth Regression 145 mm 323 mm 211 mm 282 mm

slide-41
SLIDE 41

41

Gestures + depth for 3D interaction

slide-42
SLIDE 42

42

Fast and robust gesture recognition allows:

  • natural input to wearables
  • easy control for scanners
  • universal interaction with smart objects

(through user interface outsourcing)

slide-43
SLIDE 43

In the world of binary images, generally very difficult computer vision problems like … … can have fast and robust solutions even on resource-constrained wearable devices.

43

Conclusions

  • bject segmentation

image restoration shape classification

slide-44
SLIDE 44

Our solutions

  • are pushing forward the state of the art in terms of

accuracy, robustness, and speed

  • can help to make wearable barcode scanning a

promising alternative to traditional barcode scanning

  • will potentially make wearables the essential tools for

bridging the gap between the physical and the digital world.

44

Conclusions

slide-45
SLIDE 45

45

Thank you!