Justin Meyer Mentor: Nick Larusso Faculty Advisor: Dr. Ambuj Singh - - PowerPoint PPT Presentation

justin meyer mentor nick larusso faculty advisor dr ambuj
SMART_READER_LITE
LIVE PREVIEW

Justin Meyer Mentor: Nick Larusso Faculty Advisor: Dr. Ambuj Singh - - PowerPoint PPT Presentation

Justin Meyer Mentor: Nick Larusso Faculty Advisor: Dr. Ambuj Singh Santa Barbara City College Major: Electrical Engineering Funding: N.S.F. http://www.crustal.ucsb.edu/images/ban http://www.sbcc.edu/marketing/inde


slide-1
SLIDE 1

Justin Meyer Mentor: Nick Larusso Faculty Advisor: Dr. Ambuj Singh Santa Barbara City College Major: Electrical Engineering Funding: N.S.F.

http://www.crustal.ucsb.edu/images/ban ner‐logo‐ucsb.png http://www.sbcc.edu/marketing/inde p?sec=1331

http://www.ccmr.cornell.edu/images/ logos/logo‐NSF‐CMYK.GIF

slide-2
SLIDE 2

Applications of querying uncertain data

Alzheimer's microtubule length measurements Bio‐imaging

Faster access to data

Larger dataset better representation of reality Selection and range queries

slide-3
SLIDE 3

Bio‐imaging

Dyeing Techniques are imperfect

▪ Dyeing multiple images ▪ Hand Dyeing is intensive

3D compression into a 2D image Confocal Microscope

Sensory data (tracking systems)

Update times

slide-4
SLIDE 4

Microtubule In neuron

http://www.bioimage.ucsb.edu/component/content/a rticle/53‐frontpage‐highlights/116‐bisquewebapps

slide-5
SLIDE 5

Photo Courtesy Dr. Ambuj Singh Horizontal Cells

slide-6
SLIDE 6

Efficiently query with use of an index structure

Without indexing, linear scan required Indexing is more scalable as datasets grow

Problem with indexing due to uncertainties

within data

Produce results for range and selection queries faster

slide-7
SLIDE 7

O(n)

slide-8
SLIDE 8

Microtubule Length

t1 25 t2 50 t3 75 t4 100 t5 15 t6 85 t7 90 t8 60

50 50 85 85 15 15 25 25 60 60 75 75 90 90 100 100

Eliminate on the order of half the possibilities with each decision

slide-9
SLIDE 9

Microtubule Length

t1 (25 , 0.8) (50, 0.2) t2 (50, 0.6) (60, 0.4) t3 (75, 0.5) (90, 0.5) t4 (100, 0.7) (85, 0.3) t5 (15, 0.6) (25, 0.4) t6 (85, 0.7) (100, 0.3) t7 (90, 0.9) (75, 0.1) t8 (60, 0.7) (15, 0.3)

50 t 2:0.6 t 1:0.2 50 t 2:0.6 t 1:0.2 85 t 6:0.7 t 4:0.3 85 t 6:0.7 t 4:0.3 15 t 8:0.3 t 5:0.3 15 t 8:0.3 t 5:0.3 25 t 1:0.8 t 5:0.4 25 t 1:0.8 t 5:0.4 60 t 8:0.6 t 2:0.4 60 t 8:0.6 t 2:0.4 75 t 3:0.5 t 7:0.1 75 t 3:0.5 t 7:0.1 90 t 7:0.9 t 3:0.5 90 t 7:0.9 t 3:0.5 100 t 4:0.7 t 6:0.3 100 t 4:0.7 t 6:0.3

Tuple values are ordered Range Queries do not look through all tuples

Range Query: 50 <= lengths <= 75

  • prob. > 0.5

Return: t2 and t8

slide-10
SLIDE 10

The actual results confirming the

hypothesized results

The structures cost is better than the linear scan

▪ Especially for large datasets

Future work

Appling to all areas of uncertain data

▪ Sensory data

Plan to compare with other uncertain indexing

techniques

slide-11
SLIDE 11

?

Justin Meyer Email: jmeyeroct22@gmail.com

Acknowledgements:

INSET CNSI

  • Dr. Ambuj Singh

Nick Larusso NSF

slide-12
SLIDE 12

http://www.ljosa.com/~ljosa/publications/ljosa_icdm_2006.pdf

slide-13
SLIDE 13

Only the values above the line a represented as a “average” value This is how most databases handle uncertainty

Thresholding Line