Next Generation Data Discovery Fusing Structured and Unstructured - - PowerPoint PPT Presentation

next generation data discovery
SMART_READER_LITE
LIVE PREVIEW

Next Generation Data Discovery Fusing Structured and Unstructured - - PowerPoint PPT Presentation

Next Generation Data Discovery Fusing Structured and Unstructured Content from Multiple Repositories Chris Meredith UDOT Dan Quinn PTFS Questions Shared Drives Vision: Enterprise-wide Data Discovery Information Transparency


slide-1
SLIDE 1

Next Generation Data Discovery

Fusing Structured and Unstructured Content from Multiple Repositories Chris Meredith – UDOT Dan Quinn – PTFS

slide-2
SLIDE 2

Questions

Shared Drives

slide-3
SLIDE 3

Vision: Enterprise-wide Data Discovery

  • Information Transparency
  • UDOT information should be discoverable to the entire department and

its partners

  • Sometimes, information is desired but the question is unknown
slide-4
SLIDE 4

What Have We Done?

  • You don’t need to know where the information resides
  • You don’t need to know what information exists

R2 Shared Drive

slide-5
SLIDE 5

What Stays the Same?

The Index... So you can more easily use the information already provided by the department without duplication!

  • References data from the source system. Nothing is copied!
  • Utilizes source system credentials for document access.

○ If you’re required to log into the source system, the index does not provide a way around that.

  • Allows data owners to remain data owners and continue to collect and

maintain their information.

slide-6
SLIDE 6

Basic Knowvation operation

slide-7
SLIDE 7

Basic Search screen has several options

slide-8
SLIDE 8

Knowvation supports a browse hierarchy

slide-9
SLIDE 9

Browse enables navigating a folder structure

slide-10
SLIDE 10

Knowvation can support optimal UDOT hierarchy

slide-11
SLIDE 11

There are many search options from this interface

slide-12
SLIDE 12

The full text search box can start easy searches

slide-13
SLIDE 13

Items presented as dots on base map

slide-14
SLIDE 14

Various layers can be easily turned on/off

slide-15
SLIDE 15

Now mileposts are off

slide-16
SLIDE 16

Base map options can be switched with a click

slide-17
SLIDE 17

Select Open Street Map

slide-18
SLIDE 18

Open Street Map used to start search, display data

slide-19
SLIDE 19

A geospatial search is a common starting point

slide-20
SLIDE 20

The search can now be limited to Route 48

slide-21
SLIDE 21

And further limited with a PIN

slide-22
SLIDE 22

And further limited with a Document Type

slide-23
SLIDE 23

Data can be presented in different views: Grid View

slide-24
SLIDE 24

Data can be presented in different views: List View

slide-25
SLIDE 25

Data can be presented in different views: Thumbnail View

slide-26
SLIDE 26

An Esri widget enables Knowvation searches in ArcGIS

slide-27
SLIDE 27

An Esri widget enables Knowvation searches in ArcGIS

slide-28
SLIDE 28

A drop down makes selecting target route easy

slide-29
SLIDE 29

A Search simply on route returns 4,282,556 files

slide-30
SLIDE 30

Adding full text search on “ramp” narrows to 146,763 filesPIN narrows it down to five files

slide-31
SLIDE 31

Adding PIN 10711 narrows list to five files

slide-32
SLIDE 32

Selecting “all” brings back all records = 685 records

slide-33
SLIDE 33

Pattern Search is a fuzzy text search

slide-34
SLIDE 34

Correct/incorrect spellings are highlighted.

slide-35
SLIDE 35
slide-36
SLIDE 36
slide-37
SLIDE 37
slide-38
SLIDE 38

What Else?

  • Improving data attributes/metadata improves searchability
  • Aligning data standards across the Department
  • The index reflects how well data governance functions within the department.

With data governance improvements, the index improves.

slide-39
SLIDE 39

Moving Forward

  • Findability Study using machine learning to help make documents more

findable

  • Power user testing – Region Designers
  • Training
  • Incorporate additional data sources
slide-40
SLIDE 40

Chris Meredith

Utah Department of Transportation Central Right of Way GIS Administrator cmeredith@utah.gov

Dan Quinn

PTFS VP, Sales & Marketing dquinn@ptfs.gov

slide-41
SLIDE 41
slide-42
SLIDE 42

How Can You Do That?

You can search by… Location on a map

slide-43
SLIDE 43

How Can You Do That?

You can search by… Address Route and Milepost Source system Project information (PIN)

  • PIN
  • Route
  • Name
slide-44
SLIDE 44

How Can You Do That?

You can search by… Metadata categories

The picture can't be displayed.
slide-45
SLIDE 45

How Can You Do That?

You can search by… Full text across metadata and text in files using Boolean, Exact, Concept and Pattern search techniques