Integrating multi-dimensional information spaces Kostas Saidis, - - PowerPoint PPT Presentation

integrating multi dimensional information spaces
SMART_READER_LITE
LIVE PREVIEW

Integrating multi-dimensional information spaces Kostas Saidis, - - PowerPoint PPT Presentation

Integrating multi-dimensional information spaces Kostas Saidis, Alex Delis {saiko,ad}@di.uoa.gr University of Athens 2 Oct. 2009, Corfu, Greece 2 nd Workshop on Very Large Digital Libraries (VLDL 2009) In conjunction with ECDL 2009 Size does


slide-1
SLIDE 1

Integrating multi-dimensional information spaces

Kostas Saidis, Alex Delis

{saiko,ad}@di.uoa.gr

University of Athens

2 Oct. 2009, Corfu, Greece

2nd Workshop on Very Large Digital Libraries (VLDL 2009)

In conjunction with ECDL 2009

slide-2
SLIDE 2

VLDL 2009 2

Size does not matter (1)

 We view Very Large DLs as systems that

manage not only “large” but also “complex” information spaces

 Diverse, multi-faceted content items:

 digitized and/or born digital intellectual works,

institutional and/or personal archives, scholarly information, user-generated content

 Heterogeneous content sources:

 databases, XML repositories

 Plethora of applications, services, use-cases

slide-3
SLIDE 3

VLDL 2009 3

Our discussion

 Users need to share, reuse, refine and extend

information in varying application contexts

 Can we supply VLDLs and related systems with

a unified information space management infrastructure?

 Can this infrastructure add value by simplifying

– and automating as highly as possible – the integration of diversely structured and heterogeneous information spaces?

slide-4
SLIDE 4

VLDL 2009 4

Diverse views of information

 Different systems develop different views of

digital content for different purposes.

Physical/Storage View of Digital Content Conceptual View of Digital Content Servicing View of Digital Content XML, datastreams, databases, etc articles, books, dissertations, etc Web pages, GUIs, etc

slide-5
SLIDE 5

VLDL 2009 5

Multi-dimensional Information Space Management

 Systems manage information in multiple

dimensions, supporting diverse:

 Information identification & discovery options  Information access options  Information conceptualization options  Information utilization options

slide-6
SLIDE 6

VLDL 2009 6

Integration as a process (roughly)

1.Discovery: systems “learn about” the existence

  • f each other

2.Identification: systems unambiguously identify their individual items 3.Access: systems access their items 4.Utilization: systems synthesize their items

slide-7
SLIDE 7

VLDL 2009 7

Integration imposes extensions

 Realizing these steps requires dealing with a

variety of information discovery, access, conceptualization and utilization options supported by involved systems

 Thus, when integrating information spaces, we

practically need to extend involved systems in multiple crosscut and interdependent options

 Hard, cost-consuming, may require source-code

modifications and/or system redesign

slide-8
SLIDE 8

VLDL 2009 8

Integration requires automation

Information integration/interoperation is about “enabling information that originates in one context to be used in another in ways that are as highly automated as possible”

[The DOI Handbook, Edition 4.4.1, The International DOI Foundation]

slide-9
SLIDE 9

VLDL 2009 9

Time-out

zzzzzzzzzzzzz COFFEE BREAK NOW! I CORFU

slide-10
SLIDE 10

VLDL 2009 10

Our point

 If we simplify the process of extending systems'

multi-dimensional information management

  • ptions

 We simplify the process of integrating their

information spaces Simplify ~ automate as highly as possible

slide-11
SLIDE 11

VLDL 2009 11

WWW: the largest interoperable information space

 Automates information identification and

access (HTTP & URIs)

 Yet:

 No built-in information discovery service (google)  A single “document-based” conceptualization  Information utilization follows a limited

“publish/consume” paradigm

 Technologies such as Web Services & Semantic

Web enhance limited information discovery, conceptualization and utilization options

slide-12
SLIDE 12

VLDL 2009 12

Size does not matter (2)

 Information integration/interoperation:

 plays a crucial role in smaller-scale information

spaces, too:

 Digital libraries  Business & Enterprise Environments  Proprietary & Legacy systems  etc

 is dominated by the information management

  • ptions supported by involved systems
slide-13
SLIDE 13

VLDL 2009 13 

slide-14
SLIDE 14

VLDL 2009 14 

Traditional Digital Library System

slide-15
SLIDE 15

VLDL 2009 15 

Metadata Harvesting Application

slide-16
SLIDE 16

VLDL 2009 16 

Application Independent Unified DL Infrastructure

slide-17
SLIDE 17

VLDL 2009 17

Infrastructure design (1)

 Content Source API:

 allow systems to operate atop multiple

heterogeneous sources

 register new sources dynamically  use a driver-based technique

 Content Access/Update API:

 read/modify actions that apply to any underlying

content source

slide-18
SLIDE 18

VLDL 2009 18

Infrastructure design (2)

 Content Conceptualization API:

 support storage-independent, dynamic

conceptualizations

 Employ an inheritance mechanism to enable

refinement / extension of content items

 Content Discovery API:

 Provide 3 indexing/discovery facilities, for sharing:

 Content items,  Content sources,  Content conceptualizations

slide-19
SLIDE 19

VLDL 2009 19

</presentation>

COFFEE BREAK NOW!