Data Integration of Legacy ERP System based on Ontology Learning - - PowerPoint PPT Presentation

data integration of legacy erp system based on ontology
SMART_READER_LITE
LIVE PREVIEW

Data Integration of Legacy ERP System based on Ontology Learning - - PowerPoint PPT Presentation

Data Integration of Legacy ERP System based on Ontology Learning from SQL Scripts Chuangtao Ma machuangtao@caesar.elte.hu Department of Information Systems, Etvs Lornd University This PhD project was guided by Dr. Molnr Blint Outline


slide-1
SLIDE 1

Data Integration of Legacy ERP System based

  • n Ontology Learning from SQL Scripts

Department of Information Systems, Eötvös Loránd University

Chuangtao Ma

machuangtao@caesar.elte.hu

This PhD project was guided by Dr. Molnár Bálint

slide-2
SLIDE 2

Outline

  • Introduction & Motivation
  • Research Question Statement
  • Related Work
  • Proposed Solutions & Plan
  • Conclusion & Future Work
slide-3
SLIDE 3

Introduction & Motivation

  • C++, Java, Php, etc .
  • Delphi, IBM AS/400,

Visual Studio, etc.

Diversity of Code Language & platform

  • Visual FoxPro, Access,

SQL Server, Oracle, MySQL, etc.

Various DBMS

  • Obsolete hardware.
  • Poor modularity.

Outdated Technology

  • Character-based user

interface.

  • Unavailability of the data-

access interface.

Unfriendly user Interface

Legacy ERP system is a kind of enterprise management systems, were developed in several decades ago, that is no longer being enhanced.

slide-4
SLIDE 4

Introduction & Motivation

01

  • Build the centralized data warehousing.
  • Integration of the heterogenous data from

existing system.

Data mining & analysis

02

  • Redesign and improve the business process

and architecture of the ERP.

  • Improve data dissemination and decision

making based on advanced technology.

Business Process Reengineering(BPR)

An increasing number

  • f

the enterprise decide to build the BI system for responding the dynamic business environment.

slide-5
SLIDE 5

Introduction & Motivation

Replace all of the existing legacy ERP system Modernize and Integrate the existing legacy ERP system

Pros

  • Advanced technologies and system.
  • Unified and centralized data center .

Cons

  • Invest more budget and time.
  • Potential risk for upgrading.

Pros

  • Save the costs, effort, and time.
  • Reduce the risk of the project.

Cons

  • Limited performance of the BI system.
  • Put more effort to integrate legacy system.
slide-6
SLIDE 6

Research Question Statement

Q2: Data Integration Q1: Unified description

  • f Business Process

How to achieve the unified description of business processes (BP) and efficient integration among different sub-organizations? How to access and integrate the data from the various DBMS of legacy ERP systems efficiently?

Q3: Result Evaluation

How to check the consistency of the ontologies and evaluate the correctness of the integration results?

  • It is a challengeable task to integrate the legacy ERP system efficiently

and effectively, since the diversity of code language, various DBMS and unavailability of APIs.

slide-7
SLIDE 7

Related Work

 Data access technology of legacy system

  • However, ontology-based data access from distributed data-

source still requires the data access interface to be available.

  • Pérez-Castillo, et al,

(2011) propose KDM to represent the artifacts

  • f

legacy systems as entities, relationships and attributes.

Knowledge Discovery meta-Model (KDM)

  • Millham, R, et

al (2009) employed the COBOL to access the data from the relational database

  • f legacy systems.

Common Business- Oriented Language(COBOL)

  • Calvanese, D,

et al (2017) introduced the OBDA technology to extract the log data from legacy information.

Ontology-based Data Access(OBDA)

slide-8
SLIDE 8

Related Work

 Ontology learning & Knowledge Extraction

Extraction algorithm based on RDF (Resource Description Framework) were designed to extract the knowledge from legacy systems.

Extraction algorithm based on Common RDF Model

2009 A dynamic knowledge extraction approach based on process mining and sequential pattern mining are proposed respectively .

Process Mining & Sequential Pattern Mining

2011 Semi-automated generation ontology approach from existing textual documents based on ontology learning is proposed .

Ontology Learning

2014

  • However, the knowledge extraction based on ontology learning is

still in the early phase to be explored.

slide-9
SLIDE 9

Related Work

 Semantic data integration

A semantic integration approach exploiting linked data are presented to achieve RDF data integration based on query rewriting. Linked data based semantic data integration The heterogeneous data was integrated by the semantic mapping of the ontologies.

Ontology-based semantic integration (OBA-SI)

The traditional data integration approaches, including , rule-based, middleware framework, and so forth. Data integration

  • For the ontology-based semantic integration, the efficiency of the

semantic integration is limited by the efficiency and quality of the constructed ontology.

slide-10
SLIDE 10

Proposed Solutions & Plan

  • This PhD project focus on the integration of legacy ERP system based
  • n ontology learning framework.

Business Process Integration Data Integration

Efficient and effectiveness Integration of the legacy ERP systems

  • Unified description of the business

process.

  • Integration of the business process.
  • Ontology learning model from SQL

scripts.

  • Semantic data integration based on
  • ntology.
slide-11
SLIDE 11

Proposed Solutions & Plan

Legacy ERP System Sub-ogranization 1 Legacy ERP System Sub-ogranizaiton 2 Integrated ERP System Legacy ERP system Sub-ogranizaiton i Entity 1

PK_ID1

PK

FK_BID2

Entity 2

PK_ID2

PK

FK_BID2

Entity i

PK_ID

PK

FK_BID

Business Process 1 Business Process 2 Business Process i Database 2 Database i Database 1 Integrated Legacy ERP System Database Integrated Business Process

i i

Business Process level Data Integration level System Integration level

slide-12
SLIDE 12

Proposed Solutions & Plan

 Unified description & integration of business process

  • The description logic language will be used to achieve the unified

description and representation of the business process.

  • The ontology alignment technology will be adopted to achieve

the integration of the business process.

  • 1. Jung, J. J. (2009) ‘Semantic business process integration based on ontology alignment’, Expert Systems with Applications. Elsevier Ltd, 36(8), pp.

11013–11020. doi: 10.1016/j.eswa.2009.02.086.

slide-13
SLIDE 13

Proposed Solutions & Plan

 Data integration based on ontology learning

  • The input of the ontology learning model is SQL scripts document

and the

  • utput is the corresponding ontologies and knowledge

graph.

Data Access Import Scripts Documents Generate OWL Evaluate Ontology Quality Store in Graph DB Store as RDF Triples Extract Relationship Extract Terms Pre- Processing LS1 Database LS2 Database LS i Database LS n Database SQL Scripts 1 SQL Scripts 2 SQL Scripts i SQL Scripts n Export from DBMS Export from DBMS Knowledge Extraction & Ontology Generation

slide-14
SLIDE 14

Proposed Solutions & Plan

 Data integration based on ontology learning

  • Ontology-based semantic data integration.
  • The heterogenous data will be integrated based on the

interoperability of the ontologies and knowledge graph, the specific demo of data integration is depicted.

Order

Order_id PK Customer_id FK Good_id Tran_Num Tran_state Price

Order

Order_id PK User_id FK Product_id Number Order_state Price Good

Service Product

Is_Sub_of Is_Sub_of

User

Customer Administrator Vendors

Is_Sub_of Is_Sub_of Is_Sub_of Is_Purchased_by

Number

Num

Is_Abbr_of Is_Property_of

Transaction

Order Tran

Is_Abbr_of Is_Synonym_with Is_Property_of Place_of Is_Purchased_by Is_Property_of

slide-15
SLIDE 15

Proposed Solutions & Plan

Legacy ERP System 1 Legacy ERP System2 O11,O12,O13 O1m Entity 1

PK_ID1

PK

FK_BID1

Integrated Entity

PK_ID

PK

FK_BID

Entity 2

PK_ID2

PK

FK_BID2

Integrated database O21,O22,O23 O2n Ontology term extraction Ontology term extraction Ontology matching Integrated ERP system Ontology Learning

  • Check the consistency of

the ontologies generated by ontology learning from SQL scripts.

  • Evaluate

the semantic accuracy

  • f

the data integration.

 Legacy ERP system integration

  • Evaluate the accuracy of integration result and integrate legacy

ERP system for achieving the centralized management and decision.

slide-16
SLIDE 16

Conclusion & Future Work

In this project, the architecture of the ontology learning framework was proposed to integrate heterogenous data from various legacy ERP systems efficiently.

1

The approach for generating ontologies by ontology learning from the relational database SQL scripts is proposed, and the specified steps of the ontology learning are illustrated.

2

slide-17
SLIDE 17

Conclusion & Future Work

  • This project is in its initial exploration phase, there are some works that

should be investigated and conducted in the future.

Knowledge extraction algorithm from SQL scripts

1

  • The knowledge extraction algorithm based on NLP will be designed

to extract the knowledge from SQL scripts.

Ontology generation approaches from RDF schema

2

  • Ontology generation approaches from RDF schema will be studied to

generate the ontology for the integration of heterogeneous data.

Design the tools to support data integration

3

  • The data integration tool based on ontology learning from SQL scripts

will be designed and developed to support the data integration.

slide-18
SLIDE 18

I would welcome any question and suggestion.