Improving Manufacturing Plants Through Big Data Analytics SDS2019 - - PowerPoint PPT Presentation

▶

Nov 22, 2023 232 likes •538 views

Improving Manufacturing Plants Through Big Data Analytics SDS2019 14 June 2019 Prof. Dr. Kurt Stockinger Martin Weber Marc Schni Agenda Use Case Goals Architecture Blueprint Experiment Conclusions Evaluation

SLIDE 1

Improving Manufacturing Plants Through Big Data Analytics

SDS2019 14 June 2019

Prof. Dr. Kurt Stockinger

Martin Weber Marc Schöni

SLIDE 2

Agenda

Use Case
Goals
Architecture Blueprint
Experiment
Conclusions
Evaluation
Q&A / Discussion

SLIDE 3

Agenda

Use Case
Goals
Solution Architecture
Experiment
Conclusions
Evaluation
Q&A / Discussion

SLIDE 4

About Midor

Founded in 1928
Located in Meilen ZH
600 Employees
Part of M-Industry
Produce 250’000 items daily for Migros und others
32 production lines, 940 different products (different biscuits,

ice cream, snacks, dessert powder, etc.)

SLIDE 5

Introduction

Production line 16 produces the Blévita, one of Midors sales hits
Short production stops are reducing the output
What causes these disturbances?
Can they be predicted?

SLIDE 6

Line 16

Batter mixing Oven Slots 1-9 Packaging & Labelling

SLIDE 7

Agenda

Use Case
Goals
Solution Architecture
Experiment
Conclusions
Evaluation
Q&A / Discussion

SLIDE 8

Goals

Qualitative Goals

1. Improve efficiency
2. Flexible and scalable

architecture

3. Allows processing of various

data formats Quantitative Goals

1. Find the most relevant

features causing the disturbances

2. Latency for Inference of

< 5 seconds

3. Cost should be at worst

proportional to amount of processed data

SLIDE 9

Agenda

Use Case
Goals
Solution Architecture
Experiment
Conclusions
Evaluation
Q&A / Discussion

SLIDE 10

System Context

SLIDE 11

Conceptual Architecture

SLIDE 12

Solution Architecture

SLIDE 13

Solution Architecture

SLIDE 14

Solution Architecture

SLIDE 15

Solution Architecture

A X P P

SLIDE 16

Agenda

Use Case
Goals
Solution Architecture
Experiment
Conclusions
Evaluation
Q&A / Discussion

SLIDE 17

Objective

Over a period of ~1 year data about short production stoppages

was collected (→ Label)

Over the same period additional data about orders,

climate conditions etc. were captured (→ Features)

Is it possible to find a pattern in these datasets regarding the
ccurrence of short stops?

SLIDE 18

Big Data Analytics Solution

„System under Design“

FAcility Management System (FAMS)

3rd party system

Climate System

3rd party system

Manufacturing Execution System (MES)

3rd party system

Gateway Production line

3rd party system

BaroHygro.XML FAMS.XLSX Orders.XML Signal-Stream

x1, x2, … xn y

Amperes oven: 32A

Rel. Humidity: 54%

Product: Blévita Gruyère Short stop: Nein

Building the dataset

SLIDE 19

Splitting the dataset

Unbalanced dataset for modelling (90%) unbalanced verification dataset (10%) Balanced dataset unbalanced dataset (100%) 10x Cross-Validation

Training Model

Train Test no stops stops

SLIDE 20

Modelquality

Accuracy Precision Recall F1 Score AUC Random Forest .852 .818 .897 .856 .911 Gradient Boosted Tree .847 .814 .892 .852 .909 Logistic Regression .629 .616 .644 .630 .677 Support Vector Machine .612 .692 .588 .636 .659 Naïve Bayes .572 .680 .552 .610 .602

SLIDE 21

Feature Reduction

Model quality metrics (see legend) Comparison of model quality metrics depending on number of features

number of features

SLIDE 22

Important Features

SLIDE 23

Agenda

Use Case
Goals
Solution Architecture
Experiment
Conclusions
Evaluation
Q&A / Discussion

SLIDE 24

Findings: Machine Learning

We can predict shorts stops with a F1-Score of about 85%
The integration of different data sources took the most time
It is not possible to a priori estimate the importance of

predictors/features per data source → Integrate all data sources

The prediction itself does not provide a business value without

additional steps (operationalization)

SLIDE 25

Conclusion: Data import

Data source 1 Target Dataset (combination) Data source 2 Data source 3 Events

? ? ?

Time

SLIDE 26

Conclusions: Architecture

Findings from modelling provide new boundary conditions for the

big data architecture:

Number and kind of data sources
Amount of data
Requirements for inference service (Compute, Memory)
Principles of Lambda-Architecture have proven their effectiveness
Benefits of Kapa architecture (single code base) using libraries
Tools for ML-Pipeline export & operationalization are in early stages
Monitoring of data quality is a crucial success factor

SLIDE 27

Agenda

Use Case
Goals
Solution Architecture
Experiment
Conclusions
Evaluation
Q&A / Discussion

SLIDE 28

Evaluation

Quality Goals

1. Improve efficiency
2. Flexible and scalable

architecture

3. Allows processing of various

data formats Quantity Goals

1. Find the most relevant

Features causing the disturbances

2. Latency for Inference of

< 5 seconds

3. Cost should be at worst

proportional to amount of processed data

SLIDE 29

Agenda

Use Case
Goals
Solution Architecture
Experiment
Conclusions
Evaluation
Q&A / Discussion