A Modified Fuzzy Min-Max Neural Network and Its Application to Fault - PowerPoint PPT Presentation

A Modified Fuzzy Min-Max Neural Network and Its Application to Fault Classification Anas M. Quteishat and Chee Peng Lim School of Electrical & Electronic Engineering University of Science Malaysia

Abstract The objectives of this paper are: � To improve the Fuzzy Min-Max (FMM) classification performance in situations when large hyperboxes are formed by the network. � The Euclidean distance is computed after network training, and both the membership value of the hyperbox fuzzy sets and the Euclidean distance are used for classification . � To assess the effectiveness of the modified FMM network. � Benchmark pattern classification problems are first used, and the results from different methods are compared. In addition, a fault classification problem with real sensor measurements collected from a power generation plant is used to evaluate the applicability of the modified FMM network.

Introduction � There are two types of learning classifiers: supervised and unsupervised learning, which differ in the way they are trained. � The FMM neural network is a pattern classification system that can be used for tackling clustering (unsupervised) or classification (supervised) problems. � In this paper FMM is used as a supervised classification system.

Introduction (contd.) � FMM is constructed using hyperbox fuzzy sets, each of which is an n -dimensional box defined by a set of minimum and maximum points. � Each input pattern is classified based on the degree of membership to the corresponding boxes. � A smaller hyperbox size means that the hyperbox can contain only a smaller number of patterns, which will increase the network complexity, but yet gives high accuracy. � A larger hyperbox size means that the hyperbox can contain a larger number of patterns, and will decrease the network complexity, but this leads to low classification accuracy.

FMM Neural Network � The FMM classification network is formed using hyperbox fuzzy sets. A hyperbox defines a region of the n -dimensional pattern space that has patterns with full class membership. � The hyperbox is completely defined by its minimum and maximum points. max point min point Fig. 1. A min-max hyperbox

� The definition of each hyperbox fuzzy set B j is { ( ) } = ∀ X ∈ n B X , V , W , f X , V , W (1) I j j j j j Where, X is the input, I n is a unit cube with n dimensions and V j and W j are the min and max points, respectively � The size of the hyperboxes θ can have a value between 0 and 1. A small value of θ will produce small-size hyperboxes, and vice versa.

� The input patterns are classified depending on how much they are “contained” by a hyperbox, this is measured using the membership value of a pattern using: ( ( ( ) ) ) n 1 ∑ = − γ − b ( A ) [max 0 , 1 max 0 , min 1 , a w j h hi ji 2 n ( ( ( ) ) ) ] = i 1 + − γ − (2) max 0 , 1 max 0 , min 1 , v a ji hi ( ) ( ) where, is the h th input pattern, = ∈ n = A a , a ,..., a I V v , v ,..., v ( ) h h 1 h 2 hn j j 1 j 2 jn is the min point for B j , is the max point for B j , and γ is = W w , w ,..., w j j 1 j 2 jn the sensitivity parameter.

Modifications of the FMM Network � The modification done on the FMM neural network was done on the prediction stage. The original learning FMM procedure was not touched. � In this paper we propose two methods for prediction using: 1. Euclidean distance 2. Both Euclidean distance and membership value

1) Prediction using the Euclidean distance � This prediction method is based on the Euclidean distance between the input pattern and the centroid of the hyperbox. � In addition to the min and max points, the centroid of patterns falling in each hyperbox is computed, as follows: − a C hi ji = + (3) C C ji ji N j where C ji is the centroid of the j th hyperbox in the i th dimension, and N j is the number of patterns included in the j th hyperbox.

� The Euclidean distance between the centroid and the input pattern is calculated using: ( ) n ∑ = − (4) 2 E C a jh ji hi = i 1 Where, E jh is the Euclidean distance between j th hyperbox and the h th input pattern. � In the classification process, the hyperbox with the smallest Euclidean distance to the input pattern is selected as the winner, and the pattern is so classified that it belongs to this hyperbox.

� Figure 2 shows the classification process of a two- dimensional input pattern using the described method. Centroid of w 1 w 2 hyperbox 1 C 1 Centroid of E 1 hyperbox 2 C 2 E 2 Input v 2 Class 2 v 1 Class 1 Fig. 2. The classification process of an input pattern using the Euclidean distance

2) Prediction using both the membership function and Euclidean distance � When θ is large, hyperbox sizes are large, as consequent when calculating the membership value more than one hyperbox will have very large membership value sometimes even unity value. � To solve this problem we propose to use both the membership value and Euclidean distance for classification. � The hyperboxes with the highest membership value are selected and then the Euclidean distance between the centroid of these boxes and the input pattern is calculated. The hyperbox with the smallest distance is used to classify the input pattern.

Experiments and results: The proposed methods were tested using 4 data sets: A) Bench mark data sets: 1) PID data set 2) IRIS data set B) Fault diagnosis data sets: 1) The heat transfer conditions. 2) The tube blockage conditions.

A) Bench mark data sets: 1) PID data set � Pima Indian Diabetes (PID) data set consists of 768 samples in a two class problem. The set was divided into two sets, 75% for training and 25% for testing. The experiment was done 5 times and the average of the results is shown in Figure 3. � A comparison between the results obtained by our method and other methods based on the same experimental criteria is as shown in Table 1.

100 90 Testing accuracy (%) A 80 B 70 60 C 50 40 30 1 3 5 7 9 5 5 5 5 5 5 5 5 5 0 0 0 0 0 1 2 3 4 5 6 7 8 9 . . . . . . . . . . . . . . 0 0 0 0 0 0 0 0 0 0 0 0 0 0 theta Fig. 3. The testing accuracy rates of the PID problem. Curve A shows the accuracy rate using the membership value only; curve B shows the accuracy rate using both the membership value and Euclidean distance; curve C shows the accuracy rate using the Euclidean distance only.

Table 1. Classification accuracy from various methods for the PID data set methods Accuracy (%) LDA 77.5 C4.5 73.0 CART 72.8 K-NN 71.9 Curve A (best result) 76.56 Curve B (best result) 72.4 Curve C (best result) 74.9

A) Bench mark data sets: 2) IRIS data set � The IRIS data set consists 150 samples in a 3 class classification problem. The data set was divided into two sets; training set which consisted of 80% of each class, and a test set with the remaining samples. � The experiment was conducted for the data set, and the results of the proposed methods along with the original FMM are shown in Figure 4. � Table 2 shows the maximum accuracy of various methods in comparison with the proposed methods (all the experiments are conducted using the same training and test data sets).

96 94 Testing Accuracy % 92 A 90 B 88 C 86 84 82 1 3 5 7 9 5 5 5 5 5 5 5 5 5 0 0 0 0 0 1 2 3 4 5 6 7 8 9 . . . . . . . . . . . . . . 0 0 0 0 0 0 0 0 0 0 0 0 0 0 Theta Fig. 4. The testing accuracy rates of the IRIS data set. Curve A shows the accuracy rate using the membership value only; curve B shows the accuracy rate using both the membership value and Euclidean distance; curve C shows the accuracy rate using the Euclidean distance only.

Table 2. Percentage error for various methods for the IRIS data set methods Accuracy (%) C4.5 91.60 OC1 93.89 LMDT 95.45 LVQ 92.55 Curve A (best result) 94.00 Curve B (best result) 93.33 Curve C (best result) 94.00

B) Fault Classification � A fault detection and classification system predicts failures, and when a failure occurs, it identifies the reason(s) of failures. � In this study, we investigate the applicability of modified FMM using a set of sensor measurements collected from a power generation plant in Malaysia. � The system under consideration is a circulating water (CW) system, as shown in Figure 5

Low Pressure Turbines Steam Condens er To Sea Strainer Common Primary Bar CW Discharge Screen Pumps Header Condensate Seawater (Reused in steam cycle process) Fig. 5. The circulating water system

� A data set of 2439 samples was collected. � Each data sample consisted of 12 features comprising of temperature and pressure measurements at various inlet and outlet points of the condenser, as well as other important parameters. � Two case studies were conducted: 1. The heat transfer conditions. 2. The tube blockage conditions.

A Modified Fuzzy Min-Max Neural Network and Its Application to Fault - PowerPoint PPT Presentation

A Modified Fuzzy Min-Max Neural Network and Its Application to Fault Classification Anas M. Quteishat and Chee Peng Lim School of Electrical & Electronic Engineering University of Science Malaysia Abstract The objectives of this paper are:

1 min 2 min 3 min www.matsgroup.info 1 min 2 min 3 min www.matsgroup.info 1 min 2 min 3

On Fuzzy Soft Rings Banu Pazar Varol and Halis Ayg un Department of Mathematics, Kocaeli

Applications Three sample applications Fuzzy inferno Nostalgic cow Twilight Eden Fuzzy inferno

7 Transformations of Fuzzy Sets Fuzzy Systems Engineering Toward Human-Centric Computing

11 Fuzzy Rule-Based Models Fuzzy Systems Engineering Toward Human-Centric Computing Contents

Semi-Heuristic Target-Based Fuzzy Target . . . Fuzzy Target . . . Fuzzy Decision Procedures:

M odels for Inexact Reasoning Fuzzy Logic Lesson 8 Fuzzy Controllers M aster in

Class 4 @rwdkent Overview Current Events (10 min) Break (5 min) Explore RWD (25 min) CSS

2 Notions and Concepts of Fuzzy Sets Fuzzy Systems Engineering Toward Human-Centric Computing

A fuzzy clustering method using Genetic Algorithm and Fuzzy Subtractive Clustering Thanh Le, Tom

M odels for Inexact Reasoning Fuzzy Logic Lesson 1 Crisp and Fuzzy Sets M aster in

5 Operations and Aggregations of Fuzzy Sets Fuzzy Systems Engineering Toward Human-Centric

10 Fuzzy Modeling: Principles and Methodology Fuzzy Systems Engineering Toward Human-Centric

Fuzzy Reasoning Outline Introduction Bivalent & Multivalent Logics Fundamental

On using Different Distance Measures for Fuzzy Numbers in Fuzzy Linear Regression Models Duygu

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Intro to Discrete Probability CS 70, Summer 2019 Lecture 15, 7/18/19 1 / 24 Why Learn

MA162: Finite mathematics . Jack Schmidt University of Kentucky April 11, 2012 Schedule: HW

Mesh Simplification Mesh Simplification 1 Spring 2010 The The Law The The Law Law of Law

Computing Nearby Non-Trivial Smith Forms Joseph Haraldson with Mark Giesbrecht and George Labahn

EE361: SIGNALS AND SYSTEMS II CH2: RANDOM VARIABLES http://www.ee.unlv.edu/~b1morris/ee361 2

Probability of Simple Events Return to Table of Contents Slide 7 / 66 Slide 8 / 66 Sample

The Automated GOLE Pilot Project Update Presented to GLIF Feb 23, 2011 Feb 23, 2011 Hong Kong,

Introduction to Web Design & Computer Principles SVG CSCI-UA 4 Scalable Vector Graphics

A Modified Fuzzy Min-Max Neural Network and Its Application to Fault - PowerPoint PPT Presentation

A Modified Fuzzy Min-Max Neural Network and Its Application to Fault Classification Anas M. Quteishat and Chee Peng Lim School of Electrical & Electronic Engineering University of Science Malaysia Abstract The objectives of this paper are:

1 min 2 min 3 min www.matsgroup.info 1 min 2 min 3 min www.matsgroup.info 1 min 2 min 3

On Fuzzy Soft Rings Banu Pazar Varol and Halis Ayg un Department of Mathematics, Kocaeli

Applications Three sample applications Fuzzy inferno Nostalgic cow Twilight Eden Fuzzy inferno

7 Transformations of Fuzzy Sets Fuzzy Systems Engineering Toward Human-Centric Computing

11 Fuzzy Rule-Based Models Fuzzy Systems Engineering Toward Human-Centric Computing Contents

Semi-Heuristic Target-Based Fuzzy Target . . . Fuzzy Target . . . Fuzzy Decision Procedures:

M odels for Inexact Reasoning Fuzzy Logic Lesson 8 Fuzzy Controllers M aster in

Class 4 @rwdkent Overview Current Events (10 min) Break (5 min) Explore RWD (25 min) CSS

2 Notions and Concepts of Fuzzy Sets Fuzzy Systems Engineering Toward Human-Centric Computing

A fuzzy clustering method using Genetic Algorithm and Fuzzy Subtractive Clustering Thanh Le, Tom

M odels for Inexact Reasoning Fuzzy Logic Lesson 1 Crisp and Fuzzy Sets M aster in

5 Operations and Aggregations of Fuzzy Sets Fuzzy Systems Engineering Toward Human-Centric

10 Fuzzy Modeling: Principles and Methodology Fuzzy Systems Engineering Toward Human-Centric

Fuzzy Reasoning Outline Introduction Bivalent &amp; Multivalent Logics Fundamental

On using Different Distance Measures for Fuzzy Numbers in Fuzzy Linear Regression Models Duygu

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Intro to Discrete Probability CS 70, Summer 2019 Lecture 15, 7/18/19 1 / 24 Why Learn

MA162: Finite mathematics . Jack Schmidt University of Kentucky April 11, 2012 Schedule: HW

Mesh Simplification Mesh Simplification 1 Spring 2010 The The Law The The Law Law of Law

Computing Nearby Non-Trivial Smith Forms Joseph Haraldson with Mark Giesbrecht and George Labahn

EE361: SIGNALS AND SYSTEMS II CH2: RANDOM VARIABLES http://www.ee.unlv.edu/~b1morris/ee361 2

Probability of Simple Events Return to Table of Contents Slide 7 / 66 Slide 8 / 66 Sample

The Automated GOLE Pilot Project Update Presented to GLIF Feb 23, 2011 Feb 23, 2011 Hong Kong,

Introduction to Web Design &amp; Computer Principles SVG CSCI-UA 4 Scalable Vector Graphics

Fuzzy Reasoning Outline Introduction Bivalent & Multivalent Logics Fundamental

Introduction to Web Design & Computer Principles SVG CSCI-UA 4 Scalable Vector Graphics