1
Genetic Algorithms for Simultaneous Equation Models
José J. López
Universidad Miguel Hernández (Elche, Spain)
Domingo Giménez
Universidad de Murcia (Murcia, Spain) DCAI 2008 2008
Genetic Algorithms for Simultaneous Equation Models Jos J. Lpez - - PowerPoint PPT Presentation
Genetic Algorithms for Simultaneous Equation Models Jos J. Lpez Universidad Miguel Hernndez (Elche, Spain) Domingo Gimnez Universidad de Murcia (Murcia, Spain) DCAI 2008 2008 1 Contents Introduction Simultaneous equations
1
José J. López
Universidad Miguel Hernández (Elche, Spain)
Domingo Giménez
Universidad de Murcia (Murcia, Spain) DCAI 2008 2008
2
Introduction Simultaneous equations models The problem: Find the best SEM given a set of values of variables Genetic Algorithms for selecting the best SEM
Defining a valid chromosome Initialization and EndConditions Evaluating a chromosome Crossover Mutation
Random Search Experimental results Conclusions and future works References
3
S.E.M. have been used in econometrics for years.
Traditionally, Simultaneous Equation Models (SEM)
The objective is to develop an algorithm which, given the
The space of the possible solutions is very large and
A combination between genetic and random search is
4
where and are dx1
2 21 1 23 3 2 21 1 2 2
N N K K
1 12 2 13 3 1 11 1 1 1
N N K K
1 1 2 2 1 1 1 1
N N N NN N N NK K N
− −
i j
i
1... , 1... i N j K = =
5
One model is considered better than another if it
AIC is one of the most used methods for
N e i i i
=
d is the sample size, ni and ki the number of endogenous
and exogenous variables in equation i, and is the covariance matrix of the errors.
6
columns.
the chromosome is one, and zero if not.
variables and the other K columns represent the exogenous ones. For example, in a problem with N=2 endogenous variables (Y1 and Y2) and K=3 predetermined variables (X1, X2 and X3):
1 1,2 2 1,1 1 1,2 2 1 2 2,1 1 2,3 3 2
7
The model has to have at
If the (i,i) element is zero,
Each equation in the model
The number of comparisons
Rank condition: Equation i
1,1 , 1,2 , 1
N K N N
−
2
8
The algorithm on the
The cost of evaluating
chromosome c and the set of variables Y and X
the variables Y and its estimation
2 3
9
6,23 45,32 7,28 1000 200 150 2,79 20,31 7,29 500 200 150 12,82 18,21 1,42 1000 200 100 5,38 7,70 1,43 500 200 100 8,33 3,50 0,42 1000 100 75 4,51 1,94 0,43 500 100 75 25,29 1,77 0,07 1000 100 50 11,29 0,79 0,07 500 100 50 322,22 0,09 0,00027 500 100 10 sp function chromosome d K N Fitness Valid Size of the problem Times
The cost of
10
the same probability of zeros and ones) {C1 AND C2 CONDITIONS}
3. invert all the elements e(i,i) with i=1,...,N
{C3 CONDITION}
6. IF the element e(i,i) is zero 7. make all the elements zero in column i 8. END IF
{C4 CONDITION}
11. IF equation i fails the range condition 12. generate randomly this equation (row i) and go to 2 13. END IF
generated according to the algorithm on the right.
PopSize) is stated at the beginning.
it reaches a maximum number of iterations, called MaxIter, or the best fitness is repeated over a number of successive iterations, called MaxBest.
at the beginning of the algorithm.
11
32975,04 24 64,41 31262,20 102 294,19 30956,78 111 325,87 100 50 40 22765,68 17 9,47 22120,10 72 87,54 21937,02 50 58,33 100 40 30 4709,50 40 1,94 4540,93 53 6,73 4548,68 62 8,00 50 20 15 2833,41 20 0,66 2732,90 97 5,11 2683,13 48 3,03 50 15 10 best fitness iter t best fitness iter t best fitness iter t d K N IE SPCE SP size crossover crossover crossover problem
Three sorts of crossover are studied:
Single Point (SP) Single Point considering equations (SPCE) Inside an Equation (IE)
11110110 01110110 01110110 11110110 01110110 11110110 11110110 01110110 01110101 11110100 11110101 01110100 01110101 11110100 01110100 11110101 10100100 11110110 10100100 11110110 10100110 11110110 10100100 11110110 child2 child1 child2 child1 child2 child1 parent2 parent1 e = 2, v1 = 2, v2 = 3 e = 1 e = 10 parents IE SPCE SP
12
A small probability of mutation is considered in each
A chromosome of the new subset generated in the
PROBLEM: When a chromosome is mutated and then
13
4.
Generate v between 1 and N+K randomly 5. c1=Mutate(c) {invert the element (e,v) of chromosome c}
6. IF GoodChromosome(c1) AND
Evaluation(c1)<Evaluation(c)
7.
c=c1
8. END IF 9. IF Evaluation(c)<SV 10.
EndConditions=TRUE 11. END IF
random search is used in the mutation, following the algorithm on the right.
enough when its evaluation is lower than a parameter called SV.
N=10, K=20 N=20, K=30 Mode NEG AIC time AIC time
without random search
5.10 4658.06 15.41 1 2143.54 9.79 4710.53 49.14 with random search [N/2] 1491.13 12.62 3072.98 102.23 [N/4]
27.48 811.65 227.35 N
34.17
449.78
14
system with two nodes Intel Itanium, connected by Gigabit Ethernet, where each node is equipped with four dual-core 1.4 GHz Montecito processors, i.e. 8 processors per node.
genetic algorithm and the optimum, when varying the population size (PopSize), N and
the crossover is inside an equation.
seconds) and speed-up of the algorithm in shared memory.
4 4
4 3
3 3 46,18 46,18 46,18 3 2 66,44 66,44 66,44 2 2 Optimum PopSize=500 PopSize=100 K N best fitness size 5,12 699,21 3,33 1076,45 1,86 1927,76 3580,94 200 90 70 500 4,64 229,68 2,90 368,11 1,52 699,20 1065,77 150 65 50 500 4,24 47,47 3,23 62,30 1,74 115,71 201,29 100 40 30 500 4,53 4,70 2,84 7,50 1,85 11,55 21,31 50 15 10 500 3,81 185,88 2,56 277,15 1,70 417,62 709,05 200 90 70 100 3,41 63,81 2,13 102,27 1,43 152,19 217,79 150 65 50 100 3,31 12,31 2,51 16,24 1,55 26,21 40,74 100 40 30 100 4,06 1,04 2,60 1,62 1,68 2,51 4,22 50 15 10 100 sp 8th sp 4th sp 2th 1th d K N PopSize
15
Conclusions
set of variables is studied.
and to speed up the convergence.
processors in the solution of the problem, has been developed.
Future Works
Application to real problems. Develop a hybrid (message-passing plus shared memory) algorithm. Use and comparison other criteria parameters.
16
Petrov, Csaki F. (Ed.), Proc. 2nd Int. Symp. on Information Theory, Akademiai Kiado, Budapest, 267 281, 1973.
50, 226-231, 1994.
Computational Statistics and Data Analysis, 28, 51-76,1998.
707-716, 1997.
36(3), 1-8, 2005.
B, 60 (3), 551-558, 1998.