true pt
Olfa Nasraoui
Department of Computer Engineering
& Computer Science
University of Louisville,
olfa.nasraoui_AT_louisville.edu
A GA [Hol75,Gol89,SJB$^+$93] works on a population of individuals representing possible solutions to a particular problem. Each individual is evaluated using a fitness value which is a measure of how well adapted this individual is to its environment. As in nature, individuals mate and reproduce in a GA with a reproduction rate that is proportional to their fitnesses. The individuals are represented by their genetic material or genotype. This is accomplished by choosing a suitable representation scheme to code the possible solutions into individuals. Commonly, a solution is coded into a genotype consisting of a binary chromosome string with a predetermined number of bits which determines the problem's size. This coding creates a quantization of the solution space. A GA starts with an initial population of candidate solutions or individuals, and tries to modify them until the population converges to a solution. A problem-dependent fitness function must be chosen to measure the goodness of an individual. The modification of the population is done using an iterative application of three genetic operators that provide general exploratory heuristics which evolve the population from one generation to the next. These operators are selection, crossover, and mutation.
Selection decides which individuals shall survive into the next generation. The Simple Genetic Algorithm (SGA) uses fitness-proportionate selection which is implemented using tournament or roulette wheel selection. In this method, members of the population are selected with a probablity that is proportional to their fitness score. This selection mechanism encourages the survival of the fittest individuals to the next generation, and causes them to eventually overtake the population.
Mating between two individuals is implemented using crossover which generally allows the creation of better children or
offspring by combining the genetic materials of two parents with a large crossover probability
. These offspring will replace their parents in the next generation. Typically, one-point crossover is performed, where a single position is randomly selected along the parent's bit strings. This position is referred to as the crossing site. Then the chromosome strings of two children are formed by copying the bits located to the left of the crossing point from one parent's string and the bits located to the right of the crossing point from the other parent's string. Crossover is generally considered to be the primary search mechanism in GA's.
Mutation is a totally random step, where each bit in the chromosome string of the offspring individuals is allowed to change value with a small
mutation probability
. This operator adds diversity to the population, thereby encouraging the exploration of new areas of the solution space to globalize the
search for the optimal solution.
GAs distinguish themeselves from other optimizing tools because of their implicit parallelism, diversity and intensification. Parallelism and diversity are achieved by using a population of solutions instead of a single solution, which makes the GA one of the best global optimization tools. Intensification consists in preserving good solutions and combining their good features to produce better solutions through selection and crossover. This desirable feature makes the GA a more efficient searching tool compared to other merely exploratory and costlier global optimization methods which are based on exhaustive-like and random mutation search.
Sharing methods [Hol75,GR87] attempt to maintain a diverse population with members distributed among all the niches corresponding to the peaks in a multimodal fitness landscape. They achieve this diversity by reducing the fitness of individuals that have highly similar members within the population. This in turn discourages redundent solutions from overtaking the entire population, while rewarding individuals that uniquely exploit areas of the domain. As a result, diversity is enhanced and popultion members are kept at local optima. The shared fitness of the
individual is defined as
, where
is the raw fitness and
is the niche count, given by
, where
is the number of individuals in the population, and
is the distance between the
and
individuals. It is often recommended that
be based on the individuals' phenotypes, i. e., it should be based on domain knowledge as opposed to being based solely on their genotype. The sharing function
reaches a maximum of
at zero, decreases monotonically with distance, and falls to zero for distances that are greater than
. For example, the triangular sharing function is given by
Recently, Miller and Shaw [MS95] proposed a ``dynamic sharing'' scheme which attempts to alleviate some of the drawbacks of standard sharing. In their scheme, the dynamic niche count is estimated more accurately leading to more accurate shared fitness values. This is accomplished by identifying the peaks in the population and assigning each member to the closest peak. However their method still assumes that the number of peaks is known and that all the peaks are a minimum distance of
apart, where the niche radius
is also assumed to be known and equal for all niches. In their scheme, they also used restricted mating to prevent crossover between members of different niches. In their mating restriction method, called dynamic inbreeding, the first parent is selected through tournament selection based upon shared fitness values. Then a second parent is determined by examining a pool of MF (Mating Factor) members selected randomly without replacement from the population, and selecting the fittest individual coming from the same niche as the first parent. If no such individual is found, then the parent is mated with the most similar individual in the mating pool, which can create lethal offspring. Dynamic inbreeding certainly offers an improvement over unrestricted mating. However, it relies on a correct identification of all the niche peaks which is based on two critical assumptions that require prior knowledge about the fitness landscape as discussed above.
Both standard and dynamic sharing methods have the disadvantage of giving preference to higher peaks because they use tournament selection based on fitness which can cause the extinction of relatively low peaks. One of the early attempts at maintaining a diverse population without the use of fitness based tournament selection or shared fitnesses was made by De Jong [Jon75] who proposed Crowding methods which try to form and maintain niches by replacing population members preferrably with the most similar individuals. Unfortunately, stochastic ``replacement errors'' prevented this method from maintaining more than two peaks in a multimodal fitness landscape. Mahfoud [Mah92] proposed an improved crowding mechanism, called ``deterministic crowding'' (DC), which nearly eliminated replacement errors and proved more effective in maintaining multiple niches. DC is presented below:
|
Deterministic Crowding (DC)
Repeat for
}
|
Unlike sharing methods, DC is free of any parameters relying on assumptions about the number of peaks or their widths. Also, unlike sharing, the expected distribution of the population at convergence is independent of fitness. Instead, the cardinality of each niche is expected to be proportional to the fraction of the population from the search space that falls within this niche, or in other words the prior probability of the niche. This is because the distribution of the converged population is expected to be similar to that of the initial population which is selected randomly. Hence, competition and improvement occur only within the niches leading to a diverse population with members that are closer to the actual peaks. Despite its considerable improvements, DC, like sharing methods, suffered from crossover interactions between different niches. This problem occurs when the fitness landscape contains dominated and dominant niches. A dominated niche can, with another niche's assistance, cross to form members from a fitter (dominant) niche. This causes a migration of members from the dominated peaks to the dominant peaks that will only come to a halt when one of the dominated niches is depleted of its members. Obviously, the effect of this migration on diversity can be pernicious because it eventually leads to the extinction of certain dominated peaks. Despite the crossover interaction problem inherent in all the discussed niching schemes, DC appears to be the most efficient scheme for multimodal domain optimization.
This document was generated using the LaTeX2HTML translator Version 2002 (1.62)
Copyright © 1993, 1994, 1995, 1996,
Nikos Drakos,
Computer Based Learning Unit, University of Leeds.
Copyright © 1997, 1998, 1999,
Ross Moore,
Mathematics Department, Macquarie University, Sydney.
The command line arguments were:
latex2html -split 0 -image_type gif GeneticAlgorithms.tex