Differential evolution

Method of mathematical optimization
Part of a series on the
Evolutionary algorithm
  • Artificial development
  • Artificial life
  • Cellular evolutionary algorithm
  • Cultural algorithm
  • Differential evolution
  • Effective fitness
  • Evolutionary computation
  • Evolution strategy
  • Gaussian adaptation
  • Grammar induction
  • Evolutionary multimodal optimization
  • Particle swarm optimization
  • Memetic algorithm
  • Natural evolution strategy
  • Neuroevolution
  • Promoter based genetic algorithm
  • Spiral optimization algorithm
  • Self-modifying code
  • Polymorphic code
Genetic algorithm
Genetic programming
  • v
  • t
  • e
Differential Evolution optimizing the 2D Ackley function.

In evolutionary computation, differential evolution (DE) is a method that optimizes a problem by iteratively trying to improve a candidate solution with regard to a given measure of quality. Such methods are commonly known as metaheuristics as they make few or no assumptions about the optimized problem and can search very large spaces of candidate solutions. However, metaheuristics such as DE do not guarantee an optimal solution is ever found.

DE is used for multidimensional real-valued functions but does not use the gradient of the problem being optimized, which means DE does not require the optimization problem to be differentiable, as is required by classic optimization methods such as gradient descent and quasi-newton methods. DE can therefore also be used on optimization problems that are not even continuous, are noisy, change over time, etc.[1]

DE optimizes a problem by maintaining a population of candidate solutions and creating new candidate solutions by combining existing ones according to its simple formulae, and then keeping whichever candidate solution has the best score or fitness on the optimization problem at hand. In this way, the optimization problem is treated as a black box that merely provides a measure of quality given a candidate solution and the gradient is therefore not needed.

Storn and Price introduced Differential Evolution in 1995.[2][3][4] Books have been published on theoretical and practical aspects of using DE in parallel computing, multiobjective optimization, constrained optimization, and the books also contain surveys of application areas.[5][6][7][8] Surveys on the multi-faceted research aspects of DE can be found in journal articles .[9][10]

Algorithm

A basic variant of the DE algorithm works by having a population of candidate solutions (called agents). These agents are moved around in the search-space by using simple mathematical formulae to combine the positions of existing agents from the population. If the new position of an agent is an improvement then it is accepted and forms part of the population, otherwise the new position is simply discarded. The process is repeated and by doing so it is hoped, but not guaranteed, that a satisfactory solution will eventually be discovered.

Formally, let f : R n R {\displaystyle f:\mathbb {R} ^{n}\to \mathbb {R} } be the fitness function which must be minimized (note that maximization can be performed by considering the function h := f {\displaystyle h:=-f} instead). The function takes a candidate solution as argument in the form of a vector of real numbers. It produces a real number as output which indicates the fitness of the given candidate solution. The gradient of f {\displaystyle f} is not known. The goal is to find a solution m {\displaystyle \mathbf {m} } for which f ( m ) f ( p ) {\displaystyle f(\mathbf {m} )\leq f(\mathbf {p} )} for all p {\displaystyle \mathbf {p} } in the search-space, which means that m {\displaystyle \mathbf {m} } is the global minimum.

Let x R n {\displaystyle \mathbf {x} \in \mathbb {R} ^{n}} designate a candidate solution (agent) in the population. The basic DE algorithm can then be described as follows:

  • Choose the parameters NP 4 {\displaystyle {\text{NP}}\geq 4} , CR [ 0 , 1 ] {\displaystyle {\text{CR}}\in [0,1]} , and F [ 0 , 2 ] {\displaystyle F\in [0,2]} .
    • NP {\displaystyle {\text{NP}}} is the population size, i.e. the number of candidate agents or "parents"; a typical setting is 10 n {\displaystyle n} .
    • The parameter CR [ 0 , 1 ] {\displaystyle {\text{CR}}\in [0,1]} is called the crossover probability and the parameter F [ 0 , 2 ] {\displaystyle F\in [0,2]} is called the differential weight. Typical settings are C R = 0.9 {\displaystyle CR=0.9} and F = 0.8 {\displaystyle F=0.8} .
    • Optimization performance may be greatly impacted by these choices; see below.
  • Initialize all agents x {\displaystyle \mathbf {x} } with random positions in the search-space.
  • Until a termination criterion is met (e.g. number of iterations performed, or adequate fitness reached), repeat the following:
    • For each agent x {\displaystyle \mathbf {x} } in the population do:
      • Pick three agents a , b {\displaystyle \mathbf {a} ,\mathbf {b} } , and c {\displaystyle \mathbf {c} } from the population at random, they must be distinct from each other as well as from agent x {\displaystyle \mathbf {x} } . ( a {\displaystyle \mathbf {a} } is called the "base" vector.)
      • Pick a random index R { 1 , , n } {\displaystyle R\in \{1,\ldots ,n\}} where n {\displaystyle n} is the dimensionality of the problem being optimized.
      • Compute the agent's potentially new position y = [ y 1 , , y n ] {\displaystyle \mathbf {y} =[y_{1},\ldots ,y_{n}]} as follows:
        • For each i { 1 , , n } {\displaystyle i\in \{1,\ldots ,n\}} , pick a uniformly distributed random number r i U ( 0 , 1 ) {\displaystyle r_{i}\sim U(0,1)}
        • If r i < C R {\displaystyle r_{i}<CR} or i = R {\displaystyle i=R} then set y i = a i + F × ( b i c i ) {\displaystyle y_{i}=a_{i}+F\times (b_{i}-c_{i})} otherwise set y i = x i {\displaystyle y_{i}=x_{i}} . (Index position R {\displaystyle R} is replaced for certain.)
      • If f ( y ) f ( x ) {\displaystyle f(\mathbf {y} )\leq f(\mathbf {x} )} then replace the agent x {\displaystyle \mathbf {x} } in the population with the improved or equal candidate solution y {\displaystyle \mathbf {y} } .
  • Pick the agent from the population that has the best fitness and return it as the best found candidate solution.

Parameter selection

Performance landscape showing how the basic DE performs in aggregate on the Sphere and Rosenbrock benchmark problems when varying the two DE parameters NP {\displaystyle {\text{NP}}} and F {\displaystyle {\text{F}}} , and keeping fixed CR {\displaystyle {\text{CR}}} =0.9.

The choice of DE parameters NP {\displaystyle {\text{NP}}} , CR {\displaystyle {\text{CR}}} and F {\displaystyle F} can have a large impact on optimization performance. Selecting the DE parameters that yield good performance has therefore been the subject of much research. Rules of thumb for parameter selection were devised by Storn et al.[4][5] and Liu and Lampinen.[11] Mathematical convergence analysis regarding parameter selection was done by Zaharie.[12]

Constraint handling

Differential evolution can be utilized for constrained optimization as well. A common method involves modifying the target function to include a penalty for any violation of constraints, expressed as: f ~ ( x ) = f ( x ) + ρ × C V ( x ) {\displaystyle f{\tilde {}}(x)=f(x)+\rho \times \mathrm {CV} (x)} . Here, C V ( x ) {\displaystyle \mathrm {CV} (x)} represents either a constraint violation (an L1 penalty) or the square of a constraint violation (an L2 penalty).

This method, however, has certain drawbacks. One significant challenge is the appropriate selection of the penalty coefficient ρ {\displaystyle \rho } . If ρ {\displaystyle \rho } is set too low, it may not effectively enforce constraints. Conversely, if it's too high, it can greatly slow down or even halt the convergence process. Despite these challenges, this approach remains widely used due to its simplicity and because it doesn't require altering the differential evolution algorithm itself.

There are alternative strategies, such as projecting onto a feasible set or reducing dimensionality, which can be used for box-constrained or linearly constrained cases. However, in the context of general nonlinear constraints, the most reliable methods typically involve penalty functions.

Variants

Variants of the DE algorithm are continually being developed in an effort to improve optimization performance. Many different schemes for performing crossover and mutation of agents are possible in the basic algorithm given above, see e.g.[4]

See also

References

  1. ^ Rocca, P.; Oliveri, G.; Massa, A. (2011). "Differential Evolution as Applied to Electromagnetics". IEEE Antennas and Propagation Magazine. 53 (1): 38–49. doi:10.1109/MAP.2011.5773566. S2CID 27555808.
  2. ^ Storn, Rainer; Price, Kenneth (1995). "Differential evolution—a simple and efficient scheme for global optimization over continuous spaces" (PDF). International Computer Science Institute. TR (95). Berkeley: International Computer Science Institute: TR-95-012. Retrieved 3 April 2024.
  3. ^ Storn, R.; Price, K. (1997). "Differential evolution - a simple and efficient heuristic for global optimization over continuous spaces". Journal of Global Optimization. 11 (4): 341–359. doi:10.1023/A:1008202821328. S2CID 5297867.
  4. ^ a b c Storn, R. (1996). "On the usage of differential evolution for function optimization". Biennial Conference of the North American Fuzzy Information Processing Society (NAFIPS). pp. 519–523. doi:10.1109/NAFIPS.1996.534789. S2CID 16576915.
  5. ^ a b Price, K.; Storn, R.M.; Lampinen, J.A. (2005). Differential Evolution: A Practical Approach to Global Optimization. Springer. ISBN 978-3-540-20950-8.
  6. ^ Feoktistov, V. (2006). Differential Evolution: In Search of Solutions. Springer. ISBN 978-0-387-36895-5.
  7. ^ G. C. Onwubolu and B V Babu, "New Optimization Techniques in Engineering". Retrieved 17 September 2016.
  8. ^ Chakraborty, U.K., ed. (2008), Advances in Differential Evolution, Springer, ISBN 978-3-540-68827-3
  9. ^ S. Das and P. N. Suganthan, "Differential Evolution: A Survey of the State-of-the-art", IEEE Trans. on Evolutionary Computation, Vol. 15, No. 1, pp. 4-31, Feb. 2011, DOI: 10.1109/TEVC.2010.2059031.
  10. ^ S. Das, S. S. Mullick, P. N. Suganthan, "Recent Advances in Differential Evolution - An Updated Survey," Swarm and Evolutionary Computation, doi:10.1016/j.swevo.2016.01.004, 2016.
  11. ^ Liu, J.; Lampinen, J. (2002). "On setting the control parameter of the differential evolution method". Proceedings of the 8th International Conference on Soft Computing (MENDEL). Brno, Czech Republic. pp. 11–18.
  12. ^ Zaharie, D. (2002). "Critical values for the control parameters of differential evolution algorithms". Proceedings of the 8th International Conference on Soft Computing (MENDEL). Brno, Czech Republic. pp. 62–67.
  • v
  • t
  • e
Main Topics
Algorithms
Related techniques
Metaheuristic methods
Related topics
Journals
  • Evolutionary Computation (journal)