Particle Swarm Optimization¶
Table of contents
Algorithm Description¶
Particle Swarm Optimization (PSO) is a stochastic swarm intelligence algorithm for global optimization
where \(f\) is potentially ill-behaved in one or more ways, such as non-convexity and/or non-differentiability. The updating rule for PSO is described below.
Let \(X^{(i)}\) denote the \(N \times d\) dimensional array of input values at stage \(i\) of the algorithm, where each row corresponds to a different vector of candidate solutions.
Update the velocity and position matrices. Sample two \(d\)-dimensional IID uniform random vectors, \(R_C, R_S\).
Update each velocity vector as follows:
\[V^{(i+1)}(j.:) = w \times V^{(i)}(j,:) + c_C \times R_C \odot (X_b^{(i)} (j,:) - X^{(i)}(j,:)) + c_S \times R_S \odot (g_b - X^{(i)}(j,:))\]Each position vector is then updated by element-wise addition of the updated velocity vector:
\[X^{(i+1)}(j,:) = X^{(i)}(j,:) + V^{(i+1)}(j,:)\]Update local-best particle.
Set:
\[X_b^{(i+1)}(j,:) = \begin{cases} X^{(i+1)}(j,:) & \text{ if } f(X^{(i+1)}(j,:)) < f(X_b^{(i)}(j,:)) \\ X_b^{(i)}(j,:) & \text{ else } \end{cases}\]Update the global-best particle.
Let:
\[j^{(*)} = \arg \min_{j \in \{1, \ldots, N\}} f(X^{(i+1)} (j,:))\]Then:
\[g_b = \begin{cases} X^{(i+1)}(j^{(*)},:) & \text{ if } f(X^{(i+1)}(j^{(*)},:)) < f(g_b) \\ g_b & \text{ else } \end{cases}\]
The algorithm stops when at least one of the following conditions are met:
the relative improvement in the objective function is less than
rel_objfn_change_tol
betweenpso_settings.check_freq
number of generations;the total number of generations exceeds
pso_settings.n_gen
.
PSO with Differentially-Perturbed Velocity¶
TBW.
Function Declarations¶
-
bool pso(ColVec_t &init_out_vals, std::function<fp_t(const ColVec_t &vals_inp, ColVec_t *grad_out, void *opt_data)> opt_objfn, void *opt_data)¶
Particle Swarm Optimization (PSO) Algorithm.
- Parameters
init_out_vals – a column vector of initial values, which will be replaced by the solution upon successful completion of the optimization algorithm.
opt_objfn – the function to be minimized, taking three arguments:
vals_inp
a vector of inputs;grad_out
a vector to store the gradient; andopt_data
additional data passed to the user-provided function.
opt_data – additional data passed to the user-provided function.
- Returns
a boolean value indicating successful completion of the optimization algorithm.
-
bool pso(ColVec_t &init_out_vals, std::function<fp_t(const ColVec_t &vals_inp, ColVec_t *grad_out, void *opt_data)> opt_objfn, void *opt_data, algo_settings_t &settings)¶
Particle Swarm Optimization (PSO) Algorithm.
- Parameters
init_out_vals – a column vector of initial values, which will be replaced by the solution upon successful completion of the optimization algorithm.
opt_objfn – the function to be minimized, taking three arguments:
vals_inp
a vector of inputs;grad_out
a vector to store the gradient; andopt_data
additional data passed to the user-provided function.
opt_data – additional data passed to the user-provided function.
settings – parameters controlling the optimization routine.
- Returns
a boolean value indicating successful completion of the optimization algorithm.
-
bool pso_dv(ColVec_t &init_out_vals, std::function<fp_t(const ColVec_t &vals_inp, ColVec_t *grad_out, void *opt_data)> opt_objfn, void *opt_data)¶
Particle Swarm Optimization (PSO) with Differentially-Perturbed Velocity (DV)
- Parameters
init_out_vals – a column vector of initial values, which will be replaced by the solution upon successful completion of the optimization algorithm.
opt_objfn – the function to be minimized, taking three arguments:
vals_inp
a vector of inputs;grad_out
a vector to store the gradient; andopt_data
additional data passed to the user-provided function.
opt_data – additional data passed to the user-provided function.
- Returns
a boolean value indicating successful completion of the optimization algorithm.
-
bool pso_dv(ColVec_t &init_out_vals, std::function<fp_t(const ColVec_t &vals_inp, ColVec_t *grad_out, void *opt_data)> opt_objfn, void *opt_data, algo_settings_t &settings)¶
Particle Swarm Optimization (PSO) with Differentially-Perturbed Velocity (DV)
- Parameters
init_out_vals – a column vector of initial values, which will be replaced by the solution upon successful completion of the optimization algorithm.
opt_objfn – the function to be minimized, taking three arguments:
vals_inp
a vector of inputs;grad_out
a vector to store the gradient; andopt_data
additional data passed to the user-provided function.
opt_data – additional data passed to the user-provided function.
settings – parameters controlling the optimization routine.
- Returns
a boolean value indicating successful completion of the optimization algorithm.
Optimization Control Parameters¶
The basic control parameters are:
size_t pso_settings.n_pop
: size of population for each generation.Default value:
200
.
size_t pso_settings.n_gen
: number of generations.Default value:
1000
.
size_t pso_settings.center_particle
: whether to add a particle that averages across the population in each generation.Default value:
true
.
fp_t pso_settings.inertia_method
: set inertia method (1
1 for linear decreasing betweenw_min
andw_max
, or2
for dampening).Default value:
1
.
fp_t pso_settings.par_w
: initial value of the weight parameter \(w\).Default value:
1.0
.
fp_t pso_settings.par_w_min
: lower bound on the weight parameter \(w\).Default value:
0.1
.
fp_t pso_settings.par_w_max
: upper bound on the weight parameter \(w\).Default value:
0.99
.
fp_t pso_settings.par_w_damp
: dampening parameter forinertia_method
equal to2
.Default value:
0.99
.
fp_t pso_settings.velocity_method
: set velocity method (1
for fixed values or2
for linear change from initial to final values).Default value:
1
.
fp_t pso_settings.par_c_cog
: initial value for \(c_C\).Default value:
2.0
.
fp_t pso_settings.par_c_soc
: initial value for \(c_S\).Default value:
2.0
.
fp_t pso_settings.par_final_c_cog
: final value for \(c_C\).Default value:
0.5
.
fp_t pso_settings.par_final_c_soc
: final value for \(c_S\).Default value:
2.5
.
size_t pso_settings.check_freq
: how many generations to skip when evaluating whether the best candidate value has improved between generations (i.e., to check for potential convergence).Default value:
(size_t)-1
.
Upper and lower bounds of the uniform distributions used to generate the initial population:
ColVec_t pso_settings.initial_lb
: defines the lower bounds of the search space.ColVec_t pso_settings.initial_ub
: defines the upper bounds of the search space.
fp_t rel_objfn_change_tol
: the error tolerance value controlling how small the relative change in best candidate solution should be before ‘convergence’ is declared.bool vals_bound
: whether the search space of the algorithm is bounded. Iftrue
, thenColVec_t lower_bounds
: defines the lower bounds of the search space.ColVec_t upper_bounds
: defines the upper bounds of the search space.
In addition to these:
int print_level
: Set print level.Level 1: Print iteration count and the relative improvement in the objective function value between iterations.
Level 2: Level 1 and print best input values, as well as objective function values.
Level 3: Level 2 and print full matrix \(X\).
Examples¶
Ackley Function¶
Code to run this example is given below.
Armadillo (Click to show/hide)
#define OPTIM_ENABLE_ARMA_WRAPPERS
#include "optim.hpp"
#define OPTIM_PI 3.14159265358979
double
ackley_fn(const arma::vec& vals_inp, arma::vec* grad_out, void* opt_data)
{
const double x = vals_inp(0);
const double y = vals_inp(1);
double obj_val = 20 + std::exp(1) - 20*std::exp( -0.2*std::sqrt(0.5*(x*x + y*y)) ) - std::exp( 0.5*(std::cos(2 * OPTIM_PI * x) + std::cos(2 * OPTIM_PI * y)) );
return obj_val;
}
int main()
{
arma::vec x = arma::ones(2,1) + 1.0; // initial values: (2,2)
bool success = optim::pso(x, ackley_fn, nullptr);
if (success) {
std::cout << "pso: Ackley test completed successfully." << std::endl;
} else {
std::cout << "pso: Ackley test completed unsuccessfully." << std::endl;
}
arma::cout << "pso: solution to Ackley test:\n" << x << arma::endl;
return 0;
}
Eigen (Click to show/hide)
#define OPTIM_ENABLE_EIGEN_WRAPPERS
#include "optim.hpp"
#define OPTIM_PI 3.14159265358979
double
ackley_fn(const Eigen::VectorXd& vals_inp, Eigen::VectorXd* grad_out, void* opt_data)
{
const double x = vals_inp(0);
const double y = vals_inp(1);
double obj_val = 20 + std::exp(1) - 20*std::exp( -0.2*std::sqrt(0.5*(x*x + y*y)) ) - std::exp( 0.5*(std::cos(2 * OPTIM_PI * x) + std::cos(2 * OPTIM_PI * y)) );
return obj_val;
}
int main()
{
Eigen::VectorXd x = 2.0 * Eigen::VectorXd::Ones(2); // initial values: (2,2)
bool success = optim::pso(x, ackley_fn, nullptr);
if (success) {
std::cout << "pso: Ackley test completed successfully." << std::endl;
} else {
std::cout << "pso: Ackley test completed unsuccessfully." << std::endl;
}
arma::cout << "pso: solution to Ackley test:\n" << x << arma::endl;
return 0;
}
Rastrigin Function¶
Code to run this example is given below.
Armadillo Code (Click to show/hide)
#define OPTIM_ENABLE_ARMA_WRAPPERS
#include "optim.hpp"
#define OPTIM_PI 3.14159265358979
struct rastrigin_fn_data {
double A;
};
double
rastrigin_fn(const arma::vec& vals_inp, arma::vec* grad_out, void* opt_data)
{
const int n = vals_inp.n_elem;
rastrigin_fn_data* objfn_data = reinterpret_cast<rastrigin_fn_data*>(opt_data);
const double A = objfn_data->A;
double obj_val = A*n + arma::accu( arma::pow(vals_inp,2) - A*arma::cos(2 * OPTIM_PI * vals_inp) );
return obj_val;
}
int main()
{
rastrigin_fn_data test_data;
test_data.A = 10;
arma::vec x = arma::ones(2,1) + 1.0; // initial values: (2,2)
bool success = optim::pso(x, rastrigin_fn, &test_data);
if (success) {
std::cout << "pso: Rastrigin test completed successfully." << std::endl;
} else {
std::cout << "pso: Rastrigin test completed unsuccessfully." << std::endl;
}
arma::cout << "pso: solution to Rastrigin test:\n" << x << arma::endl;
return 0;
}
Eigen Code (Click to show/hide)
#define OPTIM_ENABLE_EIGEN_WRAPPERS
#include "optim.hpp"
#define OPTIM_PI 3.14159265358979
struct rastrigin_fn_data {
double A;
};
double
rastrigin_fn(const Eigen::VectorXd& vals_inp, Eigen::VectorXd* grad_out, void* opt_data)
{
const int n = vals_inp.n_elem;
rastrigin_fn_data* objfn_data = reinterpret_cast<rastrigin_fn_data*>(opt_data);
const double A = objfn_data->A;
double obj_val = A*n + vals_inp.array().pow(2).sum() - A * (2 * OPTIM_PI * vals_inp).array().cos().sum();
return obj_val;
}
int main()
{
rastrigin_fn_data test_data;
test_data.A = 10;
Eigen::VectorXd x = 2.0 * Eigen::VectorXd::Ones(2); // initial values: (2,2)
bool success = optim::pso(x, rastrigin_fn, &test_data);
if (success) {
std::cout << "pso: Rastrigin test completed successfully." << std::endl;
} else {
std::cout << "pso: Rastrigin test completed unsuccessfully." << std::endl;
}
arma::cout << "pso: solution to Rastrigin test:\n" << x << arma::endl;
return 0;
}