#include <elxAdaptiveStochasticGradientDescent.h>
A gradient descent optimizer with an adaptive gain.
This class is a wrap around the AdaptiveStochasticGradientDescentOptimizer class. It takes care of setting parameters and printing progress information. For more information about the optimization method, please read the documentation of the AdaptiveStochasticGradientDescentOptimizer class.
This optimizer is very suitable to be used in combination with the Random image sampler, or with the RandomCoordinate image sampler, with the setting (NewSamplesEveryIteration "true"). Much effort has been spent on providing reasonable default values for all parameters, to simplify usage. In most registration problems, good results should be obtained without specifying any of the parameters described below (except the first of course, which defines the optimizer to use).
This optimization method is described in the following references:
[1] P. Cruz, "Almost sure convergence and asymptotical normality of a generalization of Kesten's stochastic approximation algorithm for multidimensional case." Technical Report, 2005. http://hdl.handle.net/2052/74
[2] S. Klein, J.P.W. Pluim, and M. Staring, M.A. Viergever, "Adaptive stochastic gradient descent optimization for image registration," International Journal of Computer Vision, vol. 81, no. 3, pp. 227-239, 2009. http://dx.doi.org/10.1007/s11263-008-0168-y
Acceleration in case of many transform parameters was proposed in the following paper:
[3] Y. Qiao, B. van Lew, B.P.F. Lelieveldt and M. Staring "Fast Automatic Step Size Estimation for Gradient Descent Optimization of Image Registration," IEEE Transactions on Medical Imaging, vol. 35, no. 2, pp. 391 - 403, February 2016. http://elastix.dev/marius/publications/2016_j_TMIa.php
The parameters used in this class are:
Optimizer: Select this optimizer as follows:
(Optimizer "AdaptiveStochasticGradientDescent")
MaximumNumberOfIterations: The maximum number of iterations in each resolution.
example: (MaximumNumberOfIterations 100 100 50)
Default/recommended value: 500. When you are in a hurry, you may go down to 250 for example. When you have plenty of time, and want to be absolutely sure of the best results, a setting of 2000 is reasonable. In general, 500 gives satisfactory results.
MaximumNumberOfSamplingAttempts: The maximum number of sampling attempts. Sometimes not enough corresponding samples can be drawn, upon which an exception is thrown. With this parameter it is possible to try to draw another set of samples.
example: (MaximumNumberOfSamplingAttempts 10 15 10)
Default value: 0, i.e. just fail immediately, for backward compatibility.
AutomaticParameterEstimation: When this parameter is set to "true", many other parameters are calculated automatically: SP_a, SP_alpha, SigmoidMax, SigmoidMin, and SigmoidScale. In the elastix.log file the actually chosen values for these parameters can be found.
example: (AutomaticParameterEstimation "true")
Default/recommended value: "true". The parameter can be specified for each resolution, or for all resolutions at once.
UseAdaptiveStepSizes: When this parameter is set to "true", the adaptive step size mechanism described in the documentation of itk::AdaptiveStochasticGradientDescentOptimizer is used. The parameter can be specified for each resolution, or for all resolutions at once.
example: (UseAdaptiveStepSizes "true")
Default/recommend value: "true", because it makes the registration more robust. In case of using a RandomCoordinate sampler, with (UseRandomSampleRegion "true"), the adaptive step size mechanism is turned off, no matter the user setting.
MaximumStepLength: Also called . This parameter can be considered as the maximum voxel displacement between two iterations. The larger this parameter, the more aggressive the optimization. The parameter can be specified for each resolution, or for all resolutions at once.
example: (MaximumStepLength 1.0)
Default: mean voxel spacing of fixed and moving image. This seems to work well in general. This parameter only has influence when AutomaticParameterEstimation is used.
SP_a: The gain at each iteration is defined by
.
SP_a can be defined for each resolution.
example: (SP_a 3200.0 3200.0 1600.0)
The default value is 400.0. Tuning this variable for you specific problem is recommended. Alternatively set the AutomaticParameterEstimation to "true". In that case, you do not need to specify SP_a. SP_a has no influence when AutomaticParameterEstimation is used.
SP_A: The gain at each iteration is defined by
.
SP_A can be defined for each resolution.
example: (SP_A 50.0 50.0 100.0)
The default/recommended value for this particular optimizer is 20.0.
SP_alpha: The gain at each iteration is defined by
.
SP_alpha can be defined for each resolution.
example: (SP_alpha 0.602 0.602 0.602)
The default/recommended value for this particular optimizer is 1.0. Alternatively set the AutomaticParameterEstimation to "true". In that case, you do not need to specify SP_alpha. SP_alpha has no influence when AutomaticParameterEstimation is used.
SigmoidMax: The maximum of the sigmoid function ( ). Must be larger than 0. The parameter can be specified for each resolution, or for all resolutions at once.
example: (SigmoidMax 1.0)
Default/recommended value: 1.0. This parameter has no influence when AutomaticParameterEstimation is used. In that case, always a value 1.0 is used.
SigmoidMin: The minimum of the sigmoid function ( ). Must be smaller than 0. The parameter can be specified for each resolution, or for all resolutions at once.
example: (SigmoidMin -0.8)
Default value: -0.8. This parameter has no influence when AutomaticParameterEstimation is used. In that case, the value is automatically determined, depending on the images, metric etc.
SigmoidScale: The scale/width of the sigmoid function ( ). The parameter can be specified for each resolution, or for all resolutions at once.
example: (SigmoidScale 0.00001)
Default value: 1e-8. This parameter has no influence when AutomaticParameterEstimation is used. In that case, the value is automatically determined, depending on the images, metric etc.
SigmoidInitialTime: the initial time input for the sigmoid ( ). Must be larger than 0.0. The parameter can be specified for each resolution, or for all resolutions at once.
example: (SigmoidInitialTime 0.0 5.0 5.0)
Default value: 0.0. When increased, the optimization starts with smaller steps, leaving the possibility to increase the steps when necessary. If set to 0.0, the method starts with with the largest step allowed.
NumberOfGradientMeasurements: Number of gradients N to estimate the average square magnitudes of the exact gradient and the approximation error. The parameter can be specified for each resolution, or for all resolutions at once.
example: (NumberOfGradientMeasurements 10)
Default value: 0, which means that the value is automatically estimated. In principle, the more the better, but the slower. In practice N=10 is usually sufficient. But the automatic estimation achieved by N=0 also works good. The parameter has only influence when AutomaticParameterEstimation is used.
NumberOfJacobianMeasurements: The number of voxels M where the Jacobian is measured, which is used to estimate the covariance matrix. The parameter can be specified for each resolution, or for all resolutions at once.
example: (NumberOfJacobianMeasurements 5000 10000 20000)
Default value: M = max( 1000, nrofparams ), with nrofparams the number of transform parameters. This is a rather crude rule of thumb, which seems to work in practice. In principle, the more the better, but the slower. The parameter has only influence when AutomaticParameterEstimation is used.
NumberOfSamplesForExactGradient: The number of image samples used to compute the 'exact' gradient. The samples are chosen on a uniform grid. The parameter can be specified for each resolution, or for all resolutions at once.
example: (NumberOfSamplesForExactGradient 100000)
Default/recommended: 100000. This works in general. If the image is smaller, the number of samples is automatically reduced. In principle, the more the better, but the slower. The parameter has only influence when AutomaticParameterEstimation is used.
ASGDParameterEstimationMethod: The ASGD parameter estimation method used in this optimizer. The parameter can be specified for each resolution.
example: (ASGDParameterEstimationMethod "Original")
or (ASGDParameterEstimationMethod "DisplacementDistribution")
Default: Original.
MaximumDisplacementEstimationMethod: The suitable position selection method used only for displacement distribution estimation method. The parameter can be specified for each resolution.
example: (MaximumDisplacementEstimationMethod "2sigma")
or (MaximumDisplacementEstimationMethod "95percentile")
Default: 2sigma.
NoiseCompensation: Selects whether or not to use noise compensation. The parameter can be specified for each resolution, or for all resolutions at once.
example: (NoiseCompensation "true")
Default/recommended: true.
Definition at line 193 of file elxAdaptiveStochasticGradientDescent.h.
Static Public Member Functions | |
static Pointer | New () |
Static Public Member Functions inherited from itk::AdaptiveStochasticGradientDescentOptimizer | |
static Pointer | New () |
Static Public Member Functions inherited from itk::StandardGradientDescentOptimizer | |
static Pointer | New () |
Static Public Member Functions inherited from itk::GradientDescentOptimizer2 | |
static Pointer | New () |
Static Public Member Functions inherited from itk::ScaledSingleValuedNonLinearOptimizer | |
static Pointer | New () |
Static Public Member Functions inherited from elastix::BaseComponent | |
template<typename TBaseComponent > | |
static auto | AsITKBaseType (TBaseComponent *const baseComponent) -> decltype(baseComponent->GetAsITKBaseType()) |
static void | InitializeElastixExecutable () |
static bool | IsElastixLibrary () |
Protected Member Functions | |
AdaptiveStochasticGradientDescent () | |
virtual void | AddRandomPerturbation (ParametersType ¶meters, double sigma) |
virtual void | AutomaticParameterEstimation () |
virtual void | AutomaticParameterEstimationOriginal () |
virtual void | AutomaticParameterEstimationUsingDisplacementDistribution () |
virtual void | GetScaledDerivativeWithExceptionHandling (const ParametersType ¶meters, DerivativeType &derivative) |
itkStaticConstMacro (FixedImageDimension, unsigned int, FixedImageType::ImageDimension) | |
itkStaticConstMacro (MovingImageDimension, unsigned int, MovingImageType::ImageDimension) | |
virtual void | SampleGradients (const ParametersType &mu0, double perturbationSigma, double &gg, double &ee) |
~AdaptiveStochasticGradientDescent () override=default | |
Protected Member Functions inherited from itk::AdaptiveStochasticGradientDescentOptimizer | |
AdaptiveStochasticGradientDescentOptimizer () | |
void | UpdateCurrentTime () override |
~AdaptiveStochasticGradientDescentOptimizer () override=default | |
Protected Member Functions inherited from itk::StandardGradientDescentOptimizer | |
virtual double | Compute_a (double k) const |
StandardGradientDescentOptimizer () | |
~StandardGradientDescentOptimizer () override=default | |
Protected Member Functions inherited from itk::GradientDescentOptimizer2 | |
GradientDescentOptimizer2 () | |
void | PrintSelf (std::ostream &os, Indent indent) const override |
~GradientDescentOptimizer2 () override=default | |
Protected Member Functions inherited from itk::ScaledSingleValuedNonLinearOptimizer | |
virtual void | GetScaledDerivative (const ParametersType ¶meters, DerivativeType &derivative) const |
virtual MeasureType | GetScaledValue (const ParametersType ¶meters) const |
virtual void | GetScaledValueAndDerivative (const ParametersType ¶meters, MeasureType &value, DerivativeType &derivative) const |
void | PrintSelf (std::ostream &os, Indent indent) const override |
ScaledSingleValuedNonLinearOptimizer () | |
void | SetCurrentPosition (const ParametersType ¶m) override |
virtual void | SetScaledCurrentPosition (const ParametersType ¶meters) |
~ScaledSingleValuedNonLinearOptimizer () override=default | |
Protected Member Functions inherited from elastix::OptimizerBase< TElastix > | |
virtual bool | GetNewSamplesEveryIteration () const |
OptimizerBase ()=default | |
virtual void | SelectNewSamples () |
~OptimizerBase () override=default | |
Protected Member Functions inherited from elastix::BaseComponentSE< TElastix > | |
BaseComponentSE ()=default | |
~BaseComponentSE () override=default | |
Protected Member Functions inherited from elastix::BaseComponent | |
BaseComponent ()=default | |
virtual | ~BaseComponent ()=default |
Additional Inherited Members | |
Static Protected Member Functions inherited from elastix::OptimizerBase< TElastix > | |
static void | PrintSettingsVector (const SettingsVectorType &settings) |
|
protected |
Definition at line 330 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
Definition at line 328 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
Definition at line 305 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
Definition at line 301 of file elxAdaptiveStochasticGradientDescent.h.
using elastix::AdaptiveStochasticGradientDescent< TElastix >::ConstPointer = itk::SmartPointer<const Self> |
Definition at line 205 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
Definition at line 327 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
Definition at line 296 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
Definition at line 297 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
Definition at line 295 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
Protected typedefs
Definition at line 292 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
Definition at line 315 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
Definition at line 314 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
Definition at line 313 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
Definition at line 312 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
Definition at line 311 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
Definition at line 310 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
Definition at line 317 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
Definition at line 316 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
Definition at line 309 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
Samplers:
Definition at line 308 of file elxAdaptiveStochasticGradientDescent.h.
using elastix::AdaptiveStochasticGradientDescent< TElastix >::ITKBaseType = typename Superclass2::ITKBaseType |
Definition at line 227 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
Definition at line 298 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
Definition at line 300 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
Definition at line 302 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
Definition at line 293 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
Definition at line 331 of file elxAdaptiveStochasticGradientDescent.h.
using elastix::AdaptiveStochasticGradientDescent< TElastix >::Pointer = itk::SmartPointer<Self> |
Definition at line 204 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
Definition at line 321 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
Other protected typedefs
Definition at line 320 of file elxAdaptiveStochasticGradientDescent.h.
using elastix::AdaptiveStochasticGradientDescent< TElastix >::Self = AdaptiveStochasticGradientDescent |
Standard ITK.
Definition at line 201 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
Definition at line 133 of file elxOptimizerBase.h.
using elastix::AdaptiveStochasticGradientDescent< TElastix >::SizeValueType = itk::SizeValueType |
Definition at line 228 of file elxAdaptiveStochasticGradientDescent.h.
using elastix::AdaptiveStochasticGradientDescent< TElastix >::Superclass1 = AdaptiveStochasticGradientDescentOptimizer |
Definition at line 202 of file elxAdaptiveStochasticGradientDescent.h.
using elastix::AdaptiveStochasticGradientDescent< TElastix >::Superclass2 = OptimizerBase<TElastix> |
Definition at line 203 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
Typedefs for support of sparse Jacobians and AdvancedTransforms.
Definition at line 324 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
Definition at line 299 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
|
overrideprotecteddefault |
|
protectedvirtual |
Helper function that adds a random perturbation delta to the input parameters, with delta ~ sigma * N(0,I). Used by SampleGradients.
|
overridevirtual |
Reimplemented from elastix::BaseComponent.
|
overridevirtual |
Reimplemented from elastix::BaseComponent.
|
overridevirtual |
Reimplemented from elastix::BaseComponent.
|
protectedvirtual |
Select different method to estimate some reasonable values for the parameters SP_a, SP_alpha (=1), SigmoidMin, SigmoidMax (=1), and SigmoidScale.
|
protectedvirtual |
Original estimation method to get the reasonable values for the parameters SP_a, SP_alpha (=1), SigmoidMin, SigmoidMax (=1), and SigmoidScale.
|
protectedvirtual |
Estimates some reasonable values for the parameters using displacement distribution SP_a, SP_alpha (=1)
|
overridevirtual |
Reimplemented from elastix::BaseComponent.
|
overridevirtual |
Methods invoked by elastix, in which parameters can be set and progress information can be printed.
Reimplemented from elastix::BaseComponent.
elastix::AdaptiveStochasticGradientDescent< TElastix >::elxClassNameMacro | ( | "AdaptiveStochasticGradientDescent< TElastix >" | ) |
Name of this class. Use this name in the parameter file to select this specific optimizer. example: (Optimizer "AdaptiveStochasticGradientDescent")
|
virtual |
|
virtual |
Run-time type information (and related methods).
Reimplemented from itk::AdaptiveStochasticGradientDescentOptimizer.
|
virtual |
Get the MaximumNumberOfSamplingAttempts.
|
virtual |
|
protectedvirtual |
Helper function, which calls GetScaledValueAndDerivative and does some exception handling. Used by SampleGradients.
elastix::AdaptiveStochasticGradientDescent< TElastix >::ITK_DISALLOW_COPY_AND_MOVE | ( | AdaptiveStochasticGradientDescent< TElastix > | ) |
|
protected |
|
protected |
|
override |
Stop optimization and pass on exception.
|
static |
Method for creation through the object factory.
|
overridevirtual |
If automatic gain estimation is desired, then estimate SP_a, SP_alpha SigmoidScale, SigmoidMax, SigmoidMin. After that call Superclass' implementation.
Reimplemented from itk::GradientDescentOptimizer2.
|
protectedvirtual |
Measure some derivatives, exact and approximated. Returns the squared magnitude of the gradient and approximation error. Needed for the automatic parameter estimation. Gradients are measured at position mu_n, which are generated according to: mu_n - mu_0 ~ N(0, perturbationSigma^2 I ); gg = g^T g, etc.
|
virtual |
Set/Get whether automatic parameter estimation is desired. If true, make sure to set the maximum step length.
The following parameters are automatically determined: SP_a, SP_alpha (=1), SigmoidMin, SigmoidMax (=1), SigmoidScale. A usually suitable value for SP_A is 20, which is the default setting, if not specified by the user.
|
virtual |
Set the MaximumNumberOfSamplingAttempts.
|
virtual |
Set/Get maximum step length.
|
override |
Check if any scales are set, and set the UseScales flag on or off; after that call the superclass' implementation.
|
private |
Definition at line 395 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
The transform stored as AdvancedTransform
Definition at line 345 of file elxAdaptiveStochasticGradientDescent.h.
|
private |
Definition at line 397 of file elxAdaptiveStochasticGradientDescent.h.
|
private |
Definition at line 405 of file elxAdaptiveStochasticGradientDescent.h.
|
private |
Definition at line 403 of file elxAdaptiveStochasticGradientDescent.h.
|
private |
Private variables for band size estimation of covariance matrix.
Definition at line 408 of file elxAdaptiveStochasticGradientDescent.h.
|
private |
Private variables for the sampling attempts.
Definition at line 402 of file elxAdaptiveStochasticGradientDescent.h.
|
private |
Definition at line 398 of file elxAdaptiveStochasticGradientDescent.h.
|
private |
Definition at line 399 of file elxAdaptiveStochasticGradientDescent.h.
|
private |
Definition at line 409 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
Some options for automatic parameter estimation.
Definition at line 340 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
Definition at line 341 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
Definition at line 342 of file elxAdaptiveStochasticGradientDescent.h.
|
private |
Definition at line 413 of file elxAdaptiveStochasticGradientDescent.h.
|
private |
Definition at line 404 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
RandomGenerator for AddRandomPerturbation.
Definition at line 348 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
Variable to store the automatically determined settings for each resolution.
Definition at line 337 of file elxAdaptiveStochasticGradientDescent.h.
|
protected |
Definition at line 350 of file elxAdaptiveStochasticGradientDescent.h.
|
private |
The flag of using noise compensation.
Definition at line 412 of file elxAdaptiveStochasticGradientDescent.h.
Generated on 2024-07-17 for elastix by 1.11.0 (9b424b03c9833626cd435af22a444888fbbb192d) |