alternatives to gradient-based optimization algorithms like lsqnonlin that handle at least O(10) parameters?

20 vues (au cours des 30 derniers jours)

SA-W le 24 Fév 2023

0
Lien

Utiliser le lien direct vers cette question

https://fr.mathworks.com/matlabcentral/answers/1918370-alternatives-to-gradient-based-optimization-algorithms-like-lsqnonlin-that-handle-at-least-o-10-par

Commenté : SA-W le 6 Mar 2023

I am fitting parameters of a PDE to experimental data utilizing the function lsqnonlin from the optimization toolbox.

I observed that I can successfully optimize 10,...,20 parameters but not more. My objective is, however, to optimize 50 to 100 parameters.

Based on my experience with lsqnonlin (and fmincon and friends,...), it seems that this class of optimization algorithms can handle a small number of parameters (10,...,20) well, but are not appropriate anymore if there are many more parameters.

Fast convergence or computation time are not important to me.

I am coming from an engineering background and do not have deep knowledge about other class of optimization algorithms. That said, I would appreciate if someone could give me some keywords or recommendations regarding alternative optimization algorithms, that are designed for handling a larger number of parameters.

Thank you!

18 commentaires
Afficher 16 commentaires plus anciensMasquer 16 commentaires plus anciens

SA-W le 24 Fév 2023

How have you assessed whether they are "wrong"? What is the final objective function value, and how does it compare to the objective value of the "right" parameters?

The objective function value is indeed in the order of the objective value corresponding to the right parameters, however, the parameters are clearly wrong which I observed by plotting the function.

The parameters that I am optimizing (the function values) have to represent a convex function in 1d. However, the 20 parameters found by lsqnonlin do not define a convex function, but rather a function which is at certain ranges even decreasing, which, for whatever reasons, is also a minimum of the objective function.

Let me ask two questions:

(i) is there a recommended ratio (just roughly) between number of parameters and equations? Currently, my data vectors have 1500 entries and I optimise 20 parameters.

(ii) I am considering enforcing convexity by having the following constraints on the parameters p(i):

p(i-1)-pf(i)+p(i-1) > 0 for all parameters

Is there a way to retranslate this constraints for lsqnonlin, for instance by adding a penalty term to

f = ||r||^2 + penalty term?

Matt J le 26 Fév 2023

Modifié(e) : Matt J le 26 Fév 2023

You could do that, or even have c(i) = (p(i-1)-2p(i)+p(i-1))^200. However, with larger exponents, the gradient of the cost function become small over a wider neighborhood of c=0. Smaller gradients means slower convergence, and also the chance that the OptimalityTolerance would trigger prematurely. Conversely, with smaller exponents (but stil greater than 1), the gradient becomes less smooth. So, there is a trade-off to be reckoned with.

SA-W le 6 Mar 2023

Modifié(e) : SA-W le 6 Mar 2023

Ouvrir dans MATLAB Online

@Matt J

I implemented

r=[weight1*r; weight2*constraint_violation];
f = ||r||^2 

iwith lsqnonlin and figured out that

weight_2 = 1e4 %approximately

is necessary that the ineqaulity constraints are considered at all at intermediate iterations and weight_2 below or above 1e4, respectively, will not include sufficiently the constraint_violation or dominates the residual r.

If I implement the above inequality constraints in fmincon and multiply A*x<=b by 1e4 (this multiplication is necessary for fmincon to pay attention to the constraints), fmincon gives a better solution than lsqnonlin. By better, I mean that the solution is nearly perfect equal to the exact solution. The lsqnonlin solution fulfills the constraints in A, however, the solution is not as accurate.

Anyway, fmincon requires twice or three times as much iterations as lsqnonlin requires to find the solution. That said, I would like to stick lsqnonlin because the evaluation of the objective function is quite expensive in my case.

I read the documentation on the iterior-point algorithm. The way fmincon incorporates the constraints is based on a merit function, which is, of course, different than just expanding the residual vector as I did it.

Admittedly, I do not have enough background in optimization. Can you give me another recommendation (which is closer to the way fmincon incorporates the constraints) to implement my constraints in lsqnonlin?

Thank you!

Connectez-vous pour commenter.

Connectez-vous pour répondre à cette question.

Réponse acceptée

Matt J le 24 Fév 2023

1
Lien

Utiliser le lien direct vers cette réponse

https://fr.mathworks.com/matlabcentral/answers/1918370-alternatives-to-gradient-based-optimization-algorithms-like-lsqnonlin-that-handle-at-least-o-10-par#answer_1179270

Modifié(e) : Matt J le 24 Fév 2023

it seems that this class of optimization algorithms can handle a small number of parameters (10,...,20) well, but are not appropriate anymore if there are many more parameters.

I am currently using lsqnonlin in optimization problems with 11796480 variables. It converges fine, though in my case the line search operations really do make it much slower than I would like. In any case, 20 variables is definitely not the upper limit.

I am fitting parameters of a PDE to experimental data utilizing the function lsqnonlin from the optimization toolbox.

Make sure you are following the guidelines here,

https://www.mathworks.com/help/optim/ug/optimizing-a-simulation-or-ordinary-differential-equation.html

9 commentaires
Afficher 7 commentaires plus anciensMasquer 7 commentaires plus anciens

Matt J le 24 Fév 2023

The documentation is obviously non-uniform!

SA-W le 1 Mar 2023

If someone comes across our disussion here: the algorithms in lsqnonlin can also recover from NaN.

Connectez-vous pour commenter.

Plus de réponses (1)

Matt J le 26 Fév 2023

0
Lien

Utiliser le lien direct vers cette réponse

https://fr.mathworks.com/matlabcentral/answers/1918370-alternatives-to-gradient-based-optimization-algorithms-like-lsqnonlin-that-handle-at-least-o-10-par#answer_1180295

Modifié(e) : Matt J le 26 Fév 2023

The parameters that I am optimizing (the function values) have to represent a convex function in 1d.

Another strategy that I would suggest is to first fit a constrained curve to your function samples using this FEX download,

https://www.mathworks.com/matlabcentral/fileexchange/24443-slm-shape-language-modeling

This tool will allow you both to constrain the fit to be convex (using the 'ConcaveUp' option), and also to compute the derivates of the fit. Once you have the curve fit and its derivates, you can substitute them into your ODE to obtain conventional equations in the original set of unknown parameters. You can then solve these equations using lsqnonin, or even a linear solver, depending on the form of your equations.

7 commentaires
Afficher 5 commentaires plus anciensMasquer 5 commentaires plus anciens

Matt J le 6 Mar 2023

I don't know enough about finite element analysis to know why you cannot fit a spline to a finite element field.

SA-W le 6 Mar 2023

Would you support me further regarding the issue in my new comment (in the comments under the question)?

Connectez-vous pour commenter.

Connectez-vous pour répondre à cette question.

Catégories

Mathematics and Optimization Optimization Toolbox Optimization Results Solver Outputs and Iterative Display

En savoir plus sur Solver Outputs and Iterative Display dans Help Center et File Exchange

Produits

Optimization Toolbox

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

alternatives to gradient-based optimization algorithms like lsqnonlin that handle at least O(10) parameters?

18 commentaires
Afficher 16 commentaires plus anciensMasquer 16 commentaires plus anciens

Réponse acceptée

9 commentaires
Afficher 7 commentaires plus anciensMasquer 7 commentaires plus anciens

Plus de réponses (1)

7 commentaires
Afficher 5 commentaires plus anciensMasquer 5 commentaires plus anciens

Voir également

Catégories

Tags

Produits

Community Treasure Hunt

alternatives to gradient-based optimization algorithms like lsqnonlin that handle at least O(10) parameters?

18 commentaires Afficher 16 commentaires plus anciensMasquer 16 commentaires plus anciens

Réponse acceptée

9 commentaires Afficher 7 commentaires plus anciensMasquer 7 commentaires plus anciens

Plus de réponses (1)

7 commentaires Afficher 5 commentaires plus anciensMasquer 5 commentaires plus anciens

Voir également

Catégories

Tags

Produits

Community Treasure Hunt

18 commentaires
Afficher 16 commentaires plus anciensMasquer 16 commentaires plus anciens

9 commentaires
Afficher 7 commentaires plus anciensMasquer 7 commentaires plus anciens

7 commentaires
Afficher 5 commentaires plus anciensMasquer 5 commentaires plus anciens