I've been reading in the litterature that quasi newton methods such as DFP and BFGS have poor performance on ill-conditionned problems, but I don't understand the reason of it. I have been trying to use these methods on a quadratic problem that is ill-conditionned and it doesn't converge in p+1 iterations (it is one of quasi newton methods properties for quadratic problems), but a little more. Why is that ? Thank you for your help.
Why do quasi newton methods such as DFP and BFGS have poor performance on ill-conditionned problem, even if quadratic
1k Views Asked by Kathryn Schutte At
1
There are 1 best solutions below
Related Questions in OPTIMIZATION
- Does compiler optimize operation on const variable and literal const number?
- Optimizing for Social Leaderboards
- 3D FFT with data larger than cache
- Optimum directory structure for large number of files to display on a page
- How to make faster queries on my mysql table?
- Xib taking long time (>1s) to load. UIFont cache seems to blame
- How to speed up string comparisons in an array with a for loop?
- How to load all symbols from shared library on start up?
- Cython speed vs numpy
- Improve Speed of Piecewise Function in MATLAB
- How to check that all values are equal in array using recursion?
- PHP split string into known tokens and remaining words add to single-worded array
- Python: why is my O(n) slowing down as it progresses?
- Hint indexes to mysql on Join
- Error When Compiler Optimizations are on
Related Questions in DATA-SCIENCE
- How access a downloaded library that is not showing up?
- Convert groupby.DataFrameGroupBy object to a dictionary in Python
- How can I detect keypresses using accelerometer/gyroscope data?
- Multiple Linear Regression handle NA
- Input/output error while copying from hadoop file system to local
- Removing duplicated values with missing values in a dataframe
- R editing dataframe based on column value
- PredictionIO Universal Recommender
- Pandas : TypeError: float() argument must be a string or a number
- Text classification algorithms which are not Naive?
- adding row generated inside a loop to a new data frame
- How to read multiple line elements in Spark , where each record of log is starting with yyyy-MM-dd format and each record of log is multi-line?
- Pandas merge duplicate DataFrame columns preserving column names
- How to plot multiple graphs in one chart using pygal?
- Removing non-English words from text using Python
Related Questions in QUADRATIC
- Void function (c++)
- What is wrong with my code for quadratic formula
- Program not outputting to file? C++
- Quadratic Primes
- Constraining norms with inequalities
- Building a circle with quadratic curves in canvas
- How to insert random ints into hashing table?
- Finding Variables from a String
- Quadratic form in R
- Efficient way of calculating quadratic forms: avoid for loops?
- cubic and quadratic graphs?
- Java -- QuadCurve2D get Y by X
- Quadratic Solver giving output NaN
- Quadratic Function with Complex Numbers
- Performance issue with setting quadratic objective in CPLEX
Related Questions in HESSIAN
- Determinant of hessian matrix of a grayscale image is too small in matlab
- Spring RPC with hessian over HTTP works fine on maven tomcat plugin, but return http 500 error on Tomcat server
- Is there any HessianKit Sample project?
- Avoiding WARNING: Hessian/Burlap: <class> is an unknown class in WebappClassLoader
- Hessian with JRockit Compatibility
- HessianKit linking failure
- parsing error when sending double from android to php with hessian
- Which spring version is right when work with Hessian(3.1.3)?
- Hessian (Java) deserialization of class which is not in the classpath does not raise an error
- HessianConnectionException: (HTTP) 500 error when using Hessian 4.0.7 & Spring 3.1.1
- Hessian, Adding a header
- How to compute Hessian of the loss w.r.t. the parameters in PyTorch using autograd.grad
- Problem trying to get the Gradient and Hessian in Pyomo
- Diagonal of the Hessian with Tensorflow
- Why do quasi newton methods such as DFP and BFGS have poor performance on ill-conditionned problem, even if quadratic
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Ill-conditioning is a general problem for first-order optimization algorithms. Basically there are two main aspects with ill-conditioning:
Standard Newton method, which is a second-order optimization algorithm, in theory, can tackle both of these problems: first, it converges much faster, hence, there is a much lower roundoff error, and second, it does not have problems with the stretched shape since now it takes into account the gradient scaling to move in a proper direction.
However, solving it involves the operation of the inverse of the Hessian, which, in the case of large condition numbers, may result in the corresponding small eigenvalues blowing up, leading to numerical instability when implemented in a naive way. This issue can still be resolved by utilizing a proper direct solver or iterative solver with pre-conditioning
Finally, quasi-Newton methods such as DFP and BFGS try to approximate the inverse Hessian iteratively. They may be more robust in handling roundoff errors and also may be faster in convergence than first-order methods, but because of this approximation, they may end up performing worse than the standard Newton method.