Abstract
The scaling and squaring method for the matrix exponential is based on the approximation e(A) approximate to (r(m)(2(-s) A))(2s), where r(m)(x) is the [m/m] Pade approximant to e(x) and the integers m and s are to be chosen. Several authors have identified a weakness of existing scaling and squaring algorithms termed overscaling, in which a value of s much larger than necessary is chosen, causing a loss of accuracy in floating point arithmetic. Building on the scaling and squaring algorithm of Higham [SIAM J. Matrix Anal. Appl., 26 (2005), pp. 1179-1193], which is used by the MATLAB function expm, we derive a new algorithm that alleviates the overscaling problem. Two key ideas are employed. The first, specific to triangular matrices, is to compute the diagonal elements in the squaring phase as exponentials instead of from powers of r(m). The second idea is to base the backward error analysis that underlies the algorithm on members of the sequence {parallel to A(k)parallel to(1/k)} instead of parallel to A parallel to, since for nonnormal matrices it is possible that parallel to A(k)parallel to(1/k) is much smaller than parallel to A parallel to, and indeed this is likely when overscaling occurs in existing algorithms. The terms parallel to Ak parallel to(1/k) are estimated without computing powers of A by using a matrix 1-norm estimator in conjunction with a bound of the form parallel to A(k)parallel to(1/k) <= max (parallel to A(p)parallel to(1/p), parallel to A(q)parallel to(1/q)) that holds for certain fixed p and q less than k. The improvements to the truncation error bounds have to be balanced by the potential for a large parallel to A parallel to to cause inaccurate evaluation of r(m) in floating point arithmetic. We employ rigorous error bounds along with some heuristics to ensure that rounding errors are kept under control. Our numerical experiments show that the new algorithm generally provides accuracy at least as good as the existing algorithm of Higham at no higher cost, while for matrices that are triangular or cause overscaling it usually yields significant improvements in accuracy, cost, or both.