28 Pages

Numerical Solution of Scalar Equations

Course: MATH 5485, Fall 2009
School: UCF
Rating:
 
 
 
 
 

Word Count: 9219

Document Preview

Lecture AIMS Notes 2006 Peter J. Olver 2. Numerical Solution of Scalar Equations Most numerical solution methods are based on some form of iteration. The basic idea is that repeated application of the algorithm will produce closer and closer approximations to the desired solution. To analyze such algorithms, our first task is to understand general iterative processes. 2.1. Iteration of Functions. Iteration,...

Register Now

Unformatted Document Excerpt

Coursehero >> Florida >> UCF >> MATH 5485

Course Hero has millions of student submitted documents similar to the one
below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.

Course Hero has millions of student submitted documents similar to the one below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.
Lecture AIMS Notes 2006 Peter J. Olver 2. Numerical Solution of Scalar Equations Most numerical solution methods are based on some form of iteration. The basic idea is that repeated application of the algorithm will produce closer and closer approximations to the desired solution. To analyze such algorithms, our first task is to understand general iterative processes. 2.1. Iteration of Functions. Iteration, meaning repeated application of a function, can be viewed as a discrete dynamical system in which the continuous time variable has been "quantized" to assume integer values. Even iterating a very simple quadratic scalar function can lead to an amazing variety of dynamical phenomena, including multiply-periodic solutions and genuine chaos. Nonlinear iterative systems arise not just in mathematics, but also underlie the growth and decay of biological populations, predator-prey interactions, spread of communicable diseases such as Aids, and host of other natural phenomena. Moreover, many numerical solution methods -- for systems of algebraic equations, ordinary differential equations, partial differential equations, and so on -- rely on iteration, and so the theory underlies the analysis of convergence and efficiency of such numerical approximation schemes. In general, an iterative system has the form u(k+1) = g(u(k) ), (2.1) where g: R n R n is a real vector-valued function. (One can similarly treat iteration of complex-valued functions g: C n C n , but, for simplicity, we only deal with real systems here.) A solution is a discrete collection of points u(k) R n , in which the index k = 0, 1, 2, 3, . . . takes on non-negative integer values. Once we specify the initial iterate, u(0) = c, (2.2) then the resulting solution to the discrete dynamical system (2.1) is easily computed: u(1) = g(u(0) ) = g(c), u(2) = g(u(1) ) = g(g(c)), u(3) = g(u(2) ) = g(g(g(c))), . . . The superscripts on u(k) refer to the iteration number, and do not denote derivatives. 3/15/06 7 c 2006 Peter J. Olver and so on. Thus, unlike continuous dynamical systems, the existence and uniqueness of solutions is not an issue. As long as each successive iterate u(k) lies in the domain of definition of g one merely repeats the process to produce the solution, k times u(k) = g g (c), k = 0, 1, 2, . . . , (2.3) which is obtained by composing the function g with itself a total of k times. In other words, the solution to a discrete dynamical system corresponds to repeatedly pushing the g key on your calculator. For example, entering 0 and then repeatedly hitting the cos key corresponds to solving the iterative system u(k+1) = cos u(k) , u(0) = 0. (2.4) The first 10 iterates are displayed in the following table: k u(k) 0 1 2 3 4 5 6 7 8 9 0 1 .540302 .857553 .65429 .79348 .701369 .76396 .722102 .750418 For simplicity, we shall always assume that the vector-valued function g: R n R n is defined on all of R n ; otherwise, we must always be careful that the successive iterates u(k) never leave its domain of definition, thereby causing the iteration to break down. To avoid technical complications, we will also assume that g is at least continuous; later results rely on additional smoothness requirements, e.g., continuity of its first and second order partial derivatives. While the solution to a discrete dynamical system is essentially trivial, understanding its behavior is definitely not. Sometimes the solution converges to a particular value -- the key requirement for numerical solution methods. Sometimes it goes off to , or, more precisely, the norms of the iterates are unbounded: u(k) as k . Sometimes the solution repeats itself after a while. And sometimes the iterates behave in a seemingly random, chaotic manner -- all depending on the function g and, at times, the initial condition c. Although all of these cases may arise in real-world applications, we shall mostly concentrate upon understanding convergence. Definition 2.1. A fixed point or equilibrium of a discrete dynamical system (2.1) is a vector u R n such that g(u ) = u . (2.5) We easily see that every fixed point provides a constant solution to the discrete dynamical system, namely u(k) = u for all k. Moreover, it is not hard to prove that any convergent solution necessarily converges to a fixed point. Proposition 2.2. If a solution to a discrete dynamical system converges, k lim u(k) = u , then the limit u is a fixed point. 3/15/06 8 c 2006 Peter J. Olver Proof : This is a simple consequence of the continuity of g. We have u = lim u(k+1) = lim g(u(k) ) = g k k k lim u(k) = g(u ), Q.E.D. the last two equalities following from the continuity of g. For example, continuing the cosine iteration (2.4), we find that the iterates gradually converge to the value u .739085, which is the unique solution to the fixed point equation cos u = u. Later we will see how to rigorously prove this observed behavior. Of course, not every solution to a discrete dynamical system will necessarily converge, but Proposition 2.2 says that if it does, then it must converge to a fixed point. Thus, a key goal is to understand when a solution converges, and, if so, to which fixed point -- if there is more than one. Fixed points are divided into three classes: asymptotically stable, with the property that all nearby solutions converge to it, stable, with the property that all nearby solutions stay nearby, and unstable, almost all of whose nearby solutions diverge away from the fixed point. Thus, from a practical standpoint, convergence of the iterates of a discrete dynamical system requires asymptotic stability of the fixed point. Examples will appear in abundance in the following sections. Scalar Functions As always, the first step is to thoroughly understand the scalar case, and so we begin with a discrete dynamical system u(k+1) = g(u(k) ), u(0) = c, (2.6) in which g: R R is a continuous, scalar-valued function. As noted above, we will assume, for simplicity, that g is defined everywhere, and so we do not need to worry about whether the iterates u(0) , u(1) , u(2) , . . . are all well-defined. As usual, to study systems one begins with an in-depth analysis of the scalar version. Consider the iterative equation u(k+1) = a u(k) , The general solution to (2.7) is easily found: u(1) = a u(0) = a c, and, in general, u(k) = ak c. (2.8) If the initial condition is a = 0, then the solution u(k) 0 is constant. Therefore, 0 is a fixed point or equilibrium solution for the iterative system. 3/15/06 9 c 2006 Peter J. Olver u(0) = c. (2.7) u(2) = a u(1) = a2 c, u(3) = a u(2) = a3 c, 1 0.75 0.5 0.25 5 -0.25 -0.5 -0.75 -1 10 15 20 25 30 1 0.75 0.5 0.25 5 -0.25 -0.5 -0.75 -1 10 15 20 25 30 1 0.75 0.5 0.25 5 -0.25 -0.5 -0.75 -1 10 15 20 25 30 0<a<1 1 0.75 0.5 0.25 5 -0.25 -0.5 -0.75 -1 10 15 20 25 30 -2.5 -5 -7.5 -10 10 7.5 5 2.5 5 -1 < a < 0 a=1 10 7.5 5 2.5 10 15 20 25 30 -2.5 -5 -7.5 -10 5 10 15 20 25 30 a = -1 Figure 2.1. 1<a a < -1 One Dimensional Real Linear Iterative Systems. Example 2.3. Banks add interest to a savings account at discrete time intervals. For example, if the bank offers 5% interest compounded yearly, this means that the account balance will increase by 5% each year. Thus, assuming no deposits or withdrawals, the balance u(k) after k years will satisfy the iterative equation (2.7) with a = 1 + r where r = .05 is the interest rate, and the 1 indicates that all the money remains in the account. Thus, after k years, your account balance is u(k) = (1 + r)k c, where c = u(0) (2.9) is your initial deposit. For example, if c = $ 1, 000, after 1 year your account has u (1) = $ 1, 050, after 10 years u(10) = $ 1, 628.89, after 50 years u(50) = $ 11, 467.40, and after 200 years u(200) = $ 17, 292, 580.82. When the interest is compounded monthly, the rate is still quoted on a yearly basis, 1 ^ and so you receive 12 of the interest each month. If u(k) denotes the balance after k 1 months, then, after n years, the account balance is u(12 n) = 1 + 12 r 12 n c. Thus, ^ when the interest rate of 5% is compounded monthly, your account balance is u (12) = ^ (120) (600) $ 1, 051.16 after 1 year, u ^ = $ 1, 647.01 after 10 years, u ^ = $ 12, 119.38 after 50 years, and u(2400) = $ 21, 573, 572.66 dollars after 200 years. So, if you wait sufficiently ^ long, compounding will have a dramatic effect. Similarly, daily compounding replaces 12 by 365.25, the number of days in a year. After 200 years, the balance is $ 22, 011, 396.03. Let us analyze the solutions of scalar iterative equations, starting with the case when a R is a real constant. Aside from the equilibrium solution u(k) 0, the iterates exhibit five qualitatively different behaviors, depending on the size of the coefficient a. (a) If a = 0, the solution immediately becomes zero, and stays there, so u (k) = 0 for all k 1. (b) If 0 < a < 1, then the solution is of one sign, and tends monotonically to zero, so u(k) 0 as k . 3/15/06 10 c 2006 Peter J. Olver (c) If -1 < a < 0, then the solution tends to zero: u(k) 0 as k . Successive iterates have alternating signs. (d) If a = 1, the solution is constant: u(k) = a, for all k 0. (e) If a = -1, the solution switches back and forth between two values; u (k) = (-1)k c. (f ) If 1 < a < , then the iterates u(k) become unbounded. If c > 0, they go monotonically to + ; if c < 0, to - . (g) If - < a < -1, then the iterates u(k) also become unbounded, with alternating signs. In Figure 2.1 we exhibit representative scatter plots for the nontrivial cases (b g). The horizontal axis indicates the index k and the vertical axis the solution value u. Each dot in the scatter plot represents an iterate u(k) . To describe the different scenarios, we adopt a terminology that already appeared in the continuous realm. In the first three cases, the fixed point u = 0 is said to be globally asymptotically stable since all solutions tend to 0 as k . In cases (d) and (e), the zero solution is stable, since solutions with nearby initial data, | c | 1, remain nearby. In the final two cases, the zero solution is unstable; any nonzero initial data a = 0 -- no matter how small -- will give rise to a solution that eventually goes arbitrarily far away from equilibrium. Let us also analyze complex scalar iterative systems. The coefficient a and the initial datum c in (2.7) are allowed to be complex numbers. The solution is the same, (2.8), but now we need to know what happens when we raise a complex number a to a high power. The secret is to write a = r e i in polar form, where r = | a | is its modulus and = ph a its angle or phase. Then ak = rk e i k . Since | e i k | = 1, we have | ak | = | a |k , and so the solutions (2.8) have modulus | u(k) | = | ak a | = | a |k | a |. As a result, u(k) will remain bounded if and only if | a | 1, and will tend to zero as k if and only if | a | < 1. We have thus established the basic stability criteria for scalar, linear systems. u (k+1) Theorem 2.4. The zero solution to a (real or complex) scalar iterative system = a u(k) is (a) asymptotically stable if and only if | a | < 1, (b) stable if and only if | a | 1, (c) unstable if and only if | a | > 1. Nonlinear Scalar Iteration The simplest "nonlinear" case is that of an affine function g(u) = a u + b, leading to an affine discrete dynamical system u(k+1) = a u(k) + b. The only fixed point is the solution to u = g(u ) = a u + b, 3/15/06 11 namely, u = b . 1-a c 2006 (2.10) (2.11) (2.12) Peter J. Olver u3 u2 u1 Figure 2.2. Fixed Points. The formula for u requires that a = 1, and, indeed, the case a = 1 has no fixed point, as the reader can easily confirm. Since we already know the value of u , we can readily analyze the differences e(k) = u(k) - u , (2.13) between successive iterates and the fixed point. Observe that, the smaller e (k) is, the closer u(k) is to the desired fixed point. In many applications, the iterate u(k) is viewed as an approximation to the fixed point u , and so e(k) is interpreted as the error in the k th iterate. Subtracting the fixed point equation (2.12) from the iteration equation (2.11), we find u(k+1) - u = a (u(k) - u ). Therefore the errors e(k) are related by a linear iteration e(k+1) = a e(k) , and hence e(k) = ak e(0) . (2.14) Therefore, the solutions to this scalar linear iteration converge: e(k) - 0 and hence u(k) - u , if and only if | a | < 1. This is the criterion for asymptotic stability of the fixed point, or, equivalently, convergence of the affine iterative system (2.11). The magnitude of a determines the rate of convergence, and the closer it is to 0, the faster the iterates approach the fixed point. Example 2.5. The affine function g(u) = leads to the iterative scheme u(k+1) = 3/15/06 12 1 4 1 4 u+2 u(k) + 2. c 2006 Peter J. Olver Figure 2.3. Tangent Line Approximation. Starting with the initial condition u(0) = 0, the ensuing values are k u (k) 1 2.0 2 2.5 3 2.625 4 2.6562 5 2.6641 6 2.6660 7 2.6665 8 3 8 2.6666 to 4 decimal Thus, after 8 iterations, the iterates have produced the fixed point u = places. The rate of convergence is 1 , and indeed 4 | e(k) | = | u(k) - u | = 1 k 4 | u(0) - u | = 8 3 1 k 4 - 0 as k - . Let us now turn to the fully nonlinear case. First note that the fixed points of g(u) correspond to the intersections of its graph with the graph of the function i(u) = u. For instance Figure 2.2 shows the graph of a function that has 3 fixed points, labeled u 1 , u2 , u3 . In general, near any point in its domain, a (smooth) nonlinear function can be well approximated by its tangent line, which repre4sents the graph of an affine function; see Figure 2.3. Therefore, if we are close to a fixed point u , then we might expect the iterative system based on the nonlinear function g(u) to behave very much like that of its affine tangent line approximation. And, indeed, this intuition turns out to be essentially correct. This result forms our first concrete example of linearization, in which the analysis of a nonlinear system is based on its linear (or, more precisely, affine) approximation. The explicit formula for the tangent line to g(u) near the fixed point u = u = g(u ) is g(u) g(u ) + g (u )(u - u ) a u + b, (2.15) where a = g (u ), b = g(u ) - g (u ) u = 1 - g (u ) u . Note that u = b /(1 - a) remains a fixed point for the affine approximation: a u + b = u . According to the preceding discussion, the convergence of the iterates for the affine approximation is governed by the size of the coefficient a = g (u ). This observation inspires the basic stability criterion for fixed points of scalar iterative systems. 3/15/06 13 c 2006 Peter J. Olver Theorem 2.6. Let g(u) be a continuously differentiable scalar function. Suppose u = g(u ) is a fixed point. If | g (u ) | < 1, then u is an asymptotically stable fixed point, and hence any sequence of iterates u(k) which starts out sufficiently close to u will converge to u . On the other hand, if | g (u ) | > 1, then u is an unstable fixed point, and the only iterates which converge to it are those that land exactly on it, i.e., u (k) = u for some k 0. Proof : The goal is to prove that the errors e(k) = u(k) - u between the iterates and the fixed point tend to 0 as k . To this end, we try to estimate e(k+1) in terms of e(k) . According to (2.6) and the Mean Value Theorem from calculus, e(k+1) = u(k+1) - u = g(u(k) ) - g(u ) = g (v) (u(k) - u ) = g (v) e(k) , (2.16) for some v lying between u(k) and u . By continuity, if | g (u ) | < 1 at the fixed point, then we can choose > 0 and | g (u ) | < < 1 such that the estimate | g (v) | < 1 whenever |v - u | < (2.17) holds in a (perhaps small) interval surrounding the fixed point. Suppose | e(k) | = | u(k) - u | < . Then the point v in (2.16), which is closer to u than u(k) , satisfies (2.17). Therefore, | u(k+1) - u | | u(k) - u |, and hence | e(k+1) | | e(k) |. (2.18) In particular, since < 1, we have | u(k+1) - u | < , and hence the subsequent iterate u(k+1) also lies in the interval where (2.17) holds. Repeating the argument, we conclude that, provided the initial iterate satisfies | e(0) | = | u(0) - u | < , the subsequent errors are bounded by e(k) k e(0) , and hence e(k) = | u(k) - u | - 0 as k , Q.E.D. which completes the proof of the theorem in the stable case. The proof in unstable case is left as an exercise for the reader. Remark : The constant governs the rate of convergence of the iterates to the fixed point. The closer the iterates are to the fixed point, the smaller we can choose in (2.17), and hence the closer we can choose to | g (u ) |. Thus, roughly speaking, | g (u ) | governs the speed of convergence, once the iterates get close to the fixed point. This observation will be developed more fully in the following subsection. Remark : The cases when g (u ) = 1 are not covered by the theorem. For a linear system, such fixed points are stable, but not asymptotically stable. For nonlinear systems, more detailed knowledge of the nonlinear terms is required in order to resolve the status -- stable or unstable -- of the fixed point. Despite their importance in certain applications, we will not try to analyze such borderline cases any further here. 3/15/06 14 c 2006 Peter J. Olver m u Figure 2.4. Planetary Orbit. Example 2.7. Given constants , m, the trigonometric equation u = m + sin u (2.19) is known as Kepler's equation. It arises in the study of planetary motion, in which 0 < < 1 represents the eccentricity of an elliptical planetary orbit, u is the eccentric anomaly, defined as the angle formed at the center of the ellipse by the planet and the major axis, and m = 2 t / T is its mean anomaly, which is the time, measured in units of T /(2 ) where T is the period of the orbit, i.e., the length of the planet's year, since perihelion or point of closest approach to the sun; see Figure 2.4. The solutions to Kepler's equation are the fixed points of the discrete dynamical system based on the function g(u) = m + sin u. Note that | g (u) | = | cos u | = | | < 1, (2.20) which automatically implies that the as yet unknown fixed point is stable. Indeed, condition (2.20) is enough to prove the existence of a unique stable fixed point; see the remark after Theorem 9.7. In the particular case m = = 1 , the result of iterating 2 1 1 u(k+1) = 2 + 2 sin u(k) starting with u(0) = 0 is k u(k) 1 .5 2 .7397 3 .8370 4 .8713 5 .8826 6 .8862 7 .8873 8 .8877 9 .8878 After 13 iterations, we have converged sufficiently close to the solution (fixed point) u = .887862 to have computed its value to 6 decimal places. 3/15/06 15 c 2006 Peter J. Olver L+ (u) g(u) L- (u) u u Figure 2.5. Graph of a Contraction. Inspection of the proof of Theorem 2.6 reveals that we never really used the differentiability of g, except to verify the inequality | g(u) - g(u ) | | u - u | for some fixed < 1. (2.21) A function that satisfies (2.21) for all nearby u is called a contraction at the point u . Any function g(u) whose graph lies between the two lines L (u) = g(u ) (u - u ) for some < 1, for all u sufficiently close to u , i.e., such that | u - u | < for some > 0, defines a contraction, and hence fixed point iteration starting with | u(0) - u | < will converge to u ; see Figure 2.5. In particular, any function that is differentiable at u with | g (u ) | < 1 defines a contraction at u . Example 2.8. The simplest truly nonlinear example is a quadratic polynomial. The most important case is the so-called logistic map g(u) = u(1 - u), (2.22) where = 0 is a fixed non-zero parameter. (The case = 0 is completely trivial. Why?) In fact, an elementary change of variables can make any quadratic iterative system into one involving a logistic map. The fixed points of the logistic map are the solutions to the quadratic equation u = u(1 - u), u1 = 0, 3/15/06 16 or u2 - u + 1 = 0. u2 = 1 - 1 . c 2006 Peter J. Olver Using the quadratic formula, we conclude that g(u) has two fixed points: 1 1 1 0.8 0.8 0.8 0.6 0.6 0.6 0.4 0.4 0.4 0.2 0.2 0.2 20 40 60 80 100 20 40 60 80 100 20 40 60 80 100 = 1.0 1 1 0.8 0.8 = 2.0 1 0.8 = 3.0 0.6 0.6 0.6 0.4 0.4 0.4 0.2 0.2 0.2 20 40 60 80 100 20 40 60 80 100 20 40 60 80 100 = 3.4 1 1 0.8 0.8 = 3.5 1 0.8 = 3.55 0.6 0.6 0.6 0.4 0.4 0.4 0.2 0.2 0.2 20 40 60 80 100 20 40 60 80 100 20 40 60 80 100 = 3.6 Figure 2.6. = 3.7 Logistic Iterates. = 3.8 Let us apply Theorem 2.6 to determine their stability. The derivative is g (u) = - 2 u, and so g (u1 ) = , g (u2 ) = 2 - . Therefore, if | | < 1, the first fixed point is stable, while if 1 < < 3, the second fixed point is stable. For < -1 or > 3 neither fixed point is stable, and we expect the iterates to not converge at all. Numerical experiments with this example show that it is the source of an amazingly diverse range of behavior, depending upon the value of the parameter . In the accompanying Figure 2.6, we display the results of iteration starting with initial point u (0) = .5 for several different values of ; in each plot, the horizontal axis indicates the iterate number k and the vertical axis the iterate valoue u(k) for k = 0, . . . , 100. As expected from Theorem 2.6, the iterates converge to one of the fixed points in the range -1 < < 3, except when = 1. For a little bit larger than 1 = 3, the iterates do not converge to a fixed point. But it does not take long for them to settle down, switching back and forth between two particular values. This behavior indicates the exitence of a (stable) period 2 orbit for the discrete dynamical system, in accordance with the following definition. 3/15/06 17 c 2006 Peter J. Olver Definition 2.9. A period k orbit of a discrete dynamical system is a solution that satisfies u(n+k) = u(n) for all n = 0, 1, 2, . . . . The (minimal ) period is the smallest positive value of k for which this condition holds. Thus, a fixed point is a period 1 orbit. A period 2 orbit satisfies u(0) = u(2) = u(4) = u(0) = u(1) = u(2) = and u(1) = u(3) = u(5) = , but u(0) = u(1) , as otherwise the minimal period would be 1. Similarly, a period 3 orbit has with u(0) , u(1) , u(2) distinct. Stability of a period k orbit implies that nearby iterates converge to this periodic solution. For the logistic map, the period 2 orbit persists until = 2 3.4495, after which the iterates alternate between four values -- a period 4 orbit. This again changes at = 3 3.5441, after which the iterates end up alternating between eight values. In fact, there is an increasing sequence of values where, for any n < n+1 , the iterates eventually follow a period 2n orbit. Thus, as passes through each value n the period of the orbit goes from 2n to 2 2n = 2n+1 , and the discrete dynamical system experiences a bifurcation. The bifurcation values n are packed closer and closer together as n increases, piling up on an eventual limiting value = lim n 3.5699, at which point the orbit's period has, so to speak, become infinitely large. The entire phenomena is known as a period doubling cascade. Interestingly, the ratios of the distances between successive bifurcation points approaches a well-defined limit, n+2 - n+1 n+1 - n - 4.6692 . . . , (2.23) n u(0) = u(3) = u(6) = , u(1) = u(4) = u(7) = , u(2) = u(5) = u(8) = , 3 = 1 < 2 < 3 < 4 < , known as Feigenbaum's constant. In the 1970's, the American physicist Mitchell Feigenbaum, [16], discovered that similar period doubling cascades appear in a broad range of discrete dynamical systems. Even more remarkably, in almost all cases, the corresponding ratios of distances between bifurcation points has the same limiting value. Feigenbaum's experimental observations were rigorously proved by Oscar Lanford in 1982, [32]. After passes the limiting value , all hell breaks loose. The iterates become completely chaotic , moving at random over the interval [ 0, 1 ]. But this is not the end of the The term "chaotic" does have a precise mathematical definition, but the reader can take it more figuratively for the purposes of this elementary exposition. 3/15/06 18 c 2006 Peter J. Olver 1 0.8 0.6 0.4 0.2 2.5 3 3.5 4 Figure 2.7. The Logistic Map. story. Embedded within this chaotic regime are certain small ranges of where the system settles down to a stable orbit, whose period is no longer necessarily a power of 2. In fact, there exist values of for which the iterates settle down to a stable orbit of period k for any positive integer k. For instance, as increases past 3, 3.83, a period 3 orbit appears over a small range of values, after which, as increses slightly further, there is a period doubling cascade where period 6, 12, 24, . . . orbits successively appear, each persisting on a shorter and shorter range of parameter values, until passes yet another critical value where chaos breaks out yet again. There is a well-prescribed order in which the periodic orbits make their successive appearance, and each odd period k orbit is followed by a very closely spaced sequence of period doubling bifurcations, of periods 2 n k for n = 1, 2, 3, . . . , after which the iterates revert to completely chaotic behavior until the next periodic case emerges. The ratios of distances between bifurcation points always have the same Feigenbaum limit (2.23). Finally, these periodic and chaotic windows all pile up on the ultimate parameter value = 4. And then, when > 4, all the iterates go off to , and the system ceases to be interesting. The reader is encouraged to write a simple computer program and perform some numerical experiments. In particular, Figure 2.7 shows the asymptotic behavior of the iterates for values of the parameter in the interesting range 2 < < 4. The horizontal axis is , and the marked points show the ultimate fate of the iteration for the given value of . For instance, each point the single curve lying above the smaller values of represents a stable fixed point; this bifurcates into a pair of curves representing stable period 2 orbits, which then bifurcates into 4 curves representing period 4 orbits, and so on. Chaotic behavior is indicated by a somewhat random pattern of points lying above the value of . To plot this figure, we ran the logistic iteration u(n) for 0 n 100, discarded the first 50 points, and then plotted the next 50 iterates u(51) , . . . , u(100) . Investigation of the fine detailed structure of the logistic map requires yet more iterations with increased numerical accuracy. In addition one should discard more of the initial iterates so as to give 3/15/06 19 c 2006 Peter J. Olver the system enough time to settle down to a stable periodic orbit or, alternatively, continue in a chaotic manner. Remark : So far, we have only looked at real scalar iterative systems. Complex discrete dynamical systems display yet more remarkable and fascinating behavior. The complex version of the logistic iteration equation leads to the justly famous Julia and Mandelbrot sets, [33], with their stunning, psychedelic fractal structure, [42]. The rich range of phenomena in evidence, even in such extremely simple nonlinear iterative systems, is astounding. While intimations first appeared in the late nineteenth century research of the influential French mathematician Henri Poincar, serious investigae tions were delayed until the advent of the computer era, which precipitated an explosion of research activity in the area of dynamical systems. Similar period doubling cascades and chaos are found in a broad range of nonlinear systems, [1], and are often encountered in physical applications, [35]. A modern explanation of fluid turbulence is that it is a (very complicated) form of chaos, [1]. Quadratic Convergence Let us now return to the more mundane case when the iterates converge to a stable fixed point of the discrete dynamical system. In applications, we use the iterates to compute a precise numerical value for the fixed point, and hence the efficiency of the algorithm depends on the speed of convergence of the iterates. According to the remark following the proof Theorem 2.6, the convergence rate of an iterative system is essentially governed by the magnitude of the derivative | g (u ) | at the fixed point. The basic inequality (2.18) for the errors e(k) = u(k) - u , namely | e(k+1) | | e(k) |, known is as a linear convergence estimate. It means that, once the iterates are close to the fixed point, the error decreases by a factor of (at least) | g (u ) | at each step. If the k th iterate u(k) approximates the fixed point u correctly to m decimal places, so its error is bounded by | e(k) | < .5 10- m , | e(k+1) | | e(k) | < .5 10- m = .5 10- m+log10 . More generally, for any j > 0, | e(k+j) | j | e(k) | < .5 10- m j = .5 10- m+j log10 , m - j log10 = m + j log10 -1 then the (k + 1)st iterate satisfies the error bound which means that the (k + j)th iterate u(k+j) has at least The degree of precision is to be specified by the user and the application. Note that since < 1, the logarithm log 10 -1 = - log10 > 0 is positive. 3/15/06 20 c 2006 Peter J. Olver correct decimal places. For instance, if = .1 then each new iterate produces one new decimal place of accuracy (at least), while if = .9 then it typically takes 22 -1/ log 10 .9 iterates to produce just one additional accurate digit! This means that there is a huge advantage -- particularly in the application of iterative methods to the numerical solution of equations -- to arrange that | g (u ) | be as small as possible. The fastest convergence rate of all will occur when g (u ) = 0. In fact, in such a happy situation, the rate of convergence is not just slightly, but dramatically faster than linear. Theorem 2.10. Suppose that g C2 , and u = g(u ) is a fixed point such that g (u ) = 0. Then, for all iterates u(k) sufficiently close to u , the errors e(k) = u(k) - u satisfy the quadratic convergence estimate | e(k+1) | | e(k) |2 for some constant > 0. Proof : Just as that of the linear convergence estimate (2.18), the proof relies on approximating g(u) by a simpler function near the fixed point. For linear convergence, an affine approximation sufficed, but here we require a higher order approximation. Thus, we replace the mean value formula (2.16) by the first order Taylor expansion g(u) = g(u ) + g (u ) (u - u ) + 1 g (w) (u - u )2 , 2 (2.25) (2.24) where the final error term depends on an (unknown) point w that lies between u and u . At a fixed point, the constant term is g(u ) = u . Furthermore, under our hypothesis g (u ) = 0, and so (2.25) reduces to g(u) - u = Therefore, | g(u) - u | | u - u |2 , where is chosen so that 1 2 1 2 g (w) (u - u )2 . (2.26) (2.27) | g (w) | for all w sufficiently close to u . Therefore, the magnitude of is governed by the size of the second derivative of the iterative function g(u) near the fixed point. We use the inequality (2.26) to estimate the error | e(k+1) | = | u(k+1) - u | = | g(u(k) ) - g(u ) | | u(k) - u |2 = | e(k) |2 , which establishes the quadratic convergence estimate (2.24). Q.E.D. Let us see how the quadratic estimate (2.24) speeds up the convergence rate. Following our earlier argument, suppose u(k) is correct to m decimal places, so | e(k) | < .5 10- m . 3/15/06 21 c 2006 Peter J. Olver Then (2.24) implies that | e(k+1) | < .5 (10- m )2 = .5 10- 2 m+log10 , and so u(k+1) has 2 m - log10 accurate decimal places. If | g (u ) | is of moderate size, we have essentially doubled the number of accurate decimal places in just a single iterate! A second iteration will double the number of accurate digits yet again. Thus, the convergence of a quadratic iteration scheme is extremely rapid, and, barring round-off errors, one can produce any desired number of digits of accuracy in a very short time. For example, if we start with an initial guess that is accurate in the first decimal digit, then a linear iteration with = .1 will require 49 iterations to obtain 50 decimal place accuracy, whereas a quadratic iteration (with = 1) will only require 6 iterations to obtain 2 6 = 64 decimal places of accuracy! Example 2.11. Consider the function g(u) = 2 u3 + 3 . 3 u2 + 3 There is a unique (real) fixed point u = g(u ), which is the real solution to the cubic equation 1 3 u3 + u - 1 = 0. Note that g (u) = 6 u 1 u3 + u - 1 2 u4 + 6 u 2 - 6 u 3 = , 3 (u2 + 1)2 3 (u2 + 1)2 and hence g (u ) = 0 vanishes at the fixed point. Theorem 2.10 implies that the iterations will exhibit quadratic convergence to the root. Indeed, we find, starting with u (0) = 0, the following values: k u(k) 1 1.00000000000000 4 .817731680821982 2 .833333333333333 5 .817731673886824 3 .817850637522769 6 .817731673886824 The convergence rate is dramatic: after only 5 iterations, we have produced the first 15 decimal places of the fixed point. In contrast, the linearly convergent scheme based on g(u) = 1 - 1 u3 takes 29 iterations just to produce the first 5 decimal places of the same 3 solution. In practice, the appearance of a quadratically convergent fixed point is a matter of luck. The construction of quadratically convergent iterative methods for solving equations will be the focus of the following Section. 3/15/06 22 c 2006 Peter J. Olver 3 2 1 -1 -0.5 -1 -2 -3 -4 0.5 1 Figure 2.8. Graph of u5 + u + 1. 2.2. Numerical Solution of Equations. Solving nonlinear equations and systems of equations is, of course, a problem of utmost importance in mathematics and its manifold applications. We begin by studying the scalar case. Thus, we are given a real-valued function f : R R, and seek its roots, i.e., the real solution(s) to the scalar equation f (u) = 0. (2.28) Here are some prototypical examples: (a) Find the roots of the quintic polynomial equation u5 + u + 1 = 0. (2.29) Graphing the left hand side of the equation, as in Figure 2.8, convinces us that there is just one real root, lying somewhere between -1 and -.5. While there are explicit algebraic formulas for the roots of quadratic, cubic, and quartic polynomials, a famous theorem due to the Norwegian mathematician Nils Henrik Abel in the early 1800's states that there is no such formula for generic fifth order polynomial equations. (b) Any fixed point equation u = g(u) has the form (2.28) where f (u) = u - g(u). For example, the trigonometric Kepler equation u - sin u = m arises in the study of planetary motion, cf. Example 2.7. Here , m are fixed constants, and we seek a corresponding solution u. (c) Suppose we are given chemical compounds A, B, C that react to produce a fourth compound D according to 2 A + B D, A + 3 C D. Let a, b, c be the initial concentrations of the reagents A, B, C injected into the reaction chamber. If u denotes the concentration of D produced by the first reaction, and v that A modern proof of this fact relies on Galois theory, [ 19 ]. 3/15/06 23 c 2006 Peter J. Olver f (u) a u b Figure 2.9. Intermediate Value Theorem. by the second reaction, then the final equilibrium concentrations a = a - 2 u - v, b = b - u, c = c - 3 v, d = u + v, of the reagents will be determined by solving the nonlinear system (a - 2 u - v)2 (b - u) = (u + v), (a - 2 u - v)(c - 3 v)3 = (u + v), (2.30) where , are the known equilibrium constants of the two reactions. Our immediate goal is to develop numerical algorithms for solving such nonlinear scalar equations. The Bisection Method The most primitive algorithm, and the only one that is guaranteed to work in all cases, is the Bisection Method. While it has an iterative flavor, it cannot be properly classed as a method governed by functional iteration as defined in the preceding section, and so must be studied directly in its own right. The starting point is the Intermediate Value Theorem, which we state in simplified form. See Figure 2.9 for an illustration, and [2] for a proof. Lemma 2.12. Let f (u) be a continuous scalar function. Suppose we can find two points a < b where the values of f (a) and f (b) take opposite signs, so either f (a) < 0 and f (b) > 0, or f (a) > 0 and f (b) < 0. Then there exists at least one point a < u < b where f (u ) = 0. Note that if f (a) = 0 or f (b) = 0, then finding a root is trivial. If f (a) and f (b) have the same sign, then there may or may not be a root in between. Figure 2.10 plots the functions u2 + 1, u2 and u2 - 1, on the interval -2 u 2. The first has two simple roots; the second has a single double root, while the third has no root. We also note that continuity of the function on the entire interval [ a, b ] is an essential hypothesis. For example, the function f (u) = 1/u satisfies f (-1) = -1 and f (1) = 1, but there is no root to the equation 1/u = 0. 3/15/06 24 c 2006 Peter J. Olver 5 4 3 2 1 5 4 3 2 1 5 4 3 2 1 -2 -1 -1 -2 1 2 -2 -1 -1 -2 1 2 -2 -1 -1 -2 1 2 Figure 2.10. Roots of Quadratic Functions. Note carefully that the Lemma 2.12 does not say there is a unique root between a and b. There may be many roots, or even, in pathological examples, infinitely many. All the theorem guarantees is that, under the stated hypotheses, there is at least one root. Once we are assured that a root exists, bisection relies on a "divide and conquer" strategy. The goal is to locate a root a < u < b between the endpoints. Lacking any 1 additional evidence, one tactic would be to try the midpoint c = 2 (a + b) as a first guess for the root. If, by some miracle, f (c) = 0, then we are done, since we have found a solution! Otherwise (and typically) we look at the sign of f (c). There are two possibilities. If f (a) and f (c) are of opposite signs, then the Intermediate Value Theorem tells us that there is a root u lying between a < u < c. Otherwise, f (c) and f (b) must have opposite signs, and so there is a root c < u < b. In either event, we apply the same method to the interval in which we are assured a root lies, and repeat the procedure. Each iteration halves the length of the interval, and chooses the half in which a root is sure to lie. (There may, of course, be a root in the other half interval, but as we cannot be sure, we discard it from further consideration.) The root we home in on lies trapped in intervals of smaller and smaller width, and so convergence of the method is guaranteed. Example 2.13. The roots of the quadratic equation f (u) = u2 + u - 3 = 0 can be computed exactly by the quadratic formula: - 1 - 13 - 1 + 13 1.302775 . . . , u2 = - 2.302775 . . . . u1 = 2 2 Let us see how one might approximate them by applying the Bisection Algorithm. We start the procedure by choosing the points a = u(0) = 1, b = v (0) = 2, noting that f (1) = -1 and f (2) = 3 have opposite signs and hence we are guaranteed that there is at least one root between 1 and 2. In the first step we look at the midpoint of the interval [ 1, 2 ], which is 1.5, and evaluate f (1.5) = .75. Since f (1) = -1 and f (1.5) = .75 have opposite signs, we know that there is a root lying between 1 and 1.5. Thus, we take u (1) = 1 and v (1) = 1.5 as the endpoints of the next interval, and continue. The next midpoint is at 1.25, where f (1.25) = -.1875 has the opposite sign to f (1.5) = .75, and so a root lies between u(2) = 1.25 and v (2) = 1.5. The process is then iterated as long as desired -- or, more practically, as long as your computer's precision does not become an issue. 3/15/06 25 c 2006 Peter J. Olver k 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 u(k) 1 1 1.25 1.25 1.25 1.2813 1.2969 1.2969 1.3008 1.3027 1.3027 1.3027 1.3027 1.3027 1.3027 v (k) 2 1.5 1.5 1.375 1.3125 1.3125 1.3125 1.3047 1.3047 1.3047 1.3037 1.3032 1.3030 1.3029 1.3028 1 w(k) = 2 (u(k) + v (k) ) f (w(k) ) .75 -.1875 .2656 .0352 -.0771 -.0212 .0069 -.0072 -.0002 .0034 .0016 .0007 .0003 .0001 -.0000 1.5 1.25 1.375 1.3125 1.2813 1.2969 1.3047 1.3008 1.3027 1.3037 1.3032 1.3030 1.3029 1.3028 1.3028 The table displays the result of the algorithm, rounded off to four decimal places. After 14 iterations, the Bisection Method has correctly computed the first four decimal digits of the positive root u1 . A similar bisection starting with the interval from u(1) = - 3 to v (1) = - 2 will produce the negative root. A formal implementation of the Bisection Algorithm appears in the accompanying pseudocode program. The endpoints of the k th interval are denoted by u(k) and v (k) . The midpoint is w (k) = 1 u(k) + v (k) , and the key decision is whether w (k) should be the 2 right or left hand endpoint of the next interval. The integer n, governing the number of iterations, is to be prescribed in accordance with how accurately we wish to approximate the root u . The algorithm produces two sequences of approximations u(k) and v (k) that both converge monotonically to u , one from below and the other from above: a = u(0) u(1) u(2) u(k) - u - v (k) v (2) v (1) v (0) = b. and u is trapped between the two. Thus, the root is trapped inside a sequence of intervals [ u(k) , v (k) ] of progressively shorter and shorter length. Indeed, the length of each interval is exactly half that of its predecessor: v (k) - u(k) = Iterating this formula, we conclude that v (n) - u(n) = 3/15/06 1 n 2 1 2 (v (k-1) - u(k-1) ). (b - a) - 0 as n - . c 2006 Peter J. Olver (v (0) - u(0) ) = 1 n 2 26 The Bisection Method start if f (a) f (b) < 0 set u(0) = a, v (0) = b else print "Bisection Method not applicable" for k = 0 to n - 1 1 set w (k) = 2 (u(k) + v (k) ) if f (w(k) ) = 0, stop; print u = w(k) if f (u(k) ) f (w(k) ) < 0, set u(k+1) = u(k) , v (k+1) = w(k) else next k 1 print u = w(n) = 2 (u(n) + v (n) ) set u(k+1) = w(k) , v (k+1) = v (k) end The midpoint w(n) = lies within a distance | w(n) - u | 1 2 1 2 (u(n) + v (n) ) 1 n+1 2 (v (n) - u(n) ) = (b - a) of the root. Consequently, if we desire to approximate the root within a prescribed tolerance , we should choose the number of iterations n so that 1 n+1 2 (b - a) < , or n > log2 b-a - 1. (2.31) Summarizing: Theorem 2.14. If f (u) is a continuous function, with f (a) f (b) < 0, then the Bisection Method starting with u(0) = a, v (0) = b, will converge to a solution u to the equation f (u) = 0 lying between a and b. After n steps, the midpoint w (n) = 1 (u(n) + v (n) ) 2 will be within a distance of = 2- n-1 (b - a) from the solution. For example, in the case of the quadratic equation in Example 2.13, after 14 iterations, we have approximated the positive root to within = 1 15 2 (2 - 1) 3.052 10-5 , reconfirming our observation that we have accurately computed its first four decimal places. If we are in need of 10 decimal places, we set our tolerance to = .5 10 -10 , and so, according to (2.31), must perform n = 34 > 33.22 log 2 2 1010 -1 successive bisections . This assumes we have sufficient precision on the computer to avoid round-off errors. 3/15/06 27 c 2006 Peter J. Olver Example 2.15. As noted at the beginning of this section, the quintic equation f (u) = u5 + u + 1 = 0 has one real root, whose value can be readily computed by bisection. We start the algorithm with the initial points u(0) = - 1, v (0) = 0, noting that f (- 1) = - 1 < 0 while f (0) = 1 > 0 are of opposite signs. In order to compute the root to 6 decimal places, we set = .510 -6 in (2.31), and so need to perform n = 20 > 19.93 log 2 2 106 - 1 bisections. Indeed, the algorithm produces the approximation u - .754878 to the root, and the displayed digits are guaranteed to be accurate. Fixed Point Methods The Bisection Method has an ironclad guarantee to converge to a root of the function -- provided it can be properly started by locating two points where the function takes opposite signs. This may be tricky if the function has two very closely spaced roots and is, say, negative only for a very small interval between them, and may be impossible for multiple roots, e.g., the root u = 0 of the quadratic function f (u) = u2 . When applicable, its convergence rate is completely predictable, but not especially fast. Worse, it has no immediately apparent extension to systems of equations, since there is no obvious counterpart to the Intermediate Value Theorem for vector-valued functions. Most other numerical schemes for solving equations rely on some form of fixed point iteration. Thus, we seek to replace the system of equations f (u) = 0 with a fixed point system u = g(u), that leads to the iterative solution scheme u(k+1) = g(u(k) ). For this to work, there are two key requirements: (a) The solution u to the equation f (u) = 0 is also a fixed point for g(u), and (b) u is, in fact a stable fixed point, meaning that the Jacobian g (u ) is a convergent matrix, or, slightly more restrictively, g (u ) < 1 for a prescribed matrix norm. If both conditions hold, then, provided we choose the initial iterate u (0) = c sufficiently close to u , the iterates u(k) u will converge to the desired solution. Thus, the key to the practical use of functional iteration for solving equations is the proper design of an iterative system -- coupled with a reasonably good initial guess for the solution. Before implementing general procedures, let us discuss a na example. ive Example 2.16. To solve the cubic equation f (u) = u3 - u - 1 = 0 (2.32) we note that f (1) = -1 while f (2) = 5, and so there is a root between 1 and 2. Indeed, the Bisection Method leads to the approximate value u 1.3247 after 17 iterations. Let us try to find the same root by fixed point iteration. As a first, na guess, we ive, rewrite the cubic equation in fixed point form Starting with the initial guess u(0) = 1.5, successive approximations to the solution are found by iterating u(k+1) = g(u(k) ) = (u(k) )3 - 1, 3/15/06 28 k = 0, 1, 2, . . . . c 2006 Peter J. Olver u = u3 - 1 = g(u). However, their values u(0) = 1.5, u(3) = 1904, u(1) = 2.375, u(4) = 6.9024 109 , u(2) = 12.396, u(5) = 3.2886 1029 , ... rapidly become unbounded, and so fail to converge. This could, in fact, have been predicted by the convergence criterion in Theorem 2.6. Indeed, g (u) = - 3 u2 and so | g (u) | > 3 for all u 1, including the root u . This means that u is an unstable fixed point, and the iterates cannot converge to it. On the other hand, we can rewrite the equation (2.32) in the alternative iterative form u = 3 1 + u = g(u). In this case 0 g (u) = 1 1 2/3 3 3(1 + u) for u > 0. Thus, the stability condition (2.17) is satisfied, and we anticipate convergence at a rate of 1 1 at least 3 . (The Bisection Method converges more slowly, at rate 2 .) Indeed, the first few 3 iterates u(k+1) = 1 + u(k) are 1.5, 1.35721, 1.33086, 1.32588, 1.32494, 1.32476, 1.32473, and we have converged to the root, correct to four decimal places, in only 6 iterations. Newton's Method Our immediate goal is to design an efficient iterative scheme u(k+1) = g(u(k) ) whose iterates converge rapidly to the solution of the given scalar equation f (u) = 0. As we learned in Section 2.1, the convergence of the iteration is governed by the magnitude of its derivative at the fixed point. At the very least, we should impose the stability criterion | g (u ) | < 1, and the smaller this quantity can be made, the faster the iterative scheme converges. if we are able to arrange that g (u ) = 0, then the iterates will converge quadratically fast, leading, as noted in the discussion following Theorem 2.10, to a dramatic improvement in speed and efficiency. Now, the first condition requires that g(u) = u whenever f (u) = 0. A little thought will convince you that the iterative function should take the form g(u) = u - h(u) f (u), (2.33) where h(u) is a reasonably nice function. If f (u ) = 0, then clearly u = g(u ), and so u is a fixed point. The converse holds provided h(u) = 0 is never zero. For quadratic convergence, the key requirement is that the derivative of g(u) be zero at the fixed point solutions. We compute g (u) = 1 - h (u) f (u) - h(u) f (u). Thus, g (u ) = 0 at a solution to f (u ) = 0 if and only if 0 = 1 - h (u ) f (u ) - h(u ) f (u ) = 1 - h(u ) f (u ). 3/15/06 29 c 2006 Peter J. Olver Consequently, we should require that h(u ) = 1 f (u ) (2.34) to ensure a quadratically convergent iterative scheme. This assumes that f (u ) = 0, which means that u is a simple root of f . For here on, we leave aside multiple roots, which require a different approach. Of course, there are many functions h(u) that satisfy (2.34), since we only need to specify its value at a single point. The problem is that we do not know u -- after all this is what we are trying to compute -- and so cannot compute the value of the derivative of f there. However, we can circumvent this apparent difficulty by a simple device: we impose equation (2.34) at all points, setting h(u) = 1 , f (u) f (u) , f (u) (2.35) which certainly guarantees that it holds at the solution u . The result is the function g(u) = u - (2.36) and the resulting iteration scheme is known as Newton's Method , which, as the name suggests, dates back to the founder of the calculus. To this day, Newton's Method remains the most important general purpose algorithm for solving equations. It starts with an initial guess u(0) to be supplied by the user, and then successively computes u(k+1) = u(k) - f (u(k) ) . f (u(k) ) (2.37) As long as the initial guess is sufficiently close, the iterates u(k) are guaranteed to converge, quadratically fast, to the (simple) root u of the equation f (u) = 0. Theorem 2.17. Suppose f (u) C2 is twice continuously differentiable. Let u be a solution to the equation f (u ) = 0 such that f (u ) = 0. Given an initial guess u(0) sufficiently close to u , the Newton iteration scheme (2.37) converges at a quadratic rate to the solution u . Proof : By continuity, if f (u ) = 0, then f (u) = 0 for all u sufficiently close to u , and hence the Newton iterative function (2.36) is well defined and continuously differentiable near u . Since g (u) = f (u) f (u)/f (u)2 , we have g (u ) = 0 when f (u ) = 0, as promised by our construction. The quadratic convergence of the resulting iterative scheme is an immediate consequence of Theorem 2.10. Q.E.D. Example 2.18. Consider the cubic equation f (u) = u3 - u - 1 = 0, that we already solved in Example 2.16. The function used in the Newton iteration is g(u) = u - 3/15/06 u3 - u - 1 f (u) =u- , f (u) 3 u2 - 1 30 c 2006 Peter J. Olver 0.1 0.05 0.2 -0.05 -0.1 -0.15 -0.2 -0.25 0.4 0.6 0.8 1 Figure 2.11. The function f (u) = u3 - 3 u2 + 5 u - 2 9 1 3 1 27 . which is well-defined as long as u = iterative procedure . We will try to avoid these singular points. The (u(k) )3 - u(k) - 1 3 (u(k) )2 - 1 u(k+1) = g(u(k) ) = u(k) - with initial guess u(0) = 1.5 produces the following values: 1.5, 1.34783, 1.32520, 1.32472, and we have computed the root to 5 decimal places after only three iterations. The quadratic convergence of Newton's Method implies that, roughly, each new iterate doubles the number of correct decimal places. Thus, to compute the root accurately to 40 decimal places would only require 3 further iterations . This underscores the tremendous advantage that the Newton algorithm offers over competing methods. Example 2.19. Consider the cubic polynomial equation 3 f (u) = u3 - 2 u2 + 5 u - 9 1 27 = 0. Since 1 f (0) = - 27 , f 1 3 = 1 54 , f 2 3 1 = - 27 , f (1) = 1 54 , the Intermediate Value Lemma 2.12 guarantees that there are three roots on the interval [ 0, 1 ]: one between 0 and 1 , the second between 1 and 2 , and the third between 2 and 1. 3 3 3 3 The graph in Figure 2.11 reconfirms this observation. Since we are dealing with a cubic polynomial, there are no other roots. (Why?) This assumes we are working in a sufficiently high precision arithmetic so as to avoid round-off errors. 3/15/06 31 c 2006 Peter J. Olver It takes sixteen iterations of the Bisection Method starting with the three subintervals 2 0 , , 1 , 2 and 3 , 1 , to produce the roots to six decimal places: 3 3 1 3 u1 .085119, u2 .451805, u3 .963076. Incidentally, if we start with the interval [ 0, 1 ] and apply bisection, we converge (perhaps surprisingly) to the largest root u3 in 17 iterations. Fixed point iteration based on the formulation u = g(u) = - u3 + 3 u2 + 4 u + 2 9 1 27 can be used to find the first and third roots, but not the second root. For instance, starting with u(0) = 0 produces u1 to 5 decimal places after 23 iterations, whereas starting with u(0) = 1 produces u3 to 5 decimal places after 14 iterations. The reason we cannot produce u2 is due to the magnitude of the derivative g (u) = - 3 u2 + 3 u + at the roots, which is g (u1 ) 0.678065, g (u2 ) 1.18748, g (u3 ) 0.551126. 4 9 Thus, u1 and u3 are stable fixed points, but u2 is unstable. However, because g (u1 ) and g (u3 ) are both bigger than .5, this iterative algorithm actually converges slower than ordinary bisection! Finally, Newton's Method is based upon iteration of the rational function g(u) = u - u3 - 3 u2 + 5 u - f (u) 2 9 =u- 2 - 3u + 5 f (u) 3u 9 1 27 . Starting with an initial guess of u(0) = 0, the method computes u1 to 6 decimal places after only 4 iterations; starting with u(0) = .5, it produces u2 to similar accuracy after 2 iterations; while starting with u(0) = 1 produces u3 after 3 iterations -- a dramatic speed up over the other two methods. Newton's Method has a very pretty graphical interpretation, that helps us understand what is going on and why it converges so fast. Given the equation f (u) = 0, suppose we know an approximate value u = u(k) for a solution. Nearby u(k) , we can approximate the nonlinear function f (u) by its tangent line y = f (u(k) ) + f (u(k) )(u - u(k) ). (2.38) As long as the tangent line is not horizontal -- which requires f (u(k) ) = 0 -- it crosses the axis at f (u(k) ) u(k+1) = u(k) - , f (u(k) ) 3/15/06 32 c 2006 Peter J. Olver f (u) u(k+1) u(k) Figure 2.12. Newton's Method. which represents a new, and, presumably more accurate, approximation to the desired root. The procedure is illustrated pictorially in Figure 2.12. Note that the passage from u(k) to u(k+1) is exactly the Newton iteration step (2.37). Thus, Newtonian iteration is the same as the approximation of function's root by those of its successive tangent lines. Given a sufficiently accurate initial guess, Newton's Method will rapidly produce highly accurate values for the simple roots to the equation in question. In practice, barring some kind of special exploitable structure, Newton's Method is the root-finding algorithm of choice. The one caveat is that we need to start the process reasonably close to the root we are seeking. Otherwise, there is no guarantee that a particular set of iterates will converge, although if they do, the limiting value is necessarily a root of our equation. The behavior of Newton's Method as we change parameters and vary the initial guess is very similar to the simpler logistic map that we studied in Section 2.1, including period doubling bifurcations and chaotic behavior. The reader is invited to experiment with simple examples; further details can be found in [42]. Example 2.20. For fixed values of the eccentricity , Kepler's equation u - sin u = m (2.39) can be viewed as a implicit equation defining the eccentric anomaly u as a function of the mean anomaly m. To solve Kepler's equation by Newton's Method, we introduce the iterative function u - sin u - m . g(u) = u - 1 - cos u Notice that when | | < 1, the denominator never vanishes and so the iteration remains well-defined everywhere. Starting with a sufficiently close initial guess u (0) , we are assured that the method will quickly converge to the solution. Fixing the eccentricity , we can employ tghe method of continuation to determine how the solution u = h(m) depends upon the mean anomaly m. Namely, we start at 3/15/06 33 c 2006 Peter J. Olver 1.4 1.2 1 0.8 0.6 0.4 0.2 0.2 0.4 0.6 0.8 1 Figure 2.13. The Solution to the Kepler Equation for Eccentricity = .5. m = m0 = 0 with the obvious solution u = h(0) = 0. Then, to compute the solution at successive closely spaced values 0 < m1 < m2 < m3 < , we use the previously computed value as an initial guess u(0) = h(mk ) for the value of the solution at the next mesh point mk+1 , and run the Newton scheme until it converges to a sufficiently accurate approximation to the value u = h(mk+1 ). As long as mk+1 is reasonably close to mk , Newton's Method will converge to the solution quite quickly. The continuation method will quickly produce the values of u at the sample points. Intermediate values can either be determined by an interpolation scheme, e.g., a cubic spline fit of the data, or by running the Newton scheme using the closest known value as an initial condition. A plot for 0 m 1 using the value = .5 appears in Figure 2.13. 3/15/06 34 c 2006 Peter J. Olver
Find millions of documents on Course Hero - Study Guides, Lecture Notes, Reference Materials, Practice Exams and more. Course Hero has millions of course specific materials providing students with the best way to expand their education.

Below is a small sample set of documents:

UCF - MATH - 5485
Chapter 20 Nonlinear Ordinary Differential EquationsThis chapter is concerned with initial value problems for systems of ordinary differential equations. We have already dealt with the linear case in Chapter 9, and so here our emphasis will be on nonline
UCF - MATH - 5485
Chapter 18 Partial Differential Equations in ThreeDimensional SpaceAt last we have ascended the dimensional ladder to its ultimate rung (at least for those of us living in a three-dimensional universe): partial differential equations in physical space. A
UCF - MATH - 5485
Orthogonal Bases and the QR Algorithmby Peter J. Olver University of Minnesota1. Orthogonal Bases.Throughout, we work in the Euclidean vector space V = R n , the space of column vectors with n real entries. As inner product, we will only use the dot pr
UCF - MATH - 5485
AIMS Lecture Notes 2006Peter J. Olver14. Finite ElementsIn this part, we introduce the powerful finite element method for finding numerical approximations to the solutions to boundary value problems involving both ordinary and partial differential equa
UCF - MATH - 5485
AIMS Lecture Notes 2006Peter J. Olver12. MinimizationIn this part, we will introduce and solve the most basic mathematical optimization problem: minimize a quadratic function depending on several variables. This will require a short introduction to pos
UCF - MATH - 5587
Remark : On a connected domain R 2 , all harmonic conjugates to a given function u(x, y) only differ by a constant: v(x, y) = v(x, y) + c; see Exercise . Although most harmonic functions have harmonic conjugates, unfortunately this is not always the case.
UCF - MATH - 5587
Chapter 7 Complex Analysis and Conformal MappingThe term &quot;complex analysis&quot; refers to the calculus of complex-valued functions f (z) depending on a single complex variable z. To the novice, it may seem that this subject should merely be a simple reworkin
UCF - MATH - 5587
1 Re z Figure 7.1.1 Im z 1 Real and Imaginary Parts of f (z) = z .Therefore, if f (z) is any complex function, we can write it as a complex combination f (z) = f (x + i y) = u(x, y) + i v(x, y), of two inter-related real harmonic functions: u(x, y) = Re
UCF - MATH - 5587
Figure 7.4.Real and Imaginary Parts ofz.also have complex logarithms! On the other hand, if z = x &lt; 0 is real and negative, then log z = log | x | + (2 k + 1) i is complex no matter which value of ph z is chosen. (This explains why one avoids defining
UCF - MATH - 5587
The proof of the converse - that any function whose real and imaginary components satisfy the CauchyRiemann equations is differentiable - will be omitted, but can be found in any basic text on complex analysis, e.g., [3, 65, 118]. Remark : It is worth poi
UCF - MATH - 5587
is analytic everywhere except for singularities at the points z = 3 and z = -1, where its denominator vanishes. Since f (z) = h1 (z) , z-3 where h1 (z) = ez (z + 1)21 is analytic at z = 3 and h1 (3) = 16 e3 = 0, we conclude that z = 3 is a simple (order
UCF - MATH - 5587
if and only if it has vanishing divergence: v = u v + = 0. x y (7.36)Incompressibility means that the fluid volume does not change as it flows. Most liquids, including water, are, for all practical purposes, incompressible. On the other hand, the flow is
UCF - MATH - 5587
Using formula (7.19) for the complex derivative, d = -i = u - i v, dz x y so = u, x = v. yThus, = v, and hence the real part (x, y) of the complex function (z) defines a velocity potential for the fluid flow. For this reason, the anti-derivative (z) is k
UCF - MATH - 5587
gDFigure 7.14.Mapping to the Unit Disk.Remark : In this section, we have focused on the fluid mechanical roles of a harmonic function and its conjugate. An analogous interpretation applies when (x, y) represents an electromagnetic potential function;
UCF - MATH - 5587
Figure 7.16.The Effect of = z 2 on Various Domains.obtained by cutting the complex plane along the negative real axis. On the other hand, vertical lines Re z = a are mapped to circles | | = ea . Thus, a vertical strip a &lt; Re z &lt; b is mapped to an annulu
UCF - MATH - 5587
zph zFigure 7.18.Complex Curve and Tangent.notation x(t) = ( x(t), y(t) ) to complex notation z(t) = x(t)+ i y(t). All the usual vectorial curve terminology - closed, simple (non-self intersecting), piecewise smooth, etc. - is employed without modific
UCF - MATH - 5587
Center: .1 Radius: .5Center: .2 + i Radius: 1Center: 1 + i Radius: 1Center: -2 + 3 i Radius: 3 2 4.2426Center: .2 + i Radius: 1.2806 Figure 7.21.Center: .1 + .3 i Radius: .9487Center: .1 + .1 i Radius: 1.1045Center: -.2 + .1 i Radius: 1.2042Airfoi
UCF - MATH - 5587
Example 7.35. The goal of this example is to construct an conformal map that takes a half disk D+ = | z | &lt; 1, Im z &gt; 0 (7.73) to the full unit disk D = cfw_ | | &lt; 1 . The answer is not = z 2 because the image of D+ omits the positive real axis, resulting
UCF - MATH - 5587
7.5. Applications of Conformal Mapping.Let us now apply what we have learned about analytic/conformal maps. We begin with boundary value problems for the Laplace equation, and then present some applications in fluid mechanics. We conclude by discussing h
UCF - MATH - 5587
Figure 7.25.A NonCoaxial Cable.Example 7.39. A non-coaxial cable. The goal of this example is to determine the electrostatic potential inside a non-coaxial cylindrical cable, as illustrated in Figure 7.25, with prescribed constant potential values on th
UCF - MATH - 5587
0 Figure 7.29.15 Fluid Flow Past a Tilted Plate.30Note that = ( 1, 0 ), and hence this flow satisfies the Neumann boundary conditions (7.95) on the horizontal segment D = . The corresponding complex potential is (z) = z, with complex velocity f (z) = (
UCF - MATH - 5587
on the unit disk D for an impulse concentrated at the origin; see Section 6.3 for details. How do we obtain the corresponding solution when the unit impulse is concentrated at another point = + i D instead of the origin? According to Example 7.25, the lin
UCF - MATH - 5587
as long as n = -1. Therefore, we can use the Fundamental Theorem of Calculus (which works equally well for real integrals of complex-valued functions), to evaluate n+1 1 -1 = n = 2 k + 1 odd, 0, 2 t + i (t - 1) 2 z n dz = = , n = 2 k even. n+1 P t = -1 n+
UCF - MATH - 5587
Figure 7.32.Orientation of Domain Boundary.Theorem 7.48. If f (z) is analytic on a bounded domain C, then f (z) dz = 0.(7.118)Proof : If we apply Green's Theorem to the two real line integrals in (7.109), we find u dx - v dy = - u v - x y = 0,v dx +
UCF - MATH - 5587
Proof : Note that the integrand f (z) = 1/(z - a) is analytic everywhere except at z = a, where it has a simple pole. If a is outside C, then Cauchy's Theorem 7.48 applies, and the integral is zero. On the other hand, if a is inside C, then Proposition 7.
UCF - MATH - 5587
0 Figure 7.36.15 Kutta Flow Past a Tilted Airfoil.30which remains asymptotically 1 at large distances. By Cauchy's Theorem 7.48 coupled with formula (7.123), if C is a curve going once around the disk in a counter-clockwise direction, then i 1 dz = - 2
UCF - MATH - 5587
is analytic in the disk | z | 2 since its only singularity, at z = 3, lies outside the contour C. Therefore, by Cauchy's formula (7.135), we immediately obtain the integral ez dz = z2 - 2 z - 3 f (z) i dz = 2 i f (-1) = - . z+1 2eCCNote: Path independe
UCF - MATH - 5587
Chapter 12 Dynamics of Planar MediaIn previous chapters we studied the equilibrium configurations of planar media - plates and membranes - governed by the two-dimensional Laplace and Poisson equations. In this chapter, we analyze their dynamics, modeled
UCF - MATH - 5587
In this manner, we arrive at the basic conservation law relating the heat energy density and the heat flux vector w. As in our one-dimensional model, cf. (4.3), the heat energy density (t, x, y) is proportional to the temperature, so (t, x, y) = (x, y) u(
UCF - MATH - 5587
for the diffusion equation. See [35; p. 369] for a precise statement and proof of the general theorem. Qualitative Properties Before tackling examples in which we are able to construct explicit formulae for the eigenfunctions and eigenvalues, let us see w
UCF - MATH - 5587
Theorem 12.1. Suppose u(t, x, y) is a solution to the forced heat equation ut = u + F (t, x, y), for (x, y) , 0 &lt; t &lt; c,where is a bounded domain, and &gt; 0. Suppose F (t, x, y) 0 for all (x, y) and 0 t c. Then the global maximum of u on the set cfw_ (t, x
UCF - MATH - 5587
so there are no non-separable eigenfunctions . As a consequence, the general solution to the initial-boundary value problem can be expressed as a linear combination u(t, x, y) =m,n = 1cm,n um,n (t, x, y) =m,n = 1cm,n e- m,n t vm,n (x, y)(12.41)of
UCF - MATH - 5587
Let us start with the equation for q(). The second boundary condition in (12.50) requires that q() be 2 periodic. Therefore, the required solutions are the elementary trigonometric functions q() = cos m or sin m , where = m2 , (12.53)with m = 0, 1, 2, .
UCF - MATH - 5587
15 10 5 -4 -2 -5 -10 -15 2 4Figure 12.3.The Gamma Function.Thus, at integer values of x, the gamma function agrees with the elementary factorial. A few other values can be computed exactly. One important case is when x = 1 . Using 2 the substitution t
UCF - MATH - 5587
Remark : The definition of a singular point assumes that the other coefficients do not both vanish there, i.e., either q(x0 ) = 0 or r(x0 ) = 0. If all three functions happen to vanish at x0 , we can cancel any common factor (x - x0 )k , and hence, withou
UCF - MATH - 5587
we find that the only non-zero coefficients un are when n = 3 k +1. The recurrence relation u3 k+1 = u3 k-2 (3 k + 1)(3 k) yields u3 k+1 = 1 . (3 k + 1)(3 k)(3 k - 2)(3 k - 3) 7 6 4 3The resulting solution isx3 k+1 . (3 k + 1)(3 k)(3 k - 2)(3 k - 3) 7 6
UCF - MATH - 5587
two different Frobenius expansions. Usually, this expectation is valid, but there is an important exception, which occurs when the indices differ by an integer. The general result is summarized in the following list: (i ) If r2 - r1 is not an integer, the
UCF - MATH - 5587
We have thus found the series solution (-1)k xm+2k . 22k k(k - 1) 3 2 (r + k)(r + k - 1) (r + 2)(r + 1) k=0 k=0 (12.93) So far, we not paid attention to the precise values of the indices r = m. In order to continue the recurrence, we need to ensure that t
UCF - MATH - 5587
where h0 = 0, while = limkhk = 1 +1 1 1 + + + , 2 3 k (12.102)hk - log k .5772156649 . . .is known as Euler's constant. All Bessel functions of the second kind have a singularity at the origin x = 0; indeed, by inspection of (12.101), we find that th
UCF - MATH - 5587
of the Bessel boundary value problem (12.5455) are the squares of the roots of the Bessel function of order m. The corresponding eigenfunctions are wm,n (r) = Jm (m,n r) , n = 1, 2, 3, . . . , m = 0, 1, 2, . . . , (12.112)defined for 0 r 1. Combining (12
UCF - MATH - 5587
t=0t = .04t = .08t = .12 Figure 12.6.t = .16 Heat Diffusion in a Disk.t = .212.5. The Fundamental Solution of the Heat Equation.As we learned in Section 4.1, the fundamental solution to the heat equation measures the temperature distribution result
UCF - MATH - 5587
for the planar heat equation is given by the linear superposition formula u(t, x, y) = 1 4 t f (, ) e- [ (x-)2+(y-)2 ]/(4 t)d d.(12.125)We can interpret the solution formula (12.125) as a two-dimensional convolution u(t, x, y) = F (t, x, y) f (x, y)
UCF - MATH - 5587
Vibration of a Rectangular Drum Let us first consider the vibrations of a membrane in the shape of a rectangle R= 0 &lt; x &lt; a, 0 &lt; y &lt; b ,with side lengths a and b, whose edges are fixed to the (x, y)plane. Thus, we seek to solve the wave equation utt = c2
UCF - MATH - 5587
A table of their values (for the case c = 1) can be found in the preceding section. The Bessel roots do not follow any easily discernible pattern, and are not rational multiples of each other. This result, known as Bourget's hypothesis, [142; p. 484], was
UCF - MATH - 5587
following table, we display a list of all relative vibrational frequencies (12.158) that are &lt; 6. Once the lowest frequency 0,1 has been determined - either theoretically, numerically or experimentally - all the higher overtones m,n = m,n 0,1 are simply o
UCF - MATH - 5587
For example, on a unit square R = 0 &lt; x, y &lt; 1 , an accidental degeneracy occurs whenever m2 + n2 = k 2 + l2 (12.163) for distinct pairs of positive integers (m, n) = (k, l). The simplest possibility arises whenever m = n, in which case we can merely reve
UCF - MATH - 5587
Chapter 9 Linear and Nonlinear Evolution EquationsIn this chapter, we analyze several of the most important evolution equations, both linear and nonlinear, involving a single spatial variable. Our first stop is to revisit the heat equation. We introduce
UCF - MATH - 5587
Chapter 3 Fourier SeriesJust before 1800, the French mathematician/physicist/engineer Jean Baptiste Joseph Fourier made an astonishing discovery. Through his deep analytical investigations into the partial differential equations modeling heat propagation
UCF - MATH - 5587
Chapter 8 Fourier TransformsFourier series and their ilk are designed to solve boundary value problems on bounded intervals. The extension of Fourier methods to the entire real line leads naturally to the Fourier transform, an extremely powerful mathemat
UCF - MATH - 5587
Chapter 6 Generalized Functions and Green's FunctionsBoundary value problems, involving both ordinary and partial differential equations, can be profitably viewed as the infinite-dimensional function space versions of finite dimensional systems of linear
UCF - MATH - 5587
Math 5587 September 8, 2011Homework #1Problems: Chapter 1: 1.1ae, 1.2b,d, 1.5a,e, 1.6, 1.12a, 1.16ad, 1.18, 1.19, 1.20, 1.24. Chapter 2: 2.1 2, 3c,e, 4, 6.Due: Thursday, September 15
UCF - MATH - 5587
Math 5587 September 20, 2011Homework #2Problems: Chapter 2: 2.2 2.3 2a, 3b, 9, 17, 26, 27. 2, 5, 14, 15.Due: Thursday, September 29 First Midterm: Tuesday, October 11 Will cover chapters 1 &amp; 2. You will be allowed to use one 8&quot; 11&quot; sheet of notes. Note
UCF - MATH - 5587
Math 5587 September 29, 2011Homework #3Problems: Chapter 2: 2.4 2, 3, 4c,d, 8, 11, 12.Also, in 2.4.8, determine the domain of influence of the point (0,2) and the domain of dependence of the point (3,-1). Discuss what these tell you about the solution.
UCF - MATH - 5587
Math 5587 October 13, 2011Homework #4Problems: Chapter 3: 3.1 3.2 2b, 5. 1, 2g, 3a, 5, 6a,g, 15a,d, 16a,d, 24, 25, 34, 35, 41b, 52, 53.Due: Thursday, October 20
UCF - MATH - 5587
Math 5587 October 25, 2011Homework #5Problems: Chapter 3: 3.3 1, 2, 8. 3.4 2b, 3c, 7, 9 (just use one of the two methods). 3.5 2b,c,d, 4, 8, 11a,b,c. Due: Tuesday, November 1 Second Midterm: Thursday, November 17 Will cover chapters 3 &amp; 4. You will be a
UCF - MATH - 5587
Math 5587 November 3, 2011Homework #6Problems: Chapter 3: 3.5 13, 21c,e, 22b,c, 27b,d, 30, 31, 35a, 42. Chapter 4: 4.1 2, 4c, 10, 17a,b. Due: Thursday, November 10 Second Midterm: Thursday, November 17 Will cover chapters 3 &amp; 4. You will be allowed to u
UCF - MATH - 5587
Math 5587 November 10, 2011Homework #7Problems: Chapter 4: 4.2 3a, 4b,e, 8, 14a,d,e, 26. 4.3 4, 7, 10c, 11, 12a, 16, 24a, 29, 31. Due: Tuesday, November 22 Second Midterm: Thursday, November 17 Will cover chapters 3 &amp; 4. You will be allowed to use one 8
UCF - MATH - 5587
Math 5587 December 6, 2011Homework #8Problems: Chapter 4: 4.4 2a,e,f, 12a,e,f, 13, 17a,b. Chapter 6: 6.1 1b,d, 2d, 3, 5b, 8, 13, 19, 35. 6.2 2, 6. 6.3 1, 2, 6. Due: Tuesday, December 13 Final Exam: Take Home, to be handed out on Tuesday, December 13 and
UCF - MATH - 5587
Chapter 2 Linear and Nonlinear WavesOur exploration of the vast mathematical continent that is partial differential equations will begin with simple first order equations. In applications, first order partial differential equations are most commonly used
UCF - MATH - 5587
Chapter 5 Numerical Methods: Finite DifferencesAs you know, the differential equations that can be solved by an explicit analytic formula are few and far between. Consequently, the development of accurate numerical approximation schemes is essential for
UCF - MATH - 5587
Chapter 11 Numerical Methods: Finite ElementsIn Chapter 5, we introduced the first, the oldest, and in many ways the simplest class of numerical algorithms for approximating the solutions to partial differential equations: finite differences. In the pres