Sturm's theorem - Everything2.com

Jacques Charles François Sturm's eponymous theorem is chiefly remarkable for its formulation. Proved in 1829, it was considered obscure. It gives an algebraic decision procedure (an algorithm) for determining the number of real roots of a polynomial as well as their locations. As a practical algorithm, it's not too useful (the numbers involved grow very quickly, and high accuracy must be maintained). But as an algorithm for proving roots exist, or for finding bounds on some parameters of a polynomial to get roots, it can be useful.

I assume Sturm intended his theorem as a tool for working on differential equations. Various polynomial characteristics exist for these. knowing whether a real root exists on some interval is important for determining the qualitative behaviour of solutions.

But that such an algorithm even exists is decidedly non-trivial. Indeed, over a century later its importance was recognized by Alfred Tarski, who used it as a chief tool in creating a decision procedure for checking propositions in the language of algebra on the real numbers. (This will (I hope) be covered in a later node...)

Definition. Let P(x)∈R[x] be a separable polynomial in one variable x with real coefficients. The Sturm chain of polynomials is the chain of polynomials defined by

P₀(x) = P(x).

P₁(x) = P'(x).

P_k+1(x) = - (P_k-1(x) % P_k(x)),

where (borrowing from C and C-like languages...) a(x) % b(x) is the remainder when dividing the polynomial a(x) by b(x) (e.g. by synthetic division).

Note that the Sturm chain is in practice finite: If the degree deg P(x) = n, then ∀k≤n: deg P_k(x) = n-k, so P_n is constant, and all subsequent P_ks are zero. The theorem only looks at changes of sign, so only the first n+1 elements of the chain can be of interest.

Theorem. Let P(x)∈R[x] be a separable polynomial in one variable x with real coefficients, and let the Sturm chain P₀(x), P₁(x), ... be as above. For all t∈R, let N(t)∈N₀ be the number of sign changes in the chain P₀(t), P₁(t), ... . Then for any interval I=[a,b] with P(a), P(b) ≠ 0, the number of roots of P(x) in I is N(a)-N(b).

Example

We determine the number and approximate locations of the roots of the polynomial P(x)=x⁵-5x²+3. This polynomial is separable (has no repeated roots), as can be seen e.g. by following the method described in Noether's "separable" writeup, and verifying that P(x) and P'(x) share no common factor.

Calculation (courtesy of Emacs' calc package) yields this Sturm chain:

P₀(x) = x⁵-5x²+3
P₁(x) = 5x⁴-10x
P₂(x) = 3x²-3
P₃(x) = 10x-5
P₄(x) = 9/4.

How many real roots are there? It is easy to see that there can be no roots with absolute value greater than 2. We calculate N(-2) and N(2). The signs of the chain at t=-2 are -++-+, giving N_-2=3; the signs of the chain at t=2 are +++++, giving N₂=0, so there are 3 real roots.

We can also calculate N_-1=3 (signs -+0-+), N₀=2 (signs +0--+) and N₁=1 (signs --0++). So there is 1 root between -1 and 0, 1 root betwen 0 and 1, and 1 root between 1 and 2. If we like, we can continue in this fashion -- but now that we have intervals each containing one root, it's faster to use any iterative method for finding roots, such as the Newton-Raphson method or the "regula falsi" method (or, better yet, one of their accelerated versions -- we're seeking the roots of a polynomial, which is an exceedingly well-behaved function!).

By comparison, we could have directly used the intermediate value theorem. It immediately implies that there is a real root (for any polynomial of odd degree). In fact, with a bit more work we see that any separable polynomial of odd degree has an odd number of real roots. But we'd have to guess the location of the roots. Knowing that there is one root we'd have to guess that there are two more roots. Even knowing that there are 3, we'd be pushed to show that there aren't 5! For higher degree polynomials, the problem becomes progressively higher; Sturm's theorem is the easy way out.

proof of Sturm's theorem	Prenex and Skolem normal forms	Jacques Charles François Sturm	Exact values of sine, cosine and tan
"regula falsi" method	Separable	faster	synthetic division
Alfred Tarski	Proving a function has only one root in a given interval	Infinite Shakespeare theorem	false secant iteration
Newton-Raphson method	Polynomial	Sturm-Liouville Theory	Ikkyu
Sojourn of Arjuna	Iterative methods for finding the roots of a function	Horner's rule	synthetic substitution
Polynomial division	Calc	Parameter	real number