-
-
home
-
-
PDF (letter size)
Notes on Sturm Liouville
April 15, 2019 Compiled on January 30, 2024 at 3:31am
Contents
1 Definitions
Regular Sturm-Liouville ODE is an eigenvalue boundary value ODE. This means the
ODE has an eigenvalue in it , where solutions exists only for specific values of eigenvalues.
The ODE is
Some books use in the above instead of . The above also can be written as
With the restrictions that are real functions that are continuous everywhere over and also
we need . The is called the weight function. But this is not all. The boundary conditions
must be linear homogeneous, of this form
Where are just real constants. Some of them can be zero but not all. For example, is
OK.
Boundary conditions do not have to mixed, but they can be in general. But they must be
homogeneous.
Notice that periodic boundary conditions are not allowed. Well, they are allowed, but then
the problem is no longer called Sturm-Liouville. The above is just the definition of the
equation and its boundary conditions. Below is list of the important properties of this ODE.
Each one of these properties have a proof.
The standard way to write S.L. is in the form (2) and not (1) above. This is because the S.L.
operator is defined Where . So we break into two parts, putting the part with the
eigenvalue to the right.
All eigenvalues are real. No complex .
Each eigenvalue , will have one and only one real eigenfunction associated with it. (in
higher dimensions, the eigenvalue problem can have more than one eigenfunction associated
with one eigenvalue. For example, heat PDE in rectangle). But we can use Gram-schmidt to
make these eigenfunctions orthogonal if we need to). But for exam, just worry about 1D for
now.
There is smallest , called and there are infinite number of eigenvalues. Hence eigenvalues
are ordered
Each eigenfunction will have zeros in . Note that this does not include the end points. This
means, will have two zeros inside the domain. i.e. there are two locations where in
.
Eigenfunctions make a complete set of basis. This means any piecewise continuous function
can be represented as . This is called the generalized Fourier series.
This follows from (5). Each eigenfunction is orthogonal to another eigenfunction, but with
the weight in there. This means if .
Rayleigh quotient relates an eigenvalue to its eigenfunction. Starting with the SL ODE ,
then multiplying by and integrating, we obtain and solving for , gives Carrying integration
by parts on first part of the integral in numerator, it becomes But this becomes much
simpler when we plug-in the boundary conditions that we must use, making the above And
if and , then it becomes Rayleigh quotient is useful to show that can be positive without
solving for and also used to estimate the value of the minimum by replacing by a
trial function and actually solving for using to obtain a numerical estimate of
.
There is symmetry relation. In operator form, let , then we have where are any two
different eigenfunctions.
There is also what is called Lagrange identity, which says . This comes from simply
expanding and simplifying things. Just calculus.
Green formula, follows from Lagrange identity, which just gives the integral form . But form
Sturm-Liouville, we know that (from 8). So this really just says that . But we know this
already from boundary conditions. So I am not sure why this is useful for Sturm-Liouville
now. Since it is just saying the same thing again.
If at the left or right end (boundaries), then the problem is now called singular
Sturm-Liouville. This is actually the important case. In regular S.L., must be
positive everywhere. We only consider at the ends, not in the middle or any other
place. When this happens, then solutions to the ODE generate special functions,
such as Bessel, Legendre, Chebyshev. These special functions are solution to the
Singular S.L. ODE, not the regular S.L. ODE. In addition, at the end where ,
the boundary conditions must be bounded. Not the regular boundary conditions
above. For example, if where is say the left end, the the boundary conditions
at the left end must be . Notice that singular can be at left end, right end or at
both at same time. At the end where , the boundary conditions must be bounded
type.
Symmetry relation where are are two eigenfunction. But remember, this does NOT mean
the integrand is identically zero, or . Only when happened to have same eigenvalue we can
say that . But for unique eigenfunctions only the integral form is zero This is important
difference, used in the proof below, that different eigenfunctions have different eigenvalues
(for the scalar SL case). For higher dimensions, we can have more than one eigenfunction for
same eigenvalue.
Small note added April 18, 2021. The above is the same as showing the operator is
Hermitian. In bra/ket Dirac notation, an operator is Hermitian when
In the proofs below, I did not do the complex conjugate, since it is assumed all
eigenfunctions are real, and I only wrote that the symmetry relation implies
It should be really called the Hermitian relation to be more accurate. But for real
eigenfunctions, it is the same.
2 Proofs
2.1 Proof symmetry of operator
Given regular Sturm-Liouville (RSL) ODE
In operator form And (1) becomes When solving RSL ode, since it is an eigenvalue ODE
with associated boundary conditions, we will get infinite number of non-negative
eigenvalues.
For each eigenvalue, there is associated with it one eigenfunction (for 1D case). Looking at
any two different eigenfunctions, say , then the symmetry relation says the following Now we
will show the above is true. This requires integration by parts two times. We start from the
LHS expression and at the end we should end up with the integral on the RHS. Let ,
then
Where we used in the above. Hence Now we will do integration by parts on the first integral.
Let . Using , and if we let , then . Hence We now apply integration by parts again to the
second integral above. But now let and , hence and , therefore the above becomes
Substituting the above into (2) gives Now comes the part where the boundary conditions are
important. In RSL, the boundary conditions are such that all the terms vanish. This is
because the boundary conditions are
And
So now (3) becomes
Therefore, we showed that . The only thing to watch for here, is which term to make and
which to make when making integration by parts. To remember, always start with and
make the term with in them during integration parts.
2.2 Proof eigenvalues are real
Assume is complex. The corresponding eigenfunctions are complex also. Let be
corresponding eigenfunction Taking complex conjugate of both sides, and since
is real, we obtain But since all the coefficients of the ODE are real. The above
becomes But by symmetry, we know that Substituting (1),(2) into the above
gives
But which is positive. Also the weight is positive by definition. Hence for the above to zero,
it must be that . Which means is real. QED.
So the main tools to use in this proof: Definition of and symmetry relation and that . This
might come up in the exam.
2.3 Proof eigenfunctions are unique
Now will show that there is one eigenfunction associated with each eigenvalue (Again, this is
for 1D, it is possible to get more than one eigenfunction for same eigenvalue for 2D, as
mentioned earlier). By contradiction, assume that has two eigenfunctions associated with
it. Hence
From the first equation, , substituting this into the second equation gives
By Lagrange identity, , hence this means that
Where is some constant. This is the main difference between the above argument, and
between Lagrange identity. This can be confusing. So let me talk more about this. In
Lagrange identity, we write And when also satisfy the SL boundary condition, only then
we say that
But the above is not the same as saying . This is important to keep in mind. Only the
integral form is zero for any two functions with the SL B.C..Now we continue. We showed
that . In SL, this constant is zero due to B.C. Hence But by definition. Hence
or
or . So the eigenfunctions are linearly dependent. One is just scaled version of the other. But
eigenfunction must be linearly independent.
Hence assumption is not valid, and there can not be two linearly independent eigenfunctions
for same eigenvalue. Notice also that is the just the Wronskian. When it is zero, we know
the functions are linearly dependent. The important part in the above proof, is that only
when happened to have same eigenvalue.
2.4 Proof eigenfunctions are real
The idea of this proof is to assume the eigenfunction is complex, then show that its real part
and its complex part both satisfy the ODE and the boundary conditions. But since they are
both use the same eigenvalue, then the real part and the complex part must be
linearly dependent. This implies the eigenfunction must be real. (think of Argand
diagram)
Assume that is complex eigenfunction with real part and complex part . Then since The
above is just writing the Sturm-Liouville ODE in operator form, where is the operator as
above. Now we have By linearity of operator Which implies
So we showed that the real and complex part satisfy S.L. Now we need to show they are
satisfy S.L. boundary conditions. Since
Where are the left and right ends of the domain. Then
Hence
So the above means both and satisfy the boundary conditions of S.L. But since both have
the same eigenvalue, then they must be linearly dependent, since we know with S.L. each
eigenfunction (or one linearly dependent to it) have only one eigenvalue. This means Where
is some constant. In other words,
Where is new constant. (OK it happens to be complex constant, but it is OK to do so, we
always do this trick in other places, if it will make me feel better, I could take the magnitude
of the constant). So all what the above says, is that we assumed to be complex, and found
that it is real. So it can’t be complex.
2.5 Proof eigenfunctions are orthogonal with weight
Given two different eigenfunctions . Hence
From symmetry integral relation, since these eigenfunctions also satisfy S.L. boundary
conditions, we can write Replacing (1,2) into the above
But since there are different eigenvalues for different eigenfunctions. Hence Which means
are orthogonal to each others with weight
3 Special functions generated from solving singular Sturm-Liouville
When S.L. is singular, meaning at one or both ends, we end with important class of ODE’s,
whose solutions are special functions (not or ) as the case with the regular S.L. Recall that
S.L. is
With the regular S.L., we say that over the whole domain, including end points. But
with singular, this is not the case. Here are three important S.L. ODE’s that are
singular.
Bessel equation:
Or in standard form Comparing to SL form, then . At , then , which is what makes it
singular. (also the weight happens to be zero), but we only care about being zero or not, at
one of the ends. So to check if S.L. is singular or not, just need to check if is zero or not at
one of the ends.
As mentioned before, when at one of the ends, we can’t use the standard B.C. for the
regular S.L. instead, at the end where , we must use what is called bounded boundary
conditions, which is in this case . The solution to this ODE will be in terms of Bessel
functions. Notice that this happened to be singular, due to the domain starting at . If the
domain happened to be from to say , then it is no longer singular S.L. but regular
one.
Legendre equation Or it can be written as We see that . And now it happens to be that at .
So it is singular at the other end compared to Bessel. In this case . Again, the boundary
conditions at must now be bounded. i.e. . On the other end, where is not zero, we still use
the standard boundary conditions .
Chebyshev equation Or So we see that . So where is here? At then and at then also. So
this is singular at both ends. So we need to use bounded boundary conditions at both ends
now to solve this.
The solution to this singular S.L. is given in terms of special function Chebyshev.
4 General notes on S-L
Solution to . Set in S-L form, we get , then . For with . This is singular S-L since is zero at
one end. Using Frobenius series, the solution comes out to be , where is Bessel function of
first kind. are roots of . (check if this should be . is constant, which still needs to be
found.
We can find approximate solution to S-L ODE for large eigenvalue using WKB so we do not
have to solve the ode using series method. Given which is standard form S-L. Using
physical optics correction, we obtain the solution as Where are found from boundary
conditions now. The above is valid for large , and is found by first letting and then
assuming . And working though the WKB method. Remember, WKB only works
for linear homogeneous ODE and is used to estimate solution for large (or small
).
in 1D, regular S-L is with , with B.C. given by and and in higher dimensions, this
problem becomes with boundary conditions now written as on boundary . In higher
dimensions, .
Lagrange’s identity in 1D is where above is the S-L operator, as in so that we write .
When it says eigenfunctions are normalized, it should mean that where is the weight
function (comes from SL equation) and is the domain. This is for 1D SL. Remember for
example, that . where . Here, it is not normalized since the result is not one. The weight here
is just one and here
Heat PDE in cylinder is For steady state, it becomes, say cylinder has length
And for axis symmetric (no dependency), it becomes The solution is found by
separation of variables. For the above is in terms of Bessel function order zero
which is found using series method . The eigenvalues are zeros of where is disk
radius.
Heat PDE in 2D polar is and for steady state it becomes . After separating, the ODE
becomes Euler ODE. Use guess for solution. The solution will be To change to S.L. form,
multiply the ODE by
In using method of eigenfunction expansion to solve nonhomogeneous B.C. PDE, the
eigenfunction used are the ones from solving the PDE with homogeneous B.C. For example,
if given with nonhomogeneous B.C. , say , and we want to use on the nonhomogeneous
B.C. PDE, then those are from the corresponding homogeneous B.C. PDE. They should be
something like and so on.
See the table at the top. That is why we write above and not and remember not to do
term-by-term differentiation in when this is the case. We can do term-by-term
differentiation only of the PDE itself also has homogeneous B.C. , even if it had a source
term there also. The point is, always come from solving the homogeneous B.C. PDE. (This
normally means solving a Sturm-Liouville ODE).
I found new relation for eigenvalue , but it is probably not useful as the one the book has.
Here how to derive it
Given S-L Let be the eigenfunction associated with eigenvalue . Since eigenfunctions satisfy
the ODE itself, then we can write, for any arbitrary eigenfunction (subscript removed for
clarity in what follows)
Integrating both sides Looking at . Integrating by part. Let , hence
Substituting (3) into (2) gives
Hence compare to the one in the book .
This is about using eigenfunction expansion to approximate . Given the eigenfunctions
found by solving SL problem, then these eigenfunctions are called complete in the mean
sense, if converges to the mean of at each point in .
This requires only that be square integrable over the domain. We can also say that the
eigenfunctions are complete, in pointwise sense, if converges to at each point in . This
however requires that be continuous in and be piecewise continuous.
So clearly the first case is less strict than the second. This means if is only square integrable
(and not necessarily continuos), then convergence to the mean of at each point can be
obtained. To obtain the more strict point wise convergence, more restrictions are needed on
as mentioned above. Clearly point wise convergence implies convergence in the mean. But
not the other way around.
For Sturm-Liouville, write it as . Now it is easy to compare to. For example, , then we see
that . And for then . Do not divide by first to re-write as and then compare, as this
makes it hard to compare to standard form and will make error. Books write
Sturm-Liouville as but I find this confusing to compare to this way. Better to
expand it to first so now it is in a more familiar form and can read it out more
directly.
5 Methods of solutions to some Sturm Liouville problems
Problem 1 If the problem gives S-L equations, and asks to find estimate on the smallest
eigenvalue, then use Rayleigh quotient for . And write . Now we do not need to solve the SL
to find solution , this is the whole point. Everything in RHS is given, except for, of course
the solution .
Here comes the main idea: Come up with any trial and use it in place of . This trial
function just needs to satisfy the boundary conditions, which is also given. Then all
what we need to do is just evaluate the integral. Pick the simplest function of
which satisfies boundary conditions. All other terms we can read from the given
problem.
At the end, we should get a numerical value for the integral. This is the upper limit of the
lowest eigenvalue
Problem 2 We are given SL problem with boundary conditions, and asked to show that
without solving the ODE to find :
Use Rayleigh quotient and argue that the denominator can’t be zero (else eigenfunction is
zero, which is not possible) and it also can’t be negative. Argue about the numerator not
being negative. Do not solve the ODE! this is the whole point of using Rayleigh
quotient.
Problem We are given an SL problem with boundary conditions and asked to estimate large
and corresponding eigenfunction.
This is different from being asked to estimate the smallest eigenvalue, where we use Rayleigh
quotient and trial function. In this one, we instead use WKB. For 1D, just use what is called
the physical optics correction method, given by . Where in this are all known functions
(from the problem itself). Notice that is not there.
Now use the boundary conditions and solve for one of the constants, should come to be zero.
Use the second boundary condition and (remember, the boundary conditions are
homogenous), and we get one equation in and for non-trivial solution, solve for allowed
values of . This gives the large eigenvalue estimate. i.e. for when is very large. Depending
on problem, does not have be too large to get good accurate estimate compared with exact
solution. See HW7, problem 5.9.2 for example.
Problem We are given 1D heat ODE . If B.C. are homogenous, we are done. We know the
solution. Separation of variables. If the B.C. is not homogenous, then we have a small
problem. We can’t do separation of variables. If you can find equilibrium solution, , where
this solution only needs to satisfy the non-homogenous B.C. then write solution as , and plug
this in the PDE.
This will give but this now satisfies the homogenous B.C. This is the case if the original
non-homogenous B.C. were Dirichlet. If they were Neumann, then we will get where is
extra term that do not vanish because do not vanish in this case after double
differentiating. We solve for now, since it has homogenous B.C., but it has extra
term there, which we treat as new source. We apply initial conditions now also
to final all the eigenfunction expansion coefficients. We are done. Now we know
But if we can find the reference function or we do not want to use this method, then we can
use another method, called eigenfunction expansion. We assume and plug this
in the PDE. But now we can’t do term by term differentiation, since are the
eigenfunctions that satisfies the homogenous B.C., while has non-homogenous
B.C.
So the trick is to use Green formula, to rewrite as plus the contribution from the boundary
terms. (this is like doing integration by parts twice, but much easier). Now rewrite . See page
355-356, Haberman.
6 Examples converting second order linear ODE to Sturm-Liouville
Any linear second order ODE can be converted to S-L form. The S-L form is the following
Some books write the above with a plus sign there (for example, Haberman) and some with
for example Boyce/DiPrima. I really never understood why some put a minus sign and
some do not. May be I’ll find out one day. But for now will use (1) is used as the
S-L form. If your book uses the plus sign, the same process works, just flip the
sign.
The goal now, is given any general form eigenvalue ODE or second order ODE
(with no eigenvalue ), we want to convert it (rewrite it) in the above form. The
second order linear ODE will have this form The parameter in the S-L form, is the
eigenvalue. We really only use S-L form for eigenvalue problems, but if the is
missing, then the form of the input will be And the process will work the same
way.
First, we will show how to convert (2) to (1), and then show few examples. We are given (2),
and want to convert it to (1). The first step is to convert (2) to standard form Then multiply
the above by some unknown which is assumed positive. This is called the integrating factor
Now, rewrite as in the above
Here comes the main trick in all of this. We want to force to be zero. This implies This
comes from just solving the ODE . Therefore, if , then (3) becomes
We are done. Compare (4) to the SL form We see that, given then
Let see how this works on some examples
6.1 Example 1
Convert to S-L. We see the general form here is , with . Hence , therefore the SL form is
Where
Hence (1) becomes This was easy, since the ODE given was already in SL form.
6.2 Example 2
Convert Bessel ODE to SL
Comparing the above to , we see that . Hence Therefore the SL form is Where
Therefore (1) becomes Let check if the above gives back . Expanding the above gives
Multiplying by
This verifies that (2) is the Sturm-Liouville form for Bessel ODE
6.3 Example 3
Convert Legendre ODE to SL The general form here is , with and . Hence
Hence SL is
Let check. Expanding the above gives Multiplying both sides by gives Which is the original
form of Legendre ODE
6.4 Example 4
Convert Rewrite as The general form here is , with . Hence Therefore the SL form is
Where
Hence (1) becomes
6.5 Example 5
Convert We see the general form here is , with . Hence But , therefore . Therefore the
SL form is Where
Hence (1) becomes
6.6 Example 6
Convert The general form here is , with and . Hence Therefore the SL form is Where
Hence (1) becomes We see that the above is the same as because And hence (2) becomes
Canceling gives
Which is the original ODE.
6.7 Example 7
Convert The general form here is , with and . Hence Therefore the SL form is Where
Hence (1) becomes The above is the same as because And hence (2) becomes Canceling
gives
Which is the original ODE.