home

PDF (letter size)

Notes on Sturm Liouville

Nasser M. Abbasi

April 15, 2019   Compiled on January 30, 2024 at 3:31am

Contents

1 Definitions
2 Proofs
2.1 Proof symmetry of operator
2.2 Proof eigenvalues are real
2.3 Proof eigenfunctions are unique
2.4 Proof eigenfunctions are real
2.5 Proof eigenfunctions are orthogonal with weight
3 Special functions generated from solving singular Sturm-Liouville
4 General notes on S-L
5 Methods of solutions to some Sturm Liouville problems
6 Examples converting second order linear ODE to Sturm-Liouville
6.1 Example 1
6.2 Example 2
6.3 Example 3
6.4 Example 4
6.5 Example 5
6.6 Example 6
6.7 Example 7

1 Definitions

Regular Sturm-Liouville ODE is an eigenvalue boundary value ODE. This means the ODE has an eigenvalue in it λ, where solutions exists only for specific values of eigenvalues. The ODE isddx(p(x)dydx)+q(x)y(x)+λσ(x)y(x)=0a<x<b

Some books use q(x)y(x) in the above instead of +q(x)y(x). The above also can be written as

(1)py+py+(q+λσ)y=0(2)(py)+qy=λσy

With the restrictions that p(x),q(x),σ(x) are real functions that are continuous everywhere over axb and also we need p(x)>0,σ(x)>0. The σ(x) is called the weight function. But this is not all. The boundary conditions must be linear homogeneous, of this formβ1y(a)+β2y(a)=0β3y(b)+β4y(b)=0

Where βi are just real constants. Some of them can be zero but not all. For example, β1y(a)=0,β4y(b)=0 is OK.

Boundary conditions do not have to mixed, but they can be in general. But they must be homogeneous.

Notice that periodic boundary conditions are not allowed. Well, they are allowed, but then the problem is no longer called Sturm-Liouville. The above is just the definition of the equation and its boundary conditions. Below is list of the important properties of this ODE. Each one of these properties have a proof.

The standard way to write S.L. is in the form (2) and not (1) above. This is because the S.L. operator L is defined L[y]=λσy Where L[y](py)+qy. So we break (q+λσ)y into two parts, putting the part with the eigenvalue to the right.

All eigenvalues λn are real. No complex λn.

Each eigenvalue λn, will have one and only one real eigenfunction ϕn(x) associated with it. (in higher dimensions, the eigenvalue problem can have more than one eigenfunction associated with one eigenvalue. For example, heat PDE in rectangle). But we can use Gram-schmidt to make these eigenfunctions orthogonal if we need to). But for exam, just worry about 1D for now.

There is smallest λ, called λ1 and there are infinite number of eigenvalues. Hence eigenvalues are ordered λ1<λ2<λ3

Each eigenfunction ϕn(x) will have n1 zeros in a<x<b. Note that this does not include the end points. This means, ϕ3(x) will have two zeros inside the domain. i.e. there are two x locations where ϕ3(x)=0 in a<x<b.

Eigenfunctions ϕn(x) make a complete set of basis. This means any piecewise continuous function f(x) can be represented as f(x)n=1anϕn(x). This is called the generalized Fourier series.

This follows from (5). Each eigenfunction is orthogonal to another eigenfunction, but with the weight in there. This means abϕn(x)ϕm(x)σ(x)dx=0 if nm.

Rayleigh quotient relates an eigenvalue to its eigenfunction. Starting with the SL ODE (pϕ)+qϕ=λσϕ, then multiplying by ϕ and integrating, we obtain abϕ(pϕ)+qϕ2dx= abλσϕ2dx and solving for λ, gives λ=abϕ(pϕ)+qϕ2dxabϕ2σ(x)dx Carrying integration by parts on first part of the integral in numerator, it becomes λ=(pϕϕ)ba+abp(ϕ)2qϕ2dxabϕ2σ(x)dx But this becomes much simpler when we plug-in the boundary conditions that we must use, making the above λ=abp(ϕ)2qϕ2dxabϕ2σ(x)dx And if q=0,p=1 and σ(x)=1, then it becomes λ=ab(ϕ)2dxabϕ2dx. Rayleigh quotient is useful to show that λ can be positive without solving for ϕ and also used to estimate the value of the minimum λ by replacing ϕ by a trial function and actually solving for λ using λ=abϕ(pϕ)+qϕ2dxabϕ2σ(x)dx to obtain a numerical estimate of λ1.

There is symmetry relation. In operator form, let Lddx(p(x)ddx)+q(x), then we have abuL[v]vL[u]dx=0 where u,v are any two different eigenfunctions.

There is also what is called Lagrange identity, which says uL[v]vL[u]=ddx(p(uvvu)). This comes from simply expanding L[v],L[u] and simplifying things. Just calculus.

Green formula, follows from Lagrange identity, which just gives the integral form abuL[v]vL[u]dx=[p(uvvu)]ab. But form Sturm-Liouville, we know that abuL[v]vL[u]dx=0 (from 8). So this really just says that [p(uvvu)]ab=0. But we know this already from boundary conditions. So I am not sure why this is useful for Sturm-Liouville now. Since it is just saying the same thing again.

If p(x)=0 at the left or right end (boundaries), then the problem is now called singular Sturm-Liouville. This is actually the important case. In regular S.L., p(x) must be positive everywhere. We only consider p(x)=0 at the ends, not in the middle or any other place. When this happens, then solutions to the ODE generate special functions, such as Bessel, Legendre, Chebyshev. These special functions are solution to the Singular S.L. ODE, not the regular S.L. ODE. In addition, at the end where p(x)=0, the boundary conditions must be bounded. Not the regular boundary conditions above. For example, if p(a)=0 where x=a is say the left end, the the boundary conditions at the left end must be y(a)<,y(a)<. Notice that singular can be at left end, right end or at both at same time. At the end where p(x)=0, the boundary conditions must be bounded type.

Symmetry relation abuL[v]vL[u]dx=0 where u,v are are two eigenfunction. But remember, this does NOT mean the integrand is identically zero, or uL[v]vL[u]=0. Only when u,v happened to have same eigenvalue we can say that uL[v]vL[u]=0. But for unique eigenfunctions only the integral form is zero abuL[v]vL[u]dx=0. This is important difference, used in the proof below, that different eigenfunctions have different eigenvalues (for the scalar SL case). For higher dimensions, we can have more than one eigenfunction for same eigenvalue.

Small note added April 18, 2021. The above is the same as showing the operator L is Hermitian. In bra/ket Dirac notation, an operator L is Hermitian when

u|L|v=v|L|u(u(x)L[v(x)]dx)=v(x)L[u(x)]dxu(x)L[v(x)]dx=v(x)L[u(x)]dx

In the proofs below, I did not do the complex conjugate, since it is assumed all eigenfunctions u,v are real, and I only wrote that the symmetry relation implies

u(x)L[v(x)]dx=v(x)L[u(x)]dx

It should be really called the Hermitian relation to be more accurate. But for real eigenfunctions, it is the same.

2 Proofs

2.1 Proof symmetry of operator

Given regular Sturm-Liouville (RSL) ODE(1)(py)+qy=λσypy+py+qy=λσy

In operator form Lddx(p(x)ddx)+q(x) And (1) becomesL[y]=λσy When solving RSL ode, since it is an eigenvalue ODE with associated boundary conditions, we will get infinite number of non-negative eigenvalues.

For each eigenvalue, there is associated with it one eigenfunction (for 1D case). Looking at any two different eigenfunctions, say u,v, then the symmetry relation says the followingabuL[v]dx=abvL[u]dx Now we will show the above is true. This requires integration by parts two times. We start from the LHS expression and at the end we should end up with the integral on the RHS. Let I=abuL[v]dx, then (1)I=abuL[v]dx=abu(ddx(pddxv)+qv)dx

Where we used L[v]=(ddx(pddx)+q)v=(ddx(pddxv)+qv) in the above. Hence(2)I=abuddx(pdvdx)dVdx+abqvudx Now we will do integration by parts on the first integral. Let I2=abuddx(pdvdx)dx. Using UdV=UVVdU, and if we let U=u,dV=ddx(pdvdx), then dU=dudx,V=pdvdx. Hence I2=(updvdx)ababdvdxdVpdudxUdx We now apply integration by parts again to the second integral above. But now let U=pdudx and dV=dvdx, hence dU=ddx(pdudx) and V=v, therefore the above becomesI2=(updvdx)ab[(pdudxv)ababvddx(pdudx)dx] Substituting the above into (2) gives(3)I=(updvdx)ab[(pdudxv)ababvddx(pdudx)dx]+abqvudx Now comes the part where the boundary conditions are important. In RSL, the boundary conditions are such that all the terms (updvdx)ab,(pdudxv)ab vanish. This is because the boundary conditions areβ1u(a)+β2u(a)=0β3u(b)+β4u(b)=0

Andβ1v(a)+β2v(a)=0β3v(b)+β4v(b)=0

So now (3) becomesI=[abvddx(pdudx)dx]+abqvudx=abvddx(pdudx)dx+abqvudx=abv(ddx(pdudx)+qu)dx=abvL[u]dx

Therefore, we showed that abuL[v]dx=abvL[u]dx. The only thing to watch for here, is which term to make U and which to make dV when making integration by parts. To remember, always start with uL[v] and make dV the term with dvdx in them during integration parts.

2.2 Proof eigenvalues are real

Assume λ is complex. The corresponding eigenfunctions are complex also. Let ϕ be corresponding eigenfunction(1)L[ϕ]=λσϕ Taking complex conjugate of both sides, and since σ is real, we obtainL[ϕ]=λσϕ But L[ϕ]=L[ϕ] since all the coefficients of the ODE are real. The above becomes(2)L[ϕ]=λσϕ But by symmetry, we know thatabϕL[ϕ]ϕL[ϕ]dx=0 Substituting  (1),(2) into the above givesabϕ(λσϕ)ϕ(λσϕ)dx=0abλσϕϕ+λσϕϕdx=0ab(λλ)(σϕϕ)dx=0(λλ)abσϕϕdx=0

But ϕϕ=|ϕ| which is positive. Also the weight σ(x) is positive by definition. Hence for the above to zero, it must be that λ=λ. Which means λ is real. QED.

So the main tools to use in this proof: Definition of L[ϕ]=λσϕ, and symmetry relation and that L[ϕ]=L[ϕ]. This might come up in the exam.

2.3 Proof eigenfunctions are unique

Now will show that there is one eigenfunction associated with each eigenvalue (Again, this is for 1D, it is possible to get more than one eigenfunction for same eigenvalue for 2D, as mentioned earlier).  By contradiction, assume that λ has two eigenfunctions ϕ1,ϕ2 associated with it. HenceL[ϕ1]=λσϕ1L[ϕ2]=λσϕ2

From the first equation, λ=L[ϕ1]σϕ1, substituting this into the second equation givesL[ϕ2]=L[ϕ1]ϕ1ϕ2ϕ1L[ϕ2]ϕ2L[ϕ1]=0

By Lagrange identity, ϕ1L[ϕ2]ϕ2L[ϕ1]=ddx(p(ϕ1ϕ2ϕ2ϕ1)), hence this means that ddx(p(ϕ1ϕ2ϕ2ϕ1))=0p(ϕ1ϕ2ϕ2ϕ1)=c1

Where c1 is some constant. This is the main difference between the above argument, and between Lagrange identity. This can be confusing. So let me talk more about this. In Lagrange identity, we write ϕ1L[ϕ2]ϕ2L[ϕ1]=ddx(p(ϕ1ϕ2ϕ2ϕ1)) And when ϕ2,ϕ1 also satisfy the SL boundary condition, only then we say thatabϕ1L[ϕ2]ϕ2L[ϕ1]dx=0p(ϕ1ϕ2ϕ2ϕ1)|ab=0

But the above is not the same as saying ϕ1L[ϕ2]ϕ2L[ϕ1]=0. This is important to keep in mind. Only the integral form is zero for any two functions with the SL B.C..Now we continue. We showed that p(ϕ1ϕ2ϕ2ϕ1)=c. In SL, this constant is zero due to B.C. Hencep(ϕ1ϕ2ϕ2ϕ1)=0 But p>0 by definition. Hence ϕ1ϕ2ϕ2ϕ1=0 orddx(ϕ2ϕ1)=0ϕ2ϕ1=c2

or ϕ2=c2ϕ1. So the eigenfunctions are linearly dependent. One is just scaled version of the other. But eigenfunction must be linearly independent.

Hence assumption is not valid, and there can not be two linearly independent eigenfunctions for same eigenvalue. Notice also that ϕ1ϕ2ϕ2ϕ1=0 is the just the Wronskian. When it is zero, we know the functions are linearly dependent. The important part in the above proof, is that ϕ1ϕ2ϕ2ϕ1=0 only when ϕ1,ϕ2 happened to have same eigenvalue.

2.4 Proof eigenfunctions are real

The idea of this proof is to assume the eigenfunction is complex, then show that its real part and its complex part both satisfy the ODE and the boundary conditions. But since they are both use the same eigenvalue, then the real part and the complex part must be linearly dependent. This implies the eigenfunction must be real. (think of Argand diagram)

Assume that ϕ=U+iV is complex eigenfunction with real part U and complex part V. Then sinceL[ϕ]=λσϕ The above is just writing the Sturm-Liouville ODE in operator form, where L is the operator as above.  Now we haveL[U+iV]=λσ(U+iV) By linearity of operator LL[U]+iL[V]=λσUiλσV Which impliesL[U]=λσUL[V]=λσV

So we showed that the real and complex part satisfy S.L. Now we need to show they are satisfy S.L. boundary conditions. Sinceβ1ϕ(a)+β2ϕ(a)=0β3ϕ(b)+β4ϕ(b)=0

Where x=a,x=b are the left and right ends of the domain. Thenβ1(U(a)+iV(a))+β2(U(a)+iV(a))=0β3(U(b)+iV(b))+β4(U(b)+iV(b))=0

Henceβ1U(a)+β2U(a)=0β1V(a)+β2V(a)=0β3U(b)+β4U(b)=0β3V(b)+β4V(b)=0

So the above means both U and V satisfy the boundary conditions of S.L. But since both U,V have the same eigenvalue, then they must be linearly dependent, since we know with S.L. each eigenfunction (or one linearly dependent to it) have only one eigenvalue. This meansV=cU Where c is some constant. In other words, ϕ=U+iV=U+icU=U(1+ic)=c0U

Where c0 is new constant. (OK it happens to be complex constant, but it is OK to do so, we always do this trick in other places, if it will make me feel better, I could take the magnitude of the constant). So all what the above says, is that we assumed ϕ to be complex, and found that it is real. So it can’t be complex.

2.5 Proof eigenfunctions are orthogonal with weight

Given two different eigenfunctions ϕ1,ϕ2. Hence(1)L[ϕ1]=λ1σϕ1(2)L[ϕ2]=λ2σϕ2

From symmetry integral relation, since these eigenfunctions also satisfy S.L. boundary conditions, we can writeabϕ1L[ϕ2]ϕ2L[ϕ1]dx=0 Replacing (1,2) into the aboveabϕ1(λ2σϕ2)ϕ2(λ1σϕ1)dx=0abλ2σϕ1ϕ2+λ1σϕ2ϕ1dx=0ab(λ1λ2)(σϕ1ϕ2)dx=0(λ1λ2)abσϕ1ϕ2dx=0

But (λ1λ2)0 since there are different eigenvalues for different eigenfunctions. Hence abσϕ1(x)ϕ2(x)dx=0 Which means ϕ1,ϕ2 are orthogonal to each others with weight σ(x)

3 Special functions generated from solving singular Sturm-Liouville

When S.L. is singular, meaning p=0 at one or both ends, we end with important class of ODE’s, whose solutions are special functions (not sin or cos) as the case with the regular S.L. Recall that S.L. is(1)ddx(p(x)dydx)+q(x)y(x)+λσ(x)y(x)=0py+py+qy=λσy

With the regular S.L., we say that p(x)>0 over the whole domain, including end points. But with singular, this is not the case.  Here are three important S.L. ODE’s that are singular.

Bessel equation: x2y+xy+λx2y=00<x<1xy+y+λxy=0

Or in standard form(xy)=λxy Comparing to SL form, then p(x)=x,q(x)=0,σ(x)=x. At x=0, then p(x)=0, which is what makes it singular. (also the weight happens to be zero), but we only care about p being zero or not, at one of the ends. So to check if S.L. is singular or not, just need to check if p(x) is zero or not at one of the ends.

As mentioned before, when p=0 at one of the ends, we can’t use the standard B.C. for the regular S.L. instead, at the end where p=0, we must use what is called bounded boundary conditions, which is in this case y(0)<,y(0)<. The solution to this ODE will be in terms of Bessel functions. Notice that this happened to be singular, due to the domain starting at x=0. If the domain happened to be from x=0.5 to say x=1, then it is no longer singular S.L. but regular one.

Legendre equation ((1x2)y)=λy0<x<1 Or it can be written as(1x2)y2xy=λy We see that p=1x2. And now it happens to be that p=0 at x=1. So it is singular at the other end compared to Bessel. In this case q=0,σ=1. Again, the boundary conditions at x=1 must now be bounded. i.e. y(1)<,y(1)<. On the other end, where p is not zero, we still use the standard boundary conditions  β1y(0)+β2y(0)=0.

Chebyshev equation(1x2y)=λ1x2y1<x<1 Or1x2y122x1x2y=λ1x2y So we see that p=1x2,q=0,σ=1x2. So where is p=0 here? At x=1 then p=11=0 and at x=1 then p=0 also. So this is singular at both ends. So we need to use bounded boundary conditions at both ends now to solve this.y(1)<y(1)<y(+1)<y(+1)<

The solution to this singular S.L. is given in terms of special function Chebyshev.

4 General notes on S-L

Solution to xy+y=λxy. Set in S-L form, we get (py)+qy=λσy, then p=x,q=0,σ=x. For 0<x<b with y(b)=0,y(0)<. This is singular S-L since p=x is zero at one end. Using Frobenius series, the solution comes out to be y(x)=n=1cnJ0(λnx), where J0 is Bessel function of first kind. λn are roots of 0=J0(λb). (check if this should be 0=J0(λ(ba)). cn is constant, which still needs to be found.

We can find approximate solution to S-L ODE for large eigenvalue using WKB so we do not have to solve the ode using series method. Given (py)+qy=λσy which is standard form S-L. Using physical optics correction, we obtain the solution asy(x)(σ(x)p(x))14(c1cos(λ0xσ(t)p(t)dt)+c1sin(λ0xσ(t)p(t)dt)) Where c1,c2 are found from boundary conditions now. The above is valid for large λ, and is found by first letting ε2=1λ and then assuming y(x)=exp(1εn=0εnSn(x)). And working though the WKB method. Remember, WKB only works for linear homogeneous ODE and is used to estimate solution for large λ (or small ε).

in 1D, regular S-L is (py)+qy=λσy with 0<x<L, with B.C. given by β1y(0)+β2y(0)=0 and β1y(L)+β2y(L)=0, and in higher dimensions, this problem becomes (pϕ)+qϕ=λσϕ with boundary conditions now written as β1ϕ+β2(ϕn^)=0 on boundary Γ. In higher dimensions, ϕϕ(x¯).

Lagrange’s identity in 1D is uL(v)vL(u)=ddx(p(udvdxvdudx)) where L above is the S-L operator, as in Lddx(pddx)+q so that we write L(u)=λσy.  

When it says eigenfunctions are normalized, it should mean that  abϕn2(x)σ(x) dx=1 where σ(x) is the weight function (comes from SL equation) and a<x<b is the domain. This is for 1D SL. Remember for example, that 0Lcos2(λnx)dx=L2. where λn=nπL. Here, it is not normalized since the result is not one. The weight here is just one and here ϕn(x)=cos(nπLx)

Heat PDE in cylinder is T(r,z,θ,t)t=k(2Tr2+1rTr+2Tz2+1r22Tθ2). For steady state, it becomes, say cylinder has length L 0=2T(r,z,θ)=2Tr2+1rTr+2Tz2+1r22Tθ2|T(0,z,θ)|<|T(0,z,θ)|<T(r,0)=f(r)R(r,L)=g(r)

And for axis symmetric (no θ dependency), it becomes 0=2T(r,z)=2Tr2+1rTr+2Tz2 The solution is found by separation of variables. For the above is in terms of Bessel function order zeroT(r,z)=n=1J0(λ0r)(Aneλnz+Bneλnz) which is found using series method T=n=0anrn+α. The eigenvalues λn are zeros of 0=J0(λr0) where r0 is disk radius.

Heat PDE in 2D polar is T(r,θ,t)t=k(2Tr2+1rTr+1r22Tθ2) and for steady state it becomes 0=2Tr2+1rTr+1r22Tθ2. After separating, the r ODE becomes Euler ODE. Use guess R(r)=rp for solution. The solution will beT(r,z)=A0+n=1Ancos(nθ)rn+n=1Bnsin(nθ)rn To change p(x)y+q(x)y+r(x)y(x)=0 to S.L. form, multiply the ODE by μ=1p(x)exp(q(x)p(x)dx)

In using method of eigenfunction expansion to solve nonhomogeneous B.C. PDE, the eigenfunction used ϕn(x) are the ones from solving the PDE with homogeneous B.C. For example, if given ut=k2ux2 with nonhomogeneous B.C. , say u(0,t)=A,u(L,t)=B, and we want to use u(x,t)n=1bn(t)ϕn(x) on the nonhomogeneous B.C. PDE, then those ϕn(x) are from the corresponding homogeneous B.C. PDE. They should be something like sin(nπLx) and so on.

See the table at the top. That is why we write above and not = and remember not to do term-by-term differentiation in x when this is the case. We can do term-by-term differentiation only of the PDE itself also has homogeneous B.C. , even if it had a source term there also. The point is, ϕn(x) always come from solving the homogeneous B.C. PDE. (This normally means solving a Sturm-Liouville ODE).

I found new relation for eigenvalue λ, but it is probably not useful as the one the book has. Here how to derive it
Given S-L (1)ddx(pdydx)+qy=λσy Let ϕn(x) be the eigenfunction associated with eigenvalue λn. Since eigenfunctions satisfy the ODE itself, then we can write, for any arbitrary eigenfunction (subscript removed for clarity in what follows)ddx(pϕ)+qϕ=λσϕpϕ+pϕ+qϕ=λσϕ

Integrating both sides(2)abpϕdx+abpϕdx+abqϕdx=λabσϕdx Looking at abpϕdx. Integrating by part. Let u=p,dv=ϕdu=p,v=ϕ, henceabpϕdx=[uv]ababvdu(3)=[pϕ]ababpϕdx

Substituting (3) into (2) givesabpϕdx+[pϕ]0Labpϕdx+abqϕdx=λabσϕdx[pϕ]0L+abqϕdx=λabσϕdx

Henceλ=[pϕ]ab+abqϕdxabσϕdx compare to the one in the book λ=(pϕϕ)ba+abp(ϕ)2qϕ2dxabϕ2σ(x)dx.

This is about using eigenfunction expansion to approximate f(x). Given the eigenfunctions Φn(x) found by solving SL L[y]=λry problem, then these eigenfunctions are called complete in the mean sense, if n=1bnΦn(x) converges to the mean of f(x) at each point in axb.

This requires only that f(x) be square integrable over the domain. We can also say that the eigenfunctions are complete, in pointwise sense, if n=1bnΦn(x) converges to f(x) at each point in axb. This however requires that f(x) be continuous in axb and f(x) be piecewise continuous.

So clearly the first case is less strict than the second. This means if f(x) is only square integrable (and not necessarily continuos), then convergence to the mean of f(x) at each point can be obtained. To obtain the more strict point wise convergence, more restrictions are needed on f(x) as mentioned above. Clearly point wise convergence implies convergence in the mean. But not the other way around.

For Sturm-Liouville, write it as py+py+(q+λσ)y=0. Now it is easy to compare to. For example, x+λx=0, then we see that p=1,q=0,σ=1. And for xy+x+λxy=0 then p=1,q=0,σ=x. Do not divide by x first to re-write xy+x+λxy=0 as y+1xx+λy=0 and then compare, as this makes it hard to compare to standard form and will make error.  Books write Sturm-Liouville as ddx(py)+qy+λσy=0 but I find this confusing to compare to this way. Better to expand it to py+py+(q+λσ)y=0 first so now it is in a more familiar form and can read it out more directly.

5 Methods of solutions to some Sturm Liouville problems

Problem 1 If the problem gives S-L equations, and asks to find estimate on the smallest eigenvalue, then use Rayleigh quotient for λ. And write λmin=λ1(pϕϕ)ba+abp(ϕ)2qϕ2dxabϕ2σ(x)dx. Now we do not need to solve the SL to find solution ϕ, this is the whole point. Everything in RHS is given, except for, of course the solution ϕ.

Here comes the main idea: Come up with any trial ϕtrial and use it in place of ϕ. This trial function just needs to satisfy the boundary conditions, which is also given. Then all what we need to do is just evaluate the integral. Pick the simplest ϕtrial function of x which satisfies boundary conditions. All other terms p,q,σ we can read from the given problem.

At the end, we should get a numerical value for the integral. This is the upper limit of the lowest eigenvalue λ1

Problem 2 We are given SL problem with boundary conditions, and asked to show that λ0 without solving the ODE to find ϕ:

Use Rayleigh quotient and argue that the denominator can’t be zero (else eigenfunction is zero, which is not possible) and it also can’t be negative. Argue about the numerator not being negative. Do not solve the ODE! this is the whole point of using Rayleigh quotient.

Problem We are given an SL problem with boundary conditions and asked to estimate large λ and corresponding eigenfunction.

This is different from being asked to estimate the smallest eigenvalue, where we use Rayleigh quotient and trial function. In this one, we instead use WKB. For 1D, just use what is called the physical optics correction method, given by ϕ(x)(σp)14(c1cos(λ0xσ(t)p(t)dt)+c2sin(λ0xσ(t)p(t)dt)). Where σ,p in this are all known functions (from the problem itself). Notice that q is not there.

Now use the boundary conditions and solve for one of the constants, should come to be zero. Use the second boundary condition and (remember, the boundary conditions are homogenous), and we get one equation in λ and for non-trivial solution, solve for allowed values of λn. This gives the large eigenvalue estimate. i.e. for λn when n is very large. Depending on problem, n does not have be too large to get good accurate estimate compared with exact solution. See HW7, problem 5.9.2 for example.

Problem We are given 1D heat ODE ut=k2ux2. If B.C. are homogenous, we are done. We know the solution. Separation of variables. If the B.C. is not homogenous, then we have a small problem. We can’t do separation of variables. If you can find equilibrium solution, ur(x), where this solution only needs to satisfy the non-homogenous B.C. then write solution as u(x,t)=v(x,t)+ur(x), and plug this in the PDE.

This will give vt=k2vx2 but this now satisfies the homogenous B.C. This is the case if the original non-homogenous B.C. were Dirichlet. If they were Neumann, then we will get vt=k2vx2+extra where extra is extra term that do not vanish because ur(x) do not vanish in this case after double differentiating. We solve for v(x,t) now, since it has homogenous B.C., but it has extra term there, which we treat as new source. We apply initial conditions now also to final all the eigenfunction expansion coefficients. We are done. Now we know u(x,t).

But if we can find the reference function ur or we do not want to use this method, then we can use another method, called eigenfunction expansion. We assume u(x,t)=n=1an(t)ϕn(x) and plug this in the PDE.  But now we can’t do term by term differentiation, since ϕn are the eigenfunctions that satisfies the homogenous B.C., while u(x,t) has non-homogenous B.C.

So the trick is to use Green formula, to rewrite 0Lk2ux2ϕn(x)dx as 0Lku2ϕn(x)x2dx plus the contribution from the boundary terms. (this is like doing integration by parts twice, but much easier). Now rewrite 2ϕn(x)x2=λnϕn(x). See page 355-356, Haberman.

6 Examples converting second order linear ODE to Sturm-Liouville

Any linear second order ODE can be converted to S-L form. The S-L form is the following(1)(p(x)dydx)q(x)y(x)+λr(x)y(x)=0 Some books write the above with a plus sign +q(x)y(x) there (for example, Haberman) and some with q(x)y(x) for example Boyce/DiPrima. I really never understood why some put a minus sign and some do not. May be I’ll find out one day. But for now will use (1) is used as the S-L form. If your book uses the plus sign, the same process works, just flip the sign.

The goal now, is given any general form eigenvalue ODE or second order ODE (with no eigenvalue λ), we want to convert it (rewrite it) in the above form. The second order linear ODE will have this form(2)a(x)y+b(x)y+(c(x)+λ)y=0 The parameter λ in the S-L form, is the eigenvalue. We really only use S-L form for eigenvalue problems, but if the λ is missing, then the form of the input will be a(x)y+b(x)y+c(x)y=0 And the process will work the same way.

First, we will show how to convert (2) to (1), and then show few examples. We are given (2), and want to convert it to (1). The first step is to convert (2) to standard formy+b(x)a(x)y+c(x)+λa(x)y=0 Then multiply the above by some unknown p(x) which is assumed positive. This is called the integrating factorp(x)y+p(x)b(x)a(x)y+p(x)c(x)+λa(x)y=0 Now, rewrite p(x)y as (p(x)y)p(x)y in the above((p(x)y)p(x)y)+p(x)b(x)a(x)y+p(x)(c(x)+λ)a(x)y=0(3)(p(x)y)+y(p(x)b(x)a(x)p(x))+p(x)(c(x)+λ)a(x)y=0

Here comes the main trick in all of this. We want to force p(x)b(x)a(x)p(x) to be zero. This implies p(x)=eb(x)a(x)dx This comes from just solving the ODE p(x)b(x)a(x)p(x)=0. Therefore, if p(x)=eb(x)a(x)dx, then (3) becomes(p(x)y)+(p(x)c(x)+λa(x))y=0(p(x)y)+(p(x)c(x)a(x)+p(x)a(x)λ)y=0(4)(p(x)y)+p(x)c(x)a(x)y+p(x)a(x)λy=0

We are done. Compare (4) to the SL form(p(x)dydx)q(x)y(x)+λr(x)y(x)=0  We see that, given a(x)y+b(x)y+(c(x)+λd)y=0 then

p(x)=ebadxq(x)=pcar(x)=pad

Let see how this works on some examples

6.1 Example 1

Convert y+5λy=0 to S-L.  We see the general form here is a(x)y+b(x)y+(c(x)+λd)y=0, with a=1,b=0,c=0,d=5. Hence p(x)=eb(x)a(x)dx=1, therefore the SL form is(1)(p(x)dydx)q(x)y(x)+λr(x)y(x)=0 Where p(x)=1q(x)=p(x)c(x)a(x)=0r(x)=p(x)a(x)d=5

Hence (1) becomes(y)+5λy=0 This was easy, since the ODE given was already in SL form.

6.2 Example 2

Convert Bessel ODE to SL x2y+xy+(x2n2)y=0

Comparing the above to a(x)y+b(x)y+(c(x)+λ)y=0, we see that a=x2,b=x,c=x2,d=1,λ=n2. Hence p(x)=eb(x)a(x)dx=exx2dx=elnx=x Therefore the SL form is(1)(p(x)dydx)q(x)y(x)+λr(x)y(x)=0 Where p(x)=xq(x)=p(x)c(x)a(x)=x(x2x2)=xr(x)=p(x)a(x)d=1x

Therefore (1) becomes(2)(xy)+xyn2xy(x)=0 Let check if the above gives back x2y+xy+(x2n2)y=0. Expanding the above gives xy+y+xyn2xy=0 Multiplying by xx2y+xy+x2yn2y=0x2y+xy+(x2n2)y=0

This verifies that (2) is the Sturm-Liouville form for Bessel ODE

6.3 Example 3

Convert Legendre ODE to SL(1x2)y2xy+n(n+1)y=0 The general form here is a(x)y+b(x)y+(c(x)+λ)y=0, with a=(1x2),b=2x,c=0,d=1 and λ=n(n+1). Hence p(x)=e2x(1x2)dx=eln(x21)=x21q(x)=pca=0r(x)=pad=x211x2=1

Hence SL is(py)qy+λry=0((x21)y)n(n+1)y=0

Let check. Expanding the above gives(x21)y+2xyn(n+1)y=0 Multiplying both sides by 1 gives (1x2)y2xy+n(n+1)y=0 Which is the original form of Legendre ODE

6.4 Example 4

Convert y2x1x2y+λ1x2y=0 Rewrite as(1x2)y2xy+λy=0 The general form here is a(x)y+b(x)y+(c(x)+λ)y=0, with a=(1x2),b=2x,c=0,d=1. Hence μ(x)=ebadx=e2x1x2dx=e(ln(x1)ln(x+1))=eln(x1)eln(x+1)=(x1)(x+1)=x21 Therefore the SL form is(1)(p(x)dydx)q(x)y(x)+λr(x)y(x)=0 Where p(x)=x21q(x)=p(x)c(x)a(x)=0r(x)=p(x)a(x)d=x21x21=1

Hence (1) becomes((x21)dydx)+λy(x)=0

6.5 Example 5

Convert (1x2)yxy+λy=0 We see the general form here is a(x)y+b(x)y+(c(x)+λ)y=0, with a=(1x2),b=x,c=0,d=1. Hence p(x)=ebadx=ex1x2dx But x1x2dx=12ln|1x2|, therefore μ(x)=e12ln|1x2|=1x2. Therefore the SL form is(1)(p(x)dydx)q(x)y(x)+λr(x)y(x)=0 Where p(x)=1x2q(x)=p(x)c(x)a(x)=0r(x)=p(x)a(x)d=1x2(1x2)=11x2

Hence (1) becomes(1x2dydx)+λ1x2y(x)=0

6.6 Example 6

Converta(x)y+b(x)y+c(x)y=0 The general form here is a(x)y+b(x)y+(c(x)+λ)y=0, with a=a(x),b=b(x),c=c(x),d=1 and λ=0. Hence p(x)=ebadx Therefore the SL form is(1)(p(x)dydx)q(x)y(x)+λr(x)y(x)=0 Where p(x)=ebadxq(x)=p(x)c(x)a(x)=ebadxc(x)a(x)r(x)=p(x)a(x)d=ebadx

Hence (1) becomes(2)(ebadxy)+ebadxc(x)a(x)y(x)=0 We see that the above is the same as a(x)y+b(x)y+c(x)y=0 because(ebadxy)=baebadxy+ebadxy And hence (2) becomesbaebadxy+ebadxy+ebadxc(x)a(x)y(x)=0 Canceling ebadx givesbay+y+c(x)a(x)y(x)=0ay+by+cy=0

Which is the original ODE.

6.7 Example 7

Convert3y+2y+5y=0 The general form here is a(x)y+b(x)y+(c(x)+λ)y=0, with a=3,b=2,c=5,d=1 and λ=0. Hence p(x)=ebadx=e23dx=e23x Therefore the SL form is(1)(p(x)dydx)q(x)y(x)+λr(x)y(x)=0 Where p(x)=e23xq(x)=p(x)c(x)a(x)=e23x(52)r(x)=p(x)a(x)d=e23x3

Hence (1) becomes(2)(e23xy)+e23x(52)y(x)=0 The above is the same as 3y+2y+5y=0 because(e23xy)=23e23xy+e23xy And hence (2) becomes23e23xy+e23xy+e23x(52)y(x)=0 Canceling e23x gives23y+y+52y(x)=02y+3y+5y=0

Which is the original ODE.