3.1.2 Terminology used and high level introduction
Sophus Lie was inspired by Galois theory for solving algebraic equations.
The goal of Lie symmetry for solving first order ode is to transform the ode \(\frac {dy}{dx}=\omega \left ( x,y\right ) \)
to canonical coordinates \(\frac {dY}{dX}=F\left ( X\right ) \) where it is easily solved by quadrature. Once the
solution \(Y\left ( X\right ) \) is found, it is converted back to \(y\left ( x\right ) \) in the original \(x,y\) coordinates, thus
obtaining the solution \(y\left ( x\right ) \) to the original ode. This method works regardless of
how complicated the original ode happened to be (linear or not). But this
requires finding what is called the Lie infinitesimals \(\xi \left ( x,y\right ) ,\eta \left ( x,y\right ) \) which requires solving a
PDE using ansatz and this can be difficult. Different algorithms are designed
to help find \(\xi \left ( x,y\right ) ,\eta \left ( x,y\right ) \) using different forms of ansatz.
\(x,y\) are the natural coordinates used in the input ode \(\frac {dy}{dx}=\omega \left ( x,y\right ) \). For example \(y^{\prime }=x^{2}+y^{2}\). Here
\(\omega =x^{2}+y^{2}\).
\(\bar {x},\bar {y}\) are called the Lie group (local) transformation coordinates. The Lie transformation
is one parameter transformation. Meaning it depends only on one parameter. Some
books call this \(\lambda \) and some call it \(\varepsilon \). Here \(\varepsilon \) is used. Hence we write \(\left ( \bar {x},\bar {y}\right ) =T\left ( x,y;\varepsilon \right ) \) to mean
transformation \(T\) is applied on point \(\left ( x,y\right ) \) to obtain new point \(\left ( \bar {x},\bar {y}\right ) \) and this transformation
depends on the value used for \(\varepsilon \). The parameter \(\varepsilon \) is real value. The ode \(\frac {d\bar {y}}{d\bar {x}}=\omega \left ( \bar {x},\bar {y}\right ) \)
must remain invariant (i.e. same shape) but with new letters \(\bar {x},\bar {y}\) that replace
each of the letters \(x,y\) in the original ode. If the new ode does not have same
exact form, then this is not valid Lie symmetry transformation that was
used.
The group transformation is defined as \(T_{\varepsilon }:\left \{ \bar {x}=\varphi \left ( x,y,\varepsilon \right ) ,\bar {y}=\psi \left ( x,y,\varepsilon \right ) \right \} \). It is required that \(\varphi \left ( x,y,\varepsilon \right ) ,\psi \left ( x,y,\varepsilon \right ) \) are independent of each
others, which means the Jacobian do not vanish. Hence \(\begin {vmatrix} \varphi _{x} & \varphi _{y}\\ \psi _{x} & \psi _{y}\end {vmatrix} \neq 0\). Lie group comes from the
above transformation group when expanding \(\varphi \left ( x,y,\varepsilon \right ) ,\psi \left ( x,y,\varepsilon \right ) \) in Taylor series near \(\left ( x,y\right ) \) and keep linear
terms, which results in \(\bar {x}\approx x+\varepsilon \xi \left ( x,y\right ) ,\bar {y}\approx y+\varepsilon \eta \left ( x,y\right ) \).
If given Transformation group \(T_{\varepsilon }\) defined as \(\bar {x}=\varphi \left ( x,y,\varepsilon \right ) ,\bar {y}=\psi \left ( x,y,\varepsilon \right ) \), then \(\xi ,\eta \) are found as follows.
The quantities \(\left ( \xi ,\eta \right ) \) define the tangent direction at \(\left ( x,y\right ) \) of the path that is taken to move \(\left ( x,y\right ) \) to
\(\left ( \bar {x},\bar {y}\right ) \). In other words, starting from any point \(\left ( x,y\right ) \) and calculating \(\left ( \xi ,\eta \right ) \) at \(\left ( x,y\right ) \), then the line going
from \(\left ( x,y\right ) \) to the point \(\left ( x+\varepsilon \xi ,y+\varepsilon \eta \right ) \) for a very small \(\varepsilon \) value, will be tangent line to the path from \(\left ( x,y\right ) \) to
\(\left ( \bar {x},\bar {y}\right ) \).
It is good to look at \(\bar {x}=x+\varepsilon \xi \left ( x,y\right ) \) as in kinematics, where \(x=x\left ( 0\right ) +v_{x}t\), where now \(\varepsilon \) represents the time
and \(x\left ( 0\right ) \) is initial position and \(x\) is final position and \(\xi \left ( x,y\right ) \) is the speed \(v_{x}\) which is a
function of position. Same for the \(y\) coordinate. \(y=y\left ( 0\right ) +v_{y}t\). In this view, as \(t\) increases the
point moves more and all points covered in the path are the orbit of point
\(\left ( x,y\right ) \).
The coordinates \(\left ( X,Y\right ) \) (some books use lower case \(r,s\)) are called the canonical coordinates in
which the input ode becomes a quadrature and therefore easily solved by just
integration. In other words, in canonical coordinates, Lie transformation is given
by
The ode is always solved in canonical coordinates \(\left ( X,Y\right ) \) and not in \(\left ( x,y\right ) \) since it is much
simpler to solve it in those coordinates.
\(\xi \left ( x,y\right ) ,\eta \left ( x,y\right ) \) are called the Lie infinitesimals, also called tangent vectors. They are functions of \(\left ( x,y\right ) \).
These are the core quantities of Lie symmetry method. These can be calculated
knowing \(\bar {x},\bar {y}\). Also \(\bar {x},\bar {y}\) can be calculated given \(\xi ,\eta \). In practice, \(\bar {x},\bar {y}\) are not given, and hence these
have to be found using solving a PDE. It is \(\xi ,\eta \) which are the most important
quantities that need to be determined in order to find the canonical coordinates
\(X,Y\).
The tangent vectors \(\xi ,\eta \) are calculated at \(\epsilon =0\). They are defined as \(\xi =\left . \frac {dx}{d\epsilon }\right \vert _{\epsilon =0},\eta =\left . \frac {dy}{d\epsilon }\right \vert _{\epsilon =0}\). The point \(\left ( \bar {x},\bar {y}\right ) \) (orbit of \(\left ( x,y\right ) \)) is
given by \(\bar {x}=x+\xi \epsilon \) and \(\bar {y}=y+\eta \epsilon \).
The above shows the importance of \(\xi \left ( x,y\right ) ,\eta \left ( x,y\right ) \). These (along with the specific value of \(\epsilon \))
determine the orbit of each point \(\left ( x,y\right ) \).
The orbit of a point \(A\) given by natural coordinates \(\left ( x,y\right ) \) is the set of all possible
points \(\left ( \bar {x},\bar {y}\right ) \) that the point \(A\) transforms to for all possible value of \(\varepsilon \).
The ultimate goal is write \(\frac {dy}{dx}=\omega \left ( x,y\right ) \) in \(X,Y\) coordinates where symmetry have the ideal form \(\left ( \bar {X},\bar {Y}\right ) =\left ( X,Y+\varepsilon \right ) \)
because this leads to ode of form \(\frac {dY}{dX}=f\left ( X\right ) \). The right hand side should always be a function
of \(X\) only in canonical coordinates.
The ideal transformation has the form \(\left ( \bar {X},\bar {Y}\right ) \rightarrow \left ( X,Y+\varepsilon \right ) \) as mentioned above, because with this
transformation the ode becomes quadrature in the transformed coordinates. But
because not all ode’s have this transformation as given, the ode is first transformed
to canonical coordinates \(\left ( X,Y\right ) \) where the transformation is \(\left ( \bar {X},\bar {Y}\right ) \rightarrow \left ( X,Y+\varepsilon \right ) \) is imposed. If the
transformation \(\left ( \bar {x},\bar {y}\right ) \rightarrow \left ( x,y+\epsilon \right ) \) is already present in original coordinates, then there will be no need
for canonical coordinates \(\left ( X,Y\right ) \).
The main goal of Lie symmetry method is to determine \(X,Y\) and solve the ode \(\frac {dY}{dX}=f\left ( X\right ) \) in that
space instead in the natural coordinates \(\left ( x,y\right ) \). To be able to do this, the quantities \(\xi ,\eta \) must
be determined first.
The remarkable thing about Lie symmetry method, is that regardless of how
complicated the original ode \(\frac {dy}{dx}=\omega \left ( x,y\right ) \) is, if the similarity condition PDE can be solved for \(\xi ,\eta \),
then \(X,Y\) can always be found and the ode becomes quadrature \(\frac {dY}{dX}=f\left ( X\right ) \). The ode is
then solved in canonical coordinates and the solution transformed back to
\(x,y\).
The quantity \(\epsilon \) is called the Lie parameter. This is a real quantity which as it
goes to zero, gives the identity transformation. In other words, when \(\epsilon =0\) then
\(\left ( x,y\right ) =\left ( \bar {x},\bar {y}\right ) \).
But there is no free lunch, even in Mathematics. The problem comes down to finding
\(\xi ,\eta \). This requires solving a PDE. This is done using ansatz and trial and
error. This reason possibly explains why the Lie symmetry method have
not become standard in textbooks for solving ODE’s as the algebra and
computation needed to find \(\xi ,\eta \) from the PDE become very complex to do by
hand.
Total derivative operator: Given \(f\left ( x,y\right ) \) then \(\frac {df}{dx}=\frac {\partial f}{\partial x}+\frac {\partial f}{\partial y}\frac {dy}{dx}\) where it is assumed that \(y\left ( x\right ) \) depends on \(x\). Total
derivative operator will be used extensively in all the derivations below, so good to
practice this. It is written as \(D_{x}=\partial _{x}+\partial _{y}y^{\prime }\) for first order ode, and as \(D_{x}=\partial _{x}+\partial _{y}y^{\prime }+\partial _{y^{\prime }}y^{\prime \prime }\) for second order ode and as \(D_{x}=\partial _{x}+\partial _{y}y^{\prime }+\partial _{y^{\prime }}y^{\prime \prime }+\partial _{y^{\prime \prime }}y^{\prime \prime \prime }\)
for third order ode and so on.
The notation \(f_{x}\) means partial derivative. Hence \(\frac {\partial f}{\partial x}\) is written as \(f_{x}\). Total derivative will
always be written as \(\frac {df}{dx}\). It is important to distinguish between these two as the
algebra will get messy with Lie symmetry. Sometimes we write \(f^{\prime }\) to mean \(\frac {df}{dx}\) but
it is better to avoid \(f^{\prime }\) and just write \(\frac {df}{dx}\) when \(f\) is function of more than one
variable.
Given first ode \(\frac {dy}{dx}=\omega \left ( x,y\right ) \), where \(\bar {y}\equiv \bar {y}\left ( x,y\right ) \) and \(\bar {x}\equiv \bar {x}\left ( x,y\right ) \) then then\(\frac {d\bar {y}}{d\bar {x}}\) is given by the following (using the total
derivative operator)
Given second order ode \(\frac {d^{2}y}{dx^{2}}=\omega \left ( x,y,y^{\prime }\right ) \) where \(\bar {y}\equiv \bar {y}\left ( x,y,y^{\prime }\right ) \) and \(\bar {x}\equiv \bar {x}\left ( x,y,y^{\prime }\right ) \) then \(\frac {d^{2}\bar {y}}{d\bar {x}^{2}}\) is given by
Keeping in mind that \(\left ( \circ \right ) _{x}\) or \(\left ( \circ \right ) _{y}\) mean partial derivative.
Given third order ode \(\frac {d^{3}y}{dx^{3}}=\omega \left ( x,y,y^{\prime },y^{\prime \prime }\right ) \) where \(\bar {y}\equiv \bar {y}\left ( x,y,y^{\prime },y^{\prime \prime }\right ) \) and \(\bar {x}\equiv \bar {x}\left ( x,y,y^{\prime },y^{\prime }\right ) \) then \(\frac {d^{3}\bar {y}}{d\bar {x}^{3}}\) is given by