Review of the geometry of screw Axes

: HOME
: PDF (letter size)
: PDF (legal size)

Review of the geometry of screw Axes

Graduate student. Dept. of Mechanical engineering, UCI. 2006 Compiled on July 18, 2025 at 10:56am

1 Deﬁnitions, Important formulas, and terminology
1.1 Planar displacement
1.2 Spatial displacement
1.3 Homogeneous Transformation
1.4 The rotation axis
1.5 A Pole or a Fixed point
1.6 Rotational displacement
1.7 Rodrigues vector \(b\)
1.8 Screw
1.9 Plücker coordinates of a line
1.10 screw deﬁned in terms of the plücker coordinates of its axis
1.11 Important relations between Cayley Matrix \(B\)
2 Introduction to the rotation matrix in 3D
3 Screw displacement
3.1 Derivation for expression for ﬁnding reference point for screw axis
4 The Screw Matrix
5 The spatial displacement of screws
6 The screw axis of a displacement
7 References

Report writing at UCI during work on my MSc in Mechanical Engineering. 2006

1 Deﬁnitions, Important formulas, and terminology

1.1 Planar displacement

General motion of a body in 2D that includes rotation and translation. We use \([A]\) for the rotation matrix, the vector \(\mathbf {d}\) for the translation and the matrix \([T]\) for displacement. Hence we write

\[ \mathbf {X}=[A]\mathbf {x}+\mathbf {d}\]

1.2 Spatial displacement

General motion of a body in 2D that includes rotation and translation. Another name for this is rigid displacement. Distance between points in a body remain unchanged before and after spatial displacement.

1.3 Homogeneous Transformation

Recall that the transformation that deﬁnes spatial displacement, which is given by \(\mathbf {X}=[A]\mathbf {x}+\mathbf {d}\) is non-linear due to the presence of the translation term \(\mathbf {d}\). It is more convenient to be able to work with linear transformation, therefore we add a fourth component to the position vector which is always \(1\) and rewrite the transformation, now calling it as \(T\) which maps \(X\) to \(x\) as

\[ X=T\ x \]

Or in full component form

\[\begin {Bmatrix} X_{1}\\ X_{2}\\ X_{3}\\ 1 \end {Bmatrix} =\overset {T}{\overbrace {\begin {bmatrix} a_{11} & a_{12} & a_{13} & d_{1}\\ a_{21} & a_{22} & a_{23} & d_{2}\\ a_{31} & a_{32} & a_{33} & d_{3}\\ 0 & 0 & 0 & 1 \end {bmatrix} }}\begin {Bmatrix} x_{1}\\ x_{2}\\ x_{3}\\ 1 \end {Bmatrix} \]

Or in short form

\[\begin {Bmatrix} \mathbf {X}\\ 1 \end {Bmatrix} =\overset {T}{\overbrace {\begin {bmatrix} A & \mathbf {d}\\ 000 & 1 \end {bmatrix} }}\begin {Bmatrix} \mathbf {x}\\ 1 \end {Bmatrix} \]

1.4 The rotation axis

The set of points that remain ﬁxed during the rotation deﬁned by \([A]\). We use Rodrigues vector \(\mathbf {b}\) to deﬁne the rotation axis.

1.5 A Pole or a Fixed point

A pole is point that remains ﬁxed during planar or spatial displacement. Planar displacement (2D) have a pole, but 3D spatial displacement do not have a pole in general, since the requirement for a pole is to have an inverse for \([I-A]\) which for 3D is not possible since \(A\) has an eigenvalue of 1.

The following diagram is an example of a pole under planar displacement.

For spatial displacement (3D), one condition, when satisﬁed will result in a ﬁxed point. This condition occurs when the translation vector \(\mathbf {d}\) is perpendicular to the rotation vector \(\mathbf {b}\). The pole in this case is given by

\[ \mathbf {c}=\frac {\mathbf {b}\times \left ( \mathbf {d}-\mathbf {b}\times \mathbf {d}\right ) }{2\mathbf {b}\cdot \mathbf {b}}\]

Not only is this point \(\mathbf {c}\) ﬁxed, by any point on the line \(\mathbf {c}+t\mathbf {S}\) is also ﬁxed. Where \(t\) is a parameter and \(\mathbf {S}\) is a unit vector. Hence we can a ﬁxed line under such a spatial displacement, and this line is called the rotation axis.

1.6 Rotational displacement

This is a special case of spatial displacement when the translation vector \(\mathbf {d}\) is perpendicular to the rotation axis \(\mathbf {b.}\) To make the diﬀerence more clear, the following diagram is an illustration of spatial displacement which is not rotational displacement, and one which is.

1.7 Rodrigues vector \(b\)

Deﬁnes the rotation axis under \([A]\). Using \(\mathbf {S}\) as the unit vector along \(\mathbf {b}\), then \(\mathbf {b}=k\mathbf {S}\) where \(k\) is the length of vector \(\mathbf {b}\) given by \(\tan \left ( \frac {\theta }{2}\right ) \) where \(\theta \) is the rotation angle around the rotation axis. Hence we can write

\[ \mathbf {b}=\tan \left ( \frac {\theta }{2}\right ) \mathbf {S}\]

We see that the

\[ \left \Vert \mathbf {b}\right \Vert =\tan \left ( \frac {\theta }{2}\right ) \]

Hence at \(\theta =\pm 180^{0}\) , \(\left \Vert \mathbf {b}\right \Vert =\tan \left ( \pm \frac {\pi }{2}\right ) \) which goes to inﬁnity. A plot of the function \(\tan \left ( \alpha \right ) \) is below showing the discontinuities at \(\pm \frac {\pi }{2}\)

1.8 Screw

Screw is deﬁned as a pair of vectors \(\left ( \mathbf {W},\mathbf {V}\right ) ^{T}\) such that \(\mathbf {W}\cdot \mathbf {V}\neq 0\) and \(|\mathbf {W}|\neq 1\). The pitch of the screw \(p_{\omega }\) is deﬁned as \(\frac {\mathbf {W}\cdot \mathbf {V}}{\mathbf {W}\cdot \mathbf {W}}\)

1.9 Plücker coordinates of a line

Given a line deﬁned by a parametric equation \(L\left ( k\right ) =\mathbf {C}+k\mathbf {S}\), where \(C\) is a reference point and \(S\) is a unit vector along the line, the Plücker coordinates of this line is given by \(\begin {Bmatrix} \mathbf {S}\\ \mathbf {C\times S}\end {Bmatrix} \) \(\mathbf {C\times S}\) represents the moment of the line around the origin of the reference frame. The following diagram helps to illustrates this.

1.10 screw deﬁned in terms of the plücker coordinates of its axis

We said above that a screw is deﬁned as a pair of vectors \(\left ( \mathbf {W},\mathbf {V}\right ) ^{T}\). Given the plücker coordinates of the screw line \(\left ( \mathbf {W},\mathbf {C}\times \mathbf {W}\right ) ^{T}\), then we can write the pair of vectors that deﬁnes the screw associated with this screw line as follows

\[\begin {Bmatrix} \omega \mathbf {S}\\ \omega \mathbf {C}\times \mathbf {S+}\omega p_{\omega }\mathbf {S}\end {Bmatrix} \]

Where \(p_{\omega }\) is the screw pitch and \(\omega =\left \vert \mathbf {W}\right \vert \)

1.11 Important relations between Cayley Matrix \(B\)

These are some important formulas put here for quick reference.

\[ \left ( \mathbf {x}_{1}+\mathbf {x}_{2}\right ) ^{T}\left [ B\right ] \left ( \mathbf {x}_{1}+\mathbf {x}_{2}\right ) =0 \]

See (4A) below for derivation of the above where

\[ \left [ B\right ] =\begin {bmatrix} 0 & -b_{z} & b_{y}\\ b_{z} & 0 & -b_{x}\\ -b_{y} & b_{x} & 0 \end {bmatrix} \]

and also

\[ \left [ B\right ] =\tan \frac {\theta }{2}[S] \]

Where

\[ \lbrack S]=\begin {bmatrix} 0 & -s_{z} & s_{y}\\ s_{z} & 0 & -s_{x}\\ -s_{y} & s_{x} & 0 \end {bmatrix} \]

From \([B]\) we obtain a row vector and call it

\[ \mathbf {b}=\left ( b_{z},b_{y},b_{z}\right ) ^{T}\]

(This is rodrigues vector) From \(\left [ S\right ] \) we obtain a row vector and call it

\[ \mathbf {s}=\left ( s_{x},s_{y},s_{z}\right ) ^{T}\]

(This is unit vector along the screw axis). We also have

\[ \lbrack B]=\tan \frac {\theta }{2}\left [ S\right ] \]

And in terms of the vectors \(\mathbf {b},\mathbf {s}\,\ \) the above becomes

\[ \mathbf {b}=\tan \frac {\theta }{2}\mathbf {s}\]

2 Introduction to the rotation matrix in 3D

We seek to derive an expression for the rotation matrix in 3D. Consider a point \(x_{1}\) in 3D being acted upon by a rotation matrix \(A\). Let the ﬁnal coordinates of this point be \(x_{2}\). We consider the position vectors of these points, so we will designate these points by their position vectors \(\mathbf {x}_{1}\) and \(\mathbf {x}_{2}\) from now on. The following diagram illustrate this.

Hence we have that

\[ \mathbf {x}_{2}=A\mathbf {x}_{1}\]

Since \(\left \vert \mathbf {x}_{1}\right \vert =\left \vert \mathbf {x}_{2}\right \vert \) we can write

\begin{align} \mathbf {x}_{1}\cdot \mathbf {x}_{1} & =\mathbf {x}_{2}\cdot \mathbf {x}_{2}\nonumber \\ \mathbf {x}_{1}\cdot \mathbf {x}_{1}-\mathbf {x}_{2}\cdot \mathbf {x}_{2} & =0\nonumber \\ \left ( \mathbf {x}_{1}-\mathbf {x}_{2}\right ) \cdot \left ( \mathbf {x}_{1}+\mathbf {x}_{2}\right ) & =0 \tag {1}\end{align}

The geometric meaning of the above last equation is shown in the following diagram

To introduce the rotation matrix \(A\) into the equations, we can write \(\mathbf {x}_{1}-\mathbf {x}_{2}\) as \(\mathbf {x}_{1}-A\mathbf {x}_{1}\). Hence

\begin{equation} \mathbf {x}_{1}-\mathbf {x}_{2}=\left [ I-A\right ] \mathbf {x}_{1} \tag {2}\end{equation}

Similarly, we obtain

\begin{equation} \mathbf {x}_{1}+\mathbf {x}_{2}=\left [ I+A\right ] \mathbf {x}_{1} \tag {3}\end{equation}

From (3) we obtain

\[ \left [ A+I\right ] ^{-1}\left ( \mathbf {x}_{1}+\mathbf {x}_{2}\right ) =\mathbf {x}_{1}\]

Substitute the above expression for \(\mathbf {x}_{1}\) into (2) we obtain

\begin{equation} \mathbf {x}_{1}-\mathbf {x}_{2}=\left [ A-I\right ] \left [ A+I\right ] ^{-1}\left ( \mathbf {x}_{1}+\mathbf {x}_{2}\right ) \tag {3A}\end{equation}

We now call the matrix \(\left [ A-I\right ] \left [ A+I\right ] ^{-1}\) as \(B\)

\begin{equation} B=\left [ A-I\right ] \left [ A+I\right ] ^{-1} \tag {3B}\end{equation}

We can also write from above the following

\begin{equation} A=[I-B]^{-1}[I+B] \tag {3C}\end{equation}

Now rewrite (3A) as

\begin{equation} \mathbf {x}_{1}-\mathbf {x}_{2}=\left [ B\right ] \left ( \mathbf {x}_{1}+\mathbf {x}_{2}\right ) \tag {4}\end{equation}

What is the geometric meaning of \(\left [ B\right ] \)? we see that it is an operator that acts on vector \(\mathbf {x}_{1}+\mathbf {x}_{2}\) to produce the vector \(\mathbf {x}_{1}-\mathbf {x}_{2}\), however from the diagram above we see that the vector \(\mathbf {x}_{1}-\mathbf {x}_{2}\) is perpendicular to \(\mathbf {x}_{1}+\mathbf {x}_{2}\) and scaled down.

Hence

\begin{equation} \left ( \mathbf {x}_{1}+\mathbf {x}_{2}\right ) ^{T}\left [ B\right ] \left ( \mathbf {x}_{1}+\mathbf {x}_{2}\right ) =0 \tag {4A}\end{equation}

This implied that \(B\) is skew-symmetric and have the form given by

\begin{equation} B=\begin {bmatrix} 0 & -b_{z} & b_{y}\\ b_{z} & 0 & -b_{x}\\ -b_{y} & b_{x} & 0 \end {bmatrix} \tag {4A}\end{equation}

Matrix \(B\) can be written as a column vector called \(\mathbf {b}=[b_{x},b_{y},b_{z}]^{T}\) such that for any vector \(y\) we have

\[ \left [ B\right ] \mathbf {y}=\mathbf {b}\times \mathbf {y}\]

The vector \(\mathbf {b}\) is called Rodrigues vector.

Let the vector \(\mathbf {x}_{1}+\mathbf {x}_{2}=\mathbf {y}\) then we write

\[ \left [ B\right ] \mathbf {y}=\mathbf {b}\times \mathbf {y}\]

Where \(\mathbf {b}\) is some vector perpendicular to \(\mathbf {y}\) such that its cross product with \(y\) results in \(\left [ B\right ] \mathbf {y}\)

The vector \(\mathbf {b}\) is the vector that deﬁnes the rotation axis. The length of this vector is \(k\) and a unit vector along \(\mathbf {b}\) is called \(\mathbf {S}\), Hence we can write

\[ \mathbf {b}=k\mathbf {S}\]

Now we solve for \(B.\)

We will use equation (3B) to ﬁnd the form of \(B\) for diﬀerent rotations.

Consider for example the 3D rotation around the \(x\) axis given by

\[ A=\begin {bmatrix} 1 & 0 & 0\\ 0 & \cos \theta & -\sin \theta \\ 0 & \sin \theta & \cos \theta \end {bmatrix} \]

From (3B) we obtain

For example for \(\theta =45^{0}\) the \(B\) matrix is

\[ B=\begin {bmatrix} 0 & 0 & 0\\ 0 & 0 & -0.414214\\ 0 & 0.414214 & 0 \end {bmatrix} \]

Hence we see that \(b_{z}=0\), \(b_{y}=0\), \(b_{x}=-0.41421\) hence

\[ \mathbf {b}=\begin {bmatrix} 0.41421\\ 0\\ 0 \end {bmatrix} \]

Hence geometrically, Rodrigues vector is along the \(x\) axis and in the positive direction as illustrated in the following diagram.

The size of the \(\mathbf {b}\) vector depends on the angle or rotation \(\theta \). The Rodrigues vector will be largest when \(\theta \) is almost \(180^{0}\) and smallest when \(\theta \) is zero.

3 Screw displacement

We now derive an expression for the invariant under spatial displacement, which is the screw axis.

Consider the following general spatial displacement

We start be decomposing the translation vector \(\mathbf {d}\) into 2 components: One parallel (\(\mathbf {d}_{2})\) and one perpendicular \(\left ( \mathbf {d}^{\ast }\right ) \) to the rotation axis of \([A]\) as follows (notice that \(\mathbf {d}^{\ast }\) is perpendicular to the \(x-axis\) which is where the rotation occurs around)

Recall from earlier that the vector \(\mathbf {S}\) is a unit vector along the rotation axis of \([A]\) in the direction of \(\mathbf {b}\). We found earlier that \(\mathbf {b}=\tan \left ( \frac {\theta }{2}\right ) \mathbf {S}\). Hence we can write that \(\mathbf {d}_{2}=-k\mathbf {S}\) where \(k=\tan \left ( \frac {\theta }{2}\right ) \)

Let us redraw the above diagram putting all of these symbols to make the discussion more clear.

Now since

\[ \mathbf {d}=\mathbf {d}^{\ast }-k\mathbf {S}\]

Then the spatial displacement operator \(T\) can be written as follows

\begin{align*} T & =[A,d]\\ & =[A,\mathbf {d}^{\ast }-k\mathbf {S]}\end{align*}

Hence the spatial displacement is

\begin{align*} \mathbf {X} & =T\mathbf {x}\\ & =[A,\mathbf {d}^{\ast }-k\mathbf {S]x}\\ & =[A,\mathbf {d}^{\ast }]\mathbf {x}-[I,k\mathbf {S]x}\end{align*}

Hence we see that the spatial displacement can be viewed as rotational displacement followed by pure translation. Recall from above that rotational displacement is a special type of spatial displacement where the translation part is perpendicular to the rotation axis. We can represent the above equation geometrically as follows

3.1 Derivation for expression for ﬁnding reference point for screw axis

Rotational displacement have a ﬁxed point given by

\[ \mathbf {C}=\frac {\mathbf {b}\times \left ( \mathbf {d}^{\ast }-\mathbf {b}\times \mathbf {d}^{\ast }\right ) }{2\mathbf {b}\cdot \mathbf {b}}\]

The derivation of the above equation is as follows.

Since we seek a ﬁxed point \(\mathbf {C}\), then we write

\begin{align} \mathbf {C} & =\left [ T\right ] \mathbf {C}\nonumber \\ & =\left [ A\right ] \mathbf {C}+\mathbf {d}^{\ast } \tag {4}\end{align}

Using Cayley’s formula, derived above in equation (3C), reproduced below

\begin{equation} A=[I-B]^{-1}[I+B] \tag {3C}\end{equation}

and substitute for \(A\) in (4), we obtain

\[ \mathbf {C=\mathbf {[}}I-B\mathbf {\mathbf {]^{-1}}[}I+B\mathbf {]C}+\mathbf {d}^{\ast }\]

Multiply both sides by \([I-B]\)

\begin{align} \mathbf {C-}\left [ B\right ] \mathbf {C} & \mathbf {=[}I+B\mathbf {]C}+[I-B]\mathbf {d}^{\ast }\nonumber \\ 0 & =2\left [ B\right ] \mathbf {C}+[I-B]\mathbf {d}^{\ast }\nonumber \\ -\frac {1}{2}[I-B]\mathbf {d}^{\ast } & =\left [ B\right ] \mathbf {C} \tag {4B}\end{align}

But by deﬁnition

\[ \left [ B\right ] \mathbf {C}=\mathbf {b}\times \mathbf {C}\]

Hence (4B) becomes

\begin{align*} -\frac {1}{2}\mathbf {d}^{\ast }+\frac {1}{2}B\mathbf {d}^{\ast } & =\mathbf {b}\times \mathbf {C}\\ -\frac {1}{2}\mathbf {d}^{\ast }+\frac {1}{2}\mathbf {b}\times \mathbf {d}^{\ast } & =\mathbf {b}\times \mathbf {C}\\ \frac {1}{2}\left ( \mathbf {b}\times \mathbf {d}^{\ast }-\mathbf {d}^{\ast }\right ) & =\mathbf {b}\times \mathbf {C}\end{align*}

Take the cross product of both sides w.r.t. \(\mathbf {b}\) we obtain

\begin{equation} \mathbf {b}\times \left ( \frac {1}{2}\left ( \mathbf {b}\times \mathbf {d}^{\ast }-\mathbf {d}^{\ast }\right ) \right ) =\mathbf {b}\times \left ( \mathbf {b}\times \mathbf {C}\right ) \tag {4C}\end{equation}

To simplify the above, use the relation

\[ \mathbf {A}\times \left ( \mathbf {B}\times \mathbf {C}\right ) =\mathbf {B}\left ( \mathbf {A}\cdot \mathbf {C}\right ) -\mathbf {C}\left ( \mathbf {A}\cdot \mathbf {B}\right ) \]

Apply the above relation on the RHS of (4C), hence (4C) can be rewritten as

\[ \mathbf {b}\times \left ( \frac {1}{2}\left ( \mathbf {b}\times \mathbf {d}^{\ast }-\mathbf {d}^{\ast }\right ) \right ) =\mathbf {b}\times \left ( \mathbf {b}\cdot \mathbf {C}\right ) -\mathbf {C}\left ( \mathbf {b}\cdot \mathbf {b}\right ) \]

But the vector \(\mathbf {C}\) is perpendicular to \(\mathbf {b}\) hence \(\mathbf {b}\cdot \mathbf {C=0}\) and the above simpliﬁes to

\[ \mathbf {C}=\frac {\mathbf {b}\times \left ( \mathbf {d}^{\ast }-\mathbf {b}\times \mathbf {d}^{\ast }\right ) }{2\mathbf {b}\cdot \mathbf {b}}\]

Now we continue to derive an expression for the screw axis.

We now consider a line \(L\) that passed through this point \(\mathbf {C}\) and is parallel to the rotation axis of \([A]\) (in other words, along the same direction as the vector \(\mathbf {S}\)). Any point along this line remain ﬁxed relative to the rotational displacement \(\mathbf {X}=[A,d^{\ast }]\mathbf {x}\) part of the spatial displacement.

In addition, since the translation part of the spatial displacement, and given by \(\mathbf {X}=[I,k\mathbf {S}]\mathbf {x},\) is a translation in the same direction and slides along the vector \(kS\) as \(k\) changes, then this line will also remain ﬁxed relative to the translation part as well.

Hence we conclude that the line \(L\) will remain ﬁxed relative to the overall spatial displacement \(T\).

This line is called the screw axis. And this type of decomposing the spatial displacement into rotational displacement followed by pure translation is called the screw displacement.

How to geometrically ﬁnd the screw axis? Let us ﬁnd the point \(\mathbf {C}\) ﬁrst. Let take an example similar to the above diagrams, where say \(\theta =30^{0},\) \(\mathbf {d}^{\ast }=0\mathbf {i}+\mathbf {j}+\mathbf {k}\), \(\mathbf {S}=\mathbf {i}\) , hence

\begin{align*} \mathbf {b} & =\overset {k}{\overbrace {\tan \left ( \frac {30^{0}}{2}\right ) }}\mathbf {S}\\ & =0.26795\mathbf {i}\end{align*}

hence

\begin{align*} \mathbf {C} & =\frac {\mathbf {b}\times \left ( \mathbf {d}^{\ast }-\mathbf {b}\times \mathbf {d}^{\ast }\right ) }{2\mathbf {b}\cdot \mathbf {b}}\\ \mathbf {C} & =\frac {0.26795\mathbf {i}\times \left \{ \left ( 0\mathbf {i}+\mathbf {j}+\mathbf {k}\right ) -\left ( 0.26795\mathbf {i}\right ) \times \left ( 0\mathbf {i}+\mathbf {j}+\mathbf {k}\right ) \right \} }{2\left ( 0.26795\mathbf {i}\right ) \cdot \left ( 0.26795\mathbf {i}\right ) }\\ \mathbf {C} & =0\mathbf {i}-1.3660\mathbf {j}+2.3660\mathbf {k}\end{align*}

On the above diagram we now can draw the screw axis using the above coordinates for the point \(\mathbf {C}\)

It is important to note that it is the line given by \(L=\mathbf {C}+k\mathbf {S}\) (the screw axis) which is ﬁxed under the spatial displacement, and not any one single point on this line.

4 The Screw Matrix

We now derive a new expression for spatial displacement using the screw axis line, which we denote as\(\ \mathit {S}\), the angle of rotation \(\theta \) and the amount of slide \(k\) along the screw axis.

The screw matrix is a new mathematical operator that we can use to denote spatial displacement between 2 diﬀerent reference frames. Earlier we showed that we can use the homogeneous transformation operator \(T\left ( A,d\right ) =[A,d]\) to denote spatial displacement, and now we seek to obtain a new expression for a spatial displacement operator which is a function of the following 3 parameters

The screw axis line which we call \(\mathit {S}\) with the plucker coordinates \(\left ( \mathbf {s},\mathbf {C}\times \mathbf {s}\right ) \)
The angle or rotation \(\theta \)
The amount of slide \(k\)

This is in addition to the mathematical object we examined earlier which is

\begin{equation} \mathbf {d}=\mathbf {d}^{\ast }+k\mathbf {S} \tag {5}\end{equation}

Since \(C\) is a ﬁxed point under the translation by \(\mathbf {d}^{\ast }\) hence we write

\begin{align*} \mathbf {C} & =[A,\mathbf {d}^{\ast }]\mathbf {C}\\ & =A\mathbf {C}+\mathbf {d}^{\ast }\end{align*}

Hence

\[ \mathbf {d}^{\ast }=[I-A]\mathbf {C}\]

Substitute the above into (5) we obtain

\begin{equation} \mathbf {d}=[I-A]\mathbf {C}+k\mathbf {S} \tag {5A}\end{equation}

But the spatial displacement \(T\) is deﬁned as

\[ \lbrack T]=[A,\mathbf {d}] \]

Hence using (5A) the above becomes

\begin{equation} \lbrack T]=[A,[I-A]\mathbf {C}+k\mathbf {S}] \tag {5B}\end{equation}

Using the notation of \(\theta \) for angle of rotation and the slide \(k\) and \(\Large S\) to denote the screw axis, we can write (5B) as

\begin{equation} \left [ T\left ( \theta ,k,{\Large S}\right ) \right ] =\left [ A\left ( \theta ,\mathbf {S}\right ) ,[I-A\left ( \theta ,\mathbf {S}\right ) ]\mathbf {C}+k\mathbf {S}\right ] \tag {5C}\end{equation}

So now (5C) is an expression for the spatial operator \(T\) in terms of \(\mathbf {S,}k\ \)and \(\theta \). Recall that

\[ A\left ( \theta ,\mathbf {S}\right ) =\left [ I\right ] +\sin \theta \left [ S\right ] +\left ( 1-\cos \theta \right ) \left [ S^{2}\right ] \]

Where

\[ \left [ S\right ] =\begin {bmatrix} 0 & -s_{x} & s_{y}\\ s_{z} & 0 & -s_{x}\\ -s_{y} & s_{x} & 0 \end {bmatrix} \]

We call \(\left [ T\left ( \theta ,k,{\Large S}\right ) \right ] \) the screw Matrix.

The following diagram helps to illustrate this.

5 The spatial displacement of screws

So far we have discussed spatial displacements applied to points. We showed two Matrices can be used to accomplish this. The homogeneous transformation matrix \(T\left ( A,d\right ) =[A,d]\) and the screw matrix \(T\left ( \theta ,k,{\Large S}\right ) =\) \(\left [ A\left ( \theta ,\mathbf {S}\right ) ,[I-A\left ( \theta ,\mathbf {S}\right ) ]\mathbf {C}+k\mathbf {S}\right ] \) where \(A\left ( \theta ,\mathbf {S}\right ) =\left [ I\right ] +\sin \theta \left [ S\right ] +\left ( 1-\cos \theta \right ) \left [ S^{2}\right ] .\)

We now show a matrix \([\hat {T}]\) which is used for the spatial displacement of a line and not just a point. This is based on using the plücker coordinates of a line to represent the line. Geometrically this is illustrated in the following diagram

Since \(\left [ \hat {T}\right ] \) operates on the Plücker coordinates of a line, then we write

\begin{align} X & =\left [ \hat {T}\right ] x\nonumber \\\begin {Bmatrix} \mathbf {X}\\ \mathbf {P}\times \mathbf {X}\end {Bmatrix} & =\left [ \hat {T}\right ] \begin {Bmatrix} \mathbf {x}\\ \mathbf {p}\times \mathbf {x}\end {Bmatrix} \tag {6A}\end{align}

To processed further, we now assume a point \(\mathbf {q}\) on the line \(x\) such that \(\mathbf {x}=\) \(\mathbf {q}-\mathbf {p}\) as illustrated below

Hence we can now write, in the new coordinates

\begin{equation}\begin {Bmatrix} \mathbf {X}\\ \mathbf {P}\times \mathbf {X}\end {Bmatrix} =\begin {Bmatrix} \mathbf {Q-P}\\ \mathbf {P}\times \left ( \mathbf {Q-P}\right ) \end {Bmatrix} \tag {6B}\end{equation}

But

\begin{align} \mathbf {Q-P} & \mathbf {=}\left [ T\right ] \mathbf {q}-[T]\mathbf {p}\nonumber \\ & =\left [ A\right ] \mathbf {q}+\mathbf {d-}\left ( \left [ A\right ] \mathbf {p}+\mathbf {d}\right ) \nonumber \\ & =\left [ A\right ] \mathbf {q-}\left [ A\right ] \mathbf {p}\nonumber \\ & =\left [ A\right ] \left ( \mathbf {q-p}\right ) \nonumber \\ & =\left [ A\right ] \mathbf {x} \tag {6C}\end{align}

And

\begin{align} \mathbf {P}\times \left ( \mathbf {Q-P}\right ) & =\mathbf {P}\times \mathbf {Q-P\times P}\nonumber \\ & =\mathbf {P}\times \mathbf {Q}\nonumber \\ & =[T]\mathbf {p\times }[T]\mathbf {q}\nonumber \\ & =\left ( \left [ A\right ] \mathbf {p}+\mathbf {d}\right ) \mathbf {\times }\left ( \left [ A\right ] \mathbf {q}+\mathbf {d}\right ) \nonumber \\ & =\left ( \left [ A\right ] \mathbf {p\times }\left [ A\right ] \mathbf {q}\right ) \mathbf {+}\left ( \mathbf {\left [ A\right ] \mathbf {p\times d}}\right ) \mathbf {\mathbf {+}}\left ( \mathbf {\mathbf {d\times }\left [ A\right ] \mathbf {q}}\right ) \mathbf {+}\overset {0}{\overbrace {\mathbf {d\times d}}}\nonumber \\ & =\left ( \left [ A\right ] \mathbf {p\times }\left [ A\right ] \mathbf {q}\right ) \mathbf {+}\left ( \mathbf {\left [ A\right ] \mathbf {p\times d}}\right ) \mathbf {\mathbf {+}}\left ( \mathbf {\mathbf {d\times }\left [ A\right ] \mathbf {q}}\right ) \tag {6D}\end{align}

Since \([A]\) is a rotation matrix, then

\[ \left ( \mathbf {\left [ A\right ] \mathbf {p\times d}}\right ) \mathbf {\mathbf {+}}\left ( \mathbf {\mathbf {d\times }\left [ A\right ] \mathbf {q}}\right ) =\left [ D\right ] \left [ A\right ] \left ( \mathbf {q-p}\right ) \]

Where \([D]\) is a skew-symmetric matrix deﬁned such that \([D]\mathbf {y}=\mathbf {d}\times \mathbf {y}\)

Hence (6D) can be written as

\begin{align} \mathbf {P}\times \left ( \mathbf {Q-P}\right ) & =\left ( \left [ A\right ] \mathbf {p\times }\left [ A\right ] \mathbf {q}\right ) +\left [ D\right ] \left [ A\right ] \left ( \mathbf {q-p}\right ) \nonumber \\ & =\left [ A\right ] \left ( \mathbf {p\times q}\right ) +\left [ D\right ] \left [ A\right ] \left ( \mathbf {q-p}\right ) \tag {6E}\end{align}

By substitution of (6E) and (6C) into RHS of (6B) we obtain

\[\begin {Bmatrix} \mathbf {X}\\ \mathbf {P}\times \mathbf {X}\end {Bmatrix} =\begin {Bmatrix} \mathbf {Q-P}\\ \mathbf {P}\times \left ( \mathbf {Q-P}\right ) \end {Bmatrix} =\begin {Bmatrix} \left [ A\right ] \mathbf {x}\\ \left [ A\right ] \left ( \mathbf {p\times q}\right ) +\left [ D\right ] \left [ A\right ] \left ( \mathbf {q-p}\right ) \end {Bmatrix} \]

But \(\mathbf {q-p=x}\). Hence the above becomes

\[\begin {Bmatrix} \mathbf {X}\\ \mathbf {P}\times \mathbf {X}\end {Bmatrix} =\begin {Bmatrix} \left [ A\right ] \mathbf {x}\\ \left [ A\right ] \left ( \mathbf {p\times q}\right ) +\left [ D\right ] \left [ A\right ] \mathbf {x}\end {Bmatrix} \]

Now substitute \(\mathbf {q=p+x}\) in the above we obtain

\begin{align*}\begin {Bmatrix} \mathbf {X}\\ \mathbf {P}\times \mathbf {X}\end {Bmatrix} & =\begin {Bmatrix} \left [ A\right ] \mathbf {x}\\ \left [ A\right ] \left ( \mathbf {p\times }\left ( \mathbf {p+x}\right ) \right ) +\left [ D\right ] \left [ A\right ] \mathbf {x}\end {Bmatrix} \\ & =\begin {Bmatrix} \left [ A\right ] \mathbf {x}\\ \left [ A\right ] \left ( \mathbf {p\times p+p\times x}\right ) +\left [ D\right ] \left [ A\right ] \mathbf {x}\end {Bmatrix} \\ & =\begin {Bmatrix} \left [ A\right ] \mathbf {x}\\ \left [ A\right ] \left ( \mathbf {p\times x}\right ) +\left [ D\right ] \left [ A\right ] \mathbf {x}\end {Bmatrix} \\ & =\begin {bmatrix} A & 0\\ DA & A \end {bmatrix}\begin {Bmatrix} \mathbf {x}\\ \mathbf {p\times x}\end {Bmatrix} \end{align*}

Hence

\[ \lbrack \hat {T}]=\begin {bmatrix} A & 0\\ DA & A \end {bmatrix} \]

The we can write

\[\begin {Bmatrix} \mathbf {X}\\ \mathbf {P}\times \mathbf {X}\end {Bmatrix} =[\hat {T}]\begin {Bmatrix} \mathbf {x}\\ \mathbf {p}\times \mathbf {x}\end {Bmatrix} \]

We now analyze the spatial displacement of the screw axis under \([\hat {T}]\)

Recall that a screw axis Plücker coordinates are written as

\[\begin {Bmatrix} \mathbf {W}\\ \mathbf {p\times W}\end {Bmatrix} =\begin {Bmatrix} \omega \mathbf {S}\\ \omega \mathbf {p}\times \mathbf {S+}\omega p_{\omega }\mathbf {S}\end {Bmatrix} \]

Where \(\mathbf {S}\) is the unit vector in along the axis, \(\mathbf {p}\) is the ﬁxed reference point on the axis and \(p_{\omega }\) is the screw pitch and \(\omega =\left \Vert \mathbf {W}\right \Vert \) where \(\mathbf {W}\) is the Rodrigues vector. To make things more clear, we illustrate these quantities in the following diagram

We now perform spatial displacement on the screw axis using its Plücker coordinates

\begin{equation}\begin {Bmatrix} \omega \mathbf {S}\\ \omega \mathbf {P}\times \mathbf {S+}\omega p_{\omega }\mathbf {S}\end {Bmatrix} \tag {6C}\end{equation}

Rewrite the above as general plücker coordinates \(\begin {Bmatrix} \mathbf {W}\\ \mathbf {V}\end {Bmatrix} \) then the spatial displacement of this general line is as seen above in (6B) becomes

\[\begin {Bmatrix} \left [ A\right ] \mathbf {w}\\ \lbrack D][A]\mathbf {w}+\left [ A\right ] \mathbf {v}\end {Bmatrix} \]

We need seek to evaluate the above coordinates for the screw axis given in (6C)

In other words, given \(\mathbf {v}=\omega \mathbf {p}\times \mathbf {s+}\omega p_{\omega }\mathbf {s}\) and \(\mathbf {W}=\omega \mathbf {S}\) we need to ﬁnd \(\left [ A\right ] \mathbf {v,}\left [ A\right ] \mathbf {w}\) and \([D][A]\mathbf {w}\)

The ﬁrst plücker coordinate \(\omega \mathbf {S}\) transforms easily as

\begin{align*} \omega \mathbf {S} & \mathbf {=}\omega \left [ A\right ] \mathbf {s}\\ & \mathbf {=}\mathbf {\left [ A\right ] }\omega \mathbf {\mathbf {s}}\end{align*}

But \(\omega \mathbf {\mathbf {s}}\) is just the Rodrigues vector in the new coordinates system which we call lower case \(\mathbf {w}\) hence

\[ \omega \mathbf {S=\left [ A\right ] w}\]

Now we need to transform the second plücker coordinate \(\omega \mathbf {p}\times \mathbf {s+}\omega p_{\omega }\mathbf {s}\)

With the help of the \([D]\) matrix which can be used to rewrite the cross product of 2 vectors as \([D]\) times one the 2 vectors, we can write

\[ \left [ D\right ] \mathbf {S=d\times S}\]

But \(\mathbf {S=}\left [ A\right ] \mathbf {s}\) hence the above becomes

\[ \left [ D\right ] \left [ A\right ] \mathbf {s=d\times S}\]

And since \(\omega \) is a scalar, we can write the above as

\begin{align} \omega \left [ D\right ] \left [ A\right ] \mathbf {s} & \mathbf {=}\omega \mathbf {d\times S}\nonumber \\ \left [ D\right ] \left [ A\right ] \left ( \omega \mathbf {s}\right ) & =\omega \mathbf {d\times S}\nonumber \\ \left [ D\right ] \left [ A\right ] \mathbf {w} & \mathbf {=}\omega \mathbf {d\times S} \tag {6D}\end{align}

Now we need to compute \(\left [ A\right ] \mathbf {v}\) which is \(\left [ A\right ] \left ( \omega \mathbf {p}\times \mathbf {s+}\omega p_{\omega }\mathbf {s}\right ) \)

\begin{align} \left [ A\right ] \left ( \omega \mathbf {p}\times \mathbf {s+}\omega p_{\omega }\mathbf {s}\right ) & =\omega \left ( \left [ A\right ] \left ( \mathbf {p}\times \mathbf {s}\right ) +\omega p_{\omega }\left [ A\right ] \mathbf {s}\right ) \nonumber \\ & =\omega \left ( \left ( \left [ A\right ] \mathbf {p}\times \left [ A\right ] \mathbf {s}\right ) +\omega p_{\omega }\left [ A\right ] \mathbf {s}\right ) \nonumber \\ & =\omega \left ( \left ( \left [ A\right ] \mathbf {p}\times \mathbf {S}\right ) +\omega p_{\omega }\mathbf {S}\right ) \tag {6E}\end{align}

Hence

\[\begin {Bmatrix} \mathbf {W}\\ \mathbf {V}\end {Bmatrix} =\begin {Bmatrix} \left [ A\right ] \mathbf {w}\\ \lbrack D][A]\mathbf {w}+\left [ A\right ] \mathbf {v}\end {Bmatrix} \]

6 The screw axis of a displacement

Here we show that the screw axis is invariant of the \(6\times 6\) transformation matrix \([\hat {T}]\) derived in the last section.

Given the screw axis line \(S\) deﬁned by its plucker coordinates \(\begin {bmatrix} \mathbf {S}\\ \mathbf {V}\end {bmatrix} \) we need to show the following

\begin{equation} S=[\hat {T}]S \tag {1}\end{equation}

(1) can be written as

\begin{align} S-[\hat {T}]S & =0\nonumber \\ \left [ I-\hat {T}\right ] S & =0 \tag {2}\end{align}

Now, if we can ﬁnd solution to the above others than \(S=0\) then we have showed that (1) is valid. Equation (1) can be written as

\[ \left [ I-\hat {T}\right ] \begin {bmatrix} \mathbf {S}\\ \mathbf {V}\end {bmatrix} =0 \]

But

\[ \lbrack \hat {T}]=\begin {bmatrix} A & 0\\ DA & A \end {bmatrix} \]

Hence we obtain

\begin{align*} \left [ \begin {bmatrix} I & 0\\ 0 & I \end {bmatrix} -\begin {bmatrix} A & 0\\ DA & A \end {bmatrix} \right ] \begin {bmatrix} \mathbf {S}\\ \mathbf {V}\end {bmatrix} & =0\\ & \\\begin {bmatrix} I-A & 0\\ -DA & I-A \end {bmatrix}\begin {bmatrix} \mathbf {S}\\ \mathbf {V}\end {bmatrix} & =0\\ & \\\begin {bmatrix} \left ( I-A\right ) \mathbf {S}\\ -\left [ D\right ] \left [ A\right ] \mathbf {S+}\left [ I-A\right ] \mathbf {V}\end {bmatrix} & =0 \end{align*}

Hence we obtain 2 equations

\[ \left \{ \begin {array} [c]{l}\left ( I-\left [ A\right ] \right ) \mathbf {S=0}\\ -\left [ D\right ] \left [ A\right ] \mathbf {S+}\left [ I-A\right ] \mathbf {V=0}\end {array} \right . \]

From the ﬁrst equation we obtain \(\left [ A\right ] \mathbf {S=S}\) substitute into the second equation

\begin{align*} -\left [ D\right ] \mathbf {S+}\left [ I-A\right ] \mathbf {V} & \mathbf {=0}\\ \left [ I-A\right ] \mathbf {V} & \mathbf {=}\left [ D\right ] \mathbf {S}\end{align*}

Introduce \(\left [ D\right ] \mathbf {S=-}\left [ S\right ] \mathbf {d}\) hence the above becomes

\begin{equation} \left [ I-A\right ] \mathbf {V=-}\left [ S\right ] \mathbf {d} \tag {3}\end{equation}

And from the cayley’s formula for \(A\)

\[ A=[I-B]^{-1}[I+B] \]

Then (3) becomes

\begin{align*} \left [ I-[I-B]^{-1}[I+B]\right ] \mathbf {V} & \mathbf {=-}\left [ S\right ] \mathbf {d}\\ \left [ I-\frac {[I+B]}{[I-B]}\right ] \mathbf {V} & \mathbf {=-}\left [ S\right ] \mathbf {d}\end{align*}

Hence

\begin{align} \left [ \mathbf {[}I-B\mathbf {]}-[I+B]\right ] \mathbf {V} & \mathbf {=[}I-B\mathbf {]}\left ( \mathbf {-}\left [ S\right ] \mathbf {d}\right ) \nonumber \\ -2\left [ B\right ] \mathbf {V} & =\mathbf {[}I-B\mathbf {]}\left ( \mathbf {-}\left [ S\right ] \mathbf {d}\right ) \nonumber \\ \left [ B\right ] \mathbf {V} & =\frac {1}{2}\mathbf {[}I-B\mathbf {]}\left [ S\right ] \mathbf {d} \tag {4}\end{align}

But we know that \(\left [ B\right ] =\tan \left ( \frac {\theta }{2}\right ) \left [ S\right ] \) where \(\theta \) is the rotation angle, and \([S]=\begin {bmatrix} 0 & -s_{z} & s_{y}\\ s_{z} & 0 & -s_{x}\\ -s_{y} & s_{x} & 0 \end {bmatrix} \) hence (4) becomes

\begin{align*} \tan \left ( \frac {\theta }{2}\right ) \left [ S\right ] \mathbf {V} & =\frac {1}{2}\mathbf {[}I-B\mathbf {]}\left [ S\right ] \mathbf {d}\\ \left [ S\right ] \mathbf {V} & =\left [ S\right ] \frac {1}{2\tan \left ( \frac {\theta }{2}\right ) }\mathbf {[}I-B\mathbf {]d}\end{align*}

Hence

\begin{equation} \mathbf {V}=\frac {1}{2\tan \left ( \frac {\theta }{2}\right ) }\mathbf {[}I-B\mathbf {]d} \tag {5}\end{equation}

Hence we showed that a non zero solution for (2) exist given by \(S=\begin {bmatrix} \mathbf {S}\\ \mathbf {V}\end {bmatrix} \) where \(V\) is given in (5). This shows that (1) is valid which is what we wanted to show.

Therefore the screw axis \(S\) is invariant of the \(6\times 6\) transformation matrix \([\hat {T}]\).

7 References

Geometric Design Of Linkages. By Professor J.Michael McCarthy. Springer publication.
Introduction to Theoretical Kinematics. By Professor J.Michael McCarthy
Class notes, MAE245. Theoretical Kinematics spring 2004. UCI. Professor J.Michael McCarthy