How to Represent Multi-Dimensional Rotations, Including Boosts 1

Transcription

How to Represent Multi-Dimensional Rotations, Including Boosts 1
1
How to Represent
Multi-Dimensional Rotations,
Including Boosts
John Denker
1
Introduction
In this document, we discuss rotations, including simple rotations in the plane but also
including compound rotations around multiple axes in three or more dimensions. We briefly
survey four ways of pictorially representing rotations: two vectors in the plane of rotation,
triad before and after rotation, axis plus amount of rotation, and yaw/pitch/roll. These
can (respectively) be formalized in terms of (respectively) Clifford algebra i.e. quaternions,
matrices, Rodrigues vectors, and Euler angles. See section 11.
Also, there is a deep relationship between ordinary rotations (in D = 3 space) and boosts1
(in D = 1 + 3 spacetime). Therefore we would like to represent rotations in a way that
is consistent with special relativity. In fact Clifford algebra makes the generalization from
ordinary space to spacetime as simple as it could possibly be: it suffices to change one minus
sign in one equation. See section 5.
We discuss the Clifford algebra representation in some detail, because it is ideal for keeping
track of rotations per se, especially if there are many different rotations to keep track of. It
is elegant, it is efficient, and it is easily converted to any other representation. This has the
pedagogical advantage of requiring only a small step beyond an elementary understanding
of vectors. In particular, matrices are not required. (If you’re not familiar with matrices,
just skip the few places in this document that mention matrices. You will still be able to
represent compound rotations – including boosts – in arbitrarily-many dimensions, using
Clifford algebra alone.) This representation is older than you might think, considerably
older than any notion of vector cross product (reference 1, reference 2, and reference 3).
The matrix representation is particularly efficient if you have one particular rotation and
wish to apply it repeatedly, using it to rotate a large number of vectors. The advantage is
most conspicuous in four or more dimensions. See section 7.
Being able to deal with rotations has many practical applications. For instance, suppose
you want to build an autopilot or a flight simulator. You need to be able to figure out the
overall effect of a long sequence of rotations about multiple axes.
Most people, unless they are unusually well trained or unusually gifted, have a hard time
visualizing rotations in D = 3 or higher. For example, here’s a puzzle: suppose you apply
1
A boost is just a change in velocity.
CONTENTS
2
90 degrees of yaw followed by 90 degrees of roll. What’s the overall effect? Answer: it is
a 120 degree rotation, and the plane of rotation is given by x + y + z = 0. It can also be
seen as a cyclic permutation of the x, y, and z axes. Most people find this puzzle somewhat
discombobulating the first time they see it. See section 4.4.
Contents
1 Introduction
1
2 The Product-of-Vectors Representation
3
2.1
Half-Angles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
4
3 Digression: Clifford Algebra
4
4 How To Do Calculations
6
4.1
General Procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
6
4.2
A First Example: 180 Degree Rotation . . . . . . . . . . . . . . . . . . . . .
7
4.3
Normalization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
7
4.4
Another Simple Example: Compound Rotation in D = 3 . . . . . . . . . . .
8
4.5
Rotations in a Rotating Frame . . . . . . . . . . . . . . . . . . . . . . . . . .
10
4.6
Bootstrapping Small Angles to Large Angles . . . . . . . . . . . . . . . . . .
12
5 Spacetime and Boosts
14
6 Four or More Dimensions
18
7 Representation and Computation
21
7.1
Double Coverage . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
21
7.2
Basis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
22
7.3
Dimensions; Number of Components . . . . . . . . . . . . . . . . . . . . . .
23
7.4
Computational Load . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
24
8 Example: Combining Rotations in VRML
25
2 THE PRODUCT-OF-VECTORS REPRESENTATION
3
9 Rotations in Terms of Reflections
27
10 Quaternions and Pauli Matrices in terms of Clifford Algebra
28
11 Survey of Ways to Represent Rotations
29
12 Clifford Algebra Desk Calculator
31
13 References
35
2
The Product-of-Vectors Representation
Rather than talking about the axis of rotation, we choose to emphasize the plane of rotation.
This has many advantages; among other things, it works equally well in D = 2 flatland, in
D = 3 space, in D = 1 + 3 spacetime, et cetera. This permits unification and simplification
of many ideas.
Also, once you get used to it, the plane of rotation just “looks” more natural than an axis
of rotation. If you are sitting in an aircraft, you can see the plane of rotation for yawwise rotations spread out in front of you, running left/right. It is somewhat less natural to
visualize the vertical axis (even though the two representations are technically equivalent in
D = 3). Similarly it is natural to think of the pitching motion as motion in a vertical plane,
rather than as motion around a horizontal axis.
As we shall see below, you can encode both the plane of rotation and the amount of rotation
by specifying two vectors. The plane containing the two vectors is the plane of rotation, and
the angle between the two vectors tells us something about the angle of rotation; specifically,
it tells us the half-angle, as will be discussed in section 2.1.
The exact choice of vectors doesn’t matter; there are many different pairs of vectors that
specify the same rotation. In the language of Clifford Algebra, this means that only the
product of the two vectors matters. For an introduction to Clifford algebra, and its application
to geometry and physics, see reference 4, reference 5, reference 6, and reference 7.
This technique (representing a rotation as a product of vectors) is Lorentz-invariant. That
is, if Alice is using a frame that is rotated relative to Bob’s frame, and also moving relative
to Bob’s frame, everybody agrees as to what rotation is represented by a given product-ofvectors.
There are no singularities in the product-of-vectors representation i.e. Clifford algebra i.e.
quaternions. In contrast, there are nasty singularities in the Euler angle representation, as
discussed in section 11.
3 DIGRESSION: CLIFFORD ALGEBRA
2.1
4
Half-Angles
There is one slight quirk with the product-of-vectors representation. The angle between the
vectors cannot directly represent the whole angle of rotation. To see why not, consider 180
degree rotations. If you start out heading north then apply 180 degrees of yaw, you wind
up heading south, right-side up. In contrast, if you start out heading north and apply 180
degrees of pitch, you wind up facing south upside down.
The problem is that two vectors with a 180 degree angle between them are collinear, so there
are many inequivalent planes that contain both vectors.
There is a simple way to fix this problem: Let the angle between the vectors represent half
the angle of rotation. That is, given two vectors in the plane of rotation, we define the rotor
angle to be the angle between the two vectors. The rotor represents a rotation, where the
rotation angle is twice the rotor angle. Loosely speaking, we say a rotor is half of a rotation.
A 180 degree rotor represents a 360 degree rotation. For this special rotation, you don’t
need to specify the plane of rotation, so the fact that these two vectors are collinear isn’t a
problem. It all works out.
This half-angle business may seem like a kludge, but in fact it has a deep physical significance.
One way to see the significance is in terms of reflections, as discussed in section 9.
The rotor angle is half the rotation angle.
3
Digression: Clifford Algebra
In the previous section we argued on geometric and pictorial grounds that two vectors in the
plane of rotation should provide a nice representation of rotations.
That leaves us with the question of how best to quantify this notion. We shall see that
Clifford algebra is the perfect tool for this. Sometimes the same ideas are discussed in terms
of quaternions, which correspond to a subset of Clifford algebra, as discussed in section 10.
Clifford algebra is not very complicated; it is only a few small steps beyond ordinary vector
algebra. It is amazingly elegant, and is useful for many, many things, not just rotations.
Since this document is focussed on rotations, this is not the place for a general tutorial on
Clifford algebra. Instead we rely on reference 4 and reference 7.
To proceed, you need to appreciate the existence of scalars, vectors, and bivectors. (There
also exist trivectors et cetera, although we have no immediate need of them.) You should
learn to visualize such things, as in figure 1. We will explain how to form the sums and
geometric products of such things.
3 DIGRESSION: CLIFFORD ALGEBRA
Scalar
Vector
P
5
Bivector
P
Q
Trivector
R
P
Q
Figure 1: Scalar, Vector, Bivector, and Trivector
However, to make this document at least formally self-contained, we recite here the most
crucial properties of Clifford algebra. (If you want to understand the concepts the lie behind
this formalism, please see the references, especially reference 4.
Suppose we can find three linearly-independent spacelike vectors. That means we must live
in a space with at least three dimensions. (The generalization to any number of dimensions,
from D = 2 on up, is straightforward.) Applying the Gram-Schmidt algorithm to these
vectors, we can construct three orthonormal vectors, namely γ1 , γ2 , and γ3 . We do not
attribute any special properties to these vectors, beyond being orthonormal and spacelike;
in particular they do not need to be aligned with the cardinal directions up/down, east/west,
or anything like that.
Multiplication of orthogonal vectors is anticommutative, and the product is a bivector:
γi γj = −γj γi
(bivector)
(1)
for all i 6= j.
The product of parallel vectors is a scalar. Spacelike unit vectors are normalized like this:
γi γi = +1 (scalar)
(2)
for all i.
Sometimes we also notice that timelike vectors exist. The timelike unit vector (γ0 ) is normalized differently:
γ0 γ0 = −1 (scalar)
(3)
Clifford Algebra also defines the reverse of a product of vectors, formed by writing all the
vectors in the reverse order. For example:
IfC
= a + b γ1 γ2
(4)
thenC ∼ = a + b γ2 γ1
for any scalars a and b.
4 HOW TO DO CALCULATIONS
6
We can use our set of orthonormal vectors as a basis, expressing any vector in the space as
a linear combination of basis vectors.
This approach – expressing everything in terms of components relative to a given basis – is
definitely not the most elegant approach, and is usually not even the easiest approach, but
it seems expedient in this case, for a couple of reasons: (1) Many of the illustrative examples
(below) revolve around perpendicular vectors, and it’s nice to have a set of such vectors lying
around. (2) Computers are good at manipulating numbers, but not so good at manipulating
real physical objects like vectors and bivectors. So in computer programs, such as the one
mentioned in section 12, at some point we need to project our vectors (etc.) onto a particular
basis, and manipulate the components.
In contrast, if you want to see what the more-physical less-numerical approach looks like,
see reference 4.
4
How To Do Calculations
4.1
General Procedure
Now we are in a position to quantify the idea of using two vectors to represent the plane of
rotation and the amount of rotation. In section 2 we introduced, qualitatively, the idea of
rotor angle. We now define a simple rotor to be the product of two vectors, normalized to
unity (as discussed section 4.3). By product we mean the geometric product, as defined by
Clifford algebra. (Non-simple rotors will be discussed in section 6.)
Note that there is a one-to-one correspondence between quaternions and a subalgebra of Clifford algebra, as discussed in section 10.)
Also note that a rotor is not a bivector; in general it has a scalar piece as well as
a bivector piece.
We will show that equation 5 is the completely general formula for using a rotor r to rotate
a vector v. We have not yet proved or even motivated2 this result; instead we pull the
formula out of thin air and then show, in retrospect, that it has the desired behavior. The
key formula is:
v 0 = r∼ v r
2
(5)
It is possible to motivate equation 5 in terms of reflections, as discussed in section 9. However, this seems
like robbing Peter to pay Paul, since the reflection formula is not particularly more intuitive than equation
5, and is usually just pulled out of thin air and justified a posteriori.
4 HOW TO DO CALCULATIONS
7
where v is the unrotated vector, v 0 is a vector that is rotated relative to v by some angle δ,
and r is is a rotor with rotor angle ≡ δ/2.
That’s all there is to it. Given two vectors in the plane of rotation, you can use them to build
a rotor r. Then you can use the rotor r to perform rotations in accordance with equation 5.
Compound rotations are represented by a product of rotors in the obvious way; see section
8 and section 12 for details and reference 8 for luridly explicit details.
4.2
A First Example: 180 Degree Rotation
The simplest possible rotor is the product γ1 γ2 . Since the vectors γ1 and γ2 are perpendicular,
the rotor angle must be 90 degrees, and the corresponding rotation angle is 180 degrees. Let’s
do the math:
(γ2 γ1 )γ1 (γ1 γ2 ) = (γ2 γ1 )γ2
= −(γ1 γ2 )γ2
(6)
= −γ1
where all we needed were the axiomatic anticommutation relations (equation 1) and the fact
that multiplication is associative. Similarly we have:
(γ2 γ1 )γ2 (γ1 γ2 ) = −(γ2 γ1 )γ2 (γ2 γ1 )
= −(γ2 γ1 )γ1
(7)
= −γ2
whereas in contrast, vectors perpendicular to the γ1 γ2 plane are unaffected by the rotation:
(γ2 γ1 )γ3 (γ1 γ2 ) = +γ3
(8)
and in general, for any arbitrary vector in D = 3 space:
If
v = a γ1 + b γ2 + c γ3
Then v 0 = (γ2 γ1 )(a γ1 + b γ2 + c γ3 )(γ1 γ2 )
= −a γ1 − b γ2 + c γ3
(9)
which is exactly the correct behavior for a 180 degree rotation in the γ1 γ2 plane. (Here a,
b, and c are arbitrary scalars.) This rotation is depicted in figure 2.
It is worth emphasizing that this result is a direct consequence of the axioms of Clifford
algebra, plus our decision to represent a simple rotor as a product of vectors.
4.3
Normalization
If we want equation 5 to represent rotations, we must choose vectors such that their product
is normalized to unity. Without this constraint, we would be inadvertently representing
size-changing transformations as well as rotations. It suffices for the rotor to be a product
of unit vectors, but in all generality only the product must be normalized. That is, if we
4 HOW TO DO CALCULATIONS
8
v
γ
2
Rotor
Angle
γ
Rotation
Angle
1
v’
Figure 2: Rotor Angle and Rotation Angle
have a rotor r which is the geometric product of spacelike vectors P and Q, i.e. r = P Q, it
is sufficient but not necessary for the vectors P and Q to be separately normalized. All we
really require is that:
gorm(r) = 1
(10)
where the gorm of r is, by definition, the scalar part of r∼ r. For example, the gorm of
(a + b γ2 γ3 ) is equal to (a2 + b2 ), for arbitrary scalars a and b. For details, see reference 4.
4.4
Another Simple Example: Compound Rotation in D = 3
Let’s return to the puzzle posed in section 1. That is, suppose you apply 90 degrees of yaw
followed by 90 degrees of roll. What’s the overall effect?
We can easily compute the answer using rotors. We start with the rotor
cos(45◦ ) + sin(45◦ )γ2 γ3
(11)
which we hope will represent a 45 degree rotor angle, and hence a 90 degree rotation in the
γ2 γ3 plane. You can verify this by considering the product of r with itself, namely:
√
√
√
√
( (.5) + (.5)γ2 γ3 )( (.5) + (.5)γ2 γ3 ) = γ2 γ3
(12)
which we recognize as the 90 degree rotor angle (i.e. 180 degree rotation) that we saw in
section 4.2.
Similarly, the rotor
cos(45◦ ) + sin(45◦ )γ3 γ1
represents a 45 degree rotor angle (i.e. 90 degree rotation) in the γ3 γ1 plane.
(13)
4 HOW TO DO CALCULATIONS
9
If we multiply these two rotors together, we get an interesting result:
√
√
√
√
( (.5) + (.5)γ3 γ1 )( (.5) + (.5)γ2 γ3 ) = .5 + .5 γ1 γ2 + .5 γ2 γ3 + .5 γ3 γ1
γ3 +γ3 γ1
= cos(60◦ ) + sin(60◦ ) γ1 γ2 +γ√2(3)
(14)
and we can, on sight, identify the rotor angle as 60 degrees (corresponding to a 120 degree
rotation), and we can see that the plane of rotation is specified by x + y + z = 0. That is
equivalent to saying the axis of rotation is a vector perpendicular to the x + y + z = 0 plane,
√
i.e. a vector pointing in the [1, 1, 1] direction. (The factor of 3 in the denominator is so
that the fraction as a whole is a unit bivector, i.e. the bivector has gorm=1.)
For readers who are familiar with matrices, we mention that the rotation matrix
corresponding to the rotor in equation 14 is


0 0 1


(15)
 1 0 0 
0 1 0
One advantage of the matrix representation is that it shows quite clearly that
the rotation in question produces a cyclic permutation of the γ1 , γ2 , and γ3 axes
... as advertised in section 1.
The disadvantage is that for most people, it is hard to ascertain the plane of
rotation by looking at a typical rotation matrix.
Here is the general rule for combining rotations: If you carry out a rotation described by
rotor r1 and then follow it by another rotation described by rotor r2 , the overall rotation can
be described by the rotor r, where:
r = r1 r2
(16)
That’s all there is to it; you just multiply the rotors, in order.
When we say the rotors appear “in order” in equation 16, that means left-to-right. That’s
not because we read from left to right, but rather because when we rotate a vector we want
r1 to be applied first and r2 to be applied second, in accordance with equation 5. When we
have a compound rotation, we can expand equation 5 as follows:
v 00 = r∼ v r
= (r1 r2 )∼ v r1 r2
(17)
= r2∼ r1∼ v r1 r2
= r2∼ (r1∼ v r1 )r2
The point is that when we write r1 and r2 in the correct order, the first rotor stands next
to the vector v in equation 17 and therefore gets applied first in accordance with the usual
rules of arithmetic, as shown by the parentheses in the last line of equation 17. Note that r1∼
also stands next to v on the left just as r1 stands next to v on the right, which is consistent
with the fact that (r1 r2 )∼ = r2∼ r1∼ .
4 HOW TO DO CALCULATIONS
10
Philosophical remark: In this section, and also in section 4.5, we are treating
rotations as objects unto themselves. That is, we focus on the rotation operators
directly. In this section, paid little attention to using the rotors to rotate this-orthat vector; instead we mainly considered the effect of one rotation on another.
4.5
Rotations in a Rotating Frame
In the previous section, we expressed the rotors in terms of basis vectors (γ1 , γ2 , and γ3 )
that remained fixed in space. That seems so natural and reasonable that non-experts might
imagine that it is the only reasonable way of doing business ... but it is not.
aw
<Y
Pitch >
< Roll
In aircraft (as well as boats and spacecraft) one way to describe rotations is in terms of
yaw, pitch, and roll, as defined in figure 3. Rotations defined in this way require special
treatment, because the axes are attached to the aircraft, not fixed in space. Therefore, if the
aircraft turns, the new yaw-wise direction is different from the old yaw-wise direction ... and
similarly for pitch and roll.
copyright © 2002 jsd
Figure 3: Yaw, Pitch, and Roll
Using axes attached to the aircraft is entirely conventional and is entirely sensible from the
pilot’s point of view.
It is remarkably easy to switch back and forth between axes attached to the aircraft and
axes fixed in space, as we now explain:
Conventionally, the aircraft axes are called X, Y , and Z. That means yaw is a rotation in
the XY plane, pitch is a rotation in the ZX plane, and roll is a rotation in the Y Z plane.
4 HOW TO DO CALCULATIONS
11
Consider the following scenario: Initially, the aircraft axes {X, Y , Z} happen to coincide
with the fixed-in-space axes {γ1 , γ2 , γ3 }.
The pilot begins by performing some amount of roll. This is a rotation in the Y Z plane,
which is also a rotation in the γ2 γ3 plane. Let this rotation be represented by rotor r1 . Next,
the pilot performs some amount of pitch. This is a rotation in the ZX plane ... but it is
not a rotation in the γ3 γ1 plane. We need to take into account that the current ZX plane
is different from the original ZX plane. Let the second rotation be represented by the rotor
r2 = a + b Z X, for suitable scalars a and b ... where Z and X are the current Z and X
vectors.
It is easy to describe the current ZX plane in terms of things we already know. We can
use the rotor r1 to rotate the original Z vector and also to rotate the original X vector.
Specifically,
Z = r∼ γ3 r
(18)
X = r∼ γ1 r
so the compound rotation r1 r2 can be expressed as
r1 r2 = r1 (a + b(r1∼ γ3 r1 r1∼ γ1 r1 ))
= (a + b γ3 γ1 ) r1
(19)
where it should be noted that on the LHS of the equation, r1 is the leftmost factor, while on
the RHS of the equation, r1 is the rightmost factor. Also on the RHS, the factor in front of
r1 is definitely not equal to r2 , but looks hauntingly similar to r2 , since it involves the same
coefficients a and b, and involves vectors that were initially equal to Z and X.
This tells us something very interesting: If you know how to describe a rotation relative
to the axes attached to the aircraft, you can also describe it relative to axes fixed in space
using the same components, provided you multiply the rotors in the reverse order ... reversed
relative to equation 5.
This trick about reversing the order of the operations is not dependent on Clifford algebra
per se; it is a direct result of the basic geometry of rotations, and of how we have defined
the XY Z axes. The basic logic is that when you apply the N th rotation, if it comes to you
described relative to the aircraft axes, you need to undo the previous N − 1 rotations to
understand how it looks relative to the original axes. You perform it, then re-do the other
N −1 rotations. When you apply that logic at each step, the overall result is a complete endfor-end reversal of the order of the steps. You can find this discussed in classical mechanics
books (e.g. Goldstein) under the heading of “passive versus active transformations.”
This trick is valuable because the same routines you use for keeping track of rotations
relative to fixed axes can be used for keeping track of rotations relative to rotating axes,
with essentially zero extra work. In fact it is so easy that people sometimes forget that the
two schemes are conceptually different ... so be careful; remember which is which.
4 HOW TO DO CALCULATIONS
4.6
12
Bootstrapping Small Angles to Large Angles
In section 4.2 and section 4.4 we used the notion that orthogonality corresponds to a 90
degree angle to construct some interesting rotors. In this section, we derive more general
expressions, covering rotors with any angle whatsoever.
v’ = 2
γ
2
v =γ 2
γ
2
−εγ
+ε γ
1
v’ = γ 1
v = γ1
γ
1
Figure 4: Rotation by a Small Angle
Let’s consider the situation shown on the left side of figure 4. We start with a vector v equal
to γ1 and form another v 0 by adding a tiny displacement vector in a perpendicular direction,
so that:
If
v := γ1
(20)
then v 0 := γ1 + γ2
Similar words apply to the right side of figure 4. We start with a vector v equal to γ2 and
form another v 0 by adding a tiny displacement vector in a perpendicular direction, so that:
If
v := γ2
(21)
then v 0 := γ2 − γ1
Note that equation 20 has a plus sign, while equation 21 has a minus sign. This expresses a
very important fact about the geometry of space. The minus sign occurs in the latter and
not in the former because we are rotating in the γ1 γ2 direction, not in the opposite direction
(γ2 γ1 ).
Rotating a sum of vectors is the same as rotating each summand separately, so we can
combine equation 20 and equation 21 as follows:
If
v =
a γ1
+b γ2
(22)
then v 0 =
a γ1 − b γ1
+ a γ2 +
b γ2
We can take equation 22 as the definition of what we mean by rotation in the γ1 γ2 plane,
in the limit of small angles. We shall soon verify that this definition is consistent with
everything we already know about rotations. (Note that the angle is measured in radians.)
4 HOW TO DO CALCULATIONS
We can use the vectors v and v 0 from equation 20 to construct a rotor r, as follows:
r = v v0
= γ1 (γ1 + γ2 )
= 1 + γ1 γ2
13
(23)
where the last line is obtained simply by carrying out the indicated multiplications, and (as
usual) simplifying by use of the normalization condition, equation 2. It is an easy exercise
to show that taking the vectors v and v 0 from equation 21 (instead of equation 20) would
produce exactly the same rotor r in equation 23.
Let’s see what happens when we use this rotor to rotate something. We start with v = γ1
and create a new vector v 00 as follows:
v 00 = r∼ γ1 r
= (1 + γ2 γ1 )γ1 (1 + γ1 γ2 )
(24)
= γ1 + 2 γ2 + O(2 )
where the terms of order 2 can be neglected when is small.
Then, comparing the last line of equation 24 with equation 20, we find that v 00 is rotated
relative to v by the angle 2. That is, once again the rotation angle is twice the rotor angle.
Now we have all the fundamentals in place. We can start reaping the rewards.
The first thing to do is to consider larger rotations. Since we have constructed a rotationallyinvariant representation of rotations, it is easy to represent repeated rotations, just by piling
on additional copies of the rotation operator in accordance with equation 16.
Applying this idea, we can investigate what happens if we apply N copies of an infinitesimal
rotation:
v 0 = (1 + γ2 γ1 )N v (1 + γ1 γ2 )N
(25)
where it is our intention that v 0 be a vector rotated relative to v by the angle N .
Now, the powers on the RHS have a very interesting structure. In the limit of small , we
can write
(1 + γ1 γ2 )N = exp(N γ1 γ2 )
(26)
and we can expand the exponential in the familiar power series. (Equivalently, you can
expand the LHS using the binomial theorem.) The result might look messy at first, but
it can be greatly simplified by using the fact that γ1 γ2 γ1 γ2 = −1 whenever γ1 and γ2 are
orthonormal and spacelike.
In the limit of large N , the first few terms of the series are, after simplification:
1
1
1
exp(θ γ1 γ2 ) = 1 + θ γ1 γ2 − θ2 − θ3 γ1 γ2 + θ4 + · · ·
2
3!
4!
(27)
5 SPACETIME AND BOOSTS
14
where any angle θ can be written as a multiple of , that is, θ := N . Although is small,
we are not assuming that θ is small.
Collecting all the scalar terms, we recognize the series for cos(θ). The remaining terms
remind us of the series for sin(θ). (Indeed, if you don’t recognize the power series for sin
and cos functions, you could perfectly well use the power series to define those functions,
and derive therefrom all the functions’ interesting properties, using the methods described
in reference 9.)
In any case, we discover that:
r(θ) = cos(θ) + γ1 γ2 sin(θ)
(28)
which is a wonderful result. We did not start out assuming that rotations would be periodic.
All we did is turn the crank on the formalism, and it told us that rotations are periodic.
Let’s see what happens if we apply the rotor in equation 28 to a vector, according to the
prescription in equation 5. We get
[cos(θ) + γ2 γ1 sin(θ)] γ1 [cos(θ) + γ1 γ2 sin(θ)]
= (cos2 (θ) − sin2 (θ)) γ1 + 2 cos(θ) sin(θ) γ2
(29)
= cos(2θ) γ1 + sin(2θ) γ2
where we have used the trigonometric double-angle identities. We find that the rotor angle
is half the rotation angle, even for non-infinitesimal angles.
5
Spacetime and Boosts
By definition, spacetime refers to any system where we have one or more timelike dimensions,
in addition to one or more spacelike dimensions. The most familiar example is D = 1 + 3
spacetime, where we have one timelike dimension and three spacelike dimensions, but other
possibilities should not be ruled out.
In spacetime, it is useful to categorize rotations as follows:
• There are rotations where the plane of rotation is purely spacelike. Unsurprisingly,
these are called spacelike rotations. These are the ordinary rotations with which you
are familiar.
• There are rotations where the plane of rotation is spanned by one timelike vector and
one spacelike vector. These are called boosts.
• In the uncommon case where we have multiple timelike directions, there can be rotations where the plane of rotation is spanned by two linearly-independent timelike
vectors. These doubly-timelike rotations are so uncommon – and mathematically so
similar to purely spacelike rotations – that we will have nothing further to say about
them.
5 SPACETIME AND BOOSTS
15
The physical interpretation is that boosting an object changes its velocity, just as rotating
a line changes its slope. For more about the physical meaning of boosts, see reference 10.
The effect of a typical boost is depicted in figure 5. You can see that is analogous to – but
not identical to – figure 4.
+
v’ = γ 1
εγ
0
0 + ε γ1
v =γ 0
0
v’ = γ
γ
v = γ1
γ
1
Figure 5: A Small Boost
The geometry and trigonometry of boosts is very similar to the familiar geometry and
trigonometry of spacelike rotations ... but not quite identical, as we now discuss.
Note: Some authors define “rotation” to include only spacelike rotations, excluding boosts. However, we wish to make a pedagogical and philosophical point,
by treating the spacelike and timelike dimensions on the same footing. We shall
see that they are as similar as they possibly could be, short of being absolutely
identical. We should get used to living in a four-dimensional universe.
Let’s analyze a boost, following the same recipe as in section 4.
If
v =
a γ0
+b γ1
0
then v =
a γ0 + b γ0
+ a γ1 + b γ1
(30)
We can take equation 30 as the definition of what we mean by rotation in the γ0 γ1 plane,
in the limit of small angles. This is analogous to – but not identical to – equation 22. In
particular, the minus sign in equation 22 has been replaced by a plus sign in equation 30.
This is the crucial difference between spacetime and ordinary Euclidean space.
Once again, we can construct rotors by forming the product of vectors:
r = v v0
= γ0 (γ0 + γ1 )
= −1 + γ0 γ1
= −1 − γ1 γ0
∼
= 1 + γ1 γ0
(31)
5 SPACETIME AND BOOSTS
16
where the last line was derived by multiplying the RHS by −1, and the ∼
= sign should be
interpreted as “having the same physical effect” since the rotor −r has the same physical
effect as the rotor r, in accordance with equation 5.
Note that both of the factors (γ0 ) and (γ0 + γ1 ) are timelike vectors. They come from the
RHS of figure 5. Also: It is an easy exercise to show that the exact same value of r could
have been obtained by multiplying two spacelike vectors from the LHS of figure 5, namely
(γ1 ) and (γ1 + γ0 ).
If we multiply together a large number of such rotors, we find an equation analogous to
equation 27, except that all the minus signs are turned into plus signs, because γ0 γ1 γ0 γ1 =
+1 for timelike γ0 and spacelike γ1 . So instead of equation 28, we get
r(θ) = cosh(θ) + γ1 γ0 sinh(θ)
(32)
that is, the rotor in the timelike direction is just the same, except that it uses hyperbolic
trig functions where spacelike rotors use circular trig functions. In this equation θ is called
the rotor angle.
The rotor in equation 32 can be written in various ways as the product of two unit vectors,
either two spacelike unit vectors or two timelike unit vectors. Examples include:
r(θ) = cosh(θ) + γ1 γ0 sinh(θ)
= γ1 [γ1 cosh(θ) + γ0 sinh(θ)]
(33)
∼
= γ0 [γ0 cosh(θ) + γ1 sinh(θ)]
If it’s not obvious, you should verify directly that the factor in square brackets is in fact a
unit vector.
Let’s see what happens if we apply the rotor in equation 32 to a vector, according to the
prescription in equation 5. In analogy to equation 29, we get
[cosh(θ) + γ0 γ1 sinh(θ)] γ1 [cosh(θ) + γ1 γ0 sinh(θ)]
= (cosh2 (θ) + sinh2 (θ)) γ1 + 2 cosh(θ) sinh(θ) γ0
(34)
= cosh(2θ) γ1 + sinh(2θ) γ0
= cosh(ρ) γ1 + sinh(ρ) γ0
where the last line is exactly what we would expect to obtain as the result of boosting γ1 by
a rapidity ρ. (See reference 11 for more about the idea of rapidity.) We see that the rapidity
is twice the rotor angle, even for non-infinitesimal angles: ρ = 2θ. The calculation involves
nothing more than carrying out the indicated multiplications, then using the hyperbolic
trigonometric double-angle identities.
These are stunning results. They are simultaneously elegant, easy to use, and very powerful.
See reference 10 for more about this.
Given two or more spacelike vectors {γ1 , γ2 , · · ·}, Clifford algebra gives us a nice representation of the rotation group. Given a timelike vector γ0 and the aforementioned spacelike
vectors, we get a nice representation of the entire Lorentz group. That is, we can represent
5 SPACETIME AND BOOSTS
17
any combination of boosts and rotations in any direction ... and the formalism treats them
all on the same footing, with a few caveats that will be discussed shortly.
We should have expected an intimate connection between boosts and rotations, because it
has long been known that a sequence of boosts (not all in the same direction) can be used
to produce a pure rotation.
Of course, space and spacetime have many things in common, but they are not exactly the
same. First let’s consider what they have in common:
In Euclidean space, γ1 is perpendicular to
γ2 . The product γ1 γ2 defines the plane
of rotation, and plays a crucial role in the
infinitesimal rotor r = 1 + γ1 γ2 .
In spacetime, γ0 is perpendicular to γ1 .
The product γ1 γ0 defines the plane of rotation, and plays a crucial role in the infinitesimal rotor r = 1 + γ1 γ0 .
Specifically, if we recall the definition of Specifically, if we recall the definition of
r(θ) from equation 28, γ1 γ2 is just the r(θ) from equation 32, γ1 γ0 is just the
derivative of r with respect to θ:
derivative of r with respect to θ:
dr dr = γ1 γ2
(35)
= γ1 γ0
(36)
dθ 0
dθ 0
If you don’t know what a derivative is, you can just ignore equation 35 and equation 36.
The geometry of space can be quantified The geometry of spacetime can be quanusing circular trig functions, such as sin() tified using hyperbolic trig functions, such
and cos().
as sinh() and cosh().
Now let’s consider the ways in which space and spacetime differ:
In the xy plane, you can turn x into y by In the tx plane, you cannot turn t into x by
a 90 degree rotation, and you can turn x any kind of rotation (including boosts) no
into −x by a 180 degree rotation.
matter how large or how small. Similarly
you cannot turn t into −t (reversing the
flow of time) by any kind of rotation. For
more on this, see reference 10.
By itself, the product γ1 γ2 is a large-angle
rotor. It is what we get from equation 28
when the rotor angle is π/2 radians. The
scalar component of the rotor goes to zero
at this point.
By itself, the product γ1 γ0 is not a rotor.
It has gorm equal to −1, whereas all rotors
must have gorm equal to +1. If you want
a large boost involving γ1 γ0 , you need to
√
write something like 2 + γ1 γ0 , which is
what we get from equation 32 when the
√
rotor angle is ln(1 + 2), i.e. about 0.881.
The scalar component of the rotor never
goes to zero.
6 FOUR OR MORE DIMENSIONS
18
Let’s be clear: At some point you may be tempted to think of γ1 γ0 as a large angle boost, in
analogy to γ1 γ2 ... but you must resist the temptation. That is, γ1 γ0 represents the plane
of rotation, and it is the derivative of a rotor, but it is not a rotor unto itself. To understand
why this must be so, consider the following argument: First of all, the set of all rotations is
a continuous family of transformations; that is, for every possible rotation, there are other
rotations nearby. The same goes for rotors: For every rotor, there are other rotors nearby.
Secondly, a rotor with gorm equal to 1 cannot be near a rotor with gorm equal to -1. Thirdly,
the rotor family is connected to the identity. That is, you can always set the rotor angle to
zero in equation 32 and get a trivial rotor that represents the identity transformation. This
trivial rotor has gorm equal to 1. All the rotors near the identity have gorm equal to 1, and
by induction all the rotors in the world have gorm equal to 1.
Aficionados might wish to define the concept of improper rotor, namely elements
of the even-grade subalgebra having gorm equal to −1. These represent improper
rotations, including reflections and the like. Details are beyond the scope of the
present discussion.
Also note that spacelike rotations and timelike rotations (aka boosts) do not cover all the
possibilities. It is possible to have rotors such as q := 1 + θ γ2 (γ0 + γ1 ) which is neither
timelike nor spacelike. The trick is that γ0 + γ1 is a null vector. This q has gorm equal to 1
for all values of θ. This is not as weird as it might at first seem; such a rotor might describe
a rocket that turns as it accelerates. This rotor q sits right on the dividing line, halfway
between boosts and ordinary spacelike rotations. This is another reason why it is unhelpful
to think of boosts as being different from rotations. It is better to lump them all together
under the name “rotation,” no matter whether the plane of rotation is spacelike, timelike, or
null. A boost in the x direction is just a rotation in the xt plane.
Also beware that when we draw a spacetime diagram, such as figure 5, we are using paper,
which has two spacelike dimensions, to represent the γ0 γ1 plane, which in reality has one
spacelike and one timelike dimension. As a result, the geometry of the diagram-on-paper is
not an entirely faithful representation of the geometry of spacetime. In particular, the true
notion of angle in the γ0 γ1 plane is not well represented in the diagram, especially when the
angle is large. This makes it difficult to develop intuition about angles in spacetime. However,
the mathematics is straightforward: just as the geometry of space can be quantified using
circular trig functions, the geometry of spacetime can be quantified using hyperbolic trig
functions, as you can see by comparing equation 29 with equation 34.
6
Four or More Dimensions
You might think that four dimensions is just like three dimensions, except 33% bigger. That
is almost true, but not quite. There are some things that happen in four dimensions that
are categorically different from what happens in three dimensions.
6 FOUR OR MORE DIMENSIONS
19
Unless otherwise stated, everything in this section applies to the case of four spacelike dimensions ... and also applies equally well to spacetime. That is, in this section we assume the
fourth dimension is spacelike (γ4 γ4 = 1), but very similar remarks apply when it is timelike
(γ4 γ4 = −1).
Executive summary: In this section, we will explain what we mean by the
following:
• In any number of dimensions from two on up, any rotation (including
boosts), and any combination of rotations (including boosts) can be represented by the equation V 0 = r∼ V r where r is an element of the even-grade
subalgebra.
• In two or three dimensions, but not four or more, we can represent an
arbitrary rotor as the product of vectors.
That means that the mathematics always works. The mathematics gets more
laborious as we move to higher dimensions, but the axioms remain the same,
the logic remains the same, and the basic pattern of the calculations remains
the same. You can use most of your intuition about two-dimensional and threedimensional rotations as a guide to arbitrary rotations – including boosts – in
four dimensions and higher.
One downside is that our ability to picture the most-general rotor as simply
the product of two vectors is impaired in four dimensions and higher. Another
downside is that four dimensions is considerably worse that 33% more laborious
than three dimensions; it is typically about twice as laborious, as discussed in
section 7.4.
Let’s do an example. Let’s pick two typical rotors (analogous to the ones we already encountered in equation 12) and see what happens when we multiply them. In four or more
dimensions, the following example is reasonably typical:
√
√
r1 :=
.5 + .5 γ1 γ2
√
√
r2 :=
.5 + .5 γ3 γ4
(37)
r := r1 r2
= .5 + .5 γ1 γ2 + .5 γ3 γ4 + .5 γ1 γ2 γ3 γ4
We see that this product contains only even terms: a scalar term, a couple of bivector term,
and a grade=4 term. (Some of these terms may vanish in special cases.)
If we are working in a three-dimensional space (which includes the case where we simply restrict our attention to a three-dimensional subset of a larger space), the situation is markedly
simpler. There is no such thing as γ4 in three dimensions, so if we try to make something
6 FOUR OR MORE DIMENSIONS
20
analogous to equation 37, the most complicated thing we can make is something like this:
√
√
r1 :=
.5 + .5 γ1 γ2
√
√
r2 :=
.5 + .5 γ3 γ2
(38)
r := r1 r2
= .5 + .5 γ1 γ2 + .5 γ3 γ2
= .5 + .5 (γ1 + γ3 ) γ2
where we have just replaced every occurrence of γ4 with γ2 and simplified the results using
the axioms of Clifford algebra.
Equation 38 differs from equation 37 in two ways:
In two or three dimensions, there cannot In four or more dimensions, it is perfectly
be any grade=4 term. If you try to con- OK to have grade=4 terms.
struct a 4-blade of the form a ∧ b ∧ c ∧ d,
the fourth factor (d) is necessarily linearly
dependent on the previous factors, so the
product necessarily vanishes.
In two or three dimensions, any object that In four or more dimensions, it is possible to
is homogeneous of grade 2 is necessarily have grade-2 objects that are not 2-blades,
a 2-blade, i.e. the wedge product of two but rather the sum of 2-blades.
vectors.
If you are good at visualizing things in four dimensions, figure 6 shows how to visualize a
non-blade. On the left, just for reference, is a four-dimensional hypercube. On the right, we
see two blades: one yellow, one green. These blades cannot be added edge-to-edge to form a
single blade representing their sum, as we would do in three dimensions. We can’t do that,
because the two blades have no edge in common. Indeed, no vector in the yellow plane is
parallel to any vector in the green plane, and vice versa. Therefore the sum of these blades
is homogeneous, but is not a blade.
Figure 6: Sum of Blades is Not a Blade
There is a nice formalism for handling rotors in general (including both simple and nonsimple rotors), as we now discuss: Start with all the elements in our Clifford algebra, and
form a subset by throwing away any elements that contain any odd-grade terms, keeping
only the even-grade blades and sums of even-grade blades. If you take any two elements from
this subset and multiply them, you get another element of this subset. Therefore we say this
7 REPRESENTATION AND COMPUTATION
21
subset is closed under multiplication. It is also true that this subset is closed under addition
and subtraction. Therefore we conclude that this is not just a subset, it is a full-blown
subalgebra, namely the even-grade subalgebra of our original Clifford algebra.
In all generality, we define a rotor to be an element of the even-grade subalgebra having gorm
equal to 1. We define a simple rotor to be one containing no terms higher than grade=2.
Some useful facts include:
• Any rotor can be represented as the product of rotors. It will either be the product of
two timelike rotors, or the product of two spacelike rotors (not one of each).
• The product of any two rotors is a rotor. We say the set of rotors is closed under
multiplication. (It is not closed under addition or subtraction.)
• In two or three dimensions, all rotors are simple. (This includes D = 1 + 1 spacetime
and D = 1 + 2 spacetime, as well as D = 2 flatland and ordinary D = 3 space.)
• In four or more dimensions, the product of simple rotors is not necessarily a simple
rotor.
• In four or five dimensions, an arbitrary rotor can be written as the product of two
simple rotors.
• In physics, i.e. in the case of D = 1 + 3 spacetime, any arbitrary combination of
rotations (including boosts) can be expressed as a simple spacelike rotation followed
by a boost.
When looking at equation 37, you may wonder how much trouble is caused by the fact that
the grade=2 terms don’t form a blade. The answer is, surprisingly little trouble. It turns
out that in equation 37, the overall rotor can be written as r = r1 r2 . That is, r is not a
simple rotor, but it can be visualized as the product of two simple rotors.
Note: in equation 37 the two rotors commute, i.e. r1 r2 = r2 r1 . In four dimensions, an arbitrary rotor can always be represented as the product of two simple rotors that commute (hint:
Gram-Schmidt orthogonalization), but this is not always the most natural representation;
you are free to use two rotors that don’t commute, if you find that more convenient.
Also, if you are computing things in terms of components relative to some basis, as discussed
in section 7.2, the same set of basis bivectors that is used to represent an arbitrary 2-blade
is also sufficient to represent an arbitrary sum of 2-blades, so the existence of non-blades
causes no extra work at all.
7
7.1
Representation and Computation
Double Coverage
You should not imagine that there is a one-to-one relationship between rotors and rotations.
Actually it is a two-to-one relationship. Any given rotation can be represented by two
7 REPRESENTATION AND COMPUTATION
22
inequivalent rotors (r and −r). If you rotate something by 2π radians in any plane, you get
back the same attitude, but the rotor picks up a minus sign. You need to rotate 4π radians
to get back the original rotor.
This has practical significance if you have a computer program that needs to check whether
a given attitude (represented by rotor r1 ) is close to the desired attitude (represented by
rotor r2 ). It does not to suffice to see whether r1 is close to r2 in a component-by-component
numerical sense; you have to check r1 against both r2 and −r2 .
Before you decide that this is a defect in the Clifford algebra approach, note that there are
situations in this world where a 4π rotation is equivalent to no rotation, but a 2π rotation is
not. One example is the rotation of the wavefunction of a fermion. Examples can be found in
the classical, macroscopic world: The Dirac string trick and the Philippine wine-glass trick.
Details are beyond the scope of the present discussion.
The product-of-vectors representation may be even cleverer, even more profound than you
initially thought.
7.2
Basis
In theoretical physics, almost anything worth saying can be said without reference to a
particular basis.
However, when it comes time to do numerical calculations, it is often most practical to
express vectors, bivectors, et cetera in terms of some chosen basis.
In a space where γ1 , γ2 , γ3 are the basis vectors, the natural basis for the bivectors is
ux := γ2 γ3 (the YZ plane)
uy := γ3 γ1 (the ZX plane)
(39)
uz := γ1 γ2 (the XY plane)
That is, any plane can be represented as a linear combination of these three basis planes.
(For more on this and its relationship to quaternions, see section 10.)
Using this basis for the bivectors we can represent any rotor in three dimensions by four
numbers [x, y, z; w] where w is the scalar piece and x, y, and z are the coefficients that
describe the bivector piece. Specifically,
r = w + x ux + y uy + z uz
(40)
This expansion will be be put to good use in section 8.
Also: It is almost but not quite possible to represent a rotor in three dimensions using only
three numbers, not four, because it is almost possible to infer the scalar piece using the
normalization condition (equation 10). Even if you could infer the scalar piece, it would be
7 REPRESENTATION AND COMPUTATION
23
more efficient to carry it around explicitly anyway, rather than recomputing it every time it
was needed.
If you ever want to convert from the rotor representation to the matrix representation,
here’s the procedure. Given a rotor of the form of the form w + x γ2 γ3 + y γ3 γ1 + z γ1 γ2 , the
corresponding rotation matrix is

ww + xx − yy − zz

 2 (w z + x y)
−2 (w y − x z)

−2 (w z − x y)
ww + yy − zz − xx
2 (w x + y z)
2 (w y + x z)

−2 (w x − y z)

ww + zz − yy − xx
(41)
The Perl program mentioned in section 12 implements this matrix and uses it to convert
rotors to matrices.
If you are wondering where equation 41 comes from, just apply the most-general rotor to
the most-general vector, as follows:
(w − x γ2 γ3 − y γ3 γ1 − z γ1 γ2 )[a γ1 + b γ2 + c γ3 ](w + x γ2 γ3 + y γ3 γ1 + z γ1 γ2 )
(42)
Then just collect terms, as follows: How does the γ1 term depend on b? Those terms go in
the middle of the top row of the matrix ... and similarly for all the other terms.
7.3
Dimensions; Number of Components
We use the word clif to denote an arbitrary element of the Clifford algebra. A clif could
be a vector, scalar, bivector, etc. – or a sum thereof. For more about the terminology, see
reference 4.
The number of components required to describe a clif depends on the number of dimensions
involved, i.e. the number of basis vectors in some chosen basis set. The first few cases are
shown in this table, borrowed from reference 4:
1s
1s
1s
1s
1v
2v
3v
4v
1b
3b
6b
1t
4t
1q
D
D
D
D
=1
=2
=3
=4
(43)
where s means scalar, v means vector, b means bivector, t means trivector, and q means
quadvector. You can see that it takes the form of Pascal’s triangle. On each row, the total
number of components is 2D .
The even-grade components are shown in boldface. On each row, the number of even-grade
components is 2(D−1)
7 REPRESENTATION AND COMPUTATION
24
For the purpose of representing rotors, you can ignore the odd-grade components in equation
43, but it is nice to have them there, because they help explain the number of even-grade
components.
To make things really explicit, the number of components ordinarily required is as follows:
• In two dimensions, a rotor has two components, namely one scalar component and one
bivector component.
• In three dimensions, a rotor has four components, namely one scalar component and
three bivector components. Any grade=2 contribution can be represented as a linear
combination of the three basis bivectors.
• In four dimensions, a rotor has eight components, namely one scalar component, six
bivector components, and one quadvector component. Any grade=2 contribution, be
it a blade or a sum of blades, can be represented as a linear combination of the six
basis bivectors.
So we see that representing a rotor in four dimensions requires twice as many components as
in three dimensions, which in turn requires twice as many components as in two dimensions.
7.4
Computational Load
In three-dimensional space, when you calculate the geometric product of two vectors, there
will be four numbers you need to keep track of. This makes it significantly more compact
than the rotation-matrix representation, which requires nine numbers in D = 3.
In spacetime, a rotor can be specified using 8 numbers, which is significantly less than the
matrix representation, which requires 16 numbers.
In D dimensions, a rotor is ordinarily represented using 2(D−1) numbers, while a matrix
requires D2 numbers. The situation is summarized in the following table:
Dimensionality # of components
rotor
matrix
3
4
9
4
8
16
D
2(D−1)
D2
We now move from the question of storage space to the question of computational effort
required to calculate a compound rotation. If we use the rotor representation, all we need to
do is multiply rotors, as suggested by equation 16. If we use the matrix representation, all
we need to do is multiply matrices. The level of computational effort required is summarized
in the following table:
8 EXAMPLE: COMBINING ROTATIONS IN VRML
25
Dimensionality m ultiplications required
rotor
matrix
3
16
27
64
64
4
(D−1)
D
4
D3
From this we can see that the rotor representation is computationally advantageous in D = 3.
The advantage vanishes in D = 4, and turns into a disadvantage in very high-dimensional
spaces.
Things get worse (but only slightly worse) when we ask how much computational effort is
required to apply a given rotation to a vector. In the matrix representation, that involves
just one matrix-vector product, while in the rotor representation, we need to perform two
products, because there is a rotor on the left and on the right of the vector.
The situation is summarized in the following table:
Dimensionality
3
4
D
m
ultiplications required
rotor
matrix
28
9
96
16
4(D−1) + O(D ∗ 2(D−1) )
D2
This is unflattering to the rotor representation ... but we should keep things in perspective,
as we now discuss:
To summarize the overall situation:
• If you have a whole bunch of rotations (i.e. rotation operators) and you want to keep
track of them as objects unto themselves, you should use the rotor representation. You
can store rotors efficiently, and you can compute with them efficiently using equation
16 (in D = 4 or less). You can also easily convert the rotor representation to other
representations.
• On the other hand, if you have one particular rotation and you want to apply it to a
whole bunch of vectors, equation 5 is not very efficient. But don’t panic. Just convert
the rotor to a matrix using equation 41 (which is very efficient), and then apply the
matrix to all your vectors (which is also very efficient).
8
Example: Combining Rotations in VRML
Here is a very practical example. In VRML (virtual reality modeling language) a rotation is
represented by specifying the axis of rotation and the amount of rotation. Specifically, the
8 EXAMPLE: COMBINING ROTATIONS IN VRML
26
amount of rotation is specified in radians, and the axis must be specified as a unit vector.
(If you inadvertently use a non-unit vector, weird things will happen.) For example, a 90
degree rotation around the X axis is represented by (1 0 0 1.5708).
It is easy to convert back and forth between this representation and the geometric-algebra
representation. If the VRML representation is (X, Y, Z, θ), the scalar piece of the rotor is
cos(θ/2) and the components of the bivector piece are [X sin(θ/2), Y sin(θ/2), Z sin(θ/2)].
If you want to calculate a compound rotation, the easiest method is to convert to the rotor
representation, multiply the rotors, and then convert back to the VRML representation.
This approach has several advantages, including:
• It is straightforward to combine two rotors by multiplying them according to the axioms
of geometric algebra. This is in contrast to the VRML representation, where it isn’t
at all obvious how to combine things.
• It is straightforward to convert the geometric algebra representation back to the VRML
representation, since the components of the bivector tell you the axis of rotation. This
is in contrast to the rotation-matrix representation, where although multiplication is
easy enough, converting back to the VRML representation would be tricky.
• The process is bulletproof, by which I mean there is no danger of “gimbal lock” such as
might plague you in the Euler-angle representation, due to singularities at the poles.
The perl program mentioned in Reference 8 knows how to perform this calculation. The principle of operation of the program is as follows: Let the first VRML rotation be (X1 , Y1 , Z1 , θ1 ).
Then the corresponding rotor is
r1
:= c1 + x1 ux + y1 uy + z1 uz
where
c1
:= cos(θ1 /2)
(44)
x1
:= X1 sin(θ1 /2)
y1
:= Y1 sin(θ1 /2)
z1
:= Z1 sin(θ1 /2)
where ux etc. are defined by equation 39.
We define the second rotor r2 in the corresponding way, i.e. just change “1” to “2” everywhere
in equation 44.
To calculate the compound rotation, we just multiply the rotors. Each rotor is represented by
four numbers, so (before simplification) there will be sixteen terms in the product, namely:
r1 r2 = c1 c2
+ c1 [x1 ux + y1 uy + z1 uz ]
+ c2 [x2 ux + y2 uy + z2 uz ]
(45)
−x1 x2 −x1 y2 uz +x1 z2 uy
+y1 x2 uz −y1 y2 −y1 z2 ux
−z1 x2 uy +z1 y2 ux −z1 z2
9 ROTATIONS IN TERMS OF REFLECTIONS
27
and then it’s just a matter of simplifying by collecting like terms. The result is a rotor,
represented by four numbers in the usual way.
Here is an amusing tangential thought: The program takes the arccosine at one
point. I have learned through bitter experience to be careful with arccosines.
The problem is that when the input routine takes the cosine of θ, it is insensitive
to the sign of θ. That is, cos(θ) looks a whole lot like cos(−θ). Then when the
output routine takes the arccosine, you might or might not get back the original
θ, depending on whether or not it was in the top half or the bottom half of the
unit circle. The only reason this is not a problem for the code in reference 8 is
that the input routine also calculates sin(θ) and factors it into the bivector piece
of the rotor. So if your rotation angle is in the bottom half of the unit circle, it
will get flipped to the top half, but this is OK because the axis of rotation will
get flipped end-for-end.
9
Rotations in Terms of Reflections
It is not super-important to the current discussion, but there is a deep connection between
rotations and reflections. If you want details, see reference 7, but we include a brief overview
here.
In general, if you apply the same reflection operator twice, you get back where you started.
Reflecting in one mirror then reflecting again in a different mirror undoes most of the effects
of the reflection – in particular it undoes the inversion – but it produces a rotation. The
amount of rotation is twice the angle between the two mirrors.
This means that given two vectors, we can use them to represent a rotation as follows: First,
reflect everything in the mirror perpendicular to the first vector, then reflect everything
again in the mirror perpendicular to the second vector. This is entirely equivalent to the
procedure described in section 4; it is just another way of looking at things.
In D = 2, the mirror is the D − 1 = 1 dimensional line perpendicular to the given vector.
In D = 3, the mirror is the D − 1 = 2 dimensional plane perpendicular to the given vector.
In D = 4, the mirror is the D − 1 = 3 dimensional hyperplane perpendicular to the given
vector.
This interpretation in terms of reflections makes it pretty obvious that this representation
is Lorentz-invariant.
Remember that the rotor angle is half the rotation angle. This can be a source of confusion
if you’re not careful.
10 QUATERNIONS AND PAULI MATRICES IN TERMS OF CLIFFORD ALGEBRA28
10
Quaternions and Pauli Matrices in terms of Clifford
Algebra
There is a one-to-one correspondence between quaternions and a subalgebra of Clifford algebra, namely the subalgebra containing only scalars and bivectors.
The basis bivectors in equation 39 are identical to the I, J, K basis quaternions, except each
is missing a minus sign. Specifically, we define the quaternions I, J, and K according to:
I := −ux ≡γ3 γ2 (the YZ plane)
J := −uy ≡γ2 γ1 (the ZX plane)
(46)
K := −uz ≡γ1 γ3 (the XY plane)
The fourth basis quaternion is the plain old scalar 1.
By direct application of the Clifford Algebra axioms (equation 1 and equation 2), you can
verify Hamilton’s celebrated identities I 2 = J 2 = K 2 = IJK = −1 (reference 1).
The advantage of the Clifford Algebra approach is that you don’t need to spend any effort
learning the algebra of quaternions. Once you know the axioms of Clifford Algebra, you get
quaternions (and a lot of other things) for free.
(The quaternions we have called I, J, and K are more conventionally written as lower-case
i, j, and k, but in this document we capitalize them, for reasons that will become obvious
in a moment.)
Another set of objects that serve as generators of rotations are the Pauli spin matrices,
namely:
"
#
01
σx :=
10 #
"
0−i
σy :=
(47)
i0
"
#
10
σz :=
0−1
These behave like the I, J, K quaternions, except each is missing a factor of i, where i :=
√
−1. Specifically, you can verify that if we redefine I := iσx , J := iσy , and J := iσz then
once again we can write Hamilton’s identities, namely I 2 = J 2 = K 2 = IJK = −1.
The three Pauli matrices of course go along with
" a #fourth matrix, the unit matrix:
10
1 =
01
(48)
11 SURVEY OF WAYS TO REPRESENT ROTATIONS
11
29
Survey of Ways to Represent Rotations
Note: In this section, all rotations live in D = 3 space, unless otherwise specified.
Let’s imagine you are playing charades, and you want to portray a rotation, a very specific
rotation. There are various approaches you could take:
1. You could take some object, such as a book, and show it in the “before” and “after”
states (before rotation and after rotation). It is best to choose an object of no particular symmetry, since rotating a highly-symmetric object such as a sphere is not very
interesting.
You can make this seem more scientific by choosing the “object” to be a triad of
linearly-independent vectors. You need to label the vectors, so you can keep track of
which is which. Then the length of the vectors doesn’t matter, so we can WLoG3 take
them to be unit vectors. As before, you need to depict the triad twice, once before and
once after rotation.
The formal mathematical version of this approach consists of writing down the rotation
matrix. The left column of the matrix shows where the X unit vector winds up after
rotation; the middle column shows where the Y unit vector winds up, and the right
column shows where the Z unit vector winds up.
Using a matrix to rotate a vector is computationally efficient, as discussed in section
7.4.
The downside is that it is inconvenient to convert from the matrix representation to
other representations.
2. In some circles it is traditional to represent rotations in terms of the Euler angles: yaw,
pitch, and roll. But that does not mean that you can just depict the three angles and
quit there, because the Euler angles are only defined with respect to a particular basis.
So you need to depict the basis as well as the three Euler angles.
If you are doing a lot of calculations, you can keep the basis constant, so the three Euler
angles are the only variables. This means you only need to carry around three variables,
which would seem to be an improvement over the rotation-matrix representation, which
requires carrying around nine variables.
Euler angles are semi-reasonable for some applications, especially if the pitch angle
and the bank angle4 always remain small, as they do in ordinary non-aerobatic flying.
But there are many drawbacks. For one thing, there are nasty singularities, such as
the following: suppose you pitch up 89 degrees. Your heading5 is unchanged, and your
3
Without Loss of Generality
Bank angle is synonymous with roll angle. The verb “roll” refers to a change in the bank angle.
5
Heading is synonymous with yaw angle. The verb “yaw” refers to a change in heading.
4
11 SURVEY OF WAYS TO REPRESENT ROTATIONS
30
bank angle is unchanged. So far so good ... but now continue the pitch-wise motion
another two degrees. Your heading is now reversed (180 degrees from where you were
a moment ago) and your bank angle is upside down (also 180 degrees from where it
was a moment ago).
Any scheme for representing rotations (in D = 3 space) using only three variables will
have singularities. There’s no way around it.
Even if you stay away from the singularities, if you want to describe the results of two
consecutive rotations, the mathematics of Euler angles is not very pretty.
Because the Euler angles depend on a particular choice of basis, they represent rotations
in a way that is not rotation-invariant ... which is pathetic. Of course they have no
chance of being relativistically invariant.
3. Especially in D = 3, you may be accustomed to thinking of every rotation as a rotation
about some axis. So all you need to do is specify the direction of the axis, and the
amount of rotation.
This can be formalized in terms of the so-called Rodrigues vector. The direction of the
Rodrigues vector indicates the axis of rotation, and the length represents the amount
of rotation.
This representation is not as elegant or as useful as one might have hoped. In particular, if you compound two rotations, the result is not represented by the sum of the
Rodrigues vectors (nor the product, nor any other simple vector operation).
The Rodrigues vector is not relativistically invariant.
Also, this approach is restricted to D = 3 space only. In D = 2 flatland, it is not
necessary – nor even possible – to specify the direction of rotation as a vector. In
D = 4 or higher, including D = 1 + 3 spacetime, it is again impossible to represent the
direction of rotation as a vector. In D = 4, it takes 6 numbers to specify the direction
of rotation, but a 4-vector has only 4 components. The way out of this difficulty can
be found in the following item.
4. Rather than depicting the axis of rotation, you can depict the plane of rotation. This
has tremendous advantages. For starters, it works equally well in any nontrivial space,
including D = 2 flatland, D = 3 space, and D = 1 + 3 spacetime.
This can be formalized as the product of two vectors in the plane, as discussed in
section 2.
Note: For all the representations discussed here, we have represented only the amount of
rotation and the orientation of the plane of rotation; we have not attempted to represent the
location of the center of the rotation.
12 CLIFFORD ALGEBRA DESK CALCULATOR
31
However, there is a theorem that says that a rotation about one center can be decomposed
into a rotation around another center, plus a pure translation. We assume everybody understands how to represent translations. So for simplicity, we consider only rotations around
the origin.
12
Clifford Algebra Desk Calculator
I wrote a “Clifford algebra desk calculator” program. It knows how to do addition, subtraction, dot product, wedge product, full geometric product, reverse, hodge dual, and so forth.
Most of the features work in arbitrarily many dimensions.
Here is the program’s help message. See also reference 8.
Desk calculator for Clifford algebra in arbitrarily many
Euclidean dimensions. (No Minkowski space yet; sorry.)
Usage:
./cliffer [options]
Command-line options include
-h
print this message (and exit immediately).
-v
increase verbosity.
-i fn
take input from file ’fn’.
-pre fn take preliminary input from file ’fn’.
-take input from STDIN
If no input files are specified with -i or --, the default is an
implicit ’--’. Note that -i and -pre can be used multiple times.
All -pre files are processed before any -i files.
Advanced usage: If you want to make an input file into a
self-executing script, you can use "#! /path/to/cliffer -i" as
the first line. Similarly, if you want to do some
initialization and then read from standard input, you can use
"#! /path/to/cliffer -pre" as the first line.
Ordinary usage example:
# compound rotation: two 90 degree rotations
# makes a 120 degree rotation about the 1,1,1 diagonal:
echo -e "1 0 0 90 vrml 0 0 1 90 vrml mul @v" | cliffer
Result:
0.57735 0.57735 0.57735 2.09440 = 120.0000
12 CLIFFORD ALGEBRA DESK CALCULATOR
32
Explanation:
*) Push a rotation operator onto the stack, by
giving four numbers in VRML format
X Y Z theta
followed by the "vrml" keyword.
*) Push another rotation operator onto the stack,
in the same way.
*) Multiply them together using the "mul" keyword.
*) Pop the result and print it in VRML format using
the "@v" keyword
On input, we expect all angles to be in radians. You can convert
from degrees to radians using the "deg" operator, which can be
abbreviated to "" (the degree symbol). Hint: Alt-0 on some
keyboards.
As a special case, on input, a number with suffix "d" (with no
spaces between the number and the "d") is converted from degrees to
radians.
echo "90 sin @" | cliffer
echo "90 sin @" | cliffer
echo "90d sin @" | cliffer
are each equivalent to
echo "pi 2 div sin @" | cliffer
Input words can be spread across as many lines (or as few) as you
wish. If input is from an interactive terminal, any error causes
the rest of the current line to be thrown away, but the program does
not exit. In the non-interactive case, any error causes the program
to exit.
On input, a comma or tab is equivalent to a space.
are equivalent to a single space.
Multiple spaces
Note on VRML format: X Y Z theta
[X Y Z] is a vector specifying the axis of rotation,
and theta specifies the amount of rotation around that axis.
VRML requires [X Y Z] to be normalized as a unit vector,
but we are more tolerant; we will normalize it for you.
VRML requires theta to be measured in radians.
Also note that on input, the VRML operator accepts either four
12 CLIFFORD ALGEBRA DESK CALCULATOR
numbers, or one 3-component vector plus one scalar, as in the
following example.
Same as previous example, with more output:
echo -e "[1 0 0] 90 vrml dup @v dup @m
[0 0 1] -90 vrml rev mul dup @v @m" | cliffer
Result:
1.00000 0.00000 0.00000 1.57080 = 90.0000
[ 1.00000 0.00000 0.00000 ]
[ 0.00000 0.00000 -1.00000 ]
[ 0.00000 1.00000 0.00000 ]
0.57735 0.57735 0.57735 2.09440 = 120.0000
[ 0.00000 0.00000 1.00000 ]
[ 1.00000 0.00000 0.00000 ]
[ 0.00000 1.00000 0.00000 ]
Even fancier: Multiply two vectors to create a bivector,
then use that to crank a vector:
echo -e "[ 1 0 0 ] [ 1 1 0 ] mul normalize [ 0 1 0 ] crank @" \
| ./cliffer
Result:
[-1, 0, 0]
Another example: Calculate the angle between two vectors:
echo -e "[ -1 0 0 ] [ 1 1 0 ] mul normalize rangle @a" | ./cliffer
Result:
2.35619 = 135.0000
Example: Powers: Exponentiate a quaternion. Find rotor that rotates
only half as much:
echo -e "[ 1 0 0 ] [ 0 1 0 ] mul 2 mul dup rangle @a " \
" .5 pow dup rangle @a @" | ./cliffer
Result:
1.57080 = 90.0000
0.78540 = 45.0000
1 + [0, 0, 1]
Example: Take the 4th root using pow, then take the fourth
power using direct multiplication of quaternions:
echo "[ 1 0 0 ] [ 0 1 0 ] mul dup @v
.25 pow dup @v dup mul dup mul @v" | ./cliffer
Result
33
12 CLIFFORD ALGEBRA DESK CALCULATOR
0.00000 0.00000 1.00000 3.14159 = 180.0000
0.00000 0.00000 1.00000 0.78540 = 45.0000
0.00000 0.00000 1.00000 3.14159 = 180.0000
More systematic testing:
./cliffer.test1
The following operators have been implemented:
help
help message
listops list all operators
=== Unary operators
pop
remove top item from stack
neg
negate: multiply by -1
deg
convert number from radians to degrees
dup
duplicate top item on stack
gorm
gorm i.e. scalar part of V~ V
norm
norm i.e. sqrt(gorm}
normalize divide top item by its norm
rev
clifford ’~’ operator, reverse basis vectors
hodge
hodge dual aka unary ’’ operator; alt-’ on some keyboards
gradesel given C and s, find the grade-s part of C
rangle calculate rotor angle
=== Binary operators
exch
exchange top two items on stack
codot
multiply corresponding components, then sum
add
add top two items on stack
sub
sub top two items on stack
mul
multiply top two items on stack (in subspace if possible)
cmul
promote A and B to clifs, then multiply them
div
divide clif A by scalar B
dot
promote A and B to clifs, then take dot product
wedge
promote A and B to clifs, then take wedge product
cross
the hodge of the wedge (familiar as cross product in 3D)
crank
calculate R~ V R
pow
calculate Nth power of scalar or quat
sqrt
calculate square root of power of scalar or quat
=== Constructors
[
mark the beginning of a vector
]
construct vector by popping to mark
unpack unpack a vector, quat, or clif; push its contents (normal order)
dimset project object onto N-dimensional Clifford algebra
unbave top unit basis vector in N dimensions
34
13 REFERENCES
35
ups
unit pseudo-scalar in N dimensions
pi
push pi onto the stack
vrml
construct a quaternion from VRML representation x,y,z,theta
clif
take a vector in D=2**n, construct a clif in D=n
Note: You can do the opposite via ’[ exch unpack ]’
=== Printout operators
setbasis set basis mode, 0=abcdef 1=xyzabc
dump
show everything on stack, leave it unchanged
@
compactly show item of any type, D=3 (then remove it)
@m
show quaternion, formatted as a rotation matrix (then remove it)
@v
show quaternion, formatted in VRML style (then remove it)
@a
show angle, formatted in radian and degrees (then remove it)
@x
print clif of any grade, row by row
=== Math library functions:
sin
cos
tan
sec
csc
cot
sinh
cosh
tanh
asin
acos
atan
asinh acosh atanh ln
log2
log10
exp
atan2
13
References
1. W. R. Hamilton “On a new Species of Imaginary Quantities connected with a theory
of Quaternions”
http://www.maths.tcd.ie/pub/HistMath/People/Hamilton/Quatern1/Quatern1.html
Proceedings of the Royal Irish Academy, vol. 2, 424-434 (Nov. 13, 1843).
2. H. Grassmann, “Die Lineale Ausdehnungslehre” (1844).
3. W. K. Clifford “Application of Grassmann’s Extensive Algebra” American Journal of
Mathematics 350-358 (1878).
4. John Denker “Introduction to Clifford Algebra”
www.av8n.com/physics/clifford-intro.htm
5. Stephen Gull, Anthony Lasenby, and Chris Doran, “The Geometric Algebra of
Spacetime”
http://www.mrao.cam.ac.uk/˜clifford/introduction/intro/intro.html
6. Richard E. Harke, “An Introduction to the Mathematics of the Space-Time Algebra”
http://www.harke.org/ps/intro.ps.gz
7. David Hestenes,
“Oersted Medal Lecture 2002: Reforming the Mathematical Language of Physics”
13 REFERENCES
36
Abstract: http://geocalc.clas.asu.edu/html/Overview.html Full paper:
http://geocalc.clas.asu.edu/pdf/OerstedMedalLecture.pdf
8. John Denker “cliffer” (program that inputs rotations in VRML format and combines
them, printing out the resulting overall rotation in VRML format)
./cat.cgi/cliffer.pl and ./cat.cgi/clifford.pm
9. Feynman, Leighton, and Sands The Feynman Lectures on Physics volume I
chapter 22 (“Algebral”).
10. John Denker “The Geometry and Trigonometry of Spacetime”
www.av8n.com/physics/spacetime-trig.htm
11. John Denker, “Rapidities, Boosts, Rotations, and the Structure of Spacetime”
www.av8n.com/physics/rapidity.htm