Abstract Algebra and Discrete Mathematics, Cyclotomic Extensions

Angle Addition Formula
Double Angle Formula
Half Angle Formula
Demoivre's Formula
Cyclotomic Extensions
Primitive n^th Root
Automorphisms
Splitting xⁿ-1
ζ Polynomials
ζ Irreducible over Z
ζ Irreducible over Z[i]
Conjugates and Norm
Ratio Units
1-y, lying over n

Let u and v be angles such that 0 ≤ u ≤ v ≤ 90°. Let w = v-u. Let p be the point on the unit circle that marks the angle u, and let q mark the angle v. Draw segments from the origin to p and q, defining a slice of pie with angle w. Then draw a line segment from p to q, the chord that cuts the crust off the slice of pie.

Let the chord pq be the hypotenuse of a right triangle that is aligned with the axes. In other words, the sides of the triangle run parallel to the x and y axes. Thus the corner of the triangle, t, has the x coordinate of q and the y coordinate of p, also known as cos(v),sin(u). The lengths of the sides of the right triangle are cos(u)-cos(v) and sin(v)-sin(u). Compute the square of the length of the hypotenuse using the pythagorean theorem. After some algebra and trig simplification you should get:

2 - 2cos(u)cos(v) - 2sin(u)sin(v)

[ radii from origin to p and q, angles u and v, with angle w between the two radii, chord from p to q, point t at the corner of the right triangle tpq ]

Now build another right triangle. Draw segment from q perpendicular to the radius from 0 to p. Let these perpendicular segments meet at the point s. Now spq forms another right triangle, with pq as hypotenuse. The altitude of this triangle has length sin(w), while the base is 1-cos(w). Apply the pythagorean theorem again to get the square of the hypotenuse.

2 - 2cos(w)

Set this equal to the earlier expression to obtain the angle subtraction formula:

cos(v-u) = cos(u)cos(v) + sin(u)sin(v)

[ radii from origin to p and q, angles u and v, with angle w between the two radii, chord from p to q, point s along the first radius and at the corner of the right triangle spq ]

This is great, but u and v are rather constrained. Let v stray past 90°. Its cosine becomes negative, but cos(u)-cos(v) is still correct for the length of the base of the first triangle, the distance from t to p.

As v-u exceeds 90°, s slides back along its radius and through the origin, and winds up behind the origin. The cosine of w goes negative, yet 1-cos(w) is still the length of the base of the second right triangle, the distance from s to p.

Eventually q is lower than p. The first right triangle points down, rather than up, and t is actually outside the circle. Now sin(v)-sin(u) is the opposite of the length of the altitude, but the length is squared in the pythagorean formula, so this doesn't matter.

When v passes 180°, its sine is negative, yet sin(v)-sin(u) still gives the length of the altitude of the first triangle, at least in absolute value. Our formula holds for any u between 0° and 90°, and any v between u and u+180°.

When v goes beyond u+180°, reflect the picture through the line x = y. This reproduces the earlier case, where our formula holds. The reflection swaps sine and cosine for u and v, which changes the right side of the equation not at all. It also replaces w with 360°-w, which changes its cosine not at all. Thus the formula holds for all v between u and u+360°, which is all v.

If u is an angle in the second quadrant, subtract 90° from u and v, leaving w unchanged. Now u is in the first quadrant and the formula works. This rotation moves sine to cosine and cosine to -sine, for both u and v. This changes the formula not at all. Perform a similar rotation when u is in the third or fourth quadrant. Therefore the angle subtraction formula works for all angles u and v.

Replace v with -v to get the angle addition formula:

cos(u+v) = cos(u)cos(v) - sin(u)sin(v)

In the above formula, hold u fixed and let v be a variable. Take the derivative with respect to v. This gives the angle addition for sines.

sin(u+v) = cos(u)sin(v) + sin(u)cos(v)

Replace v with -v to get the angle subtraction formula for sines.

sin(u-v) = sin(u)cos(v) - cos(u)sin(v)

There is a tangent addition formula. It's easiest to start with the answers, given below, and work backwards. Replace each tangent with sine over cosine and simplify. I'll leave the details to you.

tan(u+v) = (tan(u) + tan(v)) / (1 - tan(u)tan(v))

tan(u-v) = (tan(u) - tan(v)) / (1 + tan(u)tan(v))

Set u = v in the angle addition formulas to get the double angle formulas.

sin(2u) = 2sin(u)cos(u)

cos(2u) = cos²(u) - sin²(u)

The latter is sometimes written:

cos(2u) = 2×cos²(u) - 1

cos(2u) = 1 - 2×sin²(u)

There is of course a triple angle formula. Expand sin(2u+u) using the angle addition formula, then expand cos(2u) and sin(2u) using the double angle formulas. Do this again to get the quadruple angle formula, the quintuple angle formula, and so on.

sin(3u) = 3sin(u)cos²(u) - sin³(u)

cos(3u) = cos³(u) - 3cos(u)sin²(u)

sin(4u) = 4sin(u)cos³(u) - 4sin³(u)cos(u)

cos(4u) = cos⁴(u) - 6cos²(u)sin²(u) + sin⁴(u)

If you are familiar with the binomial theorem, you will recognize a pattern. Let c and s represent cosine and sine respectively, and expand (c+s)ⁿ. Take every other term starting with cⁿ; this is the cosine of nu. Well almost; you have to negate every other term in this series. To find the sine of nu, Take every other term in the expansion, starting with nc^n-1s, and again, negate every other term in this series.

Look at our last example, sine and cosine of 4u. Expand (c+s)⁴ and put the minus signs where they belong. The cosine is every other term starting with c⁴, and the sine is every other term starting with 4c³s.

c⁴ + 4c³s - 6c²s² - 4cs³ + s⁴

Prove the formula, in general, by induction on n. Apply the angle addition formula to u + nu. Imagine cosine and sine of nu written together, as above. It looks like the n^th row of Pascal's triangle, except some of the entries have been negated. Now move on to the next level.

The sine is s times the previous formula for cosine plus c times the previous formula for sine. This adds adjacent terms from the previous row, in pairs, giving every other entry in the next row of Pascal's triangle. Similarly, the cosine is c times the previous formula for cosine minus s times the previous formula for sine. This fills in the rest of the row, and all the minus signs are where they belong. By induction, the formula holds for all n.

Of course there is a much simpler proof based on complex exponentiation. Write e^nθi = e^θi to the n, then expand the right side by the binomial theorem. Since e^θi = c+si, Equate real and imaginary terms, and you're done.

The double angle formula asserts:

cos(2θ) = 2cos²(θ) - 1

cos(2θ) = 1 - 2sin²(θ)

Reverse these to derive the half angle formulas.

cos(½θ) = sqrt(½(1+cos(θ)))

sin(½θ) = sqrt(½(1-cos(θ)))

Notice that sine squared + cosine squared is still 1, as required.

Let's try 15°, which is half of 30°, which has a cosine of sqrt(3)/2. After some algebra,

cos(15°) = sqrt(2+sqrt(3))/2 = 0.9659

sin(15°) = sqrt(2-sqrt(3))/2 = 0.2588

In this case we could have derived the sine and cosine via angle subtraction. That is, cos(45°-30°) = sqrt(1/2) × (1/2+sqrt(3)/2). Oddly enough, this different looking formula produces the exact same number.

As an exercise, use the half angle formula to show the tangent of 22.5° is sqrt(2)-1.

When multiplying two complex numbers, z₁×z₂, it is sometimes convenient to convert to polar coordinates. Multiplication is then implemented by adding angles and multiplying radii. This is Demoivre's formula.

[ Two vectors u and v into the first quadrant, v further around and longer than u, then u×v in the second quadrant, longer than u or v, and angle the sum of the other two angles ]

Assume z₁ = a₁+b₁i = r₁,θ₁, and z₂ = a₂+b₂i = r₂,θ₂.

The new radius is r₁r₂, and the new angle is θ₁+θ₂. Convert the product back to rectangular coordinates as follows.

x ← r₁r₂×cos(θ₁+θ₂)

y ← r₁r₂×sin(θ₁+θ₂)

Expand using the angle addition formulas. Replace r₁cos(θ₁) and r₁sin(θ₁) with a₁ and b₁, and similarly for a₂ and b₂, and get x = a₁a₂ - b₁b₂ and y = a₁b₂ + a₂b₁. This is the formula for multiplication of complex numbers in rectangular coordinates. Therefore Demoivre's formula is valid.

This formula provides efficient procedures for complex exponentiation. If z is 3 units from the origin, at 45°, then z⁴ is -81.

Reverse this process to take roots. The 6 sixth roots of 64 all have radius 2, lying on a circle 2 units from the origin. Every angle that is a multiple of 60°, when multiplied by 6, becomes a multiple of 360°, and lands on the positive x axis, which is what we want. Thus the 6 roots are at angles 0° 60° 120° 180° 240° 300°, radius 2. The root at 60° is 1+sqrt(3)i.

The word cyclotomic comes from the Greek. Its literal meaning is, "cut the circle into pieces", and that's what a cyclotomic extension does.

Draw the unit circle in the complex plane, then let p be the n^th root of 1. By Demoivre's formula, the distance from p to the origin, when raised to the n, is 1, hence p lies on the unit circle. Also, the angle of p, when multiplied by n, equals 360°, i.e. back around to the x axis, hence p makes an angle of 360/n. Actually, any multiple of this angle will do. There are n distinct angles, n points on the unit circle, and n roots of 1. These are the powers of p.

The n roots of 1 cut the unit circle into equal arcs and define a regular n-gon. When n = 6, for instance, the 6 sixth roots of 1 lie on the unit circle at 60° intervals, and define an inscribed hexagon.

If n = 3, you still get the sixth roots of 1, because minus the cube root of 1, when cubed, is -1, hence minus the cube root of 1 is a sixth root of 1. We discovered this earlier with the Eisenstein integers. Adjoin the cube root of 1, or the sixth root of 1, the result is the same. If n is odd, you can extend by n or 2n, at your convenience.

[ second picture from chapter 1 Eisenstein Integers ]

Remember that the roots of 1 come from an integer polynomial, something like xⁿ-1, and all you need to define this polynomial is the integers. So if a ring R contains Z, or Z/m, and you want to adjoin y the n^th root of 1 to R, you can adjoin it to the integers first, then bring in the rest of R. In other words, the rest of R doesn't really affect the cyclotomic extension. It is enough to adjoin Y to the integers first and see what happens, then bring in the rest of R.

If m = a×b, where a and b are relatively prime, then Z/m is the direct product of Z/a and Z/b. This is the chinese remainder theorem. An n^th root y in the direct product implies an n^th root in each component, (reduce everything mod a or mod b), and n^th roots in the two components join together to produce an n^th root in Z/m. Therefore we can restrict attention to prime powers.

Subsequent sections will consider Z/p, because that's a field. It is also the homomorphic image of Z/p^k, so we may be able to work our way back to prime powers later, and then back to Z/m.

In the field Q, or Z/p, y satisfies some polynomial g(y) of minimum degree. Take the gcd of g(x) and xⁿ-1, and the result has y as a root, and has to be g, since g is minimal. With g as gcd, xⁿ-1 is a multiple of g. In other words, g(x) is a factor of xⁿ-1.

By Gauss' lemma, xⁿ-1 factors over Q the same way it factors over Z. Thus g has integer coefficients, and still divides into xⁿ-1. We saw an example of this when adjoining the cube roots of 1. Start with x³-1; but 1 is a root, and not very interesting, because 1 is already part of the integers, so divide by x-1 and find x²+x+1. This is g(x), the minimum polynomial for y the cube root of 1. Any smaller polynomial would be linear, and would put the cube roots back into Z, hence x²+x+1 is minimal for the cube roots of 1 over Z.

In fact the same polynomial is minimal for the cube roots of 1 over z/p, or any other ring, assuming the cube roots of 1 are not already there to start with. Z/7 contains all 3 cube roots of 1, so not much to do there, x³-1 = (x-1) * (x-2) * (x-4), but one can meaningfully adjoin x to Z/5 mod x²+x+1, and find a quadratic cyclotomic extension bringing in the other two cube roots of 1.

In the same way, the minimum polynomial for i, in the gaussian integers, is x²+1. This is a factor of x⁴-1, adjoining the fourth roots of 1.

In the complex plane, the n roots of 1 cut the circle into n pieces. The first one counterclockwise from the x axis generates all the others. This is a primitive n^th root of 1, which I have denoted y. All the others are powers of y. That takes care of Z, is there a primitive root over Z/p?

Let n = p×t. The polynomial xⁿ-1 is now x^t-1 raised to the p power. The roots of x^t-1 are adjoined p times over. This doesn't make much sense, so assume p does not divide n.

Adjoin all the roots of xⁿ-1 to get a finite field F. Let b be primitive in F, so that we can take logs base b. Let m be the order of F, minus 1, so that logs wrap around mod m.

Since p does not divide n, use formal derivatives to show all the roots are distinct. There are n such roots; let y have the smallest log base b. The next root, y², has twice the log of y. If this is reduced mod m then it is smaller than y, which is a contradiction. The same holds for y³, y⁴, and so on up to yⁿ, which is 1, with log 0. This makes y a primitive n^th root of 1. The log of y is m/n, and m is a multiple of n. Remember that some of these roots, i.e. some of the powers of y, could lie in the base field Z/p.

Unfortunately the term "primitive root" is overloaded. Let y be the fourth root of 1 over Z/3. Here -1 is 2, and 2 has no square root, so y is outside the base field. The extension is quadratic and has order 9. Yes indeed, y is primitive relative to our cyclotomic extension x⁴-1, since y generates all four fourth roots, but y is not a primitive root of the entire finite field F. Let b = y+2, so that b² = y. Now b has order 8, and is a primitive root of F. There is a primitive root of 1 and a primitive root of F; sorry for the ambiguity.

By convention, a cyclotomic extension adjoins y - not just any old n^th root of 1, but a primitive root of 1. It's true that -1 is a fourth root of 1, but when we adjoin the fourth root of 1, we're really talking about i, the primitive fourth root of 1 that generates all the others. Of course -i would do just as well.

So how many primitive roots are there? The powers of y^j step through n distinct values, iff j and n are coprime. Thus there are φ(n) primitive roots.

An extension contains all the n^th roots of 1 iff it contains a primitive n^th root of 1. Let F be the smallest such extension, i.e. the extension generated by y. If the base field is Z/p, what is the dimension d of such an extension? As shown earlier, n divides m, hence n divides p^d-1. Reduce mod n, and remember that p is coprime to n, so some power of p is 1. This is the order of p within the units mod n, and it becomes the dimension of the extension. F has order p^d, m = p^d-1, n divides m, and y is the element with log m/n. Recall the earlier example, wherein the fourth root of 1 was adjoined to Z/3. 3 has order 2 mod 4, that is, 3 squared is 1 mod 4, hence a quadratic extension is sufficient to bring in all the fourth roots of 1.

If you are adjoining y to L, some other finite field, and l has dimension e, then the new field has dimension lcm(d,e).

There is a reason I put the chapters on cyclotomics and finite fields back to back. Every finite field is a cyclotomic extension. If b is primitive for F, then b is an n^th root of 1, where n is 1 less than the order of F, and b is a primitive root for a cyclotomic extension of Z/p, or L. The order of p mod n tells us how big this extension has to be. These connections will become important in algebraic number theory.

An automorphism on a cyclotomic extension takes y to something else whose powers yield all the n^throots of 1, that is, another primitive root. The converse is not always true. In an extreme example, look at the cube roots of 1 mod 7. These are 1 2 and 4, with 2 and 4 primitive. If you only care about multiplication, then yes, 2 and 4 can change places. Map 3 to 5, and everything follows from there. However, the word automorphism, taken in context, usually applies to the entire structure, whatever that structure is. If multiplication and addition are both well defined, then both operations should be preserved. A field automorphism has to fix 1, and all the integers, thus there is no field automorphism of Z/7, and the cube roots of 1 must stay put.

Furthermore, an automorphism of an "extension", as opposed to the field itself, has to fix the base, whatever that is. An automorphism on L[y] moves y to another primitive root, but fixes L.

If L = Q, the rationals, there is no trouble. If n is odd, no power of y is -1. If n is even, then yes, y^n/2 = -1, but n/2 is not coprime to n, and -1 is not primitive. All the primitive roots lie in the complex plane, and off the x axis. Move y to anything else primitive and it defines the automorphism, and it leaves Q alone. Of course I'm skating past something very important; y and y^j have to satisfy the same irreducible polynomial. Otherwise it's not an isomorphism. Sure they both satisfy xⁿ-1, but that's not irreducible. What if y and y^j belong to different irreducible factors of xⁿ-1? They don't - they belong to the same irreducible factor, and I'll cover that below. Thus each of the primitive roots defines a unique automorphism. Of course mapping y to y gives the trivial automorphism.

[ Unit circle with the 8 roots of one marked around, y in the first quadrant, arrows from y around the circle to y³, y⁵ and y⁷ ]

Compose two automorphisms, y → yⁱ, and y → y^j, and the result is y → y^ij. Exponents are multiplied, and ij is still coprime to n. Composition of automorphisms is commutative, and it looks just like multiplication mod n.

If F is a finite field L[y], and y is cyclotomic primitive, the automorphisms are drawn from the commutative group described above, i.e. y → y^j, but they have to be automorphisms of F as well, and they have to fix the base field L (usually the integers). The automorphisms of F are all frobenius, raising everything to the p, or to some power of p if L lies above the integers. We can't use any old y^j, it has to be y^p, again and again and again, until you get back to y. This is, as we saw before, the order of p mod n. If p^d brings us back to start, then the dimension of the extension is d, and there are d automorphisms running in a cycle.

If the extension is F/L, where L is dimension e, then the dimension of F/L is d/e. At the same time, the F automorphisms that fix L start with p^e, not p. There are only d/e of these, before you return to start. The cyclotomic extension over L has dimension d/e, with d/e automorphisms running in a circle.

Let's illustrate with the 24^th roots of 1 over Z/5. Let F be the finite field of order 25. Every nonzero element of F is now a 24^th root of 1. If y is a primitive element of F, also a primitive root of the cyclotomic, then the valid F automorphisms raise y to the fifth power. Do this twice and you're done. Yes, 7 is coprime to 24, but you can't map y → y⁷ - only y → y⁵, carried along by the frobenius automorphism on F.

Let y be the root of an irreducible polynomial q(x). The frobenius automorphism carries y to another primitive root, and fixes the coefficients on q. The conjugates of y move around in a cycle, just as the automorphisms form a cycle, and all these conjugates are roots of q. The degree of q is the dimension of the extension, is the length of the cycle of automorphisms. All this holds over a base field L; the cycles are just shorter.

Let L be the rationals or the integers mod p, or some higher finite field. Let F be L(y), adjoining the n^th root of 1. Remember that L(y)[x] is a ufd.

Every linear binomial, such as x-c, is both prime and irreducible. In particular, x-y is one of the prime factors of xⁿ-1. The same holds for x-y², and every x-y^j. Thus xⁿ-1 splits over L(y). Since the powers of y are distinct,the linear factors are distinct, and this is a complete prime factorization of xⁿ-1 over L(y).

xⁿ-1 = (x-y) * (x-y²) * (x-y³) * … * (x-1)

Of course some of these factors clump together to become irreducible polynomials over L. For instance, x⁴-1 splits in the gaussian integers, x-1 times x+1 times x-i times x+i, but over the rationals you get x-1 times x+1 times x²+1. One of the irreducibles over L has root y and defines the extension.

Multiply all the factors x-y^j, to get xⁿ-1, and note that the coefficient on x^n-1 is 0. Therefore the sum of all n^th roots of 1 is 0. The coefficient on x^n-2 is also 0, hence the sum of the pairwise products of the n^th roots of 1 is also 0. This continues all the way down the line, until the product of the roots of 1 yields -1. Verify this for the cube roots of 1 in the complex plane. They sum to 0, their pairwise products sum to 0, and their product is -1.

[ Unit circle with the 3 cube roots of 1 marked ]

Consider the product of the factors s-ty^j, where s and t are arbitrary symbols, and j runs from 1 to n. As shown above, all the coefficients, other than the first and last, drop to 0. Therefore the product is sⁿ-tⁿ.

If n is odd, replace t with -t. Thus the product over s+ty^j yields sⁿ+tⁿ.

As you recall, xⁿ-1 brings in all the n^th roots of 1. Is this polynomial irreducible? If not, what is its factorization?

If d divides n, then xⁿ-1 is divisible by x^d-1. We can always set d = 1, even when n is prime, hence xⁿ-1 is never irreducible. You can adjoin y to create a field extension, but the irreducible polynomial associated with y is a proper factor of xⁿ-1. Let's try to find this irreducible polynomial.

Let ζ_n(x) be the product of x-y^j, for all primitive n^th roots of 1. Recall that y^j is a primitive n^th root iff j is relatively prime to n. Thus there are φ(n) terms in the product, and ζ(x) has degree φ(n).

Don't confuse this function with the Riemann zeta function; they are unrelated.

Here is a recursive procedure to build ζ_n.

Let ζ₁ = x-1. Then let ζ_n equal xⁿ-1 divided by ζ_d for every d < n that divides n.

Instead of a formal proof, I'll illustrate with n = 12. The polynomial xⁿ-1 includes all the roots of 1, primitive and nonprimitive alike. Each root has an order, how long to get to 1. The order is always a factor of 12. Divide by x-1, ζ₁, when d = 1, and take away the root 1, having order 1. Divide by x+1 and remove the root -1, whose order is precisely 2. Divide by ζ₃, x²+x+1, to take away the two roots with order 3. Do the same for the roots of order 4, and 6, and you are left with the four roots having order 12.

This algorithm works over the integers, or Z/p. In fact the former polynomial, with coefficients in Z, can be reduced mod p to produce the latter polynomial. But are the coefficients always integers?

Suppose ζ_n is the first polynomial that does not have integer coefficients. A lesser polynomial ζ_j consists of some of the roots of n, and divides evenly into xⁿ-1. ζ_j has integer coefficients, so by synthetic division, the quotient is well defined in the rationals. By Gauss' lemma, this can be moved back into the integers. Since ζ_j already starts with a lead coefficient of 1, that is, 1 times a power of x, we can't divide this polynomial by c and multiply the other one by c to clear denominators. Therefore there are no denominators to clear. The quotient has integer coefficients, and begins with 1. Therefore ζ_n has integer coefficients and is monic.

Here is another formula for ζ_n that is not recursive, hence it is more efficient. It uses the mobius function, denoted μ(n).

For every d dividing n, raise x^n/d-1 to the μ(d) power. Multiply these together to build ζ_n. Let's see why this works.

First let n be prime, so that d is either 1 or n. This gives xⁿ-1 to the 1 power times x-1 to the -1 power, or (xⁿ-1)/(x-1), which is indeed ζ_n.

Next let n be composite. By induction this formula gives the right answer for all lesser ζ polynomials.

Set d = 1 to get the numerator xⁿ-1. Now we need to divide by ζ_j for every divisor j ≥ 1 and < n.

Focus on a divisor d, and let e = n/d. Which of the lesser ζ polynomials bring in x^e-1? The factor x^e-1 appears in ζ_j when e divides j, properly divides n. Let c = j/e. The exponent on x^e-1, used to build ζ_j, is μ(c). This happens whenever e divides j divides n, but we want to exclude j = n, because we don't want to divide by ζ_n. That's not part of the recursive formula.

Let j run over the multiples of e that are also divisors of n, from e up to n, and take the sum of μ(j/e). This is the sum of μ(c) for all c dividing n/e, which is equal to 0. Since j was not suppose to include n, take that out and get -μ(n/e) = -μ(d). This factor is in the denominator, so the exponent on x^e-1 is μ(d), as it should be.

If n is odd, φ(n) = φ(2n). Hence ζ_n and ζ_2n have the same degree. In fact there is a deeper connection.

Let y be a primitive root of ζ_n. Adjoining y also brings in -y, which is a primitive root of order 2n. You get the 2n roots for free.

Think of ζ_n as the product of x-y^j, over the primitive n^th roots of 1. Replace each y^j with -y^j to find the primitive roots of order 2n. Raise any of these to the n and get -1, then square to get 1, thus each has order 2n. This builds ζ_2n. The lead coefficient of the product is still 1. The next coefficient, the sum of the roots, has been negated. The next coefficient, the sum of the pairwise products of the roots, is unchanged. The next coefficient is negated, and so on down the line.

I'll illustrate with 5 and 10. Since 5 is prime, ζ₅ = x⁴+x³+x²+x+1. Compute ζ₁₀ by the mobius formula.

ζ₁₀ =

(x¹⁰-1) × (x-1) / (x⁵-1) / (x²-1) =

(x⁵+1) / (x+1) =

x⁴ - x³ + x² - x + 1

At first it appears that all the cyclotomic coefficients are 0 or ±1. Derive the first 20 or so and you'll see what I mean. But don't try to prove it, because it's not true. The smallest counterexample is n = 105.

ζ looks somewhat unpredictable, but if n is prime, ζ_n = x^n-1 + x^n-2 + … + x² + x + 1. ζ_2n is the aforementioned sum with alternating signs.

ζ is also well understood when n is a power of 2. Use the mobius formula above. The only divisors that are squarefree are 1 and 2. Thus xⁿ-1 is divided by x^n/2-1, giving x^n/2 + 1. We saw this when adjoining the eighth roots of 1. The adjoined root y satisfies x⁴ + 1 = 0.

Our carefully crafted cyclotomic polynomial ζ need not be irreducible. Consider the 8^th roots of 1 over Z/3. The cyclotomic polynomial has degree φ(8) = 4, but a 2 dimensional extension of Z/3 produces the finite field of order 9, which includes all 8 roots of 1. Here is the factorization of ζ₈.

x⁴+1 = (x²+x+2) × (x²+2x+2)

Either quadratic on the right can be used to bring in the 8 roots of 1. Adjoin y, a root of the first quadratic, and linear combinations of 1 and y are sufficient to span all the roots of 1. Apply the automorphism y → y³ to get another primitive root. Apply this again to get back to y. The other quadratic has y⁵ and y⁷ as roots, also conjugates of each other.

In the last section we built a cyclotomic polynomial over Z/3 that was not irreducible. However, all cyclotomic polynomials are irreducible over the integers. Here is the proof, courtesy of Gauss.

Let y be a primitive n^th root and let p be any prime not dividing n. Suppose ζ factors into g * h, with h irreducible, and h has the root y. Remember that there is a minimum polynomial with root y, which divides every other polynomial having root y. Since h is irreducible, h is the minimum polynomial with root y.

Since p and n are coprime, y^p is another primitive root. Suppose y^p is a root of g(x). This means y is a root of g(x^p).

Since h also has root y, and h is irreducible, h is a factor of g(x^p). Say g(x^p) = h * k.

Reduce all coefficients mod p, giving the following polynomial equation. Both g and h are monic, so the polynomials do not disappear when reduced mod p.

h * k = g(x^p) = g(x)^p

Some irreducible factor of h, possibly h itself, now divides g.

h₁ * h₂ = g ( over Z/p )

h₁ * h₂ * k = h₁^p * h₂^p

Each polynomial above is a factor of xⁿ-1. Split all of these polynomials over Z/p adjoin its cyclotomic roots. Let z be a root of h₁, thus x-z is a factor of h₁. Now x-z appears p times on the right, and by unique factorization, x-z appears p times on the left. However, the left is g(x), a factor of xⁿ-1, and all the roots of g(x) are distinct. This contradiction means y^p is not a root of g(x).

Since y^p is not in g(x), it belongs to h(x), as does y to the p², and p³, and so on.

Let's look at a specific primitive root y^j. If p² divides j, raise y to the p, and then to the p again. This is a primitive root that lies in h. Then, if q divides j, raise this root to the q and find another primitive root in h. Continue through the primes of j, until y^j lies in h. With j coprime to n, all the primes of j are coprime to n, and there is no trouble.

Since h contains all the primitive roots, it is equal to ζ_n, which is built from these same primitive roots, hence ζ_n is irreducible.

By gauss' lemma, ζ_n(x) is irreducible over Q, or any subring of Q, such as Z localized about p.

Adjoining an n^th root of 1 to Q is an extension of order φ(n), as per the degree of ζ_n, which is irreducible.

You can map y to any other primitive root, since they are all conjugates of the same irreducible polynomial. The automorphisms of the extension are the integers coprime to n, and composition of automorphisms is multiplication mod n.

When n is prime, ζ_n, which is the sum of the powers of x up to x^n-1, is irreducible. This result generalizes to xⁿ-dⁿ over x-d, for any nonzero integer d, via the following argument.

Let p q and r be polynomials with coefficients in a field F, such that p = q * r. If v is a nonzero constant taken from F, then substituting x → vx is a ring automorphism. In other words, it commutes with addition and multiplication. Therefore p(x) is reducible iff p(vx) is reducible. Turn this around and p(x) is irreducible iff p(vx) is irreducible.

The cyclotomic polynomial, x^n-1 + x^n-2 + … + x³ + x² + x + 1, is irreducible over the rationals. The same is true when x is replaced with x/d. Multiply this by d^n-1, which is a unit in Q, and the following is irreducible.

x^n-1 + dx^n-2 + d²x^n-3 + … + d^n-2x + d^n-1

This is the quotient xⁿ-dⁿ over x-d, and we're done.

Set d = -1 to show the alternating sum of the powers of x is irreducible.

Let G be the gaussian integers, that is, the integers adjoin i. Is ζ still irreducible over G?

Remember that G is a ufd, so ζ factors over G iff it factors over the fraction field of G. These are the rational points in the complex plane.

As mentioned earlier, a polynomial is irreducible iff the same polynomial, with x scaled by a constant, is irreducible. In particular, ζ_n(x) is irreducible iff ζ_n(-x) is irreducible. With n odd, the second ixpression is ζ_2n(x). Therefore it is enough to prove irreducibility for n even.

Next let n be even, but not divisible by 4. Q(i)(y) is the same field extension as Q(y)(i). Call this extension F. Since i is not in Q(y), F/Q has dimension 2×φ(n). Thus F/Q(i) has dimension φ(n). The polynomial of y has degree φ(n), therefore it is irreducible over Q(i).

Finally, when n is divisible by 4, ζ always factors. Adjoin i first, then y, to create a composite extension of dimension φ(n). The first has dimension 2, so the irreducible polynomial associated with y, relative to G, has degree φ(n)/2. The same thing happens if you select a different primitive root, from the other half of ζ_n. Thus ζ factors precisely into two irreducible pieces, each having half the degree.

Let's illustrate with n = 8. ζ(8) = x⁴+1. This does not split in the integers, but bring in i, and it becomes (x²+i) × (x²-i). The first has roots y³ and y⁷, and the second has roots y and y⁵.

Adjoin y, and build an extension E of dimension d over Q or over Z/p. Here d is φ(n), or perhaps a factor of φ(n) in the finite case. There are d conjugates of y, and they are all primitive. There are d automorphisms, taking y to each of these conjugates.

Let the norm of s be the product of its d conjugates. This is familiar territory.

The norm of st is the product of the conjugates of st, and if c is one of these automorphisms, c(st) = c(s)c(t). Regroup terms, and |st| = |s|*|t|. Thus norm is a multiplicative homomorphism, but what is the range?

Apply c to the norm of s, and you simply rearrange the factors; the outcome is the same. Thus the norm lives in the field that is fixed by all the automorphisms of E. This fixed field is the base field Q, or Z/p, though that is far from obvious.

Take the finite case first. The automorphisms are frobenius, and if |s| is fixed by all of them it is fixed by the first one; raising everything to the p. If x is fixed by this automorphism then x^p = x. There are p solutions to this polynomial, namely the integers from 0 to p-1. Therefore |s| is an integer.

If the base is a larger finite field L, then the first automorphism raises everything to some power of p, commensurate with the order of L. This automorphism fixes L, and no more than L. Once again the norm lies in L.

Now turn to Q. Like the finite case, anything that is fixed by all the automorphisms lives in the base field Q. I offer this without proof for now, but it is a result of galois theory, which is coming soon. For now let's just say that |s| lies in Q.

More than just Q, the norm is an integer. Assuming s is nonzero, the norm is the product of nonzero elements in an integral domain, and |s| is not zero. Remember that s is a linear combination of powers of y. Write |s| as a product of conjugates, and the result is another linear combination of powers of y, where each coefficient is a polynomial in the original coefficients of s. Recall the polynomial that implements norm when n = 8. A similar polynomial can be built for any n, though it becomes quite unwieldy. Assuming the coefficients of s are integers, i.e. s belongs to Z[y], then the coefficients of |s| are also integers. With |s| belonging to Q, the coefficients on y and higher powers of y are 0, and |s| is an integer.

In fact the norm is a positive integer. Group various conjugates of s together in pairs. Remember that each conjugate is y replaced with some power of y. so y → y¹ and y → y^n-1 go together. These two images of s are conjugates in the complex plane, reflections through the x axis. Their product lies on the positive x axis. If n is odd, put y → y² and y → y^n-2 together. Their product lies on the positive x axis. If n is not divisible by 3, put y → y³ and y → y^n-3 together. Do this for all the conjugates of s, and |s| lies on the positive x axis. If s belongs to Z[y], then |s| is a positive integer.

From here the familiar theorems come rolling in: s is a unit iff |s| = 1, and s is irreducible when |s| is prime.

If y is an n^th root of 1, the powers of y are all units in the cyclotomic extension. Can we identify any other units?

Let i and j be integers such that gcd(i,n) = gcd(j,n). Let r be the ratio (1-yⁱ) / (1-y^j). Use synthetic division to expand the quotient, giving the following:

1 + y^j + y^2j + y^3j + … + y^k

The quotient terminates because there is some k such that jk = i mod n. Divide through by the common gcd, whence j/g becomes a unit mod n/g, and some k times j/g gives i/g, another unit mod n/g.

The inverse, (1-y^j) / (1-yⁱ), is also a polynomial in y, hence r is invertible.

Some of these expressions produce old familiar units. When n = 5, (1-y²) / (1-y³) = 1+y³+y+y⁴, or -y². Others are new, such as (1-y²) / (1-y) = 1+y. This is not on the unit circle, not ± a power of y.

If n is prime, what is the norm of 1-y? We already solved this when n = 3. y is a cube root of 1; plot it on the unit circle, 120 degrees around. That puts 1-y in the fourth quadrant, 30 degrees down from the x axis. The conjugate 1-y² is also the complex conjugate, 30 degrees up from the x axis. Their product is 3, hence the norm is 3. Furthermore, 1-y over 1-y² is a ratio unit, thus 1-y and 1-y² are associates. Therefore 3 is a ramified prime, (1-y)² times some unit. Happily, the same thing is going to happen for all n.

Since n is prime, 1-y over 1-y^j is a ratio unit, hence all the conjugates of 1-y are associates of 1-y. The norm of y is a unit away from (1-y)^n-1.

Remember that the product of x-y^j, as j runs from 1 to n, is xⁿ-1. Divide this by x-1 to get ζ_n: x^n-1 + x^n-2 + … + x + 1. Replace x with 1 to get the desired product; the answer is n. Therefore |1-y| = n.

Since the norm is prime, 1-y is irreducible. All its conjugates are also associates, hence (1-y)^n-1 = n, or at least a unit times n, and n is a ramified prime.

The same thing happens when n is a power of 2. The cyclotomic polynomial ζ_n is x^n/2+1, so replace x with 1 and get 2. The norm of 1-y is 2, hence 1-y is irreducible. The conjugates of y are y raised to any odd power. Quotients of these conjugates are ratio units, and these conjugates are all associates of each other. Thus 2 is ramified, equal to (1-y)^n/2.

A complete characterization of primes over primes is coming, once we have more machinery in place. One of my professors remarked, "If you like algebraic number theory, then you can never know enough about the cyclotomics."