10.1. Cartesian form of complex numbers#
10.1.1. Introduction#
Consider the equation
where \(a\neq0\). Previously you probably learned that Equation (10.1.1) only has solutions when \(D=b^2-4ac\) was non-negative and in that case you would have the two solutions
Now consider the case that you really need solutions of Equation (10.1.1) even if \(D\) is negative. This might sound strange, but you will encounter this case in many engineering applications, for example in solving second-order linear differential equations. It turns out that the solution in Equation (10.1.2) is still valid when \(D<0\), which means that you need to take the square root of a negative number.
Luckily, mathematicians have found a way to handle this, namely complex numbers. Before we start treating complex numbers, we first need to introduce the special number \(i\). After that, we will introduce the complex numbers and some other terminologies.
10.1.2. Complex numbers#
Definitions
We start with considering an easier form of Equation (10.1.1), namely
From this equation we find that \(D=-1\) and if we just straight forward apply Equation (10.1.2), we get
The square root of \(-1\) obviously is a problem. Therefore we introduce the special number \(i\):
Definition 10.1.1
The imaginary unit \(i\) is a number defined by the equation
Because \(i\) is defined to be a number (mind you, it is not a real number), we also assume \(i\) behaves just like any normal number.
Using the imaginary unit \(i\) we can prove the following theorem:
Theorem 10.1.1
Let \(a\) be a positive real number. Then the two numbers \(x_-=-ai\) and \(x_+=ai\) are solutions to the equation \(x^2=-a^2\).
Proof of Theorem 10.1.1
First we consider \(x_-=-ai\) and take its square:
This shows that \(x_-=-ai\) is indeed a solution to the equation \(x^2=-a^2\).
We repeat the same for \(x_+=ai\):
We also find that \(x_+=ai\) is a solution to the equation \(x^2=-a^2\).
This means we can rewrite our two solutions as
Doesn’t this already look simpler?
Now we have defined \(i\), we can revisit the general case given in Equation (10.1.1). Let us start with an example:
Example 10.1.1
Consider the second degree polynomial \(p(z) =z^2+2z+5\) and we want to solve the equation \(p(z)=0\).
We are going to do this by first rewriting \(p\) to the form \(p(z) = (z+p)^2+q\) for some numbers \(p\) and \(q\).
Expanding gives that we want \(z^2+2z+5=z^2+2pz + (p^2+q)\), thus \(2=2p\) and \(5=p^2+q\). The first equation gives us \(p=1\). Plugging this into the second equation, we obtain \(5=1+q\), so \(q=4\). Therefore, \(z^2+2z+5=(z+1)^2+4\).
To solve \(z^2+2z+5=0\), we can now write
Note that we used Theorem 10.1.1.
Going from \(z^2+2z+5\) to \((z+1)^2+4\) is called completing the square. You can also immediately see that the minimal value of the parabola \(y=z^2+2z+5\) for real values of \(z\) equals \(4\) (as \((z+1)^2\geq 0\) for all real \(z\)), and the minimum is obtained when \(z=-1\).
In general, you can write any polynomial \(az^2+bz+c\) in the form \(a ((z+p)^2+q)\) by first factoring out the \(a\), subsequently choosing the \(p\) such that the linear term (the term involving \(z\)) is correct, and letting \(q\) be the remainder. Using this form, you can then determine the zeros of the polynomial.
As you can see in Example 10.1.1, we now found two numbers that are of the form \(a+bi\), where \(a\) and \(b\) are real numbers (for short \(a\in\mathbb{R}\) and \(b\in\mathbb{R}\)). A number like this is called a complex number:
Definition 10.1.2
A complex number is a number of the form
where \(a\in\mathbb{R}\) and \(b\in\mathbb{R}\).
The set of all complex numbers is denoted by the symbol \(\mathbb{C}\), and is called the complex plane.
Such complex numbers we usually denote with the letter \(z\) (if we only have one). A complex number also has some special parts, which we define next:
Definition 10.1.3
If \(z=a+bi\) is a complex number (with \(a\in\mathbb{R}\) and \(b\in\mathbb{R}\)), the real part \(\Re{z}\) is defined as
and the imaginary part \(\Im{z}\) is defined as
To make everything concise, we also define the relation between the set of real numbersr \(\mathbb{R}\) and the set of complex numbers \(\mathbb{C}\):
Definition 10.1.4
Assume \(x\in\mathbb{R}\). Then we define that \(x\in\mathbb{C}\) with \(\Re{x}=x\) and \(\Im{x}=0\).
Assume \(z\in\mathbb{C}\) and \(\Im{z}=0\). Then we define that \(z\in\mathbb{R}\).
Just like we can visualise real numbers on a number line, we can visualise complex numbers. For this we need two axes, one indicating the value of the real part of a complex number and one indicating the imaginary part of the same complex number. In Figure 10.1.1 you can see this visualisation. The horizontal axis always indicates the real part, while the vertical axis always represents the imaginary part.
Fig. 10.1.1 Visualisation of the complex plane \(\mathbb{C}\).#
Operations with complex numbers
With complex numbers we can do the same operations as with real numbers:
Theorem 10.1.2
If \(z=a+bi\) and \(w=c+di\) are complex numbers (with \(a,b,c,d\in\mathbb{R}\)), then the following numbers are again complex numbers:
Proof of Theorem 10.1.2
We prove the four results by working each out separately. We start with the addition:
Then we treat subtraction the same:
Now we turn to multiplication:
Finally we look into the division:
Besides these four standard operations we have one more:
Definition 10.1.5
If \(z=a+bi\) is a complex number (with \(a,b\in\mathbb{R}\)), the complex conjugate \(\overline{z}\) is defined as
which is also a complex number.
We can combine the complex conjugate with the first four operations, which gives the following theorem:
Theorem 10.1.3
If \(z\) and \(w\) are a complex numbers, then the following identities hold:
Proof of Theorem 10.1.3
We show each of the identities, one after the other, where we assume \(z=a+bi\) and \(w=c+di\), \(a,b,c,d\in\mathbb{R}\):
Double conjugation:
Addition and conjugation:
Substraction and conjugation:
Multiplication and conjugation:
Division and conjugation:
Even better, we can relate the complex conjugate with the real and imaginary part of a complex number:
Theorem 10.1.4
If \(z\) is a complex number, then the following identities hold:
Proof of Theorem 10.1.4
We show each of the identities, one after the other, where we assume \(z=a+bi\), \(a,b\in\mathbb{R}\):
Conjugation and real part:
Conjugation and imaginary part:
Conjugation and product:
From the second identity above we can even deduce the next theorem:
Theorem 10.1.5
Assume \(z\in\mathbb{C}\). \(z\in\mathbb{R}\) if and only if \(z=\overline{z}\).
Proof of Theorem 10.1.5
Assume \(z\in\mathbb{C}\).
If \(z\in\mathbb{R}\), then \(\Im{z}=0\). The second identity of Theorem 10.1.4 then gives \(\frac{z-\overline{z}}{2i}=0\), which gives in turn \(z=\overline{z}\).
If \(z\notin\mathbb{R}\), then \(\Im{z}\neq0\). The second identity of Theorem 10.1.4 then gives \(\frac{z-\overline{z}}{2i}\neq0\), which gives in turn \(z\neq\overline{z}\).
Geometric interpretation of the complex conjugate
First, we look at the complex conjugation. This is a relatively straightforward case, as it involves only a single number. Recall that the complex conjugate \(\bar z\) changes the sign of the imaginary part of the number \(z\). That is \(\overline{a+bi} = a-bi\). As the imaginary part of a complex number corresponds to the second coordinate of its representation in the complex plane, this implies that the number is reflected in the real axis (the horizontal axis). See Figure 10.1.2.
Fig. 10.1.2 Complex conjugation reflects a number in the real axis.#
Geometric interpretation of addition
The geometric interpretation of adding complex numbers should look familiar to you. Indeed if we add \(z=a+bi\) and \(w=c+di\) the new number is \(z+w=(a+c)+(b+d)i\), so we add the real and imaginary parts. This means we add the coordinates of the corresponding points, just as if we were adding vectors. Thus, geometrically we can add two complex numbers by following the parallelogram rule. That is, the lines from the origin to the two complex numbers form two sides of a parallelogram with vertices \(0\), \(z\), \(z+w\), and \(w\). See Figure 10.1.3.
Fig. 10.1.3 Adding complex numbers follows the parallelogram rule.#
If we want to interpet the other operations such as multiplication easily, we first need another way of writing complex numbers. But before we do that we will turn our attention first to solving equations.
10.1.3. Solving equations#
The reason for introducing complex numbers is to ensure that more equations have solutions, for example \(z^2+1=0\). In this section, we consider equations involving complex numbers. This means that the solutions may be complex, but also that the coefficients in the equations can be complex.
We already solved quadratic equations using a new technique called completing the square and in this section you will learn more ways to solve equations.
We introduced complex numbers to give the equation \(x^2+1 = 0\) a solution. It appears that something much stronger holds, namely, that every polynomial equation with coefficients in \(\mathbb{C}\), for instance \((1+i)x^4 - 2x^2 + x = 10i\), has solutions in \(\mathbb{C}\). This is the content of the following theorem.
Theorem 10.1.6 (Fundamental Theorem of algebra)
Consider a polynomial \(p(z)\) of degree \(n\),
where the coefficients \(a_n, a_{n-1}, \ldots, a_0\) are complex numbers and \(a_n\neq 0\). Then you can factor the polynomial in linear terms, that is
for some complex numbers \(b_1, b_2, \ldots, b_n\).
Observe that this factorisation means that \(b_1, b_2, \ldots, b_n\) are the zeros (= roots) of the polynomial \(p(z)\). It might happen that the \(b\)’s are complex numbers, even if \(a_1, a_2, \ldots, a_n\) are real.
We will not discuss the proof of this theorem, as that requires much more mathematics. However, we will illustrate the theorem using some examples.
Example 10.1.2
Consider \(p(z)=z^2+3z+2\). Then we know that we can factor the polynomial as \(p(z) = (z+2)(z+1)\) and thus find the zeros as \(-2\) and \(-1\).
Example 10.1.3
Consider the equation \(z^2=-1\). We can always rewrite an equation to an equation where one side is equal to zero by moving everything to one side. Thus this equation corresponds to \(z^2+1=0\). We can now factor \(z^2+1\) to \((z+i)(z-i)\) and thus find that \(i\) and \(-i\) are the two solutions to this equation.
Example 10.1.4
While a polynomial of degree \(n\) can be factored in \(n\) linear terms, and we have \(n\) values \(b_i\), this does not mean that there are \(n\) distinct zeros. For example \(p(z) = z^2+2z+4\) can be factored as \(p(z)=(z+2)^2\) and thus only has \(z=-2\) as a solution. However, the term \((z+2)\) occurs twice in the factorisation. We therefore say that the multiplicity of the zero \(-2\) is equal to two.
In particular, we see that any polynomial of degree \(n\) has \(n\) complex zeros counting multiplicity:
Theorem 10.1.7
If \(\{z_1,z_2,\ldots,z_k\}\) is the set of unique zeros of a polynomial \(p\) of degree \(n\) with \(p(z)=a_n z^n + a_{n-1} z^{n-1} + \cdots + a_1 z + a_0\), the polynomial \(p\) can be written as
where \(\alpha_j,j=1,2,\ldots,k\) are positive integers and
Proof of Theorem 10.1.7
Assume \(\{z_1,z_2,\ldots,z_k\}\) is the set of unique zeros of a polynomial \(p\) of degree \(n\). Then following Theorem 10.1.6, we can write
for some complex numbers \(b_1, b_2, \ldots, b_n\).
Because \(p(z_1)=0\) for \(j\in\{1,\ldots,k\}\), we must have that \(\alpha_1\in\{1,\ldots,n\}\) numbers out of the set \(\{b_1, b_2, \ldots, b_n\}\) must equal \(z_1\). Without loss of generality we may assume those are \(b_1,\ldots,b_{\alpha_1}\). This is also means that Equation (10.1.4) transforms to
We can repeat the above argument for \(z_2\): we must have that \(\alpha_2\in\{1,\ldots,n-\alpha_1\}\) numbers out of the set \(\{b_{\alpha_1+1}, \ldots, b_n\}\) must equal \(z_2\). Without loss of generality we may assume those are \(b_{\alpha_1+1},\ldots,b_{\alpha_1+\alpha_2}\). This is also means that Equation (10.1.5) transforms to
Repeating this argument for \(z_3,\ldots,z_k\) leads to desired formulae.
Definition 10.1.6
If \(z_j\) is a zero of a polynomial \(p\) of degree \(n\) with \(p(z)=a_n(z-z_1)^{\alpha_1}(z-z_2)^{\alpha_2}\cdots(z-z_k)^{\alpha_k}\), the (algebraic) multiplicity of \(z_j\) is equal to \(\alpha_j\).
Example 10.1.5
The third degree polynomial \(p(z) = z^3-4z^2\) can be factored as \(p(z) = z^2(z-4) = (z-0)^2(z-4)\). Therefore, it has zeros \(4\) and \(0\), where the multiplicity of \(4\) is equal to one and the multiplicity of \(0\) is equal to two. The degree of the polynomial is \(3\), which is equal to the sum of the multiplcities of its zeros (\(1+2=3\)).
Example 10.1.3 showed that both \(z=i\) and its complex conjugate \(\overline{z}=-i\) where roots of the polynomial \(p(z)=z^2+1\). One might wonder whether it is always the case that both \(z\) and its complex conjugate \(\overline{z}\) are both roots of a given polynomial. It can be shown that this is the case if all coefficients are real valued.
Theorem 10.1.8
Let \(p\) be a polynomial with real coefficients. If \(p(z)=0\), then \(p(\overline{z})=0\) as well, and the algebraic multiplicities of \(z\) and \(\overline{z}\) are the same.
Proof of Theorem 10.1.8
Consider a polynomial \(p\) of degree \(n\), \(\sum_{j=0}^n a_j z^j\), where the coefficients \(a_n, a_{n-1}, \ldots, a_0\) are real-valued numbers and \(a_n\neq 0\).
First we show that \(p(\overline{z})=0\) by considering \(\overline{p(z)}\) twice:
but also
Combining these two results gives the desired \(p(\overline{z})=0\).
Now we focus on the algebraic multiplicity. Theorem 10.1.8 shows that we can write \(p(z)\) as
where \(\alpha_j,j=1,2,\ldots,k\) are positive integers,
and \(z_1,z_2,\ldots,z_k\) are the unique zeros of \(p\).
If \(p(z)=0\), then \(z=z_j\) for some \(j\in\{1,2,\ldots,k\}\) and \(z\neq z_i\) for \(i\in\{1,2,\ldots,k\}\setminus\{j\}\). Without loss of generality we can assume \(j=1\).
This means that also \((z-z_1)^{\alpha_1}=0\) and \((z- z_i)^{\alpha_i}\neq0\) for \(i\in\{2,\ldots,k\}\), and even more that \(\overline{(z-z_1)}^{\alpha_1}=0\) and \(\overline{(z- z_i)}^{\alpha_i}\neq0\) for \(i\in\{2,\ldots,k\}\).
From the last it even follows that \((\overline{z}-\overline{z_1})^{\alpha_1}=0\) and \((\overline{z}- \overline{z_i})^{\alpha_i}\neq0\) for \(i\in\{2,\ldots,k\}\).
Using the previous results, we look at \(p(\overline{z})\):
As all terms except the first term \((\overline{z}-\overline{z_1})^{\alpha_1}\) are non-zero, and the first term is zero, we find that \(\overline{z}=\overline{z_1}\), \(\alpha_1\) times. In other words, the algebraic multiplicity of \(\overline{z}\) is \(\alpha_1\).
Polynomial Division
Next, we consider a method you can use whenever you know one root of a polynomial. The fundamental theorem of Algebra says that if \(p(z)\) is a polynomial such that \(p(b)=0\) for some \(b\), then \(p(z) = a_n(z-b)(z-b_2)\cdots (z-b_n) = (z-b) q(z)\) for another polynomial \(q(z)=a_n(z-b_2)\cdots (z-b_n)\). Thus, we divide the polynomial \(p(z)\) by \((z-b)\) in this case and obtain a new polynomial. To find the zeros of \(p\) we now just have to find the zeros of the quotient \(\nicefrac{p(z)}{z-b}\) and add \(b\) to this list.
To divide a polynomial by another polynomial you can use a long division. Let us recall how this worked for ordinary fractions.
Example 10.1.6
Let us calculate \(\frac{97813}{382}\). In Figure 10.1.4 on the left, you see the American notation for Long division, on the right the corresponding Dutch notation. Everything in red is usually not written down, but included here to clarify what happens. (Note that the calculations in the middle are true regardless of whether the red parts are included or not.)
Fig. 10.1.4 Example of long division for ordinary fractions.#
Here, the following calculation has been written down concisely:
First, subtract as many multiples of \(100\cdot 382\) from \(97813\) as possible (or multiples of \(100\) from \(\frac{97813}{382}\)). Next, you do the same with the remainder, and one digit lower, so you subtract as many multiples of \(10\cdot 382\) from the remainder \(21413\). You continue until you can’t subtract the numerator even once from the remainder (or until you have as many digits as you want).
We can do the same thing for polynomials.
Example 10.1.7
Consider \(p(z) = z^3+3z^2+z-5\). You may notice that \(z=1\) is a root; \(p(1)=0\). Thus \(z-1\) must be a factor. If we calculate this division we obtain (on the left again US notation, on the right Dutch notation):
Fig. 10.1.5 Example of long division for polynomials.#
Note that the remainder is zero, as we should have expected.
Here, we essentially calculate
As a consequence, we see that \(z^3+3z^2+z-5=(z-1)(z^2+4z+5)\), so it equals \(0\) if either \(z-1=0\) or \(z^2+4z+5=0\). Completing the square gives \(z^2+4z+5=(z+2)^2+1\), so the zeros are \(z=-2\pm i\). Thus, the roots of \(z^3+3z^2+z-5\) are \(z=1\) and \(z=-2\pm i\).
10.1.4. Grasple exercises#
Grasple exercise 10.1.1
Real part of a complex number.
Click to show/hide
Grasple exercise 10.1.2
Imaginary part of a complex number.
Click to show/hide
Grasple exercise 10.1.3
Adding two complex numbers.
Click to show/hide
Grasple exercise 10.1.4
Subtracting two complex numbers.
Click to show/hide
Grasple exercise 10.1.5
Multiplying two complex numbers.
Click to show/hide
Grasple exercise 10.1.6
Dividing two complex numbers.
Click to show/hide
Grasple exercise 10.1.7
Complex conjugate of a complex number.
Click to show/hide
Grasple exercise 10.1.8
Multiplying a complex number with it’s complex conjugate.
Click to show/hide
Grasple exercise 10.1.9
Completing the square with real coefficients.
Click to show/hide
Grasple exercise 10.1.10
Completing the square with complex coefficients.
Click to show/hide
Grasple exercise 10.1.11
One more time completing the square.
Click to show/hide
Grasple exercise 10.1.12
Polynomial long division.
Click to show/hide
Grasple exercise 10.1.13
Another long division with polynomials.
Click to show/hide
Grasple exercise 10.1.14
Counting solutions of a polynomial equation.
Click to show/hide
Grasple exercise 10.1.15
On the relation between roots of polynomials.
Click to show/hide
Grasple exercise 10.1.16
Solving a cubic equation given a root.
Click to show/hide
Grasple exercise 10.1.17
Solving a quartic equation given two roots.
Click to show/hide
Grasple exercise 10.1.18
Solving a quartic equation given one root.