April 22, 2017 Leave a comment
Reading David Deutsch’s papers on quantum physics requires knowing some matrix maths. The papers are here
This post gives a brief account of the relevant maths.
First, a brief explanation of complex numbers. Ordinary positive and negative numbers have the property that the square of the number is positive, e.g. . An imaginary number is defined to have the property that its square is negative. The imaginary number i is the number such that and other imaginary numbers are just multiples of i. Also, A complex number is a sum of an ordinary real number and an imaginary number, e.g. 1+2i is a complex number.
For a complex number given by the complex conjugate of is defined as . Now, and is called the magnitude of . For a real number , , so for any complex number, there is a real number such that . It also happens to be true that and so a complex number is sometimes represented as
These papers are about the multiverse as described by quantum mechanics. Each system exists in multiple versions that can interact in interference experiments. For any particular quantity you could measure for which there are multiple possible outcomes, there is one version of the system for each outcome. There is a finite set of possible measurement results for any finite system.
Let’s suppose that we have a system S and a measurement that could be performed on S with two possible outcomes +1,-1. There needs to be something in the theory that represents the transitions between each outcome. There is a complex number that describes each transition : the probability of the transition is . So for S the thing that represents these transitions would need 4 numbers: one for each pairs of outcomes. Now, a version of the system could do the transition and then it could do any of the transitions allowed from -1. The system could also do any -1 transition if it did the transition .
What happens if two transitions happen one after another? The way to work out what happens is you list the set of possible states of the system. You can describe the first set of transitions as a square matrix whose elements are the numbers for each transition. So for the system S the matrix would read:
The second transition would have a different set of numbers and the corresponding matrix would be:
To work out the number for the composition of the transitions, you take the product of the transitions for which the final state of the first transition is the same as the initial state of the next transition and add them together. The matrix that describes the result of both transitions would be:
This is just the equation for the result of multiplying a pair of matrices. More generally, for a set of N possible states a set of transitions is represented by an matrix. If two sets of transitions are represented by matrices A and B, the transition that happens if you do the transition described by A followed by the transition described by B is described by the matrix product of B and A, whose elements are . For more than two transitions you just multiply more matrices, with the earlier transitions on the right and the later ones on the left.
So far I have only described transitions. What describes the system undergoing the transitions? The answer is more matrices. You need a set of matrices that can be multiplied by complex numbers and added up to give any other matrix of the same dimension. The reason is that you need a set of matrices that can be used to represent all of the possible transitions. For a system with N possible states you need matrices. If A is a transition matrix and M is one of the matrices describing the system then the system after the transition is described by , where is the Hermitian conjugate of A: the matrix found by taking the complex conjugate of the entries and interchanging their indices. So the Hermitian conjugate of
is given by
The matrices representing the transitions are unitary, which means that where I is the identity matrix that has 1s on the diagonal and zero on all off-diagonal entries. Some examples of unitary matrices:
Measurable quantities are represented by eigenvalues (the definition will given below, but requires some setup) of Hermitian matrices. A Hermitian matrix M is a matrix for which . Some examples of Hermitian matrices:
These matrices are called the Pauli matrices. Suppose a matrix M and a vector v have the property Mv = av, where a is a number, then a is an eigenvalue of M and v is an eigenvector of M. The first three Pauli matrices all have eigenvalues +1 and -1. The eigenvectors for are $latex[1,0],[0,1]$. The eigenvectors for are . The dot product of two vectors is given by . The dot product of two eigenvectors is always zero: they are said to be orthogonal.
A projector P is an operator such that . The projectors for the three Pauli matrices have the property that or . More generally, for any Hermitian matrix M there is a set of projectors such that where if and 0 otherwise for which and the numbers are the eigenvalues of M.
If you have two different systems , the transition matrices and the matrices representing the system’s state can be represented by tensor products of the matrices representing each system. The tensor product of two matrices A,B is denoted as $latex A\otimes B$ and is represented by
A function f applied to a Hermitian operator is given by.
I think that covers most the matrix stuff you need to know to read those papers. More stuff on matrices can be found in Quantum Computation and Quantum Information by Nielsen and Chuang, which also has exercises.