Files
NoteNextra-origin/pages/Math401/Math401_T4.md
Zheyuan Wu 1e2f670abf update
2025-06-21 17:00:07 -05:00

358 lines
17 KiB
Markdown

# Topic 4: The quantum version of probabilistic concepts
> In mathematics, on often speaks of non-commutative instead of quantum constructions.
**Note, in this section, we will see a lot of mixed used terms used in physics and mathematics. I will use _italic_ to denote the terminology used in physics. It is safe to ignore them if you just care about the mathematics.**
## Section 1: Generalities about classical versus quantum systems
In classical physics, we assume that a system's properties have well-defined values regardless of how we choose to measure them.
### Basic terminology
#### Set of states
The preparation of a system builds a convex set of states as our initial condition for the system.
For a collection of $N$ system. Let procedure $N_1=\lambda P_1$ be a preparation procedure for state $P_1$, and $N_2=(1-\lambda) P_2$ be a preparation procedure for state $P_2$. The state of the collection is $N=\lambda N_1+(1-\lambda) N_2$.
#### Set of effects
The set of effects is the set of all possible outcomes of a measurement. $\Omega=\{\omega_1, \omega_2, \ldots, \omega_n\}$. Where each $\omega_i$ is an associated effect, or some query problems regarding the system. (For example, is outcome $\omega_i$ observed?)
#### Registration of outcomes
A pair of state and effect determines a probability $E_i(P)=p(\omega_i|P)$. By the law of large numbers, this probability shall converge to $N(\omega_i)/N$ as $N$ increases.
**Quantum states, _observables_ (random variables), and effects can be represented mathematically by linear operators on a Hilbert space.**
## Section 2: Examples of physical experiment in language of mathematics
### Sten-Gernach experiment
_**State preparation:**_ Silver tams are emitted from a thermal source and collimated to form a beam.
_**Measurement:**_ Silver atoms interact with the field produced by the magnet and impinges on the class plate.
_**Registration:**_ The impression left on the glass pace by the condensed silver atoms.
## Section 3: Finite probability spaces in the language of Hilbert space and operators
> Superposition is a linear combination of two or more states.
A quantum coin can be represented mathematically by linear combination of $|0\rangle$ and $|1\rangle$.$\alpha|0\rangle+\beta|1\rangle\in\mathscr{H}\cong\mathbb{C}^2$.
> For the rest of the material, we shall take the $\mathscr{H}$ to be vector space over $\mathbb{C}$.
### Definitions in classical probability under generalized probability theory
#### Definition of states (classical probability)
A state in classical probability is a probability distribution on the set of all possible outcomes. We can list as $(p_1,p_2,\cdots,p_n)$.
To each event $A\in \Omega$, we associate the operator on $\mathscr{H}$ of multiplication by the indicator function $P_A\coloneqq M_{\mathbb{I}_A}:f\mapsto \mathbb{I}_A f$ that projects onto the subspace of $\mathscr{H}$ corresponding to the event $A$.
$$
P_A=\sum_{k=1}^n a_k|k\rangle\langle k|
$$
where $a_k\in\{0,1\}$, and $a_k=1$ if and only if $k\in A$. Note that $P_A^*=P_A$ and $P_A^2=P_A$.
#### Definition of density operator (classical probability)
Let $(p_1,p_2,\cdots,p_n)$ be a probability distribution on $X$, where $p_k\geq 0$ and $\sum_{k=1}^n p_k=1$. The density operator $\rho$ is defined by
$$
\rho=\sum_{k=1}^n p_k|k\rangle\langle k|
$$
The probability of event $A$ relative to the probability distribution $(p_1,p_2,\cdots,p_n)$ becomes the trace of the product of $\rho$ and $P_A$.
$$
\operatorname{Prob}_\rho(A)\coloneqq\text{Tr}(\rho P_A)
$$
#### Definition of random variables (classical probability)
A random variable is a function $f:X\to\mathbb{R}$. It can also be written in operator form:
$$
F=\sum_{k=1}^n f(k)P_{\{k\}}
$$
The expectation of $f$ relative to the probability distribution $(p_1,p_2,\cdots,p_n)$ is given by
$$
\mathbb{E}_\rho(f)=\sum_{k=1}^n p_k f(k)=\operatorname{Tr}(\rho F)
$$
Note, by our definition of the operator $F,P_A,\rho$ (all diagonal operators) commute among themselves, which does not hold in general, in non-commutative (_quantum_) theory.
## Section 4: Why we need generalized probability theory to study quantum systems
Story of light polarization and violation of Bell's inequality.
### Classical picture of light polarization and Bell's inequality
> An interesting story will be presented here.
## Section 5: The non-commutative (_quantum_) probability theory
Let $\mathscr{H}$ be a Hilbert space. $\mathscr{H}$ consists of complex-valued functions on a finite set $\Omega=\{1,2,\cdots,n\}$. and that the functions $(e_1,e_2,\cdots,e_n)$ form an orthonormal basis of $\mathscr{H}$. We use Dirac notation $|k\rangle$ to denote the basis vector $e_k$.
In classical settings, multiplication operators is now be the full space of bounded linear operators on $\mathscr{H}$. (Denoted by $\mathscr{B}(\mathscr{H})$)
Let $A,B\in\mathscr{F}$ be the set of all events in the classical probability settings. $X$ denotes the set of all possible outcomes.
> A orthogonal projection on a Hilbert space is a projection operator satisfying $P^*=P$ and $P^2=P$. We denote the set of all orthogonal projections on $\mathscr{H}$ by $\mathscr{P}$.
>
> This can be found in linear algebra. [Orthogonal projection](https://notenextra.trance-0.com/Math429/Math429_L28#definition-655)
Let $P,Q\in\mathscr{P}$ be the event in probability space. $R(\cdot)$ is the range of the operator. $P^\perp$ is the orthogonal complement of $P$.
| Classical | Classical interpretation | Non-commutative (_Quantum_) | Non-commutative (_Quantum_) interpretation |
| --------- | ------- | -------- | -------- |
| $A\subset B$| Event $A$ is a subset of event $B$ | $P\leq Q$| $R(P)\subseteq R(Q)$ Range of event $P$ is a subset of range of event $Q$ |
| $A\cap B$| Both event $A$ and $B$ happened | $P\land Q$| projection to $R(P)\cap R(Q)$ Range of event $P$ and event $Q$ happened |
| $A\cup B$| Any of the event $A$ or $B$ happened | $P\lor Q$| projection to $R(P)\cup R(Q)$ Range of event $P$ or event $Q$ happened |
| $X\subset A$ or $A^c$| Event $A$ did not happen | $P^\perp$| projection$R(P)^\perp$ Range of event $P$ is the orthogonal complement of range of event $P$ |
In such setting, some rules of classical probability theory are not valid in quantum probability theory.
In classical probability theory, $A\cap(B\cup C)=(A\cap B)\cup(A\cap C)$.
In quantum probability theory, $P\land(Q\lor R)\neq(P\land Q)\lor(P\land R)$ in general.
### Definitions of non-commutative (_quantum_) probability theory under generalized probability theory
#### Definition of states (non-commutative (_quantum_) probability theory)
A state on $(\mathscr{H},\mathscr{P})$ is a map $\mu:\mathscr{P}\to[0,1]$ such that:
1. $\mu(O)=0$, where $O$ is the zero projection.
2. If $P_1,P_2,\cdots,P_n$ are pairwise disjoint orthogonal projections, then $\mu(P_1\lor P_2\lor\cdots\lor P_n)=\sum_{i=1}^n\mu(P_i)$.
Where projections are disjoint if $P_iP_j=P_jP_i=O$.
#### Definition of density operator (non-commutative (_quantum_) probability theory)
A density operator $\rho$ on the finite-dimensional Hilbert space $\mathscr{H}$ is:
1. self-adjoint ($A^*=A$, that is $\langle Ax,y\rangle=\langle x,Ay\rangle$ for all $x,y\in\mathscr{H}$)
2. positive semi-definite (all eigenvalues are non-negative)
3. $\operatorname{Tr}(\rho)=1$.
If $(|\psi_1\rangle,|\psi_2\rangle,\cdots,|\psi_n\rangle)$ is an orthonormal basis of $\mathscr{H}$ consisting of eigenvectors of $\rho$, for the eigenvalue $p_1,p_2,\cdots,p_n$, then $p_j\geq 0$ and $\sum_{j=1}^n p_j=1$.
We can write $\rho$ as
$$
\rho=\sum_{j=1}^n p_j|\psi_j\rangle\langle\psi_j|
$$
(under basis $|\psi_j\rangle$, it is a diagonal matrix with $p_j$ on the diagonal)
#### Theorem: Born's rule
Let $\rho$ be a density operator on $\mathscr{H}$. then
$$
\mu(P)\coloneqq\operatorname{Tr}(\rho P)=\sum_{j=1}^n p_j\langle\psi_j|P|\psi_j\rangle
$$
Defines a probability measure on the space $\mathscr{P}$.
[Proof ignored here]
#### Theorem: Gleason's theorem
Let $\mathscr{H}$ be a Hilbert space over $\mathbb{C}$ or $\mathbb{R}$ of dimension $n\geq 3$. Let $\mu$ be a state on the space $\mathscr{P}$ of projections on $\mathscr{H}$. Then there exists a unique density operator $\rho$ such that
$$
\mu(P)=\operatorname{Tr}(\rho P)
$$
for all $P\in\mathscr{P}$. $\mathscr{P}$ is the space of all orthogonal projections on $\mathscr{H}$.
[Proof ignored here]
#### Definition of random variable _or Observables_ (non-commutative (_quantum_) probability theory)
_It is the physical measurement of a system that we are interested in. (kinetic energy, position, momentum, etc.)_
$\mathscr{B}(\mathbb{R})$ is the set of all Borel sets on $\mathbb{R}$.
An random variable on the Hilbert space $\mathscr{H}$ is a projection valued map $P:\mathscr{B}(\mathbb{R})\to\mathscr{P}$.
With the following properties:
1. $P(\emptyset)=O$ (the zero projection)
2. $P(\mathbb{R})=I$ (the identity projection)
3. For any sequence $A_1,A_2,\cdots,A_n\in \mathscr{B}(\mathbb{R})$. the following holds:
(a) $P(\bigcup_{i=1}^n A_i)=\bigvee_{i=1}^n P(A_i)$
(b) $P(\bigcap_{i=1}^n A_i)=\bigwedge_{i=1}^n P(A_i)$
(c) $P(A^c)=I-P(A)$
(d) If $A_j$ are mutually disjoint (that is $P(A_i)P(A_j)=P(A_j)P(A_i)=O$ for $i\neq j$), then $P(\bigcup_{j=1}^n A_j)=\sum_{j=1}^n P(A_j)$
#### Definition of probability of a random variable
For a system prepared in state $\rho$, the probability of the random variable by the projection-valued measure $P$ is in the Borel set $A$ is $\operatorname{Tr}(\rho P(A))$.
### Expectation of an random variable and projective measurement
Notice that if we set $\lambda$ is _observed_ with probability $p_\lambda=\operatorname{Tr}(\rho P_\lambda)$. $\rho'\coloneqq\sum_{\lambda\in sp(T)}P_\lambda \rho P_\lambda$ is a density operator.
#### Definition of expectation of operators
Let $T$ be a self-adjoint operator on $\mathscr{H}$. The expectation of $T$ relative to the density operator $\rho$ is given by
$$
\mathbb{E}_\rho(T)=\operatorname{Tr}(\rho T)=\sum_{\lambda\in sp(T)}\lambda \operatorname{Tr}(\rho P(\lambda))
$$
if we set $T=\sum_{\lambda\in sp(T)}\lambda P_\lambda$, then $\mathbb{E}_\rho(T)=\sum_{\lambda\in sp(T)}\lambda \operatorname{Tr}(\rho P(\lambda))$.
### The uncertainty principle
Let $A,B$ be two self-adjoint operators on $\mathscr{H}$. Then we define the following two self-adjoint operators:
$$
i[A,B]\coloneqq i(AB-BA)
$$
$$
A\circ B\coloneqq \frac{AB+BA}{2}
$$
Note that $A\circ B$ satisfies Jordan's identity.
$$
(A\circ B)\circ (A\circ A)=A\circ (B\circ (A\circ A))
$$
#### Definition of variance
Given a state $\rho$, the variance of $A$ is given by
$$
\operatorname{Var}_\rho(A)\coloneqq\mathbb{E}_\rho(A^2)-\mathbb{E}_\rho(A)^2=\operatorname{Tr}(\rho A^2)-\operatorname{Tr}(\rho A)^2
$$
#### Definition of covariance
Given a state $\rho$, the covariance of $A$ and $B$ is given by the Jordan product of $A$ and $B$.
$$
\operatorname{Cov}_\rho(A,B)\coloneqq\mathbb{E}_\rho(A\circ B)-\mathbb{E}_\rho(A)\mathbb{E}_\rho(B)=\operatorname{Tr}(\rho A\circ B)-\operatorname{Tr}(\rho A)\operatorname{Tr}(\rho B)
$$
#### Robertson-Schrödinger uncertainty relation in finite dimensional Hilbert space
Let $\rho$ be a state on $\mathscr{H}$, $A,B$ be two self-adjoint operators on $\mathscr{H}$. Then
$$
\operatorname{Var}_\rho(A)\operatorname{Var}_\rho(B)\geq\operatorname{Cov}_\rho(A,B)^2+\frac{1}{4}|\mathbb{E}_\rho([A,B])|^2
$$
If $\rho$ is a pure state ($\rho=|\psi\rangle\langle\psi|$ for some unit vector $|\psi\rangle\in\mathscr{H}$), and the equality holds, then $A$ and $B$ are collinear (i.e. $A=c B$ for some constant $c\in\mathbb{R}$).
When $A$ and $B$ commute, the classical inequality is recovered (Cauchy-Schwarz inequality).
$$
\operatorname{Var}_\rho(A)\operatorname{Var}_\rho(B)\geq\operatorname{Cov}_\rho(A,B)^2
$$
[Proof ignored here]
### The uncertainty relation for unbounded symmetric operators
#### Definition of symmetric operator
An operator $A$ is symmetric if for all $\phi,\psi\in\mathscr{H}$, we have
$$
\langle A\phi,\psi\rangle=\langle\phi,A\psi\rangle
$$
An example of symmetric operator is $T(\psi)=i\hbar\frac{d\psi}{dx}$. If we let $\mathscr{H}=\mathscr{D}(T)$, $\hbar$ is the Planck constant.
$\mathscr{D}(T)$ be the space of all square integrable, differentiable, and it's derivative is also square integrable functions on $\mathbb{R}$.
#### Definition of joint domain of two operators
Let $(A,\mathscr{D}(A)),(B,\mathscr{D}(B))$ be two symmetric operators on their corresponding domains. The domain of $AB$ is defined as
$$
\mathscr{D}(AB)\coloneqq\{\psi\in\mathscr{D}(B):B\psi\in\mathscr{D}(A)\}
$$
Since $(AB)\psi=A(B\psi)$, the variance of an operator $A$ relative to a pure state $\rho=|\psi\rangle\langle\psi|$ is given by
$$
\operatorname{Var}_\rho(A)=\operatorname{Tr}(\rho A^2)-\operatorname{Tr}(\rho A)^2=\langle\psi,A^2\psi\rangle-\langle\psi,A\psi\rangle^2
$$
If $A$ is symmetric, then $\operatorname{Var}_\rho(A)=\langle A\psi,A\psi\rangle-\langle \psi, A\psi\rangle^2$.
#### Robertson-Schrödinger uncertainty relation for unbounded symmetric operators
Let $(A,\mathscr{D}(A)),(B,\mathscr{D}(B))$ be two symmetric operators on their corresponding domains. Then
$$
\operatorname{Var}_\rho(A)\operatorname{Var}_\rho(B)\geq\operatorname{Cov}_\rho(A,B)^2+\frac{1}{4}|\mathbb{E}_\rho([A,B])|^2
$$
If $\rho$ is a pure state ($\rho=|\psi\rangle\langle\psi|$ for some unit vector $|\psi\rangle\in\mathscr{H}$), and the equality holds, then $A\psi$ and $B\psi$ are collinear (i.e. $A\psi=c B\psi$ for some constant $c\in\mathbb{R}$).
[Proof ignored here]
### Summary of analog of classical probability theory and non-commutative (_quantum_) probability theory
| Classical probability | Non-commutative (_Quantum_) probability |
| --------- | ------- |
| Sample space $\Omega$, cardinality $\vert\Omega\vert=n$, example: $\Omega=\{0,1\}$ | Complex Hilbert space $\mathscr{H}$, dimension $\dim\mathscr{H}=n$, example: $\mathscr{H}=\mathbb{C}^2$ |
|Common algebra of $\mathbb{C}$ valued functions| Algebra of bounded operators $\mathscr{B}(\mathscr{H})$|
|$f\mapsto \bar{f}$ complex conjugation| $P\mapsto P^*$ adjoint|
|Events: indicator functions of sets| Projections: space of orthogonal projections $\mathscr{P}\subseteq\mathscr{B}(\mathscr{H})$|
|functions $f$ such that $f^2=f=\overline{f}$| orthogonal projections $P$ such that $P^*=P=P^2$|
|$\mathbb{R}$-valued functions $f=\overline{f}$| self-adjoint operators $A=A^*$|
| $\mathbb{I}_{f^{-1}(\{\lambda\})}$ is the indicator function of the set $f^{-1}(\{\lambda\})$| $P(\lambda)$ is the orthogonal projection to eigenspace|
|$f=\sum_{\lambda\in \operatorname{Range}(f)}\lambda \mathbb{I}_{f^{-1}(\{\lambda\})}$|$A=\sum_{\lambda\in \operatorname{sp}(A)}\lambda P(\lambda)$|
|Probability measure $\mu$ on $\Omega$| Density operator $\rho$ on $\mathscr{H}$|
|Delta measure $\delta_\omega$| Pure state $\rho=\vert\psi\rangle\langle\psi\vert$|
|$\mu$ is non-negative measure and $\sum_{i=1}^n\mu(\{i\})=1$| $\rho$ is positive semi-definite and $\operatorname{Tr}(\rho)=1$|
|Expected value of random variable $f$ is $\mathbb{E}_{\mu}(f)=\sum_{i=1}^n f(i)\mu(\{i\})$| Expected value of operator $A$ is $\mathbb{E}_\rho(A)=\operatorname{Tr}(\rho A)$|
|Variance of random variable $f$ is $\operatorname{Var}_\mu(f)=\sum_{i=1}^n (f(i)-\mathbb{E}_\mu(f))^2\mu(\{i\})$| Variance of operator $A$ is $\operatorname{Var}_\rho(A)=\operatorname{Tr}(\rho A^2)-\operatorname{Tr}(\rho A)^2$|
|Covariance of random variables $f$ and $g$ is $\operatorname{Cov}_\mu(f,g)=\sum_{i=1}^n (f(i)-\mathbb{E}_\mu(f))(g(i)-\mathbb{E}_\mu(g))\mu(\{i\})$| Covariance of operators $A$ and $B$ is $\operatorname{Cov}_\rho(A,B)=\operatorname{Tr}(\rho A\circ B)-\operatorname{Tr}(\rho A)\operatorname{Tr}(\rho B)$|
|Composite system is given by Cartesian product of the sample spaces $\Omega_1\times\Omega_2$| Composite system is given by tensor product of the Hilbert spaces $\mathscr{H}_1\otimes\mathscr{H}_2$|
|Product measure $\mu_1\times\mu_2$ on $\Omega_1\times\Omega_2$| Tensor product of space $\rho_1\otimes\rho_2$ on $\mathscr{H}_1\otimes\mathscr{H}_2$|
|Marginal distribution $\pi_*v$|Partial trace $\operatorname{Tr}_2(\rho)$|
### States of two dimensional system and the complex projective space (_Bloch sphere_)
Let $v=e^{i\theta}u$, then the space of pure states ($\rho=|u\rangle\langle u|$) is the complex projective space $\mathbb{C}P^1$.
$\alpha=x_i+iy_i,\beta=x_2+iy_2$ must satisfy $|\alpha|^2+|\beta|^2=1$, that is $x_1^2+x_2^2+y_1^2+y_2^2=1$.
The set of unit vectors in $\mathbb{C}^2$ is the unit sphere in $\mathbb{R}^3$.
So the space of pure states is the unit circle in $\mathbb{R}^2$.
#### Mapping between the space of pure states and the complex projective space
Any two dimensional pure state can be written as $e^{i\theta}u$, where $u$ is a unit vector in $\mathbb{R}^2$. There exists a bijective map $P:S^2\to\mathscr{P}_1\subseteq M_2(\mathbb{C})$ such that $P(u)=|u\rangle\langle u|$.
$$
P(\vec{x})=\frac{1}{2}(I+\vec{a}\cdot\vec{\sigma})=\frac{1}{2}\begin{pmatrix}
1&0\\
0&1
\end{pmatrix}+\frac{a_x}{2}\begin{pmatrix}
0&1\\
1&0
\end{pmatrix}+\frac{a_y}{2}\begin{pmatrix}
0&-i\\
i&0
\end{pmatrix}+\frac{a_z}{2}\begin{pmatrix}
1&0\\
0&-1
\end{pmatrix}