PSet 3 Discussion
Conventions used below: for spins we use Pauli operators \(\sigma_{x,z}\) with eigenvalues \(\pm 1\); the physical spin components are \(S_{x,z}=(\hbar/2)\sigma_{x,z}\). For coupled light–matter states we write \(\ket{n,\pm}\) where \(n\) is the cavity photon number and \(\sigma_z\ket{\pm}=\pm\ket{\pm}\).
Problem 3-1: Generalizing EPR-style arguments to three-particle entangled states¶
The three-spin state is
where \(\ket{\pm}\) denotes an \(S_z\) eigenstate (equivalently, a \(\sigma_z\) eigenstate) for each particle \(A,B,C\) in that order.
We will repeatedly use the standard basis change
(a)¶
If all three observers measure along \(z\), the only possible outcomes are exactly the four computational-basis strings that appear in \(\ket{\psi}\): \(+++\), \(+--\), \(-+-\), and \(--+\). Each has probability \(1/4\) (since each basis ket has amplitude magnitude \(1/2\)).
To diagnose correlations cleanly, work with Pauli outcomes \(s_z^A,s_z^B,s_z^C\in\{\pm 1\}\).
First look at two-body correlations, e.g. between \(A\) and \(B\). The possible \((s_z^A,s_z^B,s_z^C)\) triples are
each occurring with probability \(1/4\). The corresponding products \(s_z^A s_z^B\) across these four outcomes are
so they average to zero:
By the same check (or by symmetry), all pairwise correlations vanish:
workout
However, the three-body product \(s_z^A s_z^B s_z^C\) equals \(+1\) for every allowed outcome above, hence
Equivalently, in terms of physical spins,
So: there are no two-body correlations in \(z\), but there is a perfect three-body correlation.
(b)¶
Here Alice and Bob measure along \(x\) while Carl measures along \(z\).
workout
A compact way to see the claimed parity constraint is to show that \(\ket{\psi}\) is an eigenstate of the product observable \(\sigma_x^A\sigma_x^B\sigma_z^C\). Acting on each \(z\)-basis product ket, \(\sigma_x\) flips \(\ket{+z}\leftrightarrow\ket{-z}\) and \(\sigma_z\) contributes a sign on the third qubit. One finds
Therefore, if \((\sigma_x^A,\sigma_x^B,\sigma_z^C)\) are measured in the same run with outcomes \(s_x^A,s_x^B,s_z^C\in\{\pm 1\}\), their product is deterministic:
Now interpret this in terms of the physical-spin outcomes \(\pm \hbar/2\). A “\(+1/2\) outcome” corresponds to the Pauli eigenvalue \(+1\). For three \(\pm 1\) numbers, having product \(-1\) is equivalent to having an even number of \(+1\)’s (since the number of \(-1\)’s must be odd). Thus, among the three outcomes \((S_x^A,S_x^B,S_z^C)\), there must be an even number of \(+\hbar/2\) results.
We can also expand \(\ket{\psi}\) into the \((xxz)\) basis, and can see that exactly four outcome triples occur (each with probability \(1/4\)):
which gives rise to
and these are precisely the ones satisfying \(s_x^A s_x^B s_z^C=-1\).
(c)¶
Here Alice measures along \(x\) while Bob and Carl measure along \(z\).
Projecting onto definite \((s_z^B,s_z^C)\) outcomes, note that for each of the four allowed \((B,C)\) strings in \(\ket{\psi}\), the corresponding \(A\) spin is actually a definite \(z\) eigenstate:
workout
Thus, conditioned on any \((s_z^B,s_z^C)\), Alice’s state is either \(\ket{+z}\) or \(\ket{-z}\). Measuring \(\sigma_x\) on either \(\ket{+z}\) or \(\ket{-z}\) gives a completely random outcome:
Since the four \((s_z^B,s_z^C)\) outcomes each occur with probability \(1/4\), the full joint distribution is uniform:
So the measurement outcomes in the \((xzz)\) setting are completely random: knowing two outcomes does not constrain the third.
(d)¶
workout
Under cyclic permutation of the particles \(A\to B\to C\to A\), the set of four \(z\)-basis kets appearing in \(\ket{\psi}\) maps into itself with the same coefficients, so the state is invariant under cyclic relabeling.
Therefore:
- The conclusion of part (c) (uniform/random statistics) holds for any measurement setting with exactly one \(x\) and two \(z\) measurements: \((xzz)\), \((zxz)\), and \((zzx)\).
- The conclusion of part (b) (the deterministic parity rule) holds for any measurement setting with exactly one \(z\) and two \(x\) measurements: \((xxz)\), \((xzx)\), and \((zxx)\).
(e)¶
Consider runs where Bob and Carl measure along \(z\). From part (a) we know the allowed outcomes satisfy
Thus, once Bob and Carl later compare notes, they can predict with certainty what Alice would have obtained had she measured along \(z\) in that same run:
workout
EPR-style reasoning then goes as follows: Alice’s particle is spacelike separated from Bob and Carl, so (assuming locality) Bob and Carl’s choices/outcomes cannot influence what would happen at Alice. But Bob and Carl can nonetheless predict Alice’s hypothetical \(S_z^A\) outcome with certainty. Therefore (in the EPR sense) \(S_z^A\) must have had a predetermined value in that run: it is an “element of reality”.
(f)¶
Now consider runs where Bob measures along \(z\) and Carl measures along \(x\). By part (d), the state also satisfies a deterministic constraint for the \((xzx)\) setting:
workout
Hence the outcomes obey
so once Bob and Carl compare their outcomes, they can predict with certainty what Alice would have obtained had she measured along \(x\):
Applying the same locality logic as in part (e), this implies (again in the EPR sense) that \(S_x^A\) also has a predetermined value for that run.
(g)¶
EPR reasoning from (e) and (f) suggests that in each run particle \(A\) has definite predetermined values for both \(\sigma_z^A\) and \(\sigma_x^A\). By symmetry (part (d)), the same should hold for particles \(B\) and \(C\) as well. A local hidden-variable model would then assign, for each run, fixed numbers
independent of which measurements are actually chosen.
workout
However, the deterministic quantum predictions from parts (a) and (d) impose the following constraints on any such assignment:
-
From the \((zzz)\) setting in part (a),
\[ a_z b_z c_z=+1. \] -
From the “two \(x\), one \(z\)” settings in part (d),
\[ a_x b_x c_z=-1,\qquad a_x b_z c_x=-1,\qquad a_z b_x c_x=-1. \]
Now multiply the last three equations. The left-hand side becomes
because \(a_x^2=b_x^2=c_x^2=1\). But the right-hand side gives \((-1)^3=-1\). Thus any hidden-variable assignment would imply
which contradicts the required \(a_z b_z c_z=+1\) from the \((zzz)\) statistics. Therefore it is impossible to construct a deterministic local hidden-variable theory reproducing the correlations of \(\ket{\psi}\).
Problem 3-2: Superdense coding and security¶
In superdense coding, Alice and Bob initially share a maximally entangled Bell state, e.g.
Alice encodes a classical 2-bit message \(m\in\{00,01,10,11\}\) by applying one of four single-qubit unitaries \(U_m\) on her half of the pair, then sends her qubit to Bob. Bob then performs a Bell-basis measurement to decode \(m\) perfectly.
Suppose an eavesdropper Eve intercepts only Alice’s transmitted (single) qubit. We want to show that, regardless of what Eve measures, she learns nothing about \(m\).
workout
Let
be the joint two-qubit state after Alice’s encoding. Eve has access only to the reduced state on Alice’s subsystem:
Taking the partial trace and using cyclicity with respect to operators on \(A\),
But for a maximally entangled Bell state the reduced state is maximally mixed:
Therefore, for every message \(m\),
This already proves security. Indeed, for any measurement Eve might do, described as a POVM \(\{M_y\}\) on the intercepted qubit, the probability of outcome \(y\) conditioned on \(m\) is
which is independent of \(m\). Hence Eve’s measurement results have zero information about Alice’s message.
Problem 3-3: Perturbative treatment of a light–matter interaction¶
The Hamiltonian is
with \(\omega_{\mathrm{cav}}>\omega_{\mathrm{at}}\) and where \(\sigma_-=\ket{-}\!\bra{+}\) and \(\sigma_+=\ket{+}\!\bra{-}\). We treat the interaction term (proportional to \(g\)) as a perturbation.
(a)¶
At \(g=0\), the Hamiltonian is
workout
Its ground state is clearly the no-photon, spin-down state \(\ket{0,-}\), with energy
Moreover, \(\ket{0,-}\) remains an eigenstate for any value of \(g\), because the interaction term annihilates it:
imply
Thus \(\hat H\ket{0,-}=-(\omega_{\mathrm{at}}/2)\ket{0,-}\) for all \(g\).
(b)¶
The candidates for first excited eigenstates of \(\hat H_0\) are the two states in the one-excitation sector:
and
Since \(\omega_{\mathrm{cav}}>\omega_{\mathrm{at}}\), we have \(E_{1,-}^{(0)}-E_{0,+}^{(0)}=\omega_{\mathrm{cav}}-\omega_{\mathrm{at}}>0\), so the first excited state of \(\hat H_0\) is \(\ket{0,+}\).
The first-order energy shift is
workout
Both terms vanish: \(\sigma_-\ket{+}=\ket{-}\) but then \(\bra{+}\ket{-}=0\) for the diagonal matrix element, and \(\hat a\ket{0}=0\). Hence
(c)¶
The first-order correction to the eigenstate \(\ket{0,+}\) is
where \(V=g(\hat a^\dagger\sigma_-+\hat a\,\sigma_+)\). The only state connected to \(\ket{0,+}\) by \(V\) is \(\ket{1,-}\), because
workout
The matrix element is
The energy denominator is
Therefore
So, to first order in \(g\),
(d)¶
The second-order energy shift is
Again only \(\ket{1,-}\) contributes, yielding
(e)¶
Define the total excitation number operator
Here \(\hat a^\dagger\hat a\) counts cavity photons, while \(\sigma_+\sigma_-=\ket{+}\!\bra{+}\) is the projector onto the atomic excited state (so it counts atomic excitations). Using \(\sigma_z\ket{\pm}=\pm\ket{\pm}\), we can rewrite
so equivalently
It is helpful to see explicitly that \(\hat N\) really does “count total excitations” on the product basis \(\ket{n,\pm}\):
Now we show \([\hat N,\hat H]=0\). First recall the basic bosonic commutators \([\hat a,\hat a^\dagger]=1\) and the number-operator identities
For the atom, use the standard spin algebra
Since \(\sigma_+\sigma_-=(1+\sigma_z)/2\), this implies
We will also use two facts that simplify the bookkeeping:
- Cavity operators commute with atomic operators, e.g. \([\hat a,\sigma_\pm]=0\) and \([\hat a^\dagger\hat a,\sigma_z]=0\).
- The Leibniz rule \([A,BC]=[A,B]C+B[A,C]\).
The Hamiltonian is
The commutator with the “free” part is immediate:
because \([\hat a^\dagger\hat a,\hat a^\dagger\hat a]=0\), \([\sigma_+\sigma_-,\sigma_z]=0\), and cavity/atomic operators commute.
The only nontrivial step is the interaction term. Consider the first piece \(\hat a^\dagger\sigma_-\). Using the Leibniz rule and the commuting-subsystem facts,
Similarly, for \(\hat a\,\sigma_+\),
Putting this together, we have
This means \(\hat H\) conserves the total excitation number \(N\) and therefore block-diagonalizes into fixed-\(N\) subspaces.
- \(N=0\) subspace: the only state is \(\ket{0,-}\), so it is an exact eigenstate (the ground state) for all \(g\).
- \(N=1\) subspace: a convenient basis is \(\{\ket{0,+},\ket{1,-}\}\).
We now build the Hamiltonian matrix in the \(N=1\) subspace explicitly. The diagonal terms come from the free Hamiltonian:
For the off-diagonal coupling, note
while the other combinations annihilate these basis states. Hence in the ordered basis \(\{\ket{0,+},\ket{1,-}\}\),
workout
Let \(\Delta\equiv\omega_{\mathrm{cav}}-\omega_{\mathrm{at}}>0\). The eigenvalues of this \(2\times 2\) matrix are
At \(g\to 0\), \(E_-\to \omega_{\mathrm{at}}/2\) and \(E_+\to \omega_{\mathrm{cav}}-\omega_{\mathrm{at}}/2\). Therefore the exact first excited energy for any \(g\) is
It is also nice to write the corresponding exact eigenstate. Solve
which gives the ratio
Equivalently, one can introduce a mixing angle \(\theta\) defined by
so that (up to an overall phase)
Finally, to confirm agreement with perturbation theory to order \(g^2\), expand \(E_-\) for small \(g\):
so
matching parts (b) and (d).
Problem 3-4: Perturbing a harmonic oscillator¶
We take \(\hbar=1\) as in the problem statement. The Hamiltonian is
where
Let \(H_0=\frac{\hat p^2}{2m}+\frac{1}{2}m\omega^2\hat x^2\) so that \(E_n^{(0)}=(n+\tfrac{1}{2})\omega\), and take the perturbation
Using \(\hat x=x_{\mathrm{zpt}}(\hat a+\hat a^\dagger)\), we can write
(a)¶
The first-order shift is
A convenient normal-ordered expansion is
where \(\hat N=\hat a^\dagger\hat a\). Acting on \(\ket{n}\), every term except the last changes the number state and therefore has zero diagonal matrix element. Hence
workout
and
(b)¶
The second-order shift for a non-degenerate level is
Because \((\hat a+\hat a^\dagger)^4\) only changes the number \(n\) by \(\Delta n=\pm 2,\pm 4\), only \(k=n\pm 2\) and \(k=n\pm 4\) contribute.
From the expansion above, the needed matrix elements of \((\hat a+\hat a^\dagger)^4\) are
Also,
Using \(\bra{k}V\ket{n}=\lambda\,\omega\,\bra{k}(\hat a+\hat a^\dagger)^4\ket{n}\), we can write the second-order shift as a sum over the four connected levels:
Squaring the matrix elements listed above gives
(For small \(n\), the “downward” terms automatically vanish because of the factors \(n(n-1)\) and \(n(n-1)(n-2)(n-3)\).)
Plugging in, we get
Now expand each polynomial factor:
workout
Substituting these back in and collecting like powers of \(n\), the \(n^4\) terms cancel and we obtain
(c)¶
workout
Yes, \(E_n^{(2)}\) is negative for all \(n\ge 0\) because the polynomial \(34n^3+51n^2+59n+21\) is strictly positive.
It is also useful to understand why the sign comes out negative, since in general second-order shifts can have either sign.
Start from the second-order formula
For the harmonic oscillator, \(E_k^{(0)}=(k+\tfrac{1}{2})\omega\), so:
- If \(k>n\), then \(E_n^{(0)}-E_k^{(0)}<0\), so these terms contribute negatively.
- If \(k<n\), then \(E_n^{(0)}-E_k^{(0)}>0\), so these terms contribute positively.
In our case, \(V=\lambda\omega(\hat a+\hat a^\dagger)^4\) only connects \(n\) to \(n\pm 2\) and \(n\pm 4\), so we can write schematically
where \(M_{n\to k}\equiv \bra{k}(\hat a+\hat a^\dagger)^4\ket{n}\).
The reason the negative terms win is that “upward” matrix elements (those involving more raising operators) carry slightly larger ladder factors than the corresponding “downward” ones. Concretely, compare the \(n\to n\pm 2\) pair:
Both are \(O(n^4)\) at large \(n\), but for any finite \(n\) the “upward” factor is larger because it contains \((n+1)(n+2)\) rather than \(n(n-1)\) and also \((4n+6)^2\) rather than \((4n-2)^2\). In the same spirit, for the \(n\to n\pm 4\) pair,
and again the upward factor is larger.
We can view these ladder factors as ratios of factorials. For example,
and similarly for the \(\pm 4\) terms. These ratios make it clear that, when comparing contributions with the same number of raising vs lowering operations, the raising (upward) transitions carry more “weight” in magnitude. Since the raising transitions are precisely the ones with negative denominators, the overall second-order sum becomes negative.
Finally, note that \(E_n^{(2)}<0\) does not mean the anharmonic potential lowers the energy overall: for \(\lambda>0\), the first-order shift \(E_n^{(1)}=3\lambda\omega(2n^2+2n+1)\) is positive and dominates at small \(\lambda\), while the negative \(E_n^{(2)}\) is a (smaller) correction arising from level mixing.
(d)¶
We want an order-of-magnitude estimate of the largest \(n\) for which perturbation theory is valid.
A common (and more robust than only looking at \(E_n^{(1)}\)) criterion is:
where \(\Delta E\) is the unperturbed level spacing and \(\Delta V\) is the uncertainty (RMS fluctuation) of the perturbation in the unperturbed state \(\ket{n}\):
For the harmonic oscillator, the spacing is constant:
Now compute \(\Delta V\). Since \(V=\lambda\omega(\hat a+\hat a^\dagger)^4\), we have
We already found
To evaluate \(\bra{n}(\hat a+\hat a^\dagger)^8\ket{n}\) efficiently, one can use the normal-ordered form of \((\hat a+\hat a^\dagger)^4\) from part (a) and square it. When taking \(\bra{n}(\cdot)\ket{n}\), only terms with an equal number of raising and lowering operators survive (because the final state must return to \(\ket{n}\)). Carrying this out gives
Therefore,
so
The validity criterion \(\Delta V\ll\Delta E=\omega\) becomes
workout
For large \(n\), the leading behavior is \(\Delta V\sim \lambda\omega\sqrt{34}\,n^2\), hence the breakdown scale is
This reproduces the same scaling \(n_{\max}\propto 1/\sqrt{\lambda}\) as simpler estimates, but with a more concrete \(O(1)\) prefactor coming from the RMS-fluctuation criterion.
For comparison, a simpler (cruder) criterion is to demand that the mean perturbation (i.e. the first-order shift) be small compared to the spacing. From part (a), at large \(n\)
Requiring \(|E_n^{(1)}|\ll \omega\) again gives \(n_{\max}\sim 1/\sqrt{\lambda}\); the RMS criterion above just refines this with a more systematic measure of “typical perturbation size”.