This site uses cookies. By continuing to use this site you agree to our use of cookies. To find out more, see our Privacy and Cookies policy.
Brought to you by:
Paper

Stern–Gerlach experiment with higher spins

Published 22 March 2016 © 2016 IOP Publishing Ltd
, , Citation Bayram Tekin 2016 Eur. J. Phys. 37 035401 DOI 10.1088/0143-0807/37/3/035401

0143-0807/37/3/035401

Abstract

We analyze idealized sequential Stern–Gerlach (SG) experiments with higher spin particles. This analysis serves at least two purposes: the widely discussed spin-1/2 case leads to some misunderstandings since the probabilities are always evenly distributed for the sequential orthogonal magnets, which does not generalize to higher spins. A detailed discussion of the higher spin case, as is done here, is highly useful. Secondly, the Wigner rotation matrices for generic spins become conceptually more transparent with this physical example. We also give compact formulas for the probabilities in terms of the angle between the sequential SG apparatus for generic spins. We work out the spin-1/2, spin-1 cases explicitly. Since there are some confusing issues regarding the actual experiment, we also compile a 'facts and fiction' section on the SG experiments.

Export citation and abstract BibTeX RIS

1. Introduction

We begin with the following concrete problem. Given two idealized sequential Stern–Gerlach (SG) magnets that point in two fixed directions as shown in figure 1, what are the probabilities of possible outcomes from the second magnet for spin-s particles in terms of s and the angle between the magnets?

Figure 1.

Figure 1. Schematic representation of two sequential Stern–Gerlach (SG) magnets. An unpolarized beam of neutral spin-s particles with magnetic dipole moments, coming from the oven located at the far left, enters the first magnet whose magnetic field is pointing in the ${\hat{n}}_{1}$ direction; they are split into $2s+1$ states and only one of these states is allowed to enter the second magnet, the rest are blocked. The second magnet is pointing in the ${\hat{n}}_{2}$ direction. On the far right a detector detects the split atoms. The question is to write the probabilities of all possible outcomes in terms of the angle between the magnets, or, in short, in terms of ${\hat{n}}_{1}\cdot {\hat{n}}_{2}$.

Standard image High-resolution image

In the wording of the problem, the reservation idealized is quite important: as we shall discuss at the end, the actual SG experiment is beautiful but a very complicated one and the final probabilities cannot simply be explained exactly in terms of classical quantities. With this reservation in mind, let us note one of the probabilities here. Suppose we block all but the highest spin (measured in the ${\hat{n}}_{1}$ direction) after the first magnet. Let us call this state the $| {\hat{n}}_{1};s\rangle $ state. Then this state enters the second magnet and the probability of getting the $| {\hat{n}}_{2};s\rangle $ state out of $| {\hat{n}}_{1};s\rangle $ state is

Equation (1)

A priori, one might have guessed the angular dependence but the dependence on the spin cannot be easily guessed. As we shall see below, in general, when one looks for other probabilities of the form ${P}_{{m}_{1}{\hslash }\to {m}_{2}{\hslash }}$, the problem becomes more involved. Before we start the computations properly, let us say a few words on the history and the importance of the experiment, as well as the people involved and also note what is left somewhat unclear by the canonical case of spin-1/2.

1.1. A pinch of history

The SG experiment [1] is a very useful tool in teaching many aspects of quantum mechanics from the notion of spin of particles (or atoms) to superposition states, the problems of weak and strong measurements, decoherence, entanglement and density matrix formalism, etc. Bernstein goes so far as to say '... all of quantum mechanics is summarized in the SternGerlach experiments—at least all of quantum mechanics that is really mysterious' [2]. The idea of the experiment is simple: in an inhomogeneous magnetic field, with a sufficient gradient, a mixed collection of neutral atoms is expected to split into various groups due to their possible magnetic dipole moment, which is quantized in quantum mechanics. The actual execution of the experiment and understanding the outcomes in full detail with quantum mechanics is not a straightforward task. After one year of experimentation, Stern and Gerlach published their results in 1922. Even though what they really measured in the experiment was the magnetic dipole moment of the electron up to 10% accuracy, since the notion of electron spin was not known in those days, the correct interpretation of the experiment came only five years later [3] in a highly interesting paper that established the isotropy of some atoms, namely the existence of s-orbital electrons. These s-orbitals were thought to be non-existent, or unstable, in the Bohr–Sommerfeld theory. As opposed to the actual experiment, the idealized versions of the experiment serve as a fascinating teaching tool with many sequential SG apparatus utilized to expound upon the idea of superposition states etc. Many modern books discuss the experiment in varying detail; some even start teaching quantum mechanics with the experiment. As an example for undergraduate students, McIntyre's book [4], and for graduate students Sakurai and Napolitano [5], can be mentioned.

Many of the books that include the experiment only discuss the case of the silver atom; being a spin-1/2 system, it is the simplest and perhaps most elegant case to discuss, and was used in the original SG experiment that started it all. (Feynman's Lectures on Physics was an exception: Feynman devotes a chapter to the spin-1 case [6].) The spin-1/2 case is elegant in the sense that it shows the quantization of spin (or more properly the magnetic dipole moment) in the best way possible. But this simple case leaves many issues unclear if one is interested in issues beyond spin quantization, such as superposition states, etc, as mentioned above. This is because for the sequential SG experiments where two or more magnets are used to describe various overlaps of states (as shown in figure 1), probabilities are evenly split for the spin-1/2 case when the magnets are perpendicular to each other, while this is not the case for higher spins. Let us explain what we mean with an example: consider the spin up ($+\frac{{\hslash }}{2}$) silver atoms, described by the state $| {S}_{z};+\rangle $, coming out of the SG magnets placed along the $\hat{z}$-axis as shown in figure 2. Take these states and block the spin-down ones. Then, let these spin-up states pass through a second set of magnets in the $\hat{x}$-direction, then one expects to find half the atoms to be in the state $| {S}_{x};+\rangle $ and half in the state $| {S}_{x};-\rangle $. (Of course due to symmetry between the $\hat{x}$- and $\hat{y}$-axes, the probabilities would again be evenly split if one had the second set of magnets lie along the $\hat{y}$-axis.) Most students do not have much trouble in understanding this.

Figure 2.

Figure 2. A highly simplified picture of two SG magnets that deflect spin-1/2 particles with magnetic dipole moments. The probabilities are evenly distributed for this setting. This is the canonically discussed case to reveal the notion of superposition states.

Standard image High-resolution image

But for the (massive) spin-1 case, when they see the following probabilities (some are shown in figure 3), most of the students feel uneasy:

Equation (2)

Figure 3.

Figure 3. The probability outcomes are not evenly distributed for this orthogonal setting of the magnets, unlike the case of spin-1/2. The problem becomes a bit more complicated when the magnets are not orthogonal to each other. Details of the computations of the probabilities are given in the text below for the general orientation case.

Standard image High-resolution image

In the next section we will see how these probabilities are computed, but before that let us note a couple of things about the SG experiment as a teaching tool. Both in my undergraduate and graduate quantum mechanics courses, I spend a fair amount of time discussing various facets of the experiment for the original silver atoms as well as gedanken higher spin systems. I have found that students feel much more comfortable with the discussion of rotations, Wigner matrices, etc, when these issues are discussed in the SG context, or at least when the higher spin SG experiment is given as an example where, for example Wigner matrices appear. I have also found that the history of the original SG experiment with serendipitous events and the fascinating lives of the people involved is so rich that it is one of the best 'baits' to attract students to think about quantum mechanics. Before the lectures on the SG experiment, I assign students to read the dramatic account of the experiment from [7]. It is not our intention to discuss the history of the experiment here, but let us say a few words. Stern and Gerlach's low salaries played an interesting role: they could afford only cheap cigars with a high sulfur content, which helped the split-silver atoms appear in the deposited plates. Born even gave a paid lecture on relativity to help the experiment and solicited funds from a businessman in the United States. Long after the experiment, the lives of Stern and Gerlach diverged in a dramatic way. Gerlach worked for Germany's war effort; in particular, he became the director of the nuclear energy programme (possibly intending to develop an atomic bomb) and got detained with nine other German scientists, the likes of W Heisenberg, O Hahn, M von Lue and C F Von Weizsacker, in a house in Cambridge for six months after they were captured in Operation Epsilon, whose scientific advisor was none other than S Goudsmit, who was one of the originators of the idea of spin. Stern on the other hand was awarded the Nobel prize in physics in 1943 with a citation that did not mention the SG experiment but cited Stern's measurement of the magnetic dipole moment of the proton. Let us now turn to the bulk of the paper where we study idealized SG experiments with higher spins.

2. Idealized SG experiments with all spins

With two sequential SG magnets, the most general probability one needs to compute is the probability of finding the state $| {\hat{n}}_{2};{m}_{2}\rangle $ in the state $| {\hat{n}}_{1};{m}_{1}\rangle $, where of course ${m}_{\mathrm{1,2}}$ are the components of the spin in the ${\hat{n}}_{\mathrm{1,2}}$ directions, taking values from the set $\{-s,-s+1,...,s-1,s\}$. So we need to find the result of

Equation (3)

in terms of the vectors ${\hat{n}}_{i}$ and s as well as ${m}_{\mathrm{1,2}}$. Before we present the general spin case, let us adopt a very naive approach and carry out the computation for several lower spin cases by brute force, namely by finding the states $| \hat{n};m\rangle $, say in vector representation, and directly computing the probability.

2.1. Spin-1/2 case

To compute the probability outcomes from the theory what do we need to know? The answer is: we need to know the angular momentum, in this case, the spin algebra. So, for the rest of the work this is assumed to be the case. Since this is discussed in almost all textbooks, we shall not dwell on this much. It is also sort of self-evident: classical three-dimensional spatial rotations as applied to a quantum system lead to the commutation relations

Equation (4)

which are satisfied by $2s+1$-dimensional matrices with $s=0,1/2,1,3/2,\ldots $ and the corresponding states are labeled by the eigenvalues of the quadratic Casimir operator ${\hat{S}}^{2}$ together with the eigenvalue of our favorite spin component ${\hat{S}}_{z}$. Then without further ado, let us note that the eigenket of ${\hat{S}}_{n}=\hat{S}\cdot \hat{n}$ with eigenvalue ${\hslash }/2$ is easily found as

Equation (5)

where θ is the angle measured from the $\hat{z}$-axis in the clockwise direction and ϕ is the angle measured from the $\hat{x}$-axis in the counterclockwise direction. Then, given the vectors

Equation (6)

it is easy to show that one has the following probability

Equation (7)

Needless to say, one can choose one of the directions , say ${\hat{n}}_{1}$, to be in the $\hat{z}$ direction. Without a surprise, the 'other' probability also follows as (see figure 4)

Equation (8)

happily yielding ${P}_{\frac{{\hslash }}{2}\to \frac{{\hslash }}{2}}+{P}_{\frac{{\hslash }}{2}\to -\frac{{\hslash }}{2}}=1$. Let us note that we can define the angle between the two vectors as

Equation (9)

Figure 4.

Figure 4. Spin-1/2 particles enter two sequential SG magnets that are oriented in generic directions. Probabilities are written on the right. This example serves as a good starting point, but it misses a lot of fine details.

Standard image High-resolution image

This simple discussion was to set the stage, let us now consider the massive spin-1 case (a massless one would have only two helicity modes, which we do not discuss here).

2.2. Spin-1 case

For this case, ${\hat{S}}_{n}$ is a 3 × 3 Hermitian operator with the following eigenkets (up to a phase, as usual)

Equation (10)

The probabilities can be computed as

Equation (11)

The others simply follow from these due to symmetry. Observe that the probabilities for each given initial state ending in a sum of final states add up to 1, ${\sum }_{f}{P}_{i\to f}=1$, as needed. The probabilities noted in figure 3 follow from (11) as a simple example. Leaving the spin-3/2 and spin-2 cases as useful exercises, let us move on to the generic spin-s case and answer the question posed at the beginning of the paper.

2.3. Spin-s case

Our task now is to compute the following probability

Equation (12)

whose meaning is clear when referring to figure 1. Not to clutter the notation, and since the final result is independent of the choice of coordinates anyway, let us choose ${\hat{n}}_{1}=\hat{z}$, which then reduces our task to the following computation

Equation (13)

This is all rather obvious, the crucial step is the next one: we can obtain $| {\hat{n}}_{2};{m}_{2}\rangle $ from $| \hat{z};{m}_{2}\rangle $ by two rotations; first rotate about the $\hat{y}$-axis by an angle θ, then rotate about the $\hat{z}$-axis by an angle ϕ (both in the counter-clockwise direction) as

Equation (14)

which further reduces our computation to

Equation (15)

where the angle ϕ drops out and we have defined

Equation (16)

which are the elements of little-d (Wigner) rotation matrices. Since the computation of these objects is well covered in many books (for example, see [5]), I just quote the final result here

Equation (17)

where the upper case gamma stands for

Equation (18)

and the sum runs over all possible integer k values that do not yield negative arguments of the factorials. Since in this choice of coordinates θ is the angle between the two magnets, we can write the final result in a coordinate-independent way as

Equation (19)

This is the amplitude of measuring the spin component to be ${m}_{2}{\hslash }$ in the second magnet oriented along ${\hat{n}}_{2}$ if we allow only ${m}_{1}{\hslash }$ spins from the first magnet oriented along ${\hat{n}}_{2}$ as depicted in figure 1. From this amplitude one computes the desired probability, such as the expression (1). When ${m}_{1}={m}_{2}=s$, one has ${{\rm{\Gamma }}}_{m0}^{s}=1$ and one obtains the amplitude as

Equation (20)

Furthermore, as this amplitude is the most general one needed, one can use as many sequential SG magnets as one desires to compute the various probability outcomes. Let us now collect some facts and fiction regarding the SG experiments and conclude.

3. Facts and fiction in the SG experiments

  • 1.  
    Stern and Gerlach were trying to prove the quantization of the spin of the electron. No, they knew nothing about the spin of the electron. They were trying to (dis)prove 'space quantization', that is the quantization of the orbital angular momentum as suggested by the Bohr–Sommerfeld theory, which was then the theory of the atom in old quantum mechanics. In fact, when they observed the splitting, they celebrated Bohr with a now famous postcard that shows the photo of the splitting. Of course, from today's vantage point, they had the wrong theory in their hands, because the lowest non-trivial angular momentum is ${\ell }=1$ which would lead to three lines instead of the two that they observed. Why then, did they celebrate Bohr? Because, Bohr as well as others did not think that zero angular momentum was stable, so, they eliminated the ${m}_{{\ell }}=0$ component [2].
  • 2.  
    After the SG experiment, people must surely have realized that there should be another quantum number in the atom besides the three already known at the time? This is also not correct, as partially answered above. The experiment did not actually play much of a role in the introduction of the notion of the spin quantum number. What played role is the spectrum of the atoms in a magnetic field, more specifically, the duplexity of the lines due to the (anomalous) Zeeman effect. Pauli suggested that there must be a fourth quantum number without giving a related motion to the electron, Kronig (as a graduate student) noted that the duplexity could be due to the spinning motion of the electron, but he was scared off the idea by the 'elders', who earlier entertained the idea of a rotating electron about an axis passing through its center only to find that such a motion is inconsistent with special relativity, as some points on the electron must rotate with speeds two orders of magnitude faster than the speed of light. Uhlenbeck and Goudsmit (also graduate students) suggested the same notion of a spinning electron to explain the duplexity of the spectrum. They were also scared off, but their paper was already published by the time they wanted to retract. What were Pauli and Bohr thinking about the fourth quantum number? What kind of physical motion or phenomenon did they attach to it? This is actually an important point: they thought that the electron does not have an independent, intrinsic property, call it spin or some other number, the fourth quantum number arises, just like the other three (quantized energy, quantized angular momentum and quantized component of angular momentum), according to them, when the electron is bound to the atom. Who could blame them? Nobody has attached a physical intrinsic quantity to an electron besides its mass and charge at the time. Dirac's discovery of the relativistic equation for the electron and his prediction of the duplexity and the gyromagnetic factor supported Bohr and Pauli, because, if the notion of spin was intrinsic why should it appear in the relativistic theory which has something to the motion [8]? Hence comes the topic of the next item.
  • 3.  
    There is no way to measure the spin of an electron in a SG type experiment. No this is not correct, even though this has become somewhat folklore because of the obvious dominance of the Lorentz force over the force on the magnetic dipole moment of the electron. But it is a rather interesting issue, with a remarkable history [8]: Bohr always maintained that due to uncertainty there is no way to measure the spin of an isolated electron and Pauli agreed, and in fact wrote a review trying to argue against the published suggestions for experiments, such as the longitudinal SG experiments where electrons are basically expected to split in a race according their dipole moment in a magnetic field which is inhomogeneous along the direction of their motion. In principle this is doable as was suggested in [9] with the methods invented by the Nobel prize winner H Dehmelt. See also the related notion of 'continuous SG experiments' [11] that lead to a very accurate determination of the gyromagnetic factor of the electron.
  • 4.  
    The strength of the gradient of the magnetic field is not important. This is not correct at all. Unfortunately, this wrong view comes from utilizing the idealized version of the experiment as a teaching tool, as we did above. An honest computation of the predictions of probabilities must make use of the multi-component Schrödinger equation and its wave-packet solutions. Consider just the case of the silver atom (that is the spin-1/2 case). The atoms coming out of the oven, which is around 1000 Celsius, have a Maxwell–Boltzmann distribution with velocities centered around 1 km s−1. They form an unpolarized ensemble even after they are collimated. We must then, consider a mixed ensemble of particles, interacting with a SG magnet for about 10−4 seconds (as was the case in the original SG experimental set up) and getting detected at the screen. Of course, this is quantum mechanics and one does not really have trajectories. What happens is that the initial superposition states that form the wave packet evolve in time in such a way that spin-up and spin-down parts of the amplitude are peaked around different points in the $\hat{z}$ direction, due to the magnetic field–dipole interactions. Of course, depending on the gradient of the magnetic field, this splitting may not lead to a measurement. In fact, the notion of weak measurement is essentially realized by this idea of having magnetic dipoles interact with magnetic fields with weak gradients [10]. I suggest that the reader work out how the centers of the wave-packet will change in the lowest order approximation (namely, assuming that the magnetic field can be expanded in a Taylor series and the resulting spinor Schrödinger equation splits into two equations, where the problem reduces to the analogous problem of motion under constant 'gravity'). The reader will also notice that the lip-like structure in the actual experiment can be explained with the help of two things: for the consistency of the no monopole condition $\vec{{\rm{\nabla }}}\cdot \vec{B}=0$, the magnetic field must change in another direction, hence for some atoms, the measurement is actually not in the intended direction but in the perpendicular direction. Also, as noted there are no clean-cut trajectories, so smearing out is expected.
  • 5.  
    The silver atom was a random choice in the experiment. Yes and no. Stern was the first person to measure directly the speed of gas molecules in 1920 using silver vapor [12]. (It is amazing that it took such a long time to realize this mid 18th century idea.) Of course, heavier atoms go slower and so their speeds are easier to measure. So, Stern was already familiar with the silver atom. The fact that silver has 46 electrons forming a closed shell with zero angular momentum and the 47th electron in the $5\;s$ state with zero orbital angular momentum worked well for Stern and Gerlach. But, in retrospect, with their beautiful experiment they lent support to the wrong theory [13]

4. Conclusions

We have studied the Stern–Gerlach (GS) experiment for generic spin-s particles in the idealized setting to hopefully remove some of the possible misunderstandings introduced in the canonically-discussed spin-1/2 case. We have also given compact formulas for the probability outcomes which boil down to the elements of the Wigner rotation matrices. As explicit examples, we have worked out the details of spin-1/2 and spin-1 for generic orientations of two sequential SG magnets. Of course this discussion could be extended to any number of sequential magnets. We have compiled a facts and fiction section that we hope would serve to remove some unclear points in the idealized version of the SG experiments.

Acknowledgments

This work is partially supported by TÜBİTAK grant 113F155. I would like to thank Nur Ünal and Deniz Tekin for their kind help with the figures.

Please wait… references are loading.
10.1088/0143-0807/37/3/035401