The Road to Quantum Mechanics: Derivations & Simulations

01

Classical Mechanics & Newton's Revolution

1. Coordinates and Galilean Reference Frames

To describe physical phenomena, we construct a coordinate system. A point particle's position is represented by a vector $\mathbf{r}(t) = (x(t), y(t), z(t))$ in Cartesian space relative to an origin. Galileo observed that there is no absolute resting reference frame; uniform linear motion is fundamentally relative. If an observer in reference frame $S'$ moves at a constant velocity $\mathbf{v}$ along the x-axis relative to a "stationary" frame $S$, the coordinates transform via the Galilean Transformations:

$x' = x - vt, \quad y' = y, \quad z' = z, \quad t' = t$

Differentiating these positions with respect to time yields the velocity addition rule: $u'_x = u_x - v$. Differentiating once more, because $\mathbf{v}$ is strictly constant, its derivative vanishes ($\frac{d\mathbf{v}}{dt} = 0$), showing that physical accelerations are identical in all Galilean inertial reference frames: $\mathbf{a}' = \mathbf{a}$. This mathematical equivalence is the cornerstone of the Galilean relativity principle: the laws of mechanics are identical for all observers in uniform motion.

2. Defining the Concept of Force: Galileo's Inertia to Newton's Second Law

In Aristotelian physics, objects were believed to naturally come to rest, requiring a continuous push to maintain a velocity ($\mathbf{F} \propto \mathbf{v}$). Galileo dismantled this through rigorous inclined-plane and double-ramp rolling tests: he observed that a polished bronze ball rolled down one ramp would rise to the exact same height on an opposite ramp. When he flattened the second ramp (reducing its angle to 0°), the ball rolled indefinitely, limited only by the roughness of the wood. This meant objects possess a physical property called **inertia**—they naturally preserve their state of motion.

Newton codified this into three laws, translating qualitative observations into rigorous algebraic dynamics:

Newton's First Law (Inertia): A body remains at rest or in uniform linear motion unless compelled to change its state by a net external force. This defines the existence of inertial reference frames.
Newton's Second Law (Dynamics): How do we formulate an equation to predict the exact acceleration caused by a force?
Let us perform an experiment with a rolling cart. If we pull the cart using a spring stretched to a constant length, it experiences a constant force $\mathbf{F}$, and we observe that it accelerates at a constant rate $\mathbf{a}$.
If we double the mass of the cart ($m \to 2m$) by stacking an identical block on it, we observe that the same spring pull produces exactly **half** the acceleration ($\mathbf{a} \to \mathbf{a}/2$). This means acceleration is inversely proportional to mass: $\mathbf{a} \propto 1/m$.
Now, if we double the force ($F \to 2\mathbf{F}$) by pulling with two identical springs side-by-side, we observe that the acceleration of the cart **doubles** ($\mathbf{a} \to 2\mathbf{a}$). This means acceleration is directly proportional to force: $\mathbf{a} \propto \mathbf{F}$.
Combining these experimental observations yields: $$ \mathbf{a} \propto \frac{\mathbf{F}}{m} \implies \mathbf{F} \propto m\mathbf{a} $$ By defining the unit of force (the "Newton") as the exact amount of force required to accelerate a $1\text{ kg}$ mass at $1\text{ m/s}^2$, the constant of proportionality becomes exactly $1$. Generalizing force as the rate of change of linear momentum $\mathbf{p} = m\mathbf{v}$, we write: $$ \mathbf{F} = \frac{d\mathbf{p}}{dt} = \frac{d(m\mathbf{v})}{dt} = m\mathbf{a} \quad (\text{for constant mass } m) $$
Newton's Third Law (Reciprocity): When body $A$ exerts a force $\mathbf{F}_{AB}$ on body $B$, body $B$ simultaneously exerts an equal and opposite force $\mathbf{F}_{BA}$ on body $A$: $$ \mathbf{F}_{AB} = -\mathbf{F}_{BA} $$ This is experimentally demonstrated by two interacting carts with a compressed spring between them; when released, the change in their momenta is equal in magnitude and opposite in direction ($\Delta \mathbf{p}_A = -\Delta \mathbf{p}_B$), indicating that the forces acting during the collision were symmetric.

3. Direct Derivations of Kinematics: Linking Galileo's $s \propto t^2$ to Calculus

Through his inclined plane experiments, Galileo observed a precise quantitative rule: the distance $s$ rolled by a ball starting from rest was strictly proportional to the square of the elapsed time ($s \propto t^2$). If we assume a constant force causing a constant physical acceleration $\mathbf{a}(t) = \mathbf{a}_0$, how do we mathematically formulate this observation?

Since acceleration is defined as the rate of change of velocity ($\mathbf{a}_0 = \frac{d\mathbf{v}}{dt}$), we separate variables and integrate both sides from initial time $t = 0$ to final time $t$: $$ \int_{\mathbf{v}_0}^{\mathbf{v}(t)} d\mathbf{v} = \int_0^t \mathbf{a}_0 dt' \implies \mathbf{v}(t) - \mathbf{v}_0 = \mathbf{a}_0 t \implies \mathbf{v}(t) = \mathbf{v}_0 + \mathbf{a}_0 t $$ This shows that velocity increases linearly with time. Now, since velocity is the rate of change of position ($\mathbf{v}(t) = \frac{d\mathbf{r}}{dt}$), we integrate once more: $$ \int_{\mathbf{r}_0}^{\mathbf{r}(t)} d\mathbf{r} = \int_0^t (\mathbf{v}_0 + \mathbf{a}_0 t') dt' \implies \mathbf{r}(t) - \mathbf{r}_0 = \left[ \mathbf{v}_0 t' + \frac{1}{2}\mathbf{a}_0 t'^2 \right]_0^t \implies \mathbf{r}(t) = \mathbf{r}_0 + \mathbf{v}_0 t + \frac{1}{2}\mathbf{a}_0 t^2 $$ This calculus derivation beautifully reproduces Galileo's experimental discovery: if an object starts from rest ($\mathbf{v}_0 = 0$) at the origin ($\mathbf{r}_0 = 0$), its position is $\mathbf{r}(t) = \frac{1}{2}\mathbf{a}_0 t^2$, mathematically proving that $s \propto t^2$.

To obtain the 1D speed-displacement relationship without explicit time variables, we write acceleration using the chain rule as $a = \frac{dv}{dt} = \frac{dv}{dx}\frac{dx}{dt} = v\frac{dv}{dx}$. Separating variables and integrating: $$ \int_{x_0}^x a \, dx' = \int_{v_0}^v v' \, dv' \implies a(x - x_0) = \left[ \frac{1}{2}v'^2 \right]_{v_0}^v \implies a\Delta x = \frac{1}{2}v^2 - \frac{1}{2}v_0^2 \implies v^2 = v_0^2 + 2a\Delta x $$

4. Energy and Momentum Conservation

We define the mechanical **Work** $W$ done by a force on a particle traveling along a path $C$ as the line integral: $$ W = \int_C \mathbf{F} \cdot d\mathbf{r} = \int_{t_0}^{t_1} \mathbf{F} \cdot \frac{d\mathbf{r}}{dt} dt = \int_{t_0}^{t_1} \mathbf{F} \cdot \mathbf{v} dt $$

Substituting Newton's second law $\mathbf{F} = m\frac{d\mathbf{v}}{dt}$: $$ W = \int_{t_0}^{t_1} m\frac{d\mathbf{v}}{dt} \cdot \mathbf{v} dt = \int_{\mathbf{v}_0}^{\mathbf{v}_1} m \mathbf{v} \cdot d\mathbf{v} = \left[ \frac{1}{2}m(\mathbf{v} \cdot \mathbf{v}) \right]_{\mathbf{v}_0}^{\mathbf{v}_1} = \frac{1}{2}mv_1^2 - \frac{1}{2}mv_0^2 = \Delta K $$ This is the **Work-Energy Theorem**: the work done by a net force equals the change in the particle's **Kinetic Energy** $K$.

If the force is conservative (path-independent), we can define a scalar **Potential Energy** function $U(\mathbf{r})$ such that $\mathbf{F} = -\nabla U$. Thus: $$ W = \int_{\mathbf{r}_0}^{\mathbf{r}_1} \mathbf{F} \cdot d\mathbf{r} = -\int_{\mathbf{r}_0}^{\mathbf{r}_1} \nabla U \cdot d\mathbf{r} = -(U_1 - U_0) = -\Delta U $$ Combining the two definitions: $\Delta K = -\Delta U \implies \Delta(K + U) = 0 \implies E = K + U = \text{Constant}$.

5. Gravitational Fields & Kepler's Laws: Deducing the Inverse-Square Law

Terrestrial gravity causes objects to fall with a constant acceleration of $g \approx 9.8\text{ m/s}^2$. In the celestial realm, Johannes Kepler synthesized decades of meticulous astronomical observations of planetary orbits, identifying a striking empirical harmony (Kepler's Third Law): the square of a planet's orbital period $T$ is directly proportional to the cube of its orbital radius ($T^2 \propto r^3$).

How did Isaac Newton formulate a single gravitational equation to explain both phenomena?
Let us model a planet of mass $m$ in a stable circular orbit of radius $r$ around the sun. The centripetal force holding the planet in its orbit is: $$ F_c = \frac{m v^2}{r} $$ Since the planet's orbital speed is the circumference divided by the period ($v = \frac{2\pi r}{T}$), its square is $v^2 = \frac{4\pi^2 r^2}{T^2}$.
Substituting Kepler's experimental observation that $T^2 = k r^3$ (where $k$ is a constant of nature): $$ v^2 = \frac{4\pi^2 r^2}{k r^3} = \left(\frac{4\pi^2}{k}\right) \frac{1}{r} $$ This indicates that planetary speed squared drops off inversely with distance. Plugging this speed back into our centripetal force equation: $$ F_c = \frac{m v^2}{r} \propto \frac{m \left(\frac{1}{r}\right)}{r} = \frac{m}{r^2} $$ To explain Kepler's astronomical observations, the gravitational force **must** decay with the inverse square of the distance: $F \propto 1/r^2$.
Furthermore, by Newton's Third Law (Reciprocity), gravity must be mutual: if the Sun pulls the planet, the planet pulls the Sun. The force must therefore be proportional to both the Sun's mass $M$ and the planet's mass $m$. Combining these experimental and logical constraints yields: $$ \mathbf{F} = -G \frac{M m}{r^2} \hat{\mathbf{r}} $$ where $G$ is the universal gravitational constant. Applying this central force model to Newton's Second Law in general elliptical coordinates, we mathematically derive Kepler's three empirical laws:

Kepler's First Law (Law of Ellipses): The orbit of a planet is an ellipse with the Sun at one of the two foci.
To derive this, we set up the equation of motion in plane polar coordinates. The radial component of acceleration is $a_r = \ddot{r} - r\dot{\theta}^2$. Under gravitational attraction: $$ m(\ddot{r} - r\dot{\theta}^2) = -G\frac{Mm}{r^2} \implies \ddot{r} - r\dot{\theta}^2 = -\frac{GM}{r^2} $$ Since the central force is radial, angular momentum $L = mr^2\dot{\theta}$ is conserved, which means $\dot{\theta} = \frac{L}{mr^2}$. Making the standard orbital substitution $u = 1/r$ and writing $r$ derivatives in terms of angle $\theta$: $$ \frac{dr}{dt} = \frac{d}{dt}\left(\frac{1}{u}\right) = -\frac{1}{u^2}\frac{du}{d\theta}\frac{d\theta}{dt} = -\frac{1}{u^2}\frac{du}{d\theta}\left(\frac{Lu^2}{m}\right) = -\frac{L}{m}\frac{du}{d\theta} $$ Differentiating again with respect to time: $$ \ddot{r} = -\frac{L}{m}\frac{d^2u}{d\theta^2}\dot{\theta} = -\frac{L^2u^2}{m^2}\frac{d^2u}{d\theta^2} $$ Substituting $\ddot{r}$ and $\dot{\theta}$ back into the radial equation of motion yields Binet's orbital equation: $$ -\frac{L^2u^2}{m^2}\frac{d^2u}{d\theta^2} - \frac{1}{u}\left(\frac{Lu^2}{m}\right)^2 = -GMu^2 \implies \frac{d^2u}{d\theta^2} + u = \frac{GMm^2}{L^2} $$ This is a second-order linear differential equation. Its general physical solution is: $$ u(\theta) = \frac{GMm^2}{L^2} \left[ 1 + e \cos(\theta - \theta_0) \right] $$ Inverting back to $r(\theta)$ and setting $\theta_0 = 0$: $$ r(\theta) = \frac{\alpha}{1 + e\cos\theta} \quad \text{where} \quad \alpha = \frac{L^2}{GMm^2} $$ This represents the polar coordinate equation of a conic section with eccentricity $e$. For bounded planetary systems where $0 \le e < 1$, this equation dictates an **ellipse**, proving Kepler's First Law.
Kepler's Second Law (Law of Equal Areas): A line segment joining a planet and the Sun sweeps out equal areas during equal intervals of time.
The differential area $dA$ swept out by the position vector $\mathbf{r}$ as it rotates through a small angle $d\theta$ is: $$ dA = \frac{1}{2} r (r d\theta) = \frac{1}{2} r^2 d\theta $$ Dividing by $dt$ to find the rate of area swept over time (areal velocity): $$ \frac{dA}{dt} = \frac{1}{2} r^2 \frac{d\theta}{dt} $$ Since angular momentum is $L = m r^2 \dot{\theta}$, we substitute $\dot{\theta} = \frac{L}{mr^2}$: $$ \frac{dA}{dt} = \frac{1}{2} r^2 \left( \frac{L}{mr^2} \right) = \frac{L}{2m} = \text{Constant} $$ Because angular momentum $L$ is conserved in a central field, $\frac{dA}{dt}$ is strictly constant, demonstrating that planets sweep out equal orbital areas in equal times.
Kepler's Third Law (Law of Harmonies): The square of the orbital period $T$ of a planet is directly proportional to the cube of the semi-major axis $a$ of its orbit.
Integrating the constant areal velocity $\frac{dA}{dt} = \frac{L}{2m}$ over one full orbital period $T$ must yield the total area of the ellipse $A = \pi a b$, where $b$ is the semi-minor axis: $$ A = \int_0^T \frac{dA}{dt} dt = \frac{L}{2m} T \implies \pi a b = \frac{L}{2m} T \implies T = \frac{2m\pi ab}{L} $$ Squaring both sides gives: $$ T^2 = \frac{4\pi^2 m^2 a^2 b^2}{L^2} $$ The geometry of an ellipse relates the axes by $b^2 = a^2(1 - e^2)$. Recall from the First Law derivation that the semi-latus rectum is $\alpha = \frac{L^2}{GMm^2} = a(1-e^2)$. Rearranging terms: $$ 1 - e^2 = \frac{L^2}{GMm^2 a} \implies b^2 = a^2 \left( \frac{L^2}{GMm^2 a} \right) = \frac{a L^2}{GMm^2} \implies \frac{m^2 b^2}{L^2} = \frac{a}{GM} $$ Substituting this relationship into our squared period expression: $$ T^2 = 4\pi^2 a^2 \left( \frac{m^2 b^2}{L^2} \right) = 4\pi^2 a^2 \left( \frac{a}{GM} \right) = \left( \frac{4\pi^2}{GM} \right) a^3 $$ Thus, we derive that the square of the orbital period is proportional to the cube of the semi-major axis ($T^2 \propto a^3$), completing the mechanical synthesis.

Historical Experiment: Galileo's Inclined Plane (1638)

Free fall was too fast for 17th-century water clocks to measure accurately. Galileo solved this by rolling bronze balls down polished, inclined wooden planes, effectively "diluting" gravity. He demonstrated that the distance traveled was proportional to the square of elapsed time ($x \propto t^2$), establishing acceleration as a constant and laying the foundation for Newton's laws.

Simulation 1: The Gravity and Energy Lab

Newtonian Gravity

Position ($s$): 0.00 m

Velocity ($v$): 0.00 m/s

Accel. ($a$): 0.00 m/s²

Friction ($F_f$): 0.00 N

Control Deck

Mass ($m$): 2.0 kg

Ramp Angle ($\theta$): 30°

Kinetic Friction ($\mu_k$): 0.15

Gravity ($g$): 9.8 m/s²

Real-time Energy Split

Potential ($U$): 100%

Kinetic ($K$): 0%

Thermal ($E_{th}$): 0%

02

Electromagnetism & Maxwell's Synthesis

1. Coulomb's Law and the Electrostatic Field: The Torsion Balance to Gauss's Law

How do we formulate an equation to describe the invisible force between stationary electric charges? Charles-Augustin de Coulomb resolved this in 1785 using a highly sensitive torsion balance (a silver thread suspending a horizontal insulating needle with small conducting spheres). By charging the spheres and bringing them close, the electrostatic force caused the thread to twist. The angle of twist was directly proportional to the force exerted.

Through these trials, Coulomb compiled two crucial experimental facts:
1. **Distance Dependence:** Doubling the distance between two charged spheres reduced the twisting force by a factor of 4. Tripling the distance reduced the force by a factor of 9. This established that the force obeys an inverse-square law: $F \propto 1/r^2$.
2. **Charge Dependence:** To vary the charge, Coulomb touched a charged metal sphere to an identical, uncharged, isolated metal sphere, exactly halving its charge ($q \to q/2$). He observed that halving either charge halved the force, and halving both charges reduced the force to exactly one-quarter. Thus, force is proportional to the product of the charges: $F \propto q_1 q_2$.
Combining these experimental deductions yields: $$ \mathbf{F} = \frac{1}{4\pi\epsilon_0} \frac{q_1 q_2}{r^2} \hat{\mathbf{r}} $$ where $\epsilon_0 \approx 8.854 \times 10^{-12}\text{ C}^2/\text{N}\cdot\text{m}^2$ is the vacuum permittivity. To resolve the philosophical paradox of "action-at-a-distance," we define the **Electric Field** $\mathbf{E}$ as the electrostatic force experienced per unit test charge: $$ \mathbf{E} = \lim_{q_0 \to 0} \frac{\mathbf{F}}{q_0} = \frac{1}{4\pi\epsilon_0} \frac{q}{r^2} \hat{\mathbf{r}} $$

Michael Faraday subsequently performed his famous **ice-pail experiment**, lowering a charged metal ball into an uncharged, hollow metal container. He observed that a charge of equal magnitude and opposite sign was induced on the inner walls, and an equal charge of the same sign was induced on the outer wall, completely independent of the ball's position inside the container.
This meant that the total electric field lines ("flux") emerging from a closed volume depend only on the net enclosed charge, not on its position. If we integrate the electric field $\mathbf{E}$ over a closed sphere of radius $r$ centered on a charge $q$, the total flux is: $$ \oint \mathbf{E} \cdot d\mathbf{A} = E (4\pi r^2) = \left( \frac{1}{4\pi\epsilon_0} \frac{q}{r^2} \right) (4\pi r^2) = \frac{q}{\epsilon_0} $$ Generalizing this to any arbitrary closed surface enclosing charge $Q_{\text{enc}}$ yields the integral form of **Gauss's Law**: $\oint \mathbf{E} \cdot d\mathbf{A} = \frac{Q_{\text{enc}}}{\epsilon_0}$. Applying the Divergence Theorem, we convert the surface integral to a volume integral: $\oint \mathbf{E} \cdot d\mathbf{A} = \int_V (\nabla \cdot \mathbf{E}) dV$. Since the enclosed charge is the integral of the charge density $\int_V \rho \, dV$, we write: $$ \int_V (\nabla \cdot \mathbf{E}) dV = \int_V \frac{\rho}{\epsilon_0} dV $$ Because this equivalence must hold for any arbitrary volume $V$, the integrands must be identical, yielding the local differential form of Gauss's Law: $$ \nabla \cdot \mathbf{E} = \frac{\rho}{\epsilon_0} $$

2. Magnetostatics and Faraday's Induction

Magnetic forces arise from moving charges. Hans Christian Oersted experimentally discovered that an electric current flowing through a wire deflected a nearby compass needle, indicating that moving electric charges generate magnetic fields. André-Marie Ampère formulated this mathematically as a loop integral: $\oint \mathbf{B} \cdot d\mathbf{l} = \mu_0 I_{\text{enc}}$. In differential vector terms, this is $\nabla \times \mathbf{B} = \mu_0 \mathbf{J}$.

However, if you take a bar magnet and break it in half, you never isolate a single magnetic pole (monopole). You always obtain two smaller, complete magnets, each with its own North and South poles. No matter how many times you break it down, you cannot isolate a magnetic charge.
This means that every magnetic field line that leaves a region must eventually loop back and re-enter, so the net magnetic flux through any closed surface is strictly zero: $\oint \mathbf{B} \cdot d\mathbf{A} = 0$. Applying the Divergence Theorem, we convert this to the differential **Gauss's Law for Magnetism**: $$ \nabla \cdot \mathbf{B} = 0 $$

Michael Faraday set out to reverse Oersted's discovery, demonstrating that magnetic fields could generate electrical current. He connected a coil of wire to a galvanometer (ammeter) and moved a permanent bar magnet in and out of the coil. He observed that:
1. Current only flowed *while the magnet was in motion* relative to the coil.
2. Moving the magnet faster produced a larger current (and thus a larger induced electromotive force $\mathcal{E}$).
3. Reversing the magnet's direction reversed the direction of the current.
This showed that a changing magnetic flux induces an electromotive force, which is the closed loop integral of the induced electric field: $\mathcal{E} = \oint \mathbf{E} \cdot d\mathbf{l}$. Heinrich Lenz observed that the induced current always opposes the change in magnetic flux (conservation of energy), adding a negative sign. Thus, we formulate: $$ \oint \mathbf{E} \cdot d\mathbf{l} = -\frac{d\Phi_B}{dt} = -\frac{\partial}{\partial t} \int_A \mathbf{B} \cdot d\mathbf{A} $$ Applying Stokes' Theorem to convert the line integral into a surface integral: $\oint \mathbf{E} \cdot d\mathbf{l} = \int_A (\nabla \times \mathbf{E}) \cdot d\mathbf{A}$. Equating the surface integrals: $$ \int_A (\nabla \times \mathbf{E}) \cdot d\mathbf{A} = \int_A -\frac{\partial \mathbf{B}}{\partial t} \cdot d\mathbf{A} $$ Since this must hold for any loop area $A$, the local differential form of **Faraday's Law** is: $$ \nabla \times \mathbf{E} = -\frac{\partial \mathbf{B}}{\partial t} $$

3. Maxwell's Correction: The Displacement Current

In the mid-19th century, James Clerk Maxwell identified a severe mathematical inconsistency in the classical equations of Ampere ($\nabla \times \mathbf{B} = \mu_0 \mathbf{J}$). Taking the divergence of Ampere's curl equation: $$ \nabla \cdot (\nabla \times \mathbf{B}) = 0 \quad \text{always, by vector identity} $$ However, from charge conservation (the continuity equation), we know that $\nabla \cdot \mathbf{J} = -\frac{\partial \rho}{\partial t}$. Under time-varying conditions, charge density changes, so $\nabla \cdot \mathbf{J} \neq 0$, creating a glaring mathematical contradiction!

Maxwell resolved this by examining a charging capacitor. During charging, a physical current $I$ flows through the connecting wires, but *no* physical charges cross the empty gap between the capacitor plates. Under Ampere's classical law, a loop placed in the gap should enclose zero current and thus measure zero magnetic field. Yet, placing a compass in the gap reveals a magnetic field is indeed present!
This meant the changing electric field in the gap must behave exactly like a physical current. To formulate this, Maxwell calculated the changing electric flux between capacitor plates of area $A$: $E = \frac{Q}{\epsilon_0 A} \implies \Phi_E = E A = \frac{Q}{\epsilon_0}$. Differentiating with respect to time: $$ \frac{d\Phi_E}{dt} = \frac{1}{\epsilon_0} \frac{dQ}{dt} = \frac{I}{\epsilon_0} \implies I = \epsilon_0 \frac{d\Phi_E}{dt} $$ This rate of change of electric flux carries the dimensions of current. Maxwell defined this as the **Displacement Current** $I_D = \epsilon_0 \frac{d\Phi_E}{dt}$, with an associated displacement current density $\mathbf{J}_D = \epsilon_0 \frac{\partial \mathbf{E}}{\partial t}$. Adding this term to Ampere's Law resolves the mathematical contradiction and charge conservation, yielding the complete Ampere-Maxwell Law:

1. Gauss's Law (Electrostatics): $$ \nabla \cdot \mathbf{E} = \frac{\rho}{\epsilon_0} $$

2. Gauss's Law for Magnetism: $$ \nabla \cdot \mathbf{B} = 0 $$

3. Faraday's Law of Induction: $$ \nabla \times \mathbf{E} = -\frac{\partial \mathbf{B}}{\partial t} $$

4. Ampere-Maxwell Law: $$ \nabla \times \mathbf{B} = \mu_0 \left( \mathbf{J} + \epsilon_0 \frac{\partial \mathbf{E}}{\partial t} \right) $$

4. Derivation of the Electromagnetic Wave Equation

In a complete vacuum containing no free charges ($\rho = 0$) and no electric currents ($\mathbf{J} = 0$), Maxwell's equations simplify to: $$ (a) \, \nabla \cdot \mathbf{E} = 0, \quad (b) \, \nabla \cdot \mathbf{B} = 0, \quad (c) \, \nabla \times \mathbf{E} = -\frac{\partial \mathbf{B}}{\partial t}, \quad (d) \, \nabla \times \mathbf{B} = \mu_0 \epsilon_0 \frac{\partial \mathbf{E}}{\partial t} $$

To decouple these, we take the curl ($\nabla \times$) of Faraday's Law $(c)$: $$ \nabla \times (\nabla \times \mathbf{E}) = \nabla \times \left( -\frac{\partial \mathbf{B}}{\partial t} \right) = -\frac{\partial}{\partial t} (\nabla \times \mathbf{B}) $$

Applying the vector identity $\nabla \times (\nabla \times \mathbf{E}) = \nabla(\nabla \cdot \mathbf{E}) - \nabla^2\mathbf{E}$: $$ \nabla(\nabla \cdot \mathbf{E}) - \nabla^2\mathbf{E} = -\frac{\partial}{\partial t} (\nabla \times \mathbf{B}) $$

Substituting equations $(a)$ and $(d)$ into this expression: $$ 0 - \nabla^2\mathbf{E} = -\frac{\partial}{\partial t} \left( \mu_0 \epsilon_0 \frac{\partial \mathbf{E}}{\partial t} \right) \implies \nabla^2\mathbf{E} = \mu_0\epsilon_0 \frac{\partial^2\mathbf{E}}{\partial t^2} $$

This is the classical 3D **Wave Equation** which dictates wave propagation. Comparing this directly to the standard mechanical wave equation $\nabla^2 \psi = \frac{1}{v^2} \frac{\partial^2 \psi}{\partial t^2}$, we prove that electromagnetic fields propagate as waves with velocity: $$ c = \frac{1}{\sqrt{\mu_0 \epsilon_0}} \approx 2.998 \times 10^8 \text{ m/s} $$ This unified optics with electromagnetism, demonstrating that light is an electromagnetic wave.

5. Electromagnetic Energy Density, Wave Intensity, and the Poynting Vector

Electromagnetic waves carry physical energy stored within the propagating fields themselves. To calculate this, we define the electromagnetic energy density $u$ (energy per unit volume) as the sum of the electric energy density $u_E$ and magnetic energy density $u_B$: $$ u = u_E + u_B = \frac{1}{2}\epsilon_0 E^2 + \frac{1}{2\mu_0} B^2 $$

Recall that for a plane wave, the magnetic and electric field amplitudes are linked by $B = \frac{E}{c} = E\sqrt{\mu_0\epsilon_0}$. Substituting this relationship into the magnetic density: $$ u_B = \frac{1}{2\mu_0} \left(E^2 \mu_0\epsilon_0\right) = \frac{1}{2}\epsilon_0 E^2 = u_E $$ This proves an important physical symmetry: the energy carried by an electromagnetic wave is equally shared between the electric and magnetic fields. The total instantaneous energy density is therefore: $$ u = \epsilon_0 E^2 = \frac{1}{\mu_0} B^2 $$

To describe the rate of energy transfer per unit area per unit time (energy flux), we formulate the Poynting Vector $\mathbf{S}$. Since the fields are orthogonal and transport energy in the direction of wave propagation (the vector cross product of $\mathbf{E}$ and $\mathbf{B}$): $$ \mathbf{S} = \frac{1}{\mu_0} (\mathbf{E} \times \mathbf{B}) $$ The units of $\mathbf{S}$ are Watts per square meter ($\text{W/m}^2$). The direction of $\mathbf{S}$ is strictly aligned with the wave propagation vector $\mathbf{k}$, showing that energy flows orthogonally to the oscillating fields.

Because physical detectors (such as our eyes or photo-sensors) measure fields averaged over many optical cycles, we define the Wave Intensity $I$ as the time-average of the Poynting vector magnitude: $$ I = \langle S \rangle = \frac{1}{\mu_0} \langle E B \rangle = \frac{1}{\mu_0} \langle E_0 \sin(\mathbf{k}\cdot\mathbf{r}-\omega t) B_0 \sin(\mathbf{k}\cdot\mathbf{r}-\omega t) \rangle $$ Since the time-average of $\sin^2(\theta)$ over a full cycle is exactly $\frac{1}{2}$, this yields the classical intensity formulation: $$ I = \frac{E_0 B_0}{2\mu_0} = \frac{E_0^2}{2\mu_0 c} = \frac{1}{2} c \epsilon_0 E_0^2 $$ This shows that wave intensity is directly proportional to the square of the electric field amplitude ($E_0^2$).

Simulation 2: Electromagnetic Wave Propagation

Field Electrodynamics

Interactive 3D Window: Drag / swipe to rotate view

Wave Telemetry & Controls

Frequency ($f$): 4.0 Hz

Amplitude ($E_0$): 50 V/m

Polarization: Horizontal

Vector Color Code

Electric Field ($\mathbf{E}$ Vector)

Magnetic Field ($\mathbf{B}$ Vector)

Wave Propagation ($\mathbf{k}$ Vector)

03

The Relativity Crisis & Lorentz transformations

1. The Speed of Light Paradox

Maxwell's wave equation derived the velocity of light $c = 1/\sqrt{\mu_0\epsilon_0}$ strictly from universal parameters. However, in Galilean reference frames, speeds are relative ($c' = c - v$). Under Galilean relativity, there could only be *one* privileged frame where the speed of light is exactly $c$—the frame of the medium carrying light waves, dubbed the **Luminiferous Aether**.

2. Michelson-Morley Interferometer (1887)

Albert A. Michelson and Edward W. Morley attempted to measure the velocity of Earth through this Aether wind by splitting a light beam into two perpendicular arms, bouncing them off mirrors, and recombining them to form interference fringes.

Let one arm (length $L_1$) lie parallel to Earth's velocity vector $\mathbf{v}$ through the Aether, and the other arm (length $L_2$) lie perpendicular. The classical time of round-trip travel along the parallel arm is calculated via velocity addition: $$ t_{\text{para}} = \frac{L_1}{c - v} + \frac{L_1}{c + v} = \frac{2 L_1 c}{c^2 - v^2} = \frac{2 L_1}{c} \frac{1}{1 - v^2/c^2} $$ While the cross-wind travel along the perpendicular arm requires diagonal aiming, yielding: $$ t_{\text{perp}} = \frac{2 L_2}{\sqrt{c^2 - v^2}} = \frac{2 L_2}{c} \frac{1}{\sqrt{1 - v^2/c^2}} $$

As the Earth rotated, the phase difference between the arms should shift, shifting the interference pattern. However, despite extreme sensitivity, **no fringe shift was detected**. The velocity of Earth relative to the Aether was zero at all seasons, showing that the speed of light is isotropic (constant in all directions).

3. Postulates of Special Relativity

Albert Einstein resolved this crisis by discarding the Aether and reforming the concepts of space and time. He established two postulates:

The Principle of Relativity: The laws of physics are identical in all inertial reference frames.
The Constancy of the Speed of Light: Light propagates in empty space with a constant velocity $c$ independent of the motion of the emitting body.

4. Derivation of Time Dilation via Light-Clock

Consider a light clock consisting of two parallel mirrors separated by a distance $D$ in a stationary frame $S_0$. A light pulse bounces between them. The proper time interval $\Delta t_0$ for a round-trip bounce is: $$ \Delta t_0 = \frac{2D}{c} $$

Now, let this clock move horizontally at constant velocity $v$ relative to an observer in frame $S$. To this observer, the light pulse must travel a diagonal path of length $2L$ to trace the moving mirrors. In the time interval $\Delta t$ of the round trip, the clock travels a horizontal distance $v \Delta t$.

Applying the Pythagorean theorem to the triangle formed by the light path: $$ L^2 = D^2 + \left( \frac{v \Delta t}{2} \right)^2 $$

Since the speed of light is constant ($c$) in frame $S$, the distance $L$ traveled by the pulse is $L = c \frac{\Delta t}{2}$. Substituting: $$ \left( \frac{c \Delta t}{2} \right)^2 = D^2 + \left( \frac{v \Delta t}{2} \right)^2 $$

Multiplying by 4 and rearranging terms to isolate $\Delta t$: $$ c^2 \Delta t^2 - v^2 \Delta t^2 = 4D^2 \implies \Delta t^2(c^2 - v^2) = 4D^2 \implies \Delta t^2 = \frac{4D^2}{c^2 - v^2} = \frac{4D^2}{c^2 \left(1 - \frac{v^2}{c^2}\right)} $$ Taking the square root and substituting proper time $\Delta t_0 = \frac{2D}{c}$: $$ \Delta t = \frac{2D}{c} \frac{1}{\sqrt{1 - v^2/c^2}} = \gamma \Delta t_0 $$ where $\gamma = 1/\sqrt{1 - v^2/c^2}$ is the relativistic **Lorentz Factor**.

5. The Lorentz Transformations: Deriving Spacetime Transformations from Constant $c$

To ensure that the speed of light $c$ is strictly invariant for all inertial observers, the Galilean transformations must be discarded. How can we formulate an equation to transform coordinates while preserving the isotropic speed of light?
Let us assume a linear coordinate transformation between frame $S$ and frame $S'$ moving at relative velocity $v$ along the x-axis. To ensure uniform motion transforms to uniform motion, the transformation must be linear: $$ x' = \gamma (x - vt) $$ By the principle of relativity, the inverse transformation must carry the exact same mathematical form, differing only by the sign of the relative velocity: $$ x = \gamma (x' + vt') $$ Now, let us perform a thought experiment. A light pulse is emitted from the origin of both frames at the instant they coincide ($t = t' = 0$). Since the speed of light is observed as exactly $c$ in **both** frames, the position of the wave front must satisfy: $$ x = ct \quad \text{and} \quad x' = ct' $$ Let us substitute these isotropic wave equations into our transformation equations: $$ ct' = \gamma (c - v) t \quad \text{and} \quad ct = \gamma (c + v) t' $$ To solve for the scaling factor $\gamma$, we multiply these two expressions together: $$ c^2 t t' = \gamma^2 (c^2 - v^2) t t' $$ Dividing both sides by $t t'$ and isolating $\gamma$: $$ c^2 = \gamma^2 (c^2 - v^2) \implies \gamma^2 = \frac{c^2}{c^2 - v^2} = \frac{1}{1 - v^2/c^2} \implies \gamma = \frac{1}{\sqrt{1 - v^2/c^2}} $$ This is the exact same Lorentz factor derived from our geometric light-clock, proving that the constant speed of light mathematically necessitates this spatial stretch.
To find how time transforms ($t'$), we substitute the spatial equation $x' = \gamma(x-vt)$ into our reciprocal equation $x = \gamma(x' + vt')$: $$ x = \gamma \left[ \gamma(x - vt) + vt' \right] = \gamma^2 x - \gamma^2 vt + \gamma vt' $$ Isolating the $t'$ term: $$ \gamma vt' = x - \gamma^2 x + \gamma^2 vt \implies vt' = \frac{x}{\gamma} - \gamma x + \gamma vt $$ Dividing by $v$ and substituting $\frac{1}{\gamma} = \gamma(1 - v^2/c^2)$: $$ t' = \gamma t + \frac{x}{v} \left( \frac{1}{\gamma} - \gamma \right) = \gamma t + \frac{x}{v} \left( \gamma\left(1 - \frac{v^2}{c^2}\right) - \gamma \right) = \gamma t + \frac{x}{v} \left( \gamma - \gamma\frac{v^2}{c^2} - \gamma \right) $$ Simplifying the expression: $$ t' = \gamma\left(t - \frac{vx}{c^2}\right) $$ This yields the complete and elegant **Lorentz Transformations**:

$x' = \gamma(x - vt), \quad y' = y, \quad z' = z, \quad t' = \gamma\left(t - \frac{vx}{c^2}\right)$

These equations demonstrate that space and time are not independent absolutes, but rather orthogonal coordinates in a unified 4D spacetime continuum. A consequence of this is the relativity of simultaneity: events that occur simultaneously at different spatial locations in frame $S$ will not occur at the same time in frame $S'$.

6. 4-Vectors: Unifying Spacetime and Energy-Momentum

In special relativity, the separate three-dimensional vectors of space and scalar quantities of time are unified into four-dimensional vectors, or **4-vectors**, which transform under Lorentz boosts. The position 4-vector $X^\mu$ is defined as: $$ X^\mu = (X^0, X^1, X^2, X^3) = (ct, x, y, z) $$ where $\mu = 0,1,2,3$ denotes the spacetime components ($0$ is temporal, $1,2,3$ are spatial). Under a Lorentz boost at velocity $v$ along the x-axis, this vector transforms as $X'^\mu = \Lambda^\mu_{\ \nu} X^\nu$, which is exactly the matrix representation of the Lorentz transformations: $$ \begin{pmatrix} ct' \\ x' \\ y' \\ z' \end{pmatrix} = \begin{pmatrix} \gamma & -\beta\gamma & 0 & 0 \\ -\beta\gamma & \gamma & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \end{pmatrix} \begin{pmatrix} ct \\ x \\ y \\ z \end{pmatrix} $$ where $\beta = v/c$. To compute physical quantities that are identical for all observers, we define the scalar product of two 4-vectors using the flat **Minkowski metric** $\eta_{\mu\nu} = \text{diag}(-1, 1, 1, 1)$: $$ A \cdot B = \eta_{\mu\nu} A^\mu B^\nu = -A^0 B^0 + A^1 B^1 + A^2 B^2 + A^3 B^3 $$ The scalar product of the position 4-vector with itself yields the **invariant spacetime interval** $ds^2$: $$ ds^2 = X \cdot X = -c^2 dt^2 + dx^2 + dy^2 + dz^2 $$ Because $\eta_{\mu\nu}$ is invariant under Lorentz boosts, $ds^2$ has the exact same value in all inertial reference frames ($ds^2 = ds'^2$), representing the absolute geometry of spacetime.

Similarly, the three-dimensional momentum vector $\mathbf{p}$ and scalar relativistic energy $E$ are unified into the **energy-momentum 4-vector** $P^\mu$: $$ P^\mu = \left(\frac{E}{c}, p_x, p_y, p_z\right) $$ Taking the Lorentz-invariant scalar product of $P^\mu$ with itself: $$ P \cdot P = -\frac{E^2}{c^2} + |\mathbf{p}|^2 = -m^2 c^2 $$ where $m$ is the particle's invariant rest mass. Rearranging this yields Einstein's celebrated energy-momentum relation: $$ E^2 = (pc)^2 + (mc^2)^2 $$ For a particle at rest ($\mathbf{p} = 0$), this simplifies directly to the most famous equation in physics: $E = mc^2$. Thus, 4-vectors provide the ultimate coordinate-free representation of relativistic dynamics.

Simulation 3: Lorentz Transformations & Length Contraction

Special Relativity

Boost Control Room

Relative Speed ($\beta = v/c$): 0.00

Relativistic Telemetry

Lorentz Factor ($\gamma$): 1.00

Length Contraction ($L/L_0$): 100.0%

Time Dilation ($\Delta t / \Delta t_0$): 1.00x

Relativistic Momentum ($p / p_{\text{class}}$): 1.00x

Interactive Legend

Lorentz factor curve

Rocket Frame (Moving)

Earth Frame (Proper)

Drag the speed slider to see length contraction physically compress the rocket ship in the direction of motion, and see the clocks tick at dilating rates.

04

Thermodynamics & The Ultraviolet Catastrophe

1. Cavity Radiators and the Classical Failure: Deducing the Density of States

At the close of the 19th century, physicists sought to compute the exact spectrum of electromagnetic radiation emitted by a thermal cavity (an ideal blackbody). Lord Rayleigh and Sir James Jeans applied classical statistical mechanics to the radiation field inside a conducting cavity of volume $V = L^3$.
They treated the radiation field as a collection of electromagnetic standing waves with nodes at the metallic walls. By solving Maxwell's equations under these conducting boundary conditions, they deduced the number of electromagnetic standing wave modes per unit frequency interval $d\nu$ (the **Density of States**): $$ g(\nu)d\nu = \frac{8\pi V \nu^2}{c^3} d\nu $$

According to the classical statistical **Equipartition Theorem**, every degree of freedom in thermal equilibrium at temperature $T$ must carry an average energy of: $$ \langle E \rangle = k_B T $$ where $k_B$ is the Boltzmann constant. This continuous average energy is derived by integrating the Boltzmann probability distribution over all possible continuous energy states from $0$ to $\infty$: $$ \langle E \rangle = \frac{\int_0^\infty E e^{-\frac{E}{k_B T}} dE}{\int_0^\infty e^{-\frac{E}{k_B T}} dE} = k_B T $$

Multiplying the density of states $g(\nu)$ by the continuous average energy $k_B T$ yields the classical **Rayleigh-Jeans Spectral Radiance** (expressed in wavelength $\lambda = c/\nu$): $$ I(\lambda, T) = \frac{8\pi k_B T}{\lambda^4} $$

The Ultraviolet Catastrophe: Because the denominator scales as $\lambda^4$, as we head toward short wavelengths ($\lambda \to 0$, high frequencies in the ultraviolet and X-ray spectrum), the predicted spectral radiance diverges to infinity: $\lim_{\lambda \to 0} I(\lambda, T) = \infty$. Continuous classical physics predicted that every warm object (even a human body) must instantly radiate away all its thermal energy as infinite ultraviolet and X-ray radiation, leaving the universe in immediate darkness.

2. Planck's Quanta Hypothesis and the Resolution: Formulating Discrete Averages

In 1900, Max Planck resolved this crisis by introducing a radical physical assumption. He realized that to suppress the infinite divergence at high frequencies, there must be a mechanism that prevents high-frequency electromagnetic oscillators from absorbing thermal energy at a given temperature $T$.
If energy is continuous, any oscillator—no matter how high its frequency—can absorb an infinitesimally small amount of energy to participate in thermal equilibrium. But what if energy is strictly **discrete**?
Planck postulated that the microscopic oscillators in the cavity walls exchange energy with the radiation field *only* in discrete packets called **quanta**: $$ E = n h \nu = n \frac{h c}{\lambda}, \quad n \in \{0, 1, 2, 3, \dots\} $$ where $h$ is a new constant of nature (Planck's constant).
Under this discrete assumption, the average energy of an oscillator mode must be computed using a discrete sum over Maxwell-Boltzmann probabilities rather than a continuous integration: $$ \langle E \rangle = \frac{\sum_{n=0}^\infty (nh\nu) e^{-\frac{nh\nu}{k_B T}}}{\sum_{n=0}^\infty e^{-\frac{nh\nu}{k_B T}}} $$ Let us define a dimensionless parameter $x = \frac{h\nu}{k_B T}$. The denominator of this expression is a standard infinite geometric series: $$ S = \sum_{n=0}^\infty e^{-nx} = 1 + e^{-x} + e^{-2x} + \dots = \frac{1}{1 - e^{-x}} $$ The numerator can be rewritten as a derivative of this geometric series: $$ \sum_{n=0}^\infty (nh\nu) e^{-nx} = -h\nu \frac{d}{dx} \left( \sum_{n=0}^\infty e^{-nx} \right) = -h\nu \frac{d}{dx} \left( \frac{1}{1 - e^{-x}} \right) = -h\nu \left( \frac{-e^{-x}}{(1 - e^{-x})^2} \right) = h\nu \frac{e^{-x}}{(1 - e^{-x})^2} $$ Dividing our formulated numerator by the denominator yields the average energy of a quantized mode: $$ \langle E \rangle = \frac{h\nu \frac{e^{-x}}{(1 - e^{-x})^2}}{\frac{1}{1 - e^{-x}}} = h\nu \frac{e^{-x}}{1 - e^{-x}} = \frac{h\nu}{e^x - 1} = \frac{h\nu}{e^{\frac{h\nu}{k_B T}} - 1} $$ Let us analyze what this equation says:
* At low frequencies ($h\nu \ll k_B T$), we can Taylor-expand the denominator: $e^{\frac{h\nu}{k_B T}} - 1 \approx 1 + \frac{h\nu}{k_B T} - 1 = \frac{h\nu}{k_B T}$. The average energy becomes $\langle E \rangle \approx \frac{h\nu}{h\nu/k_B T} = k_B T$, which is exactly the continuous classical Equipartition result!
* At high frequencies ($h\nu \gg k_B T$), the denominator grows exponentially: $e^{\frac{h\nu}{k_B T}} \to \infty$. This forces the average energy to plummet exponentially to zero: $\langle E \rangle \to 0$. Because high-frequency quantum states require a large packet of energy ($h\nu$) to be excited, the available thermal energy ($k_B T$) is simply too small to excite them, mathematically freezing them out!
Multiplying our density of states by this discrete average energy yields the celebrated **Planck's Radiation Law**: $$ I(\lambda, T) = \frac{8\pi hc}{\lambda^5} \frac{1}{e^{\frac{hc}{\lambda k_B T}} - 1} $$ This beautiful formula perfectly matched the experimental curves, successfully predicting the spectral radiance peak and resolving the Ultraviolet Catastrophe by dropping to zero at short wavelengths: $\lim_{\lambda \to 0} I(\lambda, T) = 0$.

Simulation 4: Blackbody Radiation Curve

Thermodynamic Quanta

Blackbody Control Room

Temperature ($T$): 5500 K

Calculated Metrics

Peak $\lambda$: 527 nm

Spectral Band: Visible (Green)

Radiated Power: 5.2E7 W/m²

Interactive Legend

Planck Radiance Curve

Rayleigh-Jeans (Classical)

Visible Spectrum Area

Note how the classical Rayleigh-Jeans curve diverges upward toward the left (the Ultraviolet Catastrophe), while the correct Planck curve remains bounded and peaks.

05

The Birth of Quantum Mechanics

1. Einstein and the Photoelectric Effect (1905): The Collision of Light Quanta

Classical wave electrodynamics predicted that light, as a continuous wave, deposits energy over time. Therefore, the kinetic energy of ejected photoelectric electrons should scale strictly with the **intensity** of the light (which represents the field amplitude squared, $|\mathbf{E}|^2$). Under classical physics, a very dim light should require a long delay (minutes to hours) for an electron to accumulate enough wave energy to break free, whereas extremely bright light should eject incredibly fast electrons.

However, experiments conducted by Philipp Lenard and Robert Millikan revealed three shocking observations that completely shattered classical wave expectations:
1. **Frequency Dependency:** The kinetic energy of the ejected electrons depended strictly on the **frequency** $\nu$ (color) of the incident light, completely independent of its intensity!
2. **Zero Time Delay:** Ejection was strictly instantaneous—even for extremely dim light, electrons were emitted immediately with zero delay.
3. **Threshold Frequency:** Below a specific cutoff frequency $\nu_0$ (the work function threshold), **no** electrons were emitted, regardless of how bright or intense the light source was.
To explain these point-like, all-or-nothing collisions, Albert Einstein extended Planck's quanta hypothesis. He proposed that light propagates not as continuous waves, but as localized packets of energy called **photons**, each carrying a discrete energy: $$ E = h\nu $$ When a photon strikes an electron in a metal, it transfers its entire energy packet in a single, instantaneous collision. A fixed amount of energy, called the **Work Function** $\Phi$, is absorbed to overcome the electrostatic binding forces of the metal lattice. The remaining energy is converted into the electron's maximum kinetic energy.
By applying the principle of conservation of energy to this single-photon collision, we formulate Einstein's celebrated photoelectric equation: $$ K_{\text{max}} = h\nu - \Phi $$ This simple algebraic balance perfectly explained all experimental observations: the threshold frequency is where $h\nu_0 = \Phi$, the speed scales with frequency because energy scales with $\nu$, and the process is instant because it is a point-like collision.

2. Bohr's Quantized Hydrogen Atom (1913): Spectral Lines and Stable Orbits

Ernest Rutherford's alpha-particle scattering experiments proved that atoms consist of lightweight, negative electrons orbiting a dense, positively charged nucleus. However, classical electrodynamics dictated that accelerating charges must continuously emit electromagnetic radiation. An electron orbiting classically would continuously bleed away its kinetic energy, spiraling into the positive nucleus within $10^{-11}$ seconds. Classical physics predicted that all matter in the universe should instantly collapse!
Furthermore, when an electric discharge is passed through hydrogen gas, it does not emit a continuous thermal spectrum. Instead, it emits light at **discrete, sharp spectral lines** (the Balmer series), described by the empirical Rydberg formula: $$ \frac{1}{\lambda} = R \left( \frac{1}{n_{\text{lower}}^2} - \frac{1}{n_{\text{upper}}^2} \right) $$

Niels Bohr resolved both the collapse paradox and the discrete spectrum by postulating that electrons orbit only in non-radiating, stationary states, and that their **angular momentum** $L$ is restricted to integer multiples of $\hbar = h/2\pi$: $$ L = m_e v r = n \hbar, \quad n \in \{1, 2, 3, \dots\} $$ Let us derive how this quantization condition dictates orbital mechanics. Under circular motion, we balance the electrostatic attractive force with the centripetal force: $$ \frac{1}{4\pi\epsilon_0} \frac{e^2}{r^2} = \frac{m_e v^2}{r} \implies m_e v^2 = \frac{e^2}{4\pi\epsilon_0 r} $$ From the angular momentum quantization condition, we solve for velocity: $v = \frac{n\hbar}{m_e r}$. Substituting this into our force balance: $$ m_e \left( \frac{n\hbar}{m_e r} \right)^2 = \frac{e^2}{4\pi\epsilon_0 r} \implies \frac{n^2 \hbar^2}{m_e r^2} = \frac{e^2}{4\pi\epsilon_0 r} $$ Solving for the quantized orbit radius $r_n$: $$ r_n = \frac{4\pi\epsilon_0 \hbar^2}{m_e e^2} n^2 = a_0 n^2 $$ where $a_0 \approx 0.529 \text{ Å}$ is the **Bohr Radius** ($n=1$). This shows that the electron's orbit cannot decay continuously; it can only occupy discrete, stable shells.
The total mechanical energy is the sum of kinetic and electrostatic potential energy: $$ E = K + U = \frac{1}{2}m_e v^2 - \frac{e^2}{4\pi\epsilon_0 r} $$ Substituting our force balance expression $\frac{1}{2}m_e v^2 = \frac{e^2}{8\pi\epsilon_0 r}$: $$ E = \frac{e^2}{8\pi\epsilon_0 r} - \frac{e^2}{4\pi\epsilon_0 r} = -\frac{e^2}{8\pi\epsilon_0 r} $$ Substituting our quantized radius $r_n$ yields the quantized **Bohr Energy Levels**: $$ E_n = -\left(\frac{m_e e^4}{32 \pi^2 \epsilon_0^2 \hbar^2}\right) \frac{1}{n^2} \approx -\frac{13.606 \text{ eV}}{n^2} $$ When an electron transitions from a higher energy orbit $n_{\text{upper}}$ to a lower orbit $n_{\text{lower}}$, it emits a single photon carrying the exact energy difference: $\Delta E = E_{\text{upper}} - E_{\text{lower}} = h\nu = \frac{hc}{\lambda}$.
Substituting our quantized energy states: $$ \frac{1}{\lambda} = \frac{E_{\text{upper}} - E_{\text{lower}}}{hc} = \left( \frac{m_e e^4}{64\pi^3 \epsilon_0^2 \hbar^3 c} \right) \left( \frac{1}{n_{\text{lower}}^2} - \frac{1}{n_{\text{upper}}^2} \right) $$ This derived coefficient is exactly the Rydberg constant $R \approx 1.097 \times 10^7 \text{ m}^{-1}$, mathematically proving Bohr's quantization condition.

3. Wave-Particle Duality and the de Broglie Waves

In 1924, Louis de Broglie proposed a beautiful symmetry: if light waves behave as particles (photons), then matter particles must also possess a wave character. He defined the **matter wavelength** as: $$ \lambda = \frac{h}{p} $$ This wave-like behavior was subsequently confirmed experimentally by Clinton Davisson and Lester Germer, who fired a beam of electrons at a nickel crystal and observed sharp diffraction rings on a detector—a phenomenon unique to wave interference.

4. Postulating the Schrödinger Wave Equation: Deducing Operators from a Plane Wave

If particles behave as waves, what wave equation governs them? Erwin Schrödinger formulated his monumental wave equation using classical energy conservation and a plane wave ansatz.
For a non-relativistic particle, the total energy is the sum of kinetic and potential energy: $$ E = \frac{p^2}{2m} + U(x) $$ To describe this matter wave, Schrödinger assumed a free particle propagating along the x-axis is represented by a plane wave function: $$ \Psi(x, t) = A e^{i(kx - \omega t)} $$ Applying de Broglie's relation $p = \hbar k$ and Planck's relation $E = \hbar\omega$, we rewrite the wave in terms of physical observables: $$ \Psi(x, t) = A e^{\frac{i}{\hbar}(px - Et)} $$ To convert our classical energy equation into a differential wave equation, we find mathematical operators that can extract $E$ and $p^2$ from this wave function:
1. **Energy Operator:** Taking a first derivative of the wave function with respect to time $t$: $$ \frac{\partial\Psi}{\partial t} = -\frac{i E}{\hbar} \Psi \implies i\hbar \frac{\partial\Psi}{\partial t} = E\Psi $$ This maps the physical energy $E$ to the temporal differential operator $\hat{E} = i\hbar\frac{\partial}{\partial t}$.
2. **Momentum Operator:** Taking a first spatial derivative with respect to $x$: $$ \frac{\partial\Psi}{\partial x} = \frac{i p}{\hbar} \Psi \implies -i\hbar \frac{\partial\Psi}{\partial x} = p\Psi $$ This maps momentum $p$ to the spatial operator $\hat{p} = -i\hbar\frac{\partial}{\partial x}$. Differentiating twice with respect to $x$: $$ \frac{\partial^2\Psi}{\partial x^2} = -\frac{p^2}{\hbar^2} \Psi \implies -\hbar^2 \frac{\partial^2\Psi}{\partial x^2} = p^2 \Psi \implies -\frac{\hbar^2}{2m} \frac{\partial^2\Psi}{\partial x^2} = \frac{p^2}{2m} \Psi $$ This maps the kinetic energy term $\frac{p^2}{2m}$ to the spatial operator $-\frac{\hbar^2}{2m}\frac{\partial^2}{\partial x^2}$.
Now, let us multiply our classical energy conservation equation $E = \frac{p^2}{2m} + U(x)$ by the wave function $\Psi$: $$ E\Psi = \frac{p^2}{2m}\Psi + U(x)\Psi $$ Substituting our temporal and spatial operators directly into this relation yields the monumental **Time-Dependent Schrödinger Equation**: $$ i\hbar \frac{\partial}{\partial t}\Psi(x,t) = \left[ -\frac{\hbar^2}{2m}\frac{\partial^2}{\partial x^2} + U(x) \right]\Psi(x,t) $$ This equation replaced classical deterministic trajectories with a **wave function** $\Psi$, whose magnitude squared $|\Psi(\mathbf{r}, t)|^2$ determines the probability density of finding a particle in space.

Simulation 5: Interactive Bohr Hydrogen Orbit Transitions

Quantum Mechanics

Orbital Transition Controller

Initial State ($n_i$)

Target State ($n_f$)

Transition Telemetry

Transition Energy: 1.89 eV

Photon Wavelength: 656 nm

Spectral Color: Balmer Red

Orbit Radius ($r_n$): 4.76 Å

Visualizer Instructions: Press "Trigger Transition". If moving to a lower shell, you'll see a photon wave emitted. If moving to a higher shell, you must hit the electron with an incoming photon.

06

Quantum Spin, Measurement & Spin-Orbit Coupling

1. Classical vs. Quantum Treatment of Spin (Moments and Currents)

In classical electrodynamics, a circulating electrical current $I$ around a flat closed loop of area $\mathbf{A}$ generates a magnetic dipole moment $\boldsymbol{\mu}$: $$ \boldsymbol{\mu}_{\text{class}} = I \mathbf{A} $$ If we model an electron of charge $q$ and mass $m$ executing a circular orbit of radius $r$ at velocity $v$, the current is $I = \frac{q}{T} = \frac{q v}{2\pi r}$, and the loop area is $A = \pi r^2$. Substituting these variables yields the classical magnetic moment: $$ \boldsymbol{\mu}_{\text{class}} = \left(\frac{q v}{2\pi r}\right) \left(\pi r^2 \hat{\mathbf{z}}\right) = \frac{q}{2} v r \hat{\mathbf{z}} $$ Because the classical orbital angular momentum of this electron is $\mathbf{L} = m \mathbf{r} \times \mathbf{v} = m v r \hat{\mathbf{z}}$, we can rewrite this relation in terms of the angular momentum vector: $$ \boldsymbol{\mu}_{\text{class}} = \frac{q}{2m} \mathbf{L} = \gamma_{\text{class}} \mathbf{L} $$ where $\gamma_{\text{class}} = \frac{q}{2m}$ is the **classical gyromagnetic ratio**. If we extend this derivation to a continuous rotating sphere of charge and mass, this proportionality constant remains unchanged, stating that classical magnetic moments scale strictly as a simple current loop.

However, when physicists measured the intrinsic magnetic moment associated with the electron's **spin** angular momentum $\mathbf{S}$, they discovered a profound quantum discrepancy. The spin magnetic moment $\boldsymbol{\mu}_s$ was formulated as: $$ \boldsymbol{\mu}_s = g_s \left(\frac{q}{2m}\right) \mathbf{S} = g_s \gamma_{\text{class}} \mathbf{S} $$ where $g_s$ is the dimensionless Landé g-factor. According to classical mechanics, $g_s$ should equal exactly $1$. Yet, relativistic quantum mechanics—specifically Paul Dirac's formulation of the wave equation (the **Dirac Equation**, 1928)—derived that for an intrinsic spin-1/2 particle: $$ g_s = 2 $$ Quantum Electrodynamics (QED) field corrections further modify this value slightly due to vacuum polarization: $g_s \approx 2.0023$. This anomalous doubling of the gyromagnetic ratio ($\gamma_s \approx \frac{q}{m}$) mathematically proves that **spin is not a classical charge rotation or localized current loop**. Spin is an intrinsic, non-classical relativistic angular momentum arising from the fundamental symmetries of the Poincaré group in spacetime.

2. Pauli Spinor Algebra & The Stern-Gerlach Experiment (1922)

To quantify this two-state spin-1/2 quantum system, Wolfgang Pauli formulated the spin operators in terms of the $2 \times 2$ **Pauli Spin Matrices** $\boldsymbol{\sigma}$: $$ \mathbf{S} = \frac{\hbar}{2} \boldsymbol{\sigma} \implies \sigma_x = \begin{pmatrix} 0 & 1 \\ 1 & 0 \end{pmatrix}, \quad \sigma_y = \begin{pmatrix} 0 & -i \\ i & 0 \end{pmatrix}, \quad \sigma_z = \begin{pmatrix} 1 & 0 \\ 0 & -1 \end{pmatrix} $$ which satisfy the anticommutation relation $\{\sigma_i, \sigma_j\} = 2\delta_{ij} I$ and commutation algebra $[\sigma_i, \sigma_j] = 2i\epsilon_{ijk}\sigma_k$. A quantum electron state is represented by a two-component complex column vector, or **spinor**: $$ |\chi\rangle = c_+ \begin{pmatrix} 1 \\ 0 \end{pmatrix} + c_- \begin{pmatrix} 0 \\ 1 \end{pmatrix} = c_+ |{\uparrow}\rangle + c_- |{\downarrow}\rangle $$ Otto Stern and Walther Gerlach experimentally verified this discrete quantization of spin in 1922. They fired a beam of neutral silver atoms through an inhomogeneous magnetic field pointing along the z-axis: $\mathbf{B} \approx (B_0 + \frac{\partial B_z}{\partial z}z)\hat{\mathbf{z}}$. The magnetic gradient exerts a force $\mathbf{F} = \nabla(\boldsymbol{\mu}_s \cdot \mathbf{B}) \approx \mu_{sz} \frac{\partial B_z}{\partial z} \hat{\mathbf{z}}$.
Classical physics predicted that since the atomic magnetic dipoles should be randomly oriented in space, the force would deflect the atoms continuously, leaving a smooth smeared vertical line on the detector. Instead, **the beam split into two discrete spots**, representing exactly spin-up ($s_z = +\hbar/2$) and spin-down ($s_z = -\hbar/2$). This proved that quantum angular momentum is spatially quantized.

3. Relativistic Spin-Orbit Coupling

When an electron orbits a heavy nucleus of charge $Ze$, it experiences a strong electrostatic potential $V(r)$. From the moving electron's frame of reference, this relative nuclear motion creates a massive circulating electric field $\mathbf{E} = -\boldsymbol{\nabla}V(r)$, which by Lorentz transformation is felt as a relativistic magnetic field $\mathbf{B}_{\text{rel}} = -\frac{1}{c^2}(\mathbf{v} \times \mathbf{E})$.
The interaction of the electron's spin magnetic moment $\boldsymbol{\mu}_s$ with this relativistic magnetic field, corrected by a factor of $1/2$ due to relativistic **Thomas Precession** in curved spacetime, is the **Spin-Orbit Coupling (SOC)** Hamiltonian $H_{\text{SO}}$: $$ H_{\text{SO}} = \frac{1}{2} \boldsymbol{\mu}_s \cdot \mathbf{B}_{\text{rel}} = \left( \frac{1}{2 m^2 c^2} \frac{1}{r} \frac{dV}{dr} \right) \mathbf{L} \cdot \mathbf{S} = \xi(r) \mathbf{L} \cdot \mathbf{S} $$ To calculate the energy splitting, we express the spin-orbit product in terms of the total angular momentum operator $\mathbf{J} = \mathbf{L} + \mathbf{S}$: $$ \mathbf{L} \cdot \mathbf{S} = \frac{1}{2} \left( \mathbf{J}^2 - \mathbf{L}^2 - \mathbf{S}^2 \right) $$ Evaluating this on states of quantum numbers $|l, s, j, m_j\rangle$, the expectation value is: $$ \langle \mathbf{L} \cdot \mathbf{S} \rangle = \frac{\hbar^2}{2} \left[ j(j+1) - l(l+1) - s(s+1) \right] $$ For a $p$-shell core level ($l=1$ and spin $s=1/2$), the total angular momentum $j$ can take values of $l+s = 3/2$ or $l-s = 1/2$. This splits the degenerate core states into two distinct energy levels: - **$2p_{3/2}$** ($j=3/2$, 4-fold degenerate, higher energy) $\implies$ **$L_3$ edge**. - **$2p_{1/2}$** ($j=1/2$, 2-fold degenerate, lower energy) $\implies$ **$L_2$ edge**. This spin-orbit core-level splitting is the fundamental origin of the double-peak X-ray Absorption Spectroscopy (XAS) profiles.

4. Quantum Measurement & The Elitzur-Vaidman "Quantum Bomb" Tester

The Stern-Gerlach experiment illustrates the quantum measurement postulate: when a quantum state in superposition is measured, its wavefunction collapses into a single eigenstate. In 1993, Avshalom Elitzur and Lev Vaidman proposed an extraordinary thought experiment showing that **quantum measurement can occur without any physical interaction at all**. This is the **Elitzur-Vaidman Bomb Tester**.
Consider a Mach-Zehnder interferometer. A single-photon source fires a photon at a 50/50 beamsplitter (BS1), splitting the photon wavefunction into a superposition of path $A$ (reflection) and path $B$ (transmission): $$ |\Psi\rangle = \frac{1}{\sqrt{2}} \left( i|A\rangle + |B\rangle \right) $$ These paths bounce off mirrors and recombine at a second beamsplitter (BS2) before hitting two detectors, $C$ and $D$.
- **Case 1: The Dud Bomb**. If we place a "dud" bomb on path $B$, the photon undergoes coherent superposition. The path lengths are adjusted so that the waves interfere constructively at Detector $C$ and destructively at Detector $D$. Thus, **the photon lands at Detector C 100% of the time**, and never at Detector D. - **Case 2: The Live Bomb**. If we place a "live" bomb on path $B$, the bomb trigger acts as a highly sensitive quantum measurement device. - With **50% probability**, the photon takes path $B$, hits the trigger, **and the bomb explodes**. - With **50% probability**, the photon takes path $A$. The wavefunction collapses into state $|A\rangle$. When this collapsed photon reaches BS2, it has no wave partner to interfere with. Therefore, it has a 50/50 chance of going to Detector $C$ or Detector $D$ (each representing **25% overall probability**).
If **Detector D clicks (25% probability)**, we have successfully detected that a **live bomb is blocking path B, even though the photon never touched it** and the bomb did not explode! This is the magic of quantum self-interaction and non-demolition measurement.

Simulation 6: The Elitzur-Vaidman Quantum Bomb Tester

Quantum Measurement

Interferometer Controller

Measurement Statistics

Total Trials: 0

Explosions: 0 (0.0%)

Detector C clicks: 0 (0.0%)

Detector D clicks: 0 (0.0%)

Visualizer Legend: Dud Bomb: Constructive interference forces the photon to hit Detector C 100% of the time.
Live Bomb: Detector D clicks 25% of the time, proving a live bomb is present without exploding it!

07

Multi-Electron Symmetries & Exchange Polarization

1. The Microscopic Origin of Magnetism: From Atomic Dipoles to Bulk Domains

To understand macroscopic magnets, we must trace their physical origin to the atomic scale. Every electron in an atom possesses a magnetic dipole moment originating from two quantum sources: its orbital angular momentum $\mathbf{L}$ (acting as a microscopic circulating current loop) and its intrinsic spin angular momentum $\mathbf{S}$ (an intrinsic quantum dipole). The total magnetic moment of a single atom is the vector sum of its electrons' orbital and spin moments.
How do these atomic dipoles behave collectively? Depending on their electronic shell structures and mutual interactions, materials exhibit five primary classes of magnetic behavior:

Diamagnetism (Universal & Weak): When an external magnetic field $\mathbf{H}$ is applied, the orbital motion of electrons adjusts to set up a microscopic circulating current that opposes the applied field (a direct quantum consequence of Lenz's Law). This produces a weak, negative magnetic susceptibility ($\chi \lt 0$). Diamagnetism is present in all matter, but it is extremely weak and easily masked by other magnetic responses. Water, bismuth, and grapes are classic examples of diamagnets.
Paramagnetism (Randomized Unpaired Spins): Atoms in paramagnetic materials possess unpaired electrons, creating permanent atomic dipoles. However, without an external field, thermal fluctuations completely randomize their orientations, resulting in zero net macroscopic magnetization. When a field $\mathbf{H}$ is applied, the dipoles weakly align along the field, yielding a positive susceptibility ($\chi \gt 0$). Once the field is removed, thermal energy instantly randomizes the spins again (e.g., aluminum, platinum).
Ferromagnetism (Spontaneous Parallel Alignment): In certain transition metals (Fe, Co, Ni) and rare-earth elements, the quantum exchange interaction ($J \gt 0$, arising from Pauli anti-symmetry and Coulomb repulsion) overcomes thermal randomization. It forces neighboring spins to spontaneously align parallel to one another. This yields a massive net macroscopic magnetic moment even in the absence of an external field, creating a permanent magnet.
Antiferromagnetism (Spontaneous Antiparallel Alignment): When the exchange coupling is negative ($J \lt 0$), neighboring spins align anti-parallel to one another. The microscopic magnetic moments of the sublattices perfectly cancel out, leaving a macroscopic magnetization of zero, though the material possesses highly ordered microscopic spin arrangements (e.g., chromium, manganese oxide).
Ferrimagnetism & Altermagnetism (Unbalanced Sublattices and Anisotropic Spin-Splitting): Ferrimagnets contain unequal anti-parallel sublattices, resulting in a net macroscopic moment (e.g., magnetite $\text{Fe}_3\text{O}_4$). Altermagnets represent a newly discovered third branch: collinear antiferromagnets where the crystal symmetry causes the electronic energy bands to split into spin-up and spin-down channels in momentum space, behaving like ferromagnets in transport while possessing zero net magnetization!

Why isn't every piece of iron a permanent magnet? Even though iron is ferromagnetic, a block of unmagnetized iron has a net magnetization of zero. This is due to **Magnetic Domains**.
To minimize the massive magnetostatic field energy (the energy stored in the surrounding magnetic field), a bulk ferromagnet spontaneously splits into microscopic regions called domains (typically $10\text{ }\mu\text{m}$ to $100\text{ }\mu\text{m}$ wide). Within each domain, all atomic spins are aligned. However, the magnetization vectors of neighboring domains point in different directions, canceling each other out globally.
When an external magnetic field $\mathbf{H}$ is applied: 1. **Domain Wall Motion**: The boundaries between domains (Bloch walls) sweep, expanding domains aligned with the field and shrinking misaligned ones. 2. **Domain Rotation**: At high fields, the magnetization vectors of the remaining domains rotate to align perfectly with the field, reaching **Saturation Magnetization** ($M_s$).
When the external field is removed, the domain walls do not return to their original configurations because they get pinned by crystal lattice defects and impurities. This results in **Remanence** ($B_r$)—the permanent magnetization that makes the iron a magnet. To return the magnetization to zero, we must apply an opposing magnetic field called the **Coercive Field** or **Coercivity** ($H_c$). This path-dependent lagging behavior is traced out on a **Magnetic Hysteresis Loop ($B$-$H$ curve)**, which represents the work done in sweeping domain walls back and forth, dissipating energy as heat.

2. Many-Body Symmetries & Slater Determinants

For many-electron atomic systems, the **Pauli Exclusion Principle** states that the total quantum wavefunction must be anti-symmetric under the exchange of any two identical fermions. To satisfy this requirement, many-body electronic wavefunctions are constructed as **Slater Determinants**: $$ \Psi(\mathbf{r}_1, \mathbf{s}_1, \dots, \mathbf{r}_N, \mathbf{s}_N) = \frac{1}{\sqrt{N!}} \begin{vmatrix} \psi_1(\mathbf{r}_1, \mathbf{s}_1) & \psi_2(\mathbf{r}_1, \mathbf{s}_1) & \dots & \psi_N(\mathbf{r}_1, \mathbf{s}_1) \\ \psi_1(\mathbf{r}_2, \mathbf{s}_2) & \psi_2(\mathbf{r}_2, \mathbf{s}_2) & \dots & \psi_N(\mathbf{r}_2, \mathbf{s}_2) \\ \vdots & \vdots & \ddots & \vdots \\ \psi_1(\mathbf{r}_N, \mathbf{s}_N) & \psi_2(\mathbf{r}_N, \mathbf{s}_N) & \dots & \psi_N(\mathbf{r}_N, \mathbf{s}_N) \end{vmatrix} $$ Exchanging any two rows corresponds to exchanging two electrons, which mathematically flips the determinant sign ($\Psi \to -\Psi$), enforcing perfect anti-symmetry. If two single-particle states $\psi_a$ and $\psi_b$ are identical, two columns are identical, rendering the determinant zero—proving that no two electrons can occupy the exact same quantum state.

3. Hund's Rules & Crystal Fields

To determine the ground-state angular momentum configuration ($L, S, J$) of many-electron $d$ or $f$ shells, we apply **Hund's Rules**: 1. **Maximize Spin ($S$)**: Electrostatically minimizes Coulomb repulsion because parallel spin electrons must occupy different spatial orbitals due to Pauli anti-symmetrization. 2. **Maximize Orbital Momentum ($L$)**: Minimizes Coulomb repulsion by ensuring electrons orbit in the same direction, reducing overlapping paths. 3. **Spin-Orbit Coupling Alignment ($J$)**: If the shell is less than half-full, $J = |L - S|$; if more than half-full, $J = L + S$.

4. The Heisenberg Exchange Interaction & Band Splitting

In solids, the Coulomb electrostatic repulsion combined with the Pauli exclusion principle creates an effective spin-spin coupling, the **Heisenberg Exchange Coupling**: $$ H_{\text{ex}} = -2 \sum_{i \lt j} J_{ij} \mathbf{S}_i \cdot \mathbf{S}_j $$ When $J_{ij} > 0$, the system minimizes energy when neighboring spins align parallel, yielding **Ferromagnetism**. In transition metals (Fe, Co, Ni), the $3d$ electrons hybridize into energy bands. The exchange interaction shifts the energy bands of spin-up (majority) and spin-down (minority) electrons relative to each other by an exchange energy $\Delta E_{\text{ex}}$, splitting the valence $3d$ Density of States (DOS).
Because the majority band is pushed below the Fermi level and filled, the unoccupied Density of States is heavily spin-asymmetric (polarized). This spin-polarized empty band acts as a **spin detector** for excited photoelectrons.

The Climax: Synchrotron Magnetic Spectroscopy

Fermi's Golden Rule & XMCD / XMLD

The spin-orbit split core levels ($2p_{3/2}$ and $2p_{1/2}$) and exchange-split empty $3d$ valence states form the exact physical architecture needed for synchrotron spectroscopy. Proceed to our advanced write-ups and 3D altermagnetic Fermi surface simulations!

Fermi's Golden Rule Derivation Materials: XMCD & XMLD Lab →