Quantum Mechanics Fundamentals
This page covers the quantum mechanics that informs quantum computing engineering decisions. It is not a full quantum mechanics course — those exist, they take a semester, and they assume calculus through linear algebra and complex analysis. The goal here is specifically the physics that matters when working with quantum computers as engineering artifacts: what a quantum state actually is mathematically, how the operations transform it, what measurement does, where noise comes from, and which intuitions from classical physics work versus break.
The intended reader is a senior engineer with comfort in linear algebra and probability who has not taken a quantum mechanics course. The exposition uses bra-ket notation and matrix mechanics but stays as close to the operationally relevant subset as possible.
This page is the deep-dive companion to the Quantum Computing umbrella overview. Quantum gates and algorithms are covered in Quantum Algorithms; the physical systems that realize qubits in hardware are covered in Qubit Architectures; error correction sits on top in its own subpage.
The state vector
The state of a quantum system is described by a state vector — a unit-length complex vector in a Hilbert space. For a single qubit, the Hilbert space is two-dimensional, and the state vector can be written:
|ψ⟩ = α|0⟩ + β|1⟩
where α and β are complex numbers (called amplitudes), |0⟩ and |1⟩ are the basis states (the two distinguishable classical outcomes), and the normalization condition |α|² + |β|² = 1 must hold.
A few things worth being explicit about:
The amplitudes are complex, not real. This is what distinguishes quantum mechanics from a classical probability theory over the same two states. A classical bit has probabilities p_0 and p_1 summing to 1. A qubit has amplitudes α and β whose squared magnitudes sum to 1. The complex phase of the amplitudes carries information that is operationally invisible at measurement but matters enormously when the qubit interacts with other qubits.
Global phase is unphysical. The state |ψ⟩ and the state e^(iφ)|ψ⟩ for any real φ describe the same physical situation. Measurement outcomes, expectation values, and any operationally meaningful quantity are unchanged by global phase. Relative phase between amplitudes — the phase of β relative to α — is physical.
The superposition principle. Any complex linear combination of valid states is itself a valid state, after normalization. This is the structural feature of quantum mechanics that makes quantum computing possible: the state space is the full continuum of unit-length vectors, not just the basis states.
The Hilbert space is the natural mathematical object. Hilbert spaces are complex vector spaces with an inner product that satisfies certain technical conditions (completeness in the induced norm). For computational purposes, the inner product is the operation that produces probabilities — specifically, ⟨φ|ψ⟩ gives the overlap between two states, and |⟨φ|ψ⟩|² gives the probability of measuring the state |ψ⟩ as |φ⟩ when |φ⟩ is one of the measurement outcomes.
The Bloch sphere
Any single-qubit state can be written in the form:
|ψ⟩ = cos(θ/2)|0⟩ + e^(iφ) sin(θ/2)|1⟩
where θ ∈ [0, π] and φ ∈ [0, 2π). The two angles map to a point on the surface of a unit sphere — the Bloch sphere — with |0⟩ at the north pole, |1⟩ at the south pole, and superpositions at intermediate latitudes.
The Bloch sphere is the standard visualization for single-qubit states because it captures the entire single-qubit state space in a familiar geometric object. Single-qubit operations (gates) become rotations of the sphere. The state |+⟩ = (|0⟩ + |1⟩)/√2 sits on the equator at φ = 0; |−⟩ = (|0⟩ − |1⟩)/√2 sits on the equator at φ = π. The states |0⟩ and |1⟩ are antipodal — which is a useful reminder that “orthogonal” in Hilbert space means “diametrically opposite” on the Bloch sphere, not “at 90 degrees” in any geometric sense.
The Bloch sphere extends naturally to mixed states by allowing points inside the sphere as well as on the surface. Pure states are on the surface; mixed states (covered below) are strictly inside; the center represents the maximally mixed state, equivalent to a classical fair coin flip with no quantum information.
For multi-qubit systems, the Bloch sphere visualization breaks down — there is no equivalent geometric picture for two or more qubits that captures entanglement. The Hilbert space is the right mathematical object; the Bloch sphere is a useful pedagogical aid for single qubits and a misleading metaphor for many-qubit systems.
Unitary evolution
The time evolution of a closed quantum system is unitary: the state vector evolves according to a unitary operator U applied to the initial state. A unitary operator satisfies U†U = UU† = I, where U† is the conjugate transpose (adjoint) of U.
The unitarity condition has several immediate consequences worth knowing:
Probability is preserved. A unitary operator maps unit vectors to unit vectors, so a properly-normalized initial state evolves to a properly-normalized final state. The probability that the system is in some state remains 1, which it must.
The inner product is preserved. For any two states |ψ⟩ and |φ⟩, the inner product ⟨φ|ψ⟩ is the same as ⟨φ|U†U|ψ⟩ — that is, the overlap between two states is invariant under unitary evolution. The geometric picture is that unitary operations rotate the state space rigidly; they do not stretch or squash it.
Unitary operations are reversible. Every unitary U has an inverse U† that undoes its action. This is the precise sense in which quantum computation is reversible: every gate operation can be undone by applying its adjoint, before any measurement happens.
Quantum gates are unitary operators. A quantum gate acting on n qubits is a unitary operator on a 2^n-dimensional Hilbert space. The set of allowed quantum operations is exactly the set of unitary operators (plus measurement, which is non-unitary and covered separately). Any operation that cannot be expressed as a unitary is not a legal quantum operation on a closed system.
The Schrödinger equation governs how the unitary U is generated from a Hamiltonian H: the evolution over time t is U = e^(-iHt/ℏ), where ℏ is Planck’s reduced constant. For most quantum computing purposes, the Hamiltonian formulation is hidden behind the “apply gate U” abstraction, but it matters in two contexts: quantum simulation algorithms that explicitly evolve states under physical Hamiltonians, and the hardware-level question of how a given gate is physically realized by driving the system with appropriate Hamiltonian terms.
Measurement and the Born rule
Measurement is the operation that converts quantum information into classical information. It is also the operation that breaks all the convenient mathematical properties of unitary evolution — measurement is non-unitary, irreversible, and probabilistic.
The standard formalism is projective measurement. An observable A is represented by a Hermitian operator (an operator equal to its own adjoint), which can be decomposed into its eigenvalues a_i and eigenstates |a_i⟩:
A = Σ_i a_i |a_i⟩⟨a_i|
When the observable A is measured on a state |ψ⟩:
- The outcome is one of the eigenvalues a_i.
- The probability of getting outcome a_i is |⟨a_i|ψ⟩|² — this is the Born rule, the fundamental rule that connects the abstract state vector to observable probabilities.
- After measurement, the state collapses to the corresponding eigenstate |a_i⟩.
For a single qubit measured in the computational basis (the eigenbasis of the Pauli Z operator):
- A state α|0⟩ + β|1⟩ yields outcome 0 with probability |α|², outcome 1 with probability |β|².
- After measurement, the state collapses to |0⟩ or |1⟩ accordingly.
Two important points:
Measurement destroys the quantum state. After a projective measurement, the state is no longer in the original superposition — it has collapsed to whichever eigenstate corresponds to the observed outcome. Information about the relative amplitudes and phases is lost. This is the structural reason quantum algorithms must be carefully designed: you cannot examine the state during the computation and then continue, because examining destroys the very superposition that the computation depends on.
Different measurements give different information. A qubit in the state |+⟩ = (|0⟩ + |1⟩)/√2 measured in the computational basis gives 0 or 1 with equal probability, providing no information about the state’s identity. The same qubit measured in the X basis (eigenstates |+⟩ and |−⟩) gives outcome “+” with probability 1. The choice of measurement basis is itself a meaningful decision; an experimenter who measures in the wrong basis gets random noise.
Beyond projective measurements, the more general POVM (Positive Operator-Valued Measure) formalism describes measurements that can be implemented by coupling the system to an auxiliary system and then projecting on the auxiliary. POVMs are the right framework for handling real hardware measurements that are noisy, partial, or otherwise imperfect.
Entanglement
Entanglement is the structural feature of quantum mechanics that makes quantum computing more powerful than probabilistic classical computing, and it is the property that distinguishes quantum information theory from any classical information theory.
A multi-qubit state is separable if it can be written as a tensor product of single-qubit states:
|ψ⟩ = |ψ_1⟩ ⊗ |ψ_2⟩ ⊗ … ⊗ |ψ_n⟩
A state that cannot be written this way is entangled. The two-qubit state (|00⟩ + |11⟩)/√2 — the Bell state — is the canonical example. Try to write it as |ψ_1⟩ ⊗ |ψ_2⟩ = (a|0⟩ + b|1⟩) ⊗ (c|0⟩ + d|1⟩) = ac|00⟩ + ad|01⟩ + bc|10⟩ + bd|11⟩. For this to equal (|00⟩ + |11⟩)/√2, you would need ac = bd = 1/√2 and ad = bc = 0, which has no solution. The state is not separable.
The four standard Bell states form an orthonormal basis for the two-qubit Hilbert space:
- |Φ⁺⟩ = (|00⟩ + |11⟩)/√2
- |Φ⁻⟩ = (|00⟩ − |11⟩)/√2
- |Ψ⁺⟩ = (|01⟩ + |10⟩)/√2
- |Ψ⁻⟩ = (|01⟩ − |10⟩)/√2
The operational signature of entanglement is non-classical correlation. For the Bell state |Φ⁺⟩, measurement of either qubit in the computational basis gives 0 or 1 with equal probability — but the two outcomes are perfectly correlated. The qubits always agree, even though neither qubit individually has a definite value before measurement.
Bell inequalities
The question of whether the correlation in entangled states could be explained by some “hidden variable” theory — some classical underlying reality that determines the outcomes — was answered by John Bell in 1964. Bell derived an inequality that any local hidden variable theory must satisfy and showed that quantum mechanics predicts violations of the inequality.
The most-cited version is the CHSH inequality (Clauser-Horne-Shimony-Holt, 1969). For any local hidden variable theory, a specific combination of correlation measurements is bounded:
|⟨E(a,b) − E(a,b′) + E(a′,b) + E(a′,b′)⟩| ≤ 2
Quantum mechanics predicts that the same combination can reach 2√2 ≈ 2.828 — the Tsirelson bound. Experimental measurements of the CHSH parameter consistently match the quantum prediction and violate the classical bound by many standard deviations.
The experimental program to test Bell inequalities runs from Alain Aspect’s experiments in 1981-1982 through a series of progressively more stringent tests. The 2015 loophole-free Bell tests by independent groups in Delft, Vienna, and the NIST/Boulder groups closed the last remaining experimental loopholes (the detection loophole and the locality loophole simultaneously) and confirmed quantum mechanics decisively. The 2022 Nobel Prize in Physics was awarded to Alain Aspect, John Clauser, and Anton Zeilinger specifically for this work.
The implication: entangled quantum systems exhibit correlations that have no classical explanation. Any “hidden variable” theory that respects locality (no faster-than-light influence) cannot reproduce quantum predictions. Quantum mechanics is not “classical mechanics with extra randomness”; it is structurally different.
Density matrices and mixed states
The state vector formalism above describes pure states — states of complete quantum information. Real quantum systems are usually in mixed states, which arise from statistical uncertainty about which pure state the system is actually in, or from the system being entangled with an inaccessible environment.
The density matrix ρ generalizes the state vector to describe both pure and mixed states:
ρ = Σ_i p_i |ψ_i⟩⟨ψ_i|
where the p_i are classical probabilities summing to 1, and the |ψ_i⟩ are pure states. For a pure state, the density matrix simplifies to ρ = |ψ⟩⟨ψ|, which satisfies ρ² = ρ. For mixed states, ρ² ≠ ρ in general.
Properties of any valid density matrix:
- Hermitian: ρ = ρ†
- Positive semidefinite: ⟨φ|ρ|φ⟩ ≥ 0 for any |φ⟩
- Trace one: Tr(ρ) = 1
For computations with mixed states, the Born rule generalizes: the probability of measurement outcome corresponding to projector P is Tr(Pρ). Unitary evolution becomes ρ → UρU†.
Why density matrices matter
The density matrix formalism is not optional once real hardware is in the picture. Three reasons:
Noise produces mixed states. A qubit subjected to noise — environmental interactions, control errors, decoherence — evolves from a pure state to a mixed state. The off-diagonal elements of the density matrix (the coherences) decay over time, and a state that started as a pure superposition becomes a statistical mixture indistinguishable from classical randomness. This is decoherence, covered in the next section.
Subsystems of entangled states are mixed. If two qubits are in an entangled pure state (the joint system is a pure state), each qubit individually is in a mixed state. The mathematical operation that extracts the state of one subsystem is the partial trace over the other subsystem. The reduced density matrix of either qubit in a Bell state, after tracing out the other, is the maximally mixed state I/2 — a coin flip with no information. The information is in the correlations, not in either qubit alone.
Measurement readout requires it. Real quantum hardware does not produce pure-state measurement outcomes. The output of a quantum circuit is a probability distribution over classical bitstrings, and the right framework for analyzing the relationship between intended circuit and observed measurement statistics is the density matrix formalism applied to noisy circuits.
Decoherence
Decoherence is the process by which a quantum system loses its quantum properties through interaction with its environment. It is the dominant practical obstacle to quantum computing, and the entire enterprise of quantum error correction exists to combat its effects.
The standard phenomenological model decomposes decoherence into two characteristic times:
T1 (relaxation time / amplitude damping) describes how excited states decay to the ground state. A qubit prepared in |1⟩ has probability e^(-t/T1) of still being in |1⟩ after time t, with the complement having relaxed to |0⟩. This is energy exchange with the environment — the qubit loses energy by emitting it into the surrounding electromagnetic field, lattice vibrations, or other environmental modes.
T2 (coherence time / dephasing) describes how the phase coherence between superposition components decays. A qubit prepared in (|0⟩ + |1⟩)/√2 has its relative phase scrambled over time, eventually becoming the mixed state (|0⟩⟨0| + |1⟩⟨1|)/2 — diagonal in the computational basis with no quantum coherence remaining. This is information leakage to the environment without energy exchange; the environment learns something about the qubit’s state, and the qubit’s coherence dies as a result.
The mathematical relationship is T2 ≤ 2·T1: dephasing happens at least twice as fast as relaxation, because relaxation also destroys coherence. In good qubits, T2 approaches 2·T1; in qubits with significant pure dephasing, T2 is substantially shorter.
For real hardware, the operational metrics that follow are:
- Coherence time is min(T1, T2), or specifically T2 for most practical purposes.
- Gate fidelity is bounded by the ratio of gate time to coherence time. A gate taking t_gate to execute on a qubit with coherence time T2 has error bounded by roughly t_gate/T2 even in the absence of any other error sources.
- Algorithm depth is bounded by the total circuit time relative to the coherence time. A 1000-gate circuit on hardware with 100 μs coherence and 100 ns gates uses 100 μs of total time, which is the full coherence budget. Algorithms substantially longer than the coherence time produce essentially noise.
Decoherence is the fundamental reason quantum error correction is needed. Even with perfect gates, the qubits themselves cannot maintain their state long enough to run a useful algorithm. Error correction encodes one logical qubit in many physical qubits and continuously corrects the errors that decoherence produces, at the cost of substantial overhead in physical qubit count.
The no-cloning theorem
The no-cloning theorem, published by William Wootters and Wojciech Zurek (and independently by Dennis Dieks) in 1982, states that it is impossible to construct a quantum operation that produces an exact copy of an arbitrary unknown quantum state. There is no quantum equivalent of the classical “copy a bit” operation.
The proof is brief. Suppose there existed a unitary U such that for any state |ψ⟩, U(|ψ⟩|0⟩) = |ψ⟩|ψ⟩. Apply U to two different states |ψ⟩ and |φ⟩:
U(|ψ⟩|0⟩) = |ψ⟩|ψ⟩
U(|φ⟩|0⟩) = |φ⟩|φ⟩
Take the inner product of both sides:
⟨ψ|φ⟩⟨0|0⟩ = ⟨ψ|φ⟩²
This implies ⟨ψ|φ⟩ = ⟨ψ|φ⟩², which can only hold when ⟨ψ|φ⟩ is 0 or 1 — that is, the cloning device can only work when the input states are orthogonal or identical. Universal cloning is impossible.
The no-cloning theorem has substantial implications:
- Quantum information cannot be amplified. Classical signal amplification depends on being able to copy a signal; quantum amplifiers exist only in restricted forms that distort the state.
- Quantum cryptography is possible. The BB84 quantum key distribution protocol, and many that followed, rely fundamentally on no-cloning — an eavesdropper cannot copy the quantum states being transmitted without detection.
- Quantum error correction is non-obvious. Classical error correction uses redundant copies; quantum error correction cannot copy the state directly. The Shor code, Steane code, surface code, and other quantum error correction schemes work around this by encoding the logical state into the entangled structure of many qubits rather than by replication.
- Some seemingly natural operations are forbidden. Examples include “measure without disturbing,” “copy a subsystem,” and “transmit quantum information faster than light.” All are blocked by no-cloning or its consequences.
Tensor products and many-qubit systems
The Hilbert space for a multi-qubit system is the tensor product of the Hilbert spaces for the individual qubits. For n qubits, the joint Hilbert space has dimension 2^n.
The tensor product satisfies the natural distributive properties:
(α|0⟩ + β|1⟩) ⊗ (γ|0⟩ + δ|1⟩) = αγ|00⟩ + αδ|01⟩ + βγ|10⟩ + βδ|11⟩
A general n-qubit state has 2^n complex amplitudes, one for each computational basis state |b_1 b_2 … b_n⟩. The number of parameters needed to describe the state grows exponentially with n. This is the entire source of quantum computing’s power and the entire reason quantum systems are hard to simulate classically.
For n = 50, the state space has 2^50 ≈ 10^15 complex amplitudes — petabyte-scale memory just to represent a single state. For n = 70, the state space exceeds the storage of the entire world’s data centers. For n = 100, the state space exceeds the number of atoms in a sand grain. The exponential is the point.
A multi-qubit unitary operator acts on the full 2^n-dimensional state space. Two-qubit gates like CNOT, CZ, or iSWAP are 4×4 unitary matrices acting on the two-qubit subspace; they extend to the full n-qubit space through tensor products with the identity on the other qubits. Building up complex multi-qubit unitaries from single-qubit and two-qubit gates is the universality theorem of quantum computing — any unitary on n qubits can be approximated to arbitrary precision by a circuit of one- and two-qubit gates.
The Heisenberg uncertainty principle
The Heisenberg uncertainty principle is one of the most-famous and most-commonly-misstated results in physics. The accurate statement: for any two observables A and B, the product of their standard deviations in any quantum state satisfies:
ΔA · ΔB ≥ |⟨[A,B]⟩| / 2
where [A,B] = AB − BA is the commutator. When A and B commute (their operators commute, [A,B] = 0), the uncertainty principle imposes no constraint. When they don’t commute, the principle gives a state-dependent lower bound on how precisely both can be known simultaneously.
The most-cited specific case is position and momentum, which satisfy [x,p] = iℏ, giving the famous Δx · Δp ≥ ℏ/2. For qubit observables, the X and Z Pauli operators don’t commute, and the uncertainty principle says that a state cannot simultaneously be a sharp eigenstate of both — which is consistent with the Bloch sphere picture: the states |0⟩ and |1⟩ (Z eigenstates) are antipodal, and the states |+⟩ and |−⟩ (X eigenstates) are antipodal in a different direction. A state on the surface of the Bloch sphere cannot be on both axes simultaneously.
Two things worth being explicit about:
The uncertainty principle is not about measurement disturbance. A common misconception is that uncertainty arises because measurement of A disturbs the system in a way that prevents measurement of B. There are situations where measurement disturbance is involved, but the uncertainty principle is a fundamental property of quantum states themselves — even before any measurement, a state cannot have simultaneously sharp values of non-commuting observables. The uncertainty is built into the structure of the quantum state, not introduced by the act of measurement.
Different uncertainty relations exist for different setups. The Heisenberg-Robertson form above is the standard textbook version. The entropic uncertainty relations of Maassen-Uffink give bounds in terms of entropy rather than standard deviations and are often more useful in quantum information contexts. The Heisenberg-Robertson-Schrödinger inequality is the tighter version that includes the covariance of the two observables.
Open quantum systems
A quantum system is closed if it interacts only with itself (and any unitary control operations applied to it). Real quantum systems are open: they interact with the environment, lose energy to it, decohere from it, and can be modeled rigorously only by including the environment in the description.
The standard mathematical framework for open quantum systems is the Lindblad equation (or Lindblad master equation):
dρ/dt = −i[H, ρ]/ℏ + Σ_k (L_k ρ L_k† − {L_k† L_k, ρ}/2)
The first term is the standard Schrödinger-equation evolution under the Hamiltonian H. The remaining terms — the Lindblad operators L_k — capture the non-unitary processes by which the environment couples to the system. Each L_k describes a specific noise channel: amplitude damping, dephasing, depolarization, leakage to non-computational states, and so on.
For qubit modeling purposes, the Lindblad equation is more general than needed for most engineering purposes. The phenomenological T1/T2 model captures the operationally relevant noise behavior for most quantum computing applications. The Lindblad formalism becomes important when modeling more complex noise patterns, when designing error correction codes that target specific error types, or when reasoning about the fundamental physics of decoherence in a particular qubit platform.
The general lesson is that real quantum systems do not follow pure unitary evolution. Models that assume they do — that treat noise as an afterthought to be added at the end — produce predictions that don’t match experimental data. Modern quantum hardware design and quantum algorithm design must incorporate noise from the start.
The interpretation question
A note on a topic that is more philosophical than operational, but that comes up often enough to address briefly.
Quantum mechanics has been the subject of competing interpretations for the entire history of the field. The Copenhagen interpretation treats measurement as fundamental and the wavefunction collapse as a real physical process. The many-worlds interpretation treats the wavefunction as the complete description of reality and treats apparent “collapse” as an observer becoming entangled with the system. The pilot wave or Bohmian mechanics interpretation posits hidden variables that determine outcomes deterministically. QBism and other epistemic interpretations treat the wavefunction as a representation of the observer’s information rather than as a physical object. Several other interpretations exist.
The interpretations all agree on the predictions of the theory — they give identical answers for every experimentally testable question. They differ in the philosophical framing of what the math means.
For quantum computing engineering purposes, the choice of interpretation does not matter. The math is what it is; the operational predictions are what they are. Practitioners can hold whatever interpretation they prefer (or none at all) and produce the same results. The interpretations are worth knowing as a cultural matter — they come up in conversation, they appear in popular accounts of the field, and the more thoughtful interpretive work has produced useful technical tools (decoherence theory, decoherent histories, consistent histories). But the engineering depends on the mathematics, not on the philosophical framing.
What this means for quantum computing
A summary of the operational takeaways from the physics covered above:
The state space is exponential. The 2^n-dimensional Hilbert space is the entire reason quantum computers are interesting. Classical simulation requires that exponential-dimensional state representation, which is why even modest quantum systems become intractable to simulate classically.
Operations must be unitary. Quantum gates are restricted to unitary operators on the state space. Any operation that cannot be expressed as a unitary is not a valid quantum operation, which constrains what algorithms can do.
Measurement is destructive and probabilistic. Quantum algorithms produce probability distributions over classical outcomes, not direct readouts of internal state. Algorithms must be designed so that the desired answer has high probability of being observed.
Entanglement is the resource. A quantum algorithm without entanglement provides no advantage over classical algorithms. The entangling two-qubit gates are where the computational power lives; single-qubit gates by themselves are insufficient.
Decoherence is the enemy. Real qubits do not maintain pure quantum states for long. The ratio of gate time to coherence time bounds the depth of useful circuits, and error correction is the bridge between noisy physical qubits and reliable logical qubits.
The no-cloning theorem shapes everything. Quantum error correction cannot duplicate state; quantum communication cannot copy messages; quantum cryptography exploits exactly the impossibility of undetected eavesdropping.
Open quantum systems formalism describes real hardware. Closed-system unitary evolution is a useful idealization but not what running hardware actually does. Real noise modeling, real error correction design, and real performance assessment all use the open quantum systems framework.
Where to go next on this site
Adjacent material on this site:
- Quantum Computing — the umbrella overview covering the field at the survey level.
- Qubit Architectures — how the physics described here is realized in actual hardware platforms.
- Quantum Algorithms — the procedures that exploit the physics to solve specific problems.
- Quantum Error Correction — the engineering layer that bridges noisy physical qubits to reliable logical qubits.
- Quantum Hardware: State of the Art — what current systems can actually do.
- Post-Quantum Cryptography — the cryptographic response to the algorithms enabled by the physics above.
The physics described here has been stable for nearly a century. The interpretations have not been settled and probably will not be. The operational use of the formalism — what it actually predicts about experiments and computations — has been verified repeatedly, including by the 2015 loophole-free Bell tests and the 2022 Nobel work. The math works; the engineering builds on it; the philosophy is its own conversation.