In short

The previous chapter built you a transmon — a superconducting island whose two lowest energy levels are your qubit |0\rangle and |1\rangle. This chapter is the toolkit that lets you do things to it. Single-qubit gates are shaped microwave pulses, 20–50 ns long, resonant with the qubit frequency \omega_{10}/2\pi \approx 5 GHz. The pulse envelope is a Gaussian with a derivative-correction (DRAG) to kill leakage to |2\rangle. Amplitude sets the rotation angle; the phase of the drive selects the rotation axis (drive cosine → R_x, drive sine → R_y). R_z is free — you just rename the frame (a virtual-Z gate). Two-qubit gates come in three competing flavours: cross-resonance (IBM), where you drive qubit A at qubit B's frequency and a ZX interaction falls out; iSWAP (early Google), where two transmons are tuned briefly onto resonance; and parametric CZ (Google Willow, Rigetti, modern Fluxonium), where a tunable coupler is modulated to synthesize a fast controlled-Z. Readout is dispersive: a resonator coupled far-off-resonance to the qubit gets its frequency shifted by an amount \pm \chi depending on whether the qubit is |0\rangle or |1\rangle. You probe the resonator, measure the phase of the reflected microwave, and infer the state — without destroying it more than necessary. State of the art in 2024-2026: 1-qubit fidelity 99.95% (IBM Heron, Google Willow), 2-qubit 99.6–99.7%, readout 98–99% in 100 ns to 1 μs. Trapped ions beat these numbers on fidelity; superconducting wins on gate speed (~100× faster).

A transmon sitting at 15 millikelvin inside a dilution refrigerator has two energy levels, |0\rangle and |1\rangle, separated by roughly 5 GHz. It can live there, undisturbed, for about 300 microseconds before decoherence eats the information. That is pretty long. But if nobody ever does anything to the transmon, you do not have a computer — you have a very cold, very expensive trinket.

You need three things. You need to rotate the qubit — flip |0\rangle to |1\rangle, or put it into a superposition. You need to entangle pairs of qubits. And you need to read the qubit out when you are done. All three happen over coaxial cables running into the fridge, all three use microwaves, and all three have to be done fast enough that you can fit thousands of gates into one coherence time.

This chapter is the manual. It covers single-qubit microwave drives, the three leading two-qubit-gate technologies, and dispersive readout. By the end you will know why a CNOT on IBM's hardware is actually a cross-resonance pulse, why Google's Willow chip uses tunable couplers instead, and why "we measured the qubit" actually means "we probed a resonator for 800 nanoseconds."

Single-qubit gates — a microwave pulse is a Bloch-sphere rotation

Start with the simplest thing. You have one transmon. You want to flip it from |0\rangle to |1\rangle.

The trick is resonance. The transmon has a specific transition frequency \omega_{10} — the energy gap between |0\rangle and |1\rangle, divided by \hbar. If you shine microwaves at the transmon at exactly that frequency, the electromagnetic field pushes the qubit to absorb a photon and climb to |1\rangle. Shine them longer, and the qubit oscillates between |0\rangle and |1\rangle — these are Rabi oscillations, the workhorse of every atomic and superconducting qubit experiment.

A shaped microwave pulse rotating a transmonLeft panel shows a Gaussian-enveloped microwave pulse of 30 nanoseconds with a derivative DRAG correction. Right panel shows a Bloch sphere with the state starting at the north pole and arcing down to the equator under a 90-degree rotation about the x axis.A 30-ns X/2 gate — shaped microwave pulse, Bloch-sphere rotationtime (ns) 0 ............ 30ampGaussian envelopeDRAG (derivative) correctionΩ(t) = A exp(−(t−t₀)²/2σ²) cos(ω₁₀ t + φ)|0⟩|1⟩|+⟩ (+x)R_x(π/2)
Left: a 30-nanosecond DRAG-shaped Gaussian microwave pulse, resonant with the qubit transition at $\omega_{10}/2\pi \approx 5$ GHz. The dashed red curve is the DRAG (derivative) correction applied to the out-of-phase quadrature to cancel unwanted coupling to the leakage state $|2\rangle$. Right: the same pulse, seen on the Bloch sphere. A pulse of area $\pi/2$ rotates $|0\rangle$ into $|+\rangle$ — an $R_x(\pi/2)$ gate.

The drive Hamiltonian and the rotating-wave trick

A microwave drive on a transmon adds a time-dependent term to the Hamiltonian. In the qubit's two-level approximation, you can write the total Hamiltonian as

H(t) = \tfrac{\hbar \omega_{10}}{2} Z + \hbar \Omega(t) \cos(\omega_d t + \varphi)\, X

where Z and X are the Pauli operators on the qubit, \omega_d is the drive frequency, \varphi is the drive phase, and \Omega(t) is the pulse envelope (amplitude in rad/s).

Why this form: the first term is the qubit's own energy — its resonance at \omega_{10}. The second is the microwave pushing the charge on the transmon's superconducting island back and forth, which in the two-level truncation looks like an X operator multiplied by the instantaneous field amplitude. The phase \varphi controls which axis in the equatorial plane the pulse pushes along.

Go into a frame rotating at \omega_d. After the rotating-wave approximation (drop the terms oscillating at 2\omega_d — they average to nothing), the effective Hamiltonian becomes

H_{\text{eff}} = \tfrac{\hbar \Delta}{2} Z + \tfrac{\hbar \Omega(t)}{2}\big(\cos\varphi\, X + \sin\varphi\, Y\big)

with \Delta = \omega_{10} - \omega_d the detuning between qubit and drive.

Why the detuning matters: if \Delta = 0 (drive exactly on resonance with the qubit), the Z term vanishes and the drive implements a pure rotation in the equatorial plane — around +X if \varphi = 0, around +Y if \varphi = \pi/2. Any detuning twists the axis off the equator and reduces gate fidelity. This is why calibration obsesses over qubit-frequency drift.

On resonance, a pulse of integrated amplitude \theta = \int \Omega(t)\, dt applied at phase \varphi rotates the qubit by angle \theta around the axis \hat{n} = (\cos\varphi, \sin\varphi, 0). This gives you any rotation in the X-Y plane. Combined with virtual-Z (see below), you get every single-qubit unitary you want.

Pulse shapes — why Gaussian, why DRAG

A square pulse (on, constant amplitude, off) would work in principle but spreads energy across many frequencies — some of which would excite the transmon out of the qubit subspace into |2\rangle. That failure mode is called leakage, and it is the biggest enemy of fast superconducting gates. Transmons have low anharmonicity: the |1\rangle \to |2\rangle transition is only \sim 300 MHz below \omega_{10}, so a square pulse with significant spectral content at that distance will populate |2\rangle.

The fix is a smooth envelope. A Gaussian pulse of width \sigma has spectral width \sim 1/\sigma, so if \sigma \gtrsim 5 ns the spectral content at the |1\rangle \to |2\rangle transition is exponentially suppressed. But Gaussians alone leave a small phase error. DRAGDerivative Removal by Adiabatic Gate (Motzoi et al. 2009) — adds a small amount of the pulse's time derivative on the other quadrature (the Y channel for an X drive, or vice versa). The DRAG coefficient is calibrated to cancel the leakage almost exactly. Modern hardware uses DRAG as a matter of course; the residual leakage is below 10^{-4} per gate.

The free R_z — virtual-Z gates

You might think R_z(\theta) rotations require a physical pulse too. They do not. Because R_z(\theta) = e^{-i\theta Z/2} commutes with the frame of measurement, you can implement it for free by redefining the phase of every subsequent pulse. If the next gate is R_x(\phi) = e^{-i\phi X/2}, you instead apply R_x at drive phase \varphi - \theta — which acts as if an R_z(\theta) happened just before.

This is virtual-Z (McKay et al. 2017). It is literally zero duration, zero error, and zero cost. It is the reason decompositions like H = R_z(\pi/2)\, R_x(\pi/2)\, R_z(\pi/2) are cheap on superconducting hardware — only the two R_x are real pulses; the two R_z are bookkeeping in the control software.

Net result: a transmon's native gate set is typically \{R_x(\pi/2), R_z(\varphi\,\text{any})\} for single qubits, plus one two-qubit entangler. Compilers translate arbitrary unitaries into this set.

Example 1: Calibrating a 30-ns X gate

The goal is to implement R_x(\pi) — a complete flip from |0\rangle to |1\rangle — with a single DRAG-Gaussian pulse.

Step 1. Pick the envelope. Use a Gaussian of total width T = 30 ns, parameterised as \Omega(t) = A \exp\big(-\tfrac{(t - 15)^2}{2\sigma^2}\big) with \sigma = 7.5 ns (so the pulse is essentially zero outside the 30 ns window). Why: 30 ns is a typical gate time that comfortably suppresses leakage (1/\sigma \approx 20 MHz spectral width, tiny at 300 MHz away from the |2\rangle transition) while being far shorter than T_1 \approx 300\,\mus.

Step 2. Set the rotation angle. You need the pulse area \theta = \int_0^T \Omega(t)\, dt = A \sigma \sqrt{2\pi} = \pi. Solving: A = \pi / (\sigma \sqrt{2\pi}) = \pi / (7.5 \times 10^{-9} \times 2.507) \approx 1.67 \times 10^{8}\,\text{rad/s} \approx 2\pi \times 26.6\,\text{MHz}. Why: this is the peak Rabi frequency. Gates with larger amplitudes go faster but excite more leakage; this sits in the sweet spot.

Step 3. Choose the phase. For R_x, set \varphi = 0 (cosine drive) — the resulting axis is +x. Why: the rotation axis is (\cos\varphi, \sin\varphi, 0). Setting \varphi = \pi/2 would give R_y instead, at identical cost.

Step 4. Add DRAG. On the orthogonal quadrature, add -\beta \dot\Omega(t)/\alpha where \alpha is the anharmonicity (\approx -300 MHz) and \beta is calibrated by minimising leakage (\beta \approx 0.5 on typical IBM hardware). Why: DRAG cancels the lowest-order leakage to |2\rangle caused by the finite bandwidth of the envelope.

Step 5. Send it. The arbitrary waveform generator (AWG) at room temperature produces I (cosine) and Q (sine) baseband samples at 1–4 GS/s, mixes them up to \omega_d, sends down a coaxial cable through attenuators, arrives at the qubit.

Result. A 30-ns pulse with peak amplitude \approx 2\pi \times 26.6 MHz and \beta \approx 0.5 DRAG implements R_x(\pi) with fidelity \approx 99.95\% on IBM Heron (2024).

Rabi oscillations as pulse area is variedExcited state population versus pulse area, showing cosine oscillations between 0 and 1 with the pi rotation marked at pulse area pi.pulse area θ (radians)P(|1⟩)01π/2π3π/2X gateX/2
Rabi oscillation: excited-state probability versus pulse area. Peak at $\theta = \pi$ is the X gate ($|0\rangle \to |1\rangle$). The X/2 gate sits at $\theta = \pi/2$, the amplitude most circuits decompose into.

What this shows: gate calibration is a handful of numbers — amplitude, width, DRAG coefficient, phase — set by the pulse integral you want and by pushing the leakage error below 10^{-4}. Once calibrated, the same pulse runs identically on every shot.

Two-qubit gates — three ways, three companies

Single-qubit gates are straightforward. The hard part of a superconducting quantum computer is getting two qubits to interact in a controlled way. You need an entangling gate — something like a CNOT, CZ, or iSWAP — and you need it fast and accurate. Three architectures dominate the field in 2026.

Three two-qubit gate architecturesThree side-by-side schematics. Left shows two fixed-frequency transmons connected by a fixed bus resonator with a cross-resonance drive. Middle shows two tunable transmons briefly brought to resonance by flux tuning. Right shows two transmons separated by a tunable coupler that is parametrically modulated.Two-qubit gates: cross-resonance, tunable flux, parametric couplerIBM — cross-resonanceQ_Aω_A = 5.0 GHzQ_Bω_B = 5.2 GHzbus res.drive @ ω_B→ effective ZXcompile to CNOT~300 ns, 99.5%Google 2019 — tunable fluxQ_AQ_Bflux bias lineω_A(Φ)ω_B(Φ)tune briefly on-resonance→ iSWAP (or √iSWAP)~20 ns, 99.4%Google Willow / Rigettiparametric CZQ_AQ_BtcAC flux@ |ω_A − ω_B|→ CZ via adiabaticavoided crossing~50 ns, 99.7%
The three dominant two-qubit-gate architectures on superconducting hardware. IBM's fixed-frequency transmons use an always-on bus plus a microwave drive applied to qubit A at qubit B's frequency (cross-resonance). Google's 2019-era sycamore-family used flux-tunable transmons brought briefly onto resonance to produce an iSWAP. Modern machines (Google Willow, Rigetti Ankaa, many new Fluxonium designs) insert a tunable coupler between the qubits and modulate its frequency parametrically to synthesize a CZ gate.

Cross-resonance (IBM)

IBM builds fixed-frequency transmons, each detuned from its neighbours by a few hundred MHz. To entangle qubit A with qubit B, you apply a microwave drive to qubit A, at qubit B's frequency. The drive is off-resonant with A (so it barely excites A) and resonant with B (but B is not directly wired to the drive — A is). Through the shared bus resonator, the drive couples to B with an effective strength that depends on whether A is in |0\rangle or |1\rangle.

In the rotating frame, the effective interaction takes the form

H_{\text{CR}} = \tfrac{\hbar}{2} J_{ZX}\, Z_A \otimes X_B + \text{(smaller terms)}

where J_{ZX} is the cross-resonance coupling, proportional to the drive amplitude and to the inverse detuning between the qubits. Turning this on for time t = \pi / (2 J_{ZX}) implements \exp(-i\tfrac{\pi}{4} Z_A X_B) — a ZX rotation. With single-qubit wrappers, this is equivalent to a CNOT.

Why this architecture: no tunable elements means fewer wires, better manufacturability, and more stable frequencies. Cost: gate times are longer (200–400 ns) because J_{ZX} is weak, and spectator qubits can leak in crosstalk. IBM's engineering over 2017–2024 brought cross-resonance fidelity from 95% to 99.7% through echoed-cross-resonance sequences, careful calibration of static ZZ coupling, and custom DRAG-like pulse shaping.

IBM Heron (2024): cross-resonance CNOT fidelity 99.7% average, gate time ~300 ns.

iSWAP and tunable flux (Google's early path)

Google's Sycamore (2019, the chip that ran the famous 53-qubit random-circuit-sampling experiment) used a different recipe. Each transmon had a flux-biased SQUID — a DC-flux-tunable element — so its frequency could be shifted by several hundred MHz on a few-nanosecond timescale. To do a two-qubit gate, you tune two neighbouring qubits briefly into resonance. Their |01\rangle and |10\rangle states hybridize, swap populations, and pick up a phase — this is the iSWAP family.

Gate time: as short as 12 ns. Fidelity: 99.4% in 2019, improved since. Cost: extra flux-bias wires per qubit, sensitivity to flux noise (limits T_2 at the tuning sweet spot).

Parametric CZ via tunable couplers (Google Willow, Rigetti)

Modern Google hardware (Willow, 2024) and Rigetti's Ankaa keep the two data transmons fixed-frequency but insert a tunable coupler — a third transmon whose frequency can be modulated by AC flux. Modulating the coupler at the detuning frequency |\omega_A - \omega_B| drives a parametric exchange interaction. With the right pulse shape, the interaction implements a controlled-Z in 40–80 ns, and the coupler can be tuned to zero effective coupling when idle — killing spectator crosstalk.

Why this wins: fast (tens of ns), cleanly turnable on and off, lower crosstalk. Current state of the art: Google Willow 99.7% CZ, 40 ns; Rigetti 99.5%, 72 ns; IBM's newer Egret/Flamingo architectures are also adopting tunable couplers.

Readout — dispersive measurement

A transmon's state cannot be read directly by scoping the qubit wire — the microwave signal you would need to send would smash the state. Instead, you pair every qubit with a readout resonator (a small superconducting LC oscillator at a different frequency, say 7 GHz) and use a beautiful bit of physics called dispersive coupling.

The physics

The qubit and resonator interact via the Jaynes–Cummings Hamiltonian. In the dispersive regime — where the detuning \Delta = \omega_q - \omega_r is large compared to the bare coupling g — the interaction reduces to

H_{\text{disp}} = \hbar \chi\, Z \otimes a^\dagger a

where \chi = g^2 / \Delta is the dispersive shift (typically 0.5–5 MHz) and a^\dagger a is the photon-number operator of the resonator. The effect: the resonator's frequency depends on the qubit state. If the qubit is |0\rangle, the resonator sits at \omega_r + \chi. If |1\rangle, at \omega_r - \chi.

Why this is wonderful: sending a weak microwave probe tone at \omega_r into the resonator and measuring the phase of what comes back tells you the qubit state without, in principle, absorbing any energy from the qubit itself. The interaction is a pure Z-type coupling — it commutes with the qubit Hamiltonian — so the measurement is quantum non-demolition (QND) up to higher-order corrections. You can measure repeatedly and get the same answer.

Dispersive readout schematicLeft shows a transmon coupled capacitively to a readout resonator which in turn connects to a transmission line. Right shows two Lorentzian resonator peaks shifted by plus and minus chi, one for qubit in zero and one for qubit in one.Dispersive readout — the resonator frequency depends on the qubit stateQtransmonω_q ≈ 5 GHzC_c (coupling)resonatorω_r ≈ 7 GHzprobe inreflected outω_rω_r − χω_r + χprobe frequency|1⟩|0⟩shift 2χ
Qubit and resonator are coupled by a small capacitance $C_c$. In the dispersive regime (qubit-resonator detuning large), the resonator's frequency depends on whether the qubit is $|0\rangle$ (right peak) or $|1\rangle$ (left peak). A probe tone sent in at $\omega_r$ reflects back with a phase that encodes the qubit state. Separation between the two peaks is $2\chi$, typically a few MHz.

The measurement protocol

  1. Send a probe tone at frequency \omega_r (midway between the two qubit-conditioned peaks) for a readout window of t_m.
  2. The reflected microwave is routed through a cold circulator and amplified by a Josephson parametric amplifier (JPA) or traveling-wave parametric amplifier (TWPA) at the 20 mK stage — these add quantum-limited noise. Then further amplification at 4 K (HEMT) and room temperature.
  3. At room temperature, demodulate the signal into in-phase (I) and quadrature (Q) components.
  4. Repeat the circuit many times. Plot each shot as a point on the (I, Q) plane. You see two blobs — one for each qubit state — separated by a distance set by the integrated signal-to-noise ratio (SNR).
  5. Fit a threshold line. Each new shot is classified as |0\rangle or |1\rangle based on which side of the threshold its (I, Q) falls.

The readout fidelity is F = 1 - \tfrac{1}{2}(P(1|0) + P(0|1)) — the average probability of correct assignment.

Numbers

Example 2: Signal-to-noise ratio of a dispersive readout

Suppose you have a readout resonator with \omega_r / 2\pi = 7 GHz, decay rate \kappa / 2\pi = 5 MHz, and dispersive shift \chi / 2\pi = 1 MHz. You drive the resonator with \bar{n} = 4 photons (a typical value — more photons is more signal but more back-action). Amplifier noise adds n_{\text{noise}} = 0.5 photons per mode per readout quadrature (the quantum-limited floor is 1/2).

Step 1. Find the phase shift. In steady state the reflected phase as a function of qubit state is \phi_\pm = \arctan(\pm 2\chi / \kappa). Plug in: \phi_\pm = \arctan(\pm 2 \cdot 1 / 5) = \arctan(\pm 0.4) \approx \pm 21.8°. Why: the qubit-state-dependent resonator frequency shift \pm \chi causes a phase shift of the reflected probe; the maximum phase difference 2\phi_+ is the raw readout signal.

Step 2. The signal difference between the two qubit states is |e^{i\phi_+} - e^{i\phi_-}| = 2\sin(\phi_+) \approx 0.743. In photon-amplitude units, scale by \sqrt{\bar n} = 2, giving signal amplitude \approx 1.49 per readout time. Why: the two qubit states produce complex-number readout outputs on opposite sides of the origin in the IQ plane; their separation is what distinguishes them.

Step 3. SNR per readout time. With a 500-ns integration and a quantum-limited amplifier, the number of noise photons is n_{\text{noise}} \cdot \kappa \cdot t_m = 0.5 \cdot 2\pi \cdot 5 \times 10^6 \cdot 500 \times 10^{-9} \approx 7.9. SNR = (signal)² / (noise variance) \approx (1.49 \cdot \sqrt{\kappa t_m})^2 / (2 n_{\text{noise}} \kappa t_m) \approx 6.3. Why: SNR grows linearly with integration time — longer reads give better fidelity, up to the limit set by T_1 decay during the window.

Step 4. Convert SNR to fidelity. For Gaussian blobs, F = \tfrac{1}{2}(1 + \text{erf}(\sqrt{\text{SNR}/2})). \sqrt{6.3/2} \approx 1.77; \text{erf}(1.77) \approx 0.988; so F \approx 0.994. Why: if the two IQ-plane blobs have Gaussian noise, the optimal threshold misclassifies only the tail fraction.

Result. A 500-ns dispersive readout with these parameters achieves ~99.4% assignment fidelity — consistent with IBM Heron and Google Willow 2024 numbers.

IQ plane showing readout blobsAn IQ plane with two Gaussian blobs, one labelled zero on the right side and one labelled one on the left side, with a vertical dashed line between them showing the decision threshold.IQ|1⟩|0⟩thresholdseparation ∝ √SNR
Repeated readout shots plotted in the IQ plane. Each shot is a demodulated point; preparing $|0\rangle$ many times produces one blob, $|1\rangle$ produces the other. The Gaussian width is set by the amplifier noise; the separation is set by the dispersive shift times the integration time. The vertical dashed line is the classifier threshold.

What this shows: readout fidelity is an explicit calculation, not a magic number. You pick \chi, \kappa, and integration time to balance speed against accuracy. More photons in the resonator gives a cleaner signal but increases back-action on the qubit.

Error sources and fidelity budgets

Every gate has a fidelity budget. Understanding where the error comes from tells you what to optimise.

A useful rule of thumb: on 2024 hardware, fidelity is limited roughly equally by the four listed sources. When one is improved, another becomes the bottleneck.

State of the art — 2024 to 2026

The field moves quickly. Benchmark numbers as of April 2026:

Platform Chip 1Q fidelity 2Q fidelity Readout fidelity Coherence T_1
Superconducting (IBM) Heron r2 (2024) 99.95% 99.7% (CR) 99.0% 250 μs
Superconducting (Google) Willow (2024) 99.97% 99.7% (CZ) 98.5% 100 μs
Superconducting (Rigetti) Ankaa-3 (2024) 99.9% 99.5% 97.5% 50 μs
Fluxonium (various) research (2024) 99.99% 99.5% 98% 1–10 ms
Trapped ions (Quantinuum) H2 (2024) 99.998% 99.9% 99.8% > 60 s

The trapped-ion column beats superconducting on every fidelity metric (by a factor of 3–10 on 2-qubit errors) and wins by orders of magnitude on coherence. The cost is speed: ion gates are 100–500 μs versus 30 ns single-qubit and 50 ns two-qubit for superconducting. A fair fidelity-per-unit-time comparison is more complicated; for now, both platforms are viable and invest in different bets on the road to fault tolerance.

Indian context

India's National Quantum Mission (2023, ₹6000 crore over 8 years) funds a superconducting-qubit effort led by TIFR Mumbai in collaboration with IISc Bangalore and IIT Bombay. The target is a 16-qubit processor by 2026, 50 qubits by 2028, 100 qubits by 2030. TIFR's group has published characterisation of two-qubit gates at the 99% level as of 2024 — the entry-level benchmark that IBM passed around 2017. The learning curve is steep but the direction is clear, and the hardware supply chain (dilution refrigerators from Bluefors in Finland, microwave electronics from Zurich Instruments) is established. For a student in India reading this chapter: the jobs will exist.

Common confusions

Going deeper

If you understand that single-qubit superconducting gates are 20–50 ns DRAG-shaped microwave pulses whose amplitude sets the rotation angle and phase sets the axis, that virtual-Z is free, that two-qubit gates come in three competing flavours (cross-resonance at IBM, iSWAP/CZ on tunable-flux or tunable-coupler platforms elsewhere), that readout is a dispersive coupling to a resonator whose state-dependent frequency shift is detected by probing and phase-measuring a reflected microwave, and that 2024-era fidelity numbers are 99.95% / 99.7% / 99% for 1Q / 2Q / readout — you have chapter 165. The rest is for readers who want the derivation of the ZX interaction, the surface-code fidelity requirements, the two-level-system (TLS) decoherence story, and the IBM-versus-Quantinuum benchmark debate.

DRAG — the derivation

The leakage problem is real. A Gaussian pulse of width \sigma has spectral density \propto e^{-\sigma^2 (\omega - \omega_{10})^2/2}. At the |1\rangle \to |2\rangle frequency \omega_{10} + \alpha (where \alpha \approx -300 MHz is the anharmonicity), the spectral weight is e^{-\sigma^2 \alpha^2 /2}. For \sigma = 5 ns and |\alpha| = 2\pi \times 300 MHz, this evaluates to e^{-0.5 \times 88} \approx 10^{-19} — absurdly small. So why does leakage happen at all?

Because the transmon is not a true two-level system. At leading order in perturbation theory, the drive on |0\rangle \to |1\rangle is accompanied by a next-order-in-\alpha off-resonant drive on |1\rangle \to |2\rangle, whose effect does not scale as the spectral density — it scales as 1/\alpha times the pulse amplitude. DRAG cancels this term exactly by adding a quadrature-phase pulse whose derivative matches the leakage-excitation amplitude. The original Motzoi et al. (2009) derivation gives

\Omega_{\text{DRAG}}^{(Y)}(t) = -\frac{\dot\Omega^{(X)}(t)}{\alpha}

for a first-order cancellation. Higher-order DRAG schemes (Chen et al. 2016) cancel further leakage and phase errors and are standard on modern hardware.

Cross-resonance — the ZX derivation

Consider two transmons A (drive qubit) and B (target qubit) coupled via a shared bus resonator with exchange coupling J. The bare Hamiltonian (after RWA and adiabatic elimination of the resonator) is

H_0 = \tfrac{\omega_A}{2} Z_A + \tfrac{\omega_B}{2} Z_B + J (X_A X_B + Y_A Y_B)/2.

Apply a drive \Omega \cos(\omega_B t)\, X_A. Go into a frame rotating at \omega_B for both qubits. Second-order perturbation theory in J/(\omega_A - \omega_B) produces, in the qubit-qubit effective Hamiltonian, the term

H_{ZX} = \frac{J \Omega}{2(\omega_A - \omega_B)} Z_A X_B + \text{(ZY, IX, IY corrections)}.

Over time t = \pi / (2 |H_{ZX}|) this implements \exp(-i\tfrac{\pi}{4} Z_A X_B). Single-qubit R_z(\pi/2) and R_y(\pi/2) wrappers convert this to CNOT. Modern IBM hardware uses an echoed cross-resonance sequence (drive, X_A, drive, X_A) to cancel the unwanted IX and ZI terms exactly — the ZX and IX split into even and odd sectors.

Dispersive readout — the shift formula

Start from the Jaynes–Cummings Hamiltonian for a qubit (two-level atom with frequency \omega_q) coupled to a single resonator mode (frequency \omega_r, bosonic operator a):

H_{JC} = \tfrac{\omega_q}{2} Z + \omega_r a^\dagger a + g\, (a^\dagger \sigma_- + a \sigma_+).

Apply the Schrieffer–Wolff transformation at small g/|\Delta| where \Delta = \omega_q - \omega_r:

H_{\text{disp}} = \tfrac{\omega_q + \chi}{2} Z + (\omega_r + \chi Z) a^\dagger a + O(g^4/\Delta^3)

with \chi = g^2 / \Delta. The \chi Z a^\dagger a term is the readout signal. For a transmon (which is weakly anharmonic, not a true two-level system) there is a correction \chi = g^2/\Delta \cdot \alpha / (\alpha + \Delta) that accounts for the |2\rangle level — this is the formula used in real calibration.

Two-level systems (TLS) — the coherence limit

Transmons do not sit in empty space. They live on an amorphous dielectric (usually aluminium oxide) and interact with defects in that dielectric that themselves behave as two-level systems — TLS defects. Each TLS couples to the qubit via the dipole-dipole interaction, providing a channel for T_1 decay. TLS defects are the dominant source of T_1 variation across qubits on the same chip. They appear and disappear as the chip is thermally cycled. The field has spent two decades trying to reduce TLS density by improving fabrication (tantalum-on-sapphire substrates give T_1 \sim 1 ms, aluminium-on-silicon gives \sim 100300\,\mus); progress is incremental and active.

The fault-tolerance threshold

For a surface code, the per-gate error rate must be below the threshold of about 1\% for the code to improve logical error with increasing distance. Current hardware is at 0.30.5\% per two-qubit gate — a factor of 2–3 below threshold. This is enough for small logical qubits (Google Willow demonstrated below-threshold performance at distance 7 in 2024), but not enough for truly useful encoded computation, which requires thresholds 100–1000× below. That factor is the engineering gap between NISQ and fault-tolerant computing.

NVIDIA cuQuantum and classical simulators

Superconducting hardware is benchmarked against classical simulators running the same circuit. NVIDIA's cuQuantum library lets a single GPU simulate dense state vectors of up to ~32 qubits and tensor-network approximations of much larger circuits. Benchmarking a 99.7%-fidelity 2-qubit gate on hardware means nothing without a classical baseline of what the gate should have produced. Every gate-fidelity claim you read is validated against such a simulator.

The IBM-versus-Quantinuum debate

IBM argues superconducting wins in the long run because it scales faster (1000+ qubits already) and gate speed lets you fit deeper circuits in a coherence time. Quantinuum argues trapped ions win because 99.9% gate fidelity and second-long coherence mean logical qubits today, and QCCD architectures (ion shuttling across junctions) will scale. The academic consensus: both will succeed for different applications. NISQ-scale chemistry simulations fit comfortably on either. Large-scale factoring probably needs millions of physical qubits regardless of platform, which means the winning platform is whichever first demonstrates reliable fabrication at that scale.

Where this leads next

References

  1. Philip Krantz et al., A Quantum Engineer's Guide to Superconducting Qubits (2019), Applied Physics ReviewsarXiv:1904.06560.
  2. Alexandre Blais et al., Circuit Quantum Electrodynamics (2021), Reviews of Modern PhysicsarXiv:2005.12667.
  3. Rami Barends et al., Superconducting quantum circuits at the surface code threshold for fault tolerance (2014), NaturearXiv:1402.4848.
  4. IBM Quantum, IBM Quantum Hardware Documentationibm.com/quantum/technology.
  5. Wikipedia, Superconducting quantum computing.
  6. John Preskill, Lecture Notes on Quantum Computation, Chapter 7 — theory.caltech.edu/~preskill/ph229.