Mechanism · a hard thing done with only shift, add, and a sign

The Sine That Never Multiplied

A pocket calculator finds the sine of an angle without ever multiplying two numbers together. It walks a single vector through a fixed ladder of ever-halving angles — arctan 1, arctan ½, arctan ¼ — and at each rung it does only three things a chip finds trivial: shift some bits, add, and choose a sign. That is CORDIC. Watch it work, watch why its magic constant never changes, then flip one parameter and turn the same machine into a circle, a straight line, or a hyperbola.

In 1959 Jack Volder needed to compute trigonometry aboard a B-58 bomber, in real time, with 1950s hardware that had no multiplier worth the name — just shifters and adders. His answer, the COordinate Rotation DIgital Computer, is still how a great deal of silicon computes sines today. The idea is almost impudent: to rotate a vector by some target angle θ, don't rotate it by θ. Rotate it by arctan 1 = 45°. Overshot? Rotate back by arctan ½ ≈ 26.57°. Still not there? Forward by arctan ¼ ≈ 14.04°. Each rung is half-ish the last, so the running total zeroes in on θ like a scale balancing — a binary search across the arc.

The reason those particular angles were chosen — the arctangents of 2⁻ⁱ — is the whole trick. Rotating by an angle whose tangent is exactly 2⁻ⁱ means the rotation's arithmetic collapses into a bit-shift. A true rotation multiplies by a 2×2 matrix full of sines and cosines. CORDIC's rotation multiplies by 2⁻ⁱ — which, in binary, is just "move the point i places right." No multiplier required.

§1 · The ladder

A fixed table of angles, computed once

Every CORDIC machine carries one small read-only table: the arctangents of the negative powers of two. Here are the first rungs (recomputed live, below), beside the length each rung stretches the vector — hold onto that second column, it is the secret of the whole method.

i	2⁻ⁱ (the shift)	αᵢ = arctan 2⁻ⁱ	αᵢ in degrees	scale sᵢ = √(1+2⁻²ⁱ)

the whole angle table, computed once and frozen Σ αᵢ = the reach ↓

Add every rung and you get the farthest angle a bare CORDIC can reach: Σ arctan 2⁻ⁱ = 1.743 rad ≈ 99.88°. Ask for more than that and you must first fold the angle into a quadrant (the calculator does; our instrument stays inside the honest range). Now let it run.

§2 · The instrument

Watch one vector zig-zag onto the circle

Set a target angle. The vector starts pointing along the x-axis, at a peculiar length K = 0.6073… we will explain in a moment. Then it takes N pseudo-rotations, each time turning toward θ if it is short of it, away if it has overshot — the + and − choices below. After enough steps the tip lands, to full precision, on the unit circle at exactly (cos θ, sin θ). No sine was ever looked up; it fell out of the rotations.

start length K = 0.6073tip → (cos θ, sin θ)

Target angle θ — 40.0°

Native CORDIC reaches ±99.88°. Beyond that a real chip folds by quadrant first.

Iterations N — 14

Each step wins about one more bit of accuracy — until it can't. Drag to feel the floor.

CORDIC cos θ / sin θ

0.7660 / 0.6428

from shifts & adds only

Math.cos / Math.sin

0.7660 / 0.6428

the library's answer

|error| (max of the two)

1.2e-4

≈ 13 correct bits

the sign choices dᵢ — turn toward θ (+) or back (−)

That is the entire algorithm. Three integer operations per step — a shift, an add or subtract, and reading one sign bit — repeated a couple of dozen times. Nothing that deserves to be called a multiply. But two things should nag: why does the vector start at that strange length K, and why does K never depend on the angle you asked for? That constant is doing something, and it is the hinge the whole method turns on.

§3 · The payload

Why the gain K never changes

Here is the sleight of hand. A true rotation by angle α keeps the vector's length exactly the same. CORDIC's step does not. When it "rotates" by using tan α = 2⁻ⁱ, it quietly skips the cos α that a real rotation carries — so each step is a rotation and a stretch. This is why it is called a pseudo-rotation. The step is:

x_i+1 = x_i − d·y_i·2⁻ⁱ # d = ±1, the sign choice
y_i+1 = y_i + d·x_i·2⁻ⁱ

Ask how much longer the vector got. Square the new length and expand — this is the one calculation that matters on this whole page:

|v_i+1|² = (x − d·y·2⁻ⁱ)² + (y + d·x·2⁻ⁱ)²
= x² − 2·d·x·y·2⁻ⁱ + y²·2⁻²ⁱ + y² + 2·d·x·y·2⁻ⁱ + x²·2⁻²ⁱ
= (x² + y²)·(1 + 2⁻²ⁱ) # the two red terms cancel — for either sign of d

Look at what the sign d does and does not touch. It appears only in the two red cross-terms — and those are equal and opposite, so they cancel for d = +1 and for d = −1 alike (all that matters is d² = 1). What survives, the stretch factor √(1+2⁻²ⁱ), is set by the shift magnitude i alone. The sign steers the angle; it has no vote on the length.

So over N steps the vector's length is multiplied by the same product no matter what angle you steered toward — a fixed number. Its reciprocal is K. Start the vector at length K and the accumulated stretch lands it back at length exactly 1, on the unit circle, where its coordinates are the cosine and sine. Below: pick a rung i and a vector, and see the same stretch land the + step and the − step on the identical dashed circle — different directions, one radius.

Rung i (shift magnitude) — 1

The stretch is √(1+2⁻²ⁱ). Bigger i → tinier stretch, closer to a true rotation.

Vector direction — 55°

Any direction: the two tips always share one dashed circle. The sign never moves the radius.

stretch this rung sᵢ

1.1180

= √(1+2⁻²ⁱ)

length after + step

1.1180

|v| · sᵢ

length after − step

1.1180

identical → K is constant

And here is K assembling itself. The running product of the stretch factors converges to the total gain A = 1/K; its reciprocal settles onto the constant every CORDIC chip stores. Tied to the same iteration count N as the instrument above:

A(N) = ∏ √(1+2⁻²ⁱ) = 1.6467… → K = 1/A = 0.6073…

Converged, to the last bit a double can hold: K = 0.60725293500888, 1/K = 1.64676025812107. This single number is the price of admission, paid once, for every trig call the chip will ever make.

§4 · The honest floor

One bit per step — until it isn't

The folklore says CORDIC gains "about one bit of accuracy per iteration," and the sloped part of the curve below shows exactly that: each rung roughly halves the error. But the folklore stops too soon. The angle table and the running arithmetic are themselves only stored to finite precision, so past a certain point new iterations stop buying anything — the error flattens, and in double precision it even bounces around a floor near 10⁻⁹. More steps is not more truth forever. Drag N and watch the marker walk down the slope and then wander along the floor.

|error| vs iterations, at θ = 0.7 rad · log scaleN = 14

N	\|sin error\|	bits	vs. slope

the sloped rows gain ~1 bit each; the last rows sit on the floorrecomputed live

This flattening is not a flaw to hide — it is the honest shape of every iterative method, and naming it is the difference between a demo and the truth. A hardware designer reads this curve backwards: to guarantee b good bits, run about b iterations and store the table to a couple of guard bits beyond that. The cost is linear in the number of result bits — which, in a moment, is exactly why CORDIC lost the general-purpose CPU.

§5 · The flagship

One machine, three geometries

Everything so far was Volder's 1959 circular CORDIC. Twelve years later, John Walther at Hewlett-Packard noticed that the identical shift-add-sign iteration — change nothing but one sign in the update and swap the little angle table — computes an entire library of functions, not just sine. He added a single parameter m:

x_i+1 = x_i − m·d·y_i·2⁻ⁱ
y_i+1 = y_i + d·x_i·2⁻ⁱ
z_i+1 = z_i − d·α_i # α from the table chosen by m

The quantity the iteration secretly conserves is x² + m·y². Set m = +1 and that is a circle — sine, cosine, arctangent. Set m = 0 and the conserved shape degenerates to a straight line — and the machine multiplies and divides. Set m = −1 and it is a hyperbola — giving exp, ln, √, and the hyperbolic functions. Same rotations. Same silicon. Flip m and watch the point walk a different curve.

conserved: x² + y² = 1 (a circle)point → (cos θ, sin θ)

CORDIC result

—

same shift-add machine

Math library

—

the answer to beat

|error|

—

i	d	x	y	z (angle left)

circular · gain 1/K = 1.6468 · reach ±99.88°the iteration, live

The hyperbolic mode hides one genuine wrinkle, and the instrument honours it rather than fudging it: not every rung converges on the first pass. Iterations 4, 13, 40, 121, … (each 3k+1 after the last) must be repeated, or exp and ln silently drift. You can see the doubled rows in the trace table above when m = −1. That is the sort of detail a one-paragraph "CORDIC rotates a vector" tutorial never mentions — and the reason the verifier for this page tests convergence with and without the repeats.

This unification is the answer to the sophisticated dismissal — "CORDIC is a museum piece; real processors just evaluate polynomials." The depth of CORDIC was never the sine. It was that one tiny state machine, by swapping a table and a sign, spans trigonometry, exponentials, logarithms, multiplication, division and square roots. Which is why it never left the places where multipliers are expensive or absent: FPGAs and ASICs, DSP blocks, and dedicated units like the CORDIC coprocessor STMicroelectronics still ships inside its STM32 chips.

§6 · The relative

The other way to make a transcendental cheap

There is a rival trick, and setting the two side by side sharpens what CORDIC is. In 1962 — three years after Volder — John Mitchell showed you can multiply and divide by faking a logarithm: because a binary number's position already is roughly its log₂, you can approximate the log by counting and shifting bits, add instead of multiply, then shift back. The most famous descendant of that idea is the fast inverse square root from Quake III: reinterpret a float's raw bits as an integer and you have log₂ for free, already sitting in the exponent field.

Both families make transcendentals cheap, and both, at bottom, exploit the logarithm — but by opposite mechanisms. The bit-log trick is a single, one-shot reinterpretation: read the bits differently and the log is just there, no loop, at the cost of a fixed few percent of error. CORDIC is iterative accumulation: it never reinterprets anything; it grinds out one more correct bit per rung by tabulated rotation, as accurate as you are patient. One reads a logarithm the hardware already wrote down; the other builds an answer up from rotations. Kin in spirit, opposite in method — which is why they win in different places.

The check — every number on this page, recomputed from the recurrences

verifier: 33 / 33 green

Nothing here is a stored answer. The browser runs the real CORDIC iterations live for the instruments above, and research/cordic/verify-cordic.mjs recomputes the same quantities independently — the gain, the ladders, the ranges, the error curve with its floor, and the linear and hyperbolic modes (including a control that shows the hyperbolic repeats are load-bearing). A sampling, computed just now in your browser:

K = 0.60725293500888 · 1/K = 1.64676025812107

circular reach = Σ arctan 2⁻ⁱ = 1.74328662 rad = 99.883°

hyperbolic gain 1/Ah = 1.20749707 · reach = 1.11817 rad · repeats at i = 4, 13, 40, 121

at θ = 40°, N = 14 → CORDIC sin = … , |error| = …

Run it yourself: node research/cordic/verify-cordic.mjs → 33/33 PASS. Sources and the full check list live in research/cordic/.

What is exactly true here, what is idealised, and three things people get wrong

Exactly true. The gain K = 0.6072529350088813 is constant, and the reason is the cancellation shown in §3 — the per-step stretch √(1+2⁻²ⁱ) depends on the shift i only, never on the sign. Circular CORDIC converges for |θ| ≤ Σ arctan 2⁻ⁱ = 1.7433 rad ≈ 99.88°; the linear range is Σ 2⁻ⁱ = 2; the hyperbolic range is ≈ 1.1182 rad and needs the repeated iterations. All of this the verifier reproduces.

Idealised. Our instrument computes in the browser's float64, not in fixed-point integers, so you see the algorithm's convergence and its precision floor — not the exact rounding of any particular chip's word length. A real fixed-point CORDIC also carries quantisation error in the stored angle table; the shape of the curve (slope, then floor) is the same, the floor just sits where that word length puts it.

"HP calculators used CORDIC." Nearly, not exactly. The HP-35 (1972) ran a CORDIC-family method — David Cochran's implementation used the Meggitt pseudo-multiplication/pseudo-division variant, and Jacques Laporte's reverse-engineering shows its exp/ln path differs from textbook circular CORDIC. So: CORDIC-family, yes; the precise rotation this instrument shows, not verbatim.

"Modern CPUs compute sine with CORDIC." Generally false. Where a fast hardware multiplier exists, argument reduction plus a minimax polynomial (or accurate-table methods) beats CORDIC, because CORDIC costs about one add-shift per result bit. In particular, do not believe the common claim that Intel's x87 FSIN "uses CORDIC" — Intel's transcendentals are table/polynomial. CORDIC stays optimal specifically where multipliers are scarce: FPGA/ASIC/DSP and dedicated blocks.

"More iterations, more accuracy." Only up to the floor. Past roughly N ≈ 24 in double precision the error stops falling and bounces around ~10⁻⁹; §4 shows this rather than pretending the line drops forever.