CPython JIT Compiler: Python's Experimental JIT Explained

I spent a decade telling people that Python's speed doesn't matter. "Developer productivity," I'd say, waving my hand like a Jedi. "C extensions. NumPy. Just drop into Rust for the hot path." And I believed it -- until I watched a tight numerical loop in CPython 3.12 run 47x slower than the equivalent C, and had to explain to a product manager why our data pipeline needed six r5.xlarge instances. The cpython jit compiler landing in Python 3.13 and 3.14 is the first serious attempt to close that gap from inside CPython itself, and it's worth understanding what's actually happening under the hood.

This isn't a press release. The JIT doesn't make Python fast yet. But the architecture is genuinely clever, the roadmap is credible, and the implications for the next three years of Python performance are bigger than most people realize.

Python's Performance Problem (and Why a JIT Is the Answer)

Python is slow for a specific, well-understood reason: CPython's main execution loop is an interpreter that dispatches bytecode instructions one at a time. Every , every , every goes through the same giant switch statement, and the CPU's branch predictor has a terrible time with it. The indirect dispatch overhead alone accounts for a shocking percentage of total execution time in tight loops.

Benchmark	Without JIT	With JIT	Change
Richards	44.5ms	37.8ms	~15% faster
Nbody	91.8ms	104ms	~13% slower
Spectral Norm	90.6ms	96.0ms	~6% slower
Fibonacci	6.59s	6.59s	No change
Bubble Sort (Linux)	2.18s	2.03s	~7% faster

CPython JIT Compiler: Python's Experimental JIT Explained

Python's Performance Problem (and Why a JIT Is the Answer)

Related Posts

Copy-and-Patch: The Technique Behind CPython's JIT

What the JIT Actually Compiles

Benchmarks: Where the JIT Helps (and Where It Doesn't)

JIT vs PyPy vs Cython: The Performance Landscape

The Road to a Production JIT

Related Posts