Python 3.14 Free-Threaded Mode: GIL Removal Performance Analysis

Platform	Overhead
macOS aarch64 (Apple Silicon)	~1%
x86-64 Linux	~8%
Geometric mean across benchmarks	5-10%

Benchmark	Threads	Linux Speedup	macOS Speedup
`fibo(40)`	4	3.09x	3.20x
Bubble sort	4	2.03x	2.74x
Prime number calculation	4	~3.4x	—
Flask CPU-bound endpoint	4	~1.94x	—

Python 3.14 Free-Threaded Mode: GIL Removal Performance Analysis | U-BLOG

import sys
import sysconfig
import threading
import time

# Check if we're running free-threaded
gil_disabled = not sys._is_gil_enabled()
ft_build = sysconfig.get_config_var("Py_GIL_DISABLED") == 1
print(f"GIL disabled: {gil_disabled}")
print(f"Free-threaded build: {ft_build}")

def fibo(n):
    if n <= 1:
        return n
    return fibo(n - 1) + fibo(n - 2)

N = 38  # Use 38 for faster runs, 40 for dramatic effect

# Single-threaded baseline
start = time.perf_counter()
fibo(N)
single = time.perf_counter() - start
print(f"\nSingle-threaded fibo({N}): {single:.2f}s")

# Multi-threaded
for num_threads in [2, 4]:
    start = time.perf_counter()
    threads = [threading.Thread(target=fibo, args=(N,)) for _ in range(num_threads)]
    for t in threads:
        t.start()
    for t in threads:
        t.join()
    multi = time.perf_counter() - start
    speedup = (single * num_threads) / multi
    print(f"{num_threads} threads: {multi:.2f}s (speedup: {speedup:.2f}x)")

import threading
import time

def compute_primes(start, end, results, index):
    """Find primes in range [start, end) using trial division."""
    primes = []  # Thread-local list — no contention
    for n in range(max(start, 2), end):
        is_prime = True
        for d in range(2, int(n**0.5) + 1):
            if n % d == 0:
                is_prime = False
                break
        if is_prime:
            primes.append(n)
    results[index] = primes  # Single write at the end

def find_primes_parallel(limit, num_threads=4):
    chunk_size = limit // num_threads
    results = [None] * num_threads
    threads = []

    start = time.perf_counter()
    for i in range(num_threads):
        lo = i * chunk_size
        hi = limit if i == num_threads - 1 else (i + 1) * chunk_size
        t = threading.Thread(target=compute_primes, args=(lo, hi, results, i))
        threads.append(t)
        t.start()

    for t in threads:
        t.join()
    elapsed = time.perf_counter() - start

    all_primes = []
    for r in results:
        all_primes.extend(r)

    return all_primes, elapsed

# Compare single-threaded vs multi-threaded
LIMIT = 500_000

primes_1, time_1 = find_primes_parallel(LIMIT, num_threads=1)
primes_4, time_4 = find_primes_parallel(LIMIT, num_threads=4)

print(f"1 thread:  {time_1:.2f}s  ({len(primes_1)} primes)")
print(f"4 threads: {time_4:.2f}s  ({len(primes_4)} primes)")
print(f"Speedup:   {time_1 / time_4:.2f}x")

import sys

# After importing all your dependencies
if sys._is_gil_enabled():
    print("WARNING: GIL is enabled (likely an extension re-enabled it)")
else:
    print("GIL is disabled — free-threading active")

# Using uv (recommended)
uv run --python 3.14t script.py

# Or install directly and run
python3.14t -VV  # Verify you have the free-threaded build

# At runtime, disable the GIL explicitly
PYTHON_GIL=0 python3.14t your_script.py

import your_framework
import your_database_driver
import your_image_library
# ... everything you use

import sys
print(f"GIL enabled: {sys._is_gil_enabled()}")

Python 3.14 Free-Threaded Mode: The GIL Is Finally Optional — What Changes for Real Workloads

What Changed: PEP 703 in Python 3.14

Related Posts

Performance Numbers: Overhead vs. Gains

Single-Threaded Overhead

Multi-Threaded Gains

The Flask Test

When Free-Threaded Mode Actually Helps

Where it wins

Where it does nothing

The contention trap

The Extension Module Problem

Migrating to Free-Threaded Python

Step 1: Install the free-threaded build

Step 2: Audit your dependencies

Step 3: Identify your bottleneck

Step 4: Restructure shared state

Step 5: Benchmark, then deploy

What Comes Next

Related Posts