If you're on mypy in 2026, Pyrefly is the obvious upgrade

May 13, 2026·Tim Hopper · Markdown

type-checking

The case for staying on mypy has often been migration cost. Pyrefly 1.0, released May 12, 2026, makes the mechanical half close to free:

uv add --dev pyrefly
uv run pyrefly init
uv run pyrefly check --baseline=pyrefly-baseline.json --update-baseline
uv run pyrefly check

pyrefly init reads an existing mypy.ini or [tool.mypy] section, writes the equivalent Pyrefly configuration, and turns on a preset called legacy that disables the checks mypy does not have. Pyrefly’s inference is more aggressive than mypy’s on the checks both tools run, so a real codebase will surface new diagnostics on the first check pass. The count depends on the codebase and how much was typed before. The baseline-snapshot step writes those errors to JSON so the final check returns clean and CI stays green while the team triages on its own schedule.

That is the news. The rest of this post explains what each command is doing and what the new diagnostics actually look like.

The legacy preset plus a baseline file is the workflow

One common failure mode for mypy migrations is the wall-of-errors problem: a fresh checker surfaces every diagnostic mypy had silently ignored, and the resulting issue list is too big to triage without freezing the merge queue. Pyrefly 1.0 splits that problem in half.

When pyrefly init detects a mypy config, it writes preset = "legacy" into the generated file. The preset is narrower than the name suggests: it turns off three specific Pyrefly check kinds that mypy lacks (bad-override-mutable-attribute, bad-override-param-name, unbound-name), and leaves everything else in place. Pyrefly’s other rules are still stricter than mypy’s. Take Protocol conformance:

from typing import Protocol


class Lang(Protocol):
    def check_version(self, version: str) -> bool: ...


class Conda:
    def check_version(self, language_version: str) -> bool:
        return True


def run(lang: Lang) -> bool:
    return lang.check_version("1.0")


run(Conda())

mypy returns “Success: no issues found.” Pyrefly flags the call: Conda.check_version declares its parameter as language_version, but Lang.check_version declares it as version. mypy treats classes as structurally conformant to a Protocol when the method types match; Pyrefly additionally requires matching parameter names. That extra strictness is one source of the new diagnostics, and it is independent of the legacy preset.

Running pre-commit through the migration produced 47 missing-stub diagnostics (the same ones mypy reports) and 69 new ones from Pyrefly’s stricter checks. The Protocol-conformance pattern shown above appears verbatim in pre_commit/all_languages.py:27. Other categories include super().method() calls that reference methods the parent class does not define.

Baseline files close the loop. pyrefly check --baseline=pyrefly-baseline.json --update-baseline snapshots the current set of errors to JSON. Subsequent checks pick up the baseline and report only diagnostics that are new since the snapshot, so CI stays green while the team works through the 69 on its own schedule. For a worked example on pre-commit and the setup.cfg gotcha that bites projects keeping mypy config in setup.cfg, see how to migrate from mypy to Pyrefly.

The other 1.0 change that matters here is quieter but the impact is just as large. Pyrefly now reads [tool.pyrefly] from pyproject.toml. Older articles, including earlier versions of the handbook’s comparison page, flagged the lack of pyproject.toml support as a coexistence problem. That friction is gone.

Track adoption with `pyrefly report`

Once the codebase passes, the next question is “what fraction of this is actually annotated?” pyrefly report answers it. The command emits JSON with annotation-completeness and type-completeness statistics at every level from function to module, plus an aggregate summary.

The output slots cleanly into CI: pin the current floor, fail the build if a PR regresses it. Combined with experimental baseline files (which snapshot the current set of errors and report only new ones on subsequent runs), the workflow described in how to gradually adopt type checking in an existing Python project is now a Pyrefly-native flow rather than a pattern teams have to assemble themselves.

What 1.0 means beyond the migration story

The migration tooling is the lead, but 1.0 also raised the floor on the metrics teams cite when they pick between checkers.

Conformance to the Python typing spec test suite climbed from 70% at beta to 92.2%. That puts Pyrefly ahead of mypy (59.6%) and ty (still in beta, at 67.4%), behind pyright (95.7%) and Zuban (99.3%). Conformance scores measure how closely each checker implements the typing spec, which is the contract library type stubs are written against. It is not a measure of bugs caught in application code, as Rob Hand’s analysis of the new type checkers argues at length.

Performance shipped against the February baseline: 2-125x faster diagnostics after save, 40-60% less memory, 20-36% faster full project checks on PyTorch and Pandas. The numbers come from open-source type_coverage_py and ty_benchmark suites. Meta also dropped the release cadence from weekly to monthly.

Cursor, VSCodium, and Windsurf users already get Pyrefly via the meta.pyrefly extension on Open VSX, because Pylance is closed-source and licensed to run only on Microsoft-branded VS Code. Positron ships Pyrefly bundled as of 2025.12, having dropped Pyright in the same release. The remaining integration question is Microsoft VS Code itself, where Pylance is the default. Meta announced Pylance compatibility through the Type Server Protocol (TSP) as upcoming work; if it lands, VS Code users could keep Pylance’s hover and rename UX while swapping the analysis backend to Pyrefly.

What about ty?

ty makes a different bet than Pyrefly. Its gradual guarantee promises that adding annotations to working code never introduces new errors. For a codebase not yet under any type checker, that promise is genuinely useful and Pyrefly’s aggressive inference will surface more diagnostics on first run. The FastAPI migration to ty is the most thorough write-up of how that plays out in practice.

For a codebase already on mypy in 2026, the legacy preset plus baseline files lets the migration be one command followed by incremental triage rather than a multi-quarter project. Astral’s acquisition by OpenAI earlier in 2026 is a separate variable to weigh: some teams will read it as a positive signal for ty’s resourcing, others as a strategic uncertainty.

mypy 2.0, released a week earlier, is the answer for teams that want speed without leaving the mypy ecosystem at all. Parallel checking via --num-workers closes part of the speed gap. The conformance gap stays open.

Learn more

Last updated on May 15, 2026

Python 3.14.5 rolls back the incremental garbage collector Bernát Gábor's Recap of the 2026 Python Packaging Summit

Please submit corrections and feedback...