OPEN SOURCE

Pseudonymize sensitive data before any LLM gets to see it.

noirdoc is an open-source Python library with a CLI plus a plugin for Claude Code — MIT-licensed and running entirely on your machine.

View on GitHub Quickstart

noirdoc is an open-source Python library and CLI for local, reversible pseudonymization of personal data in documents. The engine runs on your machine, is MIT-licensed, and is maintained by Nextaim.

Here's what it looks like before anything leaves your machine.

Original

Anna Müller, born March 12, 1985, granted her tax advisor Markus Schmidt in Munich a comprehensive power of attorney on April 3, 2024.

Pseudonymized

<<PERSON_1>>, born <<DATE_1>>, granted her tax advisor <<PERSON_2>> in <<CITY_1>> a comprehensive power of attorney on <<DATE_2>>.

<<PERSON_1>> <<DATE_1>> <<PERSON_2>> <<CITY_1>> <<DATE_2>>

What the CLI does for you.

Four building blocks that together give you what honest local pseudonymization actually needs.

DETECTORS

Reliably detects German PII.

The detectors are trained on German contracts, letters, and HR documents and reliably find names, addresses, IBANs, tax IDs, and phone numbers — even when the formatting isn't quite clean.

REVERSIBLE

Pseudonymization with a mapping.

Every token points back to the original value. You use the redacted document with your LLM workflow and restore the real data in the response afterwards — all locally.

FORMATS

PDF, DOCX, TXT, and Markdown.

Pseudonymized documents keep their original format. The LLM sees a cleaned copy while you keep working with the original file.

LOCAL

Runs entirely on your machine.

The models run locally and the mapping stays on your disk. No API call ever leaves your machine — not even to us.

INSTALL

Here's how you get noirdoc onto your machine.

Install via pip, then call it from Python or straight from the shell.

 $ pip install noirdoc
$ pip install noirdoc[full]    with all optional detectors
$ noirdoc models pull 

 from noirdoc import Redactor

r = Redactor(namespace="mandant-mueller")

r.redact_file("vertrag.pdf", output="vertrag-clean.pdf")
r.redact_file("brief.docx", output="brief-clean.docx")

translate responses back
original = r.reveal_text(llm_response) 

 one-shot — mapping is discarded
$ noirdoc redact vertrag.pdf -o vertrag-clean.pdf

persistent — mapping is preserved
$ noirdoc redact --namespace mandant-mueller brief.docx -o brief-clean.docx
$ noirdoc reveal --namespace mandant-mueller brief-clean.docx -o brief-revealed.docx
$ noirdoc lookup --namespace mandant-mueller "<>" 

MIT License · github.com/nextaim-de/noirdoc

The same engine powers our chat for sensitive data.

If you'd rather not deal with pip, Python, and models yourself — Noirdoc Chat is the managed version: same pseudonymization code, multiple models, and a GDPR-grade DPA, with no setup on your end.

Try the chat See pricing

CLAUDE CODE PLUGIN

Claude Code, without your data ever reaching Claude.

The plugin pseudonymizes your inputs locally before they reach Claude — and restores the responses automatically afterwards.

INSTALL

# add the marketplace once
$ /plugin marketplace add nextaim-de/noirdoc-claude-plugin

# install the plugin inside Claude Code
$ /plugin install noirdoc@nextaim

AUTO REDACT

Redacts without lifting a finger.

As soon as you open or read a protected file in Claude Code, the plugin replaces names, IBANs, and IDs with placeholders locally — before Claude gets to see anything.

REVEAL

Real values stay in your own terminal.

Run `noirdoc reveal` to see the original — but only in your own shell, never inside the Claude Code transcript. The conversation stays clean.

PATH RULES

You decide what's protected.

Glob rules like `./incoming/**` or `*.contract.*` decide which files are pseudonymized automatically. Everything else stays untouched.

LOCAL

Mapping stays on your machine.

Pseudonymized copies live in `.noirdoc/cache/` and the reversible mapping stays local. No API call ever leaves your machine — not even to us.

MIT License · github.com/nextaim-de/noirdoc-claude-plugin

Pick the path that fits you.

Locally with the OSS CLI, inside your editor with the plugin, or managed through Noirdoc Chat — the pseudonymization underneath is always the same.

Try the chat free