BIP Draft: Multilingual mnemonic display and input conventions #2200

osem23 commented at 11:25 AM on June 23, 2026: none

This adds a Specification BIP draft, "Multilingual mnemonic display and input rules" (resubmission of the previously-closed #2192, updated).

A display wordlist is a 2048-entry list in a target language, index-parallel to the canonical English BIP-39 wordlist. PBKDF2 runs only on the canonical English mnemonic; native-language renderings are a display and input layer with no new cryptographic surface, and every seed produced under the convention is restorable in any BIP-39 wallet via its English form.

The preamble follows the BIP 3 format (Authors, Assigned, Discussion; no Discussions-To/Comments-*). I have not self-assigned a BIP number.

Discussion

bitcoin-dev (2026-06-13): https://groups.google.com/g/bitcoindev/c/Rwo7P5pTA0c
Delving Bitcoin (2026-06-23): https://delvingbitcoin.org/t/bip39-native-language-display-wordlists-mapped-to-canonical-english/2637

Reference implementation (MIT): https://github.com/osem23/bip39-wordlists-tzur — 30 index-paired display wordlists with bidirectional mappings, the 10 canonical BIP-39 wordlists preserved byte-for-byte for spec comparison, a reference validator enforcing every MUST clause, reference decoders in Python, JavaScript, and Swift producing byte-identical seeds, and per-language conformance test vectors across the five BIP-39 entropy lengths.

Shipped in production by the TZUR Wallet suite (iPhone and Windows).

License: BSD-2-Clause (document), MIT (reference implementation).

Add Informational BIP: Multilingual mnemonic display and input conventions

A display wordlist is a 2048-entry list in a target language, index-parallel
to the canonical English BIP-39 wordlist. PBKDF2 runs only on the canonical
English mnemonic; native-language renderings are a display and input layer with
no new cryptographic surface, and every seed produced under the convention is
restorable in any BIP-39 wallet via its English form.

Preamble follows the BIP 3 format. No BIP number self-assigned.

c0ec345f2f

jonatack added the label New BIP on Jun 23, 2026

jonatack renamed this:
~~Add Informational BIP: Multilingual mnemonic display and input conventions~~
BIP Draft: Multilingual mnemonic display and input conventions
on Jun 23, 2026

in bip-multilingual-mnemonic.md:12 in c0ec345f2f outdated

   7 | +  Type: Informational
   8 | +  Assigned: ?
   9 | +  License: BSD-2-Clause
  10 | +  Discussion: 2026-06-13: https://groups.google.com/g/bitcoindev/c/Rwo7P5pTA0c
  11 | +              2026-06-23: https://delvingbitcoin.org/t/bip39-native-language-display-wordlists-mapped-to-canonical-english/2637
  12 | +```

danielabrozzoni commented at 2:27 PM on June 24, 2026:

The preamble should contain

Requires: 39

osem23 commented at 4:51 AM on June 25, 2026:

Done, added Requires: 39 to the preamble.

in bip-multilingual-mnemonic.md:4 in c0ec345f2f

   0 | @@ -0,0 +1,282 @@
   1 | +```
   2 | +  BIP: ?
   3 | +  Layer: Applications
   4 | +  Title: Multilingual mnemonic display and input conventions

danielabrozzoni commented at 3:07 PM on June 24, 2026:

Unfortunately title should be at most 50 characters, and this is 51 😅

osem23 commented at 4:51 AM on June 25, 2026:

Fixed. It's now "Multilingual mnemonic display and input rules" (45 chars).

in bip-multilingual-mnemonic.md:7 in c0ec345f2f

   0 | @@ -0,0 +1,282 @@
   1 | +```
   2 | +  BIP: ?
   3 | +  Layer: Applications
   4 | +  Title: Multilingual mnemonic display and input conventions
   5 | +  Authors: Daniel Osemberg <ceo@blocksight.live>
   6 | +  Status: Draft
   7 | +  Type: Informational

danielabrozzoni commented at 3:09 PM on June 24, 2026:

I think this is a specification BIP. From BIP3:

https://github.com/bitcoin/bips/blob/861e235e93b40e84e86652ef6e80c2f2dbfc1e17/bip-0003.md?plain=1#L175-L185

osem23 commented at 4:51 AM on June 25, 2026:

Agreed, set to Type: Specification.

danielabrozzoni commented at 3:19 PM on June 24, 2026: member

Only gave a first very quick pass, will do another one soon :)

jonatack commented at 6:00 PM on June 24, 2026: member

This draft appears to be mostly AI generated?

Edit: am looking at the document history in https://github.com/osem23/bip39-wordlists-tzur/commits/main/docs/BIP-multilingual-mnemonics.md

osem23 commented at 8:34 PM on June 24, 2026: none

Yes, I used AI as a writing tool, and I'm not going to pretend otherwise. I'm proud of it. But "AI generated" isn't really the question for a spec. The question is whether it's correct and implementable. I stand behind every line and I understand every line. If any specific clause reads as wrong, vague, or unnecessary, point at it and I'll fix or defend it. That's the feedback I can actually use, and this spec is built to be checked rather than trusted.

osem23 commented at 8:35 PM on June 24, 2026: none

Only gave a first very quick pass, will do another one soon :)

Thanks for the pass, all three are good catches.

Address review: title length, Specification type, Requires 39

Per danielabrozzoni's review on PR #2200:
- Title trimmed to 50-char limit (now 45): "...display and input rules"
- Type changed Informational -> Specification (BIP-3: implementable
  with compliant implementations; has validator, decoders, vectors)
- Add Requires: 39, placed after Discussion per BIP-3 field order

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

4bf395ea9f

in bip-multilingual-mnemonic.md:28 in 4bf395ea9f outdated

  23 | +This document does **not** replace BIP-39, does not deprecate any existing BIP-39 wordlist, and does not change the canonical seed-derivation flow. It defines only a display and backup layer that sits above an unchanged BIP-39 core. The following points hold throughout this specification:
  24 | +
  25 | +- **English BIP-39 remains canonical.** The English BIP-39 mnemonic is the only mnemonic fed to PBKDF2-HMAC-SHA512, and the only artifact that determines the derived seed and cross-wallet compatibility. This document does not alter BIP-39 entropy, checksum, Unicode normalization, or PBKDF2 rules.
  26 | +- **Localized wordlists are a display and backup layer only.** A display wordlist is never the password input to PBKDF2. It exists so a user can read and write their backup in their own language.
  27 | +- **The mapping is by word index.** The display token at index `i` corresponds to the English BIP-39 word at index `i`, and to nothing else. There is no per-language entropy, checksum, or key derivation.
  28 | +- **The localized mnemonic is always reversible to the canonical English mnemonic.** The bidirectional mapping is bijective across all 2048 entries (§Display wordlist requirements), so a conformant display mnemonic resolves back to exactly one English BIP-39 mnemonic, deterministically.

murchandamus commented at 9:38 PM on June 24, 2026:

Was that supposed to be a link to the Display wordlist section?

osem23 commented at 4:51 AM on June 25, 2026:

Yes, it points to the Display wordlist requirements section. I use the §section style throughout instead of anchor links. Happy to switch these to real anchors if the editors prefer.

murchandamus commented at 7:04 PM on June 26, 2026:

Yeah, I think links would be preferable.

osem23 commented at 12:51 PM on June 27, 2026:

Done. Converted every § reference to a real anchor link (49aafcc).

in bip-multilingual-mnemonic.md:84 in 4bf395ea9f outdated

  79 | +
  80 | +A wallet that accepts a display mnemonic on restore tokenizes it on whitespace before lookup:
  81 | +
  82 | +1. Tokenize on Unicode whitespace (characters with the Unicode `White_Space` property) plus the ideographic space (`U+3000`) used by the official Japanese BIP-39 mnemonic.
  83 | +2. Normalize every token and the display wordlist to the same Unicode form (NFC) before comparison. Mismatched normalization between input and wordlist causes silent lookup failures on precomposed/decomposed accent pairs. NFC, and the NFKD that BIP-39 applies before PBKDF2, are both safe: they never merge two distinct entries in a conformant wordlist (there are zero NFKD collisions across the reference wordlists).
  84 | +3. If a wallet applies any *lossy* fold to input as a convenience — stripping diacritics, case-folding, or similar — and that fold maps a token to more than one wordlist entry, the wallet MUST reject the token and ask the user to disambiguate. It MUST NOT silently pick one entry. Distinct entries can collapse under accent stripping (for example Vietnamese `được` and `đuốc`, or Swedish `läger` and `lager`), and an arbitrary pick selects the wrong index and derives the wrong seed. Lossy folds are not required by this convention; a wallet that performs none is always conformant. Per-language collision counts are reported by the reference validator and documented in `validation/encoding-notes.md`.

murchandamus commented at 9:49 PM on June 24, 2026:

Maybe you are implying that already, but would it be possible to enforce at word list creation time that no word matches another per list if diacritics were stripped, case was folded or similar? Has that been done for the proposed lists?

E.g., this was done for the French wordlist, where "special French characters "é-è" are considered equal to "e", for example "museau" and "musée" can not be together".

osem23 commented at 4:51 AM on June 25, 2026:

Right now I enforce this at input time, not at construction time the way the French list did. §Input parsing MUST 3: if a lossy fold (diacritic strip, case fold) maps a token to more than one entry, the wallet must reject and ask the user, never auto-pick. The validator already reports per-language collision counts under those folds.

I didn't make it a construction-time MUST because a mechanically-seeded list can't always satisfy it without curation, which is the quality tension you raise in your top comment. I can add it as a construction-time SHOULD with the per-list collision report surfaced, and make it a MUST for any list that claims a curated tier. The input-time disambiguation MUST keeps wallets safe in the meantime.

in bip-multilingual-mnemonic.md:112 in 4bf395ea9f outdated

 107 | +
 108 | +Every wordlist MUST clause above is mechanically enforceable. A reference validator at `validation/validate_all.py` in the reference registry checks each: exactly 2048 entries per file, UTF-8 encoding without BOM, absence of duplicates, absence of leading or trailing whitespace, absence of embedded whitespace under the full Unicode `White_Space` property, absence of hyphen or dash codepoints inside any entry, NFC form for TZUR Original wordlists and for the native-side fields of mappings, test vectors, and compound-entry datasets, and round-trip consistency of the bidirectional mapping against the canonical English wordlist. SHOULD-clause metrics (4-character prefix uniqueness, native-speaker review status, wordlist identifier triple) are not enforced as errors by the validator and are tracked separately in the registry's construction notes and the per-mapping JSON metadata.
 109 | +
 110 | +### Multi-word native concepts
 111 | +
 112 | +Some languages express a single BIP-39 concept only as a multi-word native term: Hebrew `רופא שיניים` (dentist), Turkish `hindistan cevizi` (coconut), Indonesian `kebun binatang` (zoo), Vietnamese multi-syllable words that use native word-spacing. Requirement 4 forbids embedded whitespace, so a conformant wordlist stores such entries as a single glued orthographic token (e.g., `רופאשיניים`, `hindistancevizi`, `kebunbinatang`). This is a structural consequence of the tokenization rule, not an independent requirement.

murchandamus commented at 9:53 PM on June 24, 2026:

I was wondering how so many languages had been created at inception. So the wordlists were created by translating the English words to the target languages?

osem23 commented at 4:51 AM on June 25, 2026:

Yes. Generated by translation, then validated rather than trusted: structural checks, back-translation and forward-translation each with an LLM verdict, multilingual sentence-embedding similarity, Wiktionary cross-reference, and a blind LLM top-8 pass. Process and per-language results are in docs/CONSTRUCTION.md and docs/V2_VALIDATION.md. It isn't a substitute for native-speaker review, which is why the lists are explicitly supersedable.

in bip-multilingual-mnemonic.md:255 in 4bf395ea9f

 250 | +
 251 | +The specific MUST clauses each address a concrete failure mode. Embedded whitespace inside an entry breaks the paper-backup round trip because mnemonic tokenization is whitespace-based; a multi-word entry fragments into two tokens that the wallet cannot resolve, and the seed becomes unrecoverable from text backup. The bijective mapping requirement ensures that translation in either direction is unambiguous. The NFC storage requirement prevents precomposed/decomposed accent mismatches from causing silent lookup failures on restore.
 252 | +
 253 | +The 4-character prefix uniqueness recommendation from the original BIP-39 specification is achievable for English and most Latin-script languages but structurally infeasible for several scripts where word stems and limited short-prefix variety dominate. Requiring it would exclude those languages or force authorship of artificial vocabulary. Treating it as a SHOULD with informational reporting per language preserves the autocomplete benefit where feasible without excluding scripts where it is not.
 254 | +
 255 | +Native-speaker review is recommended (SHOULD) rather than required (MUST) because its absence is a UX risk, not a cryptographic risk. The worst case is a poorly-chosen native word that a future PR can correct; no funds are at stake.

murchandamus commented at 10:05 PM on June 24, 2026:

I don’t follow here. If people had started using the original native words to record their backup, changing the poorly-chosen word would invalidate their backup.

osem23 commented at 4:51 AM on June 25, 2026:

You're right, that line was wrong and I removed it (6608dcb). Published lists are frozen. A correction is a new versioned list, never a mutation of a published one, so an existing backup is never invalidated: it resolves against the exact version that produced it, pinned by SHA-256, with the canonical English mnemonic as the safety net.

in bip-multilingual-mnemonic.md:259 in 4bf395ea9f outdated

 254 | +
 255 | +Native-speaker review is recommended (SHOULD) rather than required (MUST) because its absence is a UX risk, not a cryptographic risk. The worst case is a poorly-chosen native word that a future PR can correct; no funds are at stake.
 256 | +
 257 | +The 9 non-English canonical BIP-39 wordlists are alphabetized independent word selections, not translations of the English list, so they cannot serve as a display layer over an English mnemonic without the user facing semantically unrelated tokens at each index. This convention does not replace those wordlists; it sits parallel to them and fills the role they do not fill.
 258 | +
 259 | +This convention does not eliminate the cross-wallet restore problem for display-only backups; it bounds the problem and defines wallet-level obligations (§Backup and portability policy) that mitigate it. The user-facing safety net is the canonical English mnemonic, which every conformant wallet exposes in any flow that shows a display mnemonic. A backup that includes the canonical English mnemonic is restorable in any BIP-39 wallet without depending on the receiving wallet's wordlist support.

murchandamus commented at 10:06 PM on June 24, 2026:

If the users have to end up recording both the display-words and the English words, how does this solve the issues that non-English speakers are significantly more likely to make mistakes recording the English words?

osem23 commented at 4:51 AM on June 25, 2026:

MUST 1 is an availability obligation on the wallet, not a requirement to record a second English copy. A user can back up in the display language only, and then there is no English transcription step and therefore no English transcription error, which is exactly the failure this removes. English stays viewable and exportable as the portability guarantee and safety net, surfaced and labeled. I clarified this in the text (6608dcb).

in bip-multilingual-mnemonic.md:267 in 4bf395ea9f outdated

 262 | +
 263 | +## Security Considerations
 264 | +
 265 | +- **PBKDF2 input is invariant under this convention.** Only the canonical English mnemonic reaches PBKDF2-HMAC-SHA512. An implementation that feeds the display mnemonic directly to PBKDF2 is non-conformant and produces incompatible seeds. The conformance test vectors in the reference registry exercise the resolve-to-English path for every supported language.
 266 | +- **Strict single-wordlist tokenization.** On restore, every token in the display mnemonic MUST resolve within a single display wordlist. Wallets MUST NOT silently accept mnemonics whose tokens span multiple wordlists, partial-match across wordlists, or fall through to the canonical English wordlist when a display token is unrecognized. Mixed-wordlist input is malformed and is rejected.
 267 | +- **Only the canonical English mnemonic guarantees cross-wallet recovery.** A user whose wallet supports a display wordlist can always recover the seed in any BIP-39 wallet by entering the canonical English mnemonic. A user who backs up only the display mnemonic and then needs to restore in a wallet that does not support the same display wordlist cannot recover without the mapping. The normative wallet-level obligations that follow from this property are defined in §Backup and portability policy above.

murchandamus commented at 10:12 PM on June 24, 2026:

I was somewhat excited by your idea at first, but this approach seems to undermine a big portion of the potential utility of this BIP. If the wordlists are not intended to be stable, I am not sure I see the point.

osem23 commented at 4:51 AM on June 25, 2026:

Agreed, and they are stable. The registry pins v1.0 with the SHA-256 as the load-bearing identifier; lists are frozen per version and never mutated in place. The Rationale line that implied otherwise was the bug, and I fixed it (6608dcb). Stability is the point, the same way it is for BIP-39 itself.

murchandamus commented at 10:38 PM on June 24, 2026: member

I gave this a quick first read. I like the idea of normalizing to the English wordlist under the hood as it directly mitigates one of the worst issues with BIP39’s portability.

That said, the approach to the additional languages feels unappealing to me: producing initial lists by mechanically translating the English word list is bound to cause a number of issues such as the described concerns with terms composed of multiple words and diacritics, which would persist especially for wordlists that don’t get review before publication. As such wordlists would have room for improvement, it implies that there would soon be multiple wordlists for some languages which would cause even more confusion on top of BIP39 language lists vs display language lists. It seems worthwhile to try and pursue more stable higher quality lists from the get-go, so that more languages would only ever have a single wordlist to converge on.

Given the numerous pull requests we’ve had to the BIPs repository where people tried to add more wordlists to BIP39, I would like to suggest only shipping a framework for more languages to be added instead of shipping with placeholder language lists, and to leave the creation of wordlists to the respective language communities.

Since you are creating a new mnemonic scheme that is essentially a breaking change to BIP39 for every language but English, I would alternatively propose that you go further and create a new scheme that is not backwards compatible with BIP39 but instead addresses all issues with BIP39:

use the indices of the words to generate the seed instead of hashing text
encode a version
use a better checksum
if possible encode information about the output script pattern used
maybe create a generic encoding of data with words that then is used to encode a seed in a second BIP

Preferably such a scheme would also use a different number of words so that it cannot be mixed up with BIP39.

jonatack commented at 11:27 PM on June 24, 2026: member

Yes, I used AI as a writing tool, and I'm not going to pretend otherwise. I'm proud of it. But "AI generated" isn't really the question for a spec. The question is whether it's correct and implementable. I stand behind every line and I understand every line. If any specific clause reads as wrong, vague, or unnecessary, point at it and I'll fix or defend it. That's the feedback I can actually use, and this spec is built to be checked rather than trusted.

Thank you for clarifying. My goal isn't to stigmatize and I'm still trying to figure out the best way to handle LLM-generated submissions. I think it's mildly preferable to state upfront to readers and reviewers when the content is mostly LLM output, and to what extent, out of respect for their time. Some may indeed not see any issue. Others may not wish to spend scarce review cycles doing human review of LLM output, or may prefer to delegate review of LLM output out to LLMs, because human review is a scarce and expensive resource. The idea is to respect the community's time and help them allocate it well.

Address review: list stability, English-availability, framework framing

- Remove the incorrect "future PR can correct, no funds at stake" line.
  Corrections are new versioned lists; published lists are frozen; backups
  resolve against the pinned version (SHA-256).
- Clarify Backup MUST 1 is an availability obligation on the wallet, not a
  requirement that the user record a second English copy.
- State explicitly that the BIP specifies a framework and blesses no
  individual wordlist as canonical; list creation belongs to language
  communities.

6608dcb931

osem23 commented at 4:51 AM on June 25, 2026: none

Thanks for the careful read. We agree on the core: normalizing to English under the hood is the win.

On mechanical translation and "multiple lists per language", I think we're closer than it reads. The BIP ships no wordlists into this repo and blesses none as canonical. It specifies the framework: construction, mapping, and input rules, plus a conformance profile where every wordlist-level MUST maps to an executable check. The 30 lists live in a separate registry as a bootstrap corpus, supersedable by native-speaker review. I made that explicit in 6608dcb. So "ship a framework, leave creation to the communities" is the intended end state, not a conflict with it.

On why I shipped a starting corpus and not just an empty framework: it expands practical BIP-39 coverage from 10 languages to ~30 today. The 10 canonical lists cover roughly a third of people by native language. The other two thirds, about 5 billion native speakers, have no list at all. A working corpus, even one communities later refine, lets wallets onboard those users now instead of waiting for 20 separate community list efforts to each reach completion. That reach, opening Bitcoin self-custody to people in their own language, is the whole point of the proposal.

On "multiple lists cause confusion": that's what the (language, version, SHA-256) triple is for. Two lists for one language are two versions, and each backup names the one that produced it. BIP-39 today carries no version identifier at all, so this is strictly more robust, not less.

On stability: I'm committing to immutability-by-version. A published list is frozen, corrections are new versions, no existing backup is invalidated. (You caught a Rationale line that said the opposite; fixed in 6608dcb.)

On going further to a new, non-backwards-compatible scheme (indices to seed, version byte, stronger checksum, script-type encoding, distinct word count): I think that's worth doing, but it's a BIP-39 successor and a different document. This proposal's entire value is zero new cryptographic surface and universal restore in the installed base today, including English-only wallets. Folding a successor in forfeits exactly that, and helps none of those ~5 billion speakers now. I'd support a successor effort on its own track and would contribute, but I'd keep the two separate so this one stays deployable.

osem23 commented at 9:38 AM on June 25, 2026: none

This is not something I originally set out to work on.

My main work is BlockSight.Live, a free Bitcoin explorer. I have worked in the Bitcoin ATM industry in Israel for the last 3+ years and have seen thousands of regular users interact with Bitcoin.

My main goal has always been to build useful tools for Bitcoiners.

While building a Bitcoin wallet with the native explorer integrated into it, I encountered the BIP39 language issue directly, and it bothered me.

Users in Israel still generally have to write down their seed words in English. For many people, that is not natural, and I think it creates a real backup and recovery risk.

I am not trying to change BIP39 itself.

I am trying to explore whether this can be made better while keeping English BIP39 as the canonical base.

murchandamus commented at 7:12 PM on June 26, 2026: member

Thanks for your additional explanations. Your approach makes more sense to me now. I will wait for a bit to let other reviewers have a go and give this another read in a couple weeks or so. I’ll be sure to pay more attention to the versioning of the backup then.

Convert §section references to Markdown anchor links (review feedback) 49aafcc041

in bip-multilingual-mnemonic.md:73 in 49aafcc041

  68 | +5. Be paired with a bidirectional mapping (`english_to_native` and `native_to_english`) that is bijective across all 2048 entries. This is the property that makes display-mnemonic to canonical-English-mnemonic resolution unambiguous in either direction.
  69 | +6. Be stored in Unicode Normalization Form C (NFC). NFKD normalization is applied only to the canonical English mnemonic and the salt before PBKDF2, as BIP-39 already requires. The display wordlist itself never reaches PBKDF2.
  70 | +
  71 | +A display wordlist SHOULD:
  72 | +
  73 | +1. Maximize 4-character prefix uniqueness within the constraints of the target script. Realized uniqueness varies widely across scripts; wallets relying on prefix-based autocomplete fall back to full-word matching whenever prefix uniqueness is below 2048/2048.

danielabrozzoni commented at 4:28 PM on June 29, 2026:

nit: I think the last sentence can be made much easier to read: "wallets relying on prefix-based autocomplete should require full words unless all 2048 words have unique prefixes."

osem23 commented at 6:28 PM on June 30, 2026:

Thanks, that reads better. Took your wording for SHOULD 1. (2b2c516)

in bip-multilingual-mnemonic.md:75 in 49aafcc041

  70 | +
  71 | +A display wordlist SHOULD:
  72 | +
  73 | +1. Maximize 4-character prefix uniqueness within the constraints of the target script. Realized uniqueness varies widely across scripts; wallets relying on prefix-based autocomplete fall back to full-word matching whenever prefix uniqueness is below 2048/2048.
  74 | +2. Be reviewed by a fluent native speaker of the target language before publication. Native-speaker review catches register, idiom, and cultural-neutrality issues that mechanical validation cannot.
  75 | +3. Carry a stable identifier triple of (language code, version string, SHA-256 of the wordlist file) so that a display backup can be matched on restore to the exact wordlist that produced it. The reference registry publishes this triple in each mapping JSON under the keys `language`, `version`, and `sha256`, with a `normalization_form` field set to `"NFC"` for TZUR Original wordlists. Wallets that bundle wordlists SHOULD persist this triple alongside wallet metadata. In registries that use a single pinned version tag (the reference registry's model, documented at `docs/GOVERNANCE.md`), the version string anchors the shipped corpus and is stable; integrators pin the SHA-256 of the wordlist file as the load-bearing change-detection identifier alongside it.

danielabrozzoni commented at 4:36 PM on June 29, 2026:

I think you should define somewhere what's the language code. I think there are a lot of standards, for example ISO-639 has a few different sets to decide from. Or is this up to the creator of the display wordlist to decide?

osem23 commented at 6:28 PM on June 30, 2026:

Good question, it was underspecified. A language identifier is now defined as a BCP 47 tag: ISO 639-1 where it exists (he, ar, ja), ISO 639-3 where it doesn't (fil for Filipino), and a script subtag where needed (zh-Hans / zh-Hant). The load-bearing part of the triple stays the SHA-256 of the wordlist file, the identifier is just the human-facing label.

The registry was using English names, which contradicted that, so I moved the mapping and test-vector language fields to codes and kept the English name under a new name field. The wordlist files are untouched, so no hashes change and no backup is affected. (BIP 2b2c516, registry osem23/bip39-wordlists-tzur@f4871bb)

in bip-multilingual-mnemonic.md:30 in 49aafcc041

25 | +- **English BIP-39 remains canonical.** The English BIP-39 mnemonic is the only mnemonic fed to PBKDF2-HMAC-SHA512, and the only artifact that determines the derived seed and cross-wallet compatibility. This document does not alter BIP-39 entropy, checksum, Unicode normalization, or PBKDF2 rules.
26 | +- **Localized wordlists are a display and backup layer only.** A display wordlist is never the password input to PBKDF2. It exists so a user can read and write their backup in their own language.
27 | +- **The mapping is by word index.** The display token at index `i` corresponds to the English BIP-39 word at index `i`, and to nothing else. There is no per-language entropy, checksum, or key derivation.
28 | +- **The localized mnemonic is always reversible to the canonical English mnemonic.** The bidirectional mapping is bijective across all 2048 entries ([Display wordlist requirements](#display-wordlist-requirements)), so a conformant display mnemonic resolves back to exactly one English BIP-39 mnemonic, deterministically.
29 | +- **Wallets must give users access to the canonical English mnemonic.** In any flow that exposes a display mnemonic, a standard wallet MUST let the user view, copy, or export the canonical English BIP-39 mnemonic, so the backup is recoverable in any BIP-39 implementation ([Backup and portability policy](#backup-and-portability-policy)).
30 | +- **This document specifies a framework, not a blessed set of wordlists.** It defines what makes a display wordlist conformant (the construction, mapping, and input rules, and the conformance profile in which every wordlist-level MUST maps to an executable check). It ships no wordlists into this repository and blesses no individual list as canonical. The reference registry's lists are a bootstrap corpus, explicitly supersedable by native-speaker review; per-language list creation and curation belong to the respective language communities.

danielabrozzoni commented at 1:47 PM on June 30, 2026:

Throughout the document you are talking about the "reference registry", but you have never defined it properly. I would do so here:

It ships no wordlists into this repository and blesses no individual list as canonical. For purposes of this document, the reference registry is the external repository that publishes the example display wordlists, mappings, validator, test vectors, and construction notes used by the reference implementation. The reference registry's lists are a bootstrap corpus, explicitly supersedable by native-speaker review...

I'm also okay with defining it somewhere else, as long as it's clearly defined before mentioning it here

osem23 commented at 6:28 PM on June 30, 2026:

Thanks, good call. Added your definition where the term first appears. (2b2c516)

in bip-multilingual-mnemonic.md:162 in 49aafcc041

 157 | +For *new* wallets specifically, a wallet that implements this convention SHOULD prefer the display-layer path over generating a fresh backup whose seed of record is one of the nine legacy non-English canonical wordlists, when both are available for the same language. The reason is interoperability, not correctness: a display-layer wallet always exposes the universally portable canonical English mnemonic ([Backup and portability policy](#backup-and-portability-policy) MUST 1), whereas a newly minted legacy non-English seed is only restorable in wallets that implement that specific non-English wordlist, which is a smaller and less predictable set. This is a recommendation about which backup to *create* going forward. It does not deprecate the legacy wordlists, does not invalidate any existing backup, and imposes no obligation to migrate funds: an existing legacy non-English seed remains a valid BIP-39 mnemonic and MUST continue to be importable and derivable exactly as before ([Wallet implementation guidance](#wallet-implementation-guidance) MUST 2).
 158 | +
 159 | +## Reference Implementation
 160 | +
 161 | +- **Wordlist registry.** <https://github.com/osem23/bip39-wordlists-tzur>, `main` branch. Ships 30 index-paired display wordlists with bidirectional mappings at `wordlists/tzur-original/`, the 10 canonical BIP-39 wordlists preserved at `wordlists/reference-canonical/` for spec comparison, and a reference validator at `validation/validate_all.py`. Tag `v1.0` pins a stable snapshot for citation continuity.
 162 | +- **Construction notes.** `docs/CONSTRUCTION.md` documents structural rules, disambiguation rules, multi-word-concept handling, per-language notes, and the three-layer validation methodology (structural, back-translation via Google Translate with LLM verdict, forward-translation via Microsoft Azure Translator with LLM verdict).

danielabrozzoni commented at 5:16 PM on June 30, 2026:

I think all the file names here should be links to the files in the wordlist repository.

osem23 commented at 6:28 PM on June 30, 2026:

Done, every registry path in Reference Implementation now links into the wordlist repo. (2b2c516)

osem23 referenced this in commit f4871bbef8 on Jun 30, 2026

Address review: define language identifier, reference registry, links

- Define "language identifier" as a BCP 47 tag (ISO 639-1 where it exists,
  ISO 639-3 e.g. `fil` where it does not, script subtag e.g. `zh-Hans`),
  and replace "language code" throughout. The reference registry's mapping
  `language` field now carries the BCP 47 code (English name under `name`).
- Define "reference registry" where the term first appears.
- Reword the prefix-autocomplete SHOULD for readability.
- Link every registry path in Reference Implementation to the wordlist repo.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

2b2c516594

osem23 referenced this in commit d4b354e49a on Jun 30, 2026