BIP 181, 182, 183: BIPs for Utreexo

kcalvinalvin commented at 6:56 am on August 10, 2025: contributor

These are the 3 BIPs that describe Utreexo, a consensus-compatible (non-soft fork) way to send and verify transactions without storing the full UTXO set.

The 3 BIPs are for:

The specification of the Utreexo accumulator.
The specification of Bitcoin block and tx validation using the Utreexo accumulator.
The peer to peer networking changes required to enable Utreexo nodes.

Mailing list post: https://groups.google.com/g/bitcoindev/c/W1lxBraKG_E

kcalvinalvin force-pushed on Aug 10, 2025

jonatack added the label New BIP on Aug 10, 2025

in utreexo-p2p-bip.md:26 in a94f6434c8 outdated

21+This document **does not** describe how to validate blocks and transactions using the provided data, see [Utreexo - Validation Layer](./utreexo-validation-bip.md) for more details.
22+
23+## Motivation
24+
25+Utreexo nodes require the inclusion proof to fully validate blocks and transactions.
26+Each block has an corresponding inclusion proof with it and this inclusion proof for blocks up to height 906,937 requires an additional 631.85GB, which is roughly 40GB less than the size of the block data.

jmoik commented at 2:08 pm on August 11, 2025:

an -> a

jonatack commented at 3:39 pm on August 11, 2025:

there are two, would be this one

0Each block has a corresponding inclusion proof with it and this inclusion proof for blocks up to height 906,937 requires an additional 631.85GB, which is roughly 40GB less than the size of the block data.

kcalvinalvin commented at 6:56 am on August 12, 2025:

Addressed in the latest push

in utreexo-p2p-bip.md:27 in a94f6434c8 outdated

22+
23+## Motivation
24+
25+Utreexo nodes require the inclusion proof to fully validate blocks and transactions.
26+Each block has an corresponding inclusion proof with it and this inclusion proof for blocks up to height 906,937 requires an additional 631.85GB, which is roughly 40GB less than the size of the block data.
27+Each transaction also has an corresponding inclusion proof with it and for normal transaction relay, the proof is roughly 3 times the size of the transaction.

jmoik commented at 2:08 pm on August 11, 2025:

an -> a

kcalvinalvin commented at 6:56 am on August 12, 2025:

Addressed in the latest push

in utreexo-p2p-bip.md:50 in a94f6434c8 outdated

45+3. Archive nodes
46+
47+CSNs have the goal of minimizing data storage and download while performing block validation.
48+Archive and bridge nodes store more data and provide this data to CSNs.
49+
50+Bridge nodes are nodes that can add inclusion proofs to mempool transactions, support the same set of messages as CSNs, and are in fact should be indistinguishable from CSNs on the network.

jmoik commented at 2:10 pm on August 11, 2025:

are in fact should -> should in fact

kcalvinalvin commented at 6:57 am on August 12, 2025:

Addressed in the latest push

in utreexo-p2p-bip.md:98 in a94f6434c8 outdated

93+### Transaction relay
94+
95+![Current TX relay](bip-utreexo-p2p/current-tx-relay.png)
96+
97+Current transaction relay is done by sending an inv message with the hash of the transaction and a type field that denotes that this hash represents a transaction.
98+If the node receiving the inv is does not have a tx matching that hash, it then requests for it using a getdata message.

jmoik commented at 2:15 pm on August 11, 2025:

is

kcalvinalvin commented at 6:57 am on August 12, 2025:

Removed in the latest push

in utreexo-p2p-bip.md:105 in a94f6434c8 outdated

100+![Utreexo TX relay](bip-utreexo-p2p/utreexo-tx-relay.png)
101+
102+The transaction relay for Utreexo nodes doesn't add any extra round trips.
103+However, it does include extra inventory vectors in the inv message.
104+
105+We introduce a new inventory vector type called `utreexoproofhash` which make up the extra information that a Utreexo node will receive.

jmoik commented at 2:17 pm on August 11, 2025:

make -> includes

jonatack commented at 3:42 pm on August 11, 2025:

s/ which make/, which makes/

kcalvinalvin commented at 3:49 am on August 12, 2025:

I’ll go with , which makes since includes sounds like the utreexoproofhash invvect has other information as well

EDIT: Replaced with , which makes in the latest push

in utreexo-p2p-bip.md:302 in a94f6434c8 outdated

297+| Field                      | Type                                | Description                                   |
298+|----------------------------|-------------------------------------|-----------------------------------------------|
299+| length of the Utreexo TTLs | varint                              | The length of the Utreexo summaries           |
300+| Utreexo TTLs               | vector of Utreexo summaries         | The vector of the requested Utreexo summaries |
301+| length of the proof hashes | varint                              | The length of the proof hashes                |
302+| proof hashes               | vector of 32 byte hashes            | The vector of the requested Utreexo summaries |

jmoik commented at 2:19 pm on August 11, 2025:

requested proof hashes*?

kcalvinalvin commented at 7:09 am on August 12, 2025:

Addressed in the latest push

in utreexo-p2p-bip.md:242 in a94f6434c8 outdated

237+
238+| Field        | Type                | Description                                                                                                                                                          |
239+|--------------|---------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------|
240+| block height | uint32              | The time-to-live value of a leaf in the Utreexo merkle forest. The value is determined by the amount of leaves that were added to the accumulator since its creation |
241+| length       | varint              | The length of the TTLs                                                                                                                                               |
242+| TTLs         | vector of TTL infos | position in the Utreexo merkle forest when the leaf was removed                                                                                                      |

jmoik commented at 2:21 pm on August 11, 2025:

description?

kcalvinalvin commented at 7:09 am on August 12, 2025:

Added the correct description in the latest push

in utreexo-p2p-bip.md:194 in a94f6434c8 outdated

189+A compact leaf data is defined as:
190+
191+| Field        | type                         | Description     |
192+|--------------|------------------------------|-----------------|
193+| header code  | uint32                       | This is a value obtained by left shifting the block height that confirmed this transaction, and then OR-ing it with 1, only if this transaction is a coinbase. |
194+| amount       | int64                        | The amount in sats locked on this output |

jmoik commented at 2:23 pm on August 11, 2025:

should probably be unsigned

kcalvinalvin commented at 4:38 am on August 12, 2025:

It makes sense to have it as int64 as CAmount is represented as int64 in code https://github.com/bitcoin/bitcoin/blob/273e600e65c2e31a6e9a0bd72b40672aaa503b08/src/consensus/amount.h#L12

Other implementations follow this as well:https://github.com/btcsuite/btcd/blob/baebb836c2d4692da3de3b0d437f4da6ce915546/wire/msgtx.go#L337

jmoik commented at 2:34 pm on August 11, 2025: none

some typos

in utreexo-p2p-bip.md:13 in a94f6434c8 outdated

 8+Comments-URI: TBD
 9+Status: Draft
10+Type: Specification
11+Created: 2024-08-08
12+License: BSD-3-Clause
13+Depends: BIP-???? (Utreexo - Peer Services)

jonatack commented at 7:41 pm on August 11, 2025:

Per BIPs 2 and 3, this would be “Requires” (and currently refers to the same BIP)

0Requires: BIP-???? (Utreexo - Peer Services)

kcalvinalvin commented at 6:53 am on August 12, 2025:

Addressed in the latest push

in utreexo-validation-bip.md:13 in a94f6434c8 outdated

 8+Comments-URI: TBD
 9+Status: Draft
10+Type: Specification
11+Created: 2023-10-01
12+License: BSD-3-Clause
13+Depends: BIP-???? (Utreexo Accumulator Specification)

jonatack commented at 7:41 pm on August 11, 2025:

Per BIPs 2 and 3, this would be “Requires”

0Requires: BIP-???? (Utreexo Accumulator Specification)

kcalvinalvin commented at 6:53 am on August 12, 2025:

Addressed in the latest push

in utreexo-accumulator-bip.md:13 in a94f6434c8 outdated

 8+Comments-URI: TBD
 9+Status: Draft
10+Type: Specification
11+Created: 2025-06-18
12+License: BSD-3-Clause
13+Depends: BIP-???? (Utreexo Accumulator Specification)

jonatack commented at 7:42 pm on August 11, 2025:

Refers to the same document. If correct, this line should be dropped.

kcalvinalvin commented at 6:54 am on August 12, 2025:

Dropped in the latest push

in utreexo-accumulator-bip.md:56 in a94f6434c8 outdated

51+the accumulator tracks the current set of unspent transaction outputs (UTXOs).
52+
53+The Utreexo accumulator is based on an append-only Merkle tree design introduced in [^1],
54+which provides logarithmic-sized inclusion proofs. Utreexo extends this design to support dynamic updates,
55+specifically enabling deletions from the set—a requirement for tracking UTXO spends in Bitcoin.
56+To accommodate this, Utreexo increases the storage requirement for the accumulator state to O(log₂(N)),

jonatack commented at 7:47 pm on August 11, 2025:

“increases the requirement” – perhaps mention here “compared to the UTXO set”

luisschwab commented at 10:05 pm on August 11, 2025:

0To accommodate this, Utreexo increases the storage requirement for the accumulator state to $O(log_2(N))$,

LaTeX renderers don’t play nice with this unicode symbol.

kcalvinalvin commented at 5:03 am on August 12, 2025:

Ah the paragraph could be worded better.

It’s referring to how the merkle forest is expanded to support more leaves. Like sparse merkle trees, you pre-allocate the Utreexo accumulator to hold 2^n leaves. If you want to hold (2^n)+1 leaves, you need to resize the accumulator to hold 2^n+1 leaves.

kcalvinalvin commented at 6:05 am on August 12, 2025:

~~Oh I read it wrong too. It increases the requirements vs the paper referenced in [^1].~~

~~Fixing this…~~

Changed the sentence to improve legibility

kcalvinalvin commented at 6:52 am on August 12, 2025:

Addressed in the latest push

in utreexo-validation-bip.md:39 in a94f6434c8 outdated

34+long-term scalability concern.
35+
36+Utreexo is a dynamic accumulator that enables the UTXO set to be represented in just a few kilobytes,
37+by requiring peers to provide additional proof data to verify the inclusion of a UTXO in the
38+accumulator. This allows for the construction of extremely lightweight nodes capable of performing
39+the same validation as a full node, without the need to store the entire UTXO set.

jonatack commented at 9:38 pm on August 11, 2025:

The preceding 3 paragraphs seem to be duplicates of the accumulator BIP that this BIP requires. Can perhaps remove them or refer to the accumulator BIP motivation.

kcalvinalvin commented at 6:56 am on August 12, 2025:

Removed the preceding 3 paragraphs in the latest push

in utreexo-accumulator-bip.md:554 in a94f6434c8 outdated

550+While RSA accumulators and similar constructions offer significant advantages in proof size—often allowing a
551+single proof to cover an entire block's worth of UTXOs—the trade-offs in proof generation cost and latency are
552+substantial. In RSA-based designs, creating a proof for any given UTXO at arbitrary times can be computationally
553+intensive, especially as the number of UTXOs grows.
554+
555+Utreexo's design is driven by the need for Bridge Nodes: nodes that maintain backward compatibility with existing

jonatack commented at 9:48 pm on August 11, 2025:

This BIP appears to be missing a required backwards compatibility section.

kcalvinalvin commented at 6:56 am on August 12, 2025:

Added a backwards compatibility section

jonatack commented at 9:52 pm on August 11, 2025: member

Thank you for proposing these drafts. They already look quite complete with respect to the editorial requirements (BIPs 2 and 3). I’ve done a cursory first pass. No immediate conceptual feedback. A few editorial comments follow; feel free to ignore them during conceptual review until they are applicable.

in utreexo-accumulator-bip.md:66 in a94f6434c8 outdated

61+The Utreexo accumulator consists of a set of Merkle trees: specifically, perfect binary trees with $2^n$ elements,
62+where each node in the tree contains a 32-byte hash. The elements being stored appear at the leaves—the bottom layer of the tree.
63+The topmost node is referred to as the "root," while nodes located between the leaves and the root are called "intermediate nodes."
64+
65+Any integer number of elements ($N$) can be represented as a forest of such trees. On average, a set of N elements will require
66+approximately $\frac{log₂(N)}{2}$ trees. The number and sizes of trees are determined by the binary representation of $N$:

luisschwab commented at 10:05 pm on August 11, 2025:

0approximately $\frac{log_2(N)}{2}$ trees. The number and sizes of trees are determined by the binary representation of $N$:

LaTeX renderers don’t play nice with this unicode symbol.

kcalvinalvin commented at 6:52 am on August 12, 2025:

Addressed in the latest push

kcalvinalvin force-pushed on Aug 12, 2025

petertodd commented at 3:52 pm on August 12, 2025: contributor

You need to justify why you’re using SHA-512/256 rather than SHA-256, like the rest of the Bitcoin protocol. Right now you just link to a paper from 2011. But that paper is out of date now that hardware support for SHA-256 has become common.

1BitcoinBoWP1FZ4xwTNkq6XksKidmgYYw commented at 6:29 pm on August 12, 2025: contributor

I strongly recommend replacing SHA-256 with SHAKE256 (from the SHA-3 standard) for the following reasons:

1. Security Advantages

🔒 Provides built-in protection against length-extension attacks
📏 Offers flexible output lengths (supports 128-bit and 256-bit security levels)
⚙️ Based on Keccak sponge construction (NIST FIPS 202 standard)
🌐 Aligns with post-quantum cryptography standards

2. Comparative Analysis: SHA-256 vs SHAKE256

Characteristic	SHA-256	SHAKE256
Algorithm Family	SHA-2	SHA-3 (Keccak)
Output Flexibility	Fixed 256-bit	Arbitrary length
Security Properties	Vulnerable to length-extension	Resistant to length-extension
Internal Structure	Merkle-Damgård	Sponge function
Standardization	NIST FIPS 180-4	NIST FIPS 202

3. Functional Example

Input: Bitcoin

SHAKE256 (512-bit output):
6beb0661ba1fa7289bf359fbb81550bd9641cf5abc62a14d466c421c8a86e528e027632ec0e7ceb994650566f3c8258af2240333b6d0e9186766fd2c1ebb763a

SHAKE256 (256-bit output):
6beb0661ba1fa7289bf359fbb81550bd9641cf5abc62a14d466c421c8a86e528

4. Implementation Benefits

✅ Maintains 256-bit output compatibility where needed
✅ Future-proofs against emerging cryptographic vulnerabilities
✅ Reduces potential attack vectors through improved design
✅ Supports Bitcoin’s security evolution while maintaining performance

5. Technical Reference

For detailed cryptographic differences:
Cryptographic Comparison: SHA-2 vs SHA-3

kcalvinalvin commented at 11:06 am on August 18, 2025: contributor

You need to justify why you’re using SHA-512/256 rather than SHA-256, like the rest of the Bitcoin protocol. Right now you just link to a paper from 2011. But that paper is out of date now that hardware support for SHA-256 has become common.

Sure we can update the accumulator BIP with benchmarks for SHA512/256 vs SHA256.

But could you link to the aforementioned justifications for the other parts of the Bitcoin protocol that use SHA512?

kcalvinalvin commented at 11:10 am on August 18, 2025: contributor

I strongly recommend replacing SHA-256 with SHAKE256 (from the SHA-3 standard) for the following reasons:

SHAKE256 is not used in Bitcoin and introduces a new hash which increases the trust-assumption. We do not want to do this.

bitcoin deleted a comment on Aug 18, 2025

1BitcoinBoWP1FZ4xwTNkq6XksKidmgYYw commented at 2:32 pm on August 18, 2025: contributor

The reliance of Bitcoin on SHA-2—a legacy hash function designed by the National Security Agency (NSA)—introduces non-trivial security risks, particularly when considering the often-dismissed threat posed by quantum adversaries.

Migrating to SHAKE256 (a variant of SHA-3) would represent a meaningful improvement, though such a change merely delays the inevitable: Bitcoin must eventually transition to a quantum-resistant cryptographic framework. When this occurs—and it will, regardless of opposition—SHA-2, along with ECDSA private keys, public keys, and signatures, will become obsolete.

See: Lenght extension attack (Bitcoin is vulnerable because it’s using SHA-256)

bitcoin deleted a comment on Aug 18, 2025

jonatack commented at 2:35 pm on August 18, 2025: member

Some friendly moderation to keep the discussion focused on technical review – thanks.

kcalvinalvin commented at 2:46 pm on August 18, 2025: contributor

The reliance of Bitcoin on SHA-2—a legacy hash function designed by the National Security Agency (NSA)—introduces non-trivial security risks, particularly when considering the often-dismissed threat posed by quantum adversaries.

SHA256 and SHA512 are quantum resistent.

Migrating to SHAKE256 (a variant of SHA-3) would represent a meaningful improvement, though such a change merely delays the inevitable: Bitcoin must eventually transition to a quantum-resistant cryptographic framework. When this occurs—and it will, regardless of opposition—SHA-2, along with ECDSA private keys, public keys, and signatures, will become obsolete. See: Lenght extension attack (Bitcoin is vulnerable because it’s using SHA-256)

Ok but this has nothing to do with this BIP.

murchandamus commented at 10:15 pm on August 18, 2025: contributor

@1BitcoinBoWP1FZ4xwTNkq6XksKidmgYYw, please cut out the LLM generated comments. If any of us were interested in seeing an LLM’s prediction of what might be said about a topic, we could prompt one ourselves.

petertodd commented at 10:18 pm on August 18, 2025: contributor

On Mon, Aug 18, 2025 at 04:06:51AM -0700, Calvin Kim wrote:

kcalvinalvin left a comment (bitcoin/bips#1923)

You need to justify why you’re using SHA-512/256 rather than SHA-256, like the rest of the Bitcoin protocol. Right now you just link to a paper from 2011. But that paper is out of date now that hardware support for SHA-256 has become common.

Sure we can update the accumulator BIP with benchmarks for SHA512/256 vs SHA256.

But could you link to the aforementioned justifications for the other parts of the Bitcoin protocol that use SHA512?

No part of the Bitcoin consensus protocol uses SHA512.

kcalvinalvin commented at 6:17 am on August 19, 2025: contributor

On Mon, Aug 18, 2025 at 04:06:51AM -0700, Calvin Kim wrote: kcalvinalvin left a comment (bitcoin/bips#1923) > You need to justify why you’re using SHA-512/256 rather than SHA-256, like the rest of the Bitcoin protocol. Right now you just link to a paper from 2011. But that paper is out of date now that hardware support for SHA-256 has become common. Sure we can update the accumulator BIP with benchmarks for SHA512/256 vs SHA256. But could you link to the aforementioned justifications for the other parts of the Bitcoin protocol that use SHA512? No part of the Bitcoin consensus protocol uses SHA512.

Ok but you’ve stated in your previous comment “You need to justify why you’re using SHA-512/256 rather than SHA-256, like the rest of the Bitcoin protocol”. Would be very helpful to see what type of justifications the other protocols have made.

Second, I don’t think it matters if SHA512 wasn’t used in the Bitcoin consensus protocol. SHA512 is used in BIP32 and the argument that SHA512 is safe for generating private keys but not safe for Bitcoin consensus isn’t sound.

I think our original justification (better performance with SHA512/256) mentioned in the BIP is sound. Happy to provide the benchmarks, they’re being worked on at the moment.

1BitcoinBoWP1FZ4xwTNkq6XksKidmgYYw commented at 6:22 am on August 20, 2025: contributor

SHA512 is safe for generating private keys

Lol, what did you say?

kcalvinalvin commented at 6:35 am on August 20, 2025: contributor

SHA512 is safe for generating private keys

Lol, what did you say?

Dude, go look up on chatgpt how SHA512/256 works. Length extension attacks that you mentioned DOES NOT work on it because the outputs are truncated. BIP32 uses HMAC-SHA512 which is just a keyed SHA512.

Why do I even have to deal with this guy. It’s clear he doesn’t know anything. His comments are worthless and this is wasting my time and energy.

kcalvinalvin commented at 7:16 am on August 20, 2025: contributor

This is the type of email he sends me after I block him. I’m sorry for posting unrelated comments here but imho he should be blocked from this repo.

in utreexo-validation-bip.md:66 in d1d03420ac outdated

61+hash, you must compute the SHA-512/256 hash of the following data:
62+
63+| Name              | Type                     | Description                               |
64+| ----------------- | ------------------------ | ----------------------------------------- |
65+| Utreexo_Tag_V1    | 64 byte array            | The version tag to be prepended to the leafhash. |
66+| Utreexo_Tag_V1    | 64 byte array            | The version tag to be prepended to the leafhash. |

lucad70 commented at 7:13 pm on August 21, 2025:

For clarification, is the Utreexo_Tag_V1 really used twice in preimage to the hash?

murchandamus commented at 4:38 pm on August 27, 2025:

My guess would be that this duplication is unintended.

0| Name              | Type                     | Description                               |
1| ----------------- | ------------------------ | ----------------------------------------- |
2| Utreexo_Tag_V1    | 64 byte array            | The version tag to be prepended to the leafhash. |

kcalvinalvin commented at 8:58 am on August 29, 2025:

Oh no the duplication is intended.

Since we use SHA512/256 as the hash function, each chunk is 128 bytes. Since the version tag is only 64 bytes, we need two of them.

petertodd commented at 1:48 pm on August 24, 2025: contributor

On Mon, Aug 18, 2025 at 04:06:51AM -0700, Calvin Kim wrote: kcalvinalvin left a comment (bitcoin/bips#1923) > You need to justify why you’re using SHA-512/256 rather than SHA-256, like the rest of the Bitcoin protocol. Right now you just link to a paper from 2011. But that paper is out of date now that hardware support for SHA-256 has become common. Sure we can update the accumulator BIP with benchmarks for SHA512/256 vs SHA256. But could you link to the aforementioned justifications for the other parts of the Bitcoin protocol that use SHA512? No part of the Bitcoin consensus protocol uses SHA512.

Ok but you’ve stated in your previous comment “You need to justify why you’re using SHA-512/256 rather than SHA-256, like the rest of the Bitcoin protocol”. Would be very helpful to see what type of justifications the other protocols have made.

Second, I don’t think it matters if SHA512 wasn’t used in the Bitcoin consensus protocol. SHA512 is used in BIP32 and the argument that SHA512 is safe for generating private keys but not safe for Bitcoin consensus isn’t sound.

I think our original justification (better performance with SHA512/256) mentioned in the BIP is sound. Happy to provide the benchmarks, they’re being worked on at the moment.

The question is 1) why are we added one new dependency to consensus implementations, and 2) is this actually a performance increase, given that dedicated SHA256 hardware is becoming common?

Length-extension attacks are not relevant for this use-case as we are only committing to public data.

in utreexo-accumulator-bip.md:10 in 8444a28331 outdated

 5+Authors: Tadge Dryja <rx@awsomnet.org>
 6+         Calvin Kim <bip@calvinkim.info>
 7+         Davidson Souza <bip@dlsouza.dev>
 8+Comments-URI: TBD
 9+Status: Draft
10+Type: Specification

murchandamus commented at 7:02 pm on August 25, 2025:

Nit: BIP 2 is still active, so this should be “Standard Track” for the time being.

in utreexo-accumulator-bip.md:17 in d1d03420ac outdated

12+License: BSD-3-Clause
13+```
14+
15+## Abstract
16+
17+This BIP describes the Utreexo accumulator and it's operations. It lays down how to update the

murchandamus commented at 7:04 pm on August 25, 2025:

0This BIP describes the Utreexo accumulator and its operations. It lays down how to update the

in utreexo-accumulator-bip.md:56 in d1d03420ac outdated

51+
52+The Utreexo accumulator is based on an append-only Merkle tree design introduced in [^1],
53+which provides logarithmic-sized inclusion proofs. Utreexo extends this design to support dynamic updates,
54+specifically enabling deletions from the set—a requirement for tracking UTXO spends in Bitcoin.
55+To accommodate this, Utreexo changes the storage requirement from the accumulator design in [^1] to $O(log_2(N))$,
56+where N is the number of elements ever added to the set, while still keeping proof sizes small and verification efficient.

murchandamus commented at 8:36 pm on August 25, 2025:

In case this doesn’t get discussed later, it might be interesting to compare how O(log2(N)) for all transaction outputs ever created compare to the current UTXO set size.

kcalvinalvin commented at 8:53 am on August 29, 2025:

Technically the current Utreexo design is O(log2(N)) of all txos since the forest doesn’t shrink on a deletion. We just move the leaf up so it has the same affect as shrinking the forest.

in utreexo-accumulator-bip.md:176 in d1d03420ac outdated

171+
172+## Utility Functions
173+
174+The following utility functions are required for performing accumulator operations:
175+
176+**parent_hash(left, right):** Returns the hash of the concatenation of two child hashes (`left` and `right`).

murchandamus commented at 8:44 pm on August 25, 2025:

Does this ambiguity regarding the depth of the leaf in the tree not introduce similar weaknesses as the original Merkle tree construction? Why would we float up leaf-hashes rather than create a tagged hash at each level?

Is this fully mitigated due to the number of leaves being known?

kcalvinalvin commented at 9:07 am on August 29, 2025:

Does this ambiguity regarding the depth of the leaf in the tree not introduce similar weaknesses as the original Merkle tree construction?

Not quite sure which weakness you’re referring to here. Is it CVE-2012-2459 (one from calculating the Bitcoin block header commitment)? Since we don’t duplicate hashes, it’s not vulnerable to that particular attack.

Why would we float up leaf-hashes rather than create a tagged hash at each level?

Since we float up the leaf hashes, we can save on the proofs being sent over for the sibling later on.

On a tree like so, proof for 01 is 00, 09, 13.

014
1|---------------\
212              13
3|-------\       |-------\
408      09      10      11
5|---\   |---\   |---\   |---\
600  01  02  03  04  05  06  07

If we delete 00, then 01 moves up to 08. The proof for 01 is now 09 and 13. The proof got shorter.

014
1|---------------\
212              13
3|-------\       |-------\
401      09      10      11
5|---\   |---\   |---\   |---\
6        02  03  04  05  06  07

adiabat commented at 2:52 am on September 2, 2025:

That’s a good point, and we can add a bit about this potential issue in the BIP.

Leaves can move up, and it’s no longer leaves getting hashed with leaves, but leaf/internal node pairs happen often. An attack would be to grind through transactions to get two leaf hashes that together could look like a leaf data preimage for a rogue UTXO.

The reason this isn’t a problem is that in all cases when a node is verifying a UTXO proof, the full UTXO data is known and a keyed hash is used (see “UTXO Hash Preimages” section of the validation BIP) to get from the UTXO data to the leaf. The first 128 bytes input to the hash function are the tags (the hash of “UtreexoV1”). Since this tag is only used for UTXO data, and not in internal accumulator hashes, this should prevent any internal hashes from being interpreted as UTXO data.

in utreexo-accumulator-bip.md:191 in d1d03420ac outdated

186+    if right is None: return left
187+
188+    return sha512_256(left + right)
189+```
190+
191+**treerows(numleaves):** Returns the minimum number of bits required to represent `numleaves - 1`. This corresponds to the height of the largest tree in the forest. Returns `0` if `numleaves` is `0`.

murchandamus commented at 8:49 pm on August 25, 2025:

The numleaves - 1 throws me off here. It’s not obvious to me, why the function would be defined that way rather than the “minimum number of bits required to represent numleaves”? Perhaps a bit more context would help?

kcalvinalvin commented at 9:26 am on August 29, 2025:

Ah it’s because we wanted treerows to return the index of the largest tree not the length. For the below tree, numleaves = 4 but we want treerows to return 2 not 3.

0row 2: 06
1       |-------\
2row 1: 04      05
3       |---\   |---\
4row 0: 00  01  02  03

If we just took the minimum number of bits to represent numleaves = 4, we’d get 3. So to account for this, we take the minimum number of bits needed to represent numleaves-1. This off-by-one happens when numleaves is a power of two. @adiabat did talk about wanting to make treerows return the length and not the index a while back so last chance to speak up? :)

I’ve added the explanation in the bip as well.

murchandamus commented at 8:36 pm on September 16, 2025:

I think I understand the calculation that you are doing, but what’s a bit unclear is the reasoning behind it. What’s the relationship between the thing you are calculating vs the input?

in utreexo-accumulator-bip.md:241 in d1d03420ac outdated

236+Implementation:
237+
238+```python
239+def parent(position: int, total_rows: int) -> int:
240+    return (position >> 1) | (1 << total_rows)
241+```

murchandamus commented at 9:05 pm on August 25, 2025:

I could have used a little more explanation why this returns the parent, but staring at it for a bit, it seems to me that a fully filled tree with 2n leaves would have 2n-1 inner nodes, meaning that all leaves start with a zero in the first position and all inner nodes starting with a one.

E.g. for four leaves, the leaves are 000, 001, 010, and 011, and the inner nodes would be 100, 101, 110.

For 000 and 001, shifting to the right gives 00 and setting the top bit makes the parent 100. For 010 and 011, it works out to be 101. For 100 and 101, it works out to 110.

Gotcha, cool.

in utreexo-accumulator-bip.md:554 in d1d03420ac outdated

549+While RSA accumulators and similar constructions offer significant advantages in proof size—often allowing a
550+single proof to cover an entire block's worth of UTXOs—the trade-offs in proof generation cost and latency are
551+substantial. In RSA-based designs, creating a proof for any given UTXO at arbitrary times can be computationally
552+intensive, especially as the number of UTXOs grows.
553+
554+Utreexo's design is driven by the need for Bridge Nodes: nodes that maintain backward compatibility with existing

murchandamus commented at 9:16 pm on August 25, 2025:

New jargon is usually italicized on introduction, perhaps consider:

0Utreexo's design is driven by the need for *bridge nodes*: nodes that maintain backward compatibility with existing

murchandamus commented at 9:22 pm on August 25, 2025: contributor

I had a look at most of the Accumulator Specification for the first helping. Looks very good already. I only reviewed the function definitions up to root_position, then skimmed the rest, before reading on from Rationale.

in utreexo-validation-bip.md:4 in ca511ff1de outdated

0@@ -0,0 +1,328 @@
1+```
2+BIP: TBD
3+Layer: Peer Services
4+Title: Utreexo - Validation Layer

murchandamus commented at 4:14 pm on August 27, 2025:

The title feels a bit odd to me. It could be a bit more descriptive, I was thinking “Utreexo - Transaction and block validation” or smth?

in utreexo-validation-bip.md:10 in ca511ff1de outdated

 5+Authors: Tadge Dryja <rx@awsomnet.org>
 6+         Calvin Kim <bip@calvinkim.info>
 7+         Davidson Souza <bip@dlsouza.dev>
 8+Comments-URI: TBD
 9+Status: Draft
10+Type: Specification

murchandamus commented at 4:16 pm on August 27, 2025:

Nit: Until BIP 3 activates, this should be Standards Track.

in utreexo-validation-bip.md:20 in d1d03420ac outdated

15+
16+## Abstract
17+
18+This BIP defines the rules for validating blocks and transactions using the
19+Utreexo accumulator. It is important to note that this BIP does not define the
20+Utreexo accumulator itself, for that see BIP-????. This document is only concerned with

murchandamus commented at 4:32 pm on August 27, 2025:

Maybe for the time being:

0Utreexo accumulator itself, for that see [‎BIP Utreexo Accumulator](‎utreexo-accumulator-bip.md). This document is only concerned with

in utreexo-validation-bip.md:52 in d1d03420ac outdated

47+## Specification
48+
49+### Node Hashes
50+
51+During a node's normal operation, it will need to compute the leaf hash for UTXOs
52+being added or removed from the accumulator. The leaf hash is a 32 byte hash that

murchandamus commented at 4:35 pm on August 27, 2025:

0being added or removed from the accumulator. The leaf hash is a 32-byte hash that

in utreexo-validation-bip.md:60 in d1d03420ac outdated

55+
56+Unless otherwise specified, all fields are in little-endian format.
57+
58+#### UTXO Hash Preimages
59+
60+Individual UTXOs are represented as 32 byte hashes in the Utreexo accumulator. To obtain this

murchandamus commented at 4:36 pm on August 27, 2025:

0Individual UTXOs are represented as 32-byte hashes in the Utreexo accumulator. To obtain this

in utreexo-validation-bip.md:79 in d1d03420ac outdated

74+
75+Each field being defined as follows:
76+
77+##### Version tag
78+
79+We use tagged hashes for the hashes committed in the accumulator for versioning

murchandamus commented at 4:45 pm on August 27, 2025:

Maybe link to the section introducing tagged hashes in BIP 340: https://github.com/bitcoin/bips/blob/master/bip-0340.mediawiki#user-content-Design

in utreexo-validation-bip.md:84 in d1d03420ac outdated

79+We use tagged hashes for the hashes committed in the accumulator for versioning
80+purposes. This is added so that if there are changes in the preimage of the
81+hash, the version tag helps to avoid misinterpretation.
82+
83+The Utreexo version tag is the SHA512 hash of the string `UtreexoV1`, which is represented as the vector
84+`[85 116 114 101 101 120 111 86 49]` and hex `0x5574726565786f5631`.  (The resulting 64 byte output is

murchandamus commented at 4:46 pm on August 27, 2025:

0`[85 116 114 101 101 120 111 86 49]` and hex `0x5574726565786f5631`.  (The resulting 64-byte output is

in utreexo-validation-bip.md:67 in d1d03420ac outdated

62+
63+| Name              | Type                     | Description                               |
64+| ----------------- | ------------------------ | ----------------------------------------- |
65+| Utreexo_Tag_V1    | 64 byte array            | The version tag to be prepended to the leafhash. |
66+| Utreexo_Tag_V1    | 64 byte array            | The version tag to be prepended to the leafhash. |
67+| BlockHash         | 32 byte array            | The hash of the block in which this tx was confirmed. |

murchandamus commented at 4:49 pm on August 27, 2025:

On all lines in this table, when the byte count is used as an adjective:

0-32 byte array
1+32-byte array

and

0-4 bytes unsigned integer
1+4-byte unsigned integer

in utreexo-validation-bip.md:70 in d1d03420ac outdated

65+| Utreexo_Tag_V1    | 64 byte array            | The version tag to be prepended to the leafhash. |
66+| Utreexo_Tag_V1    | 64 byte array            | The version tag to be prepended to the leafhash. |
67+| BlockHash         | 32 byte array            | The hash of the block in which this tx was confirmed. |
68+| TXID              | 32 byte array            | The transaction's TXID                    |
69+| Vout              | 4 bytes unsigned integer | The output index of this UTXO             |
70+| Header code       | 4 bytes unsigned integer | The block height and iscoinbase. This is a value obtained by left shifting the block height that confirmed this transaction, and then OR-ing it with 1, only if this transaction is a coinbase. |

murchandamus commented at 4:57 pm on August 27, 2025:

0| Header code       | 4 bytes unsigned integer | The block height and iscoinbase. This is a value obtained by left shifting the block height that confirmed this transaction by one bit, and then OR-ing it with 1, only if this transaction is a coinbase. |

in utreexo-validation-bip.md:110 in d1d03420ac outdated

105+The output index of the UTXO in the transaction.
106+
107+##### Header code
108+
109+This field stores the block height and a boolean for marking that the UTXO is
110+part of a coinbase transaction. Mostly serves to save space as the coinbase

murchandamus commented at 5:02 pm on August 27, 2025:

0This field stores the block height and a boolean for marking that the UTXO was
1created by a coinbase transaction. Mostly serves to save space as the coinbase

in utreexo-validation-bip.md:114 in d1d03420ac outdated

109+This field stores the block height and a boolean for marking that the UTXO is
110+part of a coinbase transaction. Mostly serves to save space as the coinbase
111+boolean can be stored in a single bit.
112+
113+This field is a value obtained by left shifting the block height that
114+confirmed this transaction, and then setting the least significant bit to 1 only

murchandamus commented at 5:03 pm on August 27, 2025:

0confirmed this transaction by one bit, and then setting the least significant bit to 1 only

in utreexo-validation-bip.md:130 in d1d03420ac outdated

125+The block height is needed as during transaction validation, it is used during
126+the check of BIP-0065 CLTV. In current nodes, the block height is stored locally
127+as a part of the UTXO set. Since Utreexo nodes get this data from peers, we need
128+to commit to the block height to avoid security vulnerabilities.
129+
130+The boolean for coinbase is needed as they may not be spent before having 100 confirmations.

murchandamus commented at 5:03 pm on August 27, 2025:

0The boolean for coinbase outputs is needed as they may not be spent before having 100 confirmations.

in utreexo-validation-bip.md:150 in d1d03420ac outdated

145+##### script pubkey
146+
147+This field is added to commit to the locking script of the UTXO. With current
148+nodes, this is stored in the UTXO set but since we receive this in the proof
149+from our peers, we need to commit to this value to avoid malicious peers that
150+may send over the wrong locking script.

murchandamus commented at 5:08 pm on August 27, 2025:

Perhaps consider “output script” as “scriptPubKey” is just Bitcoin Core’s variable name for that field.

 0##### Output script size
 1
 2As the output script ("scriptPubKey" in Bitcoin Core) is a variable length byte array, we prepend it with the
 3length.
 4
 5##### Output Script
 6
 7This field is added to commit to the output script of the UTXO. With current
 8nodes, this is stored in the UTXO set but since we receive this in the proof
 9from our peers, we need to commit to this value to avoid malicious peers that
10may send over the wrong output script.

in utreexo-validation-bip.md:173 in d1d03420ac outdated

168+##### Provably unspendable transaction outputs
169+
170+There are outputs in the Bitcoin network that we can guarantee that they cannot
171+be spent without a hard-fork of the network. The following output types are not
172+added to the accumulator:
173+- Outputs that start with an OP_RETURN (0x6a)

murchandamus commented at 5:13 pm on August 27, 2025:

0- Outputs whose output script starts with an OP_RETURN (0x6a)

in utreexo-validation-bip.md:174 in d1d03420ac outdated

169+
170+There are outputs in the Bitcoin network that we can guarantee that they cannot
171+be spent without a hard-fork of the network. The following output types are not
172+added to the accumulator:
173+- Outputs that start with an OP_RETURN (0x6a)
174+- Outputs with a scriptPubkey larger than 10,000 bytes

murchandamus commented at 6:04 pm on August 27, 2025:

0- Outputs with an output script larger than 10,000 bytes

in utreexo-validation-bip.md:186 in d1d03420ac outdated

181+In Utreexo, nodes inspect blocks and identify which outputs are being created
182+and destroyed in the same block, and exclude them from the accumulator and proofs.
183+
184+There's no need to provide proofs for outputs which have been created in the same
185+block. Adding and then immediately removing the output from the accumulator would be
186+possible but doesn't serve any purpose - once outputs are spent their past existence

murchandamus commented at 6:05 pm on August 27, 2025:

0possible but doesn't serve any purpose - once outputs are spent, their past existence

in utreexo-validation-bip.md:199 in d1d03420ac outdated

194+The Utreexo accumulator lacks associative properties during addition and the
195+ordering of which UTXO hash gets added first is consensus critical. For
196+the modification of the accumulator the steps are as follows:
197+
198+1. Batch remove the UTXOs that were spent in the block based on the algorithm
199+   defined in BIP-????. Deletions itself are order-independent.

murchandamus commented at 6:08 pm on August 27, 2025:

For the time being, maybe just use [‎BIP Utreexo Accumulator](‎utreexo-accumulator-bip.md) to clarify whether you are referring to one or the other.

in utreexo-validation-bip.md:204 in d1d03420ac outdated

199+   defined in BIP-????. Deletions itself are order-independent.
200+2. Batch add all non-excluded outputs in the order they're included in the
201+   Bitcoin block. Additions are order-dependent.
202+
203+The removal and the addition of the hashes follow the algorithms defined in
204+BIP-????.

murchandamus commented at 6:10 pm on August 27, 2025:

Ditto

in utreexo-validation-bip.md:213 in d1d03420ac outdated

208+The UTXO proof has 2 elements: the accumulator proof and the leaf data. The
209+leaf data provides the necessary UTXO data for block validation that would be
210+stored locally for non-Utreexo nodes. The accumulator proof proves that the
211+given UTXO hash preimages are committed in the accumulator.
212+
213+Accumulator proof is defined in BIP-????, and contains two elements:

murchandamus commented at 6:10 pm on August 27, 2025:

Ditto

in utreexo-validation-bip.md:218 in d1d03420ac outdated

213+Accumulator proof is defined in BIP-????, and contains two elements:
214+
215+1. A vector of positions of the UTXO hashes in the accumulator.
216+2. A vector of hashes required to hash up to the roots.
217+
218+For (1), positions are in the order of the leaves that are being proved in

murchandamus commented at 6:11 pm on August 27, 2025:

0For (1), positions are in the order of the leaves that are being proven in

in utreexo-validation-bip.md:224 in d1d03420ac outdated

219+the accumulator. These are all the inputs in the natural blockchain order that
220+excludes the same block spends.
221+
222+The UTXO hash preimages follow the same ordering as (1) in the accumulator
223+proofs. Each of the positions in (1) refer to the UTXO hash preimage in the same
224+index.

murchandamus commented at 6:16 pm on August 27, 2025:

For some reason I had thought that the accumulator proof was a Merkle branch, but now reading this, it makes me think that the proofs are built-up from the leaf preimages. Which of the two is correct, and could you perhaps check whether some more clarification should be added here to make it unambiguous? This might also just be me mixing up something as I’m trying to puzzle together everything that is going on.

kcalvinalvin commented at 10:08 am on August 29, 2025:

For some reason I had thought that the accumulator proof was a Merkle branch, but now reading this, it makes me think that the proofs are built-up from the leaf preimages. Which of the two is correct, and could you perhaps check whether some more clarification should be added here to make it unambiguous?

You are right, there’s the merkle branches themselves and the leaf preimages are an entirely separate data apart from that.

I’ll read it over again and make clarifications where needed.

adiabat commented at 2:56 am on September 2, 2025:

Yeah it’s both. First the merkle branch is needed to verify that the UTXO exists at all, then the UTXO data / leaf preimage is needed to feed in to the normal script & transaction validation. We can make it clear that there’s really 2 things stuck together here.

in utreexo-validation-bip.md:237 in d1d03420ac outdated

232+
233+For each block, the UTXO proof must be provided with the bitcoin block for
234+validation to be possible. Without the UTXO proof, it's not possible to
235+validate that the inputs being referenced exists in the UTXO set.
236+
237+The end result of the UTXO proof validation results us in the vector of UTXO

murchandamus commented at 6:18 pm on August 27, 2025:

0The end result of the UTXO proof validation results in the vector of UTXO

in utreexo-validation-bip.md:258 in d1d03420ac outdated

253+of this check affect the consensus validation for Utreexo nodes.
254+
255+### BIP-0030 and BIP-0034 consensus check
256+
257+Before `BIP-0030`, the Bitcoin consensus rules allowed for duplicate TXIDs. If two
258+transactions shared a same TXID, the transaction outputs of the preceding

murchandamus commented at 6:20 pm on August 27, 2025:

0transactions shared a same TXID, the transaction outputs of the succeeding

in utreexo-validation-bip.md:261 in d1d03420ac outdated

256+
257+Before `BIP-0030`, the Bitcoin consensus rules allowed for duplicate TXIDs. If two
258+transactions shared a same TXID, the transaction outputs of the preceding
259+transaction would overwrite the previously created UTXOs. It was assumed that
260+TXIDs were unique but it's trivially easy to create a transaction that share
261+the same `TXID` for coinbase transactions by re-using the same bitcoin address.

murchandamus commented at 6:30 pm on August 27, 2025:

Nit: It’s not just that the TXID is the same, the entire transaction is the same.

in utreexo-validation-bip.md:264 in d1d03420ac outdated

259+transaction would overwrite the previously created UTXOs. It was assumed that
260+TXIDs were unique but it's trivially easy to create a transaction that share
261+the same `TXID` for coinbase transactions by re-using the same bitcoin address.
262+
263+`BIP-0030` check is a consensus check that enforces that newly created transactions
264+do not have outputs that overwrites an existing UTXO.

murchandamus commented at 6:30 pm on August 27, 2025:

0do not have outputs that overwrite an existing UTXO.

in utreexo-validation-bip.md:266 in d1d03420ac outdated

261+the same `TXID` for coinbase transactions by re-using the same bitcoin address.
262+
263+`BIP-0030` check is a consensus check that enforces that newly created transactions
264+do not have outputs that overwrites an existing UTXO.
265+
266+`BIP-0034` was a rule where the block height was included in the script signature

murchandamus commented at 6:35 pm on August 27, 2025:

What was called originally “generator transaction” is now more familiarly referred to as a “coinbase transaction” after the “scriptSig” equivalent being called “coinbase field” in that context.

0`BIP-0034` introduces a rule that requires the block height to be included in the coinbase field

in utreexo-validation-bip.md:267 in d1d03420ac outdated

262+
263+`BIP-0030` check is a consensus check that enforces that newly created transactions
264+do not have outputs that overwrites an existing UTXO.
265+
266+`BIP-0034` was a rule where the block height was included in the script signature
267+of the coinbase transaction. One of the reason for the change was to make

murchandamus commented at 6:39 pm on August 27, 2025:

As far as I can tell, the rest of BIP 34 explains the activation mechanism of BIP 34, so I would claim that this is the main reason.

0of the coinbase transaction. The main reason for the change was to make

in utreexo-validation-bip.md:271 in d1d03420ac outdated

266+`BIP-0034` was a rule where the block height was included in the script signature
267+of the coinbase transaction. One of the reason for the change was to make
268+coinbase transactions unique so that the expensive check of going through the
269+UTXO set wouldn't be needed. However, there were blocks in the past that had
270+random bytes that could be interpreted as block heights. The lowest block
271+heights are: 209,921, 490,897, and 1,983,702.

murchandamus commented at 6:41 pm on August 27, 2025:

0random bytes that could be interpreted as block heights. The lowest implicated block
1heights are: 209,921, 490,897, and 1,983,702.

in utreexo-validation-bip.md:285 in d1d03420ac outdated

280+will remove the checkpoints[^1], as they are not needed anymore to prevent attacks
281+against nodes during Initial Block Download. This is effectively a hard-fork,
282+that will probably never actually happen, however.
283+
284+Block 1,983,702 is the first block that Utreexo nodes would be in danger of a
285+consensus failure due to the inability to perform the BIP-0030 checks. However,

murchandamus commented at 6:47 pm on August 27, 2025:

0consensus failure due to the inability to perform the BIP-0030 checks, if someone were to reuse coinbase transaction from block 164,384 . However,

in utreexo-validation-bip.md:291 in d1d03420ac outdated

286+this block will happen in roughly 21 years from now, and some mitigations have been
287+proposed [^2].
288+
289+### Historical BIP-0030 violations
290+
291+There were two UTXOs that were overwritten due to this consensus rule are:

murchandamus commented at 6:48 pm on August 27, 2025:

Not due to this rule, but rather before it was introduced:

0There were two UTXOs that were overwritten by repeated transactions:

in utreexo-validation-bip.md:300 in d1d03420ac outdated

295+Since the leaf hashes that are committed to the Utreexo accumulator commit to
296+the block hash as well, all the leaf hashes are unique and the two historical
297+violations do not happen with how the UTXO set is represented with the Utreexo
298+accumulator. To be consensus compatible with clients that do have the historical
299+violations, the leaves representing these two UTXOs in the Utreexo accumulator
300+are hardcoded as unspendable.

murchandamus commented at 6:51 pm on August 27, 2025:

If I’m understanding this right:

0accumulator. To be consensus compatible with clients that retain only the second
1occurrences of these outputs, the leaves representing the corresponding first UTXOs in the Utreexo accumulator
2are hardcoded as unspendable.

murchandamus commented at 6:56 pm on August 27, 2025: contributor

This time I took a look at the “Validation Layer” BIP. Also looks very good already. I noticed that there is no Rationale section, and the title seemed a little less informative than it could be.

in utreexo-p2p-bip.md:28 in d1d03420ac outdated

23+## Motivation
24+
25+Utreexo nodes require the inclusion proof to fully validate blocks and transactions.
26+Each block has a corresponding inclusion proof with it and this inclusion proof for blocks up to height 906,937 requires an additional 631.85GB, which is roughly 40GB less than the size of the block data.
27+Each transaction also has a corresponding inclusion proof with it and for normal transaction relay, the proof is roughly 3 times the size of the transaction.
28+It's still reasonable for a single node to download this extra data but little caching goes a long way in reducing the amount of data that one has to download.

murchandamus commented at 9:10 pm on August 28, 2025:

little caching ↦ almost no caching a little caching ↦ some caching

I think you mean the latter:

0It's still reasonable for a single node to download this extra data but a little caching goes a long way in reducing the amount of data that one has to download.

in utreexo-p2p-bip.md:50 in d1d03420ac outdated

45+3. Archive nodes
46+
47+CSNs have the goal of minimizing data storage and download while performing block validation.
48+Archive and bridge nodes store more data and provide this data to CSNs.
49+
50+Bridge nodes are nodes that can add inclusion proofs to mempool transactions, support the same set of messages as CSNs, and should in fact be indistinguishable from CSNs on the network.

murchandamus commented at 9:16 pm on August 28, 2025:

It’s not clear to me how “bridge nodes should in fact be indistinguishable from CSNs on the network”. By whom are they indistinguishable. In what regard are they indistinguishable? Shouldn’t they, e.g., be frequently the first peer to notify about new transactions appearing in the mempool and blocks having been found as they act as the translation layer and therefore the initial source of data for the Utreexo-portion of the node network?

kcalvinalvin commented at 11:31 am on August 29, 2025:

It’s not clear to me how “bridge nodes should in fact be indistinguishable from CSNs on the network”. By whom are they indistinguishable. In what regard are they indistinguishable?

They’re indistinguishable as we don’t explicitly specify which nodes are bridges. The sentence was an attempt at clarifying a common misconception that a CSN must connect to bridge nodes.

Shouldn’t they, e.g., be frequently the first peer to notify about new transactions appearing in the mempool and blocks having been found as they act as the translation layer and therefore the initial source of data for the Utreexo-portion of the node network?

Yes this is true. They usually should be the first to notify utreexo peers about new txs and blocks

adiabat commented at 3:12 am on September 2, 2025:

Maybe “indistinguishable” is too strong – it would be great if nobody could tell, but if there are a small number of bridge nodes and a large number of CSNs it might be traceable.

The main thing is bridge nodes don’t announce themselves as such; they just pass proofs and transactions around just like CSNs. If you’re a CSN connected directly to a bridge node, you might see a lot of INVs and proofs originate from that node, and they might be a bridge, but they might just be a well connected CSN.

It’s similar to trying to prevent people from tracing new transactions to originating nodes, though probably in one sense harder (bridge nodes keep being bridge nodes all the time vs only getting one shot with a wallet broadcasting) but also lower stakes (determining that a node is a bridge doesn’t hurt privacy or network strength that much).

in utreexo-p2p-bip.md:51 in d1d03420ac outdated

46+
47+CSNs have the goal of minimizing data storage and download while performing block validation.
48+Archive and bridge nodes store more data and provide this data to CSNs.
49+
50+Bridge nodes are nodes that can add inclusion proofs to mempool transactions, support the same set of messages as CSNs, and should in fact be indistinguishable from CSNs on the network.
51+Archive nodes are able to serve the blocks and the inclusion proofs. However, they are not able to generate the inclusion proofs as they do not keep the full UTXO set.

murchandamus commented at 9:19 pm on August 28, 2025:

Does “Bridge node” refer to the aspect of whether the node has the UTXO set, and does “archive node” refer to having the full set of data? I.e., are these different dimensions? Would you run an “archive bridge node” if you want to offer all services?

Edit: Oh, never mind, you answer that right below.

in utreexo-p2p-bip.md:60 in d1d03420ac outdated

55+The one exception to this flexibility is that archive nodes must provide both the blocks and the inclusion proofs.
56+While theoretically possible to split these two resources, the blocks are quite small relative to the block proofs, and it simplifies clients to be able to rely on being able to request both over the same connection.
57+
58+### Pre-P2P: Bridge Building
59+
60+When introducing Utreexo into an existing network, there are 2 thing needed before CSNs can operate.

murchandamus commented at 9:23 pm on August 28, 2025:

0When introducing Utreexo into an existing network, there are two things needed before CSNs can operate.

in utreexo-p2p-bip.md:61 in d1d03420ac outdated

56+While theoretically possible to split these two resources, the blocks are quite small relative to the block proofs, and it simplifies clients to be able to rely on being able to request both over the same connection.
57+
58+### Pre-P2P: Bridge Building
59+
60+When introducing Utreexo into an existing network, there are 2 thing needed before CSNs can operate.
61+First, archive nodes need to build proofs for old blocks to serve during the initial-block download (IBD).

murchandamus commented at 9:23 pm on August 28, 2025:

0First, archive nodes need to build proofs for old blocks to serve during the initial block download (IBD).

in utreexo-p2p-bip.md:71 in d1d03420ac outdated

66+
67+### Initial Block Download
68+
69+![Current IBD](bip-utreexo-p2p/current-ibd.png)
70+
71+Current IBD is done by a headers-first block download, in which the node downloads all the Bitcoin block headers, verifies that they connect, and start downloading the actual block data for validation.

murchandamus commented at 9:25 pm on August 28, 2025:

“Verifies that they connect” feels a bit overly reductive.

0Conventionally, IBD is done by a headers-first block download, in which the node downloads all the Bitcoin block headers, verifies that they connect, and follows up by by downloading the block data for validation.

in utreexo-p2p-bip.md:78 in d1d03420ac outdated

73+![Utreexo node IBD](bip-utreexo-p2p/utreexo-node-ibd.png)
74+
75+Utreexo nodes still perform the headers-first phase.
76+However, in addition to blocks, they also require the inclusion proof for UTXOs spent in that block.
77+Hence a Utreexo node will send a `getutreexoproof` message along with the `getdata` message for a given block.
78+This flow is the simplest change and allows a Utreexo node to validate and perform IBD but this method does require downloading about 2 times compared to the current nodes as the inclusion proof for a block is roughly the same size as the block itself.

murchandamus commented at 9:31 pm on August 28, 2025:

0This flow is the simplest change and allows a Utreexo node to validate and perform IBD but this method does require downloading about two times as much data as a conventional node due to the inclusion proof for a block being roughly the same size as the block itself.

in utreexo-p2p-bip.md:89 in d1d03420ac outdated

84+With these TTL values, a node receiving the `TTL` message will be able to determine which output to cache with the Clairvoyant algorithm[^1] which allows the IBD-ing node to reduce the bandwidth required in syncing the node in the most efficient way possible.
85+
86+The node will have the block and the TTLs for the outputs of the given block which it can then use to cache parts of the inclusion proof and only request the needed parts of an inclusion proof for future blocks.
87+
88+We note that it is feasible for a node to receive incorrect TTL values from malicious nodes and this can negatively impact the bandwidth savings.
89+Nodes can mitigate this by not downloading TTL values too far into the future or by checking if the `TTL` message received was included in the accumulator hard-coded into the binary.

murchandamus commented at 9:35 pm on August 28, 2025:

I don’t understand what you mean with “hard-coded in the binary”.

kcalvinalvin commented at 11:40 am on August 29, 2025:

Oh I should clarify this.

Since nothing is being committed to the TTL messages, a node can just lie about the values in the message. To prevent this, the node should either:

1: don’t download too far into the future since the damage done will be greater. 2: rely on the pre-committed (aka “hard coded into the binary”) ttl accumulator in the node software. The ttl accumulator has ttls for each of the blocks accumulated. With this accumulator, the node can check if the received ttl is valid or invalid by checking for its existence in the ttl accumulator.

in utreexo-p2p-bip.md:91 in d1d03420ac outdated

86+The node will have the block and the TTLs for the outputs of the given block which it can then use to cache parts of the inclusion proof and only request the needed parts of an inclusion proof for future blocks.
87+
88+We note that it is feasible for a node to receive incorrect TTL values from malicious nodes and this can negatively impact the bandwidth savings.
89+Nodes can mitigate this by not downloading TTL values too far into the future or by checking if the `TTL` message received was included in the accumulator hard-coded into the binary.
90+
91+This TTL commitment scheme is described in detail [here](#Commitment scheme for TTL messages).

murchandamus commented at 9:36 pm on August 28, 2025:

0The TTL commitment scheme is described in detail in the section [Commitment scheme for TTL messages](#commitment-scheme-for-ttl-messages) below.

in utreexo-p2p-bip.md:97 in d1d03420ac outdated

92+
93+### Transaction relay
94+
95+![Current TX relay](bip-utreexo-p2p/current-tx-relay.png)
96+
97+Current transaction relay is done by sending an inv message with the hash of the transaction and a type field that denotes that this hash represents a transaction.

murchandamus commented at 9:41 pm on August 28, 2025:

Throughout this BIP: I don’t think “current” is a future-proof term to distinguish the established behavior of nodes from Utreexo nodes. Perhaps “conventional” or “non-Utreexo nodes” would be a better fit?

in utreexo-p2p-bip.md:98 in d1d03420ac outdated

93+### Transaction relay
94+
95+![Current TX relay](bip-utreexo-p2p/current-tx-relay.png)
96+
97+Current transaction relay is done by sending an inv message with the hash of the transaction and a type field that denotes that this hash represents a transaction.
98+If the node receiving the inv does not have a tx matching that hash, it then requests for it using a getdata message.

murchandamus commented at 9:54 pm on August 28, 2025:

0If the node receiving the inv does not have a transaction matching that hash, the node then requests the transaction using a getdata message.

in utreexo-p2p-bip.md:107 in d1d03420ac outdated

102+The transaction relay for Utreexo nodes doesn't add any extra round trips.
103+However, it does include extra inventory vectors in the inv message.
104+
105+We introduce a new inventory vector type called `utreexoproofhash`, which makes up the extra information that a Utreexo node will receive.
106+
107+A hash with the type `utreexoproofhash` represents 4 Utreexo merkle tree positions, each of them little endian serialized and taking up 8 bytes in the 32 byte hash.

murchandamus commented at 9:56 pm on August 28, 2025:

0
1A hash with the type `utreexoproofhash` represents four Utreexo merkle tree positions, each of them little-endian serialized and taking up 8 bytes in the 32-byte hash.

in utreexo-p2p-bip.md:108 in d1d03420ac outdated

103+However, it does include extra inventory vectors in the inv message.
104+
105+We introduce a new inventory vector type called `utreexoproofhash`, which makes up the extra information that a Utreexo node will receive.
106+
107+A hash with the type `utreexoproofhash` represents 4 Utreexo merkle tree positions, each of them little endian serialized and taking up 8 bytes in the 32 byte hash.
108+When sending an inv message to a Utreexo node for a tx, we append `utreexoproofhash` inventory vectors to represent the merkle tree positions for each of the UTXOs being referenced in the inputs of the tx.

murchandamus commented at 9:58 pm on August 28, 2025:

Please don’t use abbreviations like “tx” in the running text.

0When sending an inv message to a Utreexo node for a transaction, we append `utreexoproofhash` inventory vectors to represent the merkle tree positions for each of the UTXOs being referenced in the inputs of the transaction.

murchandamus commented at 10:14 pm on August 28, 2025:

When you write “send an inv message” do you actually mean “announce a new transaction” rather than literally a inv message being sent?

murchandamus commented at 10:17 pm on August 28, 2025:

The use of inv message here seems confusing. It’s not clear to me whether it is meant literal or as a stand in for “transaction announcement”. First I even thought you were describing the message that transfers the entire transaction because you already were sending the utreexoproofhash along. If this is actually describing how Utreexo nodes announce transactions to each other, instead of saying “to send an inv message”, it could better be introduces e.g., as

“Where conventional nodes us a inv message to announce a new transaction, Utreexo nodes use the invvect message to announce new transactions to Utreexo peers.”

It would perhaps also help if you explain why the utreexoproofhash would need to be sent with announcement of the transaction—I would have expected them to only be necessary when the transaction data is sent.

Either way, this section is confusing to me.

in utreexo-p2p-bip.md:109 in d1d03420ac outdated

104+
105+We introduce a new inventory vector type called `utreexoproofhash`, which makes up the extra information that a Utreexo node will receive.
106+
107+A hash with the type `utreexoproofhash` represents 4 Utreexo merkle tree positions, each of them little endian serialized and taking up 8 bytes in the 32 byte hash.
108+When sending an inv message to a Utreexo node for a tx, we append `utreexoproofhash` inventory vectors to represent the merkle tree positions for each of the UTXOs being referenced in the inputs of the tx.
109+The Utreexo merkle tree positions are explained in detail in the bip "Utreexo Accumulator Specification".

murchandamus commented at 9:58 pm on August 28, 2025:

0The Utreexo merkle tree positions are explained in detail in "BIP Utreexo Accumulator Specification".

in utreexo-p2p-bip.md:105 in d1d03420ac outdated

100+![Utreexo TX relay](bip-utreexo-p2p/utreexo-tx-relay.png)
101+
102+The transaction relay for Utreexo nodes doesn't add any extra round trips.
103+However, it does include extra inventory vectors in the inv message.
104+
105+We introduce a new inventory vector type called `utreexoproofhash`, which makes up the extra information that a Utreexo node will receive.

murchandamus commented at 10:12 pm on August 28, 2025:

This section feels ambiguous to me.

With the graphic labeling the first box that contains “type:transaction, hash: tx hash (a)” as invvect, I am now wondering whether Utreexo nodes use the inv message to announce messages to each other extended by invvect data, or whether they send invvect messages instead of inv messages.

in utreexo-p2p-bip.md:115 in d1d03420ac outdated

110+Since the hash in an inventory vector is always 32 bytes, any unused space will be padded with the max uint64 value of 18446744073709551615.
111+
112+With these merkle tree positions for the UTXOs referenced in the inputs, we can calculate the needed positions of the merkle hashes to them.
113+These positions are then sent over in the `getdata` message as an another inventory vector.
114+
115+![Utreexo TX relay multiple Utreexo proof hash vectors](bip-utreexo-p2p/utreexo-tx-relay-multiple-proofhash-vectors.png)

murchandamus commented at 10:22 pm on August 28, 2025:

Scrolling up and down through this document, it’s sometimes difficult to tell whether a paragraph belongs to the image before or after the paragraph. Since Markdown does not allow captions on images, it could for example help if either the images included the caption, or if the text were structured in some way that makes it clearer.

in utreexo-p2p-bip.md:131 in d1d03420ac outdated

126+
127+### Block Propagation
128+
129+![Legacy Block Propagation](bip-utreexo-p2p/legacy-block-propagation.png)
130+
131+Legacy block propagation without Compact Blocks comprises of three steps:

murchandamus commented at 10:27 pm on August 28, 2025:

Consistency: Previously you were referring to non-Utreexo nodes as “current nodes”, now it’s “legacy”. Please use one term to refer to the same concept across the entire document.

in utreexo-p2p-bip.md:202 in d1d03420ac outdated

197+#### Reconstructable Script
198+
199+For some script types (e.g. `ScriptHash`, `PubkeyHash`, `WitnessScriptHash`, `WitnessPubkeyHash`) the actual locking condition is not in the scriptPubkey, but a hash of it.
200+The script which is evaluated is provided as an element of the scriptSig or witness data.
201+
202+Therefore, we can safely just omit the locking script hash from the UTXO data and reconstruct it from the witness or scriptSig.

murchandamus commented at 10:37 pm on August 28, 2025:

0Therefore, we can safely omit the locking script hash from the UTXO data and reconstruct it from the witness or scriptSig.

in utreexo-p2p-bip.md:242 in d1d03420ac outdated

237+
238+| Field        | Type                | Description                                                                                                                                                                                                                                                    |
239+|--------------|---------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
240+| block height | uint32              | The time-to-live value of a leaf in the Utreexo merkle forest. The value is determined by the amount of leaves that were added to the accumulator since its creation                                                                                           |
241+| length       | varint              | The length of the TTLs                                                                                                                                                                                                                                         |
242+| TTLs         | vector of TTL infos | The TTL Info for the UTXOs that are added to the Utreexo merkle forest in blockchain ordering. See [Utreexo - Validation Layer](./utreexo-validation-bip.md#Excluded UTXOs from the accumulator) for the UTXOs that are not added to the Utreexo merkle forest |

murchandamus commented at 10:40 pm on August 28, 2025:

0| TTLs         | vector of TTL infos | The TTL Info for the UTXOs that are added to the Utreexo merkle forest in blockchain ordering. See [Utreexo - Validation Layer](./utreexo-validation-bip.md#excluded-utxos-from-the-accumulator) for the UTXOs that are not added to the Utreexo merkle forest |

in utreexo-p2p-bip.md:287 in d1d03420ac outdated

282+
283+The bitmaps here are formatted as big-endian and padded to the nearest byte, with 1 meaning a request for the proof hash or the leaf data, and 0 meaning omit the proof hash or the leaf data.
284+
285+Since there's one corresponding leaf data per target location, it's trivial to generate a bitmap for the leafdatas.
286+
287+Using the [proof_positions](./utreexo-accumulator-bip.md#Utility Functions) function, it's possible to generate the positions of the needed proof hashes for a given set of targets.

murchandamus commented at 10:42 pm on August 28, 2025:

0Using the [proof_positions](./utreexo-accumulator-bip.md#utility-functions) function, it's possible to generate the positions of the needed proof hashes for a given set of targets.

murchandamus commented at 10:50 pm on August 28, 2025: contributor

I read the whole P2P BIP, although I went over the new messages section a bit more quickly. There are some sections that felt a bit confusing to me, perhaps you could try to take a look at whether you can clarify those for the less initiated. Overall, this seems close to complete, although I noticed that it is missing a Rationale section.

murchandamus added the label Needs number assignment on Aug 28, 2025

jonatack commented at 8:00 pm on August 29, 2025: member

After discussion amongst the editors, we’ve assigned 181-183 for these 3 BIP drafts. @murchandamus suggested 181 Accumulator / 182 Validation / 183 P2P (I agree) while leaving it up to you.

jonatack removed the label Needs number assignment on Aug 29, 2025

murchandamus commented at 11:33 pm on August 29, 2025: contributor

Whenever you get around to it, please add the numbers to the Preambles, set the “Created” header to 2025-08-29 (it holds the date a BIP got numbered), and add the table entries to the README.mediawiki.

kcalvinalvin commented at 3:53 am on August 30, 2025: contributor

After discussion amongst the editors, we’ve assigned 181-183 for these 3 BIP drafts.

@murchandamus suggested 181 Accumulator / 182 Validation / 183 P2P (I agree) while leaving it up to you.

Whenever you get around to it, please add the numbers to the Preambles, set the “Created” header to 2025-08-29 (it holds the date a BIP got numbered), and add the table entries to the README.mediawiki.

Currently going through all the reviews and writing up the rationale for validation and p2p. Will address these as well.

in utreexo-p2p-bip.md:186 in d1d03420ac outdated

181+### New data structures
182+
183+#### Compact leaf data
184+
185+For a CSN to learn the data associated with a UTXO, it must ask for a peer that has it.
186+To authenticate this data, it is committed into the accumulator, and therefore cannot be changed by peer.

luisschwab commented at 4:11 pm on September 2, 2025:

0To authenticate this data, it is committed into the accumulator, and therefore cannot be changed by the peer.

in utreexo-p2p-bip.md:18 in d1d03420ac outdated

13+Requires: <BIP-???? (Utreexo Accumulator Specification), BIP-???? (Utreexo - Validation Layer)>
14+```
15+
16+## Abstract
17+
18+Utreexo creates a compact representation of the UTXO set that only takes a couple of kilobytes.

luisschwab commented at 4:14 pm on September 2, 2025:

At one extreme of this gradient, nodes minimize storage and memory requirements, keeping only the roots of the hash trees, which never exceed a kilobyte.

The Utreexo paper mentions that the upper limit of the accumulator size is a single KB. What changed?

kcalvinalvin commented at 12:55 pm on September 7, 2025:

It’s essentially still a kilobyte but since we can support leaves up to the maximum of uint64, we can have 64 roots which is 64*32 = 2048. So 2KB max.

kcalvinalvin force-pushed on Sep 7, 2025

BIP181: Add the Utreexo accumulator BIP d89952d09f

kcalvinalvin force-pushed on Sep 7, 2025

BIP182: Add the Utreexo validation BIP 4aa26f3ca7

kcalvinalvin force-pushed on Sep 7, 2025

BIP183: Add the Utreexo P2P BIP 68da366a98

Update README table to include BIPs: 181, 182, 183 bd1e242587

kcalvinalvin force-pushed on Sep 7, 2025

kcalvinalvin renamed this:
~~BIP draft: BIPs for Utreexo~~
BIP 181, 182, 183: BIPs for Utreexo
on Sep 7, 2025

kcalvinalvin commented at 12:58 pm on September 7, 2025: contributor

All of the review comments are addressed and the rationale for BIPs 182 and 183 were added.

BIP-0183 was also edited in the following ways:

1: Images updated with caption 2: Images now updated with transparent backgrounds and changed the colors so they can be read in dark mod 3: Changed the layout of the images and the paragraphs to be more legible.

in bip-0182.md:211 in bd1e242587

206+#### Format of the UTXO proof
207+
208+The UTXO proof has 2 elements: the accumulator proof and the leaf data. The
209+leaf data provides the necessary UTXO data for block validation that would be
210+stored locally for non-Utreexo nodes. Non-Utreexo nodes store this data (under "chainstate/" for Bitcoin Core)
211+but since utreexo nodes don't this data, it must be provided.

murchandamus commented at 8:19 pm on September 16, 2025:

Missing a word.

0but since utreexo nodes don't <missing word> this data, it must be provided.

vostrnad commented at 0:09 am on September 17, 2025:

0but since Utreexo nodes don't this data, it must be provided.

in bip-0182.md:321 in bd1e242587

316+
317+## Rationale
318+
319+**Why use the Utreexo accumulator to keep track of UTXOs instead of a key-value database like leveldb?**
320+
321+There's two main advantages to using the Utreexo accumulator instead of a key-value database like leveldb:

murchandamus commented at 8:24 pm on September 16, 2025:

0There are two main advantages to using the Utreexo accumulator instead of a key-value database like leveldb:

in bip-0182.md:324 in bd1e242587

319+**Why use the Utreexo accumulator to keep track of UTXOs instead of a key-value database like leveldb?**
320+
321+There's two main advantages to using the Utreexo accumulator instead of a key-value database like leveldb:
322+
323+ 1. Puts a cap on the UTXO set growth.
324+ 3. Performance gains with the elimination of random reads/writes.

murchandamus commented at 8:26 pm on September 16, 2025:

Renders right of course, but still:

0 2. Performance gains with the elimination of random reads/writes.

in bip-0182.md:335 in bd1e242587

330+
331+The UTXO set is currently around 10GB in 2025 and with pruning that's all it takes to maintain a full node.
332+However, as the UTXO set grows, the disk storage requirement will grow along with it and increase the barrier to running a full node.
333+
334+Currently, the UTXO set size is $O(log(N))$ where $N$ is the number of UTXOs.
335+By utilizing the Utreexo accumulator, we're able to cap the UTXO set growth at $O(log_2(N))$.

murchandamus commented at 8:28 pm on September 16, 2025:

Given that you don’t store the UTXO set, but an accumulator that commits to the UTXO set, perhaps these two sentences should be amended?

murchandamus commented at 8:32 pm on September 16, 2025: contributor

Thanks for the update.

I gave the diff a quick skim:

murchandamus commented at 8:51 pm on September 16, 2025: contributor

It would perhaps be good if one or two other people gave it also a read, but either way, it seems pretty complete to me. What’s the status on your end? Do you still have planned work, or are waiting for people to finish reviews?

in bip-0181.md:56 in bd1e242587

51+the accumulator tracks the current set of unspent transaction outputs (UTXOs).
52+
53+The Utreexo accumulator is based on an append-only Merkle tree design introduced in [^1],
54+which provides logarithmic-sized inclusion proofs. Utreexo extends this design to support dynamic updates,
55+specifically enabling deletions from the set—a requirement for tracking UTXO spends in Bitcoin.
56+To accommodate this, Utreexo changes the storage requirement from the accumulator design in [^1] to $O(log_2(N))$,

vostrnad commented at 0:08 am on September 17, 2025:

Specifying the logarithm base is redundant in big O notation, as changing the base is equivalent to multiplying by a constant factor.

0To accommodate this, Utreexo changes the storage requirement from the accumulator design in [^1] to $O(log(N))$,

in bip-0181.md:79 in bd1e242587

74+The size of each tree corresponds to the power of two represented by the position of each set bit.
75+For example, the decimal number 21 (binary `0b10101`) contains three 1-bits, meaning three trees are needed in the forest:
76+a 16-element tree ($2^4$), a 4-element tree ($2^2$), and a 1-element tree ($2^0$), with gaps at the 8-element ($2^3$)
77+and 2-element ($2^1$) positions.
78+
79+Each of the hashes in the forest can be referred by an integer label. This labeling is a convention we find easiest

vostrnad commented at 0:08 am on September 17, 2025:

0Each of the hashes in the forest can be referred to by an integer label. This labeling is a convention we find easiest

in bip-0181.md:194 in bd1e242587

189+    return sha512_256(left + right)
190+```
191+
192+**treerows(numleaves):** Returns the minimum number of bits required to represent `numleaves - 1`. This corresponds to the height of the largest tree in the forest. Returns `0` if `numleaves` is `0`.
193+
194+The reason for taking the minimum number of bits required for `numleaves-1` and not `numleaves` is because when `numleaves` is a power of two, we'd get an off-by-one error.

vostrnad commented at 0:08 am on September 17, 2025:

It would be nice to have consistent spacing around the minus sign, there’s both numleaves - 1 and numleaves-1. Same goes for the equals sign below.

in bip-0181.md:419 in bd1e242587

414+be the hash for element at index `i` in `proof.targets`. Otherwise the returned roots will be invalid.
415+
416+The calculate roots algorithm is defined as `CalculateRoots(numleaves, []hash, proof) -> calculated_roots`:
417+
418+- Check if length of `proof.targets` is equal to the length of `[]hash`. Return early if they're not equal.
419+- map `proof.targets` to their hash.

vostrnad commented at 0:08 am on September 17, 2025:

0- Map `proof.targets` to their hash.

in bip-0181.md:432 in bd1e242587

427+  - Calculate parent position.
428+  - Insert parent position into the sorted `proof.targets`.
429+  - Map parent hash to the parent position.
430+- Return calculated_roots
431+
432+The algorithm implemented in python:

vostrnad commented at 0:09 am on September 17, 2025:

0The algorithm implemented in Python:

in bip-0181.md:476 in bd1e242587

471+
472+Inputs:
473+  - `acc`.
474+  - `hash` to be added.
475+
476+The Addition algorithm Add(`acc`, `hash`) is defined as:

vostrnad commented at 0:09 am on September 17, 2025:

0The addition algorithm Add(`acc`, `hash`) is defined as:

in bip-0181.md:480 in bd1e242587

475+
476+The Addition algorithm Add(`acc`, `hash`) is defined as:
477+
478+- From row 0 to and **including** `treerows(acc.numleaves)`
479+  - Break if there's no root at this row.
480+  - remove the last root from `acc.roots`.

vostrnad commented at 0:09 am on September 17, 2025:

0  - Remove the last root from `acc.roots`.

in bip-0181.md:486 in bd1e242587

481+    - Calculate the parent hash of the removed root and the `hash` to be added using *parent_hash*.
482+  - Make the result from `parent_hash` the new `hash`.
483+- Increment `acc.numleaves` by 1.
484+- Append `hash` to `acc.roots`.
485+
486+The algorithm implemented in python:

vostrnad commented at 0:09 am on September 17, 2025:

0The algorithm implemented in Python:

in bip-0181.md:506 in bd1e242587

501+- Inputs:
502+  - The accumulator state.
503+  - `[]hash` that are the hashes for the `proof.targets`.
504+  - `proof`.
505+
506+The Verification algorithm `Verify(acc, []hash, proof) -> bool` is defined as:

vostrnad commented at 0:09 am on September 17, 2025:

0The verification algorithm `Verify(acc, []hash, proof) -> bool` is defined as:

in bip-0181.md:515 in bd1e242587

510+- Get `root_idxs` from `getrootidxs`.
511+- Raise error if the length of `modified_roots` and `root_idxs` do not match.
512+- Attempt to match roots in modified_roots with roots in `acc`. Raise error if we don't find all the roots in the modified_roots in `acc`.
513+- Return `true`.
514+
515+The algorithm implemented in python:

vostrnad commented at 0:09 am on September 17, 2025:

0The algorithm implemented in Python:

in bip-0181.md:544 in bd1e242587

539+
540+- Inputs:
541+  - The accumulator state.
542+  - `proof`.
543+
544+The Deletion algorithm `Delete(acc, Proof) -> acc` is defined as:

vostrnad commented at 0:09 am on September 17, 2025:

0The deletion algorithm `Delete(acc, Proof) -> acc` is defined as:

in bip-0181.md:550 in bd1e242587

545+
546+- Get the modified indexes of the roots `root_idxes` from `getrootidxs`.
547+- Get modified_roots from `Calculate_Roots(acc.numleaves, []positions, Proof)`.
548+- Replace the matching indexes from the `root_idxes` in `acc.roots` with `modified_roots`.
549+
550+The algorithm implemented in python:

vostrnad commented at 0:09 am on September 17, 2025:

0The algorithm implemented in Python:

in bip-0182.md:21 in bd1e242587

16+## Abstract
17+
18+This BIP defines the rules for validating blocks and transactions using the
19+Utreexo accumulator. It is important to note that this BIP does not define the
20+Utreexo accumulator itself, for that see [Utreexo Accumulator Specification](bip-0181.md). This document is only concerned with
21+the general rules for validating blocks and transactions using the Utreexo,

vostrnad commented at 0:09 am on September 17, 2025:

0the general rules for validating blocks and transactions using Utreexo,

in bip-0182.md:63 in bd1e242587

58+#### UTXO Hash Preimages
59+
60+Individual UTXOs are represented as 32-byte hashes in the Utreexo accumulator. To obtain this
61+hash, you must compute the SHA-512/256 hash of the following data:
62+
63+| Name               | Type                     | Description                               |

vostrnad commented at 0:09 am on September 17, 2025:

Inconsistent use of terminating periods in the Description column.

in bip-0182.md:66 in bd1e242587

61+hash, you must compute the SHA-512/256 hash of the following data:
62+
63+| Name               | Type                     | Description                               |
64+| ------------------ | ------------------------ | ----------------------------------------- |
65+| Utreexo_Tag_V1     | 64-byte array            | The version tag to be prepended to the leafhash. |
66+| Utreexo_Tag_V1     | 64-byte array            | The version tag to be prepended to the leafhash. |

vostrnad commented at 0:09 am on September 17, 2025:

Duplicate table rows.

vostrnad commented at 0:16 am on September 17, 2025:

Sorry, didn’t notice this was intentional. You could consolidate this into one row that says “128-byte array”, or change the description of the second line to clarify the intended duplication.

in bip-0182.md:70 in bd1e242587

65+| Utreexo_Tag_V1     | 64-byte array            | The version tag to be prepended to the leafhash. |
66+| Utreexo_Tag_V1     | 64-byte array            | The version tag to be prepended to the leafhash. |
67+| BlockHash          | 32-byte array            | The hash of the block in which this tx was confirmed. |
68+| TXID               | 32-byte array            | The transaction's TXID                    |
69+| Vout               | 4-byte unsigned integer  | The output index of this UTXO             |
70+| Header code        | 4-byte unsigned integer  | The block height and iscoinbase. This is a value obtained by left shifting the block height that confirmed this transaction by one bit, and then OR-ing it with 1, only if this transaction is a coinbase. |

vostrnad commented at 0:09 am on September 17, 2025:

0| Header code        | 4-byte unsigned integer  | The block height and iscoinbase. This is a value obtained by left shifting the block height that confirmed this transaction by one bit, and then OR-ing it with 1 if this transaction is a coinbase. |

in bip-0182.md:72 in bd1e242587

67+| BlockHash          | 32-byte array            | The hash of the block in which this tx was confirmed. |
68+| TXID               | 32-byte array            | The transaction's TXID                    |
69+| Vout               | 4-byte unsigned integer  | The output index of this UTXO             |
70+| Header code        | 4-byte unsigned integer  | The block height and iscoinbase. This is a value obtained by left shifting the block height that confirmed this transaction by one bit, and then OR-ing it with 1, only if this transaction is a coinbase. |
71+| Amount             | 8-byte unsigned integer  | The amount in satoshis for this UTXO      |
72+| Output script size | varint                   | The output script length in bytes              |

vostrnad commented at 0:09 am on September 17, 2025:

“varint” is very often confused with “compact size”, are you sure you didn’t mean the latter? In any case you might want to link to some documentation for this format, as it might not be universally understood.

See: https://learnmeabitcoin.com/technical/general/compact-size/#varint

in bip-0182.md:85 in bd1e242587

80+purposes. This is added so that if there are changes in the preimage of the
81+hash, the version tag helps to avoid misinterpretation.
82+
83+The Utreexo version tag is the SHA512 hash of the string `UtreexoV1`, which is represented as the vector
84+`[85 116 114 101 101 120 111 86 49]` and hex `0x5574726565786f5631`.  (The resulting 64-byte output is
85+`5b832db8ca26c25be1c542d6cceddda8c145615cff5c35727fb3462610807e20ae534dc3f64299199931772e03787d18156eb3151e0ed1b3098bdc8445861885`).

vostrnad commented at 0:09 am on September 17, 2025:

Inconsistent use of 0x before a hexadecimal string.

in bip-0182.md:109 in bd1e242587

104+
105+The output index of the UTXO in the transaction.
106+
107+##### Header code
108+
109+This field stores the block height and a boolean for marking that the UTXO was

vostrnad commented at 0:09 am on September 17, 2025:

0This field stores the block height and a boolean for marking whether the UTXO was

in bip-0182.md:170 in bd1e242587

165+For this reason, we define which UTXOs are not inserted to the accumulator.  Any
166+variations here will result in Utreexo nodes with incompatible proofs.
167+
168+##### Provably unspendable transaction outputs
169+
170+There are outputs in the Bitcoin network that we can guarantee that they cannot

vostrnad commented at 0:09 am on September 17, 2025:

0There are outputs in the Bitcoin network we can guarantee cannot

in bip-0182.md:199 in bd1e242587

194+The Utreexo accumulator lacks associative properties during addition and the
195+ordering of which UTXO hash gets added first is consensus critical. For
196+the modification of the accumulator the steps are as follows:
197+
198+1. Batch remove the UTXOs that were spent in the block based on the algorithm
199+   defined in [Utreexo Accumulator Specification](bip-0181.md). Deletions itself are order-independent.

vostrnad commented at 0:09 am on September 17, 2025:

0   defined in [Utreexo Accumulator Specification](bip-0181.md). Deletions themselves are order-independent.

in bip-0182.md:239 in bd1e242587

234+| Accumulator Proof   | variable byte array | variable  | The Utreexo proof as defined in BIP-0181 |
235+| UTXO hash preimages | variable byte array | variable  | The UTXO data needed to validate all the transaction in the block |
236+
237+#### UTXO proof validation
238+
239+For each block, the UTXO proof must be provided with the bitcoin block for

vostrnad commented at 0:09 am on September 17, 2025:

0For each block, the UTXO proof must be provided with the Bitcoin block for

in bip-0182.md:241 in bd1e242587

236+
237+#### UTXO proof validation
238+
239+For each block, the UTXO proof must be provided with the bitcoin block for
240+validation to be possible. Without the UTXO proof, it's not possible to
241+validate that the inputs being referenced exists in the UTXO set.

vostrnad commented at 0:09 am on September 17, 2025:

0validate that the inputs being referenced exist in the UTXO set.

in bip-0182.md:268 in bd1e242587

263+Before `BIP-0030`, the Bitcoin consensus rules allowed for duplicate TXIDs. If two
264+transactions shared a same TXID, the transaction outputs of the succeeding
265+transaction would overwrite the previously created UTXOs. It was assumed that
266+TXIDs were unique but it was trivially easy to create a duplicate transaction that was
267+exactly the same, resulting in a duplicate `TXID` for coinbase transactions by re-using
268+the same bitcoin address.

vostrnad commented at 0:09 am on September 17, 2025:

0the same Bitcoin address.

in bip-0182.md:277 in bd1e242587

272+
273+`BIP-0034` introduces a rule that requires the block height to be included in the coinbase field
274+of the coinbase transaction. The main reason for the change was to make
275+coinbase transactions unique so that the expensive check of going through the
276+UTXO set wouldn't be needed. However, there were blocks in the past that had
277+random bytes that could be interpreted as block heights. The lowest implicated block

vostrnad commented at 0:09 am on September 17, 2025:

0random bytes that could be interpreted as block heights. The lowest impacted block

in bip-0182.md:286 in bd1e242587

281+Since Utreexo nodes only keep the UTXO set commitment, it's not possible to
282+perform the `BIP-0030` check. In theory, those blocks can't be reorged, because
283+of checkpoints, that goes back to block height 295,000 with the block hash
284+`00000000000000004d9b4ef50f0f9d686fd69db2e03af35a100370c64632a983`. Any chain that
285+doesn't include this block at height 295,000 isn't valid as removing this check
286+would be a hard-fork. We note, however, that after version `0.30`, Bitcoin Core

vostrnad commented at 0:09 am on September 17, 2025:

Bitcoin Core dropped the leading zero a few years ago:

0would be a hard-fork. We note, however, that after v30, Bitcoin Core

in bip-0182.md:293 in bd1e242587

288+against nodes during Initial Block Download. This is effectively a hard-fork,
289+that will probably never actually happen, however.
290+
291+Block 1,983,702 is the first block that Utreexo nodes would be in danger of a
292+consensus failure due to the inability to perform the BIP-0030 checks if someone were
293+to reuse coinbase transaction from block 164,384. However, this block will happen in roughly

vostrnad commented at 0:09 am on September 17, 2025:

0to reuse the coinbase transaction from block 164,384. However, this block will happen in roughly

in bip-0182.md:334 in bd1e242587

329+As the amount of Bitcoin users grow, the UTXO set grows with it.
330+
331+The UTXO set is currently around 10GB in 2025 and with pruning that's all it takes to maintain a full node.
332+However, as the UTXO set grows, the disk storage requirement will grow along with it and increase the barrier to running a full node.
333+
334+Currently, the UTXO set size is $O(log(N))$ where $N$ is the number of UTXOs.

vostrnad commented at 0:09 am on September 17, 2025:

I think this is meant to say:

0Currently, the UTXO set size is $O(N)$ where $N$ is the number of UTXOs.

in bip-0183.md:18 in bd1e242587

13+  Requires: 181, 182
14+```
15+
16+## Abstract
17+
18+Utreexo creates a compact representation of the UTXO set that only takes a couple of kilobytes.

vostrnad commented at 0:10 am on September 17, 2025:

0Utreexo creates a compact representation of the UTXO set that only takes up a couple of kilobytes.

in bip-0183.md:62 in bd1e242587

57+
58+### Pre-P2P: Bridge Building
59+
60+When introducing Utreexo into an existing network, there are two things needed before CSNs can operate.
61+First, archive nodes need to build proofs for old blocks to serve during the initial block download (IBD).
62+Second, nodes need to build and maintain the UTXO merkle forest, and an index of outpoints to leaves of that forest, so that they can build proofs for new transactions.

vostrnad commented at 0:10 am on September 17, 2025:

For some reason, “Merkle” is consistently capitalized in 181 and 182 but consistently lowercase in 183.

in bip-0183.md:63 in bd1e242587

58+### Pre-P2P: Bridge Building
59+
60+When introducing Utreexo into an existing network, there are two things needed before CSNs can operate.
61+First, archive nodes need to build proofs for old blocks to serve during the initial block download (IBD).
62+Second, nodes need to build and maintain the UTXO merkle forest, and an index of outpoints to leaves of that forest, so that they can build proofs for new transactions.
63+Both of these processes happen without any p2p messages by taking an already existing, synchronized archive full node and going through its stored block data.

vostrnad commented at 0:10 am on September 17, 2025:

0Both of these processes happen without any P2P messages by taking an already existing, synchronized archive full node and going through its stored block data.

in bip-0183.md:65 in bd1e242587

60+When introducing Utreexo into an existing network, there are two things needed before CSNs can operate.
61+First, archive nodes need to build proofs for old blocks to serve during the initial block download (IBD).
62+Second, nodes need to build and maintain the UTXO merkle forest, and an index of outpoints to leaves of that forest, so that they can build proofs for new transactions.
63+Both of these processes happen without any p2p messages by taking an already existing, synchronized archive full node and going through its stored block data.
64+
65+Once an archive and bridge node have been established, CSNs download blocks and inclusion proofs to IBD and maintain sync with the bitcoin network.

vostrnad commented at 0:10 am on September 17, 2025:

0Once an archive and bridge node have been established, CSNs download blocks and inclusion proofs to IBD and maintain sync with the Bitcoin network.

in bip-0183.md:120 in bd1e242587

115+
116+We introduce a new inventory vector type called `utreexoproofhash`, which makes up the extra information that a Utreexo node will receive.
117+
118+A hash with the type `utreexoproofhash` represents four Utreexo merkle tree positions, each of them little-endian serialized and taking up 8 bytes in the 32-byte hash.
119+When sending an inv message to a Utreexo node for a transaction, we append `utreexoproofhash` inventory vectors to represent the merkle tree positions for each of the UTXOs being referenced in the inputs of the transaction.
120+The Utreexo merkle tree positions are explained in detail in [Utreexo Accumulator Specification](bip-0181#Merkle Forest).

vostrnad commented at 0:10 am on September 17, 2025:

0The Utreexo merkle tree positions are explained in detail in [Utreexo Accumulator Specification](bip-0181.md#merkle-forest).

in bip-0183.md:170 in bd1e242587

165+
166+Note that while Node A sent the inv or the blockhash to Node B, Node B is free to ask for the Utreexo proof from a node other than Node A.
167+This allows a Utreexo node to be notified of new blocks from non-Utreexo nodes.
168+
169+Since there's no PoW required for the inclusion proof, the block may be valid and the proof may be invalid.
170+If the block header validation passed while the full block validation fails, Node B should request the inclusion proof from a different peer.

vostrnad commented at 0:10 am on September 17, 2025:

0If the block header validation passes while the full block validation fails, Node B should request the inclusion proof from a different peer.

in bip-0183.md:212 in bd1e242587

207+
208+#### Compact leaf data
209+
210+For a CSN to learn the data associated with a UTXO, it must ask for it from a peer that has it.
211+To authenticate this data, it is committed into the accumulator, and therefore cannot be changed by the peer.
212+The committed data is defined in [Utreexo - Transaction and block validation](bip-0182#UTXO Hash Preimages), but for some information in the leaf data, the receiving peer might already have it, so sending it again is a waste of bandwidth.

vostrnad commented at 0:10 am on September 17, 2025:

0The committed data is defined in [Utreexo - Transaction and block validation](bip-0182.md#utxo-hash-preimages), but for some information in the leaf data, the receiving peer might already have it, so sending it again is a waste of bandwidth.

in bip-0183.md:289 in bd1e242587

284+| target locations               | vector of varint values      | The Utreexo merkle tree locations of the leafdatas. MUST be in blockchain order. MUST include all the locations or none of the locations   |
285+| length of the leafdatas        | varint                       | The length of the leafdatas                                                                                                                |
286+| leafdatas                      | vector of compact leafdatas  | The preimage of the committed UTXOs requested by the MSG_GET_UTREEXO_PROOF. MUST be in blockchain order. See compact leaf data for details |
287+
288+The proof hashes MUST be in merkle forest tree ordering.
289+See BIP [Utreexo Accumulator Specification](bip-0181.md#Merkle Forest) for an explanation on how each of the hashes in the merkle forest are positioned.

vostrnad commented at 0:10 am on September 17, 2025:

0See BIP [Utreexo Accumulator Specification](bip-0181.md#merkle-forest) for an explanation on how each of the hashes in the merkle forest are positioned.

in bip-0183.md:292 in bd1e242587

287+
288+The proof hashes MUST be in merkle forest tree ordering.
289+See BIP [Utreexo Accumulator Specification](bip-0181.md#Merkle Forest) for an explanation on how each of the hashes in the merkle forest are positioned.
290+
291+Each of the target location represents the position of the leaf data at the same index.
292+While each leaf data represent a UTXO in a given block, not all are added as per [Utreexo - Validation Layer](bip-0182.md#Excluded UTXOs from the accumulator).

vostrnad commented at 0:10 am on September 17, 2025:

0While each leaf data represent a UTXO in a given block, not all are added as per [Utreexo - Validation Layer](bip-0182.md#excluded-utxos-from-the-accumulator).

in bip-0183.md:303 in bd1e242587

298+Its `cmdString` for P2PV1 is `getuproof`.
299+Its [BIP324 P2PV2](https://github.com/bitcoin/bips/blob/master/bip-0324.mediawiki#user-content-v2_Bitcoin_P2P_message_structure) message type is `30`.
300+
301+| Field                     | Type                        | Description                                                        |
302+|---------------------------|-----------------------------|--------------------------------------------------------------------|
303+| blockhash                 | 32 byte vector              | The hash of the bitcoin block that we want the inclusion proof for |

vostrnad commented at 0:10 am on September 17, 2025:

0| blockhash                 | 32 byte vector              | The hash of the Bitcoin block that we want the inclusion proof for |

in bip-0183.md:365 in bd1e242587

360+Its `cmdString` for P2PV1 is `utreexotx`.
361+Its [BIP324 P2PV2](https://github.com/bitcoin/bips/blob/master/bip-0324.mediawiki#user-content-v2_Bitcoin_P2P_message_structure) message type is `34`.
362+
363+| Field                      | Type                         | Description                                                                                                                                                                                                     |
364+|----------------------------|------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
365+| transaction                | MSG_TX                       | The bitcoin transaction. Unconfirmed inputs are marked by shifting the index by 1 and setting the LSB                                                                                                           |

vostrnad commented at 0:10 am on September 17, 2025:

0| transaction                | MSG_TX                       | The Bitcoin transaction. Unconfirmed inputs are marked by shifting the index by 1 and setting the LSB                                                                                                           |

luisschwab commented at 11:09 pm on September 19, 2025:

The direction of the shift should be explicit.

in bip-0183.md:369 in bd1e242587

364+|----------------------------|------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
365+| transaction                | MSG_TX                       | The bitcoin transaction. Unconfirmed inputs are marked by shifting the index by 1 and setting the LSB                                                                                                           |
366+| length of the proof hashes | varint                       | The length of the proof hashes                                                                                                                                                                                  |
367+| proof hashes               | vector of 32 byte hashes     | The vector of the requested Utreexo summaries                                                                                                                                                                   |
368+| length of the leafdatas    | varint                       | The length of the leafdatas                                                                                                                                                                                     |
369+| leafdatas                  | vector of compact leafdatas  | The preimage of the leafdatas referenced in the bitcoin transaction. MUST be in the order of the referenced inputs. Unconfirmed inputs do not have a corresponding leaf data. See compact leaf data for details |

vostrnad commented at 0:10 am on September 17, 2025:

0| leafdatas                  | vector of compact leafdatas  | The preimage of the leafdatas referenced in the Bitcoin transaction. MUST be in the order of the referenced inputs. Unconfirmed inputs do not have a corresponding leaf data. See compact leaf data for details |

in bip-0183.md:384 in bd1e242587

379+
380+This step is required because if the unconfirmed UTXO is not explicitly marked, then a malicious peer can omit the leaf data for a confirmed UTXO and mislead us into believing that the transaction is an orphan.
381+
382+#### MSG_UTREEXO_ROOT
383+
384+`MSG_UTREEXO_ROOT` is the utreexo accumulator state at a given height with a proof to a utreexo accumulator of the utreexo roots.

vostrnad commented at 0:10 am on September 17, 2025:

0`MSG_UTREEXO_ROOT` is the Utreexo accumulator state at a given height with a proof to a Utreexo accumulator of the Utreexo roots.

There are also several instances of lowercase “utreexo” in the table below.

luisschwab commented at 11:10 pm on September 19, 2025:

Should be hash, as there is no height on the message, only a block hash.

0`MSG_UTREEXO_ROOT` is the utreexo accumulator state at a given blockhash with a proof to a utreexo accumulator of the utreexo roots.

in bip-0183.md:405 in bd1e242587

400+
401+Because the size of the state needed to validate blocks is so small with Utreexo, nodes can perform IBD in parallel and out of order.
402+
403+For example, a computer could divide the task of validating 800,000 blocks into 100 tasks of 8,000 blocks each: blocks 1 through 800, 800 through 1600, 1600 through 2400, and so on.
404+
405+In order start the 1600 through 2400 IBD task, however, the node should know what the state of the utxo set is at block 1600, so that it can validate and modify the accumulator.

vostrnad commented at 0:10 am on September 17, 2025:

0For example, a computer could divide the task of validating 800,000 blocks into 100 tasks of 8,000 blocks each: blocks 1 through 800, 801 through 1600, 1601 through 2400, and so on.
1
2In order start the 1601 through 2400 IBD task, however, the node should know what the state of the UTXO set is at block 1600, so that it can validate and modify the accumulator.

in bip-0183.md:415 in bd1e242587

410+The node performing IBD tries out the state given for a block height, but checks that when that state is reached from the thread "below" that it properly links up, with the accumulator state arrived at through full validation matching the state given.
411+If that link up does not successfully happen, the IBD process should halt.
412+
413+These hints are statements of fact that are hard-coded into the program itself, and if they are false all bets are off about the program.
414+
415+Archive nodes create a forest of Linkup hints, so that they can prove, with respect to the Linkup forest roots in a node performing IBD, what their binary has claimed the utxo accumulator state to be at any block height.

vostrnad commented at 0:10 am on September 17, 2025:

0Archive nodes create a forest of Linkup hints, so that they can prove, with respect to the Linkup forest roots in a node performing IBD, what their binary has claimed the UTXO accumulator state to be at any block height.

luisschwab commented at 11:16 pm on September 19, 2025:

0Archive nodes create a forest of "linkup hints", so that they can prove, with respect to the Linkup forest roots in a node performing IBD, what their binary has claimed the utxo accumulator state to be at any block height.

in bip-0183.md:419 in bd1e242587

414+
415+Archive nodes create a forest of Linkup hints, so that they can prove, with respect to the Linkup forest roots in a node performing IBD, what their binary has claimed the utxo accumulator state to be at any block height.
416+
417+#### MSG_GET_UTREEXO_ROOT
418+
419+`MSG_GET_UTREEXO_ROOT` is used to request a utreexo accumulator state at a given height.

vostrnad commented at 0:10 am on September 17, 2025:

0`MSG_GET_UTREEXO_ROOT` is used to request a Utreexo accumulator state at a given height.

vostrnad commented at 0:11 am on September 17, 2025: contributor

It would perhaps be good if one or two other people gave it also a read

Here’s my read. I’ve suggested mainly formatting and capitalization changes, but at least two suggestions are quite important: the distinction between “varint” and “compact size”, and the broken cross-BIP links.

in bip-0183.md:139 in bd1e242587

134+
135+Below image illustrates how a Utreexo node would relay transactions with multiple inventory vectors of the type `utreexoproofhash`.
136+
137+![Utreexo TX relay multiple Utreexo proof hash vectors](bip-0183/utreexo-tx-relay-with-multiple-proofhash-inventory-vectors.png)
138+
139+It's possible to have an inv message with multiple txs as well.

luisschwab commented at 10:48 pm on September 19, 2025:

0It's possible to have an `inv` message with multiple transactions as well.

in bip-0183.md:148 in bd1e242587

143+
144+![Utreexo TX relay with multiple txs](bip-0183/utreexo-tx-relay-with-multiple-txs.png)
145+
146+### Block Propagation
147+
148+Legacy block propagation without Compact Blocks comprises of three steps:

luisschwab commented at 10:49 pm on September 19, 2025:

0Legacy block propagation without Compact Blocks is comprised of three steps:

in bip-0183.md:151 in bd1e242587

146+### Block Propagation
147+
148+Legacy block propagation without Compact Blocks comprises of three steps:
149+
150+1. Node A sends an inv message or a block header to Node B.
151+2. Node B makes a getdata request for the block.

luisschwab commented at 10:50 pm on September 19, 2025:

01. Node A sends an `inv` message or a block header to Node B.
12. Node B makes a `getdata` request for the block.

in bip-0183.md:154 in bd1e242587

149+
150+1. Node A sends an inv message or a block header to Node B.
151+2. Node B makes a getdata request for the block.
152+3. Node A sends the block data to Node B.
153+
154+Below image illustrates how a non-Utreexo node would relay blocks without using Compact Blocks.

luisschwab commented at 10:50 pm on September 19, 2025:

0The image below illustrates how a non-Utreexo node would relay blocks without using Compact Blocks.

in bip-0183.md:162 in bd1e242587

157+
158+The same block propagation with Utreexo nodes will look like so:
159+
160+1. Node A sends an inv message or a block header to Node B.
161+2. Node B makes a getdata request for the block.
162+3. Node B makes a getutreexoproof request for the block.

luisschwab commented at 10:50 pm on September 19, 2025:

01. Node A sends an `inv` message or a block header to Node B.
12. Node B makes a `getdata` request for the block.
23. Node B makes a `getutreexoproof` request for the block.

in bip-0183.md:173 in bd1e242587

168+
169+Since there's no PoW required for the inclusion proof, the block may be valid and the proof may be invalid.
170+If the block header validation passed while the full block validation fails, Node B should request the inclusion proof from a different peer.
171+If the new proof and the block pass validation, we can conclude that Node A is malicious and ban the peer.
172+
173+Below image illustrates how a Utreexo node would relay blocks without using Compact Blocks.

luisschwab commented at 10:51 pm on September 19, 2025:

0The image below illustrates how a Utreexo node would relay blocks without using Compact Blocks.

in bip-0183.md:185 in bd1e242587

180+1. Node A sends an inv message or a block header to Node B.
181+2. Node B makes a getdata request (MSG_UTREEXO_SUMMARY) for the given blockhash.
182+3. Node A sends the utreexoblocksummary message to Node B.
183+4. Node B calculates which proof hashes and leafdatas it needs to prove this block.
184+5. Node B makes a getdata request for the block to Node A.
185+6. Node B makes a getutreexoproof request for the block to Node A.

luisschwab commented at 10:55 pm on September 19, 2025:

01. Node A sends an `inv` message or a block header to Node B.
12. Node B makes a `getdata` request (MSG_UTREEXO_SUMMARY) for the given blockhash.
23. Node A sends the `usummary` message to Node B.
34. Node B calculates which proof hashes and leafdatas it needs to prove this block.
45. Node B makes a `getdata` request for the block to Node A.
56. Node B makes a `getuproof` request for the block to Node A.

in bip-0183.md:197 in bd1e242587

192+Should the proof and the block pass validation, we can conclude that Node A is malicious and ban the peer.
193+
194+All of the above propagation works the same with Compact Block propagation as well.
195+The requester would need to send a getdata request (MSG_UTREEXO_SUMMARY) after the Compact Block propagation has concluded for high-bandwidth Compact Block propagation and after the header/inv message was received from the broadcasting peer.
196+
197+Below image illustrates how a Utreexo node would relay blocks in a bandwidth efficient manner without using Compact Blocks.

luisschwab commented at 10:55 pm on September 19, 2025:

0The  image below illustrates how a Utreexo node would relay blocks in a bandwidth efficient manner without using Compact Blocks.

in bip-0183.md:189 in bd1e242587

184+5. Node B makes a getdata request for the block to Node A.
185+6. Node B makes a getutreexoproof request for the block to Node A.
186+7. Node A sends the block data to Node B.
187+8. Node A sends the requested inclusion proof data to Node B.
188+
189+As with the getutreexoproof message, Node B is free to ask for the utreexoblocksummary message from a node other than Node A.

luisschwab commented at 10:56 pm on September 19, 2025:

0As with the `getuproof` message, Node B is free to ask for the `usummary` message from a node other than Node A.

in bip-0183.md:177 in bd1e242587

172+
173+Below image illustrates how a Utreexo node would relay blocks without using Compact Blocks.
174+
175+![Non-Compact-Block Block Propagation with Utreexo Nodes](bip-0183/non-compact-block-utreexo-block-propagation.png)
176+
177+Since the inclusion proof is cached for each of the transaction in the mempool, it's possible to omit the proof hashes for the input UTXOs that we can already prove on our own.

luisschwab commented at 10:56 pm on September 19, 2025:

0Since the inclusion proof is cached for each of the transactions in the mempool, it's possible to omit the proof hashes for the input UTXOs that we can already prove on our own.

luisschwab commented at 10:58 pm on September 19, 2025: none

Some test vectors are in order as well.

in bip-0183.md:253 in bd1e242587

248+| 0x03  | ScriptHash          |
249+| 0x04  | WitnessV0ScriptHash |
250+
251+#### TTL Info
252+
253+For all UTXOs that get added to the Utreexo merkle forest, a TTL info exists for it and includes information necessary for efficiently caching and requesting proofs.

luisschwab commented at 11:01 pm on September 19, 2025:

0For any UTXO that gets added to the Utreexo Merkle forest exists a corresponding TTL Info. It includes the necessary information for efficiently caching and requesting proofs.

in bip-0183.md:258 in bd1e242587

253+For all UTXOs that get added to the Utreexo merkle forest, a TTL info exists for it and includes information necessary for efficiently caching and requesting proofs.
254+The TTL value provides information to determine which leaves should be cached and the death position is used to calculate which positions in the merkle forest we need to prove a block.
255+
256+| Field          | Type   | Description                                                                                                                                                          |
257+|----------------|--------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------|
258+| TTL            | varint | The time-to-live value of a leaf in the Utreexo merkle forest. The value is determined by the amount of leaves that were added to the accumulator since its creation |

luisschwab commented at 11:02 pm on September 19, 2025:

0| TTL            | varint | The time-to-live value of a leaf in the Utreexo Merkle forest. The value is determined by the amount of leaves that were added to the accumulator since its creation |

in bip-0183.md:262 in bd1e242587

257+|----------------|--------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------|
258+| TTL            | varint | The time-to-live value of a leaf in the Utreexo merkle forest. The value is determined by the amount of leaves that were added to the accumulator since its creation |
259+| death position | varint | The position in the Utreexo merkle forest when the leaf was removed                                                                                                  |
260+
261+#### Utreexo TTL
262+

luisschwab commented at 11:02 pm on September 19, 2025:

Missing a description here.

in bip-0183.md:259 in bd1e242587

254+The TTL value provides information to determine which leaves should be cached and the death position is used to calculate which positions in the merkle forest we need to prove a block.
255+
256+| Field          | Type   | Description                                                                                                                                                          |
257+|----------------|--------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------|
258+| TTL            | varint | The time-to-live value of a leaf in the Utreexo merkle forest. The value is determined by the amount of leaves that were added to the accumulator since its creation |
259+| death position | varint | The position in the Utreexo merkle forest when the leaf was removed                                                                                                  |

luisschwab commented at 11:04 pm on September 19, 2025:

0| death position | varint | The position of the leaf in the Utreexo Merkle forest at the moment it was removed                                                                                                  |

in bip-0183.md:265 in bd1e242587

260+
261+#### Utreexo TTL
262+
263+| Field        | Type                | Description                                                                                                                                                                                                                                                    |
264+|--------------|---------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
265+| block height | uint32              | The time-to-live value of a leaf in the Utreexo merkle forest. The value is determined by the amount of leaves that were added to the accumulator since its creation                                                                                           |

luisschwab commented at 11:05 pm on September 19, 2025:

Wrong description.

in bip-0183.md:266 in bd1e242587

261+#### Utreexo TTL
262+
263+| Field        | Type                | Description                                                                                                                                                                                                                                                    |
264+|--------------|---------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
265+| block height | uint32              | The time-to-live value of a leaf in the Utreexo merkle forest. The value is determined by the amount of leaves that were added to the accumulator since its creation                                                                                           |
266+| length       | varint              | The length of the TTLs                                                                                                                                                                                                                                         |

luisschwab commented at 11:06 pm on September 19, 2025:

0| length       | varint              | The length of the TTLInfo vector                                                                                                                                                                                                                                   |

in bip-0183.md:344 in bd1e242587

339+| Start height         | uint32 | The first block which the TTL message will be provided for                                                           |
340+| Max receive exponent | uint8  | Denotes how many TTLs should be provided in total. The provided TTL count will be $2^{Max Receive Exponent}$         |
341+
342+#### MSG_UTREEXO_SUMMARY
343+
344+`MSG_UTREEXO_SUMMARY` is the data needed to calculate the missing merkle forest positions required to validate a given block.

luisschwab commented at 11:06 pm on September 19, 2025:

0`MSG_UTREEXO_SUMMARY` is the data needed to calculate the missing Merkle forest positions required to validate a given block.

in bip-0183.md:354 in bd1e242587

349+| Field                      | Type                    | Description                                                                                                     |
350+|----------------------------|-------------------------|-----------------------------------------------------------------------------------------------------------------|
351+| blockhash                  | 32 byte vector          | The hash of the block that this Utreexo block summary is for                                                    |
352+| num adds                   | varint                  | The count of leaves added to the accumulator on the block this Utreexo block summary is for                     |
353+| length of target locations | varint                  | The length of the target locations                                                                              |
354+| target locations           | vector of uint64 values | The Utreexo merkle tree locations of the leafdatas. MUST be in blockchain order. MUST include all the locations |

luisschwab commented at 11:07 pm on September 19, 2025:

0| target locations           | vector of uint64 values | The Utreexo merkle tree locations of the leafdatas. MUST be in blockchain order. MUST include all locations |

in bip-0183.md:358 in bd1e242587

353+| length of target locations | varint                  | The length of the target locations                                                                              |
354+| target locations           | vector of uint64 values | The Utreexo merkle tree locations of the leafdatas. MUST be in blockchain order. MUST include all the locations |
355+
356+#### MSG_UTREEXO_TX
357+
358+`MSG_UTREEXO_TX` is the non-Utreexo Bitcoin transaction appended with the inclusion proof.

luisschwab commented at 11:08 pm on September 19, 2025:

0`MSG_UTREEXO_TX` is the non-Utreexo Bitcoin transaction appended with its inclusion proof.

in bip-0183.md:391 in bd1e242587

386+Its `cmdString` for P2PV1 is `uroot`.
387+Its [BIP324 P2PV2](https://github.com/bitcoin/bips/blob/master/bip-0324.mediawiki#user-content-v2_Bitcoin_P2P_message_structure) message type is `35`.
388+
389+| Field                      | Type                         | Description                                                                                                                                                                                                      |
390+|----------------------------|------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
391+| numleaves                  | varint                       | The number of leaves that was ever added to the accumulator at this block height. See [numleaves](bip-0181.md#Definitions)                                                                      |

luisschwab commented at 11:11 pm on September 19, 2025:

0| numleaves                  | varint                       | The total number of leaves that were added to the accumulator until this block hash. See [numleaves](bip-0181.md#Definitions)                                                                      |

in bip-0183.md:392 in bd1e242587

387+Its [BIP324 P2PV2](https://github.com/bitcoin/bips/blob/master/bip-0324.mediawiki#user-content-v2_Bitcoin_P2P_message_structure) message type is `35`.
388+
389+| Field                      | Type                         | Description                                                                                                                                                                                                      |
390+|----------------------------|------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
391+| numleaves                  | varint                       | The number of leaves that was ever added to the accumulator at this block height. See [numleaves](bip-0181.md#Definitions)                                                                      |
392+| target                     | varint                       | The position of the utreexo root in the optional accumulator of the utreexo roots                                                                                                                                |

luisschwab commented at 11:11 pm on September 19, 2025:

0| target                     | varint                       | The position of the Utreexo root in the optional accumulator of the Utreexo roots                                                                                                                                |

in bip-0183.md:393 in bd1e242587

388+
389+| Field                      | Type                         | Description                                                                                                                                                                                                      |
390+|----------------------------|------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
391+| numleaves                  | varint                       | The number of leaves that was ever added to the accumulator at this block height. See [numleaves](bip-0181.md#Definitions)                                                                      |
392+| target                     | varint                       | The position of the utreexo root in the optional accumulator of the utreexo roots                                                                                                                                |
393+| blockhash                  | 32 byte vector               | The blockhash for this utreexo accumulator state                                                                                                                                                                 |

luisschwab commented at 11:12 pm on September 19, 2025:

0| blockhash                  | 32 byte vector               | The blockhash for this Utreexo accumulator state                                                                                                                                                                 |

in bip-0183.md:394 in bd1e242587

389+| Field                      | Type                         | Description                                                                                                                                                                                                      |
390+|----------------------------|------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
391+| numleaves                  | varint                       | The number of leaves that was ever added to the accumulator at this block height. See [numleaves](bip-0181.md#Definitions)                                                                      |
392+| target                     | varint                       | The position of the utreexo root in the optional accumulator of the utreexo roots                                                                                                                                |
393+| blockhash                  | 32 byte vector               | The blockhash for this utreexo accumulator state                                                                                                                                                                 |
394+| length of the root hashes  | varint                       | The length of the root hashes                                                                                                                                                                                    |

luisschwab commented at 11:12 pm on September 19, 2025:

0| length of the root hashes  | varint                       | The length of the root hash vector                                                                                                                                                                                  |

in bip-0183.md:395 in bd1e242587

390+|----------------------------|------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
391+| numleaves                  | varint                       | The number of leaves that was ever added to the accumulator at this block height. See [numleaves](bip-0181.md#Definitions)                                                                      |
392+| target                     | varint                       | The position of the utreexo root in the optional accumulator of the utreexo roots                                                                                                                                |
393+| blockhash                  | 32 byte vector               | The blockhash for this utreexo accumulator state                                                                                                                                                                 |
394+| length of the root hashes  | varint                       | The length of the root hashes                                                                                                                                                                                    |
395+| root hashes                | vector of 32 byte hashes     | The utreexo roots for the UTXO set at the blockhash. See [roots](bip-0181.md#Definitions)                                                                                                       |

luisschwab commented at 11:13 pm on September 19, 2025:

0| root hashes                | vector of 32 byte hashes     | The Utreexo roots for the UTXO set at the blockhash. See [roots](bip-0181.md#Definitions)                                                                                                       |

in bip-0183.md:396 in bd1e242587

391+| numleaves                  | varint                       | The number of leaves that was ever added to the accumulator at this block height. See [numleaves](bip-0181.md#Definitions)                                                                      |
392+| target                     | varint                       | The position of the utreexo root in the optional accumulator of the utreexo roots                                                                                                                                |
393+| blockhash                  | 32 byte vector               | The blockhash for this utreexo accumulator state                                                                                                                                                                 |
394+| length of the root hashes  | varint                       | The length of the root hashes                                                                                                                                                                                    |
395+| root hashes                | vector of 32 byte hashes     | The utreexo roots for the UTXO set at the blockhash. See [roots](bip-0181.md#Definitions)                                                                                                       |
396+| length of the proof hashes | varint                       | The length of the proof hashes                                                                                                                                                                                   |

luisschwab commented at 11:13 pm on September 19, 2025:

0| length of the proof hashes | varint                       | The length of the proof hashes vector                                                                                                                                                                                |

in bip-0183.md:397 in bd1e242587

392+| target                     | varint                       | The position of the utreexo root in the optional accumulator of the utreexo roots                                                                                                                                |
393+| blockhash                  | 32 byte vector               | The blockhash for this utreexo accumulator state                                                                                                                                                                 |
394+| length of the root hashes  | varint                       | The length of the root hashes                                                                                                                                                                                    |
395+| root hashes                | vector of 32 byte hashes     | The utreexo roots for the UTXO set at the blockhash. See [roots](bip-0181.md#Definitions)                                                                                                       |
396+| length of the proof hashes | varint                       | The length of the proof hashes                                                                                                                                                                                   |
397+| proof hashes               | vector of 32 byte hashes     | The proof hashes needed to validate with the pre-committed utreexo accumulator of the utreexo roots                                                                                                              |

luisschwab commented at 11:13 pm on September 19, 2025:

0| proof hashes               | vector of 32 byte hashes     | The proof hashes needed to validate with the pre-committed Utreexo accumulator of the Utreexo roots                                                                                                              |

in bip-0183.md:399 in bd1e242587

394+| length of the root hashes  | varint                       | The length of the root hashes                                                                                                                                                                                    |
395+| root hashes                | vector of 32 byte hashes     | The utreexo roots for the UTXO set at the blockhash. See [roots](bip-0181.md#Definitions)                                                                                                       |
396+| length of the proof hashes | varint                       | The length of the proof hashes                                                                                                                                                                                   |
397+| proof hashes               | vector of 32 byte hashes     | The proof hashes needed to validate with the pre-committed utreexo accumulator of the utreexo roots                                                                                                              |
398+
399+This message is for implementing an out-of-order block validation node[^2] or softchains[^3].

luisschwab commented at 11:14 pm on September 19, 2025:

0This message is used for implementing an out-of-order block validation node[^2] or softchains[^3].

in bip-0183.md:401 in bd1e242587

396+| length of the proof hashes | varint                       | The length of the proof hashes                                                                                                                                                                                   |
397+| proof hashes               | vector of 32 byte hashes     | The proof hashes needed to validate with the pre-committed utreexo accumulator of the utreexo roots                                                                                                              |
398+
399+This message is for implementing an out-of-order block validation node[^2] or softchains[^3].
400+
401+Because the size of the state needed to validate blocks is so small with Utreexo, nodes can perform IBD in parallel and out of order.

luisschwab commented at 11:15 pm on September 19, 2025:

0Because the size of the state needed to validate blocks is so small with Utreexo, nodes can perform parallel and out of order IBD.

in bip-0183.md:426 in bd1e242587

421+Its `cmdString` for P2PV1 is `geturoot`.
422+Its [BIP324 P2PV2](https://github.com/bitcoin/bips/blob/master/bip-0324.mediawiki#user-content-v2_Bitcoin_P2P_message_structure) message type is `36`.
423+
424+| Field                      | Type                    | Description                                                                                                      |
425+|----------------------------|-------------------------|------------------------------------------------------------------------------------------------------------------|
426+| blockhash                  | 32 byte vector          | The hash of the block that the requested utreexo root message is for                                             |

luisschwab commented at 11:16 pm on September 19, 2025:

0| blockhash                  | 32 byte vector          | The hash of the block that the requested Utreexo root message is for                                             |

in bip-0183.md:450 in bd1e242587

445+
446+#### MSG_UTREEXO_FLAG
447+
448+Defined as `1 << 24`.
449+
450+It can be set with `MSG_TX` and `MSG_WITNESS_TX` to indicate in `getdata` messages that a Utreexo tx is desired.

luisschwab commented at 11:16 pm on September 19, 2025:

0It can be set with `MSG_TX` and `MSG_WITNESS_TX` to indicate in `getdata` messages that a Utreexo transaction is desired.

in bip-0183.md:462 in bd1e242587

457+
458+#### MSG_WITNESS_UTREEXO_TX
459+
460+Defined as `1090519041` or `1 << 30 | 1 << 24 | 1`.
461+
462+Used to indicate in a `getdata` message that a witness Utreexo tx is desired.

luisschwab commented at 11:17 pm on September 19, 2025:

0Used to indicate in a `getdata` message that a Utreexo transaction is desired.
1
2#### MSG_WITNESS_UTREEXO_TX
3
4Defined as `1090519041` or `1 << 30 | 1 << 24 | 1`.
5
6Used to indicate in a `getdata` message that a witness Utreexo transaction is desired.

in bip-0183.md:466 in bd1e242587

461+
462+Used to indicate in a `getdata` message that a witness Utreexo tx is desired.
463+
464+### Commitment scheme for TTL messages
465+
466+We choose an arbitrary height `X` and go through each of `TTL info` in all the the `Utreexo TTL` values up until that height.

luisschwab commented at 11:18 pm on September 19, 2025:

0We choose an arbitrary height `X` and go through each of `TTL Info`s in all of the `Utreexo TTL` values up until that height.

in bip-0183.md:468 in bd1e242587

463+
464+### Commitment scheme for TTL messages
465+
466+We choose an arbitrary height `X` and go through each of `TTL info` in all the the `Utreexo TTL` values up until that height.
467+
468+If the TTL in the `TTL info` is greater than the [numleaves](bip-0181.md#Definitions) value of the Utreexo accumulator at the chosen height `X`, we reset the `death position` and the `TTL` values to their default of 0.

luisschwab commented at 11:18 pm on September 19, 2025:

0If the TTL in the `TTL Info` is greater than the [numleaves](bip-0181.md#Definitions) value of the Utreexo accumulator at the chosen height `X`, we reset the `death position` and the `TTL` values to their default of 0.

in bip-0183.md:502 in bd1e242587

497+
498+## Rationale
499+
500+**Why is there a separate NODE_UTREEXO_ARCHIVE service bit from the NODE_UTREEXO service bit?**
501+
502+For archive nodes, we wanted the ability for a node to keep just the historical Utreexo proofs since the historical blocks can be served by any archival nodes.

luisschwab commented at 11:19 pm on September 19, 2025:

0For archival nodes, we wanted the ability for a node to keep just the historical Utreexo proofs since the historical blocks can be served by any archival node.

in bip-0183.md:524 in bd1e242587

519+
520+**Why are the positions in the Utreexo merkle forest communicated via inventory vectors instead of a separate message?**
521+
522+We decided to communicate the positions in the Utreexo merkle forest by inventory vectors instead of a separate message to avoid an extra round trip during the transaction propagation.
523+
524+As mentioned above in [Transaction Relay](#transaction-relay), non-Utreexo nodes propagate a transaction in these 3 steps:

luisschwab commented at 11:20 pm on September 19, 2025:

0As mentioned above in [Transaction Relay](#transaction-relay), non-Utreexo nodes propagate a transaction in 3 steps:

in bip-0183.md:535 in bd1e242587

530+The Utreexo nodes follow the same 3 steps because of the new MSG_UTREEXO_PROOF_HASH.
531+If we were to implement the following with a separate message, we would add a round trip and the entire transaction propagation would look like these 5 steps:
532+
533+  1. Receive the inventory message for the transaction.
534+  2. Send a message to get the positions in the Utreexo merkle forest for the transaction.
535+  3. Receive the positions in the Utreexo merkle forest.

luisschwab commented at 11:21 pm on September 19, 2025:

0  2. Send a message to get the positions in the Utreexo Merkle forest for the transaction.
1  3. Receive the positions in the Utreexo Merkle forest.

in bip-0183.md:428 in bd1e242587

423+
424+| Field                      | Type                    | Description                                                                                                      |
425+|----------------------------|-------------------------|------------------------------------------------------------------------------------------------------------------|
426+| blockhash                  | 32 byte vector          | The hash of the block that the requested utreexo root message is for                                             |
427+
428+### New Inventory Types

luisschwab commented at 0:01 am on September 20, 2025:

For all inventory types: be explicit about what needs to be provided and in what format (eg: blockhash, leaf positions, etc..).

in bip-0181.md:33 in d89952d09f outdated

28+This set is typically stored in a database that must be accessed frequently and cannot
29+be pruned. As a result, the cost of running a node is directly tied to the size
30+of the UTXO set. Since it can grow indefinitely, bounded only by block size, it represents a
31+long-term scalability concern.
32+
33+Utreexo is a dynamic accumulator that enables the UTXO set to be represented in just a few kilobytes,

ismaelsadeeq commented at 8:16 am on October 2, 2025:

Coming from https://github.com/cryptography-camp/workbook

The defined accumulator in BIP 181 is positive because it supports membership proofs.

0Utreexo is a dynamic positive accumulator that enables the UTXO set to be represented in just a few kilobytes

in bip-0181.md:55 in d89952d09f outdated

50+enabling efficient membership proofs without requiring storage of the entire set. In the context of Utreexo,
51+the accumulator tracks the current set of unspent transaction outputs (UTXOs).
52+
53+The Utreexo accumulator is based on an append-only Merkle tree design introduced in [^1],
54+which provides logarithmic-sized inclusion proofs. Utreexo extends this design to support dynamic updates,
55+specifically enabling deletions from the set—a requirement for tracking UTXO spends in Bitcoin.

ismaelsadeeq commented at 8:37 am on October 2, 2025:

Parsing through the linked paper it claimed that the accumulator defined there is sound and strong.

With the extension here to make that accumulator dynamic, I suppose it is still correct, sound and strong? Perhaps link to some resource on where that was explicitly studied

ismaelsadeeq commented at 8:51 am on October 2, 2025: member

I think our original justification (better performance with SHA512/256) mentioned in the BIP is sound. Happy to provide the benchmarks, they’re being worked on at the moment.

This point should also be added in the rationale along with the benchmarks when available

murchandamus added the label PR Author action required on Oct 6, 2025

murchandamus commented at 6:46 pm on October 6, 2025: contributor

It looks like there is still work in progress here. Please let me know when the review feedback has been resolved.

BIP 181, 182, 183: BIPs for Utreexo #1923

1. Security Advantages

2. Comparative Analysis: SHA-256 vs SHAKE256

3. Functional Example

4. Implementation Benefits

5. Technical Reference