Cluster mempool implementation

sdaftuar commented at 6:59 pm on October 18, 2023: member

This is a draft implementation of the cluster mempool design described in #27677. I’m opening this as a draft PR now to share the branch I’m working on with others, so that we can start to think about in-progress projects (like package relay, package validation, and package rbf) in the context of this design. Also, I can use some help from others for parts of this work, including the interaction between the mempool and the wallet, and also reworking some of our existing test cases to fit a cluster-mempool world.

Note that the design of this implementation is subject to change as I continue to iterate on the code (to make the code more hygienic and robust, in particular). At this point though I think the performance is pretty reasonable and I’m not currently aware of any bugs. There are some microbenchmarks added here, and some improved fuzz tests; it would be great if others ran both of those on their own hardware as well and reported back on any findings.

This branch implements the following observable behavior changes:

Maintains a partitioning of the mempool into connected clusters
Each cluster is sorted (“linearized”) either using an optimal sort, or an ancestor-feerate-based one, depending on the size of the cluster (thanks to @sipa for this logic)
Transaction selection for mining is updated to use the cluster linearizations
Mempool eviction is updated to use the cluster linearizations
The RBF rules are updated to drop the requirement that no new inputs are introduced, and to change the feerate requirement to instead check that the mining score of a replacement transaction exceed the mining score of the conflicted transactions
The CPFP carveout rule is eliminated (it doesn’t make sense in a cluster-limited mempool)
The ancestor and descendant limits are no longer enforced.
New cluster count/cluster vsize limits are now enforced instead.

Some less observable behavior changes:

The cached ancestor and descendant data are dropped from the mempool, along with the multi_index indices that were maintained to sort the mempool by ancestor and descendant feerates. For compatibility (eg with wallet behavior or RPCs exposing this), this information is now calculated dynamically instead.
The ancestor and descendant walking algorithms are now implemented using epochs (resulting in a significant performance improvement, according to the benchmarks I’ve looked at)

Still to do:

More comparisons between this branch and master on historical data to compare validation speed (accepting loose transactions, processing RBF transactions, validating a block/postprocessing, updating the mempool for a reorg).
More historical data analysis to try to evaluate the likely impact of setting the cluster size limits to varying values (to motivate what values we should ultimately pick). [DONE, see this post]
Updating wallet code to be cluster-aware (including mini_miner and coin selection)
Rework many of our functional tests to be cluster-aware
Figure out what package validation and package RBF rules should be in this design
Rework the partially_downloaded_block fuzz target to not add duplicate transactions to the mempool (#29990).
Update RBF logic to ensure that replacements always strictly improve the mempool.
Figure out how we want to document our RBF policy (preserve historical references to BIP 125 or previous Bitcoin Core behaviors vs clean slate documentation?)

For discussion/feedback:

How significant is it to be dropping the CPFP carveout rule? Does that affect how we will ultimately want to stage new mempool deployment?
How well do the proposed RBF rules meet everyone’s use cases?
What design improvements can we make to the cluster tracking implementation?
The ZMQ callbacks that occur when a block is found will happen in a slightly different order, because we now will fully remove all transactions occurring in a block from the mempool before removing any conflicts. Is this a problem?

DrahtBot commented at 6:59 pm on October 18, 2023: contributor

The following sections might be updated with supplementary metadata relevant to reviewers and maintainers.

Code Coverage & Benchmarks

For details see: https://corecheck.dev/bitcoin/bitcoin/pulls/28676.

Reviews

See the guideline for information on the review process.

Type	Reviewers
Approach ACK	ismaelsadeeq

If your review is incorrectly listed, please react with 👎 to this comment and the bot will ignore it on the next update.

Conflicts

Reviewers, this pull request conflicts with the following ones:

#33616 (policy: don’t CheckEphemeralSpends on reorg by instagibbs)
#33421 (node: add BlockTemplateCache by ismaelsadeeq)
#33335 (txgraph: randomize order of same-feerate distinct-cluster transactions by sipa)
#33214 (rpc: require integer verbosity; remove boolean ‘verbose’ by fqlx)
#33192 (refactor: unify container presence checks by l0rinc)
#33191 (net: Provide block templates to peers on request by ajtowns)
#33157 (cluster mempool: control/optimize TxGraph memory usage by sipa)
#33062 (truc: optimize the in package relation calculation by HowHsu)
#32587 (test: Fix reorg patterns in tests to use proper fork-based approach by yuvicc)
#31974 (Drop testnet3 by Sjors)
#31803 (fuzz: Extend mini_miner fuzz coverage to max block weight by fjahr)
#31682 ([IBD] specialize CheckBlock’s input & coinbase checks by l0rinc)
#31382 (kernel: Flush in ChainstateManager destructor by TheCharlatan)
#30277 ([DO NOT MERGE] Erlay: bandwidth-efficient transaction relay protocol (Full implementation) by sr-gi)
#29641 (scripted-diff: Use LogInfo over LogPrintf [WIP, NOMERGE, DRAFT] by maflcko)
#28690 (build: Introduce internal kernel library by TheCharlatan)
#17783 (common: Disallow calling IsArgSet() on ALLOW_LIST options by ryanofsky)
#17581 (refactor: Remove settings merge reverse precedence code by ryanofsky)
#17580 (refactor: Add ALLOW_LIST flags and enforce usage in CheckArgFlags by ryanofsky)
#17493 (util: Forbid ambiguous multiple assignments in config file by ryanofsky)

If you consider this pull request important, please also help to review the conflicting pull requests. Ideally, start with the one that should be merged first.

in doc/policy/mempool-replacements.md:22 in b0f771bb55 outdated

23-   currently-unconfirmed transaction.
24-
25-   *Rationale*: When RBF was originally implemented, the mempool did not keep track of
26-   ancestor feerates yet. This rule was suggested as a temporary restriction.
27-
28-3. The replacement transaction pays an absolute fee of at least the sum paid by the original

instagibbs commented at 7:11 pm on October 18, 2023:

might be easier to just “strike out” this rule and mark it deprecated instead of changing numbers since that’s become engineer lingo

sdaftuar commented at 4:52 pm on December 7, 2023:

I’m not sure the best way to document things, but I added a to-do item to the OP so that we don’t lose track of figuring out how we want the docs to look. (Will have to conform all the code comments as well.)

DrahtBot added the label CI failed on Oct 18, 2023

sdaftuar force-pushed on Oct 18, 2023

in src/rpc/mempool.cpp:844 in 3a42ba255a outdated

829@@ -684,6 +830,19 @@ UniValue MempoolInfoToJSON(const CTxMemPool& pool)
830     ret.pushKV("incrementalrelayfee", ValueFromAmount(pool.m_incremental_relay_feerate.GetFeePerK()));
831     ret.pushKV("unbroadcastcount", uint64_t{pool.GetUnbroadcastTxs().size()});
832     ret.pushKV("fullrbf", pool.m_full_rbf);
833+    ret.pushKV("numberofclusters", pool.m_cluster_map.size());
834+    int64_t max_cluster_count = 1;

instagibbs commented at 3:13 pm on October 19, 2023:

shouldn’t this be 0?

in src/policy/rbf.cpp:65 in 3a42ba255a outdated

88-    std::set<uint256> parents_of_conflicts;
89-    for (const auto& mi : iters_conflicting) {
90-        for (const CTxIn& txin : mi->GetTx().vin) {
91-            parents_of_conflicts.insert(txin.prevout.hash);
92+        // Exit early if we're going to fail (see below)
93+        if (all_conflicts.size() > MAX_REPLACEMENT_CANDIDATES) {

instagibbs commented at 3:40 pm on October 19, 2023:

Note: this is sticking with the same rule#5 instead of number of effected clusters. It would be more ideal if it were number of clusters to allow for better usage of adversarial-ish batched CPFPs

sdaftuar commented at 12:48 pm on October 25, 2023:

There is room to relax this rule some, so if this is important we can do so. I think the requirement is a bound on the number of clusters that would have to be re-sorted in order to accept the new transaction. We can approximate that as the number of clusters that would be non-empty as a result of removing all the conflicting transactions from the mempool, and only process replacements for which that is below some target.

That would be a more complex logic though, so before implementing it I wanted to have some sense of whether we need to. Has the historical 100-transaction-conflict limit been problematic for use cases in the past? Note also that in the new code, we are calculating the number of conflicts exactly (the old code used an approximation, which could be gamed by an adversary).

instagibbs commented at 1:52 pm on October 25, 2023:

~~Ah! I wrote a huge response to this, then looked up our previous discussions, and realized I didn’t actually read the code: #27677 (comment)~~

~~IIUC now, this is only counting direct conflicts, and not the descendants that are booted.~~

~~I think that’s fine.~~

Actually no, the existing code comments were just misleading, looks like the issue still exists, see: #27677 (comment)

sdaftuar commented at 1:59 pm on October 25, 2023:

Ah yes, in the new code I’m only counting direct conflicts right now, because every descendant of a direct conflict must be in the same cluster as that conflict. So this is already a relaxation of the existing rule.

instagibbs commented at 2:01 pm on October 25, 2023:

I think the requirement is a bound on the number of clusters that would have to be re-sorted in order to accept the new transaction.

As an alternative, we drop the replacement limit to like, 10 or something, and then only count the direct conflicts, not the direct conflicts and all the descendants?

sdaftuar commented at 5:00 pm on December 7, 2023:

I believe I (finally) actually fixed this behavior to count the number of direct conflicts.

sdaftuar commented at 5:15 pm on September 5, 2024:

Will comment elsewhere as well, but I just updated this limit to be a limit on the number of distinct clusters that are conflicted.

sdaftuar force-pushed on Oct 19, 2023

in src/validation.cpp:896 in c5d3c42da4 outdated

947-        ancestors = m_pool.CalculateMemPoolAncestors(*entry, cpfp_carve_out_limits);
948-        if (!ancestors) return state.Invalid(TxValidationResult::TX_MEMPOOL_POLICY, "too-long-mempool-chain", error_message);
949-    }
950-
951-    ws.m_ancestors = *ancestors;
952+    // Calculate in-mempool ancestors

instagibbs commented at 8:48 pm on October 19, 2023:

the check two conditionals above:

0if (!bypass_limits && ws.m_modified_fees < m_pool.m_min_relay_feerate.GetFee(ws.m_vsize))

This is still needed for the same reason as before: transaction that is above minrelay, but would be in a chunk below minrelay. We could immediately evict below minrelay chunks post-re-linearization f.e. which would allow 0-fee parents then relax this maybe.

in src/bench/mempool_stress.cpp:107 in c5d3c42da4 outdated

 95+    }
 96+    const auto testing_setup = MakeNoLogFileContext<const TestingSetup>(ChainType::MAIN);
 97+    CTxMemPool& pool = *testing_setup.get()->m_node.mempool;
 98+
 99+    std::vector<CTransactionRef> transactions;
100+    // Create 1000 clusters of 100 transactions each

instagibbs commented at 9:01 pm on October 19, 2023:

numbers in comments are off

shower thought: Should we/can we bound the number of clusters in addition to total memory in TrimToSize? I can’t think of a good way to do that that doesn’t complicate things quite a bit, and perhaps practical mempool sizes make this moot. Just something to consider in case I missed something obvious.

sdaftuar commented at 7:24 pm on October 24, 2023:

The immediate downside to a cap on number of clusters is that singleton, high-feerate transactions would not be accepted. And I don’t think we need to – the only places where having more clusters makes us slower is in eviction and mining, and for both of those use cases we could improve performance (if we need to) by maintaining the relevant heap data structures (or something equivalent) as chunks are modified, rather than all at once.

For now in this branch I’ve created these from scratch each time, but if it turns out that performance is meaningfully impacted when the mempool is busy, then I can optimize this further by just using a bit more memory.

sdaftuar force-pushed on Oct 19, 2023

glozow added the label Mempool on Oct 20, 2023

in src/txmempool.cpp:1481 in 26e831d5f2 outdated

1477+            // TODO: branch on size of fee to do this as 32-bit calculation
1478+            // instead? etc
1479+            return a.first->fee*b.first->size > b.first->fee*a.first->size;
1480+        };
1481+
1482+        std::make_heap(heap_chunks.begin(), heap_chunks.end(), cmp);

instagibbs commented at 7:39 pm on October 20, 2023:

You probably didn’t mean to call make_heap in the loop for 3N work each time. fwiw I don’t see any performance difference between push_heaping all the elements on vs one make_heap.

once this is changed, add+trimming seems to be faster than in master regardless of topology tested

sdaftuar commented at 5:03 pm on December 7, 2023:

Should be fixed now.

in src/node/miner.cpp:248 in 26e831d5f2 outdated

381+        // TODO: branch on size of fee to do this as 32-bit calculation
382+        // instead? etc
383+        return a.first->fee*b.first->size < b.first->fee*a.first->size;
384+    };
385+    // TODO: replace the heap with a priority queue
386+    std::make_heap(heap_chunks.begin(), heap_chunks.end(), cmp);

instagibbs commented at 7:46 pm on October 20, 2023:

don’t ask why but I’m getting significant performance improvement(>10%) just push_heaping everything from scratch, and similarly with priority_queue

DrahtBot added the label Needs rebase on Oct 30, 2023

in src/bench/mempool_stress.cpp:269 in dd6684ab66 outdated

253+        pool.CalculateMiningScoreOfReplacementTx(entry, det_rand.randrange(30000)+1000, all_conflicts, limits);
254+    });
255+}
256+
257+BENCHMARK(MemPoolAncestorsDescendants, benchmark::PriorityLevel::HIGH);
258+BENCHMARK(MemPoolAddTransactions, benchmark::PriorityLevel::HIGH);

Sjors commented at 10:05 am on November 14, 2023:

dd6684ab665bc8ae76b76fdd2e578fc77d562a52: While you’re touching this, can you rename MempoolCheck to MemPoolCheck, MempoolEviction to MemPoolEviction and ComplexMemPool to MempoolComplex? That makes -filter=MemPool.* work

As a workaround, -filter=.*Mem.* does work.

Sjors commented at 10:29 am on November 14, 2023: member

It would be useful to add a mempool_backwards_compatibility.py test to illustrate how the new rules interact with older nodes. It could have two modern nodes and one v25 (or v26) node. Some of the tests you deleted in this branch could be moved there. E.g. the test could demonstrate how RBF rule 2 is not enforced when relaying to the new node, but it is when relaying to the v25 node.

Benchmarks on a 2019 MacBook Pro (2,3 GHz 8-Core Intel Core i9), plugged in:

0% src/bench/bench_bitcoin -filter=.*Mem.* -min-time=10000
1
2|      330,557,188.67 |                3.03 |    1.5% |     10.77 | `ComplexMemPool`
3|      451,529,273.50 |                2.21 |    2.8% |     10.01 | `MemPoolAddTransactions`
4|            2,847.13 |          351,231.06 |    2.7% |     10.93 | `MemPoolAncestorsDescendants`
5|           11,047.90 |           90,514.97 |    2.5% |     10.69 | `MemPoolMiningScoreCheck`
6|        4,328,796.04 |              231.01 |    1.1% |     10.99 | `MempoolCheck`
7|           36,268.80 |           27,571.91 |    2.9% |     11.17 | `MempoolEviction`
8|        9,123,684.25 |              109.60 |    1.4% |     10.74 | `RpcMempool`

Update: added bench for master@c2d4e40e454ba0c7c836a849b6d15db4850079f2:

0|               ns/op |                op/s |    err% |     total | benchmark
1|--------------------:|--------------------:|--------:|----------:|:----------
2|      302,677,055.25 |                3.30 |    3.5% |     10.76 | `ComplexMemPool`
3|      100,167,478.00 |                9.98 |    2.5% |     11.08 | `MempoolCheck`
4|           43,759.84 |           22,852.00 |    4.1% |     11.42 | `MempoolEviction`
5|       10,235,913.25 |               97.70 |    3.5% |     10.66 | `RpcMempool`

in src/init.cpp:644 in 7fdcccd133 outdated

573@@ -574,6 +574,8 @@ void SetupServerArgs(ArgsManager& argsman)
574     argsman.AddArg("-limitancestorsize=<n>", strprintf("Do not accept transactions whose size with all in-mempool ancestors exceeds <n> kilobytes (default: %u)", DEFAULT_ANCESTOR_SIZE_LIMIT_KVB), ArgsManager::ALLOW_ANY | ArgsManager::DEBUG_ONLY, OptionsCategory::DEBUG_TEST);
575     argsman.AddArg("-limitdescendantcount=<n>", strprintf("Do not accept transactions if any ancestor would have <n> or more in-mempool descendants (default: %u)", DEFAULT_DESCENDANT_LIMIT), ArgsManager::ALLOW_ANY | ArgsManager::DEBUG_ONLY, OptionsCategory::DEBUG_TEST);
576     argsman.AddArg("-limitdescendantsize=<n>", strprintf("Do not accept transactions if any ancestor would have more than <n> kilobytes of in-mempool descendants (default: %u).", DEFAULT_DESCENDANT_SIZE_LIMIT_KVB), ArgsManager::ALLOW_ANY | ArgsManager::DEBUG_ONLY, OptionsCategory::DEBUG_TEST);
577+    argsman.AddArg("-limitclustercount=<n>", strprintf("Do not accept transactions connected to <n> or more existing in-mempool transactions (default: %u)", DEFAULT_CLUSTER_LIMIT), ArgsManager::ALLOW_ANY | ArgsManager::DEBUG_ONLY, OptionsCategory::DEBUG_TEST);

Sjors commented at 10:34 am on November 14, 2023:

I made an attempt at dropping -limitdescendantsize and friends: https://github.com/Sjors/bitcoin/commits/2022/11/cluster-mempool

I (naively) replaced ancestor and descendent limits in coin selection with the new cluster limit. At least the tests pass *.

When we drop these settings anyone who uses them will get an error when starting the node. That’s probably a good thing, since they should read about this change.

* = well, wallet_basic.py fails with:

0Internal bug detected: Shared UTXOs among selection results
1wallet/coinselection.h:340 (InsertInputs)

glozow commented at 3:49 pm on September 2, 2025:

I think we can do something similar to #32941 and just emit a warning (not error) for the first release, then remove it afterwards. We can’t deprecate / continue supporting the options, but this is more user-friendly than dropping immediately.

0        InitWarning(_("Option '-limitdescendantcount' is set but no longer has any effect (see release notes about cluster limits). Please remove it from your configuration."));

in src/txmempool.cpp:1857 in 54f39ca8f1 outdated

1549+
1550+void Cluster::RemoveTransaction(const CTxMemPoolEntry& entry)
1551+{
1552+    m_chunks[entry.m_loc.first].txs.erase(entry.m_loc.second);
1553+
1554+    // Chunk (or cluster) may now be empty, but this will get cleaned up

Sjors commented at 1:15 pm on November 14, 2023:

54f39ca8f101483f5f82707689ca49431d4091e5: what if the deleted transaction makes it so there are now two clusters? This is also safe to ignore?

sdaftuar commented at 1:33 am on December 10, 2023:

I wouldn’t say it’s safe to ignore, but the idea is that we often want to be able to batch deletions, and then clean things up in one pass. So the call sites should all be dealing with this issue and ensuring that we always clean up at some point.

(This is definitely an area where I expect that we’ll be re-engineering all this logic and trying to come up with a better abstraction layer so that this is more robust and easier to think about!)

in src/txmempool.cpp:1875 in 54f39ca8f1 outdated

1565+{
1566+    m_chunks.clear();
1567+
1568+    for (auto txentry : txs) {
1569+        m_chunks.emplace_back(txentry.get().GetModifiedFee(), txentry.get().GetTxSize());
1570+        m_chunks.back().txs.emplace_back(txentry);

Sjors commented at 2:09 pm on November 14, 2023:

54f39ca8f101483f5f82707689ca49431d4091e5: So you’re creating a chunk for each new transaction and then erasing it if the fee rate goes down. Why not the other way around?

in src/txmempool.cpp:1877 in 54f39ca8f1 outdated

1569+        m_chunks.emplace_back(txentry.get().GetModifiedFee(), txentry.get().GetTxSize());
1570+        m_chunks.back().txs.emplace_back(txentry);
1571+        while (m_chunks.size() >= 2) {
1572+            auto cur_iter = std::prev(m_chunks.end());
1573+            auto prev_iter = std::prev(cur_iter);
1574+            double feerate_prev = prev_iter->fee*cur_iter->size;

Sjors commented at 2:13 pm on November 14, 2023:

54f39ca8f101483f5f82707689ca49431d4091e5: shouldn’t this be fee / size? Or do you mean to use an inverse fee rate for performance (inv_fee_rate)?

sipa commented at 12:53 pm on November 20, 2023:

To check a/b > c/d you can instead check a*d > c*b, which avoids divisions (which are an order of magnitude slower than multiplication).

Sjors commented at 5:22 pm on November 20, 2023:

The math school memories are coming back :-)

Sjors commented at 2:27 pm on November 14, 2023: member

Couple more comments / questions. To be continued…

in src/txmempool.h:427 in 065e18fff3 outdated

633@@ -632,6 +634,22 @@ class CTxMemPool
634      */
635     void UpdateTransactionsFromBlock(const std::vector<uint256>& vHashesToUpdate) EXCLUSIVE_LOCKS_REQUIRED(cs, cs_main) LOCKS_EXCLUDED(m_epoch);
636 
637+    /**
638+     * Calculate whether cluster size limits would be exceeded if a new tx were
639+     * added to the mempool (assuming no conflicts).

Sjors commented at 11:22 am on November 20, 2023:

065e18fff30a91c94c47d5c2fc65b13ddc38aa47: do you plan to relax this assumption so that a transaction in (a) full cluster(s) can be replaced?

Update: AcceptSingleTransaction skips ClusterSizeChecks if there’s a replacement, in which case ReplacementChecks checks the new cluster size. So this is not an issue.

sipa commented at 12:54 pm on November 20, 2023:

I believe that’s an idea we’ve toyed with (calling it “sibling eviction”), but so far it isn’t clear how to align that with DoS prevention.

in src/rpc/mempool.cpp:659 in 0b284b5fd2 outdated

659@@ -553,6 +660,39 @@ static RPCHelpMan getmempooldescendants()
660     };
661 }
662 
663+static RPCHelpMan getmempoolcluster()
664+{
665+    return RPCHelpMan{"getmempoolcluster",
666+        "\nReturns mempool data for given cluster\n",
667+        {
668+            {"id", RPCArg::Type::NUM, RPCArg::Optional::NO, "The cluster id (must be in mempool)"},

Sjors commented at 12:32 pm on November 20, 2023:

0b284b5fd29c06d46c1ec60ea7e1bcd07f36feb1: it would be more practical to (also) have a lookup by transaction hash

in src/rpc/mempool.cpp:315 in 0b284b5fd2 outdated

326+    AssertLockHeld(pool.cs);
327+    info.pushKV("vsize", (int)c.m_tx_size);
328+    info.pushKV("txcount", (int)c.m_tx_count);
329+    info.pushKV("clusterid", (int)c.m_id);
330+    UniValue chunks(UniValue::VARR);
331+    for (auto &chunk : c.m_chunks) {

Sjors commented at 1:42 pm on November 20, 2023:

0b284b5fd29c06d46c1ec60ea7e1bcd07f36feb1: can you order chunks by mining score?

sdaftuar commented at 4:55 pm on December 7, 2023:

Chunks for a given cluster are already sorted in descending feerate (ie mining score) order, is that what you’re asking about or is there another issue I’m overlooking?

in src/txmempool.cpp:1157 in 7fdcccd133 outdated

994+    int64_t a_size = a->m_cluster->m_chunks[a->m_loc.first].size;
995+    CAmount b_fee = b->m_cluster->m_chunks[b->m_loc.first].fee;
996+    int64_t b_size = b->m_cluster->m_chunks[b->m_loc.first].size;
997+
998+    int64_t a_score = a_fee * b_size;
999+    int64_t b_score = b_fee * a_size;

Sjors commented at 1:48 pm on November 20, 2023:

It would be good to explain the rationale for the miner score somewhere.

Why a * b and b * a?

And why is not the fee rate a_fee / a_size? (inverse score?)

sipa commented at 1:53 pm on November 20, 2023: member

@Sjors I believe most if not all of this PR will be rewritten, and split up into several components. The goal here is just to give an idea of the high-level interactions with other changes (wallet behavior, package validation/relay/RBF, …). I don’t think a detailed line-by-line code review at this stage is a good use of your time.

in src/rpc/mempool.cpp:466 in 0b284b5fd2 outdated

466+                heap_chunks.pop_back();
467+
468+                accum_size += best_chunk.first->size;
469+                accum_fee += best_chunk.first->fee;
470+
471+                UniValue o(UniValue::VOBJ);

Sjors commented at 1:58 pm on November 20, 2023:

0b284b5fd29c06d46c1ec60ea7e1bcd07f36feb1: this should also list the cluster id, and maybe also the chunk number.

Could also add an argument limit the total number of vbytes.

getmempoolfeesize can be introduced in its own commit

in src/rpc/mempool.cpp:474 in 0b284b5fd2 outdated

452+                    heap_chunks.emplace_back(cluster->m_chunks.begin(), cluster.get());
453+                }
454+            }
455+            // Define comparison operator on our heap entries (using feerate of chunks).
456+            auto cmp = [](const Cluster::HeapEntry& a, const Cluster::HeapEntry& b) {
457+                return a.first->fee*b.first->size < b.first->fee*a.first->size;

Sjors commented at 2:00 pm on November 20, 2023:

0b284b5fd29c06d46c1ec60ea7e1bcd07f36feb1 : Could this use CompareMiningScore or does it have a different goal?

in doc/policy/mempool-replacements.md:49 in bf467f8286 outdated

45@@ -53,12 +46,10 @@ other consensus and policy rules, each of the following conditions are met:
46    significant portions of the node's mempool using replacements with multiple directly conflicting
47    transactions, each with large descendant sets.
48 
49-6. The replacement transaction's feerate is greater than the feerates of all directly conflicting
50+5. The replacement transaction's mining score is greater than the mining score of all directly conflicting

Sjors commented at 2:06 pm on November 20, 2023:

bf467f8286425b692a1736ab6d417d0ba6074658 Maybe make this rule 7. That seems a bit more clear than saying “previously this rule referred to fee rate”

in doc/policy/mempool-replacements.md:20 in bf467f8286 outdated

27-
28-3. The replacement transaction pays an absolute fee of at least the sum paid by the original
29+2. The replacement transaction pays an absolute fee of at least the sum paid by the original
30    transactions.
31 
32    *Rationale*: Only requiring the replacement transaction to have a higher feerate could allow an

Sjors commented at 2:09 pm on November 20, 2023:

bf467f8286425b692a1736ab6d417d0ba6074658: “a higher feerate or (using clusters) mining score”

Is the “Additionally” reasoning still valid?

in doc/policy/mempool-replacements.md:46 in bf467f8286 outdated

54-   preferable for block-inclusion, compared to what would be removed from the mempool. This rule
55-   predates ancestor feerate-based transaction selection.
56+   *Rationale*: Ensure that the new transaction is more appealing to mine than those being evicted.
57 
58 This set of rules is similar but distinct from BIP125.
59

Sjors commented at 2:24 pm on November 20, 2023:

bf467f8286425b692a1736ab6d417d0ba6074658: history can be expanded:

0  * Cluster mempool introduced, dropping rule 2 and introducing rule 7.
1    As of **v27.0** ([PR 28676](https://github.com/bitcoin/bitcoin/pull/28676)).

in src/policy/rbf.cpp:116 in bf467f8286 outdated

116+        // directly replaced, because descendant transactions which pay for the
117+        // parent will be reflected in the parent's chunk feerate.
118+        Cluster::Chunk &chunk = mi->m_cluster->m_chunks[mi->m_loc.first];
119+        CFeeRate original_feerate(chunk.fee, chunk.size);
120         if (replacement_feerate <= original_feerate) {
121             return strprintf("rejecting replacement %s; new feerate %s <= old feerate %s",

Sjors commented at 2:27 pm on November 20, 2023:

bf467f8286425b692a1736ab6d417d0ba6074658: ; new chunk feerate %s <= old chunk feerate

Sjors commented at 3:00 pm on November 20, 2023: member

I was wondering if we can drop RBF rule 5 (in a followup), but I’m guessing not really. My initial thinking was the cluster limit could be used instead. But CalculateMiningScoreOfReplacementTx only checks that the new cluster doesn’t get too big.

But does the new cluster system make it less of a burden to have large numbers of transactions appear and disappear from the mempool?

—

I’m also trying to understand if dropping the CPFP carveout is fine. The original scenario described on the mailistlist:

0TX_important
1  * output Alice <- child_a_1 <- child_a_2 <- … -< child_a_25
2                   (^ intentionally low fees so it doesn't confirm before timeout)
3  * output Bob: child_b_1
4                ^ high fee

The carveout rule allows Bob’s child_b_1, despite Alice having used up the 25 ancestor limit for TX_important. And alice can’t use the carveout to just add child_a_26, because child_26 has more than one unconfirmed ancestor.

So what happens in the cluster world?

For simplicity, let’s set -limitclustercount to 26 (the default is 100 in this PR). Without the CPFP carveout, can Bob still insert child_b_1?

My understanding is that he can’t, because ClusterSizeChecks will fail.

Instead of failing, can we check if it’s possible to evict the lowest value chunk? Though it introduces an implicit RBF-like mechanism… If it were possible, then child_a_25 would be evicted from the cluster as long as it has a longer fee rate than child_b_1.

Alice might make sure that the fee rates of child_a_1 … child_a_25 increase such that there’s only one chunk. That just requires Bob to use a higher fee rate.

in src/txmempool.cpp:1910 in 7fdcccd133 outdated

1727+namespace {
1728+
1729+template <typename SetType>
1730+std::vector<CTxMemPoolEntry::CTxMemPoolEntryRef> InvokeSort(size_t tx_count, const std::vector<Cluster::Chunk>& chunks)
1731+{
1732+    std::vector<CTxMemPoolEntry::CTxMemPoolEntryRef> txs;

Sjors commented at 4:04 pm on November 20, 2023:

Can you clarify what txs, orig_txs and cluster are for, and what the general strategy is here?

Sjors commented at 5:16 pm on November 20, 2023: member

I don’t think a detailed line-by-line code review at this stage is a good use of your time.

That’s not what I’m trying to do. I strategically / haphazardly picked a dozen lines to ask questions about to get a better understanding of the whole thing. Based on IRC chat today, I’ll wait now for the upcoming update.

Relevant IRC log: https://bitcoin-irc.chaincode.com/bitcoin-core-dev/2023-11-20#984780;

sdaftuar force-pushed on Dec 6, 2023

sdaftuar force-pushed on Dec 7, 2023

DrahtBot removed the label Needs rebase on Dec 7, 2023

sdaftuar force-pushed on Dec 8, 2023

sdaftuar force-pushed on Dec 10, 2023

DrahtBot removed the label CI failed on Dec 10, 2023

DrahtBot added the label Needs rebase on Dec 18, 2023

achow101 referenced this in commit 7143d43884 on Feb 10, 2024

ariard commented at 3:45 am on February 19, 2024: none

do you have a simulation environment or script like you’ve done for #25717 to observe the computational complexity diff compared to today’s mempool against different chain of transaction performance payload ? interesting what is the lowest performance linux host assumed for decentralization of the tx-relay network. assume always 24/7 internet and on a vpcu instance, not baremetal.

Sjors commented at 3:05 pm on February 19, 2024: member

In the context of Stratum v2 #29432 I’m looking for a more efficient way to decide when to generate a new block template.

The current implementation looks at the mempool’s GetTransactionsUpdated() every -sv2interval seconds. If there was any update, it calls CreateNewBlock() on BlockAssembler. It then checks if fees increased by at least -sv2feedelta since the last template and if so, pushes it to connected clients.

GetTransactionsUpdated() changes all the time, especially with a big mempool, so not that useful of a filter. I’d like something like GetFeeEstimateAtTop(size vbytes=4000000). Would it be easy to keep track of that (approximate) number every time something is added or removed from the mempool?

Ideally I’d like to change the meaning of -sv2interval from “check every n seconds and push update if needed” to “push better templates immediately, but wait least n seconds between them”.

sipa commented at 3:10 pm on February 19, 2024: member

Would it be easy to keep track of that (approximate) number every time something is added or removed from the mempool?

After clustermempool, yes. Today doing that inherently requires running the mining algorithm to figure out what transactions to put there.

Sjors commented at 3:12 pm on February 19, 2024: member

Indeed, I meant on top of this PR (or a later incarnation).

ariard commented at 5:56 pm on February 22, 2024: none

interesting what is the lowest performance linux host assumed for decentralization of the tx-relay network. assume always 24/7 internet and on a vpcu instance, not baremetal.

Running this branch mainet on a 2 vcpu instance, with the following performance characteristics:

0processor       : 0
1vendor_id       : AuthenticAMD
2cpu family      : 25
3model           : 1
4model name      : AMD EPYC 7543 32-Core Processor
5stepping        : 1
6microcode       : 0xa0011d1
7cpu MHz         : 2794.750
8cache size      : 512 KB

It would be very interesting to feed cluster-support testing node with all kinds of chain of transactions, and check substantial performance diff compared to non-cluster-support testing node. Check there is no substantial performance regression.

sdaftuar force-pushed on Mar 7, 2024

DrahtBot removed the label Needs rebase on Mar 7, 2024

in src/txmempool.h:839 in 96df5acec3 outdated

833@@ -764,6 +834,9 @@ class CTxMemPool
834      */
835     void UpdateForDescendants(txiter updateIt, cacheMap& cachedDescendants,
836                               const std::set<uint256>& setExclude, std::set<uint256>& descendants_to_remove) EXCLUSIVE_LOCKS_REQUIRED(cs);
837+    // During reorg we add transactions back to mempool, must reconnect
838+    // clusters with in-mempool descendants.
839+    void UpdateClusterForDescendants(txiter updateIt) EXCLUSIVE_LOCKS_REQUIRED(cs);

ariard commented at 1:47 am on March 8, 2024:

I think the introduction of Cluster alters the worst-case computational during re-org, when we reconnect disconnectpool to the current mempool. Before the limit we were strictly bounded with DEFAULT_ANCESTOR_LIMIT / DEFAULT_DESCENDANT_LIMIT. Now we have a DEFAULT_CLUST_LIMIT, where the worst-case graph traversal algorithm might have to visit more element than at max 25.

in src/txmempool.cpp:122 in 96df5acec3 outdated

117+    }
118+    if (clusters_to_merge.size() > 1) {
119+        // Merge the other clusters into this one, but keep this cluster as
120+        // first so that it's topologically sound.
121+        clusters_to_merge[0]->Merge(clusters_to_merge.begin()+1, clusters_to_merge.end(), true);
122+        // TODO: limit the size of the cluster, in case it got too big.

ariard commented at 2:05 am on March 8, 2024:

I think this introduces a transaction censorship vector for time-sensitive second-layers. If you can re-org one block and construct a cluster such a a target descendant is not reconnected with other clusters, the non-cluster connected target descendant might have a stale feerate. This descendant could be pinnned in the bottom of the local mempool.

in src/txmempool.cpp:559 in 96df5acec3 outdated

554+        // Only one parent cluster: add to it.
555+        clusters_to_merge[0]->AddTransaction(*newit, true);
556+        cachedInnerUsage += clusters_to_merge[0]->GetMemoryUsage();
557+    } else {
558+        cachedInnerUsage -= clusters_to_merge[0]->GetMemoryUsage();
559+        clusters_to_merge[0]->Merge(clusters_to_merge.begin()+1, clusters_to_merge.end(), false);

ariard commented at 2:13 am on March 8, 2024:

This can bind max number of clusts’s Merge(). addUnchecked() internal logic differs on tip commit.

in src/txmempool.cpp:732 in 96df5acec3 outdated

727+                for (auto entry : children) {
728+                    work_queue.push_back(entry);
729+                    visited(entry.get());
730+                }
731+
732+                while (!work_queue.empty()) {

ariard commented at 2:27 am on March 8, 2024:

I think here a counter should be introduced to avoid iterating on more than DEFAULT_CLUSTER_LIMIT on this work_queue. It’s in RemoveStaged itself call in Finalize, I think you might have temporarily unbounded numbers with conflicting descendants transactions.

in src/txmempool.cpp:1545 in 96df5acec3 outdated

1540+void Cluster::RemoveTransaction(const CTxMemPoolEntry& entry)
1541+{
1542+    m_chunks[entry.m_loc.first].txs.erase(entry.m_loc.second);
1543+
1544+    // Chunk (or cluster) may now be empty, but this will get cleaned up
1545+    // when the cluster is re-sorted (or when the cluster is deleted) Note:

ariard commented at 2:34 am on March 8, 2024:

I think you will have computational threshold effect with whatever limits you’re picking up for max cluster size and chunk ordering parameters. Namely, what is the minimal chunk modification that can provoke the maximum of re-ordering of all in-cluster chunks.

in src/txmempool.cpp:1569 in 96df5acec3 outdated

1564+            double feerate_prev = prev_iter->fee*cur_iter->size;
1565+            double feerate_cur = cur_iter->fee*prev_iter->size;
1566+            // We only combine chunks if the feerate would go up; if two
1567+            // chunks have equal feerate, we prefer to keep the smaller
1568+            // chunksize (which is generally better for both mining and
1569+            // eviction).

ariard commented at 2:41 am on March 8, 2024:

I think this is a broken assumption. You’re using virtual bytes (GetTxSize()), however an equivalent feerate cluster can have a smaller total witness unit as such a higher feerate w.r.t consensus rules (MAX_BLOCK_WEIGHT) and it should be selected in a mining block template.

in src/txmempool.cpp:1605 in 96df5acec3 outdated

1600+            txs.push_back(chunk_tx.get());
1601+        }
1602+    }
1603+    // Sorting by ancestor count is equivalent to topological sort.
1604+    std::sort(txs.begin(), txs.end(), [](const CTxMemPoolEntry::CTxMemPoolEntryRef& a, const CTxMemPoolEntry::CTxMemPoolEntryRef& b) {
1605+        return a.get().GetCountWithAncestors() < b.get().GetCountWithAncestors();

ariard commented at 2:56 am on March 8, 2024:

Note topological sort is not defined. At the very least you can have a and b both being the first child of a single ancestor and still being the 2rd and 3rd child of a common ancestor. Strictly inferior here cannot be say to be equivalent.

in src/rpc/mempool.cpp:262 in ce41413592 outdated

246@@ -247,34 +247,81 @@ static RPCHelpMan testmempoolaccept()
247     };
248 }
249 
250+static std::vector<RPCResult> ClusterDescription()
251+{
252+    return {
253+        RPCResult{RPCResult::Type::NUM, "vsize", "virtual transaction size as defined in BIP 141. This is different from actual serialized size for witness transactions as witness data is discounted."},
254+        RPCResult{RPCResult::Type::NUM, "txcount", "number of transactions (including this one)"},
255+        RPCResult{RPCResult::Type::NUM, "clusterid", "id of the cluster containing this tx"},

ariard commented at 3:11 am on March 8, 2024:

About the clusterid (m_next_cluster_id), I don’t think it’s a good idea to use it as a tie-breaker in CompareMiningScore as it’s just a monotonic counter, it’s not information perfomative for a mining score. You could use weight at least or reception timestamp e.g oldest the tx-relay reception more well-propagated on the network, already in every miner mempool.

in src/rpc/mempool.cpp:426 in ce41413592 outdated

419@@ -370,6 +420,76 @@ UniValue MempoolToJSON(const CTxMemPool& pool, bool verbose, bool include_mempoo
420     }
421 }
422 
423+static RPCHelpMan getmempoolfeesize()
424+{
425+    return RPCHelpMan{"getmempoolfeesize",
426+        "Returns fee/size data for the whole mempool.",

ariard commented at 3:12 am on March 8, 2024:

Obviously you can add fee/weight units here.

in src/util/feefrac.h:20 in bb1bc54a7c outdated

11+/** Data structure storing a fee and size, ordered by increasing fee/size.
12+ *
13+ * The size of a FeeFrac cannot be zero unless the fee is also zero.
14+ *
15+ * FeeFracs have a total ordering, first by increasing feerate (ratio of fee over size), and then
16+ * by decreasing size. The empty FeeFrac (fee and size both 0) sorts last. So for example, the

ariard commented at 3:31 am on March 8, 2024:

This is not perfect as you could use weight units and even have to consider what is the minimal satisfying witness under given consensus rules for a given transaction, especially if you have multiple candidate for a spend given a witnessScript. You have already such types of transaction like second-stage LN transactions (either revocation path or preimage/timeout path).

in src/util/feefrac.h:36 in bb1bc54a7c outdated

31+ * The >> and << operators only compare feerate and treat equal feerate but different size as
32+ * equivalent. The empty FeeFrac is neither lower or higher in feerate than any other.
33+ *
34+ * These comparisons are only guaranteed to be correct when the product of the highest fee and
35+ * highest size does not exceed 2^64-1. If the fee is a number in sats, and size in bytes, then
36+ * this allows up to 46116.86 BTC at size 4M, and 1844674.4 BTC at size 100k).

ariard commented at 3:44 am on March 8, 2024:

I think this should be MAX_MONEY as ReplacementChecks() in PreChecks() are done before PolicyScriptChecks() and ConsensusScriptChecks() so an adversary does not have to own the solution of the scriptpubkey to fake 46116.86 or 1844674.4 of value in your local mempool, exceeds 2^64-1 and mess up with linearization.

ariard commented at 3:55 am on March 8, 2024: none

I’ll spend time reviewing current CTxMemPool’s code path w.r.t DoS resistance before to pursue review further. This is very unsure the introduction of TxGraph improves on this front as the design is focus on mining incentive-compatibility improvement first.

ariard commented at 4:01 am on March 8, 2024: none

Running this branch mainet on a 2 vcpu instance, with the following performance characteristics:

Got a disk space issue at block 501691 by syncing from genesis.

02024-02-23T10:53:41Z UpdateTip: new best=000000000000000000753b6b9821750d271e1730d7403a0658c507c88092bdf0 height=501691 version=0x20000000 log2_work=87.760876 tx=287312998 date='2017-12-30T07:48:59Z' progress=0.295839 cache=16.3MiB(151170txo)
12024-02-23T10:53:41Z New outbound-full-relay v1 peer connected: version: 70016, blocks=831673, peer=196
22024-02-23T10:53:47Z *** Disk space is too low!
32024-02-23T10:53:47Z Error: Disk space is too low!

Was configured as a prune node with default pruning settings.

02024-02-22T17:49:33Z Prune configured to target 550 MiB on disk for block and undo files.

With following /proc/meminfo dump

 0MemTotal:        4020688 kB
 1MemFree:          359104 kB
 2MemAvailable:    3650432 kB
 3Buffers:           44424 kB
 4Cached:          3385624 kB
 5SwapCached:            0 kB
 6Active:          1312988 kB
 7Inactive:        2156904 kB
 8Active(anon):        352 kB
 9Inactive(anon):    40004 kB
10Active(file):    1312636 kB
11Inactive(file):  2116900 kB

DrahtBot added the label Needs rebase on Mar 9, 2024

DrahtBot added the label CI failed on Apr 3, 2024

DrahtBot commented at 7:04 pm on April 3, 2024: contributor

🚧 At least one of the CI tasks failed. Make sure to run all tests locally, according to the documentation.

Possibly this is due to a silent merge conflict (the changes in this pull request being incompatible with the current code in the target branch). If so, make sure to rebase on the latest commit of the target branch.

Leave a comment here, if you need help tracking down a confusing failure.

Debug: https://github.com/bitcoin/bitcoin/runs/22392307037

sdaftuar force-pushed on Apr 25, 2024

sdaftuar force-pushed on Apr 26, 2024

DrahtBot removed the label Needs rebase on Apr 26, 2024

sdaftuar force-pushed on Apr 26, 2024

sdaftuar force-pushed on Apr 27, 2024

Dianagram009 approved

sdaftuar force-pushed on Apr 28, 2024

DrahtBot removed the label CI failed on Apr 28, 2024

DrahtBot added the label Needs rebase on Apr 30, 2024

in src/kernel/txgraph.cpp:522 in c49e0444e5 outdated

502+    cluster->Clear();
503+
504+    // The first transaction gets to stay in the existing cluster.
505+    bool first = true;
506+    for (auto& txentry : txs) {
507+        if (txentry.get().m_cluster == nullptr) {

Christewart commented at 1:25 pm on May 7, 2024:

Isn’t this trivially true from this? https://github.com/bitcoin/bitcoin/blob/c49e0444e5c474d5e1ac0af89e9bc27958f3ed31/src/kernel/txgraph.cpp#L498

sdaftuar commented at 3:50 pm on May 8, 2024:

See line 528 below: https://github.com/bitcoin/bitcoin/blob/c49e0444e5c474d5e1ac0af89e9bc27958f3ed31/src/kernel/txgraph.cpp#L528

Note also that I expect this implementation of txgraph to be rewritten entirely, so the most relevant thing to review right now relating to this code is the header file, as the interface is the most important part (and the rest of the PR and the mempool code builds on top of that interface).

in src/rpc/mempool.cpp:659 in c49e0444e5 outdated

644@@ -547,6 +645,40 @@ static RPCHelpMan getmempooldescendants()
645     };
646 }
647 
648+static RPCHelpMan getmempoolcluster()
649+{
650+    return RPCHelpMan{"getmempoolcluster",
651+        "\nReturns mempool data for given cluster\n",
652+        {
653+            {"id", RPCArg::Type::NUM, RPCArg::Optional::NO, "The cluster id (must be in mempool)"},

Christewart commented at 2:50 pm on May 7, 2024:

How are users of the RPC expected to obtain the set of cluster_id’s?

IIUC, the way to do this currently is to query

0./bin/bitcoin-cli getrawmempool true

and then aggregate the cluster_id’s yourself?

sdaftuar commented at 3:45 pm on May 8, 2024:

Sjors suggested here #28676 (review) that I add a way to do it by transaction hash, which would be easy. Let me know if you have other suggestions…

sdaftuar force-pushed on May 8, 2024

sdaftuar force-pushed on Jun 5, 2024

DrahtBot removed the label Needs rebase on Jun 5, 2024

sdaftuar force-pushed on Jun 5, 2024

DrahtBot added the label CI failed on Jun 5, 2024

sdaftuar force-pushed on Jun 5, 2024

DrahtBot removed the label CI failed on Jun 5, 2024

sdaftuar force-pushed on Jun 5, 2024

DrahtBot added the label Needs rebase on Jun 7, 2024

sdaftuar force-pushed on Jun 10, 2024

DrahtBot removed the label Needs rebase on Jun 10, 2024

DrahtBot added the label CI failed on Jun 10, 2024

DrahtBot added the label Needs rebase on Jun 11, 2024

sdaftuar force-pushed on Jun 12, 2024

sdaftuar force-pushed on Jun 14, 2024

DrahtBot removed the label Needs rebase on Jun 14, 2024

DrahtBot removed the label CI failed on Jun 14, 2024

DrahtBot added the label Needs rebase on Jun 17, 2024

sdaftuar force-pushed on Jun 18, 2024

DrahtBot removed the label Needs rebase on Jun 18, 2024

DrahtBot added the label CI failed on Jun 18, 2024

DrahtBot commented at 8:03 pm on June 18, 2024: contributor

🚧 At least one of the CI tasks failed. Make sure to run all tests locally, according to the documentation.

Possibly this is due to a silent merge conflict (the changes in this pull request being incompatible with the current code in the target branch). If so, make sure to rebase on the latest commit of the target branch.

Leave a comment here, if you need help tracking down a confusing failure.

Debug: https://github.com/bitcoin/bitcoin/runs/26377000968

bitcoin deleted a comment on Jun 18, 2024

DrahtBot added the label Needs rebase on Jun 19, 2024

sdaftuar force-pushed on Jun 26, 2024

DrahtBot removed the label Needs rebase on Jun 26, 2024

sdaftuar force-pushed on Jun 28, 2024

DrahtBot removed the label CI failed on Jun 28, 2024

sdaftuar force-pushed on Jul 1, 2024

DrahtBot added the label Needs rebase on Jul 3, 2024

sdaftuar force-pushed on Jul 11, 2024

DrahtBot commented at 3:38 am on July 11, 2024: contributor

🚧 At least one of the CI tasks failed. Make sure to run all tests locally, according to the documentation.

Possibly this is due to a silent merge conflict (the changes in this pull request being incompatible with the current code in the target branch). If so, make sure to rebase on the latest commit of the target branch.

Leave a comment here, if you need help tracking down a confusing failure.

Debug: https://github.com/bitcoin/bitcoin/runs/27300763520

DrahtBot added the label CI failed on Jul 11, 2024

DrahtBot removed the label Needs rebase on Jul 11, 2024

sdaftuar force-pushed on Jul 11, 2024

DrahtBot removed the label CI failed on Jul 11, 2024

DrahtBot added the label Needs rebase on Jul 15, 2024

in src/kernel/txgraph.cpp:174 in 7e97af364e outdated

169+    orig_linearization.reserve(tx_count);
170+    for (unsigned int i=0; i<cluster.size(); ++i) {
171+        orig_linearization.push_back(i);
172+    }
173+    cluster_linearize::DepGraph dep_graph(cluster);
174+    auto result = cluster_linearize::Linearize(dep_graph, iterations, 0, orig_linearization);

instagibbs commented at 8:44 pm on July 22, 2024:

iterations is not modified here, so you’re just reporting 0 iters used everywhere, and always PostLinearizing

Do we want to change the Linearize signature to report iters left, which can infer optimal or not as well?

sdaftuar commented at 1:09 pm on July 26, 2024:

Fixed this to use result.second, which indicates whether the linearization is optimal.

Also added a wip commit to track iterations done, so that the benchmark logging is more useful for now.

instagibbs commented at 3:24 pm on July 23, 2024: member

(after fixing the reported iterations done) Seeing a lot of linerarizations being done that are kind of odd looking repeats with same number of iterations?

02024-07-22T21:01:29.226080Z [bench] InvokeSort linearize cluster: 15 txs, 0.0184ms, 120 iter, 153.1ns/iter
12024-07-22T21:01:29.226348Z [bench] InvokeSort linearize cluster: 15 txs, 0.0187ms, 120 iter, 155.7ns/iter
22024-07-22T21:01:29.226633Z [bench] InvokeSort linearize cluster: 15 txs, 0.0180ms, 120 iter, 149.8ns/iter
32024-07-22T21:01:29.226926Z [bench] InvokeSort linearize cluster: 15 txs, 0.0188ms, 120 iter, 156.8ns/iter

what’s happening here?

sdaftuar force-pushed on Jul 26, 2024

DrahtBot added the label CI failed on Jul 26, 2024

DrahtBot commented at 3:06 pm on July 26, 2024: contributor

🚧 At least one of the CI tasks failed. Debug: https://github.com/bitcoin/bitcoin/runs/27964484768

Make sure to run all tests locally, according to the documentation.

The failure may happen due to a number of reasons, for example:

Possibly due to a silent merge conflict (the changes in this pull request being incompatible with the current code in the target branch). If so, make sure to rebase on the latest commit of the target branch.
A sanitizer issue, which can only be found by compiling with the sanitizer and running the affected test.
An intermittent issue.

Leave a comment here, if you need help tracking down a confusing failure.

DrahtBot removed the label Needs rebase on Jul 26, 2024

sdaftuar force-pushed on Jul 26, 2024

DrahtBot removed the label CI failed on Jul 27, 2024

sdaftuar commented at 6:14 pm on July 29, 2024: member

(after fixing the reported iterations done) Seeing a lot of linerarizations being done that are kind of odd looking repeats with same number of iterations?

02024-07-22T21:01:29.226080Z [bench] InvokeSort linearize cluster: 15 txs, 0.0184ms, 120 iter, 153.1ns/iter
12024-07-22T21:01:29.226348Z [bench] InvokeSort linearize cluster: 15 txs, 0.0187ms, 120 iter, 155.7ns/iter
22024-07-22T21:01:29.226633Z [bench] InvokeSort linearize cluster: 15 txs, 0.0180ms, 120 iter, 149.8ns/iter
32024-07-22T21:01:29.226926Z [bench] InvokeSort linearize cluster: 15 txs, 0.0188ms, 120 iter, 156.8ns/iter

what’s happening here?

I observed some similar behavior on my node as well. Without knowing exactly what was happening here: in the current draft of the PR, the number of iterations in an optimal sort is related to the number of topologically valid orderings of the cluster (since we look at all topologically valid subsets each time we try to find the best candidate), so if there are clusters with similar topologies, you might expect to see the same number of iterations needed.

Additionally, in the current draft, successful RBF candidates cause us to relinearize things twice: once to construct the feerate diagram, and once again when the new transaction is added to the mempool. The intention is to eliminate the linearizations upon acceptance to the mempool from taking place if they were already done as part of the RBF calculation (the design of the TxGraph was done with this in mind), but that is not yet implemented. (In the example you pasted, I noticed there are 4 linearizations happening at the same millisecond, so I wonder if RBF may also be involved.)

glozow referenced this in commit bba01ba18d on Aug 5, 2024

DrahtBot added the label Needs rebase on Aug 5, 2024

sdaftuar force-pushed on Aug 29, 2024

DrahtBot removed the label Needs rebase on Aug 29, 2024

sdaftuar force-pushed on Aug 29, 2024

DrahtBot added the label CI failed on Aug 30, 2024

DrahtBot added the label Needs rebase on Sep 2, 2024

sdaftuar force-pushed on Sep 3, 2024

DrahtBot removed the label Needs rebase on Sep 3, 2024

in test/functional/mining_prioritisetransaction.py:119 in 76b4f7ded7 outdated

108@@ -109,6 +109,14 @@ def test_diamond(self):
109         raw_before[txid_c]["fees"]["descendant"] += fee_delta_c_1 + fee_delta_c_2
110         raw_before[txid_d]["fees"]["ancestor"] += fee_delta_b + fee_delta_c_1 + fee_delta_c_2
111         raw_after = self.nodes[0].getrawmempool(verbose=True)
112+        for txid in [txid_a, txid_b, txid_c, txid_d]:

glozow commented at 4:16 pm on September 3, 2024:

9047d620b81 Suggestion for adding the deltas to chunk feerates instead of deleting the fields:

 0diff --git a/test/functional/mining_prioritisetransaction.py b/test/functional/mining_prioritisetransaction.py
 1index a8183dc8e14..6c3e02b8862 100755
 2--- a/test/functional/mining_prioritisetransaction.py
 3+++ b/test/functional/mining_prioritisetransaction.py
 4@@ -101,21 +101,19 @@ class PrioritiseTransactionTest(BitcoinTestFramework):
 5         self.nodes[0].prioritisetransaction(txid=txid_c, fee_delta=int(fee_delta_c_1 * COIN))
 6         self.nodes[0].prioritisetransaction(txid=txid_c, fee_delta=int(fee_delta_c_2 * COIN))
 7         raw_before[txid_a]["fees"]["descendant"] += fee_delta_b + fee_delta_c_1 + fee_delta_c_2
 8+        raw_before[txid_a]["fees"]["chunk"] += fee_delta_b + fee_delta_c_1 + fee_delta_c_2
 9         raw_before[txid_b]["fees"]["modified"] += fee_delta_b
10         raw_before[txid_b]["fees"]["ancestor"] += fee_delta_b
11         raw_before[txid_b]["fees"]["descendant"] += fee_delta_b
12+        raw_before[txid_b]["fees"]["chunk"] += fee_delta_b + fee_delta_c_1 + fee_delta_c_2
13         raw_before[txid_c]["fees"]["modified"] += fee_delta_c_1 + fee_delta_c_2
14         raw_before[txid_c]["fees"]["ancestor"] += fee_delta_c_1 + fee_delta_c_2
15         raw_before[txid_c]["fees"]["descendant"] += fee_delta_c_1 + fee_delta_c_2
16+        raw_before[txid_c]["fees"]["chunk"] += fee_delta_b + fee_delta_c_1 + fee_delta_c_2
17         raw_before[txid_d]["fees"]["ancestor"] += fee_delta_b + fee_delta_c_1 + fee_delta_c_2
18         raw_after = self.nodes[0].getrawmempool(verbose=True)
19         for txid in [txid_a, txid_b, txid_c, txid_d]:
20-            del raw_before[txid]["fees"]["chunk"]
21-            del raw_before[txid]["chunksize"]
22             del raw_before[txid]["clusterid"]
23-        for txid in [txid_a, txid_b, txid_c, txid_d]:
24-            del raw_after[txid]["fees"]["chunk"]
25-            del raw_after[txid]["chunksize"]
26             del raw_after[txid]["clusterid"]
27         assert_equal(raw_before[txid_a], raw_after[txid_a])
28         assert_equal(raw_before, raw_after)
29@@ -137,8 +135,6 @@ class PrioritiseTransactionTest(BitcoinTestFramework):
30             self.nodes[0].sendrawtransaction(t)
31         raw_after = self.nodes[0].getrawmempool(verbose=True)
32         for txid in [txid_a, txid_b, txid_c, txid_d]:
33-            del raw_after[txid]["fees"]["chunk"]
34-            del raw_after[txid]["chunksize"]
35             del raw_after[txid]["clusterid"]
36         assert_equal(raw_before[txid_a], raw_after[txid_a])
37         assert_equal(raw_before, raw_after)

sdaftuar commented at 5:09 pm on September 5, 2024:

Taken, thanks.

in test/functional/mempool_persist.py:102 in 76b4f7ded7 outdated

 96@@ -97,6 +97,9 @@ def run_test(self):
 97         assert_equal(total_fee_old, sum(v['fees']['base'] for k, v in self.nodes[0].getrawmempool(verbose=True).items()))
 98 
 99         last_entry = self.nodes[0].getmempoolentry(txid=last_txid)
100+        del last_entry["fees"]["chunk"]
101+        del last_entry["clusterid"]
102+        del last_entry["chunksize"]

glozow commented at 4:20 pm on September 3, 2024:

9047d620b81 Just deleting the clusterid is sufficient

0        del last_entry["clusterid"]

sdaftuar commented at 5:17 pm on September 5, 2024:

Taken

in test/functional/mempool_persist.py:140 in 76b4f7ded7 outdated

133@@ -131,7 +134,11 @@ def run_test(self):
134         assert_equal(fees['base'] + Decimal('0.00001000'), fees['modified'])
135 
136         self.log.debug('Verify all fields are loaded correctly')
137-        assert_equal(last_entry, self.nodes[0].getmempoolentry(txid=last_txid))
138+        new_entry = self.nodes[0].getmempoolentry(txid=last_txid)
139+        del new_entry["fees"]["chunk"]
140+        del new_entry["clusterid"]
141+        del new_entry["chunksize"]

glozow commented at 4:20 pm on September 3, 2024:

9047d620b81 similarly

0        del new_entry["clusterid"]

sdaftuar commented at 5:17 pm on September 5, 2024:

Taken

in test/functional/mempool_sigoplimit.py:175 in 76b4f7ded7 outdated

173         # When we actually try to submit, the parent makes it into the mempool, but the child would exceed ancestor vsize limits
174         res = self.nodes[0].submitpackage([tx_parent.serialize().hex(), tx_child.serialize().hex()])
175-        assert "too-long-mempool-chain" in res["tx-results"][tx_child.getwtxid()]["error"]
176+        assert "too-large-cluster" in res["tx-results"][tx_child.getwtxid()]["error"]
177+        # When we actually try to submit, the parent makes it into the mempool, but the child would exceed cluster vsize limits
178+        #assert_raises_rpc_error(-26, "too-large-cluster", self.nodes[0].submitpackage, [tx_parent.serialize().hex(), tx_child.serialize().hex()])

glozow commented at 4:21 pm on September 3, 2024:

line should be deleted

sdaftuar commented at 5:18 pm on September 5, 2024:

Removed

in test/functional/mempool_truc.py:548 in 76b4f7ded7 outdated

544@@ -557,10 +545,11 @@ def test_truc_sibling_eviction(self):
545         # Override maxfeerate - it costs a lot to replace these 100 transactions.
546         assert node.testmempoolaccept([tx_v3_replacement_only["hex"]], maxfeerate=0)[0]["allowed"]
547         # Adding another one exceeds the limit.
548-        utxos_for_conflict.append(tx_v3_parent["new_utxos"][1])
549-        tx_v3_child_2_rule5 = self.wallet.create_self_transfer_multi(utxos_to_spend=utxos_for_conflict, fee_per_output=4000000, version=3)
550-        rule5_str = f"too many potential replacements (including sibling eviction), rejecting replacement {tx_v3_child_2_rule5['txid']}; too many potential replacements (101 > 100)"
551-        assert_raises_rpc_error(-26, rule5_str, node.sendrawtransaction, tx_v3_child_2_rule5["hex"])
552+        # TODO: rewrite this test given the new RBF rules

glozow commented at 4:43 pm on September 3, 2024:

d0576c134a5 I don’t think it’s possible to adapt this test so that the tx has 100 direct conflicts, given that it needs to be within 1000vB (the only transactions allowed to have sibling eviction are TRUC children). Having 100 inputs makes it at least 3000-something vB. This would be the case for any new rule that requires 100 inputs.

So this test could just be deleted until we allow sibling eviction for transactions that don’t have this size restriction.

sdaftuar commented at 5:19 pm on September 5, 2024:

Deleted.

in test/functional/feature_rbf.py:69 in 76b4f7ded7 outdated

71-        self.test_too_many_replacements()
72-
73-        self.log.info("Running test too many replacements using default mempool params...")
74-        self.test_too_many_replacements_with_default_mempool_params()
75+        #self.log.info("Running test too many replacements using default mempool params...")
76+        #self.test_too_many_replacements_with_default_mempool_params()

glozow commented at 5:21 pm on September 3, 2024:

d0576c134a5 I think we only need to adapt 1 of these 2 tests and it should be quite easy to do. Currently, test_too_many_replacements requires a high descendant limit because it’s 1 parent with 100 children, but we can just add a self.generate(node, 1) in between and it can use node 1 (the default one). Then, we can just delete test_too_many_replacements_with_default_mempool_params

  0diff --git a/test/functional/feature_rbf.py b/test/functional/feature_rbf.py
  1index f1de04008c4..57817380c50 100755
  2--- a/test/functional/feature_rbf.py
  3+++ b/test/functional/feature_rbf.py
  4@@ -61,12 +61,8 @@ class ReplaceByFeeTest(BitcoinTestFramework):
  5         self.log.info("Running test spends of conflicting outputs...")
  6         self.test_spends_of_conflicting_outputs()
  7 
  8-        # TODO: rework too many replacements test to use direct conflicts only
  9-        #self.log.info("Running test too many replacements...")
 10-        #self.test_too_many_replacements()
 11-
 12-        #self.log.info("Running test too many replacements using default mempool params...")
 13-        #self.test_too_many_replacements_with_default_mempool_params()
 14+        self.log.info("Running test too many replacements...")
 15+        self.test_too_many_replacements()
 16 
 17         self.log.info("Running test opt-in...")
 18         self.test_opt_in()
 19@@ -331,17 +327,20 @@ class ReplaceByFeeTest(BitcoinTestFramework):
 20         split_value = int((initial_nValue - fee) / (MAX_REPLACEMENT_LIMIT + 1))
 21 
 22         splitting_tx_utxos = self.wallet.send_self_transfer_multi(
 23-            from_node=self.nodes[0],
 24+            from_node=self.nodes[1],
 25             utxos_to_spend=[utxo],
 26             sequence=0,
 27             num_outputs=MAX_REPLACEMENT_LIMIT + 1,
 28             amount_per_output=split_value,
 29         )["new_utxos"]
 30 
 31+        # Let this tx confirm so that its children are not connected
 32+        self.generate(self.nodes[1], 1)
 33+
 34         # Now spend each of those outputs individually
 35         for utxo in splitting_tx_utxos:
 36             self.wallet.send_self_transfer(
 37-                from_node=self.nodes[0],
 38+                from_node=self.nodes[1],
 39                 utxo_to_spend=utxo,
 40                 sequence=0,
 41                 fee=Decimal(fee) / COIN,
 42@@ -359,98 +358,13 @@ class ReplaceByFeeTest(BitcoinTestFramework):
 43         double_tx_hex = double_tx.serialize().hex()
 44 
 45         # This will raise an exception
 46-        assert_raises_rpc_error(-26, "too many potential replacements", self.nodes[0].sendrawtransaction, double_tx_hex, 0)
 47+        assert_raises_rpc_error(-26, "too many potential replacements", self.nodes[1].sendrawtransaction, double_tx_hex, 0)
 48 
 49         # If we remove an input, it should pass
 50         double_tx.vin.pop()
 51         double_tx_hex = double_tx.serialize().hex()
 52         self.nodes[0].sendrawtransaction(double_tx_hex, 0)
 53 
 54-    def test_too_many_replacements_with_default_mempool_params(self):
 55-        """
 56-        Test rule 5 (do not allow replacements that cause more than 100
 57-        evictions) without having to rely on non-default mempool parameters.
 58-
 59-        In order to do this, create a number of "root" UTXOs, and then hang
 60-        enough transactions off of each root UTXO to exceed the MAX_REPLACEMENT_LIMIT.
 61-        Then create a conflicting RBF replacement transaction.
 62-        """
 63-        # Clear mempools to avoid cross-node sync failure.
 64-        for node in self.nodes:
 65-            self.generate(node, 1)
 66-        normal_node = self.nodes[1]
 67-        wallet = MiniWallet(normal_node)
 68-
 69-        # This has to be chosen so that the total number of transactions can exceed
 70-        # MAX_REPLACEMENT_LIMIT without having any one tx graph run into the descendant
 71-        # limit; 10 works.
 72-        num_tx_graphs = 10
 73-
 74-        # (Number of transactions per graph, rule 5 failure expected)
 75-        cases = [
 76-            # Test the base case of evicting fewer than MAX_REPLACEMENT_LIMIT
 77-            # transactions.
 78-            ((MAX_REPLACEMENT_LIMIT // num_tx_graphs) - 1, False),
 79-
 80-            # Test hitting the rule 5 eviction limit.
 81-            (MAX_REPLACEMENT_LIMIT // num_tx_graphs, True),
 82-        ]
 83-
 84-        for (txs_per_graph, failure_expected) in cases:
 85-            self.log.debug(f"txs_per_graph: {txs_per_graph}, failure: {failure_expected}")
 86-            # "Root" utxos of each txn graph that we will attempt to double-spend with
 87-            # an RBF replacement.
 88-            root_utxos = []
 89-
 90-            # For each root UTXO, create a package that contains the spend of that
 91-            # UTXO and `txs_per_graph` children tx.
 92-            for graph_num in range(num_tx_graphs):
 93-                root_utxos.append(wallet.get_utxo())
 94-
 95-                optin_parent_tx = wallet.send_self_transfer_multi(
 96-                    from_node=normal_node,
 97-                    sequence=MAX_BIP125_RBF_SEQUENCE,
 98-                    utxos_to_spend=[root_utxos[graph_num]],
 99-                    num_outputs=txs_per_graph,
100-                )
101-                assert_equal(True, normal_node.getmempoolentry(optin_parent_tx['txid'])['bip125-replaceable'])
102-                new_utxos = optin_parent_tx['new_utxos']
103-
104-                for utxo in new_utxos:
105-                    # Create spends for each output from the "root" of this graph.
106-                    child_tx = wallet.send_self_transfer(
107-                        from_node=normal_node,
108-                        utxo_to_spend=utxo,
109-                    )
110-
111-                    assert normal_node.getmempoolentry(child_tx['txid'])
112-
113-            num_txs_invalidated = len(root_utxos) + (num_tx_graphs * txs_per_graph)
114-
115-            if failure_expected:
116-                assert num_txs_invalidated > MAX_REPLACEMENT_LIMIT
117-            else:
118-                assert num_txs_invalidated <= MAX_REPLACEMENT_LIMIT
119-
120-            # Now attempt to submit a tx that double-spends all the root tx inputs, which
121-            # would invalidate `num_txs_invalidated` transactions.
122-            tx_hex = wallet.create_self_transfer_multi(
123-                utxos_to_spend=root_utxos,
124-                fee_per_output=10_000_000,  # absurdly high feerate
125-            )["hex"]
126-
127-            if failure_expected:
128-                assert_raises_rpc_error(
129-                    -26, "too many potential replacements", normal_node.sendrawtransaction, tx_hex, 0)
130-            else:
131-                txid = normal_node.sendrawtransaction(tx_hex, 0)
132-                assert normal_node.getmempoolentry(txid)
133-
134-        # Clear the mempool once finished, and rescan the other nodes' wallet
135-        # to account for the spends we've made on `normal_node`.
136-        self.generate(normal_node, 1)
137-        self.wallet.rescan_utxos()
138-
139     def test_opt_in(self):
140         """Replacing should only work if orig tx opted in"""
141         tx0_outpoint = self.make_utxo(self.nodes[0], int(1.1 * COIN))

sdaftuar commented at 5:23 pm on September 5, 2024:

Thanks, took this.

glozow commented at 5:33 pm on September 3, 2024: member

had a look at some tests that are disabled or have TODOs

DrahtBot removed the label CI failed on Sep 3, 2024

sdaftuar force-pushed on Sep 4, 2024

DrahtBot added the label CI failed on Sep 4, 2024

DrahtBot commented at 6:29 pm on September 4, 2024: contributor

🚧 At least one of the CI tasks failed. Debug: https://github.com/bitcoin/bitcoin/runs/29684404375

Make sure to run all tests locally, according to the documentation.

The failure may happen due to a number of reasons, for example:

Possibly due to a silent merge conflict (the changes in this pull request being incompatible with the current code in the target branch). If so, make sure to rebase on the latest commit of the target branch.
A sanitizer issue, which can only be found by compiling with the sanitizer and running the affected test.
An intermittent issue.

Leave a comment here, if you need help tracking down a confusing failure.

sdaftuar force-pushed on Sep 4, 2024

DrahtBot added the label Needs rebase on Sep 4, 2024

sdaftuar force-pushed on Sep 5, 2024

DrahtBot removed the label Needs rebase on Sep 5, 2024

sdaftuar force-pushed on Sep 5, 2024

sdaftuar commented at 5:28 pm on September 5, 2024: member

Thanks @glozow for the test improvements. I’ve also made a change to an RBF limit that I think is worth mentioning. Rather than limit the number of direct conflicts that a transaction can have, I’ve now implemented the limit to be on the number of distinct clusters that the conflicts of a transaction belong to.

The idea is still to create some bound on the amount of CPU we might spend linearizing clusters and doing feerate diagram checks when processing a single transaction. Limiting the number of conflicting clusters (at 100 for now) should be sufficient to achieve that. I want to point out that we might linearize more than 100 clusters as a result of a replacement (because removing conflicts might cause clusters to split), but since linearizing small clusters is generally much less work than linearizing big clusters, I don’t think this distinction materially affects the way we understand the implied CPU bound.

in test/functional/feature_rbf.py:33 in ab35bca097 outdated

29@@ -30,6 +30,7 @@ def set_test_params(self):
30         self.extra_args = [
31             [
32                 "-mempoolfullrbf=0",
33+                "-limitclustercount=200",

instagibbs commented at 6:23 pm on September 5, 2024:

nothing fails when I remove this

sdaftuar commented at 2:13 pm on September 17, 2024:

Removed (it was needed in an intermediate commit).

in test/functional/feature_rbf.py:334 in ab35bca097 outdated

314@@ -317,27 +315,6 @@ def test_spends_of_conflicting_outputs(self):
315         # This will raise an exception
316         assert_raises_rpc_error(-26, "bad-txns-spends-conflicting-tx", self.nodes[0].sendrawtransaction, tx2_hex, 0)
317 
318-    def test_new_unconfirmed_inputs(self):

instagibbs commented at 6:25 pm on September 5, 2024:

do we have functional test coverage that it is allowed elsewhere? Just check that it doesn’t fail rather than rip out?

sdaftuar commented at 9:40 pm on September 16, 2024:

Done.

in test/functional/feature_rbf.py:265 in ab35bca097 outdated

273@@ -276,7 +274,7 @@ def test_replacement_feeperkb(self):
274         )["hex"]
275 
276         # This will raise an exception due to insufficient fee
277-        assert_raises_rpc_error(-26, "insufficient fee", self.nodes[0].sendrawtransaction, tx1b_hex, 0)
278+        assert_raises_rpc_error(-26, "does not improve feerate diagram", self.nodes[0].sendrawtransaction, tx1b_hex, 0)

instagibbs commented at 6:33 pm on September 5, 2024:

I wonder if we want to change this string to “insufficient fee, does not improve feerate diagram” to give a hint at what to do to user

sdaftuar commented at 9:40 pm on September 16, 2024:

good point. i wonder if we should just drop the feerate diagram language altogether and leave it at “insufficient fee”?

in test/functional/mempool_limit.py:48 in ab35bca097 outdated

44@@ -45,7 +45,7 @@ def test_rbf_carveout_disallowed(self):
45         # B: First transaction in package, RBFs A by itself under individual evaluation, which would give it +1 descendant limit
46         # C: Second transaction in package, spends B. If the +1 descendant limit persisted, would make it into mempool
47 
48-        self.restart_node(0, extra_args=self.extra_args[0] + ["-limitancestorcount=2", "-limitdescendantcount=1"])
49+        self.restart_node(0, extra_args=self.extra_args[0] + ["-limitclustercount=1"])

instagibbs commented at 6:36 pm on September 5, 2024:

I think test_rbf_carveout_disallowed is meaningless post-cluster mempool, we should be relying on other tests to ensure we never go above cluster count limit

sdaftuar commented at 3:24 pm on September 17, 2024:

Dropped this test for now, and left a todo to add more tests to exercise different methods of mempool submission and ensure we never violate the cluster limits.

in test/functional/mempool_package_rbf.py:191 in ab35bca097 outdated

197@@ -201,12 +198,12 @@ def test_package_rbf_additional_fees(self):
198 
199     def test_package_rbf_max_conflicts(self):

instagibbs commented at 7:13 pm on September 5, 2024:

Here’s a diff to clean up some of the test language and double-checking the right number of clusters

  0diff --git a/test/functional/mempool_package_rbf.py b/test/functional/mempool_package_rbf.py
  1index f75b13a571..201f303525 100755
  2--- a/test/functional/mempool_package_rbf.py
  3+++ b/test/functional/mempool_package_rbf.py
  4@@ -176,95 +176,103 @@ class PackageRBFTest(BitcoinTestFramework):
  5         assert_equal(f"package RBF failed: insufficient anti-DoS fees, rejecting replacement {failure_package_txns3[1].rehash()}, not enough additional fees to relay; {incremental_sats_short} < {incremental_sats_required}", pkg_results3["package_msg"])
  6         self.assert_mempool_contents(expected=package_txns1)
  7 
  8         success_package_hex3, success_package_txns3 = self.create_simple_package(coin, parent_fee=DEFAULT_FEE, child_fee=DEFAULT_CHILD_FEE + incremental_sats_required)
  9         node.submitpackage(success_package_hex3)
 10         self.assert_mempool_contents(expected=success_package_txns3)
 11         self.generate(node, 1)
 12 
 13         self.log.info("Check Package RBF must have strict cpfp structure")
 14         coin = self.coins.pop()
 15         package_hex4, package_txns4 = self.create_simple_package(coin, parent_fee=DEFAULT_FEE, child_fee=DEFAULT_CHILD_FEE)
 16         node.submitpackage(package_hex4)
 17         self.assert_mempool_contents(expected=package_txns4)
 18         package_hex5, _package_txns5 = self.create_simple_package(coin, parent_fee=DEFAULT_CHILD_FEE, child_fee=DEFAULT_CHILD_FEE)
 19         pkg_results5 = node.submitpackage(package_hex5)
 20         assert 'package RBF failed: package feerate is less than or equal to parent feerate' in pkg_results5["package_msg"]
 21         self.assert_mempool_contents(expected=package_txns4)
 22 
 23         package_hex5_1, package_txns5_1 = self.create_simple_package(coin, parent_fee=DEFAULT_CHILD_FEE, child_fee=DEFAULT_CHILD_FEE + Decimal("0.00000001"))
 24         node.submitpackage(package_hex5_1)
 25         self.assert_mempool_contents(expected=package_txns5_1)
 26         self.generate(node, 1)
 27 
 28     def test_package_rbf_max_conflicts(self):
 29         node = self.nodes[0]
 30-        self.log.info("Check Package RBF cannot replace more than MAX_REPLACEMENT_CANDIDATES direct conflicts")
 31+        self.log.info("Check Package RBF cannot conflict with  more than MAX_REPLACEMENT_CANDIDATES clusters")
 32         num_coins = 101
 33         parent_coins = self.coins[:num_coins]
 34         del self.coins[:num_coins]
 35 
 36-        # Original transactions: 101 transactions with 1 descendants each -> 202 total transactions
 37+        # Original transactions: 101 transactions with 1 descendants each -> 202 total transactions, 101 clusters
 38         size_two_clusters = []
 39         for coin in parent_coins:
 40             size_two_clusters.append(self.wallet.send_self_transfer_chain(from_node=node, chain_length=2, utxo_to_spend=coin))
 41         expected_txns = [txn["tx"] for parent_child_txns in size_two_clusters for txn in parent_child_txns]
 42         assert_equal(len(expected_txns), num_coins * 2)
 43         self.assert_mempool_contents(expected=expected_txns)
 44 
 45+        # 101 clusters
 46+        clusters = set([])
 47+        for txn in expected_txns:
 48+            clusters.add(node.getmempoolentry(txn.rehash())["clusterid"])
 49+        assert_equal(len(clusters), num_coins)
 50+        
 51         # parent feeerate needs to be high enough for minrelay
 52         # child feerate needs to be large enough to trigger package rbf with a very large parent and
 53         # pay for all evicted fees. maxfeerate turned off for all submissions since child feerate
 54         # is extremely high
 55         parent_fee_per_conflict = 10000
 56         child_feerate = 10000 * DEFAULT_FEE
 57 
 58-        # Conflict against all transactions by double-spending each parent, causing 202 evictions
 59+        # Conflict against all transactions by double-spending each parent, causing 101 cluster conflicts
 60         package_parent = self.wallet.create_self_transfer_multi(utxos_to_spend=parent_coins, fee_per_output=parent_fee_per_conflict)
 61         package_child = self.wallet.create_self_transfer(fee_rate=child_feerate, utxo_to_spend=package_parent["new_utxos"][0])
 62 
 63         pkg_results = node.submitpackage([package_parent["hex"], package_child["hex"]], maxfeerate=0)
 64         print(pkg_results["package_msg"])
 65         assert_equal(f"transaction failed", pkg_results["package_msg"])
 66+        assert_equal(f"too many potential replacements, rejecting replacement {package_parent['txid']}; too many conflicting clusters (101 > 100)\n", pkg_results["tx-results"][package_parent["wtxid"]]["error"])
 67         self.assert_mempool_contents(expected=expected_txns)
 68 
 69         # Make singleton tx to conflict with in next batch
 70         singleton_coin = self.coins.pop()
 71         singleton_tx = self.wallet.create_self_transfer(utxo_to_spend=singleton_coin)
 72         node.sendrawtransaction(singleton_tx["hex"])
 73         expected_txns.append(singleton_tx["tx"])
 74 
 75-        # Double-spend same set minus last, and double-spend singleton. This is still too many direct conflicts.
 76+        # Double-spend same set minus last, and double-spend singleton. This is still too many conflicted clusters.
 77         # N.B. we can't RBF just a child tx in the clusters, as that would make resulting cluster of size 3.
 78         double_spending_coins = parent_coins[:-1] + [singleton_coin]
 79         package_parent = self.wallet.create_self_transfer_multi(utxos_to_spend=double_spending_coins, fee_per_output=parent_fee_per_conflict)
 80         package_child = self.wallet.create_self_transfer(fee_rate=child_feerate, utxo_to_spend=package_parent["new_utxos"][0])
 81         pkg_results = node.submitpackage([package_parent["hex"], package_child["hex"]], maxfeerate=0)
 82         assert_equal(f"transaction failed", pkg_results["package_msg"])
 83+        assert_equal(f"too many potential replacements, rejecting replacement {package_parent['txid']}; too many conflicting clusters (101 > 100)\n", pkg_results["tx-results"][package_parent["wtxid"]]["error"])
 84         self.assert_mempool_contents(expected=expected_txns)
 85 
 86-        # Finally, evict MAX_REPLACEMENT_CANDIDATES direct conflicts
 87+        # Finally, conflict with MAX_REPLACEMENT_CANDIDATES clusters
 88         package_parent = self.wallet.create_self_transfer_multi(utxos_to_spend=parent_coins[:-1], fee_per_output=parent_fee_per_conflict)
 89         package_child = self.wallet.create_self_transfer(fee_rate=child_feerate, utxo_to_spend=package_parent["new_utxos"][0])
 90         pkg_results = node.submitpackage([package_parent["hex"], package_child["hex"]], maxfeerate=0)
 91         assert_equal(pkg_results["package_msg"], "success")
 92         self.assert_mempool_contents(expected=[singleton_tx["tx"], size_two_clusters[-1][0]["tx"], size_two_clusters[-1][1]["tx"], package_parent["tx"], package_child["tx"]] )
 93 
 94         self.generate(node, 1)
 95 
 96     def test_too_numerous_ancestors(self):
 97         self.log.info("Test that package RBF doesn't work with packages larger than 2 due to ancestors")
 98         node = self.nodes[0]
 99         coin = self.coins.pop()
100 
101         package_hex1, package_txns1 = self.create_simple_package(coin, DEFAULT_FEE, DEFAULT_CHILD_FEE)
102         node.submitpackage(package_hex1)
103         self.assert_mempool_contents(expected=package_txns1)
104 
105         # Double-spends the original package
106         self.ctr += 1
107         parent_result1 = self.wallet.create_self_transfer(
108             fee=DEFAULT_FEE,
109             utxo_to_spend=coin,
110             sequence=MAX_BIP125_RBF_SEQUENCE - self.ctr,
111         )

sdaftuar commented at 3:27 pm on September 17, 2024:

Thanks! Taken.

in test/functional/mempool_packages.py:33 in ab35bca097 outdated

28@@ -33,8 +29,8 @@ def set_test_params(self):
29             [
30             ],
31             [
32-                "-limitancestorcount={}".format(CUSTOM_ANCESTOR_LIMIT),
33                 "-limitdescendantcount={}".format(CUSTOM_DESCENDANT_LIMIT),
34+                "-limitclustercount={}".format(CUSTOM_DESCENDANT_LIMIT),

instagibbs commented at 8:00 pm on September 5, 2024:

this test is pretty ancient and tbh not sure worth holding around. But in case we want to hold onto it, modifications to make things more sensible in cluster mempool world

  0diff --git a/test/functional/mempool_packages.py b/test/functional/mempool_packages.py
  1index f7fe381f43..9cf1aa01f5 100755
  2--- a/test/functional/mempool_packages.py
  3+++ b/test/functional/mempool_packages.py
  4@@ -1,92 +1,91 @@
  5 #!/usr/bin/env python3
  6 # Copyright (c) 2014-2022 The Bitcoin Core developers
  7 # Distributed under the MIT software license, see the accompanying
  8 # file COPYING or http://www.opensource.org/licenses/mit-license.php.
  9 """Test descendant package tracking code."""
 10 
 11 from decimal import Decimal
 12 
 13 from test_framework.messages import (
 14-    DEFAULT_ANCESTOR_LIMIT,
 15-    DEFAULT_DESCENDANT_LIMIT,
 16+    DEFAULT_CLUSTER_LIMIT,
 17 )
 18 from test_framework.p2p import P2PTxInvStore
 19 from test_framework.test_framework import BitcoinTestFramework
 20 from test_framework.util import (
 21     assert_equal,
 22 )
 23 from test_framework.wallet import MiniWallet
 24 
 25 # custom limits for node1
 26-CUSTOM_DESCENDANT_LIMIT = 10
 27+CUSTOM_CLUSTER_LIMIT = 10
 28+assert CUSTOM_CLUSTER_LIMIT < DEFAULT_CLUSTER_LIMIT
 29 
 30 class MempoolPackagesTest(BitcoinTestFramework):
 31     def set_test_params(self):
 32         self.num_nodes = 2
 33         # whitelist peers to speed up tx relay / mempool sync
 34         self.noban_tx_relay = True
 35         self.extra_args = [
 36             [
 37             ],
 38             [
 39-                "-limitdescendantcount={}".format(CUSTOM_DESCENDANT_LIMIT),
 40-                "-limitclustercount={}".format(CUSTOM_DESCENDANT_LIMIT),
 41+                "-limitclustercount={}".format(CUSTOM_CLUSTER_LIMIT),
 42             ],
 43         ]
 44 
 45     def run_test(self):
 46         self.wallet = MiniWallet(self.nodes[0])
 47         self.wallet.rescan_utxos()
 48 
 49         peer_inv_store = self.nodes[0].add_p2p_connection(P2PTxInvStore()) # keep track of invs
 50 
 51-        # DEFAULT_ANCESTOR_LIMIT transactions off a confirmed tx should be fine
 52-        chain = self.wallet.create_self_transfer_chain(chain_length=DEFAULT_ANCESTOR_LIMIT)
 53+        # DEFAULT_CLUSTER_LIMIT transactions off a confirmed tx should be fine for default node
 54+        chain = self.wallet.create_self_transfer_chain(chain_length=DEFAULT_CLUSTER_LIMIT)
 55         witness_chain = [t["wtxid"] for t in chain]
 56         ancestor_vsize = 0
 57         ancestor_fees = Decimal(0)
 58 
 59         for i, t in enumerate(chain):
 60             ancestor_vsize += t["tx"].get_vsize()
 61             ancestor_fees += t["fee"]
 62             self.wallet.sendrawtransaction(from_node=self.nodes[0], tx_hex=t["hex"])
 63 
 64         # Wait until mempool transactions have passed initial broadcast (sent inv and received getdata)
 65         # Otherwise, getrawmempool may be inconsistent with getmempoolentry if unbroadcast changes in between
 66         peer_inv_store.wait_for_broadcast(witness_chain)
 67 
 68-        # Check mempool has DEFAULT_ANCESTOR_LIMIT transactions in it, and descendant and ancestor
 69+        # Check mempool has DEFAULT_CLUSTER_LIMIT transactions in it, and descendant and ancestor
 70         # count and fees should look correct
 71         mempool = self.nodes[0].getrawmempool(True)
 72-        assert_equal(len(mempool), DEFAULT_ANCESTOR_LIMIT)
 73+        assert_equal(len(mempool), DEFAULT_CLUSTER_LIMIT)
 74         descendant_count = 1
 75         descendant_fees = 0
 76         descendant_vsize = 0
 77 
 78         assert_equal(ancestor_vsize, sum([mempool[tx]['vsize'] for tx in mempool]))
 79-        ancestor_count = DEFAULT_ANCESTOR_LIMIT
 80+        ancestor_count = DEFAULT_CLUSTER_LIMIT
 81         assert_equal(ancestor_fees, sum([mempool[tx]['fees']['base'] for tx in mempool]))
 82 
 83         descendants = []
 84         ancestors = [t["txid"] for t in chain]
 85         chain = [t["txid"] for t in chain]
 86         for x in reversed(chain):
 87             # Check that getmempoolentry is consistent with getrawmempool
 88             entry = self.nodes[0].getmempoolentry(x)
 89             assert_equal(entry, mempool[x])
 90 
 91             # Check that gettxspendingprevout is consistent with getrawmempool
 92             witnesstx = self.nodes[0].getrawtransaction(txid=x, verbose=True)
 93             for tx_in in witnesstx["vin"]:
 94                 spending_result = self.nodes[0].gettxspendingprevout([ {'txid' : tx_in["txid"], 'vout' : tx_in["vout"]} ])
 95                 assert_equal(spending_result, [ {'txid' : tx_in["txid"], 'vout' : tx_in["vout"], 'spendingtxid' : x} ])
 96 
 97             # Check that the descendant calculations are correct
 98             assert_equal(entry['descendantcount'], descendant_count)
 99             descendant_fees += entry['fees']['base']
100             assert_equal(entry['fees']['modified'], entry['fees']['base'])
101             assert_equal(entry['fees']['descendant'], descendant_fees)
102             descendant_vsize += entry['vsize']
103             assert_equal(entry['descendantsize'], descendant_vsize)
104             descendant_count += 1
105 
106@@ -169,81 +168,81 @@ class MempoolPackagesTest(BitcoinTestFramework):
107         # Prioritise a transaction that has been mined, then add it back to the
108         # mempool by using invalidateblock.
109         self.nodes[0].prioritisetransaction(txid=chain[-1], fee_delta=2000)
110         self.nodes[0].invalidateblock(self.nodes[0].getbestblockhash())
111         # Keep node1's tip synced with node0
112         self.nodes[1].invalidateblock(self.nodes[1].getbestblockhash())
113 
114         # Now check that the transaction is in the mempool, with the right modified fee
115         descendant_fees = 0
116         for x in reversed(chain):
117             entry = self.nodes[0].getmempoolentry(x)
118             descendant_fees += entry['fees']['base']
119             if (x == chain[-1]):
120                 assert_equal(entry['fees']['modified'], entry['fees']['base'] + Decimal("0.00002"))
121             assert_equal(entry['fees']['descendant'], descendant_fees + Decimal("0.00002"))
122 
123         # Now test descendant chain limits
124         tx_children = []
125         # First create one parent tx with 10 children
126         tx_with_children = self.wallet.send_self_transfer_multi(from_node=self.nodes[0], num_outputs=10)
127         parent_transaction = tx_with_children["txid"]
128         transaction_package = tx_with_children["new_utxos"]
129 
130         # Sign and send up to MAX_DESCENDANT transactions chained off the parent tx
131         chain = [] # save sent txs for the purpose of checking node1's mempool later (see below)
132-        for _ in range(DEFAULT_DESCENDANT_LIMIT - 1):
133+        for _ in range(DEFAULT_CLUSTER_LIMIT - 1):
134             utxo = transaction_package.pop(0)
135             new_tx = self.wallet.send_self_transfer_multi(from_node=self.nodes[0], num_outputs=10, utxos_to_spend=[utxo])
136             txid = new_tx["txid"]
137             chain.append(txid)
138             if utxo['txid'] is parent_transaction:
139                 tx_children.append(txid)
140             transaction_package.extend(new_tx["new_utxos"])
141 
142         mempool = self.nodes[0].getrawmempool(True)
143-        assert_equal(mempool[parent_transaction]['descendantcount'], DEFAULT_DESCENDANT_LIMIT)
144+        assert_equal(mempool[parent_transaction]['descendantcount'], DEFAULT_CLUSTER_LIMIT)
145         assert_equal(sorted(mempool[parent_transaction]['spentby']), sorted(tx_children))
146 
147         for child in tx_children:
148             assert_equal(mempool[child]['depends'], [parent_transaction])
149 
150         # Check that node1's mempool is as expected, containing:
151         # - parent tx for descendant test
152         # - txs chained off parent tx (-> custom descendant limit)
153-        self.wait_until(lambda: len(self.nodes[1].getrawmempool()) == 2*CUSTOM_DESCENDANT_LIMIT, timeout=10)
154+        self.wait_until(lambda: len(self.nodes[1].getrawmempool()) == 2*CUSTOM_CLUSTER_LIMIT, timeout=10)
155         mempool0 = self.nodes[0].getrawmempool(False)
156         mempool1 = self.nodes[1].getrawmempool(False)
157         assert set(mempool1).issubset(set(mempool0))
158         assert parent_transaction in mempool1
159         for tx in chain:
160             if tx in mempool1:
161                 entry0 = self.nodes[0].getmempoolentry(tx)
162                 entry1 = self.nodes[1].getmempoolentry(tx)
163                 assert not entry0['unbroadcast']
164                 assert not entry1['unbroadcast']
165-                assert entry1["descendantcount"] <= CUSTOM_DESCENDANT_LIMIT
166+                assert entry1["descendantcount"] <= CUSTOM_CLUSTER_LIMIT
167                 assert_equal(entry1['fees']['base'], entry0['fees']['base'])
168                 assert_equal(entry1['vsize'], entry0['vsize'])
169                 assert_equal(entry1['depends'], entry0['depends'])
170 
171         # Test reorg handling
172         # First, the basics:
173         self.generate(self.nodes[0], 1)
174         self.nodes[1].invalidateblock(self.nodes[0].getbestblockhash())
175         self.nodes[1].reconsiderblock(self.nodes[0].getbestblockhash())
176 
177         # Now test the case where node1 has a transaction T in its mempool that
178         # depends on transactions A and B which are in a mined block, and the
179         # block containing A and B is disconnected, AND B is not accepted back
180         # into node1's mempool because its ancestor count is too high.
181 
182         # Create 8 transactions, like so:
183         # Tx0 -> Tx1 (vout0)
184         #   \--> Tx2 (vout1) -> Tx3 -> Tx4 -> Tx5 -> Tx6 -> Tx7
185         #
186         # Mine them in the next block, then generate a new tx8 that spends
187         # Tx1 and Tx7, and add to node1's mempool, then disconnect the
188         # last block.
189 
190         # Create tx0 with 2 outputs
191         tx0 = self.wallet.send_self_transfer_multi(from_node=self.nodes[0], num_outputs=2)

sdaftuar commented at 4:20 pm on September 17, 2024:

Thanks! Taken.

in test/functional/mempool_persist.py:137 in ab35bca097 outdated

131@@ -131,7 +132,9 @@ def run_test(self):
132         assert_equal(fees['base'] + Decimal('0.00001000'), fees['modified'])
133 
134         self.log.debug('Verify all fields are loaded correctly')
135-        assert_equal(last_entry, self.nodes[0].getmempoolentry(txid=last_txid))
136+        new_entry = self.nodes[0].getmempoolentry(txid=last_txid)
137+        del new_entry["clusterid"]
138+        assert_equal(last_entry, new_entry)

instagibbs commented at 8:06 pm on September 5, 2024:

can obviously ignore this but instead of deleting keys in two spots, could just do: assert_equal({**last_entry, "clusterid": None}, {**new_entry, "clusterid": None})

sdaftuar commented at 4:23 pm on September 17, 2024:

Taken.

in test/functional/mempool_truc.py:547 in ab35bca097 outdated

543@@ -556,11 +544,6 @@ def test_truc_sibling_eviction(self):
544         tx_v3_replacement_only = self.wallet.create_self_transfer_multi(utxos_to_spend=utxos_for_conflict, fee_per_output=4000000)
545         # Override maxfeerate - it costs a lot to replace these 100 transactions.
546         assert node.testmempoolaccept([tx_v3_replacement_only["hex"]], maxfeerate=0)[0]["allowed"]
547-        # Adding another one exceeds the limit.

instagibbs commented at 9:58 pm on September 5, 2024:

heh, if you try 99 cluster conflicts + sibling eviction, that implies the new transaction has a vsize cap of 1kvB since it’s a child, and since it will be north of 5kvB it will be rejected. I don’t think we can have sibling eviction count meaningfully towards exceeding max cluster conflicts. Worth the coverage anyways?

  0diff --git a/test/functional/mempool_truc.py b/test/functional/mempool_truc.py
  1index e3fcc45059..4e8e5943fc 100755
  2--- a/test/functional/mempool_truc.py
  3+++ b/test/functional/mempool_truc.py
  4@@ -480,92 +480,101 @@ class MempoolTRUC(BitcoinTestFramework):
  5 
  6         child_1 = self.wallet.send_self_transfer(from_node=node, version=3, utxo_to_spend=ancestor_tx["new_utxos"][0])
  7         child_2 = self.wallet.send_self_transfer(from_node=node, version=3, utxo_to_spend=ancestor_tx["new_utxos"][1])
  8         self.check_mempool([child_1["txid"], child_2["txid"]])
  9 
 10         self.generate(node, 1)
 11         self.check_mempool([])
 12 
 13         # Create a reorg, causing ancestor_tx to exceed the 1-child limit
 14         node.invalidateblock(block)
 15         self.check_mempool([ancestor_tx["txid"], child_1["txid"], child_2["txid"]])
 16         assert_equal(node.getmempoolentry(ancestor_tx["txid"])["descendantcount"], 3)
 17 
 18         # Create a replacement of child_1. It does not conflict with child_2.
 19         child_1_conflict = self.wallet.send_self_transfer(from_node=node, version=3, utxo_to_spend=ancestor_tx["new_utxos"][0], fee_rate=Decimal("0.01"))
 20 
 21         # Ensure child_1 and child_1_conflict are different transactions
 22         assert (child_1_conflict["txid"] != child_1["txid"])
 23         self.check_mempool([ancestor_tx["txid"], child_1_conflict["txid"], child_2["txid"]])
 24         assert_equal(node.getmempoolentry(ancestor_tx["txid"])["descendantcount"], 3) [@cleanup](/bitcoin-bitcoin/contributor/cleanup/)(extra_args=None)
 25     def test_truc_sibling_eviction(self):
 26         self.log.info("Test sibling eviction for TRUC")
 27         node = self.nodes[0]
 28+
 29+        # Seed sufficient utxos for the test
 30+        self.generate(node, 100)
 31+        self.wallet.rescan_utxos()
 32+
 33         tx_v3_parent = self.wallet.send_self_transfer_multi(from_node=node, num_outputs=2, version=3)
 34         # This is the sibling to replace
 35         tx_v3_child_1 = self.wallet.send_self_transfer(
 36             from_node=node, utxo_to_spend=tx_v3_parent["new_utxos"][0], fee_rate=DEFAULT_FEE * 2, version=3
 37         )
 38         assert tx_v3_child_1["txid"] in node.getrawmempool()
 39 
 40         self.log.info("Test tx must be higher feerate than sibling to evict it")
 41         tx_v3_child_2_rule6 = self.wallet.create_self_transfer(
 42             utxo_to_spend=tx_v3_parent["new_utxos"][1], fee_rate=DEFAULT_FEE, version=3
 43         )
 44         rule6_str = f"insufficient fee (including sibling eviction), rejecting replacement {tx_v3_child_2_rule6['txid']}"
 45         assert_raises_rpc_error(-26, rule6_str, node.sendrawtransaction, tx_v3_child_2_rule6["hex"])
 46         self.check_mempool([tx_v3_parent['txid'], tx_v3_child_1['txid']])
 47 
 48         self.log.info("Test tx must meet absolute fee rules to evict sibling")
 49         tx_v3_child_2_rule4 = self.wallet.create_self_transfer(
 50             utxo_to_spend=tx_v3_parent["new_utxos"][1], fee_rate=2 * DEFAULT_FEE + Decimal("0.00000001"), version=3
 51         )
 52         rule4_str = f"insufficient fee (including sibling eviction), rejecting replacement {tx_v3_child_2_rule4['txid']}, not enough additional fees to relay"
 53         assert_raises_rpc_error(-26, rule4_str, node.sendrawtransaction, tx_v3_child_2_rule4["hex"])
 54         self.check_mempool([tx_v3_parent['txid'], tx_v3_child_1['txid']])
 55 
 56-        self.log.info("Test tx cannot cause more than 100 evictions including RBF and sibling eviction")
 57-        # First add 4 groups of 25 transactions.
 58+        self.log.info("Test tx cannot cause more than 100 cluster conflicts including both explicit conflicts and sibling eviction")
 59+
 60+        # First make 100 clusters, so sibling eviction counts as 101st conflicted cluster
 61         utxos_for_conflict = []
 62         txids_v2_100 = []
 63-        for _ in range(4):
 64+        for _ in range(100):
 65             confirmed_utxo = self.wallet.get_utxo(confirmed_only=True)
 66             utxos_for_conflict.append(confirmed_utxo)
 67-            # 25 is within descendant limits
 68-            chain_length = int(MAX_REPLACEMENT_CANDIDATES / 4)
 69-            chain = self.wallet.create_self_transfer_chain(chain_length=chain_length, utxo_to_spend=confirmed_utxo)
 70-            for item in chain:
 71-                txids_v2_100.append(item["txid"])
 72-                node.sendrawtransaction(item["hex"])
 73+            tx = self.wallet.create_self_transfer(utxo_to_spend=confirmed_utxo)
 74+            txids_v2_100.append(tx["txid"])
 75+            node.sendrawtransaction(tx["hex"])
 76         self.check_mempool(txids_v2_100 + [tx_v3_parent["txid"], tx_v3_child_1["txid"]])
 77 
 78         # Replacing 100 transactions is fine
 79         tx_v3_replacement_only = self.wallet.create_self_transfer_multi(utxos_to_spend=utxos_for_conflict, fee_per_output=4000000)
 80         # Override maxfeerate - it costs a lot to replace these 100 transactions.
 81         assert node.testmempoolaccept([tx_v3_replacement_only["hex"]], maxfeerate=0)[0]["allowed"]
 82+
 83+        # Adding the sibling eviction exceeds the child TRUC vsize limit which is reported, not max clusters conflicted which is checked later.
 84+        utxos_for_conflict.append(tx_v3_parent["new_utxos"][1])
 85+        tx_v3_child_2_rule5 = self.wallet.create_self_transfer_multi(utxos_to_spend=utxos_for_conflict, fee_per_output=4000000, version=3)
 86+        tx_v3_violation_str = f"TRUC-violation, version=3 child tx {tx_v3_child_2_rule5['txid']} (wtxid={tx_v3_child_2_rule5['wtxid']}) is too big: {tx_v3_child_2_rule5['tx'].get_vsize()} > 1000 virtual bytes"
 87+        assert_raises_rpc_error(-26, tx_v3_violation_str, node.sendrawtransaction, tx_v3_child_2_rule5["hex"])
 88         self.check_mempool(txids_v2_100 + [tx_v3_parent["txid"], tx_v3_child_1["txid"]])
 89 
 90         self.log.info("Test sibling eviction is successful if it meets all RBF rules")
 91         tx_v3_child_2 = self.wallet.create_self_transfer(
 92             utxo_to_spend=tx_v3_parent["new_utxos"][1], fee_rate=DEFAULT_FEE*10, version=3
 93         )
 94         node.sendrawtransaction(tx_v3_child_2["hex"])
 95         self.check_mempool(txids_v2_100 + [tx_v3_parent["txid"], tx_v3_child_2["txid"]])
 96 
 97         self.log.info("Test that it's possible to do a sibling eviction and RBF at the same time")
 98         utxo_unrelated_conflict = self.wallet.get_utxo(confirmed_only=True)
 99         tx_unrelated_replacee = self.wallet.send_self_transfer(from_node=node, utxo_to_spend=utxo_unrelated_conflict)
100         assert tx_unrelated_replacee["txid"] in node.getrawmempool()
101 
102         fee_to_beat = max(int(tx_v3_child_2["fee"] * COIN), int(tx_unrelated_replacee["fee"]*COIN))
103 
104         tx_v3_child_3 = self.wallet.create_self_transfer_multi(
105             utxos_to_spend=[tx_v3_parent["new_utxos"][0], utxo_unrelated_conflict], fee_per_output=fee_to_beat*4, version=3
106         )
107         node.sendrawtransaction(tx_v3_child_3["hex"])
108         self.check_mempool(txids_v2_100 + [tx_v3_parent["txid"], tx_v3_child_3["txid"]]) [@cleanup](/bitcoin-bitcoin/contributor/cleanup/)(extra_args=None)
109     def test_reorg_sibling_eviction_1p2c(self):
110         node = self.nodes[0]

sdaftuar commented at 4:30 pm on September 17, 2024:

Ah, right. Given that this falls back to the vsize limit being exercised, I think this isn’t really worth including? Will leave this for now.

in test/functional/mempool_updatefromblock.py:21 in ab35bca097 outdated

17@@ -18,7 +18,7 @@
18 class MempoolUpdateFromBlockTest(BitcoinTestFramework):
19     def set_test_params(self):
20         self.num_nodes = 1
21-        self.extra_args = [['-limitdescendantsize=1000', '-limitancestorsize=1000', '-limitancestorcount=100']]
22+        self.extra_args = [['-limitdescendantsize=1000', '-limitancestorsize=1000', '-limitancestorcount=100', "-limitclustersize=1000"]]

instagibbs commented at 10:00 pm on September 5, 2024:

the rest of these args can be deleted and the test passes

sdaftuar commented at 4:37 pm on September 17, 2024:

Thanks, removed.

instagibbs commented at 10:05 pm on September 5, 2024: member

poked at the function tests a bit

sdaftuar force-pushed on Sep 16, 2024

sdaftuar force-pushed on Sep 17, 2024

DrahtBot removed the label CI failed on Sep 17, 2024

DrahtBot added the label Needs rebase on Sep 30, 2024

sdaftuar force-pushed on Oct 2, 2024

DrahtBot removed the label Needs rebase on Oct 2, 2024

sdaftuar force-pushed on Oct 2, 2024

sdaftuar force-pushed on Oct 19, 2024

DrahtBot added the label CI failed on Oct 19, 2024

DrahtBot commented at 10:55 am on October 19, 2024: contributor

🚧 At least one of the CI tasks failed. Debug: https://github.com/bitcoin/bitcoin/runs/31768588577

Try to run the tests locally, according to the documentation. However, a CI failure may still happen due to a number of reasons, for example:

Possibly due to a silent merge conflict (the changes in this pull request being incompatible with the current code in the target branch). If so, make sure to rebase on the latest commit of the target branch.
A sanitizer issue, which can only be found by compiling with the sanitizer and running the affected test.
An intermittent issue.

Leave a comment here, if you need help tracking down a confusing failure.

sdaftuar force-pushed on Oct 19, 2024

sdaftuar force-pushed on Oct 20, 2024

DrahtBot removed the label CI failed on Oct 20, 2024

sdaftuar force-pushed on Oct 21, 2024

sdaftuar force-pushed on Oct 22, 2024

sdaftuar force-pushed on Oct 25, 2024

DrahtBot added the label Needs rebase on Oct 26, 2024

sdaftuar force-pushed on Oct 26, 2024

DrahtBot removed the label Needs rebase on Oct 26, 2024

DrahtBot added the label CI failed on Oct 26, 2024

sdaftuar force-pushed on Oct 27, 2024

DrahtBot removed the label CI failed on Oct 27, 2024

DrahtBot added the label Needs rebase on Oct 29, 2024

sdaftuar force-pushed on Nov 1, 2024

sdaftuar force-pushed on Nov 14, 2024

DrahtBot removed the label Needs rebase on Nov 14, 2024

DrahtBot added the label Needs rebase on Nov 14, 2024

glozow referenced this in commit f34fe0806a on Nov 20, 2024

sdaftuar force-pushed on Nov 20, 2024

DrahtBot removed the label Needs rebase on Nov 20, 2024

DrahtBot added the label Needs rebase on Dec 6, 2024

sdaftuar force-pushed on Jan 25, 2025

sdaftuar commented at 10:03 pm on January 25, 2025: member

Rebased on #31553 (old version is here: https://github.com/sdaftuar/bitcoin/tree/28676.1)

DrahtBot added the label CI failed on Jan 25, 2025

DrahtBot commented at 10:39 pm on January 25, 2025: contributor

🚧 At least one of the CI tasks failed. Debug: https://github.com/bitcoin/bitcoin/runs/36172122987

Try to run the tests locally, according to the documentation. However, a CI failure may still happen due to a number of reasons, for example:

Possibly due to a silent merge conflict (the changes in this pull request being incompatible with the current code in the target branch). If so, make sure to rebase on the latest commit of the target branch.
A sanitizer issue, which can only be found by compiling with the sanitizer and running the affected test.
An intermittent issue.

Leave a comment here, if you need help tracking down a confusing failure.

DrahtBot removed the label Needs rebase on Jan 25, 2025

sdaftuar force-pushed on Jan 26, 2025

sdaftuar force-pushed on Jan 30, 2025

sdaftuar force-pushed on Feb 4, 2025

DrahtBot added the label Needs rebase on Feb 12, 2025

in src/validation.cpp:996 in a013138747 outdated

1001-        return state.Invalid(TxValidationResult::TX_RECONSIDERABLE,
1002-                             strprintf("insufficient fee%s", ws.m_sibling_eviction ? " (including sibling eviction)" : ""), *err_string);
1003-    }
1004 
1005     CTxMemPool::setEntries all_conflicts;
1006

instagibbs commented at 4:06 pm on February 13, 2025:

Up to this point you have not applied dependencies, so the new transaction is being considered its own singleton cluster, which is in-congruent with the actual goal. If you apply dependencies via CheckMemPoolPolicyLimits, then the resulting cluster is porentially oversized and violates the requirements set forth in the TxGraph interface.

see:

 0diff --git a/src/validation.cpp b/src/validation.cpp
 1index 61576851cd..affababd10 100644
 2--- a/src/validation.cpp
 3+++ b/src/validation.cpp
 4@@ -987,10 +987,18 @@ bool MemPoolAccept::ReplacementChecks(Workspace& ws)
 5 
 6     CFeeRate newFeeRate(ws.m_modified_fees, ws.m_vsize);
 7 
 8     CTxMemPool::setEntries all_conflicts;
 9 
10+    // This triggers dependency addition, but if this returns false,
11+    // this means we're not allowed to call CountDistinctClusters
12+    // I think we have to stage removals first, then process deps
13+    // and only then call GetEntriesForConflicts
14+    // Note: Dependencies *still* haven't been processed either,
15+    // so each new tx is still considered its own cluster
16+    Assume(m_subpackage.m_changeset->CheckMemPoolPolicyLimits());
17+
18     // Calculate all conflicting entries and enforce Rule [#5](/bitcoin-bitcoin/5/).
19     if (const auto err_string{GetEntriesForConflicts(tx, m_pool, ws.m_iters_conflicting, all_conflicts)}) {
20         return state.Invalid(TxValidationResult::TX_MEMPOOL_POLICY,
21                              strprintf("too many potential replacements%s", ws.m_sibling_eviction ? " (including sibling eviction)" : ""), *err_string);
22     }
23diff --git a/test/functional/mempool_cluster.py b/test/functional/mempool_cluster.py
24index 3da8b477a2..ec7bd62e1a 100755
25--- a/test/functional/mempool_cluster.py
26+++ b/test/functional/mempool_cluster.py
27@@ -2,10 +2,12 @@
28 # Copyright (c) 2024 The Bitcoin Core developers
29 # Distributed under the MIT software license, see the accompanying
30 # file COPYING or http://www.opensource.org/licenses/mit-license.php.
31 """Test cluster mempool accessors and limits"""
32 
33+from decimal import Decimal
34+
35 from test_framework.test_framework import BitcoinTestFramework
36 from test_framework.wallet import (
37     MiniWallet,
38 )
39 from test_framework.util import (
40@@ -23,10 +25,11 @@ class MempoolClusterTest(BitcoinTestFramework):
41         self.wallet = MiniWallet(node)
42 
43         node = self.nodes[0]
44         parent_tx = self.wallet.send_self_transfer(from_node=node)
45         utxo_to_spend = parent_tx["new_utxo"]
46+        last_utxo_to_spend = utxo_to_spend
47         ancestors = [parent_tx["txid"]]
48         while len(node.getrawmempool()) < MAX_CLUSTER_COUNT:
49             next_tx = self.wallet.send_self_transfer(from_node=node, utxo_to_spend=utxo_to_spend)
50             # Confirm that each transaction is in the same cluster as the first.
51             assert node.getmempoolcluster(next_tx['txid']) == node.getmempoolcluster(parent_tx['txid'])
52@@ -38,10 +41,11 @@ class MempoolClusterTest(BitcoinTestFramework):
53             # Confirm that each successive transaction is added as a descendant.
54             assert all([ next_tx["txid"] in node.getmempooldescendants(x) for x in ancestors ])
55 
56             # Update for next iteration
57             ancestors.append(next_tx["txid"])
58+            last_utxo_to_spend = utxo_to_spend
59             utxo_to_spend = next_tx["new_utxo"]
60 
61         assert node.getmempoolcluster(parent_tx['txid'])['txcount'] == MAX_CLUSTER_COUNT
62         feeratediagram = node.getmempoolfeeratediagram()
63         last_val = [0, 0]
64@@ -52,10 +56,16 @@ class MempoolClusterTest(BitcoinTestFramework):
65 
66         # Test that adding one more transaction to the cluster will fail.
67         bad_tx = self.wallet.create_self_transfer(utxo_to_spend=utxo_to_spend)
68         assert_raises_rpc_error(-26, "too-large-cluster", node.sendrawtransaction, bad_tx["hex"])
69 
70+        from pdb import set_trace
71+        set_trace()
72+        # Test that replacing the last tx with RBF works
73+        rbf = self.wallet.create_self_transfer(utxo_to_spend=last_utxo_to_spend, fee_rate=Decimal('0.006'))
74+        txid = node.sendrawtransaction(rbf["hex"])
75+
76         # TODO: verify that the size limits are also enforced.
77         # TODO: add tests that exercise rbf, package submission, and package
78         # rbf and verify that cluster limits are enforced.
79 
80 if __name__ == '__main__':

sdaftuar commented at 4:20 pm on February 13, 2025:

I think the mistake here is that in my call to CountDistinctClusters(), I should be setting main_only to true, and it looks like I omitted that so it’ll call it on the staging graph instead. The same issue may apply with the invocation of GetDescendants(); the intent is to just call it on the main graph.

Would that resolve the concern you’re bringing up, or is there something else I’m overlooking?

instagibbs commented at 4:27 pm on February 13, 2025:

I should be setting main_only to true,

I think that works? Will think more on it from a UX/API perspective (it’s Safe to do regardless)

edit: difference being using main graph means “affect up to 100 existing clusters”, which maybe is what we agreed to previously already? i.e. if your txn(package) has <= 100 inputs, it will pass this check

GetDescendants

Missed that one!

sdaftuar force-pushed on Mar 26, 2025

glozow referenced this in commit f1d129d963 on Mar 26, 2025

sdaftuar force-pushed on Apr 12, 2025

DrahtBot removed the label Needs rebase on Apr 12, 2025

sdaftuar force-pushed on Apr 14, 2025

DrahtBot added the label Needs rebase on Apr 17, 2025

sdaftuar force-pushed on May 29, 2025

dergoegge commented at 3:53 pm on June 29, 2025: member

I fuzzed this branch (https://github.com/bitcoin/bitcoin/pull/28676/commits/3bfaedbd7d9d6e71e9df03fbbb51a33c6184cca6) with fuzzamoto and it found an assertion crash in CTxMemPool::check.

 02025-06-29T15:20:42Z (mocktime: 2031-04-03T07:09:03Z) [bench] - Disconnect block: 0.72ms
 12025-06-29T15:20:42Z (mocktime: 2031-04-03T07:09:03Z) [prune] basic block filter index prune lock moved back to 200
 22025-06-29T15:20:42Z (mocktime: 2031-04-03T07:09:03Z) UpdateTip: new best=458aa11fef72fd87496f231da978c98b9fe7f2bbd7d2bf5f612a24920b68ce7a height=200 version=0x00000005 log2_work=8.651052 tx=201 date='2031-04-03T07:09:11Z' progress=0.994065 cache=0.3MiB(202txo)
 32025-06-29T15:20:42Z (mocktime: 2031-04-03T07:09:03Z) [validation] Enqueuing BlockDisconnected: block hash=6358b07d187f72dbf7b893a6200dc3e4c5b4bce29574ed063a9ea4c87478ba2d block height=201
 42025-06-29T15:20:42Z (mocktime: 2031-04-03T07:09:03Z) [bench]   - Load block from disk: 0.04ms
 52025-06-29T15:20:42Z (mocktime: 2031-04-03T07:09:03Z) [validation] BlockDisconnected: block hash=6358b07d187f72dbf7b893a6200dc3e4c5b4bce29574ed063a9ea4c87478ba2d block height=201
 62025-06-29T15:20:42Z (mocktime: 2031-04-03T07:09:03Z) [bench]     - Sanity checks: 0.01ms [0.00s (0.00ms/blk)]
 72025-06-29T15:20:42Z (mocktime: 2031-04-03T07:09:03Z) [bench]     - Fork checks: 0.12ms [0.00s (0.02ms/blk)]
 82025-06-29T15:20:42Z (mocktime: 2031-04-03T07:09:03Z) [bench]       - Connect 2 transactions: 0.05ms (0.025ms/tx, 0.050ms/txin) [0.00s (0.02ms/blk)]
 92025-06-29T15:20:42Z (mocktime: 2031-04-03T07:09:03Z) Block validation error: bad-txns-inputs-missingorspent, CheckTxInputs: inputs missing/spent in transaction 296accd2d2f2891647fd9375c8b1ec613c5f29366bba8349c5b804c5c9641259
102025-06-29T15:20:42Z (mocktime: 2031-04-03T07:09:03Z) [validation] BlockChecked: block hash=14f0e47137dbcfbb1e1d7f309bd71672de3cd58c6f495a31da7690ab04f26610 state=bad-txns-inputs-missingorspent, CheckTxInputs: inputs missing/spent in transaction 296accd2d2f2891647fd9375c8b1ec613c5f29366bba8349c5b804c5c9641259
112025-06-29T15:20:42Z (mocktime: 2031-04-03T07:09:03Z) [net] Misbehaving: peer=6
122025-06-29T15:20:42Z (mocktime: 2031-04-03T07:09:03Z) InvalidChainFound: invalid block=14f0e47137dbcfbb1e1d7f309bd71672de3cd58c6f495a31da7690ab04f26610  height=201  log2_work=8.658211  date=2031-04-03T07:09:03Z
132025-06-29T15:20:42Z (mocktime: 2031-04-03T07:09:03Z) InvalidChainFound:  current best=458aa11fef72fd87496f231da978c98b9fe7f2bbd7d2bf5f612a24920b68ce7a  height=200  log2_work=8.651052  date=2031-04-03T07:09:11Z
142025-06-29T15:20:42Z (mocktime: 2031-04-03T07:09:03Z) [error] ConnectTip: ConnectBlock 14f0e47137dbcfbb1e1d7f309bd71672de3cd58c6f495a31da7690ab04f26610 failed, bad-txns-inputs-missingorspent, CheckTxInputs: inputs missing/spent in transaction 296accd2d2f2891647fd9375c8b1ec613c5f29366bba8349c5b804c5c9641259
152025-06-29T15:20:42Z (mocktime: 2031-04-03T07:09:03Z) InvalidChainFound: invalid block=0bdf4c163bfa947a0994f7250ea870b38312090b66db11b91b47b5e2e1cb944c  height=202  log2_work=8.665336  date=2031-04-03T07:09:03Z
162025-06-29T15:20:42Z (mocktime: 2031-04-03T07:09:03Z) InvalidChainFound:  current best=458aa11fef72fd87496f231da978c98b9fe7f2bbd7d2bf5f612a24920b68ce7a  height=200  log2_work=8.651052  date=2031-04-03T07:09:11Z
172025-06-29T15:20:42Z (mocktime: 2031-04-03T07:09:03Z) [validation] Enqueuing TransactionAddedToMempool: txid=c8e1f1327540cc3bd05956f1e65c93b5f0fca2ce40e09c4573d8e260984882f8 wtxid=025c564da6a096dff04d9d2cfb89e73073b41ac8bac8ade988c2493c0c277e2d
182025-06-29T15:20:42Z (mocktime: 2031-04-03T07:09:03Z) [validation] Enqueuing TransactionAddedToMempool: txid=529525f93ecd129d28ba7acd9cb0bfc3338e8ab840489425f1e60417e947545d wtxid=6900dd2c8d16f0f1ffae597f756fc39ad13b5dd09f89bd00907e4aa4782b87de
192025-06-29T15:20:42Z (mocktime: 2031-04-03T07:09:03Z) [validation] Enqueuing TransactionAddedToMempool: txid=3a7e142719cc5b9dd2a8af2da1badd895fb79955c03f2ed06721d80ffe99a4a1 wtxid=41f2ac217c7b310120d33dbe399c2bead8691aa1a6bf0841ea06c31c17c55e41
202025-06-29T15:20:42Z (mocktime: 2031-04-03T07:09:03Z) [validation] Enqueuing TransactionAddedToMempool: txid=39c039e5f1ebbabce8009a49a25ad383f7ca486eb6b93d0c507fed4a0a68d4c0 wtxid=192ffd8478fa09b3325f195f13d7decab5a6ce73aee16b85e4398a3c59bd8a8e
212025-06-29T15:20:42Z (mocktime: 2031-04-03T07:09:03Z) [validation] Enqueuing TransactionAddedToMempool: txid=b929dd12cb7a7340c0d764f4d71312ba5224c2faffa8025f1f627bd624c600b7 wtxid=cb2432df45a0bbd043c0a4147fcc0b7afafeff7c3669a789b40065da3e435628
222025-06-29T15:20:42Z (mocktime: 2031-04-03T07:09:03Z) [validation] Enqueuing TransactionAddedToMempool: txid=db4aa556b03b710689c9a00c225229a3a4e2e9e296f014169b66949c6ba0c7e4 wtxid=e972c380b853a3cdd2c4554c6604748dd40ca155d70e54baf3dccf767d931322
232025-06-29T15:20:42Z (mocktime: 2031-04-03T07:09:03Z) [validation] Enqueuing TransactionRemovedFromMempool: txid=4acde228f2a5efb36b4feb02bed7f3fa476355b481f6e9a63f62c0328fc07d9c wtxid=01287e9b280e0eec3c04694d46f7ed8ef2377f404e84d67b180f311b9b2f8b38 reason=reorg
242025-06-29T15:20:42Z (mocktime: 2031-04-03T07:09:03Z) [mempool] Checking mempool with 18 transactions and 18 inputs
25bitcoind: txmempool.cpp:464: void CTxMemPool::check(const CCoinsViewCache &, int64_t) const: Assertion `mempoolDuplicate.HaveCoin(txin.prevout)' failed.

Judging by the logs, it seems to involve a reorg, so I’m not sure if the existing fuzz tests would also catch this but I also haven’t tried.

To reproduce:

0git clone https://github.com/dergoegge/fuzzamoto.git
1cd fuzzamoto/
2cargo build --release --package fuzzamoto-scenarios --features reproduce
3FUZZAMOTO_INPUT=./cluster-crash-jun29.txt RUST_LOG=info ./target/release/scenario-ir <path to bitcoind>

The input file: cluster-crash-jun29.txt

Let me know if the steps above don’t work for you or aren’t helpful for debugging, I can also try to create a functional test…

dergoegge commented at 10:09 am on June 30, 2025: member

Functional test in this commit: https://github.com/dergoegge/bitcoin/commit/6329ce979f63b396aa036a1ad39798bb83fa4ade (change to random.cpp is necessary to make it deterministic, without it the test only hits the assert 1/5 times)

cc @instagibbs

instagibbs commented at 3:21 pm on June 30, 2025: member

@dergoegge awesome coverage and functional test!

Seems this is the issue, we are visiteding children erroneously disallowing their removal later when parent is visited for immaturity:

 0diff --git a/src/txmempool.cpp b/src/txmempool.cpp
 1index 5c90bc43dd..3c278f6aad 100644
 2--- a/src/txmempool.cpp
 3+++ b/src/txmempool.cpp
 4@@ -362,5 +362,5 @@ void CTxMemPool::removeForReorg(CChain& chain, std::function<bool(txiter)> check
 5         WITH_FRESH_EPOCH(m_epoch);
 6         for (indexed_transaction_set::const_iterator it = mapTx.begin(); it != mapTx.end(); it++) {
 7-            if (!visited(it) && check_final_and_mature(it)) {
 8+            if (check_final_and_mature(it) && !visited(it)) {
 9                 txToRemove.emplace_back(it);
10                 auto descendants = m_txgraph->GetDescendants(*it);

sdaftuar force-pushed on Jul 10, 2025

DrahtBot removed the label Needs rebase on Jul 10, 2025

instagibbs commented at 9:23 pm on July 10, 2025: member

Follow-up from a while back: let’s just explicitly say which layer of txgraph we’re querying: https://github.com/instagibbs/bitcoin/commit/bb48f6c4736c227bcee6c4dda8e95b0b0287cfef

sdaftuar force-pushed on Jul 11, 2025

sdaftuar force-pushed on Jul 12, 2025

DrahtBot added the label Needs rebase on Jul 21, 2025

sdaftuar force-pushed on Jul 28, 2025

DrahtBot removed the label Needs rebase on Jul 28, 2025

DrahtBot removed the label CI failed on Jul 29, 2025

DrahtBot added the label Needs rebase on Jul 29, 2025

sdaftuar force-pushed on Jul 29, 2025

sdaftuar marked this as ready for review on Jul 29, 2025

sdaftuar commented at 12:02 pm on July 29, 2025: member

I fuzzed this branch (3bfaedb) with fuzzamoto and it found an assertion crash in CTxMemPool::check. @dergoegge @instagibbs Thank you both for detecting and pinpointing the bug! Should be fixed now in this branch.

sdaftuar force-pushed on Jul 29, 2025

sdaftuar commented at 12:43 pm on July 29, 2025: member

Follow-up from a while back: let’s just explicitly say which layer of txgraph we’re querying: instagibbs@bb48f6c @instagibbs Done, should be fixed in my last push.

DrahtBot removed the label Needs rebase on Jul 29, 2025

DrahtBot added the label CI failed on Jul 29, 2025

DrahtBot commented at 2:15 pm on July 29, 2025: contributor

🚧 At least one of the CI tasks failed. Task multiprocess, i686, DEBUG: https://github.com/bitcoin/bitcoin/runs/46942896430 LLM reason (✨ experimental): The CI failure is caused by a compilation error in txmempool.cpp due to calling MakeTxGraph with only two arguments instead of the required three.

Try to run the tests locally, according to the documentation. However, a CI failure may still happen due to a number of reasons, for example:

Possibly due to a silent merge conflict (the changes in this pull request being incompatible with the current code in the target branch). If so, make sure to rebase on the latest commit of the target branch.
A sanitizer issue, which can only be found by compiling with the sanitizer and running the affected test.
An intermittent issue.

Leave a comment here, if you need help tracking down a confusing failure.

sdaftuar force-pushed on Jul 29, 2025

sdaftuar renamed this:
~~[WIP] Cluster mempool implementation~~
Cluster mempool implementation
on Jul 29, 2025

DrahtBot removed the label CI failed on Jul 29, 2025

sdaftuar force-pushed on Jul 29, 2025

DrahtBot added the label Needs rebase on Jul 30, 2025

sdaftuar force-pushed on Aug 1, 2025

DrahtBot removed the label Needs rebase on Aug 1, 2025

DrahtBot added the label CI failed on Aug 1, 2025

DrahtBot commented at 11:48 am on August 1, 2025: contributor

🚧 At least one of the CI tasks failed. Task TSan, depends, no gui: https://github.com/bitcoin/bitcoin/runs/47189136784 LLM reason (✨ experimental): The CI failure was caused by the timeout of the bench_sanity_check test.

Try to run the tests locally, according to the documentation. However, a CI failure may still happen due to a number of reasons, for example:

Possibly due to a silent merge conflict (the changes in this pull request being incompatible with the current code in the target branch). If so, make sure to rebase on the latest commit of the target branch.
A sanitizer issue, which can only be found by compiling with the sanitizer and running the affected test.
An intermittent issue.

Leave a comment here, if you need help tracking down a confusing failure.

sdaftuar commented at 5:48 pm on August 2, 2025: member

I’m unable to reproduce the ci failure when I run that job locally – anyone have an idea?

Ah, I guess on my machine, these jobs are the slowest:

0143/144 Test   [#4](/bitcoin-bitcoin/4/): secp256k1_tests ......................   Passed  217.55 sec
1144/144 Test   [#6](/bitcoin-bitcoin/6/): bench_sanity_check ...................   Passed  433.08 sec

In the ci environment, we see this:

0[17:30:48.523] 143/144 Test   [#4](/bitcoin-bitcoin/4/): secp256k1_tests ......................   Passed  1134.59 sec
1[17:51:54.170] 144/144 Test   [#6](/bitcoin-bitcoin/6/): bench_sanity_check ...................***Timeout 2400.23 sec

maflcko commented at 8:45 am on August 6, 2025: member

Ah, I guess on my machine, these jobs are the slowest:

0143/144 Test   [#4](/bitcoin-bitcoin/4/): secp256k1_tests ......................   Passed  217.55 sec
1144/144 Test   [#6](/bitcoin-bitcoin/6/): bench_sanity_check ...................   Passed  433.08 sec

In the ci environment, we see this:

0[17:30:48.523] 143/144 Test   [#4](/bitcoin-bitcoin/4/): secp256k1_tests ......................   Passed  1134.59 sec
1[17:51:54.170] 144/144 Test   [#6](/bitcoin-bitcoin/6/): bench_sanity_check ...................***Timeout 2400.23 sec

I’d say anything larger than 1 minute for a “unit test” is too long. However, all the bench sanity checks are run in a single process, so this is likely unrelated to this pull request. Discussion can be done in #32770.

In the meantime, the timeout can be ignored. If you want to temporarily raise it, you can use:

0--- a/ci/test/00_setup_env.sh
1+++ b/ci/test/00_setup_env.sh
2-export TEST_RUNNER_TIMEOUT_FACTOR=${TEST_RUNNER_TIMEOUT_FACTOR:-40}
3+export TEST_RUNNER_TIMEOUT_FACTOR=${TEST_RUNNER_TIMEOUT_FACTOR:-80}

as a separate commit, so that it can be reverted in the future, once #32770 is fixed.

DrahtBot added the label Needs rebase on Aug 8, 2025

in src/kernel/mempool_limits.h:23 in 6caa4d7f45 outdated

15@@ -16,6 +16,10 @@ namespace kernel {
16  * Most of the time, this struct should be referenced as CTxMemPool::Limits.
17  */
18 struct MemPoolLimits {
19+    //! The maximum number of transactions in a cluster
20+    unsigned cluster_count{DEFAULT_CLUSTER_LIMIT};
21+    //! The maximum allowed size in virtual bytes of a cluster.
22+    int64_t cluster_size_vbytes{DEFAULT_CLUSTER_SIZE_LIMIT_KVB * 1'000};
23     //! The maximum allowed number of transactions in a package including the entry and its ancestors.

glozow commented at 3:37 pm on September 2, 2025:

I think {ancestor,descendant}_{count,size_vbytes} can be cleaned up after 74f51262688e1f6a4af1664c13f6c895b1499a39 as well? Perhaps Flatten() in txmempool.cpp can do the sanity check against default cluster size instead. This can be done in a followup.

sdaftuar commented at 6:31 pm on September 2, 2025:

I believe the ancestor and descendant count limits are still used by the wallet when using -walletrejectlongchains, which makes sense for at least a couple of releases after cluster mempool is released while there may still be a lot of nodes running with ancestor/descendant limits in place. So I think these variables should stay for now, perhaps I can add a comment mentioning all this so someone remembers to get rid of these later.

sdaftuar commented at 0:42 am on October 10, 2025:

I took a stab at eliminating -limitancestorsize and -limitdescendantsize, emitting a warning if those command line options are used, and eliminating all usages of DEFAULT_ANCESTOR_SIZE and DEFAULT_DESCENDANT_SIZE in #33591.

in src/txmempool.h:1 in 6caa4d7f45 outdated

glozow commented at 3:42 pm on September 2, 2025:

nit: the comment above the class CTxMemPool definition now needs an update. It still mentions the index by descendant feerate, the m_parents field in CTxMemPoolEntry, etc.

sdaftuar commented at 4:41 pm on September 13, 2025:

Reworked this a bit – please let me know if this looks better now or if additional comments would be helpful.

sdaftuar force-pushed on Sep 2, 2025

DrahtBot removed the label Needs rebase on Sep 2, 2025

DrahtBot removed the label CI failed on Sep 2, 2025

sdaftuar force-pushed on Sep 2, 2025

in test/functional/feature_rbf.py:247 in 61f51546af outdated

242@@ -244,6 +243,8 @@ def branch(prevout, initial_value, max_txs, tree_width=5, fee=0.00001 * COIN, _t
243 
244         # Try again, but with more total transactions than the "max txs
245         # double-spent at once" anti-DoS limit.
246+        # TODO: rework using direct conflict test
247+        '''

glozow commented at 5:58 pm on September 2, 2025:

Is this todo already covered by test_too_many_replacements()?

sdaftuar commented at 4:42 pm on September 13, 2025:

Yes, I think so – I’ve deleted this comment, as well as the already-commented-out test that is no longer relevant.

glozow commented at 6:00 pm on September 2, 2025: member

Would it be a good idea to split this PR into a main PR with Cluster Implementation + Cleanups + Docs and Tests, then a followups PR with Optimizations and more cleanups? Goal is to get everything in v31, but this might help with reducing the size of the main PR and make it easier to punt cleanup-related comments to later.

in src/init.cpp:646 in e72b0b1a7e outdated

640@@ -641,6 +641,9 @@ void SetupServerArgs(ArgsManager& argsman, bool can_listen_ipc)
641     argsman.AddArg("-limitdescendantcount=<n>", strprintf("Do not accept transactions if any ancestor would have <n> or more in-mempool descendants (default: %u)", DEFAULT_DESCENDANT_LIMIT), ArgsManager::ALLOW_ANY | ArgsManager::DEBUG_ONLY, OptionsCategory::DEBUG_TEST);
642     argsman.AddArg("-limitdescendantsize=<n>", strprintf("Do not accept transactions if any ancestor would have more than <n> kilobytes of in-mempool descendants (default: %u).", DEFAULT_DESCENDANT_SIZE_LIMIT_KVB), ArgsManager::ALLOW_ANY | ArgsManager::DEBUG_ONLY, OptionsCategory::DEBUG_TEST);
643     argsman.AddArg("-test=<option>", "Pass a test-only option. Options include : " + Join(TEST_OPTIONS_DOC, ", ") + ".", ArgsManager::ALLOW_ANY | ArgsManager::DEBUG_ONLY, OptionsCategory::DEBUG_TEST);
644+    argsman.AddArg("-limitclustercount=<n>", strprintf("Do not accept transactions connected to <n> or more existing in-mempool transactions (default: %u)", DEFAULT_CLUSTER_LIMIT), ArgsManager::ALLOW_ANY | ArgsManager::DEBUG_ONLY, OptionsCategory::DEBUG_TEST);
645+    argsman.AddArg("-limitclustersize=<n>", strprintf("Do not accept transactions whose size with all in-mempool connected transactions exceeds <n> kilobytes (default: %u)", DEFAULT_CLUSTER_SIZE_LIMIT_KVB), ArgsManager::ALLOW_ANY | ArgsManager::DEBUG_ONLY, OptionsCategory::DEBUG_TEST);
646+    argsman.AddArg("-addrmantest", "Allows to test address relay on localhost", ArgsManager::ALLOW_ANY | ArgsManager::DEBUG_ONLY, OptionsCategory::DEBUG_TEST);

glozow commented at 3:24 pm on September 3, 2025:

e72b0b1a7eb31078c998d95b4cfffe9cd1f9113d re-adds -addrmantest which was removed in #29007

sdaftuar commented at 4:42 pm on September 13, 2025:

Thanks! Guessing this was a rebase error; fixed.

in src/node/interfaces.cpp:723 in 5bc5edf874 outdated

718@@ -719,10 +719,13 @@ class ChainImpl : public Chain
719     util::Result<void> checkChainLimits(const CTransactionRef& tx) override
720     {
721         if (!m_node.mempool) return {};
722-        LockPoints lp;
723-        CTxMemPoolEntry entry(TxGraph::Ref(), tx, 0, 0, 0, 0, false, 0, lp);
724         LOCK(m_node.mempool->cs);
725-        return m_node.mempool->CheckPackageLimits({tx}, entry.GetTxSize());
726+        auto changeset = m_node.mempool->GetChangeSet();

glozow commented at 3:33 pm on September 3, 2025:

5bc5edf8743408979add1b15d23cc8ab7b948db Checking limits using changeset is so nice. It could be helpful to add a comment that changeset is automatically destroyed at the end of this function and thus doesn’t need to be handled/aborted.

sipa commented at 5:31 pm on September 7, 2025:

In commit “Check cluster limits when using -walletrejectlongchains” (though not introduced in this commit)

Ultranit: I think CreateChangeSet would be a slightly better name than GetChangeSet, as that may imply that there is just one ChangeSet and this is returning a reference to it or so.

instagibbs commented at 6:41 pm on September 8, 2025:

ack, wasn’t 100% obvious due to auto here

sdaftuar commented at 4:42 pm on September 13, 2025:

Done.

in src/net_processing.cpp:5429 in f172ac5e6c outdated

5406@@ -5407,7 +5407,7 @@ class CompareInvMempoolOrder
5407     {
5408         /* As std::make_heap produces a max-heap, we want the entries with the
5409          * fewest ancestors/highest fee to sort later. */
5410-        return m_mempool->CompareDepthAndScore(*b, *a);
5411+        return m_mempool->CompareMiningScoreWithTopology(*b, *a);

glozow commented at 3:47 pm on September 3, 2025:

For f172ac5e6c37f446bc3c38419cf40f3adac3ddd8 Use cluster linearization for transaction relay sort order

In addition to changing the inv order, should we also use the transaction’s chunk feerate for fee filtering?

sdaftuar commented at 12:12 pm on September 13, 2025:

I think this type of p2p change is something we should discuss separately, outside of this PR, because it’s not clear to me that there’s a simple way to make a strict improvement in relay behavior. And the interpretation of what the feefilter message means/should mean is something we should document and coordinate across implementations as well.

in src/test/miniminer_tests.cpp:635 in aa8eb8c3ab outdated

633     /* Zig Zag cluster:
634-     * txp0     txp1     txp2    ...  txp48  txp49
635+     * txp0     txp1     txp2    ...  txp31  txp32
636      *    \    /    \   /   \            \   /
637-     *     txc0     txc1    txc2  ...    txc48
638+     *     txc0     txc1    txc2  ...    txc31

glozow commented at 4:35 pm on September 3, 2025:

nit in aa8eb8c3ab1 Fix miniminer_tests to work with cluster limits, I think it’s txp0 to txp31 and txc0 to txc30?

sdaftuar commented at 4:42 pm on September 13, 2025:

Fixed, thanks.

in src/init.cpp:644 in e72b0b1a7e outdated

640@@ -641,6 +641,9 @@ void SetupServerArgs(ArgsManager& argsman, bool can_listen_ipc)
641     argsman.AddArg("-limitdescendantcount=<n>", strprintf("Do not accept transactions if any ancestor would have <n> or more in-mempool descendants (default: %u)", DEFAULT_DESCENDANT_LIMIT), ArgsManager::ALLOW_ANY | ArgsManager::DEBUG_ONLY, OptionsCategory::DEBUG_TEST);
642     argsman.AddArg("-limitdescendantsize=<n>", strprintf("Do not accept transactions if any ancestor would have more than <n> kilobytes of in-mempool descendants (default: %u).", DEFAULT_DESCENDANT_SIZE_LIMIT_KVB), ArgsManager::ALLOW_ANY | ArgsManager::DEBUG_ONLY, OptionsCategory::DEBUG_TEST);
643     argsman.AddArg("-test=<option>", "Pass a test-only option. Options include : " + Join(TEST_OPTIONS_DOC, ", ") + ".", ArgsManager::ALLOW_ANY | ArgsManager::DEBUG_ONLY, OptionsCategory::DEBUG_TEST);
644+    argsman.AddArg("-limitclustercount=<n>", strprintf("Do not accept transactions connected to <n> or more existing in-mempool transactions (default: %u)", DEFAULT_CLUSTER_LIMIT), ArgsManager::ALLOW_ANY | ArgsManager::DEBUG_ONLY, OptionsCategory::DEBUG_TEST);

glozow commented at 4:55 pm on September 3, 2025:

Perhaps also add (maximum: %u), MAX_CLUSTER_COUNT_LIMIT

sdaftuar commented at 4:42 pm on September 13, 2025:

Done.

in src/txgraph.h:17 in 34251141ec outdated

13@@ -14,7 +14,7 @@
14 #ifndef BITCOIN_TXGRAPH_H
15 #define BITCOIN_TXGRAPH_H
16 
17-static constexpr unsigned MAX_CLUSTER_COUNT_LIMIT{64};
18+static constexpr unsigned MAX_CLUSTER_COUNT_LIMIT{201};

glozow commented at 5:39 pm on September 3, 2025:

34251141eca785578a0b2ca6d1c0932d34b2418d Do not allow mempool clusters to exceed configured limits

This workaround (“Temporarily increase the maximum permitted cluster count so that feature_rbf.py passes. “) didn’t seem ideal to me. I shuffled around some of the changes so that feature_rbf is reworked before this commit: https://github.com/glozow/bitcoin/commits/2025-09-cluster-feature_rbf/

It just deletes one of the tests (also see #28676 (review), I think it’s ok to delete rather than comment out with a todo) and adds the generate needed to make too_many_replacements work without a higher anc/desc limit.

instagibbs commented at 6:10 pm on September 8, 2025:

https://github.com/glozow/bitcoin/commit/75529c01ff47997e17ba2eb8f0de8dfa0effbb72

I believe this is the suggested change

sdaftuar commented at 4:42 pm on September 13, 2025:

Thank you! In the process of looking at this again, I noticed that the mempool_tests unit test is failing in some intermediate commits here as well. Will go back over that as well.

sdaftuar commented at 9:29 pm on September 16, 2025:

After reviewing the mempool eviction unit test that was failing in an intermediate commit, I ended up reworking the test altogether, so this should be better now.

glozow commented at 5:42 pm on September 3, 2025: member

In the “Still to do” section of the PR description, it seems like most can be checked off other than

Updating wallet code to be cluster-aware (including mini_miner and coin selection) Rework many of our functional tests to be cluster-aware Figure out what package validation and package RBF rules should be in this design

in src/test/fuzz/policy_estimator.cpp:77 in 457c7bd899 outdated

73@@ -74,7 +74,7 @@ FUZZ_TARGET(policy_estimator, .init = initialize_policy_estimator)
74                         break;
75                     }
76                     const CTransaction tx{*mtx};
77-                    mempool_entries.emplace_back(CTxMemPoolEntry::ExplicitCopy, ConsumeTxMemPoolEntry(fuzzed_data_provider, tx, current_height));
78+                    mempool_entries.emplace_back(ConsumeTxMemPoolEntry(fuzzed_data_provider, tx, current_height));

sipa commented at 4:46 pm on September 7, 2025:

In commit “Allow moving CTxMemPoolEntry objects, disallow copying”

This is taking a redundant extra step: emplace_back allows constructing the CTxMemPoolEntry object directly in-place inside mempool_entries, but by invoking the constructor, it’s constructed in the call site, and then moved in place:

0mempool_entries.emplace_back(fuzzed_data_provider, tx, current_height);

With this, there is (as of this commit) no need to support moving CTxMemPoolEntry anymore, though future intermediary commits seem to still need it.

sipa commented at 12:26 pm on September 13, 2025:

Sorry, ignore this. I misread ConsumeTxMemPoolEntry as being the constructor, rather than a separate function.

in src/bench/blockencodings.cpp:25 in f8d40726eb outdated

21@@ -22,7 +22,7 @@
22 static void AddTx(const CTransactionRef& tx, const CAmount& fee, CTxMemPool& pool) EXCLUSIVE_LOCKS_REQUIRED(cs_main, pool.cs)
23 {
24     LockPoints lp;
25-    AddToMempool(pool, CTxMemPoolEntry(tx, fee, /*time=*/0, /*entry_height=*/1, /*entry_sequence=*/0, /*spends_coinbase=*/false, /*sigops_cost=*/4, lp));
26+    AddToMempool(pool, CTxMemPoolEntry(TxGraph::Ref(), tx, fee, /*time=*/0, /*entry_height=*/1, /*entry_sequence=*/0, /*spends_coinbase=*/false, /*sigops_cost=*/4, lp));

sipa commented at 4:55 pm on September 7, 2025:

In commit “Make CTxMemPoolEntry derive from TxGraph::Ref”

Nit: you can invoke the default constructor using just {} instead of TxGraph::Ref():

0AddToMempool(pool, CTxMemPoolEntry({}, tx, fee, /*time=*/0, /*entry_height=*/1, /*entry_sequence=*/0, /*spends_coinbase=*/false, /*sigops_cost=*/4, lp));

here, and throughout this commit.

instagibbs commented at 4:14 pm on September 8, 2025:

if you do so, please annotate the arg

sdaftuar commented at 12:22 pm on September 13, 2025:

@instagibbs @sipa I think switching from TxGraph::Ref() to something like /* ref = */ {} is not worth the change, as the existing code seems just as good from a self-documentation standpoint. Will plan to leave as-is unless there’s some benefit from invoking {} that I’m missing?

sipa commented at 12:24 pm on September 13, 2025:

It was just a nit, in case you weren’t aware that it was possible to do this more succinctly. Agreed that it’s more self-documenting as is.

in src/txmempool.cpp:416 in ebbe4a13ce outdated

412@@ -413,6 +413,7 @@ static CTxMemPool::Options&& Flatten(CTxMemPool::Options&& opts, bilingual_str&
413 CTxMemPool::CTxMemPool(Options opts, bilingual_str& error)
414     : m_opts{Flatten(std::move(opts), error)}
415 {
416+    m_txgraph = MakeTxGraph(64, 101'000, 10'000);

sipa commented at 5:00 pm on September 7, 2025:

In commit " Create a txgraph inside CTxMemPool"

Introduce a named constant for these values? At least for the 10'000, which survives into the final commit of this PR.

sdaftuar commented at 4:43 pm on September 13, 2025:

Done.

in src/txmempool.cpp:1451 in 34251141ec outdated

1446+        inputs.insert(txin.prevout.hash);
1447+    }
1448+    for (const auto &hash : inputs) {
1449+        std::optional<txiter> piter = m_pool->GetIter(hash);
1450+        if (piter) {
1451+            ret.emplace_back(&(**piter));

sipa commented at 5:24 pm on September 7, 2025:

In commit “Do not allow mempool clusters to exceed configured limits”

Tiny optimization: instead of constructing and returning a std::vector<const TxGraph::Ref*> ret here, it’s possible to invoke m_pool->m_txgraph->AddDependency directly.

If done, it’s probably clearer to inline CTxMemPool::ChangeSet::CalculateParentsOf into CTxMemPool::ChangeSet::ProcessDependencies below.

sdaftuar commented at 4:43 pm on September 13, 2025:

Sounds good, I added a commit that does this, if it looks right I’ll squash into place.

sdaftuar commented at 10:25 am on September 22, 2025:

Squashed this commit.

in src/txmempool.cpp:1444 in 34251141ec outdated

1439 }
1440+
1441+std::vector<const TxGraph::Ref*> CTxMemPool::ChangeSet::CalculateParentsOf(const CTransactionRef& tx)
1442+{
1443+    std::vector<const TxGraph::Ref*> ret;
1444+    std::set<Txid> inputs;

sipa commented at 5:26 pm on September 7, 2025:

In commit “Do not allow mempool clusters to exceed configured limits”

Optimization: instead of constructing an std::set of deduplicated input txids, I think it’s more efficient to do the GetIter/m_to_add.find steps for all inputs. TxGraphImpl is likely more efficient at deduplicating AddDependency calls than the caller is using std::set here.

sdaftuar commented at 4:43 pm on September 13, 2025:

Addressed with the inlining of CalculateParentsOf().

in src/txmempool.cpp:1159 in 76a3b6286b outdated

1153@@ -1154,29 +1154,38 @@ void CTxMemPool::TrimToSize(size_t sizelimit, std::vector<COutPoint>* pvNoSpends
1154 
1155     unsigned nTxnRemoved = 0;
1156     CFeeRate maxFeeRateRemoved(0);
1157+
1158     while (!mapTx.empty() && DynamicMemoryUsage() > sizelimit) {
1159-        indexed_transaction_set::index<descendant_score>::type::iterator it = mapTx.get<descendant_score>().begin();
1160+        auto [worst_chunk, feeperweight] = m_txgraph->GetWorstMainChunk();

sipa commented at 5:55 pm on September 7, 2025:

In commit “Limit mempool size based on chunk feerate”

This structured binding will, if I understand it correctly, cause worst_chunk to be a copy of the vector returned in the m_txgraph->GetWorstMainChunk() invocation:

0auto& [worst_chunk, feeperweight] = m_txgraph->GetWorstMainChunk();

(with & after auto) will avoid the copy.

sipa commented at 4:28 pm on September 13, 2025:

I’m wrong about this: https://godbolt.org/z/zKq3fMrEc

Marking resolved.

sdaftuar commented at 4:43 pm on September 13, 2025:

This seems to work if I do:

0const auto& [worst_chunk, feeperweight] = m_txgraph->GetWorstMainChunk();

Fixed now.

in src/txmempool.cpp:1177 in 76a3b6286b outdated

1180-            txn.reserve(stage.size());
1181-            for (txiter iter : stage)
1182-                txn.push_back(iter->GetTx());
1183+            txn.reserve(worst_chunk.size());
1184+            for (auto ref : worst_chunk) {
1185+                txn.emplace_back(dynamic_cast<const CTxMemPoolEntry&>(*ref).GetTx());

sipa commented at 6:09 pm on September 7, 2025:

In commit “Limit mempool size based on chunk feerate”

dynamic_cast is pretty inefficient (it literally performs a string comparison on the type name to check whether the type is compatible!). I think you can use static_cast here, as you know the Ref will always be a CTxMemPoolEntry? Here and below.

sdaftuar commented at 4:43 pm on September 13, 2025:

Thanks, fixed.

in src/node/miner.cpp:256 in 64de3bb675 outdated

387 
388-        if (packageFees < m_options.blockMinFeeRate.GetFee(packageSize)) {
389-            // Everything else we might consider has a lower fee rate
390+    while (selected_transactions.size() > 0) {
391+        // Check to see if min fee rate is still respected.
392+        if (chunk_feerate.fee < m_options.blockMinFeeRate.GetFee(chunk_feerate_vsize.size)) {

sipa commented at 6:26 pm on September 7, 2025:

In commit “Select transactions for blocks based on chunk feerate”

Since CFeeRate now just encapsulates a FeePerVSize, it should be unnecessary to invoke GetFee (which involves an integer division), and instead it should be possible to compare chunk_feerate_vsize with m_options.blockMinFeeRate.m_feefrac directly, though I don’t think there is an exposed way of doing so currently. Adding a CFeeRate::GetFeePerVSize() should be sufficient.

sdaftuar commented at 9:17 am on October 10, 2025:

Done in #33591

in src/node/miner.cpp:259 in 64de3bb675 outdated

401-                // next best entry on the next loop iteration
402-                mapModifiedTx.get<ancestor_score>().erase(modit);
403-                failedTx.insert(iter->GetSharedTx()->GetHash());
404-            }
405+        int64_t package_sig_ops = 0;
406+        std::vector<const CTxMemPoolEntry*> mempool_txs;

sipa commented at 6:29 pm on September 7, 2025:

In commit “Select transactions for blocks based on chunk feerate”

Small optimization: move the definition of mempool_txs out of the while loop here, and just .clear() it here, to avoid an allocation per included chunk.

instagibbs commented at 8:21 pm on September 8, 2025:

micro-opt: we also know the max possible chunk tx count, so could reserve it once

sdaftuar commented at 4:43 pm on September 13, 2025:

Done, and I renamed this variable to chunk_txs for clarity.

in src/node/miner.cpp:270 in 64de3bb675 outdated

412+        // Check to see if this chunk will fit.
413+        if (!TestPackage(chunk_feerate, package_sig_ops) || !TestPackageTransactions(mempool_txs)) {
414+            m_mempool->SkipBuilderChunk();
415+            // This chunk won't fit, so we let it be removed from the heap and
416+            // we'll try the next best.
417+            // TODO: try to break up this chunk into smaller chunks for

sipa commented at 6:35 pm on September 7, 2025:

In commit “Select transactions for blocks based on chunk feerate”

I don’t think this TODO belongs here, as it’s functionality that could be added as an optimization to TxGraphImpl without BlockAssembler needing to be aware.

sdaftuar commented at 4:44 pm on September 13, 2025:

Removed.

in src/node/miner.cpp:109 in 64de3bb675 outdated

105@@ -106,9 +106,7 @@ void ApplyArgsManOptions(const ArgsManager& args, BlockAssembler::Options& optio
106 
107 void BlockAssembler::resetBlock()
108 {
109-    inBlock.clear();
110-
111-    // Reserve space for fixed-size block header, txs count, and coinbase tx.
112+    // Reserve space for coinbase tx

sipa commented at 6:38 pm on September 7, 2025:

In commit “Select transactions for blocks based on chunk feerate”

This reverts a previously-added better comment.

sdaftuar commented at 4:44 pm on September 13, 2025:

Oops, reverted my change.

in src/test/util/setup_common.cpp:542 in 7553b4af4c outdated

539+    std::deque<std::pair<COutPoint, CAmount>> unspent_prevouts, undo_info;
540     std::transform(m_coinbase_txns.begin(), m_coinbase_txns.end(), std::back_inserter(unspent_prevouts),
541         [](const auto& tx){ return std::make_pair(COutPoint(tx->GetHash(), 0), tx->vout[0].nValue); });
542     while (num_transactions > 0 && !unspent_prevouts.empty()) {
543-        // The number of inputs and outputs are random, between 1 and 24.
544+        // The number of inputs and outputs are random, between 1 and 10.

sipa commented at 6:48 pm on September 7, 2025:

In commit “test: avoid adding transactions to the mempool that fail policy checks”

This looks like it does not match the code below (which uses 1-5 inputs and 1-25 outputs?).

sdaftuar commented at 4:44 pm on September 13, 2025:

Thanks, fixed the comment.

in src/txmempool.cpp:819 in f172ac5e6c outdated

823-    if (counta == countb) {
824-        return CompareTxMemPoolEntryByScore()(*i.value(), *j.value());
825-    }
826-    return counta < countb;
827+
828+    return m_txgraph->CompareMainOrder(*i.value(), *j.value()) == std::strong_ordering::less ? true : false;

sipa commented at 6:51 pm on September 7, 2025:

In commit “Use cluster linearization for transaction relay sort order”

More idiomatic:

0return m_txgraph->CompareMainOrder(*i.value(), *j.value()) < 0;

instagibbs commented at 5:42 pm on September 12, 2025:

7f44c6d9893df97bb413e01aa737028cbe52ce75

I was unable to break any existing tests by messing with this. Added a few lines to ::check which would at least catch asymmetry in orderings:

 0diff --git a/src/txmempool.cpp b/src/txmempool.cpp
 1index f44113ed8b..64e8de251d 100644
 2--- a/src/txmempool.cpp
 3+++ b/src/txmempool.cpp
 4@@ -442,4 +442,6 @@ void CTxMemPool::check(const CCoinsViewCache& active_coins_tip, int64_t spendhei
 5     CCoinsViewCache mempoolDuplicate(const_cast<CCoinsViewCache*>(&active_coins_tip));
 6 
 7+    std::optional<Wtxid> last_wtxid = std::nullopt;
 8+
 9     for (const auto& it : GetSortedScoreWithTopology()) {
10         checkTotal += it->GetTxSize();
11@@ -447,4 +449,9 @@ void CTxMemPool::check(const CCoinsViewCache& active_coins_tip, int64_t spendhei
12         innerUsage += it->DynamicMemoryUsage();
13         const CTransaction& tx = it->GetTx();
14+        if (last_wtxid) {
15+            // CompareMiningScoreWithTopology aggrees with GetSortedScoreWithTopology
16+            assert(CompareMiningScoreWithTopology(*last_wtxid, tx.GetWitnessHash()));
17+        }
18+        last_wtxid = tx.GetWitnessHash();
19         std::set<CTxMemPoolEntry::CTxMemPoolEntryRef, CompareIteratorByHash> setParentCheck;
20         std::set<CTxMemPoolEntry::CTxMemPoolEntryRef, CompareIteratorByHash> setParentsStored;

sdaftuar commented at 4:44 pm on September 13, 2025:

Ah, that is much better, thanks.

sdaftuar commented at 5:25 pm on September 17, 2025:

Thanks, I’m taking this change.

sdaftuar commented at 10:36 am on September 22, 2025:

Done in 7cebfec8a6197a260e8c21f4637fad5d60163cfa

in src/txmempool.cpp:515 in f172ac5e6c outdated

822-    uint64_t countb = j.value()->GetCountWithAncestors();
823-    if (counta == countb) {
824-        return CompareTxMemPoolEntryByScore()(*i.value(), *j.value());
825-    }
826-    return counta < countb;
827+

sipa commented at 6:57 pm on September 7, 2025:

In commit “Use cluster linearization for transaction relay sort order”

Relying solely on CompareMainOrder for the purpose of determining inv send order may be a privacy leak, as it uses cluster sequence number as tie-breaker in case of cross-cluster equal-feerate comparisons, which may reveal information about mempool insertion order. Falling back to txid like the old code did would be better, though not currently possible using CompareMainOrder.

If not addressed before this gets merged, maybe worth adding a TODO for so it doesn’t get forgotten?

sipa commented at 11:46 pm on September 7, 2025:

See #33335.

in src/txmempool.cpp:537 in 7f44c6d989 outdated

826@@ -845,7 +827,10 @@ std::vector<CTxMemPool::indexed_transaction_set::const_iterator> CTxMemPool::Get
827     for (indexed_transaction_set::iterator mi = mapTx.begin(); mi != mapTx.end(); ++mi) {
828         iters.push_back(mi);
829     }
830-    std::sort(iters.begin(), iters.end(), DepthAndScoreComparator());
831+    std::sort(iters.begin(), iters.end(), [this](const CTxMemPool::indexed_transaction_set::const_iterator& a, const CTxMemPool::indexed_transaction_set::const_iterator& b) {
832+        LOCK(this->cs); // TODO: this is unnecessary, to quiet a compiler warning

sipa commented at 7:00 pm on September 7, 2025:

In commit “Remove CTxMemPool::GetSortedDepthAndScore”

Grabbing a lock for every comparison between two transactions sound excessive. Is there really no way around silencing this otherwise?

sipa commented at 1:57 pm on September 14, 2025:

This works (also made a few other simplifications):

0std::sort(iters.begin(), iters.end(), [this](const auto& a, const auto& b) EXCLUSIVE_LOCKS_REQUIRED(cs) noexcept {
1    return m_txgraph->CompareMainOrder(*a, *b) < 0;
2});

sdaftuar commented at 9:30 pm on September 16, 2025:

Took this patch, thanks.

in test/functional/mempool_packages.py:11 in 74f5126268 outdated

 6@@ -7,8 +7,9 @@
 7 from decimal import Decimal
 8 
 9 from test_framework.messages import (
10-    DEFAULT_ANCESTOR_LIMIT,
11     DEFAULT_DESCENDANT_LIMIT,
12+    DEFAULT_DESCENDANT_LIMIT,

sipa commented at 7:04 pm on September 7, 2025:

In commit “Stop enforcing ancestor size/count limits”

Duplicate?

sdaftuar commented at 4:44 pm on September 13, 2025:

Oops, eliminated now.

in src/validation.cpp:955 in 6000fc8c3f outdated

952@@ -953,11 +953,7 @@ bool MemPoolAccept::PreChecks(ATMPArgs& args, Workspace& ws)
953     ws.m_iters_conflicting = m_pool.GetIterSet(ws.m_conflicts);
954 
955     // Calculate in-mempool ancestors, up to a limit.

sipa commented at 7:16 pm on September 7, 2025:

In commit “Simplify ancestor calculation functions”

The up to a limit sounds outdated now.

glozow commented at 4:25 pm on September 10, 2025:

Is it problematic that there is no limit?

sipa commented at 12:30 pm on September 13, 2025:

I don’t think it is; TxGraph should be pretty efficient at gathering all ancestors (and clusters are limited anyway, so the number can’t exceed the cluster count limit).

sdaftuar commented at 4:44 pm on September 13, 2025:

Eliminated the comment.

in src/txmempool.h:300 in 1f70699997 outdated

296@@ -297,6 +297,7 @@ class CTxMemPool
297     std::vector<std::pair<Wtxid, txiter>> txns_randomized GUARDED_BY(cs); //!< All transactions in mapTx with their wtxids, in arbitrary order
298 
299     typedef std::set<txiter, CompareIteratorByHash> setEntries;
300+    typedef std::vector<txiter> Entries;

sipa commented at 7:18 pm on September 7, 2025:

In commit “Use txgraph to calculate descendants”

This Entries type seems to be unused.

sdaftuar commented at 4:44 pm on September 13, 2025:

Looks like this was leftover from some earlier efforts I had to replace setEntries with a vector of txiters instead, but I’ve lost track of where I was at with that. Eliminated this new Entries type for now.

in src/txmempool.cpp:380 in 4087ae4e52 outdated

376@@ -377,8 +377,8 @@ void CTxMemPool::removeConflicts(const CTransaction &tx)
377             const CTransaction &txConflict = it->second->GetTx();
378             if (Assume(txConflict.GetHash() != tx.GetHash()))
379             {
380+                removeRecursive(it->second, MemPoolRemovalReason::CONFLICT);

sipa commented at 7:27 pm on September 7, 2025:

In commit “Simplify removeConflicts”

Is this line swap necessary? It doesn’t look like much of a simplification.

sdaftuar commented at 4:44 pm on September 13, 2025:

Looks like a rebase issue (I think there may have been an earlier version of the code where there was a bigger change, and this accidental line reordering was what was left over). Eliminated this commit.

in src/txmempool.cpp:213 in 61f51546af outdated

209@@ -210,6 +210,7 @@ void CTxMemPool::Apply(ChangeSet* changeset)
210 
211         addNewTransaction(it);
212     }
213+    m_txgraph->DoWork(10'000);

sipa commented at 7:32 pm on September 7, 2025:

In commit “Invoke TxGraph::DoWork() at appropriate times”

I would expect the number here to be a multiple (100x?) of the number passed as acceptable_iters (as that configuration variable controls the number of iterations that’s mandatory per cluster, while this is work that gets performed over all clusters. Presumably we’re ok with spending more than 50 μs overall?

Also perhaps introduce a named constant for this.

sdaftuar commented at 4:44 pm on September 13, 2025:

Thanks, good point! Making this 100x for now.

sipa commented at 7:33 pm on September 7, 2025: member

Some comments already, mostly to get familiar with the overall flow of changes.

in src/txmempool.h:302 in 56999a7085 outdated

412@@ -413,7 +413,7 @@ class CTxMemPool
413     }
414 
415 public:
416-    indirectmap<COutPoint, const CTransaction*> mapNextTx GUARDED_BY(cs);
417+    indirectmap<COutPoint, txiter> mapNextTx GUARDED_BY(cs);

instagibbs commented at 4:00 pm on September 8, 2025:

56999a7085f21c8d462fd5afdfbb41dd93df423f

now is a great time to document what this thing is

sdaftuar commented at 4:45 pm on September 13, 2025:

I added a later commit to the CTxMemPool class as a whole that adds some documentation for this variable; do you think that is sufficient or would it be helpful to add more?

in src/policy/policy.cpp:376 in 7ddce57e96 outdated

372@@ -373,6 +373,11 @@ bool SpendsNonAnchorWitnessProg(const CTransaction& tx, const CCoinsViewCache& p
373     return false;
374 }
375 
376+int64_t GetSigOpsAdjustedWeight(int64_t nWeight, int64_t nSigOpCost, unsigned int bytes_per_sigop)

instagibbs commented at 4:15 pm on September 8, 2025:

7ddce57e96aff26fb2a5abc1009e6baef24d5bab

look for some trivial coverage of this, or directly use this in GetVirtualTransactionSize?

sdaftuar commented at 10:30 am on September 22, 2025:

Did the latter, thanks.

in src/txmempool.h:513 in 34251141ec outdated

509@@ -510,6 +510,7 @@ class CTxMemPool
510      */
511     void UpdateTransactionsFromBlock(const std::vector<Txid>& vHashesToUpdate) EXCLUSIVE_LOCKS_REQUIRED(cs, cs_main) LOCKS_EXCLUDED(m_epoch);
512 
513+public:

instagibbs commented at 6:21 pm on September 8, 2025:

34251141eca785578a0b2ca6d1c0932d34b2418d

this block is already public

sdaftuar commented at 4:45 pm on September 13, 2025:

Thanks, fixed.

in src/txmempool.cpp:1443 in 34251141ec outdated

1438     m_ancestors.clear();
1439 }
1440+
1441+std::vector<const TxGraph::Ref*> CTxMemPool::ChangeSet::CalculateParentsOf(const CTransactionRef& tx)
1442+{
1443+    std::vector<const TxGraph::Ref*> ret;

instagibbs commented at 6:27 pm on September 8, 2025:

Can Assume(!m_dependencies_processed);

sdaftuar commented at 4:45 pm on September 13, 2025:

This function is gone now.

in src/txmempool.cpp:978 in 34251141ec outdated

1458+        }
1459+    }
1460+    return ret;
1461+}
1462+
1463+void CTxMemPool::ChangeSet::ProcessDependencies()

instagibbs commented at 6:37 pm on September 8, 2025:

34251141eca785578a0b2ca6d1c0932d34b2418d

I didn’t try implementing but have you considered simply processing dependencies as you go in PreChecks->StageAddition? Then we don’t have to track whether we’ve applied deps.

sdaftuar commented at 3:33 pm on September 13, 2025:

I didn’t want to impose an ordering requirement on transactions being added to staging, which means that every time we add a new transaction, we’d have to loop over all the other transactions to see if any of them depend on the new one. So we get a complexity improvement (quadratic vs linear) by doing it in one pass at the end.

in src/node/miner.cpp:197 in 64de3bb675 outdated

207-}
208-
209-bool BlockAssembler::TestPackage(uint64_t packageSize, int64_t packageSigOpsCost) const
210+bool BlockAssembler::TestPackage(FeePerWeight package_feerate, int64_t packageSigOpsCost) const
211 {
212     // TODO: switch to weight-based accounting for packages instead of vsize-based accounting.

instagibbs commented at 8:11 pm on September 8, 2025:

64de3bb675e2ff3b8f0bb1de3d7680cbb2263807

this TODO is addressed in this commit

sdaftuar commented at 4:45 pm on September 13, 2025:

Thanks, eliminated.

in src/node/miner.cpp:149 in 64de3bb675 outdated

147-    int nDescendantsUpdated = 0;
148     if (m_mempool) {
149-        addPackageTxs(nPackagesSelected, nDescendantsUpdated);
150+        LOCK(m_mempool->cs);
151+        m_mempool->StartBlockBuilding();
152+        addChunks();

instagibbs commented at 8:26 pm on September 8, 2025:

64de3bb675e2ff3b8f0bb1de3d7680cbb2263807

take or leave suggestion: addChunks could take care of starting and stopping block building; block builder doesn’t have to live outside of that function?

in src/txmempool.h:931 in 64de3bb675 outdated

926+        auto res = m_builder->GetCurrentChunk();
927+        if (!res) { return {}; }
928+
929+        auto [chunk_entries, chunk_feerate] = *res;
930+        for (TxGraph::Ref* ref : chunk_entries) {
931+            entries.emplace_back(dynamic_cast<const CTxMemPoolEntry&>(*ref));

instagibbs commented at 8:39 pm on September 8, 2025:

64de3bb675e2ff3b8f0bb1de3d7680cbb2263807

do we need dynamic_cast here since we know the underlying type vs static_cast?

sdaftuar commented at 4:45 pm on September 13, 2025:

Fixed.

in test/functional/mempool_packages.py:23 in 478ea2c3c7 outdated

19@@ -20,7 +20,7 @@
20 
21 # custom limits for node1
22 CUSTOM_ANCESTOR_LIMIT = 5
23-CUSTOM_DESCENDANT_LIMIT = 10
24+CUSTOM_DESCENDANT_LIMIT = 11

instagibbs commented at 8:53 pm on September 8, 2025:

was this test actually exploiting careveout rule?

sdaftuar commented at 6:33 pm on October 14, 2025:

Yes I think so!

in src/policy/rbf.cpp:65 in 266500e01e outdated

74-                             tx.GetHash().ToString(),
75-                             nConflictingCount,
76-                             MAX_REPLACEMENT_CANDIDATES);
77-        }
78+    // Rule #5: don't consider replacements that conflict directly with more
79+    // than MAX_REPLACEMENT_CANDIDATES distinct clusters. This implies a bound

instagibbs commented at 8:57 pm on September 8, 2025:

266500e01e996c6c08af31880545efb5568ded8b

MAX_REPLACEMENT_CANDIDATES documentation in rbf.h should get updated, including mentions at GetEntriesForConflicts

sdaftuar commented at 4:45 pm on September 13, 2025:

Made some improvements, please let me know how it looks now.

in src/validation.cpp:1022 in 266500e01e outdated

1085@@ -1108,6 +1086,12 @@ bool MemPoolAccept::ReplacementChecks(Workspace& ws)
1086     for (auto it : all_conflicts) {
1087         m_subpackage.m_changeset->StageRemoval(it);
1088     }
1089+
1090+    if (const auto err_string{ImprovesFeerateDiagram(*m_subpackage.m_changeset)}) {

instagibbs commented at 9:05 pm on September 8, 2025:

266500e01e996c6c08af31880545efb5568ded8b

if we’re not checking the error type of this, I think we will retry this transaction again in a package even if the issue is cluster limits being exceeded?

sdaftuar commented at 4:33 pm on September 13, 2025:

Is it possible that we might have failure in the individual setting with a cluster size limit being exceeded, which then is not the case in a package setting, if some additional package transaction conflicts with transactions in the cluster being added to?

in src/txmempool.h:520 in 266500e01e outdated

515+        std::vector<const TxGraph::Ref *> entries;
516+        entries.reserve(iters_conflicting.size());
517+        for (auto it : iters_conflicting) {
518+            entries.emplace_back(&*it);
519+        }
520+        return m_txgraph->CountDistinctClusters(entries);

instagibbs commented at 9:08 pm on September 8, 2025:

266500e01e996c6c08af31880545efb5568ded8b

CountDistinctClusters should be main_only=true, otherwise you may be calling it on an oversized graph (this again https://github.com/bitcoin/bitcoin/pull/28676/commits/266500e01e996c6c08af31880545efb5568ded8b#r1954800384 )

sipa commented at 8:00 pm on September 9, 2025:

See #33354.

sdaftuar commented at 4:45 pm on September 13, 2025:

Thanks, fixed.

in src/policy/rbf.cpp:64 in 266500e01e outdated

73-            return strprintf("rejecting replacement %s; too many potential replacements (%d > %d)",
74-                             tx.GetHash().ToString(),
75-                             nConflictingCount,
76-                             MAX_REPLACEMENT_CANDIDATES);
77-        }
78+    // Rule #5: don't consider replacements that conflict directly with more

instagibbs commented at 9:16 pm on September 8, 2025:

266500e01e996c6c08af31880545efb5568ded8b

commit message can also note that package rbf can now evict any topology (CheckConflictTopology nuked)

sdaftuar commented at 4:45 pm on September 13, 2025:

Done!

instagibbs commented at 9:20 pm on September 8, 2025: member

few comments through “initial implementation”

in src/txmempool.cpp:1484 in 34251141ec outdated

1479+    LOCK(m_pool->cs);
1480+    if (!m_dependencies_processed) {
1481+        ProcessDependencies();
1482+    }
1483+
1484+    return !m_pool->m_txgraph->IsOversized();

instagibbs commented at 9:26 pm on September 8, 2025:

still prefer main_only to be explicitly called

sdaftuar commented at 4:45 pm on September 13, 2025:

Thanks, fixed.

sipa commented at 10:36 pm on September 8, 2025: member

Should we make all the main_only arguments in TxGraph mandatory? Or even replace them with an enum like

0enum class Level {
1    MAIN, //<! Always refers to level 0
2    STAGING, //<! Always refers to level 1 (fail if it doesn't exist)
3    TOP, //<! Refers to level 1 if it exists, 0 otherwise
4};

?

in src/validation.cpp:1370 in 61f51546af outdated

1355@@ -1464,6 +1356,12 @@ MempoolAcceptResult MemPoolAccept::AcceptSingleTransaction(const CTransactionRef
1356         return MempoolAcceptResult::Failure(ws.m_state);
1357     }
1358 
1359+    // Check if the transaction would exceed the cluster size limit.
1360+    if (!m_subpackage.m_changeset->CheckMemPoolPolicyLimits()) {
1361+        ws.m_state.Invalid(TxValidationResult::TX_MEMPOOL_POLICY, "too-large-cluster", "");
1362+        return MempoolAcceptResult::Failure(ws.m_state);
1363+    }
1364+

glozow commented at 3:05 pm on September 10, 2025:

This should be placed earlier, i.e. before CheckEphemeralSpends, to bound the number of mempool anestors before we start looking up whether they have dust to sweep.

instagibbs commented at 6:29 pm on September 10, 2025:

not just the number, but also size as a proxy for number of parent outputs which are possibly scanned

sdaftuar commented at 5:03 pm on September 17, 2025:

Hmm, so there is a sort-of dependency between this invocation of CheckMemPoolPolicyLimits() and ReplacementChecks() – the cluster size limit check won’t be correct if we invoke it before we’ve added potential conflicts for removal to the changeset as well. However, if there are conflicts for removal, then cluster size limits are checked within ReplacementChecks() anyway, so there’s not really an issue, except that this slightly confusing and perhaps this relationship could be overlooked in the future if the behavior of ReplacementChecks() were modified.

I think what I’ll do is just move the ephemeral spends check down below this block of code. Let me know if you think of any issues with that.

glozow referenced this in commit 593d5fe37d on Sep 11, 2025

in src/txmempool.cpp:733 in 7341a51a02 outdated

1054@@ -1055,8 +1055,8 @@ void CCoinsViewMemPool::Reset()
1055 
1056 size_t CTxMemPool::DynamicMemoryUsage() const {
1057     LOCK(cs);
1058-    // Estimate the overhead of mapTx to be 15 pointers + an allocation, as no exact formula for boost::multi_index_contained is implemented.
1059-    return memusage::MallocUsage(sizeof(CTxMemPoolEntry) + 15 * sizeof(void*)) * mapTx.size() + memusage::DynamicUsage(mapNextTx) + memusage::DynamicUsage(mapDeltas) + memusage::DynamicUsage(txns_randomized) + m_txgraph->GetMainMemoryUsage() + cachedInnerUsage;
1060+    // Estimate the overhead of mapTx to be 9 pointers + an allocation, as no exact formula for boost::multi_index_contained is implemented.

instagibbs commented at 5:03 pm on September 12, 2025:

7341a51a02adbac1246f72c6643ee0d878b6aa46

nit: could we add some napkin math how we’re getting to 9 here

in src/txmempool.cpp:439 in 7f44c6d989 outdated

713@@ -714,11 +714,12 @@ void CTxMemPool::check(const CCoinsViewCache& active_coins_tip, int64_t spendhei
714     uint64_t checkTotal = 0;
715     CAmount check_total_fee{0};
716     uint64_t innerUsage = 0;
717-    uint64_t prev_ancestor_count{0};
718+
719+    assert(!m_txgraph->IsOversized());

instagibbs commented at 6:15 pm on September 12, 2025:

7f44c6d9893df97bb413e01aa737028cbe52ce75

Think you can delete CompareTxMemPoolEntryByScore

commit message is also a bit weird since it’s replaced by something related

in src/policy/truc_policy.cpp:232 in 5de7f1378a outdated

230             // which descendant to evict. Skip if this isn't true, e.g. if the transaction has
231             // multiple children or the sibling also has descendants due to a reorg.
232-            const bool consider_sibling_eviction{parent_entry->GetCountWithDescendants() == 2 &&
233-                children.begin()->get().GetCountWithAncestors() == 2};
234+            const bool consider_sibling_eviction{pool.GetNumDescendants(parent_entry) == 2 &&
235+                pool.GetNumDescendants(children.begin()->get()) == 1};

instagibbs commented at 6:46 pm on September 12, 2025:

5de7f1378a3146368975c10511dfec086d522ff4

This needs to be checking number of ancestors of the (only) child to make sure it doesn’t have multiple in-mempool parents. Here it’s re-checking the same logic as the line above?

This logic seems to persist to the end of the PR.

I think this could only happen in a reorg, resulting in something like a 2p1c TRUC sibling. Looks like we only test for 1p2c et al.

Using txgraph I feel like we can make these checks extremely compact anyways, maybe consider doing that instead? e.g., GetCluster returns a cluster of size two, make sure first tx is the parent of new tx, etc.

sdaftuar commented at 1:33 pm on September 18, 2025:

Looks like this was me just flubbing with the conversion of GetCountWithAncestors() to GetNumDescendants() instead of GetNumAncestors()? I’ll just make that switch for now, rather than try to re-write this code using GetCluster(), which I’d rather save for a future PR at this point.

I also don’t want to introduce a behavior change here, but as an aside I was surprised that no test failed with this mistake, and when looking at the original PR to refresh my memory I found this comment that I had made (hah!): #29306 (review).

instagibbs commented at 1:35 pm on September 18, 2025:

didnt you nuke GetNumAncestors?

sdaftuar commented at 10:19 am on September 22, 2025:

Brought it back in 4ead575b252becbfa18ad28a7bfa7fbf3de7129d

in test/functional/mempool_truc.py:221 in 20d6bcb2e7 outdated

214@@ -215,6 +215,15 @@ def test_nondefault_package_limits(self):
215         assert_equal(node.getmempoolentry(tx_v3_parent_large1["txid"])["descendantcount"], 1)
216         self.generate(node, 1)
217 
218+        self.log.info("Test that a decreased limitclustersize also applies to v3 parent")
219+        self.restart_node(0, extra_args=["-limitclustersize=10", "-acceptnonstdtxn=1"])
220+        tx_v3_parent_large2 = self.wallet.send_self_transfer(from_node=node, target_vsize=9900, version=3)
221+        tx_v3_child_large2 = self.wallet.create_self_transfer(utxo_to_spend=tx_v3_parent_large2["new_utxo"], version=3)

instagibbs commented at 6:54 pm on September 12, 2025:

20d6bcb2e7f2c3016280321ef31421ca4039b017

nit: child isn’t large

sdaftuar commented at 10:37 am on September 22, 2025:

Fixed – well actually I looked at the test again and tried to improve it, because what I had done there before didn’t make a lot of sense to me! Please let me know how this looks now.

in src/validation.cpp:599 in cbcfdf4cb0 outdated

595@@ -606,8 +596,8 @@ class MemPoolAccept
596     /**
597      * Submission of a subpackage.
598      * If subpackage size == 1, calls AcceptSingleTransaction() with adjusted ATMPArgs to avoid
599-     * package policy restrictions like no CPFP carve out (PackageMempoolChecks)
600-     * and creates a PackageMempoolAcceptResult wrapping the result.
601+     * package policy restrictions like disabled sibling eviction and

instagibbs commented at 7:09 pm on September 12, 2025:

cbcfdf4cb0ef05773bf430ebf6beafc59c1c7bc6

grammar: you’re not avoiding disabling sibling eviction here

sdaftuar commented at 2:30 pm on September 18, 2025:

Happy to clear this comment up to get rid of double-negatives, but just to make sure I’ve got this right:

AcceptSubPackage() will, in the case of single transaction acceptance, enable sibling eviction, right?
And in package validation, sibling eviction is disabled?

instagibbs commented at 2:46 pm on September 18, 2025:

correct

sdaftuar commented at 10:39 am on September 22, 2025:

Should be better now

instagibbs commented at 7:34 pm on September 12, 2025: member

first pass review through 7683eeed07050fbcfa4f3b77d32303c177e2683e

sdaftuar force-pushed on Sep 13, 2025

sipa commented at 4:48 pm on September 13, 2025: member

Will need rebase after merge of #33354, and including the updated #33157 would be nice to get the reduced TxGraph memory usage savings.

sipa commented at 2:05 pm on September 14, 2025: member

See https://github.com/sipa/bitcoin/commits/pr28676 for a branch with:

Rebase after #33354 merge.
Updated #33157 as base.
Addressed #28676 (review)

in src/txmempool.h:441 in 6502d30e4d outdated

438@@ -450,14 +439,10 @@ class CTxMemPool
439      * (these are all calculated including the tx itself)

instagibbs commented at 8:42 pm on September 15, 2025:

6502d30e4d8a684b93bf631424fac06aa62bb272

these are all calculated including the tx itself

Don’t think this was correct before or after

sdaftuar commented at 2:53 pm on September 18, 2025:

Yeah the limits that were previously enforced by earlier versions of this function would be calculated including the tx itself, but not the ancestor set. Thanks for catching.

sdaftuar commented at 10:41 am on September 22, 2025:

Fixed now.

in src/txmempool.cpp:329 in b526fd4a3a outdated

349-            txiter childiter = mapTx.iterator_to(child);
350-            if (!setDescendants.count(childiter)) {
351-                stage.insert(childiter);
352-            }
353-        }
354+    auto descendants = m_txgraph->GetDescendants(*entryit, /*main_only=*/true);

instagibbs commented at 8:46 pm on September 15, 2025:

b526fd4a3ae637ffc0c13353e1f55d690209fff7

avoid the temporary variable by inlining it in the for loop directly?

sdaftuar commented at 10:42 am on September 22, 2025:

Done.

in src/txmempool.cpp:918 in 65eaf2738a outdated

987                 }
988             }
989         }
990     }
991-    return clustered_txs;
992+    if (ret.size() > 500) {

instagibbs commented at 8:58 pm on September 15, 2025:

65eaf2738ac88d3f6e7418734c8eebe1f40609ec

This check should be in the loop if it’s meant to be a CPU DoS protection

sdaftuar commented at 3:04 pm on September 18, 2025:

I don’t think we need the cpu protection now that we’re using txgraph to do things (versus all the pointer chasing that would happen before), and only left the 500 size limit to preserve existing behavior. But I can move this back into the for-loop if people have concerns.

I believe that after cluster mempool, mini miner can be reworked to eliminate this function altogether. With the disclaimer that I haven’t thought about it in a while, I put together a branch demonstrating this a while back: https://github.com/bitcoin/bitcoin/commit/d24f02699ec4af215a4ffae3a2ae2457f71bf93f

sdaftuar force-pushed on Sep 16, 2025

in src/validation.cpp:955 in 52fe9997df outdated

956-    if (auto ancestors{m_subpackage.m_changeset->CalculateMemPoolAncestors(ws.m_tx_handle)}) {
957-        ws.m_ancestors = std::move(*ancestors);
958-    } else {
959-        return state.Invalid(TxValidationResult::TX_MEMPOOL_POLICY, "too-long-mempool-chain", util::ErrorString(ancestors).original);
960-    }
961+    ws.m_ancestors = m_subpackage.m_changeset->CalculateMemPoolAncestors(ws.m_tx_handle);

instagibbs commented at 7:56 pm on September 16, 2025:

6000fc8c3fcc0446b3665f7fccf102068919fa3c

Didn’t nail down exactly when this would become a concern, but do note that this call can trigger tx.vin.size() number staged parents and GetAncestors calls based on those parents.

Think this is potentially a huge number of clusters with each up to 64 transactions at a time.

sdaftuar commented at 11:18 am on September 20, 2025:

Good catch – I’m working on a solution to this, possibly involving a more extensive rewrite of the TRUC checks (to not need parents/ancestors explicitly calculated) and the RBF checks (to not require ancestors to be calculated before we verify that the cluster size limits are respected).

sdaftuar commented at 10:50 am on September 22, 2025:

I added a commit to the end of this PR that should fix this issue. As I was working on this, I was thinking about your comment above about reworking the TRUC checks to just grab the clusters it needs directly from txgraph. The issue with that in the way our code works right now is that we can’t fully realize our staging clusters until we’ve added all the transactions, applied dependencies, and staged all the RBF removals.

If we went ahead and imposed a topology requirement on transaction packages so that we know they’re staged for addition in a topologically valid order, then we could do a bit better by applying dependencies as we go, so that we could try to look at staging clusters as we go as well. However I think that is also confusing (in the same way that our existing validation logic is a bit confusing), compared with if we were able to stage all additions and removals up front, and then work on the resulting staging clusters. Doing that would require a bigger overhaul of the logic in validation.cpp, which I didn’t want to tackle just yet.

But if anyone sees a clean way to do things that is better than what I’m proposing so far, please let me know.

instagibbs commented at 5:40 pm on September 22, 2025:

re:TRUC checks I agree after some thought that txgraph-based checking is too difficult to do, at least for now. I can foresee some weird complications and I’m unsure I could even describe which series of steps would be correct vs our more bespoke approach currently which sidesteps it.

Re-arrangement of checks seems correct, will take a fresh look at the approach soon.

related: Documentation for PreChecks needs to be updated as it does not check package(cluster) limits any longer aside from TRUC checks.

sdaftuar force-pushed on Sep 16, 2025

sdaftuar force-pushed on Sep 21, 2025

sdaftuar force-pushed on Sep 22, 2025

in src/validation.cpp:1366 in 00c02b42c6

1369-        }
1370-        return MempoolAcceptResult::Failure(ws.m_state);
1371-    }
1372-
1373     // Check if the transaction would exceed the cluster size limit.
1374     if (!m_subpackage.m_changeset->CheckMemPoolPolicyLimits()) {

instagibbs commented at 5:09 pm on September 22, 2025:

as of “Rework RBF and TRUC validation” this check is now redundant I believe

in src/validation.cpp:1337 in 00c02b42c6 outdated

1333+    // Check if the transaction would exceed the cluster size limit.
1334+    if (!m_subpackage.m_changeset->CheckMemPoolPolicyLimits()) {
1335+        ws.m_state.Invalid(TxValidationResult::TX_MEMPOOL_POLICY, "too-large-cluster", "");
1336+        return MempoolAcceptResult::Failure(ws.m_state);
1337+    }
1338+

instagibbs commented at 5:21 pm on September 22, 2025:

00c02b42c6199a6c46573304761056082490c7d4

While we’re here: Suggestion to make it more clear as to the roles of the various TRUC checks and that everything works once dependencies are attached.

0    // Either sibling eviction was needed, or package based check should always pass for single tx given SingleTRUCChecks already passed
1    Assume(ws.m_sibling_eviction || !PackageTRUCChecks(m_pool, ws.m_ptx, ws.m_vsize, {ws.m_ptx}, ws.m_parents));

glozow commented at 5:35 pm on September 24, 2025:

This fails test_unspent_ephemeral for me when applied on 00c02b42c61

instagibbs commented at 5:48 pm on September 24, 2025:

ah right, it will fail with in-mempool sibling because package version will simply fail when it encounters a “parent already has child in mempool” condition it’s not meant for RBF cases

in src/validation.cpp:1333 in 00c02b42c6 outdated

1329+        }
1330+        return MempoolAcceptResult::Failure(ws.m_state);
1331+    }
1332+
1333+    // Check if the transaction would exceed the cluster size limit.
1334+    if (!m_subpackage.m_changeset->CheckMemPoolPolicyLimits()) {

glozow commented at 5:54 pm on September 24, 2025:

Noting that this check is redundant with ReplacementChecks, since CalculateChunksForRBF also calls it before calculating the feerate diagram. I think checking twice is fine, but that one has a slightly different error message that should be unified imo. See this test:

 0diff --git a/test/functional/mempool_cluster.py b/test/functional/mempool_cluster.py
 1index 3da8b477a2f..8bcc9b2fe65 100755
 2--- a/test/functional/mempool_cluster.py
 3+++ b/test/functional/mempool_cluster.py
 4@@ -4,6 +4,11 @@
 5 # file COPYING or http://www.opensource.org/licenses/mit-license.php.
 6 """Test cluster mempool accessors and limits"""
 7 
 8+from decimal import Decimal
 9+
10+from test_framework.messages import (
11+    COIN,
12+)
13 from test_framework.test_framework import BitcoinTestFramework
14 from test_framework.wallet import (
15     MiniWallet,
16@@ -22,6 +27,7 @@ class MempoolClusterTest(BitcoinTestFramework):
17         node = self.nodes[0]
18         self.wallet = MiniWallet(node)
19 
20+        self.log.info("Test that cluster count limit is enforced")
21         node = self.nodes[0]
22         parent_tx = self.wallet.send_self_transfer(from_node=node)
23         utxo_to_spend = parent_tx["new_utxo"]
24@@ -54,6 +60,19 @@ class MempoolClusterTest(BitcoinTestFramework):
25         bad_tx = self.wallet.create_self_transfer(utxo_to_spend=utxo_to_spend)
26         assert_raises_rpc_error(-26, "too-large-cluster", node.sendrawtransaction, bad_tx["hex"])
27 
28+        self.log.info("Test that cluster count limit is enforced during replacement")
29+        utxo_to_double_spend = self.wallet.get_utxo(confirmed_only=True)
30+        fee = Decimal("0.000001")
31+        tx_to_replace = self.wallet.create_self_transfer(utxo_to_spend=utxo_to_double_spend, fee=fee)
32+        node.sendrawtransaction(tx_to_replace["hex"])
33+
34+        bad_tx_also_replacement = self.wallet.create_self_transfer_multi(
35+            utxos_to_spend=[utxo_to_spend, utxo_to_double_spend],
36+            fee_per_output=int(fee * COIN * 5),
37+        )
38+        # Actually, the error is replacement-failed, cluster size limit exceeded (from CalculateChunksForRBF)
39+        assert_raises_rpc_error(-26, "too-large-cluster", node.sendrawtransaction, bad_tx_also_replacement["hex"])
40+
41         # TODO: verify that the size limits are also enforced.
42         # TODO: add tests that exercise rbf, package submission, and package
43         # rbf and verify that cluster limits are enforced.

instagibbs commented at 7:09 pm on September 24, 2025:

pendantic: it’s redundant when an RBF is encountered, but not otherwise and the result is cached so pretty cheap to throw around

DrahtBot added the label Needs rebase on Sep 25, 2025

sdaftuar force-pushed on Sep 25, 2025

DrahtBot removed the label Needs rebase on Sep 25, 2025

ismaelsadeeq commented at 4:18 pm on September 30, 2025: member

Approach ACK

I ran this branch for a while on mainnet and have not noticed any issue, importing of mempool saved before cluster succeeded in multiple runs.

Would it be a good idea to split this PR into a main PR with Cluster Implementation + Cleanups + Docs and Tests, then a followups PR with Optimizations and more cleanups? Goal is to get everything in v31, but this might help with reducing the size of the main PR and make it easier to punt cleanup-related comments to later.

I’ve spent some time trying to understand the approach and changes in this PR, the changes in this PR can be categorized into

Main Changes

Changes to CTxMemPoolEntry to be a derived class of txgraph::ref.
Changes to the mempool to be a wrapper around tx graph, making changeset create staging and commit or discard it depending on whether the mempool fee rate diagram is improved, update the txgraph for all addition, removals and accessing etc.
Updates to RBF to use the fee rate diagram.
Changes to validation ( replacement check is updated to fee rate diagram, carveouts are pruned, enforcing cluster size limits instead of ancestor or descendant size limits).
Addition of cluster size limits as a startup option.
Updating interfacess to use to check cluster limits instead of ancestor limt and updates to use new mempool methods.
Updating the blockassembler to use the new block builder for selection to block template
TRUC policy is updated use the new mempool interface.
Benchmarking changes.
Unit Test changes
RBF carveout tests deletion.
Package one-more functional test is pruned.
Package RBF updates to use cluster size as max conflicts.

RPC changes

Mempool RPC: addition of cluster description, limits, and chunk data to mempool RPCs and a new the fee rate diagram RPC.

Unit Test Improvements *lots of tests were touched because AddToMempool was changed to TryAddToMempool.

Functional tests

Addition of mempool cluster tests for testing limits and accessors.

These changes can all be in separate PR’s but not really necessary , I think it logically flow well and can be reviewed as a single PR as well.

DrahtBot added the label Needs rebase on Oct 2, 2025

sipa commented at 4:46 pm on October 2, 2025: member

@sdaftuar and I ran a simulation, replaying most of 2023, and grabbing a getblocktemplate just before every real block, in a node with -bytespersigop=0. For each three numbers were computed:

(a) the fees gathered by the getblocktemplate template itself.
(b) the fees found using the same mempool, using a dynamic programming algorithm that finds the optimal template restricted to just cluster prefixes at chunk boundaries.
(c) a conservative overestimate, computed by eagerly filling the block with chunks until one does no longer fit, and then counting the remaining space at the feerate of that non-fitting chunk

The blue line below gives the histogram of the difference between (b) and (c), the green line gives the difference between (a) and (b), and purple is the difference between (a) and (c).

sdaftuar force-pushed on Oct 9, 2025

sdaftuar commented at 7:09 pm on October 9, 2025: member

At @glozow’s suggestion I’ve removed some optimization/cleanup commits from this PR and opened #33591 for non-blocking cleanups/followups.

DrahtBot removed the label Needs rebase on Oct 9, 2025

Allow moving an Epoch::Marker 51430680ec

mempool: Store iterators into mapTx in mapNextTx

This takes the same amount of space as CTransaction pointers, and saves a map
lookup in many common uses.

6c73e47448

Allow moving CTxMemPoolEntry objects, disallow copying cd0bea2197

Make CTxMemPoolEntry derive from TxGraph::Ref c5706ea462

Create a txgraph inside CTxMemPool 91d9bfcca6

Use named constant for acceptable iters 83c8753abf

Add sigops adjusted weight calculator f2eff17c6c

Add accessor for sigops-adjusted weight 1d3b53bcf1

Add transactions to txgraph, but without cluster dependencies

Effectively this is treating all transactions in txgraph as being in a cluster
of size 1.

8c59aa56cb

Add new (unused) limits for cluster size/count 2801e80528

test: update feature_rbf.py replacement test

Preparatory commit to the rbf functional test, before changes are made to the
rbf rules as part of cluster mempool.

429bdbecfd

[test] rework/delete feature_rbf tests requiring large clusters 8d6c7e4401

Do not allow mempool clusters to exceed configured limits

Include an adjustment to mempool_tests.cpp due to the additional memory used by
txgraph.

Includes a temporary change to the mempool_ephemeral_dust.py functional test,
due to validation checks being reordered. This change will revert once the RBF
rules are changed in a later commit.

5a388c0d59

Check cluster limits when using -walletrejectlongchains 1102ac7f74

Rework miner_tests to not require large cluster limit 7a19c9b219

Limit mempool size based on chunk feerate

Rather than evicting the transactions with the lowest descendant feerate,
instead evict transactions that have the lowest chunk feerate.

Once mining is implemented based on choosing transactions with highest chunk
feerate (see next commit), mining and eviction will be opposites, so that we
will evict the transactions that would be mined last.

5502eeb4d0

bench: rewrite ComplexMemPool to not create oversized clusters 24d0031bc2

Select transactions for blocks based on chunk feerate ae1ac54103

test: rewrite PopulateMempool to not violate mempool policy (cluster size) limits 0f7739e4a1

policy: Remove CPFP carveout rule

The addition of a cluster size limit makes the CPFP carveout rule useless,
because carveout cannot be used to bypass the cluster size limit. Remove this
policy rule and update tests to no longer rely on the behavior.

576d71b33d

Implement new RBF logic for cluster mempool

With a total ordering on mempool transactions, we are now able to calculate a
transaction's mining score at all times. Use this to improve the RBF logic:

- we no longer enforce a "no new unconfirmed parents" rule

- we now require that the mempool's feerate diagram must improve in order
  to accept a replacement

- the topology restrictions for conflicts in the package rbf setting have been
  eliminated

Revert the temporary change to mempool_ephemeral_dust.py that were previously
made due to RBF validation checks being reordered.

Co-authored-by: Gregory Sanders <gsanders87@gmail.com>, glozow <gloriajzhao@gmail.com>

2aad9f01e5

==== END CLUSTER IMPLEMENTATION ==== 8cdd7bb11a

==== BEGIN MEMPOOL CLEANUP ==== 56e20fe2df

Remove the ancestor and descendant indices from the mempool 97b5213d10

Use cluster linearization for transaction relay sort order

Previously, transaction batches were first sorted by ancestor count and then
feerate, to ensure transactions are announced in a topologically valid order,
while prioritizing higher feerate transactions. Ancestor count is a crude
topological sort criteria, so replace this with linearization order so that the
highest feerate transactions (as would be observed by the mining algorithm) are
relayed before lower feerate ones, in a topologically valid way.

This also fixes a test that only worked due to the ancestor-count-based sort
order.

97fea9417c

Remove CTxMemPool::GetSortedDepthAndScore

The mempool clusters and linearization permit sorting the mempool topologically
without making use of ancestor counts (as long as the graph is not oversized).

Co-authored-by: Pieter Wuille <pieter@wuille.net>

14a56987cb

Reimplement GetTransactionAncestry() to not rely on cached data

In preparation for removing ancestor data from CTxMemPoolEntry, recalculate the
ancestor statistics on demand wherever needed.

80ac0dc0a5

rpc: Calculate ancestor data from scratch for mempool rpc calls 87ae4fe5b3

Remove dependency on cached ancestor data in mini-miner b5701bdf65

Stop enforcing ancestor size/count limits

The cluster limits should be sufficient.

Co-Authored-By: Gregory Sanders <gsanders87@gmail.com>

8a416fb8e3

Add test case for cluster size limits to TRUC logic ea58f64b76

Use mempool/txgraph to determine if a tx has descendants

Remove a reference to GetCountWithDescendants() in preparation for removing
this function and the associated cached state from the mempool.

54ffcb3ad9

Calculate descendant information for mempool RPC output on-the-fly

This is in preparation for removing the cached descendant state from the
mempool.

9fe542e9d5

test: remove rbf carveout test from mempool_limit.py 2055406335

Stop enforcing descendant size/count limits

Cluster size limits should be enough.

e80e95ece4

Eliminate RBF workaround for CPFP carveout transactions

The new cluster mempool RBF rules take into account clusters sizes exactly, so
with the removal of descendant count enforcement this idea is obsolete.

7509c321ae

wallet: Replace max descendantsize with clustersize

With the descendant size limits removed, replace the concept of "max number of
descendants of any ancestor of a given tx" with the cluster count of the cluster
that the transaction belongs to.

f5246c9f1d

mempool: Remove unused function CalculateDescendantMaximum 8455773bfa

Eliminate use of cached ancestor data in miniminer_tests and truc_policy 3cf2bb1d60

mempool: eliminate accessors to mempool entry ancestor/descendant cached state 11079b9ec7

Remove unused members from CTxMemPoolEntry 9e4543827a

Remove mempool logic designed to maintain ancestor/descendant state 189ff7e910

mempool: addUnchecked no longer needs ancestors d4cb16ffd5

Remove unused limits from CalculateMemPoolAncestors a75397518f

Make removeConflicts private d89c989b56

==== END MEMPOOL CLEANUP ==== 050ca80466

==== BEGIN OPTIMIZATIONS ==== c6ad34c04d

Simplify ancestor calculation functions

Now that ancestor calculation never fails (due to ancestor/descendant limits
being eliminated), we can eliminate the error handling from
CalculateMemPoolAncestors.

5502a06b85

Use txgraph to calculate ancestors f35091569a

Use txgraph to calculate descendants d55f4eb23c

Rework truc_policy to use descendants, not children e488cde1f9

Make getting parents/children a function of the mempool, not a mempool entry d85723a777

Eliminate CheckPackageLimits, which no longer does anything b3c6b78519

Fix miniminer_tests to work with cluster limits 517c734687

Rewrite GatherClusters to use the txgraph implementation cc6d55b8de

Stop tracking parents/children outside of txgraph bdbe50e120

==== END OPTIMIZATIONS ==== 4c52a2165f

==== BEGIN DOCS/TESTING ==== 233b886b59

Avoid violating mempool policy limits in tests

Changes AddToMempool() helper to only apply changes if the mempool limits are
respected.

Fix package_rbf fuzz target to handle mempool policy violations

befe5c8bcd

bench: add more mempool benchmarks

Add benchmarks for:

  - mempool update time when blocks are found
  - adding a transaction
  - performing the mempool's RBF calculation
  - calculating mempool ancestors/descendants

dbb6e2bc8d

fuzz: try to add more code coverage for mempool fuzzing

Including test coverage for mempool eviction and expiry

fb211f8368

Expose cluster information via rpc

Co-authored-by: glozow <gloriajzhao@gmail.com>

7ffd7c86dd

doc: Update mempool_replacements.md to reflect feerate diagram checks ac20c79eb0

test: add functional test for new cluster mempool RPCs aa30f26db3

fuzz: remove comparison between mini_miner block construction and miner

This is in preparation for eliminating the block template building happening in
mini_miner, in favor of directly using the linearizations done in the mempool.

c88360cd82

Invoke TxGraph::DoWork() at appropriate times b80c0397a6

Update comments for CTxMemPool class 6bb3e4ff01

Add check that GetSortedScoreWithTopology() agrees with CompareMiningScoreWithTopology()

We use CompareMiningScoreWithTopology() for sorting transaction announcements
during tx relay, and we use GetSortedScoreWithTopology() in
CTxMemPool::check().

92872a7f46

Rework RBF and TRUC validation

Calculating mempool ancestors for a new transaction should not be done until
after cluster size limits have been enforced, to limit CPU DoS potential.

Achieve this by reworking TRUC and RBF validation logic:

- TRUC policy enforcement is now done using only mempool parents of
  new transactions, not all mempool ancestors.
- RBF replacement checks are performed earlier (which allows for checking
  cluster size limits earlier, because cluster size checks cannot happen until
  after all conflicts are staged for removal).
- Verifying that a new transaction doesn't conflict with an ancestor now
  happens later, in AcceptSingleTransaction() rather than in PreChecks(). This
  means that the test is not performed at all in AcceptMultipleTransactions(),
  but in package acceptance we already disallow RBF in situations where a
  package transaction has in-mempool parents.

Also to ensure that all RBF validation logic is applied in both the single
transaction and multiple transaction cases, remove the optimization that skips
the PackageMempoolChecks() in the case of a single transaction being validated
in AcceptMultipleTransactions().

d3f8b6380f

sdaftuar force-pushed on Oct 14, 2025

sdaftuar closed this on Oct 14, 2025

Cluster mempool implementation #28676

Code Coverage & Benchmarks

Reviews

Conflicts