qa: fix an off-by-one in utxo snapshot fuzz target and sanity check its snapshot data

darosior commented at 9:18 pm on February 19, 2025: member

The assumeutxo data for the fuzz target could change and invalidate the hash silently, preventing the fuzz target from reaching some code paths. Fix this by introducing a unit test which would break if the snapshot data the fuzz target relies on were to change.

In implementing this i noticed the height used for coins in the fuzz target is actually off-by-one (as if the first block in the created chain was the genesis but it’s block 1), so fix that too.

qa: correct off-by-one in utxo snapshot fuzz target

The chain starts at block 1, not genesis.

d1527f6b88

DrahtBot commented at 9:18 pm on February 19, 2025: contributor

The following sections might be updated with supplementary metadata relevant to reviewers and maintainers.

Code Coverage & Benchmarks

For details see: https://corecheck.dev/bitcoin/bitcoin/pulls/31910.

Reviews

See the guideline for information on the review process.

Type	Reviewers
ACK	mzumsande, fjahr
Ignored review	dergoegge

If your review is incorrectly listed, please react with 👎 to this comment and the bot will ignore it on the next update.

Conflicts

No conflicts as of last run.

DrahtBot added the label Tests on Feb 19, 2025

darosior commented at 9:18 pm on February 19, 2025: member

cc @mzumsande. This was also suggested by @ryanofsky in #30329 (review).

darosior force-pushed on Feb 19, 2025

DrahtBot commented at 9:43 pm on February 19, 2025: contributor

🚧 At least one of the CI tasks failed. Debug: https://github.com/bitcoin/bitcoin/runs/37498360174

Try to run the tests locally, according to the documentation. However, a CI failure may still happen due to a number of reasons, for example:

Possibly due to a silent merge conflict (the changes in this pull request being incompatible with the current code in the target branch). If so, make sure to rebase on the latest commit of the target branch.
A sanitizer issue, which can only be found by compiling with the sanitizer and running the affected test.
An intermittent issue.

Leave a comment here, if you need help tracking down a confusing failure.

DrahtBot added the label CI failed on Feb 19, 2025

DrahtBot removed the label CI failed on Feb 19, 2025

dergoegge approved

dergoegge commented at 10:33 am on February 20, 2025: member

utACK a6d5ef3f604e646ae56c3a5c1aada48f9e134435

in src/test/validation_chainstatemanager_tests.cpp:848 in a6d5ef3f60 outdated

843+{
844+    const auto snapshot_height{200};
845+
846+    // Mine the same 200 blocks as the `utxo_snapshot` fuzz target.
847+    for (auto& block: CreateBlockChain(snapshot_height, m_node.chainman->GetParams())) {
848+        MineBlock(m_node, block);

maflcko commented at 10:50 am on February 20, 2025:

I don’t understand this pull request and I don’t think it is possible, but I may be missing something obvious.

My understanding is that the blockchain in the fuzzing binary may always be different, because it will have a weaker pow check, or is there something that ensures the pow check in the fuzzing binary in this fuzz target behaves identical to the one in the unit tests?

dergoegge commented at 11:42 am on February 20, 2025:

Actually valid PoW will pass the reduced fuzz PoW check as well. But you’re right I think the block hashes in the chain produced in the fuzz test and the unit test will be different.

We could move the assertion to the init function in the fuzz test, that should fail right now (i think).

darosior commented at 2:55 pm on February 20, 2025:

The fuzz mock hash[31] & 0x80 == 0 is equivalent to the pow check against regtest min difficulty used in the unit test. If one passes the other necessarily does. This mean both will mine the same chain. You can check this as they both arrive to the same block hash at height 200, with the same utxo set hash.

darosior commented at 3:22 pm on February 20, 2025:

Nevertheless this is not obvious and Niklas’ suggestion of having it in the initialization of the fuzz target makes sense so i just did that too.

maflcko commented at 4:47 pm on February 20, 2025:

The fuzz mock hash[31] & 0x80 == 0 is equivalent to the pow check against regtest min difficulty used in the unit test

Just for reference, I don’t think this is true. It simply happens to pass because it is the most likely outcome, given the number of regtest blocks, but that is not guaranteed. Block hashes do exist where this is not true. For example:

0
1{
2    const auto consensus = CreateChainParams(*m_node.args, ChainType::REGTEST)->GetConsensus();
3    unsigned int nBits{0x207fffff};
4    uint256 hash;
5    hash = uint256{"7fffff0000000000000000000000000000000000000000000000000000000000"};
6    Assert(CheckProofOfWork(hash, nBits, consensus));
7    hash = uint256{"7fffff0000000000000000000000000000000000000000000000000000000001"};
8    Assert(CheckProofOfWork(hash, nBits, consensus)); // fails in the unit test, passes in the fuzz test
9}

darosior commented at 5:13 pm on February 20, 2025:

0    hash = uint256{"7fffff0000000000000000000000000000000000000000000000000000000001"};
1    Assert(CheckProofOfWork(hash, nBits, consensus)); // fails in the unit test, passes in the fuzz test

I don’t think this actually passes in the fuzz test. The hex representation is in reversed byte order. hash[31] here is 0x7f. If it had its highest bit set (that is hash[31] & 0x80 != 0 then it would be necessarily be higher than the regtest max target.

All values with the highest bit set would be higher than the regtest maximum target (which is 7fffffffffffffffffffffffffffffffffffffffffffffffffffffffffffffff). These are 8000000000000000000000000000000000000000000000000000000000000000 to ffffffffffffffffffffffffffffffffffffffffffffffffffffffffffffffff. Therefore i think my original point stands.

maflcko commented at 6:05 pm on February 20, 2025:

Yes, the high bit is always unset, otherwise the check would fail on both tests.

However that does not mean all remaining bits are always the same in the block hash on regtest and “fuzznet”. “fuzznet” doesn’t care about the remaining bits at all. regtest does check them.

You can copy-paste the c++ code above and check it for yourself locally.

darosior commented at 7:41 pm on February 20, 2025:

You are right, i missed how the genesis block’s nBits does not correspond to the powLimit of 7fffffffffffffffffffffffffffffffffffffffffffffffffffffffffffffff but instead encodes a rounded down target of 7fffff0000000000000000000000000000000000000000000000000000000000. This means that all values from 7fffff0000000000000000000000000000000000000000000000000000000001 to 7fffffffffffffffffffffffffffffffffffffffffffffffffffffffffffffff would have the highest bit unset but also fail in the unit test. If the nBits didn’t round down the maximum target the two would match exactly.

Thanks for the patience.

maflcko commented at 10:51 am on February 20, 2025: member

left a question. Also, did you check the coverage is not decreased? Ref: https://maflcko.github.io/b-c-cov/fuzz.coverage/src/test/fuzz/utxo_snapshot.cpp.gcov.html

darosior force-pushed on Feb 20, 2025

darosior commented at 3:30 pm on February 20, 2025: member

Just pushed an update to have the sanity check in the fuzz target initialization instead. Despite the previous version being technically correct it was so for non-obvious reasons and it makes sense to have the sanity check next to where the value will be used.

You can check it serves its intended purpose with the following diff for instance:

 0diff --git a/src/test/util/mining.cpp b/src/test/util/mining.cpp
 1index e006a2da629..443254ca503 100644
 2--- a/src/test/util/mining.cpp
 3+++ b/src/test/util/mining.cpp
 4@@ -43,6 +43,7 @@ std::vector<std::shared_ptr<CBlock>> CreateBlockChain(size_t total_height, const
 5         coinbase_tx.vout.resize(1);
 6         coinbase_tx.vout[0].scriptPubKey = P2WSH_OP_TRUE;
 7         coinbase_tx.vout[0].nValue = GetBlockSubsidy(height + 1, params.GetConsensus());
 8+        coinbase_tx.nLockTime = height;
 9         coinbase_tx.vin[0].scriptSig = CScript() << (height + 1) << OP_0;
10         block.vtx = {MakeTransactionRef(std::move(coinbase_tx))};

Running the fuzz target with this diff would make the invalidation of the snapshot value in chainparams explicit:

0test/fuzz/utxo_snapshot.cpp:65 sanity_check_snapshot: Assertion `stats.hashBlock == cp_au_data.blockhash' failed.
1Aborted

Whereas if you apply the same diff on master the target will happily run and never be able to find a valid snapshot.

Also, did you check the coverage is not decreased?

I did not generate a full coverage report but did check it was still possible for the target to find a valid snapshot and also check the cov: value reported by libfuzzer was the same on master and this branch (4200 without sanitizers).

mzumsande commented at 4:54 pm on February 20, 2025: contributor

I did not generate a full coverage report but did check it was still possible for the target to find a valid snapshot and also check the cov: value reported by libfuzzer was the same on master and this branch (4200 without sanitizers).

More strictly, I’d expect that a fuzz seed that generated a valid snapshot before this change would also generate one with the change - the loop iteration that isn’t done anymore doesn’t make use of fuzzed_data_provider. So existing seeds shouldn’t be invalidated.

in src/test/util/mining.cpp:98 in 6083d2edba outdated

91@@ -92,6 +92,11 @@ COutPoint MineBlock(const NodeContext& node, std::shared_ptr<CBlock>& block)
92         assert(block->nNonce);
93     }
94 
95+    return ConnectBlock(node, block);
96+}
97+
98+COutPoint ConnectBlock(const NodeContext& node, const std::shared_ptr<CBlock>& block)

mzumsande commented at 9:42 pm on February 21, 2025:

I’d prefer to call this wrapper around ProcessNewBlock some other name (e.g. ProcessBlock() like net_processing does) to avoid confusion with the existing ConnectBlock in validation. The added function doesn’t necessarily attempt to connect the block, for example if a conflicting block arrived / was connected in between mining and calling ProcessNewBlock the real ConnectBlock may never be called.

darosior commented at 7:56 pm on February 25, 2025:

Done.

test util: split up ConnectBlock from MineBlock 3b85eba83a

fuzz: sanity check hardcoded snapshot in utxo_snapshot target

The assumeutxo data for the fuzz target could change and invalidate the hash silently, preventing
the fuzz target from reaching some code paths.

Fix this by sanity checking the snapshot values during initialization.

63b534f97e

darosior force-pushed on Feb 25, 2025

mzumsande commented at 11:09 pm on March 3, 2025: contributor

Code Review ACK 63b534f97e591d4e107fd5148909852eb2965d27

fanquake requested review from fjahr on Mar 17, 2025

in src/test/util/mining.h:36 in 3b85eba83a outdated

31@@ -32,6 +32,11 @@ COutPoint MineBlock(const node::NodeContext&,
32  **/
33 COutPoint MineBlock(const node::NodeContext&, std::shared_ptr<CBlock>& block);
34 
35+/**
36+ * Returns the generated coin (or Null if the block was invalid).

fjahr commented at 5:15 pm on March 18, 2025:

nit: Hm, this now has the exact same text here as MineBlock above and in general it’s doesn’t explain what happens internally. I think there should be something here or above that makes clear what the difference is. Maybe add something like “Expects the block submitted to have valid PoW.” or so.

darosior commented at 9:33 pm on March 18, 2025:

Thanks for the review.

I figured its name already implies what it’s doing and expecting (an entirely valid block, not only valid PoW), and didn’t want to just add a comment like “Process a block” (akin to PrepareBlock right below whose docstring states “Prepare a block”). I felt only the return value was not clear from the function declaration, hence why specifying it in the docstring.

fjahr commented at 9:04 pm on March 18, 2025: contributor

tACK 63b534f97e591d4e107fd5148909852eb2965d27

Reviewed the code and played around with the fuzzer changes a bit.

fanquake merged this on Mar 21, 2025

fanquake closed this on Mar 21, 2025

darosior deleted the branch on Mar 21, 2025

qa: fix an off-by-one in utxo snapshot fuzz target and sanity check its snapshot data #31910

Code Coverage & Benchmarks

Reviews

Conflicts