index: initial sync speedup, parallelize process

furszy commented at 1:36 pm on January 25, 2023: member

The current procedure for building the block filter index involves processing filters one at a time; Reading blocks, undo data, and previous headers from disk sequentially.

This PR introduces a new mechanism to perform the work concurrently. Dividing the filters generation workload among a pool of workers that can be configured by the user, significantly increasing the speed of the index construction process.

The same concurrent processing model has been applied to the transactions index as well.

The newly introduced init flag -indexworkers=<n> enables the concurrent sync behavior. Where “n” is the number of worker threads that will be spawned at startup to create ranges of block filters during the initial sync process. Destroying the workers pool once the initial sync completes. Note: by default, the parallelized sync process is not enabled.

Now the juicy part: In my computer, with the node in debug mode and on IBD, with -indexworkers=4, the block filter index generation took less than an hour. While, in master, the sync took more than 7 hours.

Important Note: As the access to the block data on disk is protected by cs_main, this new feature runs substantially faster when the node is not in IBD.

DrahtBot commented at 1:36 pm on January 25, 2023: contributor

The following sections might be updated with supplementary metadata relevant to reviewers and maintainers.

Code Coverage & Benchmarks

For details see: https://corecheck.dev/bitcoin/bitcoin/pulls/26966.

Reviews

See the guideline for information on the review process.

Type	Reviewers
Concept ACK	w0xlt, TheCharlatan, ismaelsadeeq
Approach ACK	ryanofsky
Stale ACK	pinheadmz

If your review is incorrectly listed, please react with 👎 to this comment and the bot will ignore it on the next update.

Conflicts

Reviewers, this pull request conflicts with the following ones:

#32997 (index: Deduplicate HashKey / HeightKey handling by mzumsande)
#32878 (index: fix wrong assert of current_tip == m_best_block_index by HowHsu)
#32541 (index: store per-block transaction locations for efficient lookups by romanz)
#31308 (ci, iwyu: Treat warnings as errors for specific directories by hebasto)
#29770 (index: Check all necessary block data is available before starting to sync by fjahr)
#17783 (common: Disallow calling IsArgSet() on ALLOW_LIST options by ryanofsky)
#17581 (refactor: Remove settings merge reverse precedence code by ryanofsky)
#17580 (refactor: Add ALLOW_LIST flags and enforce usage in CheckArgFlags by ryanofsky)
#17493 (util: Forbid ambiguous multiple assignments in config file by ryanofsky)

If you consider this pull request important, please also help to review the conflicting pull requests. Ideally, start with the one that should be merged first.

LLM Linter (✨ experimental)

Possible typos and grammar issues:

“will be already be logged” → “will already be logged” [duplicate “be” makes the sentence ungrammatical]

No other typos were found impacting comprehension.

drahtbot_id_4_m

DrahtBot added the label UTXO Db and Indexes on Jan 25, 2023

furszy force-pushed on Jan 25, 2023

Sjors commented at 2:38 pm on January 26, 2023: member

Cool, will take it for a spin…

furszy force-pushed on Jan 28, 2023

furszy commented at 3:18 pm on January 28, 2023: member

Cool @Sjors, just pushed a small update. Found a little bug.

Going to add an important note to the PR description (because otherwise testing results will vary a lot):

As the access to the block data on disk is protected by cs_main, this new feature runs substantially faster when the node is not in IBD. (where “substantially” here means full index sync, with 5 workers, in less than 20 minutes in my computer).

furszy force-pushed on Jan 28, 2023

mzumsande commented at 10:49 pm on January 29, 2023: contributor

It’s probably much easier suggested than done, but did you attempt to implement parallelization in a more general way so that other indices could benefit from it as well? On first glance, txindex, and the indices suggested in PRs (#24539, #26951) seem to be parallelizable as well (not coinstatsindex though).

furszy commented at 1:37 pm on January 30, 2023: member

It’s probably much easier suggested than done, but did you attempt to implement parallelization in a more general way so that other indices could benefit from it as well?

Yeah, that is part of the plan. I started with the block filter index because it requires an special treatment that txindex parallelization does not require (block/undo data reading and block filters creation can be parallelized but writing must be done sequentially due the need to link filter headers to their predecessors to create the filters-chain on disk).

My idea was to start reviewing this one, so the process gets as clean as possible, and then move forward with the generalization step. It’s usually more natural to abstract processes when the specific cases are well-defined.

furszy force-pushed on Jan 30, 2023

w0xlt commented at 4:42 pm on January 30, 2023: contributor

Concept ACK

Perhaps it could have separate parallelization and index logic. So it could be reused in other indexes.

furszy force-pushed on Jan 30, 2023

Sjors commented at 1:47 pm on February 7, 2023: member

Building the index on (AMD Ryzen 7950X, blocks stored on SSD):

master @ fe86616bb4ad0c4296d34299bc2e2f0fca1fe936: 35'15" (mostly 1 alternating CPU thread)
this PR (rebased)
- n=8: 5'20" (uses about 8 of 32 CPU threads as expected)
- n=32: 4'26" (pleasantly close to 100% CPU usage with a dip every 10 seconds, but it drops to only 1 CPU in the last minute or two)

I made sure to not load any wallets and disabled other indexes.

I didn’t test if the index was correct.

I wonder if, for users without this index, it would be faster to generate the index, rescan the wallet and then delete it again. Combined with #26951 you would only have to generate filters up to the age of the wallet (IIUC, cc @pstratem).

Note to self for future benchmarks: -txindex takes 1:07'14", -coinstatsindex takes 3.5 hours.

furszy commented at 2:18 pm on February 7, 2023: member

Great results @Sjors!.

Could also give it a run rebased on top #27006. On master, the index initial sync is slower when the node is in IBD because the index thread has to compete for access to block data on disk through cs_main acquisition.

I didn’t test if the index was correct.

The PR contain a test verifying it.

Side note: I’m working on generalizing the parallelization flow so other indexes, like the txindex and #26951 can make use of it too.

DrahtBot added the label Needs rebase on Feb 17, 2023

furszy force-pushed on Feb 21, 2023

DrahtBot removed the label Needs rebase on Feb 21, 2023

furszy force-pushed on Feb 22, 2023

furszy force-pushed on Feb 27, 2023

furszy renamed this:
~~index: blockfilter initial sync speedup, parallelize process~~
index: blockfilter and txindex initial sync speedup, parallelize process
on Feb 27, 2023

furszy renamed this:
~~index: blockfilter and txindex initial sync speedup, parallelize process~~
index: blockfilter initial sync speedup, parallelize process
on Feb 27, 2023

furszy force-pushed on Feb 27, 2023

furszy commented at 12:40 pm on February 28, 2023: member

PR updated, most of it implementation has changed.

The news are:

Decreased ThreadSync cs_main lock contention.
Removed CBlockIndex access from the child indexes internals.
Implemented generic workers pool.
Introduced a last header cache for the Block Filter index. Avoiding disk reads on every new processed block.
Enabled parallel sync on the tx index.

Important Note: The introduced workers pool spawned by the -indexworkers init arg is shared among all the enabled indexes that support parallel sync.

The implementation uses std::any mainly to simplify the patch-set, the base class template form of it requires a larger set of changes.

Side note: in a first glance and without going too far over the coinstats index implementation, I would say that it could also be parallelized. But will leave it for a follow-up to not continue expanding the PR size.

Future (doesn’t need to be included here, just mentioning so the path is clear): I’m working on decoupling the initial sync logic into a separate structure, so indexes subscribe to events instead of reading blocks from disk by themselves.

DrahtBot added the label Needs rebase on May 11, 2023

DrahtBot removed the label Needs rebase on May 11, 2023

furszy force-pushed on May 11, 2023

DrahtBot added the label Needs rebase on Jun 12, 2023

furszy force-pushed on Jun 21, 2023

DrahtBot removed the label Needs rebase on Jun 21, 2023

DrahtBot added the label Needs rebase on Jun 30, 2023

furszy force-pushed on Jun 30, 2023

DrahtBot removed the label Needs rebase on Jun 30, 2023

DrahtBot added the label CI failed on Jun 30, 2023

DrahtBot removed the label CI failed on Jul 4, 2023

DrahtBot added the label Needs rebase on Jul 6, 2023

furszy force-pushed on Aug 9, 2023

DrahtBot removed the label Needs rebase on Aug 9, 2023

DrahtBot added the label CI failed on Aug 29, 2023

furszy force-pushed on Aug 29, 2023

DrahtBot commented at 8:34 pm on September 4, 2023: contributor

Could mark as draft while CI is red?

furszy force-pushed on Sep 4, 2023

DrahtBot commented at 5:47 am on September 6, 2023: contributor

It looks like tsan failed, but there is no log, when there should be a log. Maybe it was accidentally removed by #27667 ?

DrahtBot added the label Needs rebase on Sep 12, 2023

furszy force-pushed on Sep 12, 2023

DrahtBot removed the label CI failed on Sep 12, 2023

DrahtBot removed the label Needs rebase on Sep 12, 2023

DrahtBot added the label Needs rebase on Oct 2, 2023

furszy force-pushed on Oct 18, 2023

DrahtBot removed the label Needs rebase on Oct 18, 2023

DrahtBot added the label CI failed on Oct 18, 2023

maflcko commented at 9:16 am on October 25, 2023: member

CI is still red. Also, how would this work with AU?

furszy force-pushed on Oct 27, 2023

DrahtBot removed the label CI failed on Oct 27, 2023

Sjors commented at 11:09 am on November 7, 2023: member

Could also give it a run rebased on top #27006.

That PR currently does not cleanly cherry-pick on top of this PR, not can I (trivially) rebase this PR on top top of it. Happy to try if you can make a branch.

I just tried it again, deleting the blockfilterindex and rebuilding it. My impression is that it’s going slower than before and I’m not seeing much CPU activity.

I also noticed the shutdown takes a very long time, with the indexer (?) threads sticking around for many minutes:

in src/index/blockfilterindex.h:46 in 003f076a15 outdated

41@@ -42,6 +42,9 @@ class BlockFilterIndex final : public BaseIndex
42     /** cache of block hash to filter header, to avoid disk access when responding to getcfcheckpt. */
43     std::unordered_map<uint256, uint256, FilterHeaderHasher> m_headers_cache GUARDED_BY(m_cs_headers_cache);
44 
45+    // Last computed header to avoid disk reads at every new block.
46+    uint256 last_header{};

TheCharlatan commented at 12:58 pm on November 24, 2023:

NIt: Might take this opportunity to add the <uint256.h> header?

furszy commented at 6:03 pm on March 22, 2024:

hmm sorry, I missed to add it on #28955.

in src/util/threadpool.h:58 in 46e74257f2 outdated

53+public:
54+    ThreadPool() {}
55+
56+    ~ThreadPool()
57+    {
58+        Stop(); // In case it hasn't being stopped.

TheCharlatan commented at 3:18 pm on November 24, 2023:

NIt: // In case it hasn't been stoppped.

furszy commented at 6:03 pm on March 22, 2024:

done as suggested. Thanks.

in src/test/threadpool_tests.cpp:1 in 46e74257f2 outdated

0@@ -0,0 +1,158 @@
1+// Copyright (c) 2012-2022 The Bitcoin Core developers

TheCharlatan commented at 3:19 pm on November 24, 2023:

Nit: Adjust copyright date.

furszy commented at 6:03 pm on March 22, 2024:

done as suggested. Thanks.

in src/util/threadpool.h:25 in 46e74257f2 outdated

11+
12+#include <condition_variable>
13+#include <future>
14+#include <queue>
15+#include <thread>
16+

TheCharlatan commented at 3:32 pm on November 24, 2023:

If I run IWYU locally, it reports the following headers as missing:

0#include <cstddef>            // for size_t
1#include <algorithm>           // for max
2#include <atomic>              // for atomic
3#include <functional>          // for function
4#include <memory>              // for make_shared
5#include <stdexcept>           // for runtime_error
6#include <utility>             // for move, swap
7#include <vector>              // for vector

furszy commented at 6:19 pm on March 22, 2024:

done as suggested. Thanks.

in src/util/threadpool.h:32 in 46e74257f2 outdated

19+private:
20+    Mutex cs_work_queue;
21+    std::queue<std::function<void()>> m_work_queue GUARDED_BY(cs_work_queue);
22+    std::condition_variable m_condition;
23+    // Stop indicator
24+    std::atomic<bool> m_stop{false};

TheCharlatan commented at 10:37 pm on November 24, 2023:

Just a comment: It would be nice if we could re-use the CThreadInterrupt interface here, but I don’t think it’s easily possible.

furszy commented at 6:19 pm on March 22, 2024:

Hmm, it shouldn’t be that hard, let me see.

in src/util/threadpool.h:105 in 46e74257f2 outdated

94+        m_condition.notify_one();
95+        return future;
96+    }
97+
98+    // Synchronous processing
99+    void ProcessTask() EXCLUSIVE_LOCKS_REQUIRED(!cs_work_queue)

TheCharlatan commented at 10:41 pm on November 24, 2023:

Why is this added?

furszy commented at 6:27 pm on March 22, 2024:

Why is this added?

Because I initially thought on pushing extra work into the thread pool queue so then the originator thread, once it finishes calculating the different tasks, can take the workload on its active wait. But I ended up implementing it differently to re-use the same piece of code for the single-thread approach (Can check it looking for where it says “// Otherwise, this is an active-wait, so we process blocks until all workers finish.”) Still, I think that having a method to process tasks manually make sense for a general thread pool implementation. But could remove it if you are strong on it.

andrewtoth commented at 1:00 pm on October 26, 2024:

This would make sense if we wanted to reuse this for CCheckQueue. Then, we could just loop and ProcessTask on the main thread once when we call Wait.

in src/util/threadpool.h:59 in 46e74257f2 outdated

45+                m_work_queue.pop();
46+            }
47+
48+            // Execute the task without the lock
49+            WITH_REVERSE_LOCK(wait_lock, task());
50+        }

TheCharlatan commented at 12:37 pm on November 25, 2023:

I don’t think it would be necessary for the tasks themselves, since you already demonstrate in the tests how to retrieve exceptions, but I think some kind of exception handling and infomation logging similar to the one of util::TraceThread would still be nice. Did you choose not to use TraceThread on purpose?

furszy commented at 6:37 pm on March 22, 2024:

I don’t think it would be necessary for the tasks themselves, since you already demonstrate in the tests how to retrieve exceptions, but I think some kind of exception handling and infomation logging similar to the one of util::TraceThread would still be nice. Did you choose not to use TraceThread on purpose?

I probably wasn’t aware of TraceThread when this was implemented. Changing it.. Thanks!

in src/test/threadpool_tests.cpp:39 in 46e74257f2 outdated

19+    // 6) Busy workers, help them by processing tasks from outside.
20+
21+    // Test case 1, submit tasks and verify completion.
22+    {
23+        int num_workers = 3;
24+        int num_tasks = 50;

TheCharlatan commented at 12:44 pm on November 25, 2023:

Could use this test to stress test the pool a bit with more tasks?

furszy commented at 6:55 pm on March 22, 2024:

Could use this test to stress test the pool a bit with more tasks?

What about creating a fuzzing test instead? I’m not sure about the benefits of adding more tasks here. I can only foresee other developers complaining about the increased unit test times.

in src/test/threadpool_tests.cpp:29 in 46e74257f2 outdated

24+        int num_tasks = 50;
25+
26+        ThreadPool threadPool;
27+        threadPool.Start(num_workers);
28+        std::atomic<int> counter = 0;
29+        for (int i=0; i<num_tasks; i++) {

TheCharlatan commented at 12:45 pm on November 25, 2023:

I think a test where each task is assigned slightly different data would be nice. Something like:

0+        std::atomic<int> par_sum{0};
1+        for (int i = 0; i < num_tasks; i++) {
2+            threadPool.Submit([&par_sum,i]() {
3+                par_sum += i;
4             });
5         }
6+        int sync_sum{0};
7+        for (int i = 0; i < num_tasks; i++) {
8+            sync_sum += i;
9+        }

furszy commented at 11:48 am on March 23, 2024:

Sure. Thats the Gauss sum :).

TheCharlatan commented at 12:54 pm on November 25, 2023: contributor

This all looks pretty promising. I left some feedback before continuing with the latter half of the PR.

in src/index/base.h:29 in 7b2d4d471a outdated

23 namespace Consensus {
24     struct Params;
25 }
26 
27+/** Number of concurrent jobs during the initial sync process */
28+const int16_t INDEX_WORKERS_COUNT = 0;

TheCharlatan commented at 4:22 pm on November 25, 2023:

static constexpr int16_t INDEX_WORKERS_COUNT{0} (and similarly below)?

furszy commented at 12:45 pm on March 23, 2024:

sure

in src/init.cpp:1984 in 7b2d4d471a outdated

1980+    }
1981+
1982     // Start threads
1983-    for (auto index : node.indexes) if (!index->StartBackgroundSync()) return false;
1984+    for (auto index : node.indexes) {
1985+        // todo: Only provide thread pool to indexes that supports parallel sync

TheCharlatan commented at 4:46 pm on November 25, 2023:

In commit 7b2d4d471ae251d4c22184dda26b73993a65eff8: Could the AllowParallelSync method be moved to this commit?

furszy commented at 12:53 pm on March 23, 2024:

In commit 7b2d4d4: Could the AllowParallelSync method be moved to this commit?

Sure.

in src/index/base.cpp:229 in 727be9d500 outdated

226+            const CBlockIndex* it_start = pindex_next;
227+
228+            if (parallel_sync_enabled) {
229+                int max_blocks_to_sync = m_tasks_per_worker * m_thread_pool->WorkersCount() + m_tasks_per_worker; // extra 'm_tasks_per_worker' due the active-wait.
230+                int tip_height = WITH_LOCK(cs_main, return m_chainstate->m_chain.Height());
231+                int remaining_blocks = tip_height - pindex_next->nHeight;

TheCharlatan commented at 5:25 pm on November 25, 2023:

Make max_blocks_to_sync, tip_height, and remaining_blocks const.

furszy commented at 12:54 pm on March 23, 2024:

Done as suggested.

in src/index/base.cpp:258 in 727be9d500 outdated

230+                int tip_height = WITH_LOCK(cs_main, return m_chainstate->m_chain.Height());
231+                int remaining_blocks = tip_height - pindex_next->nHeight;
232+                work_chunk = remaining_blocks > max_blocks_to_sync ? m_tasks_per_worker : remaining_blocks / (m_thread_pool->WorkersCount() + 1);
233+                workers_count = m_thread_pool->WorkersCount();
234+                if (work_chunk == 0) { // disable parallel sync if we are close to the tip
235+                    workers_count = 0;

TheCharlatan commented at 5:28 pm on November 25, 2023:

Nitty nit: Keep the order of workers_count and work_chunk consistent?

furszy commented at 12:55 pm on March 23, 2024:

Nitty nit: Keep the order of workers_count and work_chunk consistent?

What do you mean? Set workers_count first, then work_chunk always in the same order?

in src/index/base.cpp:247 in 727be9d500 outdated

244+                    it_start = WITH_LOCK(::cs_main, return NextSyncBlock(it_end, m_chainstate->m_chain));
245+                }
246+            }
247 
248+            // If we have only one block to process, run it directly.
249+            // Otherwise, this is an active-wait, so we process blocks until all workers finish.

TheCharlatan commented at 10:38 pm on November 25, 2023:

Would so we also process blocks in this thread until all workers finish. be more accurate?

furszy commented at 12:56 pm on March 23, 2024:

Sure. Done as suggested. Thanks.

in src/test/blockfilter_index_tests.cpp:14 in de69506c09 outdated

13@@ -14,6 +14,7 @@
14 #include <test/util/blockfilter.h>

TheCharlatan commented at 10:52 pm on November 25, 2023:

In commit de69506c090f530f34d78478db39898bd4e2cbba: I think the test introduction should be squashed into the following commit, or do you want to demonstrate something here first?

furszy commented at 1:01 pm on March 23, 2024:

In commit de69506: I think the test introduction should be squashed into the following commit, or do you want to demonstrate something here first?

hmm, will squash them. I don’t remember why I split them (2 years ago).

TheCharlatan commented at 11:01 pm on November 25, 2023: contributor

Concept ACK.

Done with my first pass, still want to think some of the approaches here over a bit.

in src/init.cpp:2117 in 373b0bb077 outdated

1974+    if (node.args->IsArgSet("-indexworkers")) {
1975+        int index_workers = node.args->GetIntArg("-indexworkers", INDEX_WORKERS_COUNT);
1976+        if (index_workers < 0 || index_workers > 100) return InitError(_("Invalid -indexworkers arg"));
1977+
1978+        thread_pool = std::make_shared<ThreadPool>();
1979+        thread_pool->Start(index_workers);

TheCharlatan commented at 4:08 pm on November 26, 2023:

Since the constructor and Start are always called in succession in this patch set, you could make ThreadPool more RAII styled if the Start method were removed and instead placed in the constructor. Could also make sense to do the same with the destructor and Stop.

DrahtBot added the label Needs rebase on Dec 14, 2023

furszy commented at 3:22 pm on February 5, 2024: member

Focus is on #28955, which contains a good number of commits decoupled from this PR. Will come back here after it.

achow101 referenced this in commit 0b96a1925e on Mar 20, 2024

furszy force-pushed on Mar 20, 2024

DrahtBot removed the label Needs rebase on Mar 20, 2024

DrahtBot added the label CI failed on Mar 21, 2024

DrahtBot commented at 1:36 am on March 21, 2024: contributor

🚧 At least one of the CI tasks failed. Make sure to run all tests locally, according to the documentation.

Possibly this is due to a silent merge conflict (the changes in this pull request being incompatible with the current code in the target branch). If so, make sure to rebase on the latest commit of the target branch.

Leave a comment here, if you need help tracking down a confusing failure.

Debug: https://github.com/bitcoin/bitcoin/runs/22906894781

furszy force-pushed on Mar 21, 2024

Sjors commented at 5:44 pm on March 21, 2024: member

Running it again, let’s see how quick it is…

Since I might set indexworkers=32 in my config file, which is great for the initial sync and if it needs to do a big catchup. But does it cause much overhead when it’s up to date? Maybe the threads should spin down if there’s not much work.

What happens when there are multiple readBlockFromDisk calls around the same time? I don’t see any (obvious) thread locking happening in CAutoFile. Since I keep block files on a spinning disk (-blocksdir), I wonder if that potentially slows things down - compared to fetching one file in a single uninterrupted operation.

So far (block 200K) my spinning disk is making a ton of noise and CPU activity is negligible.

It took 4 hours and 23 minutes. That’s an improvement over the 5 hours 46 minutes without: #28955 (comment)

Note that last year I tested with SSD - which resulted in a 16x improvement #26966 (comment). This time I used a spinning disk. CPU activity was negligible all the way.

furszy commented at 5:54 pm on March 21, 2024: member

Running it again, let’s see how quick it is…

Since I might set indexworkers=32 in my config file, which is great for the initial sync and if it needs to do a big catchup. But does it cause much overhead when it’s up to date? Maybe the threads should spin down if there’s not much work.

Yeah sure. The thread pool can be destructed once all indexes initial sync finish (once the index is synced, it starts receiving blocks through the validation signals and does not use the initial sync workers anymore). I’m currently tackling theCharlatan’s feedback, and thinking about some improvements. Will add this change too on the next push.

DrahtBot removed the label CI failed on Mar 21, 2024

furszy force-pushed on Mar 23, 2024

furszy commented at 1:34 pm on March 23, 2024: member

Thanks for the in-depth review theCharlatan! Most comments were tackled. And thanks for testing Sjors!

Now that we’ve reached this point (after #28955 merge), I’m rethinking and polishing the design. I’m not totally convinced about the current implementation anymore. We’ve grown a lot since this was implemented two years ago. Other than that, Sjors had a nice idea that I want to try out (or at least design this in a way so that the idea can be implemented in isolation in the future) –> Instead of dividing the work based on block ranges, it can be divided based on block file ranges. This would minimize the needle movement on spinning disks.

DrahtBot added the label CI failed on Mar 23, 2024

DrahtBot commented at 2:10 pm on March 23, 2024: contributor

🚧 At least one of the CI tasks failed. Make sure to run all tests locally, according to the documentation.

Possibly this is due to a silent merge conflict (the changes in this pull request being incompatible with the current code in the target branch). If so, make sure to rebase on the latest commit of the target branch.

Leave a comment here, if you need help tracking down a confusing failure.

Debug: https://github.com/bitcoin/bitcoin/runs/23011013574

Sjors commented at 9:06 am on March 25, 2024: member

Instead of dividing the work based on block ranges, it can be divided based on block file ranges. This would minimize the needle movement on spinning disks.

It’s possible that this can be achieved with block ranges too. But you have to make sure only one block range is read at any given time. I.e. the other threads should wait while a disk read is in progress. I suspect the problem lies in having 32 threads trying to read different things at the same time, and then operating system goes and fetches a few kilobytes, a few kilobytes there, etc. Though I haven’t measured this.

DrahtBot removed the label CI failed on Apr 4, 2024

DrahtBot commented at 7:54 am on April 5, 2024: contributor

🚧 At least one of the CI tasks failed. Make sure to run all tests locally, according to the documentation.

Possibly this is due to a silent merge conflict (the changes in this pull request being incompatible with the current code in the target branch). If so, make sure to rebase on the latest commit of the target branch.

Leave a comment here, if you need help tracking down a confusing failure.

Debug: https://github.com/bitcoin/bitcoin/runs/23011013574

DrahtBot added the label CI failed on Apr 5, 2024

DrahtBot commented at 7:59 am on April 5, 2024: contributor

  0test/threadpool_tests.cpp(9): Entering test suite "threadpool_tests"
  1test/threadpool_tests.cpp(11): Entering test case "threadpool_basic"
  2==================
  3WARNING: ThreadSanitizer: data race (pid=26928)
  4  Write of size 8 at 0x721000012718 by thread T13 (mutexes: write M0):
  5    [#0](/bitcoin-bitcoin/0/) free <null> (test_bitcoin+0x156ae3) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
  6    [#1](/bitcoin-bitcoin/1/) std::range_error::~range_error() <null> (libc++abi.so.1+0x28b74) (BuildId: 40d8c515ee7a3c2d826acc730982f279ffe00146)
  7    [#2](/bitcoin-bitcoin/2/) std::__1::__shared_count::__release_shared[abi:ne180100]() /usr/lib/llvm-18/bin/../include/c++/v1/__memory/shared_ptr.h:157:7 (test_bitcoin+0xb2e070) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
  8    [#3](/bitcoin-bitcoin/3/) std::__1::__shared_weak_count::__release_shared[abi:ne180100]() /usr/lib/llvm-18/bin/../include/c++/v1/__memory/shared_ptr.h:186:25 (test_bitcoin+0xb2e070)
  9    [#4](/bitcoin-bitcoin/4/) std::__1::shared_ptr<std::__1::packaged_task<void ()>>::~shared_ptr[abi:ne180100]() /usr/lib/llvm-18/bin/../include/c++/v1/__memory/shared_ptr.h:648:17 (test_bitcoin+0xb2e070)
 10    [#5](/bitcoin-bitcoin/5/) std::__1::future<decltype(fp())> ThreadPool::Submit<threadpool_tests::threadpool_basic::test_method()::$_6>(threadpool_tests::threadpool_basic::test_method()::$_6)::'lambda'()::~() src/./util/threadpool.h:96:34 (test_bitcoin+0xb2e070)
 11    [#6](/bitcoin-bitcoin/6/) std::__1::__compressed_pair_elem<std::__1::future<decltype(fp())> ThreadPool::Submit<threadpool_tests::threadpool_basic::test_method()::$_6>(threadpool_tests::threadpool_basic::test_method()::$_6)::'lambda'(), 0, false>::~__compressed_pair_elem() /usr/lib/llvm-18/bin/../include/c++/v1/__memory/compressed_pair.h:44:8 (test_bitcoin+0xb2e070)
 12    [#7](/bitcoin-bitcoin/7/) std::__1::__function::__alloc_func<std::__1::future<decltype(fp())> ThreadPool::Submit<threadpool_tests::threadpool_basic::test_method()::$_6>(threadpool_tests::threadpool_basic::test_method()::$_6)::'lambda'(), std::__1::allocator<std::__1::future<decltype(fp())> ThreadPool::Submit<threadpool_tests::threadpool_basic::test_method()::$_6>(threadpool_tests::threadpool_basic::test_method()::$_6)::'lambda'()>, void ()>::destroy[abi:ne180100]() /usr/lib/llvm-18/bin/../include/c++/v1/__functional/function.h:182:58 (test_bitcoin+0xb2e070)
 13    [#8](/bitcoin-bitcoin/8/) std::__1::__function::__func<std::__1::future<decltype(fp())> ThreadPool::Submit<threadpool_tests::threadpool_basic::test_method()::$_6>(threadpool_tests::threadpool_basic::test_method()::$_6)::'lambda'(), std::__1::allocator<std::__1::future<decltype(fp())> ThreadPool::Submit<threadpool_tests::threadpool_basic::test_method()::$_6>(threadpool_tests::threadpool_basic::test_method()::$_6)::'lambda'()>, void ()>::destroy() /usr/lib/llvm-18/bin/../include/c++/v1/__functional/function.h:297:8 (test_bitcoin+0xb2e070)
 14    [#9](/bitcoin-bitcoin/9/) std::__1::__function::__value_func<void ()>::~__value_func[abi:ne180100]() src/./sync.h (test_bitcoin+0x442c8b) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 15    [#10](/bitcoin-bitcoin/10/) std::__1::function<void ()>::~function() /usr/lib/llvm-18/bin/../include/c++/v1/__functional/function.h:972:43 (test_bitcoin+0x442c8b)
 16    [#11](/bitcoin-bitcoin/11/) ThreadPool::WorkerThread() src/./util/threadpool.h:56:9 (test_bitcoin+0x442c8b)
 17    [#12](/bitcoin-bitcoin/12/) ThreadPool::Start(int)::'lambda'()::operator()() const src/./util/threadpool.h:73:101 (test_bitcoin+0x442ae5) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 18    [#13](/bitcoin-bitcoin/13/) decltype(std::declval<ThreadPool::Start(int)::'lambda'()&>()()) std::__1::__invoke[abi:ne180100]<ThreadPool::Start(int)::'lambda'()&>(ThreadPool::Start(int)::'lambda'()&) /usr/lib/llvm-18/bin/../include/c++/v1/__type_traits/invoke.h:344:25 (test_bitcoin+0x442ae5)
 19    [#14](/bitcoin-bitcoin/14/) void std::__1::__invoke_void_return_wrapper<void, true>::__call[abi:ne180100]<ThreadPool::Start(int)::'lambda'()&>(ThreadPool::Start(int)::'lambda'()&) /usr/lib/llvm-18/bin/../include/c++/v1/__type_traits/invoke.h:419:5 (test_bitcoin+0x442ae5)
 20    [#15](/bitcoin-bitcoin/15/) std::__1::__function::__alloc_func<ThreadPool::Start(int)::'lambda'(), std::__1::allocator<ThreadPool::Start(int)::'lambda'()>, void ()>::operator()[abi:ne180100]() /usr/lib/llvm-18/bin/../include/c++/v1/__functional/function.h:169:12 (test_bitcoin+0x442ae5)
 21    [#16](/bitcoin-bitcoin/16/) std::__1::__function::__func<ThreadPool::Start(int)::'lambda'(), std::__1::allocator<ThreadPool::Start(int)::'lambda'()>, void ()>::operator()() /usr/lib/llvm-18/bin/../include/c++/v1/__functional/function.h:311:10 (test_bitcoin+0x442ae5)
 22    [#17](/bitcoin-bitcoin/17/) std::__1::__function::__value_func<void ()>::operator()[abi:ne180100]() const /usr/lib/llvm-18/bin/../include/c++/v1/__functional/function.h:428:12 (test_bitcoin+0x1815b98) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 23    [#18](/bitcoin-bitcoin/18/) std::__1::function<void ()>::operator()() const /usr/lib/llvm-18/bin/../include/c++/v1/__functional/function.h:981:10 (test_bitcoin+0x1815b98)
 24    [#19](/bitcoin-bitcoin/19/) util::TraceThread(std::__1::basic_string_view<char, std::__1::char_traits<char>>, std::__1::function<void ()>) src/util/thread.cpp:21:9 (test_bitcoin+0x1815b98)
 25    [#20](/bitcoin-bitcoin/20/) decltype(std::declval<void (*)(std::__1::basic_string_view<char, std::__1::char_traits<char>>, std::__1::function<void ()>)>()(std::declval<std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char>>>(), std::declval<ThreadPool::Start(int)::'lambda'()>())) std::__1::__invoke[abi:ne180100]<void (*)(std::__1::basic_string_view<char, std::__1::char_traits<char>>, std::__1::function<void ()>), std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char>>, ThreadPool::Start(int)::'lambda'()>(void (*&&)(std::__1::basic_string_view<char, std::__1::char_traits<char>>, std::__1::function<void ()>), std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char>>&&, ThreadPool::Start(int)::'lambda'()&&) /usr/lib/llvm-18/bin/../include/c++/v1/__type_traits/invoke.h:344:25 (test_bitcoin+0x44261e) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 26    [#21](/bitcoin-bitcoin/21/) void std::__1::__thread_execute[abi:ne180100]<std::__1::unique_ptr<std::__1::__thread_struct, std::__1::default_delete<std::__1::__thread_struct>>, void (*)(std::__1::basic_string_view<char, std::__1::char_traits<char>>, std::__1::function<void ()>), std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char>>, ThreadPool::Start(int)::'lambda'(), 2ul, 3ul>(std::__1::tuple<std::__1::unique_ptr<std::__1::__thread_struct, std::__1::default_delete<std::__1::__thread_struct>>, void (*)(std::__1::basic_string_view<char, std::__1::char_traits<char>>, std::__1::function<void ()>), std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char>>, ThreadPool::Start(int)::'lambda'()>&, std::__1::__tuple_indices<2ul, 3ul>) /usr/lib/llvm-18/bin/../include/c++/v1/__thread/thread.h:193:3 (test_bitcoin+0x44261e)
 27    [#22](/bitcoin-bitcoin/22/) void* std::__1::__thread_proxy[abi:ne180100]<std::__1::tuple<std::__1::unique_ptr<std::__1::__thread_struct, std::__1::default_delete<std::__1::__thread_struct>>, void (*)(std::__1::basic_string_view<char, std::__1::char_traits<char>>, std::__1::function<void ()>), std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char>>, ThreadPool::Start(int)::'lambda'()>>(void*) /usr/lib/llvm-18/bin/../include/c++/v1/__thread/thread.h:202:3 (test_bitcoin+0x44261e)
 28  Previous read of size 8 at 0x721000012718 by main thread:
 29    [#0](/bitcoin-bitcoin/0/) strcmp <null> (test_bitcoin+0x161718) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 30    [#1](/bitcoin-bitcoin/1/) boost::test_tools::tt_detail::equal_impl(char const*, char const*) /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/test/impl/test_tools.ipp:463:30 (test_bitcoin+0x1fca6f) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 31    [#2](/bitcoin-bitcoin/2/) boost::test_tools::assertion_result boost::test_tools::tt_detail::equal_impl_frwd::call_impl<char const*, char [25]>(char const* const&, char const (&) [25], mpl_::bool_<false>) const /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/test/tools/old/impl.hpp:130:16 (test_bitcoin+0xb2f88f) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 32    [#3](/bitcoin-bitcoin/3/) boost::test_tools::assertion_result boost::test_tools::tt_detail::equal_impl_frwd::operator()<char const*, char [25]>(char const* const&, char const (&) [25]) const /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/test/tools/old/impl.hpp:145:16 (test_bitcoin+0xb2f88f)
 33    [#4](/bitcoin-bitcoin/4/) bool boost::test_tools::tt_detail::check_frwd<boost::test_tools::tt_detail::equal_impl_frwd, char const*, char [25]>(boost::test_tools::tt_detail::equal_impl_frwd, boost::unit_test::lazy_ostream const&, boost::unit_test::basic_cstring<char const>, unsigned long, boost::test_tools::tt_detail::tool_level, boost::test_tools::tt_detail::check_type, char const* const&, char const*, char const (&) [25], char const*) /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/test/tools/old/impl.hpp:92:1 (test_bitcoin+0xb2f88f)
 34    [#5](/bitcoin-bitcoin/5/) threadpool_tests::threadpool_basic::test_method() src/test/threadpool_tests.cpp:117:13 (test_bitcoin+0xb2855b) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 35    [#6](/bitcoin-bitcoin/6/) threadpool_tests::threadpool_basic_invoker() src/test/threadpool_tests.cpp:11:1 (test_bitcoin+0xb26256) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 36    [#7](/bitcoin-bitcoin/7/) boost::detail::function::void_function_invoker0<void (*)(), void>::invoke(boost::detail::function::function_buffer&) /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/function/function_template.hpp:117:11 (test_bitcoin+0x31b9dd) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 37    [#8](/bitcoin-bitcoin/8/) boost::function0<void>::operator()() const /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/function/function_template.hpp:763:14 (test_bitcoin+0x26f5f8) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 38    [#9](/bitcoin-bitcoin/9/) boost::detail::forward::operator()() /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/test/impl/execution_monitor.ipp:1388:32 (test_bitcoin+0x26f5f8)
 39    [#10](/bitcoin-bitcoin/10/) boost::detail::function::function_obj_invoker0<boost::detail::forward, int>::invoke(boost::detail::function::function_buffer&) /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/function/function_template.hpp:137:18 (test_bitcoin+0x26f5f8)
 40    [#11](/bitcoin-bitcoin/11/) boost::function0<int>::operator()() const /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/function/function_template.hpp:763:14 (test_bitcoin+0x1f49f3) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 41    [#12](/bitcoin-bitcoin/12/) int boost::detail::do_invoke<boost::shared_ptr<boost::detail::translator_holder_base>, boost::function<int ()>>(boost::shared_ptr<boost::detail::translator_holder_base> const&, boost::function<int ()> const&) /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/test/impl/execution_monitor.ipp:301:30 (test_bitcoin+0x1f49f3)
 42    [#13](/bitcoin-bitcoin/13/) boost::execution_monitor::catch_signals(boost::function<int ()> const&) /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/test/impl/execution_monitor.ipp:903:16 (test_bitcoin+0x1f49f3)
 43    [#14](/bitcoin-bitcoin/14/) boost::execution_monitor::execute(boost::function<int ()> const&) /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/test/impl/execution_monitor.ipp:1301:16 (test_bitcoin+0x1f4d6a) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 44    [#15](/bitcoin-bitcoin/15/) boost::execution_monitor::vexecute(boost::function<void ()> const&) /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/test/impl/execution_monitor.ipp:1397:5 (test_bitcoin+0x1f04d8) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 45    [#16](/bitcoin-bitcoin/16/) boost::unit_test::unit_test_monitor_t::execute_and_translate(boost::function<void ()> const&, unsigned long) /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/test/impl/unit_test_monitor.ipp:49:9 (test_bitcoin+0x1f04d8)
 46    [#17](/bitcoin-bitcoin/17/) boost::unit_test::framework::state::execute_test_tree(unsigned long, unsigned long, boost::unit_test::framework::state::random_generator_helper const*) /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/test/impl/framework.ipp:815:44 (test_bitcoin+0x227215) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 47    [#18](/bitcoin-bitcoin/18/) boost::unit_test::framework::state::execute_test_tree(unsigned long, unsigned long, boost::unit_test::framework::state::random_generator_helper const*) /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/test/impl/framework.ipp:784:58 (test_bitcoin+0x227aa1) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 48    [#19](/bitcoin-bitcoin/19/) boost::unit_test::framework::state::execute_test_tree(unsigned long, unsigned long, boost::unit_test::framework::state::random_generator_helper const*) /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/test/impl/framework.ipp:784:58 (test_bitcoin+0x227aa1) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 49    [#20](/bitcoin-bitcoin/20/) boost::unit_test::framework::run(unsigned long, bool) /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/test/impl/framework.ipp:1722:29 (test_bitcoin+0x1ef05f) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 50    [#21](/bitcoin-bitcoin/21/) boost::unit_test::unit_test_main(boost::unit_test::test_suite* (*)(int, char**), int, char**) /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/test/impl/unit_test_main.ipp:250:9 (test_bitcoin+0x209e26) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 51    [#22](/bitcoin-bitcoin/22/) main /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/test/impl/unit_test_main.ipp:306:12 (test_bitcoin+0x20a753) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 52  Mutex M0 (0x7fff25c67fe0) created at:
 53    [#0](/bitcoin-bitcoin/0/) pthread_mutex_trylock <null> (test_bitcoin+0x159aae) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 54    [#1](/bitcoin-bitcoin/1/) std::__1::mutex::try_lock() <null> (libc++.so.1+0x5f63c) (BuildId: b52c260d7d8ecb73699f2407c0a9b193fc4761fc)
 55    [#2](/bitcoin-bitcoin/2/) UniqueLock<AnnotatedMixin<std::__1::mutex>>::UniqueLock(AnnotatedMixin<std::__1::mutex>&, char const*, char const*, int, bool) src/./sync.h:182:13 (test_bitcoin+0x442c37) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 56    [#3](/bitcoin-bitcoin/3/) ThreadPool::WorkerThread() src/./util/threadpool.h:37:9 (test_bitcoin+0x442c37)
 57    [#4](/bitcoin-bitcoin/4/) ThreadPool::Start(int)::'lambda'()::operator()() const src/./util/threadpool.h:73:101 (test_bitcoin+0x442ae5) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 58    [#5](/bitcoin-bitcoin/5/) decltype(std::declval<ThreadPool::Start(int)::'lambda'()&>()()) std::__1::__invoke[abi:ne180100]<ThreadPool::Start(int)::'lambda'()&>(ThreadPool::Start(int)::'lambda'()&) /usr/lib/llvm-18/bin/../include/c++/v1/__type_traits/invoke.h:344:25 (test_bitcoin+0x442ae5)
 59    [#6](/bitcoin-bitcoin/6/) void std::__1::__invoke_void_return_wrapper<void, true>::__call[abi:ne180100]<ThreadPool::Start(int)::'lambda'()&>(ThreadPool::Start(int)::'lambda'()&) /usr/lib/llvm-18/bin/../include/c++/v1/__type_traits/invoke.h:419:5 (test_bitcoin+0x442ae5)
 60    [#7](/bitcoin-bitcoin/7/) std::__1::__function::__alloc_func<ThreadPool::Start(int)::'lambda'(), std::__1::allocator<ThreadPool::Start(int)::'lambda'()>, void ()>::operator()[abi:ne180100]() /usr/lib/llvm-18/bin/../include/c++/v1/__functional/function.h:169:12 (test_bitcoin+0x442ae5)
 61    [#8](/bitcoin-bitcoin/8/) std::__1::__function::__func<ThreadPool::Start(int)::'lambda'(), std::__1::allocator<ThreadPool::Start(int)::'lambda'()>, void ()>::operator()() /usr/lib/llvm-18/bin/../include/c++/v1/__functional/function.h:311:10 (test_bitcoin+0x442ae5)
 62    [#9](/bitcoin-bitcoin/9/) std::__1::__function::__value_func<void ()>::operator()[abi:ne180100]() const /usr/lib/llvm-18/bin/../include/c++/v1/__functional/function.h:428:12 (test_bitcoin+0x1815b98) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 63    [#10](/bitcoin-bitcoin/10/) std::__1::function<void ()>::operator()() const /usr/lib/llvm-18/bin/../include/c++/v1/__functional/function.h:981:10 (test_bitcoin+0x1815b98)
 64    [#11](/bitcoin-bitcoin/11/) util::TraceThread(std::__1::basic_string_view<char, std::__1::char_traits<char>>, std::__1::function<void ()>) src/util/thread.cpp:21:9 (test_bitcoin+0x1815b98)
 65    [#12](/bitcoin-bitcoin/12/) decltype(std::declval<void (*)(std::__1::basic_string_view<char, std::__1::char_traits<char>>, std::__1::function<void ()>)>()(std::declval<std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char>>>(), std::declval<ThreadPool::Start(int)::'lambda'()>())) std::__1::__invoke[abi:ne180100]<void (*)(std::__1::basic_string_view<char, std::__1::char_traits<char>>, std::__1::function<void ()>), std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char>>, ThreadPool::Start(int)::'lambda'()>(void (*&&)(std::__1::basic_string_view<char, std::__1::char_traits<char>>, std::__1::function<void ()>), std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char>>&&, ThreadPool::Start(int)::'lambda'()&&) /usr/lib/llvm-18/bin/../include/c++/v1/__type_traits/invoke.h:344:25 (test_bitcoin+0x44261e) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 66    [#13](/bitcoin-bitcoin/13/) void std::__1::__thread_execute[abi:ne180100]<std::__1::unique_ptr<std::__1::__thread_struct, std::__1::default_delete<std::__1::__thread_struct>>, void (*)(std::__1::basic_string_view<char, std::__1::char_traits<char>>, std::__1::function<void ()>), std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char>>, ThreadPool::Start(int)::'lambda'(), 2ul, 3ul>(std::__1::tuple<std::__1::unique_ptr<std::__1::__thread_struct, std::__1::default_delete<std::__1::__thread_struct>>, void (*)(std::__1::basic_string_view<char, std::__1::char_traits<char>>, std::__1::function<void ()>), std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char>>, ThreadPool::Start(int)::'lambda'()>&, std::__1::__tuple_indices<2ul, 3ul>) /usr/lib/llvm-18/bin/../include/c++/v1/__thread/thread.h:193:3 (test_bitcoin+0x44261e)
 67    [#14](/bitcoin-bitcoin/14/) void* std::__1::__thread_proxy[abi:ne180100]<std::__1::tuple<std::__1::unique_ptr<std::__1::__thread_struct, std::__1::default_delete<std::__1::__thread_struct>>, void (*)(std::__1::basic_string_view<char, std::__1::char_traits<char>>, std::__1::function<void ()>), std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char>>, ThreadPool::Start(int)::'lambda'()>>(void*) /usr/lib/llvm-18/bin/../include/c++/v1/__thread/thread.h:202:3 (test_bitcoin+0x44261e)
 68  Thread T13 'b-threadpool_wo' (tid=26954, running) created by main thread at:
 69    [#0](/bitcoin-bitcoin/0/) pthread_create <null> (test_bitcoin+0x157daf) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 70    [#1](/bitcoin-bitcoin/1/) std::__1::__libcpp_thread_create[abi:ne180100](unsigned long*, void* (*)(void*), void*) /usr/lib/llvm-18/bin/../include/c++/v1/__threading_support:317:10 (test_bitcoin+0x442473) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 71    [#2](/bitcoin-bitcoin/2/) std::__1::thread::thread<void (*)(std::__1::basic_string_view<char, std::__1::char_traits<char>>, std::__1::function<void ()>), std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char>>, ThreadPool::Start(int)::'lambda'(), void>(void (*&&)(std::__1::basic_string_view<char, std::__1::char_traits<char>>, std::__1::function<void ()>), std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char>>&&, ThreadPool::Start(int)::'lambda'()&&) /usr/lib/llvm-18/bin/../include/c++/v1/__thread/thread.h:212:14 (test_bitcoin+0x442473)
 72    [#3](/bitcoin-bitcoin/3/) std::__1::thread* std::__1::construct_at[abi:ne180100]<std::__1::thread, void (*)(std::__1::basic_string_view<char, std::__1::char_traits<char>>, std::__1::function<void ()>), std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char>>, ThreadPool::Start(int)::'lambda'(), std::__1::thread*>(std::__1::thread*, void (*&&)(std::__1::basic_string_view<char, std::__1::char_traits<char>>, std::__1::function<void ()>), std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char>>&&, ThreadPool::Start(int)::'lambda'()&&) /usr/lib/llvm-18/bin/../include/c++/v1/__memory/construct_at.h:41:46 (test_bitcoin+0x4421c5) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 73    [#4](/bitcoin-bitcoin/4/) std::__1::thread* std::__1::__construct_at[abi:ne180100]<std::__1::thread, void (*)(std::__1::basic_string_view<char, std::__1::char_traits<char>>, std::__1::function<void ()>), std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char>>, ThreadPool::Start(int)::'lambda'(), std::__1::thread*>(std::__1::thread*, void (*&&)(std::__1::basic_string_view<char, std::__1::char_traits<char>>, std::__1::function<void ()>), std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char>>&&, ThreadPool::Start(int)::'lambda'()&&) /usr/lib/llvm-18/bin/../include/c++/v1/__memory/construct_at.h:49:10 (test_bitcoin+0x4421c5)
 74    [#5](/bitcoin-bitcoin/5/) void std::__1::allocator_traits<std::__1::allocator<std::__1::thread>>::construct[abi:ne180100]<std::__1::thread, void (*)(std::__1::basic_string_view<char, std::__1::char_traits<char>>, std::__1::function<void ()>), std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char>>, ThreadPool::Start(int)::'lambda'(), void, void>(std::__1::allocator<std::__1::thread>&, std::__1::thread*, void (*&&)(std::__1::basic_string_view<char, std::__1::char_traits<char>>, std::__1::function<void ()>), std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char>>&&, ThreadPool::Start(int)::'lambda'()&&) /usr/lib/llvm-18/bin/../include/c++/v1/__memory/allocator_traits.h:305:5 (test_bitcoin+0x4421c5)
 75    [#6](/bitcoin-bitcoin/6/) std::__1::thread* std::__1::vector<std::__1::thread, std::__1::allocator<std::__1::thread>>::__emplace_back_slow_path<void (*)(std::__1::basic_string_view<char, std::__1::char_traits<char>>, std::__1::function<void ()>), std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char>>, ThreadPool::Start(int)::'lambda'()>(void (*&&)(std::__1::basic_string_view<char, std::__1::char_traits<char>>, std::__1::function<void ()>), std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char>>&&, ThreadPool::Start(int)::'lambda'()&&) /usr/lib/llvm-18/bin/../include/c++/v1/vector:1491:3 (test_bitcoin+0x4421c5)
 76    [#7](/bitcoin-bitcoin/7/) std::__1::thread& std::__1::vector<std::__1::thread, std::__1::allocator<std::__1::thread>>::emplace_back<void (*)(std::__1::basic_string_view<char, std::__1::char_traits<char>>, std::__1::function<void ()>), std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char>>, ThreadPool::Start(int)::'lambda'()>(void (*&&)(std::__1::basic_string_view<char, std::__1::char_traits<char>>, std::__1::function<void ()>), std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char>>&&, ThreadPool::Start(int)::'lambda'()&&) /usr/lib/llvm-18/bin/../include/c++/v1/vector:1511:13 (test_bitcoin+0x43efb4) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 77    [#8](/bitcoin-bitcoin/8/) ThreadPool::Start(int) src/./util/threadpool.h:73:23 (test_bitcoin+0x43efb4)
 78    [#9](/bitcoin-bitcoin/9/) threadpool_tests::threadpool_basic::test_method() src/test/threadpool_tests.cpp:109:20 (test_bitcoin+0xb283a6) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 79    [#10](/bitcoin-bitcoin/10/) threadpool_tests::threadpool_basic_invoker() src/test/threadpool_tests.cpp:11:1 (test_bitcoin+0xb26256) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 80    [#11](/bitcoin-bitcoin/11/) boost::detail::function::void_function_invoker0<void (*)(), void>::invoke(boost::detail::function::function_buffer&) /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/function/function_template.hpp:117:11 (test_bitcoin+0x31b9dd) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 81    [#12](/bitcoin-bitcoin/12/) boost::function0<void>::operator()() const /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/function/function_template.hpp:763:14 (test_bitcoin+0x26f5f8) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 82    [#13](/bitcoin-bitcoin/13/) boost::detail::forward::operator()() /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/test/impl/execution_monitor.ipp:1388:32 (test_bitcoin+0x26f5f8)
 83    [#14](/bitcoin-bitcoin/14/) boost::detail::function::function_obj_invoker0<boost::detail::forward, int>::invoke(boost::detail::function::function_buffer&) /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/function/function_template.hpp:137:18 (test_bitcoin+0x26f5f8)
 84    [#15](/bitcoin-bitcoin/15/) boost::function0<int>::operator()() const /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/function/function_template.hpp:763:14 (test_bitcoin+0x1f49f3) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 85    [#16](/bitcoin-bitcoin/16/) int boost::detail::do_invoke<boost::shared_ptr<boost::detail::translator_holder_base>, boost::function<int ()>>(boost::shared_ptr<boost::detail::translator_holder_base> const&, boost::function<int ()> const&) /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/test/impl/execution_monitor.ipp:301:30 (test_bitcoin+0x1f49f3)
 86    [#17](/bitcoin-bitcoin/17/) boost::execution_monitor::catch_signals(boost::function<int ()> const&) /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/test/impl/execution_monitor.ipp:903:16 (test_bitcoin+0x1f49f3)
 87    [#18](/bitcoin-bitcoin/18/) boost::execution_monitor::execute(boost::function<int ()> const&) /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/test/impl/execution_monitor.ipp:1301:16 (test_bitcoin+0x1f4d6a) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 88    [#19](/bitcoin-bitcoin/19/) boost::execution_monitor::vexecute(boost::function<void ()> const&) /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/test/impl/execution_monitor.ipp:1397:5 (test_bitcoin+0x1f04d8) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 89    [#20](/bitcoin-bitcoin/20/) boost::unit_test::unit_test_monitor_t::execute_and_translate(boost::function<void ()> const&, unsigned long) /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/test/impl/unit_test_monitor.ipp:49:9 (test_bitcoin+0x1f04d8)
 90    [#21](/bitcoin-bitcoin/21/) boost::unit_test::framework::state::execute_test_tree(unsigned long, unsigned long, boost::unit_test::framework::state::random_generator_helper const*) /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/test/impl/framework.ipp:815:44 (test_bitcoin+0x227215) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 91    [#22](/bitcoin-bitcoin/22/) boost::unit_test::framework::state::execute_test_tree(unsigned long, unsigned long, boost::unit_test::framework::state::random_generator_helper const*) /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/test/impl/framework.ipp:784:58 (test_bitcoin+0x227aa1) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 92    [#23](/bitcoin-bitcoin/23/) boost::unit_test::framework::state::execute_test_tree(unsigned long, unsigned long, boost::unit_test::framework::state::random_generator_helper const*) /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/test/impl/framework.ipp:784:58 (test_bitcoin+0x227aa1) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 93    [#24](/bitcoin-bitcoin/24/) boost::unit_test::framework::run(unsigned long, bool) /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/test/impl/framework.ipp:1722:29 (test_bitcoin+0x1ef05f) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 94    [#25](/bitcoin-bitcoin/25/) boost::unit_test::unit_test_main(boost::unit_test::test_suite* (*)(int, char**), int, char**) /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/test/impl/unit_test_main.ipp:250:9 (test_bitcoin+0x209e26) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 95    [#26](/bitcoin-bitcoin/26/) main /ci_container_base/depends/x86_64-pc-linux-gnu/include/boost/test/impl/unit_test_main.ipp:306:12 (test_bitcoin+0x20a753) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d)
 96SUMMARY: ThreadSanitizer: data race (/ci_container_base/ci/scratch/build/bitcoin-x86_64-pc-linux-gnu/src/test/test_bitcoin+0x156ae3) (BuildId: 5ce48731414b473abd5838c79ce7029f5872bd2d) in free
 97==================
 98Running tests: translation_tests from test/translation_tests.cpp
 99make[3]: *** [Makefile:22558: test/threadpool_tests.cpp.test] Error 1
100make[3]: *** Waiting for unfinished jobs....
101make[3]: Leaving directory '/ci_container_base/ci/scratch/build/bitcoin-x86_64-pc-linux-gnu/src'
102make[2]: *** [Makefile:20520: check-am] Error 2
103make[2]: Leaving directory '/ci_container_base/ci/scratch/build/bitcoin-x86_64-pc-linux-gnu/src'
104make[1]: *** [Makefile:20185: check-recursive] Error 1
105make[1]: Leaving directory '/ci_container_base/ci/scratch/build/bitcoin-x86_64-pc-linux-gnu/src'
106make: *** [Makefile:756: check-recursive] Error 1
107Exit status: 2

DrahtBot removed the label CI failed on Apr 5, 2024

DrahtBot commented at 8:37 am on April 5, 2024: contributor

🚧 At least one of the CI tasks failed. Make sure to run all tests locally, according to the documentation.

Possibly this is due to a silent merge conflict (the changes in this pull request being incompatible with the current code in the target branch). If so, make sure to rebase on the latest commit of the target branch.

Leave a comment here, if you need help tracking down a confusing failure.

Debug: https://github.com/bitcoin/bitcoin/runs/23011013574

DrahtBot added the label CI failed on Apr 5, 2024

DrahtBot removed the label CI failed on Apr 5, 2024

DrahtBot added the label CI failed on Apr 5, 2024

furszy force-pushed on Apr 7, 2024

DrahtBot commented at 5:19 pm on April 17, 2024: contributor

Are you still working on this? If not, this could be moved to draft for as long as the tsan CI is failing.

furszy marked this as a draft on Apr 18, 2024

DrahtBot added the label Needs rebase on Apr 30, 2024

furszy force-pushed on May 1, 2024

DrahtBot removed the label Needs rebase on May 1, 2024

DrahtBot removed the label CI failed on May 1, 2024

DrahtBot added the label CI failed on Jun 14, 2024

DrahtBot commented at 2:16 pm on June 14, 2024: contributor

🚧 At least one of the CI tasks failed. Make sure to run all tests locally, according to the documentation.

Possibly this is due to a silent merge conflict (the changes in this pull request being incompatible with the current code in the target branch). If so, make sure to rebase on the latest commit of the target branch.

Leave a comment here, if you need help tracking down a confusing failure.

Debug: https://github.com/bitcoin/bitcoin/runs/24470829129

furszy force-pushed on Jun 26, 2024

DrahtBot added the label Needs rebase on Aug 5, 2024

furszy renamed this:
~~index: blockfilter initial sync speedup, parallelize process~~
index: initial sync speedup, parallelize process
on Aug 7, 2024

furszy force-pushed on Aug 14, 2024

DrahtBot removed the label Needs rebase on Aug 14, 2024

hebasto added the label Needs CMake port on Aug 16, 2024

maflcko removed the label Needs CMake port on Aug 29, 2024

DrahtBot added the label Needs rebase on Sep 2, 2024

furszy force-pushed on Oct 1, 2024

DrahtBot removed the label Needs rebase on Oct 1, 2024

in src/util/threadpool.h:46 in 81dcee9464 outdated

40+            {
41+                // Wait for the task or until the stop flag is set
42+                m_condition.wait(wait_lock,[&]() EXCLUSIVE_LOCKS_REQUIRED(cs_work_queue) { return m_interrupt || !m_work_queue.empty(); });
43+
44+                // If stopped, exit worker.
45+                if (m_interrupt && m_work_queue.empty()) {

andrewtoth commented at 2:54 pm on October 23, 2024:

If we receive an interrupt, we don’t also want to wait for the work queue to be empty right?

andrewtoth commented at 4:52 pm on October 27, 2024:

I tried this and it causes memory errors, since the remaining futures will have a dangling ref to m_condition.

ryanofsky commented at 7:58 pm on April 9, 2025:

re: #26966 (review)

In commit “util: introduce general purpose thread pool” (a382501d798c02a4dc042bc0641ce7b99c2db40f)

I tried this and it causes memory errors, since the remaining futures will have a dangling ref to m_condition.

To be clear, you tried dropping the m_work_queue.empty() condition and that didn’t work, so current code is correct? If so we could probably mark this thread resolved (if I am not misinterpreting).

andrewtoth commented at 10:12 pm on April 9, 2025:

Err, I still think we would want to abandon the work queue if we are interrupted instead of waiting for it to finish, no? My naive approach of not waiting for the queue to be empty does not work though.

furszy commented at 7:14 pm on June 27, 2025:

Hmm, sorry for the very very late response @andrewtoth. I missed this message completely.

I still think we would want to abandon the work queue if we are interrupted instead of waiting for it to finish, no? My naive approach of not waiting for the queue to be empty does not work though.

Yeah, I don’t think that’s safe. Other threads might be waiting on the tasks’ futures to complete, so exiting without notifying them would leave them blocked forever. What we could do (and I think you mentioned this elsewhere) is keep track of all the tasks’ promises internally and fail them with an interruption error/exception if the thread pool gets interrupted. In this way we also avoid lingering objects.

in src/util/threadpool.h:40 in 81dcee9464 outdated

34+
35+    void WorkerThread() EXCLUSIVE_LOCKS_REQUIRED(!cs_work_queue)
36+    {
37+        WAIT_LOCK(cs_work_queue, wait_lock);
38+        while (!m_interrupt) {
39+            std::function<void()> task;

andrewtoth commented at 3:02 pm on October 23, 2024:

Can we modify this to be more generic and return a type? And add logic to collect all returned values into a shared vector which can then be atomically swapped out by an observer? Possibly not in this PR, but if this will be split out into a generic thread pool.

furszy commented at 7:19 pm on June 27, 2025:

Can we modify this to be more generic and return a type? And add logic to collect all returned values into a shared vector which can then be atomically swapped out by an observer? Possibly not in this PR, but if this will be split out into a generic thread pool.

We could keep track of the task futures’ promises, if that’s what you’re referring to. In other words, this void() function is just a wrapper that executes a generic function which sets the result inside the caller’s future.

in src/util/threadpool.h:60 in 81dcee9464 outdated

55+            WITH_REVERSE_LOCK(wait_lock, task());
56+        }
57+    }
58+
59+public:
60+    ThreadPool() {}

andrewtoth commented at 3:37 pm on November 18, 2024:

tidy doesn’t like this

0    ThreadPool() = default;

furszy commented at 4:36 pm on June 23, 2025:

done as suggested

andrewtoth commented at 6:11 pm on November 18, 2024: contributor

I’m using the ThreadPool here in #31132 as a cherry-picked commit, modulo changing ThreadPool() {} to ThreadPool() = default;. Perhaps we could pull this out to a separate PR since it would be useful for both changes.

One request for the ThreadPool would be to track in flight tasks being executed. That way we could write tests that ensure that all tasks have been completed before continuing, even if we don’t have access to the futures.

DrahtBot added the label Needs rebase on Jan 22, 2025

furszy force-pushed on Feb 25, 2025

DrahtBot removed the label CI failed on Feb 25, 2025

DrahtBot removed the label Needs rebase on Feb 25, 2025

in src/test/threadpool_tests.cpp:28 in 349b09983d outdated

10+
11+BOOST_AUTO_TEST_CASE(threadpool_basic)
12+{
13+    // Test Cases
14+    // 1) Submit tasks and verify completion.
15+    // 2) Maintain all threads busy except one.

yancyribbens commented at 10:37 pm on February 26, 2025:

0    // 2) Maintain all busy threads except one.

DrahtBot added the label Needs rebase on Mar 12, 2025

in src/index/base.cpp:223 in c4723fb985 outdated

232-            if (!CustomAppend(block_info)) {
233-                FatalErrorf("%s: Failed to write block %s to index database",
234-                           __func__, pindex->GetBlockHash().ToString());
235-                return;
236-            }
237+            if (!ProcessBlock(pindex)) break; // error logged internally

ryanofsky commented at 5:52 pm on April 9, 2025:

In commit “index: remove CBlockIndex access from node internals” (c4723fb9857c624ac0e1e034dc6c29d7821f5b4a)

It seems like this should be returning, not breaking. Would be good to change or clarify with a comment

furszy commented at 5:41 pm on June 5, 2025:

In commit “index: remove CBlockIndex access from node internals” (c4723fb)

It seems like this should be returning, not breaking. Would be good to change or clarify with a comment

yeah, good catch!. I think at the end it doesn’t matter much because ProcessBlock aborts the node during a failure but still, this would have logged an extra line “index enabled at height ” during shutdown.

in src/index/base.cpp:336 in c4723fb985 outdated

331@@ -311,17 +332,13 @@ void BaseIndex::BlockConnected(ChainstateRole role, const std::shared_ptr<const
332             return;
333         }
334     }
335-    interfaces::BlockInfo block_info = kernel::MakeBlockInfo(pindex, block.get());
336-    if (CustomAppend(block_info)) {
337+
338+    if (ProcessBlock(pindex)) {

ryanofsky commented at 6:24 pm on April 9, 2025:

In commit “index: remove CBlockIndex access from node internals” (c4723fb9857c624ac0e1e034dc6c29d7821f5b4a)

Coud add another // error logged internally comment here since it now looks like this failure is being ignored.

furszy commented at 5:48 pm on June 5, 2025:

In commit “index: remove CBlockIndex access from node internals” (c4723fb)

Coud add another // error logged internally comment here since it now looks like this failure is being ignored.

Done as suggested.

in src/index/blockfilterindex.cpp:253 in c4723fb985 outdated

259-        if (!m_chainstate->m_blockman.ReadBlockUndo(block_undo, *pindex)) {
260-            return false;
261-        }
262-    }
263-
264+    const CBlockUndo& block_undo = block.height > 0 ? *Assert(block.undo_data) : CBlockUndo();

ryanofsky commented at 6:26 pm on April 9, 2025:

In commit “index: remove CBlockIndex access from node internals” (c4723fb9857c624ac0e1e034dc6c29d7821f5b4a)

I’m not sure this is safe. It looks like if block height is 0 this is taking a reference to a temporary object that will go out scope.

In any case I think this could be simplified to const CBlockUndo& block_undo{*Assert(block.undo_data)} because it looks like pointer will be set above even if height is 0.

furszy commented at 5:49 pm on June 5, 2025:

In commit “index: remove CBlockIndex access from node internals” (c4723fb)

I’m not sure this is safe. It looks like if block height is 0 this is taking a reference to a temporary object that will go out scope.

In any case I think this could be simplified to const CBlockUndo& block_undo{*Assert(block.undo_data)} because it looks like pointer will be set above even if height is 0.

yeah, great. Done as suggested.

in src/util/threadpool.h:32 in a382501d79 outdated

27+
28+private:
29+    Mutex cs_work_queue;
30+    std::queue<std::function<void()>> m_work_queue GUARDED_BY(cs_work_queue);
31+    std::condition_variable m_condition;
32+    CThreadInterrupt m_interrupt;

ryanofsky commented at 7:52 pm on April 9, 2025:

In commit “util: introduce general purpose thread pool” (a382501d798c02a4dc042bc0641ce7b99c2db40f)

I don’t think it makes sense for m_interrupt to use the CThreadInterrupt class because the ThreadPool class already has it’s own mutex and condition variable and it would be wasteful to introduce more. I think could just replace CThreadInterrupt with bool here, and replace m_interrupt(); with m_interrupt = true and replace m_interrupt.reset() with m_interrupt = false while holding the mutex.

furszy commented at 6:16 pm on June 5, 2025:

In commit “util: introduce general purpose thread pool” (a382501)

I don’t think it makes sense for m_interrupt to use the CThreadInterrupt class because the ThreadPool class already has it’s own mutex and condition variable and it would be wasteful to introduce more. I think could just replace CThreadInterrupt with bool here, and replace m_interrupt(); with m_interrupt = true and replace m_interrupt.reset() with m_interrupt = false while holding the mutex.

sure, done as suggested. Thanks!

in src/sync.h:302 in a382501d79 outdated

298@@ -299,6 +299,7 @@ inline MutexType* MaybeCheckNotHeld(MutexType* m) LOCKS_EXCLUDED(m) LOCK_RETURNE
299 //! The above is detectable at compile-time with the -Wreturn-local-addr flag in
300 //! gcc and the -Wreturn-stack-address flag in clang, both enabled by default.
301 #define WITH_LOCK(cs, code) (MaybeCheckNotHeld(cs), [&]() -> decltype(auto) { LOCK(cs); code; }())
302+#define WITH_REVERSE_LOCK(cs, code) ([&]() -> decltype(auto) { REVERSE_LOCK(cs); code; }())

ryanofsky commented at 7:56 pm on April 9, 2025:

In commit “util: introduce general purpose thread pool” (a382501d798c02a4dc042bc0641ce7b99c2db40f)

This seems ok, but it’s is only used one place where WITH_REVERSE_LOCK is not much simpler than plain REVERSE_LOCK. Could consider dropping it.

furszy commented at 6:22 pm on June 5, 2025:

In commit “util: introduce general purpose thread pool” (a382501)

This seems ok, but it’s is only used one place where WITH_REVERSE_LOCK is not much simpler than plain REVERSE_LOCK. Could consider dropping it.

Sure. I think I did it this way to be very explicit about the code that will be executed without the lock, but yeah, we could achieve the same outcome with another set of brackets too.

in src/util/threadpool.h:29 in a382501d79 outdated

24+#include <vector>
25+
26+class ThreadPool {
27+
28+private:
29+    Mutex cs_work_queue;

ryanofsky commented at 8:08 pm on April 9, 2025:

In commit “util: introduce general purpose thread pool” (a382501d798c02a4dc042bc0641ce7b99c2db40f)

Not important, but would suggest simplifying naming and just calling these members:

0Mutex m_mutex;
1std::condition_variable m_cv;

The cs_ prefix is an older convention that comes from windows code, and there is as long as this class is going to have one mutex there isn’t really a reason to use a more complicated name.

furszy commented at 6:23 pm on June 5, 2025:

In commit “util: introduce general purpose thread pool” (a382501)

Not important, but would suggest simplifying naming and just calling these members:
0Mutex m_mutex;
1std::condition_variable m_cv;
The cs_ prefix is an older convention that comes from windows code, and there is as long as this class is going to have one mutex there isn’t really a reason to use a more complicated name.

k sure. Done as suggested.

in src/init.cpp:2111 in d9e4adb147 outdated

2107@@ -2104,7 +2108,20 @@ bool StartIndexBackgroundSync(NodeContext& node)
2108         }
2109     }
2110 
2111+    std::shared_ptr<ThreadPool> thread_pool;

ryanofsky commented at 8:23 pm on April 9, 2025:

In commit “index: implement index parallel sync” (bc0e5211e8a914e80e68e9c700b846a4cc3ef95b)

This doesn’t seem like a great use of shared_ptr because makes the shutdown sequence more complicated than it needs to be. I think it would be clearer if instead of taking std::shared_ptr<ThreadPool> references they just used ThreadPool& and a new std::unique_ptr<ThreadPool> m_index_threads member was added to NodeContext. This way the threads could be stopped with an explicit reset() call instead of shutting down more unpredictably when the last index is destroyed.

furszy commented at 7:05 pm on June 5, 2025:

In commit “index: implement index parallel sync” (bc0e521)

This doesn’t seem like a great use of shared_ptr because makes the shutdown sequence more complicated than it needs to be. I think it would be clearer if instead of taking std::shared_ptr<ThreadPool> references they just used ThreadPool& and a new std::unique_ptr<ThreadPool> m_index_threads member was added to NodeContext. This way the threads could be stopped with an explicit reset() call instead of shutting down more unpredictably when the last index is destroyed.

Sounds good. Done as suggested. Thanks!

in src/index/base.cpp:242 in bc0e5211e8 outdated

238+
239+            if (parallel_sync_enabled) {
240+                const int max_blocks_to_sync = m_tasks_per_worker * m_thread_pool->WorkersCount() + m_tasks_per_worker; // extra 'm_tasks_per_worker' due the active-wait.
241+                const int tip_height = WITH_LOCK(cs_main, return m_chainstate->m_chain.Height());
242+                const int remaining_blocks = tip_height - pindex_next->nHeight;
243+                work_chunk = remaining_blocks > max_blocks_to_sync ? m_tasks_per_worker : remaining_blocks / (m_thread_pool->WorkersCount() + 1);

ryanofsky commented at 8:56 pm on April 9, 2025:

In commit “index: implement index parallel sync” (bc0e5211e8a914e80e68e9c700b846a4cc3ef95b)

I found this hard to follow:

0const int max_blocks_to_sync = m_tasks_per_worker * m_thread_pool->WorkersCount() + m_tasks_per_worker; // extra 'm_tasks_per_worker' due the active-wait.
1work_chunk = remaining_blocks > max_blocks_to_sync ? m_tasks_per_worker : remaining_blocks / (m_thread_pool->WorkersCount() + 1);
2workers_count = m_thread_pool->WorkersCount();

would suggest dropping the max_blocks_to_sync variable and simplifying to:

0workers_count = m_thread_pool->WorkersCount();
1work_chunk = std::min(m_tasks_per_worker, remaining_blocks / (workers_count + 1));

furszy commented at 7:51 pm on June 5, 2025:

In commit “index: implement index parallel sync” (bc0e521)

I found this hard to follow:

0const int max_blocks_to_sync = m_tasks_per_worker * m_thread_pool->WorkersCount() + m_tasks_per_worker; // extra 'm_tasks_per_worker' due the active-wait.
1work_chunk = remaining_blocks > max_blocks_to_sync ? m_tasks_per_worker : remaining_blocks / (m_thread_pool->WorkersCount() + 1);
2workers_count = m_thread_pool->WorkersCount();

would suggest dropping the max_blocks_to_sync variable and simplifying to:

0workers_count = m_thread_pool->WorkersCount();
1work_chunk = std::min(m_tasks_per_worker, remaining_blocks / (workers_count + 1));

Good idea. Done as suggested.

in src/index/base.h:94 in bc0e5211e8 outdated

90@@ -88,6 +91,7 @@ class BaseIndex : public CValidationInterface
91     CThreadInterrupt m_interrupt;
92 
93     std::shared_ptr<ThreadPool> m_thread_pool;
94+    uint16_t m_tasks_per_worker{INDEX_WORK_PER_CHUNK};

ryanofsky commented at 8:58 pm on April 9, 2025:

In commit “index: implement index parallel sync” (bc0e5211e8a914e80e68e9c700b846a4cc3ef95b)

IMO, this would be clearer if it were called blocks_per_worker or blocks_per_chunk instead of tasks_per_worker. Not knowing that a task is a block made the code harder to understand when initially reading it.

furszy commented at 7:59 pm on June 5, 2025:

In commit “index: implement index parallel sync” (bc0e521)

IMO, this would be clearer if it were called blocks_per_worker or blocks_per_chunk instead of tasks_per_worker. Not knowing that a task is a block made the code harder to understand when initially reading it.

sure. Done as suggested.

in src/index/base.cpp:238 in bc0e5211e8 outdated

234+            int work_chunk = 1;
235+            int workers_count = 0;
236+            std::vector<std::future<std::vector<std::any>>> futures;
237+            const CBlockIndex* it_start = pindex_next;
238+
239+            if (parallel_sync_enabled) {

ryanofsky commented at 9:24 pm on April 9, 2025:

In commit “index: implement index parallel sync” (bc0e5211e8a914e80e68e9c700b846a4cc3ef95b)

Would be really helpful to have a comment saying how this works at a high level. Would suggest something like “If parallel sync is enabled, use WorkersCount()+1 threads (including the current thread) to each process block ranges of up to m_tasks_per_worker blocks. The blocks in each range are processed in sequence by calling the index’s CustomProcessBlock method which returns std::any values that are collected into vectors. As the threads finish their work, the std::any values are processed in order by calling the index’s CustomPostProcessBlocks method, and the process repeats until no blocks are remaining to be processed and post-processed.”

I guess at a high level this seems reasonable, but perhaps too rigid. Like if there are 3 threads processing block ranges 1-10, 11-20, 21-30, and the first 2 threads finish while the third thread is slow. Why should the loop need to wait for the third thread before beginning to process blocks 31-50 and there are two idle threads doing nothing?

It seems like this idleness could be avoided by moving the CustomPostProcessBlocks calls into the worker threads. So that whenever each worker thread finishes processing blocks, it then opportunistically calls CustomPostProcessBlocks to post-process any blocks that are available (given the ordering constraint for post-processing). This way all the worker threads would continuously have work to do, and I suspect the resulting code might be simpler too since there would just be a single phase of work, not alternating Processing/PostProcessing phases.

furszy commented at 7:10 pm on June 23, 2025:

Would be really helpful to have a comment saying how this works at a high level. Would suggest something like “If parallel sync is enabled, use WorkersCount()+1 threads (including the current thread) to each process block ranges of up to m_tasks_per_worker blocks. The blocks in each range are processed in sequence by calling the index’s CustomProcessBlock method which returns std::any values that are collected into vectors. As the threads finish their work, the std::any values are processed in order by calling the index’s CustomPostProcessBlocks method, and the process repeats until no blocks are remaining to be processed and post-processed.”

Sure done!

guess at a high level this seems reasonable, but perhaps too rigid. Like if there are 3 threads processing block ranges 1-10, 11-20, 21-30, and the first 2 threads finish while the third thread is slow. Why should the loop need to wait for the third thread before beginning to process blocks 31-50 and there are two idle threads doing nothing?

It seems like this idleness could be avoided by moving the CustomPostProcessBlocks calls into the worker threads. So that whenever each worker thread finishes processing blocks, it then opportunistically calls CustomPostProcessBlocks to post-process any blocks that are available (given the ordering constraint for post-processing). This way all the worker threads would continuously have work to do, and I suspect the resulting code might be simpler too since there would just be a single phase of work, not alternating Processing/PostProcessing phases.

Spent a few days implementing this suggestion. It started out small but turned into a larger change than I initially expected. That said, I liked the direction and felt it was worth the extra effort. The process runs faster now. Let me know what you think.

Design-wise, I kept everything within the Sync method, but could also encapsulate it into a separate class if preferred.

ryanofsky approved

ryanofsky commented at 9:50 pm on April 9, 2025: contributor

Approach ACK 349b09983d994cb46faeed12b123ae2269c6c516 and I reviewed most of the code. Seems like a nice design and good approach. Plan to finish reviewing later.

furszy force-pushed on Jun 5, 2025

furszy commented at 8:02 pm on June 5, 2025: member

Thanks for the review, andrewtoth and ryanofsky! Addressed most of the suggestions, but not all yet. Will finish the rest soon.

DrahtBot removed the label Needs rebase on Jun 5, 2025

DrahtBot added the label CI failed on Jun 5, 2025

DrahtBot commented at 10:09 pm on June 5, 2025: contributor

🚧 At least one of the CI tasks failed. Task TSan, depends, gui: https://github.com/bitcoin/bitcoin/runs/43571812170 LLM reason (✨ experimental): The CI failure is caused by a segmentation fault occurring in CBlockIndex::GetBlockPos().

Try to run the tests locally, according to the documentation. However, a CI failure may still happen due to a number of reasons, for example:

Possibly due to a silent merge conflict (the changes in this pull request being incompatible with the current code in the target branch). If so, make sure to rebase on the latest commit of the target branch.
A sanitizer issue, which can only be found by compiling with the sanitizer and running the affected test.
An intermittent issue.

Leave a comment here, if you need help tracking down a confusing failure.

furszy force-pushed on Jun 5, 2025

furszy force-pushed on Jun 6, 2025

DrahtBot removed the label CI failed on Jun 6, 2025

furszy commented at 9:04 pm on June 6, 2025: member

Decoupled part of this work inside #32694 - combining it with part of #24230.

achow101 referenced this in commit 19765dca19 on Jun 12, 2025

DrahtBot added the label Needs rebase on Jun 12, 2025

furszy force-pushed on Jun 13, 2025

DrahtBot removed the label Needs rebase on Jun 13, 2025

in src/index/blockfilterindex.cpp:295 in 53bc8b9663 outdated

290+bool BlockFilterIndex::CustomPostProcessBlocks(const std::any& obj)
291+{
292+    const auto& [filter, height] = std::any_cast<std::pair<BlockFilter, int>>(obj);
293+    const uint256& header = filter.ComputeHeader(m_last_header);
294+    if (!Write(filter, height, header)) {
295+        LogError("Error writings filters, shutting down block filters index\n");

DrahtBot commented at 9:38 am on June 13, 2025:

writings -> writing [incorrect verb form in "Error writings filters"]

DrahtBot added the label CI failed on Jun 17, 2025

DrahtBot removed the label CI failed on Jun 17, 2025

DrahtBot added the label CI failed on Jun 20, 2025

DrahtBot commented at 0:34 am on June 20, 2025: contributor

🚧 At least one of the CI tasks failed. Task tidy: https://github.com/bitcoin/bitcoin/runs/44013275482 LLM reason (✨ experimental): The CI failed due to compilation errors caused by an incorrect macro invocation in the source code.

Try to run the tests locally, according to the documentation. However, a CI failure may still happen due to a number of reasons, for example:

Possibly due to a silent merge conflict (the changes in this pull request being incompatible with the current code in the target branch). If so, make sure to rebase on the latest commit of the target branch.
A sanitizer issue, which can only be found by compiling with the sanitizer and running the affected test.
An intermittent issue.

Leave a comment here, if you need help tracking down a confusing failure.

furszy force-pushed on Jun 20, 2025

DrahtBot removed the label CI failed on Jun 20, 2025

furszy force-pushed on Jun 23, 2025

furszy commented at 7:15 pm on June 23, 2025: member

Updated based on the feedback. Thanks! I believe I’ve addressed all the comments, but let me know if I missed anything.

furszy marked this as ready for review on Jun 23, 2025

DrahtBot added the label CI failed on Jun 23, 2025

DrahtBot commented at 8:34 pm on June 23, 2025: contributor

🚧 At least one of the CI tasks failed. Task TSan, depends, gui: https://github.com/bitcoin/bitcoin/runs/44630110161 LLM reason (✨ experimental): Data race detected in operator delete caused the failure of threadpool_tests.

Try to run the tests locally, according to the documentation. However, a CI failure may still happen due to a number of reasons, for example:

Possibly due to a silent merge conflict (the changes in this pull request being incompatible with the current code in the target branch). If so, make sure to rebase on the latest commit of the target branch.
A sanitizer issue, which can only be found by compiling with the sanitizer and running the affected test.
An intermittent issue.

Leave a comment here, if you need help tracking down a confusing failure.

furszy force-pushed on Jun 23, 2025

furszy force-pushed on Jun 24, 2025

in src/index/base.cpp:249 in a3e7d97b89 outdated

249+// CustomPostProcessBlocks. This continues until all blocks have been fully processed and committed.
250+//
251+// Reorgs are detected and handled before syncing begins, ensuring the index starts aligned with the active chain.
252 void BaseIndex::Sync()
253 {
254+    if (m_synced) return; // we are sync, nothing to do

DrahtBot commented at 8:11 am on June 25, 2025:

“// we are sync, nothing to do” → “// we are synced, nothing to do” [‘sync’ is incorrect here; use past participle ‘synced’]
“// Log progress every often” → “// Log progress every so often” [missing ‘so’ in the idiom]
“// Commit changes every often” → “// Commit changes every so often” [same idiomatic error]

furszy commented at 2:14 pm on June 25, 2025:

Done. Please stop bullying my poor English :p

furszy force-pushed on Jun 25, 2025

DrahtBot removed the label CI failed on Jun 25, 2025

furszy force-pushed on Jun 25, 2025

DrahtBot added the label CI failed on Jun 25, 2025

furszy force-pushed on Jun 26, 2025

DrahtBot removed the label CI failed on Jun 26, 2025

in src/index/base.h:144 in f6b7da2493 outdated

140@@ -130,6 +141,26 @@ class BaseIndex : public CValidationInterface
141     /// Update the internal best block index as well as the prune lock.
142     void SetBestBlockIndex(const CBlockIndex* block);
143 
144+    /// If 'AllowParallelSync()' retrieves true, 'ProcessBlock()' will run concurrently in batches.

DrahtBot commented at 7:51 am on July 4, 2025:

Possible typos and grammar issues:

retrieves -> returns [‘retrieves true’ is awkward; standard terminology is ‘returns true’]
will be already be logged -> will already be logged [duplicate “be”]

achow101 referenced this in commit 528f79f010 on Jul 8, 2025

DrahtBot added the label Needs rebase on Jul 8, 2025

furszy force-pushed on Jul 8, 2025

DrahtBot removed the label Needs rebase on Jul 8, 2025

DrahtBot added the label Needs rebase on Jul 14, 2025

Sjors commented at 5:28 pm on July 14, 2025: member

Sorry, #32948 probably caused some conflicts.

furszy force-pushed on Jul 14, 2025

furszy commented at 9:32 pm on July 14, 2025: member

Sorry, #32948 probably caused some conflicts.

Auch. Rebased.

DrahtBot removed the label Needs rebase on Jul 14, 2025

DrahtBot added the label CI failed on Jul 15, 2025

DrahtBot commented at 1:26 am on July 15, 2025: contributor

🚧 At least one of the CI tasks failed. Task TSan, depends, gui: https://github.com/bitcoin/bitcoin/runs/45962526520 LLM reason (✨ experimental): The CI failure is caused by a data race detected in operator delete, leading to a crash during threadpool_tests.

Try to run the tests locally, according to the documentation. However, a CI failure may still happen due to a number of reasons, for example:

Possibly due to a silent merge conflict (the changes in this pull request being incompatible with the current code in the target branch). If so, make sure to rebase on the latest commit of the target branch.
A sanitizer issue, which can only be found by compiling with the sanitizer and running the affected test.
An intermittent issue.

Leave a comment here, if you need help tracking down a confusing failure.

furszy force-pushed on Jul 15, 2025

DrahtBot removed the label CI failed on Jul 15, 2025

Sjors commented at 8:48 am on July 17, 2025: member

I tried to use this to boost the silent payment indexer, but I don’t know what I’m doing :-) https://github.com/Sjors/bitcoin/pull/96

furszy commented at 1:48 pm on July 17, 2025: member

I tried to use this to boost the silent payment indexer, but I don’t know what I’m doing :-) Sjors#96 @Sjors, see https://github.com/furszy/bitcoin-core/commits/2025_bip352_blind_fix. Note: I only spent a few minutes with it and the test seem to pass (I’m partially afk these days). Let me know how it goes and could check it in detail next week. The first commit there (413ce51bf10326a6c56bd4250f9a9f19fde44ed4) is merely a code improvement + cleanup, because you don’t need to re-read the undo data inside the child class anymore since #32694. And the last commit (b8883fa1fd76fbf3c3d3c10f702bea24abb42dc9) enables it by overriding CustomProcessBlock() (same as it was implemented for the tx index parallelization e8add40fbfbda9955f9b1cf998732ca03787eb72).

in src/util/threadpool.h:44 in 7d984c3b2d outdated

39+            std::function<void()> task;
40+            {
41+                // Wait for the task or until the stop flag is set
42+                m_cv.wait(wait_lock,[&]() EXCLUSIVE_LOCKS_REQUIRED(m_mutex) { return m_interrupt.load() || !m_work_queue.empty(); });
43+
44+                // If stopped, exit worker.

ismaelsadeeq commented at 8:44 pm on July 21, 2025:

In “util: introduce general purpose thread pool” 7d984c3b2dca085ef7f49d21568b06c7b89c7807

Hmm I can imagine that this can be blocking when you want to stop instantly; but I guess there is a reason why you did not return here and then empty the work queue in Stop.

0                // If stopped and no work left, exit worker.

furszy commented at 9:06 pm on July 22, 2025:

Hmm I can imagine that this can be blocking when you want to stop instantly; but I guess there is a reason why you did not return here and then empty the work queue in Stop.

Yes. We need to fulfill all promises so there are no dangling futures waiting for the worker to finish executing the task. In the future, we could avoid this by tracking all the promises and triggering a “shutdown” exception during stop.

in src/util/threadpool.h:49 in 7d984c3b2d outdated

44+                // If stopped, exit worker.
45+                if (m_interrupt && m_work_queue.empty()) {
46+                    return;
47+                }
48+
49+                // Pop the task

ismaelsadeeq commented at 8:45 pm on July 21, 2025:

In “util: introduce general purpose thread pool” 7d984c3b2dca085ef7f49d21568b06c7b89c7807

I think this and other verbose comment can be removed, it is quite obvious.

What might need comment is Submit due to the abstraction there.

furszy commented at 6:17 pm on July 23, 2025:

Done

in src/test/threadpool_tests.cpp:58 in 7d984c3b2d outdated

39+        }
40+
41+        // Wait for all tasks to finish
42+        for (auto& fut : futures) fut.wait();
43+        int expected_value = (num_tasks * (num_tasks + 1)) / 2; // Gauss sum.
44+        BOOST_CHECK_EQUAL(counter.load(), expected_value);

ismaelsadeeq commented at 8:52 pm on July 21, 2025:

In “util: introduce general purpose thread pool” 7d984c3b2dca085ef7f49d21568b06c7b89c7807

Also verify that no work queue size is 0.

furszy commented at 6:12 pm on July 23, 2025:

In “util: introduce general purpose thread pool” 7d984c3

Also verify that no work queue size is 0.

sure. Done.

in src/init.cpp:2170 in 82fa9d2965 outdated

2164@@ -2160,7 +2165,19 @@ bool StartIndexBackgroundSync(NodeContext& node)
2165         }
2166     }
2167 
2168+    if (node.args->IsArgSet("-indexworkers")) {
2169+        int index_workers = node.args->GetIntArg("-indexworkers", INDEX_WORKERS_COUNT);
2170+        if (index_workers < 0 || index_workers > 100) return InitError(_("Invalid -indexworkers arg"));

ismaelsadeeq commented at 8:55 pm on July 21, 2025:

In " init: provide thread pool to indexes " 82fa9d29653b445118fc2d03e2ced520a5d4c7dc

This message should be more verbose users should know the limit

furszy commented at 6:18 pm on July 23, 2025:

done as suggested

in src/init.cpp:522 in 82fa9d2965 outdated

518@@ -517,6 +519,7 @@ void SetupServerArgs(ArgsManager& argsman, bool can_listen_ipc)
519                  strprintf("Maintain an index of compact filters by block (default: %s, values: %s).", DEFAULT_BLOCKFILTERINDEX, ListBlockFilterTypes()) +
520                  " If <type> is not supplied or if <type> = 1, indexes for all known types are enabled.",
521                  ArgsManager::ALLOW_ANY, OptionsCategory::OPTIONS);
522+    argsman.AddArg("-indexworkers=<n>", strprintf("Number of worker threads spawned for the indexes initial sync process (default: %d).", INDEX_WORKERS_COUNT), ArgsManager::ALLOW_ANY, OptionsCategory::OPTIONS);

ismaelsadeeq commented at 8:56 pm on July 21, 2025:

In " init: provide thread pool to indexes " 82fa9d29653b445118fc2d03e2ced520a5d4c7dc

Define the max and then use it here and in checking the bounds

furszy commented at 6:10 pm on July 23, 2025:

In " init: provide thread pool to indexes " 82fa9d2

Define the max and then use it here and in checking the bounds

Since the maximum number of threads for indexes should depend on how many threads Core has at runtime and the number of available processors, I don’t think hardcoding it here is the best approach. I agree with adding it on the error side: #26966 (review)

pinheadmz commented at 3:43 pm on July 24, 2025:

b36010261921adc74e61bdcbc91ba6b7778bad9a

Wonder if it should be specified that threadpool is shared among all indexers (that support multithreading)

furszy commented at 3:41 pm on July 29, 2025:

b360102

Wonder if it should be specified that threadpool is shared among all indexers (that support multithreading)

Done as suggested.

in src/index/base.h:30 in 82fa9d2965 outdated

22 namespace interfaces {
23 class Chain;
24 } // namespace interfaces
25 
26+/** Number of concurrent jobs during the initial sync process */
27+static constexpr int16_t INDEX_WORKERS_COUNT = 0;

ismaelsadeeq commented at 8:56 pm on July 21, 2025:

In " init: provide thread pool to indexes " 82fa9d29653b445118fc2d03e2ced520a5d4c7dc

Why 0?

furszy commented at 9:12 pm on July 22, 2025:

Why 0?

Parallel sync is disabled by default. We’re currently not tracking the number of threads spawned by Core, so I chose not to make assumptions here (don’t want indexes threads competing with net/validation ones). It’s safer to let users specify the appropriate number for their setup. In the future, we could improve this by adding thread tracking object/mechanism and picking up the best number for their machine.

ismaelsadeeq commented at 9:54 pm on July 22, 2025:

Makes sense, I think we can make the bounds not just an arbitrary number but tied it to the cores of the machine.

That will prevent footgun whereby user will spawn more threads than the machine can handle leading to degraded performance due to lots of context switching.

furszy commented at 6:00 pm on July 23, 2025:

Absolutely

ismaelsadeeq commented at 9:03 pm on July 21, 2025: member

Concept ACK, did a quick pass through this.

Will do in-depth review soon

Now the juicy part:

What is the step to reproduce your result?

Side note to self could be useful to try benchmarking this using benchkit so that anyone can reproduce using the yaml config?

furszy commented at 5:58 pm on July 23, 2025: member

What is the step to reproduce your result?

Sync your node without any index.
Restart the node with block filter or txindex enabled and let it run (you could also set -connect=0 to sync only the index, without running the net/validation threads. Since threads won’t be competing for cs_main, this will give you a more accurate result).

You’ll see a “[index name] is enabled at height [height]” log entry once it finishes. Then it’s just a matter of subtracting the index startup time from the “index synced” log time.

Unfortunately, there’s no “stop at block” option like we have for chain sync, so this process is a bit more manual and has some variances depending on where/how you run it. But you will see an overall significant speedup anyway.

Note: I should update my results in the PR description. Those were compiled three years ago on a small VPS, and we’ve introduced many changes since then.

Side note to self: could be useful to try benchmarking this using [benchkit](https://github.com/bitcoin-dev-tools/benchkit) so that anyone can reproduce using the YAML config?

That would be nice. I’m not sure it supports stopping after a specific log is written. Maybe @willcl-ark could enlighten us here.

furszy force-pushed on Jul 23, 2025

in src/test/threadpool_tests.cpp:57 in d2138bef76 outdated

38+            }));
39+        }
40+
41+        // Wait for all tasks to finish
42+        for (auto& fut : futures) fut.wait();
43+        int expected_value = (num_tasks * (num_tasks + 1)) / 2; // Gauss sum.

pinheadmz commented at 3:12 pm on July 24, 2025:

d2138bef76ae7c25679571f9c07ecb44e99d4ecc

Why not just +1 in each task like you do in the next test block?

furszy commented at 8:46 pm on July 25, 2025:

d2138be

Why not just +1 in each task like you do in the next test block?

Hmm, good question. I was probably not only testing that all tasks were executed, but also that each of them was executed only once (if they were all doing the same, it would be hard to know if they were all executed). Or.. maybe I just wanted to mention Gauss somewhere in our code. I did this one 3 years ago so.. it is hard to know. But it doesn’t hurt to have it.

in src/test/threadpool_tests.cpp:94 in d2138bef76 outdated

88+
89+        for (auto& fut : futures) fut.wait();
90+        BOOST_CHECK_EQUAL(counter.load(), num_tasks);
91+
92+        blocker.set_value();
93+        for (auto& t : blocking_tasks) t.wait(); // ensure blocking tasks finish too

pinheadmz commented at 3:14 pm on July 24, 2025:

d2138bef76ae7c25679571f9c07ecb44e99d4ecc

would there be any benefit here to incrementing the counter at the end of each blocking task and check that they executed properly when unblocked?

furszy commented at 8:56 pm on July 25, 2025:

d2138be

would there be any benefit here to incrementing the counter at the end of each blocking task and check that they executed properly when unblocked?

It’s subtle but we’re doing the same with the wait here. The blocking_tasks vector contains the futures whose promises are set only when the worker finishes executing the task (meaning it has run all its code).

in src/test/threadpool_tests.cpp:189 in d2138bef76 outdated

170+        for (int i = 0; i < num_tasks; i++) {
171+            threadPool.Submit([&counter]() {
172+                counter.fetch_add(1);
173+            });
174+        }
175+        std::this_thread::sleep_for(std::chrono::milliseconds{100});

pinheadmz commented at 3:21 pm on July 24, 2025:

d2138bef76ae7c25679571f9c07ecb44e99d4ecc

is this sleep to give tasks a chance to execute if the blocking breaks?

furszy commented at 9:01 pm on July 25, 2025:

d2138be

is this sleep to give tasks a chance to execute if the blocking breaks?

Good eye. IIRC, I added it to wait until the workers actually get blocked. Otherwise, the thread pool queue size would be greater than the expected value (because it did not consumed the blocking tasks). Still, we could remove this by adding some ready_promises, as did in test 2. A bit more code but less fragile.

in src/test/threadpool_tests.cpp:196 in d2138bef76 outdated

177+
178+        // Now process manually
179+        for (int i = 0; i < num_tasks; i++) {
180+            threadPool.ProcessTask();
181+        }
182+        BOOST_CHECK_EQUAL(counter.load(), num_tasks);

pinheadmz commented at 3:22 pm on July 24, 2025:

d2138bef76ae7c25679571f9c07ecb44e99d4ecc

Could anything be gained by checking the counter after each ProcessTask()?

furszy commented at 9:05 pm on July 25, 2025:

d2138be

Could anything be gained by checking the counter after each ProcessTask()?

don’t think so. Could maybe improve it by making the task return something and checking the futures’ promises. But I’m not totally convinced it will add much value.

in src/init.cpp:2170 in b360102619 outdated

2164@@ -2160,7 +2165,19 @@ bool StartIndexBackgroundSync(NodeContext& node)
2165         }
2166     }
2167 
2168+    if (node.args->IsArgSet("-indexworkers")) {
2169+        int index_workers = node.args->GetIntArg("-indexworkers", INDEX_WORKERS_COUNT);
2170+        if (index_workers < 0 || index_workers > MAX_INDEX_WORKERS_COUNT) return InitError(Untranslated(strprintf("Invalid -indexworkers arg. Must be a number in-between 1 and %d", MAX_INDEX_WORKERS_COUNT)));

pinheadmz commented at 3:42 pm on July 24, 2025:

b36010261921adc74e61bdcbc91ba6b7778bad9a

“must be in-between 1 and…” but you allow 0

also nit, could drop the “in-” and say “number between 1 and …”

furszy commented at 9:06 pm on July 25, 2025:

Good catch!, fixed.

in src/index/base.cpp:308 in 76aa1c2da9 outdated

315+        LOCK(ctx->mutex_pending_tasks);
316+        if (ctx->pending_tasks.empty()) return std::nullopt;
317+        Task t = std::move(ctx->pending_tasks.front());
318+        ctx->pending_tasks.pop();
319+        return t;
320+    };

pinheadmz commented at 4:35 pm on July 24, 2025:

76aa1c2da9635b99e2fb8792b24b53f320b90624

Whats the benefit of defining this as a lambda instead of just moving the code inside func_worker? It doesn’t seem to be called anywhere else …?

furszy commented at 9:12 pm on July 25, 2025:

76aa1c2

Whats the benefit of defining this as a lambda instead of just moving the code inside func_worker? It doesn’t seem to be called anywhere else …?

It was like that at first, but I found it harder to follow and wanted to simplify func_worker as much as possible. You can try moving it back in and will surely see what I’m referring to.

in src/index/base.h:147 in 76aa1c2da9 outdated

142@@ -138,6 +143,26 @@ class BaseIndex : public CValidationInterface
143     /// Update the internal best block index as well as the prune lock.
144     void SetBestBlockIndex(const CBlockIndex* block);
145 
146+    /// If 'AllowParallelSync()' returns true, 'ProcessBlock()' will run concurrently in batches.
147+    /// The 'std::any' result will be passed to 'PostProcessBlocks()' so the index can process

pinheadmz commented at 7:09 pm on July 24, 2025:

76aa1c2da9635b99e2fb8792b24b53f320b90624

Did you mean CustomPostProcessBlocks() here?

furszy commented at 9:36 pm on July 25, 2025:

yes, fixed.

in src/index/blockfilterindex.cpp:304 in 77a14b37aa outdated

298@@ -299,6 +299,23 @@ bool BlockFilterIndex::Write(const BlockFilter& filter, uint32_t block_height, c
299     return true;
300 }
301 
302+std::any BlockFilterIndex::CustomProcessBlock(const interfaces::BlockInfo& block_info)
303+{
304+    return std::make_pair(BlockFilter(BlockFilterType::BASIC, *block_info.data, *block_info.undo_data), block_info.height);

pinheadmz commented at 5:11 pm on July 25, 2025:

77a14b37aaf5b1c1a5dbe28e900a3824e5732f3f

The corresponding line in the serial/legacy processing path asserts the block data (I don’t care much about that) but I do think you should keep the variable for the filter type if ever one day a new filter index is added:

0BlockFilter filter(m_filter_type, *Assert(block.data), *Assert(block.undo_data));

furszy commented at 9:16 pm on July 25, 2025:

good catch!, fixed.

in src/index/base.h:151 in 76aa1c2da9 outdated

142@@ -138,6 +143,26 @@ class BaseIndex : public CValidationInterface
143     /// Update the internal best block index as well as the prune lock.
144     void SetBestBlockIndex(const CBlockIndex* block);
145 
146+    /// If 'AllowParallelSync()' returns true, 'ProcessBlock()' will run concurrently in batches.
147+    /// The 'std::any' result will be passed to 'PostProcessBlocks()' so the index can process
148+    /// async result batches in a synchronous fashion (if required).
149+    [[nodiscard]] virtual std::any CustomProcessBlock(const interfaces::BlockInfo& block_info) {
150+        // If parallel sync is enabled, the child class must implement this method.
151+        if (AllowParallelSync()) return std::any();

pinheadmz commented at 5:15 pm on July 25, 2025:

76aa1c2da9635b99e2fb8792b24b53f320b90624

I wonder if this condition should be a bit more attention-getting, since it should never execute right? Either log something or Assume() for the benefit of future index developers?

furszy commented at 9:25 pm on July 25, 2025:

76aa1c2

I wonder if this condition should be a bit more attention-getting, since it should never execute right? Either log something or Assume() for the benefit of future index developers?

Hmm, or we could remove this line and call CustomAppend by default. Then it would be up to the child index to decide whether it needs sequential ordering during parallel sync or not. This way, the tx and BIP352 indexes would require fewer lines of code.

in src/util/threadpool.h:76 in d2138bef76 outdated

71+        if (!m_workers.empty()) throw std::runtime_error("Thread pool already started");
72+        m_interrupt.store(false); // Reset
73+
74+        // Create workers
75+        for (int i = 0; i < num_workers; i++) {
76+            m_workers.emplace_back(&util::TraceThread, "threadpool_worker_" + util::ToString(i), [this] { WorkerThread(); });

pinheadmz commented at 5:19 pm on July 25, 2025:

d2138bef76ae7c25679571f9c07ecb44e99d4ecc

I’d like to see ThreadPool reused in the codebase (for example, as http workers) which makes me think the class should also have a custom name property for logging and process monitoring. (e.g. index_worker_thread_1 and http_worker_thread_1)

furszy commented at 9:35 pm on July 25, 2025:

done!

pinheadmz approved

pinheadmz commented at 8:00 pm on July 25, 2025: member

code review ACK 068537ae61309a61e3bc350dc747d75cf2207517

Built and tested on macos/arm64 as well as Debian/x86. Left several questions and suggestions. I really like ThreadPool as a util and look forward to using it for http workers as well, to share the code.

I’m currently rebuilding block filter and tx indexes with this branch to compare against master, which after 48 hours was only about 70% complete…

 0-----BEGIN PGP SIGNED MESSAGE-----
 1Hash: SHA256
 2
 3ACK 068537ae61309a61e3bc350dc747d75cf2207517
 4-----BEGIN PGP SIGNATURE-----
 5
 6iQIzBAEBCAAdFiEE5hdzzW4BBA4vG9eM5+KYS2KJyToFAmiD4qAACgkQ5+KYS2KJ
 7yToG8g//d72j0Pq/TmDjOA0nbCrqrBIKLIvHiTD8DIBCvCb2Ul+NnZDfvFAbuR9s
 8nkArmxEUPRPBSkKBYMqJHUt+JF4zsLyttkryJW1vFCHwLi49an82B2MKSNNyOfs+
 9PUufS9ro5FNDNax66jVdjD1/CrNRYAt/AQ/K3FSo7FNG5dbpO2n09ZBXWAHqwZfU
103bf7p3Ug0JBlEe7/JMz1Wbu7wDV9E0lINarr/n5dnVQZTBLHaCvabSYtrEix/BRG
11ku2MjexrbZdR5PY1xKQvJYkOkndZDkLQVJMC9BT9GeCYBdGRaGXadZQ3AwNlzobz
12JVSMQO2Ngv/Ow8IQWrAs705Moqzu680PjdodxTSrj0QU/D4cNJAW390QzwT7evrw
13pAVipwM4oomc4ZMfWX4pq8AwlZ+GywEIMa34UbO0pbpGnlIEUPRUBb7l7ODzcCcb
14/cp9l9xpXwWH8+1yY9uRUnh9AHRlVsNdTJeCq3kTU8W1CYcmCOaImwjMJU3tM9Aj
15vgFzVQam4KYJTD+WjhKkFAb7M0uTF6LqaM5ChfSaTR1zXuVQ7OX0sYM8R0B2QLJ8
16uxNzNF/KnTJZYaOc0i/b4JHP3MZWgoO3GcT64woj1GqHGjLj2YOAKYSfpNZfWaL+
17kdwRnzC4qgMKunoqtv4sX9Ht0Hk7qyDfPnEog/AT4wnSbiP/HZs=
18=VG2A
19-----END PGP SIGNATURE-----

pinheadmz’s public key is on openpgp.org

DrahtBot requested review from w0xlt on Jul 25, 2025

DrahtBot requested review from TheCharlatan on Jul 25, 2025

DrahtBot requested review from ryanofsky on Jul 25, 2025

DrahtBot requested review from ismaelsadeeq on Jul 25, 2025

furszy force-pushed on Jul 25, 2025

pinheadmz commented at 11:01 am on July 27, 2025: member

Did a rough benchmark test. Froze a full node at height 906551 by restarting with -noconnect and also -txindex -blockfilterindex. First test was from master, after 48 hours I aborted the process after reaching only:

 0$ bitcoin-cli getindexinfo
 1{
 2  "txindex": {
 3    "synced": false,
 4    "best_block_height": 770289
 5  },
 6  "basic block filter index": {
 7    "synced": true,
 8    "best_block_height": 906551
 9  }
10}

Running this PR (restarting with empty indexes/), both indexing operations were complete after about 16 hours:

02025-07-25T18:12:46Z txindex thread start
12025-07-25T18:12:46Z basic block filter index thread start
2...
32025-07-25T19:48:16Z basic block filter index thread exit
42025-07-26T10:14:21Z txindex thread exit

0{
1  "txindex": {
2    "synced": true,
3    "best_block_height": 906551
4  },
5  "basic block filter index": {
6    "synced": true,
7    "best_block_height": 906551
8  }
9}

in src/util/threadpool.h:77 in 9bba5ef0cc outdated

72+        if (!m_workers.empty()) throw std::runtime_error("Thread pool already started");
73+        m_interrupt.store(false); // Reset
74+
75+        // Create workers
76+        for (int i = 0; i < num_workers; i++) {
77+            m_workers.emplace_back(&util::TraceThread, m_name + "_threadpool_worker_" + util::ToString(i), [this] { WorkerThread(); });

pinheadmz commented at 2:50 pm on July 29, 2025:

This is bike shedding now so ok to ignore – but when we set thread names on Linux they are truncated to 15 characters. So I dunno I guess “thread” and “worker” are redundant in the name of a thread?

https://github.com/bitcoin/bitcoin/blob/2f410ad78c767e37f083d03114f6661b73647af3/src/util/threadnames.cpp#L22-L27

pinheadmz approved

pinheadmz commented at 2:53 pm on July 29, 2025: member

re-ACK 9bba5ef0cc5b6890eba2be3a6ed429c4fea5a28f

Changes since last review are minimal responses to my own review suggestions. Built on macos/arm64 and debian/x86. Ran functional and unit tests, ran with -indexworkers=16 on mainnet fullnode with >900000 blocks

 0-----BEGIN PGP SIGNED MESSAGE-----
 1Hash: SHA256
 2
 3ACK 9bba5ef0cc5b6890eba2be3a6ed429c4fea5a28f
 4-----BEGIN PGP SIGNATURE-----
 5
 6iQIzBAEBCAAdFiEE5hdzzW4BBA4vG9eM5+KYS2KJyToFAmiI394ACgkQ5+KYS2KJ
 7yTruCxAAwQ1Q4syFsBPG0fS1KaKSV7LKLl5oJNzr5RPP7w/NliMB3KDXI8lhQnhr
 8ej6NNs7ovm05O99QrjFomGyg5oYO+rcECPoXp5lrwsOQBWJk10ZNgDIKdh2ChWr0
 9sn2o8DCsGpHfkXZ9qg8ynStt5ScCv/1bopb2jvaFjWHdy/5LREJ62XuJZoah5O74
10kSSWyj1Nxi9oVNKctXifFp0WwqOqofft2kgDWghRPw67SERuZGqcyzb89zKLPLxr
11iyMIWd8LY36isVj7XZBNVz1+jQnr77ldR15Uar/CKlrEoNC/tKs/NBE0eXAHrKhs
124iuKdtX470yazbKGAm6Ei2wQbYa4w5319QDa6pVYlqo30ZMlW/AvVMC//Uc1/v3a
13fCW2ootvbbzz0lBPXg5yIB8wvaCuDkYYjMGL6tnYg1T4ChlS7P2y7IbfWryAjOKt
14UN1kJNkMoXzSrHu2nZaDmIv3hFpQBaDUH0BlT+unUg8hUioE1VqpIw/+rmcCj7eA
15IhKJ0NnJZnh92y5Y+9b+lf+ix6TiF0ZqGT/h+5Yv7Vj8YVXP006HpKvpx2Y84ii+
165IZag6OviHoKt056Gn2iixQFSbw6hVMOw8JnF5FC3iZR28roEkJlY2WtOUF6h/Se
17UJTC1VMQuxRm+AIYLwC8w1RCRMoZDRKlOKBjOY5beFCIOARef3w=
18=Zxbl
19-----END PGP SIGNATURE-----

pinheadmz’s public key is on openpgp.org

DashCoreAutoGuix referenced this in commit 55c5a2a348 on Jul 31, 2025

furszy force-pushed on Aug 7, 2025

DrahtBot added the label CI failed on Aug 8, 2025

DrahtBot commented at 0:19 am on August 8, 2025: contributor

🚧 At least one of the CI tasks failed. Task TSan, depends, no gui: https://github.com/bitcoin/bitcoin/runs/47637510992 LLM reason (✨ experimental): The CI failure is due to the threadpool_tests timing out.

Try to run the tests locally, according to the documentation. However, a CI failure may still happen due to a number of reasons, for example:

Possibly due to a silent merge conflict (the changes in this pull request being incompatible with the current code in the target branch). If so, make sure to rebase on the latest commit of the target branch.
A sanitizer issue, which can only be found by compiling with the sanitizer and running the affected test.
An intermittent issue.

Leave a comment here, if you need help tracking down a confusing failure.

furszy force-pushed on Aug 8, 2025

util: introduce general purpose thread pool e2fef383c1

init: provide thread pool to indexes

And add option to customize thread pool workers count

6607b99131

index: implement index parallel sync 1a4d11f65a

index: enable block filter index parallel sync

It also adds coverage for initial sync from a particular block.
Mimicking a node restart.

4180c1841f

txindex: enable parallel sync f6a52c6686

furszy force-pushed on Aug 8, 2025

DrahtBot removed the label CI failed on Aug 8, 2025

index: initial sync speedup, parallelize process #26966

Code Coverage & Benchmarks

Reviews

Conflicts

LLM Linter (✨ experimental)