bench: add support for custom data directory

furszy commented at 9:01 pm on September 29, 2024: member

Expands the benchmark framework with the existing -testdatadir arg, enabling the ability to change the benchmark data directory.

This is useful for running benchmarks on different storage devices, and not just under the OS /tmp/ directory.

A good use case is #28574, where we are benchmarking the wallet migration process on an HDD.

DrahtBot commented at 9:01 pm on September 29, 2024: contributor

The following sections might be updated with supplementary metadata relevant to reviewers and maintainers.

Code Coverage & Benchmarks

For details see: https://corecheck.dev/bitcoin/bitcoin/pulls/31000.

Reviews

See the guideline for information on the review process.

Type	Reviewers
ACK	tdb3, hodlinator, maflcko, pablomartin4btc, achow101

If your review is incorrectly listed, please react with 👎 to this comment and the bot will ignore it on the next update.

Conflicts

Reviewers, this pull request conflicts with the following ones:

#31260 (WIP: scripted-diff: Type-safe settings retrieval by ryanofsky)

If you consider this pull request important, please also help to review the conflicting pull requests. Ideally, start with the one that should be merged first.

DrahtBot added the label Tests on Sep 29, 2024

DrahtBot added the label CI failed on Sep 29, 2024

DrahtBot commented at 10:24 pm on September 29, 2024: contributor

🚧 At least one of the CI tasks failed. Debug: https://github.com/bitcoin/bitcoin/runs/30825058976

Try to run the tests locally, according to the documentation. However, a CI failure may still happen due to a number of reasons, for example:

Possibly due to a silent merge conflict (the changes in this pull request being incompatible with the current code in the target branch). If so, make sure to rebase on the latest commit of the target branch.
A sanitizer issue, which can only be found by compiling with the sanitizer and running the affected test.
An intermittent issue.

Leave a comment here, if you need help tracking down a confusing failure.

furszy force-pushed on Sep 30, 2024

DrahtBot removed the label CI failed on Sep 30, 2024

maflcko commented at 5:43 am on September 30, 2024: member

Just as a note, it should already be possible to pick the device via an env var, see https://en.cppreference.com/w/cpp/filesystem/temp_directory_path#Notes.

EricMujjona approved

EricMujjona commented at 8:57 am on September 30, 2024: none

Efficient code

furszy commented at 8:42 pm on September 30, 2024: member

Just as a note, it should already be possible to pick the device via an env var, see https://en.cppreference.com/w/cpp/filesystem/temp_directory_path#Notes.

Hmm, thats a good observation. Even when that may be possible now, I’m not sure we should start recommending the mixture of env vars and binary args at the user interface level. It seems more natural to plug the already existing -testdatadir arg to the benchmark framework. Still, if we move towards the env var direction, the next step should be revert #26564?.

maflcko commented at 5:35 am on October 1, 2024: member

the next step should be revert #26564?.

IIUC the motivation for 26564 was to have a fully static and fixed path for the unit test datadir. My understanding is that changing the temp storage device was just a side-effect and not the primary motivation. (My comment was just a note to say that your goal is already achievable today)

It seems more natural to plug the already existing -testdatadir arg to the benchmark framework.

I agree. I’ve also suggested this in #30737 (review)

furszy commented at 6:01 pm on October 1, 2024: member

the next step should be revert #26564?.

IIUC the motivation for 26564 was to have a fully static and fixed path for the unit test datadir. My understanding is that changing the temp storage device was just a side-effect and not the primary motivation. (My comment was just a note to say that your goal is already achievable today)

Practically speaking, we’re ultimately talking about pretty much the same feature. It’s about customizing the test root directory path so it can be inspected or used in another context. The thing is, 26564 introduced several different features in the same commit: the test no cleanup, the directory name specialization using the test name, and the -testdatadir arg. And what I meant is that this last point can be achieved by setting the environment variable you mentioned. Yet, it seems we agree on not doing it, but it’s curious that this can still be used nowadays.

in src/bench/bench_bitcoin.cpp:65 in 1ba225c6ce outdated

60@@ -60,6 +61,17 @@ static uint8_t parsePriorityLevel(const std::string& str) {
61     return levels;
62 }
63 
64+// Parses test setup related arguments
65+static std::vector<std::string> parseTestSetupArgs(const ArgsManager& argsman) {

hodlinator commented at 7:44 am on October 3, 2024:

Braces on new lines for classes, functions, methods.

Can see you are following style of function above, but function below follows the dev-notes.

Don’t feel as strongly about capitalization.

0static std::vector<std::string> ParseTestSetupArgs(const ArgsManager& argsman)
1{

If you want to make parsePriorityLevel conform to the dev-notes as well, I support it.

furszy commented at 6:43 pm on October 3, 2024:

Done. Added the jump line. I don’t feel strong about the capitalization neither but it looks incorrect to introduce it now when all other functions aren’t using it.

hodlinator commented at 10:22 pm on October 5, 2024:

I don’t feel strong about the capitalization neither but it looks incorrect to introduce it now when all other functions aren’t using it.

(SetupBenchArgs is capitalized, main must be non-capitalized, so that only leaves parseAsymptote to change beyond the function under discussion).

in src/bench/bench_bitcoin.cpp:68 in 1ba225c6ce outdated

60@@ -60,6 +61,17 @@ static uint8_t parsePriorityLevel(const std::string& str) {
61     return levels;
62 }
63 
64+// Parses test setup related arguments
65+static std::vector<std::string> parseTestSetupArgs(const ArgsManager& argsman) {
66+    std::vector<std::string> args;
67+
68+    // Test data directory

hodlinator commented at 8:07 am on October 3, 2024:

nit: Redundant comment.

furszy commented at 6:45 pm on October 3, 2024:

Have reworked this function in a more general manner just so we can add new args in the future with just one line of parsing code.

hodlinator commented at 10:27 pm on October 5, 2024:

A bit heavy for one arg as it is right now, but I like that it avoids repeating the name. :+1:

in src/bench/bench_bitcoin.cpp:40 in 1ba225c6ce outdated

36@@ -37,6 +37,7 @@ static void SetupBenchArgs(ArgsManager& argsman)
37     argsman.AddArg("-sanity-check", "Run benchmarks for only one iteration with no output", ArgsManager::ALLOW_ANY, OptionsCategory::OPTIONS);
38     argsman.AddArg("-priority-level=<l1,l2,l3>", strprintf("Run benchmarks of one or multiple priority level(s) (%s), default: '%s'",
39                                                            benchmark::ListPriorities(), DEFAULT_PRIORITY), ArgsManager::ALLOW_ANY, OptionsCategory::OPTIONS);
40+    argsman.AddArg("-testdatadir", "Run benchmarks on a custom directory (default: </tmp/>)", ArgsManager::ALLOW_ANY, OptionsCategory::DEBUG_TEST);

hodlinator commented at 8:19 am on October 3, 2024:

Why not make SetupUnitTestArgs available and call it here instead? Delegates the responsibility of documenting the default value to that part of the code.

(Could maybe rename SetupUnitTestArgs -> SetupTestArgs or static BasicTestingSetup::RegisterArgs to make it less unit test-specific and communicate how broadly it is used).

furszy commented at 7:15 pm on October 3, 2024:

Why not make SetupUnitTestArgs available and call it here instead? Delegates the responsibility of documenting the default value to that part of the code.

(Could maybe rename SetupUnitTestArgs -> SetupTestArgs or static BasicTestingSetup::RegisterArgs to make it less unit test-specific and communicate how broadly it is used).

I think we should be careful and explicit about the features we add to other binaries. One of them could end up clashing with another functionality or be removed from a binary that was using it without us noticing it.

hodlinator commented at 10:16 pm on October 5, 2024:

Agree a SetupTestArgs may not communicate clearly enough that the function is used in a broader context. How about calling it SetupCommonTestArgs and adding a comment about it being used in benchmarks as well? That, together with no longer marking the function static (private to the compilation unit), might mitigate your concerns?

furszy commented at 11:55 pm on October 5, 2024:

Agree a SetupTestArgs may not communicate clearly enough that the function is used in a broader context. How about calling it SetupCommonTestArgs and adding a comment about it being used in benchmarks as well? That, together with no longer marking the function static (private to the compilation unit), might mitigate your concerns?

I’m not so in favor of mentioning upper level binaries or frameworks in lower level files but no problem in applying the suggestion.

furszy commented at 0:01 am on October 6, 2024:

Hmm well, I’m still a bit reluctant to it. It would add test/util/setup_common.h dependency to the benchmark binary entry point just for a single line of code.

hodlinator commented at 9:09 pm on October 10, 2024:

Adding a dependency on setup_common.h is a case of making implicit dependencies explicit in my view. What the current version signifies is a different argument description (starting with “Run benchmarks " like the others) and a default directory value that can’t be as clear as the original. We shouldn’t be duplicating code snippets to avoid #includes. Even if it minimizes the size of this specific PR, going forward one has to make sure to keep both the original and duplicated place up to date which I argue is a worse state of tech debt.

I’m happy to hear stronger counter-arguments if you feel it’s worth your time, but I can understand if you want to drop this subject too.

Summarizing my intent:

 0diff --git a/src/bench/bench_bitcoin.cpp b/src/bench/bench_bitcoin.cpp
 1index 3d649a8436..3ae3f7dd4c 100644
 2--- a/src/bench/bench_bitcoin.cpp
 3+++ b/src/bench/bench_bitcoin.cpp
 4@@ -8,6 +8,7 @@
 5 #include <tinyformat.h>
 6 #include <util/fs.h>
 7 #include <util/string.h>
 8+#include <test/util/setup_common.h>
 9 
10 #include <chrono>
11 #include <cstdint>
12@@ -37,7 +38,7 @@ static void SetupBenchArgs(ArgsManager& argsman)
13     argsman.AddArg("-sanity-check", "Run benchmarks for only one iteration with no output", ArgsManager::ALLOW_ANY, OptionsCategory::OPTIONS);
14     argsman.AddArg("-priority-level=<l1,l2,l3>", strprintf("Run benchmarks of one or multiple priority level(s) (%s), default: '%s'",
15                                                            benchmark::ListPriorities(), DEFAULT_PRIORITY), ArgsManager::ALLOW_ANY, OptionsCategory::OPTIONS);
16-    argsman.AddArg("-testdatadir", "Run benchmarks on a custom directory (default: </tmp/>)", ArgsManager::ALLOW_ANY, OptionsCategory::DEBUG_TEST);
17+    SetupCommonTestArgs(argsman);
18 }
19 
20 // parses a comma separated list like "10,20,30,50"
21diff --git a/src/test/util/setup_common.cpp b/src/test/util/setup_common.cpp
22index fa89ceb332..2471290fe9 100644
23--- a/src/test/util/setup_common.cpp
24+++ b/src/test/util/setup_common.cpp
25@@ -87,8 +87,8 @@ struct NetworkSetup
26 };
27 static NetworkSetup g_networksetup_instance;
28 
29-/** Register test-only arguments */
30-static void SetupUnitTestArgs(ArgsManager& argsman)
31+/** Register common test and bench arguments */
32+void SetupCommonTestArgs(ArgsManager& argsman)
33 {
34     argsman.AddArg("-testdatadir", strprintf("Custom data directory (default: %s<random_string>)", fs::PathToString(fs::temp_directory_path() / TEST_DIR_PATH_ELEMENT / "")),
35                    ArgsManager::ALLOW_ANY, OptionsCategory::DEBUG_TEST);
36@@ -126,7 +126,7 @@ BasicTestingSetup::BasicTestingSetup(const ChainType chainType, TestOpts opts)
37     gArgs.ClearPathCache();
38     {
39         SetupServerArgs(*m_node.args);
40-        SetupUnitTestArgs(*m_node.args);
41+        SetupCommonTestArgs(*m_node.args);
42         std::string error;
43         if (!m_node.args->ParseParameters(arguments.size(), arguments.data(), error)) {
44             m_node.args->ClearArgs();
45diff --git a/src/test/util/setup_common.h b/src/test/util/setup_common.h
46index f5a3c0dcee..4d8b9370ca 100644
47--- a/src/test/util/setup_common.h
48+++ b/src/test/util/setup_common.h
49@@ -45,6 +45,8 @@ extern const std::function<std::string()> G_TEST_GET_FULL_NAME;
50 
51 static constexpr CAmount CENT{1000000};
52 
53+void SetupCommonTestArgs(ArgsManager& argsman);
54+
55 struct TestOpts {
56     std::vector<const char*> extra_args{};
57     bool coins_db_in_memory{true};

furszy commented at 4:26 pm on October 13, 2024:

I do think that referencing upper-level binaries in lower-level framework code isn’t ideal, and duplicating a single line for an argument that rarely changes behavior is harmless. Especially when the small deduplication comes with other unnecessary stuff and isn’t truly a duplication as the help text is customized for benchmark binary users. But, that said, we’re already breaking the first point elsewhere in setup_common.h, so np. Will re-touch a bit your suggestion and push it.

in src/bench/bench.cpp:36 in 1ba225c6ce outdated

32+ * Retrieves the available test setup command line arguments that may be used
33+ * in the benchmark. They will be used only if the benchmark utilizes a
34+ * 'BasicTestingSetup' or any child of it.
35+ */
36+static std::function<std::vector<const char*>()> g_bench_command_line_args{};
37+

hodlinator commented at 8:23 am on October 3, 2024:

(Please either remove empty line after local global here or add an empty line after static std::string g_running_benchmark_name).

furszy commented at 7:19 pm on October 3, 2024:

done.

hodlinator commented at 9:07 am on October 3, 2024: contributor

Concept ACK 1ba225c6ce689defb7063dd68f62e1e90fcdda5e

Good to see these globals implemented for bench. As already mentioned, I’ve been working on a partially overlapping change in #30737.

furszy force-pushed on Oct 3, 2024

furszy commented at 7:21 pm on October 3, 2024: member

Thanks for the review @hodlinator. Updated per feedback.

Good to see these globals implemented for bench. As already mentioned, I’ve been working on a partially overlapping change in #30737.

Cool. It wasn’t on my radar. Adding it now.

hodlinator commented at 10:32 pm on October 5, 2024: contributor

git range-diff master 1ba225c ded1a6c

Still feel like it would be good to call a common function adding -testdatadir to an ArgsManager instead of duplicating the functionality between test/bench (and having a different description-string).

pablomartin4btc commented at 2:59 am on October 9, 2024: member

tACK ded1a6cc498ffde0695eb3ee6828b7c513ccc277

 0./build/src/bench/bench_bitcoin -filter=Xor -testdatadir=/tmp/btc
 1Warning, results might be unstable:
 2* DEBUG defined
 3* CPU frequency scaling enabled: CPU 0 between 400.0 and 4,700.0 MHz
 4* CPU governor is 'powersave' but should be 'performance'
 5* Turbo is enabled, CPU frequency will fluctuate
 6
 7Recommendations
 8* Make sure you compile for Release
 9* Use 'pyperf system tune' before benchmarking. See https://github.com/psf/pyperf
10
11|             ns/byte |              byte/s |    err% |     total | benchmark
12|--------------------:|--------------------:|--------:|----------:|:----------
13|                9.24 |      108,266,754.90 |    1.5% |      0.01 | `Xor`

 0
 1./build/src/bench/bench_bitcoin -filter=Xor -testdatadir=/media/pablo/USB\ DISK/tmp/btc
 2Warning, results might be unstable:
 3* DEBUG defined
 4* CPU frequency scaling enabled: CPU 0 between 400.0 and 4,700.0 MHz
 5* CPU governor is 'powersave' but should be 'performance'
 6* Turbo is enabled, CPU frequency will fluctuate
 7
 8Recommendations
 9* Make sure you compile for Release
10* Use 'pyperf system tune' before benchmarking. See https://github.com/psf/pyperf
11
12|             ns/byte |              byte/s |    err% |     total | benchmark
13|--------------------:|--------------------:|--------:|----------:|:----------
14|               11.52 |       86,792,532.79 |   28.6% |      0.01 | :wavy_dash: `Xor` (Unstable with ~45.2 iters. Increase `minEpochIterations` to e.g. 452)

I think it’s a nice feature, even this can be achieved by changing the env var (e.g. TMPDIR=/my_temp ./build/src/bench/bench_bitcoin), using -testdatadir feels more consistent with other binaries use cases.

DrahtBot requested review from hodlinator on Oct 9, 2024

tdb3 commented at 3:47 am on October 12, 2024: contributor

Approach ACK Useful improvement.

Did a quick sanity check for now (ramdisk at /mnt/tmp/). Planning to circle back.

0$ build/src/bench/bench_bitcoin -testdatadir=/mnt/tmp/
1$ ls /mnt/tmp/test_common\ bitcoin
2AssembleBlock                 DuplicateInputs        MempoolCheck              WalletBalanceDirty                             WalletIsMineDescriptors
3BlockAssemblerAddPackageTxns  LoadExternalBlockFile  MempoolEviction           WalletBalanceMine                              WalletIsMineMigratedDescriptors
4BlockFilterIndexSync          LogWithDebug           ReadBlockFromDiskTest     WalletBalanceWatch                             WalletLoadingDescriptors
5BlockToJsonVerbose            LogWithoutDebug        ReadRawBlockFromDiskTest  WalletCreateEncrypted
6BlockToJsonVerboseWrite       LogWithoutThreadNames  RpcMempool                WalletCreatePlain
7CheckBlockIndex               LogWithoutWriteToFile  WalletAvailableCoins      WalletCreateTxUseOnlyPresetInputs
8ComplexMemPool                LogWithThreadNames     WalletBalanceClean        WalletCreateTxUsePresetInputsAndCoinSelection

furszy force-pushed on Oct 13, 2024

in src/bench/bench_bitcoin.cpp:31 in 15cfeebb47 outdated

27@@ -27,6 +28,7 @@ static const std::string DEFAULT_PRIORITY{"all"};
28 static void SetupBenchArgs(ArgsManager& argsman)
29 {
30     SetupHelpOptions(argsman);
31+    SetupCommonTestArgs(argsman);

l0rinc commented at 9:08 am on October 15, 2024:

I quickly glanced over this PR only, do I understand it correctly that we’re treating benchmarks as tests here (I’m not sure I agree with that concept) and adding parameters to the benchmark setup that only apply to a smaller subset of the benchmarks? Could we configure them via env variables instead and call the benchmark with e.g. TEST_DATA_DIR=a/b/c bench instead? As an example, if a few of our tests require a for example a timestamp for whatever reason, we wouldn’t add it to the testing framework as an additional parameter, right? So if my understanding is correct, I’m leaning towards a concept NACK - please let me know if I misunderstood it.

furszy commented at 2:23 pm on October 15, 2024:

I quickly glanced over this PR only, do I understand it correctly that we’re treating benchmarks as tests here (I’m not sure I agree with that concept)

I’m not sure this is the best place for the conversation, but generally speaking, benchmarks are a form of testing. The difference is that they evaluate performance against previous runs rather than behavior.

adding parameters to the benchmark setup that only apply to a smaller subset of the benchmarks?

This applies to every benchmark requiring a node context. We currently have 33 of them.

Could we configure them via env variables instead and call the benchmark with e.g. TEST_DATA_DIR=a/b/c bench instead?

Please check the conversation above #31000 (comment).

As an example, if a few of our tests require a for example a timestamp for whatever reason, we wouldn’t add it to the testing framework as an additional parameter, right?

Unlike the hardware device type the benchmark runs on, which can’t be standardized for all users, software level variables like timestamps can be set in a general manner if needed. I don’t think there will be many args like this one.

So if my understanding is correct, I’m leaning towards a concept NACK - please let me know if I misunderstood it.

I think you understood it. Just have a different opinion.

l0rinc commented at 3:00 pm on October 15, 2024:

I did see that conversation, but I don’t think it’s the benchmark’s responsibility to reveal the internals of a certain group of tests.

hodlinator approved

hodlinator commented at 9:52 am on October 15, 2024: contributor

ACK 15cfeebb47587af6ce7be72fd52c57a0483d86d2

Concept

Thanks @furszy for making the dependency explicit! I don’t fully understand the upper/lower level argument as bench_bitcoin has a link dependency on test_util already.

Using ENV-vars instead as suggested in other comment seems too sneaky for me, and contradicts my goal of having things explicit. (IMO one original sin of C was main not taking environment args as a parameter). 31 tests are using BasicTestingSetup so I think it’s common enough to be supported at a benchmark executor level.

Tested

 0₿ build/src/bench/bench_bitcoin -testdatadir=foo/
 1...
 2Test directory (will not be deleted): "/home/hodlinator/bitcoin/foo/test_common bitcoin/WalletIsMineDescriptors/datadir"
 3...
 4
 5₿ ls foo/test_common\ bitcoin/
 6AssembleBlock                 ComplexMemPool         LogWithoutWriteToFile     RpcMempool            WalletCreateEncrypted                          WalletLoadingDescriptors
 7BlockAssemblerAddPackageTxns  DuplicateInputs        LogWithThreadNames        WalletAvailableCoins  WalletCreatePlain
 8BlockFilterIndexSync          LoadExternalBlockFile  MempoolCheck              WalletBalanceClean    WalletCreateTxUseOnlyPresetInputs
 9BlockToJsonVerbose            LogWithDebug           MempoolEviction           WalletBalanceDirty    WalletCreateTxUsePresetInputsAndCoinSelection
10BlockToJsonVerboseWrite       LogWithoutDebug        ReadBlockFromDiskTest     WalletBalanceMine     WalletIsMineDescriptors
11CheckBlockIndex               LogWithoutThreadNames  ReadRawBlockFromDiskTest  WalletBalanceWatch    WalletIsMineMigratedDescriptors

DrahtBot requested review from tdb3 on Oct 15, 2024

DrahtBot requested review from pablomartin4btc on Oct 15, 2024

maflcko commented at 3:02 pm on October 16, 2024: member

lgtm ACK 15cfeebb47587af6ce7be72fd52c57a0483d86d2

pablomartin4btc approved

pablomartin4btc commented at 8:18 pm on October 16, 2024: member

re-ACK 15cfeebb47587af6ce7be72fd52c57a0483d86d2

Changes since my last review: removing the -testdatadir arg from bench-bitcoin and reusing it from the utils setup_common.

achow101 commented at 9:02 pm on October 23, 2024: member

In 221410bc14616eaf7017e5097cd6848fcb4f8e1b “bench: specialize working directory name” , the commit message says

After this commit, benchmarks are contained within its own directory: /<OS_tmp_dir>/test_common bitcoin/<benchmark_name>/<random_uint256>/

However I am not observing this behavior. It omits the benchmark name.

The benchmark name is included when -testdatadir is used.

furszy commented at 9:34 pm on October 23, 2024: member

In 221410b “bench: specialize working directory name” , the commit message says

After this commit, benchmarks are contained within its own directory: /<OS_tmp_dir>/test_common bitcoin/<benchmark_name>/<random_uint256>/

However I am not observing this behavior. It omits the benchmark name.

The benchmark name is included when -testdatadir is used.

Yeah, that’s because the data directory is erased after execution when -testdatadir isn’t used. And the same applies to unit tests. That’s why I didn’t change it, though I could for consistency, making it easier to find the test data directory in case of failure - which is when the directory is not erased.

furszy force-pushed on Oct 23, 2024

furszy commented at 9:43 pm on October 23, 2024: member

Updated per feedback. Thanks achow.

pablomartin4btc approved

pablomartin4btc commented at 4:28 pm on October 24, 2024: member

re-ACK 5757fdf0dc74ec9d6bbefba937d6e23b09652605

A small improvement since my last review, adding (also) benchmark name/ test name to the test common dir when -testdatadir is not specified.

DrahtBot requested review from maflcko on Oct 24, 2024

DrahtBot requested review from hodlinator on Oct 24, 2024

maflcko commented at 5:08 pm on October 24, 2024: member

Needs rebase

furszy force-pushed on Oct 24, 2024

in src/test/util/setup_common.cpp:144 in 80d35bc207 outdated

142     if (!m_node.args->IsArgSet("-testdatadir")) {
143-        // By default, the data directory has a random name
144+        // By default, the data directory has a random name on each test run
145         const auto rand_str{g_rng_temp_path.rand256().ToString()};
146-        m_path_root = fs::temp_directory_path() / TEST_DIR_PATH_ELEMENT / rand_str;
147+        m_path_root = fs::temp_directory_path() / TEST_DIR_PATH_ELEMENT / test_name / rand_str;

hodlinator commented at 8:39 am on October 25, 2024:

(Since with the latest non-rebase push you are now touching 2/4 lines mentioning TEST_DIR_PATH_ELEMENT, iff you were to re-touch again, it would be nice if you also renamed it to TESTS_DIR_NAME as per @ryanofsky’s suggestion).

hodlinator approved

hodlinator commented at 9:17 am on October 25, 2024: contributor

re-ACK 80d35bc20797e59571e82fe6919705b33dcc40cd

Commit message improvements.
test_path -> test_name (improved naming).
Made test_name be included as part of randomized test directory when not setting -testdatadir. Including it is helpful for locating test data of a specific test when multiple tests fail. :+1:

Verified correct unit test directory behavior using build/src/test/test_bitcoin -- -testdatadir=/tmp/foo/, then not passing the -testdatadir and simultaneously running lsof |grep bitcoin |grep /tmp.

DrahtBot requested review from pablomartin4btc on Oct 25, 2024

maflcko commented at 9:46 am on October 25, 2024: member

Not sure about the CI failure. IIRC it happened after this push: https://github.com/bitcoin/bitcoin/compare/15cfeebb47587af6ce7be72fd52c57a0483d86d2..5757fdf0dc74ec9d6bbefba937d6e23b09652605

The logs don’t say anything except: validation_chainstatemanager_tests ...Exit code 0xc0000409 2024-10-24T23:23:18.2978457Z ***Exception: 9.33 sec and the debug log is truncated (presumably when the exception happens).

It would be good if ctest/Windows actually printed the exception and error message instead of silently hiding it.

cc @hebasto

I checked for a MAX_PATH violation, but while plausible, I couldn’t find evidence:

0>>> len(r'C:\Users\RUNNER~1\AppData\Local\Temp\test_common bitcoin\validation_chainstatemanager_tests\chainstatemanager_rebalance_caches\f487e4068280f4a48936848359cd7101a5000f55b4c7d05c531cd317544df024\regtest\chainstate')
1210

maflcko commented at 9:21 am on October 28, 2024: member

Looks like that fixed it. However, it would be good to understand why it fixed it. Looking at the diff that triggered it, I fail to see how (https://github.com/bitcoin/bitcoin/pull/31000#issuecomment-2437357680)

hodlinator commented at 9:18 am on October 29, 2024: contributor

Looks like that fixed it. However, it would be good to understand why it fixed it. Looking at the diff that triggered it, I fail to see how (#31000 (comment))

Running on another branch with -testdatadir=foo I get files like: foo/test_common bitcoin/validation_chainstatemanager_tests/chainstatemanager_snapshot_completion_hash_mismatch/datadir/regtest/chainstate_snapshot_INVALID/MANIFEST-000004

Combined with C:\Users\RUNNER~1\AppData\Local\Temp\test_common bitcoin\validation_chainstatemanager_tests\chainstatemanager_rebalance_caches\f487e4068280f4a48936848359cd7101a5000f55b4c7d05c531cd317544df024 from above into something like C:\Users\RUNNER~1\AppData\Local\Temp/test_common bitcoin/validation_chainstatemanager_tests/chainstatemanager_snapshot_completion_hash_mismatch/f487e4068280f4a48936848359cd7101a5000f55b4c7d05c531cd317544df024/datadir/regtest/chainstate_snapshot_INVALID/MANIFEST-000004 we get 268 chars, while MAX_PATH is 260.

Edit: Does not sufficiently explain the CI failure as CI does not set -testdatadir and so would not end up with the random 256 bit hex values in the paths, as pointed out by maflcko in the following comment.

maflcko commented at 9:24 am on October 29, 2024: member

Running on another branch with -testdatadir=foo

The Windows CI doesn’t set the flag and the push/diff I linked to didn’t change the non -testdatadir code path, but the previous commit passed and the next one failed. Just asking to understand what is going on with the CI.

in src/test/util/setup_common.cpp:144 in ebdace5aaf outdated

140@@ -141,7 +141,7 @@ BasicTestingSetup::BasicTestingSetup(const ChainType chainType, TestOpts opts)
141     const std::string test_name{G_TEST_GET_FULL_NAME ? G_TEST_GET_FULL_NAME() : ""};
142     if (!m_node.args->IsArgSet("-testdatadir")) {
143         // By default, the data directory has a random name on each test run
144-        const auto rand_str{g_rng_temp_path.rand256().ToString()};
145+        const auto rand_str{util::ToString(g_rng_temp_path.rand32())};

tdb3 commented at 12:28 pm on October 29, 2024:

nit: for awareness, may want to mention in OP that this changes directory names from 256-bit randoms to 32-bit randoms. I’m not seeing an issue with it (virtually non-existent likelihood of collision).

tdb3 approved

tdb3 commented at 12:31 pm on October 29, 2024: contributor

ACK ebdace5aaf74626da83eeee1f966acaad3ed0c35

Light code review and tested.

 0$ build/src/bench/bench_bitcoin -testdatadir=/mnt/31000/
 1...
 2$ ls /mnt/31000/test_common\ bitcoin/
 3AssembleBlock                 LoadExternalBlockFile  ReadBlockFromDiskTest     WalletCreateEncrypted
 4BlockAssemblerAddPackageTxns  LogWithDebug           ReadRawBlockFromDiskTest  WalletCreatePlain
 5BlockFilterIndexSync          LogWithThreadNames     RpcMempool                WalletCreateTxUseOnlyPresetInputs
 6BlockToJsonVerbose            LogWithoutDebug        WalletAvailableCoins      WalletCreateTxUsePresetInputsAndCoinSelection
 7BlockToJsonVerboseWrite       LogWithoutThreadNames  WalletBalanceClean        WalletIsMineDescriptors
 8CheckBlockIndex               LogWithoutWriteToFile  WalletBalanceDirty        WalletIsMineMigratedDescriptors
 9ComplexMemPool                MempoolCheck           WalletBalanceMine         WalletLoadingDescriptors
10DuplicateInputs               MempoolEviction        WalletBalanceWatch

DrahtBot requested review from hodlinator on Oct 29, 2024

hebasto commented at 4:00 pm on October 29, 2024: member

re: #31000 (comment)

It would be good if ctest/Windows actually printed the exception and error message instead of silently hiding it.

I’m not aware of the specific case discussed above, but exceptions in tests are generally printed as follows (with a patched code):

 0> ctest --test-dir build-static -j 8 -C Release -R amount --output-on-failure
 1Internal ctest changing into directory: C:/Users/hebasto/bitcoin/build-static
 2Test project C:/Users/hebasto/bitcoin/build-static
 3    Start 10: amount_tests
 41/1 Test [#10](/bitcoin-bitcoin/10/): amount_tests .....................***Failed    0.04 sec
 5Running 4 test cases...
 6Entering test module "Bitcoin Core Test Suite"
 7C:\Users\hebasto\bitcoin\src\test\addrman_tests.cpp(63): Test suite "addrman_tests" is skipped because disabled
 8C:\Users\hebasto\bitcoin\src\test\allocator_tests.cpp(16): Test suite "allocator_tests" is skipped because disabled
 9C:\Users\hebasto\bitcoin\src\test\amount_tests.cpp(12): Entering test suite "amount_tests"
10C:\Users\hebasto\bitcoin\src\test\amount_tests.cpp(14): Entering test case "MoneyRangeTest"
11C:\Users\hebasto\bitcoin\src\test\amount_tests.cpp(14): Leaving test case "MoneyRangeTest"; testing time: 14us
12C:\Users\hebasto\bitcoin\src\test\amount_tests.cpp(23): Entering test case "GetFeeTest"
13unknown location(0): fatal error: in "amount_tests/GetFeeTest": class std::runtime_error: A runtime error occurred
14C:\Users\hebasto\bitcoin\src\test\amount_tests.cpp(59): last checkpoint
15...

maflcko commented at 8:07 am on October 30, 2024: member

It would be good if ctest/Windows actually printed the exception and error message instead of silently hiding it.

I’m not aware of the specific case discussed above, but exceptions in tests are generally printed as follows (with a patched code):

Maybe the log is truncated in CI and only the head is printed, as opposed to the tail, which would be more useful?

furszy force-pushed on Nov 5, 2024

furszy commented at 7:19 pm on November 5, 2024: member

Sorry for the delay. @maflcko, the issue seems to be that the exception occurs in the test base class constructor. Which is not part of the actual test code and runs prior to it. So it might not be captured by the framework, throwing the “unknown location error”. Just dropped the fix and pushed a commit to verify it. Let’s see what the CI prints now.

furszy force-pushed on Nov 5, 2024

furszy force-pushed on Nov 6, 2024

DrahtBot added the label CI failed on Nov 6, 2024

DrahtBot commented at 4:08 pm on November 6, 2024: contributor

🚧 At least one of the CI tasks failed. Debug: https://github.com/bitcoin/bitcoin/runs/32605835023

Try to run the tests locally, according to the documentation. However, a CI failure may still happen due to a number of reasons, for example:

Possibly due to a silent merge conflict (the changes in this pull request being incompatible with the current code in the target branch). If so, make sure to rebase on the latest commit of the target branch.
A sanitizer issue, which can only be found by compiling with the sanitizer and running the affected test.
An intermittent issue.

Leave a comment here, if you need help tracking down a confusing failure.

furszy force-pushed on Nov 6, 2024

furszy force-pushed on Nov 7, 2024

furszy commented at 9:06 pm on November 7, 2024: member

@maflcko, have tried a good number of different things to catch the error but didn’t have much luck in the CI. ctest seems to be swallowing the error there. It occurs on a different place, not during the initial root directory creation. It probably occurs in one of the blk**.dat or the ldb files, they are also created inside the TestChain100Setup constructor.

Still, we could at least move forward with something like https://github.com/bitcoin/bitcoin/pull/31000/commits/fd5b9fc36879ead0efb12748cf8c4b9e60d7c59c, which prints a better error message when the base dir exceeds the max path limit. And could also even limit the path size further, reducing it by the longest path we have: MAX_PATH - len(/indexes/blockfilter/basic/db/MANIFEST-000002).

This seems orthogonal to this PR, probably should move it to another PR, but happy to hear your thoughts too.

maflcko commented at 9:39 am on November 8, 2024: member

Yeah, don’t worry about the ctest bug. It is unrelated and should be fixed somewhere else. I was mostly wondering why a force push that does not change behavior (https://github.com/bitcoin/bitcoin/pull/31000#issuecomment-2437357680) suddenly makes the CI trip. Why would CI pass before and not after, when -testdatadir isn’t set. Maybe someone can run the two commits locally on Windows?

furszy commented at 5:58 pm on November 8, 2024: member

Yeah, don’t worry about the ctest bug. It is unrelated and should be fixed somewhere else. I was mostly wondering why a force push that does not change behavior (#31000 (comment)) suddenly makes the CI trip. Why would CI pass before and not after, when -testdatadir isn’t set. Maybe someone can run the two commits locally on Windows?

Ok, that force-push actually contained a behavior change. It changed the default test datadir path. Moved from /<OS_tmp_dir>/test_common bitcoin/<random_uint256>/ to /<OS_tmp_dir>/test_common bitcoin/<test_name>/< random_uint256>/. This makes it easier to find any unit/benchmark/fuzzing test run when a failure occurs. Instead of going through all the random uint256 directories to see all the tests executions, we can just open the specific test name folder.

Thus why I changed the random uint256 to be a uint32 in the fix commit. It doesn’t need to be a long random number anymore.

furszy force-pushed on Nov 8, 2024

furszy commented at 6:19 pm on November 8, 2024: member

Updated. Ready to go.

DrahtBot removed the label CI failed on Nov 8, 2024

maflcko commented at 12:22 pm on November 9, 2024: member

Ok, that force-push actually contained a behavior change.

Ugh, I missed that, I didn’t see the ! negation. Generally writing a simple if with an else branch as if (!a) ... is confusing, because each branch will have a (double) negation. It would be better to just write if (a) and have a single negation in the else branch. But you didn’t change that code, so it can be done in a follow-up.

in src/test/util/setup_common.cpp:144 in 4bbcc70446 outdated

142     if (!m_node.args->IsArgSet("-testdatadir")) {
143-        // By default, the data directory has a random name
144-        const auto rand_str{g_rng_temp_path.rand256().ToString()};
145-        m_path_root = fs::temp_directory_path() / TEST_DIR_PATH_ELEMENT / rand_str;
146+        // By default, the data directory has a random name on each test run
147+        const auto rand_str{util::ToString(g_rng_temp_path.rand32())};

maflcko commented at 12:25 pm on November 9, 2024:

nit: My preference would be to use the current time (maybe in nanosecond precision) and make the path time/test_name. This way, it is easy to find the last test execution easily, if there is more than one.

With the current approach, one would have to look up which rand32 corresponds to the last execution.

furszy commented at 4:30 pm on November 9, 2024:

sure. Done as suggested.

maflcko approved

maflcko commented at 12:26 pm on November 9, 2024: member

re-ACK 4bbcc704464a470f54f372f93b2649d67b7a60c1

Only change is the default behavior change, which I somehow missed.

DrahtBot requested review from tdb3 on Nov 9, 2024

furszy force-pushed on Nov 9, 2024

in src/test/util/setup_common.cpp:143 in 5f2c7c5143 outdated

143     if (!m_node.args->IsArgSet("-testdatadir")) {
144-        // By default, the data directory has a random name
145-        const auto rand_str{g_rng_temp_path.rand256().ToString()};
146-        m_path_root = fs::temp_directory_path() / TEST_DIR_PATH_ELEMENT / rand_str;
147+        // By default, the data directory has a random name on each test run
148+        const auto rand_str{util::ToString(GetTime<std::chrono::nanoseconds>().count())};

tdb3 commented at 5:04 pm on November 9, 2024:

non-blocking nit: Since time is being used now, the name isn’t quite as random as before. Could update the comment and rand_str name.

furszy commented at 5:31 pm on November 9, 2024:

sure, pushed. Thanks

tdb3 approved

tdb3 commented at 5:26 pm on November 9, 2024: contributor

re ACK b842ad61c7d9507db415b5d6225f5444f6bf2654

Liking the use of time rather than random.

Re-ran #31000#pullrequestreview-2401711134

0'tmp/test_common bitcoin/BlockAssemblerAddPackageTxns/1731173028231274080'

DrahtBot requested review from maflcko on Nov 9, 2024

furszy force-pushed on Nov 9, 2024

tdb3 approved

tdb3 commented at 5:34 pm on November 9, 2024: contributor

re ACK 5adb4a44776de3f9295cd6bf965f8beba0e4e368

Change since last push is a simple removal of a semi-accurate comment.

in src/test/util/setup_common.cpp:144 in 5adb4a4477 outdated

142     if (!m_node.args->IsArgSet("-testdatadir")) {
143-        // By default, the data directory has a random name
144-        const auto rand_str{g_rng_temp_path.rand256().ToString()};
145-        m_path_root = fs::temp_directory_path() / TEST_DIR_PATH_ELEMENT / rand_str;
146+        const auto rand_str{util::ToString(GetTime<std::chrono::nanoseconds>().count())};
147+        m_path_root = fs::temp_directory_path() / TEST_DIR_PATH_ELEMENT / test_name / rand_str;

maflcko commented at 10:21 am on November 11, 2024:

I don’t think it is right to use mockable time here. Also, GetTime is deprecated, so what about TicksSinceEpoch<std::chrono::nanoseconds>(SystemClock::now())?

Also, I think it would be better to switch test_name / rand_str to rand_str / test_name, so that all unit tests from the same time are bundled together. This would also be identical to the functional test runner.

furszy commented at 3:23 pm on November 11, 2024:

I don’t think it is right to use mockable time here. Also, GetTime is deprecated, so what about TicksSinceEpochstd::chrono::nanoseconds(SystemClock::now())?

Yeah sure. This is usually called before mocking the time, but agree to change it.

Also, I think it would be better to switch test_name / rand_str to rand_str / test_name, so that all unit tests from the same time are bundled together. This would also be identical to the functional test runner.

That won’t work as you expect. Think of each unit test as a separate child class inheriting from the testing setup class and implementing a run() method. Each one constructs a new instance of the setup class, thereby creating a new rand_str.

I think we might be able to switch to time / test_name, creating a global testing class with a different Boost framework hook, but I’d prefer to leave that for a follow-up since it’s somewhat unrelated to this PR and will require additional investigation.

furszy commented at 9:18 pm on November 14, 2024:

Done in #31291

DrahtBot requested review from maflcko on Nov 11, 2024

furszy force-pushed on Nov 11, 2024

maflcko commented at 3:33 pm on November 11, 2024: member

re-ACK 6c7b3bf0ce7a6238457dd44ee1d5d744b3663a52

DrahtBot requested review from tdb3 on Nov 11, 2024

tdb3 approved

tdb3 commented at 4:09 pm on November 11, 2024: contributor

code review and light test re ACK 6c7b3bf0ce7a6238457dd44ee1d5d744b3663a52

build/src/bench/bench_bitcoin: … /tmp/test_common bitcoin/BlockAssemblerAddPackageTxns/1731341243673972130/regtest/blocks

Update uses TicksSinceEpoch with SystemClock::now() instead of deprecated GetTime().

in src/test/util/setup_common.cpp:142 in 6c7b3bf0ce outdated

142     if (!m_node.args->IsArgSet("-testdatadir")) {
143-        // By default, the data directory has a random name
144-        const auto rand_str{g_rng_temp_path.rand256().ToString()};
145-        m_path_root = fs::temp_directory_path() / TEST_DIR_PATH_ELEMENT / rand_str;
146+        const auto now{TicksSinceEpoch<std::chrono::nanoseconds>(SystemClock::now())};
147+        m_path_root = fs::temp_directory_path() / TEST_DIR_PATH_ELEMENT / test_name / util::ToString(now);

hodlinator commented at 4:19 pm on November 11, 2024:

nit: Could potentially move the ToString call up with the rest of the calculation for that value.

0        const auto now{util::ToString(TicksSinceEpoch<std::chrono::nanoseconds>(SystemClock::now()))};
1        m_path_root = fs::temp_directory_path() / TEST_DIR_PATH_ELEMENT / test_name / now;

furszy commented at 4:35 pm on November 11, 2024:

I prefer the current approach to not over-extend the first line. Even with the ToString call, the second line is simpler to read.

in src/test/util/setup_common.cpp:144 in 6c7b3bf0ce outdated

137@@ -139,10 +138,10 @@ BasicTestingSetup::BasicTestingSetup(const ChainType chainType, TestOpts opts)
138     // data directories use a random name that doesn't overlap with other tests.
139     SeedRandomForTest(SeedRand::FIXED_SEED);
140 
141+    const std::string test_name{G_TEST_GET_FULL_NAME ? G_TEST_GET_FULL_NAME() : ""};
142     if (!m_node.args->IsArgSet("-testdatadir")) {
143-        // By default, the data directory has a random name
144-        const auto rand_str{g_rng_temp_path.rand256().ToString()};

hodlinator commented at 4:21 pm on November 11, 2024:

As g_rng_temp_path is no longer used, it should be removed completely from line 78-79.

furszy commented at 4:32 pm on November 11, 2024:

Yeah good, done as suggested. Thanks.

hodlinator changes_requested

DrahtBot requested review from hodlinator on Nov 11, 2024

test, bench: specialize working directory name

Since G_TEST_GET_FULL_NAME is not initialized in the benchmark framework,
benchmarks using the unit test setup run in the same directory without
any clear distinction between them.
This poses an extra complication for locating any specific benchmark
directory during a failure.

In master, unit tests and benchmarks run in the following path:
/<OS_tmp_dir>/test_common bitcoin/<random_uint256>/

After this commit, unit tests and benchmarks are contained within its
own directory:
/<OS_tmp_dir>/test_common bitcoin/<test_name>/<time_in_nanoseconds>/

This makes it easier to find any benchmark run when a failure occurs.

ad9c2cceda

bench: add support for custom data directory

Expands the benchmark framework with the existing '-testdatadir' arg,
enabling the ability to change the benchmark data directory.

This is useful for running benchmarks on different storage devices, and
not just under the OS /tmp/ directory.

fa66e0887c

furszy force-pushed on Nov 11, 2024

furszy commented at 4:36 pm on November 11, 2024: member

Updated per feedback. Thanks. Removed unused global g_rng_temp_path from the setup common file.

tdb3 approved

tdb3 commented at 6:19 pm on November 11, 2024: contributor

re ACK fa66e0887ca1a1445d8b18ba1fadb12b2d911048

hodlinator approved

hodlinator commented at 8:01 pm on November 11, 2024: contributor

re-ACK fa66e0887ca1a1445d8b18ba1fadb12b2d911048

git range-diff master 80d35bc fa66e08

Decimal time since epoch replacing random 256-bit value

Sample of recent now-value is 1731341566994144855.

According to https://www.epochconverter.com/ we won’t get another digit until year 2286, so path lengths seem quite stable.

Switching to hex representation does give us constant lengths from now until unsigned 64-bit max, but only shaves off 3 chars, which is not that big of a win.

Determinism

The paths were made more deterministic in 97e16f57042cab07e5e73f6bed19feec2006e4f7, but I suspect it’s not that much of a priority.

I tried to follow through on the determinism intent in my #30737 but the approach in your PR should also fix #30696 (might be worth mentioning that you’re fixing that issue as a bonus in the PR summary! :) ).

`TEST_DIR_PATH_ELEMENT`

Could potentially still rename TEST_DIR_PATH_ELEMENT -> TESTS_DIR_NAME as previously mentioned.

Testing

Ran build/src/test/test_bitcoin and lsof |grep "/tmp.*bitcoin" in parallel.

maflcko commented at 3:46 pm on November 13, 2024: member

re-ACK fa66e0887ca1a1445d8b18ba1fadb12b2d911048

Only change is removing unused g_rng_temp_path

pablomartin4btc approved

pablomartin4btc commented at 4:20 pm on November 13, 2024: member

re-ACK fa66e0887ca1a1445d8b18ba1fadb12b2d911048

Since my last review: removal of g_rng_temp_path, renaming and moving definition of SetupCommonTestArgs to src/test/util/setup_common.h and using the timestamp for the test dir path instead of a random id.

achow101 commented at 11:45 pm on November 13, 2024: member

ACK fa66e0887ca1a1445d8b18ba1fadb12b2d911048

achow101 merged this on Nov 14, 2024

achow101 closed this on Nov 14, 2024

furszy deleted the branch on Nov 14, 2024

TheCharlatan referenced this in commit a73b2bd0f0 on Nov 14, 2024

fanquake referenced this in commit e122309958 on Nov 20, 2024

bench: add support for custom data directory #31000

Code Coverage & Benchmarks

Reviews

Conflicts

Concept

Tested

Decimal time since epoch replacing random 256-bit value

Determinism

TEST_DIR_PATH_ELEMENT

Testing

`TEST_DIR_PATH_ELEMENT`