Don’t wipe coins cache when full and instead evict LRU clean entries #31102

pull andrewtoth wants to merge 7 commits into bitcoin:master from andrewtoth:prune-cache changing 12 files +309 −92
  1. andrewtoth commented at 2:56 pm on October 16, 2024: contributor

    Depends on #28939.

    This only wipes the coins cache if memory usage is greater than total allowed cache size. Instead, it does a non-wiping sync to disk and keeps all unspent coins in the cache. These are tracked in a linked list of clean entries, and when spent are removed from the clean linked list and appended to the dirty linked list. This results in the head of the clean linked list containing the oldest clean entry.

    When the cache grows to above the large size threshold, clean entries beginning from the head of the linked list are evicted until the cache is below the threshold. This lets the cache maintain a size at or below the large threshold as long as clean entries exist in the cache. When there are no more clean entries, the non-wiping sync removes all spent entries and cleans all non-spent entries. This reduces the size of the cache and allows the previously dirty unspent entries to be evicted as needed.

    The pool allocator is modified to add an accounting field to track the amount of free bytes. However, if the in-memory footprint of the cache grows larger than allowed it will wipe and reallocate the cache. This will happen if the mempool fills up and the cache was using the unused mempool space. It will also be reallocated when resizing caches during assumeutxo syncing.

  2. memusage: move MallocUsage into separate file
    That way it becomes usable in other code like allocators/pool
    42a68d41bf
  3. memusage: pool keeps track of its memory usage
    That way all containers that use the pool have accurate memory tracking.
    
    Add test to show memory is accurately tracked, even when nodes can't be allocated by the pool.
    The more accurate memory estimation interferes shows that our memory estimation for Windows
    is off, as the real allocated memory is much higher. This adapts the memusage_test test so it still
    works with the more correct estimation.
    49010d48e2
  4. DrahtBot commented at 2:56 pm on October 16, 2024: contributor

    The following sections might be updated with supplementary metadata relevant to reviewers and maintainers.

    Code Coverage

    For detailed information about the code coverage, see the test coverage report.

    Reviews

    See the guideline for information on the review process. A summary of reviews will appear here.

    Conflicts

    Reviewers, this pull request conflicts with the following ones:

    • #30906 (refactor: prohibit direct flags access in CCoinsCacheEntry and remove invalid tests by l0rinc)
    • #30849 (refactor: migrate bool GetCoin to return optional<Coin> by l0rinc)
    • #30673 (coins: remove logic for spent-and-FRESH cache entries and writing non-DIRTY entries by andrewtoth)
    • #30611 (validation: write chainstate to disk every hour by andrewtoth)
    • #30610 (validation: do not wipe utxo cache for stats/scans/snapshots by sipa)
    • #29641 (scripted-diff: Use LogInfo over LogPrintf [WIP, NOMERGE, DRAFT] by maflcko)
    • #28531 (improve MallocUsage() accuracy by LarryRuane)

    If you consider this pull request important, please also help to review the conflicting pull requests. Ideally, start with the one that should be merged first.

  5. DrahtBot renamed this:
    Don't wipe coins cache when full and instead evict LRU clean entries
    Don't wipe coins cache when full and instead evict LRU clean entries
    on Oct 16, 2024
  6. andrewtoth force-pushed on Oct 16, 2024
  7. DrahtBot commented at 4:28 pm on October 16, 2024: contributor

    🚧 At least one of the CI tasks failed. Debug: https://github.com/bitcoin/bitcoin/runs/31623076099

    Try to run the tests locally, according to the documentation. However, a CI failure may still happen due to a number of reasons, for example:

    • Possibly due to a silent merge conflict (the changes in this pull request being incompatible with the current code in the target branch). If so, make sure to rebase on the latest commit of the target branch.

    • A sanitizer issue, which can only be found by compiling with the sanitizer and running the affected test.

    • An intermittent issue.

    Leave a comment here, if you need help tracking down a confusing failure.

  8. DrahtBot added the label CI failed on Oct 16, 2024
  9. in src/coins.cpp:46 in 34807641d8 outdated
    41@@ -42,6 +42,10 @@ size_t CCoinsViewCache::DynamicMemoryUsage() const {
    42     return memusage::DynamicUsage(cacheCoins) + cachedCoinsUsage;
    43 }
    44 
    45+size_t CCoinsViewCache::DynamicMemoryAvailableSpace() const {
    


    l0rinc commented at 4:29 pm on October 16, 2024:
    nit: can we unify the formatting here? edit: also, std::size_t vs size_t vs 64 bit and signed and unsigned conversions … can we unify these in a separate commit or pr to avoid distractioins and compiler failures
  10. in src/validation.h:488 in 0422b9bc61 outdated
    483@@ -484,7 +484,9 @@ class CoinsViews {
    484 enum class CoinsCacheSizeState
    485 {
    486     //! The coins cache is in immediate need of a flush.
    487-    CRITICAL = 2,
    488+    CRITICAL = 3,
    489+    //! The coins cache is at >= 99% capacity.
    


    l0rinc commented at 4:36 pm on October 16, 2024:
    it seemed to me that often we’re only only checking the memory after we’re finished with certain batches. 99% seems to close to 100% and I’m concerned the second one may be triggered while we’re handling the first one somehow. Would it be safer to lower this to 95%?
  11. in src/coins.h:144 in 298b2f1e30 outdated
    139+    }
    140+
    141+    inline void RemoveFromLinkedList() noexcept
    142+    {
    143+        if (m_next == nullptr) return;
    144+            m_next->second.m_prev = m_prev;
    


    l0rinc commented at 4:37 pm on October 16, 2024:
    formatting is really confusing here (looks like it’s part of the condition, but there’s a semicolon)
  12. l0rinc commented at 4:43 pm on October 16, 2024: contributor
    Since this depends on other PRs, would it make sense to draft it until those are merged and CI fixed? Edit: dependent PR was closed, I’ll review it here instead
  13. l0rinc commented at 11:04 pm on October 16, 2024: contributor

    While the build apparently hasn’t decided how to handle casts, the benchmarks are looking quite promising already:

    0hyperfine \
    1--runs 1 \
    2--export-json /mnt/my_storage/ibd_full-prune.json \
    3--parameter-list COMMIT 48cf3da636089873ba7280e0d5b22eb81811d194,0391844fa123a47ddf479e865f9eec63f28a4c6c \
    4--prepare 'rm -rf /mnt/my_storage/BitcoinData/* && git checkout {COMMIT} && git clean -fxd && git reset --hard && cmake -B build -DCMAKE_BUILD_TYPE=Release -DBUILD_UTIL=OFF -DBUILD_TX=OFF -DBUILD_TESTS=OFF -DENABLE_WALLET=OFF -DINSTALL_MAN=OFF && cmake --build build -j$(nproc)' \
    5'COMMIT={COMMIT} ./build/src/bitcoind -datadir=/mnt/my_storage/BitcoinData -stopatheight=400000 -dbcache=1000 -printtoconsole=0'
    
    0Benchmark 1: COMMIT=48cf3da636089873ba7280e0d5b22eb81811d194 ./build/src/bitcoind -datadir=/mnt/my_storage/BitcoinData -stopatheight=400000 -dbcache=1000 -printtoconsole=0
    1  Time (abs ≡):        3234.108 s               [User: 2907.369 s, System: 286.357 s]
    2
    3Benchmark 2: COMMIT=0391844fa123a47ddf479e865f9eec63f28a4c6c ./build/src/bitcoind -datadir=/mnt/my_storage/BitcoinData -stopatheight=400000 -dbcache=1000 -printtoconsole=0
    4  Time (abs ≡):        2877.113 s               [User: 2945.860 s, System: 247.616 s]
    5
    6Summary
    7  'COMMIT=0391844fa123a47ddf479e865f9eec63f28a4c6c ./build/src/bitcoind -datadir=/mnt/my_storage/BitcoinData -stopatheight=400000 -dbcache=1000 -printtoconsole=0' ran
    8    1.12 times faster than 'COMMIT=48cf3da636089873ba7280e0d5b22eb81811d194 ./build/src/bitcoind -datadir=/mnt/my_storage/BitcoinData -stopatheight=400000 -dbcache=1000 -printtoconsole=0'
    

    i.e. until 400k blocks we may be 12% faster. Will run more thorough benches later.


    edit:

    0hyperfine --runs 1 --export-json /mnt/my_storage/ibd_full-prune.json --parameter-list COMMIT 48cf3da636089873ba7280e0d5b22eb81811d194,590ffa2508460d7dc6c528b69889c4d696d8147d  --prepare 'rm -rf /mnt/my_storage/BitcoinData/* && git checkout {COMMIT} && git clean -fxd && git reset --hard && cmake -B build -DCMAKE_BUILD_TYPE=Release -DBUILD_UTIL=OFF -DBUILD_TX=OFF -DBUILD_TESTS=OFF -DENABLE_WALLET=OFF -DINSTALL_MAN=OFF && cmake --build build -j$(nproc)' 'COMMIT={COMMIT} ./build/src/bitcoind -datadir=/mnt/my_storage/BitcoinData -stopatheight=500000 -dbcache=2000 -printtoconsole=0'
    
    0Benchmark 1: COMMIT=48cf3da636089873ba7280e0d5b22eb81811d194 ./build/src/bitcoind -datadir=/mnt/my_storage/BitcoinData -stopatheight=500000 -dbcache=2000 -printtoconsole=0
    1  Time (abs ≡):        6948.336 s               [User: 6962.943 s, System: 720.546 s]
    2
    3Benchmark 2: COMMIT=590ffa2508460d7dc6c528b69889c4d696d8147d ./build/src/bitcoind -datadir=/mnt/my_storage/BitcoinData -stopatheight=500000 -dbcache=2000 -printtoconsole=0
    4  Time (abs ≡):        7706.105 s               [User: 7380.817 s, System: 696.713 s]
    5
    6Summary
    7  'COMMIT=48cf3da636089873ba7280e0d5b22eb81811d194 ./build/src/bitcoind -datadir=/mnt/my_storage/BitcoinData -stopatheight=500000 -dbcache=2000 -printtoconsole=0' ran
    8    1.11 times faster than 'COMMIT=590ffa2508460d7dc6c528b69889c4d696d8147d ./build/src/bitcoind -datadir=/mnt/my_storage/BitcoinData -stopatheight=500000 -dbcache=2000 -printtoconsole=0'
    

    i.e. 11% slower for 500k and 2000 dbcache :/. We will have to find out when it’s faster and when it’s slower.

  14. memusage: track available memory in pool allocator free list 49df654e22
  15. memusage: get available memory usage in coins cache 327ae18e9a
  16. memusage: add extra large cache size state and threshold getters c592611bd9
  17. coins: track clean cache entries in clean linked list dc3e3c1fd0
  18. coins: evict LRU clean entries when cache is large 590ffa2508
  19. andrewtoth force-pushed on Oct 16, 2024
  20. andrewtoth marked this as a draft on Oct 18, 2024
  21. andrewtoth commented at 3:11 pm on October 22, 2024: contributor

    Not sure this approach is worth it anymore. For a full IBD I got about 2% faster. It seems that keeping non-dirty entries in the cache to be read from if spent is not that much benefit, since the likelihood of one of those entries being spent before evicted is not high enough.

    I think #31132 is a better approach, because cache misses are fetched in parallel much faster so the misses are not as important.

    I still think #28939 should be revived though. It is important to also track the memory used outside of the chunks. Right now that memory is not accounted for and for a large bucket size the memory used is significant.

  22. andrewtoth closed this on Oct 22, 2024

  23. l0rinc commented at 11:30 am on January 2, 2025: contributor

    I think #31132 is a better approach, because cache misses are fetched in parallel much faster so the misses are not as important.

    I see the two changes as orthogonal - the main goal of this change as far as I understand is to avoid ReallocateCache calls (deleting unlikely entries to avoid filling the cache), while the #31132 is mostly about cache warming (adding likely ones to the cache in parallel).

    But as you’ve suggested, let’s start with reviving #28939 - I’ll separate it to a new PR.


github-metadata-mirror

This is a metadata mirror of the GitHub repository bitcoin/bitcoin. This site is not affiliated with GitHub. Content is generated from a GitHub metadata backup.
generated: 2025-01-10 03:12 UTC

This site is hosted by @0xB10C
More mirrored repositories can be found on mirror.b10c.me