refactor: Replace RecursiveMutex with Mutex in CTxMemPool #19306

hebasto commented at 4:15 PM on June 17, 2020: member

This PR replaces RecursiveMutex CTxMemPool::cs with Mutex CTxMemPool:cs.

All of the related code branches are covered by appropriate lock assertions to insure that the mutex locking policy has not been changed by accident

Related to #19303.

DrahtBot added the label GUI on Jun 17, 2020

DrahtBot added the label Mempool on Jun 17, 2020

DrahtBot added the label Mining on Jun 17, 2020

DrahtBot added the label P2P on Jun 17, 2020

DrahtBot added the label Refactoring on Jun 17, 2020

DrahtBot added the label RPC/REST/ZMQ on Jun 17, 2020

DrahtBot added the label Tests on Jun 17, 2020

DrahtBot added the label TX fees and policy on Jun 17, 2020

DrahtBot added the label Validation on Jun 17, 2020

DrahtBot commented at 11:20 PM on June 17, 2020: contributor

The following sections might be updated with supplementary metadata relevant to reviewers and maintainers.

Conflicts

Reviewers, this pull request conflicts with the following ones:

#19826 (Pass mempool reference to chainstate constructor by MarcoFalke)
#19806 (validation: UTXO snapshot activation by jamesob)
#19791 ([net processing] Move Misbehaving() to PeerManager by jnewbery)
#19753 (p2p: don't add AlreadyHave transactions to recentRejects by troygiorshev)
#19652 (Add thread safety annotations to Mempool{Info}ToJSON() by hebasto)
#19647 (Add thread safety annotations to CTxMemPool methods by hebasto)
#19645 (Allow updating mempool-txn with cheaper witnesses by ariard)
#19610 (p2p: refactor AlreadyHave(), CInv::type, INV/TX processing by jonatack)
#19572 (ZMQ: Create "sequence" notifier, enabling client-side mempool tracking by instagibbs)
#19544 (refactor: Add ParseBool to rpc/util by fjahr)
#19498 (Tidy up ProcessOrphanTx by jnewbery)
#19488 (Refactor mempool.dat to be extensible, and store missing info by luke-jr)
#19478 (Remove CTxMempool::mapLinks data structure member by JeremyRubin)
#19339 (validation: re-delegate absurd fee checking from mempool to clients by gzhao408)
#19093 (RPC: Return transaction fee from testmempoolaccept by rajarshimaitra)
#18191 (Change UpdateForDescendants to use Epochs by JeremyRubin)
#18017 (txmempool: split epoch logic into class by ajtowns)
#13990 (Allow fee estimation to work with lower fees by ajtowns)
#12677 (RPC: Add ancestor{count,size,fees} to listunspent output by luke-jr)

If you consider this pull request important, please also help to review the conflicting pull requests. Ideally, start with the one that should be merged first.

DrahtBot cross-referenced this on Jun 17, 2020 from issue net: Avoid redundant and confusing FAILED log by MarcoFalke

DrahtBot cross-referenced this on Jun 18, 2020 from issue test: Check that peers with forcerelay permission are not asked to feefilter by MarcoFalke

DrahtBot cross-referenced this on Jun 18, 2020 from issue Overhaul transaction request logic by sipa

DrahtBot cross-referenced this on Jun 18, 2020 from issue refactor: replace CConnman pointers by references in net_processing.cpp by theStack

fanquake removed the label GUI on Jun 18, 2020

fanquake removed the label Mining on Jun 18, 2020

fanquake removed the label P2P on Jun 18, 2020

fanquake removed the label RPC/REST/ZMQ on Jun 18, 2020

fanquake removed the label TX fees and policy on Jun 18, 2020

fanquake removed the label Tests on Jun 18, 2020

fanquake removed the label Validation on Jun 18, 2020

DrahtBot cross-referenced this on Jun 18, 2020 from issue RPC: Return transaction fee from testmempoolaccept by rajarshimaitra

DrahtBot cross-referenced this on Jun 18, 2020 from issue net: Use mockable time for ping/pong, add tests by MarcoFalke

DrahtBot cross-referenced this on Jun 18, 2020 from issue coins: allow cache resize after init by jamesob

DrahtBot cross-referenced this on Jun 18, 2020 from issue rpc: remove deprecated CRPCCommand constructor by MarcoFalke

DrahtBot cross-referenced this on Jun 18, 2020 from issue Change UpdateForDescendants to use Epochs by JeremyRubin

DrahtBot cross-referenced this on Jun 18, 2020 from issue txmempool: split epoch logic into class by ajtowns

hebasto cross-referenced this on Jun 18, 2020 from issue Replace all of the RecursiveMutex instances with the Mutex ones by hebasto

hebasto commented at 9:04 AM on June 18, 2020: member

From IRC:

<sipa> eh, i agree - that change isn't a step in the right direction @sipa Which direction is right?

hebasto commented at 9:28 AM on June 18, 2020: member

From IRC:

<jeremyrubin> I think it's basically getting rid of a recursive mutex for code that's still designed to take a recursive mutex <gwillen> it's better than a true recursive mutex because it's not possible to recurse by accident, you have to declare at call time which behavior you want (although better if you had to declare statically at compile time) <sipa> it looks like that <jeremyrubin> The correct refactor would be to make the code not do anything fancy with locks, or to just leave it <jeremyrubin> gwillen: I think the chances of a bug or error in custom logic is higher than a recursive mutex <jeremyrubin> accidental recursion seems unlikely... <jeremyrubin> and accidental recursion would be a bug lock or no <gwillen> (sorry I mean, accidental mutex recursion, that is, calling a function while holding a mutex, not expecting the callee to also lock it, resulting in the callee violating the caller's invariants) <gwillen> (this is the fundamental problem of recursive mutexes) <gwillen> (and I assume the motivation behind hebasto's refactor)

Correct, @gwillen :)

hebasto commented at 9:31 AM on June 18, 2020: member

Searching for concept (N)ACKs before separating digestible chunks for reviewing into smaller pulls.

hebasto marked this as ready for review on Jun 18, 2020

DrahtBot cross-referenced this on Jun 18, 2020 from issue Allow fee estimation to work with lower fees by ajtowns

MarcoFalke commented at 11:29 AM on June 18, 2020: member

Currently this adds a lot of code complexity. Also, it adds mental complexity to write code that doesn't crash or deadlock/UB whereas the RecursiveMutex just works (TM).

Maybe there is a way to achieve the same without the added complexity? For example, always force the caller to take the lock for the right scope. This would also solve issues of non-atomic RPC responses or at make them more visible. Though, it makes caller code slightly more verbose.

hebasto commented at 11:38 AM on June 18, 2020: member

Currently this adds a lot of code complexity. Also, it adds mental complexity to write code that doesn't crash or deadlock/UB whereas the RecursiveMutex just works (TM).

A function that locks RecursiveMutex there are no guarantees that protected invariants are held before locking. This adds mental complexity to read code.

Maybe there is a way to achieve the same without the added complexity? For example, always force the caller to take the lock for the right scope. This would also solve issues of non-atomic RPC responses or at make them more visible.

I assume these steps are much easier for non-recursive mutex, no?

DrahtBot cross-referenced this on Jun 19, 2020 from issue RPC: Add ancestor{count,size,fees} to listunspent output by luke-jr

DrahtBot added the label Needs rebase on Jun 19, 2020

gwillen commented at 7:34 PM on June 19, 2020: contributor

@hebasto Do you have any thoughts on "function takes bool locked" versus "split function into _locked and _unlocked variants"?

In some of the cases here, I see that the latter would require some (maybe substantial) refactoring to avoid code duplication, and I assume that's why you didn't go that route. But that would not only eliminate conditional locking (which is scary), it would probably allow the use of RAII locks. I'm curious to hear your thoughts.

hebasto commented at 12:41 PM on June 20, 2020: member

@gwillen

@hebasto Do you have any thoughts on "function takes bool locked" versus "split function into _locked and _unlocked variants"?

For new code I prefer clean "split function into _locked and _unlocked variants" like #19238 (review). Unfortunately, that is not the case in this PR.

In some of the cases here, I see that the latter would require some (maybe substantial) refactoring to avoid code duplication, and I assume that's why you didn't go that route.

You are correct. I've used the "function takes bool locked" approach in 72f7486b5ebe96762c5d5a68849c61e58c812ffd and 21787abedd7a0808c0175a0b1d795df97fd3b970 as a quick-and-dirty solution to fix broken tests.

But that would not only eliminate conditional locking (which is scary), it would probably allow the use of RAII locks. I'm curious to hear your thoughts.

Hmm... In 21787abedd7a0808c0175a0b1d795df97fd3b970 I've had to drop the RAII lock in favor of a pair ENTER_CRITICAL_SECTION() - LEAVE_CRITICAL_SECTION(). That is why I dont't like this approach.

hebasto force-pushed on Jun 20, 2020

hebasto commented at 1:04 PM on June 20, 2020: member

Rebased 4e00526c689c4164d02d8ca76331f3ed5da7b13c -> c9e7d011d69bb1ef965945bf90d7441165430808 (pr19306.01 -> pr19306.02) due to the conflict with #19293.

DrahtBot removed the label Needs rebase on Jun 20, 2020

DrahtBot cross-referenced this on Jun 20, 2020 from issue validation: re-delegate absurd fee checking from mempool to clients by glozow

DrahtBot added the label Needs rebase on Jun 21, 2020

hebasto force-pushed on Jun 23, 2020

hebasto commented at 5:30 PM on June 23, 2020: member

Rebased c9e7d011d69bb1ef965945bf90d7441165430808 -> 3fc8fa23fc4ff2978c89bba46f08e746b6e4c154 (pr19306.02 -> pr19306.03) due to the conflicts with #18027 and #19198.

DrahtBot removed the label Needs rebase on Jun 23, 2020

DrahtBot cross-referenced this on Jun 23, 2020 from issue net processing: Move orphan reprocessing to a global by jnewbery

DrahtBot cross-referenced this on Jun 23, 2020 from issue build: Do not include server symbols in wallet by MarcoFalke

DrahtBot added the label Needs rebase on Jun 24, 2020

hebasto force-pushed on Jun 24, 2020

hebasto commented at 6:17 PM on June 24, 2020: member

Rebased 3fc8fa23fc4ff2978c89bba46f08e746b6e4c154 -> ff3d969891b0687219906f18e66c5bb499915968 (pr19306.03 -> pr19306.04) due to the conflict with https://github.com/bitcoin-core/gui/pull/11.

DrahtBot removed the label Needs rebase on Jun 24, 2020

DrahtBot cross-referenced this on Jun 25, 2020 from issue Use wtxid for transaction relay by sdaftuar

DrahtBot cross-referenced this on Jun 27, 2020 from issue test: Add test for wtxid transaction relay by fjahr

DrahtBot added the label Needs rebase on Jul 1, 2020

hebasto force-pushed on Jul 3, 2020

hebasto commented at 9:12 AM on July 3, 2020: member

Rebased ff3d969891b0687219906f18e66c5bb499915968 -> 511670449669116df5488cd4f807de620e55a7e3 (pr19306.04 -> pr19306.05) due to the conflict with #19331.

DrahtBot removed the label Needs rebase on Jul 3, 2020

DrahtBot cross-referenced this on Jul 9, 2020 from issue Accurately account for mempool index memory by sipa

DrahtBot cross-referenced this on Jul 10, 2020 from issue Remove CTxMempool::mapLinks data structure member by JeremyRubin

DrahtBot cross-referenced this on Jul 10, 2020 from issue doc: Use precise permission flags where possible by MarcoFalke

DrahtBot added the label Needs rebase on Jul 11, 2020

hebasto force-pushed on Jul 16, 2020

hebasto commented at 6:57 PM on July 16, 2020: member

Rebased 511670449669116df5488cd4f807de620e55a7e3 -> 0c03cea32d3ab30a58e62bbe42af6ebef016ede4 (pr19306.05 -> pr19306.06) due to the conflicts with #19174 and #19474.

DrahtBot removed the label Needs rebase on Jul 16, 2020

DrahtBot cross-referenced this on Jul 16, 2020 from issue Tidy up ProcessOrphanTx by jnewbery

DrahtBot cross-referenced this on Jul 16, 2020 from issue Refactor mempool.dat to be extensible, and store missing info by luke-jr

DrahtBot cross-referenced this on Jul 17, 2020 from issue Revert "refactor: replace CConnman pointers by references in net_processing.cpp" by laanwj

DrahtBot cross-referenced this on Jul 18, 2020 from issue refactor: Add ParseBool to rpc/util by fjahr

DrahtBot cross-referenced this on Jul 19, 2020 from issue validation: Warm coins cache during prevalidation to connect blocks faster by andrewtoth

DrahtBot cross-referenced this on Jul 20, 2020 from issue Remove mempool global by MarcoFalke

DrahtBot added the label Needs rebase on Jul 22, 2020

hebasto force-pushed on Jul 29, 2020

hebasto commented at 11:32 AM on July 29, 2020: member

Rebased 0c03cea32d3ab30a58e62bbe42af6ebef016ede4 -> 656cba72f475603497e318ed3f01db4ab694b2af (pr19306.06 -> pr19306.07) due to the conflicts with #18044 and #18637.

DrahtBot removed the label Needs rebase on Jul 29, 2020

DrahtBot cross-referenced this on Jul 29, 2020 from issue Pass mempool pointer to UnloadBlockIndex/GetCoinsCacheSizeState by MarcoFalke

DrahtBot cross-referenced this on Jul 29, 2020 from issue Deduplicate parent txid loop of requested transactions and missing parents of orphan transactions by sdaftuar

DrahtBot cross-referenced this on Jul 29, 2020 from issue ZMQ: Create "sequence" notifier, enabling client-side mempool tracking by instagibbs

DrahtBot cross-referenced this on Jul 29, 2020 from issue Enable fetching of orphan parents from wtxid peers by sipa

DrahtBot cross-referenced this on Jul 29, 2020 from issue Erlay: bandwidth-efficient transaction relay protocol by naumenkogs

DrahtBot cross-referenced this on Jul 29, 2020 from issue [RFC] Package-relay: sender-initiated by ariard

DrahtBot cross-referenced this on Jul 30, 2020 from issue Disable fee estimation in blocksonly mode (by removing the fee estimates global) by darosior

DrahtBot added the label Needs rebase on Jul 30, 2020

hebasto force-pushed on Aug 2, 2020

hebasto commented at 9:50 PM on August 2, 2020: member

Rebased 656cba72f475603497e318ed3f01db4ab694b2af -> e23248c51c092269a33cde7ad0ff70a815876396 (pr19306.07 -> pr19306.08) due to the conflicts with #18011, #19569, and #19604.

hebasto cross-referenced this on Aug 2, 2020 from issue Add thread safety annotations to CTxMemPool methods by hebasto

hebasto commented at 10:47 PM on August 2, 2020: member

Some commits split out into #19647. So please start reviewing from #19647.

DrahtBot cross-referenced this on Aug 3, 2020 from issue Allow updating mempool-txn with cheaper witnesses by ariard

DrahtBot cross-referenced this on Aug 3, 2020 from issue refactor: Keep mempool interface in validation by MarcoFalke

hebasto cross-referenced this on Aug 3, 2020 from issue Avoid locking CTxMemPool::cs recursively in Mempool{Info}ToJSON() by hebasto

DrahtBot removed the label Needs rebase on Aug 3, 2020

DrahtBot cross-referenced this on Aug 7, 2020 from issue Run clang-tidy -*,performance-* by Warchant

DrahtBot cross-referenced this on Aug 8, 2020 from issue p2p: refactor AlreadyHave(), CInv::type, INV/TX processing by jonatack

DrahtBot added the label Needs rebase on Aug 10, 2020

Wiphawee112 approved

Wiphawee112 commented at 1:57 PM on August 19, 2020: none

hebasto force-pushed on Aug 21, 2020

hebasto commented at 6:56 AM on August 21, 2020: member

Rebased e23248c51c092269a33cde7ad0ff70a815876396 -> 95b4daa91ea4eecd3f345a20f285fb0528f5070d (pr19306.08 -> pr19306.09) due to the conflict with #19569.

DrahtBot removed the label Needs rebase on Aug 21, 2020

DrahtBot cross-referenced this on Aug 21, 2020 from issue p2p: don't add AlreadyHave transactions to recentRejects by troygiorshev

DrahtBot cross-referenced this on Aug 21, 2020 from issue Net processing: move ProcessMessage() to PeerLogicValidation by jnewbery

DrahtBot added the label Needs rebase on Aug 24, 2020

hebasto force-pushed on Aug 24, 2020

hebasto commented at 7:14 PM on August 24, 2020: member

Rebased 95b4daa91ea4eecd3f345a20f285fb0528f5070d -> 1faf43ac3b2cb2a116f501e56c2cd6fed903409c (pr19306.09 -> pr19306.10) due to the conflict with #19704.

DrahtBot removed the label Needs rebase on Aug 24, 2020

DrahtBot cross-referenced this on Aug 24, 2020 from issue [net processing] Move Misbehaving() to PeerManager by jnewbery

DrahtBot cross-referenced this on Aug 26, 2020 from issue validation: UTXO snapshot activation by jamesob

DrahtBot cross-referenced this on Aug 28, 2020 from issue Pass mempool reference to chainstate constructor by MarcoFalke

DrahtBot added the label Needs rebase on Aug 31, 2020

hebasto cross-referenced this on Sep 1, 2020 from issue Avoid locking CTxMemPool::cs recursively in simple cases by hebasto

laanwj referenced this in commit 99a8eb6051 on Sep 4, 2020

hebasto marked this as a draft on Sep 4, 2020

hebasto cross-referenced this on Sep 4, 2020 from issue Avoid locking CTxMemPool::cs recursively in some cases by hebasto

refactor: Prevent double lock in MempoolToJSON() 8f1767d115

refactor: Add thread context to AcceptToMemoryPool() call sites c903bf7389

refactor: Prevent double lock in AcceptToMemoryPool() 7878cfc4af

refactor: Specify CChainState::FlushStateToDisk() by mempool mutex state b3d44b9dca

refactor: Avoid excessive mempool locking in FlushStateToDiskHelper() afcf9161ce

refactor: Remove excessive locking in CTxMemPool methods 52ec60d7bd

refactor: Make CTxMemPool::ClearPrioritisation() private f2825fa7b5

refactor: Add negative thread safety annotations to CTxMemPool ec98daf0ab

refactor: Prevent double lock in CTxMemPool::check() 26da318c12

refactor: Prevent double lock in CTxMemPool::clear() 87d624c46c

refactor: Prevent double lock in CTxMemPool::isSpent() b9b8dfd957

refactor: Prevent double lock in CTxMemPool::GetMinFee() 12922a0b65

refactor: Prevent double lock in CTxMemPool::GetTransactionAncestry() a249d59dc7

refactor: Prevent double lock in CTxMemPool::IsLoaded() 89dfadc589

refactor: Prevent double lock in CTxMemPool::size() e1eab65372

refactor: Prevent double lock in CTxMemPool::exists() ae48014680

refactor: Prevent double lock in CTxMemPool::get() ce19fb90d4

refactor: Prevent double lock in CTxMemPool::infoAll() 5eac81656b

refactor: Prevent double lock in CTxMemPool::DynamicMemoryUsage() 1b68b74485

refactor: Prevent double lock in CTxMemPool::RemoveUnbroadcastTx() a2eb36cb98

refactor: Prevent double lock in CTxMemPool::GetUnbroadcastTxs() cbcdf19e49

refactor: Prevent double lock in PartiallyDownloadedBlock::InitData() 5639ecbb7d

refactor: Prevent double lock in BlockAssembler::CreateNewBlock(() c37916a81b

refactor: Prevent double lock in CheckInputsFromMempoolAndCache() e6a0279063

refactor: Replace RecursiveMutex with Mutex in CTxMemPool a887d73dcb

hebasto force-pushed on Sep 6, 2020

hebasto commented at 12:46 PM on September 6, 2020: member

Rebased 1faf43ac3b2cb2a116f501e56c2cd6fed903409c -> a887d73dcb05d59067635aff91baf85e0c7c7396 (pr19306.10 -> pr19306.12) due to the merge conflicts.

hebasto marked this as ready for review on Sep 6, 2020

DrahtBot removed the label Needs rebase on Sep 6, 2020

hebasto cross-referenced this on Sep 6, 2020 from issue Avoid locking CTxMemPool::cs recursively in CTxMemPool::DynamicMemoryUsage() by hebasto

hebasto marked this as a draft on Sep 6, 2020

sidhujag referenced this in commit 45b8840fd3 on Sep 9, 2020

hebasto closed this on Aug 24, 2021

bitcoin locked this on Aug 24, 2022