refactor: pass CTxMemPool and CFeeRate in-param objects by const reference #23076

pull jonatack wants to merge 1 commits into bitcoin:master from jonatack:feerate-and-txmempool-fixups changing 6 files +14 −12
  1. jonatack commented at 2:51 pm on September 23, 2021: contributor

    Rationale

    • maintainability: no need to change further as these classes evolve
    • clarity for developers reading the code and this description
    • performance in some cases

    Excerpts from Scott Meyers’ “Effective C++, Third Edition” (2005), Item 20, pages 86-94, and “Effective Modern C++” (2015), Item 41, pages 281-292:

    • Avoid passing objects of user-defined types by value.

    • One might conclude that all small types are good candidates for pass-by-value, even if they’re user-defined. However, just because an object is small doesn’t mean that calling its copy constructor is inexpensive. Many objects–most STL containers among them–contain little more than a pointer, but copying such objects entails copying everything they point to. That can be very expensive.

    • Even when small objects have inexpensive copy constructors, there can be performance issues. Some compilers treat built-in and user-defined types differently, even if they have the same underlying representation. When that type of thing happens, you can be better off passing such objects by reference, because compilers will certainly put pointers (the implementation of references) into registers.

    • Another reason why small user-defined types are not necessarily good pass-by-value candidates is that, being user-defined, their size is subject to change. A type that’s small now may be bigger in the future if its internal implementation changes. Things can even change when switching to a different C++ implementation.

    • In general, the only types for which you can reasonably assume that pass-by-value is inexpensive are built-in types and STL iterator and function object types. For everything else, prefer pass-by-reference-to-const over pass-by-value.

    • Exception: possibly consider pass-by-value of user-defined types for copyable parameters that are cheap to move, if they are always copied anyway in their implementation.

    • When there are chains of function calls, each of which passes-by-value, the cost for the entire chain of calls accumulates and may not be something you can tolerate. With by-reference passing, chains of calls don’t incur this kind of accumulated overhead.

    • The most practical approach may be to adopt a “guilty until proven innocent” policy for pass-by-value of user-defined types.

  2. laanwj added the label Refactoring on Sep 23, 2021
  3. JeremyRubin commented at 5:17 am on September 24, 2021: contributor

    Is there any point in passing cfeerate by const reference vs just const value?

    On Thu, Sep 23, 2021, 7:52 AM Jon Atack @.***> wrote:

    and add missing feerate and txmempool headers.

    You can view, comment on, or merge this pull request online at:

    #23076 Commit Summary

    File Changes

    Patch Links:

    — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/bitcoin/bitcoin/pull/23076, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAGYN67CNE6N4Z6HYN73JHTUDM5J7ANCNFSM5ET4AYJA . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.

  4. laanwj commented at 11:05 am on September 24, 2021: member

    Is there any point in passing cfeerate by const reference vs just const

    Agree. A CFeeRate is basically a wrapper around a CAmount, so 8 bytes. Seems for constant use it’s better to pass it by value, instead of adding a level of indirection by reference.

  5. jonatack commented at 2:10 pm on September 24, 2021: contributor

    Yes a CAmount is 8 bytes, and https://isocpp.github.io/CppCoreGuidelines/CppCoreGuidelines#Rf-in suggests a threshold of 2-3 words, e.g. 2 * sizeof(void*).

    So, I wondered why elsewhere than this patch CFeeRate params are passed by reference to const. For instance, in feerate.h, fees.{h,cpp}, policy.{h,cpp} andtxmempool.{h,cpp} (granted, it could all be an oversight).

    I found more in-depth discussion in Scott Meyers’ “Effective C++, Third Edition” (2005), Item 20, pages 86-94, and “Effective Modern C++” (2015), Item 41, pages 281-292. Maybe it is somewhat dated. Apologies if it is obvious info. Some excerpts:

    • Avoid passing objects of user-defined types by value.

    • One might conclude that all small types are good candidates for pass-by-value, even if they’re user-defined. However, just because an object is small doesn’t mean that calling its copy constructor is inexpensive. Many objects–most STL containers among them–contain little more than a pointer, but copying such objects entails copying everything they point to. That can be very expensive.

    • Even when small objects have inexpensive copy constructors, there can be performance issues. Some compilers treat built-in and user-defined types differently, even if they have the same underlying representation. When that type of thing happens, you can be better off passing such objects by reference, because compilers will certainly put pointers (the implementation of references) into registers.

    • Another reason why small user-defined types are not necessarily good pass-by-value candidates is that, being user-defined, their size is subject to change. A type that’s small now may be bigger in the future if its internal implementation changes. Things can even change when switching to a different C++ implementation.

    • In general, the only types for which you can reasonably assume that pass-by-value is inexpensive are built-in types and STL iterator and function object types. For everything else, prefer pass-by-reference-to-const over pass-by-value.

    • Exception: possibly consider pass-by-value of user-defined types for copyable parameters that are cheap to move, if they are always copied anyway in their implementation.

    • When there are chains of function calls, each of which passes-by-value, the cost for the entire chain of calls accumulates and may not be something you can tolerate. With by-reference passing, chains of calls don’t incur this kind of accumulated overhead.

    • The most practical approach may be to adopt a “guilty until proven innocent” policy for pass-by-value of user-defined types.

  6. jonatack commented at 2:22 pm on September 24, 2021: contributor
    It looks like there are 18 lines of CFeeRate params passed by reference to const, and 7 in this patch passed by value. If the 18 should be changed rather than these 7, it would remain a reasonably small change.
  7. DrahtBot commented at 3:29 pm on September 25, 2021: contributor

    The following sections might be updated with supplementary metadata relevant to reviewers and maintainers.

    Conflicts

    Reviewers, this pull request conflicts with the following ones:

    • #23475 (wallet: add config to prioritize a solution that doesn’t create change in coin selection by brunoerg)

    If you consider this pull request important, please also help to review the conflicting pull requests. Ideally, start with the one that should be merged first.

  8. laanwj commented at 1:20 pm on September 27, 2021: member

    Yes I think the main thing to consider is maintainability: if CFeeRate might grow larger at some point in the future, we do not want to change all of these again. FWIW, the story behind CAmount historically being passed by-reference is that it makes it easier to share code with coins that have non-POD amounts (at the time, Freicoin). I don’t think we actually care about this, but for the info.

    Anyhow:

    • I think the chance of more fields being added to CFeeRate is relatively small
    • however, CFeeRate is never passed to really performance critical functions where copying a few more bytes would really make a difference. Nor the extra level of indirection, for that matter.

    So to be honest I don’t feel strongly either way. It would be somewhat nice to be consistent but that’s all.

  9. JeremyRubin commented at 6:18 pm on September 27, 2021: contributor
    i don’t particularly care either just figured it might be better to pass by const value.
  10. jonatack commented at 9:41 am on September 29, 2021: contributor

    Yes, ISTM the advantages of this change are small amounts (pun intended) of:

    • maintainability (no need to change further as CFeeRate evolves)
    • clarity for developers reading the code and this discussion
    • and perhaps even performance…

    With by-reference passing, chains of calls don’t incur this kind of accumulated overhead.

    …as the CFeeRate comparison operators in feerate.h are reference to const, when feerates are passed into functions to be compared by invoking one of those operators, like in policy/rbf.{h,cpp} in this diff, performance might be not worse and indeed (very slightly) better if the call chain remains reference to const rather than introducing a needless copy in the middle. At least, performance is a wash or shouldn’t be an issue.

    I tried writing some benchmarks as a sanity check but I didn’t see anything conclusive (the difference may be too small to see and my machine isn’t very stable for benchmarking anyway): https://github.com/jonatack/bitcoin/commits/bench-passing-CAmount-and-CFeeRate

  11. jonatack commented at 10:16 am on September 29, 2021: contributor

    FWIW, the story behind CAmount historically being passed by-reference is that it makes it easier to share code with coins that have non-POD amounts (at the time, Freicoin).

    Thanks for the context! Been wondering for some time why that was.

  12. PastaPastaPasta approved
  13. PastaPastaPasta commented at 7:30 am on January 31, 2022: contributor

    utACK 2e4a6cd3c8eb8fafd940814f5f11a18a53060185

    This is a simply change, const is good, and based on the discussion I think that passing by reference is also fine/good. Additionally, it seems this PR has none / maybe 1 conflicting PR, so imo we should just merge it.

  14. DrahtBot added the label Needs rebase on Mar 23, 2022
  15. jonatack force-pushed on Mar 24, 2022
  16. jonatack commented at 1:14 pm on March 24, 2022: contributor
    Rebased. git range-diff e7b6272 2e4a6cd 4f9c44b
  17. DrahtBot removed the label Needs rebase on Mar 24, 2022
  18. DrahtBot added the label Needs rebase on Mar 25, 2022
  19. w0xlt approved
  20. w0xlt commented at 2:42 am on March 26, 2022: contributor

    Code Review ACK 4f9c44b.

    I agree that it’s a better approach to pass user-defined types by reference.

  21. jonatack force-pushed on Apr 29, 2022
  22. jonatack commented at 1:41 pm on April 29, 2022: contributor
    Rebased per git range-diff 26296eb 4f9c44b 3c7a3c1.
  23. DrahtBot removed the label Needs rebase on Apr 29, 2022
  24. DrahtBot added the label Needs rebase on Jun 7, 2022
  25. jonatack renamed this:
    Pass CFeeRate and CTxMemPool in-params by reference to const
    refactor: avoid unnecessarily copying CTxMemPool and CFeeRate objects
    on Jul 30, 2022
  26. jonatack force-pushed on Jul 30, 2022
  27. DrahtBot removed the label Needs rebase on Jul 30, 2022
  28. jonatack commented at 1:04 pm on July 30, 2022: contributor
    Rebased, 2 previous ACKs by @PastaPastaPasta and @w0xlt (thanks!)
  29. in src/policy/rbf.cpp:59 in f5b094df60 outdated
    55@@ -56,7 +56,7 @@ RBFTransactionState IsRBFOptInEmptyMempool(const CTransaction& tx)
    56 }
    57 
    58 std::optional<std::string> GetEntriesForConflicts(const CTransaction& tx,
    59-                                                  CTxMemPool& pool,
    60+                                                  const CTxMemPool& pool,
    


    MarcoFalke commented at 1:30 pm on July 30, 2022:

    & doesn’t create a copy of CTxMemPool, so I think the pull title is wrong.

    Also, I wonder if this is something that can be enforced with clang-tidy, if we want it. Maybe https://clang.llvm.org/extra/clang-tidy/checks/readability/non-const-parameter.html ?


    jonatack commented at 2:09 pm on July 30, 2022:
    Thanks Marco, updated the title and commit message and testing your clang-tidy idea.

    jonatack commented at 3:40 pm on July 30, 2022:
    Dropped the tidy check as it proposed making a pointer to const value that needs to be mutable.

    MarcoFalke commented at 3:43 pm on July 30, 2022:

    Yeah, I don’t know if there is a tidy check for this. But when the reason for the changes in this pull are “consistency” #23076 (comment) , we should enforce this “consistency” to avoid incremental fixups to existing code.

    Same for the CFeeRate change.


    jonatack commented at 3:48 pm on July 30, 2022:
    Many changes seem to be based on copying or grepping the existing code. For example, if CFeeRate in-params are all passed in the same fashion rather than, say, 2/3 by const ref and 1/3 by value, ISTM that alone may help to maintain consistency.

    MarcoFalke commented at 3:50 pm on July 30, 2022:
    How do you prevent a change from introducing this tomorrow? And then a follow-up pull to that change in tomorrow+n days?

    jonatack commented at 3:54 pm on July 30, 2022:
    Removed consistency from the pull description summary (I think there is ample remaining rationale).

    MarcoFalke commented at 4:00 pm on July 30, 2022:

    My comment was meant generally, I only filled in a random rationale as an example.

    When the reason for the changes in this pull are “$rationale”, we should ideally enforce this “$rationale” to avoid incremental fixups to existing or future code.


    jonatack commented at 4:03 pm on July 30, 2022:

    How do you prevent a change from introducing this tomorrow? And then a follow-up pull to that change in tomorrow+n days?

    This is true for many changes that are merged.

    There was plenty of rationale provided in the PR description and it seems pointless to continue to spend more time here.

  30. jonatack renamed this:
    refactor: avoid unnecessarily copying CTxMemPool and CFeeRate objects
    refactor: pass CTxMemPool and CFeeRate in-param objects by const reference
    on Jul 30, 2022
  31. refactor: pass CTxMemPool and CFeeRate in-params objects by const reference
    as they are classes / user-defined types.
    
    Rationale:
    - maintainability, no need to change further as these classes evolve
    - consistency
    - clarity for developers reading the code and this description
    - performance in some cases
    
    Excerpts from:
    - Scott Meyers, "Effective C++, Third Edition" (2005), Item 20, pages 86-94
    - Scott Meyers, "Effective Modern C++" (2015), Item 41, pages 281-292
    
    - Avoid passing objects of user-defined types by value.
    
    - One might conclude that all small types are good candidates for pass-by-value,
      even if they're user-defined. However, just because an object is small doesn't
      mean that calling its copy constructor is inexpensive.  Many objects--most STL
      containers among them--contain little more than a pointer, but copying such
      objects entails copying everything they point to. That can be very expensive.
    
    - Even when small objects have inexpensive copy constructors, there can be
      performance issues. Some compilers treat built-in and user-defined types
      differently, even if they have the same underlying representation. When that
      type of thing happens, you can be better off passing such objects by
      reference, because compilers will certainly put pointers (the implementation
      of references) into registers.
    
    - Another reason why small user-defined types are not necessarily good
      pass-by-value candidates is that, being user-defined, their size is subject to
      change. A type that's small now may be bigger in the future if its internal
      implementation changes.  Things can even change when switching to a different
      C++ implementation.
    
    - In general, the only types for which you can reasonably assume that
      pass-by-value is inexpensive are built-in types and STL iterator and function
      object types. For everything else, prefer pass-by-reference-to-const over
      pass-by-value.
    
    - Exception: possibly consider pass-by-value of user-defined types for copyable
      parameters that are cheap to move, if they are always copied anyway in their
      implementation.
    
    - When there are chains of function calls, each of which passes-by-value, the
      cost for the entire chain of calls accumulates and may not be something you can
      tolerate. With by-reference passing, chains of calls don't incur this kind of
      accumulated overhead.
    
    - The most practical approach may be to adopt a "guilty until proven innocent"
      policy for pass-by-value of user-defined types.
    f5cf04564d
  32. jonatack force-pushed on Jul 30, 2022
  33. jonatack force-pushed on Jul 30, 2022
  34. jonatack force-pushed on Jul 30, 2022
  35. jonatack commented at 4:02 pm on July 30, 2022: contributor
    Closing as this simple change has been open with ACKs for almost a year without merge. If it was desired, it would have been merged.
  36. jonatack closed this on Jul 30, 2022

  37. ryanofsky commented at 3:05 pm on August 1, 2022: contributor

    Code review ACK f5cf04564d5a0e044b59d508d51b2d1cbcd1afd5, but I also agree with closing the issue since there doesn’t seem to have been agreement on it.

    FWIW, I prefer passing passing by const reference just because it clearly expresses semantics of “this argument will only be read” and like reserving passing by value for meaning “this argument might be moved/copied/inserted somewhere.” But there are cases where functions are called in loops and copying by value is more performant. Funny enough, I just saw something about how this conflict is supposed to be avoided in google’s experimental c++ replacement by some new language rule https://www.foonathan.net/2022/07/carbon-calling-convention/

  38. bitcoin locked this on Aug 1, 2023

github-metadata-mirror

This is a metadata mirror of the GitHub repository bitcoin/bitcoin. This site is not affiliated with GitHub. Content is generated from a GitHub metadata backup.
generated: 2024-09-29 01:12 UTC

This site is hosted by @0xB10C
More mirrored repositories can be found on mirror.b10c.me