net/p2p: change nScore and nBestScore data types to int64_t #24090

pull RandyMcMillan wants to merge 1 commits into bitcoin:master from RandyMcMillan:1642450390-issue-24049 changing 2 files +8 −8

RandyMcMillan commented at 8:29 PM on January 17, 2022: contributor

Changing nScore and nBestScore to a fixed-width integer guarantees to be the same size on any architecture and avoids UB.

Addresses issue: #24049
DrahtBot added the label P2P on Jan 17, 2022
shaavan commented at 2:15 PM on January 18, 2022: contributor

Concept ACK

It makes sense to convert nScore to int64_t from int to avoid UB. However, I shall take a further look in the codebase to check if doing so might not be causing a yet unforeseen issue.

in src/net.cpp:143 in a1071e90b7 outdated

 139 | @@ -140,7 +140,7 @@ bool GetLocal(CService& addr, const CNetAddr *paddrPeer)
 140 |          LOCK(cs_mapLocalHost);
 141 |          for (const auto& entry : mapLocalHost)
 142 |          {
 143 | -            int nScore = entry.second.nScore;
 144 | +            int64_t nScore = entry.second.nScore;

mzumsande commented at 3:27 PM on January 18, 2022:

Because of the line nBestScore = nScore; below, I think that nBestScore should be changed to int64_t as well.

RandyMcMillan commented at 10:51 PM on January 18, 2022:

done - good catch! thanks!

mzumsande commented at 3:53 PM on January 18, 2022: contributor

Concept ACK
RandyMcMillan force-pushed on Jan 18, 2022
RandyMcMillan commented at 10:48 PM on January 18, 2022: contributor

81d20b8e2f0097867b234bdc02cb2d7aa95b4a51

Changed nBestScore type to int64_t per @mzumsande comment.
RandyMcMillan renamed this:
~~net/p2p: change nScore data type to int64_t~~
net/p2p: change nScore and nBestScore data types to int64_t
on Jan 18, 2022
RandyMcMillan commented at 11:03 PM on January 18, 2022: contributor
Concept ACK

It makes sense to convert nScore to int64_t from int to avoid UB. However, I shall take a further look in the codebase to check if doing so might not be causing a yet unforeseen issue.

The scope of nBestScore is limited to the function:
```
bool GetLocal(CService& addr, const CNetAddr *paddrPeer)
```
https://github.com/bitcoin/bitcoin/blob/81d20b8e2f0097867b234bdc02cb2d7aa95b4a51/src/net.cpp#L137

which returns a boolean based on the evaluation:

return nBestScore >= 0;
w0xlt approved
w0xlt commented at 3:29 AM on January 19, 2022: contributor

crACK 81d20b8

Changing nScore and nBestScore to a fixed-width integer guarantees to be the same size on any architecture and avoids UB.
shaavan commented at 2:35 PM on January 19, 2022: contributor
Since the type of nScore is changed to int64_t, I think it’s logical to change the return type of GetnScore from int to int64_t.

Line 194 in src/net.cpp:
```
static int GetnScore(const CService& addr)
{
    LOCK(cs_mapLocalHost);
    const auto it = mapLocalHost.find(addr);
    return (it != mapLocalHost.end()) ? it->second.nScore : 0;
}
```
src/net: change nScore data type to int64_t 1e0703baaa
RandyMcMillan force-pushed on Jan 19, 2022
RandyMcMillan commented at 4:06 PM on January 19, 2022: contributor
1e0703baaab58915f43831c32bfd789015a8d483

updated function:
```
static int GetnScore(const CService& addr)
```
return type to int64_t per @shaavan comment.
theStack approved
theStack commented at 5:01 PM on January 19, 2022: contributor

Concept and code-review ACK 1e0703baaab58915f43831c32bfd789015a8d483
RandyMcMillan commented at 5:23 PM on January 19, 2022: contributor

yes - bravo to @shaavan for his thorough code review!
w0xlt approved
w0xlt commented at 5:36 PM on January 19, 2022: contributor

reACK 1e0703b
shaavan approved
shaavan commented at 5:20 AM on January 20, 2022: contributor
ACK 1e0703baaab58915f43831c32bfd789015a8d483

Changes since my last review:
- Changed the return type of GetnScore from int to int64_t to match the nScore's type.
maflcko commented at 10:31 AM on January 24, 2022: member

An alternative would be to saturate the int on overflow
RandyMcMillan commented at 6:06 PM on January 28, 2022: contributor

agree - clamping the int is more prudent for memory usage.
RandyMcMillan marked this as a draft on Jan 30, 2022
RandyMcMillan commented at 10:29 PM on January 30, 2022: contributor

@w0xlt - I like the idea of saturating (clamping) this variable - open to suggestions/patches. Not sure which size is appropriate.
maflcko cross-referenced this on Feb 1, 2022 from issue util: Add SaturatingAdd helper by maflcko
maflcko referenced this in commit e44423c9d3 on Feb 21, 2022
maflcko commented at 2:52 PM on February 21, 2022: member

Can rebase and use SaturatingAdd?
luke-jr commented at 5:21 AM on March 10, 2022: member

Saturating means once an IP reaches max, it won't ever get swapped out for a newer IP, right?

Maybe instead, when we would increment beyond max, we should subtract the lowest (>LOCAL_MAX) current score from all mapLocalHost entries? I think that would preserve the current behaviour.

guarantees to be the same size

Does this matter?
RandyMcMillan commented at 5:23 AM on March 10, 2022: contributor

@luke-jr - if you have an idea - post a patch - I will apply it.
jonatack commented at 10:02 AM on March 10, 2022: contributor

ACK 1e0703baaab58915f43831c32bfd789015a8d483 code review, rebased, debug build, ran unit tests.

That said, changing the one line that increments nScore in SeenLocal() to use SaturatingAdd (recently added in #24224) looks to me like a preferable and more minimal change.

An additional reason might be that if any math operations to nScore are added in the future, the C++ tendency to auto-convert to int might be a good reason to leave nScore as int here, if I'm not confused and lacking caffeine.

luke-jr commented at 3:50 PM on March 10, 2022: member

My thought is something like this:

--- a/src/net.cpp
+++ b/src/net.cpp
@@ -349,6 +349,20 @@ bool SeenLocal(const CService& addr)
     LOCK(g_maplocalhost_mutex);
     const auto it = mapLocalHost.find(addr);
     if (it == mapLocalHost.end()) return false;
+    if (it->second.nScore == std::numeric_limits<int>::max()) {
+        int lowest_inc_score = std::numeric_limits<int>::max();
+        for (auto& localinfo : mapLocalHost) {
+            if (localinfo.second.nScore > LOCAL_MAX && localinfo.second.nScore < lowest_inc_score) {
+                lowest_inc_score = localinfo.second.nScore;
+            }
+        }
+        lowest_inc_score -= LOCAL_MAX;
+        for (auto& localinfo : mapLocalHost) {
+            if (localinfo.second.nScore > LOCAL_MAX) {
+                localinfo.second.nScore -= lowest_inc_score;
+            }
+        }
+    }
     ++it->second.nScore;
     return true;
 }

But I'm not 100% sure of it yet.

RandyMcMillan closed this on Mar 19, 2022
fanquake cross-referenced this on Aug 8, 2022 from issue net: signed-integer-overflow in LocalServiceInfo by Crypt-iQ
hebasto cross-referenced this on Oct 27, 2022 from issue Fix #24049: signed integer overflow in `SeenLocal` by ViralTaco

ViralTaco commented at 9:24 PM on November 2, 2022: none

My thought is something like this:

--- a/src/net.cpp
+++ b/src/net.cpp
@@ -349,6 +349,20 @@ bool SeenLocal(const CService& addr)
     LOCK(g_maplocalhost_mutex);
     const auto it = mapLocalHost.find(addr);
     if (it == mapLocalHost.end()) return false;
+    if (it->second.nScore == std::numeric_limits<int>::max()) {
+        int lowest_inc_score = std::numeric_limits<int>::max();
+        for (auto& localinfo : mapLocalHost) {
+            if (localinfo.second.nScore > LOCAL_MAX && localinfo.second.nScore < lowest_inc_score) {
+                lowest_inc_score = localinfo.second.nScore;
+            }
+        }
+        lowest_inc_score -= LOCAL_MAX;
+        for (auto& localinfo : mapLocalHost) {
+            if (localinfo.second.nScore > LOCAL_MAX) {
+                localinfo.second.nScore -= lowest_inc_score;
+            }
+        }
+    }
     ++it->second.nScore;
     return true;
 }

But I'm not 100% sure of it yet.

Does the code inside the if (INT_MAX == it->second.nscore) need to have linear ([or] is it binomial?) time complexity? You're updating EVERY node in a map because the nScore in ONE of the is equal to INT_MAX? If you only intend to keep the ordering: there are many ways to do it in constant, time.

All of that, and it happens to be a breaking change.

  ++reinterpret_cast<unsigned&>(it->second.nScore);

Keeps everything unchanged, gets rid of the undefined behavior making it (until c++20: implementation specified; since c++20: well-defined) behavior.

What do you think?

EDIT: Removed a paragraph since the problem I described in it couldn't happen without a data race, anyway.

hebasto cross-referenced this on Apr 1, 2023 from issue #24049 Issue: Update nScore datatype by DevAgrawal1112
bitcoin locked this on Nov 2, 2023

Contributors

Labels

Linked (view graph)

#24049 net: signed-integer-overflow in LocalServiceInfo #24224 util: Add SaturatingAdd helper #26399 Fix #24049: signed integer overflow in `SeenLocal`#27386 #24049 Issue: Update nScore datatype