Four languages have word lists encoded in non-normalized unicode. See canon and compatibility equivalence.
This makes display and comparison with input fail on some terminals (admittedly I can only report on alacritty, though).
I suggest replacing all these files with normalized versions.
I obtained these normalized versions with bash/raku (raku always outputs normalized unicode) :
0bip-0039 $ for f in *.txt; do raku -e "print q{$f}.IO.slurp" > /tmp/$f; mv {/tmp/,}$f;