As shown below, the stability metric is not terrible, but not great. I believe the issue is multithreading, since code coverage runs with gcov show variance in mutex-related code.
See section 8 of, e.g., https://afl-1.readthedocs.io/en/latest/user_guide.html
<img width="567" alt="Screen Shot 2021-07-25 at 4 48 45 PM" src="https://user-images.githubusercontent.com/967816/126917370-564ca07b-20b2-44f8-bbaf-19600a1be402.png">