As shown below, the stability metric is not terrible, but not great. I believe the issue is multithreading, since code coverage runs with gcov show variance in mutex-related code.
See section 8 of, e.g., https://afl-1.readthedocs.io/en/latest/user_guide.html