What would be the thresholds for each color (is green 80% - 95%, or 70% to 95%)?
It’s just the computer vision score, which is [0,100], scaled to [0,120] and used as the hue in HSL.
Does red sort of imply wrong so implicitly in our current visual lexicon that people don’t even consider the other suggestions (sometimes the right suggestion is further down the list)?
Red-green was just the first gradient that occurred to me. I don’t have an informed opinion about whether red in particular imparts a bias or, if so, whether that bias is relevant given that the extension would be opt-in.
Would the colors be adjusted by the user for accessibility (r/g color blindness, etc)?
This is a good point, and taken with your second one argues for a different color gradient, or at least a color-blindness mode.
In general, whether it’s color-coding or just exposing the raw number, as OP requested, I think it’s nice to be able to distinguish between different the different cases illustrated by my examples. Then again, I confess that in addition to not being a designer, I’m also not a statistician, and thus @kueda’s comment above is a bit lost on me:
I’m told the score should not be considered a metric of “confidence” or “probability” and it should mainly be used for ordering outputs
i.e. I’m unclear on whether the magnitude of the differences in scores is relevant in any way.