When benchmarks go bad - what I learned from measuring performance wrong Similarity score = 0.60 More