When benchmarks go bad - what I learned from measuring performance wrong Similarity score = 0.69 More