Base Match Query: Billy Sue Test Match Query #1: Billy Sue and Test Match

Question

0

Editorial Team

Asked: June 17, 20262026-06-17T20:34:55+00:00 2026-06-17T20:34:55+00:00

Base Match Query: Billy Sue Test Match Query #1: Billy Sue and Test Match

0

Base Match Query: Billy Sue

Test Match Query #1: Billy Sue and

Test Match Query #2: Billy and Sue

We end up with identical scores between Base and #1, but Base and #2 have similar yet different scores.

Using the analyze API, the stop word and is removed on both test queries, but the start_offset and end_offset token properties differ for Sue between the Base query and Test Query #2.

Essentially, the pre-stop-word-removal distance between the remaining tokens is recorded and has a small yet finite impact on scoring.

The Question

Is there a way to delay the calculation of the start_offset and end_offset properties of tokens until after stop-words are removed, or otherwise prevent removed stop-words from influencing scoring in any fashion?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-17T20:34:56+00:00

Perhaps disable position increments on the stop word filterand see if that helps? Especially if your mapping has some kind of filter after the stop word filter, you’ll get strange artifacts from the position increments

E.g. something like this:

"analyzer": {
   "analyzer_example":{
      "tokenizer":"standard",
      "filter":["standard", "lowercase", "filter_stop"]
    }
},
"filter": { 
   "filter_stop":{
      "type":"stop",
      "enable_position_increments":"false"
    }
}

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

Base Match Query: Billy Sue Test Match Query #1: Billy Sue and Test Match

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply