Minimal Solr score for inclusion in results?
I have created a collection of medical terms, using all default Solr (7.5) settings. The documents came from CSV files and I used bin/post
with default settings.
When I submit a silly query I may not get the number of rows I have requested.
http://host/solr/collection/select?fl=anyLabel,score&q=anyLabel:(astronaut%20%20football%20felafel)&rows=9999&wt=csv
Is there some score threshold? In this case, the lowest score is ~ 8. I've run other less-silly queries that return reasonable results down to scores of 2 or 3.
Why was this result truncated after a result with a score of 8? Do I have any control over that?
anyLabel,score
football,16.0328
astronaut haemolytic anaemia,15.470738
astronaut hemolytic anemia,15.470738
canadian football,14.440538
american football,14.440538
football field,14.440538
astronaut-bone demineralization syndrome,14.188901
indoor football arena,13.135968
australian rules football,13.135968
canadian football - sport,13.135968
american football - sport,13.135968
aussie rules football,13.135968
indoor football court,13.135968
astronaut-bone demineralization syndrome (disorder),13.103226
australian rules football ground,12.04758
indoor football arena (environment),12.04758
indoor american football arena,12.04758
american or canadian football,12.04758
american or canadian football field,11.12575
accidentally kicked during football game,11.12575
australian rules football ground (environment),11.12575
canadian football - sport (qualifier value),11.12575
american or canadian football - sport,11.12575
american football - sport (qualifier value),11.12575
australian rules football (qualifier value),11.12575
"american or canadian football, device",11.12575
accidentally stepped on during football game,10.334962
american or canadian football field (environment),10.334962
accidentally kicked during football game (event),10.334962
american or canadian football - sport (qualifier value),9.649129
"american or canadian football, device (physical object)",9.649129
accidentally stepped on during football game (event),9.649129
"place of occurrence of accident or poisoning, football field",8.518538
"place of occurrence of accident or poisoning, football field (environment)",8.047099
solr
add a comment |
I have created a collection of medical terms, using all default Solr (7.5) settings. The documents came from CSV files and I used bin/post
with default settings.
When I submit a silly query I may not get the number of rows I have requested.
http://host/solr/collection/select?fl=anyLabel,score&q=anyLabel:(astronaut%20%20football%20felafel)&rows=9999&wt=csv
Is there some score threshold? In this case, the lowest score is ~ 8. I've run other less-silly queries that return reasonable results down to scores of 2 or 3.
Why was this result truncated after a result with a score of 8? Do I have any control over that?
anyLabel,score
football,16.0328
astronaut haemolytic anaemia,15.470738
astronaut hemolytic anemia,15.470738
canadian football,14.440538
american football,14.440538
football field,14.440538
astronaut-bone demineralization syndrome,14.188901
indoor football arena,13.135968
australian rules football,13.135968
canadian football - sport,13.135968
american football - sport,13.135968
aussie rules football,13.135968
indoor football court,13.135968
astronaut-bone demineralization syndrome (disorder),13.103226
australian rules football ground,12.04758
indoor football arena (environment),12.04758
indoor american football arena,12.04758
american or canadian football,12.04758
american or canadian football field,11.12575
accidentally kicked during football game,11.12575
australian rules football ground (environment),11.12575
canadian football - sport (qualifier value),11.12575
american or canadian football - sport,11.12575
american football - sport (qualifier value),11.12575
australian rules football (qualifier value),11.12575
"american or canadian football, device",11.12575
accidentally stepped on during football game,10.334962
american or canadian football field (environment),10.334962
accidentally kicked during football game (event),10.334962
american or canadian football - sport (qualifier value),9.649129
"american or canadian football, device (physical object)",9.649129
accidentally stepped on during football game (event),9.649129
"place of occurrence of accident or poisoning, football field",8.518538
"place of occurrence of accident or poisoning, football field (environment)",8.047099
solr
add a comment |
I have created a collection of medical terms, using all default Solr (7.5) settings. The documents came from CSV files and I used bin/post
with default settings.
When I submit a silly query I may not get the number of rows I have requested.
http://host/solr/collection/select?fl=anyLabel,score&q=anyLabel:(astronaut%20%20football%20felafel)&rows=9999&wt=csv
Is there some score threshold? In this case, the lowest score is ~ 8. I've run other less-silly queries that return reasonable results down to scores of 2 or 3.
Why was this result truncated after a result with a score of 8? Do I have any control over that?
anyLabel,score
football,16.0328
astronaut haemolytic anaemia,15.470738
astronaut hemolytic anemia,15.470738
canadian football,14.440538
american football,14.440538
football field,14.440538
astronaut-bone demineralization syndrome,14.188901
indoor football arena,13.135968
australian rules football,13.135968
canadian football - sport,13.135968
american football - sport,13.135968
aussie rules football,13.135968
indoor football court,13.135968
astronaut-bone demineralization syndrome (disorder),13.103226
australian rules football ground,12.04758
indoor football arena (environment),12.04758
indoor american football arena,12.04758
american or canadian football,12.04758
american or canadian football field,11.12575
accidentally kicked during football game,11.12575
australian rules football ground (environment),11.12575
canadian football - sport (qualifier value),11.12575
american or canadian football - sport,11.12575
american football - sport (qualifier value),11.12575
australian rules football (qualifier value),11.12575
"american or canadian football, device",11.12575
accidentally stepped on during football game,10.334962
american or canadian football field (environment),10.334962
accidentally kicked during football game (event),10.334962
american or canadian football - sport (qualifier value),9.649129
"american or canadian football, device (physical object)",9.649129
accidentally stepped on during football game (event),9.649129
"place of occurrence of accident or poisoning, football field",8.518538
"place of occurrence of accident or poisoning, football field (environment)",8.047099
solr
I have created a collection of medical terms, using all default Solr (7.5) settings. The documents came from CSV files and I used bin/post
with default settings.
When I submit a silly query I may not get the number of rows I have requested.
http://host/solr/collection/select?fl=anyLabel,score&q=anyLabel:(astronaut%20%20football%20felafel)&rows=9999&wt=csv
Is there some score threshold? In this case, the lowest score is ~ 8. I've run other less-silly queries that return reasonable results down to scores of 2 or 3.
Why was this result truncated after a result with a score of 8? Do I have any control over that?
anyLabel,score
football,16.0328
astronaut haemolytic anaemia,15.470738
astronaut hemolytic anemia,15.470738
canadian football,14.440538
american football,14.440538
football field,14.440538
astronaut-bone demineralization syndrome,14.188901
indoor football arena,13.135968
australian rules football,13.135968
canadian football - sport,13.135968
american football - sport,13.135968
aussie rules football,13.135968
indoor football court,13.135968
astronaut-bone demineralization syndrome (disorder),13.103226
australian rules football ground,12.04758
indoor football arena (environment),12.04758
indoor american football arena,12.04758
american or canadian football,12.04758
american or canadian football field,11.12575
accidentally kicked during football game,11.12575
australian rules football ground (environment),11.12575
canadian football - sport (qualifier value),11.12575
american or canadian football - sport,11.12575
american football - sport (qualifier value),11.12575
australian rules football (qualifier value),11.12575
"american or canadian football, device",11.12575
accidentally stepped on during football game,10.334962
american or canadian football field (environment),10.334962
accidentally kicked during football game (event),10.334962
american or canadian football - sport (qualifier value),9.649129
"american or canadian football, device (physical object)",9.649129
accidentally stepped on during football game (event),9.649129
"place of occurrence of accident or poisoning, football field",8.518538
"place of occurrence of accident or poisoning, football field (environment)",8.047099
solr
solr
edited Nov 15 '18 at 18:34
Mark Miller
asked Nov 15 '18 at 14:56
Mark MillerMark Miller
1,927624
1,927624
add a comment |
add a comment |
1 Answer
1
active
oldest
votes
There is no minimum score - anything above 0
is considered a match in some way, and is included as long as the rows
and start
parameter make sense compared to the numFound
value in the response.
In general scores are not comparable between requests, and extrapolating scores to mean "a document with half the score of another document is just 50% as relevant" does not make sense either.
The score will also depend on the similarity algorithm used, which can differ between Solr versions. For 7.5 it's a BM25 Similarity.
add a comment |
Your Answer
StackExchange.ifUsing("editor", function () {
StackExchange.using("externalEditor", function () {
StackExchange.using("snippets", function () {
StackExchange.snippets.init();
});
});
}, "code-snippets");
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "1"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53322175%2fminimal-solr-score-for-inclusion-in-results%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
1 Answer
1
active
oldest
votes
1 Answer
1
active
oldest
votes
active
oldest
votes
active
oldest
votes
There is no minimum score - anything above 0
is considered a match in some way, and is included as long as the rows
and start
parameter make sense compared to the numFound
value in the response.
In general scores are not comparable between requests, and extrapolating scores to mean "a document with half the score of another document is just 50% as relevant" does not make sense either.
The score will also depend on the similarity algorithm used, which can differ between Solr versions. For 7.5 it's a BM25 Similarity.
add a comment |
There is no minimum score - anything above 0
is considered a match in some way, and is included as long as the rows
and start
parameter make sense compared to the numFound
value in the response.
In general scores are not comparable between requests, and extrapolating scores to mean "a document with half the score of another document is just 50% as relevant" does not make sense either.
The score will also depend on the similarity algorithm used, which can differ between Solr versions. For 7.5 it's a BM25 Similarity.
add a comment |
There is no minimum score - anything above 0
is considered a match in some way, and is included as long as the rows
and start
parameter make sense compared to the numFound
value in the response.
In general scores are not comparable between requests, and extrapolating scores to mean "a document with half the score of another document is just 50% as relevant" does not make sense either.
The score will also depend on the similarity algorithm used, which can differ between Solr versions. For 7.5 it's a BM25 Similarity.
There is no minimum score - anything above 0
is considered a match in some way, and is included as long as the rows
and start
parameter make sense compared to the numFound
value in the response.
In general scores are not comparable between requests, and extrapolating scores to mean "a document with half the score of another document is just 50% as relevant" does not make sense either.
The score will also depend on the similarity algorithm used, which can differ between Solr versions. For 7.5 it's a BM25 Similarity.
answered Nov 15 '18 at 18:17
MatsLindhMatsLindh
25.7k32341
25.7k32341
add a comment |
add a comment |
Thanks for contributing an answer to Stack Overflow!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53322175%2fminimal-solr-score-for-inclusion-in-results%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown