The site search sometimes highlights words as hits which are not hits.
Examples:
- In the results for {fled}, the word "flamboyant" is also highlighted.
- In the results for {sled}, the words "slipped", "slot", and "slaughter" are also highlighted.
- In the results for {cred}, the words "create", "CraigsList", "creepy", etc. are also highlighted.
- In the results for {bling}, the words "black", "blop", and "blouse" are also highlighted.
- In the results for {ching}, the words "challenge", "Chatterjee", "Christine", etc. are also highlighted.
- In the results for {wooing}, the words "wooden" and "WooTube" are also highlighted.
Similarly in {
fred}, {
fling}, {
sting}, {
aping}.
User's uninformed speculation as to the cause: (I understand that no bug report is complete without this.) I suppose the code sees the terminal "ed" of "fled" (for example), figures that "fled" is the past tense form of a verb "fl", then helpfully tries to highlight alternatively inflected forms, which it identifies by pattern "fl*" (in glob syntax).
Search results themselves are fine: The posts returned by the search do all actually seem to contain the searched-for words — for example, the results for {fled} have "flamboyant" highlighted, but they don't include posts that have the word "flamboyant" but not the word "fled".
Further evidence that this is just about the highlighting and not the search proper is the results for {
natural thing}, in which even stopwords such as "the" and "their" are highlighted. (I add the word "natural" to the search only to reduce the number of hits below the threshold where the search will refuse to return results.)
Counterexamples showing some limits of the phenomenon: It doesn't happen when there's just one letter before the presumed suffix, e.g., in the results for {
med}, not all words starting with "m" are highlighted (though the word "meds" is), and in the results for {
bing}, not all words starting with "b" are highlighted.
In the results for {
tees}, the word "TeeFury" is highlighted, but the word "Texas" is not, suggesting that terminal "s" is treated as a productive suffix but terminal "es" is not.
posted by stebulus to Bugs at 2:35 PM (57 comments total)
7 users marked this as a favorite
posted by pb (staff) at 2:38 PM on August 11, 2012