Punctuation buglet May 26, 2009 1:10 PM   Subscribe

Just a tiny buglet, but while tags can contain non-alphanumeric characters, the search doesn't recognise them...
posted by twine42 to Bugs at 1:10 PM (27 comments total)

Funny that it does know there is one tag that matches, but it doesn't show it.
posted by smackfu at 1:19 PM on May 26, 2009


Tags without alphanumeric characters were grandfathered in, and we try to remove/replace when we spot them. We should have an illegal character cleaning party one of these days.
posted by pb (staff) at 1:29 PM on May 26, 2009


"Illegal characters" have a cleaning party?
The party for "character cleaning" is illegal?
This party, it cleans illegal characters?
posted by Night_owl at 1:45 PM on May 26, 2009


He posits the cleaning of illegal characters by illegal characters, you git.
posted by Mister_A at 1:49 PM on May 26, 2009


What's for dinner, mother?
posted by lazaruslong at 1:52 PM on May 26, 2009


"Illegal characters" have a cleaning party?

Like in a Luc Besson film!
posted by Artw at 1:53 PM on May 26, 2009 [2 favorites]


We should have an illegal character cleaning party one of these days.

Sure, that sounds like it would be a lot more fun than a book burning party, but would it really? Because I had a book burning party once and nobody showed up (possibly because it didn't sound very fun). And nobody showed up to my birthday party that year either (probably because it was a surprise party and I was planning it myself so nobody really knew when or where it was). I guess my point is that, looking back on it, becoming a professional party planner was a bad career choice for me.
posted by burnmp3s at 1:59 PM on May 26, 2009


I shouldn't have answered this off the top of my head. I just checked and apostrophes and underscores are the exception to the English alphanumeric rule, they are allowed. I updated the search results page accordingly, so everything should be set. Thanks for letting us know.
posted by pb (staff) at 2:17 PM on May 26, 2009


pb: We should have an illegal character cleaning party one of these days.

*warily eyes accent mark in own first name*

These are not the illegal characters you are looking for.
posted by Kattullus at 2:46 PM on May 26, 2009 [2 favorites]


An illegal character purge would be the ideal Final Solution.
posted by DU at 2:49 PM on May 26, 2009


Can't you just put them in camps? For their own protection, of course..
posted by cj_ at 5:54 PM on May 26, 2009




oooh!...I found/reported tag non-alpha characters bug. 15th January this year.
posted by bystander at 9:33 PM on May 26, 2009


U+262D is the hammer and sickle... how do I make this display here?
posted by Meatbomb at 4:14 AM on May 27, 2009


9773;
posted by Meatbomb at 4:15 AM on May 27, 2009


☭ That is just cut and paste from elsewhere.
posted by Meatbomb at 4:17 AM on May 27, 2009


☯ 9775;
posted by Meatbomb at 4:18 AM on May 27, 2009


From personal experience, Meatbomb, it seems as though the only way to get unicode characters to display here is to use their entity names; I don't think you can use entity codes at all (at least they seem to be stripped out). And I don't think that all unicode characters actually have entity names; I think it's just a limited set.
posted by koeselitz at 7:22 AM on May 27, 2009


♣ ♦ ♥ ♠

Huh. You seem to be right, koeslitz. Funny that if you find the chars elsewhere you can cut and paste them in with no prob...

decodeunicode.org has a very cool interface and seemingly complete list.
But it does not actually display the symbols in a cut and pasteable format...

Anyone have an easier solution?

posted by Meatbomb at 7:52 AM on May 27, 2009


I keep TwitterKeys in my bookmarks bar for those times I need a hammer and sickle character. It's just a popup with a bunch of characters you can cut and paste. It has Twitter in the title, but it works just about anywhere. See also: ⌘C ⌘V Character.
posted by pb (staff) at 8:17 AM on May 27, 2009 [2 favorites]


Here's an old page with the entity names for a basic set of characters that doesn't include things like ☯ and ☭, but has often-useful stuff.

Some character entity names that are sometimes useful:
Punctuation and symbols:



Â.......................................displays as Â
—.......................................displays as —
â.......................................displays as â
µ.......................................displays as µ
´.......................................displays as ´
·......................................displays as ·
Æ.......................................displays as Æ
 ........................................displays as  
æ.......................................displays as æ
–.......................................displays as –
à......................................displays as à
¬.........................................displays as ¬
À......................................displays as À
¬.........................................displays as ¬
&.........................................displays as &
ñ......................................displays as ñ
Å.......................................displays as Å
Ñ......................................displays as Ñ
å.......................................displays as å
ó......................................displays as ó
Ã......................................displays as Ã
Ó......................................displays as Ó
ã......................................displays as ã
ô.......................................displays as ô
ä........................................displays as ä
Ô.......................................displays as Ô
Ä........................................displays as Ä
ò......................................displays as ò
¦......................................displays as ¦
Ò......................................displays as Ò
ç......................................displays as ç
ª........................................displays as ª
Ç......................................displays as Ç
º........................................displays as º
¸.......................................displays as ¸
Ø......................................displays as Ø
¢........................................displays as ¢
ø......................................displays as ø
©........................................displays as ©
Õ......................................displays as Õ
¤......................................displays as ¤
õ......................................displays as õ
†......................................displays as †
ö........................................displays as ö
‡......................................displays as ‡
Ö........................................displays as Ö
°.........................................displays as °
¶........................................displays as ¶
÷......................................displays as ÷
‰......................................displays as ‰
É......................................displays as É
±......................................displays as ±
é......................................displays as é
£.......................................displays as £
Ê.......................................displays as Ê
"........................................displays as "
ê.......................................displays as ê
».......................................displays as »
è......................................displays as è
”.......................................displays as ”
È......................................displays as È
®.........................................displays as ®
ð.........................................displays as ð
’.......................................displays as ’
Ð.........................................displays as Ð
§........................................displays as §
Ë........................................displays as Ë
­.........................................displays as ­
ë........................................displays as ë
¹........................................displays as ¹
€........................................displays as €
²........................................displays as ²
½......................................displays as ½
³........................................displays as ³
¼......................................displays as ¼
ß.......................................displays as ß
¾......................................displays as ¾
Þ.......................................displays as Þ
>..........................................displays as >
þ.......................................displays as þ
í......................................displays as í
×.......................................displays as ×
Í......................................displays as Í
™.......................................displays as ™
î.......................................displays as î
ú......................................displays as ú
Î.......................................displays as Î
Ú......................................displays as Ú
¡.......................................displays as ¡
û.......................................displays as û
ì......................................displays as ì
Û.......................................displays as Û
Ì......................................displays as Ì
ù......................................displays as ù
¿......................................displays as ¿
Ù......................................displays as Ù
ï........................................displays as ï
¨.........................................displays as ¨
Ï........................................displays as Ï
ü........................................displays as ü
«.......................................displays as «
Ü........................................displays as Ü
“.......................................displays as “
ý......................................displays as ý
‘.......................................displays as ‘
Ý......................................displays as Ý
&lt;..........................................displays as <
&yen;.........................................displays as ¥
&macr;........................................displays as ¯
&yuml;........................................displays as ÿ

Greek characters:

&Alpha; and &alpha;.......................display as Α and α
&Beta; and &beta;.........................display as Β and β
&Gamma; and &gamma;.......................display as Γ and γ
&Delta; and &delta;.......................display as Δ and δ
&Epsilon; and &epsilon;...................display as Ε and ε
&Zeta; and &zeta;.........................display as Ζ and ζ
&Eta; and &eta;...........................display as Η and η
&Theta; and &theta;.......................display as Θ and θ
&Iota; and &iota;.........................display as Ι and ι
&Kappa; and &kappa;.......................display as Κ and κ
&Lambda; and &lambda;.....................display as Λ and λ
&Mu; and &mu;.............................display as Μ and μ
&Nu; and &nu;.............................display as Ν and ν
&Xi; and &xi;.............................display as Ξ and ξ
&Omicron; and &omicron;...................display as Ο and ο
&Pi; and &pi;.............................display as Π and π
&Rho; and &rho;...........................display as Ρ and ρ
&Sigma; and &sigma; and &sigmaf;......display as Σ and σ and ς
&Tau; and &tau;...........................display as Τ and τ
&Upsilon; and &upsilon;...................display as Υ and υ
&Chi; and &chi;...........................display as Χ and χ
&Phi; and &phi;...........................display as Φ and φ
&Psi; and &psi;...........................display as Ψ and ψ
&Omega; and &omega;.......................display as Ω and ω


I have a little script set up that autoreplaces certain things (like “ ” for " and — for --) so that I can do accents and such on Metafilter. Works nicely.
posted by koeselitz at 10:47 AM on May 27, 2009


Speaking of "illegal characters", I've noticed that post titles have their dashes stripped out, but then the spaces are turned into dashes. For example: "help me with my mother-in-law" turns into "help-me-with-my-motherinlaw", when ""help-me-with-my-mother-in-law" makes the more sense.
posted by philomathoholic at 11:16 AM on May 27, 2009


Meatbomb et al:

The following code will autogenerate a popup window with a large variety of unicode chars. It can be modified to add whatever ranges you're interested in. Then take out all the line breaks and put javascript: at the front and you can turn it into a bookmarklet.

var y=null, x=0, coderanges={};
coderanges['symbols']=['2600','26ff'];
coderanges['greek']=['0374','03ff'];
coderanges['cyrillic']=['0400','04ff'];
coderanges['dingbats']=['2701','27BE'];
var k = window.open('','','width=400,height=600');
k.document.write('<style>h2 {font-family: sans-serif; font-size: small; display: inline; padding: 0.25em; margin: 1em; background: #c90} .h { display: none } .s { height: 500px; overflow-y: auto } td { font-size: xx-large !important; }</style>');
for (y in coderanges) {
k.document.write('<h2 id="' + y + '" onclick="top.opener.show(\''+y+'\')">'+y+'</h2>');
}
for (y in coderanges) {
k.document.write('<div class="h" id="' + y + '-chars"><table><tr>');
cellcount=1;
for (x = parseInt(coderanges[y][0],16); x <= parseInt(coderanges[y][1],16); x++) {
if (cellcount % 8 == 0) {
k.document.write('</tr><tr>');
}
k.document.write('<td>'+ ""+x + ';</td>');
cellcount++;
}
k.document.write('</tr></table></div>');
}
function show(w) {
t=k.document.getElementsByTagName('DIV');
for (x=0; x < t.length; x++) {
t[x].className='h';
}
k.document.getElementById(w+'-chars').className='s';
}

posted by Deathalicious at 11:22 AM on May 27, 2009 [2 favorites]


Your script got munged.

k.document.write('<td>'+ ""+x + ';</td>');

This should have an ampersand and pound sign inside the ""
posted by demiurge at 7:35 PM on May 27, 2009


Deathalicious-- cool stuff. But it stripped out the ampersand and number sign inside the double quotes. Doesn't work without those.
posted by blenderfish at 7:37 PM on May 27, 2009


Darn it! Darn it to heck!

Once more, with feeling:

var y=null, x=0, coderanges={};
coderanges['symbols']=['2600','26ff'];
coderanges['greek']=['0374','03ff'];
coderanges['cyrillic']=['0400','04ff'];
coderanges['dingbats']=['2701','27BE'];
var k = window.open('','','width=400,height=600');
k.document.write('<style>h2 {font-family: sans-serif; font-size: small; display: inline; padding: 0.25em; margin: 1em; background: #c90} .h { display: none } .s { height: 500px; overflow-y: auto } td { font-size: xx-large !important; }</style>');
for (y in coderanges) {
k.document.write('<h2 id="' + y + '" onclick="top.opener.show(\''+y+'\')">'+y+'</h2>');
}
for (y in coderanges) {
k.document.write('<div class="h" id="' + y + '-chars"><table><tr>');
cellcount=1;
for (x = parseInt(coderanges[y][0],16); x <= parseInt(coderanges[y][1],16); x++) {
if (cellcount % 8 == 0) {
k.document.write('</tr><tr>');
}
k.document.write('<td>'+ ""+x + ';</td>');
cellcount++;
}
k.document.write('</tr></table></div>');
}
function show(w) {
t=k.document.getElementsByTagName('DIV');
for (x=0; x < t.length; x++) {
t[x].className='h';
}
k.document.getElementById(w+'-chars').className='s';
}

posted by Deathalicious at 10:28 PM on May 27, 2009


Oh for the love of...administrators, hope me?
posted by Deathalicious at 10:29 PM on May 27, 2009


« Older tag = tags   |   Monkeys and Typewriters Newer »

You are not logged in, either login or create an account to post comments