Punctuation buglet May 26, 2009 1:10 PM Subscribe
Just a tiny buglet, but while tags can contain non-alphanumeric characters, the search doesn't recognise them...
Tags without alphanumeric characters were grandfathered in, and we try to remove/replace when we spot them. We should have an illegal character cleaning party one of these days.
posted by pb (staff) at 1:29 PM on May 26, 2009
posted by pb (staff) at 1:29 PM on May 26, 2009
"Illegal characters" have a cleaning party?
The party for "character cleaning" is illegal?
This party, it cleans illegal characters?
posted by Night_owl at 1:45 PM on May 26, 2009
The party for "character cleaning" is illegal?
This party, it cleans illegal characters?
posted by Night_owl at 1:45 PM on May 26, 2009
He posits the cleaning of illegal characters by illegal characters, you git.
posted by Mister_A at 1:49 PM on May 26, 2009
posted by Mister_A at 1:49 PM on May 26, 2009
What's for dinner, mother?
posted by lazaruslong at 1:52 PM on May 26, 2009
posted by lazaruslong at 1:52 PM on May 26, 2009
"Illegal characters" have a cleaning party?
Like in a Luc Besson film!
posted by Artw at 1:53 PM on May 26, 2009 [2 favorites]
Like in a Luc Besson film!
posted by Artw at 1:53 PM on May 26, 2009 [2 favorites]
We should have an illegal character cleaning party one of these days.
Sure, that sounds like it would be a lot more fun than a book burning party, but would it really? Because I had a book burning party once and nobody showed up (possibly because it didn't sound very fun). And nobody showed up to my birthday party that year either (probably because it was a surprise party and I was planning it myself so nobody really knew when or where it was). I guess my point is that, looking back on it, becoming a professional party planner was a bad career choice for me.
posted by burnmp3s at 1:59 PM on May 26, 2009
Sure, that sounds like it would be a lot more fun than a book burning party, but would it really? Because I had a book burning party once and nobody showed up (possibly because it didn't sound very fun). And nobody showed up to my birthday party that year either (probably because it was a surprise party and I was planning it myself so nobody really knew when or where it was). I guess my point is that, looking back on it, becoming a professional party planner was a bad career choice for me.
posted by burnmp3s at 1:59 PM on May 26, 2009
I shouldn't have answered this off the top of my head. I just checked and apostrophes and underscores are the exception to the English alphanumeric rule, they are allowed. I updated the search results page accordingly, so everything should be set. Thanks for letting us know.
posted by pb (staff) at 2:17 PM on May 26, 2009
posted by pb (staff) at 2:17 PM on May 26, 2009
pb: We should have an illegal character cleaning party one of these days.
*warily eyes accent mark in own first name*
These are not the illegal characters you are looking for.
posted by Kattullus at 2:46 PM on May 26, 2009 [2 favorites]
*warily eyes accent mark in own first name*
These are not the illegal characters you are looking for.
posted by Kattullus at 2:46 PM on May 26, 2009 [2 favorites]
An illegal character purge would be the ideal Final Solution.
posted by DU at 2:49 PM on May 26, 2009
posted by DU at 2:49 PM on May 26, 2009
Can't you just put them in camps? For their own protection, of course..
posted by cj_ at 5:54 PM on May 26, 2009
posted by cj_ at 5:54 PM on May 26, 2009
Illegal characters, a party cleaning.
posted by Fiasco da Gama at 9:15 PM on May 26, 2009 [1 favorite]
posted by Fiasco da Gama at 9:15 PM on May 26, 2009 [1 favorite]
oooh!...I found/reported tag non-alpha characters bug. 15th January this year.
posted by bystander at 9:33 PM on May 26, 2009
posted by bystander at 9:33 PM on May 26, 2009
U+262D is the hammer and sickle... how do I make this display here?
posted by Meatbomb at 4:14 AM on May 27, 2009
posted by Meatbomb at 4:14 AM on May 27, 2009
From personal experience, Meatbomb, it seems as though the only way to get unicode characters to display here is to use their entity names; I don't think you can use entity codes at all (at least they seem to be stripped out). And I don't think that all unicode characters actually have entity names; I think it's just a limited set.
posted by koeselitz at 7:22 AM on May 27, 2009
posted by koeselitz at 7:22 AM on May 27, 2009
♣ ♦ ♥ ♠
Huh. You seem to be right, koeslitz. Funny that if you find the chars elsewhere you can cut and paste them in with no prob...
decodeunicode.org has a very cool interface and seemingly complete list.
But it does not actually display the symbols in a cut and pasteable format...
Anyone have an easier solution?
☯
posted by Meatbomb at 7:52 AM on May 27, 2009
Huh. You seem to be right, koeslitz. Funny that if you find the chars elsewhere you can cut and paste them in with no prob...
decodeunicode.org has a very cool interface and seemingly complete list.
But it does not actually display the symbols in a cut and pasteable format...
Anyone have an easier solution?
☯
posted by Meatbomb at 7:52 AM on May 27, 2009
I keep TwitterKeys in my bookmarks bar for those times I need a hammer and sickle character. It's just a popup with a bunch of characters you can cut and paste. It has Twitter in the title, but it works just about anywhere. See also: ⌘C ⌘V Character.
posted by pb (staff) at 8:17 AM on May 27, 2009 [2 favorites]
posted by pb (staff) at 8:17 AM on May 27, 2009 [2 favorites]
Here's an old page with the entity names for a basic set of characters that doesn't include things like ☯ and ☭, but has often-useful stuff.
Some character entity names that are sometimes useful:
Punctuation and symbols:
Â.......................................displays as Â
—.......................................displays as —
â.......................................displays as â
µ.......................................displays as µ
´.......................................displays as ´
·......................................displays as ·
Æ.......................................displays as Æ
........................................displays as
æ.......................................displays as æ
–.......................................displays as –
à......................................displays as à
¬.........................................displays as ¬
À......................................displays as À
¬.........................................displays as ¬
&.........................................displays as &
ñ......................................displays as ñ
Å.......................................displays as Å
Ñ......................................displays as Ñ
å.......................................displays as å
ó......................................displays as ó
Ã......................................displays as Ã
Ó......................................displays as Ó
ã......................................displays as ã
ô.......................................displays as ô
ä........................................displays as ä
Ô.......................................displays as Ô
Ä........................................displays as Ä
ò......................................displays as ò
¦......................................displays as ¦
Ò......................................displays as Ò
ç......................................displays as ç
ª........................................displays as ª
Ç......................................displays as Ç
º........................................displays as º
¸.......................................displays as ¸
Ø......................................displays as Ø
¢........................................displays as ¢
ø......................................displays as ø
©........................................displays as ©
Õ......................................displays as Õ
¤......................................displays as ¤
õ......................................displays as õ
†......................................displays as †
ö........................................displays as ö
‡......................................displays as ‡
Ö........................................displays as Ö
°.........................................displays as °
¶........................................displays as ¶
÷......................................displays as ÷
‰......................................displays as ‰
É......................................displays as É
±......................................displays as ±
é......................................displays as é
£.......................................displays as £
Ê.......................................displays as Ê
"........................................displays as "
ê.......................................displays as ê
».......................................displays as »
è......................................displays as è
”.......................................displays as ”
È......................................displays as È
®.........................................displays as ®
ð.........................................displays as ð
’.......................................displays as ’
Ð.........................................displays as Ð
§........................................displays as §
Ë........................................displays as Ë
­.........................................displays as
ë........................................displays as ë
¹........................................displays as ¹
€........................................displays as €
²........................................displays as ²
½......................................displays as ½
³........................................displays as ³
¼......................................displays as ¼
ß.......................................displays as ß
¾......................................displays as ¾
Þ.......................................displays as Þ
>..........................................displays as >
þ.......................................displays as þ
í......................................displays as í
×.......................................displays as ×
Í......................................displays as Í
™.......................................displays as ™
î.......................................displays as î
ú......................................displays as ú
Î.......................................displays as Î
Ú......................................displays as Ú
¡.......................................displays as ¡
û.......................................displays as û
ì......................................displays as ì
Û.......................................displays as Û
Ì......................................displays as Ì
ù......................................displays as ù
¿......................................displays as ¿
Ù......................................displays as Ù
ï........................................displays as ï
¨.........................................displays as ¨
Ï........................................displays as Ï
ü........................................displays as ü
«.......................................displays as «
Ü........................................displays as Ü
“.......................................displays as “
ý......................................displays as ý
‘.......................................displays as ‘
Ý......................................displays as Ý
<..........................................displays as <
¥.........................................displays as ¥
¯........................................displays as ¯
ÿ........................................displays as ÿ
Greek characters:
Α and α.......................display as Α and α
Β and β.........................display as Β and β
Γ and γ.......................display as Γ and γ
Δ and δ.......................display as Δ and δ
Ε and ε...................display as Ε and ε
Ζ and ζ.........................display as Ζ and ζ
Η and η...........................display as Η and η
Θ and θ.......................display as Θ and θ
Ι and ι.........................display as Ι and ι
Κ and κ.......................display as Κ and κ
Λ and λ.....................display as Λ and λ
Μ and μ.............................display as Μ and μ
Ν and ν.............................display as Ν and ν
Ξ and ξ.............................display as Ξ and ξ
Ο and ο...................display as Ο and ο
Π and π.............................display as Π and π
Ρ and ρ...........................display as Ρ and ρ
Σ and σ and ς......display as Σ and σ and ς
Τ and τ...........................display as Τ and τ
Υ and υ...................display as Υ and υ
Χ and χ...........................display as Χ and χ
Φ and φ...........................display as Φ and φ
Ψ and ψ...........................display as Ψ and ψ
Ω and ω.......................display as Ω and ω
I have a little script set up that autoreplaces certain things (like “ ” for " and — for --) so that I can do accents and such on Metafilter. Works nicely.
posted by koeselitz at 10:47 AM on May 27, 2009
Some character entity names that are sometimes useful:
Punctuation and symbols:
Â.......................................displays as Â
—.......................................displays as —
â.......................................displays as â
µ.......................................displays as µ
´.......................................displays as ´
·......................................displays as ·
Æ.......................................displays as Æ
........................................displays as
æ.......................................displays as æ
–.......................................displays as –
à......................................displays as à
¬.........................................displays as ¬
À......................................displays as À
¬.........................................displays as ¬
&.........................................displays as &
ñ......................................displays as ñ
Å.......................................displays as Å
Ñ......................................displays as Ñ
å.......................................displays as å
ó......................................displays as ó
Ã......................................displays as Ã
Ó......................................displays as Ó
ã......................................displays as ã
ô.......................................displays as ô
ä........................................displays as ä
Ô.......................................displays as Ô
Ä........................................displays as Ä
ò......................................displays as ò
¦......................................displays as ¦
Ò......................................displays as Ò
ç......................................displays as ç
ª........................................displays as ª
Ç......................................displays as Ç
º........................................displays as º
¸.......................................displays as ¸
Ø......................................displays as Ø
¢........................................displays as ¢
ø......................................displays as ø
©........................................displays as ©
Õ......................................displays as Õ
¤......................................displays as ¤
õ......................................displays as õ
†......................................displays as †
ö........................................displays as ö
‡......................................displays as ‡
Ö........................................displays as Ö
°.........................................displays as °
¶........................................displays as ¶
÷......................................displays as ÷
‰......................................displays as ‰
É......................................displays as É
±......................................displays as ±
é......................................displays as é
£.......................................displays as £
Ê.......................................displays as Ê
"........................................displays as "
ê.......................................displays as ê
».......................................displays as »
è......................................displays as è
”.......................................displays as ”
È......................................displays as È
®.........................................displays as ®
ð.........................................displays as ð
’.......................................displays as ’
Ð.........................................displays as Ð
§........................................displays as §
Ë........................................displays as Ë
­.........................................displays as
ë........................................displays as ë
¹........................................displays as ¹
€........................................displays as €
²........................................displays as ²
½......................................displays as ½
³........................................displays as ³
¼......................................displays as ¼
ß.......................................displays as ß
¾......................................displays as ¾
Þ.......................................displays as Þ
>..........................................displays as >
þ.......................................displays as þ
í......................................displays as í
×.......................................displays as ×
Í......................................displays as Í
™.......................................displays as ™
î.......................................displays as î
ú......................................displays as ú
Î.......................................displays as Î
Ú......................................displays as Ú
¡.......................................displays as ¡
û.......................................displays as û
ì......................................displays as ì
Û.......................................displays as Û
Ì......................................displays as Ì
ù......................................displays as ù
¿......................................displays as ¿
Ù......................................displays as Ù
ï........................................displays as ï
¨.........................................displays as ¨
Ï........................................displays as Ï
ü........................................displays as ü
«.......................................displays as «
Ü........................................displays as Ü
“.......................................displays as “
ý......................................displays as ý
‘.......................................displays as ‘
Ý......................................displays as Ý
<..........................................displays as <
¥.........................................displays as ¥
¯........................................displays as ¯
ÿ........................................displays as ÿ
Greek characters:
Α and α.......................display as Α and α
Β and β.........................display as Β and β
Γ and γ.......................display as Γ and γ
Δ and δ.......................display as Δ and δ
Ε and ε...................display as Ε and ε
Ζ and ζ.........................display as Ζ and ζ
Η and η...........................display as Η and η
Θ and θ.......................display as Θ and θ
Ι and ι.........................display as Ι and ι
Κ and κ.......................display as Κ and κ
Λ and λ.....................display as Λ and λ
Μ and μ.............................display as Μ and μ
Ν and ν.............................display as Ν and ν
Ξ and ξ.............................display as Ξ and ξ
Ο and ο...................display as Ο and ο
Π and π.............................display as Π and π
Ρ and ρ...........................display as Ρ and ρ
Σ and σ and ς......display as Σ and σ and ς
Τ and τ...........................display as Τ and τ
Υ and υ...................display as Υ and υ
Χ and χ...........................display as Χ and χ
Φ and φ...........................display as Φ and φ
Ψ and ψ...........................display as Ψ and ψ
Ω and ω.......................display as Ω and ω
I have a little script set up that autoreplaces certain things (like “ ” for " and — for --) so that I can do accents and such on Metafilter. Works nicely.
posted by koeselitz at 10:47 AM on May 27, 2009
Speaking of "illegal characters", I've noticed that post titles have their dashes stripped out, but then the spaces are turned into dashes. For example: "help me with my mother-in-law" turns into "help-me-with-my-motherinlaw", when ""help-me-with-my-mother-in-law" makes the more sense.
posted by philomathoholic at 11:16 AM on May 27, 2009
posted by philomathoholic at 11:16 AM on May 27, 2009
Meatbomb et al:
The following code will autogenerate a popup window with a large variety of unicode chars. It can be modified to add whatever ranges you're interested in. Then take out all the line breaks and put javascript: at the front and you can turn it into a bookmarklet.
posted by Deathalicious at 11:22 AM on May 27, 2009 [2 favorites]
The following code will autogenerate a popup window with a large variety of unicode chars. It can be modified to add whatever ranges you're interested in. Then take out all the line breaks and put javascript: at the front and you can turn it into a bookmarklet.
var y=null, x=0, coderanges={};
coderanges['symbols']=['2600','26ff'];
coderanges['greek']=['0374','03ff'];
coderanges['cyrillic']=['0400','04ff'];
coderanges['dingbats']=['2701','27BE'];
var k = window.open('','','width=400,height=600');
k.document.write('<style>h2 {font-family: sans-serif; font-size: small; display: inline; padding: 0.25em; margin: 1em; background: #c90} .h { display: none } .s { height: 500px; overflow-y: auto } td { font-size: xx-large !important; }</style>');
for (y in coderanges) {
k.document.write('<h2 id="' + y + '" onclick="top.opener.show(\''+y+'\')">'+y+'</h2>');
}
for (y in coderanges) {
k.document.write('<div class="h" id="' + y + '-chars"><table><tr>');
cellcount=1;
for (x = parseInt(coderanges[y][0],16); x <= parseInt(coderanges[y][1],16); x++) {
if (cellcount % 8 == 0) {
k.document.write('</tr><tr>');
}
k.document.write('<td>'+ ""+x + ';</td>');
cellcount++;
}
k.document.write('</tr></table></div>');
}
function show(w) {
t=k.document.getElementsByTagName('DIV');
for (x=0; x < t.length; x++) {
t[x].className='h';
}
k.document.getElementById(w+'-chars').className='s';
}
posted by Deathalicious at 11:22 AM on May 27, 2009 [2 favorites]
Your script got munged.
k.document.write('<td>'+ ""+x + ';</td>');
This should have an ampersand and pound sign inside the ""
posted by demiurge at 7:35 PM on May 27, 2009
k.document.write('<td>'+ ""+x + ';</td>');
This should have an ampersand and pound sign inside the ""
posted by demiurge at 7:35 PM on May 27, 2009
Deathalicious-- cool stuff. But it stripped out the ampersand and number sign inside the double quotes. Doesn't work without those.
posted by blenderfish at 7:37 PM on May 27, 2009
posted by blenderfish at 7:37 PM on May 27, 2009
Darn it! Darn it to heck!
Once more, with feeling:
posted by Deathalicious at 10:28 PM on May 27, 2009
Once more, with feeling:
var y=null, x=0, coderanges={};
coderanges['symbols']=['2600','26ff'];
coderanges['greek']=['0374','03ff'];
coderanges['cyrillic']=['0400','04ff'];
coderanges['dingbats']=['2701','27BE'];
var k = window.open('','','width=400,height=600');
k.document.write('<style>h2 {font-family: sans-serif; font-size: small; display: inline; padding: 0.25em; margin: 1em; background: #c90} .h { display: none } .s { height: 500px; overflow-y: auto } td { font-size: xx-large !important; }</style>');
for (y in coderanges) {
k.document.write('<h2 id="' + y + '" onclick="top.opener.show(\''+y+'\')">'+y+'</h2>');
}
for (y in coderanges) {
k.document.write('<div class="h" id="' + y + '-chars"><table><tr>');
cellcount=1;
for (x = parseInt(coderanges[y][0],16); x <= parseInt(coderanges[y][1],16); x++) {
if (cellcount % 8 == 0) {
k.document.write('</tr><tr>');
}
k.document.write('<td>'+ ""+x + ';</td>');
cellcount++;
}
k.document.write('</tr></table></div>');
}
function show(w) {
t=k.document.getElementsByTagName('DIV');
for (x=0; x < t.length; x++) {
t[x].className='h';
}
k.document.getElementById(w+'-chars').className='s';
}
posted by Deathalicious at 10:28 PM on May 27, 2009
Oh for the love of...administrators, hope me?
posted by Deathalicious at 10:29 PM on May 27, 2009
posted by Deathalicious at 10:29 PM on May 27, 2009
You are not logged in, either login or create an account to post comments
posted by smackfu at 1:19 PM on May 26, 2009