<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
     xmlns:admin="http://webns.net/mvcb/"
     xmlns:content="http://purl.org/rss/1.0/modules/content/"
     xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
	<channel> 

      <title>Comments on: more better tag</title>
      <link>http://metatalk.metafilter.com/21937/more-better-tag/</link>
      <description>Comments on MetaTalk post more better tag</description>
	  	  <pubDate>Sun, 05 Aug 2012 13:29:00 -0800</pubDate>
      <lastBuildDate>Sun, 05 Aug 2012 13:29:00 -0800</lastBuildDate>
      <language>en-us</language>
	  <docs>http://blogs.law.harvard.edu/tech/rss</docs>
	  <ttl>60</ttl>

<item>
  	<title>more better tag</title>
  	<link>http://metatalk.metafilter.com/21937/more-better-tag</link>	
  	<description>Can we get a frequency count for users following / tracking tags? &lt;br /&gt;&lt;br /&gt; Since My Mefi allows users to filter by tags, it seems like a useful feature to know how popular tags are. We already have the tag cloud, but I&apos;m more curious about the middle-range of tags. Not that I plan to spam the most popular tags, but knowing for example, if &quot;ubuntu&quot; is a more followed tag than just &quot;linux&quot; would be useful when deciding how to tag a post about an Ubuntu announcement or something. 

I realize there&apos;s probably some privacy concerns about the data, but I think a few heuristics could resolve that (ie, drop all tags below a threshold from the count).

Thoughts?</description>
  	<guid isPermaLink="false">post:metatalk.metafilter.com,2012:site.21937</guid>
  	<pubDate>Sun, 05 Aug 2012 13:24:30 -0800</pubDate>
  	<dc:creator>pwnguin</dc:creator>
</item>
<item>
  	<title>By: axiom</title>
  	<link>http://metatalk.metafilter.com/21937/more-better-tag#1013437</link>	
  	<description>Well, this isn&apos;t exactly what you&apos;re asking for (it&apos;s not following data, rather usage), but tag popularity came up &lt;a href=&quot;http://metatalk.metafilter.com/21924/How-does-the-Related-Posts-widget-work#1012791&quot;&gt;the other day&lt;/a&gt;. It&apos;s in the infodump. I made &lt;a href=&quot;http://imgur.com/a/0wzev&quot;&gt;graphs&lt;/a&gt;.</description>
  	<guid isPermaLink="false">comment:metatalk.metafilter.com,2012:site.21937-1013437</guid>
  	<pubDate>Sun, 05 Aug 2012 13:29:00 -0800</pubDate>
  	<dc:creator>axiom</dc:creator>
</item>
<item>
  	<title>By: pb</title>
  	<link>http://metatalk.metafilter.com/21937/more-better-tag#1013441</link>	
  	<description>I&apos;ll need to discuss this with everyone before I run any numbers. We&apos;ve had a lot of concern about tags and privacy in the past as you mention, and My MeFi is a private section of the site. So we&apos;ll give it some thought.</description>
  	<guid isPermaLink="false">comment:metatalk.metafilter.com,2012:site.21937-1013441</guid>
  	<pubDate>Sun, 05 Aug 2012 13:36:41 -0800</pubDate>
  	<dc:creator>pb</dc:creator>
</item>
<item>
  	<title>By: axiom</title>
  	<link>http://metatalk.metafilter.com/21937/more-better-tag#1013442</link>	
  	<description>There are a &lt;i&gt;lot&lt;/i&gt; of tags, too: 149242 unique tags for the blue, and 127640 for the green (I didn&apos;t run the others dumps since they&apos;re comparatively tiny, but I can if you really want to know). It&apos;s power law, though, so the vast majority of tags hardly get used at all -- i.e., the most popular 1000 tags get used roughly 100+ times, while the other hundred-plus thousand get used less than a hundred times apiece.&lt;br&gt;
&lt;br&gt;
But what&apos;s to stop you from tagging your hypothetical post with both Ubuntu and linux?</description>
  	<guid isPermaLink="false">comment:metatalk.metafilter.com,2012:site.21937-1013442</guid>
  	<pubDate>Sun, 05 Aug 2012 13:44:26 -0800</pubDate>
  	<dc:creator>axiom</dc:creator>
</item>
<item>
  	<title>By: Tell Me No Lies</title>
  	<link>http://metatalk.metafilter.com/21937/more-better-tag#1013450</link>	
  	<description>I ran the numbers a few years ago and it came out to roughly 5% of the tags being used with any regularity.</description>
  	<guid isPermaLink="false">comment:metatalk.metafilter.com,2012:site.21937-1013450</guid>
  	<pubDate>Sun, 05 Aug 2012 14:45:14 -0800</pubDate>
  	<dc:creator>Tell Me No Lies</dc:creator>
</item>
<item>
  	<title>By: Tell Me No Lies</title>
  	<link>http://metatalk.metafilter.com/21937/more-better-tag#1013451</link>	
  	<description>Oh, and just for reference linux ranks 142 and the most used tags list and ubuntu ranks 694.&lt;br&gt;
&lt;br&gt;
If would be nice if one of the mods could run distributions on the tags followed vs. tags used.  If there is a correlation then this whole request is moot.</description>
  	<guid isPermaLink="false">comment:metatalk.metafilter.com,2012:site.21937-1013451</guid>
  	<pubDate>Sun, 05 Aug 2012 14:59:51 -0800</pubDate>
  	<dc:creator>Tell Me No Lies</dc:creator>
</item>
<item>
  	<title>By: pwnguin</title>
  	<link>http://metatalk.metafilter.com/21937/more-better-tag#1013453</link>	
  	<description>&lt;em&gt;But what&apos;s to stop you from tagging your hypothetical post with both Ubuntu and linux?&lt;/em&gt;&lt;br&gt;
&lt;br&gt;
Basically it comes down to two things: what tags should I follow, and what tags should I apply to a post? There&apos;s both concord and conflict between these two problems; if no tags are applied to a post, then there&apos;s no filter to be constructed. On the other hand, placing every word in the post as a tag* would nearly solve the poster&apos;s problem but is bound to be much noisier. So the challenge is finding some threshold for useful tags without overwhelming anyone&apos;s filter (or mefi&apos;s DB indexing).&lt;br&gt;
&lt;br&gt;
There&apos;s currently very little feedback at the moment for posters. Let&apos;s use &lt;a href=&quot;http://www.metafilter.com/118601/Municipal-Bankruptcy-in-the-United-States&quot;&gt;this post&lt;/a&gt; as an example, since my last one failed a bit. There&apos;s ten tags there that are only used by this post. By far the largest tag used is &quot;city&quot;, which has many many hits but I doubt there&apos;s many followers who aren&apos;t also following &quot;new york&quot;. And &quot;economic&quot; rather than economics, because it&apos;s part of the phrase economic downturn. It&apos;s an interesting post to me, but fell through my fairly large set of general filters. the man of twists and turns and I failed to come up with a common tag when we should have.&lt;br&gt;
&lt;br&gt;
Although, just now I did have an amusing idea. What every tag boils down to is an agreement between two (or more) parties on whether it belongs or not. And there&apos;s a &lt;a href=&quot;http://gwap.com&quot;&gt;flash game site&lt;/a&gt; that does this sort of thing for music, images, and video. Perhaps we could have mefi the tagging game some day, though I suspect not. Still would be useful to validate the current model.&lt;br&gt;
&lt;br&gt;
* I vaguely recall there being a limit of 20 tags?</description>
  	<guid isPermaLink="false">comment:metatalk.metafilter.com,2012:site.21937-1013453</guid>
  	<pubDate>Sun, 05 Aug 2012 15:28:22 -0800</pubDate>
  	<dc:creator>pwnguin</dc:creator>
</item>
<item>
  	<title>By: pb</title>
  	<link>http://metatalk.metafilter.com/21937/more-better-tag#1013457</link>	
  	<description>&lt;em&gt;I vaguely recall there being a limit of 20 tags?&lt;/em&gt;&lt;br&gt;
&lt;br&gt;
Just looked&amp;mdash;we have a limit of 50 tags.</description>
  	<guid isPermaLink="false">comment:metatalk.metafilter.com,2012:site.21937-1013457</guid>
  	<pubDate>Sun, 05 Aug 2012 15:55:16 -0800</pubDate>
  	<dc:creator>pb</dc:creator>
</item>
<item>
  	<title>By: unliteral</title>
  	<link>http://metatalk.metafilter.com/21937/more-better-tag#1013461</link>	
  	<description>Do you have a character limit too? I seem to recall not being able to have all the tags I wanted even though I don&apos;t think there were 50.</description>
  	<guid isPermaLink="false">comment:metatalk.metafilter.com,2012:site.21937-1013461</guid>
  	<pubDate>Sun, 05 Aug 2012 17:36:42 -0800</pubDate>
  	<dc:creator>unliteral</dc:creator>
</item>
<item>
  	<title>By: pb</title>
  	<link>http://metatalk.metafilter.com/21937/more-better-tag#1013462</link>	
  	<description>You&apos;re right unliteral, I looked too quickly. We have a 50 character limit on the length of a single tag. There&apos;s no limit on the number of tags you can add. I misread it a bit ago.</description>
  	<guid isPermaLink="false">comment:metatalk.metafilter.com,2012:site.21937-1013462</guid>
  	<pubDate>Sun, 05 Aug 2012 17:52:29 -0800</pubDate>
  	<dc:creator>pb</dc:creator>
</item>
<item>
  	<title>By: Night_owl</title>
  	<link>http://metatalk.metafilter.com/21937/more-better-tag#1013463</link>	
  	<description>Is there a character limit in the tag input box on the &quot;post&quot; pages?</description>
  	<guid isPermaLink="false">comment:metatalk.metafilter.com,2012:site.21937-1013463</guid>
  	<pubDate>Sun, 05 Aug 2012 18:15:29 -0800</pubDate>
  	<dc:creator>Night_owl</dc:creator>
</item>
<item>
  	<title>By: pb</title>
  	<link>http://metatalk.metafilter.com/21937/more-better-tag#1013464</link>	
  	<description>No, there&apos;s no limit there.</description>
  	<guid isPermaLink="false">comment:metatalk.metafilter.com,2012:site.21937-1013464</guid>
  	<pubDate>Sun, 05 Aug 2012 18:22:20 -0800</pubDate>
  	<dc:creator>pb</dc:creator>
</item>
<item>
  	<title>By: Blasdelb</title>
  	<link>http://metatalk.metafilter.com/21937/more-better-tag#1013482</link>	
  	<description>&quot;&lt;em&gt;Just looked&#8212;we have a limit of 50 tags.&lt;/em&gt;&quot;&lt;br&gt;
&lt;br&gt;
If you do its not working as &lt;a href=&quot;http://www.metafilter.com/113930/You-know-what-every-kitchen-needs-A-Bloonderbooss-or-a-Boomashootn-and-Swedish-Chef-shows-us-why&quot;&gt;this post has 140&lt;/a&gt;, &lt;a href=&quot;http://www.metafilter.com/117898/What-divorced-readers-did-with-their-wedding-rings&quot;&gt;this post has 56&lt;/a&gt;, &lt;a href=&quot;http://www.metafilter.com/113750/How-Corporations-Corrupt-Science-at-the-Publics-Expense&quot;&gt;this post has 71&lt;/a&gt;, and &lt;a href=&quot;http://www.metafilter.com/103049/Vanguard-of-American-Journalism&quot;&gt;this post has 103&lt;/a&gt;.&lt;br&gt;
&lt;br&gt;
To be fair they&apos;re pretty epic posts.</description>
  	<guid isPermaLink="false">comment:metatalk.metafilter.com,2012:site.21937-1013482</guid>
  	<pubDate>Mon, 06 Aug 2012 00:57:39 -0800</pubDate>
  	<dc:creator>Blasdelb</dc:creator>
</item>
<item>
  	<title>By: pb</title>
  	<link>http://metatalk.metafilter.com/21937/more-better-tag#1013505</link>	
  	<description>Yeah, there&apos;s no limit on the number of tags you can add. I misread the code.</description>
  	<guid isPermaLink="false">comment:metatalk.metafilter.com,2012:site.21937-1013505</guid>
  	<pubDate>Mon, 06 Aug 2012 06:40:35 -0800</pubDate>
  	<dc:creator>pb</dc:creator>
</item>
<item>
  	<title>By: Blasdelb</title>
  	<link>http://metatalk.metafilter.com/21937/more-better-tag#1013517</link>	
  	<description>I wonder, is there an easy way to figure out which posts have the most tags?</description>
  	<guid isPermaLink="false">comment:metatalk.metafilter.com,2012:site.21937-1013517</guid>
  	<pubDate>Mon, 06 Aug 2012 07:29:47 -0800</pubDate>
  	<dc:creator>Blasdelb</dc:creator>
</item>
<item>
  	<title>By: pb</title>
  	<link>http://metatalk.metafilter.com/21937/more-better-tag#1013535</link>	
  	<description>We don&apos;t provide any tools for that. But &lt;a href=&quot;http://stuff.metafilter.com/infodump/&quot;&gt;the infodump&lt;/a&gt; knows.</description>
  	<guid isPermaLink="false">comment:metatalk.metafilter.com,2012:site.21937-1013535</guid>
  	<pubDate>Mon, 06 Aug 2012 08:44:11 -0800</pubDate>
  	<dc:creator>pb</dc:creator>
</item>
<item>
  	<title>By: pb</title>
  	<link>http://metatalk.metafilter.com/21937/more-better-tag#1013589</link>	
  	<description>We discussed this and decided to share the data. They&apos;re aggregate stats and aren&apos;t linked to any specific user. And I only included tags that have two or more matches because it&apos;s more likely that single-use tags could be tied to a specific user. With that out of the way, here are the numbers: &lt;br&gt;
&lt;br&gt;
&lt;a href=&quot;http://ask.metafilter.com/home/myask&quot;&gt;My Ask MeFi&lt;/a&gt; started in April 2008 and 2,625 members have set preferences for this feature. Here are the &lt;a href=&quot;http://cdn.mefi.us/images/metatalk/top-my-ask-tags.txt&quot;&gt;top tags at My Ask&lt;/a&gt;. &lt;br&gt;
&lt;br&gt;
&lt;a href=&quot;http://www.metafilter.com/home/mymefi&quot;&gt;My MeFi&lt;/a&gt; started in August 2011 and 505 members have set preferences there. Here are the &lt;a href=&quot;http://cdn.mefi.us/images/metatalk/top-my-mefi-tags.txt&quot;&gt;top tags at My MeFi&lt;/a&gt;. You can also specify tags to exclude at My MeFi. Here are the &lt;a href=&quot;http://cdn.mefi.us/images/metatalk/top-my-mefi-excluded-tags.txt&quot;&gt;top excluded tags at My MeFi&lt;/a&gt;.</description>
  	<guid isPermaLink="false">comment:metatalk.metafilter.com,2012:site.21937-1013589</guid>
  	<pubDate>Mon, 06 Aug 2012 10:04:02 -0800</pubDate>
  	<dc:creator>pb</dc:creator>
</item>
<item>
  	<title>By: axiom</title>
  	<link>http://metatalk.metafilter.com/21937/more-better-tag#1013799</link>	
  	<description>That&apos;s pretty interesting. The excluded list is so short!&lt;br&gt;
&lt;br&gt;
I can&apos;t believe there are mefites excluding the &lt;code&gt;cats&lt;/code&gt; tag. That&apos;s not a bannable offense?</description>
  	<guid isPermaLink="false">comment:metatalk.metafilter.com,2012:site.21937-1013799</guid>
  	<pubDate>Mon, 06 Aug 2012 20:57:13 -0800</pubDate>
  	<dc:creator>axiom</dc:creator>
</item>
<item>
  	<title>By: pwnguin</title>
  	<link>http://metatalk.metafilter.com/21937/more-better-tag#1013808</link>	
  	<description>Awesome! Obviously MyMefi is just one use case for tags, but it&apos;s interesting how it gets used or misused. Since it was implied that the distributions should be the same, taking a quick look at the AskMefi dataset, here&apos;s a few observations: &lt;br&gt;
&lt;br&gt;
The common ontology problem of pluralization strikes: &quot;book&quot; is used 1215 times without the tag &quot;books&quot;, but only 2 people subscribe to &quot;book.&quot; Given there&apos;s only ten posts tagged &quot;mac&quot; and &quot;book, I&apos;m leaning towards user error than a heavy antimac spam preference. &lt;br&gt;
&lt;br&gt;
Moving and apartments are popular tags, but few people subscribe to them.&lt;br&gt;
&lt;br&gt;
Similarly, 761 posts tagged Christmas, but no subscribers at all. Also completely unwatched is help, microsoft, office, online, and pain. I&apos;m sure there&apos;s a consulting business in there somewhere.&lt;br&gt;
&lt;br&gt;
The most popular single letter tag is C. N half as popular, and Y registers two followers.&lt;br&gt;
&lt;br&gt;
Guitar is a crazy popular tag even though it&apos;s rarely used. Similarly, as one might expect given the popularity of the site among ref librarians, answering questions about literature is more popular than asking them. &lt;br&gt;
&lt;br&gt;
The most frequent tag separation errors are writing and sex. Tags are separated by space, so if your subscription is &quot;writing, sex&quot; you get &quot;writing,&quot; and &quot;sex&quot;. Usually it&apos;s not a common thing, but I did see more people subscribe to feminism with a comma than without!&lt;br&gt;
&lt;br&gt;
Maybe fishbike will come along and do a more robust analysis of things like rank variation between datasets, but it&apos;s pretty clear they&apos;re not the same distribution.</description>
  	<guid isPermaLink="false">comment:metatalk.metafilter.com,2012:site.21937-1013808</guid>
  	<pubDate>Mon, 06 Aug 2012 22:16:47 -0800</pubDate>
  	<dc:creator>pwnguin</dc:creator>
</item>
<item>
  	<title>By: unliteral</title>
  	<link>http://metatalk.metafilter.com/21937/more-better-tag#1013812</link>	
  	<description>The information about the PVRblog at &lt;a href=&quot;http://metafilter.net/&quot;&gt;metafilter.net&lt;/a&gt; is out of date. It is no longer a Retired Project.</description>
  	<guid isPermaLink="false">comment:metatalk.metafilter.com,2012:site.21937-1013812</guid>
  	<pubDate>Mon, 06 Aug 2012 22:21:20 -0800</pubDate>
  	<dc:creator>unliteral</dc:creator>
</item>

    </channel>
</rss>