<?xml version="1.0" encoding="iso-8859-1"?>

<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
<title type="text">Joseph Reagle</title>
<subtitle type="html"><![CDATA[
Open Communities, Media, Source, and Standards
]]></subtitle>
<id>http://reagle.org/joseph/blog/method/swartz-outsider-contributions</id>
<link rel="alternate" type="text/html" href="http://reagle.org/joseph/blog" />
<link rel="self" type="application/atom+xml" href="http://reagle.org/joseph/blog/method/swartz-outsider-contributions?flav=atom" />


<author>
<name>Joseph Reagle</name>
<uri>http://reagle.org/joseph/blog/method/swartz-outsider-contributions</uri>
<email></email>
</author>
<rights>Copyright 2003-2010 Joseph Reagle</rights>
<generator uri="http://pyblosxom.sourceforge.net/" version="1.4.3 01/10/2008">
PyBlosxom http://pyblosxom.sourceforge.net/ 1.4.3 01/10/2008
</generator>

<updated>2006-09-04T15:08:36Z</updated>
<!-- icon?  logo?  -->

<entry>
<title type="html">Outsider Contributions</title>
<category term="" />
<id>http://reagle.org/joseph/blog/2006/09/04/swartz-outsider-contributions</id>
<updated>2006-09-04T15:08:36Z</updated>
<published>2006-09-04T15:08:36Z</published>
<link rel="alternate" type="text/html" href="http://reagle.org/joseph/blog/method/swartz-outsider-contributions.html" />
<content type="html">

&lt;p&gt;When I make a substantive contribution to Wikipedia, I tend to edit
&quot;off-line&quot; until I&apos;m satisfied with the text, and then post it in a single
chunk. While I am only a WikiGnome in any case, the typical Wikipedia metric
of &quot;edit counts&quot; would underestimate the contribution made by people who edit
in a similar fashion. My own &lt;a
href=&quot;http://reagle.org/joseph/blog/method/wikipedia-history-scraping?showcomments=yes&quot;&gt;simple
Python script&lt;/a&gt; exhibits this problem. To get some sense of the substance
of any given edit, one would have to go beyond screen-scraping and perform
analysis on the Wikipedia database -- something beyond my desktop computer.
Fortunately, Aaron Swartz purchased &quot;some time on a computer cluster&quot; and
came up with the following &lt;a
href=&quot;http://www.aaronsw.com/weblog/whowriteswikipedia&quot;&gt;novel result&lt;/a&gt;:&lt;/p&gt;

&lt;blockquote&gt;
  &lt;p&gt;When you put it all together, the story become clear: an outsider makes
  one edit to add a chunk of information, then insiders make several edits
  tweaking and reformatting it. In addition, insiders rack up thousands of
  edits doing things like changing the name of a category across the entire
  site -- the kind of thing only insiders deeply care about. As a result,
  insiders account for the vast majority of the edits. But it&apos;s the outsiders
  who provide nearly all of the content.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;I&apos;m looking forward to seeing these findings replicated.&lt;/p&gt;
</content>
</entry>
</feed>
