2024-01-10
A few weeks ago I spoke with journalist Stephen Harrison for his article “Is copying from Wikipedia plagiarism?”
For those interested, I provide the full interview below with Stephen’s permission.
There are two overlapping and complementary frameworks involved: legal and scholarly.
Legally, Wikipedia allows users to use Wikipedia’s exclusive right to copy, modify, or publish Wikipedia content if the users attribute the source and share the result under the same terms. Wikipedia content is not “public domain” but licensed via the “Creative Commons Attribution-ShareAlike” license.
Academic conventions require that scholars attribute others’ ideas via citation and others’ prose via quote marks.
For example, on page 81 of Oxman’s thesis, she took over 200 words verbatim from the Wikipedia article “Weaving” without attribution or quotation. This is both an infringement of Wikipedia’s copyright and academic plagiarism.
Wikipedia has an article, “Plagiarism from Wikipedia,” that includes incidences from 2008 through to Oxman (added by a Wikipedian on January 06).
The fact that AI is doing this does not mean it is without issue, nor does it obviate centuries of legal and scholarly rules. Copyright owners are suing AI companies presently, and we would be better served by AI that could identify the source of its claims for many reasons.
Wikipedia is not public domain.
I have been disappointed by the conversations about this issue because of the confusion of terms — much of which is purposeful to muddy the waters.
For example, critiques of Neri Oxman, formerly of MIT, and Claudine Gay, of Harvard, have stated they “plagiarized” their dissertations. This implies they stole the central ideas of their dissertations. I don’t think this is the case. Instead, their dissertations contain plagiarized prose. This is a lesser, but still significant infraction.
You are right; different fields have different conventions and norms associated with citations and quotations. Even so, journalism is still concerned with ensuring that attributions are honest and accurate.
In academia, citing an encyclopedia as an authoritative source on a topic has never been appropriate. Wikipedia doesn’t change that. In fact, since at least 2006, the prominent “Cite this article” link in Wikipedia’s sidebar takes you to a page warning you about doing so, but if you do, it provides conveniently formatted citations in the major formats (e.g., APA, Chicago, and AMA).
I suppose a singular author might be more upset to see their prose used elsewhere and misattributed, but that doesn’t change the legal or scholarly issues.
There are degrees of plagiarism: are ideas or words being taken without attribution, what is the extent of the plagiarism, and was it the result of intentional action or carelessness?
Tools that can easily detect plagiarism have become a weapon, as we’ve seen in the cases of Claudine Gay and Neri Oxman. There’s little evidence of this presently, but I would hope sensible people can distinguish between inevitable mistakes of copy-editing, of taking of others words out of laziness, and taking credit for others novel ideas.
The word and concern goes back to the Romans — or so Wikipedia tells me! Anthony Grafton (1997) has an excellent book on one mechanism scholars have adopted for attribution: The Footnote: A Curious History. Laurie Stearns (1999) also has an excellent chapter, “Copy Wrong: Plagiarism And Copyright Law,” in a collection on the topic. More recently, Susan Blum (2009) wrote an ethnography of attitudes about plagiarism on campus, My word! Plagiarism and College Culture. Unfortunately, I don’t recall if they spoke to the history of academic integrity policies at universities, but such policies have existed since I was a student decades ago.