Cheating & Plagiarism | Research

Wikipedia Tops List of Plagiarized Sources

Where are students finding the materials they plagiarize in their papers? According to a new study, WIkipedia tops the list for both secondary and college students. But as a category, encyclopedia sites are among the least popular sources, coming in behind four other types of information outlets, including both academic sites and paper mills.

The study, Plagiarism and the Web: A Comparison of Internet Sources for Secondary and Higher Education Students, analyzed more than 33.5 million papers--about 9 million from secondary students and 24 million for post-secondary students--submitted to the Turnitin service from iParadigms over a one-year period (June 2010 to June 2011). In those papers, iParadigms' researchers found 128 million "content matches" from a wide variety of Web sources.

Top Individual Sources Plagiarized by Students
Among papers from secondary students, Wikipedia was used in 7.99 percent of the cases of matched text, just beating out Yahoo Answers, which came in at 7.55 percent. The remainder of the top five individual sources for plagiarism among secondary students included Answers.com (3.37 percent), eNotes (2.9 percent), and Slideshare (2.38 percent).

Among papers submitted by students in higher education, Wikipedia was by far the most plagiarized individual site, at 10.74 percent. Yahoo Answers was a distant second at 3.9 percent. Slideshare came in a close third at 3.87 percent, Answers.com at 3.57 percent, and Appapers.com at 3.11 percent.

Top Categories for Plagiarism
But surprisingly, encyclopedia sites were not at the top of the list of broader categories for plagiarized material. According to the report, the bulk of "matched content" in papers came from social and content sharing sites--31 percent in secondary education, 26 percent in higher education. Sites in this category included Facebook, Yahoo Answers, Answers.com, SlideShare, and others.

"By a margin of 5 percent, secondary students rely more on social and content sharing sites," according to the report. "The most obvious conclusion to draw is that younger students do not have as sound an understanding of what constitutes a proper source for written work. Instead, they are sourcing material from familiar sites."

Homework and academic sites came in second, at 23 percent among secondary students and 22 percent among students in higher education. Researchers described this category of sites as "academic, educational, and homework help sites that offer legitimate educational content. Sites such as nih.gov, medlibrary.org, coursehero.com, and bookrags.com are included in this category."

Cheat sites and paper mills made up the third most popular category for plagiarists, according to the report. These were more popular among students in higher education, at 20 percent, than secondary students, at 14 percent.

Traditional online publications (magazine and news sites) came in fourth at 12 percent among secondary students and 17 percent among higher education students.

And encyclopedias (including Wikipedia, Britannica, and Encyclopedia.com) rounded out the top five at 11 percent in secondary education and 12 percent in post-secondary education.

Other sources included shopping and review sites, accounting for 9 percent of content matches among papers from secondary students and 3 percent from college students.

A Call to Action
The report offered three suggestions to help curtail plagiarism, including:

  1. Designing writing assignments that focus on personal experience or current events and that are submitted in stages;
  2. Instructing students in proper citation; and
  3. Using plagiarism detection in a formative manner to allow students to see where their references are improperly cited and to make those corrections before the final paper is submitted.

The complete 2011 Plagiarism and the Web report is freely available as a PDF via Turnitin.com.

About the Author

David Nagel is the executive producer for 1105 Media's online K-12 and higher education publications and electronic newsletters. He can be reached at dnagel@1105media.com. He can now be followed on Twitter at http://twitter.com/THEJournalDave (K-12) or http://twitter.com/CampusTechDave (higher education).

Comments

Sun, Dec 18, 2011 C. Mytko

The funny thing is - the content on Wikipedia is available under a Creative Commons Attribution-ShareAlike 3.0 license. If the kids learned how to attribute it properly, it wouldn't even be plagiarism! I, too, used to forbid Wikipedia as a source in my classroom. But now I tell the kids to start there and use it as a lesson in determining validity and backing up information with secondary sources.

Sat, Nov 12, 2011 Michael Mississippi

Students are going to use Wikipedia so I address it directly when I discuss papers and plagiarism. I tell them Wikipedia is a fine place to START researching a topic, but it cannot be where they finish. Wikipedia remains inconsistently accurate, but it is a pretty good place to begin exploring a topic. I stress that the best way to use the site is to read Wikipedia as a way of generating ideas, and then to use the references on each page to begin their actual research.

Tue, Nov 8, 2011 Margaret

With middle school students, I share a variety of sites with them that I think will help them in their research. Often I will do this during class. I also teach them about how to find out who is sponsoring some of these sites and how that can negatively impact their research. Also, I have students come to my classroom to use books that I have gathered for them. WIth my college students, I use some of the same techniques.

Tue, Nov 8, 2011 Margaret

With middle school students, I share a variety of sites with them that I think will help them in their research. Often I will do this during class. I also teach them about how to find out who is sponsoring some of these sites and how that can negatively impact their research. Also, I have students come to my classroom to use books that I have gathered for them. WIth my college students, I use some of the same techniques.

Tue, Nov 8, 2011

When you tell students they can't use Wikipedia, it simply means they won't cite it. Trust me, they are still using it. When you require that they use at least one book, they come racing into the library the day the paper is due and add the first book they find to their bibliography, but the bulk of the information is still from Wikipedia and other questionable sources. We need to do far more than tell them not to use it if we expect to address the problem.

Tue, Nov 8, 2011 Marsha Claus Calhoun, GA

While obtaining my M.A., we were forbidden to use Wikipedia due to the nature of the website. I now forbid my students to use the site as well. Many times I have looked up items and found the informatin on Wikipedia to be incorrect.

Thu, Nov 3, 2011 Gregory Kohs Pennsylvania

Below comment should read "below", rather than "above". Suggestion box -- allow comments to be edited for 5 minutes by those who post them.

Thu, Nov 3, 2011 Gregory Kohs Pennsylvania

Above comment should read "their site", rather than "their".

Thu, Nov 3, 2011 Gregory Kohs Pennsylvania

David, have you never looked at Answers.com? I'd estimate that 80% of the content on their is simply copied from Wikipedia. Example: http://www.answers.com/topic/monarch-butterfly So, your report here, if anything, underestimates the problem of plagiarism from Wikipedia.

Add your Comment

Your Name:(optional)
Your Email:(optional)
Your Location:(optional)
Comment:
Please type the letters/numbers you see above

White Papers:

  • Dallas County School District Overcomes Coverage Gaps with MOTOTRBO Digital Radios PDF screen shot

    Dallas County School District needed to quickly resolve their radio coverage gaps throughout the county for more efficient transportation communication and to ensure student safety. Download this whitepaper to see how this district has found their solution with improved coverage area, clear audio and private communications, in addition to improved efficiency and student safety. Read more...