{"id":237320,"date":"2024-06-27T22:29:46","date_gmt":"2024-06-27T22:29:46","guid":{"rendered":"https:\/\/michigandigitalnews.com\/index.php\/2024\/06\/27\/university-examiners-fail-to-spot-chatgpt-answers-in-real-world-test\/"},"modified":"2025-06-25T17:15:56","modified_gmt":"2025-06-25T17:15:56","slug":"university-examiners-fail-to-spot-chatgpt-answers-in-real-world-test","status":"publish","type":"post","link":"https:\/\/michigandigitalnews.com\/index.php\/2024\/06\/27\/university-examiners-fail-to-spot-chatgpt-answers-in-real-world-test\/","title":{"rendered":"University examiners fail to spot ChatGPT answers in real-world test"},"content":{"rendered":"<p> [ad_1]<br \/>\n<\/p>\n<div id=\"\">\n<figure class=\"ArticleImage\">\n<div class=\"Image__Wrapper\"><img fetchpriority=\"high\" decoding=\"async\" class=\"Image\" width=\"1350\" height=\"900\" alt=\"\" src=\"https:\/\/images.newscientist.com\/wp-content\/uploads\/2024\/06\/25114153\/SEI_209782933.jpg\" sizes=\"(min-width: 1288px) 837px, (min-width: 1024px) calc(57.5vw + 55px), (min-width: 415px) calc(100vw - 40px), calc(70vw + 74px)\" srcset=\"https:\/\/images.newscientist.com\/wp-content\/uploads\/2024\/06\/25114153\/SEI_209782933.jpg?width=300 300w, https:\/\/images.newscientist.com\/wp-content\/uploads\/2024\/06\/25114153\/SEI_209782933.jpg?width=400 400w, https:\/\/images.newscientist.com\/wp-content\/uploads\/2024\/06\/25114153\/SEI_209782933.jpg?width=500 500w, https:\/\/images.newscientist.com\/wp-content\/uploads\/2024\/06\/25114153\/SEI_209782933.jpg?width=600 600w, https:\/\/images.newscientist.com\/wp-content\/uploads\/2024\/06\/25114153\/SEI_209782933.jpg?width=700 700w, https:\/\/images.newscientist.com\/wp-content\/uploads\/2024\/06\/25114153\/SEI_209782933.jpg?width=800 800w, https:\/\/images.newscientist.com\/wp-content\/uploads\/2024\/06\/25114153\/SEI_209782933.jpg?width=837 837w, https:\/\/images.newscientist.com\/wp-content\/uploads\/2024\/06\/25114153\/SEI_209782933.jpg?width=900 900w, https:\/\/images.newscientist.com\/wp-content\/uploads\/2024\/06\/25114153\/SEI_209782933.jpg?width=1003 1003w, https:\/\/images.newscientist.com\/wp-content\/uploads\/2024\/06\/25114153\/SEI_209782933.jpg?width=1100 1100w, https:\/\/images.newscientist.com\/wp-content\/uploads\/2024\/06\/25114153\/SEI_209782933.jpg?width=1200 1200w, https:\/\/images.newscientist.com\/wp-content\/uploads\/2024\/06\/25114153\/SEI_209782933.jpg?width=1300 1300w, https:\/\/images.newscientist.com\/wp-content\/uploads\/2024\/06\/25114153\/SEI_209782933.jpg?width=1400 1400w, https:\/\/images.newscientist.com\/wp-content\/uploads\/2024\/06\/25114153\/SEI_209782933.jpg?width=1500 1500w, https:\/\/images.newscientist.com\/wp-content\/uploads\/2024\/06\/25114153\/SEI_209782933.jpg?width=1600 1600w, https:\/\/images.newscientist.com\/wp-content\/uploads\/2024\/06\/25114153\/SEI_209782933.jpg?width=1674 1674w, https:\/\/images.newscientist.com\/wp-content\/uploads\/2024\/06\/25114153\/SEI_209782933.jpg?width=1700 1700w, https:\/\/images.newscientist.com\/wp-content\/uploads\/2024\/06\/25114153\/SEI_209782933.jpg?width=1800 1800w, https:\/\/images.newscientist.com\/wp-content\/uploads\/2024\/06\/25114153\/SEI_209782933.jpg?width=1900 1900w, https:\/\/images.newscientist.com\/wp-content\/uploads\/2024\/06\/25114153\/SEI_209782933.jpg?width=2006 2006w\" loading=\"eager\" fetchpriority=\"high\" data-image-context=\"Article\" data-image-id=\"2436938\" data-caption=\"Exams taken in person make it harder for students to cheat using AI\" data-credit=\"Trish Gant \/ Alamy\"\/><\/div><figcaption class=\"ArticleImageCaption\">\n<div class=\"ArticleImageCaption__CaptionWrapper\">\n<p class=\"ArticleImageCaption__Title\">Exams taken in person make it harder for students to cheat using AI<\/p>\n<p class=\"ArticleImageCaption__Credit\">Trish Gant \/ Alamy<\/p>\n<\/div>\n<\/figcaption><\/figure>\n<\/p>\n<p>Ninety-four per cent of university exam submissions created using ChatGPT weren\u2019t detected as being generated by artificial intelligence, and these submissions tended to get higher scores than real students\u2019 work.<\/p>\n<p><a href=\"https:\/\/www.reading.ac.uk\/pcls\/staff\/peter-scarfe\">Peter Scarfe<\/a> at the University of Reading, UK, and his colleagues used ChatGPT to produce answers to 63 assessment questions on five modules across the university\u2019s psychology undergraduate degrees. Students sat these exams at home, so they were allowed to look at notes and references, and they could potentially have used AI although this wasn\u2019t permitted.<\/p>\n<p>The AI-generated answers were submitted alongside real students\u2019 work, and accounted for, on average, 5 per cent of the total scripts marked by academics. The markers weren\u2019t informed that they were checking the work of 33 fake students \u2013 whose names were themselves generated by ChatGPT.<\/p>\n<p>The assessments included two types of questions: short answers and longer essays. The prompts given to ChatGPT began with the words \u201cIncluding references to academic literature but not a separate reference section\u201d, then copied the exam question.<\/p>\n<p><span class=\"js-content-prompt-opportunity\"\/><\/p>\n<p>Across all modules, only 6 per cent of the AI submissions were flagged as potentially not being a student\u2019s own work \u2013 though in some modules, no AI-generated work was flagged as suspicious. \u201cOn average, the AI responses gained higher grades than our real student submissions,\u201d says Scarfe, though there was some variability across modules.<\/p>\n<p>\u201cCurrent AI tends to struggle with more abstract reasoning and integration into information,\u201d he adds. But across all 63 AI submissions, there was an 83.4 per cent chance that the AI work outscored that of the students.<\/p>\n<p>The researchers claim that their work is the largest and most robust study of its kind to date. Although the study only checked work on the University of Reading\u2019s psychology degree, Scarfe believes it is a concern for the whole academic sector. \u201cI have no reason to think that other subject areas wouldn\u2019t have just the same kind of issue,\u201d he says.<\/p>\n<p>\u201cThe results show exactly what I\u2019d expect to see,\u201d says <a href=\"https:\/\/thomaslancaster.co.uk\/\">Thomas Lancaster<\/a> at Imperial College London. \u201cWe know that generative AI can produce reasonable sounding responses to simple, constrained textual questions.\u201d He points out that unsupervised assessments including short answers have always been susceptible to cheating.<\/p>\n<p>The workload for academics expected to mark work also doesn\u2019t help their ability to pick up AI fakery. \u201cTime-pressured markers of short answer questions are highly unlikely to raise AI misconduct cases on a whim,\u201d says Lancaster. \u201cI am sure this isn\u2019t the only institution where this is happening.\u201d<\/p>\n<p>Tackling it at source is going to be near-impossible, says Scarfe. So the sector must instead reconsider what it is assessing. \u201cI think it\u2019s going to take the sector as a whole to acknowledge the fact that we\u2019re going to have to be building AI into the assessments we give to our students,\u201d he says.<\/p>\n<section class=\"ArticleTopics\">\n<p class=\"ArticleTopics__Heading\">Topics:<\/p>\n<\/section><\/div>\n<p>[ad_2]<br \/>\n<br \/><a href=\"https:\/\/www.newscientist.com\/article\/2436888-university-examiners-fail-to-spot-chatgpt-answers-in-real-world-test\/?utm_campaign=RSS%7CNSNS&#038;utm_source=NSNS&#038;utm_medium=RSS&#038;utm_content=home\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>[ad_1] Exams taken in person make it harder for students to cheat using AI Trish Gant \/ Alamy Ninety-four per cent of university exam submissions<\/p>\n","protected":false},"author":1,"featured_media":237321,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_monsterinsights_skip_tracking":false,"_uf_show_specific_survey":0,"_uf_disable_surveys":false,"footnotes":""},"categories":[177],"tags":[],"_links":{"self":[{"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/posts\/237320"}],"collection":[{"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/comments?post=237320"}],"version-history":[{"count":0,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/posts\/237320\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/media\/237321"}],"wp:attachment":[{"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/media?parent=237320"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/categories?post=237320"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/tags?post=237320"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}