Dan Bowen and Ray Fleming
27 June 2024
Content warning! This episode talks about an academic research paper titled "ChatGPT is bulls**t", and we've not edited the word out - in fact, we've gone to town with it, talking about the different types of it (in the strictest academic sense). So you may not want to play this in the car on your school run! The news item discussed is: Student crafts elaborate AI scheme to pass university exam, gets arrested https://cybernews.com/news/turkish-student-found-using-ai-arrested/ This week's papers discussed are: Developing evaluative judgement for a time of generative artificial intelligence https://www.tandfonline.com/doi/full/10.1080/02602938.2024.2335321 Prompting Large Language Models for Zero-shot Essay Scoring via Multi-trait Specialization https://arxiv.org/abs/2404.04941 Working Alongside, Not Against, AI Writing Tools in the Composition Classroom: a Dialectical Retrospective https://uen.pressbooks.pub/teachingandgenerativeai/chapter/working-alongside-not-against-ai-writing-tools-in-the-composition-classroom-a-dialectical-retrospective/ GPT versus Resident Physicians — A Benchmark Based on Official Board Scores https://ai.nejm.org/doi/pdf/10.1056/AIdbp2300192 Evaluating General Vision-Language Models for Clinical Medicine https://www.medrxiv.org/content/10.1101/2024.04.12.24305744v1 Re-evaluating GPT-4's bar exam performance https://link.springer.com/article/10.1007/s10506-024-09396-9 Automated Social Science: Language Models as Scientist and Subjects https://arxiv.org/abs/2404.11794 Large language models cannot replace human participants because they cannot portray identity groups https://arxiv.org/abs/2402.01908 I also mentioned the article about the the Infinite Focus Group, which is here: https://www.ai-mindset.ai/ai-mindset-newsletter/the-infinite-focus-group The impact of large language models on university students' literacy development https://www.tandfonline.com/doi/epdf/10.1080/07294360.2024.2332259?needAccess=true Do teachers spot AI? Evaluating the detectability of AI-generated texts among student essays https://www.sciencedirect.com/science/article/pii/S2666920X24000109 Feedback sources in essay writing: peer-generated or AI-generated feedback? https://link.springer.com/article/10.1186/s41239-024-00455-4 ChatGPT is b******t https://link.springer.com/epdf/10.1007/s10676-024-09775-5?sharing_token=0CIhP_zo5-plierRq8kkDPe4RwlQNchNByi7wbcMAY77xTOWyddkW01qGFs1m5zuuoZGBctVlsJF8SbYqcxWi-XzgEYEPiw7xwWi4bMYXJ_1JARDrER9JGdWZOW-UGSkrk_tXPjPh-XWvFNoiFzNlnDUUUEBAztiX9PtP2p6jfI%3D
0:00 / 0:00