Zum Hauptinhalt springen

Showing 1–8 of 8 results for author: Hicke, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.11203  [pdf

    cs.CY cs.CL

    The Life Cycle of Large Language Models: A Review of Biases in Education

    Authors: Jinsook Lee, Yann Hicke, Renzhe Yu, Christopher Brooks, René F. Kizilcec

    Abstract: Large Language Models (LLMs) are increasingly adopted in educational contexts to provide personalized support to students and teachers. The unprecedented capacity of LLM-based applications to understand and generate natural language can potentially improve instructional effectiveness and learning outcomes, but the integration of LLMs in education technology has renewed concerns over algorithmic bi… ▽ More

    Submitted 3 June, 2024; originally announced July 2024.

    Comments: 20 pages, 2 figures, preprint for British Journal of Educational Technology submission

  2. arXiv:2311.14707  [pdf, other

    cs.CY cs.LG

    Knowledge Tracing Challenge: Optimal Activity Sequencing for Students

    Authors: Yann Hicke

    Abstract: Knowledge tracing is a method used in education to assess and track the acquisition of knowledge by individual learners. It involves using a variety of techniques, such as quizzes, tests, and other forms of assessment, to determine what a learner knows and does not know about a particular subject. The goal of knowledge tracing is to identify gaps in understanding and provide targeted instruction t… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Journal ref: Course Project, 2022

  3. arXiv:2311.02775  [pdf, other

    cs.LG cs.AI cs.CL

    AI-TA: Towards an Intelligent Question-Answer Teaching Assistant using Open-Source LLMs

    Authors: Yann Hicke, Anmol Agarwal, Qianou Ma, Paul Denny

    Abstract: Responding to the thousands of student questions on online QA platforms each semester has a considerable human cost, particularly in computing courses with rapidly growing enrollments. To address the challenges of scalable and intelligent question-answering (QA), we introduce an innovative solution that leverages open-source Large Language Models (LLMs) from the LLaMA-2 family to ensure data priva… ▽ More

    Submitted 18 December, 2023; v1 submitted 5 November, 2023; originally announced November 2023.

    Comments: Updates for camera-ready submission

    Journal ref: NeurIPS Workshop on Generative AI for Education (GAIED), 2023

  4. arXiv:2307.04276  [pdf, other

    cs.CL

    Automated Essay Scoring in Argumentative Writing: DeBERTeachingAssistant

    Authors: Yann Hicke, Tonghua Tian, Karan Jha, Choong Hee Kim

    Abstract: Automated Essay scoring has been explored as a research and industry problem for over 50 years. It has drawn a lot of attention from the NLP community because of its clear educational value as a research area that can engender the creation of valuable time-saving tools for educators around the world. Yet, these tools are generally focused on detecting good grammar, spelling mistakes, and organizat… ▽ More

    Submitted 9 July, 2023; originally announced July 2023.

    Journal ref: LAK23: Workshop on Partnerships for Cocreating Educational Content, 2023

  5. arXiv:2307.04274  [pdf, other

    cs.CL cs.LG

    Assessing the efficacy of large language models in generating accurate teacher responses

    Authors: Yann Hicke, Abhishek Masand, Wentao Guo, Tushaar Gangavarapu

    Abstract: (Tack et al., 2023) organized the shared task hosted by the 18th Workshop on Innovative Use of NLP for Building Educational Applications on generation of teacher language in educational dialogues. Following the structure of the shared task, in this study, we attempt to assess the generative abilities of large language models in providing informative and helpful insights to students, thereby simula… ▽ More

    Submitted 9 July, 2023; originally announced July 2023.

    Journal ref: ACL, Innovative Use of NLP for Building Educational Applications Workshop, 2023

  6. arXiv:2306.08997   

    cs.CL cs.AI cs.LG

    Exploring the MIT Mathematics and EECS Curriculum Using Large Language Models

    Authors: Sarah J. Zhang, Samuel Florin, Ariel N. Lee, Eamon Niknafs, Andrei Marginean, Annie Wang, Keith Tyser, Zad Chin, Yann Hicke, Nikhil Singh, Madeleine Udell, Yoon Kim, Tonio Buonassisi, Armando Solar-Lezama, Iddo Drori

    Abstract: We curate a comprehensive dataset of 4,550 questions and solutions from problem sets, midterm exams, and final exams across all MIT Mathematics and Electrical Engineering and Computer Science (EECS) courses required for obtaining a degree. We evaluate the ability of large language models to fulfill the graduation requirements for any MIT major in Mathematics and EECS. Our results demonstrate that… ▽ More

    Submitted 24 June, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: Did not receive permission to release the data or model fine-tuned on the data

  7. arXiv:2211.12112  [pdf, other

    cs.CV cs.AI cs.LG

    Human Evaluation of Text-to-Image Models on a Multi-Task Benchmark

    Authors: Vitali Petsiuk, Alexander E. Siemenn, Saisamrit Surbehera, Zad Chin, Keith Tyser, Gregory Hunter, Arvind Raghavan, Yann Hicke, Bryan A. Plummer, Ori Kerret, Tonio Buonassisi, Kate Saenko, Armando Solar-Lezama, Iddo Drori

    Abstract: We provide a new multi-task benchmark for evaluating text-to-image models. We perform a human evaluation comparing the most common open-source (Stable Diffusion) and commercial (DALL-E 2) models. Twenty computer science AI graduate students evaluated the two models, on three tasks, at three difficulty levels, across ten prompts each, providing 3,600 ratings. Text-to-image generation has seen rapid… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

    Comments: NeurIPS 2022 Workshop on Human Evaluation of Generative Models (HEGM)

  8. arXiv:2206.05442  [pdf, ps, other

    cs.LG

    From Human Days to Machine Seconds: Automatically Answering and Generating Machine Learning Final Exams

    Authors: Iddo Drori, Sarah J. Zhang, Reece Shuttleworth, Sarah Zhang, Keith Tyser, Zad Chin, Pedro Lantigua, Saisamrit Surbehera, Gregory Hunter, Derek Austin, Leonard Tang, Yann Hicke, Sage Simhon, Sathwik Karnik, Darnell Granberry, Madeleine Udell

    Abstract: A final exam in machine learning at a top institution such as MIT, Harvard, or Cornell typically takes faculty days to write, and students hours to solve. We demonstrate that large language models pass machine learning finals at a human level, on finals available online after the models were trained, and automatically generate new human-quality final exam questions in seconds. Previous work has de… ▽ More

    Submitted 28 June, 2023; v1 submitted 11 June, 2022; originally announced June 2022.

    Comments: 9 pages