Zum Hauptinhalt springen

Showing 1–38 of 38 results for author: Leinonen, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.11994  [pdf, other

    cs.HC cs.AI cs.CY

    Evaluating Contextually Personalized Programming Exercises Created with Generative AI

    Authors: Evanfiya Logacheva, Arto Hellas, James Prather, Sami Sarsa, Juho Leinonen

    Abstract: Programming skills are typically developed through completing various hands-on exercises. Such programming problems can be contextualized to students' interests and cultural backgrounds. Prior research in educational psychology has demonstrated that context personalization of exercises stimulates learners' situational interests and positively affects their engagement. However, creating a varied an… ▽ More

    Submitted 11 June, 2024; originally announced July 2024.

    Comments: 19 pages, 12 figures. Accepted for publication at ICER 2024

  2. arXiv:2407.09231  [pdf, ps, other

    cs.CY cs.HC

    Prompts First, Finally

    Authors: Brent N. Reeves, James Prather, Paul Denny, Juho Leinonen, Stephen MacNeil, Brett A. Becker, Andrew Luxton-Reilly

    Abstract: Generative AI (GenAI) and large language models in particular, are disrupting Computer Science Education. They are proving increasingly capable at more and more challenges. Some educators argue that they pose a serious threat to computing education, and that we should ban their use in the classroom. While there are serious GenAI issues that remain unsolved, it may be useful in the present moment t… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 4 pages

  3. arXiv:2407.04873  [pdf, ps, other

    cs.AI cs.CY

    Evaluating Language Models for Generating and Judging Programming Feedback

    Authors: Charles Koutcheme, Nicola Dainese, Arto Hellas, Sami Sarsa, Juho Leinonen, Syed Ashraf, Paul Denny

    Abstract: The emergence of large language models (LLMs) has transformed research and practice in a wide range of domains. Within the computing education research (CER) domain, LLMs have received plenty of attention especially in the context of learning programming. Much of the work on LLMs in CER has however focused on applying and evaluating proprietary models. In this article, we evaluate the efficiency o… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  4. arXiv:2406.04817  [pdf, other

    cs.CY cs.HC

    Experiences from Integrating Large Language Model Chatbots into the Classroom

    Authors: Arto Hellas, Juho Leinonen, Leo Leppänen

    Abstract: In the present study, we provided students an unfiltered access to a state-of-the-art large language model (LLM) chatbot. The chatbot was intentionally designed to mimic proprietary commercial chatbots such as ChatGPT where the chatbot has not been tailored for the educational context; the underlying engine was OpenAI GPT-4. The chatbot was integrated into online learning materials of three course… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 7 pages, 1 figure, 5 tables

  5. arXiv:2405.17739  [pdf, other

    cs.AI cs.HC

    The Widening Gap: The Benefits and Harms of Generative AI for Novice Programmers

    Authors: James Prather, Brent Reeves, Juho Leinonen, Stephen MacNeil, Arisoa S. Randrianasolo, Brett Becker, Bailey Kimmel, Jared Wright, Ben Briggs

    Abstract: Novice programmers often struggle through programming problem solving due to a lack of metacognitive awareness and strategies. Previous research has shown that novices can encounter multiple metacognitive difficulties while programming. Novices are typically unaware of how these difficulties are hindering their progress. Meanwhile, many novices are now programming with generative AI (GenAI), which… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: Accepted to ICER 2024

  6. arXiv:2405.05347  [pdf, other

    cs.SE cs.AI cs.CL cs.CY

    Benchmarking Educational Program Repair

    Authors: Charles Koutcheme, Nicola Dainese, Sami Sarsa, Juho Leinonen, Arto Hellas, Paul Denny

    Abstract: The emergence of large language models (LLMs) has sparked enormous interest due to their potential application across a range of educational tasks. For example, recent work in programming education has used LLMs to generate learning resources, improve error messages, and provide feedback on code. However, one factor that limits progress within the field is that much of the research uses bespoke da… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: 15 pages, 2 figures, 3 tables. Non-archival report presented at the NeurIPS'23 Workshop on Generative AI for Education (GAIED)

  7. arXiv:2405.05253  [pdf, other

    cs.CL cs.AI cs.CY

    Open Source Language Models Can Provide Feedback: Evaluating LLMs' Ability to Help Students Using GPT-4-As-A-Judge

    Authors: Charles Koutcheme, Nicola Dainese, Sami Sarsa, Arto Hellas, Juho Leinonen, Paul Denny

    Abstract: Large language models (LLMs) have shown great potential for the automatic generation of feedback in a wide range of computing contexts. However, concerns have been voiced around the privacy and ethical implications of sending student work to proprietary models. This has sparked considerable interest in the use of open source LLMs in education, but the quality of the feedback that such open models… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: 7 pages, 4 figures, 2 tables. Accepted for publication at the 29th annual ACM conference on Innovation and Technology in Computer Science Education (ITiCSE 2024)

  8. arXiv:2405.01477  [pdf, other

    cs.SE cs.HC

    "Sometimes You Just Gotta Risk It for the Biscuit": A Portrait of Student Risk-Taking

    Authors: Juho Leinonen, Paul Denny

    Abstract: Understanding how individuals, including students, make decisions involving risk is a fundamental aspect of behavioral research. Despite the ubiquity of risk in various aspects of life, limited empirical work has explored student risk-taking behavior in computing education. This study aims to partially replicate prior research on risk-taking behavior in software engineers while focusing on student… ▽ More

    Submitted 2 May, 2024; v1 submitted 1 April, 2024; originally announced May 2024.

    Comments: 7 pages, 1 figure, 4 tables

  9. arXiv:2403.09409  [pdf, ps, other

    cs.HC cs.AI cs.CL

    "Like a Nesting Doll": Analyzing Recursion Analogies Generated by CS Students using Large Language Models

    Authors: Seth Bernstein, Paul Denny, Juho Leinonen, Lauren Kan, Arto Hellas, Matt Littlefield, Sami Sarsa, Stephen MacNeil

    Abstract: Grasping complex computing concepts often poses a challenge for students who struggle to anchor these new ideas to familiar experiences and understandings. To help with this, a good analogy can bridge the gap between unfamiliar concepts and familiar ones, providing an engaging way to aid understanding. However, creating effective educational analogies is difficult even for experienced instructors.… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: 7 pages, 2 figures, ITiCSE 2024 preprint

  10. arXiv:2403.06050  [pdf, other

    cs.HC cs.CY cs.SE

    Explaining Code with a Purpose: An Integrated Approach for Developing Code Comprehension and Prompting Skills

    Authors: Paul Denny, David H. Smith IV, Max Fowler, James Prather, Brett A. Becker, Juho Leinonen

    Abstract: Reading, understanding and explaining code have traditionally been important skills for novices learning programming. As large language models (LLMs) become prevalent, these foundational skills are more important than ever given the increasing need to understand and evaluate model-generated code. Brand new skills are also needed, such as the ability to formulate clear prompts that can elicit inten… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: Accepted to ITiCSE 2024

  11. arXiv:2401.10759  [pdf, other

    cs.HC cs.AI

    Interactions with Prompt Problems: A New Way to Teach Programming with Large Language Models

    Authors: James Prather, Paul Denny, Juho Leinonen, David H. Smith IV, Brent N. Reeves, Stephen MacNeil, Brett A. Becker, Andrew Luxton-Reilly, Thezyrie Amarouche, Bailey Kimmel

    Abstract: Large Language Models (LLMs) have upended decades of pedagogy in computing education. Students previously learned to code through \textit{writing} many small problems with less emphasis on code reading and comprehension. Recent research has shown that free code generation tools powered by LLMs can solve introductory programming problems presented in natural language with ease. In this paper, we pr… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Comments: accepted for CHI 2024

  12. arXiv:2311.16017  [pdf, other

    cs.HC cs.AI

    Decoding Logic Errors: A Comparative Study on Bug Detection by Students and Large Language Models

    Authors: Stephen MacNeil, Paul Denny, Andrew Tran, Juho Leinonen, Seth Bernstein, Arto Hellas, Sami Sarsa, Joanne Kim

    Abstract: Identifying and resolving logic errors can be one of the most frustrating challenges for novices programmers. Unlike syntax errors, for which a compiler or interpreter can issue a message, logic errors can be subtle. In certain conditions, buggy code may even exhibit correct behavior -- in other cases, the issue might be about how a problem statement has been interpreted. Such errors can be hard t… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  13. arXiv:2311.05943  [pdf, other

    cs.HC

    Prompt Problems: A New Programming Exercise for the Generative AI Era

    Authors: Paul Denny, Juho Leinonen, James Prather, Andrew Luxton-Reilly, Thezyrie Amarouche, Brett A. Becker, Brent N. Reeves

    Abstract: Large Language Models (LLMs) are revolutionizing the field of computing education with their powerful code-generating capabilities. Traditional pedagogical practices have focused on code writing tasks, but there is now a shift in importance towards code reading, comprehension and evaluation of LLM-generated code. Alongside this shift, an important new skill is emerging -- the ability to solve prog… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

    Comments: Accepted to SIGCSE'24. arXiv admin note: substantial text overlap with arXiv:2307.16364

  14. arXiv:2311.03002  [pdf, other

    cs.LG stat.ML

    Estimating treatment effects from single-arm trials via latent-variable modeling

    Authors: Manuel Haussmann, Tran Minh Son Le, Viivi Halla-aho, Samu Kurki, Jussi V. Leinonen, Miika Koskinen, Samuel Kaski, Harri Lähdesmäki

    Abstract: Randomized controlled trials (RCTs) are the accepted standard for treatment effect estimation but they can be infeasible due to ethical reasons and prohibitive costs. Single-arm trials, where all patients belong to the treatment group, can be a viable alternative but require access to an external control group. We propose an identifiable deep latent-variable model for this scenario that can also a… ▽ More

    Submitted 4 March, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: Published at the 27th International Conference on Artificial Intelligence and Statistics (AISTATS) 2024

  15. arXiv:2310.00658  [pdf, other

    cs.CY cs.AI cs.HC

    The Robots are Here: Navigating the Generative AI Revolution in Computing Education

    Authors: James Prather, Paul Denny, Juho Leinonen, Brett A. Becker, Ibrahim Albluwi, Michelle Craig, Hieke Keuning, Natalie Kiesler, Tobias Kohn, Andrew Luxton-Reilly, Stephen MacNeil, Andrew Peterson, Raymond Pettit, Brent N. Reeves, Jaromir Savelka

    Abstract: Recent advancements in artificial intelligence (AI) are fundamentally reshaping computing, with large language models (LLMs) now effectively being able to generate and interpret source code and natural language instructions. These emergent capabilities have sparked urgent questions in the computing education community around how educators should adapt their pedagogy to address the challenges and t… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.

    Comments: 39 pages of content + 12 pages of references and appendices

  16. arXiv:2309.10444  [pdf, other

    cs.AI cs.CL

    Exploring Iterative Enhancement for Improving Learnersourced Multiple-Choice Question Explanations with Large Language Models

    Authors: Qiming Bao, Juho Leinonen, Alex Yuxuan Peng, Wanjun Zhong, Gaël Gendron, Timothy Pistotti, Alice Huang, Paul Denny, Michael Witbrock, Jiamou Liu

    Abstract: Large language models exhibit superior capabilities in processing and understanding language, yet their applications in educational contexts remain underexplored. Learnersourcing enhances learning by engaging students in creating their own educational content. When learnersourcing multiple-choice questions, creating explanations for the solution of a question is a crucial step; it helps other stud… ▽ More

    Submitted 10 March, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

    Comments: The short version (v4) was accepted as a non-archival workshop paper at AGI@ICLR 2024; the full version is under review

  17. arXiv:2307.16364  [pdf, other

    cs.HC cs.AI

    Promptly: Using Prompt Problems to Teach Learners How to Effectively Utilize AI Code Generators

    Authors: Paul Denny, Juho Leinonen, James Prather, Andrew Luxton-Reilly, Thezyrie Amarouche, Brett A. Becker, Brent N. Reeves

    Abstract: With their remarkable ability to generate code, large language models (LLMs) are a transformative technology for computing education practice. They have created an urgent need for educators to rethink pedagogical approaches and teaching strategies for newly emerging skill sets. Traditional approaches to learning programming have focused on frequent and repeated practice at writing code. The ease w… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

  18. arXiv:2306.10509  [pdf, other

    cs.HC cs.AI

    Can We Trust AI-Generated Educational Content? Comparative Analysis of Human and AI-Generated Learning Resources

    Authors: Paul Denny, Hassan Khosravi, Arto Hellas, Juho Leinonen, Sami Sarsa

    Abstract: As an increasing number of students move to online learning platforms that deliver personalized learning experiences, there is a great need for the production of high-quality educational content. Large language models (LLMs) appear to offer a promising solution to the rapid creation of learning materials at scale, reducing the burden on instructors. In this study, we investigated the potential for… ▽ More

    Submitted 3 July, 2023; v1 submitted 18 June, 2023; originally announced June 2023.

  19. arXiv:2306.05715  [pdf, ps, other

    cs.CY cs.AI cs.CL cs.HC cs.SE

    Exploring the Responses of Large Language Models to Beginner Programmers' Help Requests

    Authors: Arto Hellas, Juho Leinonen, Sami Sarsa, Charles Koutcheme, Lilja Kujanpää, Juha Sorva

    Abstract: Background and Context: Over the past year, large language models (LLMs) have taken the world by storm. In computing education, like in other walks of life, many opportunities and threats have emerged as a consequence. Objectives: In this article, we explore such opportunities and threats in a specific area: responding to student programmers' help requests. More specifically, we assess how good… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: 13 pages, 1 figure. To be published in Proceedings of the 2023 ACM Conference on International Computing Education Research V.1 (ICER '23 V1)

  20. arXiv:2306.02608  [pdf, other

    cs.CY cs.AI cs.HC

    Computing Education in the Era of Generative AI

    Authors: Paul Denny, James Prather, Brett A. Becker, James Finnie-Ansley, Arto Hellas, Juho Leinonen, Andrew Luxton-Reilly, Brent N. Reeves, Eddie Antonio Santos, Sami Sarsa

    Abstract: The computing education community has a rich history of pedagogical innovation designed to support students in introductory courses, and to support teachers in facilitating student learning. Very recent advances in artificial intelligence have resulted in code generation models that can produce source code from natural language problem descriptions -- with impressive accuracy in many cases. The wi… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted for publication as a Contributed Article in Communications of the ACM (CACM)

  21. arXiv:2304.12891  [pdf, other

    physics.ao-ph cs.LG eess.IV

    Latent diffusion models for generative precipitation nowcasting with accurate uncertainty quantification

    Authors: Jussi Leinonen, Ulrich Hamann, Daniele Nerini, Urs Germann, Gabriele Franch

    Abstract: Diffusion models have been widely adopted in image generation, producing higher-quality and more diverse samples than generative adversarial networks (GANs). We introduce a latent diffusion model (LDM) for precipitation nowcasting - short-term forecasting based on the latest observational data. The LDM is more stable and requires less computation to train than GANs, albeit with more computationall… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

    Comments: 18 pages, 6 figures. Submitted for publication

    ACM Class: I.2.10; J.2

  22. arXiv:2304.03938  [pdf, other

    cs.CY cs.AI cs.CL cs.HC cs.SE

    Comparing Code Explanations Created by Students and Large Language Models

    Authors: Juho Leinonen, Paul Denny, Stephen MacNeil, Sami Sarsa, Seth Bernstein, Joanne Kim, Andrew Tran, Arto Hellas

    Abstract: Reasoning about code and explaining its purpose are fundamental skills for computer scientists. There has been extensive research in the field of computing education on the relationship between a student's ability to explain code and other skills such as writing and tracing code. In particular, the ability to describe at a high-level of abstraction how code will behave over all possible inputs cor… ▽ More

    Submitted 8 April, 2023; originally announced April 2023.

    Comments: 8 pages, 3 figures. To be published in Proceedings of the 2023 Conference on Innovation and Technology in Computer Science Education V. 1

  23. arXiv:2304.02491  [pdf, other

    cs.HC cs.AI

    "It's Weird That it Knows What I Want": Usability and Interactions with Copilot for Novice Programmers

    Authors: James Prather, Brent N. Reeves, Paul Denny, Brett A. Becker, Juho Leinonen, Andrew Luxton-Reilly, Garrett Powell, James Finnie-Ansley, Eddie Antonio Santos

    Abstract: Recent developments in deep learning have resulted in code-generation models that produce source code from natural language and code-based prompts with high accuracy. This is likely to have profound effects in the classroom, where novices learning to code can now use free tools to automatically suggest solutions to programming exercises and assignments. However, little is currently known about how… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: 26 pages, 2 figures, TOCHI

  24. arXiv:2301.07509  [pdf, other

    cs.HC cs.CY

    Coverage of Course Topics in Learnersourced SQL Exercises

    Authors: Nea Pirttinen, Arto Hellas, Juho Leinonen

    Abstract: Learnersourcing is a common task in modern computing classrooms, where it is used, for example, for the creation of educational resources such as multiple-choice questions and programming exercises. One less studied type of learnersourced artefact is SQL exercises. In this work, we explore how well different SQL topics are covered by learnersourced SQL exercises. Covering most course topics would… ▽ More

    Submitted 18 January, 2023; originally announced January 2023.

  25. Automatically Generating CS Learning Materials with Large Language Models

    Authors: Stephen MacNeil, Andrew Tran, Juho Leinonen, Paul Denny, Joanne Kim, Arto Hellas, Seth Bernstein, Sami Sarsa

    Abstract: Recent breakthroughs in Large Language Models (LLMs), such as GPT-3 and Codex, now enable software developers to generate code based on a natural language prompt. Within computer science education, researchers are exploring the potential for LLMs to generate code explanations and programming assignments using carefully crafted prompts. These advances may enable students to interact with code in ne… ▽ More

    Submitted 9 December, 2022; originally announced December 2022.

    Comments: In Proceedings of the 54th ACM Technical Symposium on Computing Science Education

  26. arXiv:2211.04715  [pdf, other

    cs.HC

    Robosourcing Educational Resources -- Leveraging Large Language Models for Learnersourcing

    Authors: Paul Denny, Sami Sarsa, Arto Hellas, Juho Leinonen

    Abstract: In this article, we introduce and evaluate the concept of robosourcing for creating educational content. Robosourcing lies in the intersection of crowdsourcing and large language models, where instead of a crowd of humans, requests to large language models replace some of the work traditionally performed by the crowd. Robosourcing includes a human-in-the-loop to provide priming (input) as well as… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

  27. arXiv:2211.02265  [pdf, other

    cs.SE cs.HC

    Experiences from Using Code Explanations Generated by Large Language Models in a Web Software Development E-Book

    Authors: Stephen MacNeil, Andrew Tran, Arto Hellas, Joanne Kim, Sami Sarsa, Paul Denny, Seth Bernstein, Juho Leinonen

    Abstract: Advances in natural language processing have resulted in large language models (LLMs) that are capable of generating understandable and sensible written text. Recent versions of these models, such as OpenAI Codex and GPT-3, can generate code and code explanations. However, it is unclear whether and how students might engage with such explanations. In this paper, we report on our experiences genera… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

  28. arXiv:2211.01001  [pdf, other

    physics.ao-ph cs.LG

    Thunderstorm nowcasting with deep learning: a multi-hazard data fusion model

    Authors: Jussi Leinonen, Ulrich Hamann, Ioannis V. Sideris, Urs Germann

    Abstract: Predictions of thunderstorm-related hazards are needed in several sectors, including first responders, infrastructure management and aviation. To address this need, we present a deep learning model that can be adapted to different hazard types. The model can utilize multiple data sources; we use data from weather radar, lightning detection, satellite visible/infrared imagery, numerical weather pre… ▽ More

    Submitted 15 March, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: 17 pages, 3 figures (main text); 13 pages, 10 figures, 1 table (supplement). Accepted for publication in Geophysical Research Letters

    ACM Class: I.2.10; J.2

  29. arXiv:2210.11630  [pdf, ps, other

    cs.HC cs.AI cs.CL cs.SE

    Using Large Language Models to Enhance Programming Error Messages

    Authors: Juho Leinonen, Arto Hellas, Sami Sarsa, Brent Reeves, Paul Denny, James Prather, Brett A. Becker

    Abstract: A key part of learning to program is learning to understand programming error messages. They can be hard to interpret and identifying the cause of errors can be time-consuming. One factor in this challenge is that the messages are typically intended for an audience that already knows how to program, or even for programming environments that then use the information to highlight areas in code. Rese… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: 7 pages, accepted for publication at SIGCSE TS 2023

  30. arXiv:2206.11861  [pdf, other

    cs.SE cs.AI cs.CL

    Automatic Generation of Programming Exercises and Code Explanations using Large Language Models

    Authors: Sami Sarsa, Paul Denny, Arto Hellas, Juho Leinonen

    Abstract: This article explores the natural language generation capabilities of large language models with application to the production of two types of learning resources common in programming courses. Using OpenAI Codex as the large language model, we create programming exercises (including sample solutions and test cases) and code explanations, assessing these qualitatively and quantitatively. Our result… ▽ More

    Submitted 26 June, 2022; v1 submitted 3 June, 2022; originally announced June 2022.

    Comments: 18 pages, 1 figure, accepted in ICER

  31. arXiv:2203.10114  [pdf, other

    physics.ao-ph cs.LG

    Seamless lightning nowcasting with recurrent-convolutional deep learning

    Authors: Jussi Leinonen, Ulrich Hamann, Urs Germann

    Abstract: A deep learning model is presented to nowcast the occurrence of lightning at a five-minute time resolution 60 minutes into the future. The model is based on a recurrent-convolutional architecture that allows it to recognize and predict the spatiotemporal development of convection, including the motion, growth and decay of thunderstorm cells. The predictions are performed on a stationary grid, with… ▽ More

    Submitted 27 September, 2022; v1 submitted 15 March, 2022; originally announced March 2022.

    Comments: 21 pages, 9 figures. Accepted to Artificial Intelligence for the Earth Sciences. Changes after the previous version are in response to the comments received from one remaining anonymous reviewer

    ACM Class: I.2.10; J.2

    Journal ref: Artif. Intell. Earth Syst., 1, e220043 (2022)

  32. arXiv:2112.15072  [pdf, other

    cs.LG

    Empirical Evaluation of Deep Learning Models for Knowledge Tracing: Of Hyperparameters and Metrics on Performance and Replicability

    Authors: Sami Sarsa, Juho Leinonen, Arto Hellas

    Abstract: We review and evaluate a body of deep learning knowledge tracing (DLKT) models with openly available and widely-used data sets, and with a novel data set of students learning to program. The evaluated knowledge tracing models include Vanilla-DKT, two Long Short-Term Memory Deep Knowledge Tracing (LSTM-DKT) variants, two Dynamic Key-Value Memory Network (DKVMN) variants, and Self-Attentive Knowledg… ▽ More

    Submitted 5 April, 2022; v1 submitted 30 December, 2021; originally announced December 2021.

    Comments: 70 pages, 8 figures, submitted to JEDM, added acknowledgments, modified after first round of review

    ACM Class: K.3; I.2

  33. Improvements to short-term weather prediction with recurrent-convolutional networks

    Authors: Jussi Leinonen

    Abstract: The Weather4cast 2021 competition gave the participants a task of predicting the time evolution of two-dimensional fields of satellite-based meteorological data. This paper describes the author's efforts, after initial success in the first stage of the competition, to improve the model further in the second stage. The improvements consisted of a shallower model variant that is competitive against… ▽ More

    Submitted 24 November, 2021; v1 submitted 11 November, 2021; originally announced November 2021.

    Comments: 6 pages, 4 figures. Accepted to the session "Bigdata Cup Challenges: IARAI's Weather4cast Competition" at IEEE Big Data Conference 2021

  34. arXiv:2111.02121  [pdf, other

    cs.LG physics.ao-ph

    Spatiotemporal Weather Data Predictions with Shortcut Recurrent-Convolutional Networks: A Solution for the Weather4cast challenge

    Authors: Jussi Leinonen

    Abstract: This paper presents the neural network model that was used by the author in the Weather4cast 2021 Challenge Stage 1, where the objective was to predict the time evolution of satellite-based weather data images. The network is based on an encoder-forecaster architecture making use of gated recurrent units (GRU), residual blocks and a contracting/expanding architecture with shortcuts similar to U-Ne… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

    Comments: 6 pages, 5 figures. To be published in the proceedings of the 1st workshop on Complex Data Challenges in Earth Observation (CDCEO) 2021. Associated code can be found at https://github.com/jleinonen/weather4cast-stage1

  35. arXiv:2103.01752  [pdf, other

    cs.CY

    Morning or Evening? An Examination of Circadian Rhythms of CS1 Students

    Authors: Albina Zavgorodniaia, Raj Shrestha, Juho Leinonen, Arto Hellas, John Edwards

    Abstract: Circadian rhythms are the cycles of our internal clock that play a key role in governing when we sleep and when we are active. A related concept is chronotype, which is a person's natural tendency toward activity at certain times of day and typically governs when the individual is most alert and productive. In this work we investigate chronotypes in the setting of an Introductory Computer Programm… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

  36. arXiv:2008.08315  [pdf, other

    cs.CL

    FinChat: Corpus and evaluation setup for Finnish chat conversations on everyday topics

    Authors: Katri Leino, Juho Leinonen, Mittul Singh, Sami Virpioja, Mikko Kurimo

    Abstract: Creating open-domain chatbots requires large amounts of conversational data and related benchmark tasks to evaluate them. Standardized evaluation tasks are crucial for creating automatic evaluation metrics for model development; otherwise, comparing the models would require resource-expensive human evaluation. While chatbot challenges have recently managed to provide a plethora of such resources f… ▽ More

    Submitted 19 August, 2020; originally announced August 2020.

  37. arXiv:2005.10374  [pdf, other

    eess.IV cs.LG physics.ao-ph stat.ML

    Stochastic Super-Resolution for Downscaling Time-Evolving Atmospheric Fields with a Generative Adversarial Network

    Authors: Jussi Leinonen, Daniele Nerini, Alexis Berne

    Abstract: Generative adversarial networks (GANs) have been recently adopted for super-resolution, an application closely related to what is referred to as "downscaling" in the atmospheric sciences: improving the spatial resolution of low-resolution images. The ability of conditional GANs to generate an ensemble of solutions for a given input lends itself naturally to stochastic downscaling, but the stochast… ▽ More

    Submitted 19 October, 2020; v1 submitted 20 May, 2020; originally announced May 2020.

    Comments: Accepted for publication in IEEE Transactions in Geoscience and Remote Sensing

    ACM Class: I.4.3; I.5.1; J.2; I.2.10

    Journal ref: IEEE Transactions on Geoscience and Remote Sensing, 59 (9), 7211-7223, 2021

  38. arXiv:1904.07633  [pdf, ps, other

    cs.LG

    HARK Side of Deep Learning -- From Grad Student Descent to Automated Machine Learning

    Authors: Oguzhan Gencoglu, Mark van Gils, Esin Guldogan, Chamin Morikawa, Mehmet Süzen, Mathias Gruber, Jussi Leinonen, Heikki Huttunen

    Abstract: Recent advancements in machine learning research, i.e., deep learning, introduced methods that excel conventional algorithms as well as humans in several complex tasks, ranging from detection of objects in images and speech recognition to playing difficult strategic games. However, the current methodology of machine learning research and consequently, implementations of the real-world applications… ▽ More

    Submitted 16 April, 2019; originally announced April 2019.

    Comments: 13 pages