Zum Hauptinhalt springen

Showing 1–30 of 30 results for author: Kanervisto, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.12399  [pdf, other

    cs.LG cs.AI cs.CV

    Diffusion for World Modeling: Visual Details Matter in Atari

    Authors: Eloi Alonso, Adam Jelley, Vincent Micheli, Anssi Kanervisto, Amos Storkey, Tim Pearce, François Fleuret

    Abstract: World models constitute a promising approach for training reinforcement learning agents in a safe and sample-efficient manner. Recent world models predominantly operate on sequences of discrete latent variables to model environment dynamics. However, this compression into a compact discrete representation may ignore visual details that are important for reinforcement learning. Concurrently, diffus… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: 25 pages, 11 figures, 10 tables

  2. arXiv:2402.03575  [pdf, other

    cs.AI cs.HC

    Toward Human-AI Alignment in Large-Scale Multi-Player Games

    Authors: Sugandha Sharma, Guy Davidson, Khimya Khetarpal, Anssi Kanervisto, Udit Arora, Katja Hofmann, Ida Momennejad

    Abstract: Achieving human-AI alignment in complex multi-agent games is crucial for creating trustworthy AI agents that enhance gameplay. We propose a method to evaluate this alignment using an interpretable task-sets framework, focusing on high-level behavioral tasks instead of low-level policies. Our approach has three components. First, we analyze extensive human gameplay data from Xbox's Bleeding Edge (1… ▽ More

    Submitted 18 June, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  3. arXiv:2312.02405  [pdf, other

    cs.AI

    BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents that Solve Fuzzy Tasks

    Authors: Stephanie Milani, Anssi Kanervisto, Karolis Ramanauskas, Sander Schulhoff, Brandon Houghton, Rohin Shah

    Abstract: The MineRL BASALT competition has served to catalyze advances in learning from human feedback through four hard-to-specify tasks in Minecraft, such as create and photograph a waterfall. Given the completion of two years of BASALT competitions, we offer to the community a formalized benchmark through the BASALT Evaluation and Demonstrations Dataset (BEDD), which serves as a resource for algorithm d… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: NeurIPS 2023 Datasets and Benchmarks Oral. Dataset links are available on Github: https://github.com/minerllabs/basalt-benchmark

  4. arXiv:2312.02312  [pdf, other

    cs.LG cs.AI cs.CV

    Visual Encoders for Data-Efficient Imitation Learning in Modern Video Games

    Authors: Lukas Schäfer, Logan Jones, Anssi Kanervisto, Yuhan Cao, Tabish Rashid, Raluca Georgescu, Dave Bignell, Siddhartha Sen, Andrea Treviño Gavito, Sam Devlin

    Abstract: Video games have served as useful benchmarks for the decision making community, but going beyond Atari games towards training agents in modern games has been prohibitively expensive for the vast majority of the research community. Recent progress in the research, development and open release of large vision models has the potential to amortize some of these costs across the community. However, it… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: Preprint

  5. arXiv:2303.13512  [pdf, other

    cs.AI

    Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition

    Authors: Stephanie Milani, Anssi Kanervisto, Karolis Ramanauskas, Sander Schulhoff, Brandon Houghton, Sharada Mohanty, Byron Galbraith, Ke Chen, Yan Song, Tianze Zhou, Bingquan Yu, He Liu, Kai Guan, Yujing Hu, Tangjie Lv, Federico Malato, Florian Leopold, Amogh Raut, Ville Hautamäki, Andrew Melnik, Shu Ishida, João F. Henriques, Robert Klassert, Walter Laurito, Ellen Novoseller , et al. (5 additional authors not shown)

    Abstract: To facilitate research in the direction of fine-tuning foundation models from human feedback, we held the MineRL BASALT Competition on Fine-Tuning from Human Feedback at NeurIPS 2022. The BASALT challenge asks teams to compete to develop algorithms to solve tasks with hard-to-specify reward functions in Minecraft. Through this competition, we aimed to promote the development of algorithms that use… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

  6. arXiv:2301.10677  [pdf, other

    cs.AI cs.LG stat.ML

    Imitating Human Behaviour with Diffusion Models

    Authors: Tim Pearce, Tabish Rashid, Anssi Kanervisto, Dave Bignell, Mingfei Sun, Raluca Georgescu, Sergio Valcarcel Macua, Shan Zheng Tan, Ida Momennejad, Katja Hofmann, Sam Devlin

    Abstract: Diffusion models have emerged as powerful generative models in the text-to-image domain. This paper studies their application as observation-to-action models for imitating human behaviour in sequential environments. Human behaviour is stochastic and multimodal, with structured correlations between action dimensions. Meanwhile, standard modelling choices in behaviour cloning are limited in their ex… ▽ More

    Submitted 3 March, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

    Comments: Published in ICLR 2023

    Journal ref: ICLR 2023

  7. arXiv:2205.09123  [pdf, other

    cs.LG

    A2C is a special case of PPO

    Authors: Shengyi Huang, Anssi Kanervisto, Antonin Raffin, Weixun Wang, Santiago Ontañón, Rousslan Fernand Julien Dossa

    Abstract: Advantage Actor-critic (A2C) and Proximal Policy Optimization (PPO) are popular deep reinforcement learning algorithms used for game AI in recent years. A common understanding is that A2C and PPO are separate algorithms because PPO's clipped objective appears significantly different than A2C's objective. In this paper, however, we show A2C is a special case of PPO. We present theoretical justifica… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

  8. arXiv:2205.07060  [pdf, other

    cs.AI cs.CR cs.LG

    GAN-Aimbots: Using Machine Learning for Cheating in First Person Shooters

    Authors: Anssi Kanervisto, Tomi Kinnunen, Ville Hautamäki

    Abstract: Playing games with cheaters is not fun, and in a multi-billion-dollar video game industry with hundreds of millions of players, game developers aim to improve the security and, consequently, the user experience of their games by preventing cheating. Both traditional software-based methods and statistical systems have been successful in protecting against cheating, but recent advances in the automa… ▽ More

    Submitted 14 May, 2022; originally announced May 2022.

    Comments: Accepted to IEEE Transactions on Games. Source code available at https://github.com/miffyli/gan-aimbots

  9. arXiv:2204.07123  [pdf, other

    cs.AI

    Retrospective on the 2021 BASALT Competition on Learning from Human Feedback

    Authors: Rohin Shah, Steven H. Wang, Cody Wild, Stephanie Milani, Anssi Kanervisto, Vinicius G. Goecks, Nicholas Waytowich, David Watkins-Valls, Bharat Prakash, Edmund Mills, Divyansh Garg, Alexander Fries, Alexandra Souly, Chan Jun Shern, Daniel del Castillo, Tom Lieberum

    Abstract: We held the first-ever MineRL Benchmark for Agents that Solve Almost-Lifelike Tasks (MineRL BASALT) Competition at the Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS 2021). The goal of the competition was to promote research towards agents that use learning from human feedback (LfHF) techniques to solve open-world tasks. Rather than mandating the use of LfHF techniques,… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

    Comments: Accepted to the PMLR NeurIPS 2021 Demo & Competition Track volume

  10. arXiv:2203.11889  [pdf, other

    cs.LG cs.AI cs.NE cs.SC stat.ML

    Insights From the NeurIPS 2021 NetHack Challenge

    Authors: Eric Hambro, Sharada Mohanty, Dmitrii Babaev, Minwoo Byeon, Dipam Chakraborty, Edward Grefenstette, Minqi Jiang, Daejin Jo, Anssi Kanervisto, Jongmin Kim, Sungwoong Kim, Robert Kirk, Vitaly Kurin, Heinrich Küttler, Taehwon Kwon, Donghoon Lee, Vegard Mella, Nantas Nardelli, Ivan Nazarov, Nikita Ovsov, Jack Parker-Holder, Roberta Raileanu, Karolis Ramanauskas, Tim Rocktäschel, Danielle Rothermel , et al. (4 additional authors not shown)

    Abstract: In this report, we summarize the takeaways from the first NeurIPS 2021 NetHack Challenge. Participants were tasked with developing a program or agent that can win (i.e., 'ascend' in) the popular dungeon-crawler game of NetHack by interacting with the NetHack Learning Environment (NLE), a scalable, procedurally generated, and challenging Gym environment for reinforcement learning (RL). The challeng… ▽ More

    Submitted 22 March, 2022; originally announced March 2022.

    Comments: Under review at PMLR for the NeuRIPS 2021 Competition Workshop Track, 10 pages + 10 in appendices

  11. arXiv:2202.10583  [pdf, other

    cs.LG cs.AI

    MineRL Diamond 2021 Competition: Overview, Results, and Lessons Learned

    Authors: Anssi Kanervisto, Stephanie Milani, Karolis Ramanauskas, Nicholay Topin, Zichuan Lin, Junyou Li, Jianing Shi, Deheng Ye, Qiang Fu, Wei Yang, Weijun Hong, Zhongyue Huang, Haicheng Chen, Guangjun Zeng, Yue Lin, Vincent Micheli, Eloi Alonso, François Fleuret, Alexander Nikulin, Yury Belousov, Oleg Svidchenko, Aleksei Shpilman

    Abstract: Reinforcement learning competitions advance the field by providing appropriate scope and support to develop solutions toward a specific problem. To promote the development of more broadly applicable methods, organizers need to enforce the use of general techniques, the use of sample-efficient methods, and the reproducibility of the results. While beneficial for the research community, these restri… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

    Comments: Under review for PMLR volume on NeurIPS 2021 competitions

  12. arXiv:2201.09709  [pdf, other

    cs.SD cs.CR cs.LG eess.AS

    Optimizing Tandem Speaker Verification and Anti-Spoofing Systems

    Authors: Anssi Kanervisto, Ville Hautamäki, Tomi Kinnunen, Junichi Yamagishi

    Abstract: As automatic speaker verification (ASV) systems are vulnerable to spoofing attacks, they are typically used in conjunction with spoofing countermeasure (CM) systems to improve security. For example, the CM can first determine whether the input is human speech, then the ASV can determine whether this speech matches the speaker's identity. The performance of such a tandem system can be measured with… ▽ More

    Submitted 24 January, 2022; originally announced January 2022.

    Comments: Published in IEEE/ACM Transactions on Audio, Speech, and Language Processing. Published version available at: https://ieeexplore.ieee.org/document/9664367

    Journal ref: in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 30, pp. 477-488, 2022

  13. arXiv:2107.02195  [pdf, other

    cs.LG cs.AI

    Agents that Listen: High-Throughput Reinforcement Learning with Multiple Sensory Systems

    Authors: Shashank Hegde, Anssi Kanervisto, Aleksei Petrenko

    Abstract: Humans and other intelligent animals evolved highly sophisticated perception systems that combine multiple sensory modalities. On the other hand, state-of-the-art artificial agents rely mostly on visual inputs or structured low-dimensional observations provided by instrumented environments. Learning to act based on combined visual and auditory inputs is still a new topic of research that has not b… ▽ More

    Submitted 5 July, 2021; originally announced July 2021.

    Comments: To appear in IEEE Conference on Games 2021. Video demonstrations and experiment can be found at https://sites.google.com/view/sound-rl

  14. arXiv:2107.01969  [pdf, other

    cs.LG cs.AI

    The MineRL BASALT Competition on Learning from Human Feedback

    Authors: Rohin Shah, Cody Wild, Steven H. Wang, Neel Alex, Brandon Houghton, William Guss, Sharada Mohanty, Anssi Kanervisto, Stephanie Milani, Nicholay Topin, Pieter Abbeel, Stuart Russell, Anca Dragan

    Abstract: The last decade has seen a significant increase of interest in deep learning research, with many public successes that have demonstrated its potential. As such, these systems are now being incorporated into commercial products. With this comes an additional challenge: how can we build AI systems that solve tasks where there is not a crisp, well-defined specification? While multiple solutions have… ▽ More

    Submitted 5 July, 2021; originally announced July 2021.

    Comments: NeurIPS 2021 Competition Track

  15. arXiv:2107.00703  [pdf, other

    cs.LG cs.AI

    Distilling Reinforcement Learning Tricks for Video Games

    Authors: Anssi Kanervisto, Christian Scheller, Yanick Schraner, Ville Hautamäki

    Abstract: Reinforcement learning (RL) research focuses on general solutions that can be applied across different domains. This results in methods that RL practitioners can use in almost any domain. However, recent studies often lack the engineering steps ("tricks") which may be needed to effectively use RL, such as reward shaping, curriculum learning, and splitting a large task into smaller chunks. Such tri… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

    Comments: To appear in IEEE Conference on Games 2021. Experiment code is available at https://github.com/Miffyli/rl-human-prior-tricks

  16. arXiv:2106.03748  [pdf, other

    cs.LG cs.AI cs.NE cs.RO stat.ML

    Towards robust and domain agnostic reinforcement learning competitions

    Authors: William Hebgen Guss, Stephanie Milani, Nicholay Topin, Brandon Houghton, Sharada Mohanty, Andrew Melnik, Augustin Harter, Benoit Buschmaas, Bjarne Jaster, Christoph Berganski, Dennis Heitkamp, Marko Henning, Helge Ritter, Chengjie Wu, Xiaotian Hao, Yiming Lu, Hangyu Mao, Yihuan Mao, Chao Wang, Michal Opanowicz, Anssi Kanervisto, Yanick Schraner, Christian Scheller, Xiren Zhou, Lu Liu , et al. (4 additional authors not shown)

    Abstract: Reinforcement learning competitions have formed the basis for standard research benchmarks, galvanized advances in the state-of-the-art, and shaped the direction of the field. Despite this, a majority of challenges suffer from the same fundamental problems: participant solutions to the posed challenge are usually domain-specific, biased to maximally exploit compute resources, and not guaranteed to… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Comments: 20 pages, several figures, published PMLR

  17. arXiv:2104.10753  [pdf, other

    cs.RO cs.AI cs.CV

    Multi-task Learning with Attention for End-to-end Autonomous Driving

    Authors: Keishi Ishihara, Anssi Kanervisto, Jun Miura, Ville Hautamäki

    Abstract: Autonomous driving systems need to handle complex scenarios such as lane following, avoiding collisions, taking turns, and responding to traffic signals. In recent years, approaches based on end-to-end behavioral cloning have demonstrated remarkable performance in point-to-point navigational scenarios, using a realistic simulator and standard benchmarks. Offline imitation learning is readily avail… ▽ More

    Submitted 21 April, 2021; originally announced April 2021.

    Comments: Accepted to CVPR 2021 Workshop on Autonomous Driving

  18. arXiv:2104.00698  [pdf, other

    cs.AI

    Back to Square One: Superhuman Performance in Chutes and Ladders Through Deep Neural Networks and Tree Search

    Authors: Dylan Ashley, Anssi Kanervisto, Brendan Bennett

    Abstract: We present AlphaChute: a state-of-the-art algorithm that achieves superhuman performance in the ancient game of Chutes and Ladders. We prove that our algorithm converges to the Nash equilibrium in constant time, and therefore is -- to the best of our knowledge -- the first such formal solution to this game. Surprisingly, despite all this, our implementation of AlphaChute remains relatively straigh… ▽ More

    Submitted 1 April, 2021; originally announced April 2021.

  19. arXiv:2012.01244  [pdf, other

    cs.AI cs.NE

    General Characterization of Agents by States they Visit

    Authors: Anssi Kanervisto, Tomi Kinnunen, Ville Hautamäki

    Abstract: Behavioural characterizations (BCs) of decision-making agents, or their policies, are used to study outcomes of training algorithms and as part of the algorithms themselves to encourage unique policies, match expert policy or restrict changes to policy per update. However, previously presented solutions are not applicable in general, either due to lack of expressive power, computational constraint… ▽ More

    Submitted 28 October, 2021; v1 submitted 2 December, 2020; originally announced December 2020.

    Comments: Deep Reinforcement Learning Workshop, NeurIPS 2021

  20. arXiv:2005.03374  [pdf, other

    cs.AI cs.LG

    Playing Minecraft with Behavioural Cloning

    Authors: Anssi Kanervisto, Janne Karttunen, Ville Hautamäki

    Abstract: MineRL 2019 competition challenged participants to train sample-efficient agents to play Minecraft, by using a dataset of human gameplay and a limit number of steps the environment. We approached this task with behavioural cloning by predicting what actions human players would take, and reached fifth place in the final ranking. Despite being a simple algorithm, we observed the performance of such… ▽ More

    Submitted 7 May, 2020; originally announced May 2020.

    Comments: To appear in Post Proceedings of the Competitions & Demonstrations Track @ NeurIPS2019. Source code available at https://github.com/Miffyli/minecraft-bc

  21. arXiv:2004.00981  [pdf, other

    cs.AI

    Benchmarking End-to-End Behavioural Cloning on Video Games

    Authors: Anssi Kanervisto, Joonas Pussinen, Ville Hautamäki

    Abstract: Behavioural cloning, where a computer is taught to perform a task based on demonstrations, has been successfully applied to various video games and robotics tasks, with and without reinforcement learning. This also includes end-to-end approaches, where a computer plays a video game like humans do: by looking at the image displayed on the screen, and sending keystrokes to the game. As a general app… ▽ More

    Submitted 18 May, 2020; v1 submitted 2 April, 2020; originally announced April 2020.

    Comments: To appear in IEEE Conference on Games 2020. Experiment code available at https://github.com/joonaspu/video-game-behavioural-cloning and https://github.com/joonaspu/ViControl

  22. arXiv:2004.00980  [pdf, other

    cs.AI

    Action Space Shaping in Deep Reinforcement Learning

    Authors: Anssi Kanervisto, Christian Scheller, Ville Hautamäki

    Abstract: Reinforcement learning (RL) has been successful in training agents in various learning environments, including video-games. However, such work modifies and shrinks the action space from the game's original. This is to avoid trying "pointless" actions and to ease the implementation. Currently, this is mostly done based on intuition, with little systematic research supporting the design decisions. I… ▽ More

    Submitted 26 May, 2020; v1 submitted 2 April, 2020; originally announced April 2020.

    Comments: To appear in IEEE Conference on Games 2020. Experiment code is available at https://github.com/Miffyli/rl-action-space-shaping

  23. arXiv:2002.03801  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    An initial investigation on optimizing tandem speaker verification and countermeasure systems using reinforcement learning

    Authors: Anssi Kanervisto, Ville Hautamäki, Tomi Kinnunen, Junichi Yamagishi

    Abstract: The spoofing countermeasure (CM) systems in automatic speaker verification (ASV) are not typically used in isolation of each other. These systems can be combined, for example, into a cascaded system where CM produces first a decision whether the input is synthetic or bona fide speech. In case the CM decides it is a bona fide sample, then the ASV system will consider it for speaker verification. En… ▽ More

    Submitted 8 April, 2020; v1 submitted 6 February, 2020; originally announced February 2020.

    Comments: Odyssey 2020 The Speaker and Language Recognition Workshop. Code available at https://github.com/Miffyli/asv-cm-reinforce

  24. arXiv:1907.03164  [pdf, other

    cs.LG eess.AS stat.ML

    Towards Debugging Deep Neural Networks by Generating Speech Utterances

    Authors: Bilal Soomro, Anssi Kanervisto, Trung Ngo Trong, Ville Hautamäki

    Abstract: Deep neural networks (DNN) are able to successfully process and classify speech utterances. However, understanding the reason behind a classification by DNN is difficult. One such debugging method used with image classification DNNs is activation maximization, which generates example-images that are classified as one of the classes. In this work, we evaluate applicability of this method to speech… ▽ More

    Submitted 6 July, 2019; originally announced July 2019.

    Comments: Accepted to Interspeech 2019

  25. arXiv:1905.04192  [pdf, other

    cs.LG cs.AI cs.SD stat.ML

    Do Autonomous Agents Benefit from Hearing?

    Authors: Abraham Woubie, Anssi Kanervisto, Janne Karttunen, Ville Hautamaki

    Abstract: Mapping states to actions in deep reinforcement learning is mainly based on visual information. The commonly used approach for dealing with visual information is to extract pixels from images and use them as state representation for reinforcement learning agent. But, any vision only agent is handicapped by not being able to sense audible cues. Using hearing, animals are able to sense targets that… ▽ More

    Submitted 10 May, 2019; originally announced May 2019.

  26. arXiv:1905.00741  [pdf, other

    cs.LG cs.AI cs.RO

    From Video Game to Real Robot: The Transfer between Action Spaces

    Authors: Janne Karttunen, Anssi Kanervisto, Ville Kyrki, Ville Hautamäki

    Abstract: Deep reinforcement learning has proven to be successful for learning tasks in simulated environments, but applying same techniques for robots in real-world domain is more challenging, as they require hours of training. To address this, transfer learning can be used to train the policy first in a simulated environment and then transfer it to physical agent. As the simulation never matches reality p… ▽ More

    Submitted 23 March, 2020; v1 submitted 2 May, 2019; originally announced May 2019.

    Comments: Two first authors contributed equally. Accepted by ICASSP 2020

  27. arXiv:1811.03293  [pdf, other

    eess.AS cs.SD

    Who Do I Sound Like? Showcasing Speaker Recognition Technology by YouTube Voice Search

    Authors: Ville Vestman, Bilal Soomro, Anssi Kanervisto, Ville Hautamäki, Tomi Kinnunen

    Abstract: The popularization of science can often be disregarded by scientists as it may be challenging to put highly sophisticated research into words that general public can understand. This work aims to help presenting speaker recognition research to public by proposing a publicly appealing concept for showcasing recognition systems. We leverage data from YouTube and use it in a large-scale voice search… ▽ More

    Submitted 10 February, 2019; v1 submitted 8 November, 2018; originally announced November 2018.

    Comments: Accepted for presentation in ICASSP 2019

  28. arXiv:1807.10110  [pdf, other

    cs.AI cs.LG

    ToriLLE: Learning Environment for Hand-to-Hand Combat

    Authors: Anssi Kanervisto, Ville Hautamäki

    Abstract: We present Toribash Learning Environment (ToriLLE), a learning environment for machine learning agents based on the video game Toribash. Toribash is a MuJoCo-like environment of two humanoid character fighting each other hand-to-hand, controlled by changing actuation modes of the joints. Competitive nature of Toribash as well its focused domain provide a platform for evaluating self-play methods,… ▽ More

    Submitted 4 June, 2019; v1 submitted 26 July, 2018; originally announced July 2018.

    Comments: https://github.com/Miffyli/ToriLLE . Accepted to IEEE Conference on Games 2019

  29. arXiv:1804.08910  [pdf, other

    cs.SD cs.CY eess.AS

    Perceptual Evaluation of the Effectiveness of Voice Disguise by Age Modification

    Authors: Rosa González Hautamäki, Anssi Kanervisto, Ville Hautamäki, Tomi Kinnunen

    Abstract: Voice disguise, purposeful modification of one's speaker identity with the aim of avoiding being identified as oneself, is a low-effort way to fool speaker recognition, whether performed by a human or an automatic speaker verification (ASV) system. We present an evaluation of the effectiveness of age stereotypes as a voice disguise strategy, as a follow up to our recent work where 60 native Finnis… ▽ More

    Submitted 28 May, 2018; v1 submitted 24 April, 2018; originally announced April 2018.

    Comments: Accepted to Speaker Odyssey 2018: The Speaker and Language Recognition Workshop

  30. arXiv:1609.04938  [pdf, other

    cs.CV cs.CL cs.LG cs.NE

    Image-to-Markup Generation with Coarse-to-Fine Attention

    Authors: Yuntian Deng, Anssi Kanervisto, Jeffrey Ling, Alexander M. Rush

    Abstract: We present a neural encoder-decoder model to convert images into presentational markup based on a scalable coarse-to-fine attention mechanism. Our method is evaluated in the context of image-to-LaTeX generation, and we introduce a new dataset of real-world rendered mathematical expressions paired with LaTeX markup. We show that unlike neural OCR techniques using CTC-based models, attention-based a… ▽ More

    Submitted 13 June, 2017; v1 submitted 16 September, 2016; originally announced September 2016.

    Comments: Accepted by ICML 2017