Zum Hauptinhalt springen

Showing 1–6 of 6 results for author: Polosukhin, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:1807.03168  [pdf, ps, other

    cs.LG cs.AI cs.PL stat.ML

    NAPS: Natural Program Synthesis Dataset

    Authors: Maksym Zavershynskyi, Alex Skidanov, Illia Polosukhin

    Abstract: We present a program synthesis-oriented dataset consisting of human written problem statements and solutions for these problems. The problem statements were collected via crowdsourcing and the program solutions were extracted from human-written solutions in programming competitions, accompanied by input/output examples. We propose using this dataset for the program synthesis tasks aimed for workin… ▽ More

    Submitted 5 July, 2018; originally announced July 2018.

    Comments: 4 pages, 5 tables in 2nd Workshop on Neural Abstract Machines & Program Induction (NAMPI), @ICML 2018

  2. arXiv:1802.04335  [pdf, other

    cs.AI cs.CL cs.PL

    Neural Program Search: Solving Programming Tasks from Description and Examples

    Authors: Illia Polosukhin, Alexander Skidanov

    Abstract: We present a Neural Program Search, an algorithm to generate programs from natural language description and a small number of input/output examples. The algorithm combines methods from Deep Learning and Program Synthesis fields by designing rich domain-specific language (DSL) and defining efficient search algorithm guided by a Seq2Tree model on it. To evaluate the quality of the approach we also p… ▽ More

    Submitted 12 February, 2018; originally announced February 2018.

    Comments: 9 pages, 3 figures, ICLR workshop

  3. TensorFlow Estimators: Managing Simplicity vs. Flexibility in High-Level Machine Learning Frameworks

    Authors: Heng-Tze Cheng, Zakaria Haque, Lichan Hong, Mustafa Ispir, Clemens Mewald, Illia Polosukhin, Georgios Roumpos, D Sculley, Jamie Smith, David Soergel, Yuan Tang, Philipp Tucker, Martin Wicke, Cassandra Xia, Jianwei Xie

    Abstract: We present a framework for specifying, training, evaluating, and deploying machine learning models. Our focus is on simplifying cutting edge machine learning for practitioners in order to bring such technologies into production. Recognizing the fast evolution of the field of deep learning, we make no attempt to capture the design space of all possible model architectures in a domain- specific lang… ▽ More

    Submitted 8 August, 2017; originally announced August 2017.

    Comments: 8 pages, Appeared at KDD 2017, August 13--17, 2017, Halifax, NS, Canada

  4. arXiv:1706.03762  [pdf, other

    cs.CL cs.LG

    Attention Is All You Need

    Authors: Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin

    Abstract: The dominant sequence transduction models are based on complex recurrent or convolutional neural networks in an encoder-decoder configuration. The best performing models also connect the encoder and decoder through an attention mechanism. We propose a new simple network architecture, the Transformer, based solely on attention mechanisms, dispensing with recurrence and convolutions entirely. Experi… ▽ More

    Submitted 1 August, 2023; v1 submitted 12 June, 2017; originally announced June 2017.

    Comments: 15 pages, 5 figures

  5. arXiv:1611.01839  [pdf, other

    cs.CL

    Hierarchical Question Answering for Long Documents

    Authors: Eunsol Choi, Daniel Hewlett, Alexandre Lacoste, Illia Polosukhin, Jakob Uszkoreit, Jonathan Berant

    Abstract: We present a framework for question answering that can efficiently scale to longer documents while maintaining or even improving performance of state-of-the-art models. While most successful approaches for reading comprehension rely on recurrent neural networks (RNNs), running them over long documents is prohibitively slow because it is difficult to parallelize over sequences. Inspired by how peop… ▽ More

    Submitted 8 February, 2017; v1 submitted 6 November, 2016; originally announced November 2016.

  6. arXiv:1608.03542  [pdf, other

    cs.CL

    WikiReading: A Novel Large-scale Language Understanding Task over Wikipedia

    Authors: Daniel Hewlett, Alexandre Lacoste, Llion Jones, Illia Polosukhin, Andrew Fandrianto, Jay Han, Matthew Kelcey, David Berthelot

    Abstract: We present WikiReading, a large-scale natural language understanding task and publicly-available dataset with 18 million instances. The task is to predict textual values from the structured knowledge base Wikidata by reading the text of the corresponding Wikipedia articles. The task contains a rich variety of challenging classification and extraction sub-tasks, making it well-suited for end-to-end… ▽ More

    Submitted 15 March, 2017; v1 submitted 11 August, 2016; originally announced August 2016.

    Journal ref: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2016, pp. 1535-1545