Zum Hauptinhalt springen

Showing 1–41 of 41 results for author: Frank, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.10340  [pdf, other

    stat.ML cs.LG q-fin.ST stat.AP

    Can an unsupervised clustering algorithm reproduce a categorization system?

    Authors: Nathalia Castellanos, Dhruv Desai, Sebastian Frank, Stefano Pasquali, Dhagash Mehta

    Abstract: Peer analysis is a critical component of investment management, often relying on expert-provided categorization systems. These systems' consistency is questioned when they do not align with cohorts from unsupervised clustering algorithms optimized for various metrics. We investigate whether unsupervised clustering can reproduce ground truth classes in a labeled dataset, showing that success depend… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: 9 pages, 4 tables 28 figures

  2. arXiv:2408.09604  [pdf, other

    q-bio.PE cs.LG physics.bio-ph

    Circuit design in biology and machine learning. I. Random networks and dimensional reduction

    Authors: Steven A. Frank

    Abstract: A biological circuit is a neural or biochemical cascade, taking inputs and producing outputs. How have biological circuits learned to solve environmental challenges over the history of life? The answer certainly follows Dobzhansky's famous quote that ``nothing in biology makes sense except in the light of evolution.'' But that quote leaves out the mechanistic basis by which natural selection's tri… ▽ More

    Submitted 18 August, 2024; originally announced August 2024.

  3. arXiv:2404.17358  [pdf, ps, other

    cs.LG math.ST stat.ML

    Adversarial Consistency and the Uniqueness of the Adversarial Bayes Classifier

    Authors: Natalie S. Frank

    Abstract: Adversarial training is a common technique for learning robust classifiers. Prior work showed that convex surrogate losses are not statistically consistent in the adversarial context -- or in other words, a minimizing sequence of the adversarial surrogate risk will not necessarily minimize the adversarial classification error. We connect the consistency of adversarial surrogate losses to propertie… ▽ More

    Submitted 15 May, 2024; v1 submitted 26 April, 2024; originally announced April 2024.

    Comments: 18 pages, v2: fixed typos

  4. arXiv:2404.16956  [pdf, other

    cs.LG math.ST stat.ML

    A Notion of Uniqueness for the Adversarial Bayes Classifier

    Authors: Natalie S. Frank

    Abstract: We propose a new notion of uniqueness for the adversarial Bayes classifier in the setting of binary classification. Analyzing this concept produces a simple procedure for computing all adversarial Bayes classifiers for a well-motivated family of one dimensional data distributions. This characterization is then leveraged to show that as the perturbation radius increases, certain the regularity of a… ▽ More

    Submitted 17 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: 49 pages, 7 figures v2: fixed typos, notation errors, and a mistake in example 7

  5. arXiv:2308.15084  [pdf, other

    cs.SE

    Introducing Interactions in Multi-Objective Optimization of Software Architectures

    Authors: Vittorio Cortellessa, J. Andres Diaz-Pace, Daniele Di Pompeo, Sebastian Frank, Pooyan Jamshidi, Michele Tucci, André van Hoorn

    Abstract: Software architecture optimization aims to enhance non-functional attributes like performance and reliability while meeting functional requirements. Multi-objective optimization employs metaheuristic search techniques, such as genetic algorithms, to explore feasible architectural changes and propose alternatives to designers. However, the resource-intensive process may not always align with practi… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  6. arXiv:2305.14585  [pdf, other

    cs.LG

    Faithful and Efficient Explanations for Neural Networks via Neural Tangent Kernel Surrogate Models

    Authors: Andrew Engel, Zhichao Wang, Natalie S. Frank, Ioana Dumitriu, Sutanay Choudhury, Anand Sarwate, Tony Chiang

    Abstract: A recent trend in explainable AI research has focused on surrogate modeling, where neural networks are approximated as simpler ML algorithms such as kernel machines. A second trend has been to utilize kernel functions in various explain-by-example or data attribution tasks. In this work, we combine these two trends to analyze approximate empirical neural tangent kernels (eNTK) for data attribution… ▽ More

    Submitted 11 March, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: 9 pages, 2 figures, 3 tables Updated 3/11/2024 various additions/clarifications after ICLR review. Accepted as a Spotlight paper at ICLR 2024

  7. arXiv:2210.13134  [pdf, other

    cs.CL cs.CV

    Multilingual Multimodal Learning with Machine Translated Text

    Authors: Chen Qiu, Dan Oneata, Emanuele Bugliarello, Stella Frank, Desmond Elliott

    Abstract: Most vision-and-language pretraining research focuses on English tasks. However, the creation of multilingual multimodal evaluation datasets (e.g. Multi30K, xGQA, XVNLI, and MaRVL) poses a new challenge in finding high-quality training data that is both multilingual and multimodal. In this paper, we investigate whether machine translating English multimodal data can be an effective proxy for the l… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: EMNLP 2022

  8. Automatic differentiation and the optimization of differential equation models in biology

    Authors: Steven A. Frank

    Abstract: A computational revolution unleashed the power of artificial neural networks. At the heart of that revolution is automatic differentiation, which calculates the derivative of a performance measure relative to a large number of parameters. Differentiation enhances the discovery of improved performance in large models, an achievement that was previously difficult or impossible. Recently, a second co… ▽ More

    Submitted 11 October, 2022; v1 submitted 10 July, 2022; originally announced July 2022.

  9. arXiv:2206.09099   

    cs.LG math.ST

    The Consistency of Adversarial Training for Binary Classification

    Authors: Natalie S. Frank, Jonathan Niles-Weed

    Abstract: Robustness to adversarial perturbations is of paramount concern in modern machine learning. One of the state-of-the-art methods for training robust classifiers is adversarial training, which involves minimizing a supremum-based surrogate risk. The statistical consistency of surrogate risks is well understood in the context of standard machine learning, but not in the adversarial setting. In this p… ▽ More

    Submitted 17 May, 2023; v1 submitted 17 June, 2022; originally announced June 2022.

    Comments: There was an error in the main theorem of the paper (Theorem 7)

  10. arXiv:2206.09098  [pdf, ps, other

    cs.LG math.ST

    Existence and Minimax Theorems for Adversarial Surrogate Risks in Binary Classification

    Authors: Natalie S. Frank, Jonathan Niles-Weed

    Abstract: Adversarial training is one of the most popular methods for training methods robust to adversarial attacks, however, it is not well-understood from a theoretical perspective. We prove and existence, regularity, and minimax theorems for adversarial surrogate risks. Our results explain some empirical observations on adversarial robustness from prior work and suggest new directions in algorithm devel… ▽ More

    Submitted 10 December, 2023; v1 submitted 17 June, 2022; originally announced June 2022.

    Comments: 42 pages. version 2: corrects several errors and employs a significantly different proof technique. version 3: modifies the arXiv author list but has no other changes. version 4: improved exposition and fixed typos

  11. arXiv:2204.07833  [pdf, other

    q-bio.QM cs.LG

    Optimizing differential equations to fit data and predict outcomes

    Authors: Steven A. Frank

    Abstract: Many scientific problems focus on observed patterns of change or on how to design a system to achieve particular dynamics. Those problems often require fitting differential equation models to target trajectories. Fitting such models can be difficult because each evaluation of the fit must calculate the distance between the model and target patterns at numerous points along a trajectory. The gradie… ▽ More

    Submitted 16 April, 2022; originally announced April 2022.

  12. arXiv:2203.10020  [pdf, other

    cs.CL

    Challenges and Strategies in Cross-Cultural NLP

    Authors: Daniel Hershcovich, Stella Frank, Heather Lent, Miryam de Lhoneux, Mostafa Abdou, Stephanie Brandl, Emanuele Bugliarello, Laura Cabello Piqueras, Ilias Chalkidis, Ruixiang Cui, Constanza Fierro, Katerina Margatina, Phillip Rust, Anders Søgaard

    Abstract: Various efforts in the Natural Language Processing (NLP) community have been made to accommodate linguistic diversity and serve speakers of many different languages. However, it is important to acknowledge that speakers and the content they produce and require, vary not just by language, but also by culture. Although language and culture are tightly linked, there are important differences. Analogo… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

    Comments: ACL 2022 - Theme track

  13. arXiv:2203.06937  [pdf, ps, other

    cs.CL

    Modelling word learning and recognition using visually grounded speech

    Authors: Danny Merkx, Sebastiaan Scholten, Stefan L. Frank, Mirjam Ernestus, Odette Scharenborg

    Abstract: Background: Computational models of speech recognition often assume that the set of target words is already given. This implies that these models do not learn to recognise speech from scratch without prior knowledge and explicit supervision. Visually grounded speech models learn to recognise speech without prior knowledge by exploiting statistical dependencies between spoken and visual input. Whil… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

  14. arXiv:2202.10292  [pdf, other

    cs.CL cs.CV cs.LG

    Seeing the advantage: visually grounding word embeddings to better capture human semantic knowledge

    Authors: Danny Merkx, Stefan L. Frank, Mirjam Ernestus

    Abstract: Distributional semantic models capture word-level meaning that is useful in many natural language processing tasks and have even been shown to capture cognitive aspects of word meaning. The majority of these models are purely text based, even though the human sensory experience is much richer. In this paper we create visually grounded word embeddings by combining English text and images and compar… ▽ More

    Submitted 21 February, 2022; originally announced February 2022.

    Journal ref: Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics (CMCL) 2022

  15. arXiv:2112.01694  [pdf, other

    cs.LG stat.ML

    On the Existence of the Adversarial Bayes Classifier (Extended Version)

    Authors: Pranjal Awasthi, Natalie S. Frank, Mehryar Mohri

    Abstract: Adversarial robustness is a critical property in a variety of modern machine learning applications. While it has been the subject of several recent theoretical studies, many important questions related to adversarial robustness are still open. In this work, we study a fundamental question regarding Bayes optimality for adversarial robustness. We provide general sufficient conditions under which th… ▽ More

    Submitted 28 August, 2023; v1 submitted 2 December, 2021; originally announced December 2021.

    Comments: 27 pages, 3 figures. Version 2: Corrects 2 errors in the paper "On the Existence of the Adversarial Bayes Classifier" published in NeurIPS. Version 3: Update to acknowledgements

  16. arXiv:2109.06129  [pdf, other

    cs.CV cs.CL

    Can Language Models Encode Perceptual Structure Without Grounding? A Case Study in Color

    Authors: Mostafa Abdou, Artur Kulmizev, Daniel Hershcovich, Stella Frank, Ellie Pavlick, Anders Søgaard

    Abstract: Pretrained language models have been shown to encode relational information, such as the relations between entities or concepts in knowledge-bases -- (Paris, Capital, France). However, simple relations of this type can often be recovered heuristically and the extent to which models implicitly reflect topological structure that is grounded in world, such as perceptual structure, is unknown. To expl… ▽ More

    Submitted 14 September, 2021; v1 submitted 13 September, 2021; originally announced September 2021.

    Comments: CoNLL 2021

  17. arXiv:2109.04448  [pdf, other

    cs.CL cs.CV

    Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers

    Authors: Stella Frank, Emanuele Bugliarello, Desmond Elliott

    Abstract: Pretrained vision-and-language BERTs aim to learn representations that combine information from both modalities. We propose a diagnostic method based on cross-modal input ablation to assess the extent to which these models actually integrate cross-modal information. This method involves ablating inputs from one modality, either entirely or selectively based on cross-modal grounding alignments, and… ▽ More

    Submitted 9 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021

  18. Semantic sentence similarity: size does not always matter

    Authors: Danny Merkx, Stefan L. Frank, Mirjam Ernestus

    Abstract: This study addresses the question whether visually grounded speech recognition (VGS) models learn to capture sentence semantics without access to any prior linguistic knowledge. We produce synthetic and natural spoken versions of a well known semantic textual similarity database and show that our VGS model produces embeddings that correlate well with human semantic similarity judgements. Our resul… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

    Comments: This paper has been accepted at Interspeech 2021 where it will be presented and appear in the conference proceedings in September 2021

    Journal ref: Proc. Interspeech 2021

  19. arXiv:2101.11587  [pdf

    cs.CY cs.AI cs.CV

    The Work of Art in an Age of Mechanical Generation

    Authors: Steven J. Frank

    Abstract: Can we define what it means to be "creative," and if so, can our definition drive artificial intelligence (AI) systems to feats of creativity indistinguishable from human efforts? This mixed question is considered from technological and social perspectives. Beginning with an exploration of the value we attach to authenticity in works of art, the article considers the ability of AI to detect forger… ▽ More

    Submitted 10 August, 2022; v1 submitted 27 January, 2021; originally announced January 2021.

    Comments: This is the author's final version; the article has been accepted for publication in Leonardo Journal

    Journal ref: Leonardo(2022) 55(4): 378-381

  20. arXiv:2010.14544  [pdf, other

    q-bio.PE cs.IT

    The fundamental equations of change in statistical ensembles and biological populations

    Authors: Steven A. Frank, Frank J. Bruggeman

    Abstract: A recent article in Nature Physics unified key results from thermodynamics, statistics, and information theory. The unification arose from a general equation for the rate of change in the information content of a system. The general equation describes the change in the moments of an observable quantity over a probability distribution. One term in the equation describes the change in the probabilit… ▽ More

    Submitted 27 October, 2020; originally announced October 2020.

  21. arXiv:2010.07143  [pdf, other

    q-bio.NC cs.LG

    A Graph Neural Network Framework for Causal Inference in Brain Networks

    Authors: Simon Wein, Wilhelm Malloni, Ana Maria Tomé, Sebastian M. Frank, Gina-Isabelle Henze, Stefan Wüst, Mark W. Greenlee, Elmar W. Lang

    Abstract: A central question in neuroscience is how self-organizing dynamic interactions in the brain emerge on their relatively static structural backbone. Due to the complexity of spatial and temporal dependencies between different brain areas, fully comprehending the interplay between structure and function is still challenging and an area of intense research. In this paper we present a graph neural netw… ▽ More

    Submitted 14 October, 2020; originally announced October 2020.

  22. arXiv:2008.05458  [pdf

    eess.SP cs.LG

    Deep-Learning-Based, Multi-Timescale Load Forecasting in Buildings: Opportunities and Challenges from Research to Deployment

    Authors: Sakshi Mishra, Stephen M. Frank, Anya Petersen, Robert Buechler, Michelle Slovensky

    Abstract: Electricity load forecasting for buildings and campuses is becoming increasingly important as the penetration of distributed energy resources (DERs) grows. Efficient operation and dispatch of DERs require reasonably accurate predictions of future energy consumption in order to conduct near-real-time optimized dispatch of on-site generation and storage assets. Electric utilities have traditionally… ▽ More

    Submitted 16 December, 2021; v1 submitted 12 August, 2020; originally announced August 2020.

    Comments: 13 pages, 4 figures

  23. arXiv:2006.02174  [pdf, other

    cs.CL cs.AI cs.LG

    CompGuessWhat?!: A Multi-task Evaluation Framework for Grounded Language Learning

    Authors: Alessandro Suglia, Ioannis Konstas, Andrea Vanzo, Emanuele Bastianelli, Desmond Elliott, Stella Frank, Oliver Lemon

    Abstract: Approaches to Grounded Language Learning typically focus on a single task-based final performance measure that may not depend on desirable properties of the learned hidden representations, such as their ability to predict salient attributes or to generalise to unseen situations. To remedy this, we present GROLLA, an evaluation framework for Grounded Language Learning with Attributes with three sub… ▽ More

    Submitted 3 June, 2020; originally announced June 2020.

    Comments: Accepted to the Annual Conference of the Association for Computational Linguistics (ACL) 2020

  24. arXiv:2005.10600  [pdf

    cs.CV cs.AI

    A Neural Network Looks at Leonardo's(?) Salvator Mundi

    Authors: Steven J. Frank, Andrea M. Frank

    Abstract: We use convolutional neural networks (CNNs) to analyze authorship questions surrounding the works of Leonardo da Vinci -- in particular, Salvator Mundi, the world's most expensive painting and among the most controversial. Trained on the works of an artist under study and visually comparable works of other artists, our system can identify likely forgeries and shed light on attribution controversie… ▽ More

    Submitted 21 May, 2020; originally announced May 2020.

    Comments: This is the author's final version. The article has been accepted for publication in Leonardo (MIT Press)

  25. Human Sentence Processing: Recurrence or Attention?

    Authors: Danny Merkx, Stefan L. Frank

    Abstract: Recurrent neural networks (RNNs) have long been an architecture of interest for computational models of human sentence processing. The recently introduced Transformer architecture outperforms RNNs on many natural language processing tasks but little is known about its ability to model human language processing. We compare Transformer- and RNN-based language models' ability to account for measures… ▽ More

    Submitted 4 May, 2021; v1 submitted 19 May, 2020; originally announced May 2020.

    Comments: This paper will appear in the proceedings of CMCL 2021 to be held June 10th

    Journal ref: Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics (CMCL) 2021

  26. A Unified Architecture for Data-Driven Metadata Tagging of Building Automation Systems

    Authors: Sakshi Mishra, Andrew Glaws, Dylan Cutler, Stephen Frank, Muhammad Azam, Farzam Mohammadi, Jean-Simon Venne

    Abstract: This article presents a Unified Architecture for automated point tagging of Building Automation System data, based on a combination of data-driven approaches. Advanced energy analytics applications-including fault detection and diagnostics and supervisory control-have emerged as a significant opportunity for improving the performance of our built environment. Effective application of these analyti… ▽ More

    Submitted 11 September, 2020; v1 submitted 26 February, 2020; originally announced March 2020.

    Comments: 19 pages, 9 figures, accepted for publication in Automation in Construction

  27. Resource-Frugal Classification and Analysis of Pathology Slides Using Image Entropy

    Authors: Steven J. Frank

    Abstract: Pathology slides of lung malignancies are classified using resource-frugal convolution neural networks (CNNs) that may be deployed on mobile devices. In particular, the challenging task of distinguishing adenocarcinoma (LUAD) and squamous-cell carcinoma (LUSC) lung cancer subtypes is approached in two stages. First, whole-slide histopathology images are downsampled to a size too large for CNN anal… ▽ More

    Submitted 2 December, 2020; v1 submitted 16 February, 2020; originally announced February 2020.

    Journal ref: Biomedical Signal Processing and Control, vol. 66, April 2021, 102388

  28. arXiv:2002.05107  [pdf

    cs.CV

    Analysis of Dutch Master Paintings with Convolutional Neural Networks

    Authors: Steven J. Frank, Andrea M. Frank

    Abstract: Trained on the works of an artist under study and visually comparable works of other artists, convolutional neural networks can identify forgeries and provide attributions. They can also assign classification probabilities within a painting, revealing mixed authorship and identifying regions painted by different hands.

    Submitted 16 August, 2020; v1 submitted 12 February, 2020; originally announced February 2020.

  29. arXiv:1910.05291  [pdf, other

    cs.CL cs.AI cs.LG

    The Emergence of Compositional Languages for Numeric Concepts Through Iterated Learning in Neural Agents

    Authors: Shangmin Guo, Yi Ren, Serhii Havrylov, Stella Frank, Ivan Titov, Kenny Smith

    Abstract: Since first introduced, computer simulation has been an increasingly important tool in evolutionary linguistics. Recently, with the development of deep learning techniques, research in grounded language learning has also started to focus on facilitating the emergence of compositional languages without pre-defined elementary linguistic knowledge. In this work, we explore the emergence of compositio… ▽ More

    Submitted 11 October, 2019; originally announced October 2019.

  30. Language learning using Speech to Image retrieval

    Authors: Danny Merkx, Stefan L. Frank, Mirjam Ernestus

    Abstract: Humans learn language by interaction with their environment and listening to other humans. It should also be possible for computational models to learn language directly from speech but so far most approaches require text. We improve on existing neural network approaches to create visually grounded embeddings for spoken utterances. Using a combination of a multi-layer GRU, importance sampling, cyc… ▽ More

    Submitted 9 September, 2019; originally announced September 2019.

    Comments: Submitted to InterSpeech 2019

    Journal ref: Proc. Interspeech 2019

  31. arXiv:1907.12436  [pdf

    cs.CV cs.LG eess.IV

    Salient Slices: Improved Neural Network Training and Performance with Image Entropy

    Authors: Steven J. Frank, Andrea M. Frank

    Abstract: As a training and analysis strategy for convolutional neural networks (CNNs), we slice images into tiled segments and use, for training and prediction, segments that both satisfy a criterion of information diversity and contain sufficient content to support classification. In particular, we utilize image entropy as the diversity criterion. This ensures that each tile carries as much information di… ▽ More

    Submitted 4 May, 2020; v1 submitted 29 July, 2019; originally announced July 2019.

    Comments: Final version; article will be published in Neural Computation 32, 1222-1237 (June 2020)

  32. arXiv:1904.00825  [pdf, other

    q-bio.PE cond-mat.stat-mech cs.IT

    Simple unity among the fundamental equations of science

    Authors: Steven A. Frank

    Abstract: The Price equation describes the change in populations. Change concerns some value, such as biological fitness, information or physical work. The Price equation reveals universal aspects for the nature of change, independently of the meaning ascribed to values. By understanding those universal aspects, we can see more clearly why fundamental mathematical results in different disciplines often shar… ▽ More

    Submitted 4 August, 2019; v1 submitted 29 March, 2019; originally announced April 2019.

    Comments: arXiv admin note: text overlap with arXiv:1810.09262

  33. Learning semantic sentence representations from visually grounded language without lexical knowledge

    Authors: Danny Merkx, Stefan Frank

    Abstract: Current approaches to learning semantic representations of sentences often use prior word-level knowledge. The current study aims to leverage visual information in order to capture sentence level semantics without the need for word embeddings. We use a multimodal sentence encoder trained on a corpus of images with matching text captions to produce visually grounded sentence embeddings. Deep Neural… ▽ More

    Submitted 27 March, 2019; originally announced March 2019.

    Journal ref: Natural Language Engineering, Volume 25 - Issue 4 - July 2019

  34. arXiv:1810.09262  [pdf, other

    q-bio.PE cond-mat.stat-mech cs.IT

    The Price equation program: simple invariances unify population dynamics, thermodynamics, probability, information and inference

    Authors: Steven A. Frank

    Abstract: The fundamental equations of various disciplines often seem to share the same basic structure. Natural selection increases information in the same way that Bayesian updating increases information. Thermodynamics and the forms of common probability distributions express maximum increase in entropy, which appears mathematically as loss of information. Physical mechanics follows paths of change that… ▽ More

    Submitted 14 December, 2018; v1 submitted 22 October, 2018; originally announced October 2018.

    Comments: Version 3: added figure illustrating geometry; added table of symbols and two tables summarizing mathematical relations; this version accepted for publication in Entropy

    Journal ref: 2018. Entropy 20:978

  35. arXiv:1809.08758  [pdf, other

    cs.CV

    Low Frequency Adversarial Perturbation

    Authors: Chuan Guo, Jared S. Frank, Kilian Q. Weinberger

    Abstract: Adversarial images aim to change a target model's decision by minimally perturbing a target image. In the black-box setting, the absence of gradient information often renders this search problem costly in terms of query complexity. In this paper we propose to restrict the search for adversarial images to a low frequency domain. This approach is readily compatible with many existing black-box attac… ▽ More

    Submitted 22 July, 2019; v1 submitted 24 September, 2018; originally announced September 2018.

    Comments: 9 pages, 9 figures. Accepted to UAI 2019

  36. arXiv:1710.07177  [pdf, other

    cs.CL cs.CV

    Findings of the Second Shared Task on Multimodal Machine Translation and Multilingual Image Description

    Authors: Desmond Elliott, Stella Frank, Loïc Barrault, Fethi Bougares, Lucia Specia

    Abstract: We present the results from the second shared task on multimodal machine translation and multilingual image description. Nine teams submitted 19 systems to two tasks. The multimodal translation task, in which the source sentence is supplemented by an image, was extended with a new language (French) and two new test sets. The multilingual image description task was changed such that at test time, o… ▽ More

    Submitted 19 October, 2017; originally announced October 2017.

    Journal ref: Proceedings of the Second Conference on Machine Translation, 2017, pp. 215--233

  37. Lexical representation explains cortical entrainment during speech comprehension

    Authors: Stefan Frank, Jinbiao Yang

    Abstract: Results from a recent neuroimaging study on spoken sentence comprehension have been interpreted as evidence for cortical entrainment to hierarchical syntactic structure. We present a simple computational model that predicts the power spectra from this study, even though the model's linguistic knowledge is restricted to the lexical level, and word-level representations are not combined into higher-… ▽ More

    Submitted 10 January, 2018; v1 submitted 18 June, 2017; originally announced June 2017.

    Comments: Submitted for publication

  38. arXiv:1605.00459  [pdf, ps, other

    cs.CL cs.CV

    Multi30K: Multilingual English-German Image Descriptions

    Authors: Desmond Elliott, Stella Frank, Khalil Sima'an, Lucia Specia

    Abstract: We introduce the Multi30K dataset to stimulate multilingual multimodal research. Recent advances in image description have been demonstrated on English-language datasets almost exclusively, but image description should not be limited to English. This dataset extends the Flickr30K dataset with i) German translations created by professional translators over a subset of the English descriptions, and… ▽ More

    Submitted 2 May, 2016; originally announced May 2016.

  39. arXiv:1510.04709  [pdf, ps, other

    cs.CL cs.CV cs.LG cs.NE

    Multilingual Image Description with Neural Sequence Models

    Authors: Desmond Elliott, Stella Frank, Eva Hasler

    Abstract: In this paper we present an approach to multi-language image description bringing together insights from neural machine translation and neural image description. To create a description of an image for a given target language, our sequence generation models condition on feature vectors from the image, the description from the source language, and/or a multimodal vector computed over the image and… ▽ More

    Submitted 18 November, 2015; v1 submitted 15 October, 2015; originally announced October 2015.

    Comments: Under review as a conference paper at ICLR 2016

  40. arXiv:1509.04473  [pdf, other

    cs.CL

    Splitting Compounds by Semantic Analogy

    Authors: Joachim Daiber, Lautaro Quiroz, Roger Wechsler, Stella Frank

    Abstract: Compounding is a highly productive word-formation process in some languages that is often problematic for natural language processing applications. In this paper, we investigate whether distributional semantics in the form of word embeddings can enable a deeper, i.e., more knowledge-rich, processing of compounds than the standard string-based methods. We present an unsupervised approach that explo… ▽ More

    Submitted 15 September, 2015; originally announced September 2015.

    Journal ref: Proceedings of the 1st Deep Machine Translation Workshop. Prague, Czech Republic. 2015

  41. arXiv:1412.1285  [pdf, other

    q-bio.PE cs.NE physics.bio-ph

    The inductive theory of natural selection: summary and synthesis

    Authors: Steven A. Frank

    Abstract: The theory of natural selection has two forms. Deductive theory describes how populations change over time. One starts with an initial population and some rules for change. From those assumptions, one calculates the future state of the population. Deductive theory predicts how populations adapt to environmental challenge. Inductive theory describes the causes of change in populations. One starts w… ▽ More

    Submitted 12 November, 2016; v1 submitted 3 December, 2014; originally announced December 2014.

    Comments: Version 2: Changed title. Noted that condensed and simplified version of this manuscript will be published as book chapter with original title "The inductive theory of natural selection." See footnote on title page of pdf