Skip to main content

Showing 1–36 of 36 results for author: Robinson, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.12725  [pdf

    cs.CL cs.AI

    Can Large Language Models Code Like a Linguist?: A Case Study in Low Resource Sound Law Induction

    Authors: Atharva Naik, Kexun Zhang, Nathaniel Robinson, Aravind Mysore, Clayton Marr, Hong Sng Rebecca Byrnes, Anna Cai, Kalvin Chang, David Mortensen

    Abstract: Historical linguists have long written a kind of incompletely formalized ''program'' that converts reconstructed words in an ancestor language into words in one of its attested descendants that consist of a series of ordered string rewrite functions (called sound laws). They do this by observing pairs of words in the reconstructed language (protoforms) and the descendent language (reflexes) and co… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  2. arXiv:2405.05376  [pdf, other

    cs.CL

    Kreyòl-MT: Building MT for Latin American, Caribbean and Colonial African Creole Languages

    Authors: Nathaniel R. Robinson, Raj Dabre, Ammon Shurtz, Rasul Dent, Onenamiyi Onesi, Claire Bizon Monroc, Loïc Grobol, Hasan Muhammad, Ashi Garg, Naome A. Etori, Vijay Murari Tiyyala, Olanrewaju Samuel, Matthew Dean Stutzman, Bismarck Bamfo Odoom, Sanjeev Khudanpur, Stephen D. Richardson, Kenton Murray

    Abstract: A majority of language technologies are tailored for a small number of high-resource languages, while relatively many low-resource languages are neglected. One such group, Creole languages, have long been marginalized in academic study, though their speakers could benefit from machine translation (MT). These languages are predominantly used in much of Latin America, Africa and the Caribbean. We pr… ▽ More

    Submitted 13 May, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Comments: NAACL 2024

  3. arXiv:2403.13169  [pdf, other

    cs.CL

    Wav2Gloss: Generating Interlinear Glossed Text from Speech

    Authors: Taiqi He, Kwanghee Choi, Lindia Tjuatja, Nathaniel R. Robinson, Jiatong Shi, Shinji Watanabe, Graham Neubig, David R. Mortensen, Lori Levin

    Abstract: Thousands of the world's languages are in danger of extinction--a tremendous threat to cultural identities and human language diversity. Interlinear Glossed Text (IGT) is a form of linguistic annotation that can support documentation and resource creation for these languages' communities. IGT typically consists of (1) transcriptions, (2) morphological segmentation, (3) glosses, and (4) free transl… ▽ More

    Submitted 5 June, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: ACL 2024 camera ready version

  4. arXiv:2402.01582  [pdf

    cs.CL

    Automating Sound Change Prediction for Phylogenetic Inference: A Tukanoan Case Study

    Authors: Kalvin Chang, Nathaniel R. Robinson, Anna Cai, Ting Chen, Annie Zhang, David R. Mortensen

    Abstract: We describe a set of new methods to partially automate linguistic phylogenetic inference given (1) cognate sets with their respective protoforms and sound laws, (2) a mapping from phones to their articulatory features and (3) a typological database of sound changes. We train a neural network on these sound change data to weight articulatory distances between phones and predict intermediate sound c… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: Accepted to LChange 2023

  5. arXiv:2312.00183  [pdf, other

    cs.CE cs.AI

    RNA-KG: An ontology-based knowledge graph for representing interactions involving RNA molecules

    Authors: Emanuele Cavalleri, Alberto Cabri, Mauricio Soto-Gomez, Sara Bonfitto, Paolo Perlasca, Jessica Gliozzo, Tiffany J. Callahan, Justin Reese, Peter N Robinson, Elena Casiraghi, Giorgio Valentini, Marco Mesiti

    Abstract: The "RNA world" represents a novel frontier for the study of fundamental biological processes and human diseases and is paving the way for the development of new drugs tailored to the patient's biomolecular characteristics. Although scientific data about coding and non-coding RNA molecules are continuously produced and available from public repositories, they are scattered across different databas… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

  6. arXiv:2309.17169  [pdf

    cs.CL cs.AI

    An evaluation of GPT models for phenotype concept recognition

    Authors: Tudor Groza, Harry Caufield, Dylan Gration, Gareth Baynam, Melissa A Haendel, Peter N Robinson, Christopher J Mungall, Justin T Reese

    Abstract: Objective: Clinical deep phenotyping and phenotype annotation play a critical role in both the diagnosis of patients with rare disorders as well as in building computationally-tractable knowledge in the rare disorders field. These processes rely on using ontology concepts, often from the Human Phenotype Ontology, in conjunction with a phenotype concept recognition task (supported usually by machin… ▽ More

    Submitted 22 November, 2023; v1 submitted 29 September, 2023; originally announced September 2023.

  7. arXiv:2309.07423  [pdf, other

    cs.CL

    ChatGPT MT: Competitive for High- (but not Low-) Resource Languages

    Authors: Nathaniel R. Robinson, Perez Ogayo, David R. Mortensen, Graham Neubig

    Abstract: Large language models (LLMs) implicitly learn to perform a range of language tasks, including machine translation (MT). Previous studies explore aspects of LLMs' MT capabilities. However, there exist a wide variety of languages for which recent LLM MT performance has never before been evaluated. Without published experimental evidence on the matter, it is difficult for speakers of the world's dive… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: 27 pages, 9 figures, 14 tables

  8. arXiv:2308.06435  [pdf

    cs.RO cs.HC

    A Brief Wellbeing Training Session Delivered by a Humanoid Social Robot: A Pilot Randomized Controlled Trial

    Authors: Nicole Robinson, Jennifer Connolly, Gavin Suddrey, David J. Kavanagh

    Abstract: Mental health and psychological distress are rising in adults, showing the importance of wellbeing promotion, support, and technique practice that is effective and accessible. Interactive social robots have been tested to deliver health programs but have not been explored to deliver wellbeing technique training in detail. A pilot randomised controlled trial was conducted to explore the feasibility… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

  9. Robotic Vision for Human-Robot Interaction and Collaboration: A Survey and Systematic Review

    Authors: Nicole Robinson, Brendan Tidd, Dylan Campbell, Dana Kulić, Peter Corke

    Abstract: Robotic vision for human-robot interaction and collaboration is a critical process for robots to collect and interpret detailed information related to human actions, goals, and preferences, enabling robots to provide more useful services to people. This survey and systematic review presents a comprehensive analysis on robotic vision in human-robot interaction and collaboration over the last 10 yea… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

    Journal ref: ACM Transactions on Human-Robot Interaction (2023) Volume 12 Issue 1 Article No 12 pp 1-66

  10. arXiv:2307.05727  [pdf

    cs.AI cs.CE

    An Open-Source Knowledge Graph Ecosystem for the Life Sciences

    Authors: Tiffany J. Callahan, Ignacio J. Tripodi, Adrianne L. Stefanski, Luca Cappelletti, Sanya B. Taneja, Jordan M. Wyrwa, Elena Casiraghi, Nicolas A. Matentzoglu, Justin Reese, Jonathan C. Silverstein, Charles Tapley Hoyt, Richard D. Boyce, Scott A. Malec, Deepak R. Unni, Marcin P. Joachimiak, Peter N. Robinson, Christopher J. Mungall, Emanuele Cavalleri, Tommaso Fontana, Giorgio Valentini, Marco Mesiti, Lucas A. Gillenwater, Brook Santangelo, Nicole A. Vasilevsky, Robert Hoehndorf , et al. (7 additional authors not shown)

    Abstract: Translational research requires data at multiple scales of biological organization. Advancements in sequencing and multi-omics technologies have increased the availability of these data, but researchers face significant integration challenges. Knowledge graphs (KGs) are used to model complex phenomena, and methods exist to construct them automatically. However, tackling complex biomedical integrat… ▽ More

    Submitted 30 January, 2024; v1 submitted 11 July, 2023; originally announced July 2023.

  11. arXiv:2304.02711  [pdf, other

    cs.AI cs.LG

    Structured prompt interrogation and recursive extraction of semantics (SPIRES): A method for populating knowledge bases using zero-shot learning

    Authors: J. Harry Caufield, Harshad Hegde, Vincent Emonet, Nomi L. Harris, Marcin P. Joachimiak, Nicolas Matentzoglu, HyeongSik Kim, Sierra A. T. Moxon, Justin T. Reese, Melissa A. Haendel, Peter N. Robinson, Christopher J. Mungall

    Abstract: Creating knowledge bases and ontologies is a time consuming task that relies on a manual curation. AI/NLP approaches can assist expert curators in populating these knowledge bases, but current approaches rely on extensive training data, and are not able to populate arbitrary complex nested knowledge schemas. Here we present Structured Prompt Interrogation and Recursive Extraction of Semantics (S… ▽ More

    Submitted 22 December, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

    Comments: Updated 2023-12-22

  12. arXiv:2304.02541  [pdf, other

    cs.CL

    PWESuite: Phonetic Word Embeddings and Tasks They Facilitate

    Authors: Vilém Zouhar, Kalvin Chang, Chenxuan Cui, Nathaniel Carlson, Nathaniel Robinson, Mrinmaya Sachan, David Mortensen

    Abstract: Mapping words into a fixed-dimensional vector space is the backbone of modern NLP. While most word embedding methods successfully encode semantic information, they overlook phonetic information that is crucial for many tasks. We develop three methods that use articulatory features to build phonetically informed word embeddings. To address the inconsistent evaluation of existing phonetic word embed… ▽ More

    Submitted 26 March, 2024; v1 submitted 5 April, 2023; originally announced April 2023.

    Comments: LREC-COLING 2024

  13. arXiv:2302.10800  [pdf

    q-bio.QM cs.AI cs.LG

    KG-Hub -- Building and Exchanging Biological Knowledge Graphs

    Authors: J Harry Caufield, Tim Putman, Kevin Schaper, Deepak R Unni, Harshad Hegde, Tiffany J Callahan, Luca Cappelletti, Sierra AT Moxon, Vida Ravanmehr, Seth Carbon, Lauren E Chan, Katherina Cortes, Kent A Shefchek, Glass Elsarboukh, James P Balhoff, Tommaso Fontana, Nicolas Matentzoglu, Richard M Bruskiewich, Anne E Thessen, Nomi L Harris, Monica C Munoz-Torres, Melissa A Haendel, Peter N Robinson, Marcin P Joachimiak, Christopher J Mungall , et al. (1 additional authors not shown)

    Abstract: Knowledge graphs (KGs) are a powerful approach for integrating heterogeneous data and making inferences in biology and many other domains, but a coherent solution for constructing, exchanging, and facilitating the downstream use of knowledge graphs is lacking. Here we present KG-Hub, a platform that enables standardized construction, exchange, and reuse of knowledge graphs. Features include a simp… ▽ More

    Submitted 31 January, 2023; originally announced February 2023.

  14. arXiv:2212.05626  [pdf, other

    cs.RO

    Human-Robot Team Performance Compared to Full Robot Autonomy in 16 Real-World Search and Rescue Missions: Adaptation of the DARPA Subterranean Challenge

    Authors: Nicole Robinson, Jason Williams, David Howard, Brendan Tidd, Fletcher Talbot, Brett Wood, Alex Pitt, Navinda Kottege, Dana Kulić

    Abstract: Human operators in human-robot teams are commonly perceived to be critical for mission success. To explore the direct and perceived impact of operator input on task success and team performance, 16 real-world missions (10 hrs) were conducted based on the DARPA Subterranean Challenge. These missions were to deploy a heterogeneous team of robots for a search task to locate and identify artifacts suc… ▽ More

    Submitted 11 December, 2022; originally announced December 2022.

    Comments: Submitted to Transactions on Human-Robot Interaction

  15. arXiv:2209.06295  [pdf, other

    cs.CL

    Data-adaptive Transfer Learning for Translation: A Case Study in Haitian and Jamaican

    Authors: Nathaniel R. Robinson, Cameron J. Hogan, Nancy Fulda, David R. Mortensen

    Abstract: Multilingual transfer techniques often improve low-resource machine translation (MT). Many of these techniques are applied without considering data characteristics. We show in the context of Haitian-to-English translation that transfer effectiveness is correlated with amount of training data and relationships between knowledge-sharing languages. Our experiments suggest that for some languages beyo… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

  16. arXiv:2209.04732  [pdf

    cs.DB cs.AI

    Ontologizing Health Systems Data at Scale: Making Translational Discovery a Reality

    Authors: Tiffany J. Callahan, Adrianne L. Stefanski, Jordan M. Wyrwa, Chenjie Zeng, Anna Ostropolets, Juan M. Banda, William A. Baumgartner Jr., Richard D. Boyce, Elena Casiraghi, Ben D. Coleman, Janine H. Collins, Sara J. Deakyne-Davies, James A. Feinstein, Melissa A. Haendel, Asiyah Y. Lin, Blake Martin, Nicolas A. Matentzoglu, Daniella Meeker, Justin Reese, Jessica Sinclair, Sanya B. Taneja, Katy E. Trinkley, Nicole A. Vasilevsky, Andrew Williams, Xingman A. Zhang , et al. (7 additional authors not shown)

    Abstract: Background: Common data models solve many challenges of standardizing electronic health record (EHR) data, but are unable to semantically integrate all the resources needed for deep phenotyping. Open Biological and Biomedical Ontology (OBO) Foundry ontologies provide computable representations of biological knowledge and enable the integration of heterogeneous data. However, mapping EHR data to OB… ▽ More

    Submitted 30 January, 2023; v1 submitted 10 September, 2022; originally announced September 2022.

    Comments: Supplementary Material is included at the end of the manuscript

    ACM Class: J.3

  17. arXiv:2208.11856  [pdf, other

    cs.RO

    Design and Implementation of a Human-Robot Joint Action Framework using Augmented Reality and Eye Gaze

    Authors: Wesley P. Chan, Morgan Crouch, Khoa Hoang, Charlie Chen, Nicole Robinson, Elizabeth Croft

    Abstract: When humans work together to complete a joint task, each person builds an internal model of the situation and how it will evolve. Efficient collaboration is dependent on how these individual models overlap to form a shared mental model among team members, which is important for collaborative processes in human-robot teams. The development and maintenance of an accurate shared mental model requires… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

  18. arXiv:2207.09889  [pdf, other

    cs.CL cs.SD eess.AS

    When Is TTS Augmentation Through a Pivot Language Useful?

    Authors: Nathaniel Robinson, Perez Ogayo, Swetha Gangu, David R. Mortensen, Shinji Watanabe

    Abstract: Developing Automatic Speech Recognition (ASR) for low-resource languages is a challenge due to the small amount of transcribed audio data. For many such languages, audio and text are available separately, but not audio with transcriptions. Using text, speech can be synthetically produced via text-to-speech (TTS) systems. However, many low-resource languages do not have quality TTS systems either.… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

  19. arXiv:2206.06444  [pdf

    cs.AI cs.CY stat.AP

    A method for comparing multiple imputation techniques: a case study on the U.S. National COVID Cohort Collaborative

    Authors: Elena Casiraghi, Rachel Wong, Margaret Hall, Ben Coleman, Marco Notaro, Michael D. Evans, Jena S. Tronieri, Hannah Blau, Bryan Laraway, Tiffany J. Callahan, Lauren E. Chan, Carolyn T. Bramante, John B. Buse, Richard A. Moffitt, Til Sturmer, Steven G. Johnson, Yu Raymond Shao, Justin Reese, Peter N. Robinson, Alberto Paccanaro, Giorgio Valentini, Jared D. Huling, Kenneth Wilkins, :, Tell Bennet , et al. (12 additional authors not shown)

    Abstract: Healthcare datasets obtained from Electronic Health Records have proven to be extremely useful to assess associations between patients' predictors and outcomes of interest. However, these datasets often suffer from missing values in a high proportion of cases and the simple removal of these cases may introduce severe bias. For these reasons, several multiple imputation algorithms have been propose… ▽ More

    Submitted 25 September, 2022; v1 submitted 13 June, 2022; originally announced June 2022.

  20. arXiv:2203.15172  [pdf, other

    cs.NE cs.RO

    Assessing Evolutionary Terrain Generation Methods for Curriculum Reinforcement Learning

    Authors: David Howard, Josh Kannemeyer, Davide Dolcetti, Humphrey Munn, Nicole Robinson

    Abstract: Curriculum learning allows complex tasks to be mastered via incremental progression over `stepping stone' goals towards a final desired behaviour. Typical implementations learn locomotion policies for challenging environments through gradual complexification of a terrain mesh generated through a parameterised noise function. To date, researchers have predominantly generated terrains from a limited… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: 8 pages, 8 figures, 2022 Genetic and Evolutionary Computing Conference (GECCO'22)

  21. arXiv:2203.10403  [pdf

    cs.CR

    An Exploratory Study into Vulnerability Chaining Blindness Terminology and Viability

    Authors: Nikki Robinson

    Abstract: To tie together the concepts of linkage blindness and the inability to link vulnerabilities together in a Vulnerability Management Program (VMP), the researcher postulated new terminology. The terminology of vulnerability chaining blindness is proposed to understand the underlying issues behind vulnerability management and vulnerabilities that can be used in combination. The general problem is tha… ▽ More

    Submitted 19 March, 2022; originally announced March 2022.

    Comments: 24 pages

  22. arXiv:2110.06196  [pdf, other

    cs.LG cs.DC

    GRAPE for Fast and Scalable Graph Processing and random walk-based Embedding

    Authors: Luca Cappelletti, Tommaso Fontana, Elena Casiraghi, Vida Ravanmehr, Tiffany J. Callahan, Carlos Cano, Marcin P. Joachimiak, Christopher J. Mungall, Peter N. Robinson, Justin Reese, Giorgio Valentini

    Abstract: Graph Representation Learning (GRL) methods opened new avenues for addressing complex, real-world problems represented by graphs. However, many graphs used in these applications comprise millions of nodes and billions of edges and are beyond the capabilities of current methods and software implementations. We present GRAPE, a software resource for graph processing and embedding that can scale with… ▽ More

    Submitted 7 May, 2023; v1 submitted 12 October, 2021; originally announced October 2021.

    ACM Class: D.m; E.2; I.2.6; I.5.5

  23. arXiv:2109.09908  [pdf, other

    cs.RO

    A Proposed Set of Communicative Gestures for Human Robot Interaction and an RGB Image-based Gesture Recognizer Implemented in ROS

    Authors: Jia Chuan A. Tan, Wesley P. Chan, Nicole L. Robinson, Elizabeth A. Croft, Dana Kulic

    Abstract: We propose a set of communicative gestures and develop a gesture recognition system with the aim of facilitating more intuitive Human-Robot Interaction (HRI) through gestures. First, we propose a set of commands commonly used for human-robot interaction. Next, an online user study with 190 participants was performed to investigate if there was an agreed set of gestures that people intuitively use… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

    Comments: 6 pages, 5 figures, 3 tables, ICRA 2022 Conference

  24. arXiv:2105.02786  [pdf, other

    cs.NE cs.LG eess.SP q-bio.NC

    LGGNet: Learning from Local-Global-Graph Representations for Brain-Computer Interface

    Authors: Yi Ding, Neethu Robinson, Chengxuan Tong, Qiuhao Zeng, Cuntai Guan

    Abstract: Neuropsychological studies suggest that co-operative activities among different brain functional areas drive high-level cognitive processes. To learn the brain activities within and among different functional areas of the brain, we propose LGGNet, a novel neurologically inspired graph neural network, to learn local-global-graph representations of electroencephalography (EEG) for Brain-Computer Int… ▽ More

    Submitted 5 December, 2022; v1 submitted 5 May, 2021; originally announced May 2021.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  25. TSception: Capturing Temporal Dynamics and Spatial Asymmetry from EEG for Emotion Recognition

    Authors: Yi Ding, Neethu Robinson, Su Zhang, Qiuhao Zeng, Cuntai Guan

    Abstract: The high temporal resolution and the asymmetric spatial activations are essential attributes of electroencephalogram (EEG) underlying emotional processes in the brain. To learn the temporal dynamics and spatial asymmetry of EEG towards accurate and generalized emotion recognition, we propose TSception, a multi-scale convolutional neural network that can classify emotions from EEG. TSception consis… ▽ More

    Submitted 24 April, 2022; v1 submitted 7 April, 2021; originally announced April 2021.

    Comments: Accepted as a regular paper in IEEE Transactions on Affective Computing. A version after proof-reading. Some typos in the Early Access version of IEEE Xplore are corrected

    Journal ref: IEEE Transactions on Affective Computing 2022

  26. arXiv:2104.01233  [pdf, other

    cs.OH cs.AI cs.LG eess.SP

    FBCNet: A Multi-view Convolutional Neural Network for Brain-Computer Interface

    Authors: Ravikiran Mane, Effie Chew, Karen Chua, Kai Keng Ang, Neethu Robinson, A. P. Vinod, Seong-Whan Lee, Cuntai Guan

    Abstract: Lack of adequate training samples and noisy high-dimensional features are key challenges faced by Motor Imagery (MI) decoding algorithms for electroencephalogram (EEG) based Brain-Computer Interface (BCI). To address these challenges, inspired from neuro-physiological signatures of MI, this paper proposes a novel Filter-Bank Convolutional Network (FBCNet) for MI classification. FBCNet employs a mu… ▽ More

    Submitted 17 March, 2021; originally announced April 2021.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  27. Skillful Precipitation Nowcasting using Deep Generative Models of Radar

    Authors: Suman Ravuri, Karel Lenc, Matthew Willson, Dmitry Kangin, Remi Lam, Piotr Mirowski, Megan Fitzsimons, Maria Athanassiadou, Sheleem Kashem, Sam Madge, Rachel Prudden, Amol Mandhane, Aidan Clark, Andrew Brock, Karen Simonyan, Raia Hadsell, Niall Robinson, Ellen Clancy, Alberto Arribas, Shakir Mohamed

    Abstract: Precipitation nowcasting, the high-resolution forecasting of precipitation up to two hours ahead, supports the real-world socio-economic needs of many sectors reliant on weather-dependent decision-making. State-of-the-art operational nowcasting methods typically advect precipitation fields with radar-based wind estimates, and struggle to capture important non-linear events such as convective initi… ▽ More

    Submitted 2 April, 2021; originally announced April 2021.

    Comments: 46 pages, 17 figures, 2 tables

  28. A Review of Evaluation Practices of Gesture Generation in Embodied Conversational Agents

    Authors: Pieter Wolfert, Nicole Robinson, Tony Belpaeme

    Abstract: Embodied conversational agents (ECA) are often designed to produce nonverbal behavior to complement or enhance their verbal communication. One such form of nonverbal behavior is co-speech gesturing, which involves movements that the agent makes with its arms and hands that are paired with verbal communication. Co-speech gestures for ECAs can be created using different generation methods, divided i… ▽ More

    Submitted 1 March, 2022; v1 submitted 11 January, 2021; originally announced January 2021.

    Comments: 11 pages, accepted for publication in IEEE Transactions on Human-Machine Systems

  29. arXiv:2012.05983  [pdf, other

    cs.CL cs.AI

    Towards Neural Programming Interfaces

    Authors: Zachary C. Brown, Nathaniel Robinson, David Wingate, Nancy Fulda

    Abstract: It is notoriously difficult to control the behavior of artificial neural networks such as generative neural language models. We recast the problem of controlling natural language generation as that of learning to interface with a pretrained language model, just as Application Programming Interfaces (APIs) control the behavior of programs by altering hyperparameters. In this new paradigm, a special… ▽ More

    Submitted 17 February, 2021; v1 submitted 10 December, 2020; originally announced December 2020.

    Comments: 24 pages total (13 for main paper and references, 11 for Appendix 1), accepted for publication in Advances in Neural Information Processing Systems 33 (NeurIPS 2020)

    Journal ref: Neural Information Processing Systems 33 (2020) 17416-17428

  30. PhenoTagger: A Hybrid Method for Phenotype Concept Recognition using Human Phenotype Ontology

    Authors: Ling Luo, Shankai Yan, Po-Ting Lai, Daniel Veltri, Andrew Oler, Sandhya Xirasagar, Rajarshi Ghosh, Morgan Similuk, Peter N. Robinson, Zhiyong Lu

    Abstract: Automatic phenotype concept recognition from unstructured text remains a challenging task in biomedical text mining research. Previous works that address the task typically use dictionary-based matching methods, which can achieve high precision but suffer from lower recall. Recently, machine learning-based methods have been proposed to identify biomedical concepts, which can recognize more unseen… ▽ More

    Submitted 25 January, 2021; v1 submitted 17 September, 2020; originally announced September 2020.

    Comments: Accepted by Bioinformatics

  31. arXiv:2005.04988  [pdf, ps, other

    physics.ao-ph cs.LG stat.ML

    A review of radar-based nowcasting of precipitation and applicable machine learning techniques

    Authors: Rachel Prudden, Samantha Adams, Dmitry Kangin, Niall Robinson, Suman Ravuri, Shakir Mohamed, Alberto Arribas

    Abstract: A 'nowcast' is a type of weather forecast which makes predictions in the very short term, typically less than two hours - a period in which traditional numerical weather prediction can be limited. This type of weather prediction has important applications for commercial aviation; public and outdoor events; and the construction industry, power utilities, and ground transportation services that cond… ▽ More

    Submitted 11 May, 2020; originally announced May 2020.

    Comments: 17 pages This work has been submitted to Monthly Weather Review. Copyright in this work may be transferred without further notice

  32. arXiv:2004.02965  [pdf, other

    eess.SP cs.LG stat.ML

    TSception: A Deep Learning Framework for Emotion Detection Using EEG

    Authors: Yi Ding, Neethu Robinson, Qiuhao Zeng, Duo Chen, Aung Aung Phyo Wai, Tih-Shih Lee, Cuntai Guan

    Abstract: In this paper, we propose a deep learning framework, TSception, for emotion detection from electroencephalogram (EEG). TSception consists of temporal and spatial convolutional layers, which learn discriminative representations in the time and channel domains simultaneously. The temporal learner consists of multi-scale 1D convolutional kernels whose lengths are related to the sampling rate of the E… ▽ More

    Submitted 7 April, 2020; v1 submitted 1 April, 2020; originally announced April 2020.

    Comments: Authors information updated only. Accepted to be published in: 2020 International Joint Conference on Neural Networks (IJCNN), Glasgow, July 19--24, 2020, part of 2020 IEEE World Congress on Computational Intelligence (IEEE WCCI 2020)

  33. arXiv:2003.00971  [pdf

    cs.CR cs.DB cs.NI

    Graphing Website Relationships for Risk Prediction: Identifying Derived Threats to Users Based on Known Indicators

    Authors: Philip H. Kulp, Nikki E. Robinson

    Abstract: The hypothesis for the study was that the relationship based on referrer links and the number of hops to a malicious site could indicate the risk to another website. We chose Receiver Operating Characteristics (ROC) analysis as the method of comparing true positive and false positive rates for captured web traffic to test the predictive capabilities of our model. Known threat indicators were used… ▽ More

    Submitted 2 March, 2020; originally announced March 2020.

    Comments: 10 pages, 3 figures, 3 tables

    ACM Class: C.2.4; H.2.4; E.2; C.2.1

  34. arXiv:2001.04930  [pdf

    cs.CR cs.CY

    Shades of Perception- User Factors in Identifying Password Strength

    Authors: Jason M. Pittman, Nikki Robinson

    Abstract: The purpose of this study was to measure whether participant education, profession, and technical skill level exhibited a relationship with identification of password strength. Participants reviewed 50 passwords and labeled each as weak or strong. A Chi-square test of independence was used to measure relationships between education, profession, technical skill level relative to the frequency of we… ▽ More

    Submitted 14 January, 2020; originally announced January 2020.

    Comments: 18 pages, 6 tables, 6 figures

  35. arXiv:1908.03356  [pdf, other

    cs.DC cs.GL

    Seven Principles for Effective Scientific Big-DataSystems

    Authors: Niall H. Robinson, Joe Hamman, Ryan Abernathey

    Abstract: We should be in a golden age of scientific discovery, given that we have more data and more compute power available than ever before, plus a new generation of algorithms that can learn effectively from data. But paradoxically, in many data-driven fields, the eureka moments are becoming increasingly rare. Scientists are struggling to keep pace with the explosion in the volume and complexity of scie… ▽ More

    Submitted 25 June, 2020; v1 submitted 9 August, 2019; originally announced August 2019.

  36. arXiv:1604.03688  [pdf, other

    cs.MM

    A Practical Approach to Spatiotemporal Data Compression

    Authors: Niall H. Robinson, Rachel Prudden, Alberto Arribas

    Abstract: Datasets representing the world around us are becoming ever more unwieldy as data volumes grow. This is largely due to increased measurement and modelling resolution, but the problem is often exacerbated when data are stored at spuriously high precisions. In an effort to facilitate analysis of these datasets, computationally intensive calculations are increasingly being performed on specialised re… ▽ More

    Submitted 27 April, 2016; v1 submitted 13 April, 2016; originally announced April 2016.