Zum Hauptinhalt springen

Showing 1–41 of 41 results for author: Clark, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.19729  [pdf

    cs.HC cs.AI

    A Framework for Leveraging Human Computation Gaming to Enhance Knowledge Graphs for Accuracy Critical Generative AI Applications

    Authors: Steph Buongiorno, Corey Clark

    Abstract: External knowledge graphs (KGs) can be used to augment large language models (LLMs), while simultaneously providing an explainable knowledge base of facts that can be inspected by a human. This approach may be particularly valuable in domains where explainability is critical, like human trafficking data analysis. However, creating KGs can pose challenges. KGs parsed from documents may comprise exp… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  2. arXiv:2404.19721  [pdf

    cs.AI cs.CL

    PANGeA: Procedural Artificial Narrative using Generative AI for Turn-Based Video Games

    Authors: Steph Buongiorno, Lawrence Jake Klinkert, Tanishq Chawla, Zixin Zhuang, Corey Clark

    Abstract: This research introduces Procedural Artificial Narrative using Generative AI (PANGeA), a structured approach for leveraging large language models (LLMs), guided by a game designer's high-level criteria, to generate narrative content for turn-based role-playing video games (RPGs). Distinct from prior applications of LLMs used for video game design, PANGeA innovates by not only generating game level… ▽ More

    Submitted 9 July, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

  3. arXiv:2402.14879  [pdf, other

    cs.CL cs.AI

    Driving Generative Agents With Their Personality

    Authors: Lawrence J. Klinkert, Stephanie Buongiorno, Corey Clark

    Abstract: This research explores the potential of Large Language Models (LLMs) to utilize psychometric values, specifically personality information, within the context of video game character development. Affective Computing (AC) systems quantify a Non-Player character's (NPC) psyche, and an LLM can take advantage of the system's information by using the values for prompt generation. The research shows an L… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 9 Pages, 4 figures, Draft

  4. arXiv:2312.17172  [pdf, other

    cs.CV cs.AI cs.CL

    Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action

    Authors: Jiasen Lu, Christopher Clark, Sangho Lee, Zichen Zhang, Savya Khosla, Ryan Marten, Derek Hoiem, Aniruddha Kembhavi

    Abstract: We present Unified-IO 2, the first autoregressive multimodal model that is capable of understanding and generating image, text, audio, and action. To unify different modalities, we tokenize inputs and outputs -- images, text, audio, action, bounding boxes, etc., into a shared semantic space and then process them with a single encoder-decoder transformer model. Since training with such diverse moda… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

    Comments: 38 pages, 20 figures

  5. arXiv:2312.09067  [pdf, other

    cs.CV cs.AI cs.CL cs.RO

    Holodeck: Language Guided Generation of 3D Embodied AI Environments

    Authors: Yue Yang, Fan-Yun Sun, Luca Weihs, Eli VanderBilt, Alvaro Herrasti, Winson Han, Jiajun Wu, Nick Haber, Ranjay Krishna, Lingjie Liu, Chris Callison-Burch, Mark Yatskar, Aniruddha Kembhavi, Christopher Clark

    Abstract: 3D simulated environments play a critical role in Embodied AI, but their creation requires expertise and extensive manual effort, restricting their diversity and scope. To mitigate this limitation, we present Holodeck, a system that generates 3D environments to match a user-supplied prompt fully automatedly. Holodeck can generate diverse scenes, e.g., arcades, spas, and museums, adjust the designs… ▽ More

    Submitted 22 April, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: Published in CVPR 2024, 21 pages, 27 figures, 2 tables

  6. arXiv:2311.03026  [pdf, other

    cs.CL

    Detecting Agreement in Multi-party Conversational AI

    Authors: Laura Schauer, Jason Sweeney, Charlie Lyttle, Zein Said, Aron Szeles, Cale Clark, Katie McAskill, Xander Wickham, Tom Byars, Daniel Hernández Garcia, Nancie Gunson, Angus Addlesee, Oliver Lemon

    Abstract: Today, conversational systems are expected to handle conversations in multi-party settings, especially within Socially Assistive Robots (SARs). However, practical usability remains difficult as there are additional challenges to overcome, such as speaker recognition, addressee recognition, and complex turn-taking. In this paper, we present our work on a multi-party conversational system, which inv… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: Proceedings of the workshop on advancing GROup UNderstanding and robots aDaptive behaviour (GROUND), 2023

  7. arXiv:2310.16941  [pdf, other

    cs.RO cs.LG cs.MA

    Exploring Behavior Discovery Methods for Heterogeneous Swarms of Limited-Capability Robots

    Authors: Connor Mattson, Jeremy C. Clark, Daniel S. Brown

    Abstract: We study the problem of determining the emergent behaviors that are possible given a functionally heterogeneous swarm of robots with limited capabilities. Prior work has considered behavior search for homogeneous swarms and proposed the use of novelty search over either a hand-specified or learned behavior space followed by clustering to return a taxonomy of emergent behaviors to the user. In this… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: 11 pages, 9 figures, To be published in Proceedings IEEE International Symposium on Multi-Robot & Multi-Agent Systems (MRS 2023)

  8. arXiv:2310.13140  [pdf, other

    cs.CR

    Blind Evaluation Framework for Fully Homomorphic Encryption and Privacy-Preserving Machine Learning

    Authors: Hunjae "Timothy" Lee, Corey Clark

    Abstract: In the domain of Privacy-Preserving Machine Learning (PPML), Fully Homomorphic Encryption (FHE) is often used for encrypted computation to allow secure and privacy-preserving outsourcing of machine learning modeling. While FHE enables encrypted arithmetic operations, execution of programmatic logic such as control structures or conditional programming have remained a challenge. As a result, progre… ▽ More

    Submitted 27 August, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: changes made from previous version include change in the name of the title and restructuring/re-organization of the original paper

  9. arXiv:2306.04791  [pdf, other

    cs.LG cs.AI

    XInsight: Revealing Model Insights for GNNs with Flow-based Explanations

    Authors: Eli Laird, Ayesh Madushanka, Elfi Kraka, Corey Clark

    Abstract: Progress in graph neural networks has grown rapidly in recent years, with many new developments in drug discovery, medical diagnosis, and recommender systems. While this progress is significant, many networks are `black boxes' with little understanding of the `what' exactly the network is learning. Many high-stakes applications, such as drug discovery, require human-intelligible explanations from… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: eXplainable Artificial Intelligence. 1st World Conference on eXplainable Artificial Intelligence, xAI-2023, Lisbon, Portugal

  10. arXiv:2303.16133  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Exposing and Addressing Cross-Task Inconsistency in Unified Vision-Language Models

    Authors: Adyasha Maharana, Amita Kamath, Christopher Clark, Mohit Bansal, Aniruddha Kembhavi

    Abstract: As general purpose vision models get increasingly effective at a wide set of tasks, it is imperative that they be consistent across the tasks they support. Inconsistent AI models are considered brittle and untrustworthy by human users and are more challenging to incorporate into larger systems that take dependencies on their outputs. Measuring consistency between very heterogeneous tasks that migh… ▽ More

    Submitted 21 February, 2024; v1 submitted 28 March, 2023; originally announced March 2023.

    Comments: TMLR 2024; Project Website: https://adymaharana.github.io/cococon/

  11. arXiv:2211.09887  [pdf, other

    eess.IV cs.LG physics.med-ph

    Spherical convolutional neural networks can improve brain microstructure estimation from diffusion MRI data

    Authors: Leevi Kerkelä, Kiran Seunarine, Filip Szczepankiewicz, Chris A. Clark

    Abstract: Diffusion magnetic resonance imaging is sensitive to the microstructural properties of brain tissue. However, estimating clinically and scientifically relevant microstructural properties from the measured signals remains a highly challenging inverse problem that machine learning may help solve. This study investigated if recently developed rotationally invariant spherical convolutional neural netw… ▽ More

    Submitted 26 February, 2024; v1 submitted 17 November, 2022; originally announced November 2022.

  12. arXiv:2211.09778  [pdf, other

    cs.CV cs.CL

    I Can't Believe There's No Images! Learning Visual Tasks Using only Language Supervision

    Authors: Sophia Gu, Christopher Clark, Aniruddha Kembhavi

    Abstract: Many high-level skills that are required for computer vision tasks, such as parsing questions, comparing and contrasting semantics, and writing descriptions, are also required in other domains such as natural language processing. In this paper, we ask whether it is possible to learn those skills from text data and then transfer them to vision tasks without ever training on visual training data. Ke… ▽ More

    Submitted 18 August, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

    Comments: website (https://prior.allenai.org/projects/close), code (https://github.com/allenai/close)

  13. arXiv:2206.08916  [pdf, other

    cs.CV

    Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks

    Authors: Jiasen Lu, Christopher Clark, Rowan Zellers, Roozbeh Mottaghi, Aniruddha Kembhavi

    Abstract: We propose Unified-IO, a model that performs a large variety of AI tasks spanning classical computer vision tasks, including pose estimation, object detection, depth estimation and image generation, vision-and-language tasks such as region captioning and referring expression, to natural language processing tasks such as question answering and paraphrasing. Developing a single unified model for suc… ▽ More

    Submitted 4 October, 2022; v1 submitted 17 June, 2022; originally announced June 2022.

  14. arXiv:2206.01718  [pdf, other

    cs.CV cs.CL

    A-OKVQA: A Benchmark for Visual Question Answering using World Knowledge

    Authors: Dustin Schwenk, Apoorv Khandelwal, Christopher Clark, Kenneth Marino, Roozbeh Mottaghi

    Abstract: The Visual Question Answering (VQA) task aspires to provide a meaningful testbed for the development of AI models that can jointly reason over visual and natural language inputs. Despite a proliferation of VQA datasets, this goal is hindered by a set of common limitations. These include a reliance on relatively simplistic questions that are repetitive in both concepts and linguistic structure, lit… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

  15. arXiv:2203.11447  [pdf, other

    cs.CV eess.IV

    Manipulating UAV Imagery for Satellite Model Training, Calibration and Testing

    Authors: Jasper Brown, Cameron Clark, Sabrina Lomax, Khalid Rafique, Salah Sukkarieh

    Abstract: Modern livestock farming is increasingly data driven and frequently relies on efficient remote sensing to gather data over wide areas. High resolution satellite imagery is one such data source, which is becoming more accessible for farmers as coverage increases and cost falls. Such images can be used to detect and track animals, monitor pasture changes, and understand land use. Many of the data dr… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

    Comments: 16 pages, 7 figures, 2 tables

  16. arXiv:2202.02317  [pdf, other

    cs.CV cs.CL

    Webly Supervised Concept Expansion for General Purpose Vision Models

    Authors: Amita Kamath, Christopher Clark, Tanmay Gupta, Eric Kolve, Derek Hoiem, Aniruddha Kembhavi

    Abstract: General Purpose Vision (GPV) systems are models that are designed to solve a wide array of visual tasks without requiring architectural changes. Today, GPVs primarily learn both skills and concepts from large fully supervised datasets. Scaling GPVs to tens of thousands of concepts by acquiring data to learn each concept for every skill quickly becomes prohibitive. This work presents an effective a… ▽ More

    Submitted 20 July, 2022; v1 submitted 4 February, 2022; originally announced February 2022.

    Comments: ECCV 2022

  17. arXiv:2112.00800  [pdf, other

    cs.CL cs.AI

    Iconary: A Pictionary-Based Game for Testing Multimodal Communication with Drawings and Text

    Authors: Christopher Clark, Jordi Salvador, Dustin Schwenk, Derrick Bonafilia, Mark Yatskar, Eric Kolve, Alvaro Herrasti, Jonghyun Choi, Sachin Mehta, Sam Skjonsberg, Carissa Schoenick, Aaron Sarnat, Hannaneh Hajishirzi, Aniruddha Kembhavi, Oren Etzioni, Ali Farhadi

    Abstract: Communicating with humans is challenging for AIs because it requires a shared understanding of the world, complex semantics (e.g., metaphors or analogies), and at times multi-modal gestures (e.g., pointing with a finger, or an arrow in a diagram). We investigate these challenges in the context of Iconary, a collaborative game of drawing and guessing based on Pictionary, that poses a novel challeng… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

    Comments: In EMNLP 2021

  18. Automated Aerial Animal Detection When Spatial Resolution Conditions Are Varied

    Authors: Jasper Brown, Yongliang Qiao, Cameron Clark, Sabrina Lomax, Khalid Rafique, Salah Sukkarieh

    Abstract: Knowing where livestock are located enables optimized management and mustering. However, Australian farms are large meaning that many of Australia's livestock are unmonitored which impacts farm profit, animal welfare and the environment. Effective animal localisation and counting by analysing satellite imagery overcomes this management hurdle however, high resolution satellite imagery is expensive… ▽ More

    Submitted 4 October, 2021; originally announced October 2021.

    Comments: 20 pages, 9 figures, 4 tables in appendix

    Journal ref: Computers and Electronics in Agriculture, Volume 193, 2022

  19. arXiv:2102.04958  [pdf

    cs.HC cs.AI cs.CL cs.MA cs.MM

    Hallmarks of Human-Machine Collaboration: A framework for assessment in the DARPA Communicating with Computers Program

    Authors: Robyn Kozierok, John Aberdeen, Cheryl Clark, Christopher Garay, Bradley Goodman, Tonia Korves, Lynette Hirschman, Patricia L. McDermott, Matthew W. Peterson

    Abstract: There is a growing desire to create computer systems that can communicate effectively to collaborate with humans on complex, open-ended activities. Assessing these systems presents significant challenges. We describe a framework for evaluating systems engaged in open-ended complex scenarios where evaluators do not have the luxury of comparing performance to a single right answer. This framework ha… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

    Comments: 20 pages, 21 figures

    Report number: MITRE Document Number: MTR210002 ACM Class: I.2.11; I.2.7; H.5.2

  20. arXiv:2011.03856  [pdf, other

    cs.LG cs.CL cs.CV

    Learning to Model and Ignore Dataset Bias with Mixed Capacity Ensembles

    Authors: Christopher Clark, Mark Yatskar, Luke Zettlemoyer

    Abstract: Many datasets have been shown to contain incidental correlations created by idiosyncrasies in the data collection process. For example, sentence entailment datasets can have spurious word-class correlations if nearly all contradiction sentences contain the word "not", and image recognition datasets can have tell-tale object-background correlations if dogs are always indoors. In this paper, we prop… ▽ More

    Submitted 7 November, 2020; originally announced November 2020.

    Comments: In EMNLP Findings

  21. arXiv:1912.09503  [pdf, other

    cs.RO cs.NE

    Multi-Robot Path Planning Via Genetic Programming

    Authors: Alexandre Trudeau, Christopher M. Clark

    Abstract: This paper presents a Genetic Programming (GP) approach to solving multi-robot path planning (MRPP) problems in single-lane workspaces, specifically those easily mapped to graph representations. GP's versatility enables this approach to produce programs optimizing for multiple attributes rather than a single attribute such as path length or completeness. When optimizing for the number of time step… ▽ More

    Submitted 19 December, 2019; originally announced December 2019.

    Comments: ARMS 2019 Workshop (AAMAS)

  22. arXiv:1911.00417  [pdf, other

    cs.SD cs.LG eess.AS

    Long-distance Detection of Bioacoustic Events with Per-channel Energy Normalization

    Authors: Vincent Lostanlen, Kaitlin Palmer, Elly Knight, Christopher Clark, Holger Klinck, Andrew Farnsworth, Tina Wong, Jason Cramer, Juan Pablo Bello

    Abstract: This paper proposes to perform unsupervised detection of bioacoustic events by pooling the magnitudes of spectrogram frames after per-channel energy normalization (PCEN). Although PCEN was originally developed for speech recognition, it also has beneficial effects in enhancing animal vocalizations, despite the presence of atmospheric absorption and intermittent noise. We prove that PCEN generalize… ▽ More

    Submitted 1 November, 2019; originally announced November 2019.

    Comments: 5 pages, 3 figures. Presented at the 3rd International Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE). 25--26 October 2019, New York, NY, USA

  23. arXiv:1909.03683  [pdf, other

    cs.CL cs.CV cs.LG

    Don't Take the Easy Way Out: Ensemble Based Methods for Avoiding Known Dataset Biases

    Authors: Christopher Clark, Mark Yatskar, Luke Zettlemoyer

    Abstract: State-of-the-art models often make use of superficial patterns in the data that do not generalize well to out-of-domain or adversarial settings. For example, textual entailment models often learn that particular key words imply entailment, irrespective of context, and visual question answering models learn to predict prototypical answers, without considering evidence in the image. In this paper, w… ▽ More

    Submitted 9 September, 2019; originally announced September 2019.

    Comments: In EMNLP 2019

  24. arXiv:1908.09183  [pdf, other

    cs.LG stat.ML

    Deriving a Quantitative Relationship Between Resolution and Human Classification Error

    Authors: Josiah I. Clark, Caroline A. Clark

    Abstract: For machine learning perception problems, human-level classification performance is used as an estimate of top algorithm performance. Thus, it is important to understand as precisely as possible the factors that impact human-level performance. Knowing this 1) provides a benchmark for model performance, 2) tells a project manager what type of data to obtain for human labelers in order to get accura… ▽ More

    Submitted 24 August, 2019; originally announced August 2019.

  25. arXiv:1905.10044  [pdf, ps, other

    cs.CL

    BoolQ: Exploring the Surprising Difficulty of Natural Yes/No Questions

    Authors: Christopher Clark, Kenton Lee, Ming-Wei Chang, Tom Kwiatkowski, Michael Collins, Kristina Toutanova

    Abstract: In this paper we study yes/no questions that are naturally occurring --- meaning that they are generated in unprompted and unconstrained settings. We build a reading comprehension dataset, BoolQ, of such questions, and show that they are unexpectedly challenging. They often query for complex, non-factoid information, and require difficult entailment-like inference to solve. We also explore the eff… ▽ More

    Submitted 24 May, 2019; originally announced May 2019.

    Comments: In NAACL 2019

  26. Computer vision-based framework for extracting geological lineaments from optical remote sensing data

    Authors: Ehsan Farahbakhsh, Rohitash Chandra, Hugo K. H. Olierook, Richard Scalzo, Chris Clark, Steven M. Reddy, R. Dietmar Muller

    Abstract: The extraction of geological lineaments from digital satellite data is a fundamental application in remote sensing. The location of geological lineaments such as faults and dykes are of interest for a range of applications, particularly because of their association with hydrothermal mineralization. Although a wide range of applications have utilized computer vision techniques, a standard workflow… ▽ More

    Submitted 4 October, 2018; originally announced October 2018.

    Comments: 17 pages, 10 figures, 2 tables

  27. arXiv:1802.08872  [pdf

    cs.LG cs.CV

    Deep learning for conifer/deciduous classification of airborne LiDAR 3D point clouds representing individual trees

    Authors: Hamid Hamraz, Nathan B. Jacobs, Marco A. Contreras, Chase H. Clark

    Abstract: The purpose of this study was to investigate the use of deep learning for coniferous/deciduous classification of individual trees from airborne LiDAR data. To enable efficient processing by a deep convolutional neural network (CNN), we designed two discrete representations using leaf-off and leaf-on LiDAR data: a digital surface model with four channels (DSMx4) and a set of four 2D views (4x2D). A… ▽ More

    Submitted 24 February, 2018; originally announced February 2018.

    Comments: Under review as of the date of submission

  28. arXiv:1802.05365  [pdf, other

    cs.CL

    Deep contextualized word representations

    Authors: Matthew E. Peters, Mark Neumann, Mohit Iyyer, Matt Gardner, Christopher Clark, Kenton Lee, Luke Zettlemoyer

    Abstract: We introduce a new type of deep contextualized word representation that models both (1) complex characteristics of word use (e.g., syntax and semantics), and (2) how these uses vary across linguistic contexts (i.e., to model polysemy). Our word vectors are learned functions of the internal states of a deep bidirectional language model (biLM), which is pre-trained on a large text corpus. We show th… ▽ More

    Submitted 22 March, 2018; v1 submitted 14 February, 2018; originally announced February 2018.

    Comments: NAACL 2018. Originally posted to openreview 27 Oct 2017. v2 updated for NAACL camera ready

  29. arXiv:1710.10723  [pdf, other

    cs.CL

    Simple and Effective Multi-Paragraph Reading Comprehension

    Authors: Christopher Clark, Matt Gardner

    Abstract: We consider the problem of adapting neural paragraph-level question answering models to the case where entire documents are given as input. Our proposed solution trains models to produce well calibrated confidence scores for their results on individual paragraphs. We sample multiple paragraphs from the documents during training, and use a shared-normalization training objective that encourages the… ▽ More

    Submitted 7 November, 2017; v1 submitted 29 October, 2017; originally announced October 2017.

    Comments: 11 pages, updated a reference

  30. arXiv:1704.04760  [pdf

    cs.AR cs.LG cs.NE

    In-Datacenter Performance Analysis of a Tensor Processing Unit

    Authors: Norman P. Jouppi, Cliff Young, Nishant Patil, David Patterson, Gaurav Agrawal, Raminder Bajwa, Sarah Bates, Suresh Bhatia, Nan Boden, Al Borchers, Rick Boyle, Pierre-luc Cantin, Clifford Chao, Chris Clark, Jeremy Coriell, Mike Daley, Matt Dau, Jeffrey Dean, Ben Gelb, Tara Vazir Ghaemmaghami, Rajendra Gottipati, William Gulland, Robert Hagmann, C. Richard Ho, Doug Hogberg , et al. (50 additional authors not shown)

    Abstract: Many architects believe that major improvements in cost-energy-performance must now come from domain-specific hardware. This paper evaluates a custom ASIC---called a Tensor Processing Unit (TPU)---deployed in datacenters since 2015 that accelerates the inference phase of neural networks (NN). The heart of the TPU is a 65,536 8-bit MAC matrix multiply unit that offers a peak throughput of 92 TeraOp… ▽ More

    Submitted 16 April, 2017; originally announced April 2017.

    Comments: 17 pages, 11 figures, 8 tables. To appear at the 44th International Symposium on Computer Architecture (ISCA), Toronto, Canada, June 24-28, 2017

  31. arXiv:1607.08482  [pdf

    cs.SD

    Early and Late Time Acoustic Measures for Underwater Seismic Airgun Signals In Long-Term Acoustic Data Sets

    Authors: Peter Dugan, Melania Guerra, Dimitri Ponirakis, Holger Klinck, Christopher W. Clark

    Abstract: This work presents a new toolkit for describing the acoustic properties of the ocean environment before, during and after a sound event caused by an underwater seismic air-gun. The toolkit uses existing sound measures, but uniquely applies these to capture the early time period (actual pulse) and late time period (reverberation and multiple arrivals). In total, 183 features are produced for each a… ▽ More

    Submitted 5 May, 2016; originally announced July 2016.

    Comments: Camera copy version of the paper for publication in IEEE explore. Paper was withdrawn by the co-authors for submission to JASA Express Letters

  32. arXiv:1605.00983  [pdf

    cs.DC

    Phase 3: DCL System Using Deep Learning Approaches for Land-based or Ship-based Real-Time Recognition and Localization of Marine Mammals - Bioacoustic Applicaitons

    Authors: Peter J. Dugan, Christopher W. Clark, Yann André LeCun, Sofie M. Van Parijs

    Abstract: Goals of this research phase is to investigate advanced detection and classification pardims useful for data-mining passive large passive acoustic archives. Technical objectives are to develop and refine a High Performance Computing, Acoustic Data Accelerator (HPC-ADA) along with MATLAB based software based on time series acoustic signal Detection cLassification using Machine learning Algorithms,… ▽ More

    Submitted 5 May, 2016; v1 submitted 3 May, 2016; originally announced May 2016.

    Comments: National Oceanic Partnership Program (NOPP) sponsored by ONR and NFWF

    Report number: N000141210585

  33. arXiv:1605.00982  [pdf

    cs.DC

    Phase 4: DCL System Using Deep Learning Approaches for Land-Based or Ship-Based Real-Time Recognition and Localization of Marine Mammals - Distributed Processing and Big Data Applications

    Authors: Peter J. Dugan, Christopher W. Clark, Yann André LeCun, Sofie M. Van Parijs

    Abstract: While the animal bioacoustics community at large is collecting huge amounts of acoustic data at an unprecedented pace, processing these data is problematic. Currently in bioacoustics, there is no effective way to achieve high performance computing using commericial off the shelf (COTS) or government off the shelf (GOTS) tools. Although several advances have been made in the open source and commerc… ▽ More

    Submitted 5 May, 2016; v1 submitted 3 May, 2016; originally announced May 2016.

    Comments: National Oceanic Partnership Program (NOPP) sponsored by ONR and NFWF

    Report number: N000141210585

  34. arXiv:1605.00972  [pdf

    cs.CV

    Phase 2: DCL System Using Deep Learning Approaches for Land-based or Ship-based Real-Time Recognition and Localization of Marine Mammals - Machine Learning Detection Algorithms

    Authors: Peter J. Dugan, Christopher W. Clark, Yann André LeCun, Sofie M. Van Parijs

    Abstract: Overarching goals for this work aim to advance the state of the art for detection, classification and localization (DCL) in the field of bioacoustics. This goal is primarily achieved by building a generic framework for detection-classification (DC) using a fast, efficient and scalable architecture, demonstrating the capabilities of this system using on a variety of low-frequency mid-frequency ceta… ▽ More

    Submitted 5 May, 2016; v1 submitted 3 May, 2016; originally announced May 2016.

    Comments: National Oceanic Partnership Program (NOPP) sponsored by ONR and NFWF: N000141210585

    Report number: N000141210585

  35. arXiv:1605.00971  [pdf

    cs.DC

    Phase 1: DCL System Research Using Advanced Approaches for Land-based or Ship-based Real-Time Recognition and Localization of Marine Mammals - HPC System Implementation

    Authors: Peter J. Dugan, Christopher W. Clark, Yann André LeCun, Sofie M. Van Parijs

    Abstract: We aim to investigate advancing the state of the art of detection, classification and localization (DCL) in the field of bioacoustics. The two primary goals are to develop transferable technologies for detection and classification in: (1) the area of advanced algorithms, such as deep learning and other methods; and (2) advanced systems, capable of real-time and archival and processing. This projec… ▽ More

    Submitted 5 May, 2016; v1 submitted 3 May, 2016; originally announced May 2016.

    Comments: Year 1 National Oceanic Partnership Program Report, sponsored ONR, NFWF. N000141210585

    Report number: N000141210585

  36. arXiv:1512.06141  [pdf, other

    stat.AP cs.SI physics.soc-ph

    The Role of Race, Ethnicity, and Gender in the Congressional Cosponsorship Network

    Authors: Alison Craig, Skyler J. Cranmer, Bruce A. Desmarais, Christopher J. Clark, Vincent G. Moscardelli

    Abstract: Previous research indicates that race, ethnicity, and gender influence legislative behavior in important ways. The bulk of this research, however, focuses on the way these characteristics shape an individual legislator's behavior, making it less clear how they account for relationships between legislators. We study the cosponsorship process in order to understand the race and gender based dynamics… ▽ More

    Submitted 18 December, 2015; originally announced December 2015.

  37. arXiv:1509.03591  [pdf

    cs.DC

    High Performance Computer Acoustic Data Accelerator: A New System for Exploring Marine Mammal Acoustics for Big Data Applications

    Authors: Peter Dugan, John Zollweg, Marian Popescu, Denise Risch, Herve Glotin, Yann LeCun, and Christopher Clark

    Abstract: This paper presents a new software model designed for distributed sonic signal detection runtime using machine learning algorithms called DeLMA. A new algorithm--Acoustic Data-mining Accelerator (ADA)--is also presented. ADA is a robust yet scalable solution for efficiently processing big sound archives using distributing computing technologies. Together, DeLMA and the ADA algorithm provide a powe… ▽ More

    Submitted 11 September, 2015; originally announced September 2015.

    Comments: Seven pages, submitted at International Conference on Machine Learning 2014, Workshop uLearnBio, unsupervised learning for bioacoustic applications

    MSC Class: 68-04

  38. arXiv:1412.3409  [pdf, other

    cs.AI cs.LG cs.NE

    Teaching Deep Convolutional Neural Networks to Play Go

    Authors: Christopher Clark, Amos Storkey

    Abstract: Mastering the game of Go has remained a long standing challenge to the field of AI. Modern computer Go systems rely on processing millions of possible future positions to play well, but intuitively a stronger and more 'humanlike' way to play the game would be to rely on pattern recognition abilities rather then brute force computation. Following this sentiment, we train deep convolutional neural n… ▽ More

    Submitted 27 January, 2015; v1 submitted 10 December, 2014; originally announced December 2014.

    Comments: 9 pages, 8 figures, 5 tables. Corrected typos, minor adjustment to table format

  39. arXiv:1305.3635  [pdf

    cs.CV

    Bioacoustic Signal Classification Based on Continuous Region Processing, Grid Masking and Artificial Neural Network

    Authors: Mohammad Pourhomayoun, Peter Dugan, Marian Popescu, Christopher Clark

    Abstract: In this paper, we develop a novel method based on machine-learning and image processing to identify North Atlantic right whale (NARW) up-calls in the presence of high levels of ambient and interfering noise. We apply a continuous region algorithm on the spectrogram to extract the regions of interest, and then use grid masking techniques to generate a small feature set that is then used in an artif… ▽ More

    Submitted 17 June, 2013; v1 submitted 15 May, 2013; originally announced May 2013.

    Comments: To be Submitted to "ICML 2013 Workshop on Machine Learning for Bioacoustics", 6 pages, 8 figures

  40. arXiv:1305.3633  [pdf

    cs.CV

    Classification for Big Dataset of Bioacoustic Signals Based on Human Scoring System and Artificial Neural Network

    Authors: Mohammad Pourhomayoun, Peter Dugan, Marian Popescu, Denise Risch, Hal Lewis, Christopher Clark

    Abstract: In this paper, we propose a method to improve sound classification performance by combining signal features, derived from the time-frequency spectrogram, with human perception. The method presented herein exploits an artificial neural network (ANN) and learns the signal features based on the human perception knowledge. The proposed method is applied to a large acoustic dataset containing 24 months… ▽ More

    Submitted 17 June, 2013; v1 submitted 15 May, 2013; originally announced May 2013.

    Comments: To be Submitted to "ICML 2013 Workshop on Machine Learning for Bioacoustics", 6 pages, 4 figures

  41. arXiv:1305.3250  [pdf

    cs.CV

    Bioacoustical Periodic Pulse Train Signal Detection and Classification using Spectrogram Intensity Binarization and Energy Projection

    Authors: Marian Popescu, Peter J. Dugan, Mohammad Pourhomayoun, Denise Risch, Harold W. Lewis III, Christopher W. Clark

    Abstract: The following work outlines an approach for automatic detection and recognition of periodic pulse train signals using a multi-stage process based on spectrogram edge detection, energy projection and classification. The method has been implemented to automatically detect and recognize pulse train songs of minke whales. While the long term goal of this work is to properly identify and detect minke s… ▽ More

    Submitted 28 June, 2013; v1 submitted 14 May, 2013; originally announced May 2013.

    Comments: ICML 2013 Workshop on Machine Learning for Bioacoustics, 2013, 6 pages