Zum Hauptinhalt springen

Showing 1–18 of 18 results for author: Miwa, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.03790  [pdf, other

    cs.CL

    End-to-End Trainable Soft Retriever for Low-resource Relation Extraction

    Authors: Kohei Makino, Makoto Miwa, Yutaka Sasaki

    Abstract: This study addresses a crucial challenge in instance-based relation extraction using text generation models: end-to-end training in target relation extraction task is not applicable to retrievers due to the non-differentiable nature of instance selection. We propose a novel End-to-end TRAinable Soft K-nearest neighbor retriever (ETRASK) by the neural prompting method that utilizes a soft, differen… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: preprint

  2. arXiv:2302.05392  [pdf, other

    cs.CL cs.LG

    Span-based Named Entity Recognition by Generating and Compressing Information

    Authors: Nhung T. H. Nguyen, Makoto Miwa, Sophia Ananiadou

    Abstract: The information bottleneck (IB) principle has been proven effective in various NLP applications. The existing work, however, only used either generative or information compression models to improve the performance of the target task. In this paper, we propose to combine the two types of IB models into one system to enhance Named Entity Recognition (NER). For one type of IB model, we incorporate tw… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

    Comments: The paper has 13 pages but the main content is in 9 pages. There are two figures and 9 tables. The paper is accepted as a long paper at EACL 2023

  3. arXiv:2204.00511  [pdf, other

    cs.CL cs.LG

    Learning Disentangled Representations of Negation and Uncertainty

    Authors: Jake Vasilakes, Chrysoula Zerva, Makoto Miwa, Sophia Ananiadou

    Abstract: Negation and uncertainty modeling are long-standing tasks in natural language processing. Linguistic theory postulates that expressions of negation and uncertainty are semantically independent from each other and the content they modify. However, previous works on representation learning do not explicitly model this independence. We therefore attempt to disentangle the representations of negation,… ▽ More

    Submitted 1 April, 2022; originally announced April 2022.

    Comments: Accepted to ACL 2022. 18 pages, 7 figures. Code and data are available at https://github.com/jvasilakes/disentanglement-vae

  4. arXiv:2110.04077  [pdf, ps, other

    cs.CV cs.AI eess.IV

    Physical Context and Timing Aware Sequence Generating GANs

    Authors: Hayato Futase, Tomoki Tsujimura, Tetsuya Kajimoto, Hajime Kawarazaki, Toshiyuki Suzuki, Makoto Miwa, Yutaka Sasaki

    Abstract: Generative Adversarial Networks (GANs) have shown remarkable successes in generating realistic images and interpolating changes between images. Existing models, however, do not take into account physical contexts behind images in generating the images, which may cause unrealistic changes. Furthermore, it is difficult to generate the changes at a specific timing and they often do not match with act… ▽ More

    Submitted 28 September, 2021; originally announced October 2021.

  5. arXiv:2106.14157  [pdf, other

    cs.CL

    Analyzing Research Trends in Inorganic Materials Literature Using NLP

    Authors: Fusataka Kuniyoshi, Jun Ozawa, Makoto Miwa

    Abstract: In the field of inorganic materials science, there is a growing demand to extract knowledge such as physical properties and synthesis processes of materials by machine-reading a large number of papers. This is because materials researchers refer to many papers in order to come up with promising terms of experiments for material synthesis. However, there are only a few systems that can extract mate… ▽ More

    Submitted 27 June, 2021; originally announced June 2021.

    Comments: Accepted to ECML-PKDD2021. Preprint

  6. arXiv:2106.09900  [pdf, other

    cs.CL

    A Neural Edge-Editing Approach for Document-Level Relation Graph Extraction

    Authors: Kohei Makino, Makoto Miwa, Yutaka Sasaki

    Abstract: In this paper, we propose a novel edge-editing approach to extract relation information from a document. We treat the relations in a document as a relation graph among entities in this approach. The relation graph is iteratively constructed by editing edges of an initial graph, which might be a graph extracted by another system or an empty graph. The way to edit edges is to classify them in a clos… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: Accepted for publication at the Findings of the Association for Computational Linguistics (Findings-ACL2021), 2021. 10 pages, 6 figures, 8 tables

  7. arXiv:2104.08225  [pdf, other

    cs.CL

    Distantly Supervised Relation Extraction with Sentence Reconstruction and Knowledge Base Priors

    Authors: Fenia Christopoulou, Makoto Miwa, Sophia Ananiadou

    Abstract: We propose a multi-task, probabilistic approach to facilitate distantly supervised relation extraction by bringing closer the representations of sentences that contain the same Knowledge Base pairs. To achieve this, we bias the latent space of sentences via a Variational Autoencoder (VAE) that is trained jointly with a relation classifier. The latent code guides the pair representations and influe… ▽ More

    Submitted 16 April, 2021; originally announced April 2021.

    Comments: 16 pages, 9 figures, Accepted as a long paper at NAACL 2021

  8. arXiv:2002.07339  [pdf, other

    cs.CL

    Annotating and Extracting Synthesis Process of All-Solid-State Batteries from Scientific Literature

    Authors: Fusataka Kuniyoshi, Kohei Makino, Jun Ozawa, Makoto Miwa

    Abstract: The synthesis process is essential for achieving computational experiment design in the field of inorganic materials chemistry. In this work, we present a novel corpus of the synthesis process for all-solid-state batteries and an automated machine reading system for extracting the synthesis processes buried in the scientific literature. We define the representation of the synthesis processes using… ▽ More

    Submitted 17 February, 2020; originally announced February 2020.

    Comments: Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC 2020), Marseille, France

  9. arXiv:1910.10281  [pdf, other

    cs.CL

    A Search-based Neural Model for Biomedical Nested and Overlapping Event Detection

    Authors: Kurt Espinosa, Makoto Miwa, Sophia Ananiadou

    Abstract: We tackle the nested and overlapping event detection task and propose a novel search-based neural network (SBNN) structured prediction model that treats the task as a search problem on a relation graph of trigger-argument structures. Unlike existing structured prediction tasks such as dependency parsing, the task targets to detect DAG structures, which constitute events, from the relation graph. W… ▽ More

    Submitted 24 October, 2019; v1 submitted 22 October, 2019; originally announced October 2019.

    Comments: Accepted at EMNLP-IJCNLP 2019

  10. arXiv:1909.00228  [pdf, other

    cs.CL

    Connecting the Dots: Document-level Neural Relation Extraction with Edge-oriented Graphs

    Authors: Fenia Christopoulou, Makoto Miwa, Sophia Ananiadou

    Abstract: Document-level relation extraction is a complex human process that requires logical inference to extract relationships between named entities in text. Existing approaches use graph-based neural models with words as nodes and edges as relations between them, to encode relations across sentences. These models are node-based, i.e., they form pair representations based solely on the two target node re… ▽ More

    Submitted 31 August, 2019; originally announced September 2019.

    Comments: 12 pages, 5 figures, 6 tables. Accepted in EMNLP-IJCNLP 2019

  11. arXiv:1906.04684  [pdf, other

    cs.CL cs.IR

    Inter-sentence Relation Extraction with Document-level Graph Convolutional Neural Network

    Authors: Sunil Kumar Sahu, Fenia Christopoulou, Makoto Miwa, Sophia Ananiadou

    Abstract: Inter-sentence relation extraction deals with a number of complex semantic relationships in documents, which require local, non-local, syntactic and semantic dependencies. Existing methods do not fully exploit such dependencies. We present a novel inter-sentence relation extraction model that builds a labelled edge graph convolutional neural network model on a document-level graph. The graph is co… ▽ More

    Submitted 11 June, 2019; originally announced June 2019.

    Comments: Accepted in Association for Computational Linguistics (ACL) 2019 8 pages, 3 figures, 3 tables

  12. arXiv:1903.12650  [pdf, ps, other

    cs.LG stat.ML

    Yet Another Accelerated SGD: ResNet-50 Training on ImageNet in 74.7 seconds

    Authors: Masafumi Yamazaki, Akihiko Kasagi, Akihiro Tabuchi, Takumi Honda, Masahiro Miwa, Naoto Fukumoto, Tsuguchika Tabaru, Atsushi Ike, Kohta Nakashima

    Abstract: There has been a strong demand for algorithms that can execute machine learning as faster as possible and the speed of deep learning has accelerated by 30 times only in the past two years. Distributed deep learning using the large mini-batch is a key technology to address the demand and is a great challenge as it is difficult to achieve high scalability on large clusters without compromising accur… ▽ More

    Submitted 29 March, 2019; originally announced March 2019.

  13. arXiv:1902.07023  [pdf, other

    cs.CL

    A Walk-based Model on Entity Graphs for Relation Extraction

    Authors: Fenia Christopoulou, Makoto Miwa, Sophia Ananiadou

    Abstract: We present a novel graph-based neural network model for relation extraction. Our model treats multiple pairs in a sentence simultaneously and considers interactions among them. All the entities in a sentence are placed as nodes in a fully-connected graph structure. The edges are represented with position-aware contexts around the entity pairs. In order to consider different relation paths between… ▽ More

    Submitted 13 March, 2020; v1 submitted 19 February, 2019; originally announced February 2019.

    Comments: 8 pages, 2 figures, 2 tables

    Journal ref: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2018, pages 81-88

  14. arXiv:1805.05593  [pdf, other

    cs.CL

    Enhancing Drug-Drug Interaction Extraction from Texts by Molecular Structure Information

    Authors: Masaki Asada, Makoto Miwa, Yutaka Sasaki

    Abstract: We propose a novel neural method to extract drug-drug interactions (DDIs) from texts using external drug molecular structure information. We encode textual drug pairs with convolutional neural networks and their molecular pairs with graph convolutional networks (GCNs), and then we concatenate the outputs of these two networks. In the experiments, we show that GCNs can predict DDIs from the molecul… ▽ More

    Submitted 15 May, 2018; originally announced May 2018.

    Comments: accepted as a short paper at ACL2018

  15. arXiv:1706.05122  [pdf, ps, other

    cs.CL cs.AI cs.IR

    Bib2vec: An Embedding-based Search System for Bibliographic Information

    Authors: Takuma Yoneda, Koki Mori, Makoto Miwa, Yutaka Sasaki

    Abstract: We propose a novel embedding model that represents relationships among several elements in bibliographic information with high representation ability and flexibility. Based on this model, we present a novel search system that shows the relationships among the elements in the ACL Anthology Reference Corpus. The evaluation results show that our model can achieve a high prediction ability and produce… ▽ More

    Submitted 5 April, 2018; v1 submitted 15 June, 2017; originally announced June 2017.

    Comments: EACL2017 extended version. The demonstration is available at http://tti-coin.jp/demo/bib2vec/

    Journal ref: Proceedings of the EACL 2017 Software Demonstrations, Valencia, Spain, April 3-7 2017, pages 112-115

  16. arXiv:1601.00770  [pdf, other

    cs.CL cs.LG

    End-to-End Relation Extraction using LSTMs on Sequences and Tree Structures

    Authors: Makoto Miwa, Mohit Bansal

    Abstract: We present a novel end-to-end neural model to extract entities and relations between them. Our recurrent neural network based model captures both word sequence and dependency tree substructure information by stacking bidirectional tree-structured LSTM-RNNs on bidirectional sequential LSTM-RNNs. This allows our model to jointly represent both entities and relations with shared parameters in a singl… ▽ More

    Submitted 7 June, 2016; v1 submitted 5 January, 2016; originally announced January 2016.

    Comments: Accepted for publication at the Association for Computational Linguistics (ACL), 2016. 13 pages, 1 figure, 6 tables

  17. arXiv:1503.00095  [pdf, ps, other

    cs.CL

    Task-Oriented Learning of Word Embeddings for Semantic Relation Classification

    Authors: Kazuma Hashimoto, Pontus Stenetorp, Makoto Miwa, Yoshimasa Tsuruoka

    Abstract: We present a novel learning method for word embeddings designed for relation classification. Our word embeddings are trained by predicting words between noun pairs using lexical relation-specific features on a large unlabeled corpus. This allows us to explicitly incorporate relation-specific information into the word embeddings. The learned word embeddings are then used to construct feature vector… ▽ More

    Submitted 22 June, 2015; v1 submitted 28 February, 2015; originally announced March 2015.

    Comments: The Nineteenth Conference on Computational Natural Language Learning (CoNLL 2015)

  18. arXiv:0807.2701  [pdf, other

    cs.IT

    A Cutting Plane Method based on Redundant Rows for Improving Fractional Distance

    Authors: Makoto Miwa, Tadashi Wadayama, Ichi Takumi

    Abstract: In this paper, an idea of the cutting plane method is employed to improve the fractional distance of a given binary parity check matrix. The fractional distance is the minimum weight (with respect to l1-distance) of vertices of the fundamental polytope. The cutting polytope is defined based on redundant rows of the parity check matrix and it plays a key role to eliminate unnecessary fractional v… ▽ More

    Submitted 17 July, 2008; originally announced July 2008.

    Comments: 8 pages, To be presented at Turbo Coding 2008