Zum Hauptinhalt springen

Showing 1–23 of 23 results for author: Mori, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17185  [pdf, other

    cs.CL

    Vaporetto: Efficient Japanese Tokenization Based on Improved Pointwise Linear Classification

    Authors: Koichi Akabe, Shunsuke Kanda, Yusuke Oda, Shinsuke Mori

    Abstract: This paper proposes an approach to improve the runtime efficiency of Japanese tokenization based on the pointwise linear classification (PLC) framework, which formulates the whole tokenization process as a sequence of linear classification problems. Our approach optimizes tokenization by leveraging the characteristics of the PLC framework and the task definition. Our approach involves (1) composin… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  2. arXiv:2404.03161  [pdf, other

    cs.CV cs.CL cs.MM

    BioVL-QR: Egocentric Biochemical Video-and-Language Dataset Using Micro QR Codes

    Authors: Taichi Nishimura, Koki Yamamoto, Yuto Haneji, Keiya Kajimura, Chihiro Nishiwaki, Eriko Daikoku, Natsuko Okuda, Fumihito Ono, Hirotaka Kameko, Shinsuke Mori

    Abstract: This paper introduces a biochemical vision-and-language dataset, which consists of 24 egocentric experiment videos, corresponding protocols, and video-and-language alignments. The key challenge in the wet-lab domain is detecting equipment, reagents, and containers is difficult because the lab environment is scattered by filling objects on the table and some objects are indistinguishable. Therefore… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: 6 pages

  3. arXiv:2404.02523  [pdf, other

    cs.CV cs.AI

    Text-driven Affordance Learning from Egocentric Vision

    Authors: Tomoya Yoshida, Shuhei Kurita, Taichi Nishimura, Shinsuke Mori

    Abstract: Visual affordance learning is a key component for robots to understand how to interact with objects. Conventional approaches in this field rely on pre-defined objects and actions, falling short of capturing diverse interactions in realworld scenarios. The key idea of our approach is employing textual instruction, targeting various affordances for a wide range of objects. This approach covers both… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  4. arXiv:2403.16909  [pdf, other

    cs.AI cs.CL cs.CY

    Towards Algorithmic Fidelity: Mental Health Representation across Demographics in Synthetic vs. Human-generated Data

    Authors: Shinka Mori, Oana Ignat, Andrew Lee, Rada Mihalcea

    Abstract: Synthetic data generation has the potential to impact applications and domains with scarce data. However, before such data is used for sensitive tasks such as mental health, we need an understanding of how different demographics are represented in it. In our paper, we analyze the potential of producing synthetic data using GPT-3 by exploring the various stressors it attributes to different race an… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 14 pages, 16 figures

  5. arXiv:2403.16483  [pdf, other

    cs.CL

    Automatic Construction of a Large-Scale Corpus for Geoparsing Using Wikipedia Hyperlinks

    Authors: Keyaki Ohno, Hirotaka Kameko, Keisuke Shirai, Taichi Nishimura, Shinsuke Mori

    Abstract: Geoparsing is the task of estimating the latitude and longitude (coordinates) of location expressions in texts. Geoparsing must deal with the ambiguity of the expressions that indicate multiple locations with the same notation. For evaluating geoparsing systems, several corpora have been proposed in previous work. However, these corpora are small-scale and suffer from the coverage of location expr… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: LREC-COLING 2024

  6. arXiv:2312.00532  [pdf, other

    cs.CV

    DeepDR: Deep Structure-Aware RGB-D Inpainting for Diminished Reality

    Authors: Christina Gsaxner, Shohei Mori, Dieter Schmalstieg, Jan Egger, Gerhard Paar, Werner Bailer, Denis Kalkofen

    Abstract: Diminished reality (DR) refers to the removal of real objects from the environment by virtually replacing them with their background. Modern DR frameworks use inpainting to hallucinate unobserved regions. While recent deep learning-based inpainting is promising, the DR use case is complicated by the need to generate coherent structure and 3D geometry (i.e., depth), in particular for advanced appli… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: 11 pages, 8 figures + 13 pages, 10 figures supplementary. Accepted at 3DV 2024

  7. arXiv:2311.00967  [pdf, other

    cs.RO cs.AI cs.CL

    Vision-Language Interpreter for Robot Task Planning

    Authors: Keisuke Shirai, Cristian C. Beltran-Hernandez, Masashi Hamaya, Atsushi Hashimoto, Shohei Tanaka, Kento Kawaharazuka, Kazutoshi Tanaka, Yoshitaka Ushiku, Shinsuke Mori

    Abstract: Large language models (LLMs) are accelerating the development of language-guided robot planners. Meanwhile, symbolic planners offer the advantage of interpretability. This paper proposes a new task that bridges these two trends, namely, multimodal planning problem specification. The aim is to generate a problem description (PD), a machine-readable file used by the planners to find a plan. By gener… ▽ More

    Submitted 19 February, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: ICRA 2024

  8. arXiv:2305.19497  [pdf, other

    cs.CL

    Towards Flow Graph Prediction of Open-Domain Procedural Texts

    Authors: Keisuke Shirai, Hirotaka Kameko, Shinsuke Mori

    Abstract: Machine comprehension of procedural texts is essential for reasoning about the steps and automating the procedures. However, this requires identifying entities within a text and resolving the relationships between the entities. Previous work focused on the cooking domain and proposed a framework to convert a recipe text into a flow graph (FG) representation. In this work, we propose a framework ba… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: RepL4NLP 2023

  9. arXiv:2305.12544  [pdf, other

    cs.CL cs.AI

    Has It All Been Solved? Open NLP Research Questions Not Solved by Large Language Models

    Authors: Oana Ignat, Zhijing Jin, Artem Abzaliev, Laura Biester, Santiago Castro, Naihao Deng, Xinyi Gao, Aylin Gunal, Jacky He, Ashkan Kazemi, Muhammad Khalifa, Namho Koh, Andrew Lee, Siyang Liu, Do June Min, Shinka Mori, Joan Nwatu, Veronica Perez-Rosas, Siqi Shen, Zekun Wang, Winston Wu, Rada Mihalcea

    Abstract: Recent progress in large language models (LLMs) has enabled the deployment of many generative NLP applications. At the same time, it has also led to a misleading public discourse that ``it's all been solved.'' Not surprisingly, this has, in turn, made many NLP researchers -- especially those at the beginning of their careers -- worry about what NLP research area they should focus on. Has it all be… ▽ More

    Submitted 15 March, 2024; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: Accepted at COLING 2024

  10. arXiv:2209.10134  [pdf, other

    cs.MM cs.CL cs.CV

    Recipe Generation from Unsegmented Cooking Videos

    Authors: Taichi Nishimura, Atsushi Hashimoto, Yoshitaka Ushiku, Hirotaka Kameko, Shinsuke Mori

    Abstract: This paper tackles recipe generation from unsegmented cooking videos, a task that requires agents to (1) extract key events in completing the dish and (2) generate sentences for the extracted events. Our task is similar to dense video captioning (DVC), which aims at detecting events thoroughly and generating sentences for them. However, unlike DVC, in recipe generation, recipe story awareness is c… ▽ More

    Submitted 18 February, 2024; v1 submitted 21 September, 2022; originally announced September 2022.

    Comments: Accepted at ACM TOMM; ACM Transactions on Multimedia Computing, Communications, and Applications

  11. arXiv:2209.05840  [pdf, other

    cs.CL cs.AI

    Visual Recipe Flow: A Dataset for Learning Visual State Changes of Objects with Recipe Flows

    Authors: Keisuke Shirai, Atsushi Hashimoto, Taichi Nishimura, Hirotaka Kameko, Shuhei Kurita, Yoshitaka Ushiku, Shinsuke Mori

    Abstract: We present a new multimodal dataset called Visual Recipe Flow, which enables us to learn each cooking action result in a recipe text. The dataset consists of object state changes and the workflow of the recipe text. The state change is represented as an image pair, while the workflow is represented as a recipe flow graph (r-FG). The image pairs are grounded in the r-FG, which provides the cross-mo… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: COLING 2022

  12. arXiv:2012.14124  [pdf, other

    cs.CL cs.AI

    Neural Text Generation with Artificial Negative Examples

    Authors: Keisuke Shirai, Kazuma Hashimoto, Akiko Eriguchi, Takashi Ninomiya, Shinsuke Mori

    Abstract: Neural text generation models conditioning on given input (e.g. machine translation and image captioning) are usually trained by maximum likelihood estimation of target text. However, the trained models suffer from various types of errors at inference time. In this paper, we propose to suppress an arbitrary type of errors by training the text generation model in a reinforcement learning framework,… ▽ More

    Submitted 28 December, 2020; originally announced December 2020.

  13. arXiv:1905.13438  [pdf, ps, other

    cs.CL cs.AI

    Content Word-based Sentence Decoding and Evaluating for Open-domain Neural Response Generation

    Authors: Tianyu Zhao, Shinsuke Mori, Tatsuya Kawahara

    Abstract: Various encoder-decoder models have been applied to response generation in open-domain dialogs, but a majority of conventional models directly learn a mapping from lexical input to lexical output without explicitly modeling intermediate representations. Utilizing language hierarchy and modeling intermediate information have been shown to benefit many language understanding and generation tasks. Mo… ▽ More

    Submitted 26 June, 2019; v1 submitted 31 May, 2019; originally announced May 2019.

    Comments: 13 pages, 2 figures, 8 tables (rejected by ACL 2019)

  14. arXiv:1707.07466  [pdf, ps, other

    physics.soc-ph cs.SI physics.data-an

    The Pitman-Yor process and an empirical study of choice behavior

    Authors: Masato Hisakado, Fumiaki Sano, Shintaro Mori

    Abstract: This study discusses choice behavior using a voting model in which voters can obtain information from a finite number of previous $r$ voters. Voters vote for a candidate with a probability proportional to the previous vote ratio, which is visible to the voters. We obtain the Pitman sampling formula as the equilibrium distribution of $r$ votes. We present the model as a process of posting on a bull… ▽ More

    Submitted 11 December, 2017; v1 submitted 24 July, 2017; originally announced July 2017.

    Comments: 29 pages,11 figures

  15. arXiv:1705.10962  [pdf, other

    cs.CL

    Analysis of the Effect of Dependency Information on Predicate-Argument Structure Analysis and Zero Anaphora Resolution

    Authors: Koichiro Yoshino, Shinsuke Mori, Satoshi Nakamura

    Abstract: This paper investigates and analyzes the effect of dependency information on predicate-argument structure analysis (PASA) and zero anaphora resolution (ZAR) for Japanese, and shows that a straightforward approach of PASA and ZAR works effectively even if dependency information was not available. We constructed an analyzer that directly predicts relationships of predicates and arguments with their… ▽ More

    Submitted 31 May, 2017; originally announced May 2017.

  16. arXiv:1605.04278  [pdf, ps, other

    cs.CL

    Universal Dependencies for Learner English

    Authors: Yevgeni Berzak, Jessica Kenney, Carolyn Spadine, Jing Xian Wang, Lucia Lam, Keiko Sophie Mori, Sebastian Garza, Boris Katz

    Abstract: We introduce the Treebank of Learner English (TLE), the first publicly available syntactic treebank for English as a Second Language (ESL). The TLE provides manually annotated POS tags and Universal Dependency (UD) trees for 5,124 sentences from the Cambridge First Certificate in English (FCE) corpus. The UD annotations are tied to a pre-existing error annotation of the FCE, whereby full syntactic… ▽ More

    Submitted 7 June, 2016; v1 submitted 13 May, 2016; originally announced May 2016.

    Comments: Updated parsing experiments to EWT v1.3, improved grammatical error marking, minor revisions. To appear in ACL 2016

  17. arXiv:1504.00458  [pdf, ps, other

    physics.soc-ph cs.SI

    Information cascade on networks

    Authors: Masato Hisakado, Shintaro Mori

    Abstract: In this paper, we discuss a voting model by considering three different kinds of networks: a random graph, the Barabási-Albert(BA) model, and a fitness model. A voting model represents the way in which public perceptions are conveyed to voters. Our voting model is constructed by using two types of voters--herders and independents--and two candidates. Independents conduct voting based on their fund… ▽ More

    Submitted 15 December, 2015; v1 submitted 2 April, 2015; originally announced April 2015.

    Comments: 31 pages, 7 figures. arXiv admin note: text overlap with arXiv:1203.3274

  18. arXiv:1503.03964  [pdf, ps, other

    cs.AI cs.LG physics.data-an stat.ML

    Interactive Restless Multi-armed Bandit Game and Swarm Intelligence Effect

    Authors: Shunsuke Yoshida, Masato Hisakado, Shintaro Mori

    Abstract: We obtain the conditions for the emergence of the swarm intelligence effect in an interactive game of restless multi-armed bandit (rMAB). A player competes with multiple agents. Each bandit has a payoff that changes with a probability $p_{c}$ per round. The agents and player choose one of three options: (1) Exploit (a good bandit), (2) Innovate (asocial learning for a good bandit among $n_{I}$ ran… ▽ More

    Submitted 13 March, 2015; originally announced March 2015.

    Comments: 18 pages, 4 figures

    Journal ref: New generation computing, vol.34, No. 3, 291-306, 2016

  19. arXiv:1307.5118  [pdf, ps, other

    stat.ML cs.LG

    Model-Based Policy Gradients with Parameter-Based Exploration by Least-Squares Conditional Density Estimation

    Authors: Syogo Mori, Voot Tangkaratt, Tingting Zhao, Jun Morimoto, Masashi Sugiyama

    Abstract: The goal of reinforcement learning (RL) is to let an agent learn an optimal control policy in an unknown environment so that future expected rewards are maximized. The model-free RL approach directly learns the policy based on data samples. Although using many samples tends to improve the accuracy of policy learning, collecting a large number of samples is often expensive in practice. On the other… ▽ More

    Submitted 18 July, 2013; originally announced July 2013.

  20. arXiv:1211.3193  [pdf, ps, other

    physics.soc-ph cs.SI

    Collective Adoption of Max-Min Strategy in an Information Cascade Voting Experiment

    Authors: Shintaro Mori, Masato Hisakado, Taiki Takahashi

    Abstract: We consider a situation where one has to choose an option with multiplier m. The multiplier is inversely proportional to the number of people who have chosen the option and is proportional to the return if it is correct. If one does not know the correct option, we call him a herder, and then there is a zero-sum game between the herder and other people who have set the multiplier. The max-min strat… ▽ More

    Submitted 26 June, 2013; v1 submitted 13 November, 2012; originally announced November 2012.

    Comments: 25 pages,9 figures

    Journal ref: J. Phys. Soc. Jpn. 82, 084004 (2013)

  21. arXiv:1203.3274  [pdf, ps, other

    physics.soc-ph cond-mat.stat-mech cs.SI

    Two kinds of Phase transitions in a Voting model

    Authors: Masato Hisakado, Shintaro Mori

    Abstract: In this paper, we discuss a voting model with two candidates, C_0 and C_1. We consider two types of voters--herders and independents. The voting of independents is based on their fundamental values; on the other hand, the voting of herders is based on the number of previous votes. We can identify two kinds of phase transitions. One is an information cascade transition similar to a phase transition… ▽ More

    Submitted 26 July, 2012; v1 submitted 15 March, 2012; originally announced March 2012.

    Comments: 24 pages, 6 figures

    Journal ref: J.Phys.A45,(2012)345002-345016

  22. arXiv:1112.2816  [pdf, ps, other

    physics.soc-ph cond-mat.stat-mech cs.SI

    Phase transition to two-peaks phase in an information cascade voting experiment

    Authors: Shintaro Mori, Masato Hisakado, Taiki Takahashi

    Abstract: Observational learning is an important information aggregation mechanism. However, it occasionally leads to a state in which an entire population chooses a sub-optimal option. When it occurs and whether it is a phase transition remain unanswered. To address these questions, we performed a voting experiment in which subjects answered a two-choice quiz sequentially with and without information about… ▽ More

    Submitted 11 November, 2012; v1 submitted 13 December, 2011; originally announced December 2011.

    Comments: 11 pages, 9 figures, 3 tables

    Journal ref: Phys. Rev. E 86, 026109 (2012)

  23. arXiv:1101.3122  [pdf, ps, other

    physics.soc-ph cond-mat.stat-mech cs.SI

    Digital herders and phase transition in a voting model

    Authors: Masato Hisakado, Shintaro Mori

    Abstract: In this paper, we discuss a voting model with two candidates, C_1 and C_2. We set two types of voters--herders and independents. The voting of independent voters is based on their fundamental values; on the other hand, the voting of herders is based on the number of votes. Herders always select the majority of the previous $r$ votes, which is visible to them. We call them digital herders. We can a… ▽ More

    Submitted 19 May, 2011; v1 submitted 16 January, 2011; originally announced January 2011.

    Comments: 26 pages, 10 figures