Zum Hauptinhalt springen

Showing 1–6 of 6 results for author: Hachiya, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2007.10394  [pdf, other

    cs.LG cs.AI stat.ML

    Translation Between Waves, wave2wave

    Authors: Tsuyoshi Okita, Hirotaka Hachiya, Sozo Inoue, Naonori Ueda

    Abstract: The understanding of sensor data has been greatly improved by advanced deep learning methods with big data. However, available sensor data in the real world are still limited, which is called the opportunistic sensor problem. This paper proposes a new variant of neural machine translation seq2seq to deal with continuous signal waves by introducing the window-based (inverse-) representation to adap… ▽ More

    Submitted 20 July, 2020; originally announced July 2020.

  2. Exchangeable deep neural networks for set-to-set matching and learning

    Authors: Yuki Saito, Takuma Nakamura, Hirotaka Hachiya, Kenji Fukumizu

    Abstract: Matching two different sets of items, called heterogeneous set-to-set matching problem, has recently received attention as a promising problem. The difficulties are to extract features to match a correct pair of different sets and also preserve two types of exchangeability required for set-to-set matching: the pair of sets, as well as the items in each set, should be exchangeable. In this study, w… ▽ More

    Submitted 28 January, 2021; v1 submitted 22 October, 2019; originally announced October 2019.

  3. arXiv:1301.3966  [pdf, ps, other

    cs.LG stat.ML

    Efficient Sample Reuse in Policy Gradients with Parameter-based Exploration

    Authors: Tingting Zhao, Hirotaka Hachiya, Voot Tangkaratt, Jun Morimoto, Masashi Sugiyama

    Abstract: The policy gradient approach is a flexible and powerful reinforcement learning method particularly for problems with continuous actions such as robot control. A common challenge in this scenario is how to reduce the variance of policy gradient estimates for reliable policy updates. In this paper, we combine the following three ideas and give a highly effective policy gradient method: (a) the polic… ▽ More

    Submitted 16 January, 2013; originally announced January 2013.

  4. Feature Selection via L1-Penalized Squared-Loss Mutual Information

    Authors: Wittawat Jitkrittum, Hirotaka Hachiya, Masashi Sugiyama

    Abstract: Feature selection is a technique to screen out less important features. Many existing supervised feature selection algorithms use redundancy and relevancy as the main criteria to select features. However, feature interaction, potentially a key characteristic in real-world problems, has not received much attention. As an attempt to take feature interaction into account, we propose L1-LSMI, an L1-re… ▽ More

    Submitted 6 October, 2012; originally announced October 2012.

    Comments: 25 pages

  5. arXiv:1206.4634  [pdf

    cs.LG cs.GR stat.ML

    Artist Agent: A Reinforcement Learning Approach to Automatic Stroke Generation in Oriental Ink Painting

    Authors: Ning Xie, Hirotaka Hachiya, Masashi Sugiyama

    Abstract: Oriental ink painting, called Sumi-e, is one of the most appealing painting styles that has attracted artists around the world. Major challenges in computer-based Sumi-e simulation are to abstract complex scene information and draw smooth and natural brush strokes. To automatically find such strokes, we propose to model the brush as a reinforcement learning agent, and learn desired brush-trajector… ▽ More

    Submitted 18 June, 2012; originally announced June 2012.

    Comments: ICML2012

  6. arXiv:1203.3497  [pdf

    cs.LG stat.ML

    Parametric Return Density Estimation for Reinforcement Learning

    Authors: Tetsuro Morimura, Masashi Sugiyama, Hisashi Kashima, Hirotaka Hachiya, Toshiyuki Tanaka

    Abstract: Most conventional Reinforcement Learning (RL) algorithms aim to optimize decision-making rules in terms of the expected returns. However, especially for risk management purposes, other risk-sensitive criteria such as the value-at-risk or the expected shortfall are sometimes preferred in real applications. Here, we describe a parametric method for estimating density of the returns, which allows us… ▽ More

    Submitted 15 March, 2012; originally announced March 2012.

    Comments: Appears in Proceedings of the Twenty-Sixth Conference on Uncertainty in Artificial Intelligence (UAI2010)

    Report number: UAI-P-2010-PG-368-375