Zum Hauptinhalt springen

Showing 1–1 of 1 results for author: Azuma, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2307.06721  [pdf, other

    cs.CL cs.LG

    Why Guided Dialog Policy Learning performs well? Understanding the role of adversarial learning and its alternative

    Authors: Sho Shimoyama, Tetsuro Morimura, Kenshi Abe, Toda Takamichi, Yuta Tomomatsu, Masakazu Sugiyama, Asahi Hentona, Yuuki Azuma, Hirotaka Ninomiya

    Abstract: Dialog policies, which determine a system's action based on the current state at each dialog turn, are crucial to the success of the dialog. In recent years, reinforcement learning (RL) has emerged as a promising option for dialog policy learning (DPL). In RL-based DPL, dialog policies are updated according to rewards. The manual construction of fine-grained rewards, such as state-action-based one… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.