Chinese Character Decomposition for Neural MT with Multi-Word Expressions
Authors:
Lifeng Han,
Gareth J. F. Jones,
Alan F. Smeaton,
Paolo Bolzoni
Abstract:
Chinese character decomposition has been used as a feature to enhance Machine Translation (MT) models, combining radicals into character and word level models. Recent work has investigated ideograph or stroke level embedding. However, questions remain about different decomposition levels of Chinese character representations, radical and strokes, best suited for MT. To investigate the impact of Chi…
▽ More
Chinese character decomposition has been used as a feature to enhance Machine Translation (MT) models, combining radicals into character and word level models. Recent work has investigated ideograph or stroke level embedding. However, questions remain about different decomposition levels of Chinese character representations, radical and strokes, best suited for MT. To investigate the impact of Chinese decomposition embedding in detail, i.e., radical, stroke, and intermediate levels, and how well these decompositions represent the meaning of the original character sequences, we carry out analysis with both automated and human evaluation of MT. Furthermore, we investigate if the combination of decomposed Multiword Expressions (MWEs) can enhance the model learning. MWE integration into MT has seen more than a decade of exploration. However, decomposed MWEs has not previously been explored.
△ Less
Submitted 9 April, 2021;
originally announced April 2021.
Solving Orienteering with Category Constraints Using Prioritized Search
Authors:
Paolo Bolzoni,
Sven Helmer
Abstract:
We develop an approach for solving rooted orienteering problems with category constraints as found in tourist trip planning and logistics. It is based on expanding partial solutions in a systematic way, prioritizing promising ones, which reduces the search space we have to traverse during the search. The category constraints help in reducing the space we have to explore even further. We implement…
▽ More
We develop an approach for solving rooted orienteering problems with category constraints as found in tourist trip planning and logistics. It is based on expanding partial solutions in a systematic way, prioritizing promising ones, which reduces the search space we have to traverse during the search. The category constraints help in reducing the space we have to explore even further. We implement an algorithm that computes the optimal solution and also illustrate how our approach can be turned into an approximation algorithm, yielding much faster run times and guaranteeing lower bounds on the quality of the solution found. We demonstrate the effectiveness of our algorithms by comparing them to the state-of-the-art approach and an optimal algorithm based on dynamic programming, showing that our technique clearly outperforms these methods.
△ Less
Submitted 14 February, 2017;
originally announced February 2017.