Zum Hauptinhalt springen

Showing 1–25 of 25 results for author: Kawaguchi, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.05493  [pdf, other

    cs.SD eess.AS

    Stream-based Active Learning for Anomalous Sound Detection in Machine Condition Monitoring

    Authors: Tuan Vu Ho, Kota Dohi, Yohei Kawaguchi

    Abstract: This paper introduces an active learning (AL) framework for anomalous sound detection (ASD) in machine condition monitoring system. Typically, ASD models are trained solely on normal samples due to the scarcity of anomalous data, leading to decreased accuracy for unseen samples during inference. AL is a promising solution to solve this problem by enabling the model to learn new concepts more effec… ▽ More

    Submitted 10 August, 2024; originally announced August 2024.

    Comments: Accepted as a conference paper in INTERSPEECH 2024

  2. arXiv:2406.07250  [pdf, other

    eess.AS cs.LG cs.SD

    Description and Discussion on DCASE 2024 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring

    Authors: Tomoya Nishida, Noboru Harada, Daisuke Niizumi, Davide Albertini, Roberto Sannino, Simone Pradolini, Filippo Augusti, Keisuke Imoto, Kota Dohi, Harsh Purohit, Takashi Endo, Yohei Kawaguchi

    Abstract: We present the task description of the Detection and Classification of Acoustic Scenes and Events (DCASE) 2024 Challenge Task 2: First-shot unsupervised anomalous sound detection (ASD) for machine condition monitoring. Continuing from last year's DCASE 2023 Challenge Task 2, we organize the task as a first-shot problem under domain generalization required settings. The main goal of the first-shot… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: anomaly detection, acoustic condition monitoring, domain shift, first-shot problem, DCASE Challenge. arXiv admin note: text overlap with arXiv:2305.07828

  3. arXiv:2403.16610  [pdf, ps, other

    eess.AS cs.CR cs.LG cs.SD

    Distributed collaborative anomalous sound detection by embedding sharing

    Authors: Kota Dohi, Yohei Kawaguchi

    Abstract: To develop a machine sound monitoring system, a method for detecting anomalous sound is proposed. In this paper, we explore a method for multiple clients to collaboratively learn an anomalous sound detection model while keeping their raw data private from each other. In the context of industrial machine anomalous sound detection, each client possesses data from different machines or different oper… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  4. arXiv:2309.01013  [pdf, other

    cs.LG

    Streaming Active Learning for Regression Problems Using Regression via Classification

    Authors: Shota Horiguchi, Kota Dohi, Yohei Kawaguchi

    Abstract: One of the challenges in deploying a machine learning model is that the model's performance degrades as the operating environment changes. To maintain the performance, streaming active learning is used, in which the model is retrained by adding a newly annotated sample to the training dataset if the prediction of the sample is not certain enough. Although many streaming active learning methods hav… ▽ More

    Submitted 15 December, 2023; v1 submitted 2 September, 2023; originally announced September 2023.

    Comments: Accepted to ICASSP 2024

  5. arXiv:2305.17758  [pdf, ps, other

    cs.SD eess.AS

    CAPTDURE: Captioned Sound Dataset of Single Sources

    Authors: Yuki Okamoto, Kanta Shimonishi, Keisuke Imoto, Kota Dohi, Shota Horiguchi, Yohei Kawaguchi

    Abstract: In conventional studies on environmental sound separation and synthesis using captions, datasets consisting of multiple-source sounds with their captions were used for model training. However, when we collect the captions for multiple-source sound, it is not easy to collect detailed captions for each sound source, such as the number of sound occurrences and timbre. Therefore, it is difficult to ex… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

    Comments: Accepted to INTERSPEECH2023

  6. arXiv:2305.15859  [pdf, ps, other

    cs.SD eess.AS

    Anomalous Sound Detection Based on Sound Separation

    Authors: Kanta Shimonishi, Kota Dohi, Yohei Kawaguchi

    Abstract: This paper proposes an unsupervised anomalous sound detection method using sound separation. In factory environments, background noise and non-objective sounds obscure desired machine sounds, making it challenging to detect anomalous sounds. Therefore, using sounds not mixed with background noise or non-purpose sounds in the detection system is desirable. We compared two versions of our proposed m… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: Accepted to INTERSPEECH2023

  7. arXiv:2305.07828  [pdf, other

    cs.SD cs.LG eess.AS

    Description and Discussion on DCASE 2023 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring

    Authors: Kota Dohi, Keisuke Imoto, Noboru Harada, Daisuke Niizumi, Yuma Koizumi, Tomoya Nishida, Harsh Purohit, Ryo Tanabe, Takashi Endo, Yohei Kawaguchi

    Abstract: We present the task description of the Detection and Classification of Acoustic Scenes and Events (DCASE) 2023 Challenge Task 2: ``First-shot unsupervised anomalous sound detection (ASD) for machine condition monitoring''. The main goal is to enable rapid deployment of ASD systems for new kinds of machines without the need for hyperparameter tuning. In the past ASD tasks, developed methods tuned h… ▽ More

    Submitted 2 November, 2023; v1 submitted 12 May, 2023; originally announced May 2023.

    Comments: anomaly detection, acoustic condition monitoring, domain shift, first-shot problem, DCASE Challenge, Accepted in DCASE2023 Workshop

  8. arXiv:2304.02221  [pdf, ps, other

    cs.LG eess.AS

    Zero-shot domain adaptation of anomalous samples for semi-supervised anomaly detection

    Authors: Tomoya Nishida, Takashi Endo, Yohei Kawaguchi

    Abstract: Semi-supervised anomaly detection~(SSAD) is a task where normal data and a limited number of anomalous data are available for training. In practical situations, SSAD methods suffer adapting to domain shifts, since anomalous data are unlikely to be available for the target domain in the training phase. To solve this problem, we propose a domain adaptation method for SSAD where no anomalous data are… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

  9. arXiv:2206.05876  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Description and Discussion on DCASE 2022 Challenge Task 2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring Applying Domain Generalization Techniques

    Authors: Kota Dohi, Keisuke Imoto, Noboru Harada, Daisuke Niizumi, Yuma Koizumi, Tomoya Nishida, Harsh Purohit, Takashi Endo, Masaaki Yamamoto, Yohei Kawaguchi

    Abstract: We present the task description and discussion on the results of the DCASE 2022 Challenge Task 2: ``Unsupervised anomalous sound detection (ASD) for machine condition monitoring applying domain generalization techniques''. Domain shifts are a critical problem for the application of ASD systems. Because domain shifts can change the acoustic characteristics of data, a model trained in a source domai… ▽ More

    Submitted 21 November, 2022; v1 submitted 12 June, 2022; originally announced June 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2106.04492

  10. arXiv:2206.05460  [pdf, other

    cs.LG cs.AI eess.AS

    Hierarchical Conditional Variational Autoencoder Based Acoustic Anomaly Detection

    Authors: Harsh Purohit, Takashi Endo, Masaaki Yamamoto, Yohei Kawaguchi

    Abstract: This paper aims to develop an acoustic signal-based unsupervised anomaly detection method for automatic machine monitoring. Existing approaches such as deep autoencoder (DAE), variational autoencoder (VAE), conditional variational autoencoder (CVAE) etc. have limited representation capabilities in the latent space and, hence, poor anomaly detection performance. Different models have to be trained… ▽ More

    Submitted 11 June, 2022; originally announced June 2022.

  11. arXiv:2206.02432  [pdf, other

    eess.AS cs.CL cs.SD

    Online Neural Diarization of Unlimited Numbers of Speakers Using Global and Local Attractors

    Authors: Shota Horiguchi, Shinji Watanabe, Paola Garcia, Yuki Takashima, Yohei Kawaguchi

    Abstract: A method to perform offline and online speaker diarization for an unlimited number of speakers is described in this paper. End-to-end neural diarization (EEND) has achieved overlap-aware speaker diarization by formulating it as a multi-label classification problem. It has also been extended for a flexible number of speakers by introducing speaker-wise attractors. However, the output number of spea… ▽ More

    Submitted 22 December, 2022; v1 submitted 6 June, 2022; originally announced June 2022.

    Comments: Accepted to IEEE/ACM TASLP

    Journal ref: IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 31, pp. 706-720, 2023

  12. arXiv:2205.13879  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    MIMII DG: Sound Dataset for Malfunctioning Industrial Machine Investigation and Inspection for Domain Generalization Task

    Authors: Kota Dohi, Tomoya Nishida, Harsh Purohit, Ryo Tanabe, Takashi Endo, Masaaki Yamamoto, Yuki Nikaido, Yohei Kawaguchi

    Abstract: We present a machine sound dataset to benchmark domain generalization techniques for anomalous sound detection (ASD). Domain shifts are differences in data distributions that can degrade the detection performance, and handling them is a major issue for the application of ASD systems. While currently available datasets for ASD tasks assume that occurrences of domain shifts are known, in practice, t… ▽ More

    Submitted 21 November, 2022; v1 submitted 27 May, 2022; originally announced May 2022.

  13. arXiv:2204.07353  [pdf, ps, other

    eess.AS cs.LG cs.SD

    Anomalous Sound Detection Based on Machine Activity Detection

    Authors: Tomoya Nishida, Kota Dohi, Takashi Endo, Masaaki Yamamoto, Yohei Kawaguchi

    Abstract: We have developed an unsupervised anomalous sound detection method for machine condition monitoring that utilizes an auxiliary task -- detecting when the target machine is active. First, we train a model that detects machine activity by using normal data with machine activity labels and then use the activity-detection error as the anomaly score for a given sound clip if we have access to the groun… ▽ More

    Submitted 15 April, 2022; originally announced April 2022.

    Comments: 5 pages, 2 figures, 1 table

  14. arXiv:2112.00209  [pdf, ps, other

    cs.SD cs.LG eess.AS

    Environmental Sound Extraction Using Onomatopoeic Words

    Authors: Yuki Okamoto, Shota Horiguchi, Masaaki Yamamoto, Keisuke Imoto, Yohei Kawaguchi

    Abstract: An onomatopoeic word, which is a character sequence that phonetically imitates a sound, is effective in expressing characteristics of sound such as duration, pitch, and timbre. We propose an environmental-sound-extraction method using onomatopoeic words to specify the target sound to be extracted. By this method, we estimate a time-frequency mask from an input mixture spectrogram and an onomatopoe… ▽ More

    Submitted 16 February, 2022; v1 submitted 30 November, 2021; originally announced December 2021.

    Comments: Accepted to ICASSP2022

  15. arXiv:2111.06539  [pdf, other

    eess.AS cs.SD

    Disentangling Physical Parameters for Anomalous Sound Detection Under Domain Shifts

    Authors: Kota Dohi, Takashi Endo, Yohei Kawaguchi

    Abstract: To develop a sound-monitoring system for machines, a method for detecting anomalous sound under domain shifts is proposed. A domain shift occurs when a machine's physical parameters change. Because a domain shift changes the distribution of normal sound data, conventional unsupervised anomaly detection methods can output false positives. To solve this problem, the proposed method constrains some l… ▽ More

    Submitted 11 November, 2021; originally announced November 2021.

    Comments: 4 pages, 4 figures

  16. arXiv:2110.04694  [pdf, other

    eess.AS cs.CL cs.SD

    Multi-Channel End-to-End Neural Diarization with Distributed Microphones

    Authors: Shota Horiguchi, Yuki Takashima, Paola Garcia, Shinji Watanabe, Yohei Kawaguchi

    Abstract: Recent progress on end-to-end neural diarization (EEND) has enabled overlap-aware speaker diarization with a single neural network. This paper proposes to enhance EEND by using multi-channel signals from distributed microphones. We replace Transformer encoders in EEND with two types of encoders that process a multi-channel input: spatio-temporal and co-attention encoders. Both are independent of t… ▽ More

    Submitted 28 March, 2022; v1 submitted 9 October, 2021; originally announced October 2021.

    Comments: Accepted to ICASSP 2022

  17. arXiv:2107.03602  [pdf, other

    cs.CV

    Case-based Similar Image Retrieval for Weakly Annotated Large Histopathological Images of Malignant Lymphoma Using Deep Metric Learning

    Authors: Noriaki Hashimoto, Yusuke Takagi, Hiroki Masuda, Hiroaki Miyoshi, Kei Kohno, Miharu Nagaishi, Kensaku Sato, Mai Takeuchi, Takuya Furuta, Keisuke Kawamoto, Kyohei Yamada, Mayuko Moritsubo, Kanako Inoue, Yasumasa Shimasaki, Yusuke Ogura, Teppei Imamoto, Tatsuzo Mishina, Ken Tanaka, Yoshino Kawaguchi, Shigeo Nakamura, Koichi Ohshima, Hidekata Hontani, Ichiro Takeuchi

    Abstract: In the present study, we propose a novel case-based similar image retrieval (SIR) method for hematoxylin and eosin (H&E)-stained histopathological images of malignant lymphoma. When a whole slide image (WSI) is used as an input query, it is desirable to be able to retrieve similar cases by focusing on image patches in pathologically important regions such as tumor cells. To address this problem, w… ▽ More

    Submitted 27 January, 2023; v1 submitted 8 July, 2021; originally announced July 2021.

    ACM Class: H.3.3; I.2.1; J.3

  18. arXiv:2107.01545  [pdf, other

    eess.AS cs.CL cs.SD

    Towards Neural Diarization for Unlimited Numbers of Speakers Using Global and Local Attractors

    Authors: Shota Horiguchi, Shinji Watanabe, Paola Garcia, Yawen Xue, Yuki Takashima, Yohei Kawaguchi

    Abstract: Attractor-based end-to-end diarization is achieving comparable accuracy to the carefully tuned conventional clustering-based methods on challenging datasets. However, the main drawback is that it cannot deal with the case where the number of speakers is larger than the one observed during training. This is because its speaker counting relies on supervised learning. In this work, we introduce an un… ▽ More

    Submitted 23 September, 2021; v1 submitted 4 July, 2021; originally announced July 2021.

    Comments: Accepted to ASRU 2021

  19. arXiv:2106.04492  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Description and Discussion on DCASE 2021 Challenge Task 2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring under Domain Shifted Conditions

    Authors: Yohei Kawaguchi, Keisuke Imoto, Yuma Koizumi, Noboru Harada, Daisuke Niizumi, Kota Dohi, Ryo Tanabe, Harsh Purohit, Takashi Endo

    Abstract: We present the task description and discussion on the results of the DCASE 2021 Challenge Task 2. In 2020, we organized an unsupervised anomalous sound detection (ASD) task, identifying whether a given sound was normal or anomalous without anomalous training data. In 2021, we organized an advanced unsupervised ASD task under domain-shift conditions, which focuses on the inevitable problem of the p… ▽ More

    Submitted 27 September, 2021; v1 submitted 8 June, 2021; originally announced June 2021.

    Comments: Accepted to DCASE 2021 Workshop

  20. arXiv:2105.02702  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    MIMII DUE: Sound Dataset for Malfunctioning Industrial Machine Investigation and Inspection with Domain Shifts due to Changes in Operational and Environmental Conditions

    Authors: Ryo Tanabe, Harsh Purohit, Kota Dohi, Takashi Endo, Yuki Nikaido, Toshiki Nakamura, Yohei Kawaguchi

    Abstract: In this paper, we introduce MIMII DUE, a new dataset for malfunctioning industrial machine investigation and inspection with domain shifts due to changes in operational and environmental conditions. Conventional methods for anomalous sound detection face practical challenges because the distribution of features changes between the training and operational phases (called domain shift) due to variou… ▽ More

    Submitted 27 September, 2021; v1 submitted 6 May, 2021; originally announced May 2021.

    Comments: Accepted to IEEE WASPAA 2021

  21. arXiv:2103.08801  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Flow-based Self-supervised Density Estimation for Anomalous Sound Detection

    Authors: Kota Dohi, Takashi Endo, Harsh Purohit, Ryo Tanabe, Yohei Kawaguchi

    Abstract: To develop a machine sound monitoring system, a method for detecting anomalous sound is proposed. Exact likelihood estimation using Normalizing Flows is a promising technique for unsupervised anomaly detection, but it can fail at out-of-distribution detection since the likelihood is affected by the smoothness of the data. To improve the detection performance, we train the model to assign higher li… ▽ More

    Submitted 15 March, 2021; originally announced March 2021.

    Comments: 5 pages, 1 figure, accepted in ICASSP 2021

  22. arXiv:2009.12042  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Deep Autoencoding GMM-based Unsupervised Anomaly Detection in Acoustic Signals and its Hyper-parameter Optimization

    Authors: Harsh Purohit, Ryo Tanabe, Takashi Endo, Kaori Suefusa, Yuki Nikaido, Yohei Kawaguchi

    Abstract: Failures or breakdowns in factory machinery can be costly to companies, so there is an increasing demand for automatic machine inspection. Existing approaches to acoustic signal-based unsupervised anomaly detection, such as those using a deep autoencoder (DA) or Gaussian mixture model (GMM), have poor anomaly-detection performance. In this work, we propose a new method based on a deep autoencoding… ▽ More

    Submitted 25 September, 2020; originally announced September 2020.

    Comments: 5 pages, to appear in DCASE 2020 Workshop

  23. arXiv:2006.05822  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Description and Discussion on DCASE2020 Challenge Task2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring

    Authors: Yuma Koizumi, Yohei Kawaguchi, Keisuke Imoto, Toshiki Nakamura, Yuki Nikaido, Ryo Tanabe, Harsh Purohit, Kaori Suefusa, Takashi Endo, Masahiro Yasuda, Noboru Harada

    Abstract: In this paper, we present the task description and discuss the results of the DCASE 2020 Challenge Task 2: Unsupervised Detection of Anomalous Sounds for Machine Condition Monitoring. The goal of anomalous sound detection (ASD) is to identify whether the sound emitted from a target machine is normal or anomalous. The main challenge of this task is to detect unknown anomalous sounds under the condi… ▽ More

    Submitted 8 August, 2020; v1 submitted 10 June, 2020; originally announced June 2020.

    Comments: Submitted to DCASE2020 Workshop

  24. arXiv:2005.09234  [pdf, other

    eess.AS cs.LG cs.SD

    Anomalous sound detection based on interpolation deep neural network

    Authors: Kaori Suefusa, Tomoya Nishida, Harsh Purohit, Ryo Tanabe, Takashi Endo, Yohei Kawaguchi

    Abstract: As the labor force decreases, the demand for labor-saving automatic anomalous sound detection technology that conducts maintenance of industrial equipment has grown. Conventional approaches detect anomalies based on the reconstruction errors of an autoencoder. However, when the target machine sound is non-stationary, a reconstruction error tends to be large independent of an anomaly, and its varia… ▽ More

    Submitted 19 May, 2020; originally announced May 2020.

    Comments: 5 pages, 8 figures, published in ICASSP 2020

  25. arXiv:1909.09347  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    MIMII Dataset: Sound Dataset for Malfunctioning Industrial Machine Investigation and Inspection

    Authors: Harsh Purohit, Ryo Tanabe, Kenji Ichige, Takashi Endo, Yuki Nikaido, Kaori Suefusa, Yohei Kawaguchi

    Abstract: Factory machinery is prone to failure or breakdown, resulting in significant expenses for companies. Hence, there is a rising interest in machine monitoring using different sensors including microphones. In the scientific community, the emergence of public datasets has led to advancements in acoustic detection and classification of scenes and events, but there are no public datasets that focus on… ▽ More

    Submitted 20 September, 2019; originally announced September 2019.

    Comments: 5 pages, to appear in DCASE 2019 Workshop