-
Synthetic Multimodal Dataset for Empowering Safety and Well-being in Home Environments
Authors:
Takanori Ugai,
Shusaku Egami,
Swe Nwe Nwe Htun,
Kouji Kozaki,
Takahiro Kawamura,
Ken Fukuda
Abstract:
This paper presents a synthetic multimodal dataset of daily activities that fuses video data from a 3D virtual space simulator with knowledge graphs depicting the spatiotemporal context of the activities. The dataset is developed for the Knowledge Graph Reasoning Challenge for Social Issues (KGRC4SI), which focuses on identifying and addressing hazardous situations in the home environment. The dat…
▽ More
This paper presents a synthetic multimodal dataset of daily activities that fuses video data from a 3D virtual space simulator with knowledge graphs depicting the spatiotemporal context of the activities. The dataset is developed for the Knowledge Graph Reasoning Challenge for Social Issues (KGRC4SI), which focuses on identifying and addressing hazardous situations in the home environment. The dataset is available to the public as a valuable resource for researchers and practitioners developing innovative solutions recognizing human behaviors to enhance safety and well-being in
△ Less
Submitted 26 January, 2024;
originally announced January 2024.
-
RDF-star2Vec: RDF-star Graph Embeddings for Data Mining
Authors:
Shusaku Egami,
Takanori Ugai,
Masateru Oota,
Kyoumoto Matsushita,
Takahiro Kawamura,
Kouji Kozaki,
Ken Fukuda
Abstract:
Knowledge Graphs (KGs) such as Resource Description Framework (RDF) data represent relationships between various entities through the structure of triples (<subject, predicate, object>). Knowledge graph embedding (KGE) is crucial in machine learning applications, specifically in node classification and link prediction tasks. KGE remains a vital research topic within the semantic web community. RDF…
▽ More
Knowledge Graphs (KGs) such as Resource Description Framework (RDF) data represent relationships between various entities through the structure of triples (<subject, predicate, object>). Knowledge graph embedding (KGE) is crucial in machine learning applications, specifically in node classification and link prediction tasks. KGE remains a vital research topic within the semantic web community. RDF-star introduces the concept of a quoted triple (QT), a specific form of triple employed either as the subject or object within another triple. Moreover, RDF-star permits a QT to act as compositional entities within another QT, thereby enabling the representation of recursive, hyper-relational KGs with nested structures. However, existing KGE models fail to adequately learn the semantics of QTs and entities, primarily because they do not account for RDF-star graphs containing multi-leveled nested QTs and QT-QT relationships. This study introduces RDF-star2Vec, a novel KGE model specifically designed for RDF-star graphs. RDF-star2Vec introduces graph walk techniques that enable probabilistic transitions between a QT and its compositional entities. Feature vectors for QTs, entities, and relations are derived from generated sequences through the structured skip-gram model. Additionally, we provide a dataset and a benchmarking framework for data mining tasks focused on complex RDF-star graphs. Evaluative experiments demonstrated that RDF-star2Vec yielded superior performance compared to recent extensions of RDF2Vec in various tasks including classification, clustering, entity relatedness, and QT similarity.
△ Less
Submitted 25 December, 2023;
originally announced December 2023.
-
Martian time-series unraveled: A multi-scale nested approach with factorial variational autoencoders
Authors:
Ali Siahkoohi,
Rudy Morel,
Randall Balestriero,
Erwan Allys,
Grégory Sainton,
Taichi Kawamura,
Maarten V. de Hoop
Abstract:
Unsupervised source separation involves unraveling an unknown set of source signals recorded through a mixing operator, with limited prior knowledge about the sources, and only access to a dataset of signal mixtures. This problem is inherently ill-posed and is further challenged by the variety of timescales exhibited by sources in time series data from planetary space missions. As such, a systemat…
▽ More
Unsupervised source separation involves unraveling an unknown set of source signals recorded through a mixing operator, with limited prior knowledge about the sources, and only access to a dataset of signal mixtures. This problem is inherently ill-posed and is further challenged by the variety of timescales exhibited by sources in time series data from planetary space missions. As such, a systematic multi-scale unsupervised approach is needed to identify and separate sources at different timescales. Existing methods typically rely on a preselected window size that determines their operating timescale, limiting their capacity to handle multi-scale sources. To address this issue, we propose an unsupervised multi-scale clustering and source separation framework by leveraging wavelet scattering spectra that provide a low-dimensional representation of stochastic processes, capable of distinguishing between different non-Gaussian stochastic processes. Nested within this representation space, we develop a factorial variational autoencoder that is trained to probabilistically cluster sources at different timescales. To perform source separation, we use samples from clusters at multiple timescales obtained via the factorial variational autoencoder as prior information and formulate an optimization problem in the wavelet scattering spectra representation space. When applied to the entire seismic dataset recorded during the NASA InSight mission on Mars, containing sources varying greatly in timescale, our approach disentangles such different sources, e.g., minute-long transient one-sided pulses (known as "glitches") and structured ambient noises resulting from atmospheric activities that typically last for tens of minutes, and provides an opportunity to conduct further investigations into the isolated sources.
△ Less
Submitted 30 July, 2024; v1 submitted 25 May, 2023;
originally announced May 2023.
-
Unearthing InSights into Mars: Unsupervised Source Separation with Limited Data
Authors:
Ali Siahkoohi,
Rudy Morel,
Maarten V. de Hoop,
Erwan Allys,
Grégory Sainton,
Taichi Kawamura
Abstract:
Source separation involves the ill-posed problem of retrieving a set of source signals that have been observed through a mixing operator. Solving this problem requires prior knowledge, which is commonly incorporated by imposing regularity conditions on the source signals, or implicitly learned through supervised or unsupervised methods from existing data. While data-driven methods have shown great…
▽ More
Source separation involves the ill-posed problem of retrieving a set of source signals that have been observed through a mixing operator. Solving this problem requires prior knowledge, which is commonly incorporated by imposing regularity conditions on the source signals, or implicitly learned through supervised or unsupervised methods from existing data. While data-driven methods have shown great promise in source separation, they often require large amounts of data, which rarely exists in planetary space missions. To address this challenge, we propose an unsupervised source separation scheme for domains with limited data access that involves solving an optimization problem in the wavelet scattering covariance representation space$\unicode{x2014}$an interpretable, low-dimensional representation of stationary processes. We present a real-data example in which we remove transient, thermally-induced microtilts$\unicode{x2014}$known as glitches$\unicode{x2014}$from data recorded by a seismometer during NASA's InSight mission on Mars. Thanks to the wavelet scattering covariances' ability to capture non-Gaussian properties of stochastic processes, we are able to separate glitches using only a few glitch-free data snippets.
△ Less
Submitted 31 May, 2023; v1 submitted 27 January, 2023;
originally announced January 2023.
-
Report on the First Knowledge Graph Reasoning Challenge 2018 -- Toward the eXplainable AI System
Authors:
Takahiro Kawamura,
Shusaku Egami,
Koutarou Tamura,
Yasunori Hokazono,
Takanori Ugai,
Yusuke Koyanagi,
Fumihito Nishino,
Seiji Okajima,
Katsuhiko Murakami,
Kunihiko Takamatsu,
Aoi Sugiura,
Shun Shiramatsu,
Shawn Zhang,
Kouji Kozaki
Abstract:
A new challenge for knowledge graph reasoning started in 2018. Deep learning has promoted the application of artificial intelligence (AI) techniques to a wide variety of social problems. Accordingly, being able to explain the reason for an AI decision is becoming important to ensure the secure and safe use of AI techniques. Thus, we, the Special Interest Group on Semantic Web and Ontology of the J…
▽ More
A new challenge for knowledge graph reasoning started in 2018. Deep learning has promoted the application of artificial intelligence (AI) techniques to a wide variety of social problems. Accordingly, being able to explain the reason for an AI decision is becoming important to ensure the secure and safe use of AI techniques. Thus, we, the Special Interest Group on Semantic Web and Ontology of the Japanese Society for AI, organized a challenge calling for techniques that reason and/or estimate which characters are criminals while providing a reasonable explanation based on an open knowledge graph of a well-known Sherlock Holmes mystery story. This paper presents a summary report of the first challenge held in 2018, including the knowledge graph construction, the techniques proposed for reasoning and/or estimation, the evaluation metrics, and the results. The first prize went to an approach that formalized the problem as a constraint satisfaction problem and solved it using a lightweight formal method; the second prize went to an approach that used SPARQL and rules; the best resource prize went to a submission that constructed word embedding of characters from all sentences of Sherlock Holmes novels; and the best idea prize went to a discussion multi-agents model. We conclude this paper with the plans and issues for the next challenge in 2019.
△ Less
Submitted 21 August, 2019;
originally announced August 2019.
-
Research Activity Classification based on Time Series Bibliometrics
Authors:
Takahiro Kawamura,
Yasuhiro Yamashita,
Katsuji Matsumura
Abstract:
Bibliometrics such as the number of papers and times cited are often used to compare researchers based on specific criteria. The criteria, however, are different in each research domain and are set by empirical laws. Moreover, there are arguments, such that the simple sum of metric values works to the advantage of elders. Therefore, this paper attempts to constitute features from time series data…
▽ More
Bibliometrics such as the number of papers and times cited are often used to compare researchers based on specific criteria. The criteria, however, are different in each research domain and are set by empirical laws. Moreover, there are arguments, such that the simple sum of metric values works to the advantage of elders. Therefore, this paper attempts to constitute features from time series data of bibliometrics, and then classify the researchers according to the features. In detail, time series patterns are extracted from bibliographic data sets, and then a model to classify whether the researchers are "distinguished" or not is created by a machine learning technique. The experiments achieved an F-measure of 80.0% in the classification of 114 researchers in two research domains based on the data sets of Japan Science and Technology Agency and Elsevier's Scopus. In the future, we will conduct verification on a number of researchers in several domains, and then make use of discovering "distinguished" researchers, who are not widely known.
△ Less
Submitted 4 August, 2017;
originally announced August 2017.
-
Context-based Barrier Notification Service Toward Outdoor Support for the Elderly
Authors:
Keisuke Umezu,
Takahiro Kawamura,
Akihiko Ohsuga
Abstract:
Aging society has been becoming a global problem not only in advanced countries. Under such circumstances, it is said that participation of elderly people in social activities is highly desirable from various perspectives including decrease of social welfare costs. Thus, we propose a mobile service that notifies barrier information nearby users outside to lowers the anxiety of elderly people and p…
▽ More
Aging society has been becoming a global problem not only in advanced countries. Under such circumstances, it is said that participation of elderly people in social activities is highly desirable from various perspectives including decrease of social welfare costs. Thus, we propose a mobile service that notifies barrier information nearby users outside to lowers the anxiety of elderly people and promote their social activities. There are barrier free maps in some areas, but those are static and updated annually at the earliest. However, there exist temporary barriers like road repairing and parked bicycles, and also every barrier is not for every elder person. That is, the elder people are under several conditions and wills to go out, so that a barrier for an elder person is not necessarily the one for the other. Therefore, we first collect the barrier information in the user participatory manner and select the ones the user need to know, then timely provide them via a mobile phone equipped with GPS. This paper shows the public experiment that we conducted in Tokyo, and confirms the usability and the accuracy of the information filtering.
△ Less
Submitted 11 July, 2013;
originally announced July 2013.