Zum Hauptinhalt springen

Showing 1–40 of 40 results for author: Ishii, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.04042  [pdf, other

    cond-mat.mtrl-sci cs.LG

    Scaling Law of Sim2Real Transfer Learning in Expanding Computational Materials Databases for Real-World Predictions

    Authors: Shunya Minami, Yoshihiro Hayashi, Stephen Wu, Kenji Fukumizu, Hiroki Sugisawa, Masashi Ishii, Isao Kuwajima, Kazuya Shiratori, Ryo Yoshida

    Abstract: To address the challenge of limited experimental materials data, extensive physical property databases are being developed based on high-throughput computational experiments, such as molecular dynamics simulations. Previous studies have shown that fine-tuning a predictor pretrained on a computational database to a real system can result in models with outstanding generalization capabilities compar… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

    Comments: 22 pages, 6 figures

  2. arXiv:2405.17842  [pdf, other

    cs.CV cs.LG cs.MM cs.SD eess.AS

    Discriminator-Guided Cooperative Diffusion for Joint Audio and Video Generation

    Authors: Akio Hayakawa, Masato Ishii, Takashi Shibuya, Yuki Mitsufuji

    Abstract: In this study, we aim to construct an audio-video generative model with minimal computational cost by leveraging pre-trained single-modal generative models for audio and video. To achieve this, we propose a novel method that guides each single-modal model to cooperatively generate well-aligned samples across modalities. Specifically, given two pre-trained base diffusion models, we train a lightwei… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  3. arXiv:2405.14598  [pdf, other

    cs.CV cs.LG cs.MM cs.SD eess.AS

    Visual Echoes: A Simple Unified Transformer for Audio-Visual Generation

    Authors: Shiqi Yang, Zhi Zhong, Mengjie Zhao, Shusuke Takahashi, Masato Ishii, Takashi Shibuya, Yuki Mitsufuji

    Abstract: In recent years, with the realistic generation results and a wide range of personalized applications, diffusion-based generative models gain huge attention in both visual and audio generation areas. Compared to the considerable advancements of text2image or text2audio generation, research in audio2visual or visual2audio generation has been relatively slow. The recent audio-visual generation method… ▽ More

    Submitted 24 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: 10 pages

  4. arXiv:2402.11840  [pdf, other

    cs.CV

    An Endoscopic Chisel: Intraoperative Imaging Carves 3D Anatomical Models

    Authors: Jan Emily Mangulabnan, Roger D. Soberanis-Mukul, Timo Teufel, Manish Sahu, Jose L. Porras, S. Swaroop Vedula, Masaru Ishii, Gregory Hager, Russell H. Taylor, Mathias Unberath

    Abstract: Purpose: Preoperative imaging plays a pivotal role in sinus surgery where CTs offer patient-specific insights of complex anatomy, enabling real-time intraoperative navigation to complement endoscopy imaging. However, surgery elicits anatomical changes not represented in the preoperative model, generating an inaccurate basis for navigation during surgery progression. Methods: We propose a first v… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  5. Mining experimental data from Materials Science literature with Large Language Models: an evaluation study

    Authors: Luca Foppiano, Guillaume Lambard, Toshiyuki Amagasa, Masashi Ishii

    Abstract: This study is dedicated to assessing the capabilities of large language models (LLMs) such as GPT-3.5-Turbo, GPT-4, and GPT-4-Turbo in extracting structured information from scientific documents in materials science. To this end, we primarily focus on two critical tasks of information extraction: (i) a named entity recognition (NER) of studied materials and physical properties and (ii) a relation… ▽ More

    Submitted 30 May, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: 40 pages: 5 figures and 1 table in the body. 32 Tables in the Appendix / Supplementary materials

    Journal ref: Science and Technology of Advanced Materials: Methods (2024)

  6. arXiv:2312.04779  [pdf, other

    eess.IV cs.CV cs.LG

    Image Synthesis-based Late Stage Cancer Augmentation and Semi-Supervised Segmentation for MRI Rectal Cancer Staging

    Authors: Saeko Sasuga, Akira Kudo, Yoshiro Kitamura, Satoshi Iizuka, Edgar Simo-Serra, Atsushi Hamabe, Masayuki Ishii, Ichiro Takemasa

    Abstract: Rectal cancer is one of the most common diseases and a major cause of mortality. For deciding rectal cancer treatment plans, T-staging is important. However, evaluating the index from preoperative MRI images requires high radiologists' skill and experience. Therefore, the aim of this study is to segment the mesorectum, rectum, and rectal cancer region so that the system can predict T-stage from se… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: 10 pages, 7 figures, Accepted to Data Augmentation, Labeling, and Imperfections (DALI) at MICCAI 2022

  7. arXiv:2310.14364  [pdf, other

    cs.CV

    A Quantitative Evaluation of Dense 3D Reconstruction of Sinus Anatomy from Monocular Endoscopic Video

    Authors: Jan Emily Mangulabnan, Roger D. Soberanis-Mukul, Timo Teufel, Isabela Hernández, Jonas Winter, Manish Sahu, Jose L. Porras, S. Swaroop Vedula, Masaru Ishii, Gregory Hager, Russell H. Taylor, Mathias Unberath

    Abstract: Generating accurate 3D reconstructions from endoscopic video is a promising avenue for longitudinal radiation-free analysis of sinus anatomy and surgical outcomes. Several methods for monocular reconstruction have been proposed, yielding visually pleasant 3D anatomical structures by retrieving relative camera poses with structure-from-motion-type algorithms and fusion of monocular depth estimates.… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

  8. arXiv:2309.10923  [pdf, other

    cs.CL cond-mat.supr-con cs.DB cs.LG

    Semi-automatic staging area for high-quality structured data extraction from scientific literature

    Authors: Luca Foppiano, Tomoya Mato, Kensei Terashima, Pedro Ortiz Suarez, Taku Tou, Chikako Sakai, Wei-Sheng Wang, Toshiyuki Amagasa, Yoshihiko Takano, Masashi Ishii

    Abstract: We propose a semi-automatic staging area for efficiently building an accurate database of experimental physical properties of superconductors from literature, called SuperCon2, to enrich the existing manually-built superconductor database SuperCon. Here we report our curation interface (SuperCon2 Interface) and a workflow managing the state transitions of each examined record, to validate the data… ▽ More

    Submitted 16 November, 2023; v1 submitted 19 September, 2023; originally announced September 2023.

    Comments: 5 tables, 6 figures, 18 pages

  9. arXiv:2309.03395  [pdf, other

    cs.RO

    The Quiet Eye Phenomenon in Minimally Invasive Surgery

    Authors: Alaa Eldin Abdelaal, Rachelle Van Rumpt, Sayem Nazmuz Zaman, Irene Tong, Anthony Jarc, Gary L. Gallia, Masaru Ishii, Gregory D. Hager, Septimiu E. Salcudean

    Abstract: In this paper, we report our discovery of a gaze behavior called Quiet Eye (QE) in minimally invasive surgery. The QE behavior has been extensively studied in sports training and has been associated with higher level of expertise in multiple sports. We investigated the QE behavior in two independently collected data sets of surgeons performing tasks in a sinus surgery setting and a robotic surgery… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

  10. arXiv:2303.15780  [pdf, other

    cs.CV

    Instruct 3D-to-3D: Text Instruction Guided 3D-to-3D conversion

    Authors: Hiromichi Kamata, Yuiko Sakuma, Akio Hayakawa, Masato Ishii, Takuya Narihira

    Abstract: We propose a high-quality 3D-to-3D conversion method, Instruct 3D-to-3D. Our method is designed for a novel task, which is to convert a given 3D scene to another scene according to text instructions. Instruct 3D-to-3D applies pretrained Image-to-Image diffusion models for 3D-to-3D conversion. This enables the likelihood maximization of each viewpoint image and high-quality 3D generation. In additi… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

    Comments: Project page: https://sony.github.io/Instruct3Dto3D-doc/

  11. arXiv:2303.13121  [pdf, other

    cs.CV

    DetOFA: Efficient Training of Once-for-All Networks for Object Detection Using Path Filter

    Authors: Yuiko Sakuma, Masato Ishii, Takuya Narihira

    Abstract: We address the challenge of training a large supernet for the object detection task, using a relatively small amount of training data. Specifically, we propose an efficient supernet-based neural architecture search (NAS) method that uses search space pruning. The search space defined by the supernet is pruned by removing candidate models that are predicted to perform poorly. To effectively remove… ▽ More

    Submitted 19 October, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: Accepted to ICCV workshop 2023

  12. arXiv:2212.02024  [pdf, other

    cs.CV cs.LG

    Fine-grained Image Editing by Pixel-wise Guidance Using Diffusion Models

    Authors: Naoki Matsunaga, Masato Ishii, Akio Hayakawa, Kenji Suzuki, Takuya Narihira

    Abstract: Our goal is to develop fine-grained real-image editing methods suitable for real-world applications. In this paper, we first summarize four requirements for these methods and propose a novel diffusion-based image editing framework with pixel-wise guidance that satisfies these requirements. Specifically, we train pixel-classifiers with a few annotated data and then infer the segmentation map of a t… ▽ More

    Submitted 31 May, 2023; v1 submitted 4 December, 2022; originally announced December 2022.

    Comments: Accepted by AI for Content Creation (AI4CC) workshop at CVPR 2023

  13. arXiv:2210.15600  [pdf, other

    cs.CL cond-mat.supr-con cs.LG

    Automatic extraction of materials and properties from superconductors scientific literature

    Authors: Luca Foppiano, Pedro Baptista de Castro, Pedro Ortiz Suarez, Kensei Terashima, Yoshihiko Takano, Masashi Ishii

    Abstract: The automatic extraction of materials and related properties from the scientific literature is gaining attention in data-driven materials science (Materials Informatics). In this paper, we discuss Grobid-superconductors, our solution for automatically extracting superconductor material names and respective properties from text. Built as a Grobid module, it combines machine learning and heuristic a… ▽ More

    Submitted 22 November, 2022; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: 20 pages, 11 figures, 8 tables

    Journal ref: STAM:M, 2023, VOL. 3, NO. 1, 2153633

  14. arXiv:2209.12130  [pdf, other

    physics.comp-ph cs.DC physics.flu-dyn

    Scalable adaptive algorithms for next-generation multiphase flow simulations

    Authors: Kumar Saurabh, Masado Ishii, Makrand A. Khanwale, Hari Sundar, Baskar Ganapathysubramanian

    Abstract: High-fidelity flow simulations are indispensable when analyzing systems exhibiting multiphase flow phenomena. The accuracy of multiphase flow simulations is strongly contingent upon the finest mesh resolution used to represent the fluid-fluid interfaces. However, the increased resolution comes at a higher computational cost. In this work, we propose algorithmic advances that aim to reduce the comp… ▽ More

    Submitted 3 April, 2023; v1 submitted 24 September, 2022; originally announced September 2022.

    Comments: 12 pages, 9 figures; Accepted for publication in Proceedings of 2023 IEEE International Parallel and Distributed Processing Symposium (IPDPS)

  15. arXiv:2202.09487  [pdf, other

    cs.CV cs.AI cs.RO

    SAGE: SLAM with Appearance and Geometry Prior for Endoscopy

    Authors: Xingtong Liu, Zhaoshuo Li, Masaru Ishii, Gregory D. Hager, Russell H. Taylor, Mathias Unberath

    Abstract: In endoscopy, many applications (e.g., surgical navigation) would benefit from a real-time method that can simultaneously track the endoscope and reconstruct the dense 3D geometry of the observed anatomy from a monocular endoscopic video. To this end, we develop a Simultaneous Localization and Mapping system by combining the learning-based appearance and optimizable geometry priors and factor grap… ▽ More

    Submitted 22 February, 2022; v1 submitted 18 February, 2022; originally announced February 2022.

    Comments: Accepted to ICRA 2022

  16. Frailty Care Robot for Elderly and Its Application for Physical and Psychological Support

    Authors: Yoichi Yamazaki, Masayuki Ishii, Takahiro Ito, Takuya Hashimoto

    Abstract: To achieve continuous frail care in the daily lives of the elderly, we propose AHOBO, a frail care robot for the elderly at home. Two types of support systems by AHOBO were implemented to support the elderly in both physical health and psychological aspects. For physical health frailty care, we focused on blood pressure and developed a support system for blood pressure measurement with AHOBO. For… ▽ More

    Submitted 20 November, 2021; originally announced November 2021.

    Comments: 9 pages, 15 figures, J. Adv. Comput. Intell. Intell. Inform.(JACIII)

    Journal ref: J. Adv. Comput. Intell. Intell. Inform., Vol.25, No.6, pp. 944-952, 2021

  17. arXiv:2111.09353  [pdf, other

    cs.DC cs.CE cs.CY

    Case study of SARS-CoV-2 transmission risk assessment in indoor environments using cloud computing resources

    Authors: Kumar Saurabh, Santi Adavani, Kendrick Tan, Masado Ishii, Boshun Gao, Adarsh Krishnamurthy, Hari Sundar, Baskar Ganapathysubramanian

    Abstract: Complex flow simulations are conventionally performed on HPC clusters. However, the limited availability of HPC resources and steep learning curve of executing on traditional supercomputer infrastructure has drawn attention towards deploying flow simulation software on the cloud. We showcase how a complex computational framework -- that can evaluate COVID-19 transmission risk in various indoor cla… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

    Comments: Accepted for publication at SuperCompCloud: 5th Workshop on Interoperability of Supercomputing and Cloud Technologies

  18. Scalable adaptive PDE solvers in arbitrary domains

    Authors: Kumar Saurabh, Masado Ishii, Milinda Fernando, Boshun Gao, Kendrick Tan, Ming-Chen Hsu, Adarsh Krishnamurthy, Hari Sundar, Baskar Ganapathysubramanian

    Abstract: Efficiently and accurately simulating partial differential equations (PDEs) in and around arbitrarily defined geometries, especially with high levels of adaptivity, has significant implications for different application domains. A key bottleneck in the above process is the fast construction of a `good' adaptively-refined mesh. In this work, we present an efficient novel octree-based adaptive discr… ▽ More

    Submitted 8 August, 2021; originally announced August 2021.

    Comments: 16 pages. Accepted for publication at Supercomputing '21: The International Conference for High Performance Computing, Networking, Storage, and Analysis

  19. arXiv:2105.07660  [pdf

    cs.CV

    Global Wheat Head Dataset 2021: more diversity to improve the benchmarking of wheat head localization methods

    Authors: Etienne David, Mario Serouart, Daniel Smith, Simon Madec, Kaaviya Velumani, Shouyang Liu, Xu Wang, Francisco Pinto Espinosa, Shahameh Shafiee, Izzat S. A. Tahir, Hisashi Tsujimoto, Shuhei Nasuda, Bangyou Zheng, Norbert Kichgessner, Helge Aasen, Andreas Hund, Pouria Sadhegi-Tehran, Koichi Nagasawa, Goro Ishikawa, Sébastien Dandrifosse, Alexis Carlier, Benoit Mercatoris, Ken Kuroki, Haozhou Wang, Masanori Ishii , et al. (10 additional authors not shown)

    Abstract: The Global Wheat Head Detection (GWHD) dataset was created in 2020 and has assembled 193,634 labelled wheat heads from 4,700 RGB images acquired from various acquisition platforms and 7 countries/institutions. With an associated competition hosted in Kaggle, GWHD has successfully attracted attention from both the computer vision and agricultural science communities. From this first experience in 2… ▽ More

    Submitted 3 June, 2021; v1 submitted 17 May, 2021; originally announced May 2021.

    Comments: 8 pages, 2 figures, 1 table

  20. arXiv:2103.08193  [pdf, other

    cs.LG

    Semi-supervised learning by selective training with pseudo labels via confidence estimation

    Authors: Masato Ishii

    Abstract: We propose a novel semi-supervised learning (SSL) method that adopts selective training with pseudo labels. In our method, we generate hard pseudo-labels and also estimate their confidence, which represents how likely each pseudo-label is to be correct. Then, we explicitly select which pseudo-labeled data should be used to update the model. Specifically, assuming that loss on incorrectly pseudo-la… ▽ More

    Submitted 15 March, 2021; originally announced March 2021.

  21. arXiv:2103.04037  [pdf, other

    cs.CV cs.CL

    Perspectives and Prospects on Transformer Architecture for Cross-Modal Tasks with Language and Vision

    Authors: Andrew Shin, Masato Ishii, Takuya Narihira

    Abstract: Transformer architectures have brought about fundamental changes to computational linguistic field, which had been dominated by recurrent neural networks for many years. Its success also implies drastic changes in cross-modal tasks with language and vision, and many researchers have already tackled the issue. In this paper, we review some of the most critical milestones in the field, as well as ov… ▽ More

    Submitted 9 November, 2021; v1 submitted 6 March, 2021; originally announced March 2021.

    Comments: Accepted for publication by International Journal of Computer Vision (IJCV)

  22. arXiv:2102.06725  [pdf, other

    cs.LG cs.CV

    Neural Network Libraries: A Deep Learning Framework Designed from Engineers' Perspectives

    Authors: Takuya Narihira, Javier Alonsogarcia, Fabien Cardinaux, Akio Hayakawa, Masato Ishii, Kazunori Iwaki, Thomas Kemp, Yoshiyuki Kobayashi, Lukas Mauch, Akira Nakamura, Yukio Obuchi, Andrew Shin, Kenji Suzuki, Stephen Tiedmann, Stefan Uhlich, Takuya Yashima, Kazuki Yoshiyama

    Abstract: While there exist a plethora of deep learning tools and frameworks, the fast-growing complexity of the field brings new demands and challenges, such as more flexible network design, speedy computation on distributed setting, and compatibility between different tools. In this paper, we introduce Neural Network Libraries (https://nnabla.org), a deep learning framework designed from engineer's perspe… ▽ More

    Submitted 21 June, 2021; v1 submitted 12 February, 2021; originally announced February 2021.

    Comments: https://nnabla.org

  23. arXiv:2101.10842  [pdf, other

    cs.CV cs.LG

    Source-free Domain Adaptation via Distributional Alignment by Matching Batch Normalization Statistics

    Authors: Masato Ishii, Masashi Sugiyama

    Abstract: In this paper, we propose a novel domain adaptation method for the source-free setting. In this setting, we cannot access source data during adaptation, while unlabeled target data and a model pretrained with source data are given. Due to lack of source data, we cannot directly match the data distributions between domains unlike typical domain adaptation algorithms. To cope with this problem, we p… ▽ More

    Submitted 19 January, 2021; originally announced January 2021.

  24. arXiv:2010.11741  [pdf, other

    eess.AS cond-mat.dis-nn cs.AR cs.LG cs.SD

    Ultra-low power on-chip learning of speech commands with phase-change memories

    Authors: Venkata Pavan Kumar Miriyala, Masatoshi Ishii

    Abstract: Embedding artificial intelligence at the edge (edge-AI) is an elegant solution to tackle the power and latency issues in the rapidly expanding Internet of Things. As edge devices typically spend most of their time in sleep mode and only wake-up infrequently to collect and process sensor data, non-volatile in-memory computing (NVIMC) is a promising approach to design the next generation of edge-AI… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  25. arXiv:2008.12321  [pdf, other

    cs.CV

    Learning Representations of Endoscopic Videos to Detect Tool Presence Without Supervision

    Authors: David Z. Li, Masaru Ishii, Russell H. Taylor, Gregory D. Hager, Ayushi Sinha

    Abstract: In this work, we explore whether it is possible to learn representations of endoscopic video frames to perform tasks such as identifying surgical tool presence without supervision. We use a maximum mean discrepancy (MMD) variational autoencoder (VAE) to learn low-dimensional latent representations of endoscopic videos and manipulate these representations to distinguish frames containing tools from… ▽ More

    Submitted 27 August, 2020; originally announced August 2020.

    Comments: 10 pages, 4 figures, CLIP 2020

  26. arXiv:2003.08502  [pdf, other

    cs.CV

    Reconstructing Sinus Anatomy from Endoscopic Video -- Towards a Radiation-free Approach for Quantitative Longitudinal Assessment

    Authors: Xingtong Liu, Maia Stiber, Jindan Huang, Masaru Ishii, Gregory D. Hager, Russell H. Taylor, Mathias Unberath

    Abstract: Reconstructing accurate 3D surface models of sinus anatomy directly from an endoscopic video is a promising avenue for cross-sectional and longitudinal analysis to better understand the relationship between sinus anatomy and surgical outcomes. We present a patient-specific, learning-based method for 3D reconstruction of sinus surface anatomy directly and only from endoscopic videos. We demonstrate… ▽ More

    Submitted 2 July, 2020; v1 submitted 18 March, 2020; originally announced March 2020.

    Comments: Accepted to MICCAI 2020

  27. arXiv:2003.00619  [pdf, other

    cs.CV

    Extremely Dense Point Correspondences using a Learned Feature Descriptor

    Authors: Xingtong Liu, Yiping Zheng, Benjamin Killeen, Masaru Ishii, Gregory D. Hager, Russell H. Taylor, Mathias Unberath

    Abstract: High-quality 3D reconstructions from endoscopy video play an important role in many clinical applications, including surgical navigation where they enable direct video-CT registration. While many methods exist for general multi-view 3D reconstruction, these methods often fail to deliver satisfactory performance on endoscopic video. Part of the reason is that local descriptors that establish pair-w… ▽ More

    Submitted 27 March, 2020; v1 submitted 1 March, 2020; originally announced March 2020.

    Comments: The work has been accepted for publication in CVPR 2020

  28. arXiv:1909.03101  [pdf, other

    cs.CV

    Self-supervised Dense 3D Reconstruction from Monocular Endoscopic Video

    Authors: Xingtong Liu, Ayushi Sinha, Masaru Ishii, Gregory D. Hager, Russell H. Taylor, Mathias Unberath

    Abstract: We present a self-supervised learning-based pipeline for dense 3D reconstruction from full-length monocular endoscopic videos without a priori modeling of anatomy or shading. Our method only relies on unlabeled monocular endoscopic videos and conventional multi-view stereo algorithms, and requires neither manual interaction nor patient CT in both training and application phases. In a cross-patient… ▽ More

    Submitted 6 September, 2019; originally announced September 2019.

  29. arXiv:1908.02750  [pdf, other

    physics.comp-ph cs.GT cs.LG physics.flu-dyn

    A physics-informed reinforcement learning approach for the interfacial area transport in two-phase flow

    Authors: Zhuoran Dang, Mamoru Ishii

    Abstract: The prediction of interfacial structure in two-phase flow systems is difficult and challenging. In this paper, a novel physics-informed reinforcement learning-aided framework (PIRLF) for the interfacial area transport is proposed. A Markov Decision Process that describes the bubble transport is established by assuming that the development of two-phase flow is a stochastic process with Markov prope… ▽ More

    Submitted 4 October, 2020; v1 submitted 6 August, 2019; originally announced August 2019.

    Journal ref: International Journal of Heat and Mass Transfer, Volume 192, 15 August 2022, 122919

  30. arXiv:1904.00291  [pdf, other

    cs.CV physics.app-ph physics.data-an physics.flu-dyn

    Two-phase flow regime prediction using LSTM based deep recurrent neural network

    Authors: Zhuoran Dang, Mamoru Ishii

    Abstract: Long short-term memory (LSTM) and recurrent neural network (RNN) has achieved great successes on time-series prediction. In this paper, a methodology of using LSTM-based deep-RNN for two-phase flow regime prediction is proposed, motivated by previous research on constructing deep RNN. The method is featured with fast response and accuracy. The built RNN networks are trained and tested with time-se… ▽ More

    Submitted 30 March, 2019; originally announced April 2019.

  31. arXiv:1903.05312  [pdf, other

    cs.LG stat.ML

    Zero-shot Domain Adaptation Based on Attribute Information

    Authors: Masato Ishii, Takashi Takenouchi, Masashi Sugiyama

    Abstract: In this paper, we propose a novel domain adaptation method that can be applied without target data. We consider the situation where domain shift is caused by a prior change of a specific factor and assume that we know how the prior changes between source and target domains. We call this factor an attribute, and reformulate the domain adaptation problem to utilize the attribute prior instead of tar… ▽ More

    Submitted 13 March, 2019; originally announced March 2019.

    Comments: 14 pages

  32. arXiv:1902.07766  [pdf, other

    cs.CV stat.ML

    Dense Depth Estimation in Monocular Endoscopy with Self-supervised Learning Methods

    Authors: Xingtong Liu, Ayushi Sinha, Masaru Ishii, Gregory D. Hager, Austin Reiter, Russell H. Taylor, Mathias Unberath

    Abstract: We present a self-supervised approach to training convolutional neural networks for dense depth estimation from monocular endoscopy data without a priori modeling of anatomy or shading. Our method only requires monocular endoscopic videos and a multi-view stereo method, e.g., structure from motion, to supervise learning in a sparse manner. Consequently, our method requires neither manual labeling… ▽ More

    Submitted 29 October, 2019; v1 submitted 20 February, 2019; originally announced February 2019.

    Comments: Accepted to IEEE Transactions on Medical Imaging

  33. arXiv:1806.10748  [pdf, other

    cs.CV cs.GR cs.LG

    Towards automatic initialization of registration algorithms using simulated endoscopy images

    Authors: Ayushi Sinha, Masaru Ishii, Russell H. Taylor, Gregory D. Hager, Austin Reiter

    Abstract: Registering images from different modalities is an active area of research in computer aided medical interventions. Several registration algorithms have been developed, many of which achieve high accuracy. However, these results are dependent on many factors, including the quality of the extracted features or segmentations being registered as well as the initial alignment. Although several methods… ▽ More

    Submitted 27 June, 2018; originally announced June 2018.

    Comments: 4 pages, 4 figures

    ACM Class: J.2; J.3; I.2.6; I.2.10; I.3.3; I.3.7

  34. Self-supervised Learning for Dense Depth Estimation in Monocular Endoscopy

    Authors: Xingtong Liu, Ayushi Sinha, Mathias Unberath, Masaru Ishii, Gregory Hager, Russell H. Taylor, Austin Reiter

    Abstract: We present a self-supervised approach to training convolutional neural networks for dense depth estimation from monocular endoscopy data without a priori modeling of anatomy or shading. Our method only requires sequential data from monocular endoscopic videos and a multi-view stereo reconstruction method, e.g. structure from motion, that supervises learning in a sparse but accurate manner. Consequ… ▽ More

    Submitted 26 July, 2018; v1 submitted 25 June, 2018; originally announced June 2018.

    Comments: 11 pages, 5 figures

  35. Endoscopic navigation in the absence of CT imaging

    Authors: Ayushi Sinha, Xingtong Liu, Austin Reiter, Masaru Ishii, Gregory D. Hager, Russell H. Taylor

    Abstract: Clinical examinations that involve endoscopic exploration of the nasal cavity and sinuses often do not have a reference image to provide structural context to the clinician. In this paper, we present a system for navigation during clinical endoscopic exploration in the absence of computed tomography (CT) scans by making use of shape statistics from past CT scans. Using a deformable registration al… ▽ More

    Submitted 7 June, 2018; originally announced June 2018.

    Comments: 8 pages, 3 figures, MICCAI 2018

    ACM Class: G.3; I.4.m; J.3

  36. arXiv:1805.11178  [pdf, other

    cs.CV

    Towards computational fluorescence microscopy: Machine learning-based integrated prediction of morphological and molecular tumor profiles

    Authors: Alexander Binder, Michael Bockmayr, Miriam Hägele, Stephan Wienert, Daniel Heim, Katharina Hellweg, Albrecht Stenzinger, Laura Parlow, Jan Budczies, Benjamin Goeppert, Denise Treue, Manato Kotani, Masaru Ishii, Manfred Dietel, Andreas Hocke, Carsten Denkert, Klaus-Robert Müller, Frederick Klauschen

    Abstract: Recent advances in cancer research largely rely on new developments in microscopic or molecular profiling techniques offering high level of detail with respect to either spatial or molecular features, but usually not both. Here, we present a novel machine learning-based computational approach that allows for the identification of morphological tissue features and the prediction of molecular proper… ▽ More

    Submitted 28 May, 2018; originally announced May 2018.

  37. Anatomically Constrained Video-CT Registration via the V-IMLOP Algorithm

    Authors: Seth D. Billings, Ayushi Sinha, Austin Reiter, Simon Leonard, Masaru Ishii, Gregory D. Hager, Russell H. Taylor

    Abstract: Functional endoscopic sinus surgery (FESS) is a surgical procedure used to treat acute cases of sinusitis and other sinus diseases. FESS is fast becoming the preferred choice of treatment due to its minimally invasive nature. However, due to the limited field of view of the endoscope, surgeons rely on navigation systems to guide them within the nasal cavity. State of the art navigation systems rep… ▽ More

    Submitted 25 October, 2016; originally announced October 2016.

    Comments: 8 pages, 4 figures, MICCAI

    Journal ref: Medical Image Computing and Computer-Assisted Intervention -- MICCAI 2016: 19th International Conference, Athens, Greece, October 17-21, 2016, Proceedings, Part III. Vol. 9902, pp. 133-141

  38. arXiv:1610.04276  [pdf, ps, other

    cs.CY

    Perspectives on Surgical Data Science

    Authors: S. Swaroop Vedula, Masaru Ishii, Gregory D. Hager

    Abstract: The availability of large amounts of data together with advances in analytical techniques afford an opportunity to address difficult challenges in ensuring that healthcare is safe, effective, efficient, patient-centered, equitable, and timely. Surgical care and training stand to tremendously gain through surgical data science. Herein, we discuss a few perspectives on the scope and objectives for s… ▽ More

    Submitted 13 October, 2016; originally announced October 2016.

    Comments: Workshop on Surgical Data Science, Heidelberg, Germany, June 20, 2016

  39. arXiv:1412.6163  [pdf

    cs.CV

    Automated Objective Surgical Skill Assessment in the Operating Room Using Unstructured Tool Motion

    Authors: Piyush Poddar, Narges Ahmidi, S. Swaroop Vedula, Lisa Ishii, Gregory D. Hager, Masaru Ishii

    Abstract: Previous work on surgical skill assessment using intraoperative tool motion in the operating room (OR) has focused on highly-structured surgical tasks such as cholecystectomy. Further, these methods only considered generic motion metrics such as time and number of movements, which are of limited instructive value. In this paper, we developed and evaluated an automated approach to the surgical skil… ▽ More

    Submitted 18 December, 2014; originally announced December 2014.

  40. arXiv:cs/0111027  [pdf

    cs.NI

    Upgrade of Spring-8 Beamline Network with Vlan Technology Over Gigabit Ethernet

    Authors: M. Ishii, T. Fukui, Y. Furukawa, T. Nakatani, T. Ohata, R. Tanaka

    Abstract: The beamline network system at SPring-8 consists of three LANs; a BL-LAN for beamline component control, a BL-USER-LAN for beamline experimental users and an OA-LAN for the information services. These LANs are interconnected by a firewall system. Since the network traffic and the number of beamlines have increased, we upgraded the backbone of BL-USER-LAN from Fast Ethernet to Gigabit Ethernet. A… ▽ More

    Submitted 17 December, 2001; v1 submitted 9 November, 2001; originally announced November 2001.

    Comments: 3 pages, 2 figure, 8th International Conference on Accelerator and Large Experimental Physics Control Systems (PSN TUAP056), San Jose, CA, USA, November 27-30

    ACM Class: C.2.1

    Journal ref: eConf C011127 (2001) TUAP056