Zum Hauptinhalt springen

Showing 1–29 of 29 results for author: Cheng, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.15057  [pdf

    cs.LG

    Subgroup Analysis via Model-based Rule Forest

    Authors: I-Ling Cheng, Chan Hsu, Chantung Ku, Pei-Ju Lee, Yihuang Kang

    Abstract: Machine learning models are often criticized for their black-box nature, raising concerns about their applicability in critical decision-making scenarios. Consequently, there is a growing demand for interpretable models in such contexts. In this study, we introduce Model-based Deep Rule Forests (mobDRF), an interpretable representation learning algorithm designed to extract transparent models from… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

  2. arXiv:2404.15293  [pdf, other

    eess.IV cs.GR q-bio.NC

    Interactive Manipulation and Visualization of 3D Brain MRI for Surgical Training

    Authors: Siddharth Jha, Zichen Gui, Benjamin Delbos, Richard Moreau, Arnaud Leleve, Irene Cheng

    Abstract: In modern medical diagnostics, magnetic resonance imaging (MRI) is an important technique that provides detailed insights into anatomical structures. In this paper, we present a comprehensive methodology focusing on streamlining the segmentation, reconstruction, and visualization process of 3D MRI data. Segmentation involves the extraction of anatomical regions with the help of state-of-the-art de… ▽ More

    Submitted 24 March, 2024; originally announced April 2024.

  3. arXiv:2403.06107  [pdf, other

    cs.CV

    Textureless Object Recognition: An Edge-based Approach

    Authors: Frincy Clement, Kirtan Shah, Dhara Pancholi, Gabriel Lugo Bustillo, Irene Cheng

    Abstract: Textureless object recognition has become a significant task in Computer Vision with the advent of Robotics and its applications in manufacturing sector. It has been challenging to obtain good accuracy in real time because of its lack of discriminative features and reflectance properties which makes the techniques for textured object recognition insufficient for textureless objects. A lot of work… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: text overlap with arXiv:1910.14255

  4. arXiv:2403.05658  [pdf, other

    cs.CV cs.AI cs.MM

    Feature CAM: Interpretable AI in Image Classification

    Authors: Frincy Clement, Ji Yang, Irene Cheng

    Abstract: Deep Neural Networks have often been called the black box because of the complex, deep architecture and non-transparency presented by the inner layers. There is a lack of trust to use Artificial Intelligence in critical and high-precision fields such as security, finance, health, and manufacturing industries. A lot of focused work has been done to provide interpretable models, intending to deliver… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  5. arXiv:2203.00314  [pdf, other

    cs.CL

    VScript: Controllable Script Generation with Visual Presentation

    Authors: Ziwei Ji, Yan Xu, I-Tsun Cheng, Samuel Cahyawijaya, Rita Frieske, Etsuko Ishii, Min Zeng, Andrea Madotto, Pascale Fung

    Abstract: In order to offer a customized script tool and inspire professional scriptwriters, we present VScript. It is a controllable pipeline that generates complete scripts, including dialogues and scene descriptions, as well as presents visually using video retrieval. With an interactive interface, our system allows users to select genres and input starting words that control the theme and development of… ▽ More

    Submitted 13 October, 2022; v1 submitted 1 March, 2022; originally announced March 2022.

    Journal ref: AACL Demo (2022)

  6. Subjective and Objective Visual Quality Assessment of Textured 3D Meshes

    Authors: Jinjiang Guo, Vincent Vidal, Irene Cheng, Anup Basu, Atilla Baskurt, Guillaume Lavoue

    Abstract: Objective visual quality assessment of 3D models is a fundamental issue in computer graphics. Quality assessment metrics may allow a wide range of processes to be guided and evaluated, such as level of detail creation, compression, filtering, and so on. Most computer graphics assets are composed of geometric surfaces on which several texture images can be mapped to 11 make the rendering more reali… ▽ More

    Submitted 7 February, 2021; originally announced February 2021.

  7. arXiv:2010.03710  [pdf

    stat.ML cs.IR cs.LG

    Topic Diffusion Discovery Based on Deep Non-negative Autoencoder

    Authors: Sheng-Tai Huang, Yihuang Kang, Shao-Min Hung, Bowen Kuo, I-Ling Cheng

    Abstract: Researchers have been overwhelmed by the explosion of research articles published by various research communities. Many research scholarly websites, search engines, and digital libraries have been created to help researchers identify potential research topics and keep up with recent progress on research of interests. However, it is still difficult for researchers to keep track of the research topi… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

  8. arXiv:2007.12496  [pdf, other

    eess.IV cs.CV

    Parkinson's Disease Detection with Ensemble Architectures based on ILSVRC Models

    Authors: Tahjid Ashfaque Mostafa, Irene Cheng

    Abstract: In this work, we explore various neural network architectures using Magnetic Resonance (MR) T1 images of the brain to identify Parkinson's Disease (PD), which is one of the most common neurodegenerative and movement disorders. We propose three ensemble architectures combining some winning Convolutional Neural Network models of ImageNet Large Scale Visual Recognition Challenge (ILSVRC). All of our… ▽ More

    Submitted 23 July, 2020; originally announced July 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:2007.00682

  9. arXiv:2007.00682  [pdf, other

    eess.IV cs.LG

    Parkinson's Disease Detection Using Ensemble Architecture from MR Images

    Authors: Tahjid Ashfaque Mostafa, Irene Cheng

    Abstract: Parkinson's Disease(PD) is one of the major nervous system disorders that affect people over 60. PD can cause cognitive impairments. In this work, we explore various approaches to identify Parkinson's using Magnetic Resonance (MR) T1 images of the brain. We experiment with ensemble architectures combining some winning Convolutional Neural Network models of ImageNet Large Scale Visual Recognition C… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

  10. arXiv:2001.09631  [pdf, other

    eess.IV cs.LG stat.ML

    An Unsupervised Generative Neural Approach for InSAR Phase Filtering and Coherence Estimation

    Authors: Subhayan Mukherjee, Aaron Zimmer, Xinyao Sun, Parwant Ghuman, Irene Cheng

    Abstract: Phase filtering and pixel quality (coherence) estimation is critical in producing Digital Elevation Models (DEMs) from Interferometric Synthetic Aperture Radar (InSAR) images, as it removes spatial inconsistencies (residues) and immensely improves the subsequent unwrapping. Large amount of InSAR data facilitates Wide Area Monitoring (WAM) over geographical regions. Advances in parallel computing h… ▽ More

    Submitted 9 August, 2020; v1 submitted 27 January, 2020; originally announced January 2020.

    Comments: to be published in a future issue of IEEE Geoscience and Remote Sensing Letters

  11. Adaptive Dithering Using Curved Markov-Gaussian Noise in the Quantized Domain for Mapping SDR to HDR Image

    Authors: Subhayan Mukherjee, Guan-Ming Su, Irene Cheng

    Abstract: High Dynamic Range (HDR) imaging is gaining increased attention due to its realistic content, for not only regular displays but also smartphones. Before sufficient HDR content is distributed, HDR visualization still relies mostly on converting Standard Dynamic Range (SDR) content. SDR images are often quantized, or bit depth reduced, before SDR-to-HDR conversion, e.g. for video transmission. Quant… ▽ More

    Submitted 20 January, 2020; originally announced January 2020.

    Comments: 2018 International Conference on Smart Multimedia

  12. arXiv:2001.06961  [pdf, other

    eess.IV cs.LG stat.ML

    CNN-Based Real-Time Parameter Tuning for Optimizing Denoising Filter Performance

    Authors: Subhayan Mukherjee, Navaneeth Kamballur Kottayil, Xinyao Sun, Irene Cheng

    Abstract: We propose a novel direction to improve the denoising quality of filtering-based denoising algorithms in real time by predicting the best filter parameter value using a Convolutional Neural Network (CNN). We take the use case of BM3D, the state-of-the-art filtering-based denoising algorithm, to demonstrate and validate our approach. We propose and train a simple, shallow CNN to predict in real tim… ▽ More

    Submitted 19 January, 2020; originally announced January 2020.

    Comments: 2019 International Conference on Image Analysis and Recognition

  13. arXiv:2001.06956  [pdf

    eess.IV cs.LG stat.ML

    CNN-based InSAR Coherence Classification

    Authors: Subhayan Mukherjee, Aaron Zimmer, Xinyao Sun, Parwant Ghuman, Irene Cheng

    Abstract: Interferometric Synthetic Aperture Radar (InSAR) imagery based on microwaves reflected off ground targets is becoming increasingly important in remote sensing for ground movement estimation. However, the reflections are contaminated by noise, which distorts the signal's wrapped phase. Demarcation of image regions based on degree of contamination ("coherence") is an important component of the InSAR… ▽ More

    Submitted 19 January, 2020; originally announced January 2020.

    Comments: 2018 IEEE SENSORS

  14. arXiv:2001.06954  [pdf

    eess.IV cs.LG stat.ML

    CNN-based InSAR Denoising and Coherence Metric

    Authors: Subhayan Mukherjee, Aaron Zimmer, Navaneeth Kamballur Kottayil, Xinyao Sun, Parwant Ghuman, Irene Cheng

    Abstract: Interferometric Synthetic Aperture Radar (InSAR) imagery for estimating ground movement, based on microwaves reflected off ground targets is gaining increasing importance in remote sensing. However, noise corrupts microwave reflections received at satellite and contaminates the signal's wrapped phase. We introduce Convolutional Neural Networks (CNNs) to this problem domain and show the effectivene… ▽ More

    Submitted 19 January, 2020; originally announced January 2020.

    Comments: 2018 IEEE SENSORS

  15. Potential of deep features for opinion-unaware, distortion-unaware, no-reference image quality assessment

    Authors: Subhayan Mukherjee, Giuseppe Valenzise, Irene Cheng

    Abstract: Image Quality Assessment algorithms predict a quality score for a pristine or distorted input image, such that it correlates with human opinion. Traditional methods required a non-distorted "reference" version of the input image to compare with, in order to predict this score. However, recent "No-reference" methods circumvent this requirement by modelling the distribution of clean image features,… ▽ More

    Submitted 26 November, 2019; originally announced November 2019.

    Comments: International Conference on Smart Multimedia (Springer), 16-18 December 2019, San Diego, California, USA

  16. arXiv:1909.03120  [pdf, other

    eess.IV cs.CV

    DeepInSAR: A Deep Learning Framework for SAR Interferometric Phase Restoration and Coherence Estimation

    Authors: Xinyao Sun, Aaron Zimmer, Subhayan Mukherjee, Navaneeth Kamballur Kottayil, Parwant Ghuman, Irene Cheng

    Abstract: Over the past decade, Interferometric Synthetic Aperture Radar (InSAR) has become a successful remote sensing technique. However, during the acquisition step, microwave reflections received at satellite are usually disturbed by strong noise, leading to a noisy single-look complex (SLC) SAR image. The quality of their interferometric phase is even worse. InSAR phase filtering is an ill-posed proble… ▽ More

    Submitted 27 May, 2020; v1 submitted 6 September, 2019; originally announced September 2019.

    Comments: 19 pages

  17. arXiv:1907.06333  [pdf, ps, other

    cs.LG stat.ML

    Myers-Briggs Personality Classification and Personality-Specific Language Generation Using Pre-trained Language Models

    Authors: Sedrick Scott Keh, I-Tsun Cheng

    Abstract: The Myers-Briggs Type Indicator (MBTI) is a popular personality metric that uses four dichotomies as indicators of personality traits. This paper examines the use of pre-trained language models to predict MBTI personality types based on scraped labeled texts. The proposed model reaches an accuracy of $0.47$ for correctly predicting all 4 types and $0.86$ for correctly predicting at least 2 types.… ▽ More

    Submitted 15 July, 2019; originally announced July 2019.

  18. arXiv:1907.01723  [pdf

    stat.ML cs.LG stat.AP

    Towards Interpretable Deep Extreme Multi-label Learning

    Authors: Yihuang Kang, I-Ling Cheng, Wenjui Mao, Bowen Kuo, Pei-Ju Lee

    Abstract: Many Machine Learning algorithms, such as deep neural networks, have long been criticized for being "black-boxes"-a kind of models unable to provide how it arrive at a decision without further efforts to interpret. This problem has raised concerns on model applications' trust, safety, nondiscrimination, and other ethical issues. In this paper, we discuss the machine learning interpretability of a… ▽ More

    Submitted 2 July, 2019; originally announced July 2019.

    Comments: 6 pages

  19. arXiv:1905.00469  [pdf

    eess.IV cs.CV cs.LG cs.MM stat.ML

    Fully Automatic Brain Tumor Segmentation using a Normalized Gaussian Bayesian Classifier and 3D Fluid Vector Flow

    Authors: Tao Wang, Irene Cheng, Anup Basu

    Abstract: Brain tumor segmentation from Magnetic Resonance Images (MRIs) is an important task to measure tumor responses to treatments. However, automatic segmentation is very challenging. This paper presents an automatic brain tumor segmentation method based on a Normalized Gaussian Bayesian classification and a new 3D Fluid Vector Flow (FVF) algorithm. In our method, a Normalized Gaussian Mixture Model (N… ▽ More

    Submitted 1 May, 2019; originally announced May 2019.

    Comments: ICIP 2010

  20. A Fast Segmentation-free Fully Automated Approach to White Matter Injury Detection in Preterm Infants

    Authors: Subhayan Mukherjee, Irene Cheng, Steven Miller, Jessie Guo, Vann Chau, Anup Basu

    Abstract: White Matter Injury (WMI) is the most prevalent brain injury in the preterm neonate leading to developmental deficits. However, detecting WMI in Magnetic Resonance (MR) images of preterm neonate brains using traditional WM segmentation-based methods is difficult mainly due to lack of reliable preterm neonate brain atlases to guide segmentation. Hence, we propose a segmentation-free, fast, unsuperv… ▽ More

    Submitted 17 July, 2018; originally announced July 2018.

    Journal ref: Medical and Biological Engineering and Computing (Springer), 2018

  21. arXiv:1807.04386  [pdf

    cs.IR cs.LG stat.ML

    Topic Diffusion Discovery based on Sparseness-constrained Non-negative Matrix Factorization

    Authors: Yihuang Kang, Keng-Pei Lin, I-Ling Cheng

    Abstract: Due to recent explosion of text data, researchers have been overwhelmed by ever-increasing volume of articles produced by different research communities. Various scholarly search websites, citation recommendation engines, and research databases have been created to simplify the text search tasks. However, it is still difficult for researchers to be able to identify potential research topics withou… ▽ More

    Submitted 11 July, 2018; originally announced July 2018.

  22. arXiv:1806.07489  [pdf, other

    cs.CV

    Towards the identification of Parkinson's Disease using only T1 MR Images

    Authors: Sara Soltaninejad, Irene Cheng, Anup Basu

    Abstract: Parkinson's Disease (PD) is one of the most common types of neurological diseases caused by progressive degeneration of dopamin- ergic neurons in the brain. Even though there is no fixed cure for this neurodegenerative disease, earlier diagnosis followed by earlier treatment can help patients have a better quality of life. Magnetic Resonance Imag- ing (MRI) has been one of the most popular diagnos… ▽ More

    Submitted 19 June, 2018; originally announced June 2018.

    Comments: ICSM 2018

  23. Segmentation of Arterial Walls in Intravascular Ultrasound Cross-Sectional Images Using Extremal Region Selection

    Authors: Mehdi Faraji, Irene Cheng, Iris Naudin, Anup Basu

    Abstract: Intravascular Ultrasound (IVUS) is an intra-operative imaging modality that facilitates observing and appraising the vessel wall structure of the human coronary arteries. Segmentation of arterial wall boundaries from the IVUS images is not only crucial for quantitative analysis of the vessel walls and plaque characteristics, but is also necessary for generating 3D reconstructed models of the arter… ▽ More

    Submitted 10 June, 2018; originally announced June 2018.

    Comments: 15 pages, 5 figures, published in Elsevier Ultrasonics

  24. arXiv:1803.04053  [pdf, other

    cs.MM cs.CV

    Learning Local Distortion Visibility From Image Quality Data-sets

    Authors: Navaneeth Kamballur Kottayil, Giuseppe Valenzise, Frederic Dufaux, Irene Cheng

    Abstract: Accurate prediction of local distortion visibility thresholds is critical in many image and video processing applications. Existing methods require an accurate modeling of the human visual system, and are derived through pshycophysical experiments with simple, artificial stimuli. These approaches, however, are difficult to generalize to natural images with complex types of distortion. In this pape… ▽ More

    Submitted 11 March, 2018; originally announced March 2018.

  25. Blind High Dynamic Range Quality estimation by disentangling perceptual and noise features in images

    Authors: Navaneeth Kamballur Kottayil, Giuseppe Valenzise, Frederic Dufaux, Irene Cheng

    Abstract: Assessing the visual quality of High Dynamic Range (HDR) images is an unexplored and an interesting research topic that has become relevant with the current boom in HDR technology. We propose a new convolutional neural network based model for No reference image quality assessment(NR-IQA) on HDR data. This model predicts the amount and location of noise, perceptual influence of image pixels on the… ▽ More

    Submitted 19 December, 2017; originally announced December 2017.

  26. Investigation of Gaze Patterns in Multi View Laparoscopic Surgery

    Authors: Navaneeth Kamballur Kottayil, Rositsa Bogdanova, Irene Cheng, Anup Basu, Bin Zheng

    Abstract: Laparoscopic Surgery (LS) is a modern surgical technique whereby the surgery is performed through an incision with tools and camera as opposed to conventional open surgery. This promises minimal recovery times and less hemorrhaging. Multi view LS is the latest development in the field, where the system uses multiple cameras to give the surgeon more information about the surgical site, potentially… ▽ More

    Submitted 30 November, 2017; originally announced December 2017.

    Journal ref: 38th Annual International Conference of the IEEE EMBC, Orlando, FL, 2016, pp. 4031-4034

  27. A Color Intensity Invariant Low Level Feature Optimization Framework for Image Quality Assessment

    Authors: Navaneeth K. Kottayil, Irene Cheng, Frederic Dufaux, Anup Basu

    Abstract: Image Quality Assessment (IQA) algorithms evaluate the perceptual quality of an image using evaluation scores that assess the similarity or difference between two images. We propose a new low-level feature based IQA technique, which applies filter-bank decomposition and center-surround methodology. Differing from existing methods, our model incorporates color intensity adaptation and frequency sca… ▽ More

    Submitted 30 November, 2017; originally announced December 2017.

    Journal ref: Signal, Image and Video Processing 10.6 (2016):1169-1176

  28. Highlighting objects of interest in an image by integrating saliency and depth

    Authors: Subhayan Mukherjee, Irene Cheng, Anup Basu

    Abstract: Stereo images have been captured primarily for 3D reconstruction in the past. However, the depth information acquired from stereo can also be used along with saliency to highlight certain objects in a scene. This approach can be used to make still images more interesting to look at, and highlight objects of interest in the scene. We introduce this novel direction in this paper, and discuss the the… ▽ More

    Submitted 28 November, 2017; originally announced November 2017.

  29. Entropy-difference based stereo error detection

    Authors: Subhayan Mukherjee, Irene Cheng, Ram Mohana Reddy Guddeti, Anup Basu

    Abstract: Stereo depth estimation is error-prone; hence, effective error detection methods are desirable. Most such existing methods depend on characteristics of the stereo matching cost curve, making them unduly dependent on functional details of the matching algorithm. As a remedy, we propose a novel error detection approach based solely on the input image and its depth map. Our assumption is that, entrop… ▽ More

    Submitted 28 November, 2017; originally announced November 2017.