Zum Hauptinhalt springen

Showing 1–41 of 41 results for author: Nguyen, T Q

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.04489  [pdf, other

    cs.CV

    Dude: Dual Distribution-Aware Context Prompt Learning For Large Vision-Language Model

    Authors: Duy M. H. Nguyen, An T. Le, Trung Q. Nguyen, Nghiem T. Diep, Tai Nguyen, Duy Duong-Tran, Jan Peters, Li Shen, Mathias Niepert, Daniel Sonntag

    Abstract: Prompt learning methods are gaining increasing attention due to their ability to customize large vision-language models to new domains using pre-trained contextual knowledge and minimal training data. However, existing works typically rely on optimizing unified prompt inputs, often struggling with fine-grained classification tasks due to insufficient discriminative attributes. To tackle this, we c… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: Version 1

  2. arXiv:2404.11152  [pdf, other

    eess.IV cs.CV

    Multi-target and multi-stage liver lesion segmentation and detection in multi-phase computed tomography scans

    Authors: Abdullah F. Al-Battal, Soan T. M. Duong, Van Ha Tang, Quang Duc Tran, Steven Q. H. Truong, Chien Phan, Truong Q. Nguyen, Cheolhong An

    Abstract: Multi-phase computed tomography (CT) scans use contrast agents to highlight different anatomical structures within the body to improve the probability of identifying and detecting anatomical structures of interest and abnormalities such as liver lesions. Yet, detecting these lesions remains a challenging task as these lesions vary significantly in their size, shape, texture, and contrast with resp… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  3. arXiv:2310.01413  [pdf

    eess.IV cs.AI cs.CV

    A multi-institutional pediatric dataset of clinical radiology MRIs by the Children's Brain Tumor Network

    Authors: Ariana M. Familiar, Anahita Fathi Kazerooni, Hannah Anderson, Aliaksandr Lubneuski, Karthik Viswanathan, Rocky Breslow, Nastaran Khalili, Sina Bagheri, Debanjan Haldar, Meen Chul Kim, Sherjeel Arif, Rachel Madhogarhia, Thinh Q. Nguyen, Elizabeth A. Frenkel, Zeinab Helili, Jessica Harrison, Keyvan Farahani, Marius George Linguraru, Ulas Bagci, Yury Velichko, Jeffrey Stevens, Sarah Leary, Robert M. Lober, Stephani Campion, Amy A. Smith , et al. (15 additional authors not shown)

    Abstract: Pediatric brain and spinal cancers remain the leading cause of cancer-related death in children. Advancements in clinical decision-support in pediatric neuro-oncology utilizing the wealth of radiology imaging data collected through standard care, however, has significantly lagged other domains. Such data is ripe for use with predictive analytics such as artificial intelligence (AI) methods, which… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  4. arXiv:2309.17166  [pdf, other

    cs.CV cs.AI

    Advances in Kidney Biopsy Lesion Assessment through Dense Instance Segmentation

    Authors: Zhan Xiong, Junling He, Pieter Valkema, Tri Q. Nguyen, Maarten Naesens, Jesper Kers, Fons J. Verbeek

    Abstract: Renal biopsies are the gold standard for diagnosis of kidney diseases. Lesion scores made by renal pathologists are semi-quantitative and exhibit high inter-observer variability. Automating lesion classification within segmented anatomical structures can provide decision support in quantification analysis and reduce the inter-observer variability. Nevertheless, classifying lesions in regions-of-in… ▽ More

    Submitted 28 March, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

    Comments: 16 pages, 15 figures, 6 tables, Journal

  5. arXiv:2306.03460  [pdf, other

    cs.LG cs.CL cs.HC

    Natural Language Commanding via Program Synthesis

    Authors: Apurva Gandhi, Thong Q. Nguyen, Huitian Jiao, Robert Steen, Ameya Bhatawdekar

    Abstract: We present Semantic Interpreter, a natural language-friendly AI system for productivity software such as Microsoft Office that leverages large language models (LLMs) to execute user intent across application features. While LLMs are excellent at understanding user intent expressed as natural language, they are not sufficient for fulfilling application-specific user intent that requires more than t… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

  6. arXiv:2305.18361  [pdf, other

    eess.IV cs.CV

    Deep learning network to correct axial and coronal eye motion in 3D OCT retinal imaging

    Authors: Yiqian Wang, Alexandra Warter, Melina Cavichini, Varsha Alex, Dirk-Uwe G. Bartsch, William R. Freeman, Truong Q. Nguyen, Cheolhong An

    Abstract: Optical Coherence Tomography (OCT) is one of the most important retinal imaging technique. However, involuntary motion artifacts still pose a major challenge in OCT imaging that compromises the quality of downstream analysis, such as retinal layer segmentation and OCT Angiography. We propose deep learning based neural networks to correct axial and coronal motion artifacts in OCT based on a single… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

  7. arXiv:2303.14381  [pdf, other

    cs.CV

    3D Facial Imperfection Regeneration: Deep learning approach and 3D printing prototypes

    Authors: Phuong D. Nguyen, Thinh D. Le, Duong Q. Nguyen, Thanh Q. Nguyen, Li-Wei Chou, H. Nguyen-Xuan

    Abstract: This study explores the potential of a fully convolutional mesh autoencoder model for regenerating 3D nature faces with the presence of imperfect areas. We utilize deep learning approaches in graph processing and analysis to investigate the capabilities model in recreating a filling part for facial scars. Our approach in dataset creation is able to generate a facial scar rationally in a virtual sp… ▽ More

    Submitted 25 March, 2023; originally announced March 2023.

  8. arXiv:2112.03946  [pdf

    q-fin.ST cs.LG cs.NE

    Generative Adversarial Network (GAN) and Enhanced Root Mean Square Error (ERMSE): Deep Learning for Stock Price Movement Prediction

    Authors: Ashish Kumar, Abeer Alsadoon, P. W. C. Prasad, Salma Abdullah, Tarik A. Rashid, Duong Thu Hang Pham, Tran Quoc Vinh Nguyen

    Abstract: The prediction of stock price movement direction is significant in financial circles and academic. Stock price contains complex, incomplete, and fuzzy information which makes it an extremely difficult task to predict its development trend. Predicting and analysing financial data is a nonlinear, time-dependent problem. With rapid development in machine learning and deep learning, this task can be p… ▽ More

    Submitted 30 November, 2021; originally announced December 2021.

    Comments: 18 pages. Multimed Tools Appl, 2021

  9. arXiv:2108.02929  [pdf

    cs.CV

    VinaFood21: A Novel Dataset for Evaluating Vietnamese Food Recognition

    Authors: Thuan Trong Nguyen, Thuan Q. Nguyen, Dung Vo, Vi Nguyen, Ngoc Ho, Nguyen D. Vo, Kiet Van Nguyen, Khang Nguyen

    Abstract: Vietnam is such an attractive tourist destination with its stunning and pristine landscapes and its top-rated unique food and drink. Among thousands of Vietnamese dishes, foreigners and native people are interested in easy-to-eat tastes and easy-to-do recipes, along with reasonable prices, mouthwatering flavors, and popularity. Due to the diversity and almost all the dishes have significant simila… ▽ More

    Submitted 5 August, 2021; originally announced August 2021.

  10. arXiv:2106.13849  [pdf, other

    cs.CV eess.IV

    A CNN Segmentation-Based Approach to Object Detection and Tracking in Ultrasound Scans with Application to the Vagus Nerve Detection

    Authors: Abdullah F. Al-Battal, Yan Gong, Lu Xu, Timothy Morton, Chen Du, Yifeng Bu 1, Imanuel R Lerman, Radhika Madhavan, Truong Q. Nguyen

    Abstract: Ultrasound scanning is essential in several medical diagnostic and therapeutic applications. It is used to visualize and analyze anatomical features and structures that influence treatment plans. However, it is both labor intensive, and its effectiveness is operator dependent. Real-time accurate and robust automatic detection and tracking of anatomical structures while scanning would significantly… ▽ More

    Submitted 25 June, 2021; originally announced June 2021.

    Comments: 7 pages , 4 figures, submitted to the IEEE EMBC 2021 conference

  11. arXiv:2105.01691  [pdf, other

    cs.CL

    Data Augmentation by Concatenation for Low-Resource Translation: A Mystery and a Solution

    Authors: Toan Q. Nguyen, Kenton Murray, David Chiang

    Abstract: In this paper, we investigate the driving factors behind concatenation, a simple but effective data augmentation method for low-resource neural machine translation. Our experiments suggest that discourse context is unlikely the cause for the improvement of about +1 BLEU across four language pairs. Instead, we demonstrate that the improvement comes from three other factors unrelated to discourse: c… ▽ More

    Submitted 2 July, 2021; v1 submitted 4 May, 2021; originally announced May 2021.

    Comments: Accepted at IWSLT 2021

  12. Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets

    Authors: Julia Kreutzer, Isaac Caswell, Lisa Wang, Ahsan Wahab, Daan van Esch, Nasanbayar Ulzii-Orshikh, Allahsera Tapo, Nishant Subramani, Artem Sokolov, Claytone Sikasote, Monang Setyawan, Supheakmungkol Sarin, Sokhar Samb, Benoît Sagot, Clara Rivera, Annette Rios, Isabel Papadimitriou, Salomey Osei, Pedro Ortiz Suarez, Iroro Orife, Kelechi Ogueji, Andre Niyongabo Rubungo, Toan Q. Nguyen, Mathias Müller, André Müller , et al. (27 additional authors not shown)

    Abstract: With the success of large-scale pre-training and multilingual modeling in Natural Language Processing (NLP), recent years have seen a proliferation of large, web-mined text datasets covering hundreds of languages. We manually audit the quality of 205 language-specific corpora released with five major public datasets (CCAligned, ParaCrawl, WikiMatrix, OSCAR, mC4). Lower-resource corpora have system… ▽ More

    Submitted 21 February, 2022; v1 submitted 22 March, 2021; originally announced March 2021.

    Comments: Accepted at TACL; pre-MIT Press publication version

    Journal ref: Transactions of the Association for Computational Linguistics (2022) 10: 50-72

  13. arXiv:2103.04447  [pdf, ps, other

    cs.DM cs.DS math.CO

    Termination of Multipartite Graph Series Arising from Complex Network Modelling

    Authors: Matthieu Latapy, Thi Ha Duong Phan, Christophe Crespelle, Thanh Qui Nguyen

    Abstract: An intense activity is nowadays devoted to the definition of models capturing the properties of complex networks. Among the most promising approaches, it has been proposed to model these graphs via their clique incidence bipartite graphs. However, this approach has, until now, severe limitations resulting from its incapacity to reproduce a key property of this object: the overlapping nature of cli… ▽ More

    Submitted 7 March, 2021; originally announced March 2021.

    Comments: Published in LNCS, proceedings of the 4th International Conference on Combinatorial Optimization and Applications (COCOA), 2010

  14. arXiv:2102.04990  [pdf, other

    cs.CV cs.CL

    In Defense of Scene Graphs for Image Captioning

    Authors: Kien Nguyen, Subarna Tripathi, Bang Du, Tanaya Guha, Truong Q. Nguyen

    Abstract: The mainstream image captioning models rely on Convolutional Neural Network (CNN) image features to generate captions via recurrent models. Recently, image scene graphs have been used to augment captioning models so as to leverage their structural semantics, such as object entities, relationships and attributes. Several studies have noted that the naive use of scene graphs from a black-box scene g… ▽ More

    Submitted 17 August, 2021; v1 submitted 9 February, 2021; originally announced February 2021.

    Comments: Accepted to ICCV 2021

  15. arXiv:2010.01835  [pdf, other

    physics.comp-ph cs.LG hep-ex hep-ph

    Data Augmentation at the LHC through Analysis-specific Fast Simulation with Deep Learning

    Authors: Cheng Chen, Olmo Cerri, Thong Q. Nguyen, Jean-Roch Vlimant, Maurizio Pierini

    Abstract: We present a fast simulation application based on a Deep Neural Network, designed to create large analysis-specific datasets. Taking as an example the generation of W+jet events produced in sqrt(s)= 13 TeV proton-proton collisions, we train a neural network to model detector resolution effects as a transfer function acting on an analysis-specific set of relevant features, computed at generation le… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

    Comments: 15 pages, 12 figures

  16. arXiv:2005.01598  [pdf, other

    hep-ex cs.LG hep-ph

    Adversarially Learned Anomaly Detection on CMS Open Data: re-discovering the top quark

    Authors: Oliver Knapp, Guenther Dissertori, Olmo Cerri, Thong Q. Nguyen, Jean-Roch Vlimant, Maurizio Pierini

    Abstract: We apply an Adversarially Learned Anomaly Detection (ALAD) algorithm to the problem of detecting new physics processes in proton-proton collisions at the Large Hadron Collider. Anomaly detection based on ALAD matches performances reached by Variational Autoencoders, with a substantial improvement in some cases. Training the ALAD algorithm on 4.4 fb-1 of 8 TeV CMS Open Data, we show how a data-driv… ▽ More

    Submitted 3 October, 2020; v1 submitted 4 May, 2020; originally announced May 2020.

    Comments: 16 pages, 9 figures

  17. arXiv:1910.14659  [pdf, other

    cs.CL cs.LG eess.AS stat.ML

    Masked Language Model Scoring

    Authors: Julian Salazar, Davis Liang, Toan Q. Nguyen, Katrin Kirchhoff

    Abstract: Pretrained masked language models (MLMs) require finetuning for most NLP tasks. Instead, we evaluate MLMs out of the box via their pseudo-log-likelihood scores (PLLs), which are computed by masking tokens one by one. We show that PLLs outperform scores from autoregressive language models like GPT-2 in a variety of tasks. By rescoring ASR and NMT hypotheses, RoBERTa reduces an end-to-end LibriSpeec… ▽ More

    Submitted 31 December, 2020; v1 submitted 31 October, 2019; originally announced October 2019.

    Comments: ACL 2020 camera-ready (presented July 2020)

    Journal ref: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (2020), 2699-2712

  18. arXiv:1910.06717  [pdf, other

    cs.CL cs.LG stat.ML

    Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

    Authors: Kenton Murray, Jeffery Kinnison, Toan Q. Nguyen, Walter Scheirer, David Chiang

    Abstract: Neural sequence-to-sequence models, particularly the Transformer, are the state of the art in machine translation. Yet these neural networks are very sensitive to architecture and hyperparameter settings. Optimizing these settings by grid or random search is computationally expensive because it requires many training runs. In this paper, we incorporate architecture search into a single training ru… ▽ More

    Submitted 1 October, 2019; originally announced October 2019.

    Comments: The 3rd Workshop on Neural Generation and Translation (WNGT 2019)

  19. arXiv:1910.05895  [pdf, other

    cs.CL cs.LG stat.ML

    Transformers without Tears: Improving the Normalization of Self-Attention

    Authors: Toan Q. Nguyen, Julian Salazar

    Abstract: We evaluate three simple, normalization-centric changes to improve Transformer training. First, we show that pre-norm residual connections (PreNorm) and smaller initializations enable warmup-free, validation-based training with large learning rates. Second, we propose $\ell_2$ normalization with a single scale parameter (ScaleNorm) for faster training and better performance. Finally, we reaffirm t… ▽ More

    Submitted 29 December, 2019; v1 submitted 13 October, 2019; originally announced October 2019.

    Comments: Accepted to IWSLT 2019 (oral); code is available at https://github.com/tnq177/transformers_without_tears

  20. arXiv:1901.07838  [pdf, other

    cs.CV

    Toward Joint Image Generation and Compression using Generative Adversarial Networks

    Authors: Byeongkeun Kang, Subarna Tripathi, Truong Q. Nguyen

    Abstract: In this paper, we present a generative adversarial network framework that generates compressed images instead of synthesizing raw RGB images and compressing them separately. In the real world, most images and videos are stored and transferred in a compressed format to save storage capacity and data transfer bandwidth. However, since typical generative adversarial networks generate raw RGB images,… ▽ More

    Submitted 23 January, 2019; originally announced January 2019.

  21. Random Forest with Learned Representations for Semantic Segmentation

    Authors: Byeongkeun Kang, Truong Q. Nguyen

    Abstract: In this work, we present a random forest framework that learns the weights, shapes, and sparsities of feature representations for real-time semantic segmentation. Typical filters (kernels) have predetermined shapes and sparsities and learn only weights. A few feature extraction methods fix weights and learn only shapes and sparsities. These predetermined constraints restrict learning and extractin… ▽ More

    Submitted 23 January, 2019; originally announced January 2019.

  22. arXiv:1811.10276  [pdf, other

    hep-ex cs.LG hep-ph

    Variational Autoencoders for New Physics Mining at the Large Hadron Collider

    Authors: Olmo Cerri, Thong Q. Nguyen, Maurizio Pierini, Maria Spiropulu, Jean-Roch Vlimant

    Abstract: Using variational autoencoders trained on known physics processes, we develop a one-sided threshold test to isolate previously unseen processes as outlier events. Since the autoencoder training does not depend on any specific new physics signature, the proposed procedure doesn't make specific assumptions on the nature of new physics. An event selection based on this algorithm would be complementar… ▽ More

    Submitted 13 June, 2019; v1 submitted 26 November, 2018; originally announced November 2018.

    Comments: 29 pages, 12 figures, 5 tables

    Journal ref: J. High Energ. Phys. (2019) 2019: 36

  23. arXiv:1807.00083  [pdf, other

    hep-ex cs.LG hep-ph physics.data-an

    Topology classification with deep learning to improve real-time event selection at the LHC

    Authors: Thong Q. Nguyen, Daniel Weitekamp III, Dustin Anderson, Roberto Castello, Olmo Cerri, Maurizio Pierini, Maria Spiropulu, Jean-Roch Vlimant

    Abstract: We show how event topology classification based on deep learning could be used to improve the purity of data samples selected in real time at at the Large Hadron Collider. We consider different data representations, on which different kinds of multi-class classifiers are trained. Both raw data and high-level features are utilized. In the considered examples, a filter based on the classifier's scor… ▽ More

    Submitted 2 September, 2019; v1 submitted 29 June, 2018; originally announced July 2018.

    Comments: This is a pre-print of an article published in Computing and Software for Big Science. The final authenticated version is available online at: https://doi.org/10.1007/s41781-019-0028-1

    Journal ref: Comput Softw Big Sci (2019) 3: 12

  24. arXiv:1805.10558  [pdf, other

    cs.CV

    DPW-SDNet: Dual Pixel-Wavelet Domain Deep CNNs for Soft Decoding of JPEG-Compressed Images

    Authors: Honggang Chen, Xiaohai He, Linbo Qing, Shuhua Xiong, Truong Q. Nguyen

    Abstract: JPEG is one of the widely used lossy compression methods. JPEG-compressed images usually suffer from compression artifacts including blocking and blurring, especially at low bit-rates. Soft decoding is an effective solution to improve the quality of compressed images without changing codec or introducing extra coding bits. Inspired by the excellent performance of the deep convolutional neural netw… ▽ More

    Submitted 26 May, 2018; originally announced May 2018.

    Comments: CVPRW 2018

  25. arXiv:1803.04477  [pdf, other

    cs.CV

    Correction by Projection: Denoising Images with Generative Adversarial Networks

    Authors: Subarna Tripathi, Zachary C. Lipton, Truong Q. Nguyen

    Abstract: Generative adversarial networks (GANs) transform low-dimensional latent vectors into visually plausible images. If the real dataset contains only clean images, then ostensibly, the manifold learned by the GAN should contain only clean images. In this paper, we propose to denoise corrupted images by finding the nearest point on the GAN manifold, recovering latent vectors by minimizing distances in… ▽ More

    Submitted 12 March, 2018; originally announced March 2018.

  26. arXiv:1802.01458  [pdf, other

    eess.IV cs.CV math.ST stat.ML

    Image denoising with generalized Gaussian mixture model patch priors

    Authors: Charles-Alban Deledalle, Shibin Parameswaran, Truong Q. Nguyen

    Abstract: Patch priors have become an important component of image restoration. A powerful approach in this category of restoration algorithms is the popular Expected Patch Log-Likelihood (EPLL) algorithm. EPLL uses a Gaussian mixture model (GMM) prior learned on clean image patches as a way to regularize degraded patches. In this paper, we show that a generalized Gaussian mixture model (GGMM) captures the… ▽ More

    Submitted 11 June, 2018; v1 submitted 5 February, 2018; originally announced February 2018.

  27. arXiv:1710.08124  [pdf, other

    cs.CV

    Accelerating GMM-based patch priors for image restoration: Three ingredients for a 100$\times$ speed-up

    Authors: Shibin Parameswaran, Charles-Alban Deledalle, Loïc Denis, Truong Q. Nguyen

    Abstract: Image restoration methods aim to recover the underlying clean image from corrupted observations. The Expected Patch Log-likelihood (EPLL) algorithm is a powerful image restoration method that uses a Gaussian mixture model (GMM) prior on the patches of natural images. Although it is very effective for restoring images, its high runtime complexity makes EPLL ill-suited for most practical application… ▽ More

    Submitted 23 October, 2017; originally announced October 2017.

  28. arXiv:1710.01329  [pdf, other

    cs.CL

    Improving Lexical Choice in Neural Machine Translation

    Authors: Toan Q. Nguyen, David Chiang

    Abstract: We explore two solutions to the problem of mistranslating rare words in neural machine translation. First, we argue that the standard output layer, which computes the inner product of a vector representing the context with all possible output word embeddings, rewards frequent words disproportionately, and we propose to fix the norms of both vectors to a constant value. Second, we integrate a simpl… ▽ More

    Submitted 17 April, 2018; v1 submitted 3 October, 2017; originally announced October 2017.

    Comments: Accepted at NAACL HLT 2018

  29. arXiv:1708.09803  [pdf, other

    cs.CL

    Transfer Learning across Low-Resource, Related Languages for Neural Machine Translation

    Authors: Toan Q. Nguyen, David Chiang

    Abstract: We present a simple method to improve neural translation of a low-resource language pair using parallel data from a related, also low-resource, language pair. The method is based on the transfer method of Zoph et al., but whereas their method ignores any source vocabulary overlap, ours exploits it. First, we split words using Byte Pair Encoding (BPE) to increase vocabulary overlap. Then, we train… ▽ More

    Submitted 21 September, 2017; v1 submitted 31 August, 2017; originally announced August 2017.

  30. Depth Adaptive Deep Neural Network for Semantic Segmentation

    Authors: Byeongkeun Kang, Yeejin Lee, Truong Q. Nguyen

    Abstract: In this work, we present the depth-adaptive deep neural network using a depth map for semantic segmentation. Typical deep neural networks receive inputs at the predetermined locations regardless of the distance from the camera. This fixed receptive field presents a challenge to generalize the features of objects at various distances in neural networks. Specifically, the predetermined receptive fie… ▽ More

    Submitted 29 January, 2018; v1 submitted 5 August, 2017; originally announced August 2017.

    Comments: IEEE Transactions on Multimedia, 2018

  31. arXiv:1703.10645  [pdf, other

    cs.CV

    Relevance Subject Machine: A Novel Person Re-identification Framework

    Authors: Igor Fedorov, Ritwik Giri, Bhaskar D. Rao, Truong Q. Nguyen

    Abstract: We propose a novel method called the Relevance Subject Machine (RSM) to solve the person re-identification (re-id) problem. RSM falls under the category of Bayesian sparse recovery algorithms and uses the sparse representation of the input video under a pre-defined dictionary to identify the subject in the video. Our approach focuses on the multi-shot re-id problem, which is the prevalent problem… ▽ More

    Submitted 30 March, 2017; originally announced March 2017.

    Comments: Submitted to IEEE Transactions on Pattern Analysis and Machine Intelligence

  32. arXiv:1605.02057  [pdf, other

    cs.CV

    Robust Bayesian Method for Simultaneous Block Sparse Signal Recovery with Applications to Face Recognition

    Authors: Igor Fedorov, Ritwik Giri, Bhaskar D. Rao, Truong Q. Nguyen

    Abstract: In this paper, we present a novel Bayesian approach to recover simultaneously block sparse signals in the presence of outliers. The key advantage of our proposed method is the ability to handle non-stationary outliers, i.e. outliers which have time varying support. We validate our approach with empirical results showing the superiority of the proposed method over competing approaches in synthetic… ▽ More

    Submitted 10 May, 2016; v1 submitted 6 May, 2016; originally announced May 2016.

    Comments: To appear in ICIP 2016

  33. arXiv:1603.02345  [pdf, other

    cs.CV

    Hand Segmentation for Hand-Object Interaction from Depth map

    Authors: Byeongkeun Kang, Kar-Han Tan, Nan Jiang, Hung-Shuo Tai, Daniel Tretter, Truong Q. Nguyen

    Abstract: Hand segmentation for hand-object interaction is a necessary preprocessing step in many applications such as augmented reality, medical application, and human-robot interaction. However, typical methods are based on color information which is not robust to objects with skin color, skin pigment difference, and light condition variations. Thus, we propose hand segmentation method for hand-object int… ▽ More

    Submitted 9 January, 2018; v1 submitted 7 March, 2016; originally announced March 2016.

  34. Joint Defogging and Demosaicking

    Authors: Y. J. Lee, K. Hirakawa, T. Q. Nguyen

    Abstract: Image defogging is a technique used extensively for enhancing visual quality of images in bad weather condition. Even though defogging algorithms have been well studied, defogging performance is degraded by demosaicking artifacts and sensor noise amplification in distant scenes. In order to improve visual quality of restored images, we propose a novel approach to perform defogging and demosaicking… ▽ More

    Submitted 9 February, 2016; originally announced February 2016.

  35. Adaptive Image Denoising by Mixture Adaptation

    Authors: Enming Luo, Stanley H. Chan, Truong Q. Nguyen

    Abstract: We propose an adaptive learning procedure to learn patch-based image priors for image denoising. The new algorithm, called the Expectation-Maximization (EM) adaptation, takes a generic prior learned from a generic external database and adapts it to the noisy image to generate a specific prior. Different from existing methods that combine internal and external statistics in ad-hoc ways, the propose… ▽ More

    Submitted 24 June, 2016; v1 submitted 18 January, 2016; originally announced January 2016.

    Comments: 15 pages

  36. arXiv:1510.00981  [pdf, other

    cs.CV

    Efficient Hand Articulations Tracking using Adaptive Hand Model and Depth map

    Authors: Byeongkeun Kang, Yeejin Lee, Truong Q. Nguyen

    Abstract: Real-time hand articulations tracking is important for many applications such as interacting with virtual / augmented reality devices or tablets. However, most of existing algorithms highly rely on expensive and high power-consuming GPUs to achieve real-time processing. Consequently, these systems are inappropriate for mobile and wearable devices. In this paper, we propose an efficient hand tracki… ▽ More

    Submitted 17 October, 2015; v1 submitted 4 October, 2015; originally announced October 2015.

    Comments: Advances in Visual Computing: 11th International Symposium on Visual Computing (ISVC'15)

  37. arXiv:1509.03001  [pdf, other

    cs.CV

    Real-time Sign Language Fingerspelling Recognition using Convolutional Neural Networks from Depth map

    Authors: Byeongkeun Kang, Subarna Tripathi, Truong Q. Nguyen

    Abstract: Sign language recognition is important for natural and convenient communication between deaf community and hearing majority. We take the highly efficient initial step of automatic fingerspelling recognition system using convolutional neural networks (CNNs) from depth maps. In this work, we consider relatively larger number of classes compared with the previous literature. We train CNNs for the cla… ▽ More

    Submitted 14 October, 2015; v1 submitted 9 September, 2015; originally announced September 2015.

    Comments: 2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR)

  38. Long-Range Motion Trajectories Extraction of Articulated Human Using Mesh Evolution

    Authors: Yuanyuan Wu, Xiaohai He, Byeongkeun Kang, Haiying Song, Truong Q. Nguyen

    Abstract: This letter presents a novel approach to extract reliable dense and long-range motion trajectories of articulated human in a video sequence. Compared with existing approaches that emphasize temporal consistency of each tracked point, we also consider the spatial structure of tracked points on the articulated human. We treat points as a set of vertices, and build a triangle mesh to join them in ima… ▽ More

    Submitted 28 March, 2016; v1 submitted 30 June, 2015; originally announced June 2015.

    Comments: IEEE Signal Processing Letters

  39. arXiv:1501.02155  [pdf, ps, other

    math.MG cs.LO

    A formal proof of the Kepler conjecture

    Authors: Thomas Hales, Mark Adams, Gertrud Bauer, Dat Tat Dang, John Harrison, Truong Le Hoang, Cezary Kaliszyk, Victor Magron, Sean McLaughlin, Thang Tat Nguyen, Truong Quang Nguyen, Tobias Nipkow, Steven Obua, Joseph Pleso, Jason Rute, Alexey Solovyev, An Hoai Thi Ta, Trung Nam Tran, Diep Thi Trieu, Josef Urban, Ky Khac Vu, Roland Zumkeller

    Abstract: This article describes a formal proof of the Kepler conjecture on dense sphere packings in a combination of the HOL Light and Isabelle proof assistants. This paper constitutes the official published account of the now completed Flyspeck project.

    Submitted 9 January, 2015; originally announced January 2015.

    Comments: 21 pages

  40. Adaptive Image Denoising by Targeted Databases

    Authors: Enming Luo, Stanley H. Chan, Truong Q. Nguyen

    Abstract: We propose a data-dependent denoising procedure to restore noisy images. Different from existing denoising algorithms which search for patches from either the noisy image or a generic database, the new algorithm finds patches from a database that contains only relevant patches. We formulate the denoising problem as an optimal filter design problem and make two contributions. First, we determine th… ▽ More

    Submitted 3 November, 2014; v1 submitted 30 June, 2014; originally announced July 2014.

    Comments: 15 pages, 13 figures, 2 tables, journal

  41. arXiv:1407.3840  [pdf, ps, other

    cs.CV

    Depth Reconstruction from Sparse Samples: Representation, Algorithm, and Sampling

    Authors: Lee-Kang Liu, Stanley H. Chan, Truong Q. Nguyen

    Abstract: The rapid development of 3D technology and computer vision applications have motivated a thrust of methodologies for depth acquisition and estimation. However, most existing hardware and software methods have limited performance due to poor depth precision, low resolution and high computational cost. In this paper, we present a computationally efficient method to recover dense depth maps from spar… ▽ More

    Submitted 11 February, 2015; v1 submitted 14 July, 2014; originally announced July 2014.