Zum Hauptinhalt springen

Showing 1–22 of 22 results for author: Quan, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.09296  [pdf, other

    cs.CL

    Cross-Data Knowledge Graph Construction for LLM-enabled Educational Question-Answering System: A~Case~Study~at~HCMUT

    Authors: Tuan Bui, Oanh Tran, Phuong Nguyen, Bao Ho, Long Nguyen, Thang Bui, Tho Quan

    Abstract: In today's rapidly evolving landscape of Artificial Intelligence, large language models (LLMs) have emerged as a vibrant research topic. LLMs find applications in various fields and contribute significantly. Despite their powerful language capabilities, similar to pre-trained language models (PLMs), LLMs still face challenges in remembering events, incorporating new information, and addressing dom… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: 8 pages, 7 figures

  2. arXiv:2403.02715  [pdf, other

    cs.CL cs.AI

    Crossing Linguistic Horizons: Finetuning and Comprehensive Evaluation of Vietnamese Large Language Models

    Authors: Sang T. Truong, Duc Q. Nguyen, Toan Nguyen, Dong D. Le, Nhi N. Truong, Tho Quan, Sanmi Koyejo

    Abstract: Recent advancements in large language models (LLMs) have underscored their importance in the evolution of artificial intelligence. However, despite extensive pretraining on multilingual datasets, available open-sourced LLMs exhibit limited effectiveness in processing Vietnamese. The challenge is exacerbated by the absence of systematic benchmark datasets and metrics tailored for Vietnamese LLM eva… ▽ More

    Submitted 26 May, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: 51 pages

    MSC Class: 68T50

  3. LAMPAT: Low-Rank Adaption for Multilingual Paraphrasing Using Adversarial Training

    Authors: Khoi M. Le, Trinh Pham, Tho Quan, Anh Tuan Luu

    Abstract: Paraphrases are texts that convey the same meaning while using different words or sentence structures. It can be used as an automatic data augmentation tool for many Natural Language Processing tasks, especially when dealing with low-resource languages, where data shortage is a significant problem. To generate a paraphrase in multilingual settings, previous studies have leveraged the knowledge fro… ▽ More

    Submitted 23 June, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

    Comments: First two authors contribute equally. Accepted at AAAI 2024

  4. arXiv:2312.01612  [pdf, other

    cs.LG cs.AI

    xNeuSM: Explainable Neural Subgraph Matching with Graph Learnable Multi-hop Attention Networks

    Authors: Duc Q. Nguyen, Thanh Toan Nguyen, Tho quan

    Abstract: Subgraph matching is a challenging problem with a wide range of applications in database systems, biochemistry, and cognitive science. It involves determining whether a given query graph is present within a larger target graph. Traditional graph-matching algorithms provide precise results but face challenges in large graph instances due to the NP-complete problem, limiting their practical applicab… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Comments: 33 pages, 8 figures, 6 tables

  5. arXiv:2312.01592  [pdf, other

    cs.CL

    Expand BERT Representation with Visual Information via Grounded Language Learning with Multimodal Partial Alignment

    Authors: Cong-Duy Nguyen, The-Anh Vu-Le, Thong Nguyen, Tho Quan, Luu Anh Tuan

    Abstract: Language models have been supervised with both language-only objective and visual grounding in existing studies of visual-grounded language learning. However, due to differences in the distribution and scale of visual-grounded datasets and language corpora, the language model tends to mix up the context of the tokens that occurred in the grounded data with those that do not. As a result, during re… ▽ More

    Submitted 9 January, 2024; v1 submitted 3 December, 2023; originally announced December 2023.

  6. arXiv:2310.18648  [pdf, other

    cs.SE

    Generative Artificial Intelligence for Software Engineering -- A Research Agenda

    Authors: Anh Nguyen-Duc, Beatriz Cabrero-Daniel, Adam Przybylek, Chetan Arora, Dron Khanna, Tomas Herda, Usman Rafiq, Jorge Melegati, Eduardo Guerra, Kai-Kristian Kemell, Mika Saari, Zheying Zhang, Huy Le, Tho Quan, Pekka Abrahamsson

    Abstract: Generative Artificial Intelligence (GenAI) tools have become increasingly prevalent in software development, offering assistance to various managerial and technical project activities. Notable examples of these tools include OpenAIs ChatGPT, GitHub Copilot, and Amazon CodeWhisperer. Although many recent publications have explored and evaluated the application of GenAI, a comprehensive understandin… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

  7. arXiv:2305.06511  [pdf, other

    eess.IV cs.CV

    ParamNet: A Dynamic Parameter Network for Fast Multi-to-One Stain Normalization

    Authors: Hongtao Kang, Die Luo, Li Chen, Junbo Hu, Tingwei Quan, Shaoqun Zeng, Shenghua Cheng, Xiuli Liu

    Abstract: In practice, digital pathology images are often affected by various factors, resulting in very large differences in color and brightness. Stain normalization can effectively reduce the differences in color and brightness of digital pathology images, thus improving the performance of computer-aided diagnostic systems. Conventional stain normalization methods rely on one or several reference images,… ▽ More

    Submitted 16 July, 2024; v1 submitted 10 May, 2023; originally announced May 2023.

  8. arXiv:2304.09383  [pdf, other

    eess.IV cs.CV cs.GR

    Denoising Diffusion Medical Models

    Authors: Pham Ngoc Huy, Tran Minh Quan

    Abstract: In this study, we introduce a generative model that can synthesize a large number of radiographical image/label pairs, and thus is asymptotically favorable to downstream activities such as segmentation in bio-medical image analysis. Denoising Diffusion Medical Model (DDMM), the proposed technique, can create realistic X-ray images and associated segmentations on a small number of annotated dataset… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: Accepted to IEEE ISBI 2023

  9. Neural Radiance Projection

    Authors: Pham Ngoc Huy, Tran Minh Quan

    Abstract: The proposed method, Neural Radiance Projection (NeRP), addresses the three most fundamental shortages of training such a convolutional neural network on X-ray image segmentation: dealing with missing/limited human-annotated datasets; ambiguity on the per-pixel label; and the imbalance across positive- and negative- classes distribution. By harnessing a generative adversarial network, we can synth… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

    Comments: Accepted to IEEE ISBI 2022

    Report number: 10.1109/ISBI52829.2022.9761457

    Journal ref: 10.1109/ISBI52829.2022.9761457

  10. arXiv:2203.02433  [pdf, ps, other

    cs.LG cs.NE math.OC stat.ML

    The Machine Learning for Combinatorial Optimization Competition (ML4CO): Results and Insights

    Authors: Maxime Gasse, Quentin Cappart, Jonas Charfreitag, Laurent Charlin, Didier Chételat, Antonia Chmiela, Justin Dumouchelle, Ambros Gleixner, Aleksandr M. Kazachkov, Elias Khalil, Pawel Lichocki, Andrea Lodi, Miles Lubin, Chris J. Maddison, Christopher Morris, Dimitri J. Papageorgiou, Augustin Parjadis, Sebastian Pokutta, Antoine Prouvost, Lara Scavuzzo, Giulia Zarpellon, Linxin Yang, Sha Lai, Akang Wang, Xiaodong Luo , et al. (16 additional authors not shown)

    Abstract: Combinatorial optimization is a well-established area in operations research and computer science. Until recently, its methods have focused on solving problem instances in isolation, ignoring that they often stem from related data distributions in practice. However, recent years have seen a surge of interest in using machine learning as a new approach for solving combinatorial problems, either dir… ▽ More

    Submitted 17 March, 2022; v1 submitted 4 March, 2022; originally announced March 2022.

    Comments: Neurips 2021 competition. arXiv admin note: text overlap with arXiv:2112.12251 by other authors

  11. arXiv:2201.00815  [pdf, other

    cs.CR

    00

    Authors: Nguyen Thoi Minh Quan

    Abstract: What is the funniest number in cryptography (Episode 2)? 0 [1]. The reason is that $\forall x, x \cdot 0 = 0$, i.e., the equation is satisfied no matter what $x$ is. We'll use zero to attack zero-knowledge proof (ZKP). In particular, we'll discuss a critical issue in a cutting-edge ZKP PLONK [2] C++ implementation which allows an attacker to create a forged proof that all verifiers will accept. We… ▽ More

    Submitted 14 December, 2021; originally announced January 2022.

  12. arXiv:2109.10616  [pdf, other

    cs.CL

    Enriching and Controlling Global Semantics for Text Summarization

    Authors: Thong Nguyen, Anh Tuan Luu, Truc Lu, Tho Quan

    Abstract: Recently, Transformer-based models have been proven effective in the abstractive summarization task by creating fluent and informative summaries. Nevertheless, these models still suffer from the short-range dependency problem, causing them to produce summaries that miss the key points of document. In this paper, we attempt to address this issue by introducing a neural topic model empowered with no… ▽ More

    Submitted 22 September, 2021; originally announced September 2021.

    Comments: Accepted to the main EMNLP 2021 conference

  13. StainNet: a fast and robust stain normalization network

    Authors: Hongtao Kang, Die Luo, Weihua Feng, Junbo Hu, Shaoqun Zeng, Tingwei Quan, Xiuli Liu

    Abstract: Stain normalization often refers to transferring the color distribution of the source image to that of the target image and has been widely used in biomedical image analysis. The conventional stain normalization is regarded as constructing a pixel-by-pixel color mapping model, which only depends on one reference image, and can not accurately achieve the style transformation between image datasets.… ▽ More

    Submitted 23 July, 2021; v1 submitted 23 December, 2020; originally announced December 2020.

    Comments: 14 pages, 8 figures

    Journal ref: Front. Med. 8:746307 (2021)

  14. arXiv:2005.07058  [pdf, other

    cs.CV cs.LG eess.IV

    Reinforced Coloring for End-to-End Instance Segmentation

    Authors: Tuan Tran Anh, Khoa Nguyen-Tuan, Tran Minh Quan, Won-Ki Jeong

    Abstract: Instance segmentation is one of the actively studied research topics in computer vision in which many objects of interest should be separated individually. While many feed-forward networks produce high-quality segmentation on different types of images, their results often suffer from topological errors (merging or splitting) for segmentation of many objects, requiring post-processing. Existing ite… ▽ More

    Submitted 18 May, 2020; v1 submitted 14 May, 2020; originally announced May 2020.

  15. arXiv:1909.01132  [pdf

    cs.SI cs.LG stat.ML

    PageRank algorithm for Directed Hypergraph

    Authors: Loc Tran, Tho Quan, An Mai

    Abstract: During the last two decades, we easilly see that the World Wide Web's link structure is modeled as the directed graph. In this paper, we will model the World Wide Web's link structure as the directed hypergraph. Moreover, we will develop the PageRank algorithm for this directed hypergraph. Due to the lack of the World Wide Web directed hypergraph datasets, we will apply the PageRank algorithm to t… ▽ More

    Submitted 6 September, 2022; v1 submitted 29 August, 2019; originally announced September 2019.

    MSC Class: 68T10

  16. arXiv:1905.00195  [pdf, other

    cs.CL

    Nested Variational Autoencoder for Topic Modeling on Microtexts with Word Vectors

    Authors: Trung Trinh, Tho Quan, Trung Mai

    Abstract: Most of the information on the Internet is represented in the form of microtexts, which are short text snippets such as news headlines or tweets. These sources of information are abundant, and mining these data could uncover meaningful insights. Topic modeling is one of the popular methods to extract knowledge from a collection of documents; however, conventional topic models such as latent Dirich… ▽ More

    Submitted 15 September, 2019; v1 submitted 1 May, 2019; originally announced May 2019.

    Comments: 27 pages, 9 figures, under review at Expert Systems

  17. Combination of Domain Knowledge and Deep Learning for Sentiment Analysis of Short and Informal Messages on Social Media

    Authors: Khuong Vo, Tri Nguyen, Dang Pham, Mao Nguyen, Minh Truong, Trung Mai, Tho Quan

    Abstract: Sentiment analysis has been emerging recently as one of the major natural language processing (NLP) tasks in many applications. Especially, as social media channels (e.g. social networks or forums) have become significant sources for brands to observe user opinions about their products, this task is thus increasingly crucial. However, when applied with real data obtained from social media, we noti… ▽ More

    Submitted 20 December, 2019; v1 submitted 16 February, 2019; originally announced February 2019.

    Comments: A Preprint of an article accepted for publication by Inderscience in IJCVR on September 2018

    Journal ref: International Journal of Computational Vision and Robotics, 2019 Vol.9 No.5, pp.458 - 485

  18. Towards Autoencoding Variational Inference for Aspect-based Opinion Summary

    Authors: Tai Hoang, Huy Le, Tho Quan

    Abstract: Aspect-based Opinion Summary (AOS), consisting of aspect discovery and sentiment classification steps, has recently been emerging as one of the most crucial data mining tasks in e-commerce systems. Along this direction, the LDA-based model is considered as a notably suitable approach, since this model offers both topic modeling and sentiment classification. However, unlike traditional topic modeli… ▽ More

    Submitted 6 June, 2019; v1 submitted 7 February, 2019; originally announced February 2019.

    Comments: 20 pages, 11 figures

    Journal ref: Applied Artificial Intelligence, 33 (2019) 796-816

  19. arXiv:1807.07777  [pdf

    cs.IR

    Semantic Document Clustering on Named Entity Features

    Authors: Tru H. Cao, Vuong M. Ngo, Dung T. Hong, Tho T. Quan

    Abstract: Keyword-based information processing has limitations due to simple treatment of words. In this paper, we introduce named entities as objectives into document clustering, which are the key elements defining document semantics and in many cases are of user concerns. First, the traditional keyword-based vector space model is adapted with vectors defined over spaces of entity names, types, name-type p… ▽ More

    Submitted 20 July, 2018; originally announced July 2018.

    Comments: 7 papes, PAKDD workshops

  20. Combination of Domain Knowledge and Deep Learning for Sentiment Analysis

    Authors: Khuong Vo, Dang Pham, Mao Nguyen, Trung Mai, Tho Quan

    Abstract: The emerging technique of deep learning has been widely applied in many different areas. However, when adopted in a certain specific domain, this technique should be combined with domain knowledge to improve efficiency and accuracy. In particular, when analyzing the applications of deep learning in sentiment analysis, we found that the current approaches are suffering from the following drawbacks:… ▽ More

    Submitted 15 February, 2019; v1 submitted 22 June, 2018; originally announced June 2018.

    Comments: Accepted to MIWAI 2017

  21. Compressed Sensing MRI Reconstruction using a Generative Adversarial Network with a Cyclic Loss

    Authors: Tran Minh Quan, Thanh Nguyen-Duc, Won-Ki Jeong

    Abstract: Compressed Sensing MRI (CS-MRI) has provided theoretical foundations upon which the time-consuming MRI acquisition process can be accelerated. However, it primarily relies on iterative numerical solvers which still hinders their adaptation in time-critical applications. In addition, recent advances in deep neural networks have shown their potential in computer vision and image processing, but thei… ▽ More

    Submitted 15 March, 2018; v1 submitted 3 September, 2017; originally announced September 2017.

    Comments: submitted to IEEE Transactions on Medical Imaging

    Journal ref: IEEE Trans. Med. Imaging 37(6): 1488-1497 (2018)

  22. FusionNet: A deep fully residual convolutional neural network for image segmentation in connectomics

    Authors: Tran Minh Quan, David G. C. Hildebrand, Won-Ki Jeong

    Abstract: Electron microscopic connectomics is an ambitious research direction with the goal of studying comprehensive brain connectivity maps by using high-throughput, nano-scale microscopy. One of the main challenges in connectomics research is developing scalable image analysis algorithms that require minimal user intervention. Recently, deep learning has drawn much attention in computer vision because o… ▽ More

    Submitted 26 December, 2016; v1 submitted 15 December, 2016; originally announced December 2016.