Skip to main content

Showing 1–50 of 205 results for author: Wong, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01926  [pdf

    physics.med-ph cs.CV

    Chemical Shift Encoding based Double Bonds Quantification in Triglycerides using Deep Image Prior

    Authors: Chaoxing Huang, Ziqiang Yu, Zijian Gao, Qiuyi Shen, Queenie Chan, Vincent Wai-Sun Wong, Winnie Chiu-Wing Chu, Weitian Chen

    Abstract: This study evaluated a deep learning-based method using Deep Image Prior (DIP) to quantify triglyceride double bonds from chemical-shift encoded multi-echo gradient echo images without network training. We employed a cost function based on signal constraints to iteratively update the neural network on a single dataset. The method was validated using phantom experiments and in vivo scans. Results s… ▽ More

    Submitted 3 July, 2024; v1 submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2406.09317  [pdf, other

    eess.IV cs.CV

    Common and Rare Fundus Diseases Identification Using Vision-Language Foundation Model with Knowledge of Over 400 Diseases

    Authors: Meng Wang, Tian Lin, Aidi Lin, Kai Yu, Yuanyuan Peng, Lianyu Wang, Cheng Chen, Ke Zou, Huiyu Liang, Man Chen, Xue Yao, Meiqin Zhang, Binwei Huang, Chaoxin Zheng, Peixin Zhang, Wei Chen, Yilong Luo, Yifan Chen, Honghe Xia, Tingkun Shi, Qi Zhang, Jinming Guo, Xiaolin Chen, Jingcheng Wang, Yih Chung Tham , et al. (24 additional authors not shown)

    Abstract: Previous foundation models for retinal images were pre-trained with limited disease categories and knowledge base. Here we introduce RetiZero, a vision-language foundation model that leverages knowledge from over 400 fundus diseases. To RetiZero's pre-training, we compiled 341,896 fundus images paired with text descriptions, sourced from public datasets, ophthalmic literature, and online resources… ▽ More

    Submitted 30 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

  3. arXiv:2406.07574  [pdf, other

    cs.SI cs.LG

    Biharmonic Distance of Graphs and its Higher-Order Variants: Theoretical Properties with Applications to Centrality and Clustering

    Authors: Mitchell Black, Lucy Lin, Amir Nayyeri, Weng-Keen Wong

    Abstract: Effective resistance is a distance between vertices of a graph that is both theoretically interesting and useful in applications. We study a variant of effective resistance called the biharmonic distance. While the effective resistance measures how well-connected two vertices are, we prove several theoretical results supporting the idea that the biharmonic distance measures how important an edge i… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted to ICML 2024

  4. arXiv:2406.06543  [pdf, other

    cs.AR cs.LG cs.NE eess.SP

    SparrowSNN: A Hardware/software Co-design for Energy Efficient ECG Classification

    Authors: Zhanglu Yan, Zhenyu Bai, Tulika Mitra, Weng-Fai Wong

    Abstract: Heart disease is one of the leading causes of death worldwide. Given its high risk and often asymptomatic nature, real-time continuous monitoring is essential. Unlike traditional artificial neural networks (ANNs), spiking neural networks (SNNs) are well-known for their energy efficiency, making them ideal for wearable devices and energy-constrained edge computing platforms. However, current energy… ▽ More

    Submitted 6 May, 2024; originally announced June 2024.

  5. arXiv:2405.17940  [pdf, other

    cs.RO cs.AI

    World Models for General Surgical Grasping

    Authors: Hongbin Lin, Bin Li, Chun Wai Wong, Juan Rojas, Xiangyu Chu, Kwok Wai Samuel Au

    Abstract: Intelligent vision control systems for surgical robots should adapt to unknown and diverse objects while being robust to system disturbances. Previous methods did not meet these requirements due to mainly relying on pose estimation and feature tracking. We propose a world-model-based deep reinforcement learning framework "Grasp Anything for Surgery" (GAS), that learns a pixel-level visuomotor poli… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Journal ref: Robotics: Science and Systems 2024

  6. arXiv:2405.12386  [pdf, other

    stat.ML cs.LG stat.AP stat.CO

    Particle swarm optimization with Applications to Maximum Likelihood Estimation and Penalized Negative Binomial Regression

    Authors: Sisi Shao, Junhyung Park, Weng Kee Wong

    Abstract: General purpose optimization routines such as nlminb, optim (R) or nlmixed (SAS) are frequently used to estimate model parameters in nonstandard distributions. This paper presents Particle Swarm Optimization (PSO), as an alternative to many of the current algorithms used in statistics. We find that PSO can not only reproduce the same results as the above routines, it can also produce results that… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  7. arXiv:2405.04206  [pdf, other

    cs.AR cs.AI cs.LG

    NOVA: NoC-based Vector Unit for Mapping Attention Layers on a CNN Accelerator

    Authors: Mohit Upadhyay, Rohan Juneja, Weng-Fai Wong, Li-Shiuan Peh

    Abstract: Attention mechanisms are becoming increasingly popular, being used in neural network models in multiple domains such as natural language processing (NLP) and vision applications, especially at the edge. However, attention layers are difficult to map onto existing neuro accelerators since they have a much higher density of non-linear operations, which lead to inefficient utilization of today's vect… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 6 pages, 8 figures

    ACM Class: B.2.4

  8. Table-Lookup MAC: Scalable Processing of Quantised Neural Networks in FPGA Soft Logic

    Authors: Daniel Gerlinghoff, Benjamin Chen Ming Choong, Rick Siow Mong Goh, Weng-Fai Wong, Tao Luo

    Abstract: Recent advancements in neural network quantisation have yielded remarkable outcomes, with three-bit networks reaching state-of-the-art full-precision accuracy in complex tasks. These achievements present valuable opportunities for accelerating neural networks by computing in reduced precision. Implementing it on FPGAs can take advantage of bit-level reconfigurability, which is not available on con… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  9. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  10. arXiv:2403.04036  [pdf, other

    cs.LG cs.AI eess.SP

    Unsupervised Contrastive Learning for Robust RF Device Fingerprinting Under Time-Domain Shift

    Authors: Jun Chen, Weng-Keen Wong, Bechir Hamdaoui

    Abstract: Radio Frequency (RF) device fingerprinting has been recognized as a potential technology for enabling automated wireless device identification and classification. However, it faces a key challenge due to the domain shift that could arise from variations in the channel conditions and environmental settings, potentially degrading the accuracy of RF-based device classification when testing and traini… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 6 pages, 5 figures, accepted by 2024 IEEE International Conference on Communications (ICC)

  11. arXiv:2403.00192  [pdf, other

    cs.IT

    Block-MDS QC-LDPC Codes for Information Reconciliation in Key Distribution

    Authors: Lev Tauz, Debarnab Mitra, Jayanth Shreekumar, Murat Can Sarihan, Chee Wei Wong, Lara Dolecek

    Abstract: Quantum key distribution (QKD) is a popular protocol that provides information theoretically secure keys to multiple parties. Two important post-processing steps of QKD are 1) the information reconciliation (IR) step, where parties reconcile mismatches in generated keys through classical communication, and 2) the privacy amplification (PA) step, where parties distill their common key into a new se… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

    Comments: 7 pages, 1 figure, submitted to the International Symposium on Information Theory (ISIT) 2024

  12. arXiv:2402.15525  [pdf, other

    cs.CL cs.CY

    Detecting misinformation through Framing Theory: the Frame Element-based Model

    Authors: Guan Wang, Rebecca Frederick, Jinglong Duan, William Wong, Verica Rupar, Weihua Li, Quan Bai

    Abstract: In this paper, we delve into the rapidly evolving challenge of misinformation detection, with a specific focus on the nuanced manipulation of narrative frames - an under-explored area within the AI community. The potential for Generative AI models to generate misleading narratives underscores the urgency of this problem. Drawing from communication and framing theories, we posit that the presentati… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: 17 pages, 9 figures, 7 tables

  13. arXiv:2402.13297  [pdf, other

    q-bio.QM cs.AI

    Integrating Deep Learning and Synthetic Biology: A Co-Design Approach for Enhancing Gene Expression via N-terminal Coding Sequences

    Authors: Zhanglu Yan, Weiran Chu, Yuhua Sheng, Kaiwen Tang, Shida Wang, Yanfeng Liu, Weng-Fai Wong

    Abstract: N-terminal coding sequence (NCS) influences gene expression by impacting the translation initiation rate. The NCS optimization problem is to find an NCS that maximizes gene expression. The problem is important in genetic engineering. However, current methods for NCS optimization such as rational design and statistics-guided approaches are labor-intensive yield only relatively small improvements. T… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  14. arXiv:2402.13249  [pdf, other

    cs.CL cs.AI

    TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization

    Authors: Liyan Tang, Igor Shalyminov, Amy Wing-mei Wong, Jon Burnsky, Jake W. Vincent, Yu'an Yang, Siffi Singh, Song Feng, Hwanjun Song, Hang Su, Lijia Sun, Yi Zhang, Saab Mansour, Kathleen McKeown

    Abstract: Single document news summarization has seen substantial progress on faithfulness in recent years, driven by research on the evaluation of factual consistency, or hallucinations. We ask whether these advances carry over to other text summarization domains. We propose a new evaluation benchmark on topic-focused dialogue summarization, generated by LLMs of varying sizes. We provide binary sentence-le… ▽ More

    Submitted 31 March, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: NAACL 2024; Linguistic annotations available at https://github.com/amazon-science/tofueval

  15. arXiv:2402.10456  [pdf, other

    stat.ML cs.LG stat.AP stat.ME

    Generative Modeling for Tabular Data via Penalized Optimal Transport Network

    Authors: Wenhui Sophia Lu, Chenyang Zhong, Wing Hung Wong

    Abstract: The task of precisely learning the probability distribution of rows within tabular data and producing authentic synthetic samples is both crucial and non-trivial. Wasserstein generative adversarial network (WGAN) marks a notable improvement in generative modeling, addressing the challenges faced by its predecessor, generative adversarial network. However, due to the mixed data types and multimodal… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: 37 pages, 23 figures

  16. arXiv:2402.01900  [pdf, other

    stat.ML cs.LG

    Distributional Off-policy Evaluation with Bellman Residual Minimization

    Authors: Sungee Hong, Zhengling Qi, Raymond K. W. Wong

    Abstract: We consider the problem of distributional off-policy evaluation which serves as the foundation of many distributional reinforcement learning (DRL) algorithms. In contrast to most existing works (that rely on supremum-extended statistical distances such as supremum-Wasserstein distance), we study the expectation-extended statistical distance for quantifying the distributional Bellman residuals and… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  17. arXiv:2401.16623  [pdf, other

    cs.DS cs.IT

    Towards Optimal Grammars for RNA Structures

    Authors: Evarista Onokpasa, Sebastian Wild, Prudence W. H. Wong

    Abstract: In past work (Onokpasa, Wild, Wong, DCC 2023), we showed that (a) for joint compression of RNA sequence and structure, stochastic context-free grammars are the best known compressors and (b) that grammars which have better compression ability also show better performance in ab initio structure prediction. Previous grammars were manually curated by human experts. In this work, we develop a framewor… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: to be presented at DCC 2024

  18. arXiv:2401.01433  [pdf, other

    cs.IT eess.SP

    Multiple Access Techniques for Intelligent and Multi-Functional 6G: Tutorial, Survey, and Outlook

    Authors: Bruno Clerckx, Yijie Mao, Zhaohui Yang, Mingzhe Chen, Ahmed Alkhateeb, Liang Liu, Min Qiu, Jinhong Yuan, Vincent W. S. Wong, Juan Montojo

    Abstract: Multiple access (MA) is a crucial part of any wireless system and refers to techniques that make use of the resource dimensions to serve multiple users/devices/machines/services, ideally in the most efficient way. Given the needs of multi-functional wireless networks for integrated communications, sensing, localization, computing, coupled with the surge of machine learning / artificial intelligenc… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

    Comments: submitted for publication in Proceedings of the IEEE

  19. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  20. arXiv:2312.03642  [pdf, other

    cs.LG

    Transformer-Powered Surrogates Close the ICF Simulation-Experiment Gap with Extremely Limited Data

    Authors: Matthew L. Olson, Shusen Liu, Jayaraman J. Thiagarajan, Bogdan Kustowski, Weng-Keen Wong, Rushil Anirudh

    Abstract: Recent advances in machine learning, specifically transformer architecture, have led to significant advancements in commercial domains. These powerful models have demonstrated superior capability to learn complex relationships and often generalize better to new data and problems. This paper presents a novel transformer-powered approach for enhancing prediction accuracy in multi-modal output scenar… ▽ More

    Submitted 28 May, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: MLST

  21. arXiv:2312.00975  [pdf

    physics.med-ph cs.LG

    Noisy probing dose facilitated dose prediction for pencil beam scanning proton therapy: physics enhances generalizability

    Authors: Lian Zhang, Jason M. Holmes, Zhengliang Liu, Hongying Feng, Terence T. Sio, Carlos E. Vargas, Sameer R. Keole, Kristin Stützer, Sheng Li, Tianming Liu, Jiajian Shen, William W. Wong, Sujay A. Vora, Wei Liu

    Abstract: Purpose: Prior AI-based dose prediction studies in photon and proton therapy often neglect underlying physics, limiting their generalizability to handle outlier clinical cases, especially for pencil beam scanning proton therapy (PBSPT). Our aim is to design a physics-aware and generalizable AI-based PBSPT dose prediction method that has the underlying physics considered to achieve high generalizab… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  22. HongTu: Scalable Full-Graph GNN Training on Multiple GPUs (via communication-optimized CPU data offloading)

    Authors: Qiange Wang, Yao Chen, Weng-Fai Wong, Bingsheng He

    Abstract: Full-graph training on graph neural networks (GNN) has emerged as a promising training method for its effectiveness. Full-graph training requires extensive memory and computation resources. To accelerate this training process, researchers have proposed employing multi-GPU processing. However the scalability of existing frameworks is limited as they necessitate maintaining the training data for eve… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

    Comments: 28 pages 11 figures, SIGMOD2024

  23. arXiv:2310.08873  [pdf, other

    cs.RO cs.AI

    Interactive Navigation in Environments with Traversable Obstacles Using Large Language and Vision-Language Models

    Authors: Zhen Zhang, Anran Lin, Chun Wai Wong, Xiangyu Chu, Qi Dou, K. W. Samuel Au

    Abstract: This paper proposes an interactive navigation framework by using large language and vision-language models, allowing robots to navigate in environments with traversable obstacles. We utilize the large language model (GPT-3.5) and the open-set Vision-language Model (Grounding DINO) to create an action-aware costmap to perform effective path planning without fine-tuning. With the large models, we ca… ▽ More

    Submitted 12 March, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Accepted by 2024 IEEE International Conference on Robotics and Automation (ICRA), 7 pages, 8 figures

  24. arXiv:2310.07801  [pdf, other

    cs.CV cs.AI stat.ME

    Trajectory-aware Principal Manifold Framework for Data Augmentation and Image Generation

    Authors: Elvis Han Cui, Bingbin Li, Yanan Li, Weng Kee Wong, Donghui Wang

    Abstract: Data augmentation for deep learning benefits model training, image transformation, medical imaging analysis and many other fields. Many existing methods generate new samples from a parametric distribution, like the Gaussian, with little attention to generate samples along the data manifold in either the input or feature space. In this paper, we verify that there are theoretical and practical advan… ▽ More

    Submitted 30 July, 2023; originally announced October 2023.

    Comments: 20 figures

  25. arXiv:2310.06530  [pdf, other

    cs.SE

    Refining Decompiled C Code with Large Language Models

    Authors: Wai Kin Wong, Huaijin Wang, Zongjie Li, Zhibo Liu, Shuai Wang, Qiyi Tang, Sen Nie, Shi Wu

    Abstract: A C decompiler converts an executable into source code. The recovered C source code, once re-compiled, is expected to produce an executable with the same functionality as the original executable. With over twenty years of development, C decompilers have been widely used in production to support reverse engineering applications. Despite the prosperous development of C decompilers, it is widely ackn… ▽ More

    Submitted 28 November, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

  26. arXiv:2310.03874  [pdf, other

    physics.med-ph cs.CL

    Benchmarking a foundation LLM on its ability to re-label structure names in accordance with the AAPM TG-263 report

    Authors: Jason Holmes, Lian Zhang, Yuzhen Ding, Hongying Feng, Zhengliang Liu, Tianming Liu, William W. Wong, Sujay A. Vora, Jonathan B. Ashman, Wei Liu

    Abstract: Purpose: To introduce the concept of using large language models (LLMs) to re-label structure names in accordance with the American Association of Physicists in Medicine (AAPM) Task Group (TG)-263 standard, and to establish a benchmark for future studies to reference. Methods and Materials: The Generative Pre-trained Transformer (GPT)-4 application programming interface (API) was implemented as… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

    Comments: 20 pages, 5 figures, 1 table

  27. REB: Reducing Biases in Representation for Industrial Anomaly Detection

    Authors: Shuai Lyu, Dongmei Mo, Waikeung Wong

    Abstract: Existing representation-based methods usually conduct industrial anomaly detection in two stages: obtain feature representations with a pre-trained model and perform distance measures for anomaly detection. Among them, K-nearest neighbor (KNN) retrieval-based anomaly detection methods show promising results. However, the features are not fully exploited as these methods ignore domain bias of pre-t… ▽ More

    Submitted 17 May, 2024; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: 14 pages, 7 figures, 7 tables

  28. arXiv:2308.10875   

    cs.NE cs.AI cs.LG

    Metaheuristic Algorithms in Artificial Intelligence with Applications to Bioinformatics, Biostatistics, Ecology and, the Manufacturing Industries

    Authors: Elvis Han Cui, Zizhao Zhang, Culsome Junwen Chen, Weng Kee Wong

    Abstract: Nature-inspired metaheuristic algorithms are important components of artificial intelligence, and are increasingly used across disciplines to tackle various types of challenging optimization problems. We apply a newly proposed nature-inspired metaheuristic algorithm called competitive swarm optimizer with mutated agents (CSO-MA) and demonstrate its flexibility and out-performance relative to its c… ▽ More

    Submitted 16 October, 2023; v1 submitted 8 August, 2023; originally announced August 2023.

    Comments: Revision, unpublished manuscript

  29. arXiv:2308.08222  [pdf, other

    cs.RO cs.LG

    HyperSNN: A new efficient and robust deep learning model for resource constrained control applications

    Authors: Zhanglu Yan, Shida Wang, Kaiwen Tang, Weng-Fai Wong

    Abstract: In light of the increasing adoption of edge computing in areas such as intelligent furniture, robotics, and smart homes, this paper introduces HyperSNN, an innovative method for control tasks that uses spiking neural networks (SNNs) in combination with hyperdimensional computing. HyperSNN substitutes expensive 32-bit floating point multiplications with 8-bit integer additions, resulting in reduced… ▽ More

    Submitted 17 August, 2023; v1 submitted 16 August, 2023; originally announced August 2023.

  30. arXiv:2307.02736  [pdf

    physics.med-ph cs.CV

    An Uncertainty Aided Framework for Learning based Liver $T_1ρ$ Mapping and Analysis

    Authors: Chaoxing Huang, Vincent Wai Sun Wong, Queenie Chan, Winnie Chiu Wing Chu, Weitian Chen

    Abstract: Objective: Quantitative $T_1ρ$ imaging has potential for assessment of biochemical alterations of liver pathologies. Deep learning methods have been employed to accelerate quantitative $T_1ρ$ imaging. To employ artificial intelligence-based quantitative imaging methods in complicated clinical environment, it is valuable to estimate the uncertainty of the predicated $T_1ρ$ values to provide the con… ▽ More

    Submitted 9 October, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

  31. arXiv:2306.16186  [pdf, other

    cs.CV cs.AI

    Effective Transfer of Pretrained Large Visual Model for Fabric Defect Segmentation via Specifc Knowledge Injection

    Authors: Zhewei Chen, Wai Keung Wong, Zuofeng Zhong, Jinpiao Liao, Ying Qu

    Abstract: Fabric defect segmentation is integral to textile quality control. Despite this, the scarcity of high-quality annotated data and the diversity of fabric defects present significant challenges to the application of deep learning in this field. These factors limit the generalization and segmentation performance of existing models, impeding their ability to handle the complexity of diverse fabric typ… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: 13 pages,4 figures, 3 tables

    ACM Class: I.2.10; I.4.9; I.5.4

  32. arXiv:2305.12585  [pdf, other

    cs.LG

    GeometricImageNet: Extending convolutional neural networks to vector and tensor images

    Authors: Wilson Gregory, David W. Hogg, Ben Blum-Smith, Maria Teresa Arias, Kaze W. K. Wong, Soledad Villar

    Abstract: Convolutional neural networks and their ilk have been very successful for many learning tasks involving images. These methods assume that the input is a scalar image representing the intensity in each pixel, possibly in multiple channels for color images. In natural-science domains however, image-like data sets might have vectors (velocity, say), tensors (polarization, say), pseudovectors (magneti… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

  33. arXiv:2305.09594  [pdf, other

    cs.CR cs.LG eess.SP

    HiNoVa: A Novel Open-Set Detection Method for Automating RF Device Authentication

    Authors: Luke Puppo, Weng-Keen Wong, Bechir Hamdaoui, Abdurrahman Elmaghbub

    Abstract: New capabilities in wireless network security have been enabled by deep learning, which leverages patterns in radio frequency (RF) data to identify and authenticate devices. Open-set detection is an area of deep learning that identifies samples captured from new devices during deployment that were not part of the training set. Past work in open-set detection has mostly been applied to independent… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

  34. arXiv:2305.05187  [pdf, other

    cs.NE cs.AI cs.AR

    DeepFire2: A Convolutional Spiking Neural Network Accelerator on FPGAs

    Authors: Myat Thu Linn Aung, Daniel Gerlinghoff, Chuping Qu, Liwei Yang, Tian Huang, Rick Siow Mong Goh, Tao Luo, Weng-Fai Wong

    Abstract: Brain-inspired spiking neural networks (SNNs) replace the multiply-accumulate operations of traditional neural networks by integrate-and-fire neurons, with the goal of achieving greater energy efficiency. Specialized hardware implementations of those neurons clearly have advantages over general-purpose devices in terms of power and performance, but exhibit poor scalability when it comes to acceler… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

  35. arXiv:2305.00956  [pdf, other

    cs.IT

    Non-Binary LDPC Code Design for Energy-Time Entanglement Quantum Key Distribution

    Authors: Debarnab Mitra, Lev Tauz, Murat Can Sarihan, Chee Wei Wong, Lara Dolecek

    Abstract: In energy-time entanglement Quantum Key Distribution (QKD), two users extract a shared secret key from the arrival times (discretized as symbols) of entangled photon pairs. In prior work, Zhou et al. proposed a multi-level coding (MLC) scheme that splits the observed symbols into bit layers and utilizes binary Low-Density Parity-Check (LDPC) codes for reconciliation of the symbols. While binary LD… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

    Comments: 5 pages, 4 figures, submitted to International Symposium on Topics in Coding

  36. arXiv:2304.11135  [pdf

    cs.CV physics.med-ph

    Deep-Learning-based Fast and Accurate 3D CT Deformable Image Registration in Lung Cancer

    Authors: Yuzhen Ding, Hongying Feng, Yunze Yang, Jason Holmes, Zhengliang Liu, David Liu, William W. Wong, Nathan Y. Yu, Terence T. Sio, Steven E. Schild, Baoxin Li, Wei Liu

    Abstract: Purpose: In some proton therapy facilities, patient alignment relies on two 2D orthogonal kV images, taken at fixed, oblique angles, as no 3D on-the-bed imaging is available. The visibility of the tumor in kV images is limited since the patient's 3D anatomy is projected onto a 2D plane, especially when the tumor is behind high-density structures such as bones. This can lead to large patient setup… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

    Comments: 9 figures

  37. LightRW: FPGA Accelerated Graph Dynamic Random Walks

    Authors: Hongshi Tan, Xinyu Chen, Yao Chen, Bingsheng He, Weng-Fai Wong

    Abstract: Graph dynamic random walks (GDRWs) have recently emerged as a powerful paradigm for graph analytics and learning applications, including graph embedding and graph neural networks. Despite the fact that many existing studies optimize the performance of GDRWs on multi-core CPUs, massive random memory accesses and costly synchronizations cause severe resource underutilization, and the processing of G… ▽ More

    Submitted 21 April, 2023; v1 submitted 14 April, 2023; originally announced April 2023.

    Comments: Accepted to SIGMOD 2023

  38. arXiv:2303.10774  [pdf, other

    cs.LG cs.CV

    Cross-GAN Auditing: Unsupervised Identification of Attribute Level Similarities and Differences between Pretrained Generative Models

    Authors: Matthew L. Olson, Shusen Liu, Rushil Anirudh, Jayaraman J. Thiagarajan, Peer-Timo Bremer, Weng-Keen Wong

    Abstract: Generative Adversarial Networks (GANs) are notoriously difficult to train especially for complex distributions and with limited data. This has driven the need for tools to audit trained networks in human intelligible format, for example, to identify biases or ensure fairness. Existing GAN audit tools are restricted to coarse-grained, model-data comparisons based on summary statistics such as FID o… ▽ More

    Submitted 2 May, 2023; v1 submitted 19 March, 2023; originally announced March 2023.

    Comments: CVPR 2023. Source code is available at https://github.com/mattolson93/cross_gan_auditing

  39. arXiv:2303.10672  [pdf, other

    cs.AI cs.DC

    Going faster to see further: GPU-accelerated value iteration and simulation for perishable inventory control using JAX

    Authors: Joseph Farrington, Kezhi Li, Wai Keong Wong, Martin Utley

    Abstract: Value iteration can find the optimal replenishment policy for a perishable inventory problem, but is computationally demanding due to the large state spaces that are required to represent the age profile of stock. The parallel processing capabilities of modern GPUs can reduce the wall time required to run value iteration by updating many states simultaneously. The adoption of GPU-accelerated appro… ▽ More

    Submitted 19 March, 2023; originally announced March 2023.

    MSC Class: 90C40 (Primary) 90C39; 90B05 (Secondary) ACM Class: I.2.8; I.6.3; J.3

  40. arXiv:2302.11669  [pdf, other

    q-bio.BM cs.IT

    RNA secondary structures: from ab initio prediction to better compression, and back

    Authors: Evarista Onokpasa, Sebastian Wild, Prudence W. H. Wong

    Abstract: In this paper, we use the biological domain knowledge incorporated into stochastic models for ab initio RNA secondary-structure prediction to improve the state of the art in joint compression of RNA sequence and structure data (Liu et al., BMC Bioinformatics, 2008). Moreover, we show that, conversely, compression ratio can serve as a cheap and robust proxy for comparing the prediction quality of d… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

    Comments: paper at Data Compression Conference 2023

  41. arXiv:2302.02506  [pdf

    cs.LG cs.AI

    Generating Dispatching Rules for the Interrupting Swap-Allowed Blocking Job Shop Problem Using Graph Neural Network and Reinforcement Learning

    Authors: Vivian W. H. Wong, Sang Hun Kim, Junyoung Park, Jinkyoo Park, Kincho H. Law

    Abstract: The interrupting swap-allowed blocking job shop problem (ISBJSSP) is a complex scheduling problem that is able to model many manufacturing planning and logistics applications realistically by addressing both the lack of storage capacity and unforeseen production interruptions. Subjected to random disruptions due to machine malfunction or maintenance, industry production settings often choose to ad… ▽ More

    Submitted 28 September, 2023; v1 submitted 5 February, 2023; originally announced February 2023.

    Comments: 14 pages, 10 figures. Supplementary Material not included

  42. arXiv:2301.12540  [pdf, other

    stat.ML cs.LG

    Implicit Regularization for Group Sparsity

    Authors: Jiangyuan Li, Thanh V. Nguyen, Chinmay Hegde, Raymond K. W. Wong

    Abstract: We study the implicit regularization of gradient descent towards structured sparsity via a novel neural reparameterization, which we call a diagonally grouped linear neural network. We show the following intriguing property of our reparameterization: gradient descent over the squared regression loss, without any explicit regularization, biases towards solutions with a group sparsity structure. In… ▽ More

    Submitted 29 January, 2023; originally announced January 2023.

    Comments: accepted by ICLR 2023

  43. arXiv:2301.12360  [pdf, other

    cs.LG cs.CR eess.SP

    ADL-ID: Adversarial Disentanglement Learning for Wireless Device Fingerprinting Temporal Domain Adaptation

    Authors: Abdurrahman Elmaghbub, Bechir Hamdaoui, Weng-Keen Wong

    Abstract: As the journey of 5G standardization is coming to an end, academia and industry have already begun to consider the sixth-generation (6G) wireless networks, with an aim to meet the service demands for the next decade. Deep learning-based RF fingerprinting (DL-RFFP) has recently been recognized as a potential solution for enabling key wireless network applications and services, such as spectrum poli… ▽ More

    Submitted 28 January, 2023; originally announced January 2023.

    Comments: The paper has been accepted at IEEE ICC'23 - MWN Symposium

  44. Efficient Hyperdimensional Computing

    Authors: Zhanglu Yan, Shida Wang, Kaiwen Tang, Weng-Fai Wong

    Abstract: Hyperdimensional computing (HDC) is a method to perform classification that uses binary vectors with high dimensions and the majority rule. This approach has the potential to be energy-efficient and hence deemed suitable for resource-limited platforms due to its simplicity and massive parallelism. However, in order to achieve high accuracy, HDC sometimes uses hypervectors with tens of thousands of… ▽ More

    Submitted 12 October, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

    Journal ref: ECML PKDD 2023: Machine Learning and Knowledge Discovery in Databases: Research Track pp 141-155

  45. arXiv:2212.05925  [pdf, other

    stat.ML cs.LG

    CausalEGM: a general causal inference framework by encoding generative modeling

    Authors: Qiao Liu, Zhongren Chen, Wing Hung Wong

    Abstract: Although understanding and characterizing causal effects have become essential in observational studies, it is challenging when the confounders are high-dimensional. In this article, we develop a general framework $\textit{CausalEGM}$ for estimating causal effects by encoding generative modeling, which can be applied in both binary and continuous treatment settings. Under the potential outcome fra… ▽ More

    Submitted 16 March, 2023; v1 submitted 8 December, 2022; originally announced December 2022.

  46. arXiv:2211.08410  [pdf, other

    cs.NE cs.LG

    Low Latency Conversion of Artificial Neural Network Models to Rate-encoded Spiking Neural Networks

    Authors: Zhanglu Yan, Jun Zhou, Weng-Fai Wong

    Abstract: Spiking neural networks (SNNs) are well suited for resource-constrained applications as they do not need expensive multipliers. In a typical rate-encoded SNN, a series of binary spikes within a globally fixed time window is used to fire the neurons. The maximum number of spikes in this time window is also the latency of the network in performing a single inference, as well as determines the overal… ▽ More

    Submitted 27 October, 2022; originally announced November 2022.

  47. arXiv:2211.05412  [pdf, other

    cs.NE cs.AI cs.ET

    Desire Backpropagation: A Lightweight Training Algorithm for Multi-Layer Spiking Neural Networks based on Spike-Timing-Dependent Plasticity

    Authors: Daniel Gerlinghoff, Tao Luo, Rick Siow Mong Goh, Weng-Fai Wong

    Abstract: Spiking neural networks (SNNs) are a viable alternative to conventional artificial neural networks when resource efficiency and computational complexity are of importance. A major advantage of SNNs is their binary information transfer through spike trains which eliminates multiplication operations. The training of SNNs has, however, been a challenge, since neuron models are non-differentiable and… ▽ More

    Submitted 17 October, 2023; v1 submitted 10 November, 2022; originally announced November 2022.

  48. arXiv:2211.05276  [pdf, other

    cs.AR cs.ET cs.LG

    PhotoFourier: A Photonic Joint Transform Correlator-Based Neural Network Accelerator

    Authors: Shurui Li, Hangbo Yang, Chee Wei Wong, Volker J. Sorger, Puneet Gupta

    Abstract: The last few years have seen a lot of work to address the challenge of low-latency and high-throughput convolutional neural network inference. Integrated photonics has the potential to dramatically accelerate neural networks because of its low-latency nature. Combined with the concept of Joint Transform Correlator (JTC), the computationally expensive convolution functions can be computed instantan… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

    Comments: 12 pages, 13 figures, accepted in HPCA 2023

  49. arXiv:2210.11923  [pdf, other

    cs.CR eess.SY

    RollBack: A New Time-Agnostic Replay Attack Against the Automotive Remote Keyless Entry Systems

    Authors: Levente Csikor, Hoon Wei Lim, Jun Wen Wong, Soundarya Ramesh, Rohini Poolat Parameswarath, Mun Choon Chan

    Abstract: Today's RKE systems implement disposable rolling codes, making every key fob button press unique, effectively preventing simple replay attacks. However, a prior attack called RollJam was proven to break all rolling code-based systems in general. By a careful sequence of signal jamming, capturing, and replaying, an attacker can become aware of the subsequent valid unlock signal that has not been us… ▽ More

    Submitted 14 September, 2022; originally announced October 2022.

    Comments: 24 pages, 5 figures Under submission to a journal

    Journal ref: ACM Transactions on Cyber-Physical Systems, 2024

  50. arXiv:2210.11711  [pdf, ps, other

    cs.CL cs.AI

    Modelling Multi-relations for Convolutional-based Knowledge Graph Embedding

    Authors: Sirui Li, Kok Wai Wong, Dengya Zhu, Chun Che Fung

    Abstract: Representation learning of knowledge graphs aims to embed entities and relations into low-dimensional vectors. Most existing works only consider the direct relations or paths between an entity pair. It is considered that such approaches disconnect the semantic connection of multi-relations between an entity pair, and we propose a convolutional and multi-relational representation learning model, Co… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: Knowledge-Based and Intelligent Information & Engineering Systems: Proceedings of the 26th International Conference KES2022