Zum Hauptinhalt springen

Showing 1–50 of 61 results for author: Lo, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.19651  [pdf, other

    cs.CV cs.LG cs.MM

    ComNeck: Bridging Compressed Image Latents and Multimodal LLMs via Universal Transform-Neck

    Authors: Chia-Hao Kao, Cheng Chien, Yu-Jen Tseng, Yi-Hsin Chen, Alessandro Gnutti, Shao-Yuan Lo, Wen-Hsiao Peng, Riccardo Leonardi

    Abstract: This paper presents the first-ever study of adapting compressed image latents to suit the needs of downstream vision tasks that adopt Multimodal Large Language Models (MLLMs). MLLMs have extended the success of large language models to modalities (e.g. images) beyond text, but their billion scale hinders deployment on resource-constrained end devices. While cloud-hosted MLLMs could be available, t… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

  2. arXiv:2407.13386  [pdf, other

    cs.CR

    Time Synchronization of TESLA-enabled GNSS Receivers

    Authors: Jason Anderson, Sherman Lo, Todd Walter

    Abstract: As TESLA-enabled GNSS for authenticated positioning reaches ubiquity, receivers must use an onboard, GNSS-independent clock and carefully constructed time synchronization algorithms to assert the authenticity afforded. This work provides the necessary checks and synchronization protocols needed in the broadcast-only GNSS context. We provide proof of security for each of our algorithms under a dela… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: 16 pages, 15 figures

  3. arXiv:2407.10299  [pdf, other

    cs.CV

    Follow the Rules: Reasoning for Video Anomaly Detection with Large Language Models

    Authors: Yuchen Yang, Kwonjoon Lee, Behzad Dariush, Yinzhi Cao, Shao-Yuan Lo

    Abstract: Video Anomaly Detection (VAD) is crucial for applications such as security surveillance and autonomous driving. However, existing VAD methods provide little rationale behind detection, hindering public trust in real-world deployments. In this paper, we approach VAD with a reasoning framework. Although Large Language Models (LLMs) have shown revolutionary reasoning ability, we find that their direc… ▽ More

    Submitted 20 July, 2024; v1 submitted 14 July, 2024; originally announced July 2024.

    Comments: Accepted at European Conference on Computer Vision (ECCV) 2024

  4. arXiv:2405.20305  [pdf, other

    cs.CV

    Can't make an Omelette without Breaking some Eggs: Plausible Action Anticipation using Large Video-Language Models

    Authors: Himangi Mittal, Nakul Agarwal, Shao-Yuan Lo, Kwonjoon Lee

    Abstract: We introduce PlausiVL, a large video-language model for anticipating action sequences that are plausible in the real-world. While significant efforts have been made towards anticipating future actions, prior approaches do not take into account the aspect of plausibility in an action sequence. To address this limitation, we explore the generative capability of a large video-language model in our wo… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: CVPR 2024

  5. arXiv:2405.19413  [pdf, other

    cs.CV cs.AI

    VisTA-SR: Improving the Accuracy and Resolution of Low-Cost Thermal Imaging Cameras for Agriculture

    Authors: Heesup Yun, Sassoum Lo, Christine H. Diepenbrock, Brian N. Bailey, J. Mason Earles

    Abstract: Thermal cameras are an important tool for agricultural research because they allow for non-invasive measurement of plant temperature, which relates to important photochemical, hydraulic, and agronomic traits. Utilizing low-cost thermal cameras can lower the barrier to introducing thermal imaging in agricultural research and production. This paper presents an approach to improve the temperature acc… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  6. arXiv:2405.11708  [pdf, other

    cs.LG cs.CV

    Adaptive Batch Normalization Networks for Adversarial Robustness

    Authors: Shao-Yuan Lo, Vishal M. Patel

    Abstract: Deep networks are vulnerable to adversarial examples. Adversarial Training (AT) has been a standard foundation of modern adversarial defense approaches due to its remarkable effectiveness. However, AT is extremely time-consuming, refraining it from wide deployment in practical applications. In this paper, we aim at a non-AT defense: How to design a defense method that gets rid of AT but is still r… ▽ More

    Submitted 26 May, 2024; v1 submitted 19 May, 2024; originally announced May 2024.

    Comments: Accepted at IEEE International Conference on Advanced Video and Signal-based Surveillance (AVSS) 2024

  7. arXiv:2405.10467  [pdf, other

    cs.AI cs.SE

    Agent Design Pattern Catalogue: A Collection of Architectural Patterns for Foundation Model based Agents

    Authors: Yue Liu, Sin Kit Lo, Qinghua Lu, Liming Zhu, Dehai Zhao, Xiwei Xu, Stefan Harrer, Jon Whittle

    Abstract: Foundation model-enabled generative artificial intelligence facilitates the development and implementation of agents, which can leverage distinguished reasoning and language processing capabilities to takes a proactive, autonomous role to pursue users' goals. Nevertheless, there is a lack of systematic knowledge to guide practitioners in designing the agents considering challenges of goal-seeking… ▽ More

    Submitted 24 June, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

  8. arXiv:2404.09290  [pdf, other

    cs.CV eess.IV

    RoofDiffusion: Constructing Roofs from Severely Corrupted Point Data via Diffusion

    Authors: Kyle Shih-Huang Lo, Jörg Peters, Eric Spellman

    Abstract: Accurate completion and denoising of roof height maps are crucial to reconstructing high-quality 3D buildings. Repairing sparse points can enhance low-cost sensor use and reduce UAV flight overlap. RoofDiffusion is a new end-to-end self-supervised diffusion technique for robustly completing, in particular difficult, roof height maps. RoofDiffusion leverages widely-available curated footprints and… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  9. arXiv:2404.05583  [pdf, other

    cs.CV

    Towards More General Video-based Deepfake Detection through Facial Feature Guided Adaptation for Foundation Model

    Authors: Yue-Hua Han, Tai-Ming Huang, Shu-Tzu Lo, Po-Han Huang, Kai-Lung Hua, Jun-Cheng Chen

    Abstract: With the rise of deep learning, generative models have enabled the creation of highly realistic synthetic images, presenting challenges due to their potential misuse. While research in Deepfake detection has grown rapidly in response, many detection methods struggle with unseen Deepfakes generated by new synthesis techniques. To address this generalisation challenge, we propose a novel Deepfake de… ▽ More

    Submitted 5 June, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

  10. arXiv:2402.07104  [pdf

    cs.HC

    The Aleph & Other Metaphors for Image Generation

    Authors: Gonzalo Ramos, Rick Barraza, Victor Dibia, Sharon Lo

    Abstract: In this position paper, we reflect on fictional stories dealing with the infinite and how they connect with the current, fast-evolving field of image generation models. We draw attention to how some of these literary constructs can serve as powerful metaphors for guiding human-centered design and technical thinking in the space of these emerging technologies and the experiences we build around the… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  11. arXiv:2306.08056  [pdf, other

    cs.CR cs.AI cs.SE

    Distributed Trust Through the Lens of Software Architecture

    Authors: Sin Kit Lo, Yue Liu, Guangsheng Yu, Qinghua Lu, Xiwei Xu, Liming Zhu

    Abstract: Distributed trust is a nebulous concept that has evolved from different perspectives in recent years. While one can attribute its current prominence to blockchain and cryptocurrency, the distributed trust concept has been cultivating progress in federated learning, trustworthy and responsible AI in an ecosystem setting, data sharing, privacy issues across organizational boundaries, and zero trust… ▽ More

    Submitted 25 May, 2023; originally announced June 2023.

  12. arXiv:2305.12292  [pdf, other

    cs.LG math.OC stat.ML

    Optimal Low-Rank Matrix Completion: Semidefinite Relaxations and Eigenvector Disjunctions

    Authors: Dimitris Bertsimas, Ryan Cory-Wright, Sean Lo, Jean Pauphilet

    Abstract: Low-rank matrix completion consists of computing a matrix of minimal complexity that recovers a given set of observations as accurately as possible. Unfortunately, existing methods for matrix completion are heuristics that, while highly scalable and often identifying high-quality solutions, do not possess any optimality guarantees. We reexamine matrix completion with an optimality-oriented eye. We… ▽ More

    Submitted 26 January, 2024; v1 submitted 20 May, 2023; originally announced May 2023.

    Comments: Updated version with new numerics showcasing relaxation for rank k>1

  13. arXiv:2303.14361  [pdf, other

    cs.CV

    Spatio-Temporal Pixel-Level Contrastive Learning-based Source-Free Domain Adaptation for Video Semantic Segmentation

    Authors: Shao-Yuan Lo, Poojan Oza, Sumanth Chennupati, Alejandro Galindo, Vishal M. Patel

    Abstract: Unsupervised Domain Adaptation (UDA) of semantic segmentation transfers labeled source knowledge to an unlabeled target domain by relying on accessing both the source and target data. However, the access to source data is often restricted or infeasible in real-world scenarios. Under the source data restrictive circumstances, UDA is less practical. To address this, recent works have explored soluti… ▽ More

    Submitted 25 March, 2023; originally announced March 2023.

    Comments: Accepted at IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023

  14. arXiv:2302.13172  [pdf

    eess.IV cs.CV

    Deep Learning-based Multi-Organ CT Segmentation with Adversarial Data Augmentation

    Authors: Shaoyan Pan, Shao-Yuan Lo, Min Huang, Chaoqiong Ma, Jacob Wynne, Tonghe Wang, Tian Liu, Xiaofeng Yang

    Abstract: In this work, we propose an adversarial attack-based data augmentation method to improve the deep-learning-based segmentation algorithm for the delineation of Organs-At-Risk (OAR) in abdominal Computed Tomography (CT) to facilitate radiation therapy. We introduce Adversarial Feature Attack for Medical Image (AFA-MI) augmentation, which forces the segmentation network to learn out-of-distribution s… ▽ More

    Submitted 25 February, 2023; originally announced February 2023.

    Comments: Accepted at SPIE Medical Imaging 2023

  15. arXiv:2211.08105  [pdf, other

    math.CO cs.DM

    Few hamiltonian cycles in graphs with one or two vertex degrees

    Authors: Jan Goedgebeur, Jorik Jooken, On-Hei Solomon Lo, Ben Seamone, Carol T. Zamfirescu

    Abstract: We fully disprove a conjecture of Haythorpe on the minimum number of hamiltonian cycles in regular hamiltonian graphs, thereby extending a result of Zamfirescu, as well as correct and complement Haythorpe's computational enumerative results from [Experim. Math. 27 (2018) 426-430]. Thereafter, we use the Lovász Local Lemma to extend Thomassen's independent dominating set method. Regarding the limit… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

  16. arXiv:2208.00160  [pdf, other

    cs.CV

    Learning Feature Decomposition for Domain Adaptive Monocular Depth Estimation

    Authors: Shao-Yuan Lo, Wei Wang, Jim Thomas, Jingjing Zheng, Vishal M. Patel, Cheng-Hao Kuo

    Abstract: Monocular depth estimation (MDE) has attracted intense study due to its low cost and critical functions for robotic tasks such as localization, mapping and obstacle detection. Supervised approaches have led to great success with the advance of deep learning, but they rely on large quantities of ground-truth depth annotations that are expensive to acquire. Unsupervised domain adaptation (UDA) trans… ▽ More

    Submitted 30 July, 2022; originally announced August 2022.

    Comments: Accepted at IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2022

  17. arXiv:2207.05138  [pdf, other

    eess.SY cs.AI eess.SP

    Towards Personalized Healthcare in Cardiac Population: The Development of a Wearable ECG Monitoring System, an ECG Lossy Compression Schema, and a ResNet-Based AF Detector

    Authors: Wei-Ying Yi, Peng-Fei Liu, Sheung-Lai Lo, Ya-Fen Chan, Yu Zhou, Yee Leung, Kam-Sang Woo, Alex Pui-Wai Lee, Jia-Min Chen, Kwong-Sak Leung

    Abstract: Cardiovascular diseases (CVDs) are the number one cause of death worldwide. While there is growing evidence that the atrial fibrillation (AF) has strong associations with various CVDs, this heart arrhythmia is usually diagnosed using electrocardiography (ECG) which is a risk-free, non-intrusive, and cost-efficient tool. Continuously and remotely monitoring the subjects' ECG information unlocks the… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

  18. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  19. arXiv:2204.13291  [pdf, other

    cs.LG cs.SE

    Decision Models for Selecting Federated Learning Architecture Patterns

    Authors: Sin Kit Lo, Qinghua Lu, Hye-Young Paik, Liming Zhu

    Abstract: Federated machine learning is growing fast in academia and industries as a solution to solve data hungriness and privacy issues in machine learning. Being a widely distributed system, federated machine learning requires various system design thinking. To better design a federated machine learning system, researchers have introduced multiple patterns and tactics that cover various system design asp… ▽ More

    Submitted 27 April, 2023; v1 submitted 28 April, 2022; originally announced April 2022.

  20. arXiv:2202.09300  [pdf, other

    cs.CV cs.LG

    Exploring Adversarially Robust Training for Unsupervised Domain Adaptation

    Authors: Shao-Yuan Lo, Vishal M. Patel

    Abstract: Unsupervised Domain Adaptation (UDA) methods aim to transfer knowledge from a labeled source domain to an unlabeled target domain. UDA has been extensively studied in the computer vision literature. Deep networks have been shown to be vulnerable to adversarial attacks. However, very little focus is devoted to improving the adversarial robustness of deep UDA models, causing serious concerns about m… ▽ More

    Submitted 4 October, 2022; v1 submitted 18 February, 2022; originally announced February 2022.

    Comments: Accepted at Asian Conference on Computer Vision (ACCV) 2022

  21. ADAM Challenge: Detecting Age-related Macular Degeneration from Fundus Images

    Authors: Huihui Fang, Fei Li, Huazhu Fu, Xu Sun, Xingxing Cao, Fengbin Lin, Jaemin Son, Sunho Kim, Gwenole Quellec, Sarah Matta, Sharath M Shankaranarayana, Yi-Ting Chen, Chuen-heng Wang, Nisarg A. Shah, Chia-Yen Lee, Chih-Chung Hsu, Hai Xie, Baiying Lei, Ujjwal Baid, Shubham Innani, Kang Dang, Wenxiu Shi, Ravi Kamble, Nitin Singhal, Ching-Wei Wang , et al. (6 additional authors not shown)

    Abstract: Age-related macular degeneration (AMD) is the leading cause of visual impairment among elderly in the world. Early detection of AMD is of great importance, as the vision loss caused by this disease is irreversible and permanent. Color fundus photography is the most cost-effective imaging modality to screen for retinal disorders. Cutting edge deep learning based algorithms have been recently develo… ▽ More

    Submitted 6 May, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

    Comments: 31 pages, 17 figures

  22. arXiv:2112.02997  [pdf, other

    cs.CL cs.LG

    Language Semantics Interpretation with an Interaction-based Recurrent Neural Networks

    Authors: Shaw-Hwa Lo, Yiqiao Yin

    Abstract: Text classification is a fundamental language task in Natural Language Processing. A variety of sequential models is capable making good predictions yet there is lack of connection between language semantics and prediction results. This paper proposes a novel influence score (I-score), a greedy search algorithm called Backward Dropping Algorithm (BDA), and a novel feature engineering technique cal… ▽ More

    Submitted 1 November, 2021; originally announced December 2021.

  23. arXiv:2111.04096  [pdf, other

    cs.RO cs.CV

    Online Mutual Adaptation of Deep Depth Prediction and Visual SLAM

    Authors: Shing Yan Loo, Moein Shakeri, Sai Hong Tang, Syamsiah Mashohor, Hong Zhang

    Abstract: The ability of accurate depth prediction by a convolutional neural network (CNN) is a major challenge for its wide use in practical visual simultaneous localization and mapping (SLAM) applications, such as enhanced camera tracking and dense mapping. This paper is set out to answer the following question: Can we tune a depth prediction CNN with the help of a visual SLAM algorithm even if the CNN is… ▽ More

    Submitted 1 February, 2022; v1 submitted 7 November, 2021; originally announced November 2021.

    Comments: 11 pages, 6 figures

  24. arXiv:2108.11168  [pdf, other

    cs.CV

    Adversarially Robust One-class Novelty Detection

    Authors: Shao-Yuan Lo, Poojan Oza, Vishal M. Patel

    Abstract: One-class novelty detectors are trained with examples of a particular class and are tasked with identifying whether a query example belongs to the same known class. Most recent advances adopt a deep auto-encoder style architecture to compute novelty scores for detecting novel class data. Deep networks have shown to be vulnerable to adversarial attacks, yet little focus is devoted to studying the a… ▽ More

    Submitted 3 October, 2022; v1 submitted 25 August, 2021; originally announced August 2021.

    Comments: Accepted in IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI), 2022

  25. arXiv:2108.06912  [pdf, other

    cs.LG cs.AI

    Blockchain-based Trustworthy Federated Learning Architecture

    Authors: Sin Kit Lo, Yue Liu, Qinghua Lu, Chen Wang, Xiwei Xu, Hye-Young Paik, Liming Zhu

    Abstract: Federated learning is an emerging privacy-preserving AI technique where clients (i.e., organisations or devices) train models locally and formulate a global model based on the local model updates without transferring local data externally. However, federated learning systems struggle to achieve trustworthiness and embody responsible AI principles. In particular, federated learning systems face acc… ▽ More

    Submitted 28 October, 2021; v1 submitted 16 August, 2021; originally announced August 2021.

  26. arXiv:2106.11570  [pdf, other

    cs.LG cs.DC cs.SE

    FLRA: A Reference Architecture for Federated Learning Systems

    Authors: Sin Kit Lo, Qinghua Lu, Hye-Young Paik, Liming Zhu

    Abstract: Federated learning is an emerging machine learning paradigm that enables multiple devices to train models locally and formulate a global model, without sharing the clients' local data. A federated learning system can be viewed as a large-scale distributed system, involving different components and stakeholders with diverse requirements and constraints. Hence, developing a federated learning system… ▽ More

    Submitted 22 June, 2021; originally announced June 2021.

    Comments: Accepted by ECSA 2021

  27. An Interaction-based Convolutional Neural Network (ICNN) Towards Better Understanding of COVID-19 X-ray Images

    Authors: Shaw-Hwa Lo, Yiqiao Yin

    Abstract: The field of Explainable Artificial Intelligence (XAI) aims to build explainable and interpretable machine learning (or deep learning) methods without sacrificing prediction performance. Convolutional Neural Networks (CNNs) have been successful in making predictions, especially in image classification. However, these famous deep learning models use tens of millions of parameters based on a large n… ▽ More

    Submitted 13 June, 2021; originally announced June 2021.

    Report number: 337

    Journal ref: Algorithms 2021

  28. arXiv:2104.12672  [pdf, other

    cs.LG cs.CV stat.AP

    A Novel Interaction-based Methodology Towards Explainable AI with Better Understanding of Pneumonia Chest X-ray Images

    Authors: Shaw-Hwa Lo, Yiqiao Yin

    Abstract: In the field of eXplainable AI (XAI), robust "blackbox" algorithms such as Convolutional Neural Networks (CNNs) are known for making high prediction performance. However, the ability to explain and interpret these algorithms still require innovation in the understanding of influential and, more importantly, explainable features that directly or indirectly impact the performance of predictivity. A… ▽ More

    Submitted 15 June, 2021; v1 submitted 19 April, 2021; originally announced April 2021.

    Journal ref: Algorithms 2021, 14(11), 337

  29. arXiv:2103.05927  [pdf

    cs.CV cs.AI cs.CY

    Deep Sensing of Urban Waterlogging

    Authors: Shi-Wei Lo, Jyh-Horng Wu, Jo-Yu Chang, Chien-Hao Tseng, Meng-Wei Lin, Fang-Pang Lin

    Abstract: In the monsoon season, sudden flood events occur frequently in urban areas, which hamper the social and economic activities and may threaten the infrastructure and lives. The use of an efficient large-scale waterlogging sensing and information system can provide valuable real-time disaster information to facilitate disaster management and enhance awareness of the general public to alleviate losses… ▽ More

    Submitted 15 August, 2021; v1 submitted 10 March, 2021; originally announced March 2021.

    Comments: 19 pages, 14 figures, under submitting and patenting

    Report number: revise-2021-05-25

  30. arXiv:2102.05212  [pdf, other

    cs.CV cs.RO

    Polarimetric Monocular Dense Mapping Using Relative Deep Depth Prior

    Authors: Moein Shakeri, Shing Yan Loo, Hong Zhang

    Abstract: This paper is concerned with polarimetric dense map reconstruction based on a polarization camera with the help of relative depth information as a prior. In general, polarization imaging is able to reveal information about surface normal such as azimuth and zenith angles, which can support the development of solutions to the problem of dense reconstruction, especially in texture-poor regions. Howe… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

    Comments: 9 pages, 9 figure

  31. arXiv:2101.09451  [pdf, other

    cs.CV cs.CR cs.LG eess.IV

    Error Diffusion Halftoning Against Adversarial Examples

    Authors: Shao-Yuan Lo, Vishal M. Patel

    Abstract: Adversarial examples contain carefully crafted perturbations that can fool deep neural networks (DNNs) into making wrong predictions. Enhancing the adversarial robustness of DNNs has gained considerable interest in recent years. Although image transformation-based defenses were widely considered at an earlier time, most of them have been defeated by adaptive attacks. In this paper, we propose a ne… ▽ More

    Submitted 24 July, 2021; v1 submitted 23 January, 2021; originally announced January 2021.

    Comments: Accepted at IEEE International Conference on Image Processing (ICIP) 2021

  32. arXiv:2101.02373  [pdf, other

    cs.LG cs.DC cs.SE

    Architectural Patterns for the Design of Federated Learning Systems

    Authors: Sin Kit Lo, Qinghua Lu, Liming Zhu, Hye-young Paik, Xiwei Xu, Chen Wang

    Abstract: Federated learning has received fast-growing interests from academia and industry to tackle the challenges of data hungriness and privacy in machine learning. A federated learning system can be viewed as a large-scale distributed system with different components and stakeholders as numerous client devices participate in federated learning. Designing a federated learning system requires software sy… ▽ More

    Submitted 18 June, 2021; v1 submitted 7 January, 2021; originally announced January 2021.

    Comments: Resubmitted after minor revision to Elsevier's Journal of Systems and Software, Special issue on Software Architecture and Artificial Intelligence

  33. arXiv:2012.04262  [pdf, other

    cs.CV cs.LG eess.IV

    Overcomplete Representations Against Adversarial Videos

    Authors: Shao-Yuan Lo, Jeya Maria Jose Valanarasu, Vishal M. Patel

    Abstract: Adversarial robustness of deep neural networks is an extensively studied problem in the literature and various methods have been proposed to defend against adversarial images. However, only a handful of defense methods have been developed for defending against attacked videos. In this paper, we propose a novel Over-and-Under complete restoration network for Defending against adversarial videos (OU… ▽ More

    Submitted 14 June, 2021; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: Accepted at IEEE International Conference on Image Processing (ICIP) 2021

  34. arXiv:2009.10401  [pdf, other

    cs.DC cs.LG

    Dynamic Fusion based Federated Learning for COVID-19 Detection

    Authors: Weishan Zhang, Tao Zhou, Qinghua Lu, Xiao Wang, Chunsheng Zhu, Haoyun Sun, Zhipeng Wang, Sin Kit Lo, Fei-Yue Wang

    Abstract: Medical diagnostic image analysis (e.g., CT scan or X-Ray) using machine learning is an efficient and accurate way to detect COVID-19 infections. However, sharing diagnostic images across medical institutions is usually not allowed due to the concern of patients' privacy. This causes the issue of insufficient datasets for training the image classification model. Federated learning is an emerging p… ▽ More

    Submitted 25 October, 2020; v1 submitted 22 September, 2020; originally announced September 2020.

  35. arXiv:2009.08058  [pdf, other

    cs.LG cs.CV stat.ML

    MultAV: Multiplicative Adversarial Videos

    Authors: Shao-Yuan Lo, Vishal M. Patel

    Abstract: The majority of adversarial machine learning research focuses on additive attacks, which add adversarial perturbation to input data. On the other hand, unlike image recognition problems, only a handful of attack approaches have been explored in the video domain. In this paper, we propose a novel attack method against video recognition models, Multiplicative Adversarial Videos (MultAV), which impos… ▽ More

    Submitted 10 October, 2021; v1 submitted 17 September, 2020; originally announced September 2020.

    Comments: Accepted at IEEE International Conference on Advanced Video and Signal-based Surveillance (AVSS) 2021

  36. arXiv:2009.05244  [pdf, other

    cs.LG cs.CV stat.ML

    Defending Against Multiple and Unforeseen Adversarial Videos

    Authors: Shao-Yuan Lo, Vishal M. Patel

    Abstract: Adversarial robustness of deep neural networks has been actively investigated. However, most existing defense approaches are limited to a specific type of adversarial perturbations. Specifically, they often fail to offer resistance to multiple attack types simultaneously, i.e., they lack multi-perturbation robustness. Furthermore, compared to image recognition problems, the adversarial robustness… ▽ More

    Submitted 14 December, 2021; v1 submitted 11 September, 2020; originally announced September 2020.

    Comments: Accepted in IEEE Transactions on Image Processing (TIP)

  37. arXiv:2009.02643  [pdf, other

    cs.DC cs.CR cs.SE

    Blockchain-based Federated Learning for Device Failure Detection in Industrial IoT

    Authors: Weishan Zhang, Qinghua Lu, Qiuyu Yu, Zhaotong Li, Yue Liu, Sin Kit Lo, Shiping Chen, Xiwei Xu, Liming Zhu

    Abstract: Device failure detection is one of most essential problems in industrial internet of things (IIoT). However, in conventional IIoT device failure detection, client devices need to upload raw data to the central server for model training, which might lead to disclosure of sensitive business data. Therefore, in this paper, to ensure client data privacy, we propose a blockchain-based federated learnin… ▽ More

    Submitted 18 October, 2020; v1 submitted 6 September, 2020; originally announced September 2020.

    Comments: accepted by IEEE Internet of Things Journal

  38. arXiv:2007.11354  [pdf, other

    cs.SE cs.DC cs.LG

    A Systematic Literature Review on Federated Machine Learning: From A Software Engineering Perspective

    Authors: Sin Kit Lo, Qinghua Lu, Chen Wang, Hye-Young Paik, Liming Zhu

    Abstract: Federated learning is an emerging machine learning paradigm where clients train models locally and formulate a global model based on the local model updates. To identify the state-of-the-art in federated learning and explore how to develop federated learning systems, we perform a systematic literature review from a software engineering perspective, based on 231 primary studies. Our data synthesis… ▽ More

    Submitted 28 May, 2021; v1 submitted 22 July, 2020; originally announced July 2020.

    Comments: Published on ACM Computing Survey. Latest version available here: https://dl.acm.org/doi/10.1145/3450288

    Journal ref: ACM.CSUR.54.95 (2021) 1-39

  39. arXiv:2006.04047  [pdf, ps, other

    cs.CV cs.RO

    DeepRelativeFusion: Dense Monocular SLAM using Single-Image Relative Depth Prediction

    Authors: Shing Yan Loo, Syamsiah Mashohor, Sai Hong Tang, Hong Zhang

    Abstract: In this paper, we propose a dense monocular SLAM system, named DeepRelativeFusion, that is capable to recover a globally consistent 3D structure. To this end, we use a visual SLAM algorithm to reliably recover the camera poses and semi-dense depth maps of the keyframes, and then use relative depth prediction to densify the semi-dense depth maps and refine the keyframe pose-graph. To improve the se… ▽ More

    Submitted 9 July, 2021; v1 submitted 7 June, 2020; originally announced June 2020.

    Comments: Accepted to be published in the Proceedings of the 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2021)

  40. arXiv:1907.10015  [pdf

    cs.CV

    Exploring Semantic Segmentation on the DCT Representation

    Authors: Shao-Yuan Lo, Hsueh-Ming Hang

    Abstract: Typical convolutional networks are trained and conducted on RGB images. However, images are often compressed for memory savings and efficient transmission in real-world applications. In this paper, we explore methods for performing semantic segmentation on the discrete cosine transform (DCT) representation defined by the JPEG standard. We first rearrange the DCT coefficients to form a preferred in… ▽ More

    Submitted 29 December, 2019; v1 submitted 23 July, 2019; originally announced July 2019.

    Comments: Accepted in ACM International Conference on Multimedia in Asia (MMAsia) 2019

  41. arXiv:1907.09438  [pdf

    cs.CV

    Multi-Class Lane Semantic Segmentation using Efficient Convolutional Networks

    Authors: Shao-Yuan Lo, Hsueh-Ming Hang, Sheng-Wei Chan, Jing-Jhih Lin

    Abstract: Lane detection plays an important role in a self-driving vehicle. Several studies leverage a semantic segmentation network to extract robust lane features, but few of them can distinguish different types of lanes. In this paper, we focus on the problem of multi-class lane semantic segmentation. Based on the observation that the lane is a small-size and narrow-width object in a road scene image, we… ▽ More

    Submitted 22 July, 2019; originally announced July 2019.

    Comments: Accepted in IEEE International Workshop on Multimedia Signal Processing (MMSP) 2019

  42. arXiv:1905.07542  [pdf, other

    cs.CV cs.AI cs.RO

    Semi-Supervised Monocular Depth Estimation with Left-Right Consistency Using Deep Neural Network

    Authors: Ali Jahani Amiri, Shing Yan Loo, Hong Zhang

    Abstract: There has been tremendous research progress in estimating the depth of a scene from a monocular camera image. Existing methods for single-image depth prediction are exclusively based on deep neural networks, and their training can be unsupervised using stereo image pairs, supervised using LiDAR point clouds, or semi-supervised using both stereo and LiDAR. In general, semi-supervised training is pr… ▽ More

    Submitted 18 May, 2019; originally announced May 2019.

    Comments: Submitted to IROS2019

  43. arXiv:1902.06484  [pdf, other

    math.CO cs.DM

    Find Subtrees of Specified Weight and Cycles of Specified Length in Linear Time

    Authors: On-Hei Solomon Lo

    Abstract: We apply the Euler tour technique to find subtrees of specified weight as follows. Let $k, g, N_1, N_2 \in \mathbb{N}$ such that $1 \leq k \leq N_2$, $g + h > 2$ and $2k - 4g - h + 3 \leq N_2 \leq 2k + g + h - 2$, where $h := 2N_1 - N_2$. Let $T$ be a tree of $N_1$ vertices and let $c : V(T) \rightarrow \mathbb{N}$ be vertex weights such that $c(T) := \sum_{v \in V(T)} c(v) = N_2$ and… ▽ More

    Submitted 12 October, 2019; v1 submitted 18 February, 2019; originally announced February 2019.

  44. arXiv:1810.03865  [pdf, other

    math.CO cs.DM

    Compact Cactus Representations of all Non-Trivial Min-Cuts

    Authors: On-Hei Solomon Lo, Jens M. Schmidt, Mikkel Thorup

    Abstract: Recently, Kawarabayashi and Thorup presented the first deterministic edge-connectivity recognition algorithm in near-linear time. A crucial step in their algorithm uses the existence of vertex subsets of a simple graph $G$ on $n$ vertices whose contractions leave a multigraph with $\tilde{O}(n/δ)$ vertices and $\tilde{O}(n)$ edges that preserves all non-trivial min-cuts of $G$, where $δ$ is the mi… ▽ More

    Submitted 28 October, 2019; v1 submitted 9 October, 2018; originally announced October 2018.

    Comments: 12 pages, 3 figures

  45. arXiv:1810.01011  [pdf, ps, other

    cs.CV

    CNN-SVO: Improving the Mapping in Semi-Direct Visual Odometry Using Single-Image Depth Prediction

    Authors: Shing Yan Loo, Ali Jahani Amiri, Syamsiah Mashohor, Sai Hong Tang, Hong Zhang

    Abstract: Reliable feature correspondence between frames is a critical step in visual odometry (VO) and visual simultaneous localization and mapping (V-SLAM) algorithms. In comparison with existing VO and V-SLAM algorithms, semi-direct visual odometry (SVO) has two main advantages that lead to state-of-the-art frame rate camera motion estimation: direct pixel correspondence and efficient implementation of p… ▽ More

    Submitted 1 October, 2018; originally announced October 2018.

    Comments: 6 pages, 5 figures, submitted to ICRA 2019

  46. arXiv:1809.09077  [pdf

    cs.CV

    Incorporating Luminance, Depth and Color Information by a Fusion-based Network for Semantic Segmentation

    Authors: Shang-Wei Hung, Shao-Yuan Lo, Hsueh-Ming Hang

    Abstract: Semantic segmentation has made encouraging progress due to the success of deep convolutional networks in recent years. Meanwhile, depth sensors become prevalent nowadays, so depth maps can be acquired more easily. However, there are few studies that focus on the RGB-D semantic segmentation task. Exploiting the depth information effectiveness to improve performance is a challenge. In this paper, we… ▽ More

    Submitted 19 May, 2019; v1 submitted 24 September, 2018; originally announced September 2018.

    Comments: Accepted in IEEE International Conference on Image Processing (ICIP) 2019

  47. arXiv:1809.06323  [pdf

    cs.CV

    Efficient Dense Modules of Asymmetric Convolution for Real-Time Semantic Segmentation

    Authors: Shao-Yuan Lo, Hsueh-Ming Hang, Sheng-Wei Chan, Jing-Jhih Lin

    Abstract: Real-time semantic segmentation plays an important role in practical applications such as self-driving and robots. Most semantic segmentation research focuses on improving estimation accuracy with little consideration on efficiency. Several previous studies that emphasize high-speed inference often fail to produce high-accuracy segmentation results. In this paper, we propose a novel convolutional… ▽ More

    Submitted 28 December, 2019; v1 submitted 17 September, 2018; originally announced September 2018.

    Comments: Accepted in ACM International Conference on Multimedia in Asia (MMAsia) 2019 [Best Paper Award]

  48. arXiv:1809.03994  [pdf

    cs.CV

    Efficient Road Lane Marking Detection with Deep Learning

    Authors: Ping-Rong Chen, Shao-Yuan Lo, Hsueh-Ming Hang, Sheng-Wei Chan, Jing-Jhih Lin

    Abstract: Lane mark detection is an important element in the road scene analysis for Advanced Driver Assistant System (ADAS). Limited by the onboard computing power, it is still a challenge to reduce system complexity and maintain high accuracy at the same time. In this paper, we propose a Lane Marking Detector (LMD) using a deep convolutional neural network to extract robust lane marking features. To impro… ▽ More

    Submitted 11 September, 2018; originally announced September 2018.

    Comments: Accepted at International Conference on Digital Signal Processing (DSP) 2018

  49. arXiv:1808.01280  [pdf

    cs.NE cs.LG cs.SI stat.ML

    Geared Rotationally Identical and Invariant Convolutional Neural Network Systems

    Authors: ShihChung B. Lo, Ph. D., Matthew T. Freedman, M. D., Seong K. Mun, Ph. D., Heang-Ping Chan, Ph. D

    Abstract: Theorems and techniques to form different types of transformationally invariant processing and to produce the same output quantitatively based on either transformationally invariant operators or symmetric operations have recently been introduced by the authors. In this study, we further propose to compose a geared rotationally identical CNN system (GRI-CNN) with a small step angle by connecting ne… ▽ More

    Submitted 10 August, 2018; v1 submitted 2 August, 2018; originally announced August 2018.

    Comments: 14 pages, 6 figures, 8 tables

  50. arXiv:1807.11156  [pdf

    cs.CV cs.LG stat.ML

    Transformationally Identical and Invariant Convolutional Neural Networks by Combining Symmetric Operations or Input Vectors

    Authors: ShihChung B. Lo, Matthew T. Freedman, Seong K. Mun

    Abstract: Transformationally invariant processors constructed by transformed input vectors or operators have been suggested and applied to many applications. In this study, transformationally identical processing based on combining results of all sub-processes with corresponding transformations at one of the processing steps or at the beginning step were found to be equivalent for a given condition. This pr… ▽ More

    Submitted 20 August, 2018; v1 submitted 29 July, 2018; originally announced July 2018.

    Comments: 9 pages, 3 figures