Zum Hauptinhalt springen

Showing 1–50 of 71 results for author: Ng, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.11871  [pdf, other

    cs.CL cs.AI

    MegaFake: A Theory-Driven Dataset of Fake News Generated by Large Language Models

    Authors: Lionel Z. Wang, Yiming Ma, Renfei Gao, Beichen Guo, Zhuoran Li, Han Zhu, Wenqi Fan, Zexin Lu, Ka Chung Ng

    Abstract: The advent of large language models (LLMs) has revolutionized online content creation, making it much easier to generate high-quality fake news. This misuse threatens the integrity of our digital environment and ethical standards. Therefore, understanding the motivations and mechanisms behind LLM-generated fake news is crucial. In this study, we analyze the creation of fake news from a social psyc… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

  2. arXiv:2408.02709  [pdf, other

    cs.SE cs.AI

    Enhancing Medical Learning and Reasoning Systems: A Boxology-Based Comparative Analysis of Design Patterns

    Authors: Chi Him Ng

    Abstract: This study analyzes hybrid AI systems' design patterns and their effectiveness in clinical decision-making using the boxology framework. It categorizes and copares various architectures combining machine learning and rule-based reasoning to provide insights into their structural foundations and healthcare applications. Addressing two main questions, how to categorize these systems againts establis… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

  3. arXiv:2407.11773  [pdf, other

    cs.CL

    Educational Personalized Learning Path Planning with Large Language Models

    Authors: Chee Ng, Yuen Fung

    Abstract: Educational Personalized Learning Path Planning (PLPP) aims to tailor learning experiences to individual learners' needs, enhancing learning efficiency and engagement. Despite its potential, traditional PLPP systems often lack adaptability, interactivity, and transparency. This paper proposes a novel approach integrating Large Language Models (LLMs) with prompt engineering to address these challen… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 6 pages

  4. arXiv:2406.13434  [pdf, other

    cs.RO

    Tactile Aware Dynamic Obstacle Avoidance in Crowded Environment with Deep Reinforcement Learning

    Authors: Yung Chuen Ng, Qi Wen, Lim, Chun Ye Tan, Zhen Hao Gan, Meng Yee, Chuah

    Abstract: Mobile robots operating in crowded environments require the ability to navigate among humans and surrounding obstacles efficiently while adhering to safety standards and socially compliant mannerisms. This scale of the robot navigation problem may be classified as both a local path planning and trajectory optimization problem. This work presents an array of force sensors that act as a tactile laye… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  5. arXiv:2405.11622  [pdf, other

    cs.CL cs.LG

    Continuous Predictive Modeling of Clinical Notes and ICD Codes in Patient Health Records

    Authors: Mireia Hernandez Caralt, Clarence Boon Liang Ng, Marek Rei

    Abstract: Electronic Health Records (EHR) serve as a valuable source of patient information, offering insights into medical histories, treatments, and outcomes. Previous research has developed systems for detecting applicable ICD codes that should be assigned while writing a given EHR document, mainly focusing on discharge summaries written at the end of a hospital stay. In this work, we investigate the pot… ▽ More

    Submitted 5 July, 2024; v1 submitted 19 May, 2024; originally announced May 2024.

    ACM Class: I.2.7; J.3

  6. arXiv:2405.04165  [pdf, other

    cs.CL

    LingML: Linguistic-Informed Machine Learning for Enhanced Fake News Detection

    Authors: Jasraj Singh, Fang Liu, Hong Xu, Bee Chin Ng, Wei Zhang

    Abstract: Nowadays, Information spreads at an unprecedented pace in social media and discerning truth from misinformation and fake news has become an acute societal challenge. Machine learning (ML) models have been employed to identify fake news but are far from perfect with challenging problems like limited accuracy, interpretability, and generalizability. In this paper, we enhance ML-based solutions with… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 7 pages

  7. arXiv:2405.01842  [pdf, ps, other

    cs.CL

    SGHateCheck: Functional Tests for Detecting Hate Speech in Low-Resource Languages of Singapore

    Authors: Ri Chi Ng, Nirmalendu Prakash, Ming Shan Hee, Kenny Tsu Wei Choo, Roy Ka-Wei Lee

    Abstract: To address the limitations of current hate speech detection models, we introduce \textsf{SGHateCheck}, a novel framework designed for the linguistic and cultural context of Singapore and Southeast Asia. It extends the functional testing approach of HateCheck and MHC, employing large language models for translation and paraphrasing into Singapore's main languages, and refining these with native ann… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  8. arXiv:2404.14135  [pdf, other

    cs.CV

    Text in the Dark: Extremely Low-Light Text Image Enhancement

    Authors: Che-Tsung Lin, Chun Chet Ng, Zhi Qin Tan, Wan Jun Nah, Xinyu Wang, Jie Long Kew, Pohao Hsu, Shang Hong Lai, Chee Seng Chan, Christopher Zach

    Abstract: Extremely low-light text images are common in natural scenes, making scene text detection and recognition challenging. One solution is to enhance these images using low-light image enhancement methods before text extraction. However, previous methods often do not try to particularly address the significance of low-level features, which are crucial for optimal performance on downstream scene text t… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: The first two authors contributed equally to this work

  9. arXiv:2404.06224  [pdf, other

    cs.CL cs.AI cs.LG

    Low-Cost Generation and Evaluation of Dictionary Example Sentences

    Authors: Bill Cai, Clarence Boon Liang Ng, Daniel Tan, Shelvia Hotama

    Abstract: Dictionary example sentences play an important role in illustrating word definitions and usage, but manually creating quality sentences is challenging. Prior works have demonstrated that language models can be trained to generate example sentences. However, they relied on costly customized models and word sense datasets for generation and evaluation of their work. Rapid advancements in foundationa… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  10. arXiv:2312.11560  [pdf, other

    cs.LG cs.AI cs.NE

    Learning from Emergence: A Study on Proactively Inhibiting the Monosemantic Neurons of Artificial Neural Networks

    Authors: Jiachuan Wang, Shimin Di, Lei Chen, Charles Wang Wai Ng

    Abstract: Recently, emergence has received widespread attention from the research community along with the success of large-scale models. Different from the literature, we hypothesize a key factor that promotes the performance during the increase of scale: the reduction of monosemantic neurons that can only form one-to-one correlations with specific features. Monosemantic neurons tend to be sparser and have… ▽ More

    Submitted 19 June, 2024; v1 submitted 17 December, 2023; originally announced December 2023.

    Comments: 16 pages, 5 figures, KDD2024

  11. arXiv:2311.15530  [pdf, other

    cs.LG cs.AI physics.ao-ph

    SSIN: Self-Supervised Learning for Rainfall Spatial Interpolation

    Authors: Jia Li, Yanyan Shen, Lei Chen, Charles Wang Wai NG

    Abstract: The acquisition of accurate rainfall distribution in space is an important task in hydrological analysis and natural disaster pre-warning. However, it is impossible to install rain gauges on every corner. Spatial interpolation is a common way to infer rainfall distribution based on available raingauge data. However, the existing works rely on some unrealistic pre-settings to capture spatial correl… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

    Comments: SIGMOD 2023 Data-intensive Applications (DIA) Track; Code is available at https://github.com/jlidw/SSIN

  12. arXiv:2302.12666  [pdf, other

    cs.LG cs.AI cs.CL

    Modelling Temporal Document Sequences for Clinical ICD Coding

    Authors: Clarence Boon Liang Ng, Diogo Santos, Marek Rei

    Abstract: Past studies on the ICD coding problem focus on predicting clinical codes primarily based on the discharge summary. This covers only a small fraction of the notes generated during each hospital stay and leaves potential for improving performance by analysing all the available clinical notes. We propose a hierarchical transformer architecture that uses text across the entire sequence of clinical no… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

  13. A deep-learning search for technosignatures of 820 nearby stars

    Authors: Peter Xiangyuan Ma, Cherry Ng, Leandro Rizk, Steve Croft, Andrew P. V. Siemion, Bryan Brzycki, Daniel Czech, Jamie Drew, Vishal Gajjar, John Hoang, Howard Isaacson, Matt Lebofsky, David MacMahon, Imke de Pater, Danny C. Price, Sofia Z. Sheikh, S. Pete Worden

    Abstract: The goal of the Search for Extraterrestrial Intelligence (SETI) is to quantify the prevalence of technological life beyond Earth via their "technosignatures". One theorized technosignature is narrowband Doppler drifting radio signals. The principal challenge in conducting SETI in the radio domain is developing a generalized technique to reject human radio frequency interference (RFI). Here, we pre… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

    Comments: 10 pages of main paper followed by 16 pages of methods; 17 figures total and 7 tables; published in Nature Astronomy

  14. arXiv:2212.00581  [pdf

    eess.SY cs.AI

    An enhanced simulation-based multi-objective optimization approach with knowledge discovery for reconfigurable manufacturing systems

    Authors: Carlos Alberto Barrera-Diaz, Amir Nourmohammdi, Henrik Smedberg, Tehseen Aslam, Amos H. C. Ng

    Abstract: In today's uncertain and competitive market, where enterprises are subjected to increasingly shortened product life-cycles and frequent volume changes, reconfigurable manufacturing systems (RMS) applications play a significant role in the manufacturing industry's success. Despite the advantages offered by RMS, achieving a high-efficiency degree constitutes a challenging task for stakeholders and d… ▽ More

    Submitted 30 November, 2022; originally announced December 2022.

  15. arXiv:2211.15322  [pdf, other

    cs.LG stat.ML

    Transductive Kernels for Gaussian Processes on Graphs

    Authors: Yin-Cong Zhi, Felix L. Opolka, Yin Cheng Ng, Pietro Liò, Xiaowen Dong

    Abstract: Kernels on graphs have had limited options for node-level problems. To address this, we present a novel, generalized kernel for graphs with node feature data for semi-supervised learning. The kernel is derived from a regularization framework by treating the graph and feature data as two Hilbert spaces. We also show how numerous kernel-based models on graphs are instances of our design. A kernel de… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

  16. arXiv:2211.03057  [pdf, other

    cs.NI

    Towards Green Metaverse Networking Technologies, Advancements and Future Directions

    Authors: Siyue Zhang, Wei Yang Bryan Lim, Wei Chong Ng, Zehui Xiong, Dusit Niyato, Xuemin Sherman Shen, Chunyan Miao

    Abstract: As the Metaverse is iteratively being defined, its potential to unleash the next wave of digital disruption and create real-life value becomes increasingly clear. With distinctive features of immersive experience, simultaneous interactivity, and user agency, the Metaverse has the capability to transform all walks of life. However, the enabling technologies of the Metaverse, i.e., digital twin, art… ▽ More

    Submitted 13 April, 2023; v1 submitted 6 November, 2022; originally announced November 2022.

  17. arXiv:2209.09508  [pdf, other

    cs.RO

    Real-time Digital Double Framework to Predict Collapsible Terrains for Legged Robots

    Authors: Garen Haddeler, Hari P. Palanivelu, Yung Chuen Ng, Fabien Colonnier, Albertus H. Adiwahono, Zhibin Li, Chee-Meng Chew, Meng Yee, Chuah

    Abstract: Inspired by the digital twinning systems, a novel real-time digital double framework is developed to enhance robot perception of the terrain conditions. Based on the very same physical model and motion control, this work exploits the use of such simulated digital double synchronized with a real robot to capture and extract discrepancy information between the two systems, which provides high dimens… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

    Comments: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). Preprint version. Accepted June 2022

  18. arXiv:2208.14661  [pdf, other

    cs.GT

    Stochastic Resource Allocation for Semantic Communication-aided Virtual Transportation Networks in the Metaverse

    Authors: Wei Chong Ng, Hongyang Du, Wei Yang Bryan Lim, Zehui Xiong, Dusit Niyato, Chunyan Miao

    Abstract: The physical-virtual world synchronization to develop the Metaverse will require a massive transmission and exchange of data. In this paper, we introduce semantic communication for the development of virtual transportation networks in the Metaverse. Leveraging the perception capabilities of edge devices, virtual service providers (VSPs) can subscribe to their preferred edge devices to receive the… ▽ More

    Submitted 31 August, 2022; originally announced August 2022.

    Comments: 6 pages, 5 figures and 3 tables

  19. arXiv:2204.03724  [pdf, other

    cs.NI cs.LG eess.SP

    A Kernel Method to Nonlinear Location Estimation with RSS-based Fingerprint

    Authors: Pai Chet Ng, Petros Spachos, James She, Konstantinos N. Plataniotis

    Abstract: This paper presents a nonlinear location estimation to infer the position of a user holding a smartphone. We consider a large location with $M$ number of grid points, each grid point is labeled with a unique fingerprint consisting of the received signal strength (RSS) values measured from $N$ number of Bluetooth Low Energy (BLE) beacons. Given the fingerprint observed by the smartphone, the user's… ▽ More

    Submitted 7 April, 2022; originally announced April 2022.

  20. arXiv:2204.03195  [pdf, other

    cs.RO

    3D Perception based Imitation Learning under Limited Demonstration for Laparoscope Control in Robotic Surgery

    Authors: Bin Li, Ruofeng Wei, Jiaqi Xu, Bo Lu, Chi-Hang Yee, Chi-Fai Ng, Pheng-Ann Heng, Qi Dou, Yun-Hui Liu

    Abstract: Automatic laparoscope motion control is fundamentally important for surgeons to efficiently perform operations. However, its traditional control methods based on tool tracking without considering information hidden in surgical scenes are not intelligent enough, while the latest supervised imitation learning (IL)-based methods require expensive sensor data and suffer from distribution mismatch issu… ▽ More

    Submitted 7 April, 2022; originally announced April 2022.

    Comments: 7 pages, 7 figures, 2022 IEEE International Conference on Robotics and Automation (ICRA)

  21. arXiv:2204.00630  [pdf, other

    eess.IV cs.CV

    Extremely Low-light Image Enhancement with Scene Text Restoration

    Authors: Pohao Hsu, Che-Tsung Lin, Chun Chet Ng, Jie-Long Kew, Mei Yih Tan, Shang-Hong Lai, Chee Seng Chan, Christopher Zach

    Abstract: Deep learning-based methods have made impressive progress in enhancing extremely low-light images - the image quality of the reconstructed images has generally improved. However, we found out that most of these methods could not sufficiently recover the image details, for instance, the texts in the scene. In this paper, a novel image enhancement framework is proposed to precisely restore the scene… ▽ More

    Submitted 1 April, 2022; originally announced April 2022.

  22. arXiv:2203.15405  [pdf, other

    eess.AS cs.SD

    Automatic Detection of Speech Sound Disorder in Child Speech Using Posterior-based Speaker Representations

    Authors: Si-Ioi Ng, Cymie Wing-Yee Ng, Jiarui Wang, Tan Lee

    Abstract: This paper presents a macroscopic approach to automatic detection of speech sound disorder (SSD) in child speech. Typically, SSD is manifested by persistent articulation and phonological errors on specific phonemes in the language. The disorder can be detected by focally analyzing the phonemes or the words elicited by the child subject. In the present study, instead of attempting to detect individ… ▽ More

    Submitted 29 June, 2022; v1 submitted 29 March, 2022; originally announced March 2022.

    Comments: Accepted to Interspeech 2022

  23. arXiv:2203.05471  [pdf, other

    cs.NI cs.CY

    A Full Dive into Realizing the Edge-enabled Metaverse: Visions, Enabling Technologies,and Challenges

    Authors: Minrui Xu, Wei Chong Ng, Wei Yang Bryan Lim, Jiawen Kang, Zehui Xiong, Dusit Niyato, Qiang Yang, Xuemin Sherman Shen, Chunyan Miao

    Abstract: Dubbed "the successor to the mobile Internet", the concept of the Metaverse has grown in popularity. While there exist lite versions of the Metaverse today, they are still far from realizing the full vision of an immersive, embodied, and interoperable Metaverse. Without addressing the issues of implementation from the communication and networking, as well as computation perspectives, the Metaverse… ▽ More

    Submitted 20 August, 2022; v1 submitted 10 March, 2022; originally announced March 2022.

  24. arXiv:2202.11697  [pdf, other

    cs.IT

    Stochastic Coded Offloading Scheme for Unmanned Aerial Vehicle-Assisted Edge Computing

    Authors: Wei Chong Ng, Wei Yang Bryan Lim, Zehui Xiong, Dusit Niyato, Chunyan Miao, Zhu Han, Dong In Kim

    Abstract: Unmanned aerial vehicles (UAVs) have gained wide research interests due to their technological advancement and high mobility. The UAVs are equipped with increasingly advanced capabilities to run computationally intensive applications enabled by machine learning techniques. However, because of both energy and computation constraints, the UAVs face issues hovering in the sky while performing computa… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

    Comments: Accepted by IEEE Internet of Things Journal. 20 pages, 18 figures. arXiv admin note: text overlap with arXiv:2110.14873

  25. arXiv:2110.14873  [pdf, other

    cs.GT

    Optimal Stochastic Coded Computation Offloading in Unmanned Aerial Vehicles Network

    Authors: Wei Chong Ng, Wei Yang Bryan Lim, Jer Shyuan Ng, Suttinee Sawadsitang, Zehui Xiong, Dusit Niyato

    Abstract: Today, modern unmanned aerial vehicles (UAVs) are equipped with increasingly advanced capabilities that can run applications enabled by machine learning techniques, which require computationally intensive operations such as matrix multiplications. Due to computation constraints, the UAVs can offload their computation tasks to edge servers. To mitigate stragglers, coded distributed computing (CDC)… ▽ More

    Submitted 28 October, 2021; originally announced October 2021.

    Comments: To be published in IEEE Global Communications Conference

  26. arXiv:2110.14325  [pdf, other

    cs.GT

    Unified Resource Allocation Framework for the Edge Intelligence-Enabled Metaverse

    Authors: Wei Chong Ng, Wei Yang Bryan Lim, Jer Shyuan Ng, Zehui Xiong, Dusit Niyato, Chunyan Miao

    Abstract: Dubbed as the next-generation Internet, the metaverse is a virtual world that allows users to interact with each other or objects in real-time using their avatars. The metaverse is envisioned to support novel ecosystems of service provision in an immersive environment brought about by an intersection of the virtual and physical worlds. The native AI systems in metaverse will personalized user expe… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

    Comments: 6 pages, 10 figures

  27. arXiv:2108.02008  [pdf, other

    cs.CR cs.LG cs.NI

    Personal Devices for Contact Tracing: Smartphones and Wearables to Fight Covid-19

    Authors: Pai Chet Ng, Petros Spachos, Stefano Gregori, Konstantinos Plataniotis

    Abstract: Digital contact tracing has emerged as a viable tool supplementing manual contact tracing. To date, more than 100 contact tracing applications have been published to slow down the spread of highly contagious Covid-19. Despite subtle variabilities among these applications, all of them achieve contact tracing by manipulating the following three components: a) use a personal device to identify the us… ▽ More

    Submitted 2 August, 2021; originally announced August 2021.

    Comments: Accepted at the IEEE Communications Magazine

  28. arXiv:2107.05279  [pdf, other

    cs.CV

    ICDAR 2021 Competition on Integrated Circuit Text Spotting and Aesthetic Assessment

    Authors: Chun Chet Ng, Akmalul Khairi Bin Nazaruddin, Yeong Khang Lee, Xinyu Wang, Yuliang Liu, Chee Seng Chan, Lianwen Jin, Yipeng Sun, Lixin Fan

    Abstract: With hundreds of thousands of electronic chip components are being manufactured every day, chip manufacturers have seen an increasing demand in seeking a more efficient and effective way of inspecting the quality of printed texts on chip components. The major problem that deters this area of research is the lacking of realistic text on chips datasets to act as a strong foundation. Hence, a text on… ▽ More

    Submitted 12 July, 2021; originally announced July 2021.

    Comments: Technical report of ICDAR 2021 Competition on Integrated Circuit Text Spotting and Aesthetic Assessment

    Journal ref: International Conference on Document Analysis and Recognition (ICDAR) 2021

  29. arXiv:2107.00229  [pdf, other

    cs.CV cs.AI

    E-DSSR: Efficient Dynamic Surgical Scene Reconstruction with Transformer-based Stereoscopic Depth Perception

    Authors: Yonghao Long, Zhaoshuo Li, Chi Hang Yee, Chi Fai Ng, Russell H. Taylor, Mathias Unberath, Qi Dou

    Abstract: Reconstructing the scene of robotic surgery from the stereo endoscopic video is an important and promising topic in surgical data science, which potentially supports many applications such as surgical visual perception, robotic surgery education and intra-operative context awareness. However, current methods are mostly restricted to reconstructing static anatomy assuming no tissue deformation, too… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

    Comments: Accepted to MICCAI 2021

  30. arXiv:2106.08536  [pdf, other

    eess.AS cs.SD

    Detection of Consonant Errors in Disordered Speech Based on Consonant-vowel Segment Embedding

    Authors: Si-Ioi Ng, Cymie Wing-Yee Ng, Jingyu Li, Tan Lee

    Abstract: Speech sound disorder (SSD) refers to a type of developmental disorder in young children who encounter persistent difficulties in producing certain speech sounds at the expected age. Consonant errors are the major indicator of SSD in clinical assessment. Previous studies on automatic assessment of SSD revealed that detection of speech errors concerning short and transitory consonants is less satis… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

    Comments: Accepted to INTERSPEECH 2021

  31. arXiv:2106.00795  [pdf

    cs.IT eess.SP

    Classification of MIMO Equalizers

    Authors: Wing Chau Ng, Chuandong Li

    Abstract: In this theoretical work, the DSP-perceived channel in optical coherent communications is first simplified, based on which we categorize linear MIMO equalizers into four classes according to their reference locations. The entire channel inverse can be represented by a complex conjugate-dependent system, coinciding with the widely linear equalization theory. Suboptimally removing FO dynamics, relat… ▽ More

    Submitted 7 June, 2021; v1 submitted 1 June, 2021; originally announced June 2021.

    Comments: This work was submitted to ECOC 2021. This theoretical paper also explains the principle of the experimental demonstration in Joint Transmitter and Receiver IQ Differential Phase Calibration using a single 4x8 MIMO Equalizer, Proc. Advanced Photonics Congress 2021 (SPPCom), SpTh1D.4

  32. arXiv:2103.12988  [pdf, other

    cs.CV

    One to Many: Adaptive Instrument Segmentation via Meta Learning and Dynamic Online Adaptation in Robotic Surgical Video

    Authors: Zixu Zhao, Yueming Jin, Bo Lu, Chi-Fai Ng, Qi Dou, Yun-Hui Liu, Pheng-Ann Heng

    Abstract: Surgical instrument segmentation in robot-assisted surgery (RAS) - especially that using learning-based models - relies on the assumption that training and testing videos are sampled from the same domain. However, it is impractical and expensive to collect and annotate sufficient data from every new domain. To greatly increase the label efficiency, we explore a new problem, i.e., adaptive instrume… ▽ More

    Submitted 24 March, 2021; originally announced March 2021.

    Comments: Accepted by ICRA 2021

  33. arXiv:2101.12149  [pdf, other

    physics.comp-ph cs.DC physics.acc-ph

    Porting WarpX to GPU-accelerated platforms

    Authors: A. Myers, A. Almgren, L. D. Amorim, J. Bell, L. Fedeli, L. Ge, K. Gott, D. P. Grote, M. Hogan, A. Huebl, R. Jambunathan, R. Lehe, C. Ng, M. Rowan, O. Shapoval, M. Thévenet, J. -L. Vay, H. Vincenti, E. Yang, N. Zaïm, W. Zhang, Y. Zhao, E. Zoni

    Abstract: WarpX is a general purpose electromagnetic particle-in-cell code that was originally designed to run on many-core CPU architectures. We describe the strategy followed to allow WarpX to use the GPU-accelerated nodes on OLCF's Summit supercomputer, a strategy we believe will extend to the upcoming machines Frontier and Aurora. We summarize the challenges encountered, lessons learned, and give curren… ▽ More

    Submitted 2 September, 2021; v1 submitted 28 January, 2021; originally announced January 2021.

    Comments: 11 pages, 5 figures, accepted by Parallel Computing. Minor revisions, results unchanged

    Journal ref: Parallel Computing, Volume 108, 2021, 102833

  34. arXiv:2012.14309  [pdf, other

    q-bio.PE cond-mat.soft cs.CL physics.bio-ph

    General Mechanism of Evolution Shared by Proteins and Words

    Authors: Li-Min Wang, Hsing-Yi Lai, Sun-Ting Tsai, Chen Siang Ng, Shan-Jyun Wu, Meng-Xue Tsai, Yi-Ching Su, Daw-Wei Wang, Tzay-Ming Hong

    Abstract: Complex systems, such as life and languages, are governed by principles of evolution. The analogy and comparison between biology and linguistics\cite{alphafold2, RoseTTAFold, lang_virus, cell language, faculty1, language of gene, Protein linguistics, dictionary, Grammar of pro_dom, complexity, genomics_nlp, InterPro, language modeling, Protein language modeling} provide a computational foundation… ▽ More

    Submitted 16 December, 2022; v1 submitted 28 December, 2020; originally announced December 2020.

  35. arXiv:2008.03188  [pdf, other

    eess.AS cs.SD

    CUCHILD: A Large-Scale Cantonese Corpus of Child Speech for Phonology and Articulation Assessment

    Authors: Si-Ioi Ng, Cymie Wing-Yee Ng, Jiarui Wang, Tan Lee, Kathy Yuet-Sheung Lee, Michael Chi-Fai Tong

    Abstract: This paper describes the design and development of CUCHILD, a large-scale Cantonese corpus of child speech. The corpus contains spoken words collected from 1,986 child speakers aged from 3 to 6 years old. The speech materials include 130 words of 1 to 4 syllables in length. The speakers cover both typically developing (TD) children and children with speech disorder. The intended use of the corpus… ▽ More

    Submitted 7 August, 2020; originally announced August 2020.

    Comments: Accepted to INTERSPEECH 2020, Shanghai, China

  36. arXiv:2007.04399  [pdf, other

    cs.CR cs.HC cs.LG cs.NI

    Epidemic Exposure Notification with Smartwatch: A Proximity-Based Privacy-Preserving Approach

    Authors: Pai Chet Ng, Petros Spachos, Stefano Gregori, Konstantinos Plataniotis

    Abstract: Businesses planning for the post-pandemic world are looking for innovative ways to protect the health and welfare of their employees and customers. Wireless technologies can play a key role in assisting contact tracing to quickly halt a local infection outbreak and prevent further spread. In this work, we present a wearable proximity and exposure notification solution based on a smartwatch that al… ▽ More

    Submitted 8 July, 2020; originally announced July 2020.

  37. arXiv:2006.07361  [pdf, other

    cs.LG eess.SP stat.ML

    Gaussian Processes on Graphs via Spectral Kernel Learning

    Authors: Yin-Cong Zhi, Yin Cheng Ng, Xiaowen Dong

    Abstract: We propose a graph spectrum-based Gaussian process for prediction of signals defined on nodes of the graph. The model is designed to capture various graph signal structures through a highly adaptive kernel that incorporates a flexible polynomial function in the graph spectral domain. Unlike most existing approaches, we propose to learn such a spectral kernel, where the polynomial setup enables lea… ▽ More

    Submitted 28 October, 2020; v1 submitted 12 June, 2020; originally announced June 2020.

    Comments: 13 pages, 5 Figures

  38. arXiv:2005.13754  [pdf, other

    cs.LG cs.CR cs.HC cs.NI

    COVID-19 and Your Smartphone: BLE-based Smart Contact Tracing

    Authors: Pai Chet Ng, Petros Spachos, Konstantinos Plataniotis

    Abstract: Contact tracing is of paramount importance when it comes to preventing the spreading of infectious diseases. Contact tracing is usually performed manually by authorized personnel. Manual contact tracing is an inefficient, error-prone, time-consuming process of limited utility to the population at large as those in close contact with infected individuals are informed hours, if not days, later. This… ▽ More

    Submitted 27 May, 2020; originally announced May 2020.

  39. arXiv:2005.02780  [pdf, other

    cs.SI cs.CL

    A Large-scale Industrial and Professional Occupation Dataset

    Authors: Junhua Liu, Yung Chuen Ng, Kwan Hui Lim

    Abstract: There has been growing interest in utilizing occupational data mining and analysis. In today's job market, occupational data mining and analysis is growing in importance as it enables companies to predict employee turnover, model career trajectories, screen through resumes and perform other human resource tasks. A key requirement to facilitate these tasks is the need for an occupation-related data… ▽ More

    Submitted 25 April, 2020; originally announced May 2020.

  40. arXiv:2002.10215  [pdf, other

    cs.CV

    On the General Value of Evidence, and Bilingual Scene-Text Visual Question Answering

    Authors: Xinyu Wang, Yuliang Liu, Chunhua Shen, Chun Chet Ng, Canjie Luo, Lianwen Jin, Chee Seng Chan, Anton van den Hengel, Liangwei Wang

    Abstract: Visual Question Answering (VQA) methods have made incredible progress, but suffer from a failure to generalize. This is visible in the fact that they are vulnerable to learning coincidental correlations in the data rather than deeper relations between image content and ideas expressed in language. We present a dataset that takes a step towards addressing this problem in that it contains questions… ▽ More

    Submitted 25 February, 2020; v1 submitted 24 February, 2020; originally announced February 2020.

    Comments: Accepted to Proc. IEEE Conf. Computer Vision and Pattern Recognition 2020

  41. arXiv:1910.10495  [pdf, other

    cs.CL cs.IR cs.LG

    IPOD: An Industrial and Professional Occupations Dataset and its Applications to Occupational Data Mining and Analysis

    Authors: Junhua Liu, Yung Chuen Ng, Kristin L. Wood, Kwan Hui Lim

    Abstract: Occupational data mining and analysis is an important task in understanding today's industry and job market. Various machine learning techniques are proposed and gradually deployed to improve companies' operations for upstream tasks, such as employee churn prediction, career trajectory modelling and automated interview. Job titles analysis and embedding, as the fundamental building blocks, are cru… ▽ More

    Submitted 26 April, 2020; v1 submitted 22 October, 2019; originally announced October 2019.

  42. arXiv:1909.07741  [pdf, other

    cs.CV cs.LG cs.MM

    ICDAR 2019 Competition on Large-scale Street View Text with Partial Labeling -- RRC-LSVT

    Authors: Yipeng Sun, Zihan Ni, Chee-Kheng Chng, Yuliang Liu, Canjie Luo, Chun Chet Ng, Junyu Han, Errui Ding, Jingtuo Liu, Dimosthenis Karatzas, Chee Seng Chan, Lianwen Jin

    Abstract: Robust text reading from street view images provides valuable information for various applications. Performance improvement of existing methods in such a challenging scenario heavily relies on the amount of fully annotated training data, which is costly and in-efficient to obtain. To scale up the amount of training data while keeping the labeling procedure cost-effective, this competition introduc… ▽ More

    Submitted 17 September, 2019; originally announced September 2019.

    Comments: ICDAR 2019 Robust Reading Challenge in IAPR International Conference on Document Analysis and Recognition (ICDAR)

  43. arXiv:1909.07145  [pdf, other

    cs.CV

    ICDAR2019 Robust Reading Challenge on Arbitrary-Shaped Text (RRC-ArT)

    Authors: Chee-Kheng Chng, Yuliang Liu, Yipeng Sun, Chun Chet Ng, Canjie Luo, Zihan Ni, ChuanMing Fang, Shuaitao Zhang, Junyu Han, Errui Ding, Jingtuo Liu, Dimosthenis Karatzas, Chee Seng Chan, Lianwen Jin

    Abstract: This paper reports the ICDAR2019 Robust Reading Challenge on Arbitrary-Shaped Text (RRC-ArT) that consists of three major challenges: i) scene text detection, ii) scene text recognition, and iii) scene text spotting. A total of 78 submissions from 46 unique teams/individuals were received for this competition. The top performing score of each challenge is as follows: i) T1 - 82.65%, ii) T2.1 - 74.… ▽ More

    Submitted 16 September, 2019; originally announced September 2019.

    Comments: Technical report of ICDAR2019 Robust Reading Challenge on Arbitrary-Shaped Text (RRC-ArT) Competition

  44. A Study of an Agile Methodology with Scrum Approach to the Filipino Company-Sponsored I.T. Capstone Program

    Authors: Giuseppe C. Ng

    Abstract: Purpose - The research aims to show the relevance of company client sponsored student projects in the University of Asia and the Pacific Information Technology (UA&P IT) Capstone Program through the use ofan Agile Methodology with Scrum Approach. Method - The modified program is employed on two batches with content analysis and survey results as benchmarks. Results - Surveys at the end of the spri… ▽ More

    Submitted 3 February, 2019; originally announced February 2019.

    Journal ref: International Journal of Computing Sciences Research (ISSN print: 2546-0552; ISSN online: 2546-115X) Vol. 2, No. 2, 2018

  45. arXiv:1811.08933  [pdf, other

    cs.DC

    Analyzing Machine Learning Workloads Using a Detailed GPU Simulator

    Authors: Jonathan Lew, Deval Shah, Suchita Pati, Shaylin Cattell, Mengchi Zhang, Amruth Sandhupatla, Christopher Ng, Negar Goli, Matthew D. Sinclair, Timothy G. Rogers, Tor Aamodt

    Abstract: Most deep neural networks deployed today are trained using GPUs via high-level frameworks such as TensorFlow and PyTorch. This paper describes changes we made to the GPGPU-Sim simulator to enable it to run PyTorch by running PTX kernels included in NVIDIA's cuDNN library. We use the resulting modified simulator, which has been made available publicly with this paper, to study some simple deep lear… ▽ More

    Submitted 26 January, 2019; v1 submitted 18 November, 2018; originally announced November 2018.

    Comments: Source code available at: https://github.com/gpgpu-sim/gpgpu-sim_distribution/tree/dev

  46. arXiv:1809.05210  [pdf, other

    cs.CV

    A Time Series Graph Cut Image Segmentation Scheme for Liver Tumors

    Authors: Laramie Paxton, Yufeng Cao, Kevin R. Vixie, Yuan Wang, Brian Hobbs, Chaan Ng

    Abstract: Tumor detection in biomedical imaging is a time-consuming process for medical professionals and is not without errors. Thus in recent decades, researchers have developed algorithmic techniques for image processing using a wide variety of mathematical methods, such as statistical modeling, variational techniques, and machine learning. In this paper, we propose a semi-automatic method for liver segm… ▽ More

    Submitted 13 September, 2018; originally announced September 2018.

    Comments: Image processing; image analysis; medical imaging

  47. arXiv:1809.04379  [pdf, other

    cs.LG cs.SI stat.ML

    Bayesian Semi-supervised Learning with Graph Gaussian Processes

    Authors: Yin Cheng Ng, Nicolo Colombo, Ricardo Silva

    Abstract: We propose a data-efficient Gaussian process-based Bayesian approach to the semi-supervised learning problem on graphs. The proposed model shows extremely competitive performance when compared to the state-of-the-art graph neural networks on semi-supervised learning benchmark experiments, and outperforms the neural networks in active learning experiments where labels are scarce. Furthermore, the m… ▽ More

    Submitted 12 October, 2018; v1 submitted 12 September, 2018; originally announced September 2018.

    Comments: To appear in NIPS 2018 Fixed an error in Figure 2. The previous arxiv version contains two identical sub-figures

  48. arXiv:1801.09029  [pdf, other

    cs.IT

    Adaptive Hybrid Beamforming with Massive Phased Arrays in Macro-Cellular Networks

    Authors: Shahram Shahsavari, S. Amir Hosseini, Chris Ng, Elza Erkip

    Abstract: Hybrid beamforming via large antenna arrays has shown a great potential for increasing data rate in cellular networks by delivering multiple data streams simultaneously. In this paper, several beamforming design algorithms are proposed based on the long-term channel information for macro-cellular environments where the base station is equipped with a massive phased array under per-antenna power co… ▽ More

    Submitted 3 February, 2018; v1 submitted 26 January, 2018; originally announced January 2018.

  49. arXiv:1710.04008  [pdf, other

    stat.ML cs.SI

    A Dynamic Edge Exchangeable Model for Sparse Temporal Networks

    Authors: Yin Cheng Ng, Ricardo Silva

    Abstract: We propose a dynamic edge exchangeable network model that can capture sparse connections observed in real temporal networks, in contrast to existing models which are dense. The model achieved superior link prediction accuracy on multiple data sets when compared to a dynamic variant of the blockmodel, and is able to extract interpretable time-varying community structures from the data. In addition… ▽ More

    Submitted 11 October, 2017; originally announced October 2017.

  50. arXiv:1612.05038  [pdf, other

    cs.CV

    Objective Micro-Facial Movement Detection Using FACS-Based Regions and Baseline Evaluation

    Authors: Adrian K. Davison, Cliff Lansley, Choon Ching Ng, Kevin Tan, Moi Hoon Yap

    Abstract: Micro-facial expressions are regarded as an important human behavioural event that can highlight emotional deception. Spotting these movements is difficult for humans and machines, however research into using computer vision to detect subtle facial expressions is growing in popularity. This paper proposes an individualised baseline micro-movement detection method using 3D Histogram of Oriented Gra… ▽ More

    Submitted 15 December, 2016; originally announced December 2016.