Zum Hauptinhalt springen

Showing 1–50 of 61 results for author: Ju, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.10608  [pdf, other

    cs.CL cs.AI

    Promoting Equality in Large Language Models: Identifying and Mitigating the Implicit Bias based on Bayesian Theory

    Authors: Yongxin Deng, Xihe Qiu, Xiaoyu Tan, Jing Pan, Chen Jue, Zhijun Fang, Yinghui Xu, Wei Chu, Yuan Qi

    Abstract: Large language models (LLMs) are trained on extensive text corpora, which inevitably include biased information. Although techniques such as Affective Alignment can mitigate some negative impacts of these biases, existing prompt-based attack methods can still extract these biases from the model's weights. Moreover, these biases frequently appear subtly when LLMs are prompted to perform identical t… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

  2. arXiv:2407.11717  [pdf, other

    cs.CV

    Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Large Models

    Authors: Chen Ju, Haicheng Wang, Haozhe Cheng, Xu Chen, Zhonghua Zhai, Weilin Huang, Jinsong Lan, Shuai Xiao, Bo Zheng

    Abstract: Vision-Language Large Models (VLMs) recently become primary backbone of AI, due to the impressive performance. However, their expensive computation costs, i.e., throughput and delay, impede potentials in the real-world scenarios. To achieve acceleration for VLMs, most existing methods focus on the model perspective: pruning, distillation, quantization, but completely overlook the data-perspective… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: ECCV 2024. The first two authors share the same contribution. arXiv admin note: substantial text overlap with arXiv:2312.07408

  3. arXiv:2404.14890  [pdf, other

    cs.CV

    DENOISER: Rethinking the Robustness for Open-Vocabulary Action Recognition

    Authors: Haozhe Cheng, Cheng Ju, Haicheng Wang, Jinxiang Liu, Mengting Chen, Qiang Hu, Xiaoyun Zhang, Yanfeng Wang

    Abstract: As one of the fundamental video tasks in computer vision, Open-Vocabulary Action Recognition (OVAR) recently gains increasing attention, with the development of vision-language pre-trainings. To enable generalization of arbitrary classes, existing methods treat class labels as text descriptions, then formulate OVAR as evaluating embedding similarity between visual samples and textual classes. Howe… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  4. arXiv:2403.15082   

    cs.CV

    Cell Variational Information Bottleneck Network

    Authors: Zhonghua Zhai, Chen Ju, Jinsong Lan, Shuai Xiao

    Abstract: In this work, we propose Cell Variational Information Bottleneck Network (cellVIB), a convolutional neural network using information bottleneck mechanism, which can be combined with the latest feedforward network architecture in an end-to-end training method. Our Cell Variational Information Bottleneck Network is constructed by stacking VIB cells, which generate feature maps with uncertainty. As l… ▽ More

    Submitted 29 March, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

    Comments: Found errors in the article, therefore postponing publication for now

  5. arXiv:2403.12965  [pdf, other

    cs.CV

    Wear-Any-Way: Manipulable Virtual Try-on via Sparse Correspondence Alignment

    Authors: Mengting Chen, Xi Chen, Zhonghua Zhai, Chen Ju, Xuewen Hong, Jinsong Lan, Shuai Xiao

    Abstract: This paper introduces a novel framework for virtual try-on, termed Wear-Any-Way. Different from previous methods, Wear-Any-Way is a customizable solution. Besides generating high-fidelity results, our method supports users to precisely manipulate the wearing style. To achieve this goal, we first construct a strong pipeline for standard virtual try-on, supporting single/multiple garment try-on and… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Project Page: https://mengtingchen.github.io/wear-any-way-page/

  6. arXiv:2403.11074  [pdf, other

    cs.CV cs.AI cs.MM cs.SD eess.AS

    Audio-Visual Segmentation via Unlabeled Frame Exploitation

    Authors: Jinxiang Liu, Yikun Liu, Fei Zhang, Chen Ju, Ya Zhang, Yanfeng Wang

    Abstract: Audio-visual segmentation (AVS) aims to segment the sounding objects in video frames. Although great progress has been witnessed, we experimentally reveal that current methods reach marginal performance gain within the use of the unlabeled frames, leading to the underutilization issue. To fully explore the potential of the unlabeled frames for AVS, we explicitly divide them into two categories bas… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024

  7. Heterogeneity-aware Cross-school Electives Recommendation: a Hybrid Federated Approach

    Authors: Chengyi Ju, Jiannong Cao, Yu Yang, Zhen-Qun Yang, Ho Man Lee

    Abstract: In the era of modern education, addressing cross-school learner diversity is crucial, especially in personalized recommender systems for elective course selection. However, privacy concerns often limit cross-school data sharing, which hinders existing methods' ability to model sparse data and address heterogeneity effectively, ultimately leading to suboptimal recommendations. In response, we propo… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Journal ref: 2023 IEEE International Conference on Data Mining Workshops (ICDMW)

  8. Post-Training Embedding Alignment for Decoupling Enrollment and Runtime Speaker Recognition Models

    Authors: Chenyang Gao, Brecht Desplanques, Chelsea J. -T. Ju, Aman Chadha, Andreas Stolcke

    Abstract: Automated speaker identification (SID) is a crucial step for the personalization of a wide range of speech-enabled services. Typical SID systems use a symmetric enrollment-verification framework with a single model to derive embeddings both offline for voice profiles extracted from enrollment utterances, and online from runtime utterances. Due to the distinct circumstances of enrollment and runtim… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: Accepted to ICASSP 2024

  9. arXiv:2312.07408  [pdf, other

    cs.CV

    Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Models

    Authors: Chen Ju, Haicheng Wang, Zeqian Li, Xu Chen, Zhonghua Zhai, Weilin Huang, Shuai Xiao

    Abstract: Vision-Language Large Models (VLMs) have become primary backbone of AI, due to the impressive performance. However, their expensive computation costs, i.e., throughput and delay, impede potentials in real-world scenarios. To achieve acceleration for VLMs, most existing methods focus on the model perspective: pruning, distillation, quantification, but completely overlook the data-perspective redund… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  10. arXiv:2312.00078  [pdf, other

    cs.IR

    Enhancing Cross-domain Click-Through Rate Prediction via Explicit Feature Augmentation

    Authors: Xu Chen, Zida Cheng, Jiangchao Yao, Chen Ju, Weilin Huang, Jinsong Lan, Xiaoyi Zeng, Shuai Xiao

    Abstract: Cross-domain CTR (CDCTR) prediction is an important research topic that studies how to leverage meaningful data from a related domain to help CTR prediction in target domain. Most existing CDCTR works design implicit ways to transfer knowledge across domains such as parameter-sharing that regularizes the model training in target domain. More effectively, recent researchers propose explicit techniq… ▽ More

    Submitted 18 February, 2024; v1 submitted 29 November, 2023; originally announced December 2023.

    Comments: accepted by WWW 2024. arXiv admin note: substantial text overlap with arXiv:2305.03953

  11. arXiv:2311.07126  [pdf, other

    cs.LG

    How to Do Machine Learning with Small Data? -- A Review from an Industrial Perspective

    Authors: Ivan Kraljevski, Yong Chul Ju, Dmitrij Ivanov, Constanze Tschöpe, Matthias Wolff

    Abstract: Artificial intelligence experienced a technological breakthrough in science, industry, and everyday life in the recent few decades. The advancements can be credited to the ever-increasing availability and miniaturization of computational resources that resulted in exponential data growth. However, because of the insufficient amount of data in some cases, employing machine learning in solving compl… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  12. arXiv:2309.06419  [pdf, other

    cs.CL

    Radiology-Llama2: Best-in-Class Large Language Model for Radiology

    Authors: Zhengliang Liu, Yiwei Li, Peng Shu, Aoxiao Zhong, Longtao Yang, Chao Ju, Zihao Wu, Chong Ma, Jie Luo, Cheng Chen, Sekeun Kim, Jiang Hu, Haixing Dai, Lin Zhao, Dajiang Zhu, Jun Liu, Wei Liu, Dinggang Shen, Tianming Liu, Quanzheng Li, Xiang Li

    Abstract: This paper introduces Radiology-Llama2, a large language model specialized for radiology through a process known as instruction tuning. Radiology-Llama2 is based on the Llama2 architecture and further trained on a large dataset of radiology reports to generate coherent and clinically useful impressions from radiological findings. Quantitative evaluations using ROUGE metrics on the MIMIC-CXR and Op… ▽ More

    Submitted 29 August, 2023; originally announced September 2023.

  13. arXiv:2309.00096  [pdf, other

    cs.CV cs.AI

    AttrSeg: Open-Vocabulary Semantic Segmentation via Attribute Decomposition-Aggregation

    Authors: Chaofan Ma, Yuhuan Yang, Chen Ju, Fei Zhang, Ya Zhang, Yanfeng Wang

    Abstract: Open-vocabulary semantic segmentation is a challenging task that requires segmenting novel object categories at inference time. Recent studies have explored vision-language pre-training to handle this task, but suffer from unrealistic assumptions in practical scenarios, i.e., low-quality textual category names. For example, this paradigm assumes that new textual categories will be accurately and c… ▽ More

    Submitted 5 January, 2024; v1 submitted 31 August, 2023; originally announced September 2023.

    Comments: Accepted to NeurIPS 2023

  14. arXiv:2307.13236  [pdf, other

    cs.SD cs.CV cs.LG cs.MM eess.AS

    Audio-aware Query-enhanced Transformer for Audio-Visual Segmentation

    Authors: Jinxiang Liu, Chen Ju, Chaofan Ma, Yanfeng Wang, Yu Wang, Ya Zhang

    Abstract: The goal of the audio-visual segmentation (AVS) task is to segment the sounding objects in the video frames using audio cues. However, current fusion-based methods have the performance limitations due to the small receptive field of convolution and inadequate fusion of audio-visual features. To overcome these issues, we propose a novel \textbf{Au}dio-aware query-enhanced \textbf{TR}ansformer (AuTR… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: arXiv admin note: text overlap with arXiv:2305.11019

  15. arXiv:2307.03307  [pdf, other

    cs.DC cs.DM math.OC

    Efficient parallel implementation of the multiplicative weight update method for graph-based linear programs

    Authors: Caleb Ju, Serif Yesil, Mengyuan Sun, Chandra Chekuri, Edgar Solomonik

    Abstract: Positive linear programs (LPs) model many graph and operations research problems. One can solve for a $(1+ε)$-approximation for positive LPs, for any selected $ε$, in polylogarithmic depth and near-linear work via variations of the multiplicative weight update (MWU) method. Despite extensive theoretical work on these algorithms through the decades, their empirical performance is not well understoo… ▽ More

    Submitted 12 February, 2024; v1 submitted 6 July, 2023; originally announced July 2023.

    Comments: Updates to funding and small revisions

    MSC Class: 68W10; 90C06; 90C05; 90C35 ACM Class: F.2.1; G.2.2

  16. arXiv:2307.02003  [pdf, other

    cs.CV

    Multi-Modal Prototypes for Open-World Semantic Segmentation

    Authors: Yuhuan Yang, Chaofan Ma, Chen Ju, Fei Zhang, Jiangchao Yao, Ya Zhang, Yanfeng Wang

    Abstract: In semantic segmentation, generalizing a visual system to both seen categories and novel categories at inference time has always been practically valuable yet challenging. To enable such functionality, existing methods mainly rely on either providing several support demonstrations from the visual aspect or characterizing the informative clues from the textual aspect (e.g., the class names). Nevert… ▽ More

    Submitted 11 July, 2024; v1 submitted 4 July, 2023; originally announced July 2023.

    Comments: accepted in IJCV

  17. arXiv:2306.08666  [pdf, other

    cs.CL cs.AI

    Radiology-GPT: A Large Language Model for Radiology

    Authors: Zhengliang Liu, Aoxiao Zhong, Yiwei Li, Longtao Yang, Chao Ju, Zihao Wu, Chong Ma, Peng Shu, Cheng Chen, Sekeun Kim, Haixing Dai, Lin Zhao, Lichao Sun, Dajiang Zhu, Jun Liu, Wei Liu, Dinggang Shen, Xiang Li, Quanzheng Li, Tianming Liu

    Abstract: We introduce Radiology-GPT, a large language model for radiology. Using an instruction tuning approach on an extensive dataset of radiology domain knowledge, Radiology-GPT demonstrates superior performance compared to general language models such as StableLM, Dolly and LLaMA. It exhibits significant versatility in radiological diagnosis, research, and communication. This work serves as a catalyst… ▽ More

    Submitted 19 March, 2024; v1 submitted 14 June, 2023; originally announced June 2023.

  18. arXiv:2305.11019  [pdf, other

    cs.CV cs.AI cs.MM

    Annotation-free Audio-Visual Segmentation

    Authors: Jinxiang Liu, Yu Wang, Chen Ju, Chaofan Ma, Ya Zhang, Weidi Xie

    Abstract: The objective of Audio-Visual Segmentation (AVS) is to localise the sounding objects within visual scenes by accurately predicting pixel-wise segmentation masks. To tackle the task, it involves a comprehensive consideration of both the data and model aspects. In this paper, first, we initiate a novel pipeline for generating artificial data for the AVS task without extra manual annotations. We leve… ▽ More

    Submitted 7 October, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: Camera-ready version for WACV 2024; project page is https://jinxiang-liu.github.io/anno-free-AVS/

  19. arXiv:2305.03972  [pdf, other

    cs.IR

    Category-Oriented Representation Learning for Image to Multi-Modal Retrieval

    Authors: Zida Cheng, Chen Ju, Shuai Xiao, Xu Chen, Zhonghua Zhai, Xiaoyi Zeng, Weilin Huang, Junchi Yan

    Abstract: The rise of multi-modal search requests from users has highlighted the importance of multi-modal retrieval (i.e. image-to-text or text-to-image retrieval), yet the more complex task of image-to-multi-modal retrieval, crucial for many industry applications, remains under-explored. To address this gap and promote further research, we introduce and define the concept of Image-to-Multi-Modal Retrieval… ▽ More

    Submitted 9 June, 2024; v1 submitted 6 May, 2023; originally announced May 2023.

  20. arXiv:2303.11732  [pdf, other

    cs.CV

    Multi-modal Prompting for Low-Shot Temporal Action Localization

    Authors: Chen Ju, Zeqian Li, Peisen Zhao, Ya Zhang, Xiaopeng Zhang, Qi Tian, Yanfeng Wang, Weidi Xie

    Abstract: In this paper, we consider the problem of temporal action localization under low-shot (zero-shot & few-shot) scenario, with the goal of detecting and classifying the action instances from arbitrary categories within some untrimmed videos, even not seen at training time. We adopt a Transformer-based two-stage action localization architecture with class-agnostic action proposal, followed by open-voc… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

  21. arXiv:2303.09813  [pdf, other

    cs.CV cs.AI

    DiffusionSeg: Adapting Diffusion Towards Unsupervised Object Discovery

    Authors: Chaofan Ma, Yuhuan Yang, Chen Ju, Fei Zhang, Jinxiang Liu, Yu Wang, Ya Zhang, Yanfeng Wang

    Abstract: Learning from a large corpus of data, pre-trained models have achieved impressive progress nowadays. As popular generative pre-training, diffusion models capture both low-level visual knowledge and high-level semantic relations. In this paper, we propose to exploit such knowledgeable diffusion models for mainstream discriminative tasks, i.e., unsupervised object discovery: saliency segmentation an… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

  22. arXiv:2302.09850  [pdf, other

    cs.CV

    Constraint and Union for Partially-Supervised Temporal Sentence Grounding

    Authors: Chen Ju, Haicheng Wang, Jinxiang Liu, Chaofan Ma, Ya Zhang, Peisen Zhao, Jianlong Chang, Qi Tian

    Abstract: Temporal sentence grounding aims to detect the event timestamps described by the natural language query from given untrimmed videos. The existing fully-supervised setting achieves great performance but requires expensive annotation costs; while the weakly-supervised setting adopts cheap labels but performs poorly. To pursue high performance with less annotation cost, this paper introduces an inter… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

  23. arXiv:2212.09335  [pdf, other

    cs.CV

    Distilling Vision-Language Pre-training to Collaborate with Weakly-Supervised Temporal Action Localization

    Authors: Chen Ju, Kunhao Zheng, Jinxiang Liu, Peisen Zhao, Ya Zhang, Jianlong Chang, Yanfeng Wang, Qi Tian

    Abstract: Weakly-supervised temporal action localization (WTAL) learns to detect and classify action instances with only category labels. Most methods widely adopt the off-the-shelf Classification-Based Pre-training (CBP) to generate video features for action localization. However, the different optimization objectives between classification and localization, make temporally localized results suffer from th… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

    Comments: The first two authors share the same contribution

  24. arXiv:2211.02641  [pdf, ps, other

    eess.SP cs.AI cs.LG

    Graph Neural Networks on SPD Manifolds for Motor Imagery Classification: A Perspective from the Time-Frequency Analysis

    Authors: Ce Ju, Cuntai Guan

    Abstract: The motor imagery (MI) classification has been a prominent research topic in brain-computer interfaces based on electroencephalography (EEG). Over the past few decades, the performance of MI-EEG classifiers has seen gradual enhancement. In this study, we amplify the geometric deep learning-based MI-EEG classifiers from the perspective of time-frequency analysis, introducing a new architecture call… ▽ More

    Submitted 20 August, 2023; v1 submitted 25 October, 2022; originally announced November 2022.

    Comments: 15 pages, 5 figures, 6 Tables; This work has been accepted by the IEEE Transactions on Neural Networks and Learning Systems, 2023. Copyright will be transferred without notice, after which this version may no longer be accessible

    ACM Class: I.2.0

  25. Adversarial Reweighting for Speaker Verification Fairness

    Authors: Minho Jin, Chelsea J. -T. Ju, Zeya Chen, Yi-Chieh Liu, Jasha Droppo, Andreas Stolcke

    Abstract: We address performance fairness for speaker verification using the adversarial reweighting (ARW) method. ARW is reformulated for speaker verification with metric learning, and shown to improve results across different subgroups of gender and nationality, without requiring annotation of subgroups in the training data. An adversarial network learns a weight for each training sample in the batch so t… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

    Journal ref: Proc. Interspeech, Sept. 2022, pp. 4800-4804

  26. arXiv:2207.03638  [pdf, other

    cs.CV cs.LG

    A Support Vector Model of Pruning Trees Evaluation Based on OTSU Algorithm

    Authors: Yuefei Chen, Xinli Zheng, Chunhua Ju, Fuguang Bao

    Abstract: The tree pruning process is the key to promoting fruits' growth and improving their productions due to effects on the photosynthesis efficiency of fruits and nutrition transportation in branches. Currently, pruning is still highly dependent on human labor. The workers' experience will strongly affect the robustness of the performance of the tree pruning. Thus, it is a challenge for workers and far… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

  27. arXiv:2206.12772  [pdf, other

    cs.CV cs.MM cs.SD eess.AS

    Exploiting Transformation Invariance and Equivariance for Self-supervised Sound Localisation

    Authors: Jinxiang Liu, Chen Ju, Weidi Xie, Ya Zhang

    Abstract: We present a simple yet effective self-supervised framework for audio-visual representation learning, to localize the sound source in videos. To understand what enables to learn useful representations, we systematically investigate the effects of data augmentations, and reveal that (1) composition of data augmentations plays a critical role, i.e. explicitly encouraging the audio-visual representat… ▽ More

    Submitted 15 August, 2022; v1 submitted 25 June, 2022; originally announced June 2022.

    Comments: Camera-ready Version for ACMMM 2022, Project page is https://jinxiang-liu.github.io/SSL-TIE/

  28. arXiv:2205.13038  [pdf, other

    cs.LG cs.AI

    Improving Subgraph Representation Learning via Multi-View Augmentation

    Authors: Yili Shen, Xiao Liu, Cheng-Wei Ju, Jiaxu Yan, Jun Yi, Zhou Lin, Hui Guan

    Abstract: Subgraph representation learning based on Graph Neural Network (GNN) has exhibited broad applications in scientific advancements, such as predictions of molecular structure-property relationships and collective cellular function. In particular, graph augmentation techniques have shown promising results in improving graph-based and node-based classification tasks. Still, they have rarely been explo… ▽ More

    Submitted 13 October, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

  29. arXiv:2202.02472  [pdf, ps, other

    eess.SP cs.CV cs.LG eess.IV

    Tensor-CSPNet: A Novel Geometric Deep Learning Framework for Motor Imagery Classification

    Authors: Ce Ju, Cuntai Guan

    Abstract: Deep learning (DL) has been widely investigated in a vast majority of applications in electroencephalography (EEG)-based brain-computer interfaces (BCIs), especially for motor imagery (MI) classification in the past five years. The mainstream DL methodology for the MI-EEG classification exploits the temporospatial patterns of EEG signals using convolutional neural networks (CNNs), which have remar… ▽ More

    Submitted 23 September, 2022; v1 submitted 4 February, 2022; originally announced February 2022.

    Comments: 15 pages, 10 figures, 12 tables; This work has been accepted by the IEEE Transactions on Neural Networks and Learning Systems. Copyright will be transferred without notice, after which this version may no longer be accessible

    ACM Class: I.2.0

  30. arXiv:2201.05745  [pdf, other

    cs.LG cs.AI eess.SP

    Deep Optimal Transport for Domain Adaptation on SPD Manifolds

    Authors: Ce Ju, Cuntai Guan

    Abstract: The machine learning community has shown increasing interest in addressing the domain adaptation problem on symmetric positive definite (SPD) manifolds. This interest is primarily driven by the complexities of neuroimaging data generated from brain signals, which often exhibit shifts in data distribution across recording sessions. These neuroimaging data, represented by signal covariance matrices,… ▽ More

    Submitted 3 June, 2024; v1 submitted 14 January, 2022; originally announced January 2022.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

    ACM Class: I.2.0

  31. arXiv:2112.04478  [pdf, other

    cs.CV cs.CL

    Prompting Visual-Language Models for Efficient Video Understanding

    Authors: Chen Ju, Tengda Han, Kunhao Zheng, Ya Zhang, Weidi Xie

    Abstract: Image-based visual-language (I-VL) pre-training has shown great success for learning joint visual-textual representations from large-scale web data, revealing remarkable ability for zero-shot generalisation. This paper presents a simple but strong baseline to efficiently adapt the pre-trained I-VL model, and exploit its powerful ability for resource-hungry video understanding tasks, with minimal t… ▽ More

    Submitted 15 July, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

    Comments: ECCV 2022. Project page: https://ju-chen.github.io/efficient-prompt/

  32. arXiv:2108.06808  [pdf, other

    cs.LG

    Implicit Regularization of Bregman Proximal Point Algorithm and Mirror Descent on Separable Data

    Authors: Yan Li, Caleb Ju, Ethan X. Fang, Tuo Zhao

    Abstract: Bregman proximal point algorithm (BPPA) has witnessed emerging machine learning applications, yet its theoretical understanding has been largely unexplored. We study the computational properties of BPPA through learning linear classifiers with separable data, and demonstrate provable algorithmic regularization of BPPA. For any BPPA instantiated with a fixed Bregman divergence, we provide a lower b… ▽ More

    Submitted 24 August, 2023; v1 submitted 15 August, 2021; originally announced August 2021.

  33. arXiv:2107.09834  [pdf, other

    cs.DC math.NA

    Communication lower bounds for nested bilinear algorithms via rank expansion of Kronecker products

    Authors: Caleb Ju, Yifan Zhang, Edgar Solomonik

    Abstract: We develop lower bounds on communication in the memory hierarchy or between processors for nested bilinear algorithms, such as Strassen's algorithm for matrix multiplication. We build on a previous framework that establishes communication lower bounds by use of the rank expansion, or the minimum rank of any fixed size subset of columns of a matrix, for each of the three matrices encoding a bilinea… ▽ More

    Submitted 28 September, 2023; v1 submitted 20 July, 2021; originally announced July 2021.

    Comments: 47 pages, 6 figures, 1 table. Revisions to paper for submission

    MSC Class: 15A03; 65F99; 65Y05; 68Q11

  34. arXiv:2106.12608  [pdf, other

    cs.CL q-bio.QM

    Clinical Named Entity Recognition using Contextualized Token Representations

    Authors: Yichao Zhou, Chelsea Ju, J. Harry Caufield, Kevin Shih, Calvin Chen, Yizhou Sun, Kai-Wei Chang, Peipei Ping, Wei Wang

    Abstract: The clinical named entity recognition (CNER) task seeks to locate and classify clinical terminologies into predefined categories, such as diagnostic procedure, disease disorder, severity, medication, medication dosage, and sign symptom. CNER facilitates the study of side-effect on medications including identification of novel phenomena and human-focused information extraction. Existing approaches… ▽ More

    Submitted 23 June, 2021; originally announced June 2021.

    Comments: 1 figure, 6 tables

  35. arXiv:2106.10169  [pdf, other

    cs.LG cs.CL cs.SD eess.AS

    Fusion of Embeddings Networks for Robust Combination of Text Dependent and Independent Speaker Recognition

    Authors: Ruirui Li, Chelsea J. -T. Ju, Zeya Chen, Hongda Mao, Oguz Elibol, Andreas Stolcke

    Abstract: By implicitly recognizing a user based on his/her speech input, speaker identification enables many downstream applications, such as personalized system behavior and expedited shopping checkouts. Based on whether the speech content is constrained or not, both text-dependent (TD) and text-independent (TI) speaker recognition models may be used. We wish to combine the advantages of both types of mod… ▽ More

    Submitted 18 June, 2021; originally announced June 2021.

  36. arXiv:2104.02357  [pdf, other

    cs.CV cs.AI

    Adaptive Mutual Supervision for Weakly-Supervised Temporal Action Localization

    Authors: Chen Ju, Peisen Zhao, Siheng Chen, Ya Zhang, Xiaoyun Zhang, Qi Tian

    Abstract: Weakly-supervised temporal action localization aims to localize actions in untrimmed videos with only video-level action category labels. Most of previous methods ignore the incompleteness issue of Class Activation Sequences (CAS), suffering from trivial localization results. To solve this issue, we introduce an adaptive mutual supervision framework (AMS) with two branches, where the base branch a… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

  37. arXiv:2103.09173  [pdf, other

    cs.AI

    Ternary Hashing

    Authors: Chang Liu, Lixin Fan, Kam Woh Ng, Yilun Jin, Ce Ju, Tianyu Zhang, Chee Seng Chan, Qiang Yang

    Abstract: This paper proposes a novel ternary hash encoding for learning to hash methods, which provides a principled more efficient coding scheme with performances better than those of the state-of-the-art binary hashing counterparts. Two kinds of axiomatic ternary logic, Kleene logic and Łukasiewicz logic are adopted to calculate the Ternary Hamming Distance (THD) for both the learning/encoding and testin… ▽ More

    Submitted 19 March, 2021; v1 submitted 16 March, 2021; originally announced March 2021.

  38. arXiv:2103.04283  [pdf, ps, other

    q-bio.MN cs.LG q-bio.BM q-bio.GN

    Bio-JOIE: Joint Representation Learning of Biological Knowledge Bases

    Authors: Junheng Hao, Chelsea Ju, Muhao Chen, Yizhou Sun, Carlo Zaniolo, Wei Wang

    Abstract: The widespread of Coronavirus has led to a worldwide pandemic with a high mortality rate. Currently, the knowledge accumulated from different studies about this virus is very limited. Leveraging a wide-range of biological knowledge, such as gene ontology and protein-protein interaction (PPI) networks from other closely related species presents a vital approach to infer the molecular impact of a ne… ▽ More

    Submitted 7 March, 2021; originally announced March 2021.

    Comments: ACM BCB 2020, Best Student Paper

    Journal ref: In Procs of the 11th ACM BCB, pp. 1-10. 2020

  39. arXiv:2012.08236  [pdf, other

    cs.CV

    Point-Level Temporal Action Localization: Bridging Fully-supervised Proposals to Weakly-supervised Losses

    Authors: Chen Ju, Peisen Zhao, Ya Zhang, Yanfeng Wang, Qi Tian

    Abstract: Point-Level temporal action localization (PTAL) aims to localize actions in untrimmed videos with only one timestamp annotation for each action instance. Existing methods adopt the frame-level prediction paradigm to learn from the sparse single-frame labels. However, such a framework inevitably suffers from a large solution space. This paper attempts to explore the proposal-based prediction paradi… ▽ More

    Submitted 15 December, 2020; originally announced December 2020.

  40. arXiv:2011.13538  [pdf, other

    cs.LG

    Rethinking Uncertainty in Deep Learning: Whether and How it Improves Robustness

    Authors: Yilun Jin, Lixin Fan, Kam Woh Ng, Ce Ju, Qiang Yang

    Abstract: Deep neural networks (DNNs) are known to be prone to adversarial attacks, for which many remedies are proposed. While adversarial training (AT) is regarded as the most robust defense, it suffers from poor performance both on clean examples and under other types of attacks, e.g. attacks with larger perturbations. Meanwhile, regularizers that encourage uncertain outputs, such as entropy maximization… ▽ More

    Submitted 26 November, 2020; originally announced November 2020.

  41. arXiv:2011.03682  [pdf, other

    cs.SD eess.AS

    Non-local convolutional neural networks (nlcnn) for speaker recognition

    Authors: Haici Yang, Hongda Mao, Ruirui Li, Chelsea J. T. Ju, Oguz Elibol

    Abstract: Speaker recognition is the process of identifying a speaker based on the voice. The technology has attracted more attention with the recent increase in popularity of smart voice assistants, such as Amazon Alexa. In the past few years, various convolutional neural network (CNN) based speaker recognition algorithms have been proposed and achieved satisfactory performance. However, convolutional oper… ▽ More

    Submitted 19 May, 2021; v1 submitted 6 November, 2020; originally announced November 2020.

  42. arXiv:2008.06853  [pdf, other

    cs.LG math.NA stat.ML

    Survey: Geometric Foundations of Data Reduction

    Authors: Ce Ju

    Abstract: This survey is written in summer, 2016. The purpose of this survey is to briefly introduce nonlinear dimensionality reduction (NLDR) in data reduction. The first two NLDR were respectively published in Science in 2000 in which they solve the similar reduction problem of high-dimensional data endowed with the intrinsic nonlinear structure. The intrinsic nonlinear structure is always interpreted as… ▽ More

    Submitted 20 March, 2022; v1 submitted 16 August, 2020; originally announced August 2020.

    Comments: 78 pages, Suvery

    ACM Class: I.2

  43. arXiv:2007.01587  [pdf, other

    cs.CR cs.LG stat.ML

    Privacy Threats Against Federated Matrix Factorization

    Authors: Dashan Gao, Ben Tan, Ce Ju, Vincent W. Zheng, Qiang Yang

    Abstract: Matrix Factorization has been very successful in practical recommendation applications and e-commerce. Due to data shortage and stringent regulations, it can be hard to collect sufficient data to build performant recommender systems for a single company. Federated learning provides the possibility to bridge the data silos and build machine learning models without compromising privacy and security.… ▽ More

    Submitted 3 July, 2020; originally announced July 2020.

    Comments: 6 pages, 2 figures, 1 table, Accepted for Workshop on Federated Learning for Data Privacy and Confidentiality in Conjunction with IJCAI 2020 (FL-IJCAI'20)

  44. arXiv:2006.11601  [pdf, other

    cs.LG cs.CR cs.DC stat.ML

    Rethinking Privacy Preserving Deep Learning: How to Evaluate and Thwart Privacy Attacks

    Authors: Lixin Fan, Kam Woh Ng, Ce Ju, Tianyu Zhang, Chang Liu, Chee Seng Chan, Qiang Yang

    Abstract: This paper investigates capabilities of Privacy-Preserving Deep Learning (PPDL) mechanisms against various forms of privacy attacks. First, we propose to quantitatively measure the trade-off between model accuracy and privacy losses incurred by reconstruction, tracing and membership attacks. Second, we formulate reconstruction attacks as solving a noisy system of linear equations, and prove that a… ▽ More

    Submitted 23 June, 2020; v1 submitted 20 June, 2020; originally announced June 2020.

    Comments: under review, 36 pages (updated Eq. 3 and Fig. 8)

  45. arXiv:2006.10517  [pdf, other

    cs.LG cs.CR

    Privacy-Preserving Technology to Help Millions of People: Federated Prediction Model for Stroke Prevention

    Authors: Ce Ju, Ruihui Zhao, Jichao Sun, Xiguang Wei, Bo Zhao, Yang Liu, Hongshan Li, Tianjian Chen, Xinwei Zhang, Dashan Gao, Ben Tan, Han Yu, Chuning He, Yuan Jin

    Abstract: Prevention of stroke with its associated risk factors has been one of the public health priorities worldwide. Emerging artificial intelligence technology is being increasingly adopted to predict stroke. Because of privacy concerns, patient data are stored in distributed electronic health record (EHR) databases, voluminous clinical datasets, which prevent patient data from being aggregated and rest… ▽ More

    Submitted 14 December, 2020; v1 submitted 15 June, 2020; originally announced June 2020.

    Comments: 4 pages, 3 figures, 1 table, Accepted for Workshop on Federated Learning for Data Privacy and Confidentiality in Conjunction with IJCAI 2020 (FL-IJCAI'20)

    ACM Class: I.2.2

  46. Federated Transfer Learning for EEG Signal Classification

    Authors: Ce Ju, Dashan Gao, Ravikiran Mane, Ben Tan, Yang Liu, Cuntai Guan

    Abstract: The success of deep learning (DL) methods in the Brain-Computer Interfaces (BCI) field for classification of electroencephalographic (EEG) recordings has been restricted by the lack of large datasets. Privacy concerns associated with EEG signals limit the possibility of constructing a large EEG-BCI dataset by the conglomeration of multiple small ones for jointly training machine learning models. H… ▽ More

    Submitted 25 January, 2021; v1 submitted 26 April, 2020; originally announced April 2020.

    Comments: 6 pages, 2 figures, Accepted for IEEE Engineering in Medicine and Biology Society (EMBC) 2020 GitHub: https://github.com/DashanGao/Federated-Transfer-Leraning-for-EEG

    ACM Class: I.5.4

    Journal ref: 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), Montreal, QC, Canada, 2020, pp. 3040-3045

  47. arXiv:2002.08602  [pdf, other

    cs.RO eess.SY

    A Hybrid Systems-based Hierarchical Control Architecture for Heterogeneous Field Robot Teams

    Authors: Chanyoung Ju, Hyoung Il Son

    Abstract: Field robot systems have recently been applied to a wide range of research fields. Making such systems more automated, advanced, and activated requires cooperation among heterogeneous robots. Classic control theory is inefficient in managing large-scale complex dynamic systems. Therefore, the supervisory control theory based on discrete event system needs to be introduced to overcome this limitati… ▽ More

    Submitted 20 February, 2020; originally announced February 2020.

    Comments: 23pages, 19 figures, submitted for publication

  48. arXiv:2002.07630  [pdf

    math.OC cs.LG eess.SY

    Extending iLQR method with control delay

    Authors: Cheng Ju, Yan Qin, Chunjiang Fu

    Abstract: Iterative linear quadradic regulator(iLQR) has become a benchmark method to deal with nonlinear stochastic optimal control problem. However, it does not apply to delay system. In this paper, we extend the iLQR theory and prove new theorem in case of input signal with fixed delay. Which could be beneficial for machine learning or optimal control application to real time robot or human assistive dev… ▽ More

    Submitted 15 February, 2020; originally announced February 2020.

  49. arXiv:2002.07358  [pdf, other

    cs.CV

    Bottom-Up Temporal Action Localization with Mutual Regularization

    Authors: Peisen Zhao, Lingxi Xie, Chen Ju, Ya Zhang, Yanfeng Wang, Qi Tian

    Abstract: Recently, temporal action localization (TAL), i.e., finding specific action segments in untrimmed videos, has attracted increasing attentions of the computer vision community. State-of-the-art solutions for TAL involves evaluating the frame-level probabilities of three action-indicating phases, i.e. starting, continuing, and ending; and then post-processing these predictions for the final localiza… ▽ More

    Submitted 25 February, 2021; v1 submitted 17 February, 2020; originally announced February 2020.

    Comments: Accepted by ECCV2020

  50. arXiv:1910.13367  [pdf, other

    math.NA cs.DM cs.DS

    Derivation and Analysis of Fast Bilinear Algorithms for Convolution

    Authors: Caleb Ju, Edgar Solomonik

    Abstract: The prevalence of convolution in applications within signal processing, deep neural networks, and numerical solvers has motivated the development of numerous fast convolution algorithms. In many of these problems, convolution is performed on terabytes or petabytes of data, so even constant factors of improvement can significantly reduce the computation time. We leverage the formalism of bilinear a… ▽ More

    Submitted 2 July, 2020; v1 submitted 29 October, 2019; originally announced October 2019.

    Comments: 34 pages, 1 figure, 5 tables

    MSC Class: 65F99; 68W01