Zum Hauptinhalt springen

Showing 1–50 of 54 results for author: Su, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.08961  [pdf

    eess.IV cs.CV

    Tissue-Contrastive Semi-Masked Autoencoders for Segmentation Pretraining on Chest CT

    Authors: Jie Zheng, Ru Wen, Haiqin Hu, Lina Wei, Kui Su, Wei Chen, Chen Liu, Jun Wang

    Abstract: Existing Masked Image Modeling (MIM) depends on a spatial patch-based masking-reconstruction strategy to perceive objects'features from unlabeled images, which may face two limitations when applied to chest CT: 1) inefficient feature learning due to complex anatomical details presented in CT images, and 2) suboptimal knowledge transfer owing to input disparity between upstream and downstream model… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  2. arXiv:2405.20071  [pdf

    physics.med-ph cs.LG

    A Staged Approach using Machine Learning and Uncertainty Quantification to Predict the Risk of Hip Fracture

    Authors: Anjum Shaik, Kristoffer Larsen, Nancy E. Lane, Chen Zhao, Kuan-Jui Su, Joyce H. Keyak, Qing Tian, Qiuying Sha, Hui Shen, Hong-Wen Deng, Weihua Zhou

    Abstract: Despite advancements in medical care, hip fractures impose a significant burden on individuals and healthcare systems. This paper focuses on the prediction of hip fracture risk in older and middle-aged adults, where falls and compromised bone quality are predominant factors. We propose a novel staged model that combines advanced imaging and clinical data to improve predictive performance. By using… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 29 pages, 5 figures, 6 tables

  3. arXiv:2404.09622  [pdf, other

    cs.RO cs.AI

    DIDLM:A Comprehensive Multi-Sensor Dataset with Infrared Cameras, Depth Cameras, LiDAR, and 4D Millimeter-Wave Radar in Challenging Scenarios for 3D Mapping

    Authors: WeiSheng Gong, Chen He, KaiJie Su, QingYong Li

    Abstract: This study presents a comprehensive multi-sensor dataset designed for 3D mapping in challenging indoor and outdoor environments. The dataset comprises data from infrared cameras, depth cameras, LiDAR, and 4D millimeter-wave radar, facilitating exploration of advanced perception and mapping techniques. Integration of diverse sensor data enhances perceptual capabilities in extreme conditions such as… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  4. arXiv:2401.04934  [pdf, ps, other

    cs.MA cs.AI cs.LG

    Fully Decentralized Cooperative Multi-Agent Reinforcement Learning: A Survey

    Authors: Jiechuan Jiang, Kefan Su, Zongqing Lu

    Abstract: Cooperative multi-agent reinforcement learning is a powerful tool to solve many real-world cooperative tasks, but restrictions of real-world applications may require training the agents in a fully decentralized manner. Due to the lack of information about other agents, it is challenging to derive algorithms that can converge to the optimal joint policy in a fully decentralized setting. Thus, this… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: The first two authors contribute equally with an alphabetic order

  5. arXiv:2312.07623  [pdf

    cs.CV

    Supervised Contrastive Learning for Fine-grained Chromosome Recognition

    Authors: Ruijia Chang, Suncheng Xiang, Chengyu Zhou, Kui Su, Dahong Qian, Jun Wang

    Abstract: Chromosome recognition is an essential task in karyotyping, which plays a vital role in birth defect diagnosis and biomedical research. However, existing classification methods face significant challenges due to the inter-class similarity and intra-class variation of chromosomes. To address this issue, we propose a supervised contrastive learning strategy that is tailored to train model-agnostic d… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  6. arXiv:2310.07990  [pdf

    q-bio.GN cs.IR cs.LG stat.AP

    Multi-View Variational Autoencoder for Missing Value Imputation in Untargeted Metabolomics

    Authors: Chen Zhao, Kuan-Jui Su, Chong Wu, Xuewei Cao, Qiuying Sha, Wu Li, Zhe Luo, Tian Qin, Chuan Qiu, Lan Juan Zhao, Anqi Liu, Lindong Jiang, Xiao Zhang, Hui Shen, Weihua Zhou, Hong-Wen Deng

    Abstract: Background: Missing data is a common challenge in mass spectrometry-based metabolomics, which can lead to biased and incomplete analyses. The integration of whole-genome sequencing (WGS) data with metabolomics data has emerged as a promising approach to enhance the accuracy of data imputation in metabolomics studies. Method: In this study, we propose a novel method that leverages the information f… ▽ More

    Submitted 12 March, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: 19 pages, 3 figures

  7. arXiv:2309.04960  [pdf, other

    eess.IV cs.CV

    SdCT-GAN: Reconstructing CT from Biplanar X-Rays with Self-driven Generative Adversarial Networks

    Authors: Shuangqin Cheng, Qingliang Chen, Qiyi Zhang, Ming Li, Yamuhanmode Alike, Kaile Su, Pengcheng Wen

    Abstract: Computed Tomography (CT) is a medical imaging modality that can generate more informative 3D images than 2D X-rays. However, this advantage comes at the expense of more radiation exposure, higher costs, and longer acquisition time. Hence, the reconstruction of 3D CT images using a limited number of 2D X-rays has gained significant importance as an economical alternative. Nevertheless, existing met… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

  8. YOLIC: An Efficient Method for Object Localization and Classification on Edge Devices

    Authors: Kai Su, Yoichi Tomioka, Qiangfu Zhao, Yong Liu

    Abstract: In the realm of Tiny AI, we introduce ``You Only Look at Interested Cells" (YOLIC), an efficient method for object localization and classification on edge devices. Through seamlessly blending the strengths of semantic segmentation and object detection, YOLIC offers superior computational efficiency and precision. By adopting Cells of Interest for classification instead of individual pixels, YOLIC… ▽ More

    Submitted 30 July, 2023; v1 submitted 13 July, 2023; originally announced July 2023.

    Journal ref: Image and Vision Computing 147C (2024) 105095

  9. arXiv:2305.13871  [pdf, other

    cs.LG

    Improving Heterogeneous Model Reuse by Density Estimation

    Authors: Anke Tang, Yong Luo, Han Hu, Fengxiang He, Kehua Su, Bo Du, Yixin Chen, Dacheng Tao

    Abstract: This paper studies multiparty learning, aiming to learn a model using the private data of different participants. Model reuse is a promising solution for multiparty learning, assuming that a local model has been trained for each party. Considering the potential sample selection bias among different parties, some heterogeneous model reuse approaches have been developed. However, although pre-traine… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: 9 pages, 5 figues. Accepted by IJCAI 2023

  10. arXiv:2305.06594  [pdf, other

    cs.SD cs.CV cs.LG cs.MM eess.AS

    V2Meow: Meowing to the Visual Beat via Video-to-Music Generation

    Authors: Kun Su, Judith Yue Li, Qingqing Huang, Dima Kuzmin, Joonseok Lee, Chris Donahue, Fei Sha, Aren Jansen, Yu Wang, Mauro Verzetti, Timo I. Denk

    Abstract: Video-to-music generation demands both a temporally localized high-quality listening experience and globally aligned video-acoustic signatures. While recent music generation models excel at the former through advanced audio codecs, the exploration of video-acoustic signatures has been confined to specific visual scenarios. In contrast, our research confronts the challenge of learning globally alig… ▽ More

    Submitted 22 February, 2024; v1 submitted 11 May, 2023; originally announced May 2023.

    Comments: accepted at AAAI 2024, music samples available at https://tinyurl.com/v2meow

  11. arXiv:2303.16897  [pdf, other

    cs.CV cs.LG cs.SD eess.AS

    Physics-Driven Diffusion Models for Impact Sound Synthesis from Videos

    Authors: Kun Su, Kaizhi Qian, Eli Shlizerman, Antonio Torralba, Chuang Gan

    Abstract: Modeling sounds emitted from physical object interactions is critical for immersive perceptual experiences in real and virtual worlds. Traditional methods of impact sound synthesis use physics simulation to obtain a set of physics parameters that could represent and synthesize the sound. However, they require fine details of both the object geometries and impact locations, which are rarely availab… ▽ More

    Submitted 8 July, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

    Comments: CVPR 2023. Project page: https://sukun1045.github.io/video-physics-sound-diffusion/

  12. arXiv:2303.02915  [pdf

    cs.CL cs.AI

    GlobalNER: Incorporating Non-local Information into Named Entity Recognition

    Authors: Chiao-Wei Hsu, Keh-Yih Su

    Abstract: Nowadays, many Natural Language Processing (NLP) tasks see the demand for incorporating knowledge external to the local information to further improve the performance. However, there is little related work on Named Entity Recognition (NER), which is one of the foundations of NLP. Specifically, no studies were conducted on the query generation and re-ranking for retrieving the related information f… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: 13 pages, 5 figures

  13. arXiv:2302.07450  [pdf, other

    cs.LG cs.CR

    FedABC: Targeting Fair Competition in Personalized Federated Learning

    Authors: Dui Wang, Li Shen, Yong Luo, Han Hu, Kehua Su, Yonggang Wen, Dacheng Tao

    Abstract: Federated learning aims to collaboratively train models without accessing their client's local private data. The data may be Non-IID for different clients and thus resulting in poor performance. Recently, personalized federated learning (PFL) has achieved great success in handling Non-IID data by enforcing regularization in local optimization or improving the model aggregation scheme on the server… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

    Comments: 9 pages,5 figures

    Journal ref: AAAI2023

  14. arXiv:2212.07855  [pdf, other

    cs.CV

    QueryPose: Sparse Multi-Person Pose Regression via Spatial-Aware Part-Level Query

    Authors: Yabo Xiao, Kai Su, Xiaojuan Wang, Dongdong Yu, Lei Jin, Mingshu He, Zehuan Yuan

    Abstract: We propose a sparse end-to-end multi-person pose regression framework, termed QueryPose, which can directly predict multi-person keypoint sequences from the input image. The existing end-to-end methods rely on dense representations to preserve the spatial detail and structure for precise keypoint localization. However, the dense paradigm introduces complex and redundant post-processes during infer… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

    Comments: Published on NeurIPS 2022

  15. arXiv:2211.03032  [pdf, other

    cs.LG

    Decentralized Policy Optimization

    Authors: Kefan Su, Zongqing Lu

    Abstract: The study of decentralized learning or independent learning in cooperative multi-agent reinforcement learning has a history of decades. Recently empirical studies show that independent PPO (IPPO) can obtain good performance, close to or even better than the methods of centralized training with decentralized execution, in several benchmarks. However, decentralized actor-critic with convergence guar… ▽ More

    Submitted 6 November, 2022; originally announced November 2022.

    Comments: 14 pages

  16. arXiv:2210.09084  [pdf, other

    cs.LG cs.CV

    Multi-Agent Automated Machine Learning

    Authors: Zhaozhi Wang, Kefan Su, Jian Zhang, Huizhu Jia, Qixiang Ye, Xiaodong Xie, Zongqing Lu

    Abstract: In this paper, we propose multi-agent automated machine learning (MA2ML) with the aim to effectively handle joint optimization of modules in automated machine learning (AutoML). MA2ML takes each machine learning module, such as data augmentation (AUG), neural architecture search (NAS), or hyper-parameters (HPO), as an agent and the final performance as the reward, to formulate a multi-agent reinfo… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

  17. arXiv:2210.04014  [pdf, other

    cs.CV

    AdaptivePose++: A Powerful Single-Stage Network for Multi-Person Pose Regression

    Authors: Yabo Xiao, Xiaojuan Wang, Dongdong Yu, Kai Su, Lei Jin, Mei Song, Shuicheng Yan, Jian Zhao

    Abstract: Multi-person pose estimation generally follows top-down and bottom-up paradigms. Both of them use an extra stage ($\boldsymbol{e.g.,}$ human detection in top-down paradigm or grouping process in bottom-up paradigm) to build the relationship between the human instance and corresponding keypoints, thus leading to the high computation cost and redundant two-stage pipeline. To address the above issue,… ▽ More

    Submitted 8 October, 2022; originally announced October 2022.

    Comments: Submit to IEEE TCSVT; 11 pages. arXiv admin note: text overlap with arXiv:2112.13635

  18. arXiv:2209.12713  [pdf, other

    cs.MA cs.LG

    Multi-Agent Sequential Decision-Making via Communication

    Authors: Ziluo Ding, Kefan Su, Weixin Hong, Liwen Zhu, Tiejun Huang, Zongqing Lu

    Abstract: Communication helps agents to obtain information about others so that better coordinated behavior can be learned. Some existing work communicates predicted future trajectory with others, hoping to get clues about what others would do for better coordination. However, circular dependencies sometimes can occur when agents are treated synchronously so it is hard to coordinate decision-making. In this… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

    Comments: 20 pages

  19. arXiv:2209.08244  [pdf, other

    cs.LG cs.MA

    MA2QL: A Minimalist Approach to Fully Decentralized Multi-Agent Reinforcement Learning

    Authors: Kefan Su, Siyuan Zhou, Jiechuan Jiang, Chuang Gan, Xiangjun Wang, Zongqing Lu

    Abstract: Decentralized learning has shown great promise for cooperative multi-agent reinforcement learning (MARL). However, non-stationarity remains a significant challenge in fully decentralized learning. In the paper, we tackle the non-stationarity problem in the simplest and fundamental way and propose multi-agent alternate Q-learning (MA2QL), where agents take turns updating their Q-functions by Q-lear… ▽ More

    Submitted 7 February, 2023; v1 submitted 17 September, 2022; originally announced September 2022.

    Comments: 18 pages

  20. Mask and Reason: Pre-Training Knowledge Graph Transformers for Complex Logical Queries

    Authors: Xiao Liu, Shiyu Zhao, Kai Su, Yukuo Cen, Jiezhong Qiu, Mengdi Zhang, Wei Wu, Yuxiao Dong, Jie Tang

    Abstract: Knowledge graph (KG) embeddings have been a mainstream approach for reasoning over incomplete KGs. However, limited by their inherently shallow and static architectures, they can hardly deal with the rising focus on complex logical queries, which comprise logical operators, imputed edges, multiple source entities, and unknown intermediate entities. In this work, we present the Knowledge Graph Tran… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

    Comments: kgTransformer; Accepted to KDD 2022

  21. arXiv:2206.01369  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Incremental Learning Meets Transfer Learning: Application to Multi-site Prostate MRI Segmentation

    Authors: Chenyu You, Jinlin Xiang, Kun Su, Xiaoran Zhang, Siyuan Dong, John Onofrey, Lawrence Staib, James S. Duncan

    Abstract: Many medical datasets have recently been created for medical image segmentation tasks, and it is natural to question whether we can use them to sequentially train a single model that (1) performs better on all these datasets, and (2) generalizes well and transfers better to the unknown target site domain. Prior works have achieved this goal by jointly training one model on multi-site datasets, whi… ▽ More

    Submitted 30 July, 2022; v1 submitted 2 June, 2022; originally announced June 2022.

  22. arXiv:2110.00304  [pdf, other

    cs.LG cs.AI cs.MA

    Divergence-Regularized Multi-Agent Actor-Critic

    Authors: Kefan Su, Zongqing Lu

    Abstract: Entropy regularization is a popular method in reinforcement learning (RL). Although it has many advantages, it alters the RL objective of the original Markov Decision Process (MDP). Though divergence regularization has been proposed to settle this problem, it cannot be trivially applied to cooperative multi-agent reinforcement learning (MARL). In this paper, we investigate divergence regularizatio… ▽ More

    Submitted 21 June, 2022; v1 submitted 1 October, 2021; originally announced October 2021.

    Comments: ICML 2022, 24 pages, 10 figures

  23. arXiv:2109.06109  [pdf, other

    cs.CV

    Weakly Supervised Person Search with Region Siamese Networks

    Authors: Chuchu Han, Kai Su, Dongdong Yu, Zehuan Yuan, Changxin Gao, Nong Sang, Yi Yang, Changhu Wang

    Abstract: Supervised learning is dominant in person search, but it requires elaborate labeling of bounding boxes and identities. Large-scale labeled training data is often difficult to collect, especially for person identities. A natural question is whether a good person search model can be trained without the need of identity supervision. In this paper, we present a weakly supervised setting where only bou… ▽ More

    Submitted 13 September, 2021; originally announced September 2021.

    Comments: Accepted by ICCV 2021

  24. arXiv:2109.00373  [pdf, other

    cs.CV

    Memory Based Video Scene Parsing

    Authors: Zhenchao Jin, Dongdong Yu, Kai Su, Zehuan Yuan, Changhu Wang

    Abstract: Video scene parsing is a long-standing challenging task in computer vision, aiming to assign pre-defined semantic labels to pixels of all frames in a given video. Compared with image semantic segmentation, this task pays more attention on studying how to adopt the temporal information to obtain higher predictive accuracy. In this report, we introduce our solution for the 1st Video Scene Parsing in… ▽ More

    Submitted 1 September, 2021; originally announced September 2021.

    Comments: technical report for "The 1st Video Scene Parsing in the Wild Challenge Workshop". arXiv admin note: text overlap with arXiv:2108.11819

  25. arXiv:2107.02893  [pdf

    cs.CL

    Answering Chinese Elementary School Social Study Multiple Choice Questions

    Authors: Daniel Lee, Chao-Chun Liang, Keh-Yih Su

    Abstract: We present a novel approach to answer the Chinese elementary school Social Study Multiple Choice questions. Although BERT has demonstrated excellent performance on Reading Comprehension tasks, it is found not good at handling some specific types of questions, such as Negation, All-of-the-above, and None-of-the-above. We thus propose a novel framework to cascade BERT with a Pre-Processor and an Ans… ▽ More

    Submitted 26 June, 2021; originally announced July 2021.

    Comments: TAAI-2020

  26. arXiv:2106.15772  [pdf

    cs.AI cs.CL

    A Diverse Corpus for Evaluating and Developing English Math Word Problem Solvers

    Authors: Shen-Yun Miao, Chao-Chun Liang, Keh-Yih Su

    Abstract: We present ASDiv (Academia Sinica Diverse MWP Dataset), a diverse (in terms of both language patterns and problem types) English math word problem (MWP) corpus for evaluating the capability of various MWP solvers. Existing MWP corpora for studying AI progress remain limited either in language usage patterns or in problem types. We thus present a new English MWP corpus with 2,305 MWPs that cover mo… ▽ More

    Submitted 29 June, 2021; originally announced June 2021.

    Comments: ACL-2020

  27. arXiv:2106.00990  [pdf, other

    cs.AI

    Sequence to General Tree: Knowledge-Guided Geometry Word Problem Solving

    Authors: Shih-hung Tsai, Chao-Chun Liang, Hsin-Min Wang, Keh-Yih Su

    Abstract: With the recent advancements in deep learning, neural solvers have gained promising results in solving math word problems. However, these SOTA solvers only generate binary expression trees that contain basic arithmetic operators and do not explicitly use the math formulas. As a result, the expression trees they produce are lengthy and uninterpretable because they need to use multiple operators and… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Comments: ACL2021

  28. arXiv:2012.03478  [pdf, other

    cs.SD cs.CV eess.AS

    Multi-Instrumentalist Net: Unsupervised Generation of Music from Body Movements

    Authors: Kun Su, Xiulong Liu, Eli Shlizerman

    Abstract: We propose a novel system that takes as an input body movements of a musician playing a musical instrument and generates music in an unsupervised setting. Learning to generate multi-instrumental music from videos without labeling the instruments is a challenging problem. To achieve the transformation, we built a pipeline named 'Multi-instrumentalistNet' (MI Net). At its base, the pipeline learns a… ▽ More

    Submitted 7 December, 2020; originally announced December 2020.

    Comments: Please see associated video at https://www.youtube.com/watch?v=yo5OZKBbBh4

  29. arXiv:2006.14348  [pdf, other

    cs.CV cs.LG cs.MM cs.SD eess.AS eess.IV

    Audeo: Audio Generation for a Silent Performance Video

    Authors: Kun Su, Xiulong Liu, Eli Shlizerman

    Abstract: We present a novel system that gets as an input video frames of a musician playing the piano and generates the music for that video. Generation of music from visual cues is a challenging problem and it is not clear whether it is an attainable goal at all. Our main aim in this work is to explore the plausibility of such a transformation and to identify cues and components able to carry the associat… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

    Comments: Please see associated video at https://www.youtube.com/watch?v=8rS3VgjG7_c

    Journal ref: Advances in neural information processing 2020

  30. arXiv:2006.09874  [pdf

    cs.SI

    Cluster Diffusing Shuffles

    Authors: Kevin Su

    Abstract: Unbiased shuffling algorithms, such as the Fisher-Yates shuffle, are often used for shuffle play in media players. These algorithms treat all items being shuffled equally regardless of how similar the items are to each other. While this may be desirable for many applications, this is problematic for shuffle play due to the clustering illusion, which is the tendency for humans to erroneously consid… ▽ More

    Submitted 9 June, 2020; originally announced June 2020.

    ACM Class: F.2.2

  31. arXiv:2006.07412  [pdf, other

    cs.LG cs.CV cs.RO q-bio.NC stat.ML

    BI-MAML: Balanced Incremental Approach for Meta Learning

    Authors: Yang Zheng, Jinlin Xiang, Kun Su, Eli Shlizerman

    Abstract: We present a novel Balanced Incremental Model Agnostic Meta Learning system (BI-MAML) for learning multiple tasks. Our method implements a meta-update rule to incrementally adapt its model to new tasks without forgetting old tasks. Such a capability is not possible in current state-of-the-art MAML approaches. These methods effectively adapt to new tasks, however, suffer from 'catastrophic forgetti… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

    Comments: Please see associated video at: https://youtu.be/4qlb-iG5SFo

  32. arXiv:1911.12409  [pdf, other

    cs.CV cs.LG eess.IV

    PREDICT & CLUSTER: Unsupervised Skeleton Based Action Recognition

    Authors: Kun Su, Xiulong Liu, Eli Shlizerman

    Abstract: We propose a novel system for unsupervised skeleton-based action recognition. Given inputs of body keypoints sequences obtained during various movements, our system associates the sequences with actions. Our system is based on an encoder-decoder recurrent neural network, where the encoder learns a separable feature representation within its hidden states formed by training the model to perform pre… ▽ More

    Submitted 27 November, 2019; originally announced November 2019.

    Comments: See video at: https://www.youtube.com/watch?v=-dcCFUBRmwE

  33. arXiv:1911.07938  [pdf, ps, other

    cs.CV

    Towards Good Practices for Multi-Person Pose Estimation

    Authors: Dongdong Yu, Kai Su, Changhu Wang

    Abstract: Multi-Person Pose Estimation is an interesting yet challenging task in computer vision. In this paper, we conduct a series of refinements with the MSPN and PoseFix Networks, and empirically evaluate their impact on the final model performance through ablation studies. By taking all the refinements, we achieve 78.7 on the COCO test-dev dataset and 76.3 on the COCO test-challenge dataset.

    Submitted 27 October, 2019; originally announced November 2019.

  34. arXiv:1909.13583  [pdf, other

    cs.CV

    Towards Good Practices for Video Object Segmentation

    Authors: Dongdong Yu, Kai Su, Hengkai Guo, Jian Wang, Kaihui Zhou, Yuanyuan Huang, Minghui Dong, Jie Shao, Changhu Wang

    Abstract: Semi-supervised video object segmentation is an interesting yet challenging task in machine learning. In this work, we conduct a series of refinements with the propagation-based video object segmentation method and empirically evaluate their impact on the final model performance through ablation study. By taking all the refinements, we improve the space-time memory networks to achieve a Overall of… ▽ More

    Submitted 30 September, 2019; originally announced September 2019.

  35. arXiv:1905.12176  [pdf, other

    cs.LG eess.SP q-bio.NC stat.ML

    Clustering and Recognition of Spatiotemporal Features through Interpretable Embedding of Sequence to Sequence Recurrent Neural Networks

    Authors: Kun Su, Eli Shlizerman

    Abstract: Encoder-decoder recurrent neural network models (RNN Seq2Seq) have achieved great success in ubiquitous areas of computation and applications. It was shown to be successful in modeling data with both temporal and spatial dependencies for translation or prediction tasks. In this study, we propose an embedding approach to visualize and interpret the representation of data by these models. Furthermor… ▽ More

    Submitted 31 January, 2020; v1 submitted 28 May, 2019; originally announced May 2019.

  36. arXiv:1905.05355  [pdf, other

    cs.CV

    A Context-and-Spatial Aware Network for Multi-Person Pose Estimation

    Authors: Dongdong Yu, Kai Su, Xin Geng, Changhu Wang

    Abstract: Multi-person pose estimation is a fundamental yet challenging task in computer vision. Both rich context information and spatial information are required to precisely locate the keypoints for all persons in an image. In this paper, a novel Context-and-Spatial Aware Network (CSANet), which integrates both a Context Aware Path and Spatial Aware Path, is proposed to obtain effective features involvin… ▽ More

    Submitted 13 May, 2019; originally announced May 2019.

  37. arXiv:1905.03466  [pdf, other

    cs.CV

    Multi-Person Pose Estimation with Enhanced Channel-wise and Spatial Information

    Authors: Kai Su, Dongdong Yu, Zhenqi Xu, Xin Geng, Changhu Wang

    Abstract: Multi-person pose estimation is an important but challenging problem in computer vision. Although current approaches have achieved significant progress by fusing the multi-scale feature maps, they pay little attention to enhancing the channel-wise and spatial information of the feature maps. In this paper, we propose two novel modules to perform the enhancement of the information for the multi-per… ▽ More

    Submitted 9 May, 2019; originally announced May 2019.

    Comments: Accepted by CVPR 2019

  38. arXiv:1903.07025  [pdf

    cs.ET

    VeriSFQ - A Semi-formal Verification Framework and Benchmark for Single Flux Quantum Technology

    Authors: Alvin D. Wong, Kevin Su, Hang Sun, Arash Fayyazi, Massoud Pedram, Shahin Nazarian

    Abstract: In this paper, we propose a semi-formal verification framework for single-flux quantum (SFQ) circuits called VeriSFQ, using the Universal Verification Methodology (UVM) standard. The considered SFQ technology is superconducting digital electronic devices that operate at cryogenic temperatures with active circuit elements called the Josephson junction, which operate at high switching speeds and low… ▽ More

    Submitted 17 March, 2019; originally announced March 2019.

    Comments: 7 pages, 6 figures, 4 tables; submitted, accepted, and presented at ISQED 2019 (20th International Symposium on Quality Electronic Design) on March 7th, 2019 in Santa Clara, CA, USA

  39. arXiv:1808.09907  [pdf, other

    cs.LG stat.ML

    Dropout with Tabu Strategy for Regularizing Deep Neural Networks

    Authors: Zongjie Ma, Abdul Sattar, Jun Zhou, Qingliang Chen, Kaile Su

    Abstract: Dropout has proven to be an effective technique for regularization and preventing the co-adaptation of neurons in deep neural networks (DNN). It randomly drops units with a probability $p$ during the training stage of DNN. Dropout also provides a way of approximately combining exponentially many different neural network architectures efficiently. In this work, we add a diversification strategy int… ▽ More

    Submitted 29 August, 2018; originally announced August 2018.

  40. arXiv:1808.01308   

    cs.GR

    The Normal Map Based on Area-Preserving Parameterization

    Authors: Hui Zhao, Kehua Su, Ming Ma, Na Lei, Li Cui, Xianfeng Gu

    Abstract: In this paper, we present an approach to enhance and improve the current normal map rendering technique. Our algorithm is based on semi-discrete Optimal Mass Transportation (OMT) theory and has a solid theoretical base. The key difference from previous normal map method is that we preserve the local area when we unwrap a disk-like 3D surface onto 2D plane. Compared to the currently used techniques… ▽ More

    Submitted 21 April, 2020; v1 submitted 14 July, 2018; originally announced August 2018.

    Comments: we need update it

  41. arXiv:1804.08187  [pdf, ps, other

    cs.AI

    Advancing Tabu and Restart in Local Search for Maximum Weight Cliques

    Authors: Yi Fan, Nan Li, Chengqian Li, Zongjie Ma, Longin Jan Latecki, Kaile Su

    Abstract: The tabu and restart are two fundamental strategies for local search. In this paper, we improve the local search algorithms for solving the Maximum Weight Clique (MWC) problem by introducing new tabu and restart strategies. Both the tabu and restart strategies proposed are based on the notion of a local search scenario, which involves not only a candidate solution but also the tabu status and unlo… ▽ More

    Submitted 22 April, 2018; originally announced April 2018.

  42. arXiv:1803.06064  [pdf

    cs.AI cs.CL

    A Meaning-based Statistical English Math Word Problem Solver

    Authors: Chao-Chun Liang, Yu-Shiang Wong, Yi-Chung Lin, Keh-Yih Su

    Abstract: We introduce MeSys, a meaning-based approach, for solving English math word problems (MWPs) via understanding and reasoning in this paper. It first analyzes the text, transforms both body and question parts into their corresponding logic forms, and then performs inference on them. The associated context of each quantity is represented with proposed role-tags (e.g., nsubj, verb, etc.), which provid… ▽ More

    Submitted 5 July, 2018; v1 submitted 15 March, 2018; originally announced March 2018.

    Comments: Accepted as a long paper at NAACL HLT 2018

  43. Trainable back-propagated functional transfer matrices

    Authors: Cheng-Hao Cai, Yanyan Xu, Dengfeng Ke, Kaile Su, Jing Sun

    Abstract: Connections between nodes of fully connected neural networks are usually represented by weight matrices. In this article, functional transfer matrices are introduced as alternatives to the weight matrices: Instead of using real weights, a functional transfer matrix uses real functions with trainable parameters to represent connections between nodes. Multiple functional transfer matrices are then s… ▽ More

    Submitted 28 October, 2017; originally announced October 2017.

    Comments: 39 pages, 4 figures, submitted as a journal article

    Journal ref: Appl. Intell. (2018)

  44. arXiv:1710.05488  [pdf, other

    cs.LG stat.ML

    A Geometric View of Optimal Transportation and Generative Model

    Authors: Na Lei, Kehua Su, Li Cui, Shing-Tung Yau, David Xianfeng Gu

    Abstract: In this work, we show the intrinsic relations between optimal transportation and convex geometry, especially the variational approach to solve Alexandrov problem: constructing a convex polytope with prescribed face normals and volumes. This leads to a geometric interpretation to generative models, and leads to a novel framework for generative models. By using the optimal transportation view of GAN… ▽ More

    Submitted 18 December, 2017; v1 submitted 15 October, 2017; originally announced October 2017.

  45. Learning of Human-like Algebraic Reasoning Using Deep Feedforward Neural Networks

    Authors: Cheng-Hao Cai, Dengfeng Ke, Yanyan Xu, Kaile Su

    Abstract: There is a wide gap between symbolic reasoning and deep learning. In this research, we explore the possibility of using deep learning to improve symbolic reasoning. Briefly, in a reasoning system, a deep feedforward neural network is used to guide rewriting processes after learning from algebraic reasoning examples produced by humans. To enable the neural network to recognise patterns of algebraic… ▽ More

    Submitted 24 April, 2017; originally announced April 2017.

    Comments: 8 pages, 7 figures

    ACM Class: I.2.0; I.2.3; I.2.4; I.2.6; I.2.8; I.5.0; I.5.1; I.5.2; I.5.4; F.4.1

  46. arXiv:1605.07705  [pdf, ps, other

    cs.MM cs.NI

    Understanding Content Placement Strategies in Smartrouter-based Peer CDN for Video Streaming

    Authors: Ming Ma, Zhi Wang, Ke Su, Lifeng Sun

    Abstract: Recent years have witnessed a new video delivery paradigm: smartrouter-based peer video content delivery network, which is enabled by smartrouters deployed at users' homes. ChinaCache (one of the largest CDN providers in China) and Youku (a video provider using smartrouters to assist video delivery) announced their cooperation in 2015, to create a new paradigm of content delivery based on househol… ▽ More

    Submitted 25 May, 2016; v1 submitted 24 May, 2016; originally announced May 2016.

    Comments: arXiv admin note: text overlap with arXiv:1605.07704

  47. arXiv:1605.07704  [pdf, ps, other

    cs.MM cs.NI

    Understanding the Smartrouter-based Peer CDN for Video Streaming

    Authors: Ming Ma, Zhi Wang, Ke Su, Lifeng Sun

    Abstract: Recent years have witnessed a new video delivery paradigm: smartrouter-based video delivery network, which is enabled by smartrouters deployed at users' homes, together with the conventional video servers deployed in the datacenters. Recently, ChinaCache, a large content delivery network (CDN) provider, and Youku, a video service provider using smartrouters to assist video delivery, announced thei… ▽ More

    Submitted 24 May, 2016; originally announced May 2016.

  48. arXiv:1604.05086  [pdf, ps, other

    cs.AI cs.CC cs.LO cs.MA

    Normative Multiagent Systems: A Dynamic Generalization

    Authors: Xiaowei Huang, Ji Ruan, Qingliang Chen, Kaile Su

    Abstract: Social norms are powerful formalism in coordinating autonomous agents' behaviour to achieve certain objectives. In this paper, we propose a dynamic normative system to enable the reasoning of the changes of norms under different circumstances, which cannot be done in the existing static normative systems. We study two important problems (norm synthesis and norm recognition) related to the autonomy… ▽ More

    Submitted 18 April, 2016; originally announced April 2016.

    Comments: 26 pages. A conference version of this work is accepted by the 25th International Joint Conference on Artificial Intelligence (IJCAI-16)

    ACM Class: I.2.11; I.2.4

  49. arXiv:1410.2662  [pdf, ps, other

    cs.NI

    Evaluating Opportunistic Delivery of Large Content with TCP over WiFi in I2V Communication

    Authors: Shreyasee Mukherjee, Kai Su, Narayan B. Mandayam, K. K. Ramakrishnan, Dipankar Raychaudhuri, Ivan Seskar

    Abstract: With the increasing interest in connected vehicles, it is useful to evaluate the capability of delivering large content over a WiFi infrastructure to vehicles. The throughput achieved over WiFi channels can be highly variable and also rapidly degrades as the distance from the access point increases. While this behavior is well understood at the data link layer, the interactions across the various… ▽ More

    Submitted 9 October, 2014; originally announced October 2014.

  50. arXiv:1402.0584  [pdf

    cs.AI cs.DS

    NuMVC: An Efficient Local Search Algorithm for Minimum Vertex Cover

    Authors: Shaowei Cai, Kaile Su, Chuan Luo, Abdul Sattar

    Abstract: The Minimum Vertex Cover (MVC) problem is a prominent NP-hard combinatorial optimization problem of great importance in both theory and application. Local search has proved successful for this problem. However, there are two main drawbacks in state-of-the-art MVC local search algorithms. First, they select a pair of vertices to exchange simultaneously, which is time-consuming. Secondly, although u… ▽ More

    Submitted 3 February, 2014; originally announced February 2014.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 46, pages 687-716, 2013