Zum Hauptinhalt springen

Showing 1–46 of 46 results for author: Si, Z

Searching in archive cs. Search in all archives.
.
  1. TWIN V2: Scaling Ultra-Long User Behavior Sequence Modeling for Enhanced CTR Prediction at Kuaishou

    Authors: Zihua Si, Lin Guan, ZhongXiang Sun, Xiaoxue Zang, Jing Lu, Yiqun Hui, Xingchao Cao, Zeyu Yang, Yichen Zheng, Dewei Leng, Kai Zheng, Chenbin Zhang, Yanan Niu, Yang Song, Kun Gai

    Abstract: The significance of modeling long-term user interests for CTR prediction tasks in large-scale recommendation systems is progressively gaining attention among researchers and practitioners. Existing work, such as SIM and TWIN, typically employs a two-stage approach to model long-term user behavior sequences for efficiency concerns. The first stage rapidly retrieves a subset of sequences related to… ▽ More

    Submitted 16 August, 2024; v1 submitted 23 July, 2024; originally announced July 2024.

    Comments: Accepted by CIKM 2024

  2. arXiv:2406.14913  [pdf, other

    physics.soc-ph cs.MA

    Cooperative bots exhibit nuanced effects on cooperation across strategic frameworks

    Authors: Zehua Si, Zhixue He, Chen Shen, Jun Tanimoto

    Abstract: The positive impact of cooperative bots on cooperation within evolutionary game theory is well documented; however, existing studies have predominantly used discrete strategic frameworks, focusing on deterministic actions with a fixed probability of one. This paper extends the investigation to continuous and mixed strategic approaches. Continuous strategies employ intermediate probabilities to con… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  3. arXiv:2405.18804  [pdf, other

    cs.RO

    Tilde: Teleoperation for Dexterous In-Hand Manipulation Learning with a DeltaHand

    Authors: Zilin Si, Kevin Lee Zhang, Zeynep Temel, Oliver Kroemer

    Abstract: Dexterous robotic manipulation remains a challenging domain due to its strict demands for precision and robustness on both hardware and software. While dexterous robotic hands have demonstrated remarkable capabilities in complex tasks, efficiently learning adaptive control policies for hands still presents a significant hurdle given the high dimensionalities of hands and tasks. To bridge this gap,… ▽ More

    Submitted 21 August, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

  4. arXiv:2404.13146  [pdf, other

    cs.CR cs.CV

    DeepFake-O-Meter v2.0: An Open Platform for DeepFake Detection

    Authors: Yan Ju, Chengzhe Sun, Shan Jia, Shuwei Hou, Zhaofeng Si, Soumyya Kanti Datta, Lipeng Ke, Riky Zhou, Anita Nikolich, Siwei Lyu

    Abstract: Deepfakes, as AI-generated media, have increasingly threatened media integrity and personal privacy with realistic yet fake digital content. In this work, we introduce an open-source and user-friendly online platform, DeepFake-O-Meter v2.0, that integrates state-of-the-art methods for detecting Deepfake images, videos, and audio. Built upon DeepFake-O-Meter v1.0, we have made significant upgrades… ▽ More

    Submitted 27 June, 2024; v1 submitted 19 April, 2024; originally announced April 2024.

  5. arXiv:2404.09520  [pdf, other

    cs.IR

    UniSAR: Modeling User Transition Behaviors between Search and Recommendation

    Authors: Teng Shi, Zihua Si, Jun Xu, Xiao Zhang, Xiaoxue Zang, Kai Zheng, Dewei Leng, Yanan Niu, Yang Song

    Abstract: Nowadays, many platforms provide users with both search and recommendation services as important tools for accessing information. The phenomenon has led to a correlation between user search and recommendation behaviors, providing an opportunity to model user interests in a fine-grained way. Existing approaches either model user search and recommendation behaviors separately or overlook the differe… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: Accepted by SIGIR 2024

  6. arXiv:2404.03267  [pdf, other

    cs.IR

    To Search or to Recommend: Predicting Open-App Motivation with Neural Hawkes Process

    Authors: Zhongxiang Sun, Zihua Si, Xiao Zhang, Xiaoxue Zang, Yang Song, Hongteng Xu, Jun Xu

    Abstract: Incorporating Search and Recommendation (S&R) services within a singular application is prevalent in online platforms, leading to a new task termed open-app motivation prediction, which aims to predict whether users initiate the application with the specific intent of information searching, or to explore recommended content for entertainment. Studies have shown that predicting users' motivation to… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: Accepted by SIGIR 2024

  7. arXiv:2403.17688  [pdf, other

    cs.IR

    Large Language Models Enhanced Collaborative Filtering

    Authors: Zhongxiang Sun, Zihua Si, Xiaoxue Zang, Kai Zheng, Yang Song, Xiao Zhang, Jun Xu

    Abstract: Recent advancements in Large Language Models (LLMs) have attracted considerable interest among researchers to leverage these models to enhance Recommender Systems (RSs). Existing work predominantly utilizes LLMs to generate knowledge-rich texts or utilizes LLM-derived embeddings as features to improve RSs. Although the extensive world knowledge embedded in LLMs generally benefits RSs, the applicat… ▽ More

    Submitted 23 July, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: Accepted by CIKM 2024

  8. arXiv:2403.14174  [pdf, other

    cs.CV

    Unified Static and Dynamic Network: Efficient Temporal Filtering for Video Grounding

    Authors: Jingjing Hu, Dan Guo, Kun Li, Zhan Si, Xun Yang, Xiaojun Chang, Meng Wang

    Abstract: Inspired by the activity-silent and persistent activity mechanisms in human visual perception biology, we design a Unified Static and Dynamic Network (UniSDNet), to learn the semantic association between the video and text/audio queries in a cross-modal environment for efficient video grounding. For static modeling, we devise a novel residual structure (ResMLP) to boost the global comprehensive in… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  9. arXiv:2403.08716  [pdf, other

    cs.RO

    DIFFTACTILE: A Physics-based Differentiable Tactile Simulator for Contact-rich Robotic Manipulation

    Authors: Zilin Si, Gu Zhang, Qingwei Ben, Branden Romero, Zhou Xian, Chao Liu, Chuang Gan

    Abstract: We introduce DIFFTACTILE, a physics-based differentiable tactile simulation system designed to enhance robotic manipulation with dense and physically accurate tactile feedback. In contrast to prior tactile simulators which primarily focus on manipulating rigid bodies and often rely on simplified approximations to model stress and deformations of materials in contact, DIFFTACTILE emphasizes physics… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  10. Explicitly Integrating Judgment Prediction with Legal Document Retrieval: A Law-Guided Generative Approach

    Authors: Weicong Qin, Zelin Cao, Weijie Yu, Zihua Si, Sirui Chen, Jun Xu

    Abstract: Legal document retrieval and judgment prediction are crucial tasks in intelligent legal systems. In practice, determining whether two documents share the same judgments is essential for establishing their relevance in legal retrieval. However, existing legal retrieval studies either ignore the vital role of judgment prediction or rely on implicit training objectives, expecting a proper alignment o… ▽ More

    Submitted 15 April, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: Accepted by SIGIR'2024

  11. arXiv:2312.08862  [pdf, other

    cs.IT eess.SP

    Semantics-Division Duplexing: A Novel Full-Duplex Paradigm

    Authors: Kai Niu, Zijian Liang, Chao Dong, Jincheng Dai, Zhongwei Si, Ping Zhang

    Abstract: In-band full-duplex (IBFD) is a theoretically effective solution to increase the overall throughput for the future wireless communications system by enabling transmission and reception over the same time-frequency resources. However, reliable source reconstruction remains a great challenge in the practical IBFD systems due to the non-ideal elimination of the self-interference and the inherent limi… ▽ More

    Submitted 1 August, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: 9 pages, 5 figures, Accepted by IEEE Wireless Communications Magazine

  12. arXiv:2310.15329  [pdf, other

    cs.LG cs.AI

    Serverless Federated Learning with flwr-serverless

    Authors: Sanjeev V. Namjoshi, Reese Green, Krishi Sharma, Zhangzhang Si

    Abstract: Federated learning is becoming increasingly relevant and popular as we witness a surge in data collection and storage of personally identifiable information. Alongside these developments there have been many proposals from governments around the world to provide more protections for individuals' data and a heightened interest in data privacy measures. As deep learning continues to become more rele… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: Technical report for an open source machine learning python package

  13. arXiv:2310.05266  [pdf, other

    cs.RO

    DELTAHANDS: A Synergistic Dexterous Hand Framework Based on Delta Robots

    Authors: Zilin Si, Kevin Zhang, Oliver Kroemer, F. Zeynep Temel

    Abstract: Dexterous robotic manipulation in unstructured environments can aid in everyday tasks such as cleaning and caretaking. Anthropomorphic robotic hands are highly dexterous and theoretically well-suited for working in human domains, but their complex designs and dynamics often make them difficult to control. By contrast, parallel-jaw grippers are easy to control and are used extensively in industrial… ▽ More

    Submitted 24 December, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

  14. arXiv:2309.13375  [pdf, other

    cs.IR

    Generative Retrieval with Semantic Tree-Structured Item Identifiers via Contrastive Learning

    Authors: Zihua Si, Zhongxiang Sun, Jiale Chen, Guozhang Chen, Xiaoxue Zang, Kai Zheng, Yang Song, Xiao Zhang, Jun Xu, Kun Gai

    Abstract: The retrieval phase is a vital component in recommendation systems, requiring the model to be effective and efficient. Recently, generative retrieval has become an emerging paradigm for document retrieval, showing notable performance. These methods enjoy merits like being end-to-end differentiable, suggesting their viability in recommendation. However, these methods fall short in efficiency and ef… ▽ More

    Submitted 7 July, 2024; v1 submitted 23 September, 2023; originally announced September 2023.

    Comments: 9 main pages

  15. KuaiSAR: A Unified Search And Recommendation Dataset

    Authors: Zhongxiang Sun, Zihua Si, Xiaoxue Zang, Dewei Leng, Yanan Niu, Yang Song, Xiao Zhang, Jun Xu

    Abstract: The confluence of Search and Recommendation (S&R) services is vital to online services, including e-commerce and video platforms. The integration of S&R modeling is a highly intuitive approach adopted by industry practitioners. However, there is a noticeable lack of research conducted in this area within academia, primarily due to the absence of publicly available datasets. Consequently, a substan… ▽ More

    Submitted 13 August, 2023; v1 submitted 13 June, 2023; originally announced June 2023.

    Comments: CIKM 2023 resource track

    Report number: 5407--5411

    Journal ref: CIKM '23: Proceedings of the 32nd ACM International Conference on Information and Knowledge Management October 2023

  16. When Search Meets Recommendation: Learning Disentangled Search Representation for Recommendation

    Authors: Zihua Si, Zhongxiang Sun, Xiao Zhang, Jun Xu, Xiaoxue Zang, Yang Song, Kun Gai, Ji-Rong Wen

    Abstract: Modern online service providers such as online shopping platforms often provide both search and recommendation (S&R) services to meet different user needs. Rarely has there been any effective means of incorporating user behavior data from both S&R services. Most existing approaches either simply treat S&R behaviors separately, or jointly optimize them by aggregating data from both services, ignori… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: Accecpted by SIGIR 2023

  17. Uncovering ChatGPT's Capabilities in Recommender Systems

    Authors: Sunhao Dai, Ninglu Shao, Haiyuan Zhao, Weijie Yu, Zihua Si, Chen Xu, Zhongxiang Sun, Xiao Zhang, Jun Xu

    Abstract: The debut of ChatGPT has recently attracted the attention of the natural language processing (NLP) community and beyond. Existing studies have demonstrated that ChatGPT shows significant improvement in a range of downstream NLP tasks, but the capabilities and limitations of ChatGPT in terms of recommendations remain unclear. In this study, we aim to conduct an empirical analysis of ChatGPT's recom… ▽ More

    Submitted 24 August, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

    Comments: Accepted by RecSys 2023

  18. arXiv:2303.14637  [pdf, other

    eess.SP cs.MM

    Improved Nonlinear Transform Source-Channel Coding to Catalyze Semantic Communications

    Authors: Sixian Wang, Jincheng Dai, Xiaoqi Qin, Zhongwei Si, Kai Niu, Ping Zhang

    Abstract: Recent deep learning methods have led to increased interest in solving high-efficiency end-to-end transmission problems. These methods, we call nonlinear transform source-channel coding (NTSCC), extract the semantic latent features of source signal, and learn entropy model to guide the joint source-channel coding with variable rate to transmit latent features over wireless channels. In this paper,… ▽ More

    Submitted 18 August, 2023; v1 submitted 26 March, 2023; originally announced March 2023.

  19. A Golden Decade of Polar Codes: From Basic Principle to 5G Applications

    Authors: Kai Niu, Ping Zhang, Jincheng Dai, Zhongwei Si, Chao Dong

    Abstract: After the pursuit of seventy years, the invention of polar codes indicates that we have found the first capacity-achieving coding with low complexity construction and decoding, which is the great breakthrough of the coding theory in the past two decades. In this survey, we retrospect the history of polar codes and summarize the advancement in the past ten years. First, the primary principle of cha… ▽ More

    Submitted 25 March, 2023; originally announced March 2023.

    Comments: 29 pages, 21 figures, Published in China Communications

    Journal ref: China Communications, vol.20, no. 2, pp. 94-121, 2023

  20. arXiv:2303.02858  [pdf, other

    cs.RO

    RobotSweater: Scalable, Generalizable, and Customizable Machine-Knitted Tactile Skins for Robots

    Authors: Zilin Si, Tianhong Catherine Yu, Katrene Morozov, James McCann, Wenzhen Yuan

    Abstract: Tactile sensing is essential for robots to perceive and react to the environment. However, it remains a challenge to make large-scale and flexible tactile skins on robots. Industrial machine knitting provides solutions to manufacture customizable fabrics. Along with functional yarns, it can produce highly customizable circuits that can be made into tactile skins for robots. In this work, we presen… ▽ More

    Submitted 5 March, 2023; originally announced March 2023.

  21. arXiv:2212.10006  [pdf, other

    cs.LG cs.CR

    Multi-head Uncertainty Inference for Adversarial Attack Detection

    Authors: Yuqi Yang, Songyun Yang, Jiyang Xie. Zhongwei Si, Kai Guo, Ke Zhang, Kongming Liang

    Abstract: Deep neural networks (DNNs) are sensitive and susceptible to tiny perturbation by adversarial attacks which causes erroneous predictions. Various methods, including adversarial defense and uncertainty inference (UI), have been developed in recent years to overcome the adversarial attacks. In this paper, we propose a multi-head uncertainty inference (MH-UI) framework for detecting adversarial attac… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

  22. arXiv:2211.04339  [pdf, other

    cs.IT cs.LG eess.SP

    Toward Adaptive Semantic Communications: Efficient Data Transmission via Online Learned Nonlinear Transform Source-Channel Coding

    Authors: Jincheng Dai, Sixian Wang, Ke Yang, Kailin Tan, Xiaoqi Qin, Zhongwei Si, Kai Niu, Ping Zhang

    Abstract: The emerging field semantic communication is driving the research of end-to-end data transmission. By utilizing the powerful representation ability of deep learning models, learned data transmission schemes have exhibited superior performance than the established source and channel coding methods. While, so far, research efforts mainly concentrated on architecture and model improvements toward a s… ▽ More

    Submitted 24 May, 2023; v1 submitted 8 November, 2022; originally announced November 2022.

    Comments: Accepted by IEEE JSAC

  23. arXiv:2210.14210  [pdf, other

    cs.RO

    MidasTouch: Monte-Carlo inference over distributions across sliding touch

    Authors: Sudharshan Suresh, Zilin Si, Stuart Anderson, Michael Kaess, Mustafa Mukadam

    Abstract: We present MidasTouch, a tactile perception system for online global localization of a vision-based touch sensor sliding on an object surface. This framework takes in posed tactile images over time, and outputs an evolving distribution of sensor pose on the object's surface, without the need for visual priors. Our key insight is to estimate local surface geometry with tactile sensing, learn a comp… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

    Comments: Accepted at CoRL 2022 (Oral). Project website: https://suddhu.github.io/midastouch-tactile/

  24. arXiv:2210.06719  [pdf, other

    cs.LG cs.AI

    Reward Imputation with Sketching for Contextual Batched Bandits

    Authors: Xiao Zhang, Ninglu Shao, Zihua Si, Jun Xu, Wenhan Wang, Hanjing Su, Ji-Rong Wen

    Abstract: Contextual batched bandit (CBB) is a setting where a batch of rewards is observed from the environment at the end of each episode, but the rewards of the non-executed actions are unobserved, resulting in partial-information feedback. Existing approaches for CBB often ignore the rewards of the non-executed actions, leading to underutilization of feedback information. In this paper, we propose an ef… ▽ More

    Submitted 7 October, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: Accepted by NeurIPS 2023

    ACM Class: I.2.6

  25. arXiv:2208.02885  [pdf, other

    cs.RO

    Grasp Stability Prediction with Sim-to-Real Transfer from Tactile Sensing

    Authors: Zilin Si, Zirui Zhu, Arpit Agarwal, Stuart Anderson, Wenzhen Yuan

    Abstract: Robot simulation has been an essential tool for data-driven manipulation tasks. However, most existing simulation frameworks lack either efficient and accurate models of physical interactions with tactile sensors or realistic tactile simulation. This makes the sim-to-real transfer for tactile-based manipulation tasks still challenging. In this work, we integrate simulation of robot dynamics and vi… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

  26. arXiv:2208.02481  [pdf, ps, other

    cs.IT cs.AI cs.LG

    Communication Beyond Transmitting Bits: Semantics-Guided Source and Channel Coding

    Authors: Jincheng Dai, Ping Zhang, Kai Niu, Sixian Wang, Zhongwei Si, Xiaoqi Qin

    Abstract: Classical communication paradigms focus on accurately transmitting bits over a noisy channel, and Shannon theory provides a fundamental theoretical limit on the rate of reliable communications. In this approach, bits are treated equally, and the communication system is oblivious to what meaning these bits convey or how they would be used. Future communications towards intelligence and conciseness… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

    Comments: IEEE Wireless Communications, text overlap with arXiv:2112.03093

  27. arXiv:2205.13129  [pdf, other

    cs.CV cs.IT

    Wireless Deep Video Semantic Transmission

    Authors: Sixian Wang, Jincheng Dai, Zijian Liang, Kai Niu, Zhongwei Si, Chao Dong, Xiaoqi Qin, Ping Zhang

    Abstract: In this paper, we design a new class of high-efficiency deep joint source-channel coding methods to achieve end-to-end video transmission over wireless channels. The proposed methods exploit nonlinear transform and conditional coding architecture to adaptively extract semantic features across video frames, and transmit semantic feature domain representations over wireless channels via deep joint s… ▽ More

    Submitted 2 November, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: published in IEEE JSAC

  28. arXiv:2205.13120  [pdf, ps, other

    cs.CV cs.IT

    Perceptual Learned Source-Channel Coding for High-Fidelity Image Semantic Transmission

    Authors: Jun Wang, Sixian Wang, Jincheng Dai, Zhongwei Si, Dekun Zhou, Kai Niu

    Abstract: As one novel approach to realize end-to-end wireless image semantic transmission, deep learning-based joint source-channel coding (deep JSCC) method is emerging in both deep learning and communication communities. However, current deep JSCC image transmission systems are typically optimized for traditional distortion metrics such as peak signal-to-noise ratio (PSNR) or multi-scale structural simil… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

  29. arXiv:2205.03602  [pdf, other

    cs.CV cs.AI

    Automatic Block-wise Pruning with Auxiliary Gating Structures for Deep Convolutional Neural Networks

    Authors: Zhaofeng Si, Honggang Qi, Xiaoyu Song

    Abstract: Convolutional neural networks are prevailing in deep learning tasks. However, they suffer from massive cost issues when working on mobile devices. Network pruning is an effective method of model compression to handle such problems. This paper presents a novel structured network pruning method with auxiliary gating structures which assigns importance marks to blocks in backbone network as a criteri… ▽ More

    Submitted 7 May, 2022; originally announced May 2022.

    Comments: 7 pages, 7 figures, 2 tables

  30. arXiv:2204.02389  [pdf, other

    cs.CV cs.LG cs.RO cs.SD eess.AS

    ObjectFolder 2.0: A Multisensory Object Dataset for Sim2Real Transfer

    Authors: Ruohan Gao, Zilin Si, Yen-Yu Chang, Samuel Clarke, Jeannette Bohg, Li Fei-Fei, Wenzhen Yuan, Jiajun Wu

    Abstract: Objects play a crucial role in our everyday activities. Though multisensory object-centric learning has shown great potential lately, the modeling of objects in prior work is rather unrealistic. ObjectFolder 1.0 is a recent dataset that introduces 100 virtualized objects with visual, acoustic, and tactile sensory data. However, the dataset is small in scale and the multisensory data is of limited… ▽ More

    Submitted 5 April, 2022; originally announced April 2022.

    Comments: In CVPR 2022. Gao, Si, and Chang contributed equally to this work. Project page: https://ai.stanford.edu/~rhgao/objectfolder2.0/

  31. arXiv:2203.06692  [pdf, other

    cs.IT

    Towards Semantic Communications: A Paradigm Shift

    Authors: Kai Niu, Jincheng Dai, Shengshi Yao, Sixian Wang, Zhongwei Si, Xiaoqi Qin, Ping Zhang

    Abstract: The last seventy years have witnessed the transition of communication from Shannon's theoretical concept to current high-efficient practical systems. Classical communication systems address the capability-deficiency issue mainly by module-stacking and technique-densification with ever-increasing complexity. In such a traditional viewpoint, classical source coding only uses explicit probabilistic m… ▽ More

    Submitted 30 March, 2022; v1 submitted 13 March, 2022; originally announced March 2022.

  32. A Model-Agnostic Causal Learning Framework for Recommendation using Search Data

    Authors: Zihua Si, Xueran Han, Xiao Zhang, Jun Xu, Yue Yin, Yang Song, Ji-Rong Wen

    Abstract: Machine-learning based recommender systems(RSs) has become an effective means to help people automatically discover their interests. Existing models often represent the rich information for recommendation, such as items, users, and contexts, as embedding vectors and leverage them to predict users' feedback. In the view of causal analysis, the associations between these embedding vectors and users'… ▽ More

    Submitted 4 June, 2022; v1 submitted 9 February, 2022; originally announced February 2022.

    Comments: accepted by The Web Conference 2022

    ACM Class: H.3.3

  33. arXiv:2112.10961  [pdf, other

    cs.IT cs.CV cs.LG

    Nonlinear Transform Source-Channel Coding for Semantic Communications

    Authors: Jincheng Dai, Sixian Wang, Kailin Tan, Zhongwei Si, Xiaoqi Qin, Kai Niu, Ping Zhang

    Abstract: In this paper, we propose a class of high-efficiency deep joint source-channel coding methods that can closely adapt to the source distribution under the nonlinear transform, it can be collected under the name nonlinear transform source-channel coding (NTSCC). In the considered model, the transmitter first learns a nonlinear analysis transform to map the source data into latent space, then transmi… ▽ More

    Submitted 2 November, 2022; v1 submitted 20 December, 2021; originally announced December 2021.

    Comments: published in IEEE JSAC

  34. arXiv:2112.03093  [pdf, ps, other

    cs.IT

    Communication Beyond Transmitting Bits: Semantics-Guided Source and Channel Coding

    Authors: Jincheng Dai, Ping Zhang, Kai Niu, Sixian Wang, Zhongwei Si, Xiaoqi Qin

    Abstract: Classical communication paradigms focus on accurately transmitting bits over a noisy channel, and Shannon theory provides a fundamental theoretical limit on the rate of reliable communications. In this approach, bits are treated equally, and the communication system is oblivious to what meaning these bits convey or how they would be used. Future communications towards intelligence and conciseness… ▽ More

    Submitted 1 June, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

  35. arXiv:2110.12224  [pdf, other

    cs.IT eess.SP

    Generalized Polarization Transform: A Novel Coded Transmission Paradigm

    Authors: Bolin Wu, Jincheng Dai, Kai Niu, Zhongwei Si, Ping Zhang, Sen Wang, Yifei Yuan, Chih-Lin I

    Abstract: For the upcoming 6G wireless networks, a new wave of applications and services will demand ultra-high data rates and reliability. To this end, future wireless systems are expected to pave the way for entirely new fundamental air interface technologies to attain a breakthrough in spectrum efficiency (SE). This article discusses a new paradigm, named generalized polarization transform (GPT), to achi… ▽ More

    Submitted 27 April, 2022; v1 submitted 23 October, 2021; originally announced October 2021.

  36. arXiv:2109.09884  [pdf, other

    cs.RO

    ShapeMap 3-D: Efficient shape mapping through dense touch and vision

    Authors: Sudharshan Suresh, Zilin Si, Joshua G. Mangelson, Wenzhen Yuan, Michael Kaess

    Abstract: Knowledge of 3-D object shape is of great importance to robot manipulation tasks, but may not be readily available in unstructured environments. While vision is often occluded during robot-object interaction, high-resolution tactile sensors can give a dense local perspective of the object. However, tactile sensors have limited sensing area and the shape representation must faithfully approximate n… ▽ More

    Submitted 10 March, 2022; v1 submitted 20 September, 2021; originally announced September 2021.

    Comments: Camera-ready version for the 2022 IEEE International Conference on Robotics and Automation (ICRA 2022). Modified PDF title

  37. arXiv:2109.04027  [pdf, other

    cs.RO

    Taxim: An Example-based Simulation Model for GelSight Tactile Sensors

    Authors: Zilin Si, Wenzhen Yuan

    Abstract: Simulation is widely used in robotics for system verification and large-scale data collection. However, simulating sensors, including tactile sensors, has been a long-standing challenge. In this paper, we propose Taxim, a realistic and high-speed simulation model for a vision-based tactile sensor, GelSight. A GelSight sensor uses a piece of soft elastomer as the medium of contact and embeds optica… ▽ More

    Submitted 14 December, 2021; v1 submitted 9 September, 2021; originally announced September 2021.

  38. arXiv:2108.00301  [pdf, other

    cs.RO

    Improving Grasp Stability with Rotation Measurement from Tactile Sensing

    Authors: Raj Kolamuri, Zilin Si, Yufan Zhang, Arpit Agarwal, Wenzhen Yuan

    Abstract: Rotational displacement about the grasping point is a common grasp failure when an object is grasped at a location away from its center of gravity. Tactile sensors with soft surfaces, such as GelSight sensors, can detect the rotation patterns on the contacting surfaces when the object rotates. In this work, we propose a model-based algorithm that detects those rotational patterns and measures rota… ▽ More

    Submitted 31 July, 2021; originally announced August 2021.

  39. arXiv:2102.05792  [pdf, other

    cs.IT eess.SP

    Rate-Splitting Multiple Access for Multigateway Multibeam Satellite Systems with Feeder Link Interference

    Authors: Zhi Wen Si, Longfei Yin, Bruno Clerckx

    Abstract: This paper studies the precoder design problem of achieving max-min fairness (MMF) amongst users in multigateway multibeam satellite communication systems with feeder link interference. We propose a beamforming strategy based on a newly introduced transmission scheme known as rate-splitting multiple access (RSMA). RSMA relies on multi-antenna rate-splitting at the transmitter and successive interf… ▽ More

    Submitted 10 February, 2021; originally announced February 2021.

    Comments: Submitted for publication

  40. arXiv:2102.03828  [pdf, other

    cs.IT

    Learning to Decode Protograph LDPC Codes

    Authors: Jincheng Dai, Kailin Tan, Zhongwei Si, Kai Niu, Mingzhe Chen, H. Vincent Poor, Shuguang Cui

    Abstract: The recent development of deep learning methods provides a new approach to optimize the belief propagation (BP) decoding of linear codes. However, the limitation of existing works is that the scale of neural networks increases rapidly with the codelength, thus they can only support short to moderate codelengths. From the point view of practicality, we propose a high-performance neural min-sum (MS)… ▽ More

    Submitted 10 February, 2021; v1 submitted 7 February, 2021; originally announced February 2021.

    Comments: To appear in the IEEE JSAC Series on Machine Learning in Communications and Networks

  41. Weakly Supervised Attention Pyramid Convolutional Neural Network for Fine-Grained Visual Classification

    Authors: Yifeng Ding, Shaoguo Wen, Jiyang Xie, Dongliang Chang, Zhanyu Ma, Zhongwei Si, Haibin Ling

    Abstract: Classifying the sub-categories of an object from the same super-category (e.g. bird species, car and aircraft models) in fine-grained visual classification (FGVC) highly relies on discriminative feature representation and accurate region localization. Existing approaches mainly focus on distilling information from high-level features. In this paper, however, we show that by integrating low-level i… ▽ More

    Submitted 9 February, 2020; originally announced February 2020.

  42. arXiv:1912.04056  [pdf, other

    cs.GT

    Maximal Information Propagation with Budgets

    Authors: Haomin Shi, Yao Zhang, Zilin Si, Letong Wang, Dengji Zhao

    Abstract: In this paper, we present an information propagation game on a network where the information is originated from a sponsor who is willing to pay a fixed total budget to the players who propagate the information. Our solution can be applied to real world situations such as advertising via social networks with limited budgets. The goal is to design a mechanism to distribute the budget such that all p… ▽ More

    Submitted 27 February, 2020; v1 submitted 9 December, 2019; originally announced December 2019.

  43. arXiv:1705.11132  [pdf

    cs.OH

    The Role of Data Analysis in the Development of Intelligent Energy Networks

    Authors: Zhanyu Ma, Jiyang Xie, Hailong Li, Qie Sun, Zhongwei Si, Jianhua Zhang, Jun Guo

    Abstract: Data analysis plays an important role in the development of intelligent energy networks (IENs). This article reviews and discusses the application of data analysis methods for energy big data. The installation of smart energy meters has provided a huge volume of data at different time resolutions, suggesting data analysis is required for clustering, demand forecasting, energy generation optimizati… ▽ More

    Submitted 30 May, 2017; originally announced May 2017.

  44. Polar-Coded Non-Orthogonal Multiple Access

    Authors: Jincheng Dai, Kai Niu, Zhongwei Si, Chao Dong, Jiaru Lin

    Abstract: Non-orthogonal multiple access (NOMA) is one of the key techniques to address the high spectral efficiency and massive connectivity requirements for the fifth generation (5G) wireless system. To efficiently realize NOMA, we propose a joint design framework combining the polar coding and the NOMA transmission, which deeply mines the generalized polarization effect among the users. In this polar cod… ▽ More

    Submitted 16 May, 2017; originally announced May 2017.

    Comments: First version

  45. arXiv:1511.07236  [pdf, other

    cs.IT

    Does Gaussian Approximation Work Well for The Long-Length Polar Code Construction?

    Authors: Jincheng Dai, Kai Niu, Zhongwei Si, Chao Dong, Jiaru Lin

    Abstract: Gaussian approximation (GA) is widely used to construct polar codes. However when the code length is long, the subchannel selection inaccuracy due to the calculation error of conventional approximate GA (AGA), which uses a two-segment approximation function, results in a catastrophic performance loss. In this paper, new principles to design the GA approximation functions for polar codes are propos… ▽ More

    Submitted 15 March, 2017; v1 submitted 23 November, 2015; originally announced November 2015.

  46. arXiv:1102.5204  [pdf, ps, other

    cs.IT

    Bilayer LDPC Convolutional Codes for Half-Duplex Relay Channels

    Authors: Zhongwei Si, Ragnar Thobaben, Mikael Skoglund

    Abstract: In this paper we present regular bilayer LDPC convolutional codes for half-duplex relay channels. For the binary erasure relay channel, we prove that the proposed code construction achieves the capacities for the source-relay link and the source-destination link provided that the channel conditions are known when designing the code. Meanwhile, this code enables the highest transmission rate with d… ▽ More

    Submitted 25 February, 2011; originally announced February 2011.

    Comments: 5 pages, 5 figures, submitted to ISIT 2011