Zum Hauptinhalt springen

Showing 1–15 of 15 results for author: He, E

.
  1. arXiv:2408.10188  [pdf, other

    cs.CV cs.CL

    LongVILA: Scaling Long-Context Visual Language Models for Long Videos

    Authors: Fuzhao Xue, Yukang Chen, Dacheng Li, Qinghao Hu, Ligeng Zhu, Xiuyu Li, Yunhao Fang, Haotian Tang, Shang Yang, Zhijian Liu, Ethan He, Hongxu Yin, Pavlo Molchanov, Jan Kautz, Linxi Fan, Yuke Zhu, Yao Lu, Song Han

    Abstract: Long-context capability is critical for multi-modal foundation models, especially for long video understanding. We introduce LongVILA, a full-stack solution for long-context visual-language models by co-designing the algorithm and system. For model training, we upgrade existing VLMs to support long video understanding by incorporating two additional stages, i.e., long context extension and long su… ▽ More

    Submitted 21 August, 2024; v1 submitted 19 August, 2024; originally announced August 2024.

    Comments: Code and models are available at https://github.com/NVlabs/VILA/blob/main/LongVILA.md

  2. arXiv:2408.10114   

    quant-ph cs.CC math.OA

    Topics in Algebra of Synchronous Games, Algebraic Graph Identities and Quantum NP-hardness Reductions

    Authors: Entong He

    Abstract: We review the correspondence between a synchronous game and its associated game algebra. We slightly develop the work of Helton et al.[HMPS17] by proposing results on algebraic and locally commuting graph identities. Based on the theoretical works on noncommutative Nullstellensätze [BWHK23], we build computational tools involving Gröbner basis method and semidefinite programming to check the exist… ▽ More

    Submitted 22 August, 2024; v1 submitted 19 August, 2024; originally announced August 2024.

    Comments: There is a problem of authorship among people involved in the research project, and we have yet reached an agreement. Meanwhile, we hope to further check the validity of the proving system

  3. PAIL: Performance based Adversarial Imitation Learning Engine for Carbon Neutral Optimization

    Authors: Yuyang Ye, Lu-An Tang, Haoyu Wang, Runlong Yu, Wenchao Yu, Erhu He, Haifeng Chen, Hui Xiong

    Abstract: Achieving carbon neutrality within industrial operations has become increasingly imperative for sustainable development. It is both a significant challenge and a key opportunity for operational optimization in industry 4.0. In recent years, Deep Reinforcement Learning (DRL) based methods offer promising enhancements for sequential optimization processes and can be used for reducing carbon emission… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  4. Quantum Ranging Enhanced TDoA Localization

    Authors: Entong He, Yuxiang Yang, Chenshu Wu

    Abstract: Localization is critical to numerous applications. The performance of classical localization protocols is limited by the specific form of distance information and suffer from considerable ranging errors. This paper foresees a new opportunity by utilizing the exceptional property of entangled quantum states to measure a linear combination of target-anchor distances. Specifically, we consider locali… ▽ More

    Submitted 25 April, 2024; originally announced July 2024.

    Journal ref: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  5. arXiv:2406.00497  [pdf, ps, other

    cs.SD cs.AI cs.CL eess.AS

    Recent Advances in End-to-End Simultaneous Speech Translation

    Authors: Xiaoqian Liu, Guoqiang Hu, Yangfan Du, Erfeng He, Yingfeng Luo, Chen Xu, Tong Xiao, Jingbo Zhu

    Abstract: Simultaneous speech translation (SimulST) is a demanding task that involves generating translations in real-time while continuously processing speech input. This paper offers a comprehensive overview of the recent developments in SimulST research, focusing on four major challenges. Firstly, the complexities associated with processing lengthy and continuous speech streams pose significant hurdles.… ▽ More

    Submitted 20 August, 2024; v1 submitted 1 June, 2024; originally announced June 2024.

    Comments: Accepted by IJCAI 2024

  6. arXiv:2405.17034  [pdf, other

    cs.LG cs.AI

    FUGNN: Harmonizing Fairness and Utility in Graph Neural Networks

    Authors: Renqiang Luo, Huafei Huang, Shuo Yu, Zhuoyang Han, Estrid He, Xiuzhen Zhang, Feng Xia

    Abstract: Fairness-aware Graph Neural Networks (GNNs) often face a challenging trade-off, where prioritizing fairness may require compromising utility. In this work, we re-examine fairness through the lens of spectral graph theory, aiming to reconcile fairness and utility within the framework of spectral graph learning. We explore the correlation between sensitive features and spectrum in GNNs, using theore… ▽ More

    Submitted 13 August, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

    Comments: Accepted in SIGKDD 2024

  7. arXiv:2405.15865  [pdf, other

    cond-mat.str-el

    Metallic bonding in close packed structures: structural frustration from a hidden gauge symmetry

    Authors: Eric He, C. M. Wilson, R. Ganesh

    Abstract: Based on its simple valence electron configuration, we may expect lithium to have straightforward physical properties that are easily explained. However, solid lithium, when cooled below 77 K, develops a complex structure that has been debated for decades. A close parallel is found in sodium below 36 K where the crystal structure still remains unresolved. In this letter, we explore a possible driv… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 4 pages + supplement

  8. arXiv:2404.16895  [pdf, other

    cs.ET

    QuERLoc: Towards Next-Generation Localization with Quantum-Enhanced Ranging

    Authors: Entong He, Yuxiang Yang, Chenshu Wu

    Abstract: Remarkable advances have been achieved in localization techniques in past decades, rendering it one of the most important technologies indispensable to our daily lives. In this paper, we investigate a novel localization approach for future computing by presenting QuERLoc, the first study on localization using quantum-enhanced ranging. By fine-tuning the evolution of an entangled quantum probe, qua… ▽ More

    Submitted 4 May, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

  9. arXiv:2402.13379  [pdf, other

    cs.LG cs.CY

    Referee-Meta-Learning for Fast Adaptation of Locational Fairness

    Authors: Weiye Chen, Yiqun Xie, Xiaowei Jia, Erhu He, Han Bao, Bang An, Xun Zhou

    Abstract: When dealing with data from distinct locations, machine learning algorithms tend to demonstrate an implicit preference of some locations over the others, which constitutes biases that sabotage the spatial fairness of the algorithm. This unfairness can easily introduce biases in subsequent decision-making given broad adoptions of learning-based solutions in practice. However, locational biases in A… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  10. arXiv:2401.01625  [pdf, other

    cs.SI cs.CY cs.LG

    SCALA: Sparsification-based Contrastive Learning for Anomaly Detection on Attributed Networks

    Authors: Enbo He, Yitong Hao, Yue Zhang, Guisheng Yin, Lina Yao

    Abstract: Anomaly detection on attributed networks aims to find the nodes whose behaviors are significantly different from other majority nodes. Generally, network data contains information about relationships between entities, and the anomaly is usually embodied in these relationships. Therefore, how to comprehensively model complex interaction patterns in networks is still a major focus. It can be observe… ▽ More

    Submitted 8 January, 2024; v1 submitted 3 January, 2024; originally announced January 2024.

    Comments: 9 pages, 14 figures

  11. arXiv:2309.12234  [pdf, ps, other

    cs.CL eess.AS

    Bridging the Gaps of Both Modality and Language: Synchronous Bilingual CTC for Speech Translation and Speech Recognition

    Authors: Chen Xu, Xiaoqian Liu, Erfeng He, Yuhao Zhang, Qianqian Dong, Tong Xiao, Jingbo Zhu, Dapeng Man, Wu Yang

    Abstract: In this study, we present synchronous bilingual Connectionist Temporal Classification (CTC), an innovative framework that leverages dual CTC to bridge the gaps of both modality and language in the speech translation (ST) task. Utilizing transcript and translation as concurrent objectives for CTC, our model bridges the gap between audio and text as well as between source and target languages. Build… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: Submitted to ICASSP 2024

  12. arXiv:2302.08406  [pdf, other

    cs.LG stat.ML

    Entity Aware Modelling: A Survey

    Authors: Rahul Ghosh, Haoyu Yang, Ankush Khandelwal, Erhu He, Arvind Renganathan, Somya Sharma, Xiaowei Jia, Vipin Kumar

    Abstract: Personalized prediction of responses for individual entities caused by external drivers is vital across many disciplines. Recent machine learning (ML) advances have led to new state-of-the-art response prediction models. Models built at a population level often lead to sub-optimal performance in many personalized prediction settings due to heterogeneity in data across entities (tasks). In personal… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Comments: Submitted to IJCAI, Survey Track

  13. arXiv:2302.03065  [pdf, other

    quant-ph cond-mat.mes-hall hep-th

    Bound states without potentials: localization at singularities

    Authors: Eric He, R. Ganesh

    Abstract: Bound state formation is a classic feature of quantum mechanics, where a particle localizes in the vicinity of an attractive potential. This is typically understood as the particle lowering its potential energy. In this article, we discuss a paradigm where bound states arise purely due to kinetic energy considerations. This phenomenon occurs in certain non-manifold spaces that consist of multiple… ▽ More

    Submitted 7 August, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

    Comments: 10 pages, 7 figures

    Journal ref: Phys. Rev. A 108, 022202 (2023)

  14. arXiv:2211.00567  [pdf, other

    cs.HC

    Analysis Without Data: Teaching Students to Tackle the VAST Challenge

    Authors: Edward W He, Daniel Tolessa, Ashley Suh, Remco Chang

    Abstract: The VAST Challenges have been shown to be an effective tool in visual analytics education, encouraging student learning while enforcing good visualization design and development practices. However, research has observed that students often struggle at identifying a good "starting point" when tackling the VAST Challenge. Consequently, students who could not identify a good starting point failed at… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: IEEE Workshop on Visualization Guidelines in Research, Design, and Education (VisGuides)

  15. arXiv:1802.00840  [pdf, other

    q-bio.NC cs.CL

    Preserved Structure Across Vector Space Representations

    Authors: Andrei Amatuni, Estelle He, Elika Bergelson

    Abstract: Certain concepts, words, and images are intuitively more similar than others (dog vs. cat, dog vs. spoon), though quantifying such similarity is notoriously difficult. Indeed, this kind of computation is likely a critical part of learning the category boundaries for words within a given language. Here, we use a set of 27 items (e.g. 'dog') that are highly common in infants' input, and use both ima… ▽ More

    Submitted 14 May, 2018; v1 submitted 2 February, 2018; originally announced February 2018.

    Comments: presented at CogSci 2018