Zum Hauptinhalt springen

Showing 1–16 of 16 results for author: Loh, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.06576  [pdf, other

    cs.CL cs.AI cs.LG

    OccamLLM: Fast and Exact Language Model Arithmetic in a Single Step

    Authors: Owen Dugan, Donato Manuel Jimenez Beneto, Charlotte Loh, Zhuo Chen, Rumen Dangovski, Marin Soljačić

    Abstract: Despite significant advancements in text generation and reasoning, Large Language Models (LLMs) still face challenges in accurately performing complex arithmetic operations. To achieve accurate calculations, language model systems often enable LLMs to generate code for arithmetic operations. However, this approach compromises speed and security and, if finetuning is involved, risks the language mo… ▽ More

    Submitted 29 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  2. arXiv:2406.00132  [pdf, other

    cs.LG quant-ph

    QuanTA: Efficient High-Rank Fine-Tuning of LLMs with Quantum-Informed Tensor Adaptation

    Authors: Zhuo Chen, Rumen Dangovski, Charlotte Loh, Owen Dugan, Di Luo, Marin Soljačić

    Abstract: We propose Quantum-informed Tensor Adaptation (QuanTA), a novel, easy-to-implement, fine-tuning method with no inference overhead for large-scale pre-trained language models. By leveraging quantum-inspired methods derived from quantum circuit structures, QuanTA enables efficient high-rank fine-tuning, surpassing the limitations of Low-Rank Adaptation (LoRA)--low-rank approximation may fail for com… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

  3. arXiv:2312.00111  [pdf, other

    cs.LG cond-mat.mtrl-sci

    Multimodal Learning for Materials

    Authors: Viggo Moro, Charlotte Loh, Rumen Dangovski, Ali Ghorashi, Andrew Ma, Zhuo Chen, Samuel Kim, Peter Y. Lu, Thomas Christensen, Marin Soljačić

    Abstract: Artificial intelligence is transforming computational materials science, improving the prediction of material properties, and accelerating the discovery of novel materials. Recently, publicly available material data repositories have grown rapidly. This growth encompasses not only more materials, but also a greater variety and quantity of their associated properties. Existing machine learning effo… ▽ More

    Submitted 12 April, 2024; v1 submitted 30 November, 2023; originally announced December 2023.

    Comments: 11 pages, 4 figures

  4. arXiv:2311.17066  [pdf

    q-bio.QM cs.AI

    Cluster trajectory of SOFA score in predicting mortality in sepsis

    Authors: Yuhe Ke, Matilda Swee Sun Tang, Celestine Jia Ling Loh, Hairil Rizal Abdullah, Nicholas Brian Shannon

    Abstract: Objective: Sepsis is a life-threatening condition. Sequential Organ Failure Assessment (SOFA) score is commonly used to assess organ dysfunction and predict ICU mortality, but it is taken as a static measurement and fails to capture dynamic changes. This study aims to investigate the relationship between dynamic changes in SOFA scores over the first 72 hours of ICU admission and patient outcomes.… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

    Comments: 26 pages, 4 figures, 2 tables

  5. arXiv:2304.00601  [pdf, other

    cs.CV cs.LG

    Constructive Assimilation: Boosting Contrastive Learning Performance through View Generation Strategies

    Authors: Ligong Han, Seungwook Han, Shivchander Sudalairaj, Charlotte Loh, Rumen Dangovski, Fei Deng, Pulkit Agrawal, Dimitris Metaxas, Leonid Karlinsky, Tsui-Wei Weng, Akash Srivastava

    Abstract: Transformations based on domain expertise (expert transformations), such as random-resized-crop and color-jitter, have proven critical to the success of contrastive learning techniques such as SimCLR. Recently, several attempts have been made to replace such domain-specific, human-designed transformations with generated views that are learned. However for imagery data, so far none of these view-ge… ▽ More

    Submitted 8 April, 2023; v1 submitted 2 April, 2023; originally announced April 2023.

    Comments: Accepted at Generative Models for Computer Vision Workshop 2023

  6. arXiv:2303.02484  [pdf, other

    cs.LG cs.AI cs.CV

    Multi-Symmetry Ensembles: Improving Diversity and Generalization via Opposing Symmetries

    Authors: Charlotte Loh, Seungwook Han, Shivchander Sudalairaj, Rumen Dangovski, Kai Xu, Florian Wenzel, Marin Soljacic, Akash Srivastava

    Abstract: Deep ensembles (DE) have been successful in improving model performance by learning diverse members via the stochasticity of random initialization. While recent works have attempted to promote further diversity in DE via hyperparameters or regularizing loss functions, these methods primarily still rely on a stochastic approach to explore the hypothesis space. In this work, we present Multi-Symmetr… ▽ More

    Submitted 19 June, 2023; v1 submitted 4 March, 2023; originally announced March 2023.

    Comments: Camera Ready Revision. ICML 2023

  7. arXiv:2301.11756  [pdf, ps, other

    math.AC cs.CG math.AT

    A comment on the structure of graded modules over graded principal ideal domains in the context of persistent homology

    Authors: Clara Loeh

    Abstract: The literature in persistent homology often refers to a "structure theorem for finitely generated graded modules over a graded principal ideal domain". We clarify the nature of this structure theorem in this context.

    Submitted 5 February, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: 10 pages; v2: small improvements to exposition

  8. arXiv:2210.04783  [pdf, other

    cs.LG cs.CV physics.app-ph

    On the Importance of Calibration in Semi-supervised Learning

    Authors: Charlotte Loh, Rumen Dangovski, Shivchander Sudalairaj, Seungwook Han, Ligong Han, Leonid Karlinsky, Marin Soljacic, Akash Srivastava

    Abstract: State-of-the-art (SOTA) semi-supervised learning (SSL) methods have been highly successful in leveraging a mix of labeled and unlabeled data by combining techniques of consistency regularization and pseudo-labeling. During pseudo-labeling, the model's predictions on unlabeled data are used for training and thus, model calibration is important in mitigating confirmation bias. Yet, many SOTA methods… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

    Comments: 24 pages

  9. arXiv:2202.03159  [pdf, ps, other

    math.GR cs.LO math.GT math.LO

    $L^2$-Betti numbers and computability of reals

    Authors: Clara Loeh, Matthias Uschold

    Abstract: We study the computability degree of real numbers arising as $L^2$-Betti numbers or $L^2$-torsion of groups, parametrised over the Turing degree of the word problem.

    Submitted 7 March, 2023; v1 submitted 7 February, 2022; originally announced February 2022.

    Comments: 33 pages; To appear in Computability; v2: clarified Theorem 1.5; v3: removed Section 9, minor corrections; v4: added Appendix B and Remark 1.5; Lean implementation available at https://gitlab.com/L2-comp/l2-comp-lean;

  10. arXiv:2111.00899  [pdf, other

    cs.CV cs.LG eess.IV physics.app-ph

    Equivariant Contrastive Learning

    Authors: Rumen Dangovski, Li Jing, Charlotte Loh, Seungwook Han, Akash Srivastava, Brian Cheung, Pulkit Agrawal, Marin Soljačić

    Abstract: In state-of-the-art self-supervised learning (SSL) pre-training produces semantically good representations by encouraging them to be invariant under meaningful transformations prescribed from human knowledge. In fact, the property of invariance is a trivial instance of a broader class called equivariance, which can be intuitively understood as the property that representations transform according… ▽ More

    Submitted 14 March, 2022; v1 submitted 28 October, 2021; originally announced November 2021.

    Comments: Camera Ready Revision. ICLR 2022. Discussion: https://openreview.net/forum?id=gKLAAfiytI Code: https://github.com/rdangovs/essl

  11. arXiv:2110.08406  [pdf, other

    cs.LG cond-mat.mtrl-sci physics.app-ph physics.optics

    Surrogate- and invariance-boosted contrastive learning for data-scarce applications in science

    Authors: Charlotte Loh, Thomas Christensen, Rumen Dangovski, Samuel Kim, Marin Soljacic

    Abstract: Deep learning techniques have been increasingly applied to the natural sciences, e.g., for property prediction and optimization or material discovery. A fundamental ingredient of such approaches is the vast quantity of labelled data needed to train the model; this poses severe challenges in data-scarce settings where obtaining labels requires substantial computational or labor resources. Here, we… ▽ More

    Submitted 15 October, 2021; originally announced October 2021.

    Comments: 21 pages, 10 figures

  12. arXiv:2104.11667  [pdf, other

    cs.LG physics.app-ph physics.chem-ph physics.comp-ph physics.optics

    Deep Learning for Bayesian Optimization of Scientific Problems with High-Dimensional Structure

    Authors: Samuel Kim, Peter Y. Lu, Charlotte Loh, Jamie Smith, Jasper Snoek, Marin Soljačić

    Abstract: Bayesian optimization (BO) is a popular paradigm for global optimization of expensive black-box functions, but there are many domains where the function is not completely a black-box. The data may have some known structure (e.g. symmetries) and/or the data generation process may be a composite process that yields useful intermediate or auxiliary information in addition to the value of the optimiza… ▽ More

    Submitted 6 December, 2022; v1 submitted 23 April, 2021; originally announced April 2021.

    Comments: 32 pages, 16 figures; published in TMLR

    Journal ref: Transactions on Machine Learning Research (TMLR) September 2022

  13. arXiv:1908.10673  [pdf, other

    cs.NE

    A Search for the Underlying Equation Governing Similar Systems

    Authors: Changwei Loh, Daniel Schneegass, Pengwei Tian

    Abstract: We show a data-driven approach to discover the underlying structural form of the mathematical equation governing the dynamics of multiple but similar systems induced by the same mechanisms. This approach hinges on theories that we lay out involving arguments based on the nature of physical systems. In the same vein, we also introduce a metric to search for the best candidate equation using the dat… ▽ More

    Submitted 27 August, 2019; originally announced August 2019.

  14. arXiv:1812.02824  [pdf, ps, other

    stat.AP cs.LG stat.ML

    Structural Damage Detection and Localization with Unknown Post-Damage Feature Distribution Using Sequential Change-Point Detection Method

    Authors: Yizheng Liao, Anne S. Kiremidjian, Ram Rajagopal, Chin-Hsuing Loh

    Abstract: The high structural deficient rate poses serious risks to the operation of many bridges and buildings. To prevent critical damage and structural collapse, a quick structural health diagnosis tool is needed during normal operation or immediately after extreme events. In structural health monitoring (SHM), many existing works will have limited performance in the quick damage identification process b… ▽ More

    Submitted 14 November, 2018; originally announced December 2018.

    Comments: 20 pages

  15. Multiuser Communication through Power Talk in DC MicroGrids

    Authors: Marko Angjelichinoski, Cedomir Stefanovic, Petar Popovski, Hongpeng Liu, Poh Chiang Loh, Frede Blaabjerg

    Abstract: Power talk is a novel concept for communication among control units in MicroGrids (MGs), carried out without a dedicated modem, but by using power electronics that interface the common bus. The information is transmitted by modulating the parameters of the primary control, incurring subtle power deviations that can be detected by other units. In this paper, we develop power talk communication stra… ▽ More

    Submitted 21 July, 2015; originally announced July 2015.

    Comments: Multiuser extension of the power talk concept. Submitted to IEEE JSAC

  16. arXiv:1504.03016  [pdf, ps, other

    cs.IT

    Power Talk: How to Modulate Data over a DC Micro Grid Bus using Power Electronics

    Authors: Marko Angjelichinoski, Cedomir Stefanovic, Petar Popovski, Hongpeng Liu, Poh Chiang Loh, Frede Blaabjerg

    Abstract: We introduce a novel communication strategy for DC Micro Grids (MGs), termed power talk, in which the devices communicate by modulating the power levels in the DC bus. The information is transmitted by varying the parameters that the MG units use to control the level of the common bus voltage, while it is received by processing the bus measurements that units perform. This communication is challen… ▽ More

    Submitted 12 April, 2015; originally announced April 2015.

    Comments: IEEE GLOBECOM 2015