Zum Hauptinhalt springen

Showing 1–21 of 21 results for author: Wynter, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.17280  [pdf, other

    cs.AI cs.CL

    Flexible and Effective Mixing of Large Language Models into a Mixture of Domain Experts

    Authors: Rhui Dih Lee, Laura Wynter, Raghu Kiran Ganti

    Abstract: We present a toolkit for creating low-cost Mixture-of-Domain-Experts (MOE) from trained models. The toolkit can be used for creating a mixture from models or from adapters. We perform extensive tests and offer guidance on defining the architecture of the resulting MOE using the toolkit. A public repository is available.

    Submitted 30 August, 2024; originally announced August 2024.

  2. arXiv:2407.09105  [pdf, other

    cs.LG cs.AI

    Enhancing Training Efficiency Using Packing with Flash Attention

    Authors: Achintya Kundu, Rhui Dih Lee, Laura Wynter, Raghu Kiran Ganti, Mayank Mishra

    Abstract: Padding is often used in tuning LLM models by adding special tokens to shorter training examples to match the length of the longest sequence in each batch. While this ensures uniformity for batch processing, it introduces inefficiencies by including irrelevant padding tokens in the computation and wastes GPU resources. Hugging Face SFT trainer has always offered the option to use packing to combin… ▽ More

    Submitted 23 August, 2024; v1 submitted 12 July, 2024; originally announced July 2024.

  3. arXiv:2404.01353  [pdf, other

    cs.LG cs.AI cs.CL

    Efficiently Distilling LLMs for Edge Applications

    Authors: Achintya Kundu, Fabian Lim, Aaron Chew, Laura Wynter, Penny Chong, Rhui Dih Lee

    Abstract: Supernet training of LLMs is of great interest in industrial applications as it confers the ability to produce a palette of smaller models at constant cost, regardless of the number of models (of different size / latency) produced. We propose a new method called Multistage Low-rank Fine-tuning of Super-transformers (MLFS) for parameter-efficient supernet training. We show that it is possible to ob… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: This paper has been accepted for publication in NAACL 2024 (Industry Track)

  4. arXiv:2303.15485  [pdf, other

    cs.LG cs.CV cs.NE

    Transfer-Once-For-All: AI Model Optimization for Edge

    Authors: Achintya Kundu, Laura Wynter, Rhui Dih Lee, Luis Angel Bathen

    Abstract: Weight-sharing neural architecture search aims to optimize a configurable neural network model (supernet) for a variety of deployment scenarios across many devices with different resource constraints. Existing approaches use evolutionary search to extract models of different sizes from a supernet trained on a very large data set, and then fine-tune the extracted models on the typically small, real… ▽ More

    Submitted 2 July, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

  5. arXiv:2202.13436  [pdf, other

    cs.LG cs.AI

    Neural-Progressive Hedging: Enforcing Constraints in Reinforcement Learning with Stochastic Programming

    Authors: Supriyo Ghosh, Laura Wynter, Shiau Hong Lim, Duc Thien Nguyen

    Abstract: We propose a framework, called neural-progressive hedging (NP), that leverages stochastic programming during the online phase of executing a reinforcement learning (RL) policy. The goal is to ensure feasibility with respect to constraints and risk-based objectives such as conditional value-at-risk (CVaR) during the execution of the policy, using probabilistic models of the state transitions to gui… ▽ More

    Submitted 27 February, 2022; originally announced February 2022.

  6. arXiv:2110.07275  [pdf, other

    cs.LG cs.AI

    Order Constraints in Optimal Transport

    Authors: Fabian Lim, Laura Wynter, Shiau Hong Lim

    Abstract: Optimal transport is a framework for comparing measures whereby a cost is incurred for transporting one measure to another. Recent works have aimed to improve optimal transport plans through the introduction of various forms of structure. We introduce novel order constraints into the optimal transport formulation to allow for the incorporation of structure. We define an efficient method for obtain… ▽ More

    Submitted 28 June, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

    Comments: To appear in Proceedings of ICML 2022. Main Paper + Supplementary

  7. arXiv:2102.09745  [pdf, other

    cs.LG

    Decentralized Deterministic Multi-Agent Reinforcement Learning

    Authors: Antoine Grosnit, Desmond Cai, Laura Wynter

    Abstract: [Zhang, ICML 2018] provided the first decentralized actor-critic algorithm for multi-agent reinforcement learning (MARL) that offers convergence guarantees. In that work, policies are stochastic and are defined on finite action spaces. We extend those results to offer a provably-convergent decentralized actor-critic algorithm for learning deterministic policies on continuous action spaces. Determi… ▽ More

    Submitted 19 February, 2021; originally announced February 2021.

  8. arXiv:2102.09361  [pdf, other

    cs.LG cs.AI

    Efficient Reinforcement Learning in Resource Allocation Problems Through Permutation Invariant Multi-task Learning

    Authors: Desmond Cai, Shiau Hong Lim, Laura Wynter

    Abstract: One of the main challenges in real-world reinforcement learning is to learn successfully from limited training samples. We show that in certain settings, the available data can be dramatically increased through a form of multi-task learning, by exploiting an invariance property in the tasks. We provide a theoretical performance bound for the gain in sample efficiency under this setting. This motiv… ▽ More

    Submitted 18 February, 2021; originally announced February 2021.

  9. arXiv:2101.06171  [pdf, other

    cs.LG cs.AI cs.DC

    Probabilistic Inference for Learning from Untrusted Sources

    Authors: Duc Thien Nguyen, Shiau Hoong Lim, Laura Wynter, Desmond Cai

    Abstract: Federated learning brings potential benefits of faster learning, better solutions, and a greater propensity to transfer when heterogeneous data from different parties increases diversity. However, because federated learning tasks tend to be large and complex, and training times non-negligible, it is important for the aggregation algorithm to be robust to non-IID data and corrupted parties. This ro… ▽ More

    Submitted 15 January, 2021; originally announced January 2021.

  10. arXiv:2009.06303  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    Robustness and Personalization in Federated Learning: A Unified Approach via Regularization

    Authors: Achintya Kundu, Pengqian Yu, Laura Wynter, Shiau Hong Lim

    Abstract: We present a class of methods for robust, personalized federated learning, called Fed+, that unifies many federated learning algorithms. The principal advantage of this class of methods is to better accommodate the real-world characteristics found in federated training, such as the lack of IID data across parties, the need for robustness to outliers or stragglers, and the requirement to perform we… ▽ More

    Submitted 12 July, 2022; v1 submitted 14 September, 2020; originally announced September 2020.

    Comments: Accepted by IEEE EDGE 2022 (16 pages, 4 figures, 2 tables)

  11. arXiv:2007.10987  [pdf, other

    cs.LG cs.CR cs.DC

    IBM Federated Learning: an Enterprise Framework White Paper V0.1

    Authors: Heiko Ludwig, Nathalie Baracaldo, Gegi Thomas, Yi Zhou, Ali Anwar, Shashank Rajamoni, Yuya Ong, Jayaram Radhakrishnan, Ashish Verma, Mathieu Sinn, Mark Purcell, Ambrish Rawat, Tran Minh, Naoise Holohan, Supriyo Chakraborty, Shalisha Whitherspoon, Dean Steuer, Laura Wynter, Hifaz Hassan, Sean Laguna, Mikhail Yurochkin, Mayank Agarwal, Ebube Chuba, Annie Abay

    Abstract: Federated Learning (FL) is an approach to conduct machine learning without centralizing training data in a single place, for reasons of privacy, confidentiality or data volume. However, solving federated machine learning problems raises issues above and beyond those of centralized machine learning. These issues include setting up communication infrastructure between parties, coordinating the learn… ▽ More

    Submitted 22 July, 2020; originally announced July 2020.

    Comments: 17 pages

    ACM Class: I.2.6; I.2.11

  12. arXiv:2006.00778  [pdf, other

    cs.LG cs.AI stat.ML

    Variational Bayesian Inference for Crowdsourcing Predictions

    Authors: Desmond Cai, Duc Thien Nguyen, Shiau Hong Lim, Laura Wynter

    Abstract: Crowdsourcing has emerged as an effective means for performing a number of machine learning tasks such as annotation and labelling of images and other data sets. In most early settings of crowdsourcing, the task involved classification, that is assigning one of a discrete set of labels to each task. Recently, however, more complex tasks have been attempted including asking crowdsource workers to a… ▽ More

    Submitted 1 June, 2020; v1 submitted 1 June, 2020; originally announced June 2020.

    Comments: 7 pages

  13. arXiv:2004.01387  [pdf, other

    cs.LG cs.AI

    A Deep Ensemble Multi-Agent Reinforcement Learning Approach for Air Traffic Control

    Authors: Supriyo Ghosh, Sean Laguna, Shiau Hong Lim, Laura Wynter, Hasan Poonawala

    Abstract: Air traffic control is an example of a highly challenging operational problem that is readily amenable to human expertise augmentation via decision support technologies. In this paper, we propose a new intelligent decision making framework that leverages multi-agent reinforcement learning (MARL) to dynamically suggest adjustments of aircraft speeds in real-time. The goal of the system is to enhanc… ▽ More

    Submitted 3 April, 2020; originally announced April 2020.

  14. arXiv:1906.03040  [pdf, other

    cs.CY cs.LG eess.SP eess.SY stat.ML

    FASTER: Fusion AnalyticS for public Transport Event Response

    Authors: Sebastien Blandin, Laura Wynter, Hasan Poonawala, Sean Laguna, Basile Dura

    Abstract: Increasing urban concentration raises operational challenges that can benefit from integrated monitoring and decision support. Such complex systems need to leverage the full stack of analytical methods, from state estimation using multi-sensor fusion for situational awareness, to prediction and computation of optimal responses. The FASTER platform that we describe in this work, deployed at nation… ▽ More

    Submitted 14 May, 2019; originally announced June 2019.

  15. arXiv:1904.10180  [pdf, other

    cs.CV

    High-frequency crowd insights for public safety and congestion control

    Authors: Karthik Nandakumar, Sebastien Blandin, Laura Wynter

    Abstract: We present results from several projects aimed at enabling the real-time understanding of crowds and their behaviour in the built environment. We make use of CCTV video cameras that are ubiquitous throughout the developed and developing world and as such are able to play the role of a reliable sensing mechanism. We outline the novel methods developed for our crowd insights engine, and illustrate e… ▽ More

    Submitted 23 April, 2019; originally announced April 2019.

  16. arXiv:1903.01045  [pdf, other

    cs.LG cs.CY cs.SI stat.ML

    Robust commuter movement inference from connected mobile devices

    Authors: Baoyang Song, Hasan Poonawala, Laura Wynter, Sebastien Blandin

    Abstract: The preponderance of connected devices provides unprecedented opportunities for fine-grained monitoring of the public infrastructure. However while classical models expect high quality application-specific data streams, the promise of the Internet of Things (IoT) is that of an abundance of disparate and noisy datasets from connected devices. In this context, we consider the problem of estimation o… ▽ More

    Submitted 3 March, 2019; originally announced March 2019.

    Comments: International Conference on Data Mining 2018

  17. arXiv:1902.10887  [pdf, other

    cs.CV cs.LG

    Towards Robust ResNet: A Small Step but A Giant Leap

    Authors: Jingfeng Zhang, Bo Han, Laura Wynter, Kian Hsiang Low, Mohan Kankanhalli

    Abstract: This paper presents a simple yet principled approach to boosting the robustness of the residual network (ResNet) that is motivated by the dynamical system perspective. Namely, a deep neural network can be interpreted using a partial differential equation, which naturally inspires us to characterize ResNet by an explicit Euler method. Our analytical studies reveal that the step factor h in the Eule… ▽ More

    Submitted 1 July, 2019; v1 submitted 27 February, 2019; originally announced February 2019.

  18. arXiv:1812.05451  [pdf, other

    cs.CR cs.AI cs.LG stat.ML

    A Probabilistic Model of the Bitcoin Blockchain

    Authors: Marc Jourdan, Sebastien Blandin, Laura Wynter, Pralhad Deshpande

    Abstract: The Bitcoin transaction graph is a public data structure organized as transactions between addresses, each associated with a logical entity. In this work, we introduce a complete probabilistic model of the Bitcoin Blockchain. We first formulate a set of conditional dependencies induced by the Bitcoin protocol at the block level and derive a corresponding fully observed graphical model of a Bitcoin… ▽ More

    Submitted 6 November, 2018; originally announced December 2018.

  19. arXiv:1810.11956  [pdf, other

    cs.CR cs.LG

    Characterizing Entities in the Bitcoin Blockchain

    Authors: Marc Jourdan, Sebastien Blandin, Laura Wynter, Pralhad Deshpande

    Abstract: Bitcoin has created a new exchange paradigm within which financial transactions can be trusted without an intermediary. This premise of a free decentralized transactional network however requires, in its current implementation, unrestricted access to the ledger for peer-based transaction verification. A number of studies have shown that, in this pseudonymous context, identities can be leaked based… ▽ More

    Submitted 29 October, 2018; originally announced October 2018.

  20. arXiv:1809.10315  [pdf, other

    cs.LG cs.AI stat.ML

    Smooth Inter-layer Propagation of Stabilized Neural Networks for Classification

    Authors: Jingfeng Zhang, Laura Wynter

    Abstract: Recent work has studied the reasons for the remarkable performance of deep neural networks in image classification. We examine batch normalization on the one hand and the dynamical systems view of residual networks on the other hand. Our goal is in understanding the notions of stability and smoothness of the inter-layer propagation of ResNets so as to explain when they contribute to significantly… ▽ More

    Submitted 28 September, 2018; v1 submitted 26 September, 2018; originally announced September 2018.

    Comments: Revised Abstract

  21. arXiv:1703.00759  [pdf, other

    cs.CY

    Real-time public transport service-level monitoring using passive WiFi: a spectral clustering approach for train timetable estimation

    Authors: Baoyang Song, Laura Wynter

    Abstract: A new area in which passive WiFi analytics have promise for delivering value is the real-time monitoring of public transport systems. One example is determining the true (as opposed to the published) timetable of a public transport system in real-time. In most cases, there are no other publicly-available sources for this information. Yet, it is indispensable for the real-time monitoring of public… ▽ More

    Submitted 2 March, 2017; originally announced March 2017.