Skip to main content

Showing 1–25 of 25 results for author: Yu, C L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.19327  [pdf, other

    cs.CL cs.AI cs.LG

    MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series

    Authors: Ge Zhang, Scott Qu, Jiaheng Liu, Chenchen Zhang, Chenghua Lin, Chou Leuang Yu, Danny Pan, Esther Cheng, Jie Liu, Qunshu Lin, Raven Yuan, Tuney Zheng, Wei Pang, Xinrun Du, Yiming Liang, Yinghao Ma, Yizhi Li, Ziyang Ma, Bill Lin, Emmanouil Benetos, Huan Yang, Junting Zhou, Kaijing Ma, Minghao Liu, Morry Niu , et al. (20 additional authors not shown)

    Abstract: Large Language Models (LLMs) have made great strides in recent years to achieve unprecedented performance across different tasks. However, due to commercial interest, the most competitive models like GPT, Gemini, and Claude have been gated behind proprietary interfaces without disclosing the training details. Recently, many institutions have open-sourced several strong LLMs like LLaMA-3, comparabl… ▽ More

    Submitted 10 July, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

    Comments: https://map-neo.github.io/

  2. arXiv:2405.15984  [pdf, other

    cs.CL cs.AI

    Evaluating the Adversarial Robustness of Retrieval-Based In-Context Learning for Large Language Models

    Authors: Simon Chi Lok Yu, Jie He, Pasquale Minervini, Jeff Z. Pan

    Abstract: With the emergence of large language models, such as LLaMA and OpenAI GPT-3, In-Context Learning (ICL) gained significant attention due to its effectiveness and efficiency. However, ICL is very sensitive to the choice, order, and verbaliser used to encode the demonstrations in the prompt. Retrieval-Augmented ICL methods try to address this problem by leveraging retrievers to extract semantically r… ▽ More

    Submitted 10 July, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: COLM 2024, 29 pages, 6 figures

  3. arXiv:2405.05119  [pdf, other

    stat.ME cs.SI

    Combining Rollout Designs and Clustering for Causal Inference under Low-order Interference

    Authors: Mayleen Cortez-Rodriguez, Matthew Eichhorn, Christina Lee Yu

    Abstract: Estimating causal effects under interference is pertinent to many real-world settings. However, the true interference network may be unknown to the practitioner, precluding many existing techniques that leverage this information. A recent line of work with low-order potential outcomes models uses staggered rollout designs to obtain unbiased estimators that require no network information. However,… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: 30 pages, 13 figures

    MSC Class: 62K99 (Primary); 62P30 (Secondary)

  4. arXiv:2404.03176  [pdf, other

    cs.LG cs.IT

    Information-Theoretic Generalization Bounds for Deep Neural Networks

    Authors: Haiyun He, Christina Lee Yu, Ziv Goldfeld

    Abstract: Deep neural networks (DNNs) exhibit an exceptional capacity for generalization in practical applications. This work aims to capture the effect and benefits of depth for supervised learning via information-theoretic generalization bounds. We first derive two hierarchical bounds on the generalization error in terms of the Kullback-Leibler (KL) divergence or the 1-Wasserstein distance between the tra… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: 25 pages, 5 figures

  5. Entry-Specific Bounds for Low-Rank Matrix Completion under Highly Non-Uniform Sampling

    Authors: Xumei Xi, Christina Lee Yu, Yudong Chen

    Abstract: Low-rank matrix completion concerns the problem of estimating unobserved entries in a matrix using a sparse set of observed entries. We consider the non-uniform setting where the observed entries are sampled with highly varying probabilities, potentially with different asymptotic scalings. We show that under structured sampling probabilities, it is often better and sometimes optimal to run estimat… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

  6. arXiv:2402.17720  [pdf, other

    cs.LG cs.DS cs.IT

    The SMART approach to instance-optimal online learning

    Authors: Siddhartha Banerjee, Alankrita Bhatt, Christina Lee Yu

    Abstract: We devise an online learning algorithm -- titled Switching via Monotone Adapted Regret Traces (SMART) -- that adapts to the data and achieves regret that is instance optimal, i.e., simultaneously competitive on every input sequence compared to the performance of the follow-the-leader (FTL) policy and the worst case guarantee of any other input policy. We show that the regret of the SMART policy on… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  7. arXiv:2312.15574  [pdf, other

    math.ST cs.LG

    Clustered Switchback Experiments: Near-Optimal Rates Under Spatiotemporal Interference

    Authors: Su Jia, Nathan Kallus, Christina Lee Yu

    Abstract: We consider experimentation in the presence of non-stationarity, inter-unit (spatial) interference, and carry-over effects (temporal interference), where we wish to estimate the global average treatment effect (GATE), the difference between average outcomes having exposed all units at all times to treatment or to control. We suppose spatial interference is described by a graph, where a unit's outc… ▽ More

    Submitted 23 June, 2024; v1 submitted 24 December, 2023; originally announced December 2023.

  8. arXiv:2305.15621  [pdf, ps, other

    cs.LG

    Matrix Estimation for Offline Reinforcement Learning with Low-Rank Structure

    Authors: Xumei Xi, Christina Lee Yu, Yudong Chen

    Abstract: We consider offline Reinforcement Learning (RL), where the agent does not interact with the environment and must rely on offline data collected using a behavior policy. Previous works provide policy evaluation guarantees when the target policy to be evaluated is covered by the behavior policy, that is, state-action pairs visited by the target policy must also be visited by the behavior policy. We… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  9. arXiv:2210.11355  [pdf, other

    econ.EM cs.LG stat.ME

    Network Synthetic Interventions: A Causal Framework for Panel Data Under Network Interference

    Authors: Anish Agarwal, Sarah H. Cen, Devavrat Shah, Christina Lee Yu

    Abstract: We propose a generalization of the synthetic controls and synthetic interventions methodology to incorporate network interference. We consider the estimation of unit-specific potential outcomes from panel data in the presence of spillover across units and unobserved confounding. Key to our approach is a novel latent factor model that takes into account network interference and generalizes the fact… ▽ More

    Submitted 11 October, 2023; v1 submitted 20 October, 2022; originally announced October 2022.

    Comments: 49 pages, 6 figures

  10. arXiv:2210.00025  [pdf, other

    cs.LG stat.ML

    Artificial Replay: A Meta-Algorithm for Harnessing Historical Data in Bandits

    Authors: Siddhartha Banerjee, Sean R. Sinclair, Milind Tambe, Lily Xu, Christina Lee Yu

    Abstract: How best to incorporate historical data to "warm start" bandit algorithms is an open question: naively initializing reward estimates using all historical samples can suffer from spurious data and imbalanced data coverage, leading to computational and storage issues $\unicode{x2014}$ particularly salient in continuous action spaces. We propose Artificial Replay, a meta-algorithm for incorporating h… ▽ More

    Submitted 26 January, 2023; v1 submitted 30 September, 2022; originally announced October 2022.

    Comments: 36 pages (14 pages main paper), 9 figures

  11. Exploiting Neighborhood Interference with Low Order Interactions under Unit Randomized Design

    Authors: Mayleen Cortez-Rodriguez, Matthew Eichhorn, Christina Lee Yu

    Abstract: Network interference, where the outcome of an individual is affected by the treatment assignment of those in their social network, is pervasive in real-world settings. However, it poses a challenge to estimating causal effects. We consider the task of estimating the total treatment effect (TTE), or the difference between the average outcomes of the population when everyone is treated versus when n… ▽ More

    Submitted 5 February, 2024; v1 submitted 10 August, 2022; originally announced August 2022.

    Comments: 42 pages including citations and appendix, 2 figures (total of 12 subfigures)

    MSC Class: 62K99; 91D30; 60F05

    Journal ref: Journal of Causal Inference, vol. 11, no. 1, 2023, pp. 20220051

  12. arXiv:2206.03569  [pdf, other

    cs.LG

    Overcoming the Long Horizon Barrier for Sample-Efficient Reinforcement Learning with Latent Low-Rank Structure

    Authors: Tyler Sam, Yudong Chen, Christina Lee Yu

    Abstract: The practicality of reinforcement learning algorithms has been limited due to poor scaling with respect to the problem size, as the sample complexity of learning an $ε$-optimal policy is $\tildeΩ\left(|S||A|H^3 / ε^2\right)$ over worst case instances of an MDP with state space $S$, action space $A$, and horizon $H$. We consider a class of MDPs for which the associated optimal $Q^*$ function is low… ▽ More

    Submitted 9 June, 2023; v1 submitted 7 June, 2022; originally announced June 2022.

  13. arXiv:2205.14552  [pdf, other

    stat.ME cs.SI

    Staggered Rollout Designs Enable Causal Inference Under Interference Without Network Knowledge

    Authors: Mayleen Cortez, Matthew Eichhorn, Christina Lee Yu

    Abstract: Randomized experiments are widely used to estimate causal effects across a variety of domains. However, classical causal inference approaches rely on critical independence assumptions that are violated by network interference, when the treatment of one individual influences the outcomes of others. All existing approaches require at least approximate knowledge of the network, which may be unavailab… ▽ More

    Submitted 14 October, 2022; v1 submitted 28 May, 2022; originally announced May 2022.

    Comments: 28 pages, 6 figures, accepted to Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS 2022)

  14. Estimating Total Treatment Effect in Randomized Experiments with Unknown Network Structure

    Authors: Christina Lee Yu, Edoardo M Airoldi, Christian Borgs, Jennifer T Chayes

    Abstract: Randomized experiments are widely used to estimate the causal effects of a proposed treatment in many areas of science, from medicine and healthcare to the physical and biological sciences, from the social sciences to engineering, to public policy and to the technology industry at large. Here, we consider situations where classical methods for estimating the total treatment effect on a target popu… ▽ More

    Submitted 24 September, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

  15. arXiv:2204.07821  [pdf, other

    math.ST cs.CG math.AT stat.ML

    Detection of Small Holes by the Scale-Invariant Robust Density-Aware Distance (RDAD) Filtration

    Authors: Chunyin Siu, Gennady Samorodnitsky, Christina Lee Yu, Andrey Yao

    Abstract: A novel topological-data-analytical (TDA) method is proposed to distinguish, from noise, small holes surrounded by high-density regions of a probability density function. The proposed method is robust against additive noise and outliers. Traditional TDA tools, like those based on the distance filtration, often struggle to distinguish small features from noise, because both have short persistences.… ▽ More

    Submitted 30 March, 2024; v1 submitted 16 April, 2022; originally announced April 2022.

    Comments: 39 pages, 38 figs, J Appl. and Comput. Topology (2024). GitHub: [github.com/c-siu/RDAD]. Published version: [rdcu.be/dCXLa]. Diff of v2/3: added publication info, NO post-submission improvements (Cor2-3 rephrased and proven, setup of Sec4.1 explained, complexity computed in Sec6.1, Thm5 simplified, comparison with DTM in Sec1,8, streamlining), so no change in pdf. Diff of v1/2: more thms, more discussion on conformality, fewer egs

    MSC Class: 62R40; 55N31; 52R40; 68T09

  16. arXiv:2110.15843  [pdf, other

    stat.ML cs.LG

    Adaptive Discretization in Online Reinforcement Learning

    Authors: Sean R. Sinclair, Siddhartha Banerjee, Christina Lee Yu

    Abstract: Discretization based approaches to solving online reinforcement learning problems have been studied extensively in practice on applications ranging from resource allocation to cache management. Two major questions in designing discretization-based algorithms are how to create the discretization and when to refine it. While there have been several experimental results investigating heuristic soluti… ▽ More

    Submitted 10 October, 2022; v1 submitted 29 October, 2021; originally announced October 2021.

    Comments: 77 pages, 7 figures. arXiv admin note: text overlap with arXiv:2007.00717

    MSC Class: 68Q32 ACM Class: I.2.6

  17. arXiv:2110.13969  [pdf, other

    stat.ML cs.LG

    Nonparametric Matrix Estimation with One-Sided Covariates

    Authors: Christina Lee Yu

    Abstract: Consider the task of matrix estimation in which a dataset $X \in \mathbb{R}^{n\times m}$ is observed with sparsity $p$, and we would like to estimate $\mathbb{E}[X]$, where $\mathbb{E}[X_{ui}] = f(α_u, β_i)$ for some Holder smooth function $f$. We consider the setting where the row covariates $α$ are unobserved yet the column covariates $β$ are observed. We provide an algorithm and accompanying an… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

  18. arXiv:2105.05308  [pdf, other

    cs.GT eess.SY math.OC

    Sequential Fair Allocation: Achieving the Optimal Envy-Efficiency Tradeoff Curve

    Authors: Sean R. Sinclair, Gauri Jain, Siddhartha Banerjee, Christina Lee Yu

    Abstract: We consider the problem of dividing limited resources to individuals arriving over $T$ rounds. Each round has a random number of individuals arrive, and individuals can be characterized by their type (i.e. preferences over the different resources). A standard notion of 'fairness' in this setting is that an allocation simultaneously satisfy envy-freeness and efficiency. The former is an individual… ▽ More

    Submitted 29 September, 2022; v1 submitted 11 May, 2021; originally announced May 2021.

    Comments: 42 pages, 5 figures

    MSC Class: 91B32

  19. arXiv:2011.14382  [pdf, other

    cs.GT eess.SY math.OC

    Sequential Fair Allocation of Limited Resources under Stochastic Demands

    Authors: Sean R. Sinclair, Gauri Jain, Siddhartha Banerjee, Christina Lee Yu

    Abstract: We consider the problem of dividing limited resources between a set of agents arriving sequentially with unknown (stochastic) utilities. Our goal is to find a fair allocation - one that is simultaneously Pareto-efficient and envy-free. When all utilities are known upfront, the above desiderata are simultaneously achievable (and efficiently computable) for a large class of utility functions. In a s… ▽ More

    Submitted 9 July, 2022; v1 submitted 29 November, 2020; originally announced November 2020.

    Comments: See arXiv:2105.05308 for an updated version. 36 pages, 6 figures

    MSC Class: 91B32

  20. arXiv:2007.00736  [pdf, other

    stat.ML cs.LG math.NA

    Tensor Estimation with Nearly Linear Samples Given Weak Side Information

    Authors: Christina Lee Yu

    Abstract: Tensor completion exhibits an interesting computational-statistical gap in terms of the number of samples needed to perform tensor estimation. While there are only $Θ(tn)$ degrees of freedom in a $t$-order tensor with $n^t$ entries, the best known polynomial time algorithm requires $O(n^{t/2})$ samples in order to guarantee consistent estimation. In this paper, we show that weak side information i… ▽ More

    Submitted 10 September, 2021; v1 submitted 1 July, 2020; originally announced July 2020.

  21. arXiv:2007.00717  [pdf, other

    cs.LG stat.ML

    Adaptive Discretization for Model-Based Reinforcement Learning

    Authors: Sean R. Sinclair, Tianyu Wang, Gauri Jain, Siddhartha Banerjee, Christina Lee Yu

    Abstract: We introduce the technique of adaptive discretization to design an efficient model-based episodic reinforcement learning algorithm in large (potentially continuous) state-action spaces. Our algorithm is based on optimistic one-step value iteration extended to maintain an adaptive discretization of the space. From a theoretical perspective we provide worst-case regret bounds for our algorithm which… ▽ More

    Submitted 23 October, 2020; v1 submitted 1 July, 2020; originally announced July 2020.

    Comments: 50 pages, 7 figures

    MSC Class: 68Q32 ACM Class: I.2.6

  22. arXiv:1910.08151  [pdf, other

    cs.LG stat.ML

    Adaptive Discretization for Episodic Reinforcement Learning in Metric Spaces

    Authors: Sean R. Sinclair, Siddhartha Banerjee, Christina Lee Yu

    Abstract: We present an efficient algorithm for model-free episodic reinforcement learning on large (potentially continuous) state-action spaces. Our algorithm is based on a novel $Q$-learning policy with adaptive data-driven discretization. The central idea is to maintain a finer partition of the state-action space in regions which are frequently visited in historical trajectories, and have higher payoff e… ▽ More

    Submitted 31 October, 2019; v1 submitted 17 October, 2019; originally announced October 2019.

    Comments: 46 pages, 15 figures

    MSC Class: 68Q32 ACM Class: I.2.6

  23. arXiv:1908.01241  [pdf, other

    cs.LG cs.DS stat.ML

    Robust Max Entrywise Error Bounds for Tensor Estimation from Sparse Observations via Similarity Based Collaborative Filtering

    Authors: Devavrat Shah, Christina Lee Yu

    Abstract: Consider the task of estimating a 3-order $n \times n \times n$ tensor from noisy observations of randomly chosen entries in the sparse regime. We introduce a similarity based collaborative filtering algorithm for estimating a tensor from sparse observations and argue that it achieves sample complexity that nearly matches the conjectured computationally efficient lower bound on the sample complexi… ▽ More

    Submitted 17 January, 2023; v1 submitted 3 August, 2019; originally announced August 2019.

  24. arXiv:1908.01228  [pdf, other

    cs.LG stat.ML

    Nonparametric Contextual Bandits in an Unknown Metric Space

    Authors: Nirandika Wanigasekara, Christina Lee Yu

    Abstract: Consider a nonparametric contextual multi-arm bandit problem where each arm $a \in [K]$ is associated to a nonparametric reward function $f_a: [0,1] \to \mathbb{R}$ mapping from contexts to the expected reward. Suppose that there is a large set of arms, yet there is a simple but unknown structure amongst the arm reward functions, e.g. finite types or smooth with respect to an unknown metric space.… ▽ More

    Submitted 3 August, 2019; originally announced August 2019.

  25. arXiv:1411.2647  [pdf, other

    cs.DS

    Asynchronous Approximation of a Single Component of the Solution to a Linear System

    Authors: Asuman Ozdaglar, Devavrat Shah, Christina Lee Yu

    Abstract: We present a distributed asynchronous algorithm for approximating a single component of the solution to a system of linear equations $Ax = b$, where $A$ is a positive definite real matrix, and $b \in \mathbb{R}^n$. This is equivalent to solving for $x_i$ in $x = Gx + z$ for some $G$ and $z$ such that the spectral radius of $G$ is less than 1. Our algorithm relies on the Neumann series characteriza… ▽ More

    Submitted 21 January, 2019; v1 submitted 10 November, 2014; originally announced November 2014.

    Report number: MIT LIDS Report 3172