Zum Hauptinhalt springen

Showing 1–34 of 34 results for author: Rogers, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.04424  [pdf

    cs.LG

    Detection of Animal Movement from Weather Radar using Self-Supervised Learning

    Authors: Mubin Ul Haque, Joel Janek Dabrowski, Rebecca M. Rogers, Hazel Parry

    Abstract: Detecting flying animals (e.g., birds, bats, and insects) using weather radar helps gain insights into animal movement and migration patterns, aids in management efforts (such as biosecurity) and enhances our understanding of the ecosystem.The conventional approach to detecting animals in weather radar involves thresholding: defining and applying thresholds for the radar variables, based on expert… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

  2. arXiv:2407.11733  [pdf, other

    cs.CL

    How Are LLMs Mitigating Stereotyping Harms? Learning from Search Engine Studies

    Authors: Alina Leidinger, Richard Rogers

    Abstract: With the widespread availability of LLMs since the release of ChatGPT and increased public scrutiny, commercial model development appears to have focused their efforts on 'safety' training concerning legal liabilities at the expense of social impact evaluation. This mimics a similar trend which we could observe for search engine autocompletion some years prior. We draw on scholarship from NLP and… ▽ More

    Submitted 1 August, 2024; v1 submitted 16 July, 2024; originally announced July 2024.

    Comments: Accepted at AAAI/ACM AI, Ethics, and Society

  3. arXiv:2403.05073  [pdf, other

    cs.CR

    Private Count Release: A Simple and Scalable Approach for Private Data Analytics

    Authors: Ryan Rogers

    Abstract: We present a data analytics system that ensures accurate counts can be released with differential privacy and minimal onboarding effort while showing instances that outperform other approaches that require more onboarding effort. The primary difference between our proposal and existing approaches is that it does not rely on user contribution bounds over distinct elements, i.e. $\ell_0$-sensitivity… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  4. Demonstrative Evidence and the Use of Algorithms in Jury Trials

    Authors: Rachel Rogers, Susan VanderPlas

    Abstract: We investigate how the use of bullet comparison algorithms and demonstrative evidence may affect juror perceptions of reliability, credibility, and understanding of expert witnesses and presented evidence. The use of statistical methods in forensic science is motivated by a lack of scientific validity and error rate issues present in many forensic analysis methods. We explore what our study says a… ▽ More

    Submitted 16 May, 2024; v1 submitted 17 November, 2023; originally announced November 2023.

  5. arXiv:2310.06725  [pdf, other

    q-bio.BM cs.LG

    Growing ecosystem of deep learning methods for modeling protein$\unicode{x2013}$protein interactions

    Authors: Julia R. Rogers, Gergő Nikolényi, Mohammed AlQuraishi

    Abstract: Numerous cellular functions rely on protein$\unicode{x2013}$protein interactions. Efforts to comprehensively characterize them remain challenged however by the diversity of molecular recognition mechanisms employed within the proteome. Deep learning has emerged as a promising approach for tackling this problem by exploiting both experimental data and basic biophysical knowledge about protein inter… ▽ More

    Submitted 6 December, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: 19 pages, added model names to discussion

  6. arXiv:2309.09170  [pdf, other

    cs.CR

    A Unifying Privacy Analysis Framework for Unknown Domain Algorithms in Differential Privacy

    Authors: Ryan Rogers

    Abstract: There are many existing differentially private algorithms for releasing histograms, i.e. counts with corresponding labels, in various settings. Our focus in this survey is to revisit some of the existing differentially private algorithms for releasing histograms over unknown domains, i.e. the labels of the counts that are to be released are not known beforehand. The main practical advantage of rel… ▽ More

    Submitted 1 August, 2024; v1 submitted 17 September, 2023; originally announced September 2023.

  7. arXiv:2306.13824  [pdf, other

    cs.CR cs.DS cs.LG

    Adaptive Privacy Composition for Accuracy-first Mechanisms

    Authors: Ryan Rogers, Gennady Samorodnitsky, Zhiwei Steven Wu, Aaditya Ramdas

    Abstract: In many practical applications of differential privacy, practitioners seek to provide the best privacy guarantees subject to a target level of accuracy. A recent line of work by Ligett et al. '17 and Whitehouse et al. '22 has developed such accuracy-first mechanisms by leveraging the idea of noise reduction that adds correlated noise to the sufficient statistic in a private computation and produce… ▽ More

    Submitted 5 December, 2023; v1 submitted 23 June, 2023; originally announced June 2023.

  8. arXiv:2304.06929  [pdf

    cs.CR

    Advancing Differential Privacy: Where We Are Now and Future Directions for Real-World Deployment

    Authors: Rachel Cummings, Damien Desfontaines, David Evans, Roxana Geambasu, Yangsibo Huang, Matthew Jagielski, Peter Kairouz, Gautam Kamath, Sewoong Oh, Olga Ohrimenko, Nicolas Papernot, Ryan Rogers, Milan Shen, Shuang Song, Weijie Su, Andreas Terzis, Abhradeep Thakurta, Sergei Vassilvitskii, Yu-Xiang Wang, Li Xiong, Sergey Yekhanin, Da Yu, Huanyu Zhang, Wanrong Zhang

    Abstract: In this article, we present a detailed review of current practices and state-of-the-art methodologies in the field of differential privacy (DP), with a focus of advancing DP's deployment in real-world applications. Key points and high-level contents of the article were originated from the discussions from "Differential Privacy (DP): Challenges Towards the Next Frontier," a workshop held in July 20… ▽ More

    Submitted 12 March, 2024; v1 submitted 14 April, 2023; originally announced April 2023.

  9. arXiv:2206.07234  [pdf, other

    cs.LG cs.CR cs.DS stat.ML

    Brownian Noise Reduction: Maximizing Privacy Subject to Accuracy Constraints

    Authors: Justin Whitehouse, Zhiwei Steven Wu, Aaditya Ramdas, Ryan Rogers

    Abstract: There is a disconnect between how researchers and practitioners handle privacy-utility tradeoffs. Researchers primarily operate from a privacy first perspective, setting strict privacy requirements and minimizing risk subject to these constraints. Practitioners often desire an accuracy first perspective, possibly satisfied with the greatest privacy they can get subject to obtaining sufficiently sm… ▽ More

    Submitted 10 November, 2023; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: 26 pages, 4 figures

  10. arXiv:2203.05481  [pdf, other

    cs.LG cs.CR cs.DS stat.ML

    Fully Adaptive Composition in Differential Privacy

    Authors: Justin Whitehouse, Aaditya Ramdas, Ryan Rogers, Zhiwei Steven Wu

    Abstract: Composition is a key feature of differential privacy. Well-known advanced composition theorems allow one to query a private database quadratically more times than basic privacy composition would permit. However, these results require that the privacy parameters of all algorithms be fixed before interacting with the data. To address this, Rogers et al. introduced fully adaptive composition, wherein… ▽ More

    Submitted 24 October, 2023; v1 submitted 10 March, 2022; originally announced March 2022.

    Comments: 23 pages, 3 figures

  11. arXiv:2103.16787  [pdf, other

    cs.DS cs.CR

    Differentially Private Histograms under Continual Observation: Streaming Selection into the Unknown

    Authors: Adrian Rivera Cardoso, Ryan Rogers

    Abstract: We generalize the continuous observation privacy setting from Dwork et al. '10 and Chan et al. '11 by allowing each event in a stream to be a subset of some (possibly unknown) universe of items. We design differentially private (DP) algorithms for histograms in several settings, including top-$k$ selection, with privacy loss that scales with polylog$(T)$, where $T$ is the maximum length of the inp… ▽ More

    Submitted 4 January, 2022; v1 submitted 30 March, 2021; originally announced March 2021.

  12. arXiv:2010.13981  [pdf, other

    cs.CR

    A Members First Approach to Enabling LinkedIn's Labor Market Insights at Scale

    Authors: Ryan Rogers, Adrian Rivera Cardoso, Koray Mancuhan, Akash Kaura, Nikhil Gahlawat, Neha Jain, Paul Ko, Parvez Ahammad

    Abstract: We describe the privatization method used in reporting labor market insights from LinkedIn's Economic Graph, including the differentially private algorithms used to protect member's privacy. The reports show who are the top employers, as well as what are the top jobs and skills in a given country/region and industry. We hope this data will help governments and citizens track labor market trends du… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

  13. arXiv:2004.07223  [pdf, other

    cs.CR cs.LG

    Bounding, Concentrating, and Truncating: Unifying Privacy Loss Composition for Data Analytics

    Authors: Mark Cesar, Ryan Rogers

    Abstract: Differential privacy (DP) provides rigorous privacy guarantees on individual's data while also allowing for accurate statistics to be conducted on the overall, sensitive dataset. To design a private system, first private algorithms must be designed that can quantify the privacy loss of each outcome that is released. However, private algorithms that inject noise into the computation are not suffici… ▽ More

    Submitted 17 November, 2020; v1 submitted 15 April, 2020; originally announced April 2020.

  14. arXiv:2002.05839  [pdf, other

    cs.CR

    LinkedIn's Audience Engagements API: A Privacy Preserving Data Analytics System at Scale

    Authors: Ryan Rogers, Subbu Subramaniam, Sean Peng, David Durfee, Seunghyun Lee, Santosh Kumar Kancha, Shraddha Sahay, Parvez Ahammad

    Abstract: We present a privacy system that leverages differential privacy to protect LinkedIn members' data while also providing audience engagement insights to enable marketing analytics related applications. We detail the differentially private algorithms and other privacy safeguards used to provide results that can be used with existing real-time data analytics platforms, specifically with the open sourc… ▽ More

    Submitted 16 November, 2020; v1 submitted 13 February, 2020; originally announced February 2020.

  15. arXiv:1909.13830  [pdf, other

    cs.CR cs.DS

    Optimal Differential Privacy Composition for Exponential Mechanisms and the Cost of Adaptivity

    Authors: Jinshuo Dong, David Durfee, Ryan Rogers

    Abstract: Composition is one of the most important properties of differential privacy (DP), as it allows algorithm designers to build complex private algorithms from DP primitives. We consider precise composition bounds of the overall privacy loss for exponential mechanisms, one of the fundamental classes of mechanisms in DP. We give explicit formulations of the optimal privacy loss for both the adaptive an… ▽ More

    Submitted 24 June, 2020; v1 submitted 30 September, 2019; originally announced September 2019.

  16. arXiv:1906.09231  [pdf, other

    cs.LG math.ST stat.ML

    Guaranteed Validity for Empirical Approaches to Adaptive Data Analysis

    Authors: Ryan Rogers, Aaron Roth, Adam Smith, Nathan Srebro, Om Thakkar, Blake Woodworth

    Abstract: We design a general framework for answering adaptive statistical queries that focuses on providing explicit confidence intervals along with point estimates. Prior work in this area has either focused on providing tight confidence intervals for specific analyses, or providing general worst-case bounds for point estimates. Unfortunately, as we observe, these worst-case bounds are loose in many setti… ▽ More

    Submitted 9 March, 2020; v1 submitted 21 June, 2019; originally announced June 2019.

    Comments: Accepted to appear in the proceedings of the 23rd International Conference on Artificial Intelligence and Statistics (AISTATS) 2020

  17. arXiv:1905.04273  [pdf, other

    cs.CR

    Practical Differentially Private Top-$k$ Selection with Pay-what-you-get Composition

    Authors: David Durfee, Ryan Rogers

    Abstract: We study the problem of top-$k$ selection over a large domain universe subject to user-level differential privacy. Typically, the exponential mechanism or report noisy max are the algorithms used to solve this problem. However, these algorithms require querying the database for the count of each domain element. We focus on the setting where the data domain is unknown, which is different than the s… ▽ More

    Submitted 17 September, 2019; v1 submitted 10 May, 2019; originally announced May 2019.

  18. arXiv:1904.08721  [pdf

    cs.CL cs.CY cs.SI

    Societal Controversies in Wikipedia Articles

    Authors: Erik Borra, Andreas Kaltenbrunner, Michele Mauri, Esther Weltevrede, David Laniado, Richard Rogers, Paolo Ciuccarelli, Giovanni Magni, Tommaso Venturini

    Abstract: Collaborative content creation inevitably reaches situations where different points of view lead to conflict. We focus on Wikipedia, the free encyclopedia anyone may edit, where disputes about content in controversial articles often reflect larger societal debates. While Wikipedia has a public edit history and discussion section for every article, the substance of these sections is difficult to ph… ▽ More

    Submitted 18 April, 2019; originally announced April 2019.

    Journal ref: the 33rd Annual ACM Conference, Apr 2015, Seoul, France. pp.193-196

  19. arXiv:1812.00984  [pdf, other

    stat.ML cs.LG

    Protection Against Reconstruction and Its Applications in Private Federated Learning

    Authors: Abhishek Bhowmick, John Duchi, Julien Freudiger, Gaurav Kapoor, Ryan Rogers

    Abstract: In large-scale statistical learning, data collection and model fitting are moving increasingly toward peripheral devices---phones, watches, fitness trackers---away from centralized data collection. Concomitant with this rise in decentralized data are increasing challenges of maintaining privacy while allowing enough information to fit accurate, useful statistical models. This motivates local notio… ▽ More

    Submitted 3 June, 2019; v1 submitted 3 December, 2018; originally announced December 2018.

  20. arXiv:1810.08054  [pdf, other

    cs.DS

    Locally Private Mean Estimation: Z-test and Tight Confidence Intervals

    Authors: Marco Gaboardi, Ryan Rogers, Or Sheffet

    Abstract: This work provides tight upper- and lower-bounds for the problem of mean estimation under $ε$-differential privacy in the local model, when the input is composed of $n$ i.i.d. drawn samples from a normal distribution with variance $σ$. Our algorithms result in a $(1-β)$-confidence interval for the underlying distribution's mean $μ$ of length… ▽ More

    Submitted 10 April, 2019; v1 submitted 18 October, 2018; originally announced October 2018.

  21. Towards Better Understanding Researcher Strategies in Cross-Lingual Event Analytics

    Authors: Simon Gottschalk, Viola Bernacchi, Richard Rogers, Elena Demidova

    Abstract: With an increasing amount of information on globally important events, there is a growing demand for efficient analytics of multilingual event-centric information. Such analytics is particularly challenging due to the large amount of content, the event dynamics and the language barrier. Although memory institutions increasingly collect event-centric Web content in different languages, very little… ▽ More

    Submitted 21 September, 2018; originally announced September 2018.

    Comments: In Proceedings of the International Conference on Theory and Practice of Digital Libraries 2018

  22. Ongoing Events in Wikipedia: A Cross-lingual Case Study

    Authors: Simon Gottschalk, Elena Demidova, Viola Bernacchi, Richard Rogers

    Abstract: In order to effectively analyze information regarding ongoing events that impact local communities across language and country borders, researchers often need to perform multilingual data analysis. This analysis can be particularly challenging due to the rapidly evolving event-centric data and the language barrier. In this abstract we present preliminary results of a case study with the goal to be… ▽ More

    Submitted 22 January, 2018; originally announced January 2018.

    Comments: Proceedings of the 2017 ACM on Web Science Conference

  23. arXiv:1709.07155  [pdf, other

    math.ST cs.CR

    Local Private Hypothesis Testing: Chi-Square Tests

    Authors: Marco Gaboardi, Ryan Rogers

    Abstract: The local model for differential privacy is emerging as the reference model for practical applications collecting and sharing sensitive information while satisfying strong privacy guarantees. In the local model, there is no trusted entity which is allowed to have each individual's raw data as is assumed in the traditional curator model for differential privacy. So, individuals' data are usually pe… ▽ More

    Submitted 8 March, 2018; v1 submitted 21 September, 2017; originally announced September 2017.

  24. arXiv:1702.07810  [pdf, other

    cs.GT

    A Decomposition of Forecast Error in Prediction Markets

    Authors: Miroslav Dudík, Sébastien Lahaie, Ryan Rogers, Jennifer Wortman Vaughan

    Abstract: We analyze sources of error in prediction market forecasts in order to bound the difference between a security's price and the ground truth it estimates. We consider cost-function-based prediction markets in which an automated market maker adjusts security prices according to the history of trade. We decompose the forecasting error into three components: sampling error, arising because traders onl… ▽ More

    Submitted 20 February, 2018; v1 submitted 24 February, 2017; originally announced February 2017.

    Journal ref: Advances in Neural Information Processing Systems 30 (NIPS 2017)

  25. arXiv:1610.07662  [pdf, other

    math.ST cs.CR

    A New Class of Private Chi-Square Tests

    Authors: Daniel Kifer, Ryan Rogers

    Abstract: In this paper, we develop new test statistics for private hypothesis testing. These statistics are designed specifically so that their asymptotic distributions, after accounting for noise added for privacy concerns, match the asymptotics of the classical (non-private) chi-square tests for testing if the multinomial data parameters lie in lower dimensional manifolds (examples include goodness of fi… ▽ More

    Submitted 24 October, 2016; originally announced October 2016.

  26. arXiv:1605.08294  [pdf, other

    cs.CR

    Privacy Odometers and Filters: Pay-as-you-Go Composition

    Authors: Ryan Rogers, Aaron Roth, Jonathan Ullman, Salil Vadhan

    Abstract: In this paper we initiate the study of adaptive composition in differential privacy when the length of the composition, and the privacy parameters themselves can be chosen adaptively, as a function of the outcome of previously run analyses. This case is much more delicate than the setting covered by existing composition theorems, in which the algorithms themselves can be chosen adaptively, but the… ▽ More

    Submitted 5 August, 2021; v1 submitted 26 May, 2016; originally announced May 2016.

  27. arXiv:1604.03924  [pdf, other

    cs.LG

    Max-Information, Differential Privacy, and Post-Selection Hypothesis Testing

    Authors: Ryan Rogers, Aaron Roth, Adam Smith, Om Thakkar

    Abstract: In this paper, we initiate a principled study of how the generalization properties of approximate differential privacy can be used to perform adaptive hypothesis testing, while giving statistically valid $p$-value corrections. We do this by observing that the guarantees of algorithms with bounded approximate max-information are sufficient to correct the $p$-values of adaptively chosen hypotheses,… ▽ More

    Submitted 9 September, 2016; v1 submitted 13 April, 2016; originally announced April 2016.

  28. arXiv:1602.03090  [pdf, other

    math.ST cs.CR

    Differentially Private Chi-Squared Hypothesis Testing: Goodness of Fit and Independence Testing

    Authors: Marco Gaboardi, Hyun woo Lim, Ryan Rogers, Salil Vadhan

    Abstract: Hypothesis testing is a useful statistical tool in determining whether a given model should be rejected based on a sample from the population. Sample data may contain sensitive information about individuals, such as medical information. Thus it is important to design statistical tests that guarantee the privacy of subjects in the data. In this work, we study hypothesis testing subject to different… ▽ More

    Submitted 2 June, 2016; v1 submitted 7 February, 2016; originally announced February 2016.

  29. arXiv:1512.02698  [pdf, ps, other

    cs.GT

    Robust Mediators in Large Games

    Authors: Michael Kearns, Mallesh M. Pai, Ryan Rogers, Aaron Roth, Jonathan Ullman

    Abstract: A mediator is a mechanism that can only suggest actions to players, as a function of all agents' reported types, in a given game of incomplete information. We study what is achievable by two kinds of mediators, "strong" and "weak." Players can choose to opt-out of using a strong mediator but cannot misrepresent their type if they opt-in. Such a mediator is "strong" because we can view it as having… ▽ More

    Submitted 10 December, 2015; v1 submitted 8 December, 2015; originally announced December 2015.

    Comments: This work unifies and subsumes the two papers "Mechanism design in large games: incentives and privacy" ITCS'14 (arXiv:1207.4084) and "Asymptotically truthful equilibrium selection in large congestion games" EC '14 (arXiv:1311.2625)

  30. Do Prices Coordinate Markets?

    Authors: Justin Hsu, Jamie Morgenstern, Ryan Rogers, Aaron Roth, Rakesh Vohra

    Abstract: Walrasian equilibrium prices can be said to coordinate markets: They support a welfare optimal allocation in which each buyer is buying bundle of goods that is individually most preferred. However, this clean story has two caveats. First, the prices alone are not sufficient to coordinate the market, and buyers may need to select among their most preferred bundles in a coordinated way to find a fea… ▽ More

    Submitted 22 June, 2016; v1 submitted 3 November, 2015; originally announced November 2015.

  31. arXiv:1506.02162  [pdf, other

    cs.DS cs.GT cs.LG

    Learning from Rational Behavior: Predicting Solutions to Unknown Linear Programs

    Authors: Shahin Jabbari, Ryan Rogers, Aaron Roth, Zhiwei Steven Wu

    Abstract: We define and study the problem of predicting the solution to a linear program (LP) given only partial information about its objective and constraints. This generalizes the problem of learning to predict the purchasing behavior of a rational agent who has an unknown objective function, that has been studied under the name "Learning from Revealed Preferences". We give mistake bound learning algorit… ▽ More

    Submitted 26 October, 2016; v1 submitted 6 June, 2015; originally announced June 2015.

    Comments: The short version of this paper appears in the proceedings of NIPS-16

  32. arXiv:1502.04019  [pdf, ps, other

    cs.GT

    Inducing Approximately Optimal Flow Using Truthful Mediators

    Authors: Ryan Rogers, Aaron Roth, Jonathan Ullman, Zhiwei Steven Wu

    Abstract: We revisit a classic coordination problem from the perspective of mechanism design: how can we coordinate a social welfare maximizing flow in a network congestion game with selfish players? The classical approach, which computes tolls as a function of known demands, fails when the demands are unknown to the mechanism designer, and naively eliciting them does not necessarily yield a truthful mechan… ▽ More

    Submitted 8 May, 2015; v1 submitted 13 February, 2015; originally announced February 2015.

    Comments: Version with latencies not normalized

  33. arXiv:1407.2641  [pdf, other

    cs.GT

    Private Pareto Optimal Exchange

    Authors: Sampath Kannan, Jamie Morgenstern, Ryan Rogers, Aaron Roth

    Abstract: We consider the problem of implementing an individually rational, asymptotically Pareto optimal allocation in a barter-exchange economy where agents are endowed with goods and have preferences over the goods of others, but may not use money as a medium of exchange. Because one of the most important instantiations of such economies is kidney exchange -- where the "input"to the problem consists of s… ▽ More

    Submitted 12 February, 2015; v1 submitted 9 July, 2014; originally announced July 2014.

  34. arXiv:1311.2625   

    cs.GT cs.CR cs.DS

    Asymptotically Truthful Equilibrium Selection in Large Congestion Games

    Authors: Ryan Rogers, Aaron Roth

    Abstract: Studying games in the complete information model makes them analytically tractable. However, large $n$ player interactions are more realistically modeled as games of incomplete information, where players may know little to nothing about the types of other players. Unfortunately, games in incomplete information settings lose many of the nice properties of complete information games: the quality of… ▽ More

    Submitted 10 December, 2015; v1 submitted 11 November, 2013; originally announced November 2013.

    Comments: The conference version of this paper appeared in EC 2014. This manuscript has been merged and subsumed by the preprint "Robust Mediators in Large Games": http://arxiv.org/abs/1512.02698