Zum Hauptinhalt springen

Showing 51–69 of 69 results for author: Chi, E H

.
  1. arXiv:2006.13114  [pdf, other

    cs.LG stat.ML

    Fairness without Demographics through Adversarially Reweighted Learning

    Authors: Preethi Lahoti, Alex Beutel, Jilin Chen, Kang Lee, Flavien Prost, Nithum Thain, Xuezhi Wang, Ed H. Chi

    Abstract: Much of the previous machine learning (ML) fairness literature assumes that protected features such as race and sex are present in the dataset, and relies upon them to mitigate fairness concerns. However, in practice factors like privacy and regulation often preclude the collection of protected features, or their use for training or inference, severely limiting the applicability of traditional fai… ▽ More

    Submitted 3 November, 2020; v1 submitted 23 June, 2020; originally announced June 2020.

    Comments: To appear at 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada

  2. arXiv:2006.05067  [pdf, other

    cs.LG stat.ML

    Learning-to-Rank with Partitioned Preference: Fast Estimation for the Plackett-Luce Model

    Authors: Jiaqi Ma, Xinyang Yi, Weijing Tang, Zhe Zhao, Lichan Hong, Ed H. Chi, Qiaozhu Mei

    Abstract: We investigate the Plackett-Luce (PL) model based listwise learning-to-rank (LTR) on data with partitioned preference, where a set of items are sliced into ordered and disjoint partitions, but the ranking of items within a partition is unknown. Given $N$ items with $M$ partitions, calculating the likelihood of data with partitioned preference under the PL model has a time complexity of $O(N+S!)$,… ▽ More

    Submitted 25 February, 2021; v1 submitted 9 June, 2020; originally announced June 2020.

  3. arXiv:2003.07336  [pdf, ps, other

    cs.LG cs.PF stat.ML

    Developing a Recommendation Benchmark for MLPerf Training and Inference

    Authors: Carole-Jean Wu, Robin Burke, Ed H. Chi, Joseph Konstan, Julian McAuley, Yves Raimond, Hao Zhang

    Abstract: Deep learning-based recommendation models are used pervasively and broadly, for example, to recommend movies, products, or other information most relevant to users, in order to enhance the user experience. Among various application domains which have received significant industry and academia research attention, such as image classification, object detection, language and speech translation, the p… ▽ More

    Submitted 14 April, 2020; v1 submitted 16 March, 2020; originally announced March 2020.

  4. arXiv:2002.08530  [pdf, other

    cs.IR

    Learning Multi-granular Quantized Embeddings for Large-Vocab Categorical Features in Recommender Systems

    Authors: Wang-Cheng Kang, Derek Zhiyuan Cheng, Ting Chen, Xinyang Yi, Dong Lin, Lichan Hong, Ed H. Chi

    Abstract: Recommender system models often represent various sparse features like users, items, and categorical features via embeddings. A standard approach is to map each unique feature value to an embedding vector. The size of the produced embedding table grows linearly with the size of the vocabulary. Therefore, a large vocabulary inevitably leads to a gigantic embedding table, creating two severe problem… ▽ More

    Submitted 24 August, 2020; v1 submitted 19 February, 2020; originally announced February 2020.

    Comments: longer version

  5. arXiv:2002.03532  [pdf, other

    cs.LG cs.AI stat.ML

    Understanding and Improving Knowledge Distillation

    Authors: Jiaxi Tang, Rakesh Shivanna, Zhe Zhao, Dong Lin, Anima Singh, Ed H. Chi, Sagar Jain

    Abstract: Knowledge Distillation (KD) is a model-agnostic technique to improve model quality while having a fixed capacity budget. It is a commonly used technique for model compression, where a larger capacity teacher model with better quality is used to train a more compact student model with better inference efficiency. Through distillation, one hopes to benefit from student's compactness, without sacrifi… ▽ More

    Submitted 28 February, 2021; v1 submitted 9 February, 2020; originally announced February 2020.

  6. arXiv:1911.01916  [pdf, other

    cs.LG stat.ML

    Practical Compositional Fairness: Understanding Fairness in Multi-Component Recommender Systems

    Authors: Xuezhi Wang, Nithum Thain, Anu Sinha, Flavien Prost, Ed H. Chi, Jilin Chen, Alex Beutel

    Abstract: How can we build recommender systems to take into account fairness? Real-world recommender systems are often composed of multiple models, built by multiple teams. However, most research on fairness focuses on improving fairness in a single model. Further, recent research on classification fairness has shown that combining multiple "fair" classifiers can still result in an "unfair" classification s… ▽ More

    Submitted 25 January, 2021; v1 submitted 5 November, 2019; originally announced November 2019.

    Comments: WSDM 2021

  7. arXiv:1910.11779  [pdf, other

    cs.LG stat.ML

    Toward a better trade-off between performance and fairness with kernel-based distribution matching

    Authors: Flavien Prost, Hai Qian, Qiuwen Chen, Ed H. Chi, Jilin Chen, Alex Beutel

    Abstract: As recent literature has demonstrated how classifiers often carry unintended biases toward some subgroups, deploying machine learned models to users demands careful consideration of the social consequences. How should we address this problem in a real-world system? How should we balance core performance and fairness metrics? In this paper, we introduce a MinDiff framework for regularizing classifi… ▽ More

    Submitted 25 October, 2019; originally announced October 2019.

  8. arXiv:1906.09688  [pdf, other

    cs.LG stat.ML

    Transfer of Machine Learning Fairness across Domains

    Authors: Candice Schumann, Xuezhi Wang, Alex Beutel, Jilin Chen, Hai Qian, Ed H. Chi

    Abstract: If our models are used in new or unexpected cases, do we know if they will make fair predictions? Previously, researchers developed ways to debias a model for a single problem domain. However, this is often not how models are trained and used in practice. For example, labels and demographics (sensitive attributes) are often hard to observe, resulting in auxiliary or synthetic data to be used for t… ▽ More

    Submitted 14 November, 2019; v1 submitted 23 June, 2019; originally announced June 2019.

  9. arXiv:1905.09414  [pdf, other

    cs.LG stat.ML

    Quantifying Long Range Dependence in Language and User Behavior to improve RNNs

    Authors: Francois Belletti, Minmin Chen, Ed H. Chi

    Abstract: Characterizing temporal dependence patterns is a critical step in understanding the statistical properties of sequential data. Long Range Dependence (LRD) --- referring to long-range correlations decaying as a power law rather than exponentially w.r.t. distance --- demands a different set of tools for modeling the underlying dynamics of the sequential data. While it has been widely conjectured tha… ▽ More

    Submitted 22 May, 2019; originally announced May 2019.

  10. arXiv:1903.00780  [pdf, other

    cs.CY cs.AI cs.IR cs.LG stat.ML

    Fairness in Recommendation Ranking through Pairwise Comparisons

    Authors: Alex Beutel, Jilin Chen, Tulsee Doshi, Hai Qian, Li Wei, Yi Wu, Lukasz Heldt, Zhe Zhao, Lichan Hong, Ed H. Chi, Cristos Goodrow

    Abstract: Recommender systems are one of the most pervasive applications of machine learning in industry, with many services using them to match users to products or information. As such it is important to ask: what are the possible fairness risks, how can we quantify them, and how should we address them? In this paper we offer a set of novel metrics for evaluating algorithmic fairness concerns in recommend… ▽ More

    Submitted 2 March, 2019; originally announced March 2019.

  11. arXiv:1902.09689  [pdf, other

    stat.ML cs.LG

    AntisymmetricRNN: A Dynamical System View on Recurrent Neural Networks

    Authors: Bo Chang, Minmin Chen, Eldad Haber, Ed H. Chi

    Abstract: Recurrent neural networks have gained widespread use in modeling sequential data. Learning long-term dependencies using these models remains difficult though, due to exploding or vanishing gradients. In this paper, we draw connections between recurrent networks and ordinary differential equations. A special form of recurrent networks called the AntisymmetricRNN is proposed under this theoretical f… ▽ More

    Submitted 25 February, 2019; originally announced February 2019.

    Comments: Published as a conference paper at ICLR 2019

  12. arXiv:1902.08588  [pdf, other

    cs.LG cs.IR stat.ML

    Towards Neural Mixture Recommender for Long Range Dependent User Sequences

    Authors: Jiaxi Tang, Francois Belletti, Sagar Jain, Minmin Chen, Alex Beutel, Can Xu, Ed H. Chi

    Abstract: Understanding temporal dynamics has proved to be highly valuable for accurate recommendation. Sequential recommenders have been successful in modeling the dynamics of users and items over time. However, while different model architectures excel at capturing various temporal ranges or dynamics, distinct application contexts require adapting to diverse behaviors. In this paper we examine how to buil… ▽ More

    Submitted 22 February, 2019; originally announced February 2019.

    Comments: Accepted at WWW 2019

  13. arXiv:1901.08987  [pdf, other

    cs.LG stat.ML

    Dynamical Isometry and a Mean Field Theory of LSTMs and GRUs

    Authors: Dar Gilboa, Bo Chang, Minmin Chen, Greg Yang, Samuel S. Schoenholz, Ed H. Chi, Jeffrey Pennington

    Abstract: Training recurrent neural networks (RNNs) on long sequence tasks is plagued with difficulties arising from the exponential explosion or vanishing of signals as they propagate forward or backward through the network. Many techniques have been proposed to ameliorate these issues, including various algorithmic and architectural modifications. Two of the most successful RNN architectures, the LSTM and… ▽ More

    Submitted 23 May, 2019; v1 submitted 25 January, 2019; originally announced January 2019.

  14. arXiv:1901.04562  [pdf, other

    cs.LG cs.AI cs.CY stat.ML

    Putting Fairness Principles into Practice: Challenges, Metrics, and Improvements

    Authors: Alex Beutel, Jilin Chen, Tulsee Doshi, Hai Qian, Allison Woodruff, Christine Luu, Pierre Kreitmann, Jonathan Bischof, Ed H. Chi

    Abstract: As more researchers have become aware of and passionate about algorithmic fairness, there has been an explosion in papers laying out new metrics, suggesting algorithms to address issues, and calling attention to issues in existing applications of machine learning. This research has greatly expanded our understanding of the concerns and challenges in deploying machine learning, but there has been m… ▽ More

    Submitted 14 January, 2019; originally announced January 2019.

  15. arXiv:1809.10610  [pdf, other

    cs.LG stat.ML

    Counterfactual Fairness in Text Classification through Robustness

    Authors: Sahaj Garg, Vincent Perot, Nicole Limtiaco, Ankur Taly, Ed H. Chi, Alex Beutel

    Abstract: In this paper, we study counterfactual fairness in text classification, which asks the question: How would the prediction change if the sensitive attribute referenced in the example were different? Toxicity classifiers demonstrate a counterfactual fairness issue by predicting that "Some people are gay" is toxic while "Some people are straight" is nontoxic. We offer a metric, counterfactual token f… ▽ More

    Submitted 13 February, 2019; v1 submitted 27 September, 2018; originally announced September 2018.

  16. arXiv:1712.01208  [pdf, other

    cs.DB cs.DS cs.NE

    The Case for Learned Index Structures

    Authors: Tim Kraska, Alex Beutel, Ed H. Chi, Jeffrey Dean, Neoklis Polyzotis

    Abstract: Indexes are models: a B-Tree-Index can be seen as a model to map a key to the position of a record within a sorted array, a Hash-Index as a model to map a key to a position of a record within an unsorted array, and a BitMap-Index as a model to indicate if a data record exists or not. In this exploratory research paper, we start from this premise and posit that all existing index structures can be… ▽ More

    Submitted 30 April, 2018; v1 submitted 4 December, 2017; originally announced December 2017.

  17. arXiv:1707.00075  [pdf, other

    cs.LG cs.CY

    Data Decisions and Theoretical Implications when Adversarially Learning Fair Representations

    Authors: Alex Beutel, Jilin Chen, Zhe Zhao, Ed H. Chi

    Abstract: How can we learn a classifier that is "fair" for a protected or sensitive group, when we do not know if the input to the classifier belongs to the protected group? How can we train such a classifier when data on the protected group is difficult to attain? In many settings, finding out the sensitive input attribute can be prohibitively expensive even during model training, and sometimes impossible… ▽ More

    Submitted 6 July, 2017; v1 submitted 30 June, 2017; originally announced July 2017.

    Comments: Presented as a poster at the 2017 Workshop on Fairness, Accountability, and Transparency in Machine Learning (FAT/ML 2017)

  18. arXiv:1204.3724  [pdf

    cs.SI cs.HC

    Who is Authoritative? Understanding Reputation Mechanisms in Quora

    Authors: Sharoda A. Paul, Lichan Hong, Ed H. Chi

    Abstract: As social Q&A sites gain popularity, it is important to understand how users judge the authoritativeness of users and content, build reputation, and identify and promote high quality content. We conducted a study of emerging social Q&A site Quora. First, we describe user activity on Quora by analyzing data across 60 question topics and 3917 users. Then we provide a rich understanding of issues of… ▽ More

    Submitted 17 April, 2012; originally announced April 2012.

    Comments: Presented at Collective Intelligence conference, 2012 (arXiv:1204.2991)

    Report number: CollectiveIntelligence/2012/36

  19. arXiv:0908.0595  [pdf, other

    cs.IR cs.HC

    Towards a Model of Understanding Social Search

    Authors: Brynn M. Evans, Ed H. Chi

    Abstract: Search engine researchers typically depict search as the solitary activity of an individual searcher. In contrast, results from our critical-incident survey of 150 users on Amazon's Mechanical Turk service suggest that social interactions play an important role throughout the search process. Our main contribution is that we have integrated models from previous work in sensemaking and information… ▽ More

    Submitted 5 August, 2009; originally announced August 2009.

    Comments: Presented at 1st Intl Workshop on Collaborative Information Seeking, 2008 (arXiv:0908.0583)

    Report number: JCDL2008CIRWS/2008/evachi ACM Class: H.3.3; H.5.2; H.5.3