Zum Hauptinhalt springen

Showing 1–5 of 5 results for author: Lees, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2202.11176  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    A New Generation of Perspective API: Efficient Multilingual Character-level Transformers

    Authors: Alyssa Lees, Vinh Q. Tran, Yi Tay, Jeffrey Sorensen, Jai Gupta, Donald Metzler, Lucy Vasserman

    Abstract: On the world wide web, toxic content detectors are a crucial line of defense against potentially hateful and offensive messages. As such, building highly effective classifiers that enable a safer internet is an important research area. Moreover, the web is a highly multilingual, cross-cultural community that develops its own lingo over time. As such, it is crucial to develop models that are effect… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

  2. arXiv:2109.04912  [pdf, other

    cs.CL cs.AI cs.LG

    ReasonBERT: Pre-trained to Reason with Distant Supervision

    Authors: Xiang Deng, Yu Su, Alyssa Lees, You Wu, Cong Yu, Huan Sun

    Abstract: We present ReasonBert, a pre-training method that augments language models with the ability to reason over long-range relations and multiple, possibly hybrid contexts. Unlike existing pre-training methods that only harvest learning signals from local contexts of naturally occurring texts, we propose a generalized notion of distant supervision to automatically connect multiple pieces of text and ta… ▽ More

    Submitted 10 September, 2021; originally announced September 2021.

    Comments: Accepted to EMNLP'2021. Our code and pre-trained models are available at https://github.com/sunlab-osu/ReasonBERT

  3. arXiv:2006.14806  [pdf, other

    cs.IR cs.CL

    TURL: Table Understanding through Representation Learning

    Authors: Xiang Deng, Huan Sun, Alyssa Lees, You Wu, Cong Yu

    Abstract: Relational tables on the Web store a vast amount of knowledge. Owing to the wealth of such tables, there has been tremendous progress on a variety of tasks in the area of table understanding. However, existing work generally relies on heavily-engineered task-specific features and model architectures. In this paper, we present TURL, a novel framework that introduces the pre-training/fine-tuning par… ▽ More

    Submitted 2 December, 2020; v1 submitted 26 June, 2020; originally announced June 2020.

    Comments: Accepted to VLDB 2021. Extended version with experiments added during revision. Our source code, benchmark, as well as pre-trained models will be available on https://github.com/sunlab-osu/TURL

  4. arXiv:1910.14120  [pdf, other

    cs.LG stat.ML

    What is Fair? Exploring Pareto-Efficiency for Fairness Constrained Classifiers

    Authors: Ananth Balashankar, Alyssa Lees, Chris Welty, Lakshminarayanan Subramanian

    Abstract: The potential for learned models to amplify existing societal biases has been broadly recognized. Fairness-aware classifier constraints, which apply equality metrics of performance across subgroups defined on sensitive attributes such as race and gender, seek to rectify inequity but can yield non-uniform degradation in performance for skewed datasets. In certain domains, imbalanced degradation of… ▽ More

    Submitted 30 October, 2019; originally announced October 2019.

  5. arXiv:1910.11452  [pdf, ps, other

    cs.LG cs.CY stat.ML

    Fairness Sample Complexity and the Case for Human Intervention

    Authors: Ananth Balashankar, Alyssa Lees

    Abstract: With the aim of building machine learning systems that incorporate standards of fairness and accountability, we explore explicit subgroup sample complexity bounds. The work is motivated by the observation that classifier predictions for real world datasets often demonstrate drastically different metrics, such as accuracy, when subdivided by specific sensitive variable subgroups. The reasons for th… ▽ More

    Submitted 24 October, 2019; originally announced October 2019.

    Comments: Where is the Human? Bridging the Gap Between AI and HCI, CHI Workshop 2019