Search | arXiv e-print repository

doi 10.1145/3616855.3635736

Scaling Up LLM Reviews for Google Ads Content Moderation

Authors: Wei Qiao, Tushar Dogra, Otilia Stretcu, Yu-Han Lyu, Tiantian Fang, Dongjin Kwon, Chun-Ta Lu, Enming Luo, Yuan Wang, Chih-Chun Chia, Ariel Fuxman, Fangzhou Wang, Ranjay Krishna, Mehmet Tek

Abstract: Large language models (LLMs) are powerful tools for content moderation, but their inference costs and latency make them prohibitive for casual use on large datasets, such as the Google Ads repository. This study proposes a method for scaling up LLM reviews for content moderation in Google Ads. First, we use heuristics to select candidates via filtering and duplicate removal, and create clusters of… ▽ More Large language models (LLMs) are powerful tools for content moderation, but their inference costs and latency make them prohibitive for casual use on large datasets, such as the Google Ads repository. This study proposes a method for scaling up LLM reviews for content moderation in Google Ads. First, we use heuristics to select candidates via filtering and duplicate removal, and create clusters of ads for which we select one representative ad per cluster. We then use LLMs to review only the representative ads. Finally, we propagate the LLM decisions for the representative ads back to their clusters. This method reduces the number of reviews by more than 3 orders of magnitude while achieving a 2x recall compared to a baseline non-LLM model. The success of this approach is a strong function of the representations used in clustering and label propagation; we found that cross-modal similarity representations yield better results than uni-modal representations. △ Less

Submitted 7 February, 2024; originally announced February 2024.

arXiv:2310.06048 [pdf, ps, other]

Regularized Weyl double copy

Authors: Gokhan Alkac, Mehmet Kemal Gumus, Oguzhan Kasikci, Mehmet Ali Olpak, Mustafa Tek

Abstract: We propose a regularization procedure in the sourced Weyl double copy, a spinorial version of the classical double copy, such that it matches much more general results in the Kerr-Schild version. In the regularized Weyl double copy, the anti-de Sitter (AdS) and the Lifshitz black holes, which form the basis of the study of strongly coupled gauge theories at finite temperature through the AdS/CFT c… ▽ More We propose a regularization procedure in the sourced Weyl double copy, a spinorial version of the classical double copy, such that it matches much more general results in the Kerr-Schild version. In the regularized Weyl double copy, the anti-de Sitter (AdS) and the Lifshitz black holes, which form the basis of the study of strongly coupled gauge theories at finite temperature through the AdS/CFT correspondence and its non-relativistic generalization, become treatable. We believe that this might pave the way for finding out a relation between the classical double copy and holography. △ Less

Submitted 10 April, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

Comments: Version to appear in PRD

arXiv:2308.00620 [pdf, ps, other]

Scaling symmetry, Smarr relation, and the extended first law in lower-dimensional Lovelock gravity

Authors: Gokhan Alkac, Gokcen Deniz Ozen, Hikmet Ozsahin, Gun Suer, Mustafa Tek

Abstract: Recently, it was discovered that lower-dimensional versions of Lovelock gravity exist as scalar-tensor theories that are examples of Horndeski gravity. We study the thermodynamics of the static black hole solutions in these theories up to cubic order through Euclidean methods. Considering solutions with spherical, planar and hyperbolic event horizons ($k=+1, 0, -1$), we show that the universality… ▽ More Recently, it was discovered that lower-dimensional versions of Lovelock gravity exist as scalar-tensor theories that are examples of Horndeski gravity. We study the thermodynamics of the static black hole solutions in these theories up to cubic order through Euclidean methods. Considering solutions with spherical, planar and hyperbolic event horizons ($k=+1, 0, -1$), we show that the universality of the thermodynamics for planar black holes ($k=0$) and the extended 1st law that include the variation of the couplings together with their associated potentials hold also in lower dimensions. We find that in $D=4, 6$ where the 2nd- and the 3rd-order Lovelock Lagrangians are boundary terms respectively, the Smarr relation is modified since the entropy is not a homogenous function in these dimensions. We also present a derivation of the Smarr relation and its modified version based on the global scaling properties of the reduced action that is used to obtain the solutions consistently. Unlike the other hairy black hole solutions that have been analyzed before, despite the terms in the reduced action that break the scaling symmetry, the derivation still follows from a conserved Noether charge. △ Less

Submitted 14 April, 2024; v1 submitted 27 July, 2023; originally announced August 2023.

Comments: Version to appear in NPB

arXiv:2301.12993 [pdf, other]

Benchmarking Robustness to Adversarial Image Obfuscations

Authors: Florian Stimberg, Ayan Chakrabarti, Chun-Ta Lu, Hussein Hazimeh, Otilia Stretcu, Wei Qiao, Yintao Liu, Merve Kaya, Cyrus Rashtchian, Ariel Fuxman, Mehmet Tek, Sven Gowal

Abstract: Automated content filtering and moderation is an important tool that allows online platforms to build striving user communities that facilitate cooperation and prevent abuse. Unfortunately, resourceful actors try to bypass automated filters in a bid to post content that violate platform policies and codes of conduct. To reach this goal, these malicious actors may obfuscate policy violating images… ▽ More Automated content filtering and moderation is an important tool that allows online platforms to build striving user communities that facilitate cooperation and prevent abuse. Unfortunately, resourceful actors try to bypass automated filters in a bid to post content that violate platform policies and codes of conduct. To reach this goal, these malicious actors may obfuscate policy violating images (e.g. overlay harmful images by carefully selected benign images or visual patterns) to prevent machine learning models from reaching the correct decision. In this paper, we invite researchers to tackle this specific issue and present a new image benchmark. This benchmark, based on ImageNet, simulates the type of obfuscations created by malicious actors. It goes beyond ImageNet-$\textrm{C}$ and ImageNet-$\bar{\textrm{C}}$ by proposing general, drastic, adversarial modifications that preserve the original content intent. It aims to tackle a more common adversarial threat than the one considered by $\ell_p$-norm bounded adversaries. We evaluate 33 pretrained models on the benchmark and train models with different augmentations, architectures and training methods on subsets of the obfuscations to measure generalization. We hope this benchmark will encourage researchers to test their models and methods and try to find new approaches that are more robust to these obfuscations. △ Less

Submitted 29 November, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

ACM Class: I.2.10; I.4.0

arXiv:2103.06986 [pdf, ps, other]

doi 10.1007/JHEP05(2021)214

The Kerr-Schild Double Copy in Lifshitz Spacetime

Authors: Gokhan Alkac, Mehmet Kemal Gumus, Mustafa Tek

Abstract: The Kerr-Schild double copy is a map between exact solutions of general relativity and Maxwell's theory, where the nonlinear nature of general relativity is circumvented by considering solutions in the Kerr-Schild form. In this paper, we give a general formulation, where no simplifying assumption about the background metric is made, and show that the gauge theory source is affected by a curvature… ▽ More The Kerr-Schild double copy is a map between exact solutions of general relativity and Maxwell's theory, where the nonlinear nature of general relativity is circumvented by considering solutions in the Kerr-Schild form. In this paper, we give a general formulation, where no simplifying assumption about the background metric is made, and show that the gauge theory source is affected by a curvature term that characterizes the deviation of the background spacetime from a constant curvature spacetime. We demonstrate this effect explicitly by studying gravitational solutions with non-zero cosmological constant. We show that, when the background is flat, the constant charge density filling all space in the gauge theory that has been observed in previous works is a consequence of this curvature term. As an example of a solution with a curved background, we study the Lifshitz black hole with two different matter couplings. The curvature of the background, i.e., the Lifshitz spacetime, again yields a constant charge density; however, unlike the previous examples, it is canceled by the contribution from the matter fields. For one of the matter couplings, there remains no additional non-localized source term, providing an example for a non-vacuum gravity solution corresponding to a vacuum gauge theory solution in arbitrary dimensions. △ Less

Submitted 7 June, 2021; v1 submitted 11 March, 2021; originally announced March 2021.

Comments: 20 pages + references, a reference regarding the KS form of the Lifshitz black holes added

arXiv:2005.08399 [pdf, other]

T-VSE: Transformer-Based Visual Semantic Embedding

Authors: Muhammet Bastan, Arnau Ramisa, Mehmet Tek

Abstract: Transformer models have recently achieved impressive performance on NLP tasks, owing to new algorithms for self-supervised pre-training on very large text corpora. In contrast, recent literature suggests that simple average word models outperform more complicated language models, e.g., RNNs and Transformers, on cross-modal image/text search tasks on standard benchmarks, like MS COCO. In this paper… ▽ More Transformer models have recently achieved impressive performance on NLP tasks, owing to new algorithms for self-supervised pre-training on very large text corpora. In contrast, recent literature suggests that simple average word models outperform more complicated language models, e.g., RNNs and Transformers, on cross-modal image/text search tasks on standard benchmarks, like MS COCO. In this paper, we show that dataset scale and training strategy are critical and demonstrate that transformer-based cross-modal embeddings outperform word average and RNN-based embeddings by a large margin, when trained on a large dataset of e-commerce product image-title pairs. △ Less

Submitted 17 May, 2020; originally announced May 2020.

Comments: To appear: CVPR 2020 Workshop on Computer Vision for Fashion, Art and Design (CVFAD 2020)

arXiv:1911.07440 [pdf, other]

Large Scale Open-Set Deep Logo Detection

Authors: Muhammet Bastan, Hao-Yu Wu, Tian Cao, Bhargava Kota, Mehmet Tek

Abstract: We present an open-set logo detection (OSLD) system, which can detect (localize and recognize) any number of unseen logo classes without re-training; it only requires a small set of canonical logo images for each logo class. We achieve this using a two-stage approach: (1) Generic logo detection to detect candidate logo regions in an image. (2) Logo matching for matching the detected logo regions t… ▽ More We present an open-set logo detection (OSLD) system, which can detect (localize and recognize) any number of unseen logo classes without re-training; it only requires a small set of canonical logo images for each logo class. We achieve this using a two-stage approach: (1) Generic logo detection to detect candidate logo regions in an image. (2) Logo matching for matching the detected logo regions to a set of canonical logo images to recognize them. We constructed an open-set logo detection dataset with 12.1k logo classes and released it for research purposes.We demonstrate the effectiveness of OSLD on our dataset and on the standard Flickr-32 logo dataset, outperforming the state-of-the-art open-set and closed-set logo detection methods by a large margin. OSLD is scalable to millions of logo classes. △ Less

Submitted 12 March, 2022; v1 submitted 18 November, 2019; originally announced November 2019.

Comments: Open Set Logo Detection (OSLD) dataset available at https://github.com/mubastan/osld

arXiv:1810.03504 [pdf, ps, other]

doi 10.1103/PhysRevD.98.104021

Bachian Gravity in Three Dimensions

Authors: Gokhan Alkac, Mustafa Tek, Bayram Tekin

Abstract: In three dimensions, there exist modifications of Einstein's gravity akin to the topologically massive gravity that describe massive gravitons about maximally symmetric backgrounds. These theories are built on the three-dimensional version of the Bach tensor (a curl of the Cotton-York tensor) and its higher derivative generalizations; and they are on-shell consistent without a Lagrangian descripti… ▽ More In three dimensions, there exist modifications of Einstein's gravity akin to the topologically massive gravity that describe massive gravitons about maximally symmetric backgrounds. These theories are built on the three-dimensional version of the Bach tensor (a curl of the Cotton-York tensor) and its higher derivative generalizations; and they are on-shell consistent without a Lagrangian description based on the metric tensor alone. We give a generic construction of these models, find the spectra and compute the conserved quantities for the Banados-Teitelboim-Zanelli black hole. △ Less

Submitted 19 November, 2018; v1 submitted 8 October, 2018; originally announced October 2018.

Comments: 17 pages, a note added on MMG

Journal ref: Phys. Rev. D 98, 104021 (2018)

Showing 1–8 of 8 results for author: Tek, M