Zum Hauptinhalt springen

Showing 1–23 of 23 results for author: Ivanov, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.18074  [pdf, other

    cs.GT cs.LG cs.MA

    Principal-Agent Reinforcement Learning

    Authors: Dima Ivanov, Paul Dütting, Inbal Talgam-Cohen, Tonghan Wang, David C. Parkes

    Abstract: Contracts are the economic framework which allows a principal to delegate a task to an agent -- despite misaligned interests, and even without directly observing the agent's actions. In many modern reinforcement learning settings, self-interested agents learn to perform a multi-stage task delegated to them by a principal. We explore the significant potential of utilizing contracts to incentivize t… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

  2. arXiv:2406.02077  [pdf, other

    eess.IV cs.AI cs.CV

    Multi-target stain normalization for histology slides

    Authors: Desislav Ivanov, Carlo Alberto Barbano, Marco Grangetto

    Abstract: Traditional staining normalization approaches, e.g. Macenko, typically rely on the choice of a single representative reference image, which may not adequately account for the diverse staining patterns of datasets collected in practical scenarios. In this study, we introduce a novel approach that leverages multiple reference images to enhance robustness against stain variation. Our method is parame… ▽ More

    Submitted 10 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    MSC Class: 68U10 ACM Class: I.4.0

  3. arXiv:2405.07748  [pdf, other

    cs.LG

    Neural Network Compression for Reinforcement Learning Tasks

    Authors: Dmitry A. Ivanov, Denis A. Larionov, Oleg V. Maslennikov, Vladimir V. Voevodin

    Abstract: In real applications of Reinforcement Learning (RL), such as robotics, low latency and energy efficient inference is very desired. The use of sparsity and pruning for optimizing Neural Network inference, and particularly to improve energy and latency efficiency, is a standard technique. In this work, we perform a systematic investigation of applying these optimization techniques for different RL a… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 14 pages, 6 figures

  4. arXiv:2404.14300  [pdf, ps, other

    cs.DM

    Linear Search for an Escaping Target with Unknown Speed

    Authors: Jared Coleman, Dmitry Ivanov, Evangelos Kranakis, Danny Krizanc, Oscar Morales-Ponce

    Abstract: We consider linear search for an escaping target whose speed and initial position are unknown to the searcher. A searcher (an autonomous mobile agent) is initially placed at the origin of the real line and can move with maximum speed $1$ in either direction along the line. An oblivious mobile target that is moving away from the origin with an unknown constant speed $v<1$ is initially placed by an… ▽ More

    Submitted 23 April, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

  5. arXiv:2403.03751  [pdf, other

    cs.SE

    Trigram-Based Persistent IDE Indices with Quick Startup

    Authors: Zakhar Iakovlev, Alexey Chulkov, Nikita Golikov, Vyacheslav Lukianov, Nikita Zinoviev, Dmitry Ivanov, Vitaly Aksenov

    Abstract: One common way to speed up the find operation within a set of text files involves a trigram index. This structure is merely a map from a trigram (sequence consisting of three characters) to a set of files which contain it. When searching for a pattern, potential file locations are identified by intersecting the sets related to the trigrams in the pattern. Then, the search proceeds only in these fi… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  6. arXiv:2401.06514  [pdf, other

    cs.LG

    Personalized Reinforcement Learning with a Budget of Policies

    Authors: Dmitry Ivanov, Omer Ben-Porat

    Abstract: Personalization in machine learning (ML) tailors models' decisions to the individual characteristics of users. While this approach has seen success in areas like recommender systems, its expansion into high-stakes fields such as healthcare and autonomous driving is hindered by the extensive regulatory approval processes involved. To address this challenge, we propose a novel framework termed repre… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

    Comments: Accepted to AAAI 2024. Code: https://github.com/dimonenka/RL_policy_budget

  7. arXiv:2311.07126  [pdf, other

    cs.LG

    How to Do Machine Learning with Small Data? -- A Review from an Industrial Perspective

    Authors: Ivan Kraljevski, Yong Chul Ju, Dmitrij Ivanov, Constanze Tschöpe, Matthias Wolff

    Abstract: Artificial intelligence experienced a technological breakthrough in science, industry, and everyday life in the recent few decades. The advancements can be credited to the ever-increasing availability and miniaturization of computational resources that resulted in exponential data growth. However, because of the insufficient amount of data in some cases, employing machine learning in solving compl… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  8. arXiv:2307.02318  [pdf, other

    cs.LG cs.AI

    Deep Contract Design via Discontinuous Networks

    Authors: Tonghan Wang, Paul Dütting, Dmitry Ivanov, Inbal Talgam-Cohen, David C. Parkes

    Abstract: Contract design involves a principal who establishes contractual agreements about payments for outcomes that arise from the actions of an agent. In this paper, we initiate the study of deep learning for the automated design of optimal contracts. We introduce a novel representation: the Discontinuous ReLU (DeLU) network, which models the principal's utility as a discontinuous piecewise affine funct… ▽ More

    Submitted 27 October, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

    Journal ref: NeurIPS 2023

  9. arXiv:2306.08419  [pdf, other

    cs.MA cs.GT cs.LG

    Mediated Multi-Agent Reinforcement Learning

    Authors: Dmitry Ivanov, Ilya Zisman, Kirill Chernyshev

    Abstract: The majority of Multi-Agent Reinforcement Learning (MARL) literature equates the cooperation of self-interested agents in mixed environments to the problem of social welfare maximization, allowing agents to arbitrarily share rewards and private information. This results in agents that forgo their individual goals in favour of social good, which can potentially be exploited by selfish defectors. We… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Journal ref: AAMAS '23, Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems (May 2023) Pages 49-57

  10. arXiv:2305.10872  [pdf, other

    cs.DC

    Benchmark Framework with Skewed Workloads

    Authors: Vitaly Aksenov, Dmitry Ivanov, Ravil Galiev

    Abstract: In this work, we present a new benchmarking suite with new real-life inspired skewed workloads to test the performance of concurrent index data structures. We started this project to prepare workloads specifically for self-adjusting data structures, i.e., they handle more frequent requests faster, and, thus, should perform better than their standard counterparts. We looked over the commonly used s… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

  11. arXiv:2210.11568  [pdf, ps, other

    quant-ph cond-mat.mes-hall cs.CC

    Polynomial computational complexity of matrix elements of finite-rank-generated single-particle operators in products of finite bosonic states

    Authors: Dmitri A. Ivanov

    Abstract: It is known that computing the permanent of the matrix $1+A$, where $A$ is a finite-rank matrix, requires a number of operations polynomial in the matrix size. Motivated by the boson-sampling proposal of restricted quantum computation, I extend this result to a generalization of the matrix permanent: an expectation value in a product of a large number of identical bosonic states with a bounded num… ▽ More

    Submitted 29 May, 2023; v1 submitted 20 October, 2022; originally announced October 2022.

    Comments: 4 pages, introduction and conclusion expanded, minor style corrections

  12. arXiv:2205.13037  [pdf, other

    cs.NE

    Neuromorphic Artificial Intelligence Systems

    Authors: Dmitry Ivanov, Aleksandr Chezhegov, Andrey Grunin, Mikhail Kiselev, Denis Larionov

    Abstract: Modern AI systems, based on von Neumann architecture and classical neural networks, have a number of fundamental limitations in comparison with the brain. This article discusses such limitations and the ways they can be mitigated. Next, it presents an overview of currently available neuromorphic AI projects in which these limitations are overcame by bringing some brain features into the functionin… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

  13. arXiv:2203.10905  [pdf, other

    cs.LG

    Self-Imitation Learning from Demonstrations

    Authors: Georgiy Pshikhachev, Dmitry Ivanov, Vladimir Egorov, Aleksei Shpilman

    Abstract: Despite the numerous breakthroughs achieved with Reinforcement Learning (RL), solving environments with sparse rewards remains a challenging task that requires sophisticated exploration. Learning from Demonstrations (LfD) remedies this issue by guiding the agent's exploration towards states experienced by an expert. Naturally, the benefits of this approach hinge on the quality of demonstrations, w… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

  14. arXiv:2203.07206  [pdf, other

    cs.LG

    Improving State-of-the-Art in One-Class Classification by Leveraging Unlabeled Data

    Authors: Farid Bagirov, Dmitry Ivanov, Aleksei Shpilman

    Abstract: When dealing with binary classification of data with only one labeled class data scientists employ two main approaches, namely One-Class (OC) classification and Positive Unlabeled (PU) learning. The former only learns from labeled positive data, whereas the latter also utilizes unlabeled data to improve the overall performance. Since PU learning utilizes more data, we might be prone to think that… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

  15. arXiv:2202.13110  [pdf, other

    cs.LG cs.GT

    Optimal-er Auctions through Attention

    Authors: Dmitry Ivanov, Iskander Safiulin, Igor Filippov, Ksenia Balabaeva

    Abstract: RegretNet is a recent breakthrough in the automated design of revenue-maximizing auctions. It combines the flexibility of deep learning with the regret-based approach to relax the Incentive Compatibility (IC) constraint (that participants prefer to bid truthfully) in order to approximate optimal auctions. We propose two independent improvements of RegretNet. The first is a neural architecture deno… ▽ More

    Submitted 31 October, 2022; v1 submitted 26 February, 2022; originally announced February 2022.

    Comments: NeurIPS 2022

  16. arXiv:2201.02571  [pdf, other

    cs.LG cs.NE

    Neural Network Optimization for Reinforcement Learning Tasks Using Sparse Computations

    Authors: Dmitry Ivanov, Mikhail Kiselev, Denis Larionov

    Abstract: This article proposes a sparse computation-based method for optimizing neural networks for reinforcement learning (RL) tasks. This method combines two ideas: neural network pruning and taking into account input data correlations; it makes it possible to update neuron states only when changes in them exceed a certain threshold. It significantly reduces the number of multiplications when running neu… ▽ More

    Submitted 7 April, 2022; v1 submitted 7 January, 2022; originally announced January 2022.

  17. arXiv:2112.08882  [pdf, other

    physics.geo-ph cs.NI

    BitTorrent is Apt for Geophysical Data Collection and Distribution

    Authors: K. I. Kholodkov, I. M. Aleshin, S. D. Ivanov

    Abstract: This article covers a nouveau idea of how to collect and handle geophysical data with a peer-to-peer network in near real-time. The text covers a brief introduction to the cause, the technology, and the particular case of collecting data from GNSS stations. We describe the proof-of-concept implementation that has been tested. The test was conducted with an experimental GNSS station and a data aggr… ▽ More

    Submitted 16 December, 2021; originally announced December 2021.

    Comments: 13 pages, 2 figures

    ACM Class: J.2; C.2.2

  18. arXiv:2103.16511  [pdf, other

    cs.AI cs.LG

    Flatland Competition 2020: MAPF and MARL for Efficient Train Coordination on a Grid World

    Authors: Florian Laurent, Manuel Schneider, Christian Scheller, Jeremy Watson, Jiaoyang Li, Zhe Chen, Yi Zheng, Shao-Hung Chan, Konstantin Makhnev, Oleg Svidchenko, Vladimir Egorov, Dmitry Ivanov, Aleksei Shpilman, Evgenija Spirovska, Oliver Tanevski, Aleksandar Nikov, Ramon Grunder, David Galevski, Jakov Mitrovski, Guillaume Sartoretti, Zhiyao Luo, Mehul Damani, Nilabha Bhattacharya, Shivam Agarwal, Adrian Egli , et al. (2 additional authors not shown)

    Abstract: The Flatland competition aimed at finding novel approaches to solve the vehicle re-scheduling problem (VRSP). The VRSP is concerned with scheduling trips in traffic networks and the re-scheduling of vehicles when disruptions occur, for example the breakdown of a vehicle. While solving the VRSP in various settings has been an active area in operations research (OR) for decades, the ever-growing com… ▽ More

    Submitted 30 March, 2021; originally announced March 2021.

    Comments: 28 pages, 8 figures

  19. arXiv:2102.12307  [pdf, other

    cs.LG cs.AI cs.MA

    Balancing Rational and Other-Regarding Preferences in Cooperative-Competitive Environments

    Authors: Dmitry Ivanov, Vladimir Egorov, Aleksei Shpilman

    Abstract: Recent reinforcement learning studies extensively explore the interplay between cooperative and competitive behaviour in mixed environments. Unlike cooperative environments where agents strive towards a common goal, mixed environments are notorious for the conflicts of selfish and social interests. As a consequence, purely rational agents often struggle to achieve and maintain cooperation. A preva… ▽ More

    Submitted 24 February, 2021; originally announced February 2021.

    Comments: Short version of this paper is accepted to AAMAS 2021

  20. arXiv:1904.06069  [pdf, other

    quant-ph cond-mat.mes-hall cs.CC

    Complexity of full counting statistics of free quantum particles in product states

    Authors: Dmitri A. Ivanov, Leonid Gurvits

    Abstract: We study the computational complexity of quantum-mechanical expectation values of single-particle operators in bosonic and fermionic multi-particle product states. Such expectation values appear, in particular, in full-counting-statistics problems. Depending on the initial multi-particle product state, the expectation values may be either easy to compute (the required number of operations scales p… ▽ More

    Submitted 21 February, 2020; v1 submitted 12 April, 2019; originally announced April 2019.

    Comments: 8 pages, published version

    Journal ref: Phys. Rev. A 101, 012303 (2020)

  21. arXiv:1902.06965  [pdf, other

    cs.LG stat.ML

    DEDPUL: Difference-of-Estimated-Densities-based Positive-Unlabeled Learning

    Authors: Dmitry Ivanov

    Abstract: Positive-Unlabeled (PU) learning is an analog to supervised binary classification for the case when only the positive sample is clean, while the negative sample is contaminated with latent instances of positive class and hence can be considered as an unlabeled mixture. The objectives are to classify the unlabeled sample and train an unbiased PN classifier, which generally requires to identify the… ▽ More

    Submitted 7 June, 2020; v1 submitted 19 February, 2019; originally announced February 2019.

    Comments: Implementation of DEDPUL and experimental data are available at https://github.com/dimonenka/DEDPUL

  22. arXiv:1603.02724  [pdf, other

    cs.CC cond-mat.other quant-ph

    Computational complexity of exterior products and multi-particle amplitudes of non-interacting fermions in entangled states

    Authors: Dmitri A. Ivanov

    Abstract: Noninteracting bosons were proposed to be used for a demonstration of quantum-computing supremacy in a boson-sampling setup. A similar demonstration with fermions would require that the fermions are initially prepared in an entangled state. I suggest that pairwise entanglement of fermions would be sufficient for this purpose. Namely, it is shown that computing multi-particle scattering amplitudes… ▽ More

    Submitted 6 August, 2017; v1 submitted 8 March, 2016; originally announced March 2016.

    Comments: 5 pages, version accepted for publication

    Journal ref: Phys. Rev. A 96, 012322 (2017)

  23. arXiv:1008.0063  [pdf

    cs.NE

    Evolutionary Approach to Test Generation for Functional BIST

    Authors: Y. A. Skobtsov, D. E. Ivanov, V. Y. Skobtsov, R. Ubar, J. Raik

    Abstract: In the paper, an evolutionary approach to test generation for functional BIST is considered. The aim of the proposed scheme is to minimize the test data volume by allowing the device's microprogram to test its logic, providing an observation structure to the system, and generating appropriate test data for the given architecture. Two methods of deriving a deterministic test set at functional level… ▽ More

    Submitted 31 July, 2010; originally announced August 2010.

    Comments: 10 European Test Symposium. Informal Digest of Papers