Zum Hauptinhalt springen

Showing 1–27 of 27 results for author: Franz, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.07770  [pdf, other

    quant-ph cs.DB cs.LG

    Hype or Heuristic? Quantum Reinforcement Learning for Join Order Optimisation

    Authors: Maja Franz, Tobias Winker, Sven Groppe, Wolfgang Mauerer

    Abstract: Identifying optimal join orders (JOs) stands out as a key challenge in database research and engineering. Owing to the large search space, established classical methods rely on approximations and heuristics. Recent efforts have successfully explored reinforcement learning (RL) for JO. Likewise, quantum versions of RL have received considerable scientific attention. Yet, it is an open question if t… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  2. arXiv:2401.09450  [pdf, other

    cs.CY cs.AI cs.CV cs.HC

    Joining Forces for Pathology Diagnostics with AI Assistance: The EMPAIA Initiative

    Authors: Norman Zerbe, Lars Ole Schwen, Christian Geißler, Katja Wiesemann, Tom Bisson, Peter Boor, Rita Carvalho, Michael Franz, Christoph Jansen, Tim-Rasmus Kiehl, Björn Lindequist, Nora Charlotte Pohlan, Sarah Schmell, Klaus Strohmenger, Falk Zakrzewski, Markus Plass, Michael Takla, Tobias Küster, André Homeyer, Peter Hufnagl

    Abstract: Over the past decade, artificial intelligence (AI) methods in pathology have advanced substantially. However, integration into routine clinical practice has been slow due to numerous challenges, including technical and regulatory hurdles in translating research results into clinical diagnostic products and the lack of standardized interfaces. The open and vendor-neutral EMPAIA initiative addresses… ▽ More

    Submitted 16 April, 2024; v1 submitted 22 December, 2023; originally announced January 2024.

    Journal ref: Journal of Pathology Informatics 2024

  3. arXiv:2306.06207  [pdf, other

    physics.med-ph cs.CV

    Towards clinical translation of deep-learning based classification of DSA image sequences for stroke treatment

    Authors: Timo Baumgärtner, Benjamin J. Mittmann, Till Malzacher, Johannes Roßkopf, Michael Braun, Bernd Schmitz, Alfred M. Franz

    Abstract: In the event of stroke, a catheter-guided procedure (thrombectomy) is used to remove blood clots. Feasibility of machine learning based automatic classifications for thrombus detection on digital substraction angiography (DSA) sequences has been demonstrated. It was however not used live in the clinic, yet. We present an open-source tool for automatic thrombus classification and test it on three s… ▽ More

    Submitted 23 May, 2023; originally announced June 2023.

    Comments: This is the preprint version of the BVM paper accepted for publication in the conference proceedings of "Bildverarbeitung in der Medizin 2023"

  4. arXiv:2303.00807  [pdf, other

    cs.IR cs.CL

    UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers

    Authors: Jon Saad-Falcon, Omar Khattab, Keshav Santhanam, Radu Florian, Martin Franz, Salim Roukos, Avirup Sil, Md Arafat Sultan, Christopher Potts

    Abstract: Many information retrieval tasks require large labeled datasets for fine-tuning. However, such datasets are often unavailable, and their utility for real-world applications can diminish quickly due to domain shifts. To address this challenge, we develop and motivate a method for using large language models (LLMs) to generate large numbers of synthetic queries cheaply. The method begins by generati… ▽ More

    Submitted 13 October, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

    Comments: Long Paper at Empirical Methods in Natural Language Processing (EMNLP) 2023

  5. arXiv:2301.09715  [pdf, other

    cs.CL cs.IR cs.LG

    PrimeQA: The Prime Repository for State-of-the-Art Multilingual Question Answering Research and Development

    Authors: Avirup Sil, Jaydeep Sen, Bhavani Iyer, Martin Franz, Kshitij Fadnis, Mihaela Bornea, Sara Rosenthal, Scott McCarley, Rong Zhang, Vishwajeet Kumar, Yulong Li, Md Arafat Sultan, Riyaz Bhat, Radu Florian, Salim Roukos

    Abstract: The field of Question Answering (QA) has made remarkable progress in recent years, thanks to the advent of large pre-trained language models, newer realistic benchmark datasets with leaderboards, and novel algorithms for key components such as retrievers and readers. In this paper, we introduce PRIMEQA: a one-stop and open-source QA repository with an aim to democratize QA re-search and facilitate… ▽ More

    Submitted 25 January, 2023; v1 submitted 23 January, 2023; originally announced January 2023.

  6. arXiv:2212.01340  [pdf, other

    cs.IR cs.CL

    Moving Beyond Downstream Task Accuracy for Information Retrieval Benchmarking

    Authors: Keshav Santhanam, Jon Saad-Falcon, Martin Franz, Omar Khattab, Avirup Sil, Radu Florian, Md Arafat Sultan, Salim Roukos, Matei Zaharia, Christopher Potts

    Abstract: Neural information retrieval (IR) systems have progressed rapidly in recent years, in large part due to the release of publicly available benchmarking tasks. Unfortunately, some dimensions of this progress are illusory: the majority of the popular IR benchmarks today focus exclusively on downstream task accuracy and thus conceal the costs incurred by systems that trade away efficiency for quality.… ▽ More

    Submitted 2 December, 2022; originally announced December 2022.

  7. Anonymization of Whole Slide Images in Histopathology for Research and Education

    Authors: Tom Bisson, Michael Franz, Isil Dogan O, Daniel Romberg, Christoph Jansen, Peter Hufnagl, Norman Zerbe

    Abstract: Objective: The exchange of health-related data is subject to regional laws and regulations, such as the General Data Protection Regulation (GDPR) in the EU or the Health Insurance Portability and Accountability Act (HIPAA) in the United States, resulting in non-trivial challenges for researchers and educators when working with these data. In pathology, the digitization of diagnostic tissue samples… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

  8. arXiv:2209.02638  [pdf, other

    cs.PL cs.SE

    DFI: An Interprocedural Value-Flow Analysis Framework that Scales to Large Codebases

    Authors: Min-Yih Hsu, Felicitas Hetzelt, Michael Franz

    Abstract: Context- and flow-sensitive value-flow information is an important building block for many static analysis tools. Unfortunately, current approaches to compute value-flows do not scale to large codebases, due to high memory and runtime requirements. This paper proposes a new scalable approach to compute value-flows via graph reachability. To this end, we develop a new graph structure as an extensio… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

  9. Entity-Conditioned Question Generation for Robust Attention Distribution in Neural Information Retrieval

    Authors: Revanth Gangi Reddy, Md Arafat Sultan, Martin Franz, Avirup Sil, Heng Ji

    Abstract: We show that supervised neural information retrieval (IR) models are prone to learning sparse attention patterns over passage tokens, which can result in key phrases including named entities receiving low attention weights, eventually leading to model under-performance. Using a novel targeted synthetic data generation method that identifies poorly attended entities and conditions the generation ep… ▽ More

    Submitted 24 April, 2022; originally announced April 2022.

    Comments: Published at SIGIR 2022

  10. arXiv:2203.06289  [pdf, other

    quant-ph cs.SE

    Peel $\mid$ Pile? Cross-Framework Portability of Quantum Software

    Authors: Manuel Schönberger, Maja Franz, Stefanie Scherzinger, Wolfgang Mauerer

    Abstract: In recent years, various vendors have made quantum software frameworks available. Yet with vendor-specific frameworks, code portability seems at risk, especially in a field where hardware and software libraries have not yet reached a consolidated state, and even foundational aspects of the technologies are still in flux. Accordingly, the development of vendor-independent quantum programming langua… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

    Journal ref: QSA@ICSA2022

  11. Uncovering Instabilities in Variational-Quantum Deep Q-Networks

    Authors: Maja Franz, Lucas Wolf, Maniraman Periyasamy, Christian Ufrecht, Daniel D. Scherer, Axel Plinge, Christopher Mutschler, Wolfgang Mauerer

    Abstract: Deep Reinforcement Learning (RL) has considerably advanced over the past decade. At the same time, state-of-the-art RL algorithms require a large computational budget in terms of training time to converge. Recent work has started to approach this problem through the lens of quantum computing, which promises theoretical speed-ups for several traditionally hard tasks. In this work, we examine a clas… ▽ More

    Submitted 16 September, 2022; v1 submitted 10 February, 2022; originally announced February 2022.

    Comments: Authors Maja Franz, Lucas Wolf, Maniraman Periyasamy contributed equally (name order randomised). To be published in the Journal of The Franklin Institute

  12. arXiv:2201.04804  [pdf, other

    cs.PL

    A Highly Scalable, Hybrid, Cross-Platform Timing Analysis Framework Providing Accurate Differential Throughput Estimation via Instruction-Level Tracing

    Authors: Min-Yih Hsu, Felicitas Hetzelt, David Gens, Michael Maitland, Michael Franz

    Abstract: Estimating instruction-level throughput is critical for many applications: multimedia, low-latency networking, medical, automotive, avionic, and industrial control systems all rely on tightly calculable and accurate timing bounds of their software. Unfortunately, how long a program may run - or if it may indeed stop at all - cannot be answered in the general case. This is why state-of-the-art thro… ▽ More

    Submitted 16 May, 2023; v1 submitted 13 January, 2022; originally announced January 2022.

  13. arXiv:2112.08185  [pdf, other

    cs.CL cs.AI

    Learning Cross-Lingual IR from an English Retriever

    Authors: Yulong Li, Martin Franz, Md Arafat Sultan, Bhavani Iyer, Young-Suk Lee, Avirup Sil

    Abstract: We present DR.DECR (Dense Retrieval with Distillation-Enhanced Cross-Lingual Representation), a new cross-lingual information retrieval (CLIR) system trained using multi-stage knowledge distillation (KD). The teacher of DR.DECR relies on a highly effective but computationally expensive two-stage inference process consisting of query translation and monolingual IR, while the student, DR.DECR, execu… ▽ More

    Submitted 31 July, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

    Comments: Presented at NAACL 2022 main conference Code can be found at: https://github.com/primeqa/primeqa

  14. arXiv:2104.07800  [pdf, other

    cs.CL cs.AI cs.IR

    Towards Robust Neural Retrieval Models with Synthetic Pre-Training

    Authors: Revanth Gangi Reddy, Vikas Yadav, Md Arafat Sultan, Martin Franz, Vittorio Castelli, Heng Ji, Avirup Sil

    Abstract: Recent work has shown that commonly available machine reading comprehension (MRC) datasets can be used to train high-performance neural information retrieval (IR) systems. However, the evaluation of neural IR has so far been limited to standard supervised learning settings, where they have outperformed traditional term matching baselines. We conduct in-domain and out-of-domain evaluations of neura… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

  15. arXiv:2011.02091  [pdf, other

    cs.CR

    dMVX: Secure and Efficient Multi-Variant Execution in a Distributed Setting

    Authors: Alexios Voulimeneas, Dokyung Song, Per Larsen, Michael Franz, Stijn Volckaert

    Abstract: Multi-variant execution (MVX) systems amplify the effectiveness of software diversity techniques. The key idea is to run multiple diversified program variants in lockstep while providing them with the same input and monitoring their run-time behavior for divergences. Thus, adversaries have to compromise all program variants simultaneously to mount an attack successfully. Recent work proposed distr… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

  16. arXiv:1912.04870  [pdf

    cs.CR

    V0LTpwn: Attacking x86 Processor Integrity from Software

    Authors: Zijo Kenjar, Tommaso Frassetto, David Gens, Michael Franz, Ahmad-Reza Sadeghi

    Abstract: Fault-injection attacks have been proven in the past to be a reliable way of bypassing hardware-based security measures, such as cryptographic hashes, privilege and access permission enforcement, and trusted execution environments. However, traditional fault-injection attacks require physical presence, and hence, were often considered out of scope in many real-world adversary settings. In this p… ▽ More

    Submitted 10 December, 2019; originally announced December 2019.

  17. arXiv:1911.02984  [pdf, other

    cs.CL cs.IR

    The TechQA Dataset

    Authors: Vittorio Castelli, Rishav Chakravarti, Saswati Dana, Anthony Ferritto, Radu Florian, Martin Franz, Dinesh Garg, Dinesh Khandelwal, Scott McCarley, Mike McCawley, Mohamed Nasr, Lin Pan, Cezar Pendus, John Pitrelli, Saurabh Pujar, Salim Roukos, Andrzej Sakrajda, Avirup Sil, Rosario Uceda-Sosa, Todd Ward, Rong Zhang

    Abstract: We introduce TechQA, a domain-adaptation question answering dataset for the technical support domain. The TechQA corpus highlights two real-world issues from the automated customer support domain. First, it contains actual questions posed by users on a technical forum, rather than questions generated specifically for a competition or a task. Second, it has a real-world size -- 600 training, 310 de… ▽ More

    Submitted 7 November, 2019; originally announced November 2019.

    Comments: Long version of conference paper to be submitted

  18. arXiv:1903.03643  [pdf, other

    cs.CR

    DMON: A Distributed Heterogeneous N-Variant System

    Authors: Alexios Voulimeneas, Dokyung Song, Fabian Parzefall, Yeoul Na, Per Larsen, Michael Franz, Stijn Volckaert

    Abstract: N-Variant Execution (NVX) systems utilize software diversity techniques for enhancing software security. The general idea is to run multiple different variants of the same program alongside each other while monitoring their run-time behavior. If the internal disparity between the running variants causes observable differences in response to malicious inputs, the monitor can detect such divergences… ▽ More

    Submitted 8 March, 2019; originally announced March 2019.

  19. arXiv:1807.08304  [pdf, other

    cs.CG cs.GR

    Deep Learning Parametrization for B-Spline Curve Approximation

    Authors: Pascal Laube, Matthias O. Franz, Georg Umlauf

    Abstract: In this paper we present a method using deep learning to compute parametrizations for B-spline curve approximation. Existing methods consider the computation of parametric values and a knot vector as separate problems. We propose to train interdependent deep neural networks to predict parametric values and knots. We show that it is possible to include B-spline curve approximation directly into the… ▽ More

    Submitted 22 July, 2018; originally announced July 2018.

    Comments: Accepted at 3DV 2018

  20. arXiv:1806.04355  [pdf, other

    cs.CR cs.PL

    SoK: Sanitizing for Security

    Authors: Dokyung Song, Julian Lettner, Prabhu Rajasekaran, Yeoul Na, Stijn Volckaert, Per Larsen, Michael Franz

    Abstract: The C and C++ programming languages are notoriously insecure yet remain indispensable. Developers therefore resort to a multi-pronged approach to find security issues before adversaries. These include manual, static, and dynamic program analysis. Dynamic bug finding tools --- henceforth "sanitizers" --- can find bugs that elude other types of analysis because they observe the actual execution of a… ▽ More

    Submitted 12 June, 2018; originally announced June 2018.

  21. arXiv:1712.03111  [pdf, other

    cs.CV

    Image Inpainting for High-Resolution Textures using CNN Texture Synthesis

    Authors: Pascal Laube, Michael Grunwald, Matthias O. Franz, Georg Umlauf

    Abstract: Deep neural networks have been successfully applied to problems such as image segmentation, image super-resolution, coloration and image inpainting. In this work we propose the use of convolutional neural networks (CNN) for image inpainting of large regions in high-resolution textures. Due to limited computational resources processing high-resolution images with neural networks is still an open pr… ▽ More

    Submitted 12 February, 2018; v1 submitted 8 December, 2017; originally announced December 2017.

    Comments: Submitted

  22. arXiv:1711.08108  [pdf, other

    cs.CR

    PartiSan: Fast and Flexible Sanitization via Run-time Partitioning

    Authors: Julian Lettner, Dokyung Song, Taemin Park, Stijn Volckaert, Per Larsen, Michael Franz

    Abstract: Sanitizers can detect security vulnerabilities in C/C++ code that elude static analysis. Current practice is to continuously fuzz and sanitize internal pre-release builds. Sanitization-enabled builds are rarely released publicly. This is in large part due to the high memory and processing requirements of sanitizers. We present PartiSan, a run-time partitioning technique that speeds up sanitizers… ▽ More

    Submitted 14 May, 2018; v1 submitted 21 November, 2017; originally announced November 2017.

  23. Adaptive mixed norm optical flow estimation

    Authors: Vania V. Estrela, Matthias O. Franz, Ricardo T. Lopes, G. P. De Araujo

    Abstract: The pel-recursive computation of 2-D optical flow has been extensively studied in computer vision to estimate motion from image sequences, but it still raises a wealth of issues, such as the treatment of outliers, motion discontinuities and occlusion. It relies on spatio-temporal brightness variations due to motion. Our proposed adaptive regularized approach deals with these issues within a common… ▽ More

    Submitted 3 November, 2016; originally announced November 2016.

    Comments: 8 pages, 4 figures. arXiv admin note: text overlap with arXiv:1403.7365

    Journal ref: Proc. SPIE 5960, Visual Communications and Image Processing 2005, 59603W, July 31, 2006, Beijing, China

  24. Control-Flow Integrity: Precision, Security, and Performance

    Authors: Nathan Burow, Scott A. Carr, Joseph Nash, Per Larsen, Michael Franz, Stefan Brunthaler, Mathias Payer

    Abstract: Memory corruption errors in C/C++ programs remain the most common source of security vulnerabilities in today's systems. Control-flow hijacking attacks exploit memory corruption vulnerabilities to divert program execution away from the intended control flow. Researchers have spent more than a decade studying and refining defenses based on Control-Flow Integrity (CFI), and this technique is now int… ▽ More

    Submitted 27 January, 2017; v1 submitted 12 February, 2016; originally announced February 2016.

    Comments: Version submitted to ACM CSUR 01/27/17

  25. repAIrC: A Tool for Ensuring Data Consistency by Means of Active Integrity Constraints

    Authors: Luís Cruz-Filipe, Michael Franz, Artavazd Hakhverdyan, Marta Ludovico, Isabel Nunes, Peter Schneider-Kamp

    Abstract: Consistency of knowledge repositories is of prime importance in organization management. Integrity constraints are a well-known vehicle for specifying data consistency requirements in knowledge bases; in particular, active integrity constraints go one step further, allowing the specification of preferred ways to overcome inconsistent situations in the context of database management. This paper des… ▽ More

    Submitted 14 October, 2015; originally announced October 2015.

    Comments: IMADA-preprint-cs

  26. arXiv:1409.7760  [pdf, other

    cs.CR cs.PL cs.SE eess.SY

    Similarity-based matching meets Malware Diversity

    Authors: Mathias Payer, Stephen Crane, Per Larsen, Stefan Brunthaler, Richard Wartell, Michael Franz

    Abstract: Similarity metrics, e.g., signatures as used by anti-virus products, are the dominant technique to detect if a given binary is malware. The underlying assumption of this approach is that all instances of a malware (or even malware family) will be similar to each other. Software diversification is a probabilistic technique that uses code and data randomization and expressiveness in the target ins… ▽ More

    Submitted 27 September, 2014; originally announced September 2014.

  27. arXiv:cs/9903014  [pdf, ps, other

    cs.OS cs.PL

    Perpetual Adaptation of Software to Hardware: An Extensible Architecture for Providing Code Optimization as a Central System Service

    Authors: Thomas Kistler, Michael Franz

    Abstract: We present an open architecture for just-in-time code generation and dynamic code optimization that is flexible, customizable, and extensible. While previous research has primarily investigated functional aspects of such a system, architectural aspects have so far remained unexplored. In this paper, we argue that these properties are important to generate optimal code for a variety of hardware a… ▽ More

    Submitted 22 March, 1999; originally announced March 1999.

    Comments: 22 pages

    Report number: ICS-TR-99-12 ACM Class: D.3.4