Zum Hauptinhalt springen

Showing 1–5 of 5 results for author: Hula, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.15418  [pdf, other

    cs.LG

    Understanding GNNs for Boolean Satisfiability through Approximation Algorithms

    Authors: Jan Hůla, David Mojžíšek, Mikoláš Janota

    Abstract: The paper deals with the interpretability of Graph Neural Networks in the context of Boolean Satisfiability. The goal is to demystify the internal workings of these models and provide insightful perspectives into their decision-making processes. This is done by uncovering connections to two approximation algorithms studied in the domain of Boolean Satisfiability: Belief Propagation and Semidefinit… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

    Comments: CIKM 2024

  2. arXiv:2402.07757  [pdf, other

    cs.LG cs.AI

    Towards an Understanding of Stepwise Inference in Transformers: A Synthetic Graph Navigation Model

    Authors: Mikail Khona, Maya Okawa, Jan Hula, Rahul Ramesh, Kento Nishi, Robert Dick, Ekdeep Singh Lubana, Hidenori Tanaka

    Abstract: Stepwise inference protocols, such as scratchpads and chain-of-thought, help language models solve complex problems by decomposing them into a sequence of simpler subproblems. Despite the significant gain in performance achieved via these protocols, the underlying mechanisms of stepwise inference have remained elusive. To address this, we propose to study autoregressive Transformer models on a syn… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  3. arXiv:2005.13243  [pdf, other

    cs.CV cs.LG eess.IV

    Poly-YOLO: higher speed, more precise detection and instance segmentation for YOLOv3

    Authors: Petr Hurtik, Vojtech Molek, Jan Hula, Marek Vajgl, Pavel Vlasanek, Tomas Nejezchleba

    Abstract: We present a new version of YOLO with better performance and extended with instance segmentation called Poly-YOLO. Poly-YOLO builds on the original ideas of YOLOv3 and removes two of its weaknesses: a large amount of rewritten labels and inefficient distribution of anchors. Poly-YOLO reduces the issues by aggregating features from a light SE-Darknet-53 backbone with a hypercolumn technique, using… ▽ More

    Submitted 29 May, 2020; v1 submitted 27 May, 2020; originally announced May 2020.

    Comments: 18 pages, 15 figures, submitted to IEEE Transactions on Pattern Analysis and Machine Intelligence (under review), Source code is available at https://gitlab.com/irafm-ai/poly-yolo

    MSC Class: 68T45 ACM Class: I.2.10

  4. arXiv:1905.06782  [pdf, ps, other

    cs.SE cs.LG cs.SI stat.ML

    Identifying collaborators in large codebases

    Authors: Waren Long, Vadim Markovtsev, Hugo Mougard, Egor Bulychev, Jan Hula

    Abstract: The way developers collaborate inside and particularly across teams often escapes management's attention, despite a formal organization with designated teams being defined. Observability of the actual, organically formed engineering structure provides decision makers invaluable additional tools to manage their talent pool. To identify existing inter and intra-team interactions - and suggest releva… ▽ More

    Submitted 7 May, 2019; originally announced May 2019.

    Comments: 4 pages; Workshop on Machine Learning for Software Engineering 2019

  5. arXiv:1812.10860  [pdf, other

    cs.CL

    Can You Tell Me How to Get Past Sesame Street? Sentence-Level Pretraining Beyond Language Modeling

    Authors: Alex Wang, Jan Hula, Patrick Xia, Raghavendra Pappagari, R. Thomas McCoy, Roma Patel, Najoung Kim, Ian Tenney, Yinghui Huang, Katherin Yu, Shuning Jin, Berlin Chen, Benjamin Van Durme, Edouard Grave, Ellie Pavlick, Samuel R. Bowman

    Abstract: Natural language understanding has recently seen a surge of progress with the use of sentence encoders like ELMo (Peters et al., 2018a) and BERT (Devlin et al., 2019) which are pretrained on variants of language modeling. We conduct the first large-scale systematic study of candidate pretraining tasks, comparing 19 different tasks both as alternatives and complements to language modeling. Our prim… ▽ More

    Submitted 22 July, 2019; v1 submitted 27 December, 2018; originally announced December 2018.

    Comments: ACL 2019. This paper supercedes "Looking for ELMo's Friends: Sentence-Level Pretraining Beyond Language Modeling", an earlier version of this work by the same authors