Zum Hauptinhalt springen

Showing 1–34 of 34 results for author: Dershowitz, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.12026  [pdf, ps, other

    cs.CL cs.AI

    The Pitfalls of Publishing in the Age of LLMs: Strange and Surprising Adventures with a High-Impact NLP Journal

    Authors: Rakesh M. Verma, Nachum Dershowitz

    Abstract: We show the fraught side of the academic publishing realm and illustrate it through a recent case study with an NLP journal.

    Submitted 28 June, 2024; originally announced July 2024.

  2. arXiv:2406.16046  [pdf, ps, other

    cs.LO

    Drag Rewriting

    Authors: Nachum Dershowitz, Jean-Pierre Jouannaud, Fernando Orejas

    Abstract: We present a new and powerful algebraic framework for graph rewriting, based on drags, a class of graphs enjoying a novel composition operator. Graphs are embellished with roots and sprouts, which can be wired together to form edges. Drags enjoy a rich algebraic structure with sums and products. Drag rewriting naturally extends graph rewriting, dag rewriting, and term rewriting models.

    Submitted 27 June, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

  3. arXiv:2406.15692  [pdf, other

    cs.CV

    Segmenting Dead Sea Scroll Fragments for a Scientific Image Set

    Authors: Bronson Brown-deVost, Berat Kurar-Barakat, Nachum Dershowitz

    Abstract: This paper presents a customized pipeline for segmenting manuscript fragments from images curated by the Israel Antiquities Authority (IAA). The images present challenges for standard segmentation methods due to the presence of the ruler, color, and plate number bars, as well as a black background that resembles the ink and varying backing substrates. The proposed pipeline, consisting of four step… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  4. arXiv:2402.01019  [pdf, other

    cs.CL cs.CR cs.CY

    Domain-Independent Deception: A New Taxonomy and Linguistic Analysis

    Authors: Rakesh M. Verma, Nachum Dershowitz, Victor Zeng, Dainis Boumber, Xuting Liu

    Abstract: Internet-based economies and societies are drowning in deceptive attacks. These attacks take many forms, such as fake news, phishing, and job scams, which we call ``domains of deception.'' Machine-learning and natural-language-processing researchers have been attempting to ameliorate this precarious situation by designing domain-specific detectors. Only a few recent works have considered domain-in… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 33 pages. arXiv admin note: text overlap with arXiv:2207.01738

  5. arXiv:2305.02170  [pdf, other

    cs.CL

    A Statistical Exploration of Text Partition Into Constituents: The Case of the Priestly Source in the Books of Genesis and Exodus

    Authors: Gideon Yoffe, Axel Bühler, Nachum Dershowitz, Israel Finkelstein, Eli Piasetzky, Thomas Römer, Barak Sober

    Abstract: We present a pipeline for a statistical textual exploration, offering a stylometry-based explanation and statistical validation of a hypothesized partition of a text. Given a parameterization of the text, our pipeline: (1) detects literary features yielding the optimal overlap between the hypothesized and unsupervised partitions, (2) performs a hypothesis-testing analysis to quantify the statistic… ▽ More

    Submitted 10 June, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

  6. Style Classification of Rabbinic Literature for Detection of Lost Midrash Tanhuma Material

    Authors: Shlomo Tannor, Nachum Dershowitz, Moshe Lavee

    Abstract: Midrash collections are complex rabbinic works that consist of text in multiple languages, which evolved through long processes of unstable oral and written transmission. Determining the origin of a given passage in such a compilation is not always straightforward and is often a matter of dispute among scholars, yet it is essential for scholars' understanding of the passage and its relationship to… ▽ More

    Submitted 24 July, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

    Journal ref: Journal of Data Mining & Digital Humanities, NLP4DH (August 13, 2023) jdmdh:11375

  7. arXiv:2207.01738  [pdf, other

    cs.CR cs.CY

    Domain-Independent Deception: Definition, Taxonomy and the Linguistic Cues Debate

    Authors: Rakesh M. Verma, Nachum Dershowitz, Victor Zeng, Xuting Liu

    Abstract: Internet-based economies and societies are drowning in deceptive attacks. These attacks take many forms, such as fake news, phishing, and job scams, which we call "domains of deception." Machine-learning and natural-language-processing researchers have been attempting to ameliorate this precarious situation by designing domain-specific detectors. Only a few recent works have considered domain-inde… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

    Comments: 16 pages, 2 figures

    ACM Class: K.6.5

  8. arXiv:2010.02665  [pdf, other

    cs.CL

    Metaphor Interpretation Using Word Embeddings

    Authors: Kfir Bar, Nachum Dershowitz, Lena Dankin

    Abstract: We suggest a model for metaphor interpretation using word embeddings trained over a relatively large corpus. Our system handles nominal metaphors, like "time is money". It generates a ranked list of potential interpretations of given metaphors. Candidate meanings are drawn from collocations of the topic ("time") and vehicle ("money") components, automatically extracted from a dependency-parsed cor… ▽ More

    Submitted 6 December, 2021; v1 submitted 6 October, 2020; originally announced October 2020.

    Comments: Presented at 19th International Conference on Computational Linguistics and Intelligent Text Processing (CICLing), 2018

  9. arXiv:2006.06516  [pdf, ps, other

    math.CO cs.DM

    Between Broadway and the Hudson: A Bijection of Corridor Paths

    Authors: Nachum Dershowitz

    Abstract: We present a substantial generalization of the equinumeracy of grand Dyck paths and Dyck-path prefixes, constrained within a band. The number of constrained paths starting at level $i$ and ending in a window of size $2j+2$ is equal to the number starting at level $j$ and ending in a window of size $2i+2$ centered around the same point. A new encoding of lattice paths provides a bijective proof.

    Submitted 6 January, 2021; v1 submitted 11 June, 2020; originally announced June 2020.

    Journal ref: Journal of Integer Sequences, vol. 24, art. 21.2.8 (2021)

  10. arXiv:2004.11405  [pdf, other

    cs.CL

    Transliteration of Judeo-Arabic Texts into Arabic Script Using Recurrent Neural Networks

    Authors: Ori Terner, Kfir Bar, Nachum Dershowitz

    Abstract: We trained a model to automatically transliterate Judeo-Arabic texts into Arabic script, enabling Arabic readers to access those writings. We employ a recurrent neural network (RNN), combined with the connectionist temporal classification (CTC) loss to deal with unequal input/output lengths. This obligates adjustments in the training data to avoid input sequences that are shorter than their corres… ▽ More

    Submitted 21 October, 2020; v1 submitted 23 April, 2020; originally announced April 2020.

    Comments: accepted for WANLP 2020

  11. arXiv:1911.09960  [pdf, other

    cs.CV eess.IV

    Computational Ceramicology

    Authors: Barak Itkin, Lior Wolf, Nachum Dershowitz

    Abstract: Field archeologists are called upon to identify potsherds, for which purpose they rely on their experience and on reference works. We have developed two complementary machine-learning tools to propose identifications based on images captured on site. One method relies on the shape of the fracture outline of a sherd; the other is based on decorative features. For the outline-identification tool, a… ▽ More

    Submitted 22 November, 2019; originally announced November 2019.

  12. arXiv:1909.07899  [pdf, other

    cs.IR cs.DL

    Fast Search with Poor OCR

    Authors: Taivanbat Badamdorj, Adiel Ben-Shalom, Nachum Dershowitz, Lior Wolf

    Abstract: The indexing and searching of historical documents have garnered attention in recent years due to massive digitization efforts of important collections worldwide. Pure textual search in these corpora is a problem since optical character recognition (OCR) is infamous for performing poorly on such historical material, which often suffer from poor preservation. We propose a novel text-based method fo… ▽ More

    Submitted 21 April, 2020; v1 submitted 17 September, 2019; originally announced September 2019.

  13. arXiv:1908.04329  [pdf, ps, other

    cs.DM math.CO

    Nonleaf Patterns in Trees: Protected Nodes and Fine Numbers

    Authors: Nachum Dershowitz

    Abstract: A closed-form formula is derived for the number of occurrences of matches of a multiset of patterns among all ordered (plane-planted) trees with a given number of edges. A pattern looks like a tree, with internal nodes and leaves, but also contain components that match subtrees or sequences of subtrees. This result extends previous versatile tree-pattern enumeration formulae to incorporate compone… ▽ More

    Submitted 12 August, 2019; originally announced August 2019.

    Journal ref: Journal of Integer Sequences, vol. 23, art. 20.7.7 (2020)

  14. arXiv:1904.07953  [pdf, other

    cs.CL

    Semantic Characteristics of Schizophrenic Speech

    Authors: Kfir Bar, Vered Zilberstein, Ido Ziv, Heli Baram, Nachum Dershowitz, Samuel Itzikowitz, Eiran Vadim Harel

    Abstract: Natural language processing tools are used to automatically detect disturbances in transcribed speech of schizophrenia inpatients who speak Hebrew. We measure topic mutation over time and show that controls maintain more cohesive speech than inpatients. We also examine differences in how inpatients and controls use adjectives and adverbs to describe content words and show that the ones used by con… ▽ More

    Submitted 16 April, 2019; originally announced April 2019.

    Journal ref: CLPsych at NAACL 2019

  15. arXiv:1612.04076  [pdf, ps, other

    cs.DM

    Touchard's Drunkard

    Authors: Nachum Dershowitz

    Abstract: Based on Touchard's identity, a simple derivation is given for the enumeration of the N/S/E/W walks that remain on the north side of the origin.

    Submitted 13 December, 2016; originally announced December 2016.

    Journal ref: Journal of Integer Sequences, Vol. 20 (2017), Article 17.1.5

  16. arXiv:1609.08389  [pdf

    cs.CL cs.CY

    A Hackathon for Classical Tibetan

    Authors: Orna Almogi, Lena Dankin, Nachum Dershowitz, Lior Wolf

    Abstract: We describe the course of a hackathon dedicated to the development of linguistic tools for Tibetan Buddhist studies. Over a period of five days, a group of seventeen scholars, scientists, and students developed and compared algorithms for intertextual alignment and text classification, along with some basic language tools, including a stemmer and word segmenter.

    Submitted 31 December, 2018; v1 submitted 27 September, 2016; originally announced September 2016.

  17. arXiv:1608.08740  [pdf, ps, other

    cs.DM

    1700 Forests

    Authors: Nachum Dershowitz

    Abstract: Since ordered trees and Dyck paths are equinumerous, so are ordered forests and grand-Dyck paths that start with an upwards step.

    Submitted 31 August, 2016; originally announced August 2016.

    Comments: For OEIS entry

  18. arXiv:1606.01148  [pdf, ps, other

    cs.LO

    Tripartite Unions

    Authors: Nachum Dershowitz

    Abstract: This note provides conditions under which the union of three well-founded binary relations is also well-founded.

    Submitted 3 June, 2016; originally announced June 2016.

    Journal ref: IJCAR, Oxford, Springer, Lecture Notes in Computer Science, vol. 10900, pp. 117-133 (2018)

  19. arXiv:1604.06225  [pdf

    cs.IR cs.CL

    OCR Error Correction Using Character Correction and Feature-Based Word Classification

    Authors: Ido Kissos, Nachum Dershowitz

    Abstract: This paper explores the use of a learned classifier for post-OCR text correction. Experiments with the Arabic language show that this approach, which integrates a weighted confusion matrix and a shallow language model, improves the vast majority of segmentation and recognition errors, the most frequent types of error on our dataset.

    Submitted 21 April, 2016; originally announced April 2016.

    Comments: Proceedings of the 12th IAPR International Workshop on Document Analysis Systems (DAS2016), Santorini, Greece, April 11-14, 2016

    Journal ref: Proceedings of the 12th IAPR International Workshop on Document Analysis Systems (DAS 2016), Santorini, Greece, pp. 198-203 (2016)

  20. arXiv:1604.04295  [pdf, ps, other

    cs.LO

    Axiomatizing Analog Algorithms

    Authors: Olivier Bournez, Nachum Dershowitz, Pierre Néron

    Abstract: We propose a formalization of generic algorithms that includes analog algorithms. This is achieved by reformulating and extending the framework of abstract state machines to include continuous-time models of computation. We prove that every hybrid algorithm satisfying some reasonable postulates may be expressed precisely by a program in a simple and expressive language.

    Submitted 25 April, 2016; v1 submitted 14 April, 2016; originally announced April 2016.

    Journal ref: Computability in Europe 2016: Pursuit of the Universal (CiE), Paris, France, Lecture Notes in Computer Science, vol. 9709, Springer-Verlag, Switzerland, pp. 215-224 (2016)

  21. Cellular Automata are Generic

    Authors: Nachum Dershowitz, Evgenia Falkovich

    Abstract: Any algorithm (in the sense of Gurevich's abstract-state-machine axiomatization of classical algorithms) operating over any arbitrary unordered domain can be simulated by a dynamic cellular automaton, that is, by a pattern-directed cellular automaton with unconstrained topology and with the power to create new cells. The advantage is that the latter is closer to physical reality. The overhead of… ▽ More

    Submitted 12 April, 2015; originally announced April 2015.

    Comments: In Proceedings DCM 2014, arXiv:1504.01927

    Journal ref: EPTCS 179, 2015, pp. 17-32

  22. The Generic Model of Computation

    Authors: Nachum Dershowitz

    Abstract: Over the past two decades, Yuri Gurevich and his colleagues have formulated axiomatic foundations for the notion of algorithm, be it classical, interactive, or parallel, and formalized them in the new generic framework of abstract state machines. This approach has recently been extended to suggest a formalization of the notion of effective computation over arbitrary countable domains. The central… ▽ More

    Submitted 30 July, 2012; originally announced August 2012.

    Comments: In Proceedings DCM 2011, arXiv:1207.6821

    ACM Class: F.1.1; F.3.1

    Journal ref: EPTCS 88, 2012, pp. 59-71

  23. A Formalization and Proof of the Extended Church-Turing Thesis -Extended Abstract-

    Authors: Nachum Dershowitz, Evgenia Falkovich

    Abstract: We prove the Extended Church-Turing Thesis: Every effective algorithm can be efficiently simulated by a Turing machine. This is accomplished by emulating an effective algorithm via an abstract state machine, and simulating such an abstract state machine by a random access machine, representing data as a minimal term graph.

    Submitted 30 July, 2012; originally announced July 2012.

    Comments: In Proceedings DCM 2011, arXiv:1207.6821

    ACM Class: F.1.1; F.3.1

    Journal ref: EPTCS 88, 2012, pp. 72-78

  24. arXiv:1110.5867  [pdf, ps, other

    cs.LO

    From Total Assignment Enumeration to Modern SAT Solver

    Authors: Nachum Dershowitz, Alexander Nadel

    Abstract: A new framework for presenting and analyzing the functionality of a modern DLL-based SAT solver is proposed. Our approach exploits the inherent relation between backtracking and resolution. We show how to derive the algorithm of a modern SAT solver from DLL step-by-step. We analyze the inference power of Boolean Constraint Propagation, Non-Chronological Backtracking and 1UIP-based Conflict-Directe… ▽ More

    Submitted 26 October, 2011; originally announced October 2011.

  25. arXiv:0811.0811  [pdf, ps, other

    cs.GL cs.DS cs.LO

    When are two algorithms the same?

    Authors: Andreas Blass, Nachum Dershowitz, Yuri Gurevich

    Abstract: People usually regard algorithms as more abstract than the programs that implement them. The natural way to formalize this idea is that algorithms are equivalence classes of programs with respect to a suitable equivalence relation. We argue that no such equivalence relation exists.

    Submitted 5 November, 2008; originally announced November 2008.

    Journal ref: Bulletin of Symbolic Logic, vol. 15, no. 2, pp. 145-168, 2009

  26. Space-Efficient Bounded Model Checking

    Authors: Jacob Katz, Ziyad Hanna, Nachum Dershowitz

    Abstract: Current algorithms for bounded model checking use SAT methods for checking satisfiability of Boolean formulae. These methods suffer from the potential memory explosion problem. Methods based on the validity of Quantified Boolean Formulae (QBF) allow an exponentially more succinct representation of formulae to be checked, because no "unrolling" of the transition relation is required. These method… ▽ More

    Submitted 25 October, 2007; originally announced October 2007.

    Comments: Submitted on behalf of EDAA (http://www.edaa.com/)

    Journal ref: Dans Design, Automation and Test in Europe - DATE'05, Munich : Allemagne (2005)

  27. arXiv:0707.4255  [pdf, ps, other

    cs.CC cs.LO

    Complexity of Propositional Proofs under a Promise

    Authors: Nachum Dershowitz, Iddo Tzameret

    Abstract: We study -- within the framework of propositional proof complexity -- the problem of certifying unsatisfiability of CNF formulas under the promise that any satisfiable formula has many satisfying assignments, where ``many'' stands for an explicitly specified function $\Lam$ in the number of variables $n$. To this end, we develop propositional proof systems under different measures of promises (t… ▽ More

    Submitted 28 July, 2007; originally announced July 2007.

    Comments: 32 pages; a preliminary version appeared in the Proceedings of ICALP'07

    ACM Class: F.2.2; F.4.1

    Journal ref: ACM Transactions on Computational Logic, 11(3):1-29, 2010;

  28. arXiv:cs/0610054  [pdf, ps, other

    cs.LO cs.DM

    Enumeration Problems Related to Ground Horn Theories

    Authors: Nachum Dershowitz, Mitchell A. Harris, Guan-Shieng Huang

    Abstract: We investigate the enumeration of varieties of boolean theories related to Horn clauses. We describe a number of combinatorial equivalences among different characterizations and calculate the number of different theories in $n$ variables for slightly different characterizations. The method of counting is via counting models using a satisfiability checker.

    Submitted 4 February, 2008; v1 submitted 10 October, 2006; originally announced October 2006.

    Comments: 4 pages, before journal submission

    ACM Class: F.4.1

  29. arXiv:cs/0605085  [pdf, ps, other

    cs.LO

    A Scalable Algorithm for Minimal Unsatisfiable Core Extraction

    Authors: Nachum Dershowitz, Ziyad Hanna, Alexander Nadel

    Abstract: We propose a new algorithm for minimal unsatisfiable core extraction, based on a deeper exploration of resolution-refutation properties. We provide experimental results on formal verification benchmarks confirming that our algorithm finds smaller cores than suboptimal algorithms; and that it runs faster than those algorithms that guarantee minimality of the core.

    Submitted 28 May, 2006; v1 submitted 19 May, 2006; originally announced May 2006.

    Journal ref: Proceedings of the 9th International Conference Theory and Applications of Satisfiability Testing (SAT 2006), Lecture Notes in Computer Science, volume 4121, Springer-Verlag, Berlin, pp. 36-41

  30. arXiv:cs/0510069  [pdf, ps, other

    cs.LO

    Comparing Computational Power

    Authors: Udi Boker, Nachum Dershowitz

    Abstract: It is common practice to compare the computational power of different models of computation. For example, the recursive functions are strictly more powerful than the primitive recursive functions, because the latter are a proper subset of the former (which includes Ackermann's function). Side-by-side with this "containment" method of measuring power, it is standard to use an approach based on "s… ▽ More

    Submitted 23 October, 2005; originally announced October 2005.

    Comments: To appear in Logic Journal of the IGPL in 2006

    Journal ref: Logic Journal of the IGPL, vol. 14, no. 5, pp. 633-648, 2006

  31. Abstract Canonical Inference

    Authors: Maria Paola Bonacina, Nachum Dershowitz

    Abstract: An abstract framework of canonical inference is used to explore how different proof orderings induce different variants of saturation and completeness. Notions like completion, paramodulation, saturation, redundancy elimination, and rewrite-system reduction are connected to proof orderings. Fairness of deductive mechanisms is defined in terms of proof orderings, distinguishing between (ordinary)… ▽ More

    Submitted 13 September, 2006; v1 submitted 17 June, 2004; originally announced June 2004.

    Comments: 28 pages, no figures, to appear in ACM Trans. on Computational Logic

    Report number: RR 18/2004

    Journal ref: ACM Transactions on Computational Logic, 8(1):180-208, January 2007

  32. arXiv:cs/0304017  [pdf, ps, other

    cs.LO

    Ground Canonicity

    Authors: Nachum Dershowitz

    Abstract: We explore how different proof orderings induce different notions of saturation. We relate completion, paramodulation, saturation, redundancy elimination, and rewrite system reduction to proof orderings.

    Submitted 10 April, 2003; originally announced April 2003.

    ACM Class: F.4.1

  33. arXiv:cs/0012008  [pdf, ps, other

    cs.PL

    A General Framework for Automatic Termination Analysis of Logic Programs

    Authors: Nachum Dershowitz, Naomi Lindenstrauss, Yehoshua Sagiv, Alexander Serebrenik

    Abstract: This paper describes a general framework for automatic termination analysis of logic programs, where we understand by ``termination'' the finitenes s of the LD-tree constructed for the program and a given query. A general property of mappings from a certain subset of the branches of an infinite LD-tree into a finite set is proved. From this result several termination theorems are derived, by usi… ▽ More

    Submitted 13 December, 2000; originally announced December 2000.

    ACM Class: D.1.6

    Journal ref: Applicable Algebra in Engineering, Communication and Computing, vol. 12, no. 1/2, pp. 117-156, 2001

  34. arXiv:cs/0011036  [pdf, ps, other

    cs.PL

    Automatic Termination Analysis of Programs Containing Arithmetic Predicates

    Authors: Nachum Dershowitz, Naomi Lindenstrauss, Yehoshua Sagiv, Alexander Serebrenik

    Abstract: For logic programs with arithmetic predicates, showing termination is not easy, since the usual order for the integers is not well-founded. A new method, easily incorporated in the TermiLog system for automatic termination analysis, is presented for showing termination in this case. The method consists of the following steps: First, a finite abstract domain for representing the range of intege… ▽ More

    Submitted 23 November, 2000; originally announced November 2000.

    Comments: Appeared also in Electronic Notes in Computer Science vol. 30

    ACM Class: D.1.6; D.2.4