Zum Hauptinhalt springen

Showing 1–19 of 19 results for author: Goldberg, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.14360  [pdf

    cs.CL cs.LG

    Is ChatGPT a game changer for geocoding -- a benchmark for geocoding address parsing techniques

    Authors: Zhengcong Yin, Diya Li, Daniel W. Goldberg

    Abstract: The remarkable success of GPT models across various tasks, including toponymy recognition motivates us to assess the performance of the GPT-3 model in the geocoding address parsing task. To ensure that the evaluation more accurately mirrors performance in real-world scenarios with diverse user input qualities and resolve the pressing need for a 'gold standard' evaluation dataset for geocoding syst… ▽ More

    Submitted 15 December, 2023; v1 submitted 22 October, 2023; originally announced October 2023.

  2. arXiv:2301.13631  [pdf

    cs.CL cs.AI

    TopoBERT: Plug and Play Toponym Recognition Module Harnessing Fine-tuned BERT

    Authors: Bing Zhou, Lei Zou, Yingjie Hu, Yi Qiang, Daniel Goldberg

    Abstract: Extracting precise geographical information from textual contents is crucial in a plethora of applications. For example, during hazardous events, a robust and unbiased toponym extraction framework can provide an avenue to tie the location concerned to the topic discussed by news media posts and pinpoint humanitarian help requests or damage reports from social media. Early studies have leveraged ru… ▽ More

    Submitted 3 February, 2023; v1 submitted 31 January, 2023; originally announced January 2023.

    Comments: 8 Pages, 6 figures

  3. arXiv:2012.00058  [pdf

    cs.LG cs.DB

    PMLB v1.0: An open source dataset collection for benchmarking machine learning methods

    Authors: Joseph D. Romano, Trang T. Le, William La Cava, John T. Gregg, Daniel J. Goldberg, Natasha L. Ray, Praneel Chakraborty, Daniel Himmelstein, Weixuan Fu, Jason H. Moore

    Abstract: Motivation: Novel machine learning and statistical modeling studies rely on standardized comparisons to existing methods using well-studied benchmark datasets. Few tools exist that provide rapid access to many of these datasets through a standardized, user-friendly interface that integrates well with popular data science workflows. Results: This release of PMLB provides the largest collection of… ▽ More

    Submitted 6 April, 2021; v1 submitted 30 November, 2020; originally announced December 2020.

    Comments: 4 pages, 1 figure. *: These authors contributed equally

    ACM Class: H.2.8

  4. arXiv:1807.02227  [pdf, ps, other

    math.PR cs.DS math.OC q-fin.CP q-fin.MF

    Polynomial time algorithm for optimal stopping with fixed accuracy

    Authors: David A. Goldberg, Yilun Chen

    Abstract: The problem of high-dimensional path-dependent optimal stopping (OS) is important to multiple academic communities and applications. Modern OS tasks often have a large number of decision epochs, and complicated non-Markovian dynamics, making them especially challenging. Standard approaches, often relying on ADP, duality, deep learning and other heuristics, have shown strong empirical performance,… ▽ More

    Submitted 14 May, 2024; v1 submitted 5 July, 2018; originally announced July 2018.

  5. arXiv:1301.1762  [pdf, ps, other

    math.PR cs.CC cs.DM math-ph math.CO

    Second-order Markov random fields for independent sets on the infinite Cayley tree

    Authors: David A. Goldberg

    Abstract: Recently, there has been significant interest in understanding the properties of Markov random fields (M.r.f.) defined on the independent sets of sparse graphs. When these M.r.f. are restricted to pairwise interactions (i.e. hardcore model), much progress has been made. However, considerably less is known in the presence of higher-order interactions, which arise e.g. in the analysis of independent… ▽ More

    Submitted 18 June, 2015; v1 submitted 9 January, 2013; originally announced January 2013.

  6. arXiv:1001.5454  [pdf, other

    cond-mat.stat-mech cond-mat.dis-nn cs.IT math.PR

    Non-Equilibrium Statistical Physics of Currents in Queuing Networks

    Authors: Vladimir Y. Chernyak, Michael Chertkov, David A. Goldberg, Konstantin Turitsyn

    Abstract: We consider a stable open queuing network as a steady non-equilibrium system of interacting particles. The network is completely specified by its underlying graphical structure, type of interaction at each node, and the Markovian transition rates between nodes. For such systems, we ask the question ``What is the most likely way for large currents to accumulate over time in a network ?'', where tim… ▽ More

    Submitted 19 June, 2010; v1 submitted 29 January, 2010; originally announced January 2010.

    Comments: 26 pages, 5 figures

    Report number: LA-UR 10-00419

  7. arXiv:0912.0338  [pdf, ps, other

    math.PR cs.DC math.OC

    Correlation Decay in Random Decision Networks

    Authors: David Gamarnik, David Goldberg, Theophane Weber

    Abstract: We consider a decision network on an undirected graph in which each node corresponds to a decision variable, and each node and edge of the graph is associated with a reward function whose value depends only on the variables of the corresponding nodes. The goal is to construct a decision vector which maximizes the total reward. This decision problem encompasses a variety of models, including maxi… ▽ More

    Submitted 2 December, 2009; originally announced December 2009.

  8. arXiv:0807.1277  [pdf, ps, other

    cs.DM cs.DS

    Randomized greedy algorithms for independent sets and matchings in regular graphs: Exact results and finite girth corrections

    Authors: David Gamarnik, David Goldberg

    Abstract: We derive new results for the performance of a simple greedy algorithm for finding large independent sets and matchings in constant degree regular graphs. We show that for $r$-regular graphs with $n$ nodes and girth at least $g$, the algorithm finds an independent set of expected cardinality $f(r)n - O\big(\frac{(r-1)^{\frac{g}{2}}}{\frac{g}{2}!} n\big)$, where $f(r)$ is a function which we expl… ▽ More

    Submitted 8 July, 2008; originally announced July 2008.

    Comments: 24 pages

    ACM Class: F.2.2; G.1.6; G.2.1; G.2.2; G.3

  9. arXiv:0801.3113  [pdf, ps, other

    cs.NE cs.AI

    iBOA: The Incremental Bayesian Optimization Algorithm

    Authors: Martin Pelikan, Kumara Sastry, David E. Goldberg

    Abstract: This paper proposes the incremental Bayesian optimization algorithm (iBOA), which modifies standard BOA by removing the population of solutions and using incremental updates of the Bayesian network. iBOA is shown to be able to learn and exploit unrestricted Bayesian networks using incremental techniques for updating both the structure as well as the parameters of the probabilistic model. This re… ▽ More

    Submitted 20 January, 2008; originally announced January 2008.

    Comments: Also available at the MEDAL web site, http://medal.cs.umsl.edu/

    Report number: MEDAL Report No. 2008002 ACM Class: I.2.6; I.2.8; G.1.6

    Journal ref: Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2008), ACM Press, 455-462

  10. arXiv:cs/0502057  [pdf, ps, other

    cs.NE cs.AI

    Decomposable Problems, Niching, and Scalability of Multiobjective Estimation of Distribution Algorithms

    Authors: Kumara Sastry, Martin Pelikan, David E. Goldberg

    Abstract: The paper analyzes the scalability of multiobjective estimation of distribution algorithms (MOEDAs) on a class of boundedly-difficult additively-separable multiobjective optimization problems. The paper illustrates that even if the linkage is correctly identified, massive multimodality of the search problems can easily overwhelm the nicher and lead to exponential scale-up. Facetwise models are s… ▽ More

    Submitted 12 February, 2005; originally announced February 2005.

    Comments: Submitted to Genetic and Evolutionary Computation Conference, GECCO-2005

    Report number: IlliGAL Report No. 2005004

  11. arXiv:cs/0502034  [pdf, ps, other

    cs.NE cs.AI

    Multiobjective hBOA, Clustering, and Scalability

    Authors: Martin Pelikan, Kumara Sastry, David E. Goldberg

    Abstract: This paper describes a scalable algorithm for solving multiobjective decomposable problems by combining the hierarchical Bayesian optimization algorithm (hBOA) with the nondominated sorting genetic algorithm (NSGA-II) and clustering in the objective space. It is first argued that for good scalability, clustering or some other form of niching in the objective space is necessary and the size of ea… ▽ More

    Submitted 7 February, 2005; originally announced February 2005.

    Comments: Also IlliGAL Report No. 2005005 (http://www-illigal.ge.uiuc.edu/). Submitted to GECCO-2005

    Report number: IlliGAL Report No. 2005005 ACM Class: I.2.8; I.2.6; G.1.6; I.5.3

  12. arXiv:cs/0502023  [pdf, ps, other

    cs.NE cs.AI

    Sub-structural Niching in Estimation of Distribution Algorithms

    Authors: K. Sastry, H. A. Abbass, D. E. Goldberg, D. D. Johnson

    Abstract: We propose a sub-structural niching method that fully exploits the problem decomposition capability of linkage-learning methods such as the estimation of distribution algorithms and concentrate on maintaining diversity at the sub-structural level. The proposed method consists of three key components: (1) Problem decomposition and sub-structure identification, (2) sub-structure fitness estimation… ▽ More

    Submitted 3 February, 2005; originally announced February 2005.

    Report number: IlliGAL Report No. 2005002

  13. arXiv:cs/0502022  [pdf, ps, other

    cs.NE cs.AI

    Sub-Structural Niching in Non-Stationary Environments

    Authors: K. Sastry, H. A. Abbass, D. E. Goldberg

    Abstract: Niching enables a genetic algorithm (GA) to maintain diversity in a population. It is particularly useful when the problem has multiple optima where the aim is to find all or as many as possible of these optima. When the fitness landscape of a problem changes overtime, the problem is called non--stationary, dynamic or time--variant problem. In these problems, niching can maintain useful solution… ▽ More

    Submitted 3 February, 2005; originally announced February 2005.

    Comments: Final version published in 2005 Australian Artificial Intelligence Conference, pp. 873--885

    Report number: IlliGAL Report No. 2004035

  14. arXiv:cs/0502021  [pdf, ps, other

    cs.NE cs.AI

    Oiling the Wheels of Change: The Role of Adaptive Automatic Problem Decomposition in Non--Stationary Environments

    Authors: H. A. Abbass, K. Sastry, D. E. Goldberg

    Abstract: Genetic algorithms (GAs) that solve hard problems quickly, reliably and accurately are called competent GAs. When the fitness landscape of a problem changes overtime, the problem is called non--stationary, dynamic or time--variant problem. This paper investigates the use of competent GAs for optimizing non--stationary optimization problems. More specifically, we use an information theoretic appr… ▽ More

    Submitted 3 February, 2005; originally announced February 2005.

  15. arXiv:cs/0502020  [pdf, ps, other

    cs.AI cs.NE

    Population Sizing for Genetic Programming Based Upon Decision Making

    Authors: K. Sastry, U. -M. O'Reilly, D. E. Goldberg

    Abstract: This paper derives a population sizing relationship for genetic programming (GP). Following the population-sizing derivation for genetic algorithms in Goldberg, Deb, and Clark (1992), it considers building block decision making as a key facet. The analysis yields a GP-unique relationship because it has to account for bloat and for the fact that GP solutions often use subsolution multiple times.… ▽ More

    Submitted 3 February, 2005; originally announced February 2005.

    Comments: Final version published in O'Reilly, U.-M., et al. (2004). Genetic Programming Theory and Practice II. Boston, MA: Kluwer Academic Publishers. 49--66

    Report number: IlliGAL Report No. 2004028

  16. Efficiency Enhancement of Genetic Algorithms via Building-Block-Wise Fitness Estimation

    Authors: Kumara Sastry, Martin Pelikan, David E. Goldberg

    Abstract: This paper studies fitness inheritance as an efficiency enhancement technique for a class of competent genetic algorithms called estimation distribution algorithms. Probabilistic models of important sub-solutions are developed to estimate the fitness of a proportion of individuals in the population, thereby avoiding computationally expensive function evaluations. The effect of fitness inheritanc… ▽ More

    Submitted 18 May, 2004; originally announced May 2004.

    Comments: IEEE International Conference on Evolutionary Computation (CEC-2004)

    Report number: IlliGAL Report No. 2004010 ACM Class: G.1.6; G.3; I.2.6; I.2.8

  17. arXiv:cs/0405064  [pdf, ps, other

    cs.NE

    Designing Competent Mutation Operators via Probabilistic Model Building of Neighborhoods

    Authors: Kumara Sastry, David E. Goldberg

    Abstract: This paper presents a competent selectomutative genetic algorithm (GA), that adapts linkage and solves hard problems quickly, reliably, and accurately. A probabilistic model building process is used to automatically identify key building blocks (BBs) of the search problem. The mutation operator uses the probabilistic model of linkage groups to find the best among competing building blocks. The c… ▽ More

    Submitted 18 May, 2004; originally announced May 2004.

    Comments: Genetic and Evolutionary Computation Conference (GECCO-2004)

    Report number: IlliGAL Report No. 2004006 ACM Class: G.1.6; G.3; I.2.6; I.2.8

  18. arXiv:cs/0405063  [pdf, ps, other

    cs.NE

    Let's Get Ready to Rumble: Crossover Versus Mutation Head to Head

    Authors: Kumara Sastry, David E. Goldberg

    Abstract: This paper analyzes the relative advantages between crossover and mutation on a class of deterministic and stochastic additively separable problems. This study assumes that the recombination and mutation operators have the knowledge of the building blocks (BBs) and effectively exchange or search among competing BBs. Facetwise models of convergence time and population sizing have been used to det… ▽ More

    Submitted 18 May, 2004; originally announced May 2004.

    Comments: Genetic and Evolutionary Computation Conference (GECCO-2004)

    Report number: IlliGAL Report No. 2004005 ACM Class: G.1.6; G.3; I.2.6; I.2.8

  19. arXiv:cs/0405062  [pdf, ps, other

    cs.NE

    Efficiency Enhancement of Probabilistic Model Building Genetic Algorithms

    Authors: Kumara Sastry, David E. Goldberg, Martin Pelikan

    Abstract: This paper presents two different efficiency-enhancement techniques for probabilistic model building genetic algorithms. The first technique proposes the use of a mutation operator which performs local search in the sub-solution neighborhood identified through the probabilistic model. The second technique proposes building and using an internal probabilistic model of the fitness along with the p… ▽ More

    Submitted 18 May, 2004; originally announced May 2004.

    Comments: Optimization by Building and Using Probabilistic Models. Workshop at the 2004 Genetic and Evolutionary Computation Conference

    Report number: IlliGAL Report No. 2004020 ACM Class: G.1.6; G.3; I.2.6; I.2.8