Zum Hauptinhalt springen

Showing 1–50 of 53 results for author: Kurtz, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.03594  [pdf, other

    cs.CL cs.AI

    Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment

    Authors: Abhinav Agarwalla, Abhay Gupta, Alexandre Marques, Shubhra Pandit, Michael Goin, Eldar Kurtic, Kevin Leong, Tuan Nguyen, Mahmoud Salem, Dan Alistarh, Sean Lie, Mark Kurtz

    Abstract: Large language models (LLMs) have revolutionized Natural Language Processing (NLP), but their size creates computational bottlenecks. We introduce a novel approach to create accurate, sparse foundational versions of performant LLMs that achieve full accuracy recovery for fine-tuning tasks at up to 70% sparsity. We achieve this for the LLaMA-2 7B model by combining the SparseGPT one-shot pruning me… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  2. arXiv:2404.17438  [pdf, other

    cs.RO cs.AI cs.MA

    Real-World Deployment of a Hierarchical Uncertainty-Aware Collaborative Multiagent Planning System

    Authors: Martina Stadler Kurtz, Samuel Prentice, Yasmin Veys, Long Quang, Carlos Nieto-Granda, Michael Novitzky, Ethan Stump, Nicholas Roy

    Abstract: We would like to enable a collaborative multiagent team to navigate at long length scales and under uncertainty in real-world environments. In practice, planning complexity scales with the number of agents in the team, with the length scale of the environment, and with environmental uncertainty. Enabling tractable planning requires developing abstract models that can represent complex, high-qualit… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: Accepted to the IEEE ICRA Workshop on Field Robotics 2024

  3. arXiv:2312.14211  [pdf, ps, other

    cs.CL astro-ph.IM cs.AI

    Experimenting with Large Language Models and vector embeddings in NASA SciX

    Authors: Sergi Blanco-Cuaresma, Ioana Ciucă, Alberto Accomazzi, Michael J. Kurtz, Edwin A. Henneken, Kelly E. Lockhart, Felix Grezes, Thomas Allen, Golnaz Shapurian, Carolyn S. Grant, Donna M. Thompson, Timothy W. Hostetler, Matthew R. Templeton, Shinyi Chen, Jennifer Koch, Taylor Jacovich, Daniel Chivvis, Fernanda de Macedo Alves, Jean-Claude Paquin, Jennifer Bartlett, Mugdha Polimera, Stephanie Jarmak

    Abstract: Open-source Large Language Models enable projects such as NASA SciX (i.e., NASA ADS) to think out of the box and try alternative approaches for information retrieval and data augmentation, while respecting data copyright and users' privacy. However, when large language models are directly prompted with questions without any context, they are prone to hallucination. At NASA SciX we have developed a… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: To appear in the proceedings of the 33th annual international Astronomical Data Analysis Software & Systems (ADASS XXXIII)

  4. arXiv:2312.08579  [pdf, other

    cs.CL astro-ph.IM cs.LG

    Identifying Planetary Names in Astronomy Papers: A Multi-Step Approach

    Authors: Golnaz Shapurian, Michael J Kurtz, Alberto Accomazzi

    Abstract: The automatic identification of planetary feature names in astronomy publications presents numerous challenges. These features include craters, defined as roughly circular depressions resulting from impact or volcanic activity; dorsas, which are elongate raised structures or wrinkle ridges; and lacus, small irregular patches of dark, smooth material on the Moon, referred to as "lake" (Planetary Na… ▽ More

    Submitted 17 December, 2023; v1 submitted 13 December, 2023; originally announced December 2023.

  5. arXiv:2303.17612  [pdf, other

    cs.CL cs.AI cs.LG

    oBERTa: Improving Sparse Transfer Learning via improved initialization, distillation, and pruning regimes

    Authors: Daniel Campos, Alexandre Marques, Mark Kurtz, ChengXiang Zhai

    Abstract: In this paper, we introduce the range of oBERTa language models, an easy-to-use set of language models which allows Natural Language Processing (NLP) practitioners to obtain between 3.8 and 24.3 times faster models without expertise in model compression. Specifically, oBERTa extends existing work on pruning, knowledge distillation, and quantization and leverages frozen embeddings improves distilla… ▽ More

    Submitted 6 June, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

    Comments: SustaiNLP2023 @ ACL 2023,9 pages, 2 figures, 45 tables

  6. arXiv:2212.00744  [pdf, ps, other

    cs.CL astro-ph.IM

    Improving astroBERT using Semantic Textual Similarity

    Authors: Felix Grezes, Thomas Allen, Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Golnaz Shapurian, Edwin Henneken, Carolyn S. Grant, Donna M. Thompson, Timothy W. Hostetler, Matthew R. Templeton, Kelly E. Lockhart, Shinyi Chen, Jennifer Koch, Taylor Jacovich, Pavlos Protopapas

    Abstract: The NASA Astrophysics Data System (ADS) is an essential tool for researchers that allows them to explore the astronomy and astrophysics scientific literature, but it has yet to exploit recent advances in natural language processing. At ADASS 2021, we introduced astroBERT, a machine learning language model tailored to the text used in astronomy papers in ADS. In this work we: - announce the first… ▽ More

    Submitted 29 November, 2022; originally announced December 2022.

  7. arXiv:2205.12452  [pdf, other

    cs.CL cs.AI

    Sparse*BERT: Sparse Models Generalize To New tasks and Domains

    Authors: Daniel Campos, Alexandre Marques, Tuan Nguyen, Mark Kurtz, ChengXiang Zhai

    Abstract: Large Language Models have become the core architecture upon which most modern natural language processing (NLP) systems build. These models can consistently deliver impressive accuracy and robustness across tasks and domains, but their high computational overhead can make inference difficult and expensive. To make using these models less costly, recent work has explored leveraging structured and… ▽ More

    Submitted 5 April, 2023; v1 submitted 24 May, 2022; originally announced May 2022.

    Comments: Presented at Sparsity in Neural Networks Workshop at ICML 2022, 6 pages, 2 figures, 4 tables

  8. arXiv:2203.07259  [pdf, other

    cs.CL cs.LG

    The Optimal BERT Surgeon: Scalable and Accurate Second-Order Pruning for Large Language Models

    Authors: Eldar Kurtic, Daniel Campos, Tuan Nguyen, Elias Frantar, Mark Kurtz, Benjamin Fineran, Michael Goin, Dan Alistarh

    Abstract: Transformer-based language models have become a key building block for natural language processing. While these models are extremely accurate, they can be too large and computationally intensive to run on standard deployments. A variety of compression methods, including distillation, quantization, structured and unstructured pruning are known to decrease model size and increase inference speed, wi… ▽ More

    Submitted 17 October, 2022; v1 submitted 14 March, 2022; originally announced March 2022.

    Comments: Accepted to EMNLP 2022

  9. arXiv:2202.00777  [pdf, ps, other

    cs.HC astro-ph.IM

    Web accessibility trends and implementation in dynamic web applications

    Authors: Timothy W. Hostetler, Shinyi Chen, Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Carolyn S. Grant, Edwin Henneken, Donna M. Thompson, Roman Chyla, Golnaz Shapurian, Matthew R. Templeton, Kelly E. Lockhart, Nemanja Martinovic, Stephen McDonald, Felix Grezes

    Abstract: The NASA Astrophysics Data System (ADS), a critical research service for the astrophysics community, strives to provide the most accessible and inclusive environment for the discovery and exploration of the astronomical literature. Part of this goal involves creating a digital platform that can accommodate everybody, including those with disabilities that would benefit from alternative ways to pre… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

    Comments: Submitted to ADASS XXXI (2021)

  10. arXiv:2112.00590  [pdf, ps, other

    cs.CL astro-ph.IM

    Building astroBERT, a language model for Astronomy & Astrophysics

    Authors: Felix Grezes, Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Golnaz Shapurian, Edwin Henneken, Carolyn S. Grant, Donna M. Thompson, Roman Chyla, Stephen McDonald, Timothy W. Hostetler, Matthew R. Templeton, Kelly E. Lockhart, Nemanja Martinovic, Shinyi Chen, Chris Tanner, Pavlos Protopapas

    Abstract: The existing search tools for exploring the NASA Astrophysics Data System (ADS) can be quite rich and empowering (e.g., similar and trending operators), but researchers are not yet allowed to fully leverage semantic search. For example, a query for "results from the Planck mission" should be able to distinguish between all the various meanings of Planck (person, mission, constant, institutions and… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

  11. arXiv:2111.13445  [pdf, other

    cs.CV cs.AI cs.LG

    How Well Do Sparse Imagenet Models Transfer?

    Authors: Eugenia Iofinova, Alexandra Peste, Mark Kurtz, Dan Alistarh

    Abstract: Transfer learning is a classic paradigm by which models pretrained on large "upstream" datasets are adapted to yield good results on "downstream" specialized datasets. Generally, more accurate models on the "upstream" dataset tend to provide better transfer accuracy "downstream". In this work, we perform an in-depth investigation of this phenomenon in the context of convolutional neural networks (… ▽ More

    Submitted 21 April, 2022; v1 submitted 26 November, 2021; originally announced November 2021.

    Comments: Accepted to CVPR'22. This version: 25 pages, 9 figures (including appendix). **Includes extended upstream training results, which are not present in the CVPR version.**

  12. arXiv:2010.01418  [pdf

    cs.DL astro-ph.IM physics.soc-ph

    Second Order Operators in the NASA Astrophysics Data System

    Authors: Michael J. Kurtz, Roman Chyla

    Abstract: Second Order Operators (SOOs) are database functions which form secondary queries based on attributes of the objects returned in an initial query; they can provide powerful methods to investigate complex, multipartite information graphs. The NASA Astrophysics Data System (ADS) has implemented four SOOs, reviews, useful, trending, and similar which use the citations, references, downloads, and abst… ▽ More

    Submitted 3 October, 2020; originally announced October 2020.

    Comments: ADS Bibcode:2020BAAS...52b0207K, author's version

    Journal ref: Bulletin of the American Astronomical Society, Vol. 52, No. 2, id. 0207 2020

  13. arXiv:2009.14323  [pdf

    astro-ph.IM cs.DL

    Enabling Synergy: Improving the Information Infrastructure for Planetary Science

    Authors: Michael J. Kurtz, Alberto Accomazzi, Edwin A. Henneken

    Abstract: In this whitepaper we advocate that the Planetary Science (PS) community build a discipline-specific digital library, in collaboration with the existing astronomy digital library, ADS. We suggest that the PS data archives increase their level of curation to allow for direct linking between the archival data and the derived journal articles. And we suggest that a new component of the PS information… ▽ More

    Submitted 29 September, 2020; originally announced September 2020.

    Comments: 8 pages, submitted to the Planetary Science and Astrobiology Decadal Survey 2023-2032

  14. arXiv:2009.05048  [pdf, ps, other

    cs.SE astro-ph.IM

    Agile methodologies in teams with highly creative and autonomous members

    Authors: Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Edwin Henneken, Carolyn S. Grant, Donna M. Thompson, Roman Chyla, Stephen McDonald, Golnaz Shapurian, Timothy W. Hostetler, Matthew R. Templeton, Kelly E. Lockhart, Kris Bukovi

    Abstract: The Agile manifesto encourages us to value individuals and interactions over processes and tools, while Scrum, the most adopted Agile development methodology, is essentially based on roles, events, artifacts, and the rules that bind them together (i.e., processes). Moreover, it is generally proclaimed that whenever a Scrum project does not succeed, the reason is because Scrum was not implemented c… ▽ More

    Submitted 10 September, 2020; originally announced September 2020.

    Comments: To appear in the proceedings of the 29th annual international Astronomical Data Analysis Software & Systems (ADASS XXIX)

  15. arXiv:1901.05463  [pdf, ps, other

    astro-ph.IM cs.DL

    Fundamentals of effective cloud management for the new NASA Astrophysics Data System

    Authors: Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Edwin Henneken, Carolyn S. Grant, Donna M. Thompson, Roman Chyla, Stephen McDonald, Golnaz Shapurian, Timothy W. Hostetler, Matthew R. Templeton, Kelly E. Lockhart, Kris Bukovi, Nathan Rapport

    Abstract: The new NASA Astrophysics Data System (ADS) is designed with a serviceoriented architecture (SOA) that consists of multiple customized Apache Solr search engine instances plus a collection of microservices, containerized using Docker, and deployed in Amazon Web Services (AWS). For complex systems, like the ADS, this loosely coupled architecture can lead to a more scalable, reliable and resilient s… ▽ More

    Submitted 16 January, 2019; originally announced January 2019.

    Comments: To appear in the proceedings of the 28th annual international Astronomical Data Analysis Software & Systems (ADASS XXVIII)

  16. arXiv:1803.03598  [pdf

    astro-ph.IM cs.DL physics.soc-ph

    Merging the Astrophysics and Planetary Science Information Systems

    Authors: Michael J. Kurtz, Alberto Accomazzi, Edwin A. Henneken

    Abstract: Conceptually exoplanet research has one foot in the discipline of Astrophysics and the other foot in Planetary Science. Research strategies for exoplanets will require efficient access to data and information from both realms. Astrophysics has a sophisticated, well integrated, distributed information system with archives and data centers which are interlinked with the technical literature via the… ▽ More

    Submitted 9 March, 2018; originally announced March 2018.

    Comments: Whitepaper submitted to the Committee on an Exoplanet Science Strategy

  17. arXiv:1801.00815  [pdf

    cs.AI astro-ph.IM physics.soc-ph

    Advice from the Oracle: Really Intelligent Information Retrieval

    Authors: Michael J. Kurtz

    Abstract: What is "intelligent" information retrieval? Essentially this is asking what is intelligence, in this article I will attempt to show some of the aspects of human intelligence, as related to information retrieval. I will do this by the device of a semi-imaginary Oracle. Every Observatory has an oracle, someone who is a distinguished scientist, has great administrative responsibilities, acts as ment… ▽ More

    Submitted 2 January, 2018; originally announced January 2018.

    Comments: Author copy; published 25 years ago at the beginning of the Astrophysics Data System; 2018 keywords added

    Journal ref: In: Heck A., Murtagh F. (eds) Intelligent Information Retrieval: The Case of Astronomy and Related Space Sciences. Astrophysics and Space Science Library, vol 182. Springer, Dordrecht (1993)

  18. arXiv:1712.06704  [pdf, ps, other

    stat.ML cs.CL cs.IR

    Multilingual Topic Models

    Authors: Kriste Krstovski, Michael J. Kurtz, David A. Smith, Alberto Accomazzi

    Abstract: Scientific publications have evolved several features for mitigating vocabulary mismatch when indexing, retrieving, and computing similarity between articles. These mitigation strategies range from simply focusing on high-value article sections, such as titles and abstracts, to assigning keywords, often from controlled vocabularies, either manually or through automatic annotation. Various document… ▽ More

    Submitted 18 December, 2017; originally announced December 2017.

    Comments: 18 pages, 9 figures

  19. New ADS Functionality for the Curator

    Authors: Alberto Accomazzi, Michael J. Kurtz, Edwin A. Henneken, Carolyn S. Grant, Donna M. Thompson, Roman Chyla, Steven McDonald, Taylor J. Shaulis, Sergi Blanco-Cuaresma, Golnaz Shapurian, Timothy W. Hostetler, Matthew R. Templeton

    Abstract: In this paper we provide an update concerning the operations of the NASA Astrophysics Data System (ADS), its services and user interface, and the content currently indexed in its database. As the primary information system used by researchers in Astronomy, the ADS aims to provide a comprehensive index of all scholarly resources appearing in the literature. With the current effort in our community… ▽ More

    Submitted 23 October, 2017; originally announced October 2017.

    Comments: Submitted to the Proceedings of Library and Information Services in Astronomy VIII, Strasbourg, France

  20. arXiv:1707.09955  [pdf

    physics.soc-ph astro-ph.IM cs.DL

    Comparing People with Bibliometrics

    Authors: Michael J. Kurtz

    Abstract: Bibliometric indicators, citation counts and/or download counts are increasingly being used to inform personnel decisions such as hiring or promotions. These statistics are very often misused. Here we provide a guide to the factors which should be considered when using these so-called quantitative measures to evaluate people. Rules of thumb are given for when begin to use bibliometric measures whe… ▽ More

    Submitted 31 July, 2017; originally announced July 2017.

    Comments: to appear in Proceedings of Library and Information Science in Astronomy VIII (LISA-8)

  21. arXiv:1706.02153  [pdf

    cs.DL astro-ph.IM cs.CY cs.IR physics.soc-ph

    Usage Bibliometrics as a Tool to Measure Research Activity

    Authors: Edwin A. Henneken, Michael J. Kurtz

    Abstract: Measures for research activity and impact have become an integral ingredient in the assessment of a wide range of entities (individual researchers, organizations, instruments, regions, disciplines). Traditional bibliometric indicators, like publication and citation based indicators, provide an essential part of this picture, but cannot describe the complete picture. Since reading scholarly publica… ▽ More

    Submitted 7 June, 2017; originally announced June 2017.

    Comments: 25 pages, 11 figures, accepted for publication in Handbook of Quantitative Science and Technology Research, Springer

  22. arXiv:1601.07858  [pdf, ps, other

    astro-ph.IM cs.DL

    Aggregation and Linking of Observational Metadata in the ADS

    Authors: Alberto Accomazzi, Michael J. Kurtz, Edwin A. Henneken, Carolyn S. Grant, Donna M. Thompson, Roman Chyla, Alexandra Holachek, Jonathan Elliott

    Abstract: We discuss current efforts behind the curation of observing proposals, archive bibliographies, and data links in the NASA Astrophysics Data System (ADS). The primary data in the ADS is the bibliographic content from scholarly articles in Astronomy and Physics, which ADS aggregates from publishers, arXiv and conference proceeding sites. This core bibliographic information is then further enriched b… ▽ More

    Submitted 28 January, 2016; originally announced January 2016.

    Comments: 4 pages, Proceedings of the ADASS XXV conference

  23. arXiv:1601.01611  [pdf, other

    cs.IR

    Automatic Construction of Evaluation Sets and Evaluation of Document Similarity Models in Large Scholarly Retrieval Systems

    Authors: Kriste Krstovski, David A. Smith, Michael J. Kurtz

    Abstract: Retrieval systems for scholarly literature offer the ability for the scientific community to search, explore and download scholarly articles across various scientific disciplines. Mostly used by the experts in the particular field, these systems contain user community logs including information on user specific downloaded articles. In this paper we present a novel approach for automatically evalua… ▽ More

    Submitted 7 January, 2016; originally announced January 2016.

  24. arXiv:1510.09099  [pdf

    physics.soc-ph astro-ph.IM cs.DL

    Measuring Metrics - A forty year longitudinal cross-validation of citations, downloads, and peer review in Astrophysics

    Authors: Michael J. Kurtz, Edwin A. Henneken

    Abstract: Citation measures, and newer altmetric measures such as downloads are now commonly used to inform personnel decisions. How well do or can these measures measure or predict the past, current of future scholarly performance of an individual? Using data from the Smithsonian/NASA Astrophysics Data System we analyze the publication, citation, download, and distinction histories of a cohort of 922 indiv… ▽ More

    Submitted 30 October, 2015; originally announced October 2015.

    Comments: Author's version of manuscript accepted for publication in the Journal of the Association for Information Science and Technology (JASIST); 35 pages 16 figures

  25. arXiv:1503.05881  [pdf, other

    cs.DL

    ADS 2.0: new architecture, API and services

    Authors: Roman Chyla, Alberto Accomazzi, Alexandra Holachek, Carolyn S. Grant, Jonathan Elliott, Edwin A. Henneken, Donna M. Thompson, Michael J. Kurtz, Stephen S. Murray, Vladimir Sudilovsky

    Abstract: The ADS platform is undergoing the biggest rewrite of its 20-year history. While several components have been added to its architecture over the past couple of years, this talk will concentrate on the underpinnings of ADS's search layer and its API. To illustrate the design of the components in the new system, we will show how the new ADS user interface is built exclusively on top of the API using… ▽ More

    Submitted 19 March, 2015; originally announced March 2015.

    Comments: ADASS Conference 2014

  26. arXiv:1503.04194  [pdf, other

    astro-ph.IM cs.DL

    ADS: The Next Generation Search Platform

    Authors: Alberto Accomazzi, Michael J. Kurtz, Edwin A. Henneken, Roman Chyla, James Luker, Carolyn S. Grant, Donna M. Thompson, Alexandra Holachek, Rahul Dave, Stephen S. Murray

    Abstract: Four years after the last LISA meeting, the NASA Astrophysics Data System (ADS) finds itself in the middle of major changes to the infrastructure and contents of its database. In this paper we highlight a number of features of great importance to librarians and discuss the additional functionality that we are currently developing. Starting in 2011, the ADS started to systematically collect, parse… ▽ More

    Submitted 13 March, 2015; originally announced March 2015.

    Comments: Submitted to Library and Information Services in Astronomy VII, Naples, Italy

  27. arXiv:1406.4542  [pdf, ps, other

    cs.DL astro-ph.IM

    Computing and Using Metrics in the ADS

    Authors: Edwin A. Henneken, Alberto Accomazzi, Michael J. Kurtz, Carolyn S. Grant, Donna Thompson, Jay Luker, Roman Chyla, Alexandra Holachek, Stephen S. Murray

    Abstract: Finding measures for research impact, be it for individuals, institutions, instruments or projects, has gained a lot of popularity. More papers than ever are being written on new impact measures, and problems with existing measures are being pointed out on a regular basis. Funding agencies require impact statistics in their reports, job candidates incorporate them in their resumes, and publication… ▽ More

    Submitted 17 June, 2014; originally announced June 2014.

    Comments: to appear in proceedings of LISA VII conference, Naples, Italy

  28. arXiv:1209.2124  [pdf, other

    astro-ph.IM cs.DL physics.soc-ph

    A measure of total research impact independent of time and discipline

    Authors: Alberto Pepe, Michael J. Kurtz

    Abstract: Authorship and citation practices evolve with time and differ by academic discipline. As such, indicators of research productivity based on citation records are naturally subject to historical and disciplinary effects. We observe these effects on a corpus of astronomer career data constructed from a database of refereed publications. We employ a simple mechanism to measure research output using au… ▽ More

    Submitted 10 September, 2012; originally announced September 2012.

    Comments: 14 pages, 5 figures. PLoS ONE, in press

  29. arXiv:1209.1318  [pdf

    cs.IR astro-ph.IM cs.DL physics.soc-ph

    Finding and Recommending Scholarly Articles

    Authors: Michael J. Kurtz, Edwin A. Henneken

    Abstract: The rate at which scholarly literature is being produced has been increasing at approximately 3.5 percent per year for decades. This means that during a typical 40 year career the amount of new literature produced each year increases by a factor of four. The methods scholars use to discover relevant literature must change. Just like everybody else involved in information discovery, scholars are co… ▽ More

    Submitted 6 September, 2012; originally announced September 2012.

    Comments: 14 pages, part of the forthcoming MIT book "Bibliometrics and Beyond: Metrics-Based Evaluation of Scholarly Research" edited by Blaise Cronin and Cassidy R. Sugimoto

  30. arXiv:1209.0125  [pdf, other

    cs.DL cs.LG stat.ML

    A History of Cluster Analysis Using the Classification Society's Bibliography Over Four Decades

    Authors: Fionn Murtagh, Michael J. Kurtz

    Abstract: The Classification Literature Automated Search Service, an annual bibliography based on citation of one or more of a set of around 80 book or journal publications, ran from 1972 to 2012. We analyze here the years 1994 to 2011. The Classification Society's Service, as it was termed, has been produced by the Classification Society. In earlier decades it was distributed as a diskette or CD with the J… ▽ More

    Submitted 16 August, 2013; v1 submitted 1 September, 2012; originally announced September 2012.

    Comments: 23 pages, 9 figures

    MSC Class: 62H30 ACM Class: I.5.3; H.3.3

  31. arXiv:1106.5644  [pdf, ps, other

    astro-ph.IM cs.DL

    The ADS in the Information Age - Impact on Discovery

    Authors: Edwin A. Henneken, Michael J. Kurtz, Alberto Accomazzi

    Abstract: The SAO/NASA Astrophysics Data System (ADS) grew up with and has been riding the waves of the Information Age, closely monitoring and anticipating the needs of its end-users. By now, all professional astronomers are using the ADS on a daily basis, and a substantial fraction have been using it for their entire professional career. In addition to being an indispensable tool for professional scientis… ▽ More

    Submitted 28 June, 2011; originally announced June 2011.

    Comments: 10 pages, 5 figures, to appear in "Organizations, People and Strategies in Astronomy (OPSA)", volume 8

  32. arXiv:1102.2891  [pdf

    cs.DL astro-ph.IM cs.IR physics.soc-ph

    Usage Bibliometrics

    Authors: Michael J. Kurtz, Johan Bollen

    Abstract: Scholarly usage data provides unique opportunities to address the known shortcomings of citation analysis. However, the collection, processing and analysis of usage data remains an area of active research. This article provides a review of the state-of-the-art in usage-based informetric, i.e. the use of usage data to study the scholarly process.

    Submitted 14 February, 2011; originally announced February 2011.

    Comments: Publisher's PDF (by permission). Publisher web site: books.infotoday.com/asist/arist44.shtml

    Journal ref: Annual Review of Information Science and Technology, vol 44, p. 3-64 (2010)

  33. arXiv:1008.0826  [pdf, ps, other

    physics.soc-ph astro-ph.IM cs.DL cs.IR

    The Emerging Scholarly Brain

    Authors: Michael J. Kurtz

    Abstract: It is now a commonplace observation that human society is becoming a coherent super-organism, and that the information infrastructure forms its emerging brain. Perhaps, as the underlying technologies are likely to become billions of times more powerful than those we have today, we could say that we are now building the lizard brain for the future organism.

    Submitted 4 August, 2010; originally announced August 2010.

    Comments: to appear in Future Professional Communication in Astronomy-II (FPCA-II) editors A. Heck and A. Accomazzi

  34. Finding Your Literature Match -- A Recommender System

    Authors: Edwin A. Henneken, Michael J. Kurtz, Alberto Accomazzi, Carolyn Grant, Donna Thompson, Elizabeth Bohlen, Giovanni Di Milia, Jay Luker, Stephen S. Murray

    Abstract: The universe of potentially interesting, searchable literature is expanding continuously. Besides the normal expansion, there is an additional influx of literature because of interdisciplinary boundaries becoming more and more diffuse. Hence, the need for accurate, efficient and intelligent search tools is bigger than ever. Even with a sophisticated search engine, looking for information can still… ▽ More

    Submitted 13 May, 2010; originally announced May 2010.

    Comments: Contribution to the proceedings of the colloquium Future Professional Communication in Astronomy II, 13-14 April 2010, Cambridge, Massachusetts. 11 pages, 4 figures.

  35. arXiv:0912.5235  [pdf, ps, other

    astro-ph.IM cs.DL cs.IR physics.soc-ph

    Using Multipartite Graphs for Recommendation and Discovery

    Authors: Michael J. Kurtz, Alberto Accomazzi, Edwin Henneken, Giovanni Di Milia, Carolyn S. Grant

    Abstract: The Smithsonian/NASA Astrophysics Data System exists at the nexus of a dense system of interacting and interlinked information networks. The syntactic and the semantic content of this multipartite graph structure can be combined to provide very specific research recommendations to the scientist/user.

    Submitted 30 December, 2009; originally announced December 2009.

    Comments: To appear in ADASS XIX, ASP Conf Proc

  36. arXiv:0909.4789  [pdf

    cs.DL physics.soc-ph

    The Bibliometric Properties of Article Readership Information

    Authors: Michael J. Kurtz, Guenther Eichhorn, Alberto Accomazzi, Carolyn S. Grant, Markus Demleitner, Stephen S. Murray, Nathalie Martimbeau, Barbara Elwell

    Abstract: The NASA Astrophysics Data System (ADS), along with astronomy's journals and data centers (a collaboration dubbed URANIA), has developed a distributed on-line digital library which has become the dominant means by which astronomers search, access and read their technical literature. Digital libraries such as the NASA Astrophysics Data System permit the easy accumulation of a new type of bibliome… ▽ More

    Submitted 25 September, 2009; originally announced September 2009.

    Comments: ADS bibcode: 2005JASIS..56..111K This is the second paper (the first is Worldwide Use and Impact of the NASA Astrophysics Data System Digital Library) from the original article The NASA Astrophysics Data System: Sociology, Bibliometrics, and Impact, which went on-line in the summer of 2003

    Journal ref: The Journal of the American Society for Information Science and Technology, Vol. 56, p. 111 (2005)

  37. arXiv:0909.4786  [pdf

    cs.DL physics.soc-ph

    Worldwide Use and Impact of the NASA Astrophysics Data System Digital Library

    Authors: Michael J. Kurtz, Guenther Eichhorn, Alberto Accomazzi, Carolyn Grant, Markus Demleitner, Stephen S. Murray

    Abstract: By combining data from the text, citation, and reference databases with data from the ADS readership logs we have been able to create Second Order Bibliometric Operators, a customizable class of collaborative filters which permits substantially improved accuracy in literature queries. Using the ADS usage logs along with membership statistics from the International Astronomical Union and data o… ▽ More

    Submitted 25 September, 2009; originally announced September 2009.

    Comments: ADS bibcode: 2005JASIS..56...36K This is a portion (The bibliometric properties of article readership information is the other part) of the article: The NASA Astrophysics Data System: Sociology, bibliometrics and impact, which went on-line in the summer of 2003

    Journal ref: The Journal of the American Society for Information Science and Technology, Vol. 56, p. 36. (2005)

  38. arXiv:0903.3228  [pdf

    astro-ph.IM cs.DL

    The Smithsonian/NASA Astrophysics Data System (ADS) Decennial Report

    Authors: Michael J. Kurtz, Alberto Accomazzi, Stephen S. Murray

    Abstract: Eight years after the ADS first appeared the last decadal survey wrote: "NASA's initiative for the Astrophysics Data System has vastly increased the accessibility of the scientific literature for astronomers. NASA deserves credit for this valuable initiative and is urged to continue it." Here we summarize some of the changes concerning the ADS which have occurred in the past ten years, and we de… ▽ More

    Submitted 18 March, 2009; originally announced March 2009.

    Comments: 6 pages, whitepaper submitted to the National Research Council Astronomy and Astrophysics Decadal Survey

  39. Use of Astronomical Literature - A Report on Usage Patterns

    Authors: Edwin A. Henneken, Michael J. Kurtz, Alberto Accomazzi, Carolyn S. Grant, Donna Thompson, Elizabeth Bohlen, Stephen S. Murray

    Abstract: In this paper we present a number of metrics for usage of the SAO/NASA Astrophysics Data System (ADS). Since the ADS is used by the entire astronomical community, these are indicative of how the astronomical literature is used. We will show how the use of the ADS has changed both quantitatively and qualitatively. We will also show that different types of users access the system in different ways… ▽ More

    Submitted 3 October, 2008; v1 submitted 1 August, 2008; originally announced August 2008.

    Comments: 12 pages, 8 figures, 2 tables. Accepted by Journal of Informetrics

  40. arXiv:0709.0896  [pdf

    cs.DL cs.CY

    Open Access does not increase citations for research articles from The Astrophysical Journal

    Authors: Michael J. Kurtz, Edwin A. Henneken

    Abstract: We demonstrate conclusively that there is no "Open Access Advantage" for papers from the Astrophysical Journal. The two to one citation advantage enjoyed by papers deposited in the arXiv e-print server is due entirely to the nature and timing of the deposited papers. This may have implications for other disciplines.

    Submitted 6 September, 2007; originally announced September 2007.

  41. arXiv:cs/0701035  [pdf, ps, other

    cs.DL astro-ph

    Finding Astronomical Communities Through Co-readership Analysis

    Authors: Edwin A. Henneken, Michael J. Kurtz, Guenther Eichhorn, Alberto Accomazzi, Carolyn S. Grant, Donna Thompson, Elizabeth Bohlen, Stephen S. Murray

    Abstract: Whenever a large group of people are engaged in an activity, communities will form. The nature of these communities depends on the relationship considered. In the group of people who regularly use scholarly literature, a relationship like ``person i and person j have cited the same paper'' might reveal communities of people working in a particular field. On this poster, we will investigate the r… ▽ More

    Submitted 5 January, 2007; originally announced January 2007.

    Comments: poster presented at the 209th AAS Meeting, 7 pages, 4 figures

  42. arXiv:cs/0610030  [pdf, ps, other

    cs.DL cs.HC

    Paper to Screen: Processing Historical Scans in the ADS

    Authors: Donna M. Thompson, Alberto Accomazzi, Guenther Eichhorn, Carolyn Grant, Edwin Henneken, Michael J. Kurtz, Elizabeth Bohlen, Stephen S. Murray

    Abstract: The NASA Astrophysics Data System in conjunction with the Wolbach Library at the Harvard-Smithsonian Center for Astrophysics is working on a project to microfilm historical observatory publications. The microfilm is then scanned for inclusion in the ADS. The ADS currently contains over 700,000 scanned pages of volumes of historical literature. Many of these volumes lack clear pagination or other… ▽ More

    Submitted 5 October, 2006; originally announced October 2006.

    Comments: 4 pages; submitted to the proceedings of Library and Information Services in Astronomy; to be published in the ASP Conference Series

  43. arXiv:cs/0610029  [pdf, ps, other

    cs.DL cs.DB

    Data in the ADS -- Understanding How to Use it Better

    Authors: Carolyn S. Grant, Alberto Accomazzi, Donna Thompson, Edwin Henneken, Guenther Eichhorn, Michael J. Kurtz, Stephen S. Murray

    Abstract: The Smithsonian/NASA ADS Abstract Service contains a wealth of data for astronomers and librarians alike, yet the vast majority of usage consists of rudimentary searches. Hints on how to obtain more focused search results by using more of the various capabilities of the ADS are presented, including searching by affiliation. We also discuss the classification of articles by content and by referee… ▽ More

    Submitted 5 October, 2006; originally announced October 2006.

    Comments: 4 pages; submitted to the proceedings of the Library and Information Services in Astronomy V; to be published by ASP Conference Proceedings

  44. arXiv:cs/0610011  [pdf, ps, other

    cs.DL astro-ph cs.DB cs.IR

    Creation and use of Citations in the ADS

    Authors: Alberto Accomazzi, Gunther Eichhorn, Michael J. Kurtz, Carolyn S. Grant, Edwin Henneken, Markus Demleitner, Donna Thompson, Elizabeth Bohlen, Stephen S. Murray

    Abstract: With over 20 million records, the ADS citation database is regularly used by researchers and librarians to measure the scientific impact of individuals, groups, and institutions. In addition to the traditional sources of citations, the ADS has recently added references extracted from the arXiv e-prints on a nightly basis. We review the procedures used to harvest and identify the reference data u… ▽ More

    Submitted 3 October, 2006; originally announced October 2006.

    Comments: 9 pages; to be published in the proceedings of the conference "Library and Information Services V," June 2006, Cambridge, MA, USA

  45. arXiv:cs/0610008  [pdf, ps, other

    cs.DL astro-ph cs.DB

    Connectivity in the Astronomy Digital Library

    Authors: Günther Eichhorn, Alberto Accomazzi, Carolyn S. Grant, Edwin A. Henneken, Donna M. Thompson, Michael J. Kurtz, Stephen S. Murray

    Abstract: The Astrophysics Data System (ADS) provides an extensive system of links between the literature and other on-line information. Recently, the journals of the American Astronomical Society (AAS) and a group of NASA data centers have collaborated to provide more links between on-line data obtained by space missions and the on-line journals. Authors can now specify which data sets they have used in… ▽ More

    Submitted 2 October, 2006; originally announced October 2006.

    Comments: To appear in Library and Information Systems in Astronomy V

  46. arXiv:cs/0610007  [pdf, ps, other

    cs.DL astro-ph cs.DB

    Full Text Searching in the Astrophysics Data System

    Authors: Günther Eichhorn, Alberto Accomazzi, Carolyn S. Grant, Edwin A. Henneken, Donna M. Thompson, Michael J. Kurtz, Stephen S. Murray

    Abstract: The Smithsonian/NASA Astrophysics Data System (ADS) provides a search system for the astronomy and physics scholarly literature. All major and many smaller astronomy journals that were published on paper have been scanned back to volume 1 and are available through the ADS free of charge. All scanned pages have been converted to text and can be searched through the ADS Full Text Search System. In… ▽ More

    Submitted 5 October, 2006; v1 submitted 2 October, 2006; originally announced October 2006.

    Comments: To appear in Library and Information Systems in Astronomy V

  47. E-prints and Journal Articles in Astronomy: a Productive Co-existence

    Authors: Edwin A. Henneken, Michael J. Kurtz, Simeon Warner, Paul Ginsparg, Guenther Eichhorn, Alberto Accomazzi, Carolyn S. Grant, Donna Thompson, Elizabeth Bohlen, Stephen S. Murray

    Abstract: Are the e-prints (electronic preprints) from the arXiv repository being used instead of the journal articles? In this paper we show that the e-prints have not undermined the usage of journal papers in the astrophysics community. As soon as the journal article is published, the astronomical community prefers to read the journal article and the use of e-prints through the NASA Astrophysics Data Sy… ▽ More

    Submitted 22 September, 2006; originally announced September 2006.

    Comments: 8 pages, 4 figures, submitted to Learned Publishing

    Journal ref: Learn.Publ.20:16-22,2007

  48. arXiv:astro-ph/0609794  [pdf, ps, other

    astro-ph cs.DL

    The Future of Technical Libraries

    Authors: Michael J. Kurtz, Guenther Eichhorn, Alberto Accomazzi, Carolyn Grant, Edwin Henneken, Donna Thompson, Elizabeth Bohlen, Stephen S. Murray

    Abstract: Technical libraries are currently experiencing very rapid change. In the near future their mission will change, their physical nature will change, and the skills of their employees will change. While some will not be able to make these changes, and will fail, others will lead us into a new era.

    Submitted 28 September, 2006; originally announced September 2006.

    Comments: To appear in Library and Information Systems in Astronomy V

  49. arXiv:cs/0608027  [pdf, ps, other

    cs.DL astro-ph

    myADS-arXiv - a Tailor-Made, Open Access, Virtual Journal

    Authors: E. Henneken, M. J. Kurtz, G. Eichhorn, A. Accomazzi, C. S. Grant, D. Thompson, E. Bohlen, S. S. Murray

    Abstract: The myADS-arXiv service provides the scientific community with a one stop shop for staying up-to-date with a researcher's field of interest. The service provides a powerful and unique filter on the enormous amount of bibliographic information added to the ADS on a daily basis. It also provides a complete view with the most relevant papers available in the subscriber's field of interest. With thi… ▽ More

    Submitted 4 August, 2006; originally announced August 2006.

    Comments: 4 pages, 2 figures, poster paper to appear in the proceedings of the LISA V conference

  50. arXiv:cs/0604061  [pdf

    cs.DL astro-ph

    Effect of E-printing on Citation Rates in Astronomy and Physics

    Authors: Edwin A. Henneken, Michael J. Kurtz, Guenther Eichhorn, Alberto Accomazzi, Carolyn Grant, Donna Thompson, Stephen S. Murray

    Abstract: In this report we examine the change in citation behavior since the introduction of the arXiv e-print repository (Ginsparg, 2001). It has been observed that papers that initially appear as arXiv e-prints get cited more than papers that do not (Lawrence, 2001; Brody et al., 2004; Schwarz & Kennicutt, 2004; Kurtz et al., 2005a, Metcalfe, 2005). Using the citation statistics from the NASA-Smithsoni… ▽ More

    Submitted 5 June, 2006; v1 submitted 13 April, 2006; originally announced April 2006.

    Comments: Submitted to the Journal of Electronic Publishing. 11 pages with 5 figures