-
From Data Dump to Digestible Chunks: Automated Segmentation and Summarization of Provenance Logs for Communication
Authors:
Jeremy E. Block,
Donald Honeycutt,
Brett Benda,
Benjamin Rheault,
Eric D. Ragan
Abstract:
Communicating one's sensemaking during a complex analysis session to explain thought processes is hard, yet most intelligence occurs in collaborative settings. Team members require a deeper understanding of the work being completed by their peers and subordinates, but little research has fully articulated best practices for analytic provenance consumers. This work proposes an automatic summarizati…
▽ More
Communicating one's sensemaking during a complex analysis session to explain thought processes is hard, yet most intelligence occurs in collaborative settings. Team members require a deeper understanding of the work being completed by their peers and subordinates, but little research has fully articulated best practices for analytic provenance consumers. This work proposes an automatic summarization technique that separates an analysis session and summarizes interaction provenance as textual blurbs to allow for meta-analysis of work done. Focusing on the domain of intelligence analysis, we demonstrate our segmentation technique using five datasets, including both publicly available and classified interaction logs. We shared our demonstration with a notoriously inaccessible population of expert reviewers with experience as United States Department of Defense analysts. Our findings indicate that the proposed pipeline effectively generates cards that display key events from interaction logs, facilitating the sharing of analysis progress. Yet, we also hear that there is a need for more prominent justifications and pattern elicitation controls to communicate analysis summaries more effectively. The expert review highlights the potential of automated approaches in addressing the challenges of provenance information in complex domains. We'd like to emphasize the need for further research into provenance communication in other domains.
A free copy of this paper and all supplemental materials are available at https://osf.io/j4bxt
△ Less
Submitted 6 September, 2024;
originally announced September 2024.
-
ODIN: Improved Narrowband Ly$α$ Emitter Selection Techniques for $z$ = 2.4, 3.1, and 4.5
Authors:
Nicole M. Firestone,
Eric Gawiser,
Vandana Ramakrishnan,
Kyoung-Soo Lee,
Francisco Valdes,
Changbom Park,
Yujin Yang,
Robin Ciardullo,
María Celeste Artale,
Barbara Benda,
Adam Broussard,
Lana Eid,
Rameen Farooq,
Caryl Gronwall,
Lucia Guaita,
Stephen Gwyn,
Ho Seong Hwang,
Sang Hyeok Im,
Woong-Seob Jeong,
Shreya Karthikeyan,
Dustin Lang,
Byeongha Moon,
Nelson Padilla,
Marcin Sawicki,
Eunsuk Seo
, et al. (3 additional authors not shown)
Abstract:
Lyman-Alpha Emitting galaxies (LAEs) are typically young, low-mass, star-forming galaxies with little extinction from interstellar dust. Their low dust attenuation allows their Ly$α$ emission to shine brightly in spectroscopic and photometric observations, providing an observational window into the high-redshift universe. Narrowband surveys reveal large, uniform samples of LAEs at specific redshif…
▽ More
Lyman-Alpha Emitting galaxies (LAEs) are typically young, low-mass, star-forming galaxies with little extinction from interstellar dust. Their low dust attenuation allows their Ly$α$ emission to shine brightly in spectroscopic and photometric observations, providing an observational window into the high-redshift universe. Narrowband surveys reveal large, uniform samples of LAEs at specific redshifts that probe large scale structure and the temporal evolution of galaxy properties. The One-hundred-deg$^2$ DECam Imaging in Narrowbands (ODIN) utilizes three custom-made narrowband filters on the Dark Energy Camera (DECam) to discover LAEs at three equally spaced periods in cosmological history. In this paper, we introduce the hybrid-weighted double-broadband continuum estimation technique, which yields improved estimation of Ly$α$ equivalent widths. Using this method, we discover 6339, 6056, and 4225 LAE candidates at $z =$ 2.4, 3.1, and 4.5 in the extended COSMOS field ($\sim$9 deg$^2$). We find that [O II] emitters are a minimal contaminant in our LAE samples, but that interloping Green Pea-like [O III] emitters are important for our redshift 4.5 sample. We introduce an innovative method for identifying [O II] and [O III] emitters via a combination of narrowband excess and galaxy colors, enabling their study as separate classes of objects. We present scaled median stacked SEDs for each galaxy sample, revealing the overall success of our selection methods. We also calculate rest-frame Ly$α$ equivalent widths for our LAE samples and find that the EW distributions are best fit by exponential functions with scale lengths of $w_0$ = 55 $\pm$ 1, 65 $\pm$ 1, and 62 $\pm$ 1 Angstroms, respectively.
△ Less
Submitted 26 December, 2023;
originally announced December 2023.
-
The One-hundred-deg^2 DECam Imaging in Narrowbands (ODIN): Survey Design and Science Goals
Authors:
Kyoung-Soo Lee,
Eric Gawiser,
Changbom Park,
Yujin Yang,
Francisco Valdes,
Dustin Lang,
Vandana Ramakrishnan,
Byeongha Moon,
Nicole Firestone,
Stephen Appleby,
Maria Celeste Artale,
Moira Andrews,
Franz E. Bauer,
Barbara Benda,
Adam Broussard,
Yi-Kuan Chiang,
Robin Ciardullo,
Arjun Dey,
Rameen Farooq,
Caryl Gronwall,
Lucia Guaita,
Yun Huang,
Ho Seong Hwang,
Sanghyeok Im,
Woong-Seob Jeong
, et al. (17 additional authors not shown)
Abstract:
We describe the survey design and science goals for ODIN (One-hundred-deg^2 DECam Imaging in Narrowbands), a NOIRLab survey using the Dark Energy Camera (DECam) to obtain deep (AB~25.7) narrow-band images over an unprecedented area of sky. The three custom-built narrow-band filters, N419, N501, and N673, have central wavelengths of 419, 501, and 673 nm and respective full-widthat-half-maxima of 7.…
▽ More
We describe the survey design and science goals for ODIN (One-hundred-deg^2 DECam Imaging in Narrowbands), a NOIRLab survey using the Dark Energy Camera (DECam) to obtain deep (AB~25.7) narrow-band images over an unprecedented area of sky. The three custom-built narrow-band filters, N419, N501, and N673, have central wavelengths of 419, 501, and 673 nm and respective full-widthat-half-maxima of 7.2, 7.4, and 9.8 nm, corresponding to Lya at z=2.4, 3.1, and 4.5 and cosmic times of 2.8, 2.1, and 1.4 Gyr, respectively. When combined with even deeper, public broad-band data from Hyper Suprime-Cam, DECam, and in the future, LSST, the ODIN narrow-band images will enable the selection of over 100,000 Lya-emitting (LAE) galaxies at these epochs. ODIN-selected LAEs will identify protoclusters as galaxy overdensities, and the deep narrow-band images enable detection of highly extended Lya blobs (LABs). Primary science goals include measuring the clustering strength and dark matter halo connection of LAEs, LABs, and protoclusters, and their respective relationship to filaments in the cosmic web. The three epochs allow the redshift evolution of these properties to be determined during the period known as Cosmic Noon, where star formation was at its peak. The two narrow-band filter wavelengths are designed to enable interloper rejection and further scientific studies by revealing [O II] and [O III] at z=0.34, Lya and He II 1640 at z=3.1, and Lyman continuum plus Lya at z=4.5. Ancillary science includes similar studies of the lower-redshift emission-line galaxy samples and investigations of nearby star-forming galaxies resolved into numerous [O III] and [S II] emitting regions.
△ Less
Submitted 18 September, 2023;
originally announced September 2023.
-
HETDEX Public Source Catalog 1: 220K Sources Including Over 50K Lyman Alpha Emitters from an Untargeted Wide-area Spectroscopic Survey
Authors:
Erin Mentuch Cooper,
Karl Gebhardt,
Dustin Davis,
Daniel J. Farrow,
Chenxu Liu,
Gregory Zeimann,
Robin Ciardullo,
John J. Feldmeier,
Niv Drory,
Donghui Jeong,
Barbara Benda,
William P. Bowman,
Michael Boylan-Kolchin,
Oscar A. Chavez Ortiz,
Maya H. Debski,
Mona Dentler,
Maximilian Fabricius,
Rameen Farooq,
Steven L. Finkelstein,
Eric Gawiser,
Caryl Gronwall,
Gary J. Hill,
Ulrich Hopp,
Lindsay R. House,
Steven Janowiecki
, et al. (21 additional authors not shown)
Abstract:
We present the first publicly released catalog of sources obtained from the Hobby-Eberly Telescope Dark Energy Experiment (HETDEX). HETDEX is an integral field spectroscopic survey designed to measure the Hubble expansion parameter and angular diameter distance at 1.88<z<3.52 by using the spatial distribution of more than a million Ly-alpha-emitting galaxies over a total target area of 540 deg^2.…
▽ More
We present the first publicly released catalog of sources obtained from the Hobby-Eberly Telescope Dark Energy Experiment (HETDEX). HETDEX is an integral field spectroscopic survey designed to measure the Hubble expansion parameter and angular diameter distance at 1.88<z<3.52 by using the spatial distribution of more than a million Ly-alpha-emitting galaxies over a total target area of 540 deg^2. The catalog comes from contiguous fiber spectra coverage of 25 deg^2 of sky from January 2017 through June 2020, where object detection is performed through two complementary detection methods: one designed to search for line emission and the other a search for continuum emission. The HETDEX public release catalog is dominated by emission-line galaxies and includes 51,863 Lyα-emitting galaxy (LAE) identifications and 123,891 OII-emitting galaxies at z<0.5. Also included in the catalog are 37,916 stars, 5274 low-redshift (z<0.5) galaxies without emission lines, and 4976 active galactic nuclei. The catalog provides sky coordinates, redshifts, line identifications, classification information, line fluxes, OII and Ly-alpha line luminosities where applicable, and spectra for all identified sources processed by the HETDEX detection pipeline. Extensive testing demonstrates that HETDEX redshifts agree to within deltaz < 0.02, 96.1% of the time to those in external spectroscopic catalogs. We measure the photometric counterpart fraction in deep ancillary Hyper Suprime-Cam imaging and find that only 55.5% of the LAE sample has an r-band continuum counterpart down to a limiting magnitude of r~26.2 mag (AB) indicating that an LAE search of similar sensitivity with photometric pre-selection would miss nearly half of the HETDEX LAE catalog sample. Data access and details about the catalog can be found online at http://hetdex.org/.
△ Less
Submitted 4 January, 2023;
originally announced January 2023.