-
Towards a Holistic Framework for Multimodal Large Language Models in Three-dimensional Brain CT Report Generation
Authors:
Cheng-Yi Li,
Kao-Jung Chang,
Cheng-Fu Yang,
Hsin-Yu Wu,
Wenting Chen,
Hritik Bansal,
Ling Chen,
Yi-Ping Yang,
Yu-Chun Chen,
Shih-Pin Chen,
Jiing-Feng Lirng,
Kai-Wei Chang,
Shih-Hwa Chiou
Abstract:
Multi-modal large language models (MLLMs) have been given free rein to explore exciting medical applications with a primary focus on radiology report generation. Nevertheless, the preliminary success in 2D radiology captioning is incompetent to reflect the real-world diagnostic challenge in the volumetric 3D anatomy. To mitigate three crucial limitation aspects in the existing literature, includin…
▽ More
Multi-modal large language models (MLLMs) have been given free rein to explore exciting medical applications with a primary focus on radiology report generation. Nevertheless, the preliminary success in 2D radiology captioning is incompetent to reflect the real-world diagnostic challenge in the volumetric 3D anatomy. To mitigate three crucial limitation aspects in the existing literature, including (1) data complexity, (2) model capacity, and (3) evaluation metric fidelity, we collected an 18,885 text-scan pairs 3D-BrainCT dataset and applied clinical visual instruction tuning (CVIT) to train BrainGPT models to generate radiology-adherent 3D brain CT reports. Statistically, our BrainGPT scored BLEU-1 = 44.35, BLEU-4 = 20.38, METEOR = 30.13, ROUGE-L = 47.6, and CIDEr-R = 211.77 during internal testing and demonstrated an accuracy of 0.91 in captioning midline shifts on the external validation CQ500 dataset. By further inspecting the captioned report, we reported that the traditional metrics appeared to measure only the surface text similarity and failed to gauge the information density of the diagnostic purpose. To close this gap, we proposed a novel Feature-Oriented Radiology Task Evaluation (FORTE) to estimate the report's clinical relevance (lesion feature and landmarks). Notably, the BrainGPT model scored an average FORTE F1-score of 0.71 (degree=0.661; landmark=0.706; feature=0.693; impression=0.779). To demonstrate that BrainGPT models possess objective readiness to generate human-like radiology reports, we conducted a Turing test that enrolled 11 physician evaluators, and around 74% of the BrainGPT-generated captions were indistinguishable from those written by humans. Our work embodies a holistic framework that showcased the first-hand experience of curating a 3D brain CT dataset, fine-tuning anatomy-sensible language models, and proposing robust radiology evaluation metrics.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
The Supersonic Project: Early Star Formation with the Streaming Velocity
Authors:
William Lake,
Claire E. Williams,
Smadar Naoz,
Federico Marinacci,
Blakesley Burkhart,
Mark Vogelsberger,
Naoki Yoshida,
Gen Chiaki,
Avi Chen,
Yeou S. Chiou
Abstract:
At high redshifts ($z\gtrsim12$), the relative velocity between baryons and dark matter (the so-called streaming velocity) significantly affects star formation in low-mass objects. Streaming substantially reduces the abundance of low-mass gas objects while simultaneously allowing for the formation of supersonically-induced gas objects (SIGOs) and their associated star clusters outside of dark matt…
▽ More
At high redshifts ($z\gtrsim12$), the relative velocity between baryons and dark matter (the so-called streaming velocity) significantly affects star formation in low-mass objects. Streaming substantially reduces the abundance of low-mass gas objects while simultaneously allowing for the formation of supersonically-induced gas objects (SIGOs) and their associated star clusters outside of dark matter halos. Here, we present a study of the population-level effects of streaming on star formation within both halos and SIGOs in a set of simulations with and without streaming. Notably, we find that streaming actually enhances star formation within individual halos of all masses at redshifts between $z=12$ and $z=20$. This is demonstrated both as an increased star formation rate per object as well as an enhancement of the Kennicutt-Schmidt relation for objects with streaming. We find that our simulations are consistent with some observations at high redshift, but on a population level, they continue to under-predict star formation relative to the majority of observations. Notably, our simulations do not include feedback, and so can be taken as an upper limit on the star formation rate, exacerbating these differences. However, simulations of overdense regions (both with and without streaming) agree with observations, suggesting a strategy for extracting information about the overdensity and streaming velocity in a given survey volume in future observations.
△ Less
Submitted 2 August, 2024; v1 submitted 23 May, 2024;
originally announced May 2024.
-
Learning to Retrieve for Job Matching
Authors:
Jianqiang Shen,
Yuchin Juan,
Shaobo Zhang,
Ping Liu,
Wen Pu,
Sriram Vasudevan,
Qingquan Song,
Fedor Borisyuk,
Kay Qianqi Shen,
Haichao Wei,
Yunxiang Ren,
Yeou S. Chiou,
Sicong Kuang,
Yuan Yin,
Ben Zheng,
Muchen Wu,
Shaghayegh Gharghabi,
Xiaoqing Wang,
Huichao Xue,
Qi Guo,
Daniel Hewlett,
Luke Simon,
Liangjie Hong,
Wenjing Zhang
Abstract:
Web-scale search systems typically tackle the scalability challenge with a two-step paradigm: retrieval and ranking. The retrieval step, also known as candidate selection, often involves extracting standardized entities, creating an inverted index, and performing term matching for retrieval. Such traditional methods require manual and time-consuming development of query models. In this paper, we d…
▽ More
Web-scale search systems typically tackle the scalability challenge with a two-step paradigm: retrieval and ranking. The retrieval step, also known as candidate selection, often involves extracting standardized entities, creating an inverted index, and performing term matching for retrieval. Such traditional methods require manual and time-consuming development of query models. In this paper, we discuss applying learning-to-retrieve technology to enhance LinkedIns job search and recommendation systems. In the realm of promoted jobs, the key objective is to improve the quality of applicants, thereby delivering value to recruiter customers. To achieve this, we leverage confirmed hire data to construct a graph that evaluates a seeker's qualification for a job, and utilize learned links for retrieval. Our learned model is easy to explain, debug, and adjust. On the other hand, the focus for organic jobs is to optimize seeker engagement. We accomplished this by training embeddings for personalized retrieval, fortified by a set of rules derived from the categorization of member feedback. In addition to a solution based on a conventional inverted index, we developed an on-GPU solution capable of supporting both KNN and term matching efficiently.
△ Less
Submitted 20 February, 2024;
originally announced February 2024.
-
The Supersonic Project: Lighting up the faint end of the JWST UV luminosity function
Authors:
Claire E. Williams,
William Lake,
Smadar Naoz,
Blakesley Burkhart,
Tommaso Treu,
Federico Marinacci,
Yurina Nakazato,
Mark Vogelsberger,
Naoki Yoshida,
Gen Chiaki,
Yeou S. Chiou,
Avi Chen
Abstract:
The James Webb Space Telescope (JWST) is capable of probing extremely early eras of our Universe when the supersonic relative motions between dark matter and baryonic overdensities modulate structure formation ($z>\sim 10$). We study low-mass galaxy formation including this "stream velocity" using high resolution AREPO hydrodynamics simulations, and present theoretical predictions of the UV lumino…
▽ More
The James Webb Space Telescope (JWST) is capable of probing extremely early eras of our Universe when the supersonic relative motions between dark matter and baryonic overdensities modulate structure formation ($z>\sim 10$). We study low-mass galaxy formation including this "stream velocity" using high resolution AREPO hydrodynamics simulations, and present theoretical predictions of the UV luminosity function (UVLF) and galaxy stellar mass function (GSMF) down to extremely faint and low mass galaxies ($M_{UV}>\sim-15$, $10^4M_\odot<=M_*<=10^8 M_\odot)$. We show that, although the stream velocity suppresses early star formation overall, it induces a short period of rapid star formation in some larger dwarfs, leading to an enhancement in the faint-end of the UVLF at $z=12$. We demonstrate that JWST observations are close to this enhanced regime, and propose that the UVLF may constitute an important probe of the stream velocity at high redshift for JWST and future observatories.
△ Less
Submitted 15 December, 2023; v1 submitted 5 October, 2023;
originally announced October 2023.
-
The Supersonic Project: Star Formation in Early Star Clusters without Dark Matter
Authors:
William Lake,
Smadar Naoz,
Federico Marinacci,
Blakesley Burkhart,
Mark Vogelsberger,
Claire E. Williams,
Yeou S. Chiou,
Gen Chiaki,
Yurina Nakazato,
Naoki Yoshida
Abstract:
The formation mechanism of globular clusters (GCs) has long been debated by astronomers. It was recently proposed that Supersonically Induced Gas Objects (SIGOs), which formed in the early Universe due to the supersonic relative motion of baryons and dark matter at recombination, could be the progenitors of early globular clusters. In order to become GCs, SIGOs must form stars relatively efficient…
▽ More
The formation mechanism of globular clusters (GCs) has long been debated by astronomers. It was recently proposed that Supersonically Induced Gas Objects (SIGOs), which formed in the early Universe due to the supersonic relative motion of baryons and dark matter at recombination, could be the progenitors of early globular clusters. In order to become GCs, SIGOs must form stars relatively efficiently despite forming outside of dark matter halos. We investigate the potential for star formation in SIGOs using cosmological hydrodynamic simulations, including the aforementioned relative motions of baryons and dark matter, molecular hydrogen cooling in primordial gas clouds, and including explicit star formation. We find that SIGOs do form stars and that the nascent star clusters formed through this process are accreted by dark matter halos on short timescales (a few hundreds of Myr). Thus, SIGOs may be found as intact substructures within these halos, analogous to many present-day GCs. From this result, we conclude that SIGOs are capable of forming star clusters with similar properties to globular clusters in the early Universe and we discuss their detectablity by upcoming JWST surveys.
△ Less
Submitted 18 September, 2023; v1 submitted 1 June, 2023;
originally announced June 2023.
-
The Supersonic Project: The eccentricity and rotational support of SIGOs and DM GHOSts
Authors:
Claire E. Williams,
Smadar Naoz,
William Lake,
Yeou S. Chiou,
Blakesley Burkhart,
Federico Marinacci,
Mark Vogelsberger,
Gen Chiaki,
Yurina Nakazato,
Naoki Yoshida
Abstract:
A supersonic relative velocity between dark matter (DM) and baryons (the stream velocity) at the time of recombination induces the formation of low mass objects with anomalous properties in the early Universe. We widen the scope of the `Supersonic Project' paper series to include objects we term Dark Matter + Gas Halos Offset by Streaming (DM GHOSts)--diffuse, DM-enriched structures formed because…
▽ More
A supersonic relative velocity between dark matter (DM) and baryons (the stream velocity) at the time of recombination induces the formation of low mass objects with anomalous properties in the early Universe. We widen the scope of the `Supersonic Project' paper series to include objects we term Dark Matter + Gas Halos Offset by Streaming (DM GHOSts)--diffuse, DM-enriched structures formed because of a physical offset between the centers of mass of DM and baryonic overdensities. We present an updated numerical investigation of DM GHOSts and Supersonically Induced Gas Objects (SIGOs), including the effects of molecular cooling, in high resolution hydrodynamic simulations using the AREPO code. Supplemented by an analytical understanding of their ellipsoidal gravitational potentials, we study the population-level properties of these objects, characterizing their morphology, spin, radial mass, and velocity distributions in comparison to classical structures in non-streaming regions. The stream velocity causes deviations from sphericity in both the gas and DM components and lends greater rotational support to the gas. Low mass ($<\sim 10^{5.5}$ M$_\odot$) objects in regions of streaming demonstrate core-like rotation and mass profiles. Anomalies in the rotation and morphology of DM GHOSts could represent an early Universe analogue to observed ultra-faint dwarf galaxies with variations in DM content and unusual rotation curves.
△ Less
Submitted 13 February, 2023; v1 submitted 3 November, 2022;
originally announced November 2022.
-
Development and Clinical Evaluation of an AI Support Tool for Improving Telemedicine Photo Quality
Authors:
Kailas Vodrahalli,
Justin Ko,
Albert S. Chiou,
Roberto Novoa,
Abubakar Abid,
Michelle Phung,
Kiana Yekrang,
Paige Petrone,
James Zou,
Roxana Daneshjou
Abstract:
Telemedicine utilization was accelerated during the COVID-19 pandemic, and skin conditions were a common use case. However, the quality of photographs sent by patients remains a major limitation. To address this issue, we developed TrueImage 2.0, an artificial intelligence (AI) model for assessing patient photo quality for telemedicine and providing real-time feedback to patients for photo quality…
▽ More
Telemedicine utilization was accelerated during the COVID-19 pandemic, and skin conditions were a common use case. However, the quality of photographs sent by patients remains a major limitation. To address this issue, we developed TrueImage 2.0, an artificial intelligence (AI) model for assessing patient photo quality for telemedicine and providing real-time feedback to patients for photo quality improvement. TrueImage 2.0 was trained on 1700 telemedicine images annotated by clinicians for photo quality. On a retrospective dataset of 357 telemedicine images, TrueImage 2.0 effectively identified poor quality images (Receiver operator curve area under the curve (ROC-AUC) =0.78) and the reason for poor quality (Blurry ROC-AUC=0.84, Lighting issues ROC-AUC=0.70). The performance is consistent across age, gender, and skin tone. Next, we assessed whether patient-TrueImage 2.0 interaction led to an improvement in submitted photo quality through a prospective clinical pilot study with 98 patients. TrueImage 2.0 reduced the number of patients with a poor-quality image by 68.0%.
△ Less
Submitted 12 September, 2022;
originally announced September 2022.
-
The Supersonic Project: The Early Evolutionary Path of SIGOs
Authors:
William Lake,
Smadar Naoz,
Blakesley Burkhart,
Federico Marinacci,
Mark Vogelsberger,
Gen Chiaki,
Yeou S. Chiou,
Naoki Yoshida,
Yurina Nakazato,
Claire E. Williams
Abstract:
Supersonically Induced Gas Objects (SIGOs) are a class of early Universe objects that have gained attention as a potential formation route for globular clusters. SIGOs have only recently begun to be studied in the context of molecular hydrogen cooling, which is key to characterizing their structure and evolution. Studying the population-level properties of SIGOs with molecular cooling is important…
▽ More
Supersonically Induced Gas Objects (SIGOs) are a class of early Universe objects that have gained attention as a potential formation route for globular clusters. SIGOs have only recently begun to be studied in the context of molecular hydrogen cooling, which is key to characterizing their structure and evolution. Studying the population-level properties of SIGOs with molecular cooling is important for understanding their potential for collapse and star formation, and central for addressing whether SIGOs can survive to the present epoch. Here, we investigate the evolution of SIGOs before they form stars, using a combination of numerical and analytical analysis. For example, we study various timescales important to the evolution of SIGOs at a population level in the presence of molecular cooling. Revising the previous formulation for the critical density of collapse for SIGOs allows us to show that their prolateness tends to act as an inhibiting factor to collapse. We find that simulated SIGOs are limited by artificial two-body relaxation effects that tend to disperse them, an effect of their limited resolution. We expect that SIGOs in nature will be longer-lived compared to our simulations. Further, the fall-back timescale on which SIGOs fall into nearby dark matter halos, potentially producing a globular-cluster-like system, is frequently longer than their cooling timescale and the collapse timescale on which they shrink through gravity. Therefore, some SIGOs have time to cool and collapse outside of halos despite initially failing to exceed the critical density, even without considering metal line cooling. From this analysis we conclude that SIGOs should form stars outside of halos in non-negligible stream velocity patches in the Universe.
△ Less
Submitted 9 January, 2023; v1 submitted 11 August, 2022;
originally announced August 2022.
-
$\mathrm{H_2}$ cooling and gravitational collapse of supersonically induced gas objects
Authors:
Yurina Nakazato,
Gen Chiaki,
Naoki Yoshida,
Smadar Naoz,
William Lake,
Yeou S. Chiou
Abstract:
We study the formation and gravitational collapse of supersonically induced gas objects (SIGOs) in the early universe. We run cosmological hydrodynamics simulations of SIGOs, including relative streaming motions between baryons and dark matter. Our simulations also follow nonequilibrium chemistry and molecular hydrogen cooling in primordial gas clouds. A number of SIGOs are formed in the run with…
▽ More
We study the formation and gravitational collapse of supersonically induced gas objects (SIGOs) in the early universe. We run cosmological hydrodynamics simulations of SIGOs, including relative streaming motions between baryons and dark matter. Our simulations also follow nonequilibrium chemistry and molecular hydrogen cooling in primordial gas clouds. A number of SIGOs are formed in the run with fast-streaming motions of 2 times the rms of the cosmological velocity fluctuations. We identify a particular gas cloud that condensates by H$_2$ cooling without being hosted by a dark matter halo. The SIGO remains outside the virial radius of its closest halo, and it becomes Jeans unstable when the central gas-particle density reaches $\sim 100~{\rm cm}^{-3}$ with a temperature of $\sim$ 200 K. The corresponding Jeans mass is $\sim 10^5 M_{\odot}$, and thus the formation of primordial stars or a star cluster is expected in the SIGO.
△ Less
Submitted 2 March, 2022; v1 submitted 19 November, 2021;
originally announced November 2021.
-
Regression Modeling for Recurrent Events Using R Package reReg
Authors:
Sy Han Chiou,
Gongjun Xu,
Jun Yan,
Chiung-Yu Huang
Abstract:
Recurrent event analyses have found a wide range of applications in biomedicine, public health, and engineering, among others, where study subjects may experience a sequence of event of interest during follow-up. The R package reReg (Chiou and Huang 2021) offers a comprehensive collection of practical and easy-to-use tools for regression analysis of recurrent events, possibly with the presence of…
▽ More
Recurrent event analyses have found a wide range of applications in biomedicine, public health, and engineering, among others, where study subjects may experience a sequence of event of interest during follow-up. The R package reReg (Chiou and Huang 2021) offers a comprehensive collection of practical and easy-to-use tools for regression analysis of recurrent events, possibly with the presence of an informative terminal event. The regression framework is a general scale-change model which encompasses the popular Cox-type model, the accelerated rate model, and the accelerated mean model as special cases. Informative censoring is accommodated through a subject-specific frailty without no need for parametric specification. Different regression models are allowed for the recurrent event process and the terminal event. Also included are visualization and simulation tools.
△ Less
Submitted 20 August, 2022; v1 submitted 23 April, 2021;
originally announced April 2021.
-
The Supersonic Project: SIGOs, a Proposed Progenitor to Globular Clusters, and their Connections to Gravitational Wave Anisotropies
Authors:
William Lake,
Smadar Naoz,
Yeou S. Chiou,
Blakesley Burkhart,
Federico Marinacci,
Mark Vogelsberger,
Kyle Kremer
Abstract:
Supersonically Induced Gas Objects (SIGOs), are structures with little to no dark matter component predicted to exist in regions of the Universe with large relative velocities between baryons and dark matter at the time of recombination. They have been suggested to be the progenitors of present-day globular clusters. Using simulations, SIGOs have been studied on small scales (around 2 Mpc), where…
▽ More
Supersonically Induced Gas Objects (SIGOs), are structures with little to no dark matter component predicted to exist in regions of the Universe with large relative velocities between baryons and dark matter at the time of recombination. They have been suggested to be the progenitors of present-day globular clusters. Using simulations, SIGOs have been studied on small scales (around 2 Mpc), where these relative velocities are coherent. However, it is challenging to study SIGOs using simulations on large scales due to the varying relative velocities at scales larger than a few Mpc. Here, we study SIGO abundances semi-analytically: using perturbation theory, we predict the number density of SIGOs analytically, and compare these results to small-box numerical simulations. We use the agreement between the numerical and analytic calculations to extrapolate the large-scale variation of SIGO abundances over different stream velocities. As a result, we predict similar large-scale variations of objects with high gas densities before reionization that could possibly be observed by JWST. If indeed SIGOs are progenitors of globular clusters, then we expect a similar variation of globular cluster abundances over large scales. Significantly, we find that the expected number density of SIGOs is consistent with observed globular cluster number densities. As a proof-of-concept, and because globular clusters were proposed to be natural formation sites for gravitational wave sources from binary black hole (BBH) mergers, we show that SIGOs should imprint an anisotropy on the gravitational wave signal on the sky, consistent with SIGOs' distribution.
△ Less
Submitted 24 August, 2021; v1 submitted 22 April, 2021;
originally announced April 2021.
-
Dynamic Risk Prediction Triggered by Intermediate Events Using Survival Tree Ensembles
Authors:
Yifei Sun,
Sy Han Chiou,
Colin O. Wu,
Meghan McGarry,
Chiung-Yu Huang
Abstract:
With the availability of massive amounts of data from electronic health records and registry databases, incorporating time-varying patient information to improve risk prediction has attracted great attention. To exploit the growing amount of predictor information over time, we develop a unified framework for landmark prediction using survival tree ensembles, where an updated prediction can be perf…
▽ More
With the availability of massive amounts of data from electronic health records and registry databases, incorporating time-varying patient information to improve risk prediction has attracted great attention. To exploit the growing amount of predictor information over time, we develop a unified framework for landmark prediction using survival tree ensembles, where an updated prediction can be performed when new information becomes available. Compared to conventional landmark prediction with fixed landmark times, our methods allow the landmark times to be subject-specific and triggered by an intermediate clinical event. Moreover, the nonparametric approach circumvents the thorny issue of model incompatibility at different landmark times. In our framework, both the longitudinal predictors and the event time outcome are subject to right censoring, and thus existing tree-based approaches cannot be directly applied. To tackle the analytical challenges, we propose a risk-set-based ensemble procedure by averaging martingale estimating equations from individual trees. Extensive simulation studies are conducted to evaluate the performance of our methods. The methods are applied to the Cystic Fibrosis Patient Registry (CFFPR) data to perform dynamic prediction of lung disease in cystic fibrosis patients and to identify important prognosis factors.
△ Less
Submitted 25 August, 2022; v1 submitted 13 November, 2020;
originally announced November 2020.
-
The Supersonic Project: To cool or not to cool Supersonically Induced Gas Objects (SIGOs)?
Authors:
Yeou S. Chiou,
Smadar Naoz,
Blakesley Burkhart,
Federico Marinacci,
Mark Vogelsberger
Abstract:
Supersonically Induced Gas Objects (SIGOs) primarily form in the early Universe, outside of dark matter halos due to the presence of a relative stream velocity between baryons and dark matter. These structures may be the progenitors of globular clusters. Since SIGOs are made out of pristine gas, we investigate the effect of atomic cooling on their properties. We run a suite of simulations by using…
▽ More
Supersonically Induced Gas Objects (SIGOs) primarily form in the early Universe, outside of dark matter halos due to the presence of a relative stream velocity between baryons and dark matter. These structures may be the progenitors of globular clusters. Since SIGOs are made out of pristine gas, we investigate the effect of atomic cooling on their properties. We run a suite of simulations by using the moving-mesh code {\sc arepo}, with and without baryon-dark matter relative velocity and with and without the effects of atomic cooling. We show that SIGO's density, temperature, and prolateness are determined by gravitational interactions rather than cooling. The cold gas fraction in SIGOs is much higher than that of dark matter halos. Specifically, we show that the SIGO's characteristic low temperature and extreme high gas density forges a nurturing ground for the earliest star formation sites.
△ Less
Submitted 8 November, 2020; v1 submitted 6 August, 2020;
originally announced August 2020.
-
The Supersonic Project: Shining Light on SIGOs - a New Formation Channel for Globular Clusters
Authors:
Yeou S. Chiou,
Smadar Naoz,
Blakesley Burkhart,
Federico Marinacci,
Mark Vogelsberger
Abstract:
Supersonically induced gas objects (SIGOs) with little to no dark matter component are predicted to exist in patches of the Universe with non-negligible relative velocity between baryons and the dark matter at the time of recombination. Using {\sc arepo} hydrodynamic simulations we find that the gas densities inside these objects are high enough to allow stars to form. An estimate of the luminosit…
▽ More
Supersonically induced gas objects (SIGOs) with little to no dark matter component are predicted to exist in patches of the Universe with non-negligible relative velocity between baryons and the dark matter at the time of recombination. Using {\sc arepo} hydrodynamic simulations we find that the gas densities inside these objects are high enough to allow stars to form. An estimate of the luminosity of the first star clusters formed within these SIGOs suggests that they may be observed at high redshift using future HST and JWST observations. Furthermore, our simulations indicate that SIGOs lie in a distinct place in the luminosity-radius parameter space, which can be used observationally to distinguish SIGOs from dark-matter hosting gas systems. Finally, as a proof-of-concept, we model star formation before reionization and evolve these systems to current times. We find that SIGOs occupy a similar part of the magnitude-radius parameter space as globular clusters. These results suggest that SIGOs may be linked with present-day metal-poor local globular clusters. Since the relative velocity between the baryons and dark matter is coherent over a few Mpc scales, we predict that if this is the dominant mechanism for the formation of globular clusters, their abundance should vary significantly over these scales.
△ Less
Submitted 24 May, 2019; v1 submitted 18 April, 2019;
originally announced April 2019.
-
ROC-Guided Survival Trees and Ensembles
Authors:
Yifei Sun,
Sy Han Chiou,
Mei-Cheng Wang
Abstract:
Tree-based methods are popular nonparametric tools in studying time-to-event outcomes. In this article, we introduce a novel framework for survival trees and ensembles, where the trees partition the dynamic survivor population and can handle time-dependent covariates. Using the idea of randomized tests, we develop generalized time-dependent Receiver Operating Characteristic (ROC) curves for evalua…
▽ More
Tree-based methods are popular nonparametric tools in studying time-to-event outcomes. In this article, we introduce a novel framework for survival trees and ensembles, where the trees partition the dynamic survivor population and can handle time-dependent covariates. Using the idea of randomized tests, we develop generalized time-dependent Receiver Operating Characteristic (ROC) curves for evaluating the performance of survival trees. The tree-building algorithm is guided by decision-theoretic criteria based on ROC, targeting specifically for prediction accuracy. To address the instability issue of a single tree, we propose a novel ensemble procedure based on averaging martingale estimating equations, which is different from existing methods that average the predicted survival or cumulative hazard functions from individual trees. Extensive simulation studies are conducted to examine the performance of the proposed methods. We apply the methods to a study on AIDS for illustration.
△ Less
Submitted 13 January, 2020; v1 submitted 14 September, 2018;
originally announced September 2018.
-
The Supersonic Project: rotational effects of supersonic motions on the first structures in the Universe
Authors:
Yeou S. Chiou,
Smadar Naoz,
Federico Marinacci,
Mark Vogelsberger
Abstract:
We introduce the "Supersonic Project," aimed at investigating the effects of the supersonic relative velocity between dark matter (DM) and baryons at high redshift using a combination of analytical calculations and cosmological simulations. In this paper, we study the effect of this stream velocity on the angular momentum of the first structures in the early Universe using simulations. We focus on…
▽ More
We introduce the "Supersonic Project," aimed at investigating the effects of the supersonic relative velocity between dark matter (DM) and baryons at high redshift using a combination of analytical calculations and cosmological simulations. In this paper, we study the effect of this stream velocity on the angular momentum of the first structures in the early Universe using simulations. We focus on DM haloes and their gas component as well as the recently predicted supersonically-induced gas objects (SIGOs) that arise as a result of the stream velocity phase shift. We find that the spin parameter of the gas component in these first haloes is increased with the stream velocity. Moreover, we find that when the stream velocity is taken into account, the angular momentum vectors of the DM component and the gas component are typically misaligned and this misalignment angle has a nearly isotropic distribution. The spin parameter value of the gas component is higher than in the no stream velocity case, which even in the absence of cooling, may result in more prolate objects. We also generalize the spin parameter to the SIGOs and find that they typically have a larger spin parameter with respect to their dark matter counterparts and that there is no correlation of the spin parameter and the prolateness of such structures. We speculate that SIGOs may be observed as very low luminosity objects in the early Universe and may serve as potential progenitors of Little Blue Dot-like systems.
△ Less
Submitted 13 September, 2018;
originally announced September 2018.
-
Fast Accelerated Failure Time Modeling for Case-Cohort Data
Authors:
Steven Chiou,
Sangwook Kang,
Jun Yan
Abstract:
Semiparametric accelerated failure time (AFT) models directly relate the predicted failure times to covariates and are a useful alternative to models that work on the hazard function or the survival function. For case-cohort data, much less development has been done with AFT models. In addition to the missing covariates outside of the sub-cohort in controls, challenges from AFT model inferences wi…
▽ More
Semiparametric accelerated failure time (AFT) models directly relate the predicted failure times to covariates and are a useful alternative to models that work on the hazard function or the survival function. For case-cohort data, much less development has been done with AFT models. In addition to the missing covariates outside of the sub-cohort in controls, challenges from AFT model inferences with full cohort are retained. The regression parameter estimator is hard to compute because the most widely used rank-based estimating equations are not smooth. Further, its variance depends on the unspecified error distribution, and most methods rely on computationally intensive bootstrap to estimate it. We propose fast rank-based inference procedures for AFT models, applying recent methodological advances to the context of case-cohort data. Parameters are estimated with an induced smoothing approach that smooths the estimating functions and facilitates the numerical solution. Variance estimators are obtained through efficient resampling methods for nonsmooth estimating functions that avoids full blown bootstrap. Simulation studies suggest that the recommended procedure provides fast and valid inferences among several competing procedures. Application to a tumor study demonstrates the utility of the proposed method in routine data analysis.
△ Less
Submitted 1 June, 2012;
originally announced June 2012.
-
Semiparametric Multivariate Accelerated Failure Time Model with Generalized Estimating Equations
Authors:
Steven Chiou,
Junghi Kim,
Jun Yan
Abstract:
The semiparametric accelerated failure time model is not as widely used as the Cox relative risk model mainly due to computational difficulties. Recent developments in least squares estimation and induced smoothing estimating equations provide promising tools to make the accelerate failure time models more attractive in practice. For semiparametric multivariate accelerated failure time models, we…
▽ More
The semiparametric accelerated failure time model is not as widely used as the Cox relative risk model mainly due to computational difficulties. Recent developments in least squares estimation and induced smoothing estimating equations provide promising tools to make the accelerate failure time models more attractive in practice. For semiparametric multivariate accelerated failure time models, we propose a generalized estimating equation approach to account for the multivariate dependence through working correlation structures. The marginal error distributions can be either identical as in sequential event settings or different as in parallel event settings. Some regression coefficients can be shared across margins as needed. The initial estimator is a rank-based estimator with Gehan's weight, but obtained from an induced smoothing approach with computation ease. The resulting estimator is consistent and asymptotically normal, with a variance estimated through a multiplier resampling method. In a simulation study, our estimator was up to three times as efficient as the initial estimator, especially with stronger multivariate dependence and heavier censoring percentage. Two real examples demonstrate the utility of the proposed method.
△ Less
Submitted 1 April, 2012;
originally announced April 2012.