Zum Hauptinhalt springen

Showing 1–41 of 41 results for author: Zhao, H

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2408.09896  [pdf, other

    cs.LG physics.chem-ph q-bio.BM

    Instruction-Based Molecular Graph Generation with Unified Text-Graph Diffusion Model

    Authors: Yuran Xiang, Haiteng Zhao, Chang Ma, Zhi-Hong Deng

    Abstract: Recent advancements in computational chemistry have increasingly focused on synthesizing molecules based on textual instructions. Integrating graph generation with these instructions is complex, leading most current methods to use molecular sequences with pre-trained large language models. In response to this challenge, we propose a novel framework, named… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

  2. arXiv:2407.14020  [pdf, other

    q-bio.NC cs.LG

    NeuroBind: Towards Unified Multimodal Representations for Neural Signals

    Authors: Fengyu Yang, Chao Feng, Daniel Wang, Tianye Wang, Ziyao Zeng, Zhiyang Xu, Hyoungseob Park, Pengliang Ji, Hanbin Zhao, Yuanning Li, Alex Wong

    Abstract: Understanding neural activity and information representation is crucial for advancing knowledge of brain function and cognition. Neural activity, measured through techniques like electrophysiology and neuroimaging, reflects various aspects of information processing. Recent advances in deep neural networks offer new approaches to analyzing these signals using pre-trained models. However, challenges… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

  3. arXiv:2407.07930  [pdf

    q-bio.BM cs.LG

    Token-Mol 1.0: Tokenized drug design with large language model

    Authors: Jike Wang, Rui Qin, Mingyang Wang, Meijing Fang, Yangyang Zhang, Yuchen Zhu, Qun Su, Qiaolin Gou, Chao Shen, Odin Zhang, Zhenxing Wu, Dejun Jiang, Xujun Zhang, Huifeng Zhao, Xiaozhe Wan, Zhourui Wu, Liwei Liu, Yu Kang, Chang-Yu Hsieh, Tingjun Hou

    Abstract: Significant interests have recently risen in leveraging sequence-based large language models (LLMs) for drug design. However, most current applications of LLMs in drug discovery lack the ability to comprehend three-dimensional (3D) structures, thereby limiting their effectiveness in tasks that explicitly involve molecular conformations. In this study, we introduced Token-Mol, a token-only 3D drug… ▽ More

    Submitted 19 August, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

  4. arXiv:2406.15534  [pdf, other

    cs.LG cs.AI cs.CL q-bio.QM

    Geneverse: A collection of Open-source Multimodal Large Language Models for Genomic and Proteomic Research

    Authors: Tianyu Liu, Yijia Xiao, Xiao Luo, Hua Xu, W. Jim Zheng, Hongyu Zhao

    Abstract: The applications of large language models (LLMs) are promising for biomedical and healthcare research. Despite the availability of open-source LLMs trained using a wide range of biomedical data, current research on the applications of LLMs to genomics and proteomics is still limited. To fill this gap, we propose a collection of finetuned LLMs and multimodal LLMs (MLLMs), known as Geneverse, for th… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 8 pages

  5. arXiv:2405.06642  [pdf, other

    q-bio.BM cs.AI cs.LG

    PPFlow: Target-aware Peptide Design with Torsional Flow Matching

    Authors: Haitao Lin, Odin Zhang, Huifeng Zhao, Dejun Jiang, Lirong Wu, Zicheng Liu, Yufei Huang, Stan Z. Li

    Abstract: Therapeutic peptides have proven to have great pharmaceutical value and potential in recent decades. However, methods of AI-assisted peptide drug discovery are not fully explored. To fill the gap, we propose a target-aware peptide design method called \textsc{PPFlow}, based on conditional flow matching on torus manifolds, to model the internal geometries of torsion angles for the peptide structure… ▽ More

    Submitted 16 June, 2024; v1 submitted 5 March, 2024; originally announced May 2024.

    Comments: 18 pages

  6. arXiv:2404.19230  [pdf

    q-bio.BM cs.AI

    Deep Lead Optimization: Leveraging Generative AI for Structural Modification

    Authors: Odin Zhang, Haitao Lin, Hui Zhang, Huifeng Zhao, Yufei Huang, Yuansheng Huang, Dejun Jiang, Chang-yu Hsieh, Peichen Pan, Tingjun Hou

    Abstract: The idea of using deep-learning-based molecular generation to accelerate discovery of drug candidates has attracted extraordinary attention, and many deep generative models have been developed for automated drug design, termed molecular generation. In general, molecular generation encompasses two main strategies: de novo design, which generates novel molecular structures from scratch, and lead opt… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  7. arXiv:2404.00014  [pdf

    physics.chem-ph cs.AI q-bio.BM

    Deep Geometry Handling and Fragment-wise Molecular 3D Graph Generation

    Authors: Odin Zhang, Yufei Huang, Shichen Cheng, Mengyao Yu, Xujun Zhang, Haitao Lin, Yundian Zeng, Mingyang Wang, Zhenxing Wu, Huifeng Zhao, Zaixi Zhang, Chenqing Hua, Yu Kang, Sunliang Cui, Peichen Pan, Chang-Yu Hsieh, Tingjun Hou

    Abstract: Most earlier 3D structure-based molecular generation approaches follow an atom-wise paradigm, incrementally adding atoms to a partially built molecular fragment within protein pockets. These methods, while effective in designing tightly bound ligands, often overlook other essential properties such as synthesizability. The fragment-wise generation paradigm offers a promising solution. However, a co… ▽ More

    Submitted 15 March, 2024; originally announced April 2024.

  8. arXiv:2312.17670  [pdf, other

    cs.CV cs.LG q-bio.QM q-bio.TO

    Benchmarking the CoW with the TopCoW Challenge: Topology-Aware Anatomical Segmentation of the Circle of Willis for CTA and MRA

    Authors: Kaiyuan Yang, Fabio Musio, Yihui Ma, Norman Juchler, Johannes C. Paetzold, Rami Al-Maskari, Luciano Höher, Hongwei Bran Li, Ibrahim Ethem Hamamci, Anjany Sekuboyina, Suprosanna Shit, Houjing Huang, Chinmay Prabhakar, Ezequiel de la Rosa, Diana Waldmannstetter, Florian Kofler, Fernando Navarro, Martin Menten, Ivan Ezhov, Daniel Rueckert, Iris Vos, Ynte Ruigrok, Birgitta Velthuis, Hugo Kuijf, Julien Hämmerli , et al. (59 additional authors not shown)

    Abstract: The Circle of Willis (CoW) is an important network of arteries connecting major circulations of the brain. Its vascular architecture is believed to affect the risk, severity, and clinical outcome of serious neuro-vascular diseases. However, characterizing the highly variable CoW anatomy is still a manual and time-consuming expert task. The CoW is usually imaged by two angiographic imaging modaliti… ▽ More

    Submitted 29 April, 2024; v1 submitted 29 December, 2023; originally announced December 2023.

    Comments: 24 pages, 11 figures, 9 tables. Summary Paper for the MICCAI TopCoW 2023 Challenge

  9. arXiv:2310.17112  [pdf, other

    physics.soc-ph q-bio.PE

    Modeling and Analysis of the Epidemic-Behavior Co-evolution Dynamics with User Irrationality

    Authors: Wenxiang Dong, H. Vicky Zhao

    Abstract: During a public health crisis like COVID-19, individuals' adoption of protective behaviors, such as self-isolation and wearing masks, can significantly impact the spread of the disease. In the meanwhile, the spread of the disease can also influence individuals' behavioral choices. Moreover, when facing uncertain losses, individuals' decisions tend to be irrational. Therefore, it is critical to stu… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

  10. arXiv:2310.02275  [pdf, other

    cs.LG q-bio.GN

    MuSe-GNN: Learning Unified Gene Representation From Multimodal Biological Graph Data

    Authors: Tianyu Liu, Yuge Wang, Rex Ying, Hongyu Zhao

    Abstract: Discovering genes with similar functions across diverse biomedical contexts poses a significant challenge in gene representation learning due to data heterogeneity. In this study, we resolve this problem by introducing a novel model called Multimodal Similarity Learning Graph Neural Network, which combines Multimodal Machine Learning and Deep Graph Neural Networks to learn gene representations fro… ▽ More

    Submitted 29 September, 2023; originally announced October 2023.

    Comments: Accepted by NeurIPS 2023

  11. arXiv:2309.15397  [pdf, ps, other

    q-bio.NC cond-mat.dis-nn

    Short-Term Postsynaptic Plasticity Facilitates Predictive Tracking in Continuous Attractors

    Authors: Huilin Zhao, Sungchil Yang, Chi Chung Alan Fung

    Abstract: The N-methyl-D-aspartate receptor (NMDAR) is a crucial component of synaptic transmission, and its dysfunction is implicated in many neurological diseases and psychiatric conditions. NMDAR-based short-term postsynaptic plasticity (STPP) is a newly discovered postsynaptic response facilitation mechanism. Our group has suggested that long-lasting glutamate binding of NMDAR allows input information t… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: 29 pages, 9 figures

  12. arXiv:2309.08616  [pdf

    q-bio.QM cs.CE stat.AP

    Introduction of accelerated BOIN design and facilitation of its application

    Authors: Masahiro Kojima, Wu Wende, Henry Zhao

    Abstract: Purpose: During discussions at the Data Science Roundtable meeting in Japan, there were instances where the adoption of the BOIN design was declined, attributed to the extension of study duration and increased sample size in comparison to the 3+3 design. We introduce an accelerated BOIN design aimed at completing a clinical phase I trial at a pace comparable to the 3+3 design. Additionally, we int… ▽ More

    Submitted 31 August, 2023; originally announced September 2023.

  13. arXiv:2308.02172  [pdf

    q-bio.BM

    Delete: Deep Lead Optimization Enveloped in Protein Pocket through Unified Deleting Strategies and a Structure-aware Network

    Authors: Haotian Zhang, Huifeng Zhao, Xujun Zhang, Qun Su, Hongyan Du, Chao Shen, Zhe Wang, Dan Li, Peichen Pan, Guangyong Chen, Yu Kang, Chang-yu Hsieh, Tingjun Hou

    Abstract: Drug discovery is a highly complicated process, and it is unfeasible to fully commit it to the recently developed molecular generation methods. Deep learning-based lead optimization takes expert knowledge as a starting point, learning from numerous historical cases about how to modify the structure for better drug-forming properties. However, compared with the more established de novo generation s… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

  14. arXiv:2306.13089  [pdf, other

    cs.LG cs.CL q-bio.BM

    GIMLET: A Unified Graph-Text Model for Instruction-Based Molecule Zero-Shot Learning

    Authors: Haiteng Zhao, Shengchao Liu, Chang Ma, Hannan Xu, Jie Fu, Zhi-Hong Deng, Lingpeng Kong, Qi Liu

    Abstract: Molecule property prediction has gained significant attention in recent years. The main bottleneck is the label insufficiency caused by expensive lab experiments. In order to alleviate this issue and to better leverage textual knowledge for tasks, this study investigates the feasibility of employing natural language instructions to accomplish molecule-related tasks in a zero-shot setting. We disco… ▽ More

    Submitted 22 October, 2023; v1 submitted 28 May, 2023; originally announced June 2023.

  15. arXiv:2306.07812  [pdf, other

    q-bio.QM cs.AI cs.LG

    Automated 3D Pre-Training for Molecular Property Prediction

    Authors: Xu Wang, Huan Zhao, Weiwei Tu, Quanming Yao

    Abstract: Molecular property prediction is an important problem in drug discovery and materials science. As geometric structures have been demonstrated necessary for molecular property prediction, 3D information has been combined with various graph learning methods to boost prediction performance. However, obtaining the geometric structure of molecules is not feasible in many real-world applications due to… ▽ More

    Submitted 2 July, 2023; v1 submitted 13 June, 2023; originally announced June 2023.

  16. arXiv:2304.10494  [pdf

    q-bio.BM cs.AI cs.LG

    Infinite Physical Monkey: Do Deep Learning Methods Really Perform Better in Conformation Generation?

    Authors: Haotian Zhang, Jintu Zhang, Huifeng Zhao, Dejun Jiang, Yafeng Deng

    Abstract: Conformation Generation is a fundamental problem in drug discovery and cheminformatics. And organic molecule conformation generation, particularly in vacuum and protein pocket environments, is most relevant to drug design. Recently, with the development of geometric neural networks, the data-driven schemes have been successfully applied in this field, both for molecular conformation generation (in… ▽ More

    Submitted 7 March, 2023; originally announced April 2023.

  17. arXiv:2302.12563  [pdf, other

    q-bio.BM cs.LG

    Retrieved Sequence Augmentation for Protein Representation Learning

    Authors: Chang Ma, Haiteng Zhao, Lin Zheng, Jiayi Xin, Qintong Li, Lijun Wu, Zhihong Deng, Yang Lu, Qi Liu, Lingpeng Kong

    Abstract: Protein language models have excelled in a variety of tasks, ranging from structure prediction to protein engineering. However, proteins are highly diverse in functions and structures, and current state-of-the-art models including the latest version of AlphaFold rely on Multiple Sequence Alignments (MSA) to feed in the evolutionary knowledge. Despite their success, heavy computational overheads, a… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

  18. arXiv:2302.11662  [pdf

    q-bio.GN

    eQTL Studies: from Bulk Tissues to Single Cells

    Authors: Jingfei Zhang, Hongyu Zhao

    Abstract: An expression quantitative trait locus (eQTL) is a chromosomal region where genetic variants are associated with the expression levels of certain genes that can be both nearby or distant. The identifications of eQTLs for different tissues, cell types, and contexts have led to better understanding of the dynamic regulations of gene expressions and implications of functional genes and variants for c… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

  19. arXiv:2210.08749  [pdf, other

    cs.LG q-bio.BM

    A Transformer-based Generative Model for De Novo Molecular Design

    Authors: Wenlu Wang, Ye Wang, Honggang Zhao, Simone Sciabola

    Abstract: In the scope of drug discovery, the molecular design aims to identify novel compounds from the chemical space where the potential drug-like molecules are estimated to be in the order of 10^60 - 10^100. Since this search task is computationally intractable due to the unbounded search space, deep learning draws a lot of attention as a new way of generating unseen molecules. As we seek compounds with… ▽ More

    Submitted 22 October, 2022; v1 submitted 17 October, 2022; originally announced October 2022.

  20. arXiv:2201.01855  [pdf, other

    q-bio.QM cs.AI cs.LG

    Graph Neural Networks for Double-Strand DNA Breaks Prediction

    Authors: XU Wang, Huan Zhao, Weiwei TU, Hao Li, Yu Sun, Xiaochen Bo

    Abstract: Double-strand DNA breaks (DSBs) are a form of DNA damage that can cause abnormal chromosomal rearrangements. Recent technologies based on high-throughput experiments have obvious high costs and technical challenges.Therefore, we design a graph neural network based method to predict DSBs (GraphDSB), using DNA sequence features and chromosome structure information. In order to improve the expression… ▽ More

    Submitted 4 January, 2022; originally announced January 2022.

  21. arXiv:2111.15411  [pdf

    q-bio.GN

    EndHiC: assemble large contigs into chromosomal-level scaffolds using the Hi-C links from contig ends

    Authors: Sen Wang, Hengchao Wang, Fan Jiang, Anqi Wang, Hangwei Liu, Hanbo Zhao, Boyuan Yang, Dong Xu, Yan Zhang, Wei Fan

    Abstract: Motivation: The application of PacBio HiFi and ultra-long ONT reads have achieved huge progress in the contig-level assembly, but it is still challenging to assemble large contigs into chromosomes with available Hi-C scaffolding software, which all compute the contact value between contigs using the Hi-C links from the whole contig regions. As the Hi-C links of two adjacent contigs concentrate onl… ▽ More

    Submitted 30 November, 2021; originally announced November 2021.

    Comments: 25 pages, 1 figure, 6 supplemental figures, and 6 supplemental Tables

  22. arXiv:2101.03784  [pdf

    q-bio.QM q-bio.MN

    Estimate Metabolite Taxonomy and Structure with a Fragment-Centered Database and Fragment Network

    Authors: Hansen Zhao, Xu Zhao, Huan Yao, Jiaxin Feng, Sichun Zhang, Xinrong Zhang

    Abstract: Metabolite structure identification has become the major bottleneck of the mass spectrometry based metabolomics research. Till now, number of mass spectra databases and search algorithms have been developed to address this issue. However, two critical problems still exist: the low chemical component record coverage in databases and significant MS/MS spectra variations related to experiment equipme… ▽ More

    Submitted 11 January, 2021; originally announced January 2021.

  23. arXiv:2011.10955  [pdf, other

    q-bio.NC math.NA q-bio.QM stat.ML

    Autonomous learning of nonlocal stochastic neuron dynamics

    Authors: Tyler E. Maltba, Hongli Zhao, Daniel M. Tartakovsky

    Abstract: Neuronal dynamics is driven by externally imposed or internally generated random excitations/noise, and is often described by systems of random or stochastic ordinary differential equations. Such systems admit a distribution of solutions, which is (partially) characterized by the single-time joint probability density function (PDF) of system states. It can be used to calculate such information-the… ▽ More

    Submitted 7 September, 2021; v1 submitted 22 November, 2020; originally announced November 2020.

    Comments: 28 pages, 12 figures, First author: Tyler E. Maltba, Corresponding author: Daniel M. Tartakovsky

    Journal ref: Cogn Neurodyn 16, 683-705 (2022)

  24. arXiv:2007.09524  [pdf, ps, other

    stat.ML cs.LG math.OC q-bio.GN

    A Manifold Proximal Linear Method for Sparse Spectral Clustering with Application to Single-Cell RNA Sequencing Data Analysis

    Authors: Zhongruo Wang, Bingyuan Liu, Shixiang Chen, Shiqian Ma, Lingzhou Xue, Hongyu Zhao

    Abstract: Spectral clustering is one of the fundamental unsupervised learning methods widely used in data analysis. Sparse spectral clustering (SSC) imposes sparsity to the spectral clustering and it improves the interpretability of the model. This paper considers a widely adopted model for SSC, which can be formulated as an optimization problem over the Stiefel manifold with nonsmooth and nonconvex objecti… ▽ More

    Submitted 30 October, 2020; v1 submitted 18 July, 2020; originally announced July 2020.

  25. arXiv:2006.08355  [pdf, other

    physics.soc-ph q-bio.PE

    A Two-Phase Dynamic Contagion Model for COVID-19

    Authors: Zezhun Chen, Angelos Dassios, Valerie Kuan, Jia Wei Lim, Yan Qu, Budhi Surya, Hongbiao Zhao

    Abstract: In this paper, we propose a continuous-time stochastic intensity model, namely, two-phase dynamic contagion process(2P-DCP), for modelling the epidemic contagion of COVID-19 and investigating the lockdown effect based on the dynamic contagion model introduced by Dassios and Zhao (2011). It allows randomness to the infectivity of individuals rather than a constant reproduction number as assumed by… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

    Comments: 29 pages, 14 figures

    MSC Class: 60G55(Primary); 60J75(Secondary)

  26. arXiv:2005.05549  [pdf, other

    q-bio.PE econ.GN math.DS

    Staggered Release Policies for COVID-19 Control: Costs and Benefits of Sequentially Relaxing Restrictions by Age

    Authors: Henry Zhao, Zhilan Feng, Carlos Castillo-Chavez, Simon A. Levin

    Abstract: Strong social distancing restrictions have been crucial to controlling the COVID-19 outbreak thus far, and the next question is when and how to relax these restrictions. A sequential timing of relaxing restrictions across groups is explored in order to identify policies that simultaneously reduce health risks and economic stagnation relative to current policies. The goal will be to mitigate health… ▽ More

    Submitted 12 May, 2020; originally announced May 2020.

    Comments: 22 pages (including Appendix), 10 figures

  27. arXiv:1902.03429  [pdf

    q-bio.BM cs.LG

    Clustering Bioactive Molecules in 3D Chemical Space with Unsupervised Deep Learning

    Authors: Chu Qin, Ying Tan, Shang Ying Chen, Xian Zeng, Xingxing Qi, Tian Jin, Huan Shi, Yiwei Wan, Yu Chen, Jingfeng Li, Weidong He, Yali Wang, Peng Zhang, Feng Zhu, Hongping Zhao, Yuyang Jiang, Yuzong Chen

    Abstract: Unsupervised clustering has broad applications in data stratification, pattern investigation and new discovery beyond existing knowledge. In particular, clustering of bioactive molecules facilitates chemical space mapping, structure-activity studies, and drug discovery. These tasks, conventionally conducted by similarity-based methods, are complicated by data complexity and diversity. We ex-plored… ▽ More

    Submitted 9 February, 2019; originally announced February 2019.

  28. arXiv:1803.06393  [pdf, other

    stat.AP q-bio.TO

    Phylogeny-based tumor subclone identification using a Bayesian feature allocation model

    Authors: Li Zeng, Joshua L. Warren, Hongyu Zhao

    Abstract: Tumor cells acquire different genetic alterations during the course of evolution in cancer patients. As a result of competition and selection, only a few subgroups of cells with distinct genotypes survive. These subgroups of cells are often referred to as subclones. In recent years, many statistical and computational methods have been developed to identify tumor subclones, leading to biologically… ▽ More

    Submitted 16 March, 2018; originally announced March 2018.

    Comments: 35 pages, 11 figures

  29. arXiv:1803.03910  [pdf, other

    stat.ML cs.LG q-bio.QM

    A pathway-based kernel boosting method for sample classification using genomic data

    Authors: Li Zeng, Zhaolong Yu, Hongyu Zhao

    Abstract: The analysis of cancer genomic data has long suffered "the curse of dimensionality". Sample sizes for most cancer genomic studies are a few hundreds at most while there are tens of thousands of genomic features studied. Various methods have been proposed to leverage prior biological knowledge, such as pathways, to more effectively analyze cancer genomic data. Most of the methods focus on testing m… ▽ More

    Submitted 11 March, 2018; originally announced March 2018.

  30. arXiv:1712.04386  [pdf, other

    q-bio.PE cs.AI cs.CE cs.SI physics.soc-ph

    Hawkes Processes for Invasive Species Modeling and Management

    Authors: Amrita Gupta, Mehrdad Farajtabar, Bistra Dilkina, Hongyuan Zha

    Abstract: The spread of invasive species to new areas threatens the stability of ecosystems and causes major economic losses in agriculture and forestry. We propose a novel approach to minimizing the spread of an invasive species given a limited intervention budget. We first model invasive species propagation using Hawkes processes, and then derive closed-form expressions for characterizing the effect of an… ▽ More

    Submitted 12 December, 2017; originally announced December 2017.

  31. arXiv:1710.04173  [pdf, other

    q-bio.NC

    Structural Stability of Lexical Semantic Spaces: Nouns in Chinese and French

    Authors: Sabine Ploux, Rui Wang, ZhengFeng Zhong, Hai Zhao, Yang Xin, Bao-Liang Lu

    Abstract: Many studies in the neurosciences have dealt with the semantic processing of words or categories, but few have looked into the semantic organization of the lexicon thought as a system. The present study was designed to try to move towards this goal, using both electrophysiological and corpus-based data, and to compare two languages from different families: French and Mandarin Chinese. We conduct… ▽ More

    Submitted 11 October, 2017; originally announced October 2017.

    Comments: 17 pages, 4 figures

  32. arXiv:1701.02287  [pdf

    q-bio.BM cond-mat.soft

    Sedimentation of Reversibly Interacting Macromolecules with Changes in Fluorescence Quantum Yield

    Authors: Sumit K. Chaturvedi, Huaying Zhao, Peter Schuck

    Abstract: Sedimentation velocity analytical ultracentrifugation with fluorescence detection has emerged as a powerful method for the study of interacting systems of macromolecules. It combines picomolar sensitivity with high hydrodynamic resolution, and can be carried out with photoswitchable fluorophores for multi-component discrimination, to determine the stoichiometry, affinity, and shape of macromolecul… ▽ More

    Submitted 21 February, 2017; v1 submitted 9 January, 2017; originally announced January 2017.

    Comments: 22 pages, 5 figures

  33. 3D-Printing for Analytical Ultracentrifugation

    Authors: Abhiksha Desai, Jonathan Krynitsky, Thomas J. Pohida, Huaying Zhao, Peter Schuck

    Abstract: Analytical ultracentrifugation (AUC) is a classical technique of physical biochemistry providing information on size, shape, and interactions of macromolecules from the analysis of their migration in centrifugal fields while free in solution. A key mechanical element in AUC is the centerpiece, a component of the sample cell assembly that is mounted between the optical windows to allow imaging and… ▽ More

    Submitted 23 February, 2016; originally announced February 2016.

    Comments: 25 pages, 6 figures

    Journal ref: PLoS One. 2016; 11(8):e0155201 PMID: 27525659

  34. arXiv:1407.8382  [pdf, ps, other

    stat.AP q-bio.PE

    Detection boundary and Higher Criticism approach for rare and weak genetic effects

    Authors: Zheyang Wu, Yiming Sun, Shiquan He, Judy Cho, Hongyu Zhao, Jiashun Jin

    Abstract: Genome-wide association studies (GWAS) have identified many genetic factors underlying complex human traits. However, these factors have explained only a small fraction of these traits' genetic heritability. It is argued that many more genetic factors remain undiscovered. These genetic factors likely are weakly associated at the population level and sparsely distributed across the genome. In this… ▽ More

    Submitted 31 July, 2014; originally announced July 2014.

    Comments: Published in at http://dx.doi.org/10.1214/14-AOAS724 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS724

    Journal ref: Annals of Applied Statistics 2014, Vol. 8, No. 2, 824-851

  35. arXiv:1310.7249  [pdf, other

    q-bio.GN

    A Spatial Simulation Approach to Account for Protein Structure When Identifying Non-Random Somatic Mutations

    Authors: Gregory Ryslik, Yuwei Cheng, Kei-Hoi Cheung, Robert Bjornson, Daniel Zelterman, Yorgo Modis, Hongyu Zhao

    Abstract: Background: Current research suggests that a small set of "driver" mutations are responsible for tumorigenesis while a larger body of "passenger" mutations occurs in the tumor but does not progress the disease. Due to recent pharmacological successes in treating cancers caused by driver mutations, a variety of of methodologies that attempt to identify such mutations have been developed. Based on t… ▽ More

    Submitted 28 October, 2013; v1 submitted 27 October, 2013; originally announced October 2013.

    Comments: 16 pages, 8 Figures, 4 Tables

  36. arXiv:1309.5337  [pdf, other

    q-bio.GN q-bio.QM stat.AP

    Change Point Analysis of Histone Modifications Reveals Epigenetic Blocks Linking to Physical Domains

    Authors: Mengjie Chen, Haifan Lin, Hongyu Zhao

    Abstract: Histone modification is a vital epigenetic mechanism for transcriptional control in eukaryotes. High-throughput techniques have enabled whole-genome analysis of histone modifications in recent years. However, most studies assume one combination of histone modification invariantly translates to one transcriptional output regardless of local chromatin environment. In this study we hypothesize that,… ▽ More

    Submitted 9 May, 2014; v1 submitted 20 September, 2013; originally announced September 2013.

    Comments: 23 pages, 6 figures

  37. arXiv:1307.8229  [pdf, other

    stat.ML math.ST q-bio.QM stat.AP

    Posterior Contraction Rates of the Phylogenetic Indian Buffet Processes

    Authors: Mengjie Chen, Chao Gao, Hongyu Zhao

    Abstract: By expressing prior distributions as general stochastic processes, nonparametric Bayesian methods provide a flexible way to incorporate prior knowledge and constrain the latent structure in statistical inference. The Indian buffet process (IBP) is such an example that can be used to define a prior distribution on infinite binary features, where the exchangeability among subjects is assumed. The ph… ▽ More

    Submitted 19 May, 2015; v1 submitted 31 July, 2013; originally announced July 2013.

  38. arXiv:1304.7417  [pdf, other

    q-bio.GN q-bio.QM stat.AP

    Improving genetic risk prediction by leveraging pleiotropy

    Authors: Cong Li, Can Yang, Joel Gelernter, Hongyu Zhao

    Abstract: An important task of human genetics studies is to accurately predict disease risks in individuals based on genetic markers, which allows for identifying individuals at high disease risks, and facilitating their disease treatment and prevention. Although hundreds of genome-wide association studies (GWAS) have been conducted on many complex human traits in recent years, there has been only limited s… ▽ More

    Submitted 19 August, 2013; v1 submitted 27 April, 2013; originally announced April 2013.

  39. arXiv:1303.5889  [pdf, other

    q-bio.GN

    A Graph Theoretic Approach to Utilizing Protein Structure to Identify Non-Random Somatic Mutations

    Authors: Gregory Ryslik, Yuwei Cheng, Kei-Hoi Cheung, Yorgo Modis, Hongyu Zhao

    Abstract: Background: It is well known that the development of cancer is caused by the accumulation of somatic mutations within the genome. For oncogenes specifically, current research suggests that there is a small set of "driver" mutations that are primarily responsible for tumorigenesis. Further, due to some recent pharmacological successes in treating these driver mutations and their resulting tumors, a… ▽ More

    Submitted 12 July, 2013; v1 submitted 23 March, 2013; originally announced March 2013.

    Comments: 25 pages, 8 figures, 3 Tables

  40. Utilizing Protein Structure to Identify Non-Random Somatic Mutations

    Authors: Gregory Ryslik, Yuwei Cheng, Kei-Hoi Cheung, Yorgo Modis, Hongyu Zhao

    Abstract: Motivation: Human cancer is caused by the accumulation of somatic mutations in tumor suppressors and oncogenes within the genome. In the case of oncogenes, recent theory suggests that there are only a few key "driver" mutations responsible for tumorigenesis. As there have been significant pharmacological successes in developing drugs that treat cancers that carry these driver mutations, several me… ▽ More

    Submitted 27 February, 2013; originally announced February 2013.

  41. arXiv:0810.1968  [pdf, other

    physics.flu-dyn q-bio.TO

    A Simulation of Blood Cells in Branching Capillaries

    Authors: Amir H. G. Isfahani, Hong Zhao, Jonathan B. Freund

    Abstract: The multi-cellular hydrodynamic interactions play a critical role in the phenomenology of blood flow in the microcirculation. A fast algorithm has been developed to simulate large numbers of cells modeled as elastic thin membranes. For red blood cells, which are the dominant component in blood, the membrane has strong resistance to surface dilatation but is flexible in bending. Our numerical met… ▽ More

    Submitted 10 October, 2008; originally announced October 2008.