Search | arXiv e-print repository

Hierarchical Dependency Constrained Tree Augmented Naive Bayes Classifiers for Hierarchical Feature Spaces

Abstract: The Tree Augmented Naive Bayes (TAN) classifier is a type of probabilistic graphical model that constructs a single-parent dependency tree to estimate the distribution of the data. In this work, we propose two novel Hierarchical dependency-based Tree Augmented Naive Bayes algorithms, i.e. Hie-TAN and Hie-TAN-Lite. Both methods exploit the pre-defined parent-child (generalisation-specialisation) re… ▽ More The Tree Augmented Naive Bayes (TAN) classifier is a type of probabilistic graphical model that constructs a single-parent dependency tree to estimate the distribution of the data. In this work, we propose two novel Hierarchical dependency-based Tree Augmented Naive Bayes algorithms, i.e. Hie-TAN and Hie-TAN-Lite. Both methods exploit the pre-defined parent-child (generalisation-specialisation) relationships between features as a type of constraint to learn the tree representation of dependencies among features, whilst the latter further eliminates the hierarchical redundancy during the classifier learning stage. The experimental results showed that Hie-TAN successfully obtained better predictive performance than several other hierarchical dependency constrained classification algorithms, and its predictive performance was further improved by eliminating the hierarchical redundancy, as suggested by the higher accuracy obtained by Hie-TAN-Lite. △ Less

Submitted 8 February, 2022; originally announced February 2022.

arXiv:2112.05045 [pdf, other]

Multi-Kink Quantile Regression for Longitudinal Data with Application to the Progesterone Data Analysis

Authors: Chuang Wan, Wei Zhong, Wenyang Zhang, Changliang Zou

Abstract: Motivated by investigating the relationship between progesterone and the days in a menstrual cycle in a longitudinal study, we propose a multi-kink quantile regression model for longitudinal data analysis. It relaxes the linearity condition and assumes different regression forms in different regions of the domain of the threshold covariate. In this paper, we first propose a multi-kink quantile reg… ▽ More Motivated by investigating the relationship between progesterone and the days in a menstrual cycle in a longitudinal study, we propose a multi-kink quantile regression model for longitudinal data analysis. It relaxes the linearity condition and assumes different regression forms in different regions of the domain of the threshold covariate. In this paper, we first propose a multi-kink quantile regression for longitudinal data. Two estimation procedures are proposed to estimate the regression coefficients and the kink points locations: one is a computationally efficient profile estimator under the working independence framework while the other one considers the within-subject correlations by using the unbiased generalized estimation equation approach. The selection consistency of the number of kink points and the asymptotic normality of two proposed estimators are established. Secondly, we construct a rank score test based on partial subgradients for the existence of kink effect in longitudinal studies. Both the null distribution and the local alternative distribution of the test statistic have been derived. Simulation studies show that the proposed methods have excellent finite sample performance. In the application to the longitudinal progesterone data, we identify two kink points in the progesterone curves over different quantiles and observe that the progesterone level remains stable before the day of ovulation, then increases quickly in five to six days after ovulation and then changes to stable again or even drops slightly △ Less

Submitted 9 December, 2021; originally announced December 2021.

Comments: 22pages; 3 figures

arXiv:2109.00539 [pdf, other]

Spatially and Robustly Hybrid Mixture Regression Model for Inference of Spatial Dependence

Authors: Wennan Chang, Pengtao Dang, Changlin Wan, Xiaoyu Lu, Yue Fang, Tong Zhao, Yong Zang, Bo Li, Chi Zhang, Sha Cao

Abstract: In this paper, we propose a Spatial Robust Mixture Regression model to investigate the relationship between a response variable and a set of explanatory variables over the spatial domain, assuming that the relationships may exhibit complex spatially dynamic patterns that cannot be captured by constant regression coefficients. Our method integrates the robust finite mixture Gaussian regression mode… ▽ More In this paper, we propose a Spatial Robust Mixture Regression model to investigate the relationship between a response variable and a set of explanatory variables over the spatial domain, assuming that the relationships may exhibit complex spatially dynamic patterns that cannot be captured by constant regression coefficients. Our method integrates the robust finite mixture Gaussian regression model with spatial constraints, to simultaneously handle the spatial nonstationarity, local homogeneity, and outlier contaminations. Compared with existing spatial regression models, our proposed model assumes the existence a few distinct regression models that are estimated based on observations that exhibit similar response-predictor relationships. As such, the proposed model not only accounts for nonstationarity in the spatial trend, but also clusters observations into a few distinct and homogenous groups. This provides an advantage on interpretation with a few stationary sub-processes identified that capture the predominant relationships between response and predictor variables. Moreover, the proposed method incorporates robust procedures to handle contaminations from both regression outliers and spatial outliers. By doing so, we robustly segment the spatial domain into distinct local regions with similar regression coefficients, and sporadic locations that are purely outliers. Rigorous statistical hypothesis testing procedure has been designed to test the significance of such segmentation. Experimental results on many synthetic and real-world datasets demonstrate the robustness, accuracy, and effectiveness of our proposed method, compared with other robust finite mixture regression, spatial regression and spatial segmentation methods. △ Less

Submitted 28 September, 2021; v1 submitted 1 September, 2021; originally announced September 2021.

Comments: Accepted by ICDM IEEE 2021

arXiv:2009.02305 [pdf, other]

Composite Estimation for Quantile Regression Kink Models with Longitudinal Data

Authors: Chuang Wan

Abstract: Kink model is developed to analyze the data where the regression function is twostage linear but intersects at an unknown threshold. In quantile regression with longitudinal data, previous work assumed that the unknown threshold parameters or kink points are heterogeneous across different quantiles. However, the location where kink effect happens tend to be the same across different quantiles, esp… ▽ More Kink model is developed to analyze the data where the regression function is twostage linear but intersects at an unknown threshold. In quantile regression with longitudinal data, previous work assumed that the unknown threshold parameters or kink points are heterogeneous across different quantiles. However, the location where kink effect happens tend to be the same across different quantiles, especially in a region of neighboring quantile levels. Ignoring such homogeneity information may lead to efficiency loss for estimation. In view of this, we propose a composite estimator for the common kink point by absorbing information from multiple quantiles. In addition, we also develop a sup-likelihood-ratio test to check the kink effect at a given quantile level. A test-inversion confidence interval for the common kink point is also developed based on the quantile rank score test. The simulation study shows that the proposed composite kink estimator is more competitive with the least square estimator and the single quantile estimator. We illustrate the practical value of this work through the analysis of a body mass index and blood pressure data set. △ Less

Submitted 4 September, 2020; originally announced September 2020.

arXiv:2008.06635 [pdf, other]

Orthogonalized SGD and Nested Architectures for Anytime Neural Networks

Authors: Chengcheng Wan, Henry Hoffmann, Shan Lu, Michael Maire

Abstract: We propose a novel variant of SGD customized for training network architectures that support anytime behavior: such networks produce a series of increasingly accurate outputs over time. Efficient architectural designs for these networks focus on re-using internal state; subnetworks must produce representations relevant for both immediate prediction as well as refinement by subsequent network stage… ▽ More We propose a novel variant of SGD customized for training network architectures that support anytime behavior: such networks produce a series of increasingly accurate outputs over time. Efficient architectural designs for these networks focus on re-using internal state; subnetworks must produce representations relevant for both immediate prediction as well as refinement by subsequent network stages. We consider traditional branched networks as well as a new class of recursively nested networks. Our new optimizer, Orthogonalized SGD, dynamically re-balances task-specific gradients when training a multitask network. In the context of anytime architectures, this optimizer projects gradients from later outputs onto a parameter subspace that does not interfere with those from earlier outputs. Experiments demonstrate that training with Orthogonalized SGD significantly improves generalization accuracy of anytime networks. △ Less

Submitted 14 August, 2020; originally announced August 2020.

Comments: ICML 2020

arXiv:2007.15821 [pdf, other]

Geometric All-Way Boolean Tensor Decomposition

Authors: Changlin Wan, Wennan Chang, Tong Zhao, Sha Cao, Chi Zhang

Abstract: Boolean tensor has been broadly utilized in representing high dimensional logical data collected on spatial, temporal and/or other relational domains. Boolean Tensor Decomposition (BTD) factorizes a binary tensor into the Boolean sum of multiple rank-1 tensors, which is an NP-hard problem. Existing BTD methods have been limited by their high computational cost, in applications to large scale or hi… ▽ More Boolean tensor has been broadly utilized in representing high dimensional logical data collected on spatial, temporal and/or other relational domains. Boolean Tensor Decomposition (BTD) factorizes a binary tensor into the Boolean sum of multiple rank-1 tensors, which is an NP-hard problem. Existing BTD methods have been limited by their high computational cost, in applications to large scale or higher order tensors. In this work, we presented a computationally efficient BTD algorithm, namely \textit{Geometric Expansion for all-order Tensor Factorization} (GETF), that sequentially identifies the rank-1 basis components for a tensor from a geometric perspective. We conducted rigorous theoretical analysis on the validity as well as algorithemic efficiency of GETF in decomposing all-order tensor. Experiments on both synthetic and real-world data demonstrated that GETF has significantly improved performance in reconstruction accuracy, extraction of latent structures and it is an order of magnitude faster than other state-of-the-art methods. △ Less

Submitted 26 October, 2020; v1 submitted 30 July, 2020; originally announced July 2020.

Comments: NeurIPS 2020

arXiv:2007.15816 [pdf, other]

Denoising individual bias for a fairer binary submatrix detection

Authors: Changlin Wan, Wennan Chang, Tong Zhao, Sha Cao, Chi Zhang

Abstract: Low rank representation of binary matrix is powerful in disentangling sparse individual-attribute associations, and has received wide applications. Existing binary matrix factorization (BMF) or co-clustering (CC) methods often assume i.i.d background noise. However, this assumption could be easily violated in real data, where heterogeneous row- or column-wise probability of binary entries results… ▽ More Low rank representation of binary matrix is powerful in disentangling sparse individual-attribute associations, and has received wide applications. Existing binary matrix factorization (BMF) or co-clustering (CC) methods often assume i.i.d background noise. However, this assumption could be easily violated in real data, where heterogeneous row- or column-wise probability of binary entries results in disparate element-wise background distribution, and paralyzes the rationality of existing methods. We propose a binary data denoising framework, namely BIND, which optimizes the detection of true patterns by estimating the row- or column-wise mixture distribution of patterns and disparate background, and eliminating the binary attributes that are more likely from the background. BIND is supported by thoroughly derived mathematical property of the row- and column-wise mixture distributions. Our experiment on synthetic and real-world data demonstrated BIND effectively removes background noise and drastically increases the fairness and accuracy of state-of-the arts BMF and CC methods. △ Less

Submitted 9 August, 2020; v1 submitted 30 July, 2020; originally announced July 2020.

Comments: Accepted at CIKM 2020

arXiv:2007.09720 [pdf, ps, other]

doi 10.1093/bib/bbaa291

Supervised clustering of high dimensional data using regularized mixture modeling

Authors: Wennan Chang, Changlin Wan, Yong Zang, Chi Zhang, Sha Cao

Abstract: Identifying relationships between molecular variations and their clinical presentations has been challenged by the heterogeneous causes of a disease. It is imperative to unveil the relationship between the high dimensional molecular manifestations and the clinical presentations, while taking into account the possible heterogeneity of the study subjects. We proposed a novel supervised clustering al… ▽ More Identifying relationships between molecular variations and their clinical presentations has been challenged by the heterogeneous causes of a disease. It is imperative to unveil the relationship between the high dimensional molecular manifestations and the clinical presentations, while taking into account the possible heterogeneity of the study subjects. We proposed a novel supervised clustering algorithm using penalized mixture regression model, called CSMR, to deal with the challenges in studying the heterogeneous relationships between high dimensional molecular features to a phenotype. The algorithm was adapted from the classification expectation maximization algorithm, which offers a novel supervised solution to the clustering problem, with substantial improvement on both the computational efficiency and biological interpretability. Experimental evaluation on simulated benchmark datasets demonstrated that the CSMR can accurately identify the subspaces on which subset of features are explanatory to the response variables, and it outperformed the baseline methods. Application of CSMR on a drug sensitivity dataset again demonstrated the superior performance of CSMR over the others, where CSMR is powerful in recapitulating the distinct subgroups hidden in the pool of cell lines with regards to their coping mechanisms to different drugs. CSMR represents a big data analysis tool with the potential to resolve the complexity of translating the clinical manifestations of the disease to the real causes underpinning it. We believe that it will bring new understanding to the molecular basis of a disease, and could be of special relevance in the growing field of personalized medicine. △ Less

Submitted 19 July, 2020; originally announced July 2020.

arXiv:2006.09977 [pdf, other]

A novel sentence embedding based topic detection method for micro-blog

Authors: Cong Wan, Shan Jiang, Cuirong Wang, Cong Wang, Changming Xu, Xianxia Chen, Ying Yuan

Abstract: Topic detection is a challenging task, especially without knowing the exact number of topics. In this paper, we present a novel approach based on neural network to detect topics in the micro-blogging dataset. We use an unsupervised neural sentence embedding model to map the blogs to an embedding space. Our model is a weighted power mean word embedding model, and the weights are calculated by atten… ▽ More Topic detection is a challenging task, especially without knowing the exact number of topics. In this paper, we present a novel approach based on neural network to detect topics in the micro-blogging dataset. We use an unsupervised neural sentence embedding model to map the blogs to an embedding space. Our model is a weighted power mean word embedding model, and the weights are calculated by attention mechanism. Experimental result shows our embedding method performs better than baselines in sentence clustering. In addition, we propose an improved clustering algorithm referred as relationship-aware DBSCAN (RADBSCAN). It can discover topics from a micro-blogging dataset, and the topic number depends on dataset character itself. Moreover, in order to solve the problem of parameters sensitive, we take blog forwarding relationship as a bridge of two independent clusters. Finally, we validate our approach on a dataset from sina micro-blog. The result shows that we can detect all the topics successfully and extract keywords in each topic. △ Less

Submitted 10 June, 2020; originally announced June 2020.

arXiv:2006.07924 [pdf, other]

Estimation and Inference for Multi-Kink Quantile Regression

Authors: Wei Zhong, Chuang Wan, Wenyang Zhang

Abstract: The Multi-Kink Quantile Regression (MKQR) model is an important tool for analyzing data with heterogeneous conditional distributions, especially when quantiles of response variable are of interest, due to its robustness to outliers and heavy-tailed errors in the response. It assumes different linear quantile regression forms in different regions of the domain of the threshold covariate but are sti… ▽ More The Multi-Kink Quantile Regression (MKQR) model is an important tool for analyzing data with heterogeneous conditional distributions, especially when quantiles of response variable are of interest, due to its robustness to outliers and heavy-tailed errors in the response. It assumes different linear quantile regression forms in different regions of the domain of the threshold covariate but are still continuous at kink points. In this paper, we investigate parameter estimation, kink point detection and statistical inference in MKQR models. We propose an iterative segmented quantile regression algorithm for estimating both the regression coefficients and the locations of kink points. The proposed algorithm is much more computationally efficient than the grid search algorithm and not sensitive to the selection of initial values. Asymptotic properties, such as selection consistency of the number of kink points, asymptotic normality of the estimators of both regression coefficients and kink effects, are established to justify the proposed method theoretically. A score test, based on partial subgradients, is developed to verify whether the kink effects exist or not. Test-inversion confidence intervals for kink location parameters are also constructed. Intensive simulation studies conducted show the proposed methods work very well when sample size is finite. Finally, we apply the MKQR models together with the proposed methods to the dataset about secondary industrial structure of China and the dataset about triceps skinfold thickness of Gambian females, which leads to some very interesting findings. A new R package MultiKink is developed to implement the proposed methods. △ Less

Submitted 14 June, 2020; originally announced June 2020.

Comments: 39pages, 4 figures

arXiv:2003.05731 [pdf, other]

SUOD: Accelerating Large-Scale Unsupervised Heterogeneous Outlier Detection

Authors: Yue Zhao, Xiyang Hu, Cheng Cheng, Cong Wang, Changlin Wan, Wen Wang, Jianing Yang, Haoping Bai, Zheng Li, Cao Xiao, Yunlong Wang, Zhi Qiao, Jimeng Sun, Leman Akoglu

Abstract: Outlier detection (OD) is a key machine learning (ML) task for identifying abnormal objects from general samples with numerous high-stake applications including fraud detection and intrusion detection. Due to the lack of ground truth labels, practitioners often have to build a large number of unsupervised, heterogeneous models (i.e., different algorithms with varying hyperparameters) for further c… ▽ More Outlier detection (OD) is a key machine learning (ML) task for identifying abnormal objects from general samples with numerous high-stake applications including fraud detection and intrusion detection. Due to the lack of ground truth labels, practitioners often have to build a large number of unsupervised, heterogeneous models (i.e., different algorithms with varying hyperparameters) for further combination and analysis, rather than relying on a single model. How to accelerate the training and scoring on new-coming samples by outlyingness (referred as prediction throughout the paper) with a large number of unsupervised, heterogeneous OD models? In this study, we propose a modular acceleration system, called SUOD, to address it. The proposed system focuses on three complementary acceleration aspects (data reduction for high-dimensional data, approximation for costly models, and taskload imbalance optimization for distributed environment), while maintaining performance accuracy. Extensive experiments on more than 20 benchmark datasets demonstrate SUOD's effectiveness in heterogeneous OD acceleration, along with a real-world deployment case on fraudulent claim analysis at IQVIA, a leading healthcare firm. We open-source SUOD for reproducibility and accessibility. △ Less

Submitted 4 March, 2021; v1 submitted 10 March, 2020; originally announced March 2020.

Comments: Proceedings of the 4th Conference on Machine Learning and Systems (MLSys). The code is available at see http://github.com/yzhao062/SUOD. arXiv admin note: text overlap with arXiv:2002.03222

arXiv:1909.03991 [pdf, other]

Fast And Efficient Boolean Matrix Factorization By Geometric Segmentation

Authors: Changlin Wan, Wennan Chang, Tong Zhao, Mengya Li, Sha Cao, Chi Zhang

Abstract: Boolean matrix has been used to represent digital information in many fields, including bank transaction, crime records, natural language processing, protein-protein interaction, etc. Boolean matrix factorization (BMF) aims to find an approximation of a binary matrix as the Boolean product of two low rank Boolean matrices, which could generate vast amount of information for the patterns of relatio… ▽ More Boolean matrix has been used to represent digital information in many fields, including bank transaction, crime records, natural language processing, protein-protein interaction, etc. Boolean matrix factorization (BMF) aims to find an approximation of a binary matrix as the Boolean product of two low rank Boolean matrices, which could generate vast amount of information for the patterns of relationships between the features and samples. Inspired by binary matrix permutation theories and geometric segmentation, we developed a fast and efficient BMF approach called MEBF (Median Expansion for Boolean Factorization). Overall, MEBF adopted a heuristic approach to locate binary patterns presented as submatrices that are dense in 1's. At each iteration, MEBF permutates the rows and columns such that the permutated matrix is approximately Upper Triangular-Like (UTL) with so-called Simultaneous Consecutive-ones Property (SC1P). The largest submatrix dense in 1 would lies on the upper triangular area of the permutated matrix, and its location was determined based on a geometric segmentation of a triangular. We compared MEBF with other state of the art approaches on data scenarios with different sparsity and noise levels. MEBF demonstrated superior performances in lower reconstruction error, and higher computational efficiency, as well as more accurate sparse patterns than popular methods such as ASSO, PANDA and MP. We demonstrated the application of MEBF on both binary and non-binary data sets, and revealed its further potential in knowledge retrieving and data denoising. △ Less

Submitted 10 February, 2020; v1 submitted 9 September, 2019; originally announced September 2019.

Comments: Accepted at AAAI 2020

arXiv:1908.07483 [pdf, other]

doi 10.1145/3447516

Sensor-Based Estimation of Dim Light Melatonin Onset (DLMO) Using Features of Two Time Scales

Authors: Cheng Wan, Andrew W. McHill, Elizabeth Klerman, Akane Sano

Abstract: Circadian rhythms influence multiple essential biological activities including sleep, performance, and mood. The dim light melatonin onset (DLMO) is the gold standard for measuring human circadian phase (i.e., timing). The collection of DLMO is expensive and time-consuming since multiple saliva or blood samples are required overnight in special conditions, and the samples must then be assayed for… ▽ More Circadian rhythms influence multiple essential biological activities including sleep, performance, and mood. The dim light melatonin onset (DLMO) is the gold standard for measuring human circadian phase (i.e., timing). The collection of DLMO is expensive and time-consuming since multiple saliva or blood samples are required overnight in special conditions, and the samples must then be assayed for melatonin. Recently, several computational approaches have been designed for estimating DLMO. These methods collect daily sampled data (e.g., sleep onset/offset times) or frequently sampled data (e.g., light exposure/skin temperature/physical activity collected every minute) to train learning models for estimating DLMO. One limitation of these studies is that they only leverage one time-scale data. We propose a two-step framework for estimating DLMO using data from both time scales. The first step summarizes data from before the current day, while the second step combines this summary with frequently sampled data of the current day. We evaluate three moving average models that input sleep timing data as the first step and use recurrent neural network models as the second step. The results using data from 207 undergraduates show that our two-step model with two time-scale features has statistically significantly lower root-mean-square errors than models that use either daily sampled data or frequently sampled data. △ Less

Submitted 1 March, 2022; v1 submitted 20 August, 2019; originally announced August 2019.

Comments: 16 pages, 6 figures, 4 tables, ACM Transactions on Computing for Healthcare

arXiv:1904.07998 [pdf, other]

SynC: A Unified Framework for Generating Synthetic Population with Gaussian Copula

Authors: Colin Wan, Zheng Li, Alicia Guo, Yue Zhao

Abstract: Synthetic population generation is the process of combining multiple socioeconomic and demographic datasets from different sources and/or granularity levels, and downscaling them to an individual level. Although it is a fundamental step for many data science tasks, an efficient and standard framework is absent. In this study, we propose a multi-stage framework called SynC (Synthetic Population via… ▽ More Synthetic population generation is the process of combining multiple socioeconomic and demographic datasets from different sources and/or granularity levels, and downscaling them to an individual level. Although it is a fundamental step for many data science tasks, an efficient and standard framework is absent. In this study, we propose a multi-stage framework called SynC (Synthetic Population via Gaussian Copula) to fill the gap. SynC first removes potential outliers in the data and then fits the filtered data with a Gaussian copula model to correctly capture dependencies and marginal distributions of sampled survey data. Finally, SynC leverages predictive models to merge datasets into one and then scales them accordingly to match the marginal constraints. We make three key contributions in this work: 1) propose a novel framework for generating individual level data from aggregated data sources by combining state-of-the-art machine learning and statistical techniques, 2) demonstrate its value as a feature engineering tool, as well as an alternative to data collection in situations where gathering is difficult through two real-world datasets, 3) release an easy-to-use framework implementation for reproducibility, and 4) ensure the methodology is scalable at the production level and can easily incorporate new data. △ Less

Submitted 10 November, 2019; v1 submitted 16 April, 2019; originally announced April 2019.

arXiv:1702.03613 [pdf]

A Multi-model Combination Approach for Probabilistic Wind Power Forecasting

Authors: You Lin, Ming Yang, Can Wan, Jianhui Wang, Yonghua Song

Abstract: Short-term probabilistic wind power forecasting can provide critical quantified uncertainty information of wind generation for power system operation and control. As the complicated characteristics of wind power prediction error, it would be difficult to develop a universal forecasting model dominating over other alternative models. Therefore, a novel multi-model combination (MMC) approach for sho… ▽ More Short-term probabilistic wind power forecasting can provide critical quantified uncertainty information of wind generation for power system operation and control. As the complicated characteristics of wind power prediction error, it would be difficult to develop a universal forecasting model dominating over other alternative models. Therefore, a novel multi-model combination (MMC) approach for short-term probabilistic wind generation forecasting is proposed in this paper to exploit the advantages of different forecasting models. The proposed approach can combine different forecasting models those provide different kinds of probability density functions to improve the probabilistic forecast accuracy. Three probabilistic forecasting models based on the sparse Bayesian learning, kernel density estimation and beta distribution fitting are used to form the combined model. The parameters of the MMC model are solved based on Bayesian framework. Numerical tests illustrate the effectiveness of the proposed MMC approach. △ Less

Submitted 12 February, 2017; originally announced February 2017.

arXiv:1211.2945 [pdf]

The application of a perceptron model to classify an individual's response to a proposed loading dose regimen of Warfarin

Authors: Cen Wan, Irina V. Biktasheva, Steven Lane

Abstract: The dose regimen of Warfarin is separated into two phases. Firstly a loading dose is given, which is designed to bring the International Normalisation Ratio (INR) to within therapeutic range. Then a stable maintenance dose is given to maintain the INR within therapeutic range. In the United Kingdom (UK) the loading dose is usually given as three individual daily doses, the standard loading dose be… ▽ More The dose regimen of Warfarin is separated into two phases. Firstly a loading dose is given, which is designed to bring the International Normalisation Ratio (INR) to within therapeutic range. Then a stable maintenance dose is given to maintain the INR within therapeutic range. In the United Kingdom (UK) the loading dose is usually given as three individual daily doses, the standard loading dose being 10mg on days one and two and 5mgs on day three, which can be varied at the discretion of the clinician. However, due to the large inter-individual variation in the response to Warfarin therapy, it is difficult to identify which patients will reach the narrow therapeutic window for target INR, and which will be above or below the therapeutic window. The aim of this research was to develop a methodology using a neural networks classification algorithm and data mining techniques to predict for a given loading dose and patient characteristics if the patient is more likely to achieve target INR or more likely to be above or below therapeutic range. Multilayer perceptron (MLP) and 10-fold stratified cross validation algorithms were used to determine an artificial neural network to classify patients' response to their initial Warfarin loading dose. The resulting neural network model correctly classifies an individual's response to their Warfarin loading dose over 80% of the time. As well as taking into account the initial loading dose, the final model also includes demographic, genetic and a number of other potential confounding factors. With this model clinicians can predetermine whether a given loading regimen, along with specific patient characteristics will achieve a therapeutic response for a particular patient. Thus tailoring the loading dose regimen to meet the individual needs of the patient and reducing the risk of adverse drug reactions associated with Warfarin. △ Less

Submitted 13 November, 2012; originally announced November 2012.

Comments: 12 pages, 5 figures, 1 table

MSC Class: 68T05; 92C50 ACM Class: I.2.1; I.5.1; I.5.2; J.3

Showing 1–16 of 16 results for author: Wan, C