Lung cancer is one of the most invasive cancers affecting over a million of the population. Non-small cell lung cancer (NSCLC) constitutes up to 85% of all lung cancer cases, and therefore, it is essential to identify predictive biomarkers of NSCLC for therapeutic purposes. Here we use a network theoretical approach to investigate the complex behavior of the NSCLC gene-regulatory interactions. We have used eight NSCLC microarray datasets GSE19188, GSE118370, GSE10072, GSE101929, GSE7670, GSE33532, GSE31547, and GSE31210 and meta-analyzed them to find differentially expressed genes (DEGs) and further constructed a protein-protein interaction (PPI) network. We analyzed its topological properties and identified significant modules of the PPI network using cytoscape network analyzer and MCODE plug-in. From the PPI network, top ten genes of each of the six topological properties like closeness centrality, maximal clique centrality (MCC), Maximum Neighborhood Component (MNC), radiality, EPC (Edge Percolated Component) and bottleneck were considered for key regulator identification. We further compared them with top ten hub genes (those with the highest degrees) to find key regulator (KR) genes. We found that two genes, CDK1 and HSP90AA1, were common in the analysis suggesting a significant regulatory role of CDK1 and HSP90AA1 in non-small cell lung cancer. Our study using a network theoretical approach, as a summary, suggests CDK1 and HSP90AA1 as key regulator genes in complex NSCLC network.
Keywords: differentially expressed genes; key regulator; non-small cell lung cancer; protein–protein interaction.