Non-small-cell lung cancer (NSCLC) is the most common cause of premature death among the malignant diseases worldwide. The current staging criteria do not fully capture the complexity of this disease. Molecular biology techniques, particularly gene expression microarrays, proteomics, and next-generation sequencing, have recently been developed to facilitate effectively its molecular classification. The underlying etiology, pathogenesis, therapeutics, and prognosis of NSCLC based on an improved molecular classification scheme may promote individualized treatment and improve clinical outcomes. This review focuses on the molecular classification of NSCLC based on gene expression microarray technology reported during the past decade, as well as their applications for improving the diagnosis, staging and treatment of NSCLC, including the discovery of prognostic markers or potential therapeutic targets. We highlight some of the recent studies that may refine the identification of NSCLC subtypes using novel techniques such as epigenetics, proteomics, or deep sequencing.