Splicing-site recognition of rice (Oryza sativa L.) DNA sequences by support vector machines

J Zhejiang Univ Sci. 2003 Sep-Oct;4(5):573-7. doi: 10.1631/jzus.2003.0573.

Abstract

Motivation: It was found that high accuracy splicing-site recognition of rice (Oryza sativa L.) DNA sequence is especially difficult. We described a new method for the splicing-site recognition of rice DNA sequences.

Method: Based on the intron in eukaryotic organisms conforming to the principle of GT-AG, we used support vector machines (SVM) to predict the splicing sites. By machine learning, we built a model and used it to test the effect of the test data set of true and pseudo splicing sites.

Results: The prediction accuracy we obtained was 87.53% at the true 5' end splicing site and 87.37% at the true 3' end splicing sites. The results suggested that the SVM approach could achieve higher accuracy than the previous approaches.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • DNA / genetics*
  • Genetic Vectors
  • Introns
  • Models, Theoretical
  • Oryza / genetics*
  • RNA Splicing*

Substances

  • DNA