标题：A Convolutional Neural Network-Based Approach to Identify the Origins of Replication in Saccharomyces Cerevisiae∗
作者：Wu, Feng ;Yang, Runtao ;Chen, Jingui ;Zhang, Chengjin
作者机构：[Wu, Feng ;Yang, Runtao ;Chen, Jingui ;Zhang, Chengjin ] Shandong University, School of Mechanical, Electrical and Information Engineering, Weihai; 26 更多
会议名称：39th Chinese Control Conference, CCC 2020
会议日期：July 27, 2020 - July 29, 2020
来源：Chinese Control Conference, CCC
摘要：DNA replication is key to the inheritance of genetic information. Accurate, efficient and rapid identification of the origins of replication (ORIs) is crucial for understanding the mechanism of DNA replication. Especially for eukaryotes, each of their gene sequences contains multiple ORIs for more efficient replication. Although there are many predictors designed to identify eukaryotes' ORIs, many of them are only targeted to the gene sequences with a fixed length. In addition, the prediction accuracies are not satisfying, which still has great room to be improved. In view of the limitations in this field, a convolutional neural network-based approach is developed in this study to identify ORIs with different lengths in Saccharomyces cerevisiae (S. cerevisiae). As combining this study with the field of Natural Language Processing (NLP), trinucleotide feature vectors are constructed by Word2vec to represent each trinucleotide so as to the subsequent ORIs identification using Text-Convolutional Neural Network. As a result, the overall success rate of 88.3% was achieved which proved the effeciency of the proposed method to identify ORIs with any length.
© 2020 Technical Committee on Control Theory, Chinese Association of Automation.