In Silico Identification, Analysis, and Prediction Algorithm for Plant Gene Cluster

Himanshu Singh; C. Vineeth; Bhupender Thakur; Atul Kumar Upadhyay; Vikas Kaushik

oa In Silico Identification, Analysis, and Prediction Algorithm for Plant Gene Cluster

image of In Silico Identification, Analysis, and Prediction Algorithm for Plant Gene Cluster

Authors: Himanshu Singh¹, C. Vineeth², Bhupender Thakur³, Atul Kumar Upadhyay⁴, Vikas Kaushik⁵
View Affiliations Hide Affiliations

¹ School of Bioengineering and Biosciences, Lovely Professional University, Punjab, India ² School of Bioengineering and Biosciences, Lovely Professional University, Punjab, India ³ School of Bioengineering and Biosciences, Lovely Professional University, Punjab, India ⁴ Department of Biotechnology, Thapar University, Punjab, India ⁵ School of Bioengineering and Biosciences, Lovely Professional University, Punjab, India
Source: Global Emerging Innovation Summit (GEIS-2021) , pp 237-244
Publication Date: November 2021
Language: English

The concept/phenomenon of operons, which are organized genes that work in a coordinated way in microbes, is well established. Recent developments in genetics, biochemistry, and bioinformatics have unraveled similar gene arrangements in plants. Here we aim to develop an algorithm/tool which would help us detect and identify biosynthetic gene clusters (BGCs) from any input plant genome. Through this tool, we intend to match or supersede the performance of pre-existing sting tools for BGC prediction, like the popular plantiSMASH. The predictions models were developed using the machine learning tool WEKA using the physicochemical properties as data set to classify between terpene synthases and non-terpene synthases. A set of ten physicochemical properties were selected and their values were predicted for each of the 159 proteins (terpene synthases and non-terpene synthases) Employing the random forest and SMO classifiers, we were able to obtain significantly promising accuracy of over 90 percent with 66 percent percentage split testing. Accurate prediction of BGCs in the plants, especially the major food crops like rice, wheat, and corn revolutionize farming and nutrition for the better.

Hardbound ISBN: 9781681089027

Ebook ISBN: 9781681089010

Book DOI: https://doi.org/10.2174/97816810890101210101

From This Site

/content/books/9781681089010.chapter-27

dcterms_subject,pub_keyword

-contentType:Journal -contentType:Figure -contentType:Table -contentType:SupplementaryData

10

5

/content/books/9781681089010.chapter-27

dcterms_subject,pub_keyword

-contentType:Journal -contentType:Figure -contentType:Table -contentType:SupplementaryData

10

5

Chapter

content/books/9781681089010

Book

false

en

oa In Silico Identification, Analysis, and Prediction Algorithm for Plant Gene Cluster

From This Site