Neeraj Nagpal, Neera Munjal, Sayan Chatterjee
University School of Biotechnology, Guru Gobind Singh Indraprastha University, New Delhi, India.
Growing possibilities of biotechnology for genome sequencing lead to generation of sequences for millions of genes. However, function of majority of these genes is unknown, and can be determined experimentally only for a few of them. Therefore, a large part of proteomes is represented by hypothetical proteins (HP), i.e. proteins predicted from nucleic acid sequences only and protein sequences with unknown function. The usual scenario involving hypothetical protein is in gene identification during genome analysis. When the bioinformatics tool used for the gene identification finds a large open reading frame without an analog in the protein database, it returns “hypothetical protein” as an annotation remark. Read More …