学科主题基础医学
In silico cloning of C17orf32, a novel human gene and verification of its coding region by RT-PCR
Zhang, DL; Ding, PG; Ling, LJ; Chen, RS; Ma, DL
关键词C17orf32 LOC124919 XM_058865 XP_058865 bioinformatics in silico cloning RT-PCR human genome annotation
刊名PROGRESS IN BIOCHEMISTRY AND BIOPHYSICS
2002-08-01
29期:4页:543-549
收录类别SCI
文章类型Article
WOS标题词Science & Technology
类目[WOS]Biochemistry & Molecular Biology ; Biophysics
研究领域[WOS]Biochemistry & Molecular Biology ; Biophysics
关键词[WOS]SEQUENCE ; IDENTIFICATION ; PROTEINS ; GENOME ; CDNAS
英文摘要

A novel human gene encoding a protein of 208 amino acids is identified and characterized, which has been offered by HGNC with symbol of C17orf32 and name of chromosome 17 open reading frame 32. The full-length cDNA of 1679 bp for C17orf32 was cloned through a blast search of public databases following the identification of 1 119 bp cDNA obtained by EST assembly with full robotization of SiClone software (created by Chen RS and Ling LJ, and will be released on their website) in ShenWei IV-type supercomputer. Structurally, C17orf32 has one calcitonin / CGRP / IAPP family signature from amino acid 16 to 169, one dihydroorotase signature from amino acid 43 to 117, one tyrosine kinase phosphorylation site from amino acid 68 to 75, and one bipartite nuclear localization signal from amino acid 28 to 45. These motifs. imply the potential biological importance of this gene. Genomic organization analyses show that C17orf32 gene is comprised of six exons, in the size ranging from 43 to 1 101 bp, and five introns, in the size ranging from 163 to 1 124 bp, and spanning 4.61 kb. All of the exon/intron boundaries are consistent with the GT/AG rule, and consensuses surrounding the splice boundaries are found as well. The C17orf32 gene is located on accession NT - 010808.7 in the human chromosome 17, and is only linked with LOC124919, a hypothetical human gene of 889 bp mRNA encoding hypothetical protein XP - 058865 of 260 amino acids supported by XM - 058865. The sequence of LOC124919 has not been verified experimentally. Furthermore, the full-length ORF of 627 bp cDNA from 31 to 654 bp by RT-PCR from the single-stranded human gastric adenocarcinoma MGC803 cell line are cloned and sequenced, which is fully identical with that of the in silico cloning determined by the nucleotide sequencing. Thus,, in silico cloning of C17orf31 gene with GenBank accession number of AY074907 and TPA: BKO00260 is identified solely by bioinformatics analyses. The full-length cDNA sequence of 1 679 bp exhibits very good overall homology to that of LOC123722 of 899 bp mRNA, with matching percentage of 99 % in 78 % of total window and 57 % in 57 % of total window over the full-length nucleotide and protein, respectively. However, the base G in the No. 401 position of LOC123722 cDNA is a redundant insert, which causes a reading frame shift in the translation of an alternative protein. The insert G of LOC123722 is not supported by the experimental clone, and is fully rejected by human EST alignment, and is shown as a redundance by genomic GT/AG organization analysis. C17orf32 gene has 9 putative promoters with possibility of 58 % similar to 97 %, two TATAs, a stop codon in the upstream of ORF, two PolyA signals and a PolyA tail in the downstream of OFF, and accords with Kozak rule around the translation start of the ORF. Based on the above results, it can be concluded that a complete novel human gene is obtained. The full-length gene sequence exhibits little overall homology to any known protein at either the nucleotide or the amino acid level. The two related proteins, with 31 % (in 29 % of total window) and 18 % ( in 18 % of total window) identity over the full-length protein, respectively, are hypothetical caenorhabditis elegans protein F09E5. 11. p of 221 amino acids and polyphosphate kinase [the filamentous nitrogen-fixing cyanobacterium Anabaena sp. strain PCC 71201 of 736 amino acids. Taken together, by combining bioinformatics analyses with experimental verification, a novel human gene C17orf32 is successfully cloned, verified by a series of theoretical and experimental evidence.

The strategy will be helpful in discovering more novel human genes, even in correcting errors appeared in NCBI GENOME ANNOTATION PROJECT REFSEQs, such as LOC124919, a model reference sequence predicted from NCBI contig NT - 010808 by automated computational analysis using gene prediction method. Therefore, human genome coding region annotated by computer should be used with caution.

语种中文
WOS记录号WOS:000177560000011
引用统计
被引频次:3[WOS]   [WOS记录]     [WOS相关记录]
文献类型期刊论文
版本出版稿
条目标识符http://ir.bjmu.edu.cn/handle/400002259/55624
专题北京大学基础医学院_北京大学人类疾病基因研究中心
作者单位1.Peking Univ, Ctr Human Dis Genom, China Natl Ctr Human Genome Res, Beijing 100083, Peoples R China
2.Chinese Acad Sci, Inst Biophys, Beijing 100101, Peoples R China
推荐引用方式
GB/T 7714
Zhang, DL,Ding, PG,Ling, LJ,et al. In silico cloning of C17orf32, a novel human gene and verification of its coding region by RT-PCR[J]. PROGRESS IN BIOCHEMISTRY AND BIOPHYSICS,2002,29(4):543-549.
APA Zhang, DL,Ding, PG,Ling, LJ,Chen, RS,&Ma, DL.(2002).In silico cloning of C17orf32, a novel human gene and verification of its coding region by RT-PCR.PROGRESS IN BIOCHEMISTRY AND BIOPHYSICS,29(4),543-549.
MLA Zhang, DL,et al."In silico cloning of C17orf32, a novel human gene and verification of its coding region by RT-PCR".PROGRESS IN BIOCHEMISTRY AND BIOPHYSICS 29.4(2002):543-549.
条目包含的文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
In silico cloning of(2186KB)期刊论文出版稿开放获取CC BY-NC-SA浏览 请求全文
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Zhang, DL]的文章
[Ding, PG]的文章
[Ling, LJ]的文章
百度学术
百度学术中相似的文章
[Zhang, DL]的文章
[Ding, PG]的文章
[Ling, LJ]的文章
必应学术
必应学术中相似的文章
[Zhang, DL]的文章
[Ding, PG]的文章
[Ling, LJ]的文章
相关权益政策
暂无数据
收藏/分享
文件名: In silico cloning of C17orf32, a novel human gene and verification of its coding region by RT-PCR.pdf
格式: Adobe PDF
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。