PMID- 15629035 OWN - NLM STAT- MEDLINE DCOM- 20050211 LR - 20240326 IS - 1672-0229 (Print) IS - 2210-3244 (Electronic) IS - 1672-0229 (Linking) VI - 1 IP - 3 DP - 2003 Aug TI - Genome organization of the SARS-CoV. PG - 226-35 AB - Annotation of the genome sequence of the SARS-CoV (severe acute respiratory syndrome-associated coronavirus) is indispensable to understand its evolution and pathogenesis. We have performed a full annotation of the SARS-CoV genome sequences by using annotation programs publicly available or developed by ourselves. Totally, 21 open reading frames (ORFs) of genes or putative uncharacterized proteins (PUPs) were predicted. Seven PUPs had not been reported previously, and two of them were predicted to contain transmembrane regions. Eight ORFs partially overlapped with or embedded into those of known genes, revealing that the SARS-CoV genome is a small and compact one with overlapped coding regions. The most striking discovery is that an ORF locates on the minus strand. We have also annotated non-coding regions and identified the transcription regulating sequences (TRS) in the intergenic regions. The analysis of TRS supports the minus strand extending transcription mechanism of coronavirus. The SNP analysis of different isolates reveals that mutations of the sequences do not affect the prediction results of ORFs. FAU - Xu, Jing AU - Xu J AD - Beijing Genomics Institute, Chinese Academy of Sciences, Beijing 101300, China. FAU - Hu, Jianfei AU - Hu J FAU - Wang, Jing AU - Wang J FAU - Han, Yujun AU - Han Y FAU - Hu, Yongwu AU - Hu Y FAU - Wen, Jie AU - Wen J FAU - Li, Yan AU - Li Y FAU - Ji, Jia AU - Ji J FAU - Ye, Jia AU - Ye J FAU - Zhang, Zizhang AU - Zhang Z FAU - Wei, Wei AU - Wei W FAU - Li, Songgang AU - Li S FAU - Wang, Jun AU - Wang J FAU - Wang, Jian AU - Wang J FAU - Yu, Jun AU - Yu J FAU - Yang, Huanming AU - Yang H LA - eng PT - Journal Article PT - Research Support, Non-U.S. Gov't PL - England TA - Genomics Proteomics Bioinformatics JT - Genomics, proteomics & bioinformatics JID - 101197608 SB - IM MH - Amino Acid Substitution MH - Base Composition MH - Base Sequence MH - Computational Biology/methods MH - *Genome, Viral MH - Isoelectric Point MH - Models, Genetic MH - Molecular Sequence Data MH - Molecular Weight MH - Open Reading Frames MH - Severe acute respiratory syndrome-related coronavirus/*genetics MH - Sequence Analysis MH - Transcription, Genetic PMC - PMC5172239 EDAT- 2005/01/05 09:00 MHDA- 2005/02/12 09:00 PMCR- 2016/11/28 CRDT- 2005/01/05 09:00 PHST- 2005/01/05 09:00 [pubmed] PHST- 2005/02/12 09:00 [medline] PHST- 2005/01/05 09:00 [entrez] PHST- 2016/11/28 00:00 [pmc-release] AID - S1672-0229(03)01028-3 [pii] AID - 10.1016/s1672-0229(03)01028-3 [doi] PST - ppublish SO - Genomics Proteomics Bioinformatics. 2003 Aug;1(3):226-35. doi: 10.1016/s1672-0229(03)01028-3.