ORF3d

Gene found in SARS-CoV-2
ORF3d
Identifiers
OrganismSARS-CoV-2
SymbolORF3d
UniProtP0DTG0
Search for
StructuresSwiss-model
DomainsInterPro

ORF3d is a gene found in SARS-CoV-2 (the virus that causes COVID-19) and at least one closely related coronavirus found in pangolins, though it is not found in other closely related viruses within the Sarbecovirus subgenus. It is 57 codons long and encodes a novel 57 amino acid residue protein of unknown function.[1] At least two isoforms have been described, of which the shorter 33-residue form, ORF3d-2, may be more highly expressed, or even the only form expressed.[1][2] It is reported to be antigenic and antibodies to the ORF3d protein occur in patients recovered from COVID-19.[3] There is no homolog in the genome of the otherwise closely related SARS-CoV (which causes the disease SARS).[1][4]

Nomenclature

There has been significant confusion in the scientific literature around the nomenclature used for the accessory proteins of SARS-CoV-2, especially several overlapping genes with ORF3a.[4] Many scientific papers have referred to ORF3d and its protein product as ORF3b, due to confusion caused by differences in the length of ORF3b in SARS-CoV (about 155 codons) and SARS-CoV-2 (only 22 codons).[4] Exacerbating the confusion, both the 57-codon protein product[5] and the 22-codon protein product[6] have been described to have similar effects as interferon antagonists.[4]

The recommended nomenclature for SARS-CoV-2 uses the term ORF3b for the 22-codon gene homologous to the 5' end of ORF3b in SARS-CoV, and uses the term ORF3d for the 57-codon gene.[4]

Comparative genomics

ORF3d is an overlapping gene whose open reading frame overlaps both ORF3a and ORF3c in the SARS-CoV-2 genome. This potentially represents a rare example of all three possible reading frames of the same sequence region encoding functional proteins.[1][4] ORF3d is not present in SARS-CoV or other related coronaviruses, except for a coronavirus found in pangolins.[1] SARS-CoV-2 genome sequences have been extensively sampled throughout the COVID-19 pandemic, and examples of SARS-CoV-2 variants with truncations in ORF3d due to the introduction of a stop codon have been identified with relatively high prevalence.[1][7]

Bioinformatics analysis of the ORF3d region suggests that the sequence of the predicted protein product is not well conserved and raises the possibility that the gene does not encode a functional protein, despite experimental evidence of protein expression.[8]

Expression

The ORF3d protein has two isoforms, one 57 amino acid residues long and one 33 residues long, the latter of which is known as ORF3d-2.[1] There is experimental evidence from studies such as ribosome profiling for expression of at least ORF3d-2, without clear evidence for the full-length ORF3d.[1][2]

Function

The function of the ORF3d protein is not known, and it is possible that the gene does not code for a protein with any functional role in the viral life cycle.[8] When expressed under experimental conditions in cell culture, the ORF3d protein appears to be an interferon antagonist.[5]

Robust antibody responses to peptides from ORF3d have been reported in patients recovered from COVID-19.[3]

References

  1. ^ a b c d e f g h Nelson CW, Ardern Z, Goldberg TL, Meng C, Kuo CH, Ludwig C, et al. (October 2020). "Dynamically evolving novel overlapping gene as a factor in the SARS-CoV-2 pandemic". eLife. 9: e59633. doi:10.7554/eLife.59633. PMC 7655111. PMID 33001029.
  2. ^ a b Finkel Y, Mizrahi O, Nachshon A, Weingarten-Gabbay S, Morgenstern D, Yahalom-Ronen Y, et al. (January 2021). "The coding capacity of SARS-CoV-2" (PDF). Nature. 589 (7840): 125–130. Bibcode:2021Natur.589..125F. doi:10.1038/s41586-020-2739-1. PMID 32906143. S2CID 218582461.
  3. ^ a b Hachim A, Kavian N, Cohen CA, Chin AW, Chu DK, Mok CK, et al. (October 2020). "ORF8 and ORF3b antibodies are accurate serological markers of early and late SARS-CoV-2 infection". Nature Immunology. 21 (10): 1293–1301. doi:10.1038/s41590-020-0773-7. PMID 32807944. S2CID 221136730.
  4. ^ a b c d e f Jungreis I, Nelson CW, Ardern Z, Finkel Y, Krogan NJ, Sato K, et al. (June 2021). "Conflicting and ambiguous names of overlapping ORFs in the SARS-CoV-2 genome: A homology-based resolution". Virology. 558: 145–151. doi:10.1016/j.virol.2021.02.013. hdl:1721.1/130363. PMC 7967279. PMID 33774510.
  5. ^ a b Lu R, Zhao X, Li J, Niu P, Yang B, Wu H, et al. (February 2020). "Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding". Lancet. 395 (10224): 565–574. doi:10.1016/S0140-6736(20)30251-8. PMC 7159086. PMID 32007145.
  6. ^ Konno Y, Kimura I, Uriu K, Fukushi M, Irie T, Koyanagi Y, et al. (September 2020). "SARS-CoV-2 ORF3b Is a Potent Interferon Antagonist Whose Activity Is Increased by a Naturally Occurring Elongation Variant". Cell Reports. 32 (12): 108185. doi:10.1016/j.celrep.2020.108185. PMC 7473339. PMID 32941788.
  7. ^ Lam JY, Yuen CK, Ip JD, Wong WM, To KK, Yuen KY, Kok KH (December 2020). "Loss of orf3b in the circulating SARS-CoV-2 strains". Emerging Microbes & Infections. 9 (1): 2685–2696. doi:10.1080/22221751.2020.1852892. PMC 7782295. PMID 33205709.
  8. ^ a b Jungreis I, Sealfon R, Kellis M (May 2021). "SARS-CoV-2 gene content and COVID-19 mutation impact by comparing 44 Sarbecovirus genomes". Nature Communications. 12 (1): 2642. Bibcode:2021NatCo..12.2642J. doi:10.1038/s41467-021-22905-7. hdl:1721.1/130581. PMC 8113528. PMID 33976134.
  • v
  • t
  • e
Coronavirus genomes
Viral structural protein
Viral nonstructural protein
(expressed from ORF1ab)
  • nonstructural protein 1
  • nonstructural protein 2
  • papain-like protease (nsp3)
  • nonstructural protein 4
  • 3C-like protease (nsp5)
  • nonstructural protein 6
  • nonstructural protein 7
  • nonstructural protein 8
  • nonstructural protein 9
  • nonstructural protein 10
  • nonstructural protein 11
  • nonstructural protein 12
  • nonstructural protein 13
  • nonstructural protein 14
  • nonstructural protein 15
  • nonstructural protein 16
Viral accessory proteinRNA
  • v
  • t
  • e
DNA
linear ds-DNA
(Duplodnaviria,
Varidnaviria)
Herpes simplex
VSPs:
capsid:
VNPs:
Vaccinia
VNPs:
  • B13R
Adenoviridae
VNPs:
circular ds-DNA
(Duplodnaviria,
Varidnaviria?)
Epstein–Barr
VSPs:
VNPs:
ncRNA:
Baculoviridae
VNPs:
other
(Riboviria,
Monodnaviria)
Polyomaviridae
(SV40, MPyV, MCPyV, HaPyV)
(non-enveloped circular ds-DNA)
VSPs:
capsid:
VNPs:
oncoprotein:
Hepatitis B
(circular partially ds-DNA)
VSPs:
VNPs:
RNA
ds-RNA
(Riboviria)
Rotavirus
(Duplornaviricota)
VNPs:
  • NSP1
  • NSP2
  • NSP3
  • NSP4
  • NSP5
  • NSP6
Rhinov., Polio, Hep A,
etc. (Pisuviricota)
VNPs:
ss-RNA
positive-sense
(Riboviria)
Hepatitis C
(Kitrinoviricota)
VSPs:
VNPs:
SARS-CoV-2
(Pisuviricota)
VSPs:
VNPs:
ss-RNA
negative-sense
(Negarnaviricota)
Influenza virus
VSPs:
capsid:
glycoprotein:
VNPs:
Parainfluenza
VSPs:
glycoprotein:
Mumps
VSPs:
glycoprotein:
Measles
VSPs:
glycoprotein:
RSV
VSPs:
glycoprotein:
Zaire ebolavirus
VSPs:
capsid:
Indiana vesiculovirus
VSPs:
capsid:
RT
Structure and genome of HIV
VSPs:
VRAPs:
Multiple
Fusion protein
oncoprotein: