DEHA2D03322g


similar to uniprot|Q01159 Saccharomyces cerevisiae YGL130W CEG1 Alpha (guanylyltransferase) subunit of the mRNA capping enzyme, a heterodimer (the other subunit is CET1, an RNA 5'-triphophatase) involved in adding the 5' cap to mRNA

Genomic environment map

Element type: CDS
Element length: 1377 nucleotides,
on sense strand of
Deha2D: 289373..290749.
Other names:
DEHA-CDS2658.1
DEHA-IPF10451
DEHA0D04026g
Coding sequence: 459 codons.

Computed results  

None available yet

Homologs and Orthologs

Homologs in protein families: GL3R1787 GL3R1787.N1
Orthologs: strict determination not possible; homologs must be refined manually

Protein DEHA2D03322p  


similar to uniprot|Q01159 Saccharomyces cerevisiae YGL130W CEG1 mRNA guanylyltransferase

Protein domain map

Protein length: 458 amino acids
Protein family: GL3R1787
Database cross references:
InterPro: IPR001339
InterPro: IPR012340
InterPro: IPR013846
InterPro: IPR017075
UniProtKB/Swiss-Prot: Q6BT58

Computed results for DEHA2D03322p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  

None available yet

Sequence data  

Nucleotide sequence

>DEHA2D03322g.nt
TGTAAGTTAACTAATGAAGATTCGATGAAATACCATATCACTTGTGCCCAAGATACCCCA
AATTTTAAATTAGGGTTCAAGCTATGCTCTCAAAGACTCAATGACAAAGACAATAAACTT
GTCAAGGTCGGTGATGAATACGGTAAATTGCAACCTATTTTACTATGCCCCATGCATACA
ACTTCTTCAATAAAGATCCATAGTATGAGAACTTTGGGTAATAGAGTTTATGGTATTAGT
AGAGATGAGCTTAAACCTATTATTCAAATCTACTTGGAAGATCTAGTAAAAAGCAATAAT
ACAAGTAATAAGTTAACTGGACCTCAGTTGAAGTCTAATATTTATATTAAAATGATAAAA
GCATATGAGGAACAAGAGAAGAATCAGAAACTGAAAAATAATTATTTGAAATTGTTAAAT
CTGTCGAATATTATAAATGTGAAGCCTAAAGACGTCAAATCTTGTGGAAGATGTAAAACG
ACTGCTTCTCCTATGTGGTGGATACAAAAAGATGGTAATCAAAGCAATTTTAATGAAATG
CGTGATTCTCCGAACCAAAAGTTTCTTTGTCAAACTTGTTATCATATAAAAGATGAGGAT
AAAGAATCAACTCCAGAACGTGAAAGCCAGCCCTTATTTGATTTACTTGATGAACCATTG
GATGGAGAATTGTATGGATTACAAAATAATGAAGATAAATTATCGGATATTTATCCAAAA
GATACGAAACACGAACTGGAGCTGGTTACCTCTTCTGAAATTACAAGGTCAAAAATATCA
ATAGGTGATATATTAAGTTAAACATTAAATATCTTCCCTATATTTAATACATTTACTCTA
TGGTGGTTAGTCATATATATTATTTGCATCTATACTAAGCATGTAGTTCAAAAAATGTCG
TGAATGGGGTGTTATAAAAATGTCGTTCAGTTATGTTTTTTGAAAATGTTCATTTCATTA
AAGGCATTCTAATACTACTTTGATATATATACCCGTCAAGATGATTCAATTGGAAGAACG
AGATATGCCTGAGATACCAGGTACAATTCTCGATAGAAATGAAACACAAGAATTAAGATT
AATGGTGGCTGATTTGTTAGGTAGAAGAAATCCATCATTCCCTGGAGCTCAGCCAATTTC
ATTTGAAAGGTATCATTTGAATGATACGTTAATGAATAAGGATTACTATGTATGTGAGAA
GTCTGATGGATTGCGTTGTTTATTATTTATAATTAATCATCCCGAAAGAGGTGAAGGGGT
CTTTCTAATAACCAGAGAAAATGATTATTACTACATTCCCAACATCCATTTCCCGTTAAC
AAATAATGAAGAAAAAGGTAAAACATATCACCATGGAACTTTACTAGATGGAGAATTAGT
TCTTGAAACAAAAAATGTCCCAGAGCCTGTATTGAGGTTTTGCATATTTGACGCATTGGC
TATTAATGGGAAAGATATAACCAAGAGACACTTACCTAAGAGACTTGGCTATATTACAGA
ACAGGTAATGAAACCATTTGACAATTTTAAACGTAAAAACCCTGAAATTGTCAACGCTCC
TGATTTTCCTTTCAAAGTAAGCTTTAAATTAATGACATCCTCTTATCACGCTGATGATGT
CTTATCTAAAATGGACCAGTTATTTCACGAATCTGATGGCTTGATTTTCACTTGCGCTGA
GACGCCATATGTGTTTGGTACTGATAGCACCTTATTAAAGTGGAAACCAGCACATGAAAA
TACTGTCGATTACAAAATGGAAATGATATTTAAGAAGTTTCAAGACCCTGATTTAGACCC
AAGAGATCCGGACTCTACATACACAGACTATGATTCTAAGCCAGAACTAATCAAGTTAAG
GGTTTGGAAGGGTGGCGCAGATTATGAGGACTTTACCAAATTATCCTTAGAAAATGAAGA
TTGGGAAAAATTAAAGAACTTGAGACAACCATTGCAAGGAAGGATTGTTGAATGCCGTAA
GAAACTTTCAGACCCTGGATTTTGGGAGATGTTACGATTTAGGAATGATAAGAGTAATGG
AAATCATATTTCTGTTGTTGATAAAATTTTACATAGTATTCAAGATGGTGTGAGTGAAGA
AGAATTGATCGAAGCATGTCCTAAGATTGGTAAAGCATGGAAGAAAAGGATCTATGAAAA
ATCACAAGGTAGCAGATCACTGTATAGCGAAACCGGAAGATCACATCCTGAGCCTAACAG
AGAAGATGAGCCTGCACTGAAACGTACTAAGATTGACATGGAAGAGCCCGAGCCTAATGG
GTTTGGTGGGAGTACAACAAATCATCAAAATGCTTCTAAGCAGAACTCTGGGGAATTTCA
GGATATTCCGACCTATGAAGATAGTGACGATGAATGAACTTGAGGAAATACATAATGTAC
TATAGAAAAATTGCTGCAAAGACAGTTTGTAACATTAACAAAATTCAACATAAACCTAAA
TAGAAAACTGTACCAATAGAAAATTTAAAGAGCTGTTTCCTCATCATAGGCGGGGGGGAC
TATATCGAATTTATTAGCTGAGATCATATCCATGACAATATTGTCGTTCCTTTTCTGCAT
ATCAGAATAGCAAGGTGGTGATTCGGATGCAATTTCTGAATCATGACTTAACAACTTTAT
AGGTGCTGTAAAAGAAAGTTCGATTTTCTTGAATCTT

Coding sequence

>DEHA2D03322g.cds
ATGATTCAATTGGAAGAACGAGATATGCCTGAGATACCAGGTACAATTCTCGATAGAAAT
GAAACACAAGAATTAAGATTAATGGTGGCTGATTTGTTAGGTAGAAGAAATCCATCATTC
CCTGGAGCTCAGCCAATTTCATTTGAAAGGTATCATTTGAATGATACGTTAATGAATAAG
GATTACTATGTATGTGAGAAGTCTGATGGATTGCGTTGTTTATTATTTATAATTAATCAT
CCCGAAAGAGGTGAAGGGGTCTTTCTAATAACCAGAGAAAATGATTATTACTACATTCCC
AACATCCATTTCCCGTTAACAAATAATGAAGAAAAAGGTAAAACATATCACCATGGAACT
TTACTAGATGGAGAATTAGTTCTTGAAACAAAAAATGTCCCAGAGCCTGTATTGAGGTTT
TGCATATTTGACGCATTGGCTATTAATGGGAAAGATATAACCAAGAGACACTTACCTAAG
AGACTTGGCTATATTACAGAACAGGTAATGAAACCATTTGACAATTTTAAACGTAAAAAC
CCTGAAATTGTCAACGCTCCTGATTTTCCTTTCAAAGTAAGCTTTAAATTAATGACATCC
TCTTATCACGCTGATGATGTCTTATCTAAAATGGACCAGTTATTTCACGAATCTGATGGC
TTGATTTTCACTTGCGCTGAGACGCCATATGTGTTTGGTACTGATAGCACCTTATTAAAG
TGGAAACCAGCACATGAAAATACTGTCGATTACAAAATGGAAATGATATTTAAGAAGTTT
CAAGACCCTGATTTAGACCCAAGAGATCCGGACTCTACATACACAGACTATGATTCTAAG
CCAGAACTAATCAAGTTAAGGGTTTGGAAGGGTGGCGCAGATTATGAGGACTTTACCAAA
TTATCCTTAGAAAATGAAGATTGGGAAAAATTAAAGAACTTGAGACAACCATTGCAAGGA
AGGATTGTTGAATGCCGTAAGAAACTTTCAGACCCTGGATTTTGGGAGATGTTACGATTT
AGGAATGATAAGAGTAATGGAAATCATATTTCTGTTGTTGATAAAATTTTACATAGTATT
CAAGATGGTGTGAGTGAAGAAGAATTGATCGAAGCATGTCCTAAGATTGGTAAAGCATGG
AAGAAAAGGATCTATGAAAAATCACAAGGTAGCAGATCACTGTATAGCGAAACCGGAAGA
TCACATCCTGAGCCTAACAGAGAAGATGAGCCTGCACTGAAACGTACTAAGATTGACATG
GAAGAGCCCGAGCCTAATGGGTTTGGTGGGAGTACAACAAATCATCAAAATGCTTCTAAG
CAGAACTCTGGGGAATTTCAGGATATTCCGACCTATGAAGATAGTGACGATGAATGA

Predicted translation product

>DEHA2D03322g.aa
MIQLEERDMPEIPGTILDRNETQELRLMVADLLGRRNPSFPGAQPISFERYHLNDTLMNK
DYYVCEKSDGLRCLLFIINHPERGEGVFLITRENDYYYIPNIHFPLTNNEEKGKTYHHGT
LLDGELVLETKNVPEPVLRFCIFDALAINGKDITKRHLPKRLGYITEQVMKPFDNFKRKN
PEIVNAPDFPFKVSFKLMTSSYHADDVLSKMDQLFHESDGLIFTCAETPYVFGTDSTLLK
WKPAHENTVDYKMEMIFKKFQDPDLDPRDPDSTYTDYDSKPELIKLRVWKGGADYEDFTK
LSLENEDWEKLKNLRQPLQGRIVECRKKLSDPGFWEMLRFRNDKSNGNHISVVDKILHSI
QDGVSEEELIEACPKIGKAWKKRIYEKSQGSRSLYSETGRSHPEPNREDEPALKRTKIDM
EEPEPNGFGGSTTNHQNASKQNSGEFQDIPTYEDSDDE*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites