YALI0E02904g


similar to uniprot|Q01159 Saccharomyces cerevisiae YGL130w CEG1 Alpha (guanylyltransferase) subunit of the mRNA capping enzyme, a heterodimer (the other subunit is CET1, an RNA 5'-triphophatase) involved in adding the 5' cap to mRNA

Genomic environment map

Element type: CDS
Element length: 1176 nucleotides,
on sense strand of
Yali0E: 318087..319262.
Other names:
YALI-CDS3337.1
YALI-IPF2453
Coding sequence: 392 codons.
Database cross references:
EMBL: CR382131
GeneID: 2912122
HOGENOM: Q6C783

Computed results  

None available yet

Homologs and Orthologs

Homologs in protein families: GL3R1787 GL3R1787.N1
Orthologs: strict determination not possible; homologs must be refined manually

Protein YALI0E02904p  


similar to uniprot|Q01159 Saccharomyces cerevisiae YGL130w CEG1 mRNA guanylyltransferase (mRNA capping enzyme alpha subunit)

Protein domain map

Protein length: 391 amino acids
Protein family: GL3R1787
Database cross references:
Gene3D: G3DSA:2.40.50.140
InterPro: IPR001339
InterPro: IPR012340
InterPro: IPR013846
InterPro: IPR017075
KEGG: yli:YALI0E02904g
PIRSF: PIRSF036959
Pfam: PF01331
Pfam: PF03919
RefSeq: XP_503479.1
UniProtKB/Swiss-Prot: Q6C783
UniprotKB: MCE1_YARLI

Computed results for YALI0E02904p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  

None available yet

Sequence data  

Nucleotide sequence

>YALI0E02904g.nt
TCTTAGCAGCAGCGGATGCCGTAGAAGCGAGCTTGCTGGGCGCGCCGGTGTCGATGACAA
TATCAATGTCAGAGTCGTCGGATTCGTATTCAGAATCGGAATCATCTTCCGGTTCTTGTT
GCTTCGTGGTCTGCGAGTGTGGTTCCTCATCATCCTCAGCTCCCTCGGGCTCAACTCGGG
CCCGCTTTACCGCAGGCTCCTCGGTGCCGTAGAGAAAGTCGTCGTCGTCATCCATCTTTG
ATGTCAGATGGGGGGATGGTGATGTTGCCGTGGTGGGTGTCGTTTGGCGTAGTGCTTTGC
AGAAACCTTTCTTTAGGTTTGATGGTGGCCCAATACGAGGAGAGAATGTGGCGAAATGGG
GAGTTTTGAGGGTATATGTATGAGAAATTTGAGATTTCTGAACCAAATAAATTTACAGTC
ATATTATCTCTTTAAAAAAGCATCTCTAGACTGCTAAAGACGTATTAGCGTCCCTCAAAC
ATGGCTGACTAACACAACTCTATTTGGCCACACCAAACACTGCTGATGTATGGCGGCGGT
GGTGATATGACTGCTTCATTGCTGCTGGTTAAACAACATACAAGTATGTACTCGTACTTA
CAAATAACGTGAAGAAGTCACCTGTGCTAGCCCATCAAAACAGAAATGTTTTACAATAGT
TTGGAGAATATCGGTACAAGTATTTCGAACAGACGCTTTAGTAAGACATTGGATACTCCT
ATAAGGAACTCATGTCGTTTCTCGTTCTACTTCGGTGCGAAAACTTTCAGAAGGACACTG
CAGAGGAAGACATTCATACCTGTACCTTCATGAAATCATCCCCCCTACAGTACAAAGAAT
CGAGCCCCTTTCTACTGTAGTTCAGAAGCAATAGGGTTATGAAGGACCTTAACGAAGAGC
TAAATACCACCACAATACCACACTCCATTCACCACAACCACCTGCATCAGGACGGCATTA
GGTTCAGCCATCTCAAATTTCACACCCCACCACCATCATCATGTCTGGAATCGTCCCCGA
AATTCCAGGCGAACAAGCGCCCCCAGATGCTGCTCACCAGCTCAAGGTGGATGTTGCGCG
CCTCTTGCAAAAACCCAAGCTCAACTTTCCTGGTGCTCAGCCCGTCTCATTCGCCAGAAA
ACACATTGAAGAGGAGCTCTTCAAGCGAGATTATTATGTGTGCGAGAAGAGTGACGGTCT
TCGATGTCTCATGTACGTGACATGGGAAAATAACCCCGACACTGGCCCCCAGCAGGTCAC
GTATCTGATCACACGAAACAACGAGTTCTTCTTCATTCCCATGGTCCACTTCCCTTCGAA
TGATGGTAAGCCATTGCAAGACACAATTGTCGACGGAGAGCTCGTGCTCACAAAGGCCGA
ACCCCGATCCCTGCATTTCCTCATGTTTGACTGTCTTGCCTGCAACAAGATTCTGCTTAC
AGGCCGACCTTTGGACAAGAGATTGGGGTACCTCAATGCTGCCATCTCGCATCCTCTGAA
AGAGTACCTGCACAAGAACCCCGAAGTTGCCCGCGACTTTCCCTTCAGTGTGCGCGTCAA
GGATATGCAGTTTGCTTACAATGTGATGAACGTCTTTGCCAGCTTCCCGCACCTTCCGCA
CATCACAGATGGTCTCATCTTCACCTGTAGGGACCACCCTTACGTCTCTGGAACAGACGA
GCGAATTCTCAAGTGGAAAAAACAAGACGAAAACTCGGTAGACTTTCTGATGACCATGAA
GTTTCCTATTTTCGAGGACACAAATGGTGAGAGTTGGACAGACTACGACGCAAAACCCGA
AATCACACTCCTGGTGTGGACAGGACGGGACGGAAGCCGACCTTACGGCGAGCTGTATCT
CACTGACGAGGAATGGGATAACTTGAAGGCGCTGGAGGAGCCTCTGGAAGAGCGGGTCGT
GGAATGTATTAAGGATGATAAGAAGCGATGGAGATATCTTAGATTCCGGGATGACAAGAC
GAATGCCAACTACATCACAACTGTCGAGAAGGTCATTGACTCTATTGATGATCCCGTCTC
AGAGAAGAATCTGTTAGATGCCGCTCCCAAAATTAAGGAGCTGTGGAAGGAACGAAATCG
ACGCCCACGCGACGAAGATAGAAAGCGAGTAGGAGGGGACGACCATGATCACGGTGCCAA
AAGAGCTCGACAGTAAAGCCTCAACCCAGATCAAGTAAATTATATATTGTAATATTAGCA
CACATACTTCAAAACCATATGGGAAAGGACCACAAGTAGACACTACAGTATGTACATACA
TACAGTACACGACAAAAAGTTATGGAGAGTGATGGGAATAAATAAAAAAGCAAGCAAAAA
AGGTTTCTCCGATACGGGGAATCGAACCCCGGTCTCCACGGTTCTCAAGATGAGAGCGTG
ATGTGATAGCCCCTACACTATATCGGAAGTTGAAAATCAGCCCGACGTGGGGCTTGAACC
CACAGCCTTGAGATCA

Coding sequence

>YALI0E02904g.cds
ATGTCTGGAATCGTCCCCGAAATTCCAGGCGAACAAGCGCCCCCAGATGCTGCTCACCAG
CTCAAGGTGGATGTTGCGCGCCTCTTGCAAAAACCCAAGCTCAACTTTCCTGGTGCTCAG
CCCGTCTCATTCGCCAGAAAACACATTGAAGAGGAGCTCTTCAAGCGAGATTATTATGTG
TGCGAGAAGAGTGACGGTCTTCGATGTCTCATGTACGTGACATGGGAAAATAACCCCGAC
ACTGGCCCCCAGCAGGTCACGTATCTGATCACACGAAACAACGAGTTCTTCTTCATTCCC
ATGGTCCACTTCCCTTCGAATGATGGTAAGCCATTGCAAGACACAATTGTCGACGGAGAG
CTCGTGCTCACAAAGGCCGAACCCCGATCCCTGCATTTCCTCATGTTTGACTGTCTTGCC
TGCAACAAGATTCTGCTTACAGGCCGACCTTTGGACAAGAGATTGGGGTACCTCAATGCT
GCCATCTCGCATCCTCTGAAAGAGTACCTGCACAAGAACCCCGAAGTTGCCCGCGACTTT
CCCTTCAGTGTGCGCGTCAAGGATATGCAGTTTGCTTACAATGTGATGAACGTCTTTGCC
AGCTTCCCGCACCTTCCGCACATCACAGATGGTCTCATCTTCACCTGTAGGGACCACCCT
TACGTCTCTGGAACAGACGAGCGAATTCTCAAGTGGAAAAAACAAGACGAAAACTCGGTA
GACTTTCTGATGACCATGAAGTTTCCTATTTTCGAGGACACAAATGGTGAGAGTTGGACA
GACTACGACGCAAAACCCGAAATCACACTCCTGGTGTGGACAGGACGGGACGGAAGCCGA
CCTTACGGCGAGCTGTATCTCACTGACGAGGAATGGGATAACTTGAAGGCGCTGGAGGAG
CCTCTGGAAGAGCGGGTCGTGGAATGTATTAAGGATGATAAGAAGCGATGGAGATATCTT
AGATTCCGGGATGACAAGACGAATGCCAACTACATCACAACTGTCGAGAAGGTCATTGAC
TCTATTGATGATCCCGTCTCAGAGAAGAATCTGTTAGATGCCGCTCCCAAAATTAAGGAG
CTGTGGAAGGAACGAAATCGACGCCCACGCGACGAAGATAGAAAGCGAGTAGGAGGGGAC
GACCATGATCACGGTGCCAAAAGAGCTCGACAGTAA

Predicted translation product

>YALI0E02904g.aa
MSGIVPEIPGEQAPPDAAHQLKVDVARLLQKPKLNFPGAQPVSFARKHIEEELFKRDYYV
CEKSDGLRCLMYVTWENNPDTGPQQVTYLITRNNEFFFIPMVHFPSNDGKPLQDTIVDGE
LVLTKAEPRSLHFLMFDCLACNKILLTGRPLDKRLGYLNAAISHPLKEYLHKNPEVARDF
PFSVRVKDMQFAYNVMNVFASFPHLPHITDGLIFTCRDHPYVSGTDERILKWKKQDENSV
DFLMTMKFPIFEDTNGESWTDYDAKPEITLLVWTGRDGSRPYGELYLTDEEWDNLKALEE
PLEERVVECIKDDKKRWRYLRFRDDKTNANYITTVEKVIDSIDDPVSEKNLLDAAPKIKE
LWKERNRRPRDEDRKRVGGDDHDHGAKRARQ*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites