YALI0E02904g
similar to uniprot|Q01159 Saccharomyces cerevisiae YGL130w CEG1 Alpha (guanylyltransferase) subunit of the mRNA capping enzyme, a heterodimer (the other subunit is CET1, an RNA 5'-triphophatase) involved in adding the 5' cap to mRNA
Element type: CDS
Element length: 1176 nucleotides,
on sense strand of
Yali0E: 318087..319262.
Other names:
YALI-CDS3337.1
YALI-IPF2453
Coding sequence: 392 codons.
Element length: 1176 nucleotides,
on sense strand of
Yali0E: 318087..319262.
Other names:
YALI-CDS3337.1
YALI-IPF2453
Coding sequence: 392 codons.
Database cross references:
EMBL: CR382131
GeneID: 2912122
HOGENOM: Q6C783
Orthologs: strict determination not possible; homologs must be refined manually
EMBL: CR382131
GeneID: 2912122
HOGENOM: Q6C783
Homologs and Orthologs
Homologs in protein families: GL3R1787 GL3R1787.N1Orthologs: strict determination not possible; homologs must be refined manually
Protein YALI0E02904p 
similar to uniprot|Q01159 Saccharomyces cerevisiae YGL130w CEG1 mRNA guanylyltransferase (mRNA capping enzyme alpha subunit)
Protein domain map
Database cross references:
Gene3D: G3DSA:2.40.50.140
InterPro: IPR001339
InterPro: IPR012340
InterPro: IPR013846
InterPro: IPR017075
KEGG: yli:YALI0E02904g
PIRSF: PIRSF036959
Pfam: PF01331
Pfam: PF03919
RefSeq: XP_503479.1
UniProtKB/Swiss-Prot: Q6C783
UniprotKB: MCE1_YARLI
Gene3D: G3DSA:2.40.50.140
InterPro: IPR001339
InterPro: IPR012340
InterPro: IPR013846
InterPro: IPR017075
KEGG: yli:YALI0E02904g
PIRSF: PIRSF036959
Pfam: PF01331
Pfam: PF03919
RefSeq: XP_503479.1
UniProtKB/Swiss-Prot: Q6C783
UniprotKB: MCE1_YARLI
Sequence data 
Nucleotide sequence
>YALI0E02904g.nt TCTTAGCAGCAGCGGATGCCGTAGAAGCGAGCTTGCTGGGCGCGCCGGTGTCGATGACAA TATCAATGTCAGAGTCGTCGGATTCGTATTCAGAATCGGAATCATCTTCCGGTTCTTGTT GCTTCGTGGTCTGCGAGTGTGGTTCCTCATCATCCTCAGCTCCCTCGGGCTCAACTCGGG CCCGCTTTACCGCAGGCTCCTCGGTGCCGTAGAGAAAGTCGTCGTCGTCATCCATCTTTG ATGTCAGATGGGGGGATGGTGATGTTGCCGTGGTGGGTGTCGTTTGGCGTAGTGCTTTGC AGAAACCTTTCTTTAGGTTTGATGGTGGCCCAATACGAGGAGAGAATGTGGCGAAATGGG GAGTTTTGAGGGTATATGTATGAGAAATTTGAGATTTCTGAACCAAATAAATTTACAGTC ATATTATCTCTTTAAAAAAGCATCTCTAGACTGCTAAAGACGTATTAGCGTCCCTCAAAC ATGGCTGACTAACACAACTCTATTTGGCCACACCAAACACTGCTGATGTATGGCGGCGGT GGTGATATGACTGCTTCATTGCTGCTGGTTAAACAACATACAAGTATGTACTCGTACTTA CAAATAACGTGAAGAAGTCACCTGTGCTAGCCCATCAAAACAGAAATGTTTTACAATAGT TTGGAGAATATCGGTACAAGTATTTCGAACAGACGCTTTAGTAAGACATTGGATACTCCT ATAAGGAACTCATGTCGTTTCTCGTTCTACTTCGGTGCGAAAACTTTCAGAAGGACACTG CAGAGGAAGACATTCATACCTGTACCTTCATGAAATCATCCCCCCTACAGTACAAAGAAT CGAGCCCCTTTCTACTGTAGTTCAGAAGCAATAGGGTTATGAAGGACCTTAACGAAGAGC TAAATACCACCACAATACCACACTCCATTCACCACAACCACCTGCATCAGGACGGCATTA GGTTCAGCCATCTCAAATTTCACACCCCACCACCATCATCATGTCTGGAATCGTCCCCGA AATTCCAGGCGAACAAGCGCCCCCAGATGCTGCTCACCAGCTCAAGGTGGATGTTGCGCG CCTCTTGCAAAAACCCAAGCTCAACTTTCCTGGTGCTCAGCCCGTCTCATTCGCCAGAAA ACACATTGAAGAGGAGCTCTTCAAGCGAGATTATTATGTGTGCGAGAAGAGTGACGGTCT TCGATGTCTCATGTACGTGACATGGGAAAATAACCCCGACACTGGCCCCCAGCAGGTCAC GTATCTGATCACACGAAACAACGAGTTCTTCTTCATTCCCATGGTCCACTTCCCTTCGAA TGATGGTAAGCCATTGCAAGACACAATTGTCGACGGAGAGCTCGTGCTCACAAAGGCCGA ACCCCGATCCCTGCATTTCCTCATGTTTGACTGTCTTGCCTGCAACAAGATTCTGCTTAC AGGCCGACCTTTGGACAAGAGATTGGGGTACCTCAATGCTGCCATCTCGCATCCTCTGAA AGAGTACCTGCACAAGAACCCCGAAGTTGCCCGCGACTTTCCCTTCAGTGTGCGCGTCAA GGATATGCAGTTTGCTTACAATGTGATGAACGTCTTTGCCAGCTTCCCGCACCTTCCGCA CATCACAGATGGTCTCATCTTCACCTGTAGGGACCACCCTTACGTCTCTGGAACAGACGA GCGAATTCTCAAGTGGAAAAAACAAGACGAAAACTCGGTAGACTTTCTGATGACCATGAA GTTTCCTATTTTCGAGGACACAAATGGTGAGAGTTGGACAGACTACGACGCAAAACCCGA AATCACACTCCTGGTGTGGACAGGACGGGACGGAAGCCGACCTTACGGCGAGCTGTATCT CACTGACGAGGAATGGGATAACTTGAAGGCGCTGGAGGAGCCTCTGGAAGAGCGGGTCGT GGAATGTATTAAGGATGATAAGAAGCGATGGAGATATCTTAGATTCCGGGATGACAAGAC GAATGCCAACTACATCACAACTGTCGAGAAGGTCATTGACTCTATTGATGATCCCGTCTC AGAGAAGAATCTGTTAGATGCCGCTCCCAAAATTAAGGAGCTGTGGAAGGAACGAAATCG ACGCCCACGCGACGAAGATAGAAAGCGAGTAGGAGGGGACGACCATGATCACGGTGCCAA AAGAGCTCGACAGTAAAGCCTCAACCCAGATCAAGTAAATTATATATTGTAATATTAGCA CACATACTTCAAAACCATATGGGAAAGGACCACAAGTAGACACTACAGTATGTACATACA TACAGTACACGACAAAAAGTTATGGAGAGTGATGGGAATAAATAAAAAAGCAAGCAAAAA AGGTTTCTCCGATACGGGGAATCGAACCCCGGTCTCCACGGTTCTCAAGATGAGAGCGTG ATGTGATAGCCCCTACACTATATCGGAAGTTGAAAATCAGCCCGACGTGGGGCTTGAACC CACAGCCTTGAGATCA
Coding sequence
>YALI0E02904g.cds ATGTCTGGAATCGTCCCCGAAATTCCAGGCGAACAAGCGCCCCCAGATGCTGCTCACCAG CTCAAGGTGGATGTTGCGCGCCTCTTGCAAAAACCCAAGCTCAACTTTCCTGGTGCTCAG CCCGTCTCATTCGCCAGAAAACACATTGAAGAGGAGCTCTTCAAGCGAGATTATTATGTG TGCGAGAAGAGTGACGGTCTTCGATGTCTCATGTACGTGACATGGGAAAATAACCCCGAC ACTGGCCCCCAGCAGGTCACGTATCTGATCACACGAAACAACGAGTTCTTCTTCATTCCC ATGGTCCACTTCCCTTCGAATGATGGTAAGCCATTGCAAGACACAATTGTCGACGGAGAG CTCGTGCTCACAAAGGCCGAACCCCGATCCCTGCATTTCCTCATGTTTGACTGTCTTGCC TGCAACAAGATTCTGCTTACAGGCCGACCTTTGGACAAGAGATTGGGGTACCTCAATGCT GCCATCTCGCATCCTCTGAAAGAGTACCTGCACAAGAACCCCGAAGTTGCCCGCGACTTT CCCTTCAGTGTGCGCGTCAAGGATATGCAGTTTGCTTACAATGTGATGAACGTCTTTGCC AGCTTCCCGCACCTTCCGCACATCACAGATGGTCTCATCTTCACCTGTAGGGACCACCCT TACGTCTCTGGAACAGACGAGCGAATTCTCAAGTGGAAAAAACAAGACGAAAACTCGGTA GACTTTCTGATGACCATGAAGTTTCCTATTTTCGAGGACACAAATGGTGAGAGTTGGACA GACTACGACGCAAAACCCGAAATCACACTCCTGGTGTGGACAGGACGGGACGGAAGCCGA CCTTACGGCGAGCTGTATCTCACTGACGAGGAATGGGATAACTTGAAGGCGCTGGAGGAG CCTCTGGAAGAGCGGGTCGTGGAATGTATTAAGGATGATAAGAAGCGATGGAGATATCTT AGATTCCGGGATGACAAGACGAATGCCAACTACATCACAACTGTCGAGAAGGTCATTGAC TCTATTGATGATCCCGTCTCAGAGAAGAATCTGTTAGATGCCGCTCCCAAAATTAAGGAG CTGTGGAAGGAACGAAATCGACGCCCACGCGACGAAGATAGAAAGCGAGTAGGAGGGGAC GACCATGATCACGGTGCCAAAAGAGCTCGACAGTAA
Predicted translation product
>YALI0E02904g.aa MSGIVPEIPGEQAPPDAAHQLKVDVARLLQKPKLNFPGAQPVSFARKHIEEELFKRDYYV CEKSDGLRCLMYVTWENNPDTGPQQVTYLITRNNEFFFIPMVHFPSNDGKPLQDTIVDGE LVLTKAEPRSLHFLMFDCLACNKILLTGRPLDKRLGYLNAAISHPLKEYLHKNPEVARDF PFSVRVKDMQFAYNVMNVFASFPHLPHITDGLIFTCRDHPYVSGTDERILKWKKQDENSV DFLMTMKFPIFEDTNGESWTDYDAKPEITLLVWTGRDGSRPYGELYLTDEEWDNLKALEE PLEERVVECIKDDKKRWRYLRFRDDKTNANYITTVEKVIDSIDDPVSEKNLLDAAPKIKE LWKERNRRPRDEDRKRVGGDDHDHGAKRARQ*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://www.genolevures.org/elt/YALI/YALI0E02904g