SAKL0A04114g
highly similar to uniprot|Q01159 Saccharomyces cerevisiae YGL130W CEG1 Alpha (guanylyltransferase) subunit of the mRNA capping enzyme, a heterodimer (the other subunit is CET1, an RNA 5'-triphophatase) involved in adding the 5' cap to mRNA
Element type: CDS
Element length: 1398 nucleotides,
on sense strand of
Sakl0A: 386331..387728.
Other names:
SAKL-ORF15333
Coding sequence: 466 codons.
Element length: 1398 nucleotides,
on sense strand of
Sakl0A: 386331..387728.
Other names:
SAKL-ORF15333
Coding sequence: 466 codons.
Homologs and Orthologs
Homologs in protein families: GL3R1787 GL3R1787.F1 GL3R1787.N1Orthologs by synteny: ZYRO0E07150g KLTH0H03850g KLLA0B02200g ERGO0F02992g
Protein SAKL0A04114p 
highly similar to uniprot|Q01159 Saccharomyces cerevisiae YGL130W CEG1 mRNA guanylyltransferase (mRNA capping enzyme), alpha subunit
Protein domain map
Sequence data 
>SAKL0A04114g.nt CGTAATATTTGAGTGTGTATCTATCAAATAATTGGTCAAAATAGAACGTGTTTGCTTTTG TAATATATTCATGAGCAGAATCAGTAGAGAAGTCTAACTCAGATCTGTGGAGCACTGTGC ACTTACCTCTATAAGATATTAAAGGACAAACATCTGTATGCAAAGAGGCGTAAATTAACC TTGGATCTACATTGTGCACATTATCGCGAATATCACGGGATCTATAGTACCAGTTCATCT TTAAATTAAAATATTTTGCCGGAAACACAGCTGTAACCTTCCTCGAATTTTCAATAATCA TATGATACTCAGGTTTGCAAACAAACTCCATAATTCTACCAATGTAATAAGGTTCACCAG GAGGTTCGGATATCATGTAAATATGGTCGTCTTTTCTCAGTACCGTAGCAGTTTCATGAA GTAAAGTTTGCTCCTTGATGTTAACCATGGCACCATCTAGTTCTAATACGTTAGAAAATC TGCTGTTCTTTCGAAATGATGGTGGTAGAGAAGGTATGAAATTCCAAGTTGTGGCCTTAT CGTTCAAAAAACCTTGATATTTGAACCCATTTCTCTTTTTGTTTGAGCCACTTCCACTTT TGCTTTTACCCTTGGTTTTTTCTAATTGTTTAATCTTCTTAGCTAGTTCTGCATCTGCCC TTTTCTCATTGTAGTTGATCTTCTTCATCGTAGTACGTCTTTTGGCCAAAAAAAGTGGAG TGTCATCGGAGCTCATCGCGAATTTTCTTGAAAATTGATTAATAATTCCCTTCAATATCA ATCAGACTATCCCGAATCCTCAGATTTGATCAATCTTTTGCCAACTTGTTAGTTTTTATT TTTATTTTTATTTTTTTTTTTTTTTTTTGGATTCTGAGAGAGTGTCTTAACATCGCAAAA AGATTTTCGTGAGGAAAATCAAACTAAAAGGTTGATACAGAACTCATTGAATGTTTATTG GTTCAATTAAGCTTGAAACAACGTGAAGAGGGATATAATCATGGATCATAATCGTATTTC CCCAGAGATACCGGGTATTAGGCAGCCAGCAAATGTAACTAGTGACATTCGGATGTTGAT TTGTAAACTATTGAACTCCCCAAAACCAGCAAAAACTTTCCCAGGTTCACAACCAGTCTC GTTCCAACATAGCGACATTGAAGAGAAACTACTGCAACAAGATTACTATGTCTGTGAGAA GACAGATGGGTTACGTGCTTTGATGCTTATTATAATTAACCCAGTTACGAAAGAGCAGGG CTGTTTTATGATTGATAGAGAGAATAACTACTACTTAGTGAATGGATTCAGGTTTCCCCG CCTACCAAAGCAAAATCGAAAGGAATTACTTGAAACGTTCCAAGACGGTACTTTAGTTGA TGGAGAACTAGTCATTCAAACCAATTCTATTACCAAAATAAGGGAAATGCGTTACTTGAT GTTTGATTGTTTGGCCATTAATGGCCGTTCCCTTCTGCAATCTCCCACCAGCTCTCGTTT GGCCCATTTATGCAAGGATTTTTTCCGCCCATATTATGACCTACGCTCACTGTACCCTGA TCACTGCACCAATTTTCCTTTCAAAATATCTGTGAAGCTTATGCACTTTAGCTACGATTT GGTCAAAGTTGCCAGTACATTAGATAAATTGCCACACGTTTCTGATGGCTTAATTTTTAC CCCAGTCACAACTCCATATTACGTTGGCGGAAAGGATTCTTTCTTGCTGAAATGGAAGCC TGAGCAAGAAAATACTGTAGATTTCAAAATGATTTTGGACATCCCGGTGGTTGAAGACGA ATCTCTACCCAAAAACGACTCGAATAGGTTTTATTACAATTATGATGTCAAACCGTCTTT TCACCTATATGTTTGGCAGGGTGGTGCTGACGTCAATAATAGATTGCACGACTTTGAACA ACCATTTTCTAAAAAGGAGCTGGAAATTTTGGACAGAACCTATAAAAAATTTGCAGAATT AGAAATAGGTGACGAGCAATGGAACCTGCTGAAAAATTTGGAGGAACCGTTGAATGGTAG AATTGTGGAGTGCTCAAAAGATCAAGAGACCGGTGTTTGGAAAATGTTACGTTTTCGGGA TGACAAATTGAATGGTAATCACATCTCTGTTGTTCAGAAGGTGTTAGAAAGTATTAGCGA TTCTGTGAAGTTAGAAGATTTGGAGGAAGTTGTTGATAAAATAAAGAGTAACTGGAGTGT CAGGCAAGCAGAAAAGAAGAGGGGATTTGAAAGTATTAAGGGAGCTGCACCTCCTTCGAT TCCTAGACAAAAGCAACAAACTACCGAGGTAGCGGAACAACCAAAATATGTTGACGACGA AGATTGGTCTGATGAAAGTGATGAGGACGCTGTTGACTTAAAAAAGCAGAAACTGTAGAT ATGATAAGATTTGTACCGTAGATGTAGATGATACATATTCATTTCTTTTAAGGATAGTAT TTACACGTATGGAAGAGAAATATAAAAACTTGAAGGCTTTATGTACAGTAGGAGACTGTC TTAAGAGTATTGTAGAATAATACTTTTCAACAGTTCTCTTGGGTTACCGTTACCACTTAG ACAGTACTTCTCATCGACTAACTGCTCAAGAGTTTTTTCTTGAGCATCTCTATCTAAGAC AATCTCAGACTTCAAGTAGAATTCTAACAATTGCTTTACTTCTTCTTTGTCTAATTTT
>SAKL0A04114g.cds ATGGATCATAATCGTATTTCCCCAGAGATACCGGGTATTAGGCAGCCAGCAAATGTAACT AGTGACATTCGGATGTTGATTTGTAAACTATTGAACTCCCCAAAACCAGCAAAAACTTTC CCAGGTTCACAACCAGTCTCGTTCCAACATAGCGACATTGAAGAGAAACTACTGCAACAA GATTACTATGTCTGTGAGAAGACAGATGGGTTACGTGCTTTGATGCTTATTATAATTAAC CCAGTTACGAAAGAGCAGGGCTGTTTTATGATTGATAGAGAGAATAACTACTACTTAGTG AATGGATTCAGGTTTCCCCGCCTACCAAAGCAAAATCGAAAGGAATTACTTGAAACGTTC CAAGACGGTACTTTAGTTGATGGAGAACTAGTCATTCAAACCAATTCTATTACCAAAATA AGGGAAATGCGTTACTTGATGTTTGATTGTTTGGCCATTAATGGCCGTTCCCTTCTGCAA TCTCCCACCAGCTCTCGTTTGGCCCATTTATGCAAGGATTTTTTCCGCCCATATTATGAC CTACGCTCACTGTACCCTGATCACTGCACCAATTTTCCTTTCAAAATATCTGTGAAGCTT ATGCACTTTAGCTACGATTTGGTCAAAGTTGCCAGTACATTAGATAAATTGCCACACGTT TCTGATGGCTTAATTTTTACCCCAGTCACAACTCCATATTACGTTGGCGGAAAGGATTCT TTCTTGCTGAAATGGAAGCCTGAGCAAGAAAATACTGTAGATTTCAAAATGATTTTGGAC ATCCCGGTGGTTGAAGACGAATCTCTACCCAAAAACGACTCGAATAGGTTTTATTACAAT TATGATGTCAAACCGTCTTTTCACCTATATGTTTGGCAGGGTGGTGCTGACGTCAATAAT AGATTGCACGACTTTGAACAACCATTTTCTAAAAAGGAGCTGGAAATTTTGGACAGAACC TATAAAAAATTTGCAGAATTAGAAATAGGTGACGAGCAATGGAACCTGCTGAAAAATTTG GAGGAACCGTTGAATGGTAGAATTGTGGAGTGCTCAAAAGATCAAGAGACCGGTGTTTGG AAAATGTTACGTTTTCGGGATGACAAATTGAATGGTAATCACATCTCTGTTGTTCAGAAG GTGTTAGAAAGTATTAGCGATTCTGTGAAGTTAGAAGATTTGGAGGAAGTTGTTGATAAA ATAAAGAGTAACTGGAGTGTCAGGCAAGCAGAAAAGAAGAGGGGATTTGAAAGTATTAAG GGAGCTGCACCTCCTTCGATTCCTAGACAAAAGCAACAAACTACCGAGGTAGCGGAACAA CCAAAATATGTTGACGACGAAGATTGGTCTGATGAAAGTGATGAGGACGCTGTTGACTTA AAAAAGCAGAAACTGTAG
>SAKL0A04114g.aa MDHNRISPEIPGIRQPANVTSDIRMLICKLLNSPKPAKTFPGSQPVSFQHSDIEEKLLQQ DYYVCEKTDGLRALMLIIINPVTKEQGCFMIDRENNYYLVNGFRFPRLPKQNRKELLETF QDGTLVDGELVIQTNSITKIREMRYLMFDCLAINGRSLLQSPTSSRLAHLCKDFFRPYYD LRSLYPDHCTNFPFKISVKLMHFSYDLVKVASTLDKLPHVSDGLIFTPVTTPYYVGGKDS FLLKWKPEQENTVDFKMILDIPVVEDESLPKNDSNRFYYNYDVKPSFHLYVWQGGADVNN RLHDFEQPFSKKELEILDRTYKKFAELEIGDEQWNLLKNLEEPLNGRIVECSKDQETGVW KMLRFRDDKLNGNHISVVQKVLESISDSVKLEDLEEVVDKIKSNWSVRQAEKKRGFESIK GAAPPSIPRQKQQTTEVAEQPKYVDDEDWSDESDEDAVDLKKQKL*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://www.genolevures.org/elt/SAKL/SAKL0A04114g