SACE0P01254g
Beta (RNA 5'-triphosphatase) subunit of the mRNA capping enzyme, a heterodimer (the other subunit is CEG1, a guanylyltransferase) involved in adding the 5' cap to mRNA; the mammalian enzyme is a single bifunctional polypeptide
Element type: CDS
Element length: 1650 nucleotides,
on sense strand of
Sace0P: 118382..120031.
Other names:
CES5
CET1
YPL228W
Coding sequence: 550 codons.
Element length: 1650 nucleotides,
on sense strand of
Sace0P: 118382..120031.
Other names:
CES5
CET1
YPL228W
Coding sequence: 550 codons.
Database cross references:
ArrayExpress: O13297
CYGD: YPL228w
EMBL: AB008799
EMBL: BAA23522.1
EMBL: CAA64259.1
EMBL: CAA97943.1
EMBL: CAA97944.1
EMBL: X94561
EMBL: Z73583
EMBL: Z73584
Ensembl: YPL228W
GenomeReviews: U00094_GR
HOGENOM: O13297
Orthologs: strict determination not possible; homologs must be refined manually
ArrayExpress: O13297
CYGD: YPL228w
EMBL: AB008799
EMBL: BAA23522.1
EMBL: CAA64259.1
EMBL: CAA97943.1
EMBL: CAA97944.1
EMBL: X94561
EMBL: Z73583
EMBL: Z73584
Ensembl: YPL228W
GenomeReviews: U00094_GR
HOGENOM: O13297
Homologs and Orthologs
Homologs in protein families: GL3C0218 GL3C0218.N2Orthologs: strict determination not possible; homologs must be refined manually
Protein domain map
Database cross references:
DIP: DIP:2299N
GermOnline: YPL228W
IntAct: O13297
InterPro: IPR004206
LinkHub: O13297
PDB: 1D8H
PDB: 1D8I
PDBsum: 1D8H
PDBsum: 1D8I
PIR: S61706
PeptideAtlas: O13297
Pfam: PF02940
SGD: S000006149
UniProtKB/Swiss-Prot: O13297
UniprotKB: CET1_YEAST
DIP: DIP:2299N
GermOnline: YPL228W
IntAct: O13297
InterPro: IPR004206
LinkHub: O13297
PDB: 1D8H
PDB: 1D8I
PDBsum: 1D8H
PDBsum: 1D8I
PIR: S61706
PeptideAtlas: O13297
Pfam: PF02940
SGD: S000006149
UniProtKB/Swiss-Prot: O13297
UniprotKB: CET1_YEAST
Gene Ontology terms 
| GO:0042802 | identical protein binding |
| GO:0031533 | mRNA capping enzyme complex |
| GO:0006370 | mRNA capping |
| GO:0004651 | polynucleotide 5'-phosphatase activity |
Sequence data 
Nucleotide sequence
>SACE0P01254g.nt GTTGTCCCAGATATAGACCACTCCTCCATAATATACAAGAACAACATCTGCAAATCTTTC AAAGATGACTTATTTTTCTGTCCAAGATCTTTACTTTCTCTCGAAGAACAACAAGCATGC GAGAAAATGGATAGGCTGACCGCTGAACAAATGTCATTGTATCATCAGAACACGCAATCC AGTTCTAATCCTGGTTCTATGTCTTCTTCACCTCCAAATTCTGCTTCTTCTATATTCAAC TCTAGGCCGAAGTTCAATCCTTATACATCTCAAAGTTTTAATCCTTTGGAAAGTGTTCAA GAATGATCGCGATATGGACAAACACGTTGTTTTGATTTCTTTTTTGCATATTCCTCATTT GAACAAATCTTGTGTCCATGTTATGTTACGTTATGTTATATTATGTCAAGCCGTTTCATT TCCTACTTATTTTTAATTATCGGTTTCTTCTATTCATGTAGTTCACTATTCAATTAGTAT ATGTAGATAAAAGGAGAAGGGTTTTATCTGCAAACAGACGCAAGACGTATGTACTAAGCA CTAAATACTCATATAACCATATTCTTTCAAAAATACATGCCATTTTTCTGTTTCATAATA ATCCTGGAGGCTACAAGTATGGTGATATAATAAATCTTGAACCTTGTTTTACTGGAACTC CTAAAGTCAAGCCATACATAATGCCAGAAACAGGACCTGCGTAGATAACATATAACATGT ACCAGTCATGATTACCCAACGTGCCGTTACAAATGATATATAGACCTACATTTTAGTTAG TTTCGTCGTAAGTTATGGTAATAAAGGAAAAGTCCTCTAATGAATCTATAATATAGGTTC ACACAAATAAGAGTGATAATTAAAAGCGTATTCGACACTGAAAGATCTGCTGGGAATACT ATACTGATATTTCCAAAATATCCCTTATAAATTGAATCTGGAATAGCACTTCTTTTTTTA AAAAACCTTGAATTGTTGGTAGCATTTCTATCCTCCCACTATGAGTTACACTGACAACCC TCCTCAAACAAAAAGAGCTTTATCGTTAGACGATCTGGTGAATCACGATGAAAATGAAAA GGTTAAATTACAAAAATTAAGTGAGGCGGCTAATGGCAGCAGACCTTTTGCCGAAAATTT AGAATCTGATATAAATCAAACGGAAACGGGCCAAGCTGCTCCGATTGACAATTACAAGGA GAGTACTGGTCATGGCTCGCACTCACAAAAACCTAAATCACGCAAGTCATCTAATGATGA TGAAGAAACCGATACGGATGACGAAATGGGTGCAAGTGGAGAAATTAATTTTGATTCAGA AATGGACTTTGACTATGATAAACAACATAGAAATTTACTATCCAACGGATCACCTCCTAT GAATGATGGTAGTGATGCCAATGCGAAGTTAGAAAAGCCTTCTGATGATTCAATTCATCA GAATAGCAAGAGTGATGAAGAACAGAGAATACCGAAACAAGGTAATGAAGGGAACATTGC CAGCAACTATATAACCCAAGTACCTCTGCAAAAGCAGAAGCAAACTGAGAAGAAGATAGC GGGAAATGCAGTAGGAAGCGTGGTCAAGAAGGAAGAAGAAGCGAATGCAGCTGTAGATAA TATTTTTGAAGAGAAAGCTACTTTACAATCAAAAAAGAATAATATCAAGAGAGATTTGGA GGTTCTGAATGAAATATCTGCGTCTTCCAAGCCCAGTAAGTACAGGAATGTTCCAATTTG GGCACAAAAATGGAAACCTACTATCAAAGCTCTTCAAAGTATAAATGTGAAAGATCTCAA AATTGACCCATCTTTTTTAAACATTATTCCCGATGATGACTTAACAAAGTCAGTACAGGA CTGGGTTTATGCTACAATATACTCAATTGCTCCTGAACTAAGATCCTTCATTGAGTTAGA AATGAAATTTGGTGTTATTATTGATGCGAAAGGCCCAGATCGTGTAAATCCACCAGTTTC TTCACAATGTGTTTTCACTGAGCTTGATGCCCATCTAACGCCTAATATTGATGCATCTTT GTTCAAAGAGTTGAGCAAATATATTCGTGGTATTAGCGAAGTCACTGAAAATACAGGTAA ATTCAGTATTATTGAATCCCAGACAAGAGATTCCGTCTATAGAGTCGGACTATCCACGCA AAGACCAAGGTTTTTGAGAATGAGTACAGATATTAAGACTGGGAGGGTAGGACAATTTAT AGAGAAAAGACATGTAGCCCAACTACTATTATATTCACCAAAAGATAGTTACGACGTTAA AATCTCCCTAAACTTGGAATTACCTGTACCTGACAACGATCCGCCAGAAAAATATAAATC TCAAAGCCCAATTAGTGAAAGGACGAAAGACCGTGTTAGTTACATTCATAATGATTCCTG TACCAGAATTGATATTACAAAAGTCGAAAATCATAACCAAAATTCAAAAAGTAGACAATC AGAGACCACTCACGAAGTGGAACTAGAAATCAACACGCCTGCACTGTTAAACGCCTTTGA TAACATAACGAACGATAGTAAAGAATATGCATCTCTTATTAGAACGTTTCTGAATAATGG TACAATTATTAGAAGAAAGTTATCGTCTTTATCATATGAAATTTTTGAAGGTTCAAAGAA AGTCATGTAATATTTGAATCATTTCAAAAAAAAATAAGCAAATGCCCTTGAGCGAGAAAT TTTTTTTGTTTACTCAAATGCTGTTATGAAAGCTTATGTAATAATAATAATAATGATAAT AATAACATAAAAATAATTACTCTAACATTTCTTATTATCTCTATATATCCCTAATAAATA GGCCATTCTTATAATAACCAAGTCTTTCGCCATTTTGATACTATCAATAGCTAAAGCCAT CTTAGAGCCATCAACCTCATGCCAGGAAATTGGTATCTCCTCAATTTGGATTCTTTTTCT GATGGCTAAG
Coding sequence
>SACE0P01254g.cds ATGAGTTACACTGACAACCCTCCTCAAACAAAAAGAGCTTTATCGTTAGACGATCTGGTG AATCACGATGAAAATGAAAAGGTTAAATTACAAAAATTAAGTGAGGCGGCTAATGGCAGC AGACCTTTTGCCGAAAATTTAGAATCTGATATAAATCAAACGGAAACGGGCCAAGCTGCT CCGATTGACAATTACAAGGAGAGTACTGGTCATGGCTCGCACTCACAAAAACCTAAATCA CGCAAGTCATCTAATGATGATGAAGAAACCGATACGGATGACGAAATGGGTGCAAGTGGA GAAATTAATTTTGATTCAGAAATGGACTTTGACTATGATAAACAACATAGAAATTTACTA TCCAACGGATCACCTCCTATGAATGATGGTAGTGATGCCAATGCGAAGTTAGAAAAGCCT TCTGATGATTCAATTCATCAGAATAGCAAGAGTGATGAAGAACAGAGAATACCGAAACAA GGTAATGAAGGGAACATTGCCAGCAACTATATAACCCAAGTACCTCTGCAAAAGCAGAAG CAAACTGAGAAGAAGATAGCGGGAAATGCAGTAGGAAGCGTGGTCAAGAAGGAAGAAGAA GCGAATGCAGCTGTAGATAATATTTTTGAAGAGAAAGCTACTTTACAATCAAAAAAGAAT AATATCAAGAGAGATTTGGAGGTTCTGAATGAAATATCTGCGTCTTCCAAGCCCAGTAAG TACAGGAATGTTCCAATTTGGGCACAAAAATGGAAACCTACTATCAAAGCTCTTCAAAGT ATAAATGTGAAAGATCTCAAAATTGACCCATCTTTTTTAAACATTATTCCCGATGATGAC TTAACAAAGTCAGTACAGGACTGGGTTTATGCTACAATATACTCAATTGCTCCTGAACTA AGATCCTTCATTGAGTTAGAAATGAAATTTGGTGTTATTATTGATGCGAAAGGCCCAGAT CGTGTAAATCCACCAGTTTCTTCACAATGTGTTTTCACTGAGCTTGATGCCCATCTAACG CCTAATATTGATGCATCTTTGTTCAAAGAGTTGAGCAAATATATTCGTGGTATTAGCGAA GTCACTGAAAATACAGGTAAATTCAGTATTATTGAATCCCAGACAAGAGATTCCGTCTAT AGAGTCGGACTATCCACGCAAAGACCAAGGTTTTTGAGAATGAGTACAGATATTAAGACT GGGAGGGTAGGACAATTTATAGAGAAAAGACATGTAGCCCAACTACTATTATATTCACCA AAAGATAGTTACGACGTTAAAATCTCCCTAAACTTGGAATTACCTGTACCTGACAACGAT CCGCCAGAAAAATATAAATCTCAAAGCCCAATTAGTGAAAGGACGAAAGACCGTGTTAGT TACATTCATAATGATTCCTGTACCAGAATTGATATTACAAAAGTCGAAAATCATAACCAA AATTCAAAAAGTAGACAATCAGAGACCACTCACGAAGTGGAACTAGAAATCAACACGCCT GCACTGTTAAACGCCTTTGATAACATAACGAACGATAGTAAAGAATATGCATCTCTTATT AGAACGTTTCTGAATAATGGTACAATTATTAGAAGAAAGTTATCGTCTTTATCATATGAA ATTTTTGAAGGTTCAAAGAAAGTCATGTAA
Predicted translation product
>SACE0P01254g.aa MSYTDNPPQTKRALSLDDLVNHDENEKVKLQKLSEAANGSRPFAENLESDINQTETGQAA PIDNYKESTGHGSHSQKPKSRKSSNDDEETDTDDEMGASGEINFDSEMDFDYDKQHRNLL SNGSPPMNDGSDANAKLEKPSDDSIHQNSKSDEEQRIPKQGNEGNIASNYITQVPLQKQK QTEKKIAGNAVGSVVKKEEEANAAVDNIFEEKATLQSKKNNIKRDLEVLNEISASSKPSK YRNVPIWAQKWKPTIKALQSINVKDLKIDPSFLNIIPDDDLTKSVQDWVYATIYSIAPEL RSFIELEMKFGVIIDAKGPDRVNPPVSSQCVFTELDAHLTPNIDASLFKELSKYIRGISE VTENTGKFSIIESQTRDSVYRVGLSTQRPRFLRMSTDIKTGRVGQFIEKRHVAQLLLYSP KDSYDVKISLNLELPVPDNDPPEKYKSQSPISERTKDRVSYIHNDSCTRIDITKVENHNQ NSKSRQSETTHEVELEINTPALLNAFDNITNDSKEYASLIRTFLNNGTIIRRKLSSLSYE IFEGSKKVM*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://www.genolevures.org/elt/SACE/SACE0P01254p