SACE0P01254g


Beta (RNA 5'-triphosphatase) subunit of the mRNA capping enzyme, a heterodimer (the other subunit is CEG1, a guanylyltransferase) involved in adding the 5' cap to mRNA; the mammalian enzyme is a single bifunctional polypeptide

Genomic environment map

Element type: CDS
Element length: 1650 nucleotides,
on sense strand of
Sace0P: 118382..120031.
Other names:
CES5
CET1
YPL228W
Coding sequence: 550 codons.
Database cross references:
ArrayExpress: O13297
CYGD: YPL228w
EMBL: AB008799
EMBL: BAA23522.1
EMBL: CAA64259.1
EMBL: CAA97943.1
EMBL: CAA97944.1
EMBL: X94561
EMBL: Z73583
EMBL: Z73584
Ensembl: YPL228W
GenomeReviews: U00094_GR
HOGENOM: O13297

Computed results  

None available yet

Homologs and Orthologs

Homologs in protein families: GL3C0218 GL3C0218.N2
Orthologs: strict determination not possible; homologs must be refined manually

Protein SACE0P01254p  


Protein domain map

Protein length: 549 amino acids
Protein family: GL3C0218
Database cross references:
DIP: DIP:2299N
GermOnline: YPL228W
IntAct: O13297
InterPro: IPR004206
LinkHub: O13297
PDB: 1D8H
PDB: 1D8I
PDBsum: 1D8H
PDBsum: 1D8I
PIR: S61706
PeptideAtlas: O13297
Pfam: PF02940
SGD: S000006149
UniProtKB/Swiss-Prot: O13297
UniprotKB: CET1_YEAST

Computed results for SACE0P01254p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  

GO:0042802 identical protein binding
GO:0031533 mRNA capping enzyme complex
GO:0006370 mRNA capping
GO:0004651 polynucleotide 5'-phosphatase activity

Sequence data  


Nucleotide sequence    

>SACE0P01254g.nt
GTTGTCCCAGATATAGACCACTCCTCCATAATATACAAGAACAACATCTGCAAATCTTTC
AAAGATGACTTATTTTTCTGTCCAAGATCTTTACTTTCTCTCGAAGAACAACAAGCATGC
GAGAAAATGGATAGGCTGACCGCTGAACAAATGTCATTGTATCATCAGAACACGCAATCC
AGTTCTAATCCTGGTTCTATGTCTTCTTCACCTCCAAATTCTGCTTCTTCTATATTCAAC
TCTAGGCCGAAGTTCAATCCTTATACATCTCAAAGTTTTAATCCTTTGGAAAGTGTTCAA
GAATGATCGCGATATGGACAAACACGTTGTTTTGATTTCTTTTTTGCATATTCCTCATTT
GAACAAATCTTGTGTCCATGTTATGTTACGTTATGTTATATTATGTCAAGCCGTTTCATT
TCCTACTTATTTTTAATTATCGGTTTCTTCTATTCATGTAGTTCACTATTCAATTAGTAT
ATGTAGATAAAAGGAGAAGGGTTTTATCTGCAAACAGACGCAAGACGTATGTACTAAGCA
CTAAATACTCATATAACCATATTCTTTCAAAAATACATGCCATTTTTCTGTTTCATAATA
ATCCTGGAGGCTACAAGTATGGTGATATAATAAATCTTGAACCTTGTTTTACTGGAACTC
CTAAAGTCAAGCCATACATAATGCCAGAAACAGGACCTGCGTAGATAACATATAACATGT
ACCAGTCATGATTACCCAACGTGCCGTTACAAATGATATATAGACCTACATTTTAGTTAG
TTTCGTCGTAAGTTATGGTAATAAAGGAAAAGTCCTCTAATGAATCTATAATATAGGTTC
ACACAAATAAGAGTGATAATTAAAAGCGTATTCGACACTGAAAGATCTGCTGGGAATACT
ATACTGATATTTCCAAAATATCCCTTATAAATTGAATCTGGAATAGCACTTCTTTTTTTA
AAAAACCTTGAATTGTTGGTAGCATTTCTATCCTCCCACTATGAGTTACACTGACAACCC
TCCTCAAACAAAAAGAGCTTTATCGTTAGACGATCTGGTGAATCACGATGAAAATGAAAA
GGTTAAATTACAAAAATTAAGTGAGGCGGCTAATGGCAGCAGACCTTTTGCCGAAAATTT
AGAATCTGATATAAATCAAACGGAAACGGGCCAAGCTGCTCCGATTGACAATTACAAGGA
GAGTACTGGTCATGGCTCGCACTCACAAAAACCTAAATCACGCAAGTCATCTAATGATGA
TGAAGAAACCGATACGGATGACGAAATGGGTGCAAGTGGAGAAATTAATTTTGATTCAGA
AATGGACTTTGACTATGATAAACAACATAGAAATTTACTATCCAACGGATCACCTCCTAT
GAATGATGGTAGTGATGCCAATGCGAAGTTAGAAAAGCCTTCTGATGATTCAATTCATCA
GAATAGCAAGAGTGATGAAGAACAGAGAATACCGAAACAAGGTAATGAAGGGAACATTGC
CAGCAACTATATAACCCAAGTACCTCTGCAAAAGCAGAAGCAAACTGAGAAGAAGATAGC
GGGAAATGCAGTAGGAAGCGTGGTCAAGAAGGAAGAAGAAGCGAATGCAGCTGTAGATAA
TATTTTTGAAGAGAAAGCTACTTTACAATCAAAAAAGAATAATATCAAGAGAGATTTGGA
GGTTCTGAATGAAATATCTGCGTCTTCCAAGCCCAGTAAGTACAGGAATGTTCCAATTTG
GGCACAAAAATGGAAACCTACTATCAAAGCTCTTCAAAGTATAAATGTGAAAGATCTCAA
AATTGACCCATCTTTTTTAAACATTATTCCCGATGATGACTTAACAAAGTCAGTACAGGA
CTGGGTTTATGCTACAATATACTCAATTGCTCCTGAACTAAGATCCTTCATTGAGTTAGA
AATGAAATTTGGTGTTATTATTGATGCGAAAGGCCCAGATCGTGTAAATCCACCAGTTTC
TTCACAATGTGTTTTCACTGAGCTTGATGCCCATCTAACGCCTAATATTGATGCATCTTT
GTTCAAAGAGTTGAGCAAATATATTCGTGGTATTAGCGAAGTCACTGAAAATACAGGTAA
ATTCAGTATTATTGAATCCCAGACAAGAGATTCCGTCTATAGAGTCGGACTATCCACGCA
AAGACCAAGGTTTTTGAGAATGAGTACAGATATTAAGACTGGGAGGGTAGGACAATTTAT
AGAGAAAAGACATGTAGCCCAACTACTATTATATTCACCAAAAGATAGTTACGACGTTAA
AATCTCCCTAAACTTGGAATTACCTGTACCTGACAACGATCCGCCAGAAAAATATAAATC
TCAAAGCCCAATTAGTGAAAGGACGAAAGACCGTGTTAGTTACATTCATAATGATTCCTG
TACCAGAATTGATATTACAAAAGTCGAAAATCATAACCAAAATTCAAAAAGTAGACAATC
AGAGACCACTCACGAAGTGGAACTAGAAATCAACACGCCTGCACTGTTAAACGCCTTTGA
TAACATAACGAACGATAGTAAAGAATATGCATCTCTTATTAGAACGTTTCTGAATAATGG
TACAATTATTAGAAGAAAGTTATCGTCTTTATCATATGAAATTTTTGAAGGTTCAAAGAA
AGTCATGTAATATTTGAATCATTTCAAAAAAAAATAAGCAAATGCCCTTGAGCGAGAAAT
TTTTTTTGTTTACTCAAATGCTGTTATGAAAGCTTATGTAATAATAATAATAATGATAAT
AATAACATAAAAATAATTACTCTAACATTTCTTATTATCTCTATATATCCCTAATAAATA
GGCCATTCTTATAATAACCAAGTCTTTCGCCATTTTGATACTATCAATAGCTAAAGCCAT
CTTAGAGCCATCAACCTCATGCCAGGAAATTGGTATCTCCTCAATTTGGATTCTTTTTCT
GATGGCTAAG

Coding sequence    

>SACE0P01254g.cds
ATGAGTTACACTGACAACCCTCCTCAAACAAAAAGAGCTTTATCGTTAGACGATCTGGTG
AATCACGATGAAAATGAAAAGGTTAAATTACAAAAATTAAGTGAGGCGGCTAATGGCAGC
AGACCTTTTGCCGAAAATTTAGAATCTGATATAAATCAAACGGAAACGGGCCAAGCTGCT
CCGATTGACAATTACAAGGAGAGTACTGGTCATGGCTCGCACTCACAAAAACCTAAATCA
CGCAAGTCATCTAATGATGATGAAGAAACCGATACGGATGACGAAATGGGTGCAAGTGGA
GAAATTAATTTTGATTCAGAAATGGACTTTGACTATGATAAACAACATAGAAATTTACTA
TCCAACGGATCACCTCCTATGAATGATGGTAGTGATGCCAATGCGAAGTTAGAAAAGCCT
TCTGATGATTCAATTCATCAGAATAGCAAGAGTGATGAAGAACAGAGAATACCGAAACAA
GGTAATGAAGGGAACATTGCCAGCAACTATATAACCCAAGTACCTCTGCAAAAGCAGAAG
CAAACTGAGAAGAAGATAGCGGGAAATGCAGTAGGAAGCGTGGTCAAGAAGGAAGAAGAA
GCGAATGCAGCTGTAGATAATATTTTTGAAGAGAAAGCTACTTTACAATCAAAAAAGAAT
AATATCAAGAGAGATTTGGAGGTTCTGAATGAAATATCTGCGTCTTCCAAGCCCAGTAAG
TACAGGAATGTTCCAATTTGGGCACAAAAATGGAAACCTACTATCAAAGCTCTTCAAAGT
ATAAATGTGAAAGATCTCAAAATTGACCCATCTTTTTTAAACATTATTCCCGATGATGAC
TTAACAAAGTCAGTACAGGACTGGGTTTATGCTACAATATACTCAATTGCTCCTGAACTA
AGATCCTTCATTGAGTTAGAAATGAAATTTGGTGTTATTATTGATGCGAAAGGCCCAGAT
CGTGTAAATCCACCAGTTTCTTCACAATGTGTTTTCACTGAGCTTGATGCCCATCTAACG
CCTAATATTGATGCATCTTTGTTCAAAGAGTTGAGCAAATATATTCGTGGTATTAGCGAA
GTCACTGAAAATACAGGTAAATTCAGTATTATTGAATCCCAGACAAGAGATTCCGTCTAT
AGAGTCGGACTATCCACGCAAAGACCAAGGTTTTTGAGAATGAGTACAGATATTAAGACT
GGGAGGGTAGGACAATTTATAGAGAAAAGACATGTAGCCCAACTACTATTATATTCACCA
AAAGATAGTTACGACGTTAAAATCTCCCTAAACTTGGAATTACCTGTACCTGACAACGAT
CCGCCAGAAAAATATAAATCTCAAAGCCCAATTAGTGAAAGGACGAAAGACCGTGTTAGT
TACATTCATAATGATTCCTGTACCAGAATTGATATTACAAAAGTCGAAAATCATAACCAA
AATTCAAAAAGTAGACAATCAGAGACCACTCACGAAGTGGAACTAGAAATCAACACGCCT
GCACTGTTAAACGCCTTTGATAACATAACGAACGATAGTAAAGAATATGCATCTCTTATT
AGAACGTTTCTGAATAATGGTACAATTATTAGAAGAAAGTTATCGTCTTTATCATATGAA
ATTTTTGAAGGTTCAAAGAAAGTCATGTAA

Predicted translation product    

>SACE0P01254g.aa
MSYTDNPPQTKRALSLDDLVNHDENEKVKLQKLSEAANGSRPFAENLESDINQTETGQAA
PIDNYKESTGHGSHSQKPKSRKSSNDDEETDTDDEMGASGEINFDSEMDFDYDKQHRNLL
SNGSPPMNDGSDANAKLEKPSDDSIHQNSKSDEEQRIPKQGNEGNIASNYITQVPLQKQK
QTEKKIAGNAVGSVVKKEEEANAAVDNIFEEKATLQSKKNNIKRDLEVLNEISASSKPSK
YRNVPIWAQKWKPTIKALQSINVKDLKIDPSFLNIIPDDDLTKSVQDWVYATIYSIAPEL
RSFIELEMKFGVIIDAKGPDRVNPPVSSQCVFTELDAHLTPNIDASLFKELSKYIRGISE
VTENTGKFSIIESQTRDSVYRVGLSTQRPRFLRMSTDIKTGRVGQFIEKRHVAQLLLYSP
KDSYDVKISLNLELPVPDNDPPEKYKSQSPISERTKDRVSYIHNDSCTRIDITKVENHNQ
NSKSRQSETTHEVELEINTPALLNAFDNITNDSKEYASLIRTFLNNGTIIRRKLSSLSYE
IFEGSKKVM*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites