SACE0G02948g


Alpha (guanylyltransferase) subunit of the mRNA capping enzyme, a heterodimer (the other subunit is CET1, an RNA 5'-triphophatase) involved in adding the 5' cap to mRNA; the mammalian enzyme is a single bifunctional polypeptide

Genomic environment map

Element type: CDS
Element length: 1380 nucleotides,
on sense strand of
Sace0G: 266150..267529.
Other names:
CEG1
YGL130W
Coding sequence: 460 codons.
Database cross references:
ArrayExpress: Q01159
CYGD: YGL130w
EMBL: BAA01103.1
EMBL: CAA60705.1
EMBL: CAA96839.1
EMBL: D10263
EMBL: X87252
EMBL: Z72652
Ensembl: YGL130W
GeneID: 852747
GenomeReviews: Y13135_GR
HOGENOM: Q01159
NMPDR: fig|4932.3.peg.2489

Computed results  

None available yet

Homologs and Orthologs

Homologs in protein families: GL3R1787 GL3R1787.N1
Orthologs: strict determination not possible; homologs must be refined manually

Protein SACE0G02948p  


Protein domain map

Protein length: 459 amino acids
Protein family: GL3R1787
Database cross references:
DIP: DIP:2298N
Gene3D: G3DSA:2.40.50.140
GermOnline: YGL130W
HSSP: 1P16
HSSP: P78587
IntAct: Q01159
InterPro: IPR001339
InterPro: IPR012340
InterPro: IPR013846
InterPro: IPR017075
KEGG: sce:YGL130W
LinkHub: Q01159
PIR: S59731
PIRSF: PIRSF036959
PeptideAtlas: Q01159
Pfam: PF01331
Pfam: PF03919
RefSeq: NP_011385.1
SGD: S000003098
UniProtKB/Swiss-Prot: Q01159
UniprotKB: MCE1_YEAST

Computed results for SACE0G02948p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  

GO:0045944 positive regulation of transcription from RNA polymerase II promoter
GO:0031533 mRNA capping enzyme complex
GO:0006370 mRNA capping
GO:0005515 protein binding
GO:0004484 mRNA guanylyltransferase activity

Sequence data  


Nucleotide sequence    

>SACE0G02948g.nt
TGGGTCGTATAATGCATTCCTTCTCATTAGGAAGAACATCAAAAACTTCGTCTTTATGAA
ATATACTACATTTTCCCCTGTATGATGAAATAGGACATATGTCCTGGTGTAATGATGCAT
ACACCAACCTTGGGTTAAAAGTATTTACGTGCTCTTGAATATCCCTAGGACGATAAAACC
AATTCATTCTAACTTGAAAAAATTTAGCAGGGAAGACCGATGTTGTCTTTATAGCTTCAT
GAATAGTGTTAGAAAATTCTGGTTTACTAACAAAATTTACAACTCTGCCAACATAATAAG
GCTCGCCAGCAGGCTCTGATATCATATAAATAGTGTCGTTCGCAGATAAAAGGACAGATT
CTGTATTGAAAAGAGACATCTTTTTCAAGTCAATCATGGCATCATCTAGATCAAGTATGT
TAGAGAATCTGCTATTCTTTCTGAACGAAGGGGGTAATGTAGGGATAAAGTTCCAGCAAG
TATTCTTATCATGAAGGAATTTTTGGTACTTGAATTTCTCGTTTCTACTAGCTGGATTGC
TTGTCCTTTTACAATTCCTGGGAGTTTGGCTATTGCTACCCTTGGTCTTAGCACCACTTT
TTTCCGCTATTTGTATTTGTTGAACTAATTTAGTATCTATTTCCATTTCATTATAATTCA
CGTTTTTAGCAGCCTCTCTTCTTCTAGGTAATTGGAAATCTTCTTCTTTAGGCATAGCGT
GACCAGCTGAAAGGCAGATGATCTCAATATGTCCCGATAGGCCAATTTTGCCACCAGATA
GTTCGATGGCATTTTATTTTTAGTCCTTTTGAAAGCAATATCATAAAATATATATAGTTC
TCCATGATGTTCGGGTCAGTCGCTCCGAAGCGTAACCTAGTATAATAAATAGTTCATTGC
AGAAAATAACGAAAGAAATGGTGGAATACGATCTGTTATATCTAAACTAAAGCTAACTAA
CGGAATAAGCAAATACGAATCGACCGCTAATTTAACAAATATGGTTTTAGCAATGGAAAG
TAGAGTGGCACCGGAAATTCCTGGGCTCATTCAACCTGGGAATGTCACGCAAGACTTGAA
GATGATGGTCTGTAAATTATTGAATTCCCCAAAACCTACGAAAACATTCCCTGGTTCCCA
GCCTGTGTCCTTTCAGCATTCTGATGTGGAAGAGAAGCTGCTTGCGCATGATTACTACGT
TTGTGAGAAAACAGATGGTCTGCGGGTGTTGATGTTTATAGTGATAAATCCTGTGACGGG
TGAGCAAGGATGCTTTATGATTGATAGGGAAAATAACTATTATCTGGTTAATGGATTTAG
GTTTCCCAGATTACCCCAAAAGAAGAAAGAAGAGCTGCTAGAGACTCTTCAAGATGGCAC
CTTATTAGATGGTGAACTTGTCATACAAACTAACCCAATGACAAAATTACAAGAGTTGCG
TTATTTAATGTTCGATTGTCTTGCTATCAATGGTAGATGTCTCACACAATCACCAACAAG
TTCTAGACTAGCCCACCTTGGAAAAGAATTTTTTAAACCATACTTCGATTTAAGAGCAGC
GTACCCTAATCGTTGTACTACTTTTCCGTTCAAAATTTCCATGAAACATATGGATTTCAG
TTACCAATTAGTAAAAGTTGCTAAAAGTTTAGATAAACTACCACATCTTTCTGATGGTCT
GATATTTACTCCTGTGAAGGCACCTTACACTGCCGGCGGAAAAGATTCATTGTTATTAAA
ATGGAAGCCAGAACAAGAAAACACCGTGGACTTCAAATTGATTTTAGATATCCCAATGGT
GGAGGATCCTTCTTTGCCTAAAGATGATCGGAACAGGTGGTATTACAATTATGACGTTAA
GCCAGTTTTCAGCTTATATGTCTGGCAAGGCGGAGCTGATGTCAATTCACGTTTAAAACA
TTTCGACCAGCCTTTCGATAGGAAGGAATTTGAAATATTAGAAAGAACATACAGAAAATT
TGCAGAGTTGAGCGTTTCAGATGAGGAATGGCAAAATTTGAAGAACCTAGAACAGCCATT
AAATGGTAGAATAGTAGAGTGCGCAAAAAACCAAGAGACTGGGGCGTGGGAAATGTTAAG
ATTCAGGGATGATAAGTTAAATGGTAATCATACATCGGTGGTCCAGAAAGTTTTGGAGAG
TATCAACGATTCAGTTTCATTGGAGGACCTCGAGGAAATTGTTGGTGATATTAAAAGGTG
CTGGGACGAGAGAAGAGCAAATATGGCTGGTGGTAGTGGGAGACCACTACCGTCTCAAAG
TCAAAATGCGACATTATCTACCTCTAAGCCAGTCCATTCACAGCCCCCAAGTAATGATAA
GGAGCCAAAATATGTAGACGAGGATGATTGGTCGGATTAGGCTAAGAAGTGTTCGTATCG
ACATATATCGGTGTAAGAATTTTGTGTATGAATAAGATAGAAAAGCAAAAGCAATTGCTT
TTGCTAAATTAATTCAGTTAGTTATCTATTTATATGTACATGAGAAATGGGAGAATTTTG
AGAAGCGAGGGTGAAAGTGTTATATACGCATATACATGTATATATATGCGCATGCTTATC
ACCTATGGGAAAGGACTAAACTTTTCAATAATTCTCTTGGGTTACCGTTACCACTTAGGA
AATATTTTTCGTCAATCAAGTTCTCCCACTTTTTTCCTGT

Coding sequence    

>SACE0G02948g.cds
ATGGTTTTAGCAATGGAAAGTAGAGTGGCACCGGAAATTCCTGGGCTCATTCAACCTGGG
AATGTCACGCAAGACTTGAAGATGATGGTCTGTAAATTATTGAATTCCCCAAAACCTACG
AAAACATTCCCTGGTTCCCAGCCTGTGTCCTTTCAGCATTCTGATGTGGAAGAGAAGCTG
CTTGCGCATGATTACTACGTTTGTGAGAAAACAGATGGTCTGCGGGTGTTGATGTTTATA
GTGATAAATCCTGTGACGGGTGAGCAAGGATGCTTTATGATTGATAGGGAAAATAACTAT
TATCTGGTTAATGGATTTAGGTTTCCCAGATTACCCCAAAAGAAGAAAGAAGAGCTGCTA
GAGACTCTTCAAGATGGCACCTTATTAGATGGTGAACTTGTCATACAAACTAACCCAATG
ACAAAATTACAAGAGTTGCGTTATTTAATGTTCGATTGTCTTGCTATCAATGGTAGATGT
CTCACACAATCACCAACAAGTTCTAGACTAGCCCACCTTGGAAAAGAATTTTTTAAACCA
TACTTCGATTTAAGAGCAGCGTACCCTAATCGTTGTACTACTTTTCCGTTCAAAATTTCC
ATGAAACATATGGATTTCAGTTACCAATTAGTAAAAGTTGCTAAAAGTTTAGATAAACTA
CCACATCTTTCTGATGGTCTGATATTTACTCCTGTGAAGGCACCTTACACTGCCGGCGGA
AAAGATTCATTGTTATTAAAATGGAAGCCAGAACAAGAAAACACCGTGGACTTCAAATTG
ATTTTAGATATCCCAATGGTGGAGGATCCTTCTTTGCCTAAAGATGATCGGAACAGGTGG
TATTACAATTATGACGTTAAGCCAGTTTTCAGCTTATATGTCTGGCAAGGCGGAGCTGAT
GTCAATTCACGTTTAAAACATTTCGACCAGCCTTTCGATAGGAAGGAATTTGAAATATTA
GAAAGAACATACAGAAAATTTGCAGAGTTGAGCGTTTCAGATGAGGAATGGCAAAATTTG
AAGAACCTAGAACAGCCATTAAATGGTAGAATAGTAGAGTGCGCAAAAAACCAAGAGACT
GGGGCGTGGGAAATGTTAAGATTCAGGGATGATAAGTTAAATGGTAATCATACATCGGTG
GTCCAGAAAGTTTTGGAGAGTATCAACGATTCAGTTTCATTGGAGGACCTCGAGGAAATT
GTTGGTGATATTAAAAGGTGCTGGGACGAGAGAAGAGCAAATATGGCTGGTGGTAGTGGG
AGACCACTACCGTCTCAAAGTCAAAATGCGACATTATCTACCTCTAAGCCAGTCCATTCA
CAGCCCCCAAGTAATGATAAGGAGCCAAAATATGTAGACGAGGATGATTGGTCGGATTAG


Predicted translation product    

>SACE0G02948g.aa
MVLAMESRVAPEIPGLIQPGNVTQDLKMMVCKLLNSPKPTKTFPGSQPVSFQHSDVEEKL
LAHDYYVCEKTDGLRVLMFIVINPVTGEQGCFMIDRENNYYLVNGFRFPRLPQKKKEELL
ETLQDGTLLDGELVIQTNPMTKLQELRYLMFDCLAINGRCLTQSPTSSRLAHLGKEFFKP
YFDLRAAYPNRCTTFPFKISMKHMDFSYQLVKVAKSLDKLPHLSDGLIFTPVKAPYTAGG
KDSLLLKWKPEQENTVDFKLILDIPMVEDPSLPKDDRNRWYYNYDVKPVFSLYVWQGGAD
VNSRLKHFDQPFDRKEFEILERTYRKFAELSVSDEEWQNLKNLEQPLNGRIVECAKNQET
GAWEMLRFRDDKLNGNHTSVVQKVLESINDSVSLEDLEEIVGDIKRCWDERRANMAGGSG
RPLPSQSQNATLSTSKPVHSQPPSNDKEPKYVDEDDWSD*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites