CAGL0E06050g
similar to uniprot|O13297 Saccharomyces cerevisiae YPL228w CET1 (ohnolog of YMR180C) Beta (RNA 5'-triphosphatase) subunit of the mRNA capping enzyme, a heterodimer (the other subunit is CEG1, a guanylyltransferase) involved in adding the 5' cap to mRNA
Element type: CDS
Element length: 1809 nucleotides,
on anti-sense strand of
Cagl0E: complement(599197..601005).
Other names:
CAGL-CDS1497.1
CAGL-IPF6019
Coding sequence: 603 codons.
Element length: 1809 nucleotides,
on anti-sense strand of
Cagl0E: complement(599197..601005).
Other names:
CAGL-CDS1497.1
CAGL-IPF6019
Coding sequence: 603 codons.
Database cross references:
EMBL: CR380951
GeneID: 2887417
HOGENOM: Q6FUZ2
Orthologs: strict determination not possible; homologs must be refined manually
EMBL: CR380951
GeneID: 2887417
HOGENOM: Q6FUZ2
Homologs and Orthologs
Homologs in protein families: GL3C0218 GL3C0218.N2Orthologs: strict determination not possible; homologs must be refined manually
Protein domain map
Database cross references:
InterPro: IPR004206
KEGG: cgr:CAGL0E06050g
Pfam: PF02940
RefSeq: XP_445952.1
SMR: Q6FUZ2
UniProtKB/Swiss-Prot: Q6FUZ2
UniprotKB: CET1_CANGA
InterPro: IPR004206
KEGG: cgr:CAGL0E06050g
Pfam: PF02940
RefSeq: XP_445952.1
SMR: Q6FUZ2
UniProtKB/Swiss-Prot: Q6FUZ2
UniprotKB: CET1_CANGA
Sequence data 
Nucleotide sequence
>CAGL0E06050g.nt TTTTCTTTCATCTCATTTTATCTTTTTGTTTATCTCGTTGCAAAACTAAAATTGGATTTG TCAAATGTTTATTGTTGAAGTGTTCAACCCATAAGCAATTTGCCTCCTCCTGATTTCATT ACCATCCATTCCTTTCATGTCACAATTATGTTTAATATCCATGTCACGTTTTAATGTTCA TATACATACAATAAAAACATCTATTATTGCTGGTTCAAATACAATTTTTTTTTTTCATAT CTACAGATTTTACATTTTCATTACTTAAGGCTCACTCATTTCATATCTATTTATATAGAC AAGTACTCATGGGTTACACTAAATTATGTTTCCCCTTCATTTTATTAAATACTATTTAGA GATTCATAAACAAATAAACATTTCTTCCAATTATACAGTTACTATCTGTCTTCCAGAAAT CGTGATAACTACGTGGCGTCATCTCTCTTCGGTTGTTGAAACTATTCCAGAAGCAAGTTT TATCAATTCTTTCAGCTCAACATATTTAATATTTTGGTATTTAATCACGTCAGGAGATAA TAAAGAAATCAGAACCAAAAAAACAAACTATACAGAAGTTCATAAAAGAGCAACTAGATA GGAGCATCAAATATTATGTGATTTGGATGTATGCACAGTTCATTCCCAGTTGCTGTTATA CCAGTATAAACTGATACTTGCGCCTAGCATAGTATAACCACATGTTATATCACTCCTTAC GCCATACTAATATGGCATTTCACTCGGCAAAAAATCTTTCTGAAAAAAATAGGTCAAATT ACAACGATGCCTAATAGTTAAGCAAACCGTTCCAGTGAGAGATTTTACTATAGCTAGTGT TTACCGGAATAACTAGTGCAGTCATATTTATACCTTTTCAATTGTGAAGTACAGTGATTG AATTTGGAATCAGTGATTGTTTTTATACCGAGTACTTGTAAAACAAAGCTGAAAGTTAGG TAAAGAAAGGCAGGCATCATACAGTTTTTGGAGCTTCTAGATGTCTGAACACCACTCGAA GAGAGCTTTGTCTCTGGATGACCTTGTCAACCACGATGAAAATGATAAGTCAAAGCTGCA AAAACTAGCCGACAACGAAAGCAGTGTACGATCAGATGACAACAGACCTGGTGCTATTGA AAACATTGTGAACGGTAATAACAGCAACAGTGACCTGAACTCTAATGGTGTCATTGAAGA AGACACTGATACAGATGATGATGTTGGTGGGGAGTTCACTTTTGATAATGGCATAACATT TGACTATGACAAACAAGATAGATTCTCTCCAGAGAAAAAACGAATTCAGGCGAGAAAGAA AGATACCTCTAAAACAACACCAAGCATATCAAATGAATCACCCAGCAATTCAAAAGAATC CTCAGTCCCTGTTGATCCTCTTTCCAGCAATATAAGTGCTACAGATAGAAAGGATTCGTC AGAGGAGAAACCTGACTTGACTGGACCAGAATTAGTAAAAGAACCTGATACCAACGAATA CAAGAGACCATCTATCCAGTCCATAACCAATGCGGAAGATACTACCTATAACGACCATAA AGCAGCTGGAATGGAAAAAACATCGAACAAGCATAGCTTACCAAATATTCTTTCCGACAG TATCGATGAAACTGTCACGGAGGAACATAAACCAAAAACTGAAACTGAACAAACAATAAC AGAATATCAACAAGAAAATAAACAGAAAGATAACGTAAACGAAAGCAATTCAGAAGAAAC CCACGATATTAAAAATGATAATATGAACCAAGTCGAAAAAATATTCCAAGAAAAGACATC TACACTTTCTAAAAAGAATAGTGTAAAAAAGGATCTGGAACTATTAAACGAAATATCTGC ATCTTCCAAGCCAAATAAATATAAAAACACACCAATTTGGGCTCAAAAATGGAAACCTAC AGTTAAAGCTCTACAAAACATCGACACAAATGACTTCAAAATTGACAACTCCATTTTAGA CATTATACCAGATGATGATTTAACAAAGTCAGTTCAGGATTGGGTTTATGCAACTTTATA TTCGATTGACCCAGATTTGAGACCGTTTATTGAGCTGGAAATGAAATTTGGAGTTTTATT AGAATCAAAAAGCCCAGATAGAGTTAATCCACCAGTTTCCTCACAGGCAGTATATACTGA TATGGATGCTCACCTTACACCAAATGTAGACGAGACTGTGTTCAAAGAACTCAGTAAATA TATCCAAAGTCTCAGTGAAATCACAGAAAATGCTGGTAAATTTAACGTGATTGAAGCACA AACTAAGGATGCAGTCTATAGAGTCGGTACATCTACCCAAAGACCGAGATTTTTAAGAAT GAGTTCAGATGTAAAAACAGGTAGAATTGGTGCATTTATCGAGAAGAGACACATATCTCA ATTATTAATCTACTCTCCAAAGGATAGTTATGATGTTAAGTTATCAATAAACTTAGAACT TCCGGTTCCTGAAAACGATCCACCAGAGAAGTATCAACATCAAACACCTGTTAGTGAAAG AACAAAAGAGAGAGTTAGTTATATTCATAATGACTCTTGTACAAGGTTCGATATCACAAA AGTTCAAAATCATAATAAAGGCATAAAATCAAATGACGTTGAAATAACACACGAAATCGA ATTAGAAATAAACACACCCGCTTTAATCAAAGCTTTTGACAATATAATGACGGACAGCAA AGAATACGCTACATTGATTAGGACGTTCCTCAATAATGGAACTATTGTTCGCAGAAAACT ATCATCACTTTCATATGAAATATTTGAAGGTCAAAAAAAGATACAATAAATGGATATACT GCATTCGGTTATTGTAATTGTATTATTTTACAGTTTATGTATTAATATAATCTATGTATG AATTTATGGATTTACCACCATTAATATATTTACATAGTGATATTTGAAATATCCATTCTG ATATGTGGTAAAACACTATTGGGTTCAGCGGCTTTCCAAGTTTATATTCAACATTGTTTA TAAGGATTATATATACCCAGAATATATGCCATTCTAATAACAACCAAATCGATAGCCATT TTGATGCTATCTAGCGCAAGGGCCATTTTAGAACCATCAACTTCATGCC
Coding sequence
>CAGL0E06050g.cds ATGTCTGAACACCACTCGAAGAGAGCTTTGTCTCTGGATGACCTTGTCAACCACGATGAA AATGATAAGTCAAAGCTGCAAAAACTAGCCGACAACGAAAGCAGTGTACGATCAGATGAC AACAGACCTGGTGCTATTGAAAACATTGTGAACGGTAATAACAGCAACAGTGACCTGAAC TCTAATGGTGTCATTGAAGAAGACACTGATACAGATGATGATGTTGGTGGGGAGTTCACT TTTGATAATGGCATAACATTTGACTATGACAAACAAGATAGATTCTCTCCAGAGAAAAAA CGAATTCAGGCGAGAAAGAAAGATACCTCTAAAACAACACCAAGCATATCAAATGAATCA CCCAGCAATTCAAAAGAATCCTCAGTCCCTGTTGATCCTCTTTCCAGCAATATAAGTGCT ACAGATAGAAAGGATTCGTCAGAGGAGAAACCTGACTTGACTGGACCAGAATTAGTAAAA GAACCTGATACCAACGAATACAAGAGACCATCTATCCAGTCCATAACCAATGCGGAAGAT ACTACCTATAACGACCATAAAGCAGCTGGAATGGAAAAAACATCGAACAAGCATAGCTTA CCAAATATTCTTTCCGACAGTATCGATGAAACTGTCACGGAGGAACATAAACCAAAAACT GAAACTGAACAAACAATAACAGAATATCAACAAGAAAATAAACAGAAAGATAACGTAAAC GAAAGCAATTCAGAAGAAACCCACGATATTAAAAATGATAATATGAACCAAGTCGAAAAA ATATTCCAAGAAAAGACATCTACACTTTCTAAAAAGAATAGTGTAAAAAAGGATCTGGAA CTATTAAACGAAATATCTGCATCTTCCAAGCCAAATAAATATAAAAACACACCAATTTGG GCTCAAAAATGGAAACCTACAGTTAAAGCTCTACAAAACATCGACACAAATGACTTCAAA ATTGACAACTCCATTTTAGACATTATACCAGATGATGATTTAACAAAGTCAGTTCAGGAT TGGGTTTATGCAACTTTATATTCGATTGACCCAGATTTGAGACCGTTTATTGAGCTGGAA ATGAAATTTGGAGTTTTATTAGAATCAAAAAGCCCAGATAGAGTTAATCCACCAGTTTCC TCACAGGCAGTATATACTGATATGGATGCTCACCTTACACCAAATGTAGACGAGACTGTG TTCAAAGAACTCAGTAAATATATCCAAAGTCTCAGTGAAATCACAGAAAATGCTGGTAAA TTTAACGTGATTGAAGCACAAACTAAGGATGCAGTCTATAGAGTCGGTACATCTACCCAA AGACCGAGATTTTTAAGAATGAGTTCAGATGTAAAAACAGGTAGAATTGGTGCATTTATC GAGAAGAGACACATATCTCAATTATTAATCTACTCTCCAAAGGATAGTTATGATGTTAAG TTATCAATAAACTTAGAACTTCCGGTTCCTGAAAACGATCCACCAGAGAAGTATCAACAT CAAACACCTGTTAGTGAAAGAACAAAAGAGAGAGTTAGTTATATTCATAATGACTCTTGT ACAAGGTTCGATATCACAAAAGTTCAAAATCATAATAAAGGCATAAAATCAAATGACGTT GAAATAACACACGAAATCGAATTAGAAATAAACACACCCGCTTTAATCAAAGCTTTTGAC AATATAATGACGGACAGCAAAGAATACGCTACATTGATTAGGACGTTCCTCAATAATGGA ACTATTGTTCGCAGAAAACTATCATCACTTTCATATGAAATATTTGAAGGTCAAAAAAAG ATACAATAA
Predicted translation product
>CAGL0E06050g.aa MSEHHSKRALSLDDLVNHDENDKSKLQKLADNESSVRSDDNRPGAIENIVNGNNSNSDLN SNGVIEEDTDTDDDVGGEFTFDNGITFDYDKQDRFSPEKKRIQARKKDTSKTTPSISNES PSNSKESSVPVDPLSSNISATDRKDSSEEKPDLTGPELVKEPDTNEYKRPSIQSITNAED TTYNDHKAAGMEKTSNKHSLPNILSDSIDETVTEEHKPKTETEQTITEYQQENKQKDNVN ESNSEETHDIKNDNMNQVEKIFQEKTSTLSKKNSVKKDLELLNEISASSKPNKYKNTPIW AQKWKPTVKALQNIDTNDFKIDNSILDIIPDDDLTKSVQDWVYATLYSIDPDLRPFIELE MKFGVLLESKSPDRVNPPVSSQAVYTDMDAHLTPNVDETVFKELSKYIQSLSEITENAGK FNVIEAQTKDAVYRVGTSTQRPRFLRMSSDVKTGRIGAFIEKRHISQLLIYSPKDSYDVK LSINLELPVPENDPPEKYQHQTPVSERTKERVSYIHNDSCTRFDITKVQNHNKGIKSNDV EITHEIELEINTPALIKAFDNIMTDSKEYATLIRTFLNNGTIVRRKLSSLSYEIFEGQKK IQ*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://www.genolevures.org/elt/CAGL/CAGL0E06050p