CAGL0E06050g


similar to uniprot|O13297 Saccharomyces cerevisiae YPL228w CET1 (ohnolog of YMR180C) Beta (RNA 5'-triphosphatase) subunit of the mRNA capping enzyme, a heterodimer (the other subunit is CEG1, a guanylyltransferase) involved in adding the 5' cap to mRNA

Genomic environment map

Element type: CDS
Element length: 1809 nucleotides,
on anti-sense strand of
Cagl0E: complement(599197..601005).
Other names:
CAGL-CDS1497.1
CAGL-IPF6019
Coding sequence: 603 codons.
Database cross references:
EMBL: CR380951
GeneID: 2887417
HOGENOM: Q6FUZ2

Computed results  

None available yet

Homologs and Orthologs

Homologs in protein families: GL3C0218 GL3C0218.N2
Orthologs: strict determination not possible; homologs must be refined manually

Protein CAGL0E06050p  


similar to uniprot|O13297 Saccharomyces cerevisiae YPL228w CET1

Protein domain map

Protein length: 602 amino acids
Protein family: GL3C0218
Database cross references:
InterPro: IPR004206
KEGG: cgr:CAGL0E06050g
Pfam: PF02940
RefSeq: XP_445952.1
SMR: Q6FUZ2
UniProtKB/Swiss-Prot: Q6FUZ2
UniprotKB: CET1_CANGA

Computed results for CAGL0E06050p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  

None available yet

Sequence data  

Nucleotide sequence

>CAGL0E06050g.nt
TTTTCTTTCATCTCATTTTATCTTTTTGTTTATCTCGTTGCAAAACTAAAATTGGATTTG
TCAAATGTTTATTGTTGAAGTGTTCAACCCATAAGCAATTTGCCTCCTCCTGATTTCATT
ACCATCCATTCCTTTCATGTCACAATTATGTTTAATATCCATGTCACGTTTTAATGTTCA
TATACATACAATAAAAACATCTATTATTGCTGGTTCAAATACAATTTTTTTTTTTCATAT
CTACAGATTTTACATTTTCATTACTTAAGGCTCACTCATTTCATATCTATTTATATAGAC
AAGTACTCATGGGTTACACTAAATTATGTTTCCCCTTCATTTTATTAAATACTATTTAGA
GATTCATAAACAAATAAACATTTCTTCCAATTATACAGTTACTATCTGTCTTCCAGAAAT
CGTGATAACTACGTGGCGTCATCTCTCTTCGGTTGTTGAAACTATTCCAGAAGCAAGTTT
TATCAATTCTTTCAGCTCAACATATTTAATATTTTGGTATTTAATCACGTCAGGAGATAA
TAAAGAAATCAGAACCAAAAAAACAAACTATACAGAAGTTCATAAAAGAGCAACTAGATA
GGAGCATCAAATATTATGTGATTTGGATGTATGCACAGTTCATTCCCAGTTGCTGTTATA
CCAGTATAAACTGATACTTGCGCCTAGCATAGTATAACCACATGTTATATCACTCCTTAC
GCCATACTAATATGGCATTTCACTCGGCAAAAAATCTTTCTGAAAAAAATAGGTCAAATT
ACAACGATGCCTAATAGTTAAGCAAACCGTTCCAGTGAGAGATTTTACTATAGCTAGTGT
TTACCGGAATAACTAGTGCAGTCATATTTATACCTTTTCAATTGTGAAGTACAGTGATTG
AATTTGGAATCAGTGATTGTTTTTATACCGAGTACTTGTAAAACAAAGCTGAAAGTTAGG
TAAAGAAAGGCAGGCATCATACAGTTTTTGGAGCTTCTAGATGTCTGAACACCACTCGAA
GAGAGCTTTGTCTCTGGATGACCTTGTCAACCACGATGAAAATGATAAGTCAAAGCTGCA
AAAACTAGCCGACAACGAAAGCAGTGTACGATCAGATGACAACAGACCTGGTGCTATTGA
AAACATTGTGAACGGTAATAACAGCAACAGTGACCTGAACTCTAATGGTGTCATTGAAGA
AGACACTGATACAGATGATGATGTTGGTGGGGAGTTCACTTTTGATAATGGCATAACATT
TGACTATGACAAACAAGATAGATTCTCTCCAGAGAAAAAACGAATTCAGGCGAGAAAGAA
AGATACCTCTAAAACAACACCAAGCATATCAAATGAATCACCCAGCAATTCAAAAGAATC
CTCAGTCCCTGTTGATCCTCTTTCCAGCAATATAAGTGCTACAGATAGAAAGGATTCGTC
AGAGGAGAAACCTGACTTGACTGGACCAGAATTAGTAAAAGAACCTGATACCAACGAATA
CAAGAGACCATCTATCCAGTCCATAACCAATGCGGAAGATACTACCTATAACGACCATAA
AGCAGCTGGAATGGAAAAAACATCGAACAAGCATAGCTTACCAAATATTCTTTCCGACAG
TATCGATGAAACTGTCACGGAGGAACATAAACCAAAAACTGAAACTGAACAAACAATAAC
AGAATATCAACAAGAAAATAAACAGAAAGATAACGTAAACGAAAGCAATTCAGAAGAAAC
CCACGATATTAAAAATGATAATATGAACCAAGTCGAAAAAATATTCCAAGAAAAGACATC
TACACTTTCTAAAAAGAATAGTGTAAAAAAGGATCTGGAACTATTAAACGAAATATCTGC
ATCTTCCAAGCCAAATAAATATAAAAACACACCAATTTGGGCTCAAAAATGGAAACCTAC
AGTTAAAGCTCTACAAAACATCGACACAAATGACTTCAAAATTGACAACTCCATTTTAGA
CATTATACCAGATGATGATTTAACAAAGTCAGTTCAGGATTGGGTTTATGCAACTTTATA
TTCGATTGACCCAGATTTGAGACCGTTTATTGAGCTGGAAATGAAATTTGGAGTTTTATT
AGAATCAAAAAGCCCAGATAGAGTTAATCCACCAGTTTCCTCACAGGCAGTATATACTGA
TATGGATGCTCACCTTACACCAAATGTAGACGAGACTGTGTTCAAAGAACTCAGTAAATA
TATCCAAAGTCTCAGTGAAATCACAGAAAATGCTGGTAAATTTAACGTGATTGAAGCACA
AACTAAGGATGCAGTCTATAGAGTCGGTACATCTACCCAAAGACCGAGATTTTTAAGAAT
GAGTTCAGATGTAAAAACAGGTAGAATTGGTGCATTTATCGAGAAGAGACACATATCTCA
ATTATTAATCTACTCTCCAAAGGATAGTTATGATGTTAAGTTATCAATAAACTTAGAACT
TCCGGTTCCTGAAAACGATCCACCAGAGAAGTATCAACATCAAACACCTGTTAGTGAAAG
AACAAAAGAGAGAGTTAGTTATATTCATAATGACTCTTGTACAAGGTTCGATATCACAAA
AGTTCAAAATCATAATAAAGGCATAAAATCAAATGACGTTGAAATAACACACGAAATCGA
ATTAGAAATAAACACACCCGCTTTAATCAAAGCTTTTGACAATATAATGACGGACAGCAA
AGAATACGCTACATTGATTAGGACGTTCCTCAATAATGGAACTATTGTTCGCAGAAAACT
ATCATCACTTTCATATGAAATATTTGAAGGTCAAAAAAAGATACAATAAATGGATATACT
GCATTCGGTTATTGTAATTGTATTATTTTACAGTTTATGTATTAATATAATCTATGTATG
AATTTATGGATTTACCACCATTAATATATTTACATAGTGATATTTGAAATATCCATTCTG
ATATGTGGTAAAACACTATTGGGTTCAGCGGCTTTCCAAGTTTATATTCAACATTGTTTA
TAAGGATTATATATACCCAGAATATATGCCATTCTAATAACAACCAAATCGATAGCCATT
TTGATGCTATCTAGCGCAAGGGCCATTTTAGAACCATCAACTTCATGCC

Coding sequence

>CAGL0E06050g.cds
ATGTCTGAACACCACTCGAAGAGAGCTTTGTCTCTGGATGACCTTGTCAACCACGATGAA
AATGATAAGTCAAAGCTGCAAAAACTAGCCGACAACGAAAGCAGTGTACGATCAGATGAC
AACAGACCTGGTGCTATTGAAAACATTGTGAACGGTAATAACAGCAACAGTGACCTGAAC
TCTAATGGTGTCATTGAAGAAGACACTGATACAGATGATGATGTTGGTGGGGAGTTCACT
TTTGATAATGGCATAACATTTGACTATGACAAACAAGATAGATTCTCTCCAGAGAAAAAA
CGAATTCAGGCGAGAAAGAAAGATACCTCTAAAACAACACCAAGCATATCAAATGAATCA
CCCAGCAATTCAAAAGAATCCTCAGTCCCTGTTGATCCTCTTTCCAGCAATATAAGTGCT
ACAGATAGAAAGGATTCGTCAGAGGAGAAACCTGACTTGACTGGACCAGAATTAGTAAAA
GAACCTGATACCAACGAATACAAGAGACCATCTATCCAGTCCATAACCAATGCGGAAGAT
ACTACCTATAACGACCATAAAGCAGCTGGAATGGAAAAAACATCGAACAAGCATAGCTTA
CCAAATATTCTTTCCGACAGTATCGATGAAACTGTCACGGAGGAACATAAACCAAAAACT
GAAACTGAACAAACAATAACAGAATATCAACAAGAAAATAAACAGAAAGATAACGTAAAC
GAAAGCAATTCAGAAGAAACCCACGATATTAAAAATGATAATATGAACCAAGTCGAAAAA
ATATTCCAAGAAAAGACATCTACACTTTCTAAAAAGAATAGTGTAAAAAAGGATCTGGAA
CTATTAAACGAAATATCTGCATCTTCCAAGCCAAATAAATATAAAAACACACCAATTTGG
GCTCAAAAATGGAAACCTACAGTTAAAGCTCTACAAAACATCGACACAAATGACTTCAAA
ATTGACAACTCCATTTTAGACATTATACCAGATGATGATTTAACAAAGTCAGTTCAGGAT
TGGGTTTATGCAACTTTATATTCGATTGACCCAGATTTGAGACCGTTTATTGAGCTGGAA
ATGAAATTTGGAGTTTTATTAGAATCAAAAAGCCCAGATAGAGTTAATCCACCAGTTTCC
TCACAGGCAGTATATACTGATATGGATGCTCACCTTACACCAAATGTAGACGAGACTGTG
TTCAAAGAACTCAGTAAATATATCCAAAGTCTCAGTGAAATCACAGAAAATGCTGGTAAA
TTTAACGTGATTGAAGCACAAACTAAGGATGCAGTCTATAGAGTCGGTACATCTACCCAA
AGACCGAGATTTTTAAGAATGAGTTCAGATGTAAAAACAGGTAGAATTGGTGCATTTATC
GAGAAGAGACACATATCTCAATTATTAATCTACTCTCCAAAGGATAGTTATGATGTTAAG
TTATCAATAAACTTAGAACTTCCGGTTCCTGAAAACGATCCACCAGAGAAGTATCAACAT
CAAACACCTGTTAGTGAAAGAACAAAAGAGAGAGTTAGTTATATTCATAATGACTCTTGT
ACAAGGTTCGATATCACAAAAGTTCAAAATCATAATAAAGGCATAAAATCAAATGACGTT
GAAATAACACACGAAATCGAATTAGAAATAAACACACCCGCTTTAATCAAAGCTTTTGAC
AATATAATGACGGACAGCAAAGAATACGCTACATTGATTAGGACGTTCCTCAATAATGGA
ACTATTGTTCGCAGAAAACTATCATCACTTTCATATGAAATATTTGAAGGTCAAAAAAAG
ATACAATAA

Predicted translation product

>CAGL0E06050g.aa
MSEHHSKRALSLDDLVNHDENDKSKLQKLADNESSVRSDDNRPGAIENIVNGNNSNSDLN
SNGVIEEDTDTDDDVGGEFTFDNGITFDYDKQDRFSPEKKRIQARKKDTSKTTPSISNES
PSNSKESSVPVDPLSSNISATDRKDSSEEKPDLTGPELVKEPDTNEYKRPSIQSITNAED
TTYNDHKAAGMEKTSNKHSLPNILSDSIDETVTEEHKPKTETEQTITEYQQENKQKDNVN
ESNSEETHDIKNDNMNQVEKIFQEKTSTLSKKNSVKKDLELLNEISASSKPNKYKNTPIW
AQKWKPTVKALQNIDTNDFKIDNSILDIIPDDDLTKSVQDWVYATLYSIDPDLRPFIELE
MKFGVLLESKSPDRVNPPVSSQAVYTDMDAHLTPNVDETVFKELSKYIQSLSEITENAGK
FNVIEAQTKDAVYRVGTSTQRPRFLRMSSDVKTGRIGAFIEKRHISQLLIYSPKDSYDVK
LSINLELPVPENDPPEKYQHQTPVSERTKERVSYIHNDSCTRFDITKVQNHNKGIKSNDV
EITHEIELEINTPALIKAFDNIMTDSKEYATLIRTFLNNGTIVRRKLSSLSYEIFEGQKK
IQ*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites