CAGL0I09570g


highly similar to uniprot|Q01159 Saccharomyces cerevisiae YGL130w mRNA guanylyltransferase

Genomic environment map

Element type: CDS
Element length: 1350 nucleotides,
on anti-sense strand of
Cagl0I: complement(914062..915411).
Other names:
CAGL-CDS2405.1
CAGL-IPF7736
Coding sequence: 450 codons.
Database cross references:
EMBL: CR380955
GeneID: 2888936
GenomeReviews: CR380955_GR
HOGENOM: HBG330574

Computed results  

None available yet


Homologs and Orthologs

Homologs in protein family: GL3R1787
Orthologs: strict determination not possible; homologs must be refined manually

Protein CAGL0I09570p  


highly similar to uniprot|Q01159 Saccharomyces cerevisiae YGL130w mRNA guanylyltransferase; RecName: Full=mRNA-capping enzyme subunit alpha; AltName: Full=GTP--RNA guanylyltransferase; Short=GTase; AltName: Full=mRNA guanylyltransferase; EC=2.7.7.50;

Protein domain map

Protein length: 449 amino acids
Protein family: GL3R1787
Database cross references:
Gene3D: G3DSA:2.40.50.140
InterPro: IPR001339
InterPro: IPR012340
InterPro: IPR013846
InterPro: IPR016027
InterPro: IPR017075
KEGG: cgr:CAGL0I09570g
PIRSF: PIRSF036959
Pfam: PF01331
Pfam: PF03919
RefSeq: XP_447663.1
UniProtKB/Swiss-Prot: Q6FQ31
UniProtKB: MCE1_CANGA

Phylogeny  

PhylomeDB:CAGL0I09570g

Computed results for CAGL0I09570p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  


Sequence data  


Nucleotide sequence    

>CAGL0I09570g.nt
ATGGATGCTAGAATATCGCCAGAGATTCCGGGGATCATACAGCCCGGGAATGTTACCCAG
GACTTGAAGATGCTCGTCTGCAAGCTTCTGAACTCCCCTAAGCCCTCGAAGACGTTCCCT
GGTTCGCAACCTGTGTCCTTCCAACACTCGGATATCCGGGACAAACTGGTCGCCCAGGAC
TACTATGTGTGTGAGAAGACCGACGGGCTGCGTGTGCTGATGCTGGTGGTTGTGAACCCG
ATCACCGGTGAGCAAGGGTGCTTCATGATAGACCGTGAGAACAACTACTACTTGGTCAAC
GGGTTCCACTTCCCAAGGTTGCCGCAGAAGAAGAAAGAGGAGCTGCTGGAGACGTCCCAG
AATGGTACGCTGATAGACGGTGAGCTGGTCATTCAGAAGAACCCCATGACGAAGCTGCAG
GAGTTGCGGTACCTGATGTTCGACTGCTTGGCGATTAACGGCAGGTCCCTAGTACAATCC
CCAACCAGTTCCAGATTGGCGCACCTTGGGAAGGAGTTCTACAAGCCATACTACGACCTG
AGATCGATATACCCCGACAAATGTGCGACTTTCCCTTTCAAGCTCTCCATGAAGCACATG
GACTTCAGTTACTCCCTGGTGAAGGTGGCCAACAGTCTCGACAAGCTGCCCCACTTGTCA
GACGGTCTCATCTTTACACCCGTGAGAACGCCATACGCAGTGGGTGGTAAGGACTCCCTG
CTTCTGAAATGGAAGCCAGAACAAGAGAACTCAGTGGACTTTAAGCTAATATTGGAAATA
CCGATGACAGAGGACAATTCGGTCGCAAAGAAGGACCCTAGGAGATGGTATTATAACTAC
GATGCCAAACCAACGTTTGCACTGTATGTGTGGCAAGGTGGCTCCGATGTCAACACAAAG
CTACAAAACTTCGAACAACCATTCGACAAGAGAGAAATGCAAGTGCTGGAGAAGACATAC
AAGAGGTTCGCAGAATTATCGATTTCTGACGAGCAATGGCAGGAGCTGAAGAACTTGGAA
GAACCATTAAACGGAAGAATCGTGGAGTGTACTAAGGACCCAGAGACTGGAAGCTGGACT
ATGCTTCGATTCAGAGATGACAAGCTAAATGGTAATCACACCTCTGTGGTTCAAAAAGTT
CTGGAAAGTATCAGTGACTCAGTGACAATTGATGACCTTGCTGAGTCCGTACCGGAAATA
AAATCAGCATGGGATGAAAGAAAGAATGGCGCGTACCAACATCATAGTAGTTCCTTTTCA
GAGAGTAGGCAGCAACCTAAAGCCGAACCAGTTGCGGAGAAAAAGCAAACAGAACCAAAA
TATGTCGATGACGATGATTGGTCTGACTAA

Coding sequence    

>CAGL0I09570g.cds
ATGGATGCTAGAATATCGCCAGAGATTCCGGGGATCATACAGCCCGGGAATGTTACCCAG
GACTTGAAGATGCTCGTCTGCAAGCTTCTGAACTCCCCTAAGCCCTCGAAGACGTTCCCT
GGTTCGCAACCTGTGTCCTTCCAACACTCGGATATCCGGGACAAACTGGTCGCCCAGGAC
TACTATGTGTGTGAGAAGACCGACGGGCTGCGTGTGCTGATGCTGGTGGTTGTGAACCCG
ATCACCGGTGAGCAAGGGTGCTTCATGATAGACCGTGAGAACAACTACTACTTGGTCAAC
GGGTTCCACTTCCCAAGGTTGCCGCAGAAGAAGAAAGAGGAGCTGCTGGAGACGTCCCAG
AATGGTACGCTGATAGACGGTGAGCTGGTCATTCAGAAGAACCCCATGACGAAGCTGCAG
GAGTTGCGGTACCTGATGTTCGACTGCTTGGCGATTAACGGCAGGTCCCTAGTACAATCC
CCAACCAGTTCCAGATTGGCGCACCTTGGGAAGGAGTTCTACAAGCCATACTACGACCTG
AGATCGATATACCCCGACAAATGTGCGACTTTCCCTTTCAAGCTCTCCATGAAGCACATG
GACTTCAGTTACTCCCTGGTGAAGGTGGCCAACAGTCTCGACAAGCTGCCCCACTTGTCA
GACGGTCTCATCTTTACACCCGTGAGAACGCCATACGCAGTGGGTGGTAAGGACTCCCTG
CTTCTGAAATGGAAGCCAGAACAAGAGAACTCAGTGGACTTTAAGCTAATATTGGAAATA
CCGATGACAGAGGACAATTCGGTCGCAAAGAAGGACCCTAGGAGATGGTATTATAACTAC
GATGCCAAACCAACGTTTGCACTGTATGTGTGGCAAGGTGGCTCCGATGTCAACACAAAG
CTACAAAACTTCGAACAACCATTCGACAAGAGAGAAATGCAAGTGCTGGAGAAGACATAC
AAGAGGTTCGCAGAATTATCGATTTCTGACGAGCAATGGCAGGAGCTGAAGAACTTGGAA
GAACCATTAAACGGAAGAATCGTGGAGTGTACTAAGGACCCAGAGACTGGAAGCTGGACT
ATGCTTCGATTCAGAGATGACAAGCTAAATGGTAATCACACCTCTGTGGTTCAAAAAGTT
CTGGAAAGTATCAGTGACTCAGTGACAATTGATGACCTTGCTGAGTCCGTACCGGAAATA
AAATCAGCATGGGATGAAAGAAAGAATGGCGCGTACCAACATCATAGTAGTTCCTTTTCA
GAGAGTAGGCAGCAACCTAAAGCCGAACCAGTTGCGGAGAAAAAGCAAACAGAACCAAAA
TATGTCGATGACGATGATTGGTCTGACTAA

Predicted translation product    

>CAGL0I09570g.aa
MDARISPEIPGIIQPGNVTQDLKMLVCKLLNSPKPSKTFPGSQPVSFQHSDIRDKLVAQD
YYVCEKTDGLRVLMLVVVNPITGEQGCFMIDRENNYYLVNGFHFPRLPQKKKEELLETSQ
NGTLIDGELVIQKNPMTKLQELRYLMFDCLAINGRSLVQSPTSSRLAHLGKEFYKPYYDL
RSIYPDKCATFPFKLSMKHMDFSYSLVKVANSLDKLPHLSDGLIFTPVRTPYAVGGKDSL
LLKWKPEQENSVDFKLILEIPMTEDNSVAKKDPRRWYYNYDAKPTFALYVWQGGSDVNTK
LQNFEQPFDKREMQVLEKTYKRFAELSISDEQWQELKNLEEPLNGRIVECTKDPETGSWT
MLRFRDDKLNGNHTSVVQKVLESISDSVTIDDLAESVPEIKSAWDERKNGAYQHHSSSFS
ESRQQPKAEPVAEKKQTEPKYVDDDDWSD*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites