KLLA0B02200g


highly similar to uniprot|Q01159 Saccharomyces cerevisiae YGL130W CEG1 Alpha (guanylyltransferase) subunit of the mRNA capping enzyme, a heterodimer (the other subunit is CET1, an RNA 5'-triphophatase) involved in adding the 5' cap to mRNA

Genomic environment map

Element type: CDS
Element length: 1401 nucleotides,
on sense strand of
Klla0B: 196341..197741.
Other names:
KLLA-ORF9718
Coding sequence: 467 codons.
Database cross references:
EMBL: CR382122
GeneID: 2897021
HOGENOM: Q6CWR0

Computed results  

None available yet

Homologs and Orthologs

Homologs in protein families: GL3R1787 GL3R1787.F1 GL3R1787.N1
Orthologs by synteny: ZYRO0E07150g SAKL0A04114g KLTH0H03850g ERGO0F02992g

Protein KLLA0B02200p  


highly similar to uniprot|Q01159 Saccharomyces cerevisiae YGL130W CEG1 mRNA guanylyltransferase (mRNA capping enzyme), alpha subunit

Protein domain map

Protein length: 466 amino acids
Protein family: GL3R1787
Database cross references:
Gene3D: G3DSA:2.40.50.140
InterPro: IPR001339
InterPro: IPR012340
InterPro: IPR013846
InterPro: IPR017075
KEGG: kla:KLLA0B02200g
PIRSF: PIRSF036959
Pfam: PF01331
Pfam: PF03919
RefSeq: XP_451629.1
UniProtKB/Swiss-Prot: Q6CWR0
UniprotKB: MCE1_KLULA

Computed results for KLLA0B02200p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  

None available yet

Sequence data  


Nucleotide sequence    

>KLLA0B02200g.nt
TTGTTCTTTCCTGGATATCTCTGGGTCTATAGTACCAATTCATTTTTAACTGAAAATAAA
TCGCAGGGAAACTAGTTGCCAAGTGTAATGAACGCGATATGAGTGAACGAAACTCCTTTT
TCGGCACAAACTCAACTATCCTACCAATATAGTAGGGTTCTCCAGGTGGTTCTGATACCA
TAAATATGTGCTCATCTTTTTTCAAGACAGCCGATCCGTCAGATAGTAGCAATTGCTTAG
AAACGTCAACGGTCATCTCATCGATATCCAACATGGTACTGAACCTTGCGTATTTCTTGA
ATGAGTTAGGTAGAGAAGGAATGAAATTCCACGGCGTGTTCTTGTCTTGCAAGAACGACT
GATACTTGAAACTACTCTTCTGTGACTTCTTTCCATTGTTCGCCTTCTTCTGAATGCCAT
TCTTCTTTATAGTTTCACCCTTTTCAGACTTCTGTTTCGAAGAGTTCGCCTTCTCTAATT
GTTTAATCTTCTTCGCCAACGCAATGTCGGCTTCCTTCTCATTATAACTCACATTCTTGG
TCGCAGTTCGCCTACGACTTGTGTACTCATTGCCATTTGAAATTATTATCTCATCTGGCG
AATTTTGTCCTGAGACGACAGTCATCACTAGCCGGAAGTAAGCTGATGCAAATTGCTGTT
AAATATCACAAGTGAATATCAATTCTTCAAATGCAACACTTTTGCTAGCAACTCCGGATG
TATCCTGTATTCCCTTTTGCACTATTTCTCTCTTGGTATGTTGTTGCTCCTCCTCATATT
GAATCCATATTCATATCCTGCATATAGCGTTTTTATATACATCTTTCGATATTGAAAATA
ATTGGAAAGCATTTTACAATTACTGAACCAGATGAAAATACGTAATAAGGTAAACAGCAA
TAGATTAAGGCAGCCTGAACTTAATGAACCAAGGCAATTGATATTGCATACGTGTGTTTT
GACTGTCCTTTTTTTTGCCTAAGTTTATAACTTAATTGCCATGGACAATAACAGAGTTGC
ACCAGAGATTCCTGGTCTAAGACAACCCGGACAAATTACTAATGATATTCGAATGTTGAT
GTGTAAGCTACTCAATTCAGCGAAACCCGCTAATACCTTTCCGGGCTCTCAACCTGTTTC
TTTCCATCTGGCTGATATTGAGGAGAAGTTACTTGCGCAAGACTATTACGTATGTGAAAA
GACAGATGGATTGCGTGCGTTAATGTTGATAATGGTGAATCCAGTTACAAAAGAGCAAGG
GTGTTTTATGATTGACCGTGAAAACAACTACTATATGGTCAATGGATTTAGATTTCCATG
TTTGCCCCGTGCCAATAAAAAAGAACTCCTAGAGACTTTGCAGGACGGTACGTTAATTGA
TGGTGAACTTGTCATGCAAACTAATCCAGTGACCAAGTTGAAGGAACTGAGGTACTTGAT
GTTCGACTGCCTTGCGGTAAATGGTCGTTCGTTAGTTCAATCGCCTACAAGTTCCCGTTT
GGCTCATCTTGGCAAAGAGTTTTTCAAACCTTATTACGATTTAAGATCGTATTTTCCAGA
TAGATGTTCCACTTTCCCATTCAAAATTTCAATGAAACATATGAATTTCAGTTATGATTT
GGCAAAAGTTGCAAAGACCTTGGATTCACTTCCACACGTATCTGATGGTCTAATTTTCAC
TCCAGTACAAGCGGCATATCATATTGGTGGTAAAGATAGCTACCTTTTGAAATGGAAACC
AGAAGTAGAAAATACTGTGGATTTCAAACTGATCATTGAACCTCCTGTAGTAGAAGACAA
ATCTTTACCTAAAAGCGATAAAAACAGGTTTTACTACAATTACGATGTTAAGCCACTTTT
CCATTTGTACGTTTGGCAAGGTGGTAATGACGTGAACAATAGAATACAGGACTTCGAACA
ACCCTTTACTAAGAGTGATTTGGAGCTTTTGGAGAGAACCTATAGAAAATTTGCAGAAAT
AGAAATCGACGACAAACAATGGAATGAGCTCAAGGCCATGGAGGAGCCATTGAACGGAAG
AATTGTAGAATGTTCCAAAGATCAGGAATCAGGTGCATGGAAACTACTTAGGTTCAGAGA
CGATAAACTCAATGGTAATCACGTATCAGTCGTTCAAAAAGTCTTGGAAAGTATCGGTGA
TTCTGTTTCATTGGATGATCTAGAACAAGTGGTAGATGAAATGAGATCTCGCTGGAAAGA
ACGTGAACAAGGACTGAAAAATGCACAAAAACAATTCAACCATCAGGCTTCTGCGAGATC
GTCGCTGTCGCAGCAACATTCAACCGAACCAGAGCAGTCGCAAGATCAACCAAAATATGT
AGACGATGACGATGATAATTGGTCTGATGACGAGCCAGACACAAAAAGACAGAAGATTTA
AGAATCTTGCGTCAAGTTAAAAGCATAACATATACACGAGCATATGTGCTCATATTATAT
TTCTTTCCTGTATTTACATGGGTTTTTTTGTAATATATAATATGTACAATTAGAATGGAT
ATAGTGTAATACTCTTGAGCAATTCCCTTGGATTACCATTACCACTTAAGAAGTACTTTT
CTTCGGTAAGCTGTTCGAAGGTTTTGTTCTGTAAATCTTTGTCAAGTACGATACCCGCCT
TTTGGTAGAACTCCATCAATTTGGCAACTTCATCTTTATTAAGTTTGGTAACTTCGAACT
C

Coding sequence    

>KLLA0B02200g.cds
ATGGACAATAACAGAGTTGCACCAGAGATTCCTGGTCTAAGACAACCCGGACAAATTACT
AATGATATTCGAATGTTGATGTGTAAGCTACTCAATTCAGCGAAACCCGCTAATACCTTT
CCGGGCTCTCAACCTGTTTCTTTCCATCTGGCTGATATTGAGGAGAAGTTACTTGCGCAA
GACTATTACGTATGTGAAAAGACAGATGGATTGCGTGCGTTAATGTTGATAATGGTGAAT
CCAGTTACAAAAGAGCAAGGGTGTTTTATGATTGACCGTGAAAACAACTACTATATGGTC
AATGGATTTAGATTTCCATGTTTGCCCCGTGCCAATAAAAAAGAACTCCTAGAGACTTTG
CAGGACGGTACGTTAATTGATGGTGAACTTGTCATGCAAACTAATCCAGTGACCAAGTTG
AAGGAACTGAGGTACTTGATGTTCGACTGCCTTGCGGTAAATGGTCGTTCGTTAGTTCAA
TCGCCTACAAGTTCCCGTTTGGCTCATCTTGGCAAAGAGTTTTTCAAACCTTATTACGAT
TTAAGATCGTATTTTCCAGATAGATGTTCCACTTTCCCATTCAAAATTTCAATGAAACAT
ATGAATTTCAGTTATGATTTGGCAAAAGTTGCAAAGACCTTGGATTCACTTCCACACGTA
TCTGATGGTCTAATTTTCACTCCAGTACAAGCGGCATATCATATTGGTGGTAAAGATAGC
TACCTTTTGAAATGGAAACCAGAAGTAGAAAATACTGTGGATTTCAAACTGATCATTGAA
CCTCCTGTAGTAGAAGACAAATCTTTACCTAAAAGCGATAAAAACAGGTTTTACTACAAT
TACGATGTTAAGCCACTTTTCCATTTGTACGTTTGGCAAGGTGGTAATGACGTGAACAAT
AGAATACAGGACTTCGAACAACCCTTTACTAAGAGTGATTTGGAGCTTTTGGAGAGAACC
TATAGAAAATTTGCAGAAATAGAAATCGACGACAAACAATGGAATGAGCTCAAGGCCATG
GAGGAGCCATTGAACGGAAGAATTGTAGAATGTTCCAAAGATCAGGAATCAGGTGCATGG
AAACTACTTAGGTTCAGAGACGATAAACTCAATGGTAATCACGTATCAGTCGTTCAAAAA
GTCTTGGAAAGTATCGGTGATTCTGTTTCATTGGATGATCTAGAACAAGTGGTAGATGAA
ATGAGATCTCGCTGGAAAGAACGTGAACAAGGACTGAAAAATGCACAAAAACAATTCAAC
CATCAGGCTTCTGCGAGATCGTCGCTGTCGCAGCAACATTCAACCGAACCAGAGCAGTCG
CAAGATCAACCAAAATATGTAGACGATGACGATGATAATTGGTCTGATGACGAGCCAGAC
ACAAAAAGACAGAAGATTTAA

Predicted translation product    

>KLLA0B02200g.aa
MDNNRVAPEIPGLRQPGQITNDIRMLMCKLLNSAKPANTFPGSQPVSFHLADIEEKLLAQ
DYYVCEKTDGLRALMLIMVNPVTKEQGCFMIDRENNYYMVNGFRFPCLPRANKKELLETL
QDGTLIDGELVMQTNPVTKLKELRYLMFDCLAVNGRSLVQSPTSSRLAHLGKEFFKPYYD
LRSYFPDRCSTFPFKISMKHMNFSYDLAKVAKTLDSLPHVSDGLIFTPVQAAYHIGGKDS
YLLKWKPEVENTVDFKLIIEPPVVEDKSLPKSDKNRFYYNYDVKPLFHLYVWQGGNDVNN
RIQDFEQPFTKSDLELLERTYRKFAEIEIDDKQWNELKAMEEPLNGRIVECSKDQESGAW
KLLRFRDDKLNGNHVSVVQKVLESIGDSVSLDDLEQVVDEMRSRWKEREQGLKNAQKQFN
HQASARSSLSQQHSTEPEQSQDQPKYVDDDDDNWSDDEPDTKRQKI*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites