KLLA0C16049g


similar to uniprot|O13297 Saccharomyces cerevisiae YPL228W CET1 (ohnolog of YMR180C) Beta (RNA 5'-triphosphatase) subunit of the mRNA capping enzyme, a heterodimer (the other subunit is CEG1, a guanylyltransferase) involved in adding the 5' cap to mRNA

Genomic environment map

Element type: CDS
Element length: 1671 nucleotides,
on sense strand of
Klla0C: 1399865..1401535.
Other names:
KLLA-ORF7130
Coding sequence: 557 codons.
Database cross references:
EMBL: CR382123
GeneID: 2892078
HOGENOM: Q6CT22

Computed results  

None available yet

Homologs and Orthologs

Homologs in protein families: GL3C0218 GL3C0218.F1 GL3C0218.N2
Orthologs by synteny: ZYRO0E07700g SAKL0A03520g KLTH0H03234g ERGO0F02398g

Protein KLLA0C16049p  


similar to uniprot|O13297 Saccharomyces cerevisiae YPL228W CET1 Interacts with Ceg1p the mRNA capping enzyme alpha subunit removes gamma-phosphate from triphosphate- terminated RNA mRNA capping enzyme beta subunit RNA 5'- triphosphatase

Protein domain map

Protein length: 556 amino acids
Protein family: GL3C0218
Database cross references:
InterPro: IPR004206
KEGG: kla:KLLA0C16049g
Pfam: PF02940
RefSeq: XP_452917.1
SMR: Q6CT22
UniProtKB/Swiss-Prot: Q6CT22
UniprotKB: CET1_KLULA

Computed results for KLLA0C16049p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  

None available yet

Sequence data  


Nucleotide sequence    

>KLLA0C16049g.nt
GACCAAATGGACAAAACAAAGTAAAGATATCCAGAATAACACTTTAAACAATGTCTAATA
CGACACCATGGCAGTATACAAGTACGGGGTCTCCGCAGACTCCAAGTGCAGTGAGTGCGA
ATTTATTTGGAAATAGTATGAGTCTAAGTTCAACAAGCAGTACAAGCAGTTCAAGCGTTG
GGAATAATAACAAGAGTAGTATGTCATCTGGCCAGAATATTGGCATGCTGGGACCGGCTC
CTGGTATGCTGAATATGAATATGAACATACCTATGGGTGTTGATTCTACGTCCAGGCTGG
GATCTGGAGCAGGAACTGGAGTCGGATCTTTGATGTACAGTTTACCTCAGGAATCGCTTC
TCAGTAAGAATATGAGCAAATCGTTCGAGGACGATTTATTCTTTTGTCCAAGGTCATTAT
TATCTAAGGAAGAGCTTTCGCAGTGCTCGCAGATAGATTTGTTGTACAACCAACACCAGC
AACAAATACATCAGAACCATCATCAACAACTGCCCCTACCAGTTCCTTTACAGCAACAAC
AGAACCACCAGCAACAGCAACAGAGACCACGTTTCAATCCATATACTTCCCAAAGCTTCA
ACCCTGGTGCAGTTTGAATTCTCGGGCTCCTTTGCAGCAGCAGTTCCTCGGCTGCTGCCT
GTGAGATACCCCCCACATACCGGGGATGTTGGTTTAGTCGGATGATTATATGAAATGCAA
GTCATGTGACTTGAATTTCTTCTTGTTACCCGGTCCCATGGACAATCCCTGGCATGAAGT
AGCACTTTTCCACAATTTATCAACGTACATCATTATAGATACCAATGCATGTATGTATTT
ATGTACGTGTGTTCACATGTATATGAACGTCAGTTATATACTTTATATTACCCCCCATTT
AATCAAGTTAGTTTATTTCGGTTTATTCAGTGTTCAAGCTAAGTAGGTGCAAAATCAGGC
TGCCAATTCAATTCACAGTTTAAAGCCAACTCCACTAATAATGAAGCCTAGCAGAGGATT
GTCACTCACCGATTTGGTCAACCATGATGACGCTCCGCCCTTGAATAATGGAGATAACAA
TGTAAAACAAGAAGAAGTGCAAGCTGAGAATCTCGCTACAAATCCAGCTTCTGTTTTGGC
TCCAGGCCCAGTCATTGTTGATACTTTACCACCAGTAGAAGCTCCTGTTAGTACTTCCGA
TACTGGTAATACTAGTCATACTGGAGCAGCACCTCAGACAGCAGTAACAGCTGAGAGTGA
TGAGACTGATACGGACGATGAACCGGGGGAAATTGTGTTTGAAAATACTAAATTTAGGTT
CGATGACGAAGAACAACAACCACAGAAAGATAAACTGGTAAAGGGCACTTCTTCGGAGAA
GAAGGACAAGCAAGTAAACGCCGCAACTAAGGAGATTCAGCTGGACTCAAAACCCGTCAA
AGAGCAATCTCCAAAAACAAAAGATGAAGGAGATATTCCCGTAGAGGATGCGGATAAAGC
TGATAATAAAGTGGAAACGCAAAAAAAGGAAAACGGGATCAAACAAGAAGTCGAACCAGT
GGAACAAGATGGTAAGAAATCAGTGGAACCTGCCAAGCAATCCAAAGAGGATTCGAAAAA
GGAGAAGGACATCTTTCAGCAAAAGACAAGTAATGCTTCTGTGAAGAACAACATCAAGAA
GGACTTGAAGATCCTAAGTGAACTCTCTTCCTCTTCGCTACCTAAAAGATACAATGTTCC
GCCTATCTGGGCAAGAAAATGGAAACCTACTGTCAAGGCTTTACAAGCCATTGATTCTTC
AAATCTTAAACTTGACGACTCTATTTTAGGATTTATTCCAGAAGATGACTTGACAAAATC
CGTCCAAGATTGGATTTATGCTACGTTGATTGCAGTAGAACCGGAACTAAGGCAATTCAT
TGAAGTGGAAATGAAATACGGTCTTATCATTGATCCATCGACCTCTAATCGTGTTAACCC
TCCGGTGTCCTCTCAATGCGTTTTCACAGATCTTGACTCCACAATGAAGCCGGATGTTGA
TGAAAGAGTTTTTGATGAGTTCAACAGATACATAAAGAACTTATCTGAATTAAATGAAAA
TATGGGAAAGTTTAATATAATAGATTCTCATGCCTCAGACTTGAGCTATAGAGTAAGAAC
TCATACGGAAAGGCCGAAGTTTTTGAGAATGACAAGAGATGTTAACACCGGAAGAATTGC
ACAATTTATTGAAAAACGGAAAATTTCACAGATTTTATTGTACTCCCCAAAGGATAGTTA
TGATACCAAGATTTCAATCAGTTTGGAACTGCCTGTACCCGAAAATGATCCACCTGAAAA
GTACAAGAACCATACTCCAACAGGTCATCGTTTAAAGAAACGTACCAGTTACATCCATAA
TGACTCTTGTACCAGATTTGATATCACTAGGGTGGAAAATAAACCTATTAGAGTCAATAA
CAAAAATGAAAAAGAACCTGAATCTGATACAACGTACGAAGTTGAACTGGAAATCAATAC
ACCTGCCCTCTTAAATGCTTTTGATAACATTCAACATGATAGTAAAGAATATGCAGCTAT
CGTGAGAACCTTTTTGAATAACGGTACCATTGTAAGAAGGAAGCTTTCATCTTTATCCTA
TGACATTTACAAAGGTTCGAATAAGCTTTGAATCCCATAATCACCTACTATGTTACTTTC
AGATATTCTATAGTTCTATCTATCTCATACCAGTATAAAAGCTTGGGTTGTACTCAATGT
TTCATGAAAGTAATCTGATAGAATATATCCTTACTTGCGCTTTATCTTCGCGCAAATTCT
ATGAAGCTTATTTATCGCTATATATGCCTAGTATATATGCTAACCTTATGATGACTAAAT
CCTTGGCCATGTTAATACTGTCACGAGCCAAATCCATTTTGGAACCGTCAACTTCATGCC
ATGAGATCGGTAGCTCATTGATTGGAATATT

Coding sequence    

>KLLA0C16049g.cds
ATGAAGCCTAGCAGAGGATTGTCACTCACCGATTTGGTCAACCATGATGACGCTCCGCCC
TTGAATAATGGAGATAACAATGTAAAACAAGAAGAAGTGCAAGCTGAGAATCTCGCTACA
AATCCAGCTTCTGTTTTGGCTCCAGGCCCAGTCATTGTTGATACTTTACCACCAGTAGAA
GCTCCTGTTAGTACTTCCGATACTGGTAATACTAGTCATACTGGAGCAGCACCTCAGACA
GCAGTAACAGCTGAGAGTGATGAGACTGATACGGACGATGAACCGGGGGAAATTGTGTTT
GAAAATACTAAATTTAGGTTCGATGACGAAGAACAACAACCACAGAAAGATAAACTGGTA
AAGGGCACTTCTTCGGAGAAGAAGGACAAGCAAGTAAACGCCGCAACTAAGGAGATTCAG
CTGGACTCAAAACCCGTCAAAGAGCAATCTCCAAAAACAAAAGATGAAGGAGATATTCCC
GTAGAGGATGCGGATAAAGCTGATAATAAAGTGGAAACGCAAAAAAAGGAAAACGGGATC
AAACAAGAAGTCGAACCAGTGGAACAAGATGGTAAGAAATCAGTGGAACCTGCCAAGCAA
TCCAAAGAGGATTCGAAAAAGGAGAAGGACATCTTTCAGCAAAAGACAAGTAATGCTTCT
GTGAAGAACAACATCAAGAAGGACTTGAAGATCCTAAGTGAACTCTCTTCCTCTTCGCTA
CCTAAAAGATACAATGTTCCGCCTATCTGGGCAAGAAAATGGAAACCTACTGTCAAGGCT
TTACAAGCCATTGATTCTTCAAATCTTAAACTTGACGACTCTATTTTAGGATTTATTCCA
GAAGATGACTTGACAAAATCCGTCCAAGATTGGATTTATGCTACGTTGATTGCAGTAGAA
CCGGAACTAAGGCAATTCATTGAAGTGGAAATGAAATACGGTCTTATCATTGATCCATCG
ACCTCTAATCGTGTTAACCCTCCGGTGTCCTCTCAATGCGTTTTCACAGATCTTGACTCC
ACAATGAAGCCGGATGTTGATGAAAGAGTTTTTGATGAGTTCAACAGATACATAAAGAAC
TTATCTGAATTAAATGAAAATATGGGAAAGTTTAATATAATAGATTCTCATGCCTCAGAC
TTGAGCTATAGAGTAAGAACTCATACGGAAAGGCCGAAGTTTTTGAGAATGACAAGAGAT
GTTAACACCGGAAGAATTGCACAATTTATTGAAAAACGGAAAATTTCACAGATTTTATTG
TACTCCCCAAAGGATAGTTATGATACCAAGATTTCAATCAGTTTGGAACTGCCTGTACCC
GAAAATGATCCACCTGAAAAGTACAAGAACCATACTCCAACAGGTCATCGTTTAAAGAAA
CGTACCAGTTACATCCATAATGACTCTTGTACCAGATTTGATATCACTAGGGTGGAAAAT
AAACCTATTAGAGTCAATAACAAAAATGAAAAAGAACCTGAATCTGATACAACGTACGAA
GTTGAACTGGAAATCAATACACCTGCCCTCTTAAATGCTTTTGATAACATTCAACATGAT
AGTAAAGAATATGCAGCTATCGTGAGAACCTTTTTGAATAACGGTACCATTGTAAGAAGG
AAGCTTTCATCTTTATCCTATGACATTTACAAAGGTTCGAATAAGCTTTGA

Predicted translation product    

>KLLA0C16049g.aa
MKPSRGLSLTDLVNHDDAPPLNNGDNNVKQEEVQAENLATNPASVLAPGPVIVDTLPPVE
APVSTSDTGNTSHTGAAPQTAVTAESDETDTDDEPGEIVFENTKFRFDDEEQQPQKDKLV
KGTSSEKKDKQVNAATKEIQLDSKPVKEQSPKTKDEGDIPVEDADKADNKVETQKKENGI
KQEVEPVEQDGKKSVEPAKQSKEDSKKEKDIFQQKTSNASVKNNIKKDLKILSELSSSSL
PKRYNVPPIWARKWKPTVKALQAIDSSNLKLDDSILGFIPEDDLTKSVQDWIYATLIAVE
PELRQFIEVEMKYGLIIDPSTSNRVNPPVSSQCVFTDLDSTMKPDVDERVFDEFNRYIKN
LSELNENMGKFNIIDSHASDLSYRVRTHTERPKFLRMTRDVNTGRIAQFIEKRKISQILL
YSPKDSYDTKISISLELPVPENDPPEKYKNHTPTGHRLKKRTSYIHNDSCTRFDITRVEN
KPIRVNNKNEKEPESDTTYEVELEINTPALLNAFDNIQHDSKEYAAIVRTFLNNGTIVRR
KLSSLSYDIYKGSNKL*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites