KLTH0E12276g
weakly similar to uniprot|Q02770 Saccharomyces cerevisiae YPL064C CWC27 Component of a complex containing Cef1p, putatively involved in pre-mRNA splicing
Element type: CDS
Element length: 978 nucleotides,
on sense strand of
Klth0E: 1091241..1092218.
Other names:
KLTH-ORF8560
Coding sequence: 326 codons.
Element length: 978 nucleotides,
on sense strand of
Klth0E: 1091241..1092218.
Other names:
KLTH-ORF8560
Coding sequence: 326 codons.
Homologs and Orthologs
Homologs in protein families: GL3R3437 GL3R3437.F1 GL3R3437.N1Orthologs by synteny: ZYRO0F09152g SAKL0H10318g ERGO0D10384g
Protein domain map
Sequence data 
>KLTH0E12276g.nt ACAAATCCTTGATGTGAACGCCGTCCTTGTACAGGCAGTGCTCCTTGAAAAGGTTGTAGG TGATAAACGCGTGGATGGGGTCAAGCTTCTCCGCGACCTTGTCAGAGTACATCATGTAGT TAGAGGAAAAAAGAACAATTTCGTAGTATTGCGAGAGGTAGCCCAAGAAGTAGTCAACGC CAGGCCTCTTGGCAGTTCTCCAACCATGCTCCTTGGTCCACTCGGAGTGCACCAGCAAAT CTTCCAAGCTGAGAACTAGAGTTAGAGGTCTCTGGTATGGAGGTGGTGGAGGGGGAGGGA GCAAGTCTGGGAAAGGAGGTTCCTGGAAATATGAGAAAATAGAGTTAAAACGCAGCTTGA AGCGCTGGTACATGAGACCTGGGGTGTACCCATTGTCAGCCCCCTTTTTCAGCTCCTCTG GCTCGTCATCGTCCCAATCTCTGGCCATATATGCGCCAGCGCCAGCCAATCCAGCGAAAG ATGCAATGTAAAACCAATTAGCATATCTCTCTCTTTTAATGTCCGCCGACGTTTGTCTCT TTCTTTTGCGTTTGCCGCCTGCCTCGCTAGTCTGAGGTTCTTCAGTTTCTGCGCCTGCGC CCTGCTCCTTCTTGGAACCATCAGCGTTGGGCTCTTCGGTTTCAACGCCAGCACGGGCCA GCATGTCCTCCGTTAGAATGGATTTCGCCTTCTCCCCCGGCTTCTTTGCGTCAAGGACAC TTTTGGCGCTGGAAAACGGCCGTATCAATGAAGATCTGGCAACAGAAGCATTTCTCAGCT GTAATCTAACATTTGTGCGAACCGAATGCCTCAAAATTGTCAGCATTGTCTTGCTTACTT GACGCTCGCACCGCTTGAACTTACAGACAGAACTTAGAATACTGATAGGATTTCCACTAT AAAGTCTTGAAAAAATTTTCGAATCATACCAGATGATGTAGATCTTTAGTCGCAGAGCAT ATCGCACGGAGGTACACCTTTGAGGCTAGATATCTTCACCATGTCAGGTGGACTTGAACC TAGTACGACTGCAAGATGCACCATCCTAACGTCGGAAGGCATCTTGAATGTTGAGTTGTG GGCGAAGGAGTTTCCTAAAACAACGCGCCGTTTTCTGGAGAATTGCATTAAGGGTCTGTA CGATGGTGTCCCCTTTATCGAGAAGCCAGGCGGAACAATAATCTTAACTGGAGAGATTGG ATGCGATCCTGTAGGTAGCGTTGAGAGCAACACTAGAGTGAGGTTCGATCGTAGGGGCCT CTTGGCGAGCTTCCCTAGCTCGAAAGGTGCATTTGCTATAACGTTGTGTGATAACTCCCA CTTGGAGGGTAAGGCAACGGTTTTTGGGAAGCTCGTAGATAGCACTTACTACAGTGTTCT CAGGATCTGTGGCAAGGAGCTGAAGGCTGACAGTGACGAGTTTTTATACCCTGCATGGGT CAAAAGTATTAGTGTAGAAGAGCCTTACTTCAAGGACCTAATCGGTCACGAAGCAAGCGC GCCAAAAACAATAAGCGATGCGGTTAAGCCCCAACGGAAGAGACCGGTAAAAAAGCGCGT GCGTTTGGAGTACGAAGAAGAAGGCGATGATCAAGAAGACGCCCTTTCAAACATCAAAAT CAGGGCGGCACACGACCTTTTGAATGATAAACGGCTTGTCAGGGAGACGCCGCCCTTGGG AAACTTGGGCTCTGCCGAATCCAGACTGCCTCAAAAATCATCCAAAAATACAGCCTCCTC CAACGTTGAACATGAAGCGTCGCCCAGCGATGCTAAAAAAGTAGGGCACAGCCCAGACGA CGGTTCTGGTTCTGGTTCTCAGAAGCCGCTTGACGTTACCCAGGCGGAACCTGAAGGCCA TGATCTAACGCAAAGAGAGCGGGAGACACTAAAACTCCTGGAGCAATTCAAAAAAACGTC TTCCAAAAACAAGCAATTTGCGTCACACCAACTTAATTTCGGCGGTCAAGCTCCTTAGTC CCGTTGTATCTAGCGATTTACTCCAGCAACGTATAGAAGCTTTTATAGGCGGTATCTACG CTGAAAAGCAACTCGCTTGCTTCATCTTCTGACAGCTTTTCTGCCGCTCTCATCTTGTTG ATCTTCACAATCCACTCCACCAGGTCCCTTCTATTATCGAAGTCTTGCGAAGTTACTCTG TTTATGCTCAGCAAAAGCTCGGACATGAGGGGATGTAACTGGTCTTTCGCGCGGTAATTG AGCTTCAGCGCGTCCATCACTGTGATGAAGTTTCCTGTTGCCTCTGCTACGGCCTTCC
>KLTH0E12276g.cds ATGTCAGGTGGACTTGAACCTAGTACGACTGCAAGATGCACCATCCTAACGTCGGAAGGC ATCTTGAATGTTGAGTTGTGGGCGAAGGAGTTTCCTAAAACAACGCGCCGTTTTCTGGAG AATTGCATTAAGGGTCTGTACGATGGTGTCCCCTTTATCGAGAAGCCAGGCGGAACAATA ATCTTAACTGGAGAGATTGGATGCGATCCTGTAGGTAGCGTTGAGAGCAACACTAGAGTG AGGTTCGATCGTAGGGGCCTCTTGGCGAGCTTCCCTAGCTCGAAAGGTGCATTTGCTATA ACGTTGTGTGATAACTCCCACTTGGAGGGTAAGGCAACGGTTTTTGGGAAGCTCGTAGAT AGCACTTACTACAGTGTTCTCAGGATCTGTGGCAAGGAGCTGAAGGCTGACAGTGACGAG TTTTTATACCCTGCATGGGTCAAAAGTATTAGTGTAGAAGAGCCTTACTTCAAGGACCTA ATCGGTCACGAAGCAAGCGCGCCAAAAACAATAAGCGATGCGGTTAAGCCCCAACGGAAG AGACCGGTAAAAAAGCGCGTGCGTTTGGAGTACGAAGAAGAAGGCGATGATCAAGAAGAC GCCCTTTCAAACATCAAAATCAGGGCGGCACACGACCTTTTGAATGATAAACGGCTTGTC AGGGAGACGCCGCCCTTGGGAAACTTGGGCTCTGCCGAATCCAGACTGCCTCAAAAATCA TCCAAAAATACAGCCTCCTCCAACGTTGAACATGAAGCGTCGCCCAGCGATGCTAAAAAA GTAGGGCACAGCCCAGACGACGGTTCTGGTTCTGGTTCTCAGAAGCCGCTTGACGTTACC CAGGCGGAACCTGAAGGCCATGATCTAACGCAAAGAGAGCGGGAGACACTAAAACTCCTG GAGCAATTCAAAAAAACGTCTTCCAAAAACAAGCAATTTGCGTCACACCAACTTAATTTC GGCGGTCAAGCTCCTTAG
>KLTH0E12276g.aa MSGGLEPSTTARCTILTSEGILNVELWAKEFPKTTRRFLENCIKGLYDGVPFIEKPGGTI ILTGEIGCDPVGSVESNTRVRFDRRGLLASFPSSKGAFAITLCDNSHLEGKATVFGKLVD STYYSVLRICGKELKADSDEFLYPAWVKSISVEEPYFKDLIGHEASAPKTISDAVKPQRK RPVKKRVRLEYEEEGDDQEDALSNIKIRAAHDLLNDKRLVRETPPLGNLGSAESRLPQKS SKNTASSNVEHEASPSDAKKVGHSPDDGSGSGSQKPLDVTQAEPEGHDLTQRERETLKLL EQFKKTSSKNKQFASHQLNFGGQAP*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://www.genolevures.org/elt/KLTH/KLTH0E12276p