KLTH0E10538g
similar to uniprot|P09950 Saccharomyces cerevisiae YDR232W HEM1 5-aminolevulinate synthase, catalyzes the first step in the heme biosynthetic pathway
Element type: CDS
Element length: 1692 nucleotides,
on sense strand of
Klth0E: 948937..950628.
Other names:
KLTH-ORF8812
Coding sequence: 564 codons.
Element length: 1692 nucleotides,
on sense strand of
Klth0E: 948937..950628.
Other names:
KLTH-ORF8812
Coding sequence: 564 codons.
Homologs and Orthologs
Homologs in protein families: GL3C0100 GL3C0100.F2 GL3C0100.N2Orthologs by synteny: ZYRO0A10032g SAKL0H12364g KLLA0C15059g ERGO0B02486g
Protein KLTH0E10538p 
similar to uniprot|P09950 Saccharomyces cerevisiae YDR232W HEM1 5-aminolevulinate synthase catalyzes the first step in the heme biosynthetic pathway an N-terminal signal sequence is required for localization to the mitochondrial matrix expression is regulated by Hap2p- Hap3p
Protein domain map
Sequence data 
>KLTH0E10538g.nt CTGGAGGCCGCCGGTATCTCGCGCGACGCCACACGCGCCTTACAGTACTACCACCGCTGC GCGACGCTGTGCTCCTATCCGCTGGCGCAGTGGCGCCTGGGCCAGTGCTACGAGTTCGGC CAACTGAATCTACCAGTGGCTGCCAACAAGTCTATAGCCTGGTACGCGAAAGCCGCGATG GCTCAACCGCGCGGCAACCCTATGGCTATGATGGCTCTGAGCGGCTGGTACCTCACAGGC GCTACAGGCGTGCTAAAGCCCAACAATCGCGAAGCATACAACTGGGCGCGCAAAGCCTGC CAGGCTGCGGACGGCAAACTCGCTCGAGCAGAATATGCATTGGCTTTTTACTCCGAGAAT GGCATCGGCTGTACGCCGAGCCTGGCGGAAGCGCGCGAACACTATGAGCGCGCAGCAGCA GCAGGCCACGCGAAGGCCCGGGACCGCCTGCGAGCCGGCGACCTGTAGCACACGGGGGTG GGGTTGACGGACCCGCAATTTTAAAAACAATAATTCACAGCGACCTGTCTCGCGGACGAG AGGAATCCAGATCCCGAAGAGACGGCCACCGAGACGTCAGCTGGCCTCCAGTCGCTCAGT AGCCTAGCATATGCGAATAACGTCGACCCACCAAATACAAAGCCACGTGACACACCACTA CAAGGTCACATGCTCACACAACTCATCACCTAAGCGATGTAACTGGCGTCATGTGATGAT GTCCGTGCGTGCGACGCGATATGCACCGCTCCTTCCCAAAAAAAACAATCGCCAATCAAA TTTCACGACGGTTTCTTTTGAAGGAAACCTAGAAATTTTTCAATGGGCCGAATATATAAT CTAGGGAATGACCGAGATGGTCATCGGAGCTGCGCGTCAGAGAACGAATCTCAAAGACCA GACAGTTGTCAGCCTGTGAAGGAGTAAGATATCCCCAAACACTGGTTTATACCAACAAGA TTTGAAGCGAGCTGAAACGAGGGCAGAACTGTATTAAACAATGGATTCTATCGCACGGCA GTCAGCAAAATTATGCCCGTTCGTGAGCAGGGCCACGTCTTCCTTGAGGAACGTGCAGAC CTTGCAGCAGGGAAACATTAGCGCAATGGCGCAGCGCTGCCCATTTATGGGCCAGGCGAT GCAGAAGGCGCACTTGACTACTTCTGCCGCAGCGGGCGCTGCAGCGGCCGCACCCGCGGC GCAGGAGGCTTCTCGCGCTGCTGGGAAGGATCAGAGCTCAGTGATGGCGGATCGGGCAAC GCAGGAGGCCAAATTTGACTACGAGGGCCTTTTTGAGCAGGATCTCCAGAAGAAGCGGCT TGACAAGTCCTACCGGTTCTTCAACAATATCAACCGGCTCGCCAAGGAGTTCCCTATGGC TCACCGCCAGCAGGAGGAGGACAAGGTCACTGTTTGGTGCTCTAACGACTATCTTGCGCT GTCCAAAAACCAGCAGGTGGTGGACGTCATGAAGCGCACCCTCGACAAGTACGGGGCAGG CGCGGGCGGTACCCGTAACATTGCTGGCCATAACCAGCACGCTTTGAGACTGGAGGCGGA GGTCGCGGCTCTCCACAAGAAGGAGGGCGCACTCGTGTTTTCCTCGTGTTTCGTGGCGAA CGATGCCGTCATCTCACTTCTCGGCCAAAAGCTCAAGGACCTCGTGATCTTCTCAGACGA GCTCAACCACGCTTCAATGATCGTGGGCATCAAGCATGCCTCGACCACCAAGCACATCTT CAAGCACAACGACTTGGAGCATCTTGAGGAGCTCTTGGCGATGTACCCCAAGTCCACCCC CAAGCTCATTGCCTTTGAATCTGTTTACTCCATGTCCGGGTCTGTCGCCGACATCAACAA GATTTGCGATCTGGCCGAGAAGTATGGCGCTCTTACCTTCCTAGATGAGGTTCACGCTGT GGGCCTCTACGGCCCTCATGGTGCTGGTGTTGCCGAGCACTGTGACTTCGAGTCTCATCG TTCTTCCGGTATTGCCTCACCCGCCCACCAGACAGTTATGGACCGTGTTGATATGATCAC TGGTACTCTCGGTAAGTCTTTTGGTACTGTCGGTGGCTACGTCGCCGGCTCTCTAAGATT GATTGACTGGCTTAGATCTTACGCCCCAGGCTTTATTTTTACGACTTCATTACCTCCTTC TGTCATGGCGGGTGCTGCCGAAGCTATCCGTTACCAACGCTCTCACTTGGACCTAAGACA GGCCCAGCAGAGACACACTGCTTACGTCAAGCAAGGATTGGCCGACTTGGACATCCCTGT AATTCCTAACCCATCGCACATTGTTCCTGTGTTGGTCGGAAACCCCGACCTTGCTAGACA GGCCTCCGAAATCTTGATGGACAAGCACCGCATCTACGTGCAGGCTATTAACTTCCCAAC AGTCGCTCGGGGAACGGAGAGATTGAGAATCACTCCTACCCCAGGCCACACCGACGATCT GTCAGACATTCTCTTGGACGCAGTCGATGATGTTTTCAACACTTTGCAGCTACCCCGTGT CAAGGACTGGAAGAGGCAGGGTGGTTTGTTGGGCGTTGGCCAGTCAGACTACATGGCTGA GCCCAATTTGTGGACTGAGGACCAGCTCCAGCTGAGCAACGATGATCTACACCCCAACGT TAGAGAGCCAATTATCGATCAGCTTGAAGTGTCTTCTGGGATTCGGTACTAGAAAAAGAG ATTATTTATGAGCTTTTCCCGTCCTTTTCTATTTATCCTTTCACGCGCAGCCCGCTTTGA TACTCTACAACAAAGCTCAAAAAGAGTGCTTAGTATTAGAGCTATATTGAGCCTGAAGGT CCTCGCGAAACCACTGCTCATTGTGACAATAACAACCAGCATCGATTTGTTTTTTATAAT CAACACGTTGCAGCCTTGTTTATATAAAAATGCCTAAGATATTCTTATTTTAAGGAGTAA TGCTATAAAGCACATGCCAATATTTCACCGTAGACATTTCGAATATAAAAAT
>KLTH0E10538g.cds ATGGATTCTATCGCACGGCAGTCAGCAAAATTATGCCCGTTCGTGAGCAGGGCCACGTCT TCCTTGAGGAACGTGCAGACCTTGCAGCAGGGAAACATTAGCGCAATGGCGCAGCGCTGC CCATTTATGGGCCAGGCGATGCAGAAGGCGCACTTGACTACTTCTGCCGCAGCGGGCGCT GCAGCGGCCGCACCCGCGGCGCAGGAGGCTTCTCGCGCTGCTGGGAAGGATCAGAGCTCA GTGATGGCGGATCGGGCAACGCAGGAGGCCAAATTTGACTACGAGGGCCTTTTTGAGCAG GATCTCCAGAAGAAGCGGCTTGACAAGTCCTACCGGTTCTTCAACAATATCAACCGGCTC GCCAAGGAGTTCCCTATGGCTCACCGCCAGCAGGAGGAGGACAAGGTCACTGTTTGGTGC TCTAACGACTATCTTGCGCTGTCCAAAAACCAGCAGGTGGTGGACGTCATGAAGCGCACC CTCGACAAGTACGGGGCAGGCGCGGGCGGTACCCGTAACATTGCTGGCCATAACCAGCAC GCTTTGAGACTGGAGGCGGAGGTCGCGGCTCTCCACAAGAAGGAGGGCGCACTCGTGTTT TCCTCGTGTTTCGTGGCGAACGATGCCGTCATCTCACTTCTCGGCCAAAAGCTCAAGGAC CTCGTGATCTTCTCAGACGAGCTCAACCACGCTTCAATGATCGTGGGCATCAAGCATGCC TCGACCACCAAGCACATCTTCAAGCACAACGACTTGGAGCATCTTGAGGAGCTCTTGGCG ATGTACCCCAAGTCCACCCCCAAGCTCATTGCCTTTGAATCTGTTTACTCCATGTCCGGG TCTGTCGCCGACATCAACAAGATTTGCGATCTGGCCGAGAAGTATGGCGCTCTTACCTTC CTAGATGAGGTTCACGCTGTGGGCCTCTACGGCCCTCATGGTGCTGGTGTTGCCGAGCAC TGTGACTTCGAGTCTCATCGTTCTTCCGGTATTGCCTCACCCGCCCACCAGACAGTTATG GACCGTGTTGATATGATCACTGGTACTCTCGGTAAGTCTTTTGGTACTGTCGGTGGCTAC GTCGCCGGCTCTCTAAGATTGATTGACTGGCTTAGATCTTACGCCCCAGGCTTTATTTTT ACGACTTCATTACCTCCTTCTGTCATGGCGGGTGCTGCCGAAGCTATCCGTTACCAACGC TCTCACTTGGACCTAAGACAGGCCCAGCAGAGACACACTGCTTACGTCAAGCAAGGATTG GCCGACTTGGACATCCCTGTAATTCCTAACCCATCGCACATTGTTCCTGTGTTGGTCGGA AACCCCGACCTTGCTAGACAGGCCTCCGAAATCTTGATGGACAAGCACCGCATCTACGTG CAGGCTATTAACTTCCCAACAGTCGCTCGGGGAACGGAGAGATTGAGAATCACTCCTACC CCAGGCCACACCGACGATCTGTCAGACATTCTCTTGGACGCAGTCGATGATGTTTTCAAC ACTTTGCAGCTACCCCGTGTCAAGGACTGGAAGAGGCAGGGTGGTTTGTTGGGCGTTGGC CAGTCAGACTACATGGCTGAGCCCAATTTGTGGACTGAGGACCAGCTCCAGCTGAGCAAC GATGATCTACACCCCAACGTTAGAGAGCCAATTATCGATCAGCTTGAAGTGTCTTCTGGG ATTCGGTACTAG
>KLTH0E10538g.aa MDSIARQSAKLCPFVSRATSSLRNVQTLQQGNISAMAQRCPFMGQAMQKAHLTTSAAAGA AAAAPAAQEASRAAGKDQSSVMADRATQEAKFDYEGLFEQDLQKKRLDKSYRFFNNINRL AKEFPMAHRQQEEDKVTVWCSNDYLALSKNQQVVDVMKRTLDKYGAGAGGTRNIAGHNQH ALRLEAEVAALHKKEGALVFSSCFVANDAVISLLGQKLKDLVIFSDELNHASMIVGIKHA STTKHIFKHNDLEHLEELLAMYPKSTPKLIAFESVYSMSGSVADINKICDLAEKYGALTF LDEVHAVGLYGPHGAGVAEHCDFESHRSSGIASPAHQTVMDRVDMITGTLGKSFGTVGGY VAGSLRLIDWLRSYAPGFIFTTSLPPSVMAGAAEAIRYQRSHLDLRQAQQRHTAYVKQGL ADLDIPVIPNPSHIVPVLVGNPDLARQASEILMDKHRIYVQAINFPTVARGTERLRITPT PGHTDDLSDILLDAVDDVFNTLQLPRVKDWKRQGGLLGVGQSDYMAEPNLWTEDQLQLSN DDLHPNVREPIIDQLEVSSGIRY*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://www.genolevures.org/elt/KLTH/KLTH0E10538p