KLTH0E10538g


similar to uniprot|P09950 Saccharomyces cerevisiae YDR232W HEM1 5-aminolevulinate synthase, catalyzes the first step in the heme biosynthetic pathway

Genomic environment map

Element type: CDS
Element length: 1692 nucleotides,
on sense strand of
Klth0E: 948937..950628.
Other names:
KLTH-ORF8812
Coding sequence: 564 codons.

Computed results  

None available yet

Homologs and Orthologs

Homologs in protein families: GL3C0100 GL3C0100.F2 GL3C0100.N2
Orthologs by synteny: ZYRO0A10032g SAKL0H12364g KLLA0C15059g ERGO0B02486g

Protein KLTH0E10538p  


similar to uniprot|P09950 Saccharomyces cerevisiae YDR232W HEM1 5-aminolevulinate synthase catalyzes the first step in the heme biosynthetic pathway an N-terminal signal sequence is required for localization to the mitochondrial matrix expression is regulated by Hap2p- Hap3p

Protein domain map

Protein length: 563 amino acids
Protein family: GL3C0100

Computed results for KLTH0E10538p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  

None available yet

Sequence data  


Nucleotide sequence    

>KLTH0E10538g.nt
CTGGAGGCCGCCGGTATCTCGCGCGACGCCACACGCGCCTTACAGTACTACCACCGCTGC
GCGACGCTGTGCTCCTATCCGCTGGCGCAGTGGCGCCTGGGCCAGTGCTACGAGTTCGGC
CAACTGAATCTACCAGTGGCTGCCAACAAGTCTATAGCCTGGTACGCGAAAGCCGCGATG
GCTCAACCGCGCGGCAACCCTATGGCTATGATGGCTCTGAGCGGCTGGTACCTCACAGGC
GCTACAGGCGTGCTAAAGCCCAACAATCGCGAAGCATACAACTGGGCGCGCAAAGCCTGC
CAGGCTGCGGACGGCAAACTCGCTCGAGCAGAATATGCATTGGCTTTTTACTCCGAGAAT
GGCATCGGCTGTACGCCGAGCCTGGCGGAAGCGCGCGAACACTATGAGCGCGCAGCAGCA
GCAGGCCACGCGAAGGCCCGGGACCGCCTGCGAGCCGGCGACCTGTAGCACACGGGGGTG
GGGTTGACGGACCCGCAATTTTAAAAACAATAATTCACAGCGACCTGTCTCGCGGACGAG
AGGAATCCAGATCCCGAAGAGACGGCCACCGAGACGTCAGCTGGCCTCCAGTCGCTCAGT
AGCCTAGCATATGCGAATAACGTCGACCCACCAAATACAAAGCCACGTGACACACCACTA
CAAGGTCACATGCTCACACAACTCATCACCTAAGCGATGTAACTGGCGTCATGTGATGAT
GTCCGTGCGTGCGACGCGATATGCACCGCTCCTTCCCAAAAAAAACAATCGCCAATCAAA
TTTCACGACGGTTTCTTTTGAAGGAAACCTAGAAATTTTTCAATGGGCCGAATATATAAT
CTAGGGAATGACCGAGATGGTCATCGGAGCTGCGCGTCAGAGAACGAATCTCAAAGACCA
GACAGTTGTCAGCCTGTGAAGGAGTAAGATATCCCCAAACACTGGTTTATACCAACAAGA
TTTGAAGCGAGCTGAAACGAGGGCAGAACTGTATTAAACAATGGATTCTATCGCACGGCA
GTCAGCAAAATTATGCCCGTTCGTGAGCAGGGCCACGTCTTCCTTGAGGAACGTGCAGAC
CTTGCAGCAGGGAAACATTAGCGCAATGGCGCAGCGCTGCCCATTTATGGGCCAGGCGAT
GCAGAAGGCGCACTTGACTACTTCTGCCGCAGCGGGCGCTGCAGCGGCCGCACCCGCGGC
GCAGGAGGCTTCTCGCGCTGCTGGGAAGGATCAGAGCTCAGTGATGGCGGATCGGGCAAC
GCAGGAGGCCAAATTTGACTACGAGGGCCTTTTTGAGCAGGATCTCCAGAAGAAGCGGCT
TGACAAGTCCTACCGGTTCTTCAACAATATCAACCGGCTCGCCAAGGAGTTCCCTATGGC
TCACCGCCAGCAGGAGGAGGACAAGGTCACTGTTTGGTGCTCTAACGACTATCTTGCGCT
GTCCAAAAACCAGCAGGTGGTGGACGTCATGAAGCGCACCCTCGACAAGTACGGGGCAGG
CGCGGGCGGTACCCGTAACATTGCTGGCCATAACCAGCACGCTTTGAGACTGGAGGCGGA
GGTCGCGGCTCTCCACAAGAAGGAGGGCGCACTCGTGTTTTCCTCGTGTTTCGTGGCGAA
CGATGCCGTCATCTCACTTCTCGGCCAAAAGCTCAAGGACCTCGTGATCTTCTCAGACGA
GCTCAACCACGCTTCAATGATCGTGGGCATCAAGCATGCCTCGACCACCAAGCACATCTT
CAAGCACAACGACTTGGAGCATCTTGAGGAGCTCTTGGCGATGTACCCCAAGTCCACCCC
CAAGCTCATTGCCTTTGAATCTGTTTACTCCATGTCCGGGTCTGTCGCCGACATCAACAA
GATTTGCGATCTGGCCGAGAAGTATGGCGCTCTTACCTTCCTAGATGAGGTTCACGCTGT
GGGCCTCTACGGCCCTCATGGTGCTGGTGTTGCCGAGCACTGTGACTTCGAGTCTCATCG
TTCTTCCGGTATTGCCTCACCCGCCCACCAGACAGTTATGGACCGTGTTGATATGATCAC
TGGTACTCTCGGTAAGTCTTTTGGTACTGTCGGTGGCTACGTCGCCGGCTCTCTAAGATT
GATTGACTGGCTTAGATCTTACGCCCCAGGCTTTATTTTTACGACTTCATTACCTCCTTC
TGTCATGGCGGGTGCTGCCGAAGCTATCCGTTACCAACGCTCTCACTTGGACCTAAGACA
GGCCCAGCAGAGACACACTGCTTACGTCAAGCAAGGATTGGCCGACTTGGACATCCCTGT
AATTCCTAACCCATCGCACATTGTTCCTGTGTTGGTCGGAAACCCCGACCTTGCTAGACA
GGCCTCCGAAATCTTGATGGACAAGCACCGCATCTACGTGCAGGCTATTAACTTCCCAAC
AGTCGCTCGGGGAACGGAGAGATTGAGAATCACTCCTACCCCAGGCCACACCGACGATCT
GTCAGACATTCTCTTGGACGCAGTCGATGATGTTTTCAACACTTTGCAGCTACCCCGTGT
CAAGGACTGGAAGAGGCAGGGTGGTTTGTTGGGCGTTGGCCAGTCAGACTACATGGCTGA
GCCCAATTTGTGGACTGAGGACCAGCTCCAGCTGAGCAACGATGATCTACACCCCAACGT
TAGAGAGCCAATTATCGATCAGCTTGAAGTGTCTTCTGGGATTCGGTACTAGAAAAAGAG
ATTATTTATGAGCTTTTCCCGTCCTTTTCTATTTATCCTTTCACGCGCAGCCCGCTTTGA
TACTCTACAACAAAGCTCAAAAAGAGTGCTTAGTATTAGAGCTATATTGAGCCTGAAGGT
CCTCGCGAAACCACTGCTCATTGTGACAATAACAACCAGCATCGATTTGTTTTTTATAAT
CAACACGTTGCAGCCTTGTTTATATAAAAATGCCTAAGATATTCTTATTTTAAGGAGTAA
TGCTATAAAGCACATGCCAATATTTCACCGTAGACATTTCGAATATAAAAAT

Coding sequence    

>KLTH0E10538g.cds
ATGGATTCTATCGCACGGCAGTCAGCAAAATTATGCCCGTTCGTGAGCAGGGCCACGTCT
TCCTTGAGGAACGTGCAGACCTTGCAGCAGGGAAACATTAGCGCAATGGCGCAGCGCTGC
CCATTTATGGGCCAGGCGATGCAGAAGGCGCACTTGACTACTTCTGCCGCAGCGGGCGCT
GCAGCGGCCGCACCCGCGGCGCAGGAGGCTTCTCGCGCTGCTGGGAAGGATCAGAGCTCA
GTGATGGCGGATCGGGCAACGCAGGAGGCCAAATTTGACTACGAGGGCCTTTTTGAGCAG
GATCTCCAGAAGAAGCGGCTTGACAAGTCCTACCGGTTCTTCAACAATATCAACCGGCTC
GCCAAGGAGTTCCCTATGGCTCACCGCCAGCAGGAGGAGGACAAGGTCACTGTTTGGTGC
TCTAACGACTATCTTGCGCTGTCCAAAAACCAGCAGGTGGTGGACGTCATGAAGCGCACC
CTCGACAAGTACGGGGCAGGCGCGGGCGGTACCCGTAACATTGCTGGCCATAACCAGCAC
GCTTTGAGACTGGAGGCGGAGGTCGCGGCTCTCCACAAGAAGGAGGGCGCACTCGTGTTT
TCCTCGTGTTTCGTGGCGAACGATGCCGTCATCTCACTTCTCGGCCAAAAGCTCAAGGAC
CTCGTGATCTTCTCAGACGAGCTCAACCACGCTTCAATGATCGTGGGCATCAAGCATGCC
TCGACCACCAAGCACATCTTCAAGCACAACGACTTGGAGCATCTTGAGGAGCTCTTGGCG
ATGTACCCCAAGTCCACCCCCAAGCTCATTGCCTTTGAATCTGTTTACTCCATGTCCGGG
TCTGTCGCCGACATCAACAAGATTTGCGATCTGGCCGAGAAGTATGGCGCTCTTACCTTC
CTAGATGAGGTTCACGCTGTGGGCCTCTACGGCCCTCATGGTGCTGGTGTTGCCGAGCAC
TGTGACTTCGAGTCTCATCGTTCTTCCGGTATTGCCTCACCCGCCCACCAGACAGTTATG
GACCGTGTTGATATGATCACTGGTACTCTCGGTAAGTCTTTTGGTACTGTCGGTGGCTAC
GTCGCCGGCTCTCTAAGATTGATTGACTGGCTTAGATCTTACGCCCCAGGCTTTATTTTT
ACGACTTCATTACCTCCTTCTGTCATGGCGGGTGCTGCCGAAGCTATCCGTTACCAACGC
TCTCACTTGGACCTAAGACAGGCCCAGCAGAGACACACTGCTTACGTCAAGCAAGGATTG
GCCGACTTGGACATCCCTGTAATTCCTAACCCATCGCACATTGTTCCTGTGTTGGTCGGA
AACCCCGACCTTGCTAGACAGGCCTCCGAAATCTTGATGGACAAGCACCGCATCTACGTG
CAGGCTATTAACTTCCCAACAGTCGCTCGGGGAACGGAGAGATTGAGAATCACTCCTACC
CCAGGCCACACCGACGATCTGTCAGACATTCTCTTGGACGCAGTCGATGATGTTTTCAAC
ACTTTGCAGCTACCCCGTGTCAAGGACTGGAAGAGGCAGGGTGGTTTGTTGGGCGTTGGC
CAGTCAGACTACATGGCTGAGCCCAATTTGTGGACTGAGGACCAGCTCCAGCTGAGCAAC
GATGATCTACACCCCAACGTTAGAGAGCCAATTATCGATCAGCTTGAAGTGTCTTCTGGG
ATTCGGTACTAG

Predicted translation product    

>KLTH0E10538g.aa
MDSIARQSAKLCPFVSRATSSLRNVQTLQQGNISAMAQRCPFMGQAMQKAHLTTSAAAGA
AAAAPAAQEASRAAGKDQSSVMADRATQEAKFDYEGLFEQDLQKKRLDKSYRFFNNINRL
AKEFPMAHRQQEEDKVTVWCSNDYLALSKNQQVVDVMKRTLDKYGAGAGGTRNIAGHNQH
ALRLEAEVAALHKKEGALVFSSCFVANDAVISLLGQKLKDLVIFSDELNHASMIVGIKHA
STTKHIFKHNDLEHLEELLAMYPKSTPKLIAFESVYSMSGSVADINKICDLAEKYGALTF
LDEVHAVGLYGPHGAGVAEHCDFESHRSSGIASPAHQTVMDRVDMITGTLGKSFGTVGGY
VAGSLRLIDWLRSYAPGFIFTTSLPPSVMAGAAEAIRYQRSHLDLRQAQQRHTAYVKQGL
ADLDIPVIPNPSHIVPVLVGNPDLARQASEILMDKHRIYVQAINFPTVARGTERLRITPT
PGHTDDLSDILLDAVDDVFNTLQLPRVKDWKRQGGLLGVGQSDYMAEPNLWTEDQLQLSN
DDLHPNVREPIIDQLEVSSGIRY*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites