SAKL0H12364g
similar to uniprot|P09950 Saccharomyces cerevisiae YDR232W HEM1 5-aminolevulinate synthase, catalyzes the first step in the heme biosynthetic pathway
Element type: CDS
Element length: 1665 nucleotides,
on anti-sense strand of
Sakl0H: complement(1053913..1055577).
Other names:
SAKL-ORF1434
Coding sequence: 555 codons.
Element length: 1665 nucleotides,
on anti-sense strand of
Sakl0H: complement(1053913..1055577).
Other names:
SAKL-ORF1434
Coding sequence: 555 codons.
Homologs and Orthologs
Homologs in protein families: GL3C0100 GL3C0100.F2 GL3C0100.N2Orthologs by synteny: ZYRO0A10032g KLTH0E10538g KLLA0C15059g ERGO0B02486g
Protein SAKL0H12364p 
highly similar to gnl|GLV|KLLA0C15059g Kluyveromyces lactis KLLA0C15059g and similar to YDR232W uniprot|P09950 Saccharomyces cerevisiae YDR232W HEM1 5- aminolevulinate synthase catalyzes the first step in the heme biosynthetic pathway an N-terminal signal sequence is required for localization to the mitochondrial matrix expression is regulated by Hap2p-Hap3p
Protein domain map
Sequence data 
>SAKL0H12364g.nt CTACGAATACCAGCGAGGTACAGACCCTAGCTTAACGAAAGAGATGTGCTTACAGAAAGC AATGTATTATTACGAAAGAGGTGCTGATAAATGTCATGACGTTTCTTGTATGTATAAATT GGGTATGTTTTACCTACAAGGTATCGCAACTCCACATGACCCTAAAAATGCTATTGAATG GTTCCTAAGGGCTTCGAATGAGGATCCTCAGTATAAAATTAAGAAAGAAGATATATCACC ACAGGCGTTGTACGAATTAGGGAAGATCTATGAGTTTGACTCGTTATCCAACGAGTTGAG AGAGCTGTTACAAAGATCGGGAATTACGCGATCCTCAGAAAAGGCGTTGTACTATTATCA TCGTTGTGCTACCAAATGCTCATATCCATTGGCACAATGGCGACTAGGTCATTGTTACGA GTTTGGTGAGCTAAGTCTACCCATTATTGCCAGCAAATCTATTGCGTGGTACGCCAAAGC TGCAAGGGCCAAACCTAAAGGAAACCCGATGGCGATGATGGCTTTGAGTGGATGGTATCT TACAGGTGCTGCTGGTGTTCTACAGCCGAACGACCAAGAGGCATTCAGCTGGGCTTTAAG GAGCTGCCAGTCCACAGAAGGTAAGTTTGCCCGCGCAGAGTATGCGTTAGCTTTCTTCTA CGAGCGCGGTATTGGCTGCACAAAGGATCCAAGCAAGGCTCTGGAACATTATAAATGTGC TGCTCAAATGGGGCATCCAAAGGCTCAAGACAGGTATCGCGAACTGAATAGCTCGTAAGT AACACCTAACATTAGCTTTAACGTCGTTCTTATTGGCTCAGGCAACTGTCCACGTGATCA CATACTTGATCACACCTAAAAATTTTTCAGAAAATTTCTTTTTTCCTATTGGATGGGAAG AAAAAAATAAATATATATATGTATGTGAGTAGTGGAGAGAAGACTCACCTCCCCTCTTTA CAGGAATAGAACCGTCGAAAAATAGACAGACGAAGAAGGGATGGAATCTATTGCACGCCA ATCTGCTAAGTTGTGTCCTTTTGTTCACTCTGCTGCGTCATCTTTGCAAAGCGTCAAGGC ATTGCAGCACGCTAATTTGCCTGCCATGGCTAAGAAATGCCCTTTCATGGGTAAAGCCAT GCAAACTGCTAGCTATGCAACCTCCACCTCTACCGCTCCGGTTGCTCAGCCTGTTTCATC CACCAAGGAATCTGCTTTTGTTGATCATGCTACCCAAGAGTCATCTTTTGATTACGATGG GATGTTTGAACATGAATTAAATAAAAAAAGAGCAGATAAATCTTACAGATTTTTTAACAA CATTAATCGTTTGGCTAAGGAATTTCCGCTAGCACATCGTCAGTTAGAGAATGATAAGGT TACTGTCTGGTGCTCTAACGATTACCTGGTTTTATCTAAGAATCAGCAGGTGGTTGATGT TATGAAAAAGACATTGGACAAGTATGGCGCTGGTGCTGGTGGTACCAGAAATATTGCTGG CCACAACAAGCACACCATGAATCTGGAAGCTGAGATTGCTGCACTACACAAAAAGGAAGG TGCGCTGGTTTTTTCTTCCTGTTTTGTCGCTAACGATGCTGTCATTTCTCTTTTAGGCCA GAAAATCAAGGACTTGGTCATTTTTTCTGATGAGTTGAACCATGCCTCTATGATCGTTGG TATCAAACATGCCTCCACCACCAAACATATCTTCAAGCACAACAACTTGGAACAATTGGA AGAGATGCTGGCTATGTACCCAAAATCTACTCCAAAATTGATCGCTTTTGAGTCTGTCTA CTCCATGTCCGGTTCTGTTGCCGATATTGAAAAGATTTGTGACTTGGCCGAGAAGTATGG TGCTTTGACCTTCTTAGACGAGGTTCACGCAGTTGGCCTGTATGGTCCTCATGGTGCAGG TGTTGCGGAACACTGTGACTTCGAAGACCATCGTCAAGCTGGTATTGCTTCTCCAAACAC TCGGACAGTTATGGATCGTGTAGATATGATAACCGGTACCCTAGGTAAATCTTTCGGTAC GGTTGGTGGTTATGTTGCCGCTTCTTTAAACTTGATCGACTGGCTTAGATCATACTCTCC AGGTTTCATCTTTACTACTTCTTTGCCTCCAGCTGTTATGGCCGGCAGTGCTGAAGCTAT TAGATACCAACGTTCTCATTTGAACTTGAGACAGGATCAACAAAAACATACTGCATATGT CAAGAATGGTTTACATGATCTTGGCATTCCTGTCATACCAAATCCATCTCACATTGTCCC CGTGTTGATTGGTAATCCAGACTTAGCTAAGCAAGCCTCTGATATTTTGATGGATAAACA TCGTATTTATGTTCAGGCTATTAACTTCCCAACGGTAGCAAGAGGCACTGAAAGATTGAG AATCACTCCAACACCGGGACACACTAACGATTTGAGTGATATTCTGTTGGAAGCCGTTGA TGATGTTTTTAACACTTTACAATTACCAAGAGTTAAGGATTGGGAAATGCAAGGTGGTTT GTTAGGAGTTGGTCAACCTGATTATGTCCCTGAACCAAACTTGTGGACCGAAGAACAATT GTCTTTGACCAACGAGGATCTACATCCTAACGTTAAAGAGCCAATTATTGACCAACTAGA GGTTTCTTCTGGTATTAAGTGGTAATCACACGCAGGTTCTTGATAAGTCTATTTATTCGT GATACACATGTCTTTTTGCTCTCTCCCTGTAATATAATATAATGAATTGGAACAGAGAAT TTTTGAAAAAGAAAAAGAAGAAAAAAAAAATGGAAAACACACACAAAAAATGATTTTTTT AAAAAAAAAAAATGAATTTCACTGCCGTAGAGCAACTGTTATGCGAGTGCGGTTTTTATT TTTAAACCTATCCTGGAGGGAGGGTTTTTTTTTTTTCCTTCACTCAAATGTTTAAGTTTG CACGAAGTAAATAATATTCGGAATG
>SAKL0H12364g.cds ATGGAATCTATTGCACGCCAATCTGCTAAGTTGTGTCCTTTTGTTCACTCTGCTGCGTCA TCTTTGCAAAGCGTCAAGGCATTGCAGCACGCTAATTTGCCTGCCATGGCTAAGAAATGC CCTTTCATGGGTAAAGCCATGCAAACTGCTAGCTATGCAACCTCCACCTCTACCGCTCCG GTTGCTCAGCCTGTTTCATCCACCAAGGAATCTGCTTTTGTTGATCATGCTACCCAAGAG TCATCTTTTGATTACGATGGGATGTTTGAACATGAATTAAATAAAAAAAGAGCAGATAAA TCTTACAGATTTTTTAACAACATTAATCGTTTGGCTAAGGAATTTCCGCTAGCACATCGT CAGTTAGAGAATGATAAGGTTACTGTCTGGTGCTCTAACGATTACCTGGTTTTATCTAAG AATCAGCAGGTGGTTGATGTTATGAAAAAGACATTGGACAAGTATGGCGCTGGTGCTGGT GGTACCAGAAATATTGCTGGCCACAACAAGCACACCATGAATCTGGAAGCTGAGATTGCT GCACTACACAAAAAGGAAGGTGCGCTGGTTTTTTCTTCCTGTTTTGTCGCTAACGATGCT GTCATTTCTCTTTTAGGCCAGAAAATCAAGGACTTGGTCATTTTTTCTGATGAGTTGAAC CATGCCTCTATGATCGTTGGTATCAAACATGCCTCCACCACCAAACATATCTTCAAGCAC AACAACTTGGAACAATTGGAAGAGATGCTGGCTATGTACCCAAAATCTACTCCAAAATTG ATCGCTTTTGAGTCTGTCTACTCCATGTCCGGTTCTGTTGCCGATATTGAAAAGATTTGT GACTTGGCCGAGAAGTATGGTGCTTTGACCTTCTTAGACGAGGTTCACGCAGTTGGCCTG TATGGTCCTCATGGTGCAGGTGTTGCGGAACACTGTGACTTCGAAGACCATCGTCAAGCT GGTATTGCTTCTCCAAACACTCGGACAGTTATGGATCGTGTAGATATGATAACCGGTACC CTAGGTAAATCTTTCGGTACGGTTGGTGGTTATGTTGCCGCTTCTTTAAACTTGATCGAC TGGCTTAGATCATACTCTCCAGGTTTCATCTTTACTACTTCTTTGCCTCCAGCTGTTATG GCCGGCAGTGCTGAAGCTATTAGATACCAACGTTCTCATTTGAACTTGAGACAGGATCAA CAAAAACATACTGCATATGTCAAGAATGGTTTACATGATCTTGGCATTCCTGTCATACCA AATCCATCTCACATTGTCCCCGTGTTGATTGGTAATCCAGACTTAGCTAAGCAAGCCTCT GATATTTTGATGGATAAACATCGTATTTATGTTCAGGCTATTAACTTCCCAACGGTAGCA AGAGGCACTGAAAGATTGAGAATCACTCCAACACCGGGACACACTAACGATTTGAGTGAT ATTCTGTTGGAAGCCGTTGATGATGTTTTTAACACTTTACAATTACCAAGAGTTAAGGAT TGGGAAATGCAAGGTGGTTTGTTAGGAGTTGGTCAACCTGATTATGTCCCTGAACCAAAC TTGTGGACCGAAGAACAATTGTCTTTGACCAACGAGGATCTACATCCTAACGTTAAAGAG CCAATTATTGACCAACTAGAGGTTTCTTCTGGTATTAAGTGGTAA
>SAKL0H12364g.aa MESIARQSAKLCPFVHSAASSLQSVKALQHANLPAMAKKCPFMGKAMQTASYATSTSTAP VAQPVSSTKESAFVDHATQESSFDYDGMFEHELNKKRADKSYRFFNNINRLAKEFPLAHR QLENDKVTVWCSNDYLVLSKNQQVVDVMKKTLDKYGAGAGGTRNIAGHNKHTMNLEAEIA ALHKKEGALVFSSCFVANDAVISLLGQKIKDLVIFSDELNHASMIVGIKHASTTKHIFKH NNLEQLEEMLAMYPKSTPKLIAFESVYSMSGSVADIEKICDLAEKYGALTFLDEVHAVGL YGPHGAGVAEHCDFEDHRQAGIASPNTRTVMDRVDMITGTLGKSFGTVGGYVAASLNLID WLRSYSPGFIFTTSLPPAVMAGSAEAIRYQRSHLNLRQDQQKHTAYVKNGLHDLGIPVIP NPSHIVPVLIGNPDLAKQASDILMDKHRIYVQAINFPTVARGTERLRITPTPGHTNDLSD ILLEAVDDVFNTLQLPRVKDWEMQGGLLGVGQPDYVPEPNLWTEEQLSLTNEDLHPNVKE PIIDQLEVSSGIKW*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://www.genolevures.org/elt/SAKL/SAKL0H12364p