SAKL0H12364g


similar to uniprot|P09950 Saccharomyces cerevisiae YDR232W HEM1 5-aminolevulinate synthase, catalyzes the first step in the heme biosynthetic pathway

Genomic environment map

Element type: CDS
Element length: 1665 nucleotides,
on anti-sense strand of
Sakl0H: complement(1053913..1055577).
Other names:
SAKL-ORF1434
Coding sequence: 555 codons.

Computed results  

None available yet

Homologs and Orthologs

Homologs in protein families: GL3C0100 GL3C0100.F2 GL3C0100.N2
Orthologs by synteny: ZYRO0A10032g KLTH0E10538g KLLA0C15059g ERGO0B02486g

Protein SAKL0H12364p  


highly similar to gnl|GLV|KLLA0C15059g Kluyveromyces lactis KLLA0C15059g and similar to YDR232W uniprot|P09950 Saccharomyces cerevisiae YDR232W HEM1 5- aminolevulinate synthase catalyzes the first step in the heme biosynthetic pathway an N-terminal signal sequence is required for localization to the mitochondrial matrix expression is regulated by Hap2p-Hap3p

Protein domain map

Protein length: 554 amino acids
Protein family: GL3C0100

Computed results for SAKL0H12364p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  

None available yet

Sequence data  


Nucleotide sequence    

>SAKL0H12364g.nt
CTACGAATACCAGCGAGGTACAGACCCTAGCTTAACGAAAGAGATGTGCTTACAGAAAGC
AATGTATTATTACGAAAGAGGTGCTGATAAATGTCATGACGTTTCTTGTATGTATAAATT
GGGTATGTTTTACCTACAAGGTATCGCAACTCCACATGACCCTAAAAATGCTATTGAATG
GTTCCTAAGGGCTTCGAATGAGGATCCTCAGTATAAAATTAAGAAAGAAGATATATCACC
ACAGGCGTTGTACGAATTAGGGAAGATCTATGAGTTTGACTCGTTATCCAACGAGTTGAG
AGAGCTGTTACAAAGATCGGGAATTACGCGATCCTCAGAAAAGGCGTTGTACTATTATCA
TCGTTGTGCTACCAAATGCTCATATCCATTGGCACAATGGCGACTAGGTCATTGTTACGA
GTTTGGTGAGCTAAGTCTACCCATTATTGCCAGCAAATCTATTGCGTGGTACGCCAAAGC
TGCAAGGGCCAAACCTAAAGGAAACCCGATGGCGATGATGGCTTTGAGTGGATGGTATCT
TACAGGTGCTGCTGGTGTTCTACAGCCGAACGACCAAGAGGCATTCAGCTGGGCTTTAAG
GAGCTGCCAGTCCACAGAAGGTAAGTTTGCCCGCGCAGAGTATGCGTTAGCTTTCTTCTA
CGAGCGCGGTATTGGCTGCACAAAGGATCCAAGCAAGGCTCTGGAACATTATAAATGTGC
TGCTCAAATGGGGCATCCAAAGGCTCAAGACAGGTATCGCGAACTGAATAGCTCGTAAGT
AACACCTAACATTAGCTTTAACGTCGTTCTTATTGGCTCAGGCAACTGTCCACGTGATCA
CATACTTGATCACACCTAAAAATTTTTCAGAAAATTTCTTTTTTCCTATTGGATGGGAAG
AAAAAAATAAATATATATATGTATGTGAGTAGTGGAGAGAAGACTCACCTCCCCTCTTTA
CAGGAATAGAACCGTCGAAAAATAGACAGACGAAGAAGGGATGGAATCTATTGCACGCCA
ATCTGCTAAGTTGTGTCCTTTTGTTCACTCTGCTGCGTCATCTTTGCAAAGCGTCAAGGC
ATTGCAGCACGCTAATTTGCCTGCCATGGCTAAGAAATGCCCTTTCATGGGTAAAGCCAT
GCAAACTGCTAGCTATGCAACCTCCACCTCTACCGCTCCGGTTGCTCAGCCTGTTTCATC
CACCAAGGAATCTGCTTTTGTTGATCATGCTACCCAAGAGTCATCTTTTGATTACGATGG
GATGTTTGAACATGAATTAAATAAAAAAAGAGCAGATAAATCTTACAGATTTTTTAACAA
CATTAATCGTTTGGCTAAGGAATTTCCGCTAGCACATCGTCAGTTAGAGAATGATAAGGT
TACTGTCTGGTGCTCTAACGATTACCTGGTTTTATCTAAGAATCAGCAGGTGGTTGATGT
TATGAAAAAGACATTGGACAAGTATGGCGCTGGTGCTGGTGGTACCAGAAATATTGCTGG
CCACAACAAGCACACCATGAATCTGGAAGCTGAGATTGCTGCACTACACAAAAAGGAAGG
TGCGCTGGTTTTTTCTTCCTGTTTTGTCGCTAACGATGCTGTCATTTCTCTTTTAGGCCA
GAAAATCAAGGACTTGGTCATTTTTTCTGATGAGTTGAACCATGCCTCTATGATCGTTGG
TATCAAACATGCCTCCACCACCAAACATATCTTCAAGCACAACAACTTGGAACAATTGGA
AGAGATGCTGGCTATGTACCCAAAATCTACTCCAAAATTGATCGCTTTTGAGTCTGTCTA
CTCCATGTCCGGTTCTGTTGCCGATATTGAAAAGATTTGTGACTTGGCCGAGAAGTATGG
TGCTTTGACCTTCTTAGACGAGGTTCACGCAGTTGGCCTGTATGGTCCTCATGGTGCAGG
TGTTGCGGAACACTGTGACTTCGAAGACCATCGTCAAGCTGGTATTGCTTCTCCAAACAC
TCGGACAGTTATGGATCGTGTAGATATGATAACCGGTACCCTAGGTAAATCTTTCGGTAC
GGTTGGTGGTTATGTTGCCGCTTCTTTAAACTTGATCGACTGGCTTAGATCATACTCTCC
AGGTTTCATCTTTACTACTTCTTTGCCTCCAGCTGTTATGGCCGGCAGTGCTGAAGCTAT
TAGATACCAACGTTCTCATTTGAACTTGAGACAGGATCAACAAAAACATACTGCATATGT
CAAGAATGGTTTACATGATCTTGGCATTCCTGTCATACCAAATCCATCTCACATTGTCCC
CGTGTTGATTGGTAATCCAGACTTAGCTAAGCAAGCCTCTGATATTTTGATGGATAAACA
TCGTATTTATGTTCAGGCTATTAACTTCCCAACGGTAGCAAGAGGCACTGAAAGATTGAG
AATCACTCCAACACCGGGACACACTAACGATTTGAGTGATATTCTGTTGGAAGCCGTTGA
TGATGTTTTTAACACTTTACAATTACCAAGAGTTAAGGATTGGGAAATGCAAGGTGGTTT
GTTAGGAGTTGGTCAACCTGATTATGTCCCTGAACCAAACTTGTGGACCGAAGAACAATT
GTCTTTGACCAACGAGGATCTACATCCTAACGTTAAAGAGCCAATTATTGACCAACTAGA
GGTTTCTTCTGGTATTAAGTGGTAATCACACGCAGGTTCTTGATAAGTCTATTTATTCGT
GATACACATGTCTTTTTGCTCTCTCCCTGTAATATAATATAATGAATTGGAACAGAGAAT
TTTTGAAAAAGAAAAAGAAGAAAAAAAAAATGGAAAACACACACAAAAAATGATTTTTTT
AAAAAAAAAAAATGAATTTCACTGCCGTAGAGCAACTGTTATGCGAGTGCGGTTTTTATT
TTTAAACCTATCCTGGAGGGAGGGTTTTTTTTTTTTCCTTCACTCAAATGTTTAAGTTTG
CACGAAGTAAATAATATTCGGAATG

Coding sequence    

>SAKL0H12364g.cds
ATGGAATCTATTGCACGCCAATCTGCTAAGTTGTGTCCTTTTGTTCACTCTGCTGCGTCA
TCTTTGCAAAGCGTCAAGGCATTGCAGCACGCTAATTTGCCTGCCATGGCTAAGAAATGC
CCTTTCATGGGTAAAGCCATGCAAACTGCTAGCTATGCAACCTCCACCTCTACCGCTCCG
GTTGCTCAGCCTGTTTCATCCACCAAGGAATCTGCTTTTGTTGATCATGCTACCCAAGAG
TCATCTTTTGATTACGATGGGATGTTTGAACATGAATTAAATAAAAAAAGAGCAGATAAA
TCTTACAGATTTTTTAACAACATTAATCGTTTGGCTAAGGAATTTCCGCTAGCACATCGT
CAGTTAGAGAATGATAAGGTTACTGTCTGGTGCTCTAACGATTACCTGGTTTTATCTAAG
AATCAGCAGGTGGTTGATGTTATGAAAAAGACATTGGACAAGTATGGCGCTGGTGCTGGT
GGTACCAGAAATATTGCTGGCCACAACAAGCACACCATGAATCTGGAAGCTGAGATTGCT
GCACTACACAAAAAGGAAGGTGCGCTGGTTTTTTCTTCCTGTTTTGTCGCTAACGATGCT
GTCATTTCTCTTTTAGGCCAGAAAATCAAGGACTTGGTCATTTTTTCTGATGAGTTGAAC
CATGCCTCTATGATCGTTGGTATCAAACATGCCTCCACCACCAAACATATCTTCAAGCAC
AACAACTTGGAACAATTGGAAGAGATGCTGGCTATGTACCCAAAATCTACTCCAAAATTG
ATCGCTTTTGAGTCTGTCTACTCCATGTCCGGTTCTGTTGCCGATATTGAAAAGATTTGT
GACTTGGCCGAGAAGTATGGTGCTTTGACCTTCTTAGACGAGGTTCACGCAGTTGGCCTG
TATGGTCCTCATGGTGCAGGTGTTGCGGAACACTGTGACTTCGAAGACCATCGTCAAGCT
GGTATTGCTTCTCCAAACACTCGGACAGTTATGGATCGTGTAGATATGATAACCGGTACC
CTAGGTAAATCTTTCGGTACGGTTGGTGGTTATGTTGCCGCTTCTTTAAACTTGATCGAC
TGGCTTAGATCATACTCTCCAGGTTTCATCTTTACTACTTCTTTGCCTCCAGCTGTTATG
GCCGGCAGTGCTGAAGCTATTAGATACCAACGTTCTCATTTGAACTTGAGACAGGATCAA
CAAAAACATACTGCATATGTCAAGAATGGTTTACATGATCTTGGCATTCCTGTCATACCA
AATCCATCTCACATTGTCCCCGTGTTGATTGGTAATCCAGACTTAGCTAAGCAAGCCTCT
GATATTTTGATGGATAAACATCGTATTTATGTTCAGGCTATTAACTTCCCAACGGTAGCA
AGAGGCACTGAAAGATTGAGAATCACTCCAACACCGGGACACACTAACGATTTGAGTGAT
ATTCTGTTGGAAGCCGTTGATGATGTTTTTAACACTTTACAATTACCAAGAGTTAAGGAT
TGGGAAATGCAAGGTGGTTTGTTAGGAGTTGGTCAACCTGATTATGTCCCTGAACCAAAC
TTGTGGACCGAAGAACAATTGTCTTTGACCAACGAGGATCTACATCCTAACGTTAAAGAG
CCAATTATTGACCAACTAGAGGTTTCTTCTGGTATTAAGTGGTAA

Predicted translation product    

>SAKL0H12364g.aa
MESIARQSAKLCPFVHSAASSLQSVKALQHANLPAMAKKCPFMGKAMQTASYATSTSTAP
VAQPVSSTKESAFVDHATQESSFDYDGMFEHELNKKRADKSYRFFNNINRLAKEFPLAHR
QLENDKVTVWCSNDYLVLSKNQQVVDVMKKTLDKYGAGAGGTRNIAGHNKHTMNLEAEIA
ALHKKEGALVFSSCFVANDAVISLLGQKIKDLVIFSDELNHASMIVGIKHASTTKHIFKH
NNLEQLEEMLAMYPKSTPKLIAFESVYSMSGSVADIEKICDLAEKYGALTFLDEVHAVGL
YGPHGAGVAEHCDFEDHRQAGIASPNTRTVMDRVDMITGTLGKSFGTVGGYVAASLNLID
WLRSYSPGFIFTTSLPPAVMAGSAEAIRYQRSHLNLRQDQQKHTAYVKNGLHDLGIPVIP
NPSHIVPVLIGNPDLAKQASDILMDKHRIYVQAINFPTVARGTERLRITPTPGHTNDLSD
ILLEAVDDVFNTLQLPRVKDWEMQGGLLGVGQPDYVPEPNLWTEEQLSLTNEDLHPNVKE
PIIDQLEVSSGIKW*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites