ZYRO0A10032g
similar to uniprot|P09950 Saccharomyces cerevisiae YDR232W HEM1 5-aminolevulinate synthase, catalyzes the first step in the heme biosynthetic pathway
Element type: CDS
Element length: 1674 nucleotides,
on sense strand of
Zyro0A: 809241..810914.
Other names:
ZYRO-ORF8840
Coding sequence: 558 codons.
Element length: 1674 nucleotides,
on sense strand of
Zyro0A: 809241..810914.
Other names:
ZYRO-ORF8840
Coding sequence: 558 codons.
Homologs and Orthologs
Homologs in protein families: GL3C0100 GL3C0100.F2 GL3C0100.N2Orthologs by synteny: SAKL0H12364g KLTH0E10538g KLLA0C15059g ERGO0B02486g
Protein ZYRO0A10032p 
highly similar to gnl|GLV|KLLA0C15059g Kluyveromyces lactis KLLA0C15059g and similar to YDR232W uniprot|P09950 Saccharomyces cerevisiae YDR232W HEM1 5- aminolevulinate synthase catalyzes the first step in the heme biosynthetic pathway an N-terminal signal sequence is required for localization to the mitochondrial matrix expression is regulated by Hap2p-Hap3p
Protein domain map
Sequence data 
>ZYRO0A10032g.nt TTTACGGCTTATCGGGCCAACAGGATCCATTAGAGGCGCTACAGTGGTTTAAAAGGGCCA GCGAAGGTGGTAAGTCTACGCAGGCATTGTACGAGCTGGGCAAAATATACGAATTCACCT CTTTACCACCTCAAGTACAATCGATTTTAACCAGGTACAATATCAATAGAGATCCCGCTA CAGCACTCAAGTATTTCCACAAATGTGCCATGGAGTACGATTATCCGCTGGCACAATGGA AATTGGGGTACTGCTACGAGTTTGGTGAATTACAACTACCTGTAATTGCGAAGAAATCTA TAGCGTGGTACGCAAAGGCGGCCCTCGCGCAGCCAAAGGGTAACCCAATGGCAATGTTAG CACTAAGCGGATGGTATTTGACAGGTGCGCCAGACGTGCTAAAACCCAACGATAAAGAGG CATTCAAATGGGCTTCAAGAGCTTGCGAAGCAAGTGATGGTAAATTAAGTCGAGCGGAAT ACGCATTGGCTTGTTATCATGAGAATGGAATTGGATGTCATAGAAATTTAACAGAGGCAA GAATACATTTCGAGACTGCTACAAGACTGGGTCACATTAAGGCAAGAGATAGACTTGAAA AGGGATTCTAAGCGCTTAATGAGTAACCTTCGAGGTATTGATTGGACAATAGTCACATGG ATTAGGATCATATGACCGTTATTTTTTCACTTTTCTTTTTTCCGAGCTTTTTTTTCTTTT TCCACGTTGTCATCTTTTCTTTTTGTTTTGTCATCGGAAGTTACCTGGGAAAAAAATAAG AAGAGTTCCAAACTTTTCAATGACCTTCAAGTGTATTAAAGGACAAAGGAATAAGGAGTA TTTCTGGAGGAAGGGCGATCTGATCTAGACTCATTCAGAATCATTTGGAGTGCCTTCATT TTTCGGTTTTCAATTATCTTACCGTCTGAACATTTCTGGGCTTATCCAGCAAATTTTGGT TCTCTTGATTATTTATTAGCAAACTTCCATATCAAAGAAAATGGAGTCTATTGCACGTCA ATCGGTCAAGATGTGCCCATTCATGCAGCGTGCTGGTTCTGCGAAAGGTTTGAAAGCTTT GCATAATTCTAATTTGCCAGCAGCAGCTCGTCAATGTCCTATTGTAGGTCATGCTATGAA AAGAACTTATGCTACTGCTACCGGCGCACCCAGAGAAGCATCAGCTAGTAATGCTAAAGC TGCTAGAAATGCAAATGAAGTAGTTGCTAATCACGCGACTCAGGAATCCACTTTTGACTA CAACGGTCTTTTCGAAAGTGAAATTGACAAGAAAAGAGTTGACAAATCTTATCGTTTTTT CAACAATATCAACAGATTGGCTAAAGAATTTCCTAAGGCCCATCGTGACGTGGAAGAGGA TAAGGTTACCGTATGGTGCTCCAATGACTATTTGGCATTATCCAAAAATCAACAAGTCGT TGATGCTATGAAAAAGACTTTGGACAGGTATGGTGCCGGTGCCGGTGGTACCAGAAACAT TGCTGGTCATAACAGACACGCACTAAGATTGGAAGCCGAAATTGCGGCTTTGCACAAGAA GGAGGGTGCCCTTGTCTTTTCATCTTGCTATGTGGCTAACGATGCGGTTCTCTCATTGTT GGGACAAAAGGTCAAGGATTTGGTTATCTTTAGTGATGAACTAAACCACGCTTCTATGAT TATGGGTATGAAGCATGCTCAAACAACAAAACACATCTTCAGACATAACAACTACGCCCA TTTGGAAGAATTATTGCAGATGTACCCTAAATCAACTCCCAAGTTAATTGCTTTTGAATC TGTCTATTCAATGGCAGGTTCCGTTGCTGATATCGATAAAATCTGTAGTTTAGCTGAGAA ATACGGTGCACTAACATTTTTGGATGAAGTTCACGCCGTTGGTCTCTACGGCCCTCATGG TGCAGGTGTTGCCGAACATTGCGATTTTGAACCTCATCGTGTTACTGGTGTTGAGAGTGT CCCAGGTAACTCTACTGTAATGGACCGTGTTGATATGATTACCGGAACTTTAGGTAAATC ATTTGGTACTGTTGGAGGATACGTGGCAGCCTCTCAGAGATTAGTTGACTGGTTAAGATC ATATGCCCCAGGCTTTATCTTCACCACCTCGTTACCACCTTCAGTGATGGCAGGTGCTAC CGAAGCTATTAGATACCAACGTTCTCATTTGGAACTAAGACAATTACAACAATTACATAC CGGTTACGTGAAGCAAGGGATGAAGGAGCTAGGAATTCCAGTTATACCCAATCCTTCTCA CATCGTGCCTGTTTTAGTTGGTAATCCTGATCTAGCGAAGCAAGCTTCGGATATGTTGAT GGATAAACACAAAATTTATGTTCAAGCAATTAACTTCCCTACAGTTTCAAGAGGTACTGA AAGGTTACGTATTACTCCAACTCCTGGTCACACTGACGATTTGTCAGAGATTCTTTTGAA TGCAGTTGATGATGTCTTTAAGGAATTGCAATTGCCTCGTGTTAGAGATTGGGAGTCTCA AGGTGGTATTCTTGGTGTCGGTGAGCCTGGATTCGTCGAACAATCAAATCTTTGGACTGA GGAACAATTAAGTTTGACTAACGACCATTTGAATCCAAATGTCCTTGACCCAATCTTAGA TCATTTGGAGGTATCGAGTGGTATTAAAGTTTAAATTTACATTCAGTATGAGTACGAGCT GTAAATACTCTTGAAAATATGATATGGTATATATATATATATATTTATTCACTCTACATG AGAAGAGAAGCAAATGAAAAGATGTTTTTATTTATTTTCAAACAGTAAGTAGTAGAGCAG AATAAGAGGTTGCAGTATAAATGTATTATACTGAATTATACTAGCATTAGTATTTACTTG ATAGTTATAATATAAAGATGACAACGGTTCCTTAATACCGTGCTTCAGTCAATGATAACA AAAAAAAAAAAAAGGGGTGGAGGAGGTTAATTTG
>ZYRO0A10032g.cds ATGGAGTCTATTGCACGTCAATCGGTCAAGATGTGCCCATTCATGCAGCGTGCTGGTTCT GCGAAAGGTTTGAAAGCTTTGCATAATTCTAATTTGCCAGCAGCAGCTCGTCAATGTCCT ATTGTAGGTCATGCTATGAAAAGAACTTATGCTACTGCTACCGGCGCACCCAGAGAAGCA TCAGCTAGTAATGCTAAAGCTGCTAGAAATGCAAATGAAGTAGTTGCTAATCACGCGACT CAGGAATCCACTTTTGACTACAACGGTCTTTTCGAAAGTGAAATTGACAAGAAAAGAGTT GACAAATCTTATCGTTTTTTCAACAATATCAACAGATTGGCTAAAGAATTTCCTAAGGCC CATCGTGACGTGGAAGAGGATAAGGTTACCGTATGGTGCTCCAATGACTATTTGGCATTA TCCAAAAATCAACAAGTCGTTGATGCTATGAAAAAGACTTTGGACAGGTATGGTGCCGGT GCCGGTGGTACCAGAAACATTGCTGGTCATAACAGACACGCACTAAGATTGGAAGCCGAA ATTGCGGCTTTGCACAAGAAGGAGGGTGCCCTTGTCTTTTCATCTTGCTATGTGGCTAAC GATGCGGTTCTCTCATTGTTGGGACAAAAGGTCAAGGATTTGGTTATCTTTAGTGATGAA CTAAACCACGCTTCTATGATTATGGGTATGAAGCATGCTCAAACAACAAAACACATCTTC AGACATAACAACTACGCCCATTTGGAAGAATTATTGCAGATGTACCCTAAATCAACTCCC AAGTTAATTGCTTTTGAATCTGTCTATTCAATGGCAGGTTCCGTTGCTGATATCGATAAA ATCTGTAGTTTAGCTGAGAAATACGGTGCACTAACATTTTTGGATGAAGTTCACGCCGTT GGTCTCTACGGCCCTCATGGTGCAGGTGTTGCCGAACATTGCGATTTTGAACCTCATCGT GTTACTGGTGTTGAGAGTGTCCCAGGTAACTCTACTGTAATGGACCGTGTTGATATGATT ACCGGAACTTTAGGTAAATCATTTGGTACTGTTGGAGGATACGTGGCAGCCTCTCAGAGA TTAGTTGACTGGTTAAGATCATATGCCCCAGGCTTTATCTTCACCACCTCGTTACCACCT TCAGTGATGGCAGGTGCTACCGAAGCTATTAGATACCAACGTTCTCATTTGGAACTAAGA CAATTACAACAATTACATACCGGTTACGTGAAGCAAGGGATGAAGGAGCTAGGAATTCCA GTTATACCCAATCCTTCTCACATCGTGCCTGTTTTAGTTGGTAATCCTGATCTAGCGAAG CAAGCTTCGGATATGTTGATGGATAAACACAAAATTTATGTTCAAGCAATTAACTTCCCT ACAGTTTCAAGAGGTACTGAAAGGTTACGTATTACTCCAACTCCTGGTCACACTGACGAT TTGTCAGAGATTCTTTTGAATGCAGTTGATGATGTCTTTAAGGAATTGCAATTGCCTCGT GTTAGAGATTGGGAGTCTCAAGGTGGTATTCTTGGTGTCGGTGAGCCTGGATTCGTCGAA CAATCAAATCTTTGGACTGAGGAACAATTAAGTTTGACTAACGACCATTTGAATCCAAAT GTCCTTGACCCAATCTTAGATCATTTGGAGGTATCGAGTGGTATTAAAGTTTAA
>ZYRO0A10032g.aa MESIARQSVKMCPFMQRAGSAKGLKALHNSNLPAAARQCPIVGHAMKRTYATATGAPREA SASNAKAARNANEVVANHATQESTFDYNGLFESEIDKKRVDKSYRFFNNINRLAKEFPKA HRDVEEDKVTVWCSNDYLALSKNQQVVDAMKKTLDRYGAGAGGTRNIAGHNRHALRLEAE IAALHKKEGALVFSSCYVANDAVLSLLGQKVKDLVIFSDELNHASMIMGMKHAQTTKHIF RHNNYAHLEELLQMYPKSTPKLIAFESVYSMAGSVADIDKICSLAEKYGALTFLDEVHAV GLYGPHGAGVAEHCDFEPHRVTGVESVPGNSTVMDRVDMITGTLGKSFGTVGGYVAASQR LVDWLRSYAPGFIFTTSLPPSVMAGATEAIRYQRSHLELRQLQQLHTGYVKQGMKELGIP VIPNPSHIVPVLVGNPDLAKQASDMLMDKHKIYVQAINFPTVSRGTERLRITPTPGHTDD LSEILLNAVDDVFKELQLPRVRDWESQGGILGVGEPGFVEQSNLWTEEQLSLTNDHLNPN VLDPILDHLEVSSGIKV*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://www.genolevures.org/elt/ZYRO/ZYRO0A10032p