ZYRO0A10032g


similar to uniprot|P09950 Saccharomyces cerevisiae YDR232W HEM1 5-aminolevulinate synthase, catalyzes the first step in the heme biosynthetic pathway

Genomic environment map

Element type: CDS
Element length: 1674 nucleotides,
on sense strand of
Zyro0A: 809241..810914.
Other names:
ZYRO-ORF8840
Coding sequence: 558 codons.

Computed results  

None available yet

Homologs and Orthologs

Homologs in protein families: GL3C0100 GL3C0100.F2 GL3C0100.N2
Orthologs by synteny: SAKL0H12364g KLTH0E10538g KLLA0C15059g ERGO0B02486g

Protein ZYRO0A10032p  


highly similar to gnl|GLV|KLLA0C15059g Kluyveromyces lactis KLLA0C15059g and similar to YDR232W uniprot|P09950 Saccharomyces cerevisiae YDR232W HEM1 5- aminolevulinate synthase catalyzes the first step in the heme biosynthetic pathway an N-terminal signal sequence is required for localization to the mitochondrial matrix expression is regulated by Hap2p-Hap3p

Protein domain map

Protein length: 557 amino acids
Protein family: GL3C0100

Computed results for ZYRO0A10032p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  

None available yet

Sequence data  


Nucleotide sequence    

>ZYRO0A10032g.nt
TTTACGGCTTATCGGGCCAACAGGATCCATTAGAGGCGCTACAGTGGTTTAAAAGGGCCA
GCGAAGGTGGTAAGTCTACGCAGGCATTGTACGAGCTGGGCAAAATATACGAATTCACCT
CTTTACCACCTCAAGTACAATCGATTTTAACCAGGTACAATATCAATAGAGATCCCGCTA
CAGCACTCAAGTATTTCCACAAATGTGCCATGGAGTACGATTATCCGCTGGCACAATGGA
AATTGGGGTACTGCTACGAGTTTGGTGAATTACAACTACCTGTAATTGCGAAGAAATCTA
TAGCGTGGTACGCAAAGGCGGCCCTCGCGCAGCCAAAGGGTAACCCAATGGCAATGTTAG
CACTAAGCGGATGGTATTTGACAGGTGCGCCAGACGTGCTAAAACCCAACGATAAAGAGG
CATTCAAATGGGCTTCAAGAGCTTGCGAAGCAAGTGATGGTAAATTAAGTCGAGCGGAAT
ACGCATTGGCTTGTTATCATGAGAATGGAATTGGATGTCATAGAAATTTAACAGAGGCAA
GAATACATTTCGAGACTGCTACAAGACTGGGTCACATTAAGGCAAGAGATAGACTTGAAA
AGGGATTCTAAGCGCTTAATGAGTAACCTTCGAGGTATTGATTGGACAATAGTCACATGG
ATTAGGATCATATGACCGTTATTTTTTCACTTTTCTTTTTTCCGAGCTTTTTTTTCTTTT
TCCACGTTGTCATCTTTTCTTTTTGTTTTGTCATCGGAAGTTACCTGGGAAAAAAATAAG
AAGAGTTCCAAACTTTTCAATGACCTTCAAGTGTATTAAAGGACAAAGGAATAAGGAGTA
TTTCTGGAGGAAGGGCGATCTGATCTAGACTCATTCAGAATCATTTGGAGTGCCTTCATT
TTTCGGTTTTCAATTATCTTACCGTCTGAACATTTCTGGGCTTATCCAGCAAATTTTGGT
TCTCTTGATTATTTATTAGCAAACTTCCATATCAAAGAAAATGGAGTCTATTGCACGTCA
ATCGGTCAAGATGTGCCCATTCATGCAGCGTGCTGGTTCTGCGAAAGGTTTGAAAGCTTT
GCATAATTCTAATTTGCCAGCAGCAGCTCGTCAATGTCCTATTGTAGGTCATGCTATGAA
AAGAACTTATGCTACTGCTACCGGCGCACCCAGAGAAGCATCAGCTAGTAATGCTAAAGC
TGCTAGAAATGCAAATGAAGTAGTTGCTAATCACGCGACTCAGGAATCCACTTTTGACTA
CAACGGTCTTTTCGAAAGTGAAATTGACAAGAAAAGAGTTGACAAATCTTATCGTTTTTT
CAACAATATCAACAGATTGGCTAAAGAATTTCCTAAGGCCCATCGTGACGTGGAAGAGGA
TAAGGTTACCGTATGGTGCTCCAATGACTATTTGGCATTATCCAAAAATCAACAAGTCGT
TGATGCTATGAAAAAGACTTTGGACAGGTATGGTGCCGGTGCCGGTGGTACCAGAAACAT
TGCTGGTCATAACAGACACGCACTAAGATTGGAAGCCGAAATTGCGGCTTTGCACAAGAA
GGAGGGTGCCCTTGTCTTTTCATCTTGCTATGTGGCTAACGATGCGGTTCTCTCATTGTT
GGGACAAAAGGTCAAGGATTTGGTTATCTTTAGTGATGAACTAAACCACGCTTCTATGAT
TATGGGTATGAAGCATGCTCAAACAACAAAACACATCTTCAGACATAACAACTACGCCCA
TTTGGAAGAATTATTGCAGATGTACCCTAAATCAACTCCCAAGTTAATTGCTTTTGAATC
TGTCTATTCAATGGCAGGTTCCGTTGCTGATATCGATAAAATCTGTAGTTTAGCTGAGAA
ATACGGTGCACTAACATTTTTGGATGAAGTTCACGCCGTTGGTCTCTACGGCCCTCATGG
TGCAGGTGTTGCCGAACATTGCGATTTTGAACCTCATCGTGTTACTGGTGTTGAGAGTGT
CCCAGGTAACTCTACTGTAATGGACCGTGTTGATATGATTACCGGAACTTTAGGTAAATC
ATTTGGTACTGTTGGAGGATACGTGGCAGCCTCTCAGAGATTAGTTGACTGGTTAAGATC
ATATGCCCCAGGCTTTATCTTCACCACCTCGTTACCACCTTCAGTGATGGCAGGTGCTAC
CGAAGCTATTAGATACCAACGTTCTCATTTGGAACTAAGACAATTACAACAATTACATAC
CGGTTACGTGAAGCAAGGGATGAAGGAGCTAGGAATTCCAGTTATACCCAATCCTTCTCA
CATCGTGCCTGTTTTAGTTGGTAATCCTGATCTAGCGAAGCAAGCTTCGGATATGTTGAT
GGATAAACACAAAATTTATGTTCAAGCAATTAACTTCCCTACAGTTTCAAGAGGTACTGA
AAGGTTACGTATTACTCCAACTCCTGGTCACACTGACGATTTGTCAGAGATTCTTTTGAA
TGCAGTTGATGATGTCTTTAAGGAATTGCAATTGCCTCGTGTTAGAGATTGGGAGTCTCA
AGGTGGTATTCTTGGTGTCGGTGAGCCTGGATTCGTCGAACAATCAAATCTTTGGACTGA
GGAACAATTAAGTTTGACTAACGACCATTTGAATCCAAATGTCCTTGACCCAATCTTAGA
TCATTTGGAGGTATCGAGTGGTATTAAAGTTTAAATTTACATTCAGTATGAGTACGAGCT
GTAAATACTCTTGAAAATATGATATGGTATATATATATATATATTTATTCACTCTACATG
AGAAGAGAAGCAAATGAAAAGATGTTTTTATTTATTTTCAAACAGTAAGTAGTAGAGCAG
AATAAGAGGTTGCAGTATAAATGTATTATACTGAATTATACTAGCATTAGTATTTACTTG
ATAGTTATAATATAAAGATGACAACGGTTCCTTAATACCGTGCTTCAGTCAATGATAACA
AAAAAAAAAAAAAGGGGTGGAGGAGGTTAATTTG

Coding sequence    

>ZYRO0A10032g.cds
ATGGAGTCTATTGCACGTCAATCGGTCAAGATGTGCCCATTCATGCAGCGTGCTGGTTCT
GCGAAAGGTTTGAAAGCTTTGCATAATTCTAATTTGCCAGCAGCAGCTCGTCAATGTCCT
ATTGTAGGTCATGCTATGAAAAGAACTTATGCTACTGCTACCGGCGCACCCAGAGAAGCA
TCAGCTAGTAATGCTAAAGCTGCTAGAAATGCAAATGAAGTAGTTGCTAATCACGCGACT
CAGGAATCCACTTTTGACTACAACGGTCTTTTCGAAAGTGAAATTGACAAGAAAAGAGTT
GACAAATCTTATCGTTTTTTCAACAATATCAACAGATTGGCTAAAGAATTTCCTAAGGCC
CATCGTGACGTGGAAGAGGATAAGGTTACCGTATGGTGCTCCAATGACTATTTGGCATTA
TCCAAAAATCAACAAGTCGTTGATGCTATGAAAAAGACTTTGGACAGGTATGGTGCCGGT
GCCGGTGGTACCAGAAACATTGCTGGTCATAACAGACACGCACTAAGATTGGAAGCCGAA
ATTGCGGCTTTGCACAAGAAGGAGGGTGCCCTTGTCTTTTCATCTTGCTATGTGGCTAAC
GATGCGGTTCTCTCATTGTTGGGACAAAAGGTCAAGGATTTGGTTATCTTTAGTGATGAA
CTAAACCACGCTTCTATGATTATGGGTATGAAGCATGCTCAAACAACAAAACACATCTTC
AGACATAACAACTACGCCCATTTGGAAGAATTATTGCAGATGTACCCTAAATCAACTCCC
AAGTTAATTGCTTTTGAATCTGTCTATTCAATGGCAGGTTCCGTTGCTGATATCGATAAA
ATCTGTAGTTTAGCTGAGAAATACGGTGCACTAACATTTTTGGATGAAGTTCACGCCGTT
GGTCTCTACGGCCCTCATGGTGCAGGTGTTGCCGAACATTGCGATTTTGAACCTCATCGT
GTTACTGGTGTTGAGAGTGTCCCAGGTAACTCTACTGTAATGGACCGTGTTGATATGATT
ACCGGAACTTTAGGTAAATCATTTGGTACTGTTGGAGGATACGTGGCAGCCTCTCAGAGA
TTAGTTGACTGGTTAAGATCATATGCCCCAGGCTTTATCTTCACCACCTCGTTACCACCT
TCAGTGATGGCAGGTGCTACCGAAGCTATTAGATACCAACGTTCTCATTTGGAACTAAGA
CAATTACAACAATTACATACCGGTTACGTGAAGCAAGGGATGAAGGAGCTAGGAATTCCA
GTTATACCCAATCCTTCTCACATCGTGCCTGTTTTAGTTGGTAATCCTGATCTAGCGAAG
CAAGCTTCGGATATGTTGATGGATAAACACAAAATTTATGTTCAAGCAATTAACTTCCCT
ACAGTTTCAAGAGGTACTGAAAGGTTACGTATTACTCCAACTCCTGGTCACACTGACGAT
TTGTCAGAGATTCTTTTGAATGCAGTTGATGATGTCTTTAAGGAATTGCAATTGCCTCGT
GTTAGAGATTGGGAGTCTCAAGGTGGTATTCTTGGTGTCGGTGAGCCTGGATTCGTCGAA
CAATCAAATCTTTGGACTGAGGAACAATTAAGTTTGACTAACGACCATTTGAATCCAAAT
GTCCTTGACCCAATCTTAGATCATTTGGAGGTATCGAGTGGTATTAAAGTTTAA

Predicted translation product    

>ZYRO0A10032g.aa
MESIARQSVKMCPFMQRAGSAKGLKALHNSNLPAAARQCPIVGHAMKRTYATATGAPREA
SASNAKAARNANEVVANHATQESTFDYNGLFESEIDKKRVDKSYRFFNNINRLAKEFPKA
HRDVEEDKVTVWCSNDYLALSKNQQVVDAMKKTLDRYGAGAGGTRNIAGHNRHALRLEAE
IAALHKKEGALVFSSCYVANDAVLSLLGQKVKDLVIFSDELNHASMIMGMKHAQTTKHIF
RHNNYAHLEELLQMYPKSTPKLIAFESVYSMAGSVADIDKICSLAEKYGALTFLDEVHAV
GLYGPHGAGVAEHCDFEPHRVTGVESVPGNSTVMDRVDMITGTLGKSFGTVGGYVAASQR
LVDWLRSYAPGFIFTTSLPPSVMAGATEAIRYQRSHLELRQLQQLHTGYVKQGMKELGIP
VIPNPSHIVPVLVGNPDLAKQASDMLMDKHKIYVQAINFPTVSRGTERLRITPTPGHTDD
LSEILLNAVDDVFKELQLPRVRDWESQGGILGVGEPGFVEQSNLWTEEQLSLTNDHLNPN
VLDPILDHLEVSSGIKV*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites