SACE0D10252g


5-aminolevulinate synthase, catalyzes the first step in the heme biosynthetic pathway; an N-terminal signal sequence is required for localization to the mitochondrial matrix; expression is regulated by Hap2p-Hap3p

Genomic environment map

Element type: CDS
Element length: 1647 nucleotides,
on sense strand of
Sace0D: 927448..929094.
Other names:
CYD1
HEM1
YDR232W
Coding sequence: 549 codons.
Database cross references:
ArrayExpress: P09950
CYGD: YDR232w
EMBL: AAA34668.1
EMBL: CAA88511.1
EMBL: J03556
EMBL: M26329
EMBL: Z48612
Ensembl: YDR232W
GeneID: 851818
GenomeReviews: Z71256_GR
HOGENOM: P09950
NMPDR: fig|4932.3.peg.1275

Computed results  

None available yet

Homologs and Orthologs

Homologs in protein families: GL3C0100 GL3C0100.N2
Orthologs: strict determination not possible; homologs must be refined manually

Protein SACE0D10252p  


Protein domain map

Protein length: 548 amino acids
Protein family: GL3C0100
Database cross references:
Gene3D: G3DSA:3.40.640.10
GermOnline: YDR232W
HSSP: 1FC4
HSSP: P07912
IntAct: P09950
InterPro: IPR001917
InterPro: IPR004839
InterPro: IPR010961
InterPro: IPR015421
KEGG: sce:YDR232W
LinkHub: P09950
PIR: A24870
PROSITE: PS00599
PeptideAtlas: P09950
Pfam: PF00155
RefSeq: NP_010518.1
SGD: S000002640
TIGRFAMs: TIGR01821
UniProtKB/Swiss-Prot: P09950
UniprotKB: HEM1_YEAST

Computed results for SACE0D10252p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  

GO:0045449 regulation of transcription
GO:0006783 heme biosynthetic process
GO:0005759 mitochondrial matrix
GO:0003870 5-aminolevulinate synthase activity

Sequence data  


Nucleotide sequence    

>SACE0D10252g.nt
CACGGTTTCCTTTGCCAATTGTGCGATCTGGAAACTTTTTTGTCTCTTTAGCCTACATTG
CTCCCATCCGACAATCGAGCCCAGTATCAGAGAGCTCATCGCCCAATTCGTGGCCTTAGT
CGGGCTCTTATGGTATATAAATATAATAGATCCCATCAAGAACATGCTGGAAAACCCGAG
CATGCCCGCGTCTCTGAAGCACGGGATTGAAGTCAGCTTCTGCAAACTAAAATCACTCCA
TCTGATCGAGTCCCAAGCCTCTTTCAAAGTAGCCCGTTCTTGCTTCTTGGCCAACTGAGA
ATTGGACAAGTCATCGGCAAACTTGGGTGGAGTGTCTTCTAAGAGGATCTTTTGACCCCG
CGAGTAATTTGTCAGCACGCGATCACCATCTGCTTTACCCTGCGGTTGTTGCTGCTGCTT
TTGATCTTCTGTCTGATTTGACCACGGCCACCAACGCATCTTTATATTAACTGTTGAGCT
GCTCGTTACGCACTGCTGTGGACTTCACTACAATGCTTTACCGAGCAGTGGAGTTTGATA
TAGTGGTGGTACCCCCGACGGTGCTATAATAAGCTTGCCTGCACGGCTTACCCTACCGCT
CGGAAAGGCCGCCTTCGTCGCTCATTGGTCTGCGGCCGCGGGCGCTTTTTGGTCATTGTT
CATATATGAAGAAAGAGCAGGAACGGAAATTTTTCATCTTGCCTTGCCCATTGTTCTGAT
CCATGCGTTGTTAGAAATACAACGTTTGCATCATACCAAACTTCTCCATTCCGTCAGCTG
ATACTCTATTCGGTTGTGTGTTGCACCCTGCTCTGCTGTCTCTCAACCGTTCTTCCTTTA
CACGCCTTCCCTTCTCAGCTCGCGTTTCTTTTTTTTTTATCCCACTCTTTTCTTTTTTCT
TTCCTATATATTGCCCATATAAGTTTGGTTGGAAGGAAAACTAATAGAGCTAGTTGTTGT
CCCTCAATAATCATAACAGTACTTAGGTTTTTTTTTCAGTATGCAACGCTCCATTTTTGC
GAGGTTCGGTAACTCCTCTGCCGCTGTTTCCACACTGAATAGGCTGTCCACGACAGCCGC
ACCACATGCGAAAAATGGCTATGCCACCGCTACTGGTGCTGGTGCCGCTGCTGCCACTGC
CACAGCGTCATCAACACATGCAGCAGCAGCAGCAGCCGCTGCTGCCAACCATTCCACCCA
GGAGTCGGGTTTCGATTACGAAGGCCTGATAGATTCCGAACTGCAGAAGAAAAGACTTGA
CAAATCGTACAGATATTTCAACAATATCAACCGATTGGCCAAGGAGTTCCCCCTAGCTCA
TCGCCAGAGAGAGGCGGACAAGGTCACCGTTTGGTGTTCCAACGACTATTTAGCACTTTC
CAAGCACCCTGAGGTATTGGACGCCATGCATAAAACTATCGACAAGTATGGTTGTGGTGC
CGGTGGTACAAGAAACATTGCTGGCCATAACATCCCCACTTTGAATCTGGAAGCCGAATT
GGCCACTTTACACAAGAAGGAAGGTGCCTTAGTTTTTTCGTCATGTTACGTAGCCAACGA
TGCCGTCTTATCCCTACTGGGTCAAAAGATGAAGGACTTGGTGATTTTCTCCGACGAACT
CAACCATGCGTCCATGATTGTCGGTATTAAGCATGCTAACGTAAAAAAACACATTTTCAA
ACATAATGACTTGAACGAATTGGAACAACTGCTCCAGTCATACCCCAAATCCGTTCCTAA
ACTAATTGCTTTCGAATCAGTATATTCTATGGCCGGTTCAGTGGCCGACATAGAAAAAAT
TTGCGACTTGGCCGACAAATACGGTGCTTTGACCTTCTTGGATGAAGTACATGCGGTCGG
CCTGTACGGCCCTCACGGTGCAGGTGTTGCAGAACATTGTGATTTTGAAAGTCACCGTGC
AAGTGGTATTGCTACCCCAAAGACCAATGACAAGGGCGGCGCGAAGACTGTGATGGACCG
TGTCGACATGATCACCGGCACTTTAGGTAAGTCTTTCGGTAGCGTAGGTGGCTACGTCGC
AGCCTCTAGGAAATTGATCGATTGGTTCAGATCGTTTGCACCTGGTTTCATTTTCACCAC
GACTTTACCACCTTCAGTTATGGCAGGCGCTACCGCAGCAATTAGATACCAACGTTGCCA
CATCGACCTAAGAACCTCGCAACAGAAACATACCATGTACGTAAAGAAAGCTTTCCATGA
GTTGGGCATTCCAGTTATTCCAAATCCTTCTCATATCGTCCCAGTGTTGATTGGTAATGC
TGATTTGGCTAAGCAAGCTTCTGACATCTTAATCAATAAGCATCAAATCTACGTACAAGC
TATCAACTTCCCTACGGTTGCTCGCGGTACCGAAAGATTGAGAATTACCCCAACGCCAGG
TCACACCAACGATTTATCTGACATCTTAATCAATGCAGTTGATGATGTGTTCAATGAGCT
ACAGTTACCACGTGTCAGAGACTGGGAAAGCCAAGGTGGCTTATTGGGTGTTGGAGAGAG
CGGATTTGTGGAAGAGTCTAACTTATGGACATCAAGCCAACTATCTTTAACTAATGACGA
CTTGAACCCTAATGTTAGAGACCCCATCGTTAAACAACTAGAGGTTTCTAGTGGTATCAA
GCAGTAAAACAACCAATATATGCATGGGCTGAGATAGAGGTACAAGGAATTTGTAAATCA
GTAAAAAAAAAAATTAACAGTTTTTTTTTTTCATTTTTTTTTTTTATTCTTATTTATGTA
TGATACTTTATTATTATTTCTCTTAATTATTTATTTATTTAACTAACACGATGAGCACTT
TTAACTGCAATGGTTAAACTGTAGCAATGTTGGTAAAAAAGCAGGGAAAGTTCAAAAATA
ATTTATGTATTTTTCCTCGGGGGTACAGAAAGTAAAGAGAGAGAAACGTATGAGTATATA
TAGGCTA

Coding sequence    

>SACE0D10252g.cds
ATGCAACGCTCCATTTTTGCGAGGTTCGGTAACTCCTCTGCCGCTGTTTCCACACTGAAT
AGGCTGTCCACGACAGCCGCACCACATGCGAAAAATGGCTATGCCACCGCTACTGGTGCT
GGTGCCGCTGCTGCCACTGCCACAGCGTCATCAACACATGCAGCAGCAGCAGCAGCCGCT
GCTGCCAACCATTCCACCCAGGAGTCGGGTTTCGATTACGAAGGCCTGATAGATTCCGAA
CTGCAGAAGAAAAGACTTGACAAATCGTACAGATATTTCAACAATATCAACCGATTGGCC
AAGGAGTTCCCCCTAGCTCATCGCCAGAGAGAGGCGGACAAGGTCACCGTTTGGTGTTCC
AACGACTATTTAGCACTTTCCAAGCACCCTGAGGTATTGGACGCCATGCATAAAACTATC
GACAAGTATGGTTGTGGTGCCGGTGGTACAAGAAACATTGCTGGCCATAACATCCCCACT
TTGAATCTGGAAGCCGAATTGGCCACTTTACACAAGAAGGAAGGTGCCTTAGTTTTTTCG
TCATGTTACGTAGCCAACGATGCCGTCTTATCCCTACTGGGTCAAAAGATGAAGGACTTG
GTGATTTTCTCCGACGAACTCAACCATGCGTCCATGATTGTCGGTATTAAGCATGCTAAC
GTAAAAAAACACATTTTCAAACATAATGACTTGAACGAATTGGAACAACTGCTCCAGTCA
TACCCCAAATCCGTTCCTAAACTAATTGCTTTCGAATCAGTATATTCTATGGCCGGTTCA
GTGGCCGACATAGAAAAAATTTGCGACTTGGCCGACAAATACGGTGCTTTGACCTTCTTG
GATGAAGTACATGCGGTCGGCCTGTACGGCCCTCACGGTGCAGGTGTTGCAGAACATTGT
GATTTTGAAAGTCACCGTGCAAGTGGTATTGCTACCCCAAAGACCAATGACAAGGGCGGC
GCGAAGACTGTGATGGACCGTGTCGACATGATCACCGGCACTTTAGGTAAGTCTTTCGGT
AGCGTAGGTGGCTACGTCGCAGCCTCTAGGAAATTGATCGATTGGTTCAGATCGTTTGCA
CCTGGTTTCATTTTCACCACGACTTTACCACCTTCAGTTATGGCAGGCGCTACCGCAGCA
ATTAGATACCAACGTTGCCACATCGACCTAAGAACCTCGCAACAGAAACATACCATGTAC
GTAAAGAAAGCTTTCCATGAGTTGGGCATTCCAGTTATTCCAAATCCTTCTCATATCGTC
CCAGTGTTGATTGGTAATGCTGATTTGGCTAAGCAAGCTTCTGACATCTTAATCAATAAG
CATCAAATCTACGTACAAGCTATCAACTTCCCTACGGTTGCTCGCGGTACCGAAAGATTG
AGAATTACCCCAACGCCAGGTCACACCAACGATTTATCTGACATCTTAATCAATGCAGTT
GATGATGTGTTCAATGAGCTACAGTTACCACGTGTCAGAGACTGGGAAAGCCAAGGTGGC
TTATTGGGTGTTGGAGAGAGCGGATTTGTGGAAGAGTCTAACTTATGGACATCAAGCCAA
CTATCTTTAACTAATGACGACTTGAACCCTAATGTTAGAGACCCCATCGTTAAACAACTA
GAGGTTTCTAGTGGTATCAAGCAGTAA

Predicted translation product    

>SACE0D10252g.aa
MQRSIFARFGNSSAAVSTLNRLSTTAAPHAKNGYATATGAGAAAATATASSTHAAAAAAA
AANHSTQESGFDYEGLIDSELQKKRLDKSYRYFNNINRLAKEFPLAHRQREADKVTVWCS
NDYLALSKHPEVLDAMHKTIDKYGCGAGGTRNIAGHNIPTLNLEAELATLHKKEGALVFS
SCYVANDAVLSLLGQKMKDLVIFSDELNHASMIVGIKHANVKKHIFKHNDLNELEQLLQS
YPKSVPKLIAFESVYSMAGSVADIEKICDLADKYGALTFLDEVHAVGLYGPHGAGVAEHC
DFESHRASGIATPKTNDKGGAKTVMDRVDMITGTLGKSFGSVGGYVAASRKLIDWFRSFA
PGFIFTTTLPPSVMAGATAAIRYQRCHIDLRTSQQKHTMYVKKAFHELGIPVIPNPSHIV
PVLIGNADLAKQASDILINKHQIYVQAINFPTVARGTERLRITPTPGHTNDLSDILINAV
DDVFNELQLPRVRDWESQGGLLGVGESGFVEESNLWTSSQLSLTNDDLNPNVRDPIVKQL
EVSSGIKQ*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites