DEHA2B05478g


similar to uniprot|P09950 Saccharomyces cerevisiae YDR232w HEM1 5-aminolevulinate synthase, catalyzes the first step in the heme biosynthetic pathway

Genomic environment map

Element type: CDS
Element length: 1728 nucleotides,
on sense strand of
Deha2B: 441580..443307.
Other names:
DEHA-CDS1779.1
DEHA-IPF9580
DEHA0B05357g
Coding sequence: 576 codons.

Computed results  

None available yet

Homologs and Orthologs

Homologs in protein families: GL3C0100 GL3C0100.N3
Orthologs: strict determination not possible; homologs must be refined manually

Protein DEHA2B05478p  


similar to uniprot|P09950 Saccharomyces cerevisiae YDR232w HEM1 5-aminolevulinate synthase

Protein domain map

Protein length: 575 amino acids
Protein family: GL3C0100
Database cross references:
InterPro: IPR001917
InterPro: IPR004839
InterPro: IPR010961
InterPro: IPR015421
UniProtKB/Swiss-Prot: Q6BX71

Computed results for DEHA2B05478p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  

None available yet

Sequence data  

Nucleotide sequence

>DEHA2B05478g.nt
TGGTATAGGGCAAAATGAATACGGCGATTACAATATAGTGCTACTATACAATGCATTAAA
CCCAATCAGTTTGTACTTAAGGTAGAGGTGGATCGACATTCGAAATTGAATATTGCAAAG
TCATATATAATGACAAGGGGTATGAAGATAAAGGCGGAAGTGTCAATCATGATATTATAC
CATTAAGGTTTGTCTGTTAGTGGCAGTTGTACTGTAGTATTTGAGTATTTGAATCGTTCG
GTTAGACTAAATAAATTTACATTAAAACAAAATAAAACGAGTATAAATAAAGTAATAAGG
AATGAGAACAGAAGAATATTTGTACCAAACCGGTACTTGTATATCGAGTTTGTTTTCAGG
ACATATAATGAAACGTTTAGAAGATCTACGGATGTTCTATTGCGCCTATGAATTTTCTTC
GGTCTTAACCCTCAGTTCCCGACTTCGTGGCCAAGTTGGCTAAGGCGTGAGACTGTTAAT
CTCAAGATCGTGAGTTCGACCCTCACCGAGGTCGTTTACATTTTTACTCCATTTTTTTTT
TTTAAAATCGAGAAACATGACAGAATGATTATTTGAGAGTTCGATGATTCGCTGCATAAA
TACTTCATAACTTCATAACTTATCATTAATTTACTAAGCAGTATTTACTCTATAATTGGA
CGCTTGTGATTAGGATTATTGCTACGTGATGACGTGACAGATGCGCTTAATATTACTCCA
TGCTAAGTCATTCTTGTAGACCTCTCATAAGACAAAAACCATCATGCGAGCGTTACATAA
CCTTCCGCGGGATTGAAAAAAAATTGCTCAACCACAAGATTTCTTTGTCTTGAAAAATAA
GTCTATTGATAATTTCTTCTTCCACATATAACCCACTAAACATCTTGAAAGTTACTTAAA
GAAGCAACAAGCTAGGTATTACGGTGAAATATACTGGATCTACAAAAAACATTAAAAAAA
GTATATTTTTGAAGTAAAAAAAAAGTCATCTGAATTAGCCATGGAATCAATTACTCGAGT
CTCCATGTCCGTTTGTCCATTCGTGAAATCGTCATCGGCCCAAGCGTTACGTCAATTGAG
TAAAAACTCAGCTTTGACTAGTCAAGCACGCCAATGTCCATTTATGGGTGCGGCCTTGAA
TGCTAAGGAATCGACTAGATCGTACTCGTCAGCCACTAAACCTGTCAGAGCTACAGCTTC
TTCTCTTGCTTCGAATCCACCTTCGACTATGCAATCAAAGTACTCTTTCAAGGCAGAGGA
ACTCGTTGGAAACAAGGATGCGATTAATCTTGAGTCAAAAGAGAATACTTTTGACTTCAA
AGGTTACTTAAACTCCGAATTGAGCAAGAAGAGAACTGACAAGTCGTATCGTTTCTTCAA
CAACATTAACCGTTTAGCTAATGAATTCCCTAAGGCTCATCGTTCTGAAGAGAATGACAA
GGTTACTGTTTGGTGCTCTAATGATTATCTTGGTATGGGTAAAAATGAGAATACGATAAA
CGAAATGAAGCGAGTCTTGACCAAGTATGGTTCAGGTGCCGGTGGTACCAGAAACATTGC
TGGACACAACATCCATGCTCTTAAGTTAGAATCGGAGTTAGCTGCTTTGCACAAACATGA
AGCTGCATTAGTTTTTTCGTCTTGTTTTGTTGCTAACGATGCAGTTTTATCGTTATTTGG
CCAAAAGATCAAAGACTTAGTCATCTTTTCTGATGAATTAAATCATGCCTCTATGATCCA
GGGTATTAGAAATTCAAGAGCTAAAAAGCAAGTCTTTAAGCATAATGATTTGGCTGATTT
GGAAGAAAAATTAGCTCAATATCCAAAATCAACCCCAAAATTGATTGCATTTGAGTCTGT
CTACTCTATGTGTGGTTCTATCGCCCCTATCGAAGCTATTTGTGACTTAGCAGAAAAATA
CGGAGCATTGACTTTCCTTGATGAAGTCCACGCAGTAGGTATGTATGGTCCTCATGGTGC
TGGTGTTGCTGAACACTTAGATTTCGACGCCCATTTGAAATCGGGTATTGCTTCGCCACA
AACCCAAACAGTTATGAATAGAGTAGATATGGTTACAGGTACTTTAGGTAAGGCCTACGG
TACTGTAGGTGGTTACATCACCGGTAAGGCTAACTTGATTGATTGGTTTAGGTCTTTTGC
TCCAGGTTTTATCTTCACTACCACTTTACCTCCATCAATTATGGCTGGATCAAGTGCTTC
CATTCGTTACCAAAGATCAACCTTGCAAGACCGAATCGCTCAACAAACTAATACCAGATA
CGTCAAGAACAATTTAACCGATATTGGTATTCCTGTTATTCCAAACCCATCGCATATTGT
GCCAGTTTTAGTTGGTAATGCGCTCGACGCTAAAAAGGCCTCTGATTTGTTATTAGATAA
GTACAACATTTACGTTCAAGCCATCAATTTCCCAACTGTACCAATCGGTCAAGAAAGATT
GAGAATCACCCCAACTCCAGGCCACGGACCCGAACTTTCAAACCAATTGATTGGTGCCCT
TGATTCGGTTTTCAATGAATTGAGTTTGTCAAGAATCGGCGATTGGGAAGGTAAAGGAGG
TTTGTGTGGTGTTGGTGAACCTGATATCGAACCAATAGAACACATTTGGACTTCCGAACA
ATTAGCTTTGACTGATGCTGATATAAACCCTAACGTGATCGACCCTGTTATTCAACCTAT
TGGAGTCAGTTCTGGTGTTAGGGATTAAAAGTAATATACTACATAATTTTTTTCACATTA
TTGTGTTTTTTTTTTACGTTAGCTATGCCATCTTTTAATAAAACTTTCATCTATTTACTG
AAAACACTCTCTGTTTATCTTGTAGTTCAGTCTAATACTATCAGGTTGAAGCATATATAC
TATAATAGTTGAGTAACACAAAATCGTAGGATCAAAACATCAACGCGCTTGCAAAAGAAA
ACAATCACAATATACATTACGTTATAATCTAAAAGGATAGCCATTATGACACTAGATAAG
GACCAACTAATAAAGGTTTATACTAGTA

Coding sequence

>DEHA2B05478g.cds
ATGGAATCAATTACTCGAGTCTCCATGTCCGTTTGTCCATTCGTGAAATCGTCATCGGCC
CAAGCGTTACGTCAATTGAGTAAAAACTCAGCTTTGACTAGTCAAGCACGCCAATGTCCA
TTTATGGGTGCGGCCTTGAATGCTAAGGAATCGACTAGATCGTACTCGTCAGCCACTAAA
CCTGTCAGAGCTACAGCTTCTTCTCTTGCTTCGAATCCACCTTCGACTATGCAATCAAAG
TACTCTTTCAAGGCAGAGGAACTCGTTGGAAACAAGGATGCGATTAATCTTGAGTCAAAA
GAGAATACTTTTGACTTCAAAGGTTACTTAAACTCCGAATTGAGCAAGAAGAGAACTGAC
AAGTCGTATCGTTTCTTCAACAACATTAACCGTTTAGCTAATGAATTCCCTAAGGCTCAT
CGTTCTGAAGAGAATGACAAGGTTACTGTTTGGTGCTCTAATGATTATCTTGGTATGGGT
AAAAATGAGAATACGATAAACGAAATGAAGCGAGTCTTGACCAAGTATGGTTCAGGTGCC
GGTGGTACCAGAAACATTGCTGGACACAACATCCATGCTCTTAAGTTAGAATCGGAGTTA
GCTGCTTTGCACAAACATGAAGCTGCATTAGTTTTTTCGTCTTGTTTTGTTGCTAACGAT
GCAGTTTTATCGTTATTTGGCCAAAAGATCAAAGACTTAGTCATCTTTTCTGATGAATTA
AATCATGCCTCTATGATCCAGGGTATTAGAAATTCAAGAGCTAAAAAGCAAGTCTTTAAG
CATAATGATTTGGCTGATTTGGAAGAAAAATTAGCTCAATATCCAAAATCAACCCCAAAA
TTGATTGCATTTGAGTCTGTCTACTCTATGTGTGGTTCTATCGCCCCTATCGAAGCTATT
TGTGACTTAGCAGAAAAATACGGAGCATTGACTTTCCTTGATGAAGTCCACGCAGTAGGT
ATGTATGGTCCTCATGGTGCTGGTGTTGCTGAACACTTAGATTTCGACGCCCATTTGAAA
TCGGGTATTGCTTCGCCACAAACCCAAACAGTTATGAATAGAGTAGATATGGTTACAGGT
ACTTTAGGTAAGGCCTACGGTACTGTAGGTGGTTACATCACCGGTAAGGCTAACTTGATT
GATTGGTTTAGGTCTTTTGCTCCAGGTTTTATCTTCACTACCACTTTACCTCCATCAATT
ATGGCTGGATCAAGTGCTTCCATTCGTTACCAAAGATCAACCTTGCAAGACCGAATCGCT
CAACAAACTAATACCAGATACGTCAAGAACAATTTAACCGATATTGGTATTCCTGTTATT
CCAAACCCATCGCATATTGTGCCAGTTTTAGTTGGTAATGCGCTCGACGCTAAAAAGGCC
TCTGATTTGTTATTAGATAAGTACAACATTTACGTTCAAGCCATCAATTTCCCAACTGTA
CCAATCGGTCAAGAAAGATTGAGAATCACCCCAACTCCAGGCCACGGACCCGAACTTTCA
AACCAATTGATTGGTGCCCTTGATTCGGTTTTCAATGAATTGAGTTTGTCAAGAATCGGC
GATTGGGAAGGTAAAGGAGGTTTGTGTGGTGTTGGTGAACCTGATATCGAACCAATAGAA
CACATTTGGACTTCCGAACAATTAGCTTTGACTGATGCTGATATAAACCCTAACGTGATC
GACCCTGTTATTCAACCTATTGGAGTCAGTTCTGGTGTTAGGGATTAA

Predicted translation product

>DEHA2B05478g.aa
MESITRVSMSVCPFVKSSSAQALRQLSKNSALTSQARQCPFMGAALNAKESTRSYSSATK
PVRATASSLASNPPSTMQSKYSFKAEELVGNKDAINLESKENTFDFKGYLNSELSKKRTD
KSYRFFNNINRLANEFPKAHRSEENDKVTVWCSNDYLGMGKNENTINEMKRVLTKYGSGA
GGTRNIAGHNIHALKLESELAALHKHEAALVFSSCFVANDAVLSLFGQKIKDLVIFSDEL
NHASMIQGIRNSRAKKQVFKHNDLADLEEKLAQYPKSTPKLIAFESVYSMCGSIAPIEAI
CDLAEKYGALTFLDEVHAVGMYGPHGAGVAEHLDFDAHLKSGIASPQTQTVMNRVDMVTG
TLGKAYGTVGGYITGKANLIDWFRSFAPGFIFTTTLPPSIMAGSSASIRYQRSTLQDRIA
QQTNTRYVKNNLTDIGIPVIPNPSHIVPVLVGNALDAKKASDLLLDKYNIYVQAINFPTV
PIGQERLRITPTPGHGPELSNQLIGALDSVFNELSLSRIGDWEGKGGLCGVGEPDIEPIE
HIWTSEQLALTDADINPNVIDPVIQPIGVSSGVRD*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites