CAGL0B02607g


highly similar to uniprot|P09950 Saccharomyces cerevisiae YDR232w HEM1 5-aminolevulinate synthase, catalyzes the first step in the heme biosynthetic pathway

Genomic environment map

Element type: CDS
Element length: 1593 nucleotides,
on anti-sense strand of
Cagl0B: complement(248795..250387).
Other names:
CAGL-CDS1907.1
CAGL-IPF837
Coding sequence: 531 codons.
Database cross references:
EMBL: CR380948
GeneID: 2886690
HOGENOM: Q6FXE3

Computed results  

None available yet

Homologs and Orthologs

Homologs in protein families: GL3C0100 GL3C0100.N2
Orthologs: strict determination not possible; homologs must be refined manually

Protein CAGL0B02607p  


highly similar to uniprot|P09950 Saccharomyces cerevisiae YDR232w HEM1

Protein domain map

Protein length: 530 amino acids
Protein family: GL3C0100
Database cross references:
Gene3D: G3DSA:3.40.640.10
InterPro: IPR001917
InterPro: IPR004839
InterPro: IPR010961
InterPro: IPR015421
KEGG: cgr:CAGL0B02607g
PROSITE: PS00599
Pfam: PF00155
RefSeq: XP_445084.1
TIGRFAMs: TIGR01821
UniProtKB/Swiss-Prot: Q6FXE3
UniprotKB: HEM1_CANGA

Computed results for CAGL0B02607p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  

GO:0005759 mitochondrial matrix

Sequence data  


Nucleotide sequence    

>CAGL0B02607g.nt
CCGAGTCGCATAGTTTAACATTTCCTGCTTTGGTGTAATGTTGGATAGAATTAGAGTCTA
TTGTAAACCTCAGGTCATAGTCAGTTACGTAGCCAAATATGTTATGATGTGGAATGGCTG
TTCCTACCACTTCTGTGACAAATTTTTGTAAGTTGATGGCACTTGACTCGTTGATCACAA
GCTTCGTGTTTGTATTGTTGAACAAAGGCAACACCTCGTTGACCCGTTTGGATAATGCAT
GTAGCGACGAAGCTGCAGCATTCACAAACACCAAATCGAAACCCGCGTTGCCTCTGCTGT
CAAGTAAATCGCCAACACTCATAAACACTTTGGTCGGGTCCACTTGCAAACCACCGTTCC
CATACTGCCTGGTCCGCACCGTAACATCCTGGGAGTACTTGCTACCAACGACACAGAGCT
CAATGCTCTTCGCTAGCGAGAACCTCCATGCCAGGAACAAGTCAAATGGATCATCTCCAT
AGACTAATACACTCAACGACATTCAGCAATTTGTATCTGTCGCCACTACTAAGCTGCAAT
AAAAACGGAAGTGTGTGAGGGGGAAGAACTCAACTGCCTTTATACCTCTCTTACAGTTAC
AATCAGTGATATCTGTTAACAATGTTGCAAGATTCTAAAATAGAAGTTAACTCGTTTCGT
AGTCACGGGGAAAAAAAGCTTAAACTAATTCAATGCAAGTGACAACTCAGATGATTGCCT
TCTTTCGTTCAACTTAGACAGAGTACCTTGTTCTGGACCTCGTAAGTCCAATCCCGATCC
TCCGTTGCTGTTTGCGATGTTTGTCGTTGCTTCATGTTTGGGCTCTGACTAAGAATTTTT
CACTTTTCTAATGACAGGAAGGTTGAAAAATTATTCTAATGGGCTTGACTGAAAAAAAAA
ATTGTATAAAAAGTAGATTGCGATGAGCTGTTCCTTTGTCTGTGGTCTGAGGAGAGTATA
GCACATACAAAAGCGCTGTATTAACTTGTTCTGTTTTGCGATGTTCCGTCCTGTGTTGAA
GGTGAGGCCATCGTTCTCGTACCCATATAGCATTGTTTCTTCGAGGTCAGTGAGACTAGC
ATCTACTGCCACTGCCAACGCTAACACTGCCGCTGCCACTTCTACAGTTGCTGCCCATGG
TACCCAGGAGACGCCCTTTGACTTTGAAGGCCACTTCGAGAGCGAGTTGGCCAAGAAGAG
ACTGGATAAGTCGTACAGGTACTTCAATAACATCAACAGGCTTGCGAAGGAGTTCCCCTT
GGCTCACCGTCAGTTAGAGGACGATAAAGTCACTGTGTGGTGTTCCAACGATTACTTGGC
TTTGTCTAAGAACCCCCAAGTCTTGGACGCTATGCGCAAGACCATCGACAAGTACGGTGC
TGGTGCAGGTGGTACCAGAAACATTGCCGGCCACAACATTCCAACCATGAGACTGGAGGC
TGAGCTAGCGGCCTTGCACAAGAAGGAAGGTGCTCTAGTGTTCTCCTCTTGTTATGTTGC
CAACGATGCCGTCATCTCTCTATTGGGTCAAAAGGTGAAGGACTTGGTGATCTTCTCCGA
TGAACTAAACCATGCCTCCATGATCGTTGGTATTAAGCACGCCAACAGACCTAAGCACAT
CTTCAGACACAACGACTTGGCTCAGCTGGAGGAGATGCTACAGATGTATCCAAAGTCTAC
TCCAAAGTTGATCGCTTTCGAGTCTGTCTACTCTATGGCTGGTTCTGTTGCAGACATCAA
CAAGATATGTGACTTAGCTGAGAAATACGGTGCGCTAACTTTCCTAGATGAAGTGCATGC
CGTCGGTCTATACGGTCCACATGGTGCCGGTGTCGCTGAGCATTGTGACTTCGAAGCCCA
CCGTGTTGCAGGTATCGCCACTCCACCACAAGGTGACAACGGTAGACTCAGAACAGTCAT
GGACCGTGTTGATATGATTACCGGTACTTTGGGTAAGTCCTTCGGTACTGTTGGTGGTTA
CGTCGCTGCTTCGAGCAAGCTAATCGACTGGGTTAGATCTTACGCCCCAGGTTTCATCTT
CACCACTACCTTGCCTCCAGCTGTCATGGCTGGTGCTGCTGAGGCCATCAGATTCCAACG
TTCTCACTTGAACTTAAGACAAGATCAACAAAGGCACACCGCATACGTCAAGAAAGGTTT
GCATGATCTTGGCATTCCAGTTATTCCTAACCCATCCCACATTGTCCCAGTTCTAATTGG
TAACCCAGATTTGGCTAAGCAAGCCTCTGATATTCTAATGGAAAAGCATCGCATCTATGT
TCAAGCTATCAACTTCCCAACTGTTTCTCGTGGTACCGAGAGATTGAGAATTACTCCTAC
CCCAGGTCACACAAACGATCTATCTGATATTTTGATTGCTGCAGTCGATGATGTCTTTAA
TGAATTGCAATTGCCACGTATCAGAGATTGGGAAATGCAAGGTGGTCTATTGGGTGTCGG
TGACAAGAACTTCGTCCCAGAACCAAATCTATGGACTGAAGAGCAACTATCTTTCAGCAA
CGAGGATTTGAACTCTAACGTCTTTGAGCCAGTTATCGACCAACTTGAAGTTTCCAGTGG
TGTCAAATTATAAATGAGAGTATAAATGAAGATAAAAAAAATGCATTTCCATTTTGCATA
AATACCGGGCTAGGTTATTTGTTTCAGTGGTTTTTTTCTTTTTCATGATTTTACCTAATA
TGCTCTAACCACTTAGTATGTGGTAGGCGCCATTGCCATCAATAAGGTAATCTTAATGGA
CACGATACCATGTTTTTTTAACGACTTTTTGAGGGAATATCTAATGATTCTTTATTATAG
ATTGTAAAGAATGAATGTTTTTATTTTTTGTTGTTTGTGTTATTTTTCCTAATTCTAATA
AGACTATTTTTTA

Coding sequence    

>CAGL0B02607g.cds
ATGTTCCGTCCTGTGTTGAAGGTGAGGCCATCGTTCTCGTACCCATATAGCATTGTTTCT
TCGAGGTCAGTGAGACTAGCATCTACTGCCACTGCCAACGCTAACACTGCCGCTGCCACT
TCTACAGTTGCTGCCCATGGTACCCAGGAGACGCCCTTTGACTTTGAAGGCCACTTCGAG
AGCGAGTTGGCCAAGAAGAGACTGGATAAGTCGTACAGGTACTTCAATAACATCAACAGG
CTTGCGAAGGAGTTCCCCTTGGCTCACCGTCAGTTAGAGGACGATAAAGTCACTGTGTGG
TGTTCCAACGATTACTTGGCTTTGTCTAAGAACCCCCAAGTCTTGGACGCTATGCGCAAG
ACCATCGACAAGTACGGTGCTGGTGCAGGTGGTACCAGAAACATTGCCGGCCACAACATT
CCAACCATGAGACTGGAGGCTGAGCTAGCGGCCTTGCACAAGAAGGAAGGTGCTCTAGTG
TTCTCCTCTTGTTATGTTGCCAACGATGCCGTCATCTCTCTATTGGGTCAAAAGGTGAAG
GACTTGGTGATCTTCTCCGATGAACTAAACCATGCCTCCATGATCGTTGGTATTAAGCAC
GCCAACAGACCTAAGCACATCTTCAGACACAACGACTTGGCTCAGCTGGAGGAGATGCTA
CAGATGTATCCAAAGTCTACTCCAAAGTTGATCGCTTTCGAGTCTGTCTACTCTATGGCT
GGTTCTGTTGCAGACATCAACAAGATATGTGACTTAGCTGAGAAATACGGTGCGCTAACT
TTCCTAGATGAAGTGCATGCCGTCGGTCTATACGGTCCACATGGTGCCGGTGTCGCTGAG
CATTGTGACTTCGAAGCCCACCGTGTTGCAGGTATCGCCACTCCACCACAAGGTGACAAC
GGTAGACTCAGAACAGTCATGGACCGTGTTGATATGATTACCGGTACTTTGGGTAAGTCC
TTCGGTACTGTTGGTGGTTACGTCGCTGCTTCGAGCAAGCTAATCGACTGGGTTAGATCT
TACGCCCCAGGTTTCATCTTCACCACTACCTTGCCTCCAGCTGTCATGGCTGGTGCTGCT
GAGGCCATCAGATTCCAACGTTCTCACTTGAACTTAAGACAAGATCAACAAAGGCACACC
GCATACGTCAAGAAAGGTTTGCATGATCTTGGCATTCCAGTTATTCCTAACCCATCCCAC
ATTGTCCCAGTTCTAATTGGTAACCCAGATTTGGCTAAGCAAGCCTCTGATATTCTAATG
GAAAAGCATCGCATCTATGTTCAAGCTATCAACTTCCCAACTGTTTCTCGTGGTACCGAG
AGATTGAGAATTACTCCTACCCCAGGTCACACAAACGATCTATCTGATATTTTGATTGCT
GCAGTCGATGATGTCTTTAATGAATTGCAATTGCCACGTATCAGAGATTGGGAAATGCAA
GGTGGTCTATTGGGTGTCGGTGACAAGAACTTCGTCCCAGAACCAAATCTATGGACTGAA
GAGCAACTATCTTTCAGCAACGAGGATTTGAACTCTAACGTCTTTGAGCCAGTTATCGAC
CAACTTGAAGTTTCCAGTGGTGTCAAATTATAA

Predicted translation product    

>CAGL0B02607g.aa
MFRPVLKVRPSFSYPYSIVSSRSVRLASTATANANTAAATSTVAAHGTQETPFDFEGHFE
SELAKKRLDKSYRYFNNINRLAKEFPLAHRQLEDDKVTVWCSNDYLALSKNPQVLDAMRK
TIDKYGAGAGGTRNIAGHNIPTMRLEAELAALHKKEGALVFSSCYVANDAVISLLGQKVK
DLVIFSDELNHASMIVGIKHANRPKHIFRHNDLAQLEEMLQMYPKSTPKLIAFESVYSMA
GSVADINKICDLAEKYGALTFLDEVHAVGLYGPHGAGVAEHCDFEAHRVAGIATPPQGDN
GRLRTVMDRVDMITGTLGKSFGTVGGYVAASSKLIDWVRSYAPGFIFTTTLPPAVMAGAA
EAIRFQRSHLNLRQDQQRHTAYVKKGLHDLGIPVIPNPSHIVPVLIGNPDLAKQASDILM
EKHRIYVQAINFPTVSRGTERLRITPTPGHTNDLSDILIAAVDDVFNELQLPRIRDWEMQ
GGLLGVGDKNFVPEPNLWTEEQLSFSNEDLNSNVFEPVIDQLEVSSGVKL*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites