CAGL0B02607g
highly similar to uniprot|P09950 Saccharomyces cerevisiae YDR232w HEM1 5-aminolevulinate synthase, catalyzes the first step in the heme biosynthetic pathway
Element type: CDS
Element length: 1593 nucleotides,
on anti-sense strand of
Cagl0B: complement(248795..250387).
Other names:
CAGL-CDS1907.1
CAGL-IPF837
Coding sequence: 531 codons.
Element length: 1593 nucleotides,
on anti-sense strand of
Cagl0B: complement(248795..250387).
Other names:
CAGL-CDS1907.1
CAGL-IPF837
Coding sequence: 531 codons.
Database cross references:
EMBL: CR380948
GeneID: 2886690
HOGENOM: Q6FXE3
Orthologs: strict determination not possible; homologs must be refined manually
EMBL: CR380948
GeneID: 2886690
HOGENOM: Q6FXE3
Homologs and Orthologs
Homologs in protein families: GL3C0100 GL3C0100.N2Orthologs: strict determination not possible; homologs must be refined manually
Protein domain map
Database cross references:
Gene3D: G3DSA:3.40.640.10
InterPro: IPR001917
InterPro: IPR004839
InterPro: IPR010961
InterPro: IPR015421
KEGG: cgr:CAGL0B02607g
PROSITE: PS00599
Pfam: PF00155
RefSeq: XP_445084.1
TIGRFAMs: TIGR01821
UniProtKB/Swiss-Prot: Q6FXE3
UniprotKB: HEM1_CANGA
Gene3D: G3DSA:3.40.640.10
InterPro: IPR001917
InterPro: IPR004839
InterPro: IPR010961
InterPro: IPR015421
KEGG: cgr:CAGL0B02607g
PROSITE: PS00599
Pfam: PF00155
RefSeq: XP_445084.1
TIGRFAMs: TIGR01821
UniProtKB/Swiss-Prot: Q6FXE3
UniprotKB: HEM1_CANGA
Sequence data 
>CAGL0B02607g.nt CCGAGTCGCATAGTTTAACATTTCCTGCTTTGGTGTAATGTTGGATAGAATTAGAGTCTA TTGTAAACCTCAGGTCATAGTCAGTTACGTAGCCAAATATGTTATGATGTGGAATGGCTG TTCCTACCACTTCTGTGACAAATTTTTGTAAGTTGATGGCACTTGACTCGTTGATCACAA GCTTCGTGTTTGTATTGTTGAACAAAGGCAACACCTCGTTGACCCGTTTGGATAATGCAT GTAGCGACGAAGCTGCAGCATTCACAAACACCAAATCGAAACCCGCGTTGCCTCTGCTGT CAAGTAAATCGCCAACACTCATAAACACTTTGGTCGGGTCCACTTGCAAACCACCGTTCC CATACTGCCTGGTCCGCACCGTAACATCCTGGGAGTACTTGCTACCAACGACACAGAGCT CAATGCTCTTCGCTAGCGAGAACCTCCATGCCAGGAACAAGTCAAATGGATCATCTCCAT AGACTAATACACTCAACGACATTCAGCAATTTGTATCTGTCGCCACTACTAAGCTGCAAT AAAAACGGAAGTGTGTGAGGGGGAAGAACTCAACTGCCTTTATACCTCTCTTACAGTTAC AATCAGTGATATCTGTTAACAATGTTGCAAGATTCTAAAATAGAAGTTAACTCGTTTCGT AGTCACGGGGAAAAAAAGCTTAAACTAATTCAATGCAAGTGACAACTCAGATGATTGCCT TCTTTCGTTCAACTTAGACAGAGTACCTTGTTCTGGACCTCGTAAGTCCAATCCCGATCC TCCGTTGCTGTTTGCGATGTTTGTCGTTGCTTCATGTTTGGGCTCTGACTAAGAATTTTT CACTTTTCTAATGACAGGAAGGTTGAAAAATTATTCTAATGGGCTTGACTGAAAAAAAAA ATTGTATAAAAAGTAGATTGCGATGAGCTGTTCCTTTGTCTGTGGTCTGAGGAGAGTATA GCACATACAAAAGCGCTGTATTAACTTGTTCTGTTTTGCGATGTTCCGTCCTGTGTTGAA GGTGAGGCCATCGTTCTCGTACCCATATAGCATTGTTTCTTCGAGGTCAGTGAGACTAGC ATCTACTGCCACTGCCAACGCTAACACTGCCGCTGCCACTTCTACAGTTGCTGCCCATGG TACCCAGGAGACGCCCTTTGACTTTGAAGGCCACTTCGAGAGCGAGTTGGCCAAGAAGAG ACTGGATAAGTCGTACAGGTACTTCAATAACATCAACAGGCTTGCGAAGGAGTTCCCCTT GGCTCACCGTCAGTTAGAGGACGATAAAGTCACTGTGTGGTGTTCCAACGATTACTTGGC TTTGTCTAAGAACCCCCAAGTCTTGGACGCTATGCGCAAGACCATCGACAAGTACGGTGC TGGTGCAGGTGGTACCAGAAACATTGCCGGCCACAACATTCCAACCATGAGACTGGAGGC TGAGCTAGCGGCCTTGCACAAGAAGGAAGGTGCTCTAGTGTTCTCCTCTTGTTATGTTGC CAACGATGCCGTCATCTCTCTATTGGGTCAAAAGGTGAAGGACTTGGTGATCTTCTCCGA TGAACTAAACCATGCCTCCATGATCGTTGGTATTAAGCACGCCAACAGACCTAAGCACAT CTTCAGACACAACGACTTGGCTCAGCTGGAGGAGATGCTACAGATGTATCCAAAGTCTAC TCCAAAGTTGATCGCTTTCGAGTCTGTCTACTCTATGGCTGGTTCTGTTGCAGACATCAA CAAGATATGTGACTTAGCTGAGAAATACGGTGCGCTAACTTTCCTAGATGAAGTGCATGC CGTCGGTCTATACGGTCCACATGGTGCCGGTGTCGCTGAGCATTGTGACTTCGAAGCCCA CCGTGTTGCAGGTATCGCCACTCCACCACAAGGTGACAACGGTAGACTCAGAACAGTCAT GGACCGTGTTGATATGATTACCGGTACTTTGGGTAAGTCCTTCGGTACTGTTGGTGGTTA CGTCGCTGCTTCGAGCAAGCTAATCGACTGGGTTAGATCTTACGCCCCAGGTTTCATCTT CACCACTACCTTGCCTCCAGCTGTCATGGCTGGTGCTGCTGAGGCCATCAGATTCCAACG TTCTCACTTGAACTTAAGACAAGATCAACAAAGGCACACCGCATACGTCAAGAAAGGTTT GCATGATCTTGGCATTCCAGTTATTCCTAACCCATCCCACATTGTCCCAGTTCTAATTGG TAACCCAGATTTGGCTAAGCAAGCCTCTGATATTCTAATGGAAAAGCATCGCATCTATGT TCAAGCTATCAACTTCCCAACTGTTTCTCGTGGTACCGAGAGATTGAGAATTACTCCTAC CCCAGGTCACACAAACGATCTATCTGATATTTTGATTGCTGCAGTCGATGATGTCTTTAA TGAATTGCAATTGCCACGTATCAGAGATTGGGAAATGCAAGGTGGTCTATTGGGTGTCGG TGACAAGAACTTCGTCCCAGAACCAAATCTATGGACTGAAGAGCAACTATCTTTCAGCAA CGAGGATTTGAACTCTAACGTCTTTGAGCCAGTTATCGACCAACTTGAAGTTTCCAGTGG TGTCAAATTATAAATGAGAGTATAAATGAAGATAAAAAAAATGCATTTCCATTTTGCATA AATACCGGGCTAGGTTATTTGTTTCAGTGGTTTTTTTCTTTTTCATGATTTTACCTAATA TGCTCTAACCACTTAGTATGTGGTAGGCGCCATTGCCATCAATAAGGTAATCTTAATGGA CACGATACCATGTTTTTTTAACGACTTTTTGAGGGAATATCTAATGATTCTTTATTATAG ATTGTAAAGAATGAATGTTTTTATTTTTTGTTGTTTGTGTTATTTTTCCTAATTCTAATA AGACTATTTTTTA
>CAGL0B02607g.cds ATGTTCCGTCCTGTGTTGAAGGTGAGGCCATCGTTCTCGTACCCATATAGCATTGTTTCT TCGAGGTCAGTGAGACTAGCATCTACTGCCACTGCCAACGCTAACACTGCCGCTGCCACT TCTACAGTTGCTGCCCATGGTACCCAGGAGACGCCCTTTGACTTTGAAGGCCACTTCGAG AGCGAGTTGGCCAAGAAGAGACTGGATAAGTCGTACAGGTACTTCAATAACATCAACAGG CTTGCGAAGGAGTTCCCCTTGGCTCACCGTCAGTTAGAGGACGATAAAGTCACTGTGTGG TGTTCCAACGATTACTTGGCTTTGTCTAAGAACCCCCAAGTCTTGGACGCTATGCGCAAG ACCATCGACAAGTACGGTGCTGGTGCAGGTGGTACCAGAAACATTGCCGGCCACAACATT CCAACCATGAGACTGGAGGCTGAGCTAGCGGCCTTGCACAAGAAGGAAGGTGCTCTAGTG TTCTCCTCTTGTTATGTTGCCAACGATGCCGTCATCTCTCTATTGGGTCAAAAGGTGAAG GACTTGGTGATCTTCTCCGATGAACTAAACCATGCCTCCATGATCGTTGGTATTAAGCAC GCCAACAGACCTAAGCACATCTTCAGACACAACGACTTGGCTCAGCTGGAGGAGATGCTA CAGATGTATCCAAAGTCTACTCCAAAGTTGATCGCTTTCGAGTCTGTCTACTCTATGGCT GGTTCTGTTGCAGACATCAACAAGATATGTGACTTAGCTGAGAAATACGGTGCGCTAACT TTCCTAGATGAAGTGCATGCCGTCGGTCTATACGGTCCACATGGTGCCGGTGTCGCTGAG CATTGTGACTTCGAAGCCCACCGTGTTGCAGGTATCGCCACTCCACCACAAGGTGACAAC GGTAGACTCAGAACAGTCATGGACCGTGTTGATATGATTACCGGTACTTTGGGTAAGTCC TTCGGTACTGTTGGTGGTTACGTCGCTGCTTCGAGCAAGCTAATCGACTGGGTTAGATCT TACGCCCCAGGTTTCATCTTCACCACTACCTTGCCTCCAGCTGTCATGGCTGGTGCTGCT GAGGCCATCAGATTCCAACGTTCTCACTTGAACTTAAGACAAGATCAACAAAGGCACACC GCATACGTCAAGAAAGGTTTGCATGATCTTGGCATTCCAGTTATTCCTAACCCATCCCAC ATTGTCCCAGTTCTAATTGGTAACCCAGATTTGGCTAAGCAAGCCTCTGATATTCTAATG GAAAAGCATCGCATCTATGTTCAAGCTATCAACTTCCCAACTGTTTCTCGTGGTACCGAG AGATTGAGAATTACTCCTACCCCAGGTCACACAAACGATCTATCTGATATTTTGATTGCT GCAGTCGATGATGTCTTTAATGAATTGCAATTGCCACGTATCAGAGATTGGGAAATGCAA GGTGGTCTATTGGGTGTCGGTGACAAGAACTTCGTCCCAGAACCAAATCTATGGACTGAA GAGCAACTATCTTTCAGCAACGAGGATTTGAACTCTAACGTCTTTGAGCCAGTTATCGAC CAACTTGAAGTTTCCAGTGGTGTCAAATTATAA
>CAGL0B02607g.aa MFRPVLKVRPSFSYPYSIVSSRSVRLASTATANANTAAATSTVAAHGTQETPFDFEGHFE SELAKKRLDKSYRYFNNINRLAKEFPLAHRQLEDDKVTVWCSNDYLALSKNPQVLDAMRK TIDKYGAGAGGTRNIAGHNIPTMRLEAELAALHKKEGALVFSSCYVANDAVISLLGQKVK DLVIFSDELNHASMIVGIKHANRPKHIFRHNDLAQLEEMLQMYPKSTPKLIAFESVYSMA GSVADINKICDLAEKYGALTFLDEVHAVGLYGPHGAGVAEHCDFEAHRVAGIATPPQGDN GRLRTVMDRVDMITGTLGKSFGTVGGYVAASSKLIDWVRSYAPGFIFTTTLPPAVMAGAA EAIRFQRSHLNLRQDQQRHTAYVKKGLHDLGIPVIPNPSHIVPVLIGNPDLAKQASDILM EKHRIYVQAINFPTVSRGTERLRITPTPGHTNDLSDILIAAVDDVFNELQLPRIRDWEMQ GGLLGVGDKNFVPEPNLWTEEQLSFSNEDLNSNVFEPVIDQLEVSSGVKL*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://www.genolevures.org/elt/CAGL/CAGL0B02607p