YALI0C06083g
similar to uniprot|Q9Y8A4 Aspergillus oryzae 5- aminolevulinic acid synthase (HEM1p homologue)
Element type: CDS
Element length: 1692 nucleotides,
on sense strand of
Yali0C: 782180..783871.
Other names:
YALI-CDS1866.1
YALI-IPF6541
Coding sequence: 564 codons.
Element length: 1692 nucleotides,
on sense strand of
Yali0C: 782180..783871.
Other names:
YALI-CDS1866.1
YALI-IPF6541
Coding sequence: 564 codons.
Database cross references:
EMBL: CR382129
GeneID: 2909236
HOGENOM: Q6CCW0
Orthologs: strict determination not possible; homologs must be refined manually
EMBL: CR382129
GeneID: 2909236
HOGENOM: Q6CCW0
Homologs and Orthologs
Homologs in protein families: GL3C0100 GL3C0100.N1Orthologs: strict determination not possible; homologs must be refined manually
Protein domain map
Database cross references:
Gene3D: G3DSA:3.40.640.10
InterPro: IPR001917
InterPro: IPR004839
InterPro: IPR010961
InterPro: IPR015421
KEGG: yli:YALI0C06083g
PROSITE: PS00599
Pfam: PF00155
RefSeq: XP_501502.1
TIGRFAMs: TIGR01821
UniProtKB/Swiss-Prot: Q6CCW0
UniprotKB: HEM1_YARLI
Gene3D: G3DSA:3.40.640.10
InterPro: IPR001917
InterPro: IPR004839
InterPro: IPR010961
InterPro: IPR015421
KEGG: yli:YALI0C06083g
PROSITE: PS00599
Pfam: PF00155
RefSeq: XP_501502.1
TIGRFAMs: TIGR01821
UniProtKB/Swiss-Prot: Q6CCW0
UniprotKB: HEM1_YARLI
Sequence data 
>YALI0C06083g.nt CAGCTCGAAAGTAGCGGTTGATGGGTTGGAAAGAGGGTGAATATACGGGCAGAATTCGCA CGAGTCCATTGCAACATTTTGCGTCGTCTCATTGCATCCGTCATCGTCACGGCATGCAAT AAAAACATGGAATCAGGGGCACGCAGGGAAAAAAAAAAAAAAAAAAAAAAACCAAACAAA CACTGAAAAACACTGAAAAACACTGAAAAGTGTGAAAACTGGGAGAAAAAAATACACTGA AGAGCTGCAAAGAGCTGCAAACAACTGCACAAATGCTACAAGTAGTGGATGGAGCGTTGA AGAGAATCTGTTGCCATGTACAATGACGACCGGGGCCCCTTATCCGGTCATCCCTCGTGG AACATTGCTAAACAGCAATATACCTTGGGCCTGATTGTCACACCTTTCTCGGGTTTGGCT TACGTTACTGGCAACTGGCTCTCATCCAGGTCCTCATCACATGCTTGTGGGGTTAGAGTT GTGGGGAGGGCGGTTGAGTAGCAGAGTTTGGAGGTGGAACGTGGCCAATGAGAAAGCTTA CAATGTGAGTAATAATAGGGAATAAGCAGCAGCATCGGCAACAGCGTAGATGTGAGCTAT CTGTAAGCCGTGCCAGGCTAGAGGTGCTACAAGTAGTGGGAGGTAGTCTGGTGTTTCTCG AGATACTGTCGTTCAAGTATCTGTTACAATGGCTCTCTCTATCTCGTCACGTGAACACCA TATATACATGCCCTCTCGTGGTCCAGCAGCGATGACACTACCCGGTCGTCTGTTCGTTTG CGCGGTTGCCACGAGGAAGGGCCAAATAACCTCAATTACTAACAATGCACGTCATTGGTG ACTTTTTTAGTTGGGACAATGATGGTGACGGAGCAGAGTGTGGAAATCACAAAAATAAGG TTTCGCAGCAGCACAGAAAAAGGGCCCAAATTTTACCTCACCCTCCTTCTCATCACCCAA CCCAAACGTCCCAGACGACCCCAGACCAGTCTCGACAACCATGGAATCTCTCGTTCGACA GTCCAAAAAGCTCTGCCCCTACATTGGTCGAACCTCGGCTTCCAGCCTCAAGCAGCTCGG AAACGGGCGACTGACCCAGAAGGCAGGCCAATGCCCCATCATGGGCAAGGCCATGGGAGT GCGTGGCTTCAAGTCCGACGCCGGATCCAACGCCGAGTCCGCCACCGTTGACGTCCATGC CGCCGTGGACACCTCCAAGGGCACCTGCCCCCACGCGGCCCAGTACTCGCCCGTCTACCC TTCGTCTCGGCTCGACAACTACCCGTTCGGAATGACCCAGCGAGGCCTCGGAAAGGTGCC CACCCAGGACGCCCACAATGCCACCACCTTCAACTACGAGTCCTTCTACGAGAACAAAAT CAACGCCAAGCACCAGGACAAGTCATACCGGTACTTCAACAACATCAACCGACTGGCCGC CGAGTTCCCCCGAGCCCACAGAGGCTCCATTGAGGAGGACAAGGTGACCGTCTGGTGCGC AAACGACTACCTCGGCATGGGCCGAAACCCCGTCGTCGTCGACGCCATGCACGAGACTCT GGACAAGTACGGAGCCGGCGCTGGTGGAACCCGGAACATTGCTGGCCACAACCGACATGC CGTCGAGCTCGAGGCCGCTATCGCCGACCTCCACAAGAAGGAGGCTGCTCTGGTCTTCTC CTCGTGCTACGTCGCCAACGACTCCACCCTGTCGTTGCTCGGCCAGGCTCTCCCCAACTG CGTCTACTTCTCCGACGCTTCTAACCACGCCTCCATGATCCACGGAATCCGACACGGAGG GTCCGAGAAGGTTGTGTGGAAGCACAACGACCTCGCTGATCTGGAAGCCAAGCTCGCTCG ATACCCCAAGAGCACTCCCAAGGTGATTGCATTTGAGTCCGTCTACTCCATGTGTGGTTC CATTGGCCCTATTGAGGAGATTTGTGACCTGGCCGACAAGTACGGCGCCATCACGTTCCT CGACGAGGTCCACGCCGTTGGTATGTACGGCCCCACCGGTGCCGGTGTTGCCGAGCACCT CGACTTTGAGCACTACCACTCTGGTGCTCAGACCCAGCGACAGCCCATCATGGACCGAGT GGATATCTTCACCGGAACCCTCGGAAAGGCTTACGGATGTGTTGGTGGCTACATTGCTGG CTCTGCCAAGTTTGTTGACATGGTTCGATCTTATGCTCCCGGTTTCATCTTCACCACCAC TCTTCCCCCTGCCACCATGGCCGGTGCTCGAGCTGCTATCAACTACCAGAAGGCTACCAT GAAGGACCGAGTTGCCCAGCAGACTCACACTCGATACGTCAAGGACAAGCTGGCCAACCG AGGAATTCCCGTTGTTCCTAACCCCTCTCACATCGTGCCTGTTCTTGTTGGTGACGCCCA GAAGGCAAAGGCCGCCTCCGATCTTCTGCTGACCAAGCACCAGATCTATGTGCAGGCTAT CAACTTCCCCACTGTTCCTATTGGCCAGGAGCGACTGCGTGTGACCCCCACCCCCGGTCA CCATGAGGGACTCTGTGACGAGCTGGTTGCTGCCCTGGAAGACGTGTGGCAGGAGCTTGA TCTTAAGCGAGTTGAGGACTGGACTGCCGAGGGCGGTCTGTGTGGCGTTGGCGAGGGTGT TGAGGTTGAGCCTCTGTGGAGCGAGGAGCAGCTGTCTTACGGACGAGACTAAGTCTGTGC TCGCTTCATGAACAATGAGAGTGTGTGCTTTGACTAGTTTCGCACTCTCACTTTACTTCA CACACCCAAAGTGCACTTCAAATTAGTGCGGTGCACTCGAACTCCCCACAAAATCAACTC TTCAAAAACAAAACATGTAATTGTAAAGCATTTGGCTGGACGCCAGTGGTTTAGAACCAA ACGAGCGTAGATGGTGGAGCGACGAACTTGTAGGGCATGCTGCGACGTGAGTCGTTCCTG ACACGTGAGCCGTTCCTGAGTGAGTCGTCCCTGAGTCGTGCGTGTGTGAAGC
>YALI0C06083g.cds ATGGAATCTCTCGTTCGACAGTCCAAAAAGCTCTGCCCCTACATTGGTCGAACCTCGGCT TCCAGCCTCAAGCAGCTCGGAAACGGGCGACTGACCCAGAAGGCAGGCCAATGCCCCATC ATGGGCAAGGCCATGGGAGTGCGTGGCTTCAAGTCCGACGCCGGATCCAACGCCGAGTCC GCCACCGTTGACGTCCATGCCGCCGTGGACACCTCCAAGGGCACCTGCCCCCACGCGGCC CAGTACTCGCCCGTCTACCCTTCGTCTCGGCTCGACAACTACCCGTTCGGAATGACCCAG CGAGGCCTCGGAAAGGTGCCCACCCAGGACGCCCACAATGCCACCACCTTCAACTACGAG TCCTTCTACGAGAACAAAATCAACGCCAAGCACCAGGACAAGTCATACCGGTACTTCAAC AACATCAACCGACTGGCCGCCGAGTTCCCCCGAGCCCACAGAGGCTCCATTGAGGAGGAC AAGGTGACCGTCTGGTGCGCAAACGACTACCTCGGCATGGGCCGAAACCCCGTCGTCGTC GACGCCATGCACGAGACTCTGGACAAGTACGGAGCCGGCGCTGGTGGAACCCGGAACATT GCTGGCCACAACCGACATGCCGTCGAGCTCGAGGCCGCTATCGCCGACCTCCACAAGAAG GAGGCTGCTCTGGTCTTCTCCTCGTGCTACGTCGCCAACGACTCCACCCTGTCGTTGCTC GGCCAGGCTCTCCCCAACTGCGTCTACTTCTCCGACGCTTCTAACCACGCCTCCATGATC CACGGAATCCGACACGGAGGGTCCGAGAAGGTTGTGTGGAAGCACAACGACCTCGCTGAT CTGGAAGCCAAGCTCGCTCGATACCCCAAGAGCACTCCCAAGGTGATTGCATTTGAGTCC GTCTACTCCATGTGTGGTTCCATTGGCCCTATTGAGGAGATTTGTGACCTGGCCGACAAG TACGGCGCCATCACGTTCCTCGACGAGGTCCACGCCGTTGGTATGTACGGCCCCACCGGT GCCGGTGTTGCCGAGCACCTCGACTTTGAGCACTACCACTCTGGTGCTCAGACCCAGCGA CAGCCCATCATGGACCGAGTGGATATCTTCACCGGAACCCTCGGAAAGGCTTACGGATGT GTTGGTGGCTACATTGCTGGCTCTGCCAAGTTTGTTGACATGGTTCGATCTTATGCTCCC GGTTTCATCTTCACCACCACTCTTCCCCCTGCCACCATGGCCGGTGCTCGAGCTGCTATC AACTACCAGAAGGCTACCATGAAGGACCGAGTTGCCCAGCAGACTCACACTCGATACGTC AAGGACAAGCTGGCCAACCGAGGAATTCCCGTTGTTCCTAACCCCTCTCACATCGTGCCT GTTCTTGTTGGTGACGCCCAGAAGGCAAAGGCCGCCTCCGATCTTCTGCTGACCAAGCAC CAGATCTATGTGCAGGCTATCAACTTCCCCACTGTTCCTATTGGCCAGGAGCGACTGCGT GTGACCCCCACCCCCGGTCACCATGAGGGACTCTGTGACGAGCTGGTTGCTGCCCTGGAA GACGTGTGGCAGGAGCTTGATCTTAAGCGAGTTGAGGACTGGACTGCCGAGGGCGGTCTG TGTGGCGTTGGCGAGGGTGTTGAGGTTGAGCCTCTGTGGAGCGAGGAGCAGCTGTCTTAC GGACGAGACTAA
>YALI0C06083g.aa MESLVRQSKKLCPYIGRTSASSLKQLGNGRLTQKAGQCPIMGKAMGVRGFKSDAGSNAES ATVDVHAAVDTSKGTCPHAAQYSPVYPSSRLDNYPFGMTQRGLGKVPTQDAHNATTFNYE SFYENKINAKHQDKSYRYFNNINRLAAEFPRAHRGSIEEDKVTVWCANDYLGMGRNPVVV DAMHETLDKYGAGAGGTRNIAGHNRHAVELEAAIADLHKKEAALVFSSCYVANDSTLSLL GQALPNCVYFSDASNHASMIHGIRHGGSEKVVWKHNDLADLEAKLARYPKSTPKVIAFES VYSMCGSIGPIEEICDLADKYGAITFLDEVHAVGMYGPTGAGVAEHLDFEHYHSGAQTQR QPIMDRVDIFTGTLGKAYGCVGGYIAGSAKFVDMVRSYAPGFIFTTTLPPATMAGARAAI NYQKATMKDRVAQQTHTRYVKDKLANRGIPVVPNPSHIVPVLVGDAQKAKAASDLLLTKH QIYVQAINFPTVPIGQERLRVTPTPGHHEGLCDELVAALEDVWQELDLKRVEDWTAEGGL CGVGEGVEVEPLWSEEQLSYGRD*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://www.genolevures.org/elt/YALI/YALI0C06083p