YALI0C06083g


similar to uniprot|Q9Y8A4 Aspergillus oryzae 5- aminolevulinic acid synthase (HEM1p homologue)

Genomic environment map

Element type: CDS
Element length: 1692 nucleotides,
on sense strand of
Yali0C: 782180..783871.
Other names:
YALI-CDS1866.1
YALI-IPF6541
Coding sequence: 564 codons.
Database cross references:
EMBL: CR382129
GeneID: 2909236
HOGENOM: Q6CCW0

Computed results  

None available yet

Homologs and Orthologs

Homologs in protein families: GL3C0100 GL3C0100.N1
Orthologs: strict determination not possible; homologs must be refined manually

Protein YALI0C06083p  


Protein domain map

Protein length: 563 amino acids
Protein family: GL3C0100
Database cross references:
Gene3D: G3DSA:3.40.640.10
InterPro: IPR001917
InterPro: IPR004839
InterPro: IPR010961
InterPro: IPR015421
KEGG: yli:YALI0C06083g
PROSITE: PS00599
Pfam: PF00155
RefSeq: XP_501502.1
TIGRFAMs: TIGR01821
UniProtKB/Swiss-Prot: Q6CCW0
UniprotKB: HEM1_YARLI

Computed results for YALI0C06083p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  

GO:0005759 mitochondrial matrix

Sequence data  


Nucleotide sequence    

>YALI0C06083g.nt
CAGCTCGAAAGTAGCGGTTGATGGGTTGGAAAGAGGGTGAATATACGGGCAGAATTCGCA
CGAGTCCATTGCAACATTTTGCGTCGTCTCATTGCATCCGTCATCGTCACGGCATGCAAT
AAAAACATGGAATCAGGGGCACGCAGGGAAAAAAAAAAAAAAAAAAAAAAACCAAACAAA
CACTGAAAAACACTGAAAAACACTGAAAAGTGTGAAAACTGGGAGAAAAAAATACACTGA
AGAGCTGCAAAGAGCTGCAAACAACTGCACAAATGCTACAAGTAGTGGATGGAGCGTTGA
AGAGAATCTGTTGCCATGTACAATGACGACCGGGGCCCCTTATCCGGTCATCCCTCGTGG
AACATTGCTAAACAGCAATATACCTTGGGCCTGATTGTCACACCTTTCTCGGGTTTGGCT
TACGTTACTGGCAACTGGCTCTCATCCAGGTCCTCATCACATGCTTGTGGGGTTAGAGTT
GTGGGGAGGGCGGTTGAGTAGCAGAGTTTGGAGGTGGAACGTGGCCAATGAGAAAGCTTA
CAATGTGAGTAATAATAGGGAATAAGCAGCAGCATCGGCAACAGCGTAGATGTGAGCTAT
CTGTAAGCCGTGCCAGGCTAGAGGTGCTACAAGTAGTGGGAGGTAGTCTGGTGTTTCTCG
AGATACTGTCGTTCAAGTATCTGTTACAATGGCTCTCTCTATCTCGTCACGTGAACACCA
TATATACATGCCCTCTCGTGGTCCAGCAGCGATGACACTACCCGGTCGTCTGTTCGTTTG
CGCGGTTGCCACGAGGAAGGGCCAAATAACCTCAATTACTAACAATGCACGTCATTGGTG
ACTTTTTTAGTTGGGACAATGATGGTGACGGAGCAGAGTGTGGAAATCACAAAAATAAGG
TTTCGCAGCAGCACAGAAAAAGGGCCCAAATTTTACCTCACCCTCCTTCTCATCACCCAA
CCCAAACGTCCCAGACGACCCCAGACCAGTCTCGACAACCATGGAATCTCTCGTTCGACA
GTCCAAAAAGCTCTGCCCCTACATTGGTCGAACCTCGGCTTCCAGCCTCAAGCAGCTCGG
AAACGGGCGACTGACCCAGAAGGCAGGCCAATGCCCCATCATGGGCAAGGCCATGGGAGT
GCGTGGCTTCAAGTCCGACGCCGGATCCAACGCCGAGTCCGCCACCGTTGACGTCCATGC
CGCCGTGGACACCTCCAAGGGCACCTGCCCCCACGCGGCCCAGTACTCGCCCGTCTACCC
TTCGTCTCGGCTCGACAACTACCCGTTCGGAATGACCCAGCGAGGCCTCGGAAAGGTGCC
CACCCAGGACGCCCACAATGCCACCACCTTCAACTACGAGTCCTTCTACGAGAACAAAAT
CAACGCCAAGCACCAGGACAAGTCATACCGGTACTTCAACAACATCAACCGACTGGCCGC
CGAGTTCCCCCGAGCCCACAGAGGCTCCATTGAGGAGGACAAGGTGACCGTCTGGTGCGC
AAACGACTACCTCGGCATGGGCCGAAACCCCGTCGTCGTCGACGCCATGCACGAGACTCT
GGACAAGTACGGAGCCGGCGCTGGTGGAACCCGGAACATTGCTGGCCACAACCGACATGC
CGTCGAGCTCGAGGCCGCTATCGCCGACCTCCACAAGAAGGAGGCTGCTCTGGTCTTCTC
CTCGTGCTACGTCGCCAACGACTCCACCCTGTCGTTGCTCGGCCAGGCTCTCCCCAACTG
CGTCTACTTCTCCGACGCTTCTAACCACGCCTCCATGATCCACGGAATCCGACACGGAGG
GTCCGAGAAGGTTGTGTGGAAGCACAACGACCTCGCTGATCTGGAAGCCAAGCTCGCTCG
ATACCCCAAGAGCACTCCCAAGGTGATTGCATTTGAGTCCGTCTACTCCATGTGTGGTTC
CATTGGCCCTATTGAGGAGATTTGTGACCTGGCCGACAAGTACGGCGCCATCACGTTCCT
CGACGAGGTCCACGCCGTTGGTATGTACGGCCCCACCGGTGCCGGTGTTGCCGAGCACCT
CGACTTTGAGCACTACCACTCTGGTGCTCAGACCCAGCGACAGCCCATCATGGACCGAGT
GGATATCTTCACCGGAACCCTCGGAAAGGCTTACGGATGTGTTGGTGGCTACATTGCTGG
CTCTGCCAAGTTTGTTGACATGGTTCGATCTTATGCTCCCGGTTTCATCTTCACCACCAC
TCTTCCCCCTGCCACCATGGCCGGTGCTCGAGCTGCTATCAACTACCAGAAGGCTACCAT
GAAGGACCGAGTTGCCCAGCAGACTCACACTCGATACGTCAAGGACAAGCTGGCCAACCG
AGGAATTCCCGTTGTTCCTAACCCCTCTCACATCGTGCCTGTTCTTGTTGGTGACGCCCA
GAAGGCAAAGGCCGCCTCCGATCTTCTGCTGACCAAGCACCAGATCTATGTGCAGGCTAT
CAACTTCCCCACTGTTCCTATTGGCCAGGAGCGACTGCGTGTGACCCCCACCCCCGGTCA
CCATGAGGGACTCTGTGACGAGCTGGTTGCTGCCCTGGAAGACGTGTGGCAGGAGCTTGA
TCTTAAGCGAGTTGAGGACTGGACTGCCGAGGGCGGTCTGTGTGGCGTTGGCGAGGGTGT
TGAGGTTGAGCCTCTGTGGAGCGAGGAGCAGCTGTCTTACGGACGAGACTAAGTCTGTGC
TCGCTTCATGAACAATGAGAGTGTGTGCTTTGACTAGTTTCGCACTCTCACTTTACTTCA
CACACCCAAAGTGCACTTCAAATTAGTGCGGTGCACTCGAACTCCCCACAAAATCAACTC
TTCAAAAACAAAACATGTAATTGTAAAGCATTTGGCTGGACGCCAGTGGTTTAGAACCAA
ACGAGCGTAGATGGTGGAGCGACGAACTTGTAGGGCATGCTGCGACGTGAGTCGTTCCTG
ACACGTGAGCCGTTCCTGAGTGAGTCGTCCCTGAGTCGTGCGTGTGTGAAGC

Coding sequence    

>YALI0C06083g.cds
ATGGAATCTCTCGTTCGACAGTCCAAAAAGCTCTGCCCCTACATTGGTCGAACCTCGGCT
TCCAGCCTCAAGCAGCTCGGAAACGGGCGACTGACCCAGAAGGCAGGCCAATGCCCCATC
ATGGGCAAGGCCATGGGAGTGCGTGGCTTCAAGTCCGACGCCGGATCCAACGCCGAGTCC
GCCACCGTTGACGTCCATGCCGCCGTGGACACCTCCAAGGGCACCTGCCCCCACGCGGCC
CAGTACTCGCCCGTCTACCCTTCGTCTCGGCTCGACAACTACCCGTTCGGAATGACCCAG
CGAGGCCTCGGAAAGGTGCCCACCCAGGACGCCCACAATGCCACCACCTTCAACTACGAG
TCCTTCTACGAGAACAAAATCAACGCCAAGCACCAGGACAAGTCATACCGGTACTTCAAC
AACATCAACCGACTGGCCGCCGAGTTCCCCCGAGCCCACAGAGGCTCCATTGAGGAGGAC
AAGGTGACCGTCTGGTGCGCAAACGACTACCTCGGCATGGGCCGAAACCCCGTCGTCGTC
GACGCCATGCACGAGACTCTGGACAAGTACGGAGCCGGCGCTGGTGGAACCCGGAACATT
GCTGGCCACAACCGACATGCCGTCGAGCTCGAGGCCGCTATCGCCGACCTCCACAAGAAG
GAGGCTGCTCTGGTCTTCTCCTCGTGCTACGTCGCCAACGACTCCACCCTGTCGTTGCTC
GGCCAGGCTCTCCCCAACTGCGTCTACTTCTCCGACGCTTCTAACCACGCCTCCATGATC
CACGGAATCCGACACGGAGGGTCCGAGAAGGTTGTGTGGAAGCACAACGACCTCGCTGAT
CTGGAAGCCAAGCTCGCTCGATACCCCAAGAGCACTCCCAAGGTGATTGCATTTGAGTCC
GTCTACTCCATGTGTGGTTCCATTGGCCCTATTGAGGAGATTTGTGACCTGGCCGACAAG
TACGGCGCCATCACGTTCCTCGACGAGGTCCACGCCGTTGGTATGTACGGCCCCACCGGT
GCCGGTGTTGCCGAGCACCTCGACTTTGAGCACTACCACTCTGGTGCTCAGACCCAGCGA
CAGCCCATCATGGACCGAGTGGATATCTTCACCGGAACCCTCGGAAAGGCTTACGGATGT
GTTGGTGGCTACATTGCTGGCTCTGCCAAGTTTGTTGACATGGTTCGATCTTATGCTCCC
GGTTTCATCTTCACCACCACTCTTCCCCCTGCCACCATGGCCGGTGCTCGAGCTGCTATC
AACTACCAGAAGGCTACCATGAAGGACCGAGTTGCCCAGCAGACTCACACTCGATACGTC
AAGGACAAGCTGGCCAACCGAGGAATTCCCGTTGTTCCTAACCCCTCTCACATCGTGCCT
GTTCTTGTTGGTGACGCCCAGAAGGCAAAGGCCGCCTCCGATCTTCTGCTGACCAAGCAC
CAGATCTATGTGCAGGCTATCAACTTCCCCACTGTTCCTATTGGCCAGGAGCGACTGCGT
GTGACCCCCACCCCCGGTCACCATGAGGGACTCTGTGACGAGCTGGTTGCTGCCCTGGAA
GACGTGTGGCAGGAGCTTGATCTTAAGCGAGTTGAGGACTGGACTGCCGAGGGCGGTCTG
TGTGGCGTTGGCGAGGGTGTTGAGGTTGAGCCTCTGTGGAGCGAGGAGCAGCTGTCTTAC
GGACGAGACTAA

Predicted translation product    

>YALI0C06083g.aa
MESLVRQSKKLCPYIGRTSASSLKQLGNGRLTQKAGQCPIMGKAMGVRGFKSDAGSNAES
ATVDVHAAVDTSKGTCPHAAQYSPVYPSSRLDNYPFGMTQRGLGKVPTQDAHNATTFNYE
SFYENKINAKHQDKSYRYFNNINRLAAEFPRAHRGSIEEDKVTVWCANDYLGMGRNPVVV
DAMHETLDKYGAGAGGTRNIAGHNRHAVELEAAIADLHKKEAALVFSSCYVANDSTLSLL
GQALPNCVYFSDASNHASMIHGIRHGGSEKVVWKHNDLADLEAKLARYPKSTPKVIAFES
VYSMCGSIGPIEEICDLADKYGAITFLDEVHAVGMYGPTGAGVAEHLDFEHYHSGAQTQR
QPIMDRVDIFTGTLGKAYGCVGGYIAGSAKFVDMVRSYAPGFIFTTTLPPATMAGARAAI
NYQKATMKDRVAQQTHTRYVKDKLANRGIPVVPNPSHIVPVLVGDAQKAKAASDLLLTKH
QIYVQAINFPTVPIGQERLRVTPTPGHHEGLCDELVAALEDVWQELDLKRVEDWTAEGGL
CGVGEGVEVEPLWSEEQLSYGRD*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites