KLLA0C15059g


uniprot|P78698 Kluyveromyces lactis KLLA0C15059g HEM1 5- aminolevulinate synthase mitochondrial precursor

Genomic environment map

Element type: CDS
Element length: 1713 nucleotides,
on sense strand of
Klla0C: 1311550..1313262.
Other names:
KLLA-ORF7215
Coding sequence: 571 codons.
Database cross references:
EMBL: CR382123
EMBL: X92944
GeneID: 2892616
HOGENOM: P78698

Computed results  

None available yet

Homologs and Orthologs

Homologs in protein families: GL3C0100 GL3C0100.F2 GL3C0100.N2
Orthologs by synteny: ZYRO0A10032g SAKL0H12364g KLTH0E10538g ERGO0B02486g

Protein KLLA0C15059p  


Protein domain map

Protein length: 570 amino acids
Protein family: GL3C0100
Database cross references:
Gene3D: G3DSA:3.40.640.10
InterPro: IPR001917
InterPro: IPR004839
InterPro: IPR010961
InterPro: IPR015421
KEGG: kla:KLLA0C15059g
PROSITE: PS00599
Pfam: PF00155
RefSeq: XP_452875.1
TIGRFAMs: TIGR01821
UniProtKB/Swiss-Prot: P78698
UniprotKB: HEM1_KLULA

Computed results for KLLA0C15059p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  

GO:0005759 mitochondrial matrix

Sequence data  


Nucleotide sequence    

>KLLA0C15059g.nt
TTAGCCATGGTAGCACTTAGTGGCTGGTACCTTACCGGTGCTGAGGGGATATTGAAACCG
AACGACGTAGAGGCCTTCAAGTGGGTGTCACGTGCTTCGAAATTATCTGACGGGAAATTA
CCAAGAGCAGAGTATGCATTAGCATTTTACTTGGAGAAAGGCCTTGGATGTCAGCCAAAC
ATACAAGAGGCCAAGGTACACCACGAAACTGCTGCTCGTCTTGGCCATCCGAAAGCCATC
GAGGCCTTGAGGAAAATGTGACCTTGCGGTTTTATTTTTACTGTCCTGCGTAGCTAGCTT
CATTGTAATGATATGTAATAATAAGATTTGGTTAGAGTAGTTAGTTATTGGTATTCTCTA
GAAGTTAAATTTCTGGCGGTGGTTGGTGTCGCGAAGCGACACCAACATAGCTTAAACTCG
GACTTGCTCCCGCTAAGCGAACCATGCTCATCCAGGCCAATCCCCAGCTTCAACTGCCAG
GTCCAGGGCCAGGGCCAGCATTACTGCCACAGCTCTTCGCTTCGCTCACATCATCCTCTA
CTCTTATTGGCTGAAAACCTCCAAAAAGAAAAGCTCCTAAGTCCAGCATTTCTTTTTTTC
CCTTTTCCCTTCCCTGTCTTCCTACAATATCTTCTGCGGAACTGGCAATGTCGGAATCAT
TTCAAAGGTAAAATCCTCTCATATTGCCTGAAAATGAAAATGACTGAGGAAAAAATGACT
GAAAACTCAAAAAAAGATATAATATGATATGTTTTCATAATTATATAATAAGAGATGAGC
TTCAGGTTTAACTTAAGATACACCTTTGGTTTTGGTTCAGTGCTTCTCTAATCTAGAGTA
AAATATTCTTTGATACACCAGGTTTCACTTTCGTCACTGAAGAGAAAAATTTAAAAAGAT
TCTACCTGTTAGAAGTGGCTATATCTTTGCGAGTGAGAAAGCTTCTTTTATTGGTTTCCC
TTGCACACACTTAACCGTCTTCATTTTGGATTGAAAAAACATGGAATCTGTTATTCGTTC
TTCTGCCAAGATCTGTCCATTTATGCACTCTGCCACTGGATCAATGCAGAGTGTCAAGGC
TTTGAAGAATGCGAACTTACCAGCTATCGCTCAACAATGTCCATTTATGGGTAAGGCTAT
GGAACAACGTAGGGGTTATGCTTCTAGTGCTTCTGGAGCCTCTGCTGCCGCTGCCGCTAC
TGCTACTGCAAGCACAAGCGCTTCTAATTCTAATTCATCCGTTGAAGCTTCTGCTTCTGC
AGATGTAGTCGATCATGCCACTAAGGAAGCGTCTTTTGATTACCAAGGTTTGTTCGATTC
TGATTTGGCTAAGAAGAGAATGGATAAGTCTTACAGGTTCTTCAACAATATCAACCGTTT
GGCTAAGGAGTTCCCAATGGCTCATAGAAAGCTAGAAGATGACAAGGTTACTGTTTGGTG
TTCTAATGATTATTTGGCCTTATCTAAGAACCAAGAGGTCATTGAAGTGATGAAAAAGAC
ATTGGATAAGTACGGTGCTGGTGCTGGTGGTACCAGAAATATTGCTGGTCATAATAAACA
CGCGTTGCAATTGGAAGCTGAATTGGCTACTTTACACAAGAAGGAAGGTGCCTTGGTTTT
CTCCTCTTGTTTTGTTGCCAACGACGCTGTCATCTCATTGTTAGGTCAAAAGATCAAGGA
CTTGGTCATTTTCTCTGACGAATTAAACCACGCTTCTATGATTGTGGGTATCAAACATGC
TTCGACCAAGAAGCACATTTTCAAGCATAACAACTTGGACCAATTGGAAGAGCTGTTGGC
TATGTATCCAAAATCTACTCCAAAATTGATTGCATTCGAATCCGTTTACTCCATGTCTGG
TTCCGTTGCTGATATTGATAAGATCTGTGATTTGGCTGAAAAGTACGGCGCTTTGACTTT
CTTAGATGAAGTTCACGCTGTTGGTTTGTATGGTCCACATGGTGCAGGTGTCGCTGAACA
TTGCAACTTTGATGCTCACCGCAAGGCTGGTATTGCTTCTCCTGAATTCCGCACCGTTAT
GGATCGTGTTGATATGATCACTGGTACCTTGGGTAAATCTTTCGGTACTGTTGGTGGTTA
CGTTGCTGGTTCTTTGCAGCTAATTGACTGGGTGAGATCTTATGCTCCTGGTTTCATCTT
CACCACTACTTTACCACCTGCTGTCATGGCAGGTGCTGCTGAGGCTATCAGATACCAACG
TTCTCATTTGGACTTGAGACAAGACCAACAAAGACATACAACCTATGTTAAAGACGGTTT
AGCTGATTTGGGTATTCCAGTGATGCCAAACCCATCTCATATTGTTCCAGTTTTGGTTGG
TAACCCTCACTTGGCCAAACAAGCATCTGATATCTTGATGGACAAGCATCGTATTTACGT
CCAAGCCATCAACTTCCCAACTGTCGCTAGAGGTACCGAAAGGTTGAGAATTACCCCAAC
TCCGGGCCACACTAACGATCTATCTGACATCTTAATGGATGCTTTGGAAGATGTCTGGTC
CACTCTACAATTACCAAGGGTACGTGACTGGGAAGCCCAAGGCGGTTTGTTGGGTGTTGG
TGATCCAAACCACGTTCCCCAACCAAACTTATGGACGAAGGATCAACTGACTTTGACCAA
TAATGATTTGCATCCAAATGTTAAACAGCCAATCATCGAACAATTAGAAGTCTCTTCTGG
TATTAGATACTAGTCGAACTTGCTTAGACGACACATACTTTGAACCGTCTATTCGGAGGT
ACAATTTTGATCTTTTTACAACCCCCACCCCCACCCCCTCCCCCCTTTGGAGAAAGAATA
CTGATACTATTATATTATTATTCCTATTTTTTTTCATGATTATGAACAATCATGACTATT
ATATTCTAATTGTTTACTTTCTTTCTAAGAGGCTTAACAGAAACGAAAGATCGCCATCTA
CAAGGAATCAATAAAAGAAAAAAATATGAAAAAAAAGTTGTCACATTTAATGATTTATTC
TACAAAATATTCA

Coding sequence    

>KLLA0C15059g.cds
ATGGAATCTGTTATTCGTTCTTCTGCCAAGATCTGTCCATTTATGCACTCTGCCACTGGA
TCAATGCAGAGTGTCAAGGCTTTGAAGAATGCGAACTTACCAGCTATCGCTCAACAATGT
CCATTTATGGGTAAGGCTATGGAACAACGTAGGGGTTATGCTTCTAGTGCTTCTGGAGCC
TCTGCTGCCGCTGCCGCTACTGCTACTGCAAGCACAAGCGCTTCTAATTCTAATTCATCC
GTTGAAGCTTCTGCTTCTGCAGATGTAGTCGATCATGCCACTAAGGAAGCGTCTTTTGAT
TACCAAGGTTTGTTCGATTCTGATTTGGCTAAGAAGAGAATGGATAAGTCTTACAGGTTC
TTCAACAATATCAACCGTTTGGCTAAGGAGTTCCCAATGGCTCATAGAAAGCTAGAAGAT
GACAAGGTTACTGTTTGGTGTTCTAATGATTATTTGGCCTTATCTAAGAACCAAGAGGTC
ATTGAAGTGATGAAAAAGACATTGGATAAGTACGGTGCTGGTGCTGGTGGTACCAGAAAT
ATTGCTGGTCATAATAAACACGCGTTGCAATTGGAAGCTGAATTGGCTACTTTACACAAG
AAGGAAGGTGCCTTGGTTTTCTCCTCTTGTTTTGTTGCCAACGACGCTGTCATCTCATTG
TTAGGTCAAAAGATCAAGGACTTGGTCATTTTCTCTGACGAATTAAACCACGCTTCTATG
ATTGTGGGTATCAAACATGCTTCGACCAAGAAGCACATTTTCAAGCATAACAACTTGGAC
CAATTGGAAGAGCTGTTGGCTATGTATCCAAAATCTACTCCAAAATTGATTGCATTCGAA
TCCGTTTACTCCATGTCTGGTTCCGTTGCTGATATTGATAAGATCTGTGATTTGGCTGAA
AAGTACGGCGCTTTGACTTTCTTAGATGAAGTTCACGCTGTTGGTTTGTATGGTCCACAT
GGTGCAGGTGTCGCTGAACATTGCAACTTTGATGCTCACCGCAAGGCTGGTATTGCTTCT
CCTGAATTCCGCACCGTTATGGATCGTGTTGATATGATCACTGGTACCTTGGGTAAATCT
TTCGGTACTGTTGGTGGTTACGTTGCTGGTTCTTTGCAGCTAATTGACTGGGTGAGATCT
TATGCTCCTGGTTTCATCTTCACCACTACTTTACCACCTGCTGTCATGGCAGGTGCTGCT
GAGGCTATCAGATACCAACGTTCTCATTTGGACTTGAGACAAGACCAACAAAGACATACA
ACCTATGTTAAAGACGGTTTAGCTGATTTGGGTATTCCAGTGATGCCAAACCCATCTCAT
ATTGTTCCAGTTTTGGTTGGTAACCCTCACTTGGCCAAACAAGCATCTGATATCTTGATG
GACAAGCATCGTATTTACGTCCAAGCCATCAACTTCCCAACTGTCGCTAGAGGTACCGAA
AGGTTGAGAATTACCCCAACTCCGGGCCACACTAACGATCTATCTGACATCTTAATGGAT
GCTTTGGAAGATGTCTGGTCCACTCTACAATTACCAAGGGTACGTGACTGGGAAGCCCAA
GGCGGTTTGTTGGGTGTTGGTGATCCAAACCACGTTCCCCAACCAAACTTATGGACGAAG
GATCAACTGACTTTGACCAATAATGATTTGCATCCAAATGTTAAACAGCCAATCATCGAA
CAATTAGAAGTCTCTTCTGGTATTAGATACTAG

Predicted translation product    

>KLLA0C15059g.aa
MESVIRSSAKICPFMHSATGSMQSVKALKNANLPAIAQQCPFMGKAMEQRRGYASSASGA
SAAAAATATASTSASNSNSSVEASASADVVDHATKEASFDYQGLFDSDLAKKRMDKSYRF
FNNINRLAKEFPMAHRKLEDDKVTVWCSNDYLALSKNQEVIEVMKKTLDKYGAGAGGTRN
IAGHNKHALQLEAELATLHKKEGALVFSSCFVANDAVISLLGQKIKDLVIFSDELNHASM
IVGIKHASTKKHIFKHNNLDQLEELLAMYPKSTPKLIAFESVYSMSGSVADIDKICDLAE
KYGALTFLDEVHAVGLYGPHGAGVAEHCNFDAHRKAGIASPEFRTVMDRVDMITGTLGKS
FGTVGGYVAGSLQLIDWVRSYAPGFIFTTTLPPAVMAGAAEAIRYQRSHLDLRQDQQRHT
TYVKDGLADLGIPVMPNPSHIVPVLVGNPHLAKQASDILMDKHRIYVQAINFPTVARGTE
RLRITPTPGHTNDLSDILMDALEDVWSTLQLPRVRDWEAQGGLLGVGDPNHVPQPNLWTK
DQLTLTNNDLHPNVKQPIIEQLEVSSGIRY*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites