KLLA0E10561g
highly similar to uniprot|P32316 Saccharomyces cerevisiae YBL015W ACH1 Acetyl-coA hydrolase, primarily localized to mitochondria
Element type: CDS
Element length: 1572 nucleotides,
on sense strand of
Klla0E: 932415..933986.
Other names:
KLLA-ORF4043
KLLA0E10549g
Coding sequence: 524 codons.
Element length: 1572 nucleotides,
on sense strand of
Klla0E: 932415..933986.
Other names:
KLLA-ORF4043
KLLA0E10549g
Coding sequence: 524 codons.
Homologs and Orthologs
Homologs in protein families: GL3R1841 GL3R1841.F1 GL3R1841.N1Orthologs by synteny: ZYRO0C13134g SAKL0H19690g KLTH0H08338g ERGO0F05984g
Protein KLLA0E10561p 
highly similar to uniprot|P32316 Saccharomyces cerevisiae YBL015W ACH1 Acetyl-coA hydrolase primarily localized to mitochondria required for acetate utilization and for diploid pseudohyphal growth
Protein domain map
Sequence data 
>KLLA0E10561g.nt TCCGGACTTTAACACGCATGACTCTTTCCTTCACTATGGTATGCTCTTCAGTCATACATT GCCACATGCAGTAATTCTTACGACCCGATACGATTCCTTCCCCTCTCTATCCTTGCAAAC TTAGTAAACTGAGTAAGCTGCACCATTAATGTTTCGGGATGTCCGAGCTATTGTGTGATT CGAGTAGCGCACAGCAGTTAGACGCGATTTTTTTTTTCTAACGCGCTGCATTACTTCGCT TAGTATATCCAGCCATTTTTTGTTGTTTTGGTTTTTAGTTTTCGCTTTGATTAACAACGG TTAAACTCCAGTTAGTAGCAGACAGAAATCAACTTGTGTTCTTGCGTTTTTTTGTCCTAT TATTTTAATTTTATAAGACATTATGGTGAACAATTAATGTACAACTGGCATATGATAAAA CTCGGGGTGCAAACGTGACACATACGTACACCCCCAAATCGGCGGGGAAAGTCACTGGGT AAGCCAATGTATGAAAAAAACAGGCGAGTATTTTTGTTTTTTGTTCTGTATTTTTAGCTT AATTTCGAGCGCTCTTATTCTACGTGTATTTCTGCGTTTTTTGAGTCTCCAGAGTTATGT CGTCGTACAAAATTCGTGCGTTTTTTCATTTTTCGTTTTACAGAACTGAAATCATCCGCA TTAAAACCAATGAAAAGCGATTTTTCAACGAGATTTTCTATATAAGCCAGAGAGATTTGG TTCAATATCGATATATAGGAAAAATTGAAATTATGCCACCCCTATGTATTACATGTAAGC TTGAAGAATAAATATTATATATATACATATACTCCACATTCAATTTATATCAAGGTTCAG ACACAATTGAAATTTTCTTTGATAGACAGAGGGAAGAAAGGTGGATAGGTCATTTTTTGT ATCGCTTCTCTGATCTTAATATATATTAGTGTAAAATACAAAAAAGAAAAATAAATTAAG GGAAAATAGCCCATATAACATACCATTAAAGGAATTAAAAATGACAGTTTCCAGATTGTT GAAAGATAGAGTTAGATATGCTCCATATTTGAAGAAAGTTAAGCCAGTAGAAGAGCTGAT TCCATTGTTTAAGGATGGTCAATACATTGGTTGGTCTGGTTTCACCGGTGTCGGTGCTCC AAAAGCTGTTCCAGAAGCTTTGATCAAACATGTTGAGGAAAATAACTTGCAAGGTAAGCT TAGATTCAACCTTTTCGTCGGTGCTTCTGCTGGTCCAGAAGAATGTAAATGGGCCGAACA TGATATGATCTTGAGAAGAGCCCCTCATCAAGTCGGTAAGCCAATCGCCAAGGCTATTAA CGATGGTAGAATCCAATTTTTCGACAAGCATTTGTCCATGTTTCCACAAGATTTGACCTA CGGTTACTATAGCAGGAACAGAACCGATGGTAAGATCTTGGATTACACCATCATTGAAGC AACTGCCATTAAAGAAGATGGGTCTATTGTTCCAGGTCCTTCTGTCGGTGGTTCTCCAGA ATTTATTTCTGTTTCTGATAAAATTATTATCGAAGTTAACACTGCTACTCCATCGTTCGA AGGTCTACACGATATTGATATGCCTGTTAACCCACCATTCAGACAACCATACCCATACAC CGCTGTGGACCAAAAGAACGGTCTTGATTCTATCCCAGTGGACCCTGAACGTGTTGTTGC TGTGGTTGAATCTACTCAAAGGGATGTCGTTGGTCCAAACACTCCATCTGATGCTACATC TCAATCCATCGCTCGTCACTTGGTCGAGTTCTTCGAAAATGAAGTGAGACATGGTAGACT ACCTGAAAACTTGCATCCATTGCAATCCGGTATCGGTAACATCGCTAACGCCGTGATTGA AGGTTTGACTGACTCCTCCTTCAAGAACTTGACTGTGTGGACTGAAGTCTTGCAAGATTC ATTCTTGGATTTGTTCGAAAACGGTGCCTTGGATTACGCCACTGCTACCTCTATCAGATT GACCGAAGCTGGTTTCCAAAAGTTTTTCGATAACTGGGATGATTTCTCAAAGAAGCTATG TTTAAGATCCCAAGTTGTTTCTAACAACCCAGAATTGATCCGTCGTCTAGGTGTTATCGC TATGAACACCCCTGTCGAAGTTGACATTTACGCACACGCTAACTCCACCAACGTCTCTGG TTCTCGTATGTTGAACGGTTTGGGTGGTTCTGCTGATTTCTTGAGAAACGCTAAGTTGTC TATCATGCACGCTCCAGCTGCAAGACCTACAAAGACTGACCCAACTGGTATCTCTACCAT TGTCCCAATGGCTTCTCATGTCGATCAAACTGAACACGATTTGGATGTCTTAGTCACCGA TCAAGGTTTGGCTGACTTAAGAGGTCTATCTCCAAGAGAAAGAGCAAGAGAAATCATTAA GAACTGTGCTCATCCAGATTACCAACCAATCTTAACTGACTACTTGGACAGATCAGAACA TTATGCCAAGTTGCACAAGTGCATGCACGAACCTCACATGTTAAAGAATGCATTCAAATT CCACTTGAACTTGTCTGAAAAGGGTACCATGAAAGTCGATAACTGGGATTAAGCTTGTTT ATCTCGCTAGACAGGTATGATAAGATGAAAACTTGTTGGCATGACATTCATTGTCCGCAA TATGTCCTTTTTTTTTTTCTCTTTAAAAGCCATGCTTCAAGAAACTGCTATTTTGATGAC CCCCAAAAACTTATTACACAATAACATATAGTTCGTTTCGTCTTGAGTACACATTTTTGC CAACGAAACTGAATTTGACATGATATATTCGAAGAACAAACACCATACAGAAGCCTACAT ATGTATTTCTATATTATCTACATAGACGAACTATCTTATGGAACGAAGAAAT
>KLLA0E10561g.cds ATGACAGTTTCCAGATTGTTGAAAGATAGAGTTAGATATGCTCCATATTTGAAGAAAGTT AAGCCAGTAGAAGAGCTGATTCCATTGTTTAAGGATGGTCAATACATTGGTTGGTCTGGT TTCACCGGTGTCGGTGCTCCAAAAGCTGTTCCAGAAGCTTTGATCAAACATGTTGAGGAA AATAACTTGCAAGGTAAGCTTAGATTCAACCTTTTCGTCGGTGCTTCTGCTGGTCCAGAA GAATGTAAATGGGCCGAACATGATATGATCTTGAGAAGAGCCCCTCATCAAGTCGGTAAG CCAATCGCCAAGGCTATTAACGATGGTAGAATCCAATTTTTCGACAAGCATTTGTCCATG TTTCCACAAGATTTGACCTACGGTTACTATAGCAGGAACAGAACCGATGGTAAGATCTTG GATTACACCATCATTGAAGCAACTGCCATTAAAGAAGATGGGTCTATTGTTCCAGGTCCT TCTGTCGGTGGTTCTCCAGAATTTATTTCTGTTTCTGATAAAATTATTATCGAAGTTAAC ACTGCTACTCCATCGTTCGAAGGTCTACACGATATTGATATGCCTGTTAACCCACCATTC AGACAACCATACCCATACACCGCTGTGGACCAAAAGAACGGTCTTGATTCTATCCCAGTG GACCCTGAACGTGTTGTTGCTGTGGTTGAATCTACTCAAAGGGATGTCGTTGGTCCAAAC ACTCCATCTGATGCTACATCTCAATCCATCGCTCGTCACTTGGTCGAGTTCTTCGAAAAT GAAGTGAGACATGGTAGACTACCTGAAAACTTGCATCCATTGCAATCCGGTATCGGTAAC ATCGCTAACGCCGTGATTGAAGGTTTGACTGACTCCTCCTTCAAGAACTTGACTGTGTGG ACTGAAGTCTTGCAAGATTCATTCTTGGATTTGTTCGAAAACGGTGCCTTGGATTACGCC ACTGCTACCTCTATCAGATTGACCGAAGCTGGTTTCCAAAAGTTTTTCGATAACTGGGAT GATTTCTCAAAGAAGCTATGTTTAAGATCCCAAGTTGTTTCTAACAACCCAGAATTGATC CGTCGTCTAGGTGTTATCGCTATGAACACCCCTGTCGAAGTTGACATTTACGCACACGCT AACTCCACCAACGTCTCTGGTTCTCGTATGTTGAACGGTTTGGGTGGTTCTGCTGATTTC TTGAGAAACGCTAAGTTGTCTATCATGCACGCTCCAGCTGCAAGACCTACAAAGACTGAC CCAACTGGTATCTCTACCATTGTCCCAATGGCTTCTCATGTCGATCAAACTGAACACGAT TTGGATGTCTTAGTCACCGATCAAGGTTTGGCTGACTTAAGAGGTCTATCTCCAAGAGAA AGAGCAAGAGAAATCATTAAGAACTGTGCTCATCCAGATTACCAACCAATCTTAACTGAC TACTTGGACAGATCAGAACATTATGCCAAGTTGCACAAGTGCATGCACGAACCTCACATG TTAAAGAATGCATTCAAATTCCACTTGAACTTGTCTGAAAAGGGTACCATGAAAGTCGAT AACTGGGATTAA
>KLLA0E10561g.aa MTVSRLLKDRVRYAPYLKKVKPVEELIPLFKDGQYIGWSGFTGVGAPKAVPEALIKHVEE NNLQGKLRFNLFVGASAGPEECKWAEHDMILRRAPHQVGKPIAKAINDGRIQFFDKHLSM FPQDLTYGYYSRNRTDGKILDYTIIEATAIKEDGSIVPGPSVGGSPEFISVSDKIIIEVN TATPSFEGLHDIDMPVNPPFRQPYPYTAVDQKNGLDSIPVDPERVVAVVESTQRDVVGPN TPSDATSQSIARHLVEFFENEVRHGRLPENLHPLQSGIGNIANAVIEGLTDSSFKNLTVW TEVLQDSFLDLFENGALDYATATSIRLTEAGFQKFFDNWDDFSKKLCLRSQVVSNNPELI RRLGVIAMNTPVEVDIYAHANSTNVSGSRMLNGLGGSADFLRNAKLSIMHAPAARPTKTD PTGISTIVPMASHVDQTEHDLDVLVTDQGLADLRGLSPRERAREIIKNCAHPDYQPILTD YLDRSEHYAKLHKCMHEPHMLKNAFKFHLNLSEKGTMKVDNWD*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://www.genolevures.org/elt/KLLA/KLLA0E10561p