YALI0A12573g


weakly similar to uniprot|P08540 Kluyveromyces lactis Potential acid phosphatase

Genomic environment map

Element type: CDS
Element length: 1818 nucleotides,
on sense strand of
Yali0A: 1303228..1305045.
Other names:
YALI-CDS1584.1
YALI-IPF9576
Coding sequence: 606 codons.
Database cross references:
EMBL: CR382127
GeneID: 2906361
GenomeReviews: CR382127_GR
HOGENOM: HBG736259

Computed results  

None available yet


Homologs and Orthologs

Homologs in protein family: GL3C0576
Orthologs: strict determination not possible; homologs must be refined manually

Protein YALI0A12573p  


weakly similar to uniprot|P08540 Kluyveromyces lactis Potential acid phosphatase; SubName: Full=YALI0A12573p;

Protein domain map

Protein length: 605 amino acids
Protein family: GL3C0576
Database cross references:
InterPro: IPR007312
KEGG: yli:YALI0A12573g
Pfam: PF04185
RefSeq: XP_500016.1
UniProtKB/TrEMBL: Q6CH46
UniProtKB: Q6CH46_YARLI

Phylogeny  

PhylomeDB:YALI0A12573g

Computed results for YALI0A12573p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  


Sequence data  


Nucleotide sequence    

>YALI0A12573g.nt
ATGACCACCATCTCCCCCAATCTGGTGGAGATTGCTGCTGCCCAGTCCACCGCCGTCTCC
AACCAGTTTACCGCCCCTGTTGTTCCCGGATGTGCGTTCGACAAGTACTACCAGATCTGG
CTGGAGAACACCGACTACGACAAGGCGTTTGAGCAGGAGGATATGGCTTGGCTCCGGGGC
CAGGGCATCACTCTGACCAACTACTGGTCCCTCACCCACCCTTCCGAGCCCAACTACGTC
GCGGCTGTTGGCGGTGACTACTTTGGCATCAATAACGACGACTTCTTCCGAATCCCCGAC
AATGTCTTCACTATTGCCGACCTGCTCGATACCAAGGGCATCTCCTGGGGAGAGTACCAG
GAGCACCAGCCCTACACCGGTTTCCAGGGCATGAACTTCTCTCGACAGAGCGACTTTGCC
CCCGACTACGTGCGAAAGCACAACCCCCTGGTCCACTACGACTCCGTGGTCAACGTTCCC
GAGCGACTCGGCAACCTCAAGAACTTCACCGAGTTCAACAAGGACTTTTCCAACGCCCAG
ATCCCCCAGTGGGCCTTCATCACCCCCAATATGACCAACGATGCCCACGATACCGATATT
GAATTCGCTGGCAAGTGGACCCGGGGCTTCCTCGAGCCCCTGCTGGCTAACAAGGAGTTC
ATGGACAAGAACCTGATCATCGTCACCTTTGACGAGAACGAAACCTACCGTCTGGAGAAC
AAGGTCCTGGCAGTTCTTCTTGGTGGAGCTGTCCCCAAGGAGCTCCAGGGCACCTCCGAC
GACACCTACTACTCGCACTACTCCAACCTGGCCACCGTCCAGGCCAACTGGCAACTTCCC
CATCTCGGACGAGGCGACGCGCTTGCCAACCCCTTCAAGTTTGTTGCCGACCAGCTCAAC
ATTTCCGTCGTCGACTACGATACTACTGGCCAGTTCAACAACCAGTCTGTTGTCGGCTAC
TTCAACGACGAGCGAGTACCCATCCCCGCCCCCAACGCCTCCGCCATTGGAGTGGTTGGA
AACACCATTTACCAGAAGATTGCCGACATTTGGGGCGGCGAGGTCCCCAAGGTGGCCACC
CCCTCCACTTGTCCCACCAAGATTGTCGAGGAGTCTGCCATCTCTTTTAACAATGAGACC
GAGTCTGCCACCCCCTCCAACAATGCGGCCGAGTCTGCTGATGTCCAGTCCACCTCTATC
GAGGGCCAGTACGTCACTATCTCCTCCAACTGTTCTACTTCCGCTAACGCTACTCTTCAG
TCTGTTGCCGTTTCTACCAAGTCCATCGCCACCGCTACCGTTGTTGATGGCGAGTACGTG
ACCGTTTACGTCACTGACTGTCCCGTTACCACCGTCGATTCCAATGGAGACATTACCACT
CGGGTTGTTCAGTCCACCGTCACCGAGACCGTTTGTCCCAAGTGCACTAAGATCGAGACT
AAGGAGTCTTCTAAGGAGTCCCCCAAGGAGTCTCCTAAGGCCACCTCTGTTCAGACCCCC
AAGGAGACCCCCAAGGGATCCCCCAAGGCTGTTGAGACCCCGAAGGAGTCTCCTAAGACC
ACTTCTGCCCCTCAGACCCCCAAGGAGTCCCCCAAGGCTGCCGAGACCCCCAAGGCTTCC
CCCAAGGAGAACGATTCTACCGTGACAATCAACAAGACCGTGACCAAGACCCAGTCCGCA
GCTACCACCATGGTGCCCAGCAAGTCGGCTCCTGCCCAGGCCCCCAAGAGCGAGACTACT
CCTGCCCAGGCCAACGGCGCTGCCAAGGCTGTTGTTGGAGCTGCTGCTGTCATTCCCGCC
CTTATGGCTCTGTTCTAA

Coding sequence    

>YALI0A12573g.cds
ATGACCACCATCTCCCCCAATCTGGTGGAGATTGCTGCTGCCCAGTCCACCGCCGTCTCC
AACCAGTTTACCGCCCCTGTTGTTCCCGGATGTGCGTTCGACAAGTACTACCAGATCTGG
CTGGAGAACACCGACTACGACAAGGCGTTTGAGCAGGAGGATATGGCTTGGCTCCGGGGC
CAGGGCATCACTCTGACCAACTACTGGTCCCTCACCCACCCTTCCGAGCCCAACTACGTC
GCGGCTGTTGGCGGTGACTACTTTGGCATCAATAACGACGACTTCTTCCGAATCCCCGAC
AATGTCTTCACTATTGCCGACCTGCTCGATACCAAGGGCATCTCCTGGGGAGAGTACCAG
GAGCACCAGCCCTACACCGGTTTCCAGGGCATGAACTTCTCTCGACAGAGCGACTTTGCC
CCCGACTACGTGCGAAAGCACAACCCCCTGGTCCACTACGACTCCGTGGTCAACGTTCCC
GAGCGACTCGGCAACCTCAAGAACTTCACCGAGTTCAACAAGGACTTTTCCAACGCCCAG
ATCCCCCAGTGGGCCTTCATCACCCCCAATATGACCAACGATGCCCACGATACCGATATT
GAATTCGCTGGCAAGTGGACCCGGGGCTTCCTCGAGCCCCTGCTGGCTAACAAGGAGTTC
ATGGACAAGAACCTGATCATCGTCACCTTTGACGAGAACGAAACCTACCGTCTGGAGAAC
AAGGTCCTGGCAGTTCTTCTTGGTGGAGCTGTCCCCAAGGAGCTCCAGGGCACCTCCGAC
GACACCTACTACTCGCACTACTCCAACCTGGCCACCGTCCAGGCCAACTGGCAACTTCCC
CATCTCGGACGAGGCGACGCGCTTGCCAACCCCTTCAAGTTTGTTGCCGACCAGCTCAAC
ATTTCCGTCGTCGACTACGATACTACTGGCCAGTTCAACAACCAGTCTGTTGTCGGCTAC
TTCAACGACGAGCGAGTACCCATCCCCGCCCCCAACGCCTCCGCCATTGGAGTGGTTGGA
AACACCATTTACCAGAAGATTGCCGACATTTGGGGCGGCGAGGTCCCCAAGGTGGCCACC
CCCTCCACTTGTCCCACCAAGATTGTCGAGGAGTCTGCCATCTCTTTTAACAATGAGACC
GAGTCTGCCACCCCCTCCAACAATGCGGCCGAGTCTGCTGATGTCCAGTCCACCTCTATC
GAGGGCCAGTACGTCACTATCTCCTCCAACTGTTCTACTTCCGCTAACGCTACTCTTCAG
TCTGTTGCCGTTTCTACCAAGTCCATCGCCACCGCTACCGTTGTTGATGGCGAGTACGTG
ACCGTTTACGTCACTGACTGTCCCGTTACCACCGTCGATTCCAATGGAGACATTACCACT
CGGGTTGTTCAGTCCACCGTCACCGAGACCGTTTGTCCCAAGTGCACTAAGATCGAGACT
AAGGAGTCTTCTAAGGAGTCCCCCAAGGAGTCTCCTAAGGCCACCTCTGTTCAGACCCCC
AAGGAGACCCCCAAGGGATCCCCCAAGGCTGTTGAGACCCCGAAGGAGTCTCCTAAGACC
ACTTCTGCCCCTCAGACCCCCAAGGAGTCCCCCAAGGCTGCCGAGACCCCCAAGGCTTCC
CCCAAGGAGAACGATTCTACCGTGACAATCAACAAGACCGTGACCAAGACCCAGTCCGCA
GCTACCACCATGGTGCCCAGCAAGTCGGCTCCTGCCCAGGCCCCCAAGAGCGAGACTACT
CCTGCCCAGGCCAACGGCGCTGCCAAGGCTGTTGTTGGAGCTGCTGCTGTCATTCCCGCC
CTTATGGCTCTGTTCTAA

Predicted translation product    

>YALI0A12573g.aa
MTTISPNLVEIAAAQSTAVSNQFTAPVVPGCAFDKYYQIWLENTDYDKAFEQEDMAWLRG
QGITLTNYWSLTHPSEPNYVAAVGGDYFGINNDDFFRIPDNVFTIADLLDTKGISWGEYQ
EHQPYTGFQGMNFSRQSDFAPDYVRKHNPLVHYDSVVNVPERLGNLKNFTEFNKDFSNAQ
IPQWAFITPNMTNDAHDTDIEFAGKWTRGFLEPLLANKEFMDKNLIIVTFDENETYRLEN
KVLAVLLGGAVPKELQGTSDDTYYSHYSNLATVQANWQLPHLGRGDALANPFKFVADQLN
ISVVDYDTTGQFNNQSVVGYFNDERVPIPAPNASAIGVVGNTIYQKIADIWGGEVPKVAT
PSTCPTKIVEESAISFNNETESATPSNNAAESADVQSTSIEGQYVTISSNCSTSANATLQ
SVAVSTKSIATATVVDGEYVTVYVTDCPVTTVDSNGDITTRVVQSTVTETVCPKCTKIET
KESSKESPKESPKATSVQTPKETPKGSPKAVETPKESPKTTSAPQTPKESPKAAETPKAS
PKENDSTVTINKTVTKTQSAATTMVPSKSAPAQAPKSETTPAQANGAAKAVVGAAAVIPA
LMALF*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites