ERGO0F05984g


Syntenic homolog of Saccharomyces cerevisiae YBL015W (ACH1)

Genomic environment map

Element type: CDS
Element length: 1971 nucleotides,
on sense strand of
Ergo0F: 472535..474505.
Other names:
AFR020W
AGOS_AFR020W
Coding sequence: 657 codons.

Computed results  

None available yet

Homologs and Orthologs

Homologs in protein families: GL3R1841 GL3R1841.F1 GL3R1841.N1
Orthologs by synteny: ZYRO0C13134g SAKL0H19690g KLTH0H08338g KLLA0E10561g

Protein ERGO0F05984p  


Protein domain map

Protein length: 656 amino acids
Protein family: GL3R1841
Database cross references:
InterPro: IPR003702
UniProtKB/Swiss-Prot: Q754Q2

Computed results for ERGO0F05984p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  

None available yet

Sequence data  


Nucleotide sequence    

>ERGO0F05984g.nt
ATGCGCCGGTCCGCCCGGACGCGTAGCGTGCATACATACATACATATAACTACTGCCGAA
CTCCGCGACGCATCGCTCGCGCTTCGGCGCGTGGCTGGTCGCCGATCGTTTTGGTGCGCT
TGGGGAAGCTCTTGCATTCCCGTTGTGCCGGCGCCCCCGGATTCCAACAACCCCGTCATG
CGACTGTCTAAACGCATGACTGCCATCAGATGGTACAGTATAGCGCGTGATACCGCGAGC
GTATCCATTTTGCCGTCCCGGGAGCTGACGGTGGCAATAGCTCCAACAACCGGCGGCATT
GTGGTTATACGAGTTCAACAGTATAAAAGGGGCCGCGGTGGCAAATGGTACGATCGGGTT
TTTGCGTGGGCGTTGTGTAAGCGGCAACGGACATACCGAATGACAGTTTCGCAGCTTTTG
AAACAGAGGGTCCGGTACGCCCCATACCTATCCAAAGTACGGAGAGCAGAGGAGCTGTTG
CCTCTGTTTAAGCATGGGCAGTATATCGGGTGGTCCGGGTTCACGGGCGTGGGCGCGCCC
AAAGTGATCCCCACGGCGCTTGCAGACCATGTGGAGAAGAACGGGCTACAGGGCCAGCTG
GCATTCAATCTTTTCGTGGGCGCGTCGGCTGGGCCGGAGGAGAACAGGTGGGCGGACCTG
GACATGATTTTGCGGCGCGCGCCGCACCAGGTCGGCAAGCCGATCGCGCGCGCGATCAAC
GATGGGCGCATCAAGTTCTTCGACAAGCACCTGTCGATGTTTCCGCAGGACCTGACGTAC
GGGTACTACACCCGGGAGCGGACGGACGGCAAGATCCTGGACTACGCGATCGTGGAAGCG
ACGGCGATCAAGGAGGACGGCTCGATTGTGCTGGGCCCGTCGGTGGGCGGCTCGCCGGAG
TTCATGTCTGCGGCGGACAAGCTGATCGTGGAGGTCAACACCGCGACGCCGTCGTTCGAG
GGGCTGCATGACATCGACATGCCGGTGCTGCCACCGCACCGCGTGCCGTACCCATACACG
CGGGTGGACGAGCGCAGCGGGCTGGACGCGGTGCCGGTGGACCCTGCGCGCGTGGTCGCG
CTAGTGGAGAGCACCGAGCGCGACAAGGTGGGGCCCAACACGCCCTCGGACGAGGGGTCG
CGCGCGATTGCGGGGCATCTGGTGGAGTTCTTCGAGAACGAGGTCAGGCACGGGCGGCTG
CCAGCCAACCTGCTGCCGCTGCAGAGCGGCATCGGCAACATTGCGAACGCGGTGATCGAG
GGGCTTGCAGGCGCCTCATTCCGGAACCTGACAGTTTGGACCGAGGTGTTGCAGGACTCG
TTCCTGGATCTGTTCGAGAACGGCTCCCTAGAATTCGCGACCGCCACCAGTATCCGCCTG
ACCGAGGCCGGCTTCGAGAAGTTCTTTGCCAACTGGGACGAATACTCCTCCAAGCTGTGC
TTGCGCTCCCAGGTCGTGTCCAACAGCCCAGAGATGATCCGCCGGCTCGGTGTCATCGCA
ATGAATACGCCTGTGGAGGTCGACATCTACGCGCACGCGAACTCGACCAACGTGTCTGGC
TCTCGGATGCTCAACGGGCTCGGCGGCTCCGCGGACTTCCTCCGGAACGCAAAGCTGTCC
ATCATGCATGCGCCTTCTGCCAGACCTAGCAAGACCGATCCTACCGGCATCTCGACCATC
GTGCCCATGGCATCCCATGTCGACCAGCTTGAGCATGACCTAGACGTCCTAGTCACGGAT
CAGGGTCTGGCCGACCTGCGTGGTCTCTGTCCGCGGGAGCGCGCGCGCGAGATTATCCGC
CAGTGCGCACATCCAGACTACAAGCCAATTTTGACTGACTACCTAGACAGAGCTGAGCAT
TATGCGCAGCGCTCGCGCTCGATGCACGAGCCTCATATTTTGCAGCAAGCTCTCAGATTC
CATACCCATCTCGCTGAAAAAGGCACCATGAAGGTCCCTTCGTGGGACTAA

Coding sequence    

>ERGO0F05984g.cds
ATGCGCCGGTCCGCCCGGACGCGTAGCGTGCATACATACATACATATAACTACTGCCGAA
CTCCGCGACGCATCGCTCGCGCTTCGGCGCGTGGCTGGTCGCCGATCGTTTTGGTGCGCT
TGGGGAAGCTCTTGCATTCCCGTTGTGCCGGCGCCCCCGGATTCCAACAACCCCGTCATG
CGACTGTCTAAACGCATGACTGCCATCAGATGGTACAGTATAGCGCGTGATACCGCGAGC
GTATCCATTTTGCCGTCCCGGGAGCTGACGGTGGCAATAGCTCCAACAACCGGCGGCATT
GTGGTTATACGAGTTCAACAGTATAAAAGGGGCCGCGGTGGCAAATGGTACGATCGGGTT
TTTGCGTGGGCGTTGTGTAAGCGGCAACGGACATACCGAATGACAGTTTCGCAGCTTTTG
AAACAGAGGGTCCGGTACGCCCCATACCTATCCAAAGTACGGAGAGCAGAGGAGCTGTTG
CCTCTGTTTAAGCATGGGCAGTATATCGGGTGGTCCGGGTTCACGGGCGTGGGCGCGCCC
AAAGTGATCCCCACGGCGCTTGCAGACCATGTGGAGAAGAACGGGCTACAGGGCCAGCTG
GCATTCAATCTTTTCGTGGGCGCGTCGGCTGGGCCGGAGGAGAACAGGTGGGCGGACCTG
GACATGATTTTGCGGCGCGCGCCGCACCAGGTCGGCAAGCCGATCGCGCGCGCGATCAAC
GATGGGCGCATCAAGTTCTTCGACAAGCACCTGTCGATGTTTCCGCAGGACCTGACGTAC
GGGTACTACACCCGGGAGCGGACGGACGGCAAGATCCTGGACTACGCGATCGTGGAAGCG
ACGGCGATCAAGGAGGACGGCTCGATTGTGCTGGGCCCGTCGGTGGGCGGCTCGCCGGAG
TTCATGTCTGCGGCGGACAAGCTGATCGTGGAGGTCAACACCGCGACGCCGTCGTTCGAG
GGGCTGCATGACATCGACATGCCGGTGCTGCCACCGCACCGCGTGCCGTACCCATACACG
CGGGTGGACGAGCGCAGCGGGCTGGACGCGGTGCCGGTGGACCCTGCGCGCGTGGTCGCG
CTAGTGGAGAGCACCGAGCGCGACAAGGTGGGGCCCAACACGCCCTCGGACGAGGGGTCG
CGCGCGATTGCGGGGCATCTGGTGGAGTTCTTCGAGAACGAGGTCAGGCACGGGCGGCTG
CCAGCCAACCTGCTGCCGCTGCAGAGCGGCATCGGCAACATTGCGAACGCGGTGATCGAG
GGGCTTGCAGGCGCCTCATTCCGGAACCTGACAGTTTGGACCGAGGTGTTGCAGGACTCG
TTCCTGGATCTGTTCGAGAACGGCTCCCTAGAATTCGCGACCGCCACCAGTATCCGCCTG
ACCGAGGCCGGCTTCGAGAAGTTCTTTGCCAACTGGGACGAATACTCCTCCAAGCTGTGC
TTGCGCTCCCAGGTCGTGTCCAACAGCCCAGAGATGATCCGCCGGCTCGGTGTCATCGCA
ATGAATACGCCTGTGGAGGTCGACATCTACGCGCACGCGAACTCGACCAACGTGTCTGGC
TCTCGGATGCTCAACGGGCTCGGCGGCTCCGCGGACTTCCTCCGGAACGCAAAGCTGTCC
ATCATGCATGCGCCTTCTGCCAGACCTAGCAAGACCGATCCTACCGGCATCTCGACCATC
GTGCCCATGGCATCCCATGTCGACCAGCTTGAGCATGACCTAGACGTCCTAGTCACGGAT
CAGGGTCTGGCCGACCTGCGTGGTCTCTGTCCGCGGGAGCGCGCGCGCGAGATTATCCGC
CAGTGCGCACATCCAGACTACAAGCCAATTTTGACTGACTACCTAGACAGAGCTGAGCAT
TATGCGCAGCGCTCGCGCTCGATGCACGAGCCTCATATTTTGCAGCAAGCTCTCAGATTC
CATACCCATCTCGCTGAAAAAGGCACCATGAAGGTCCCTTCGTGGGACTAA

Predicted translation product    

>ERGO0F05984g.aa
MRRSARTRSVHTYIHITTAELRDASLALRRVAGRRSFWCAWGSSCIPVVPAPPDSNNPVM
RLSKRMTAIRWYSIARDTASVSILPSRELTVAIAPTTGGIVVIRVQQYKRGRGGKWYDRV
FAWALCKRQRTYRMTVSQLLKQRVRYAPYLSKVRRAEELLPLFKHGQYIGWSGFTGVGAP
KVIPTALADHVEKNGLQGQLAFNLFVGASAGPEENRWADLDMILRRAPHQVGKPIARAIN
DGRIKFFDKHLSMFPQDLTYGYYTRERTDGKILDYAIVEATAIKEDGSIVLGPSVGGSPE
FMSAADKLIVEVNTATPSFEGLHDIDMPVLPPHRVPYPYTRVDERSGLDAVPVDPARVVA
LVESTERDKVGPNTPSDEGSRAIAGHLVEFFENEVRHGRLPANLLPLQSGIGNIANAVIE
GLAGASFRNLTVWTEVLQDSFLDLFENGSLEFATATSIRLTEAGFEKFFANWDEYSSKLC
LRSQVVSNSPEMIRRLGVIAMNTPVEVDIYAHANSTNVSGSRMLNGLGGSADFLRNAKLS
IMHAPSARPSKTDPTGISTIVPMASHVDQLEHDLDVLVTDQGLADLRGLCPRERAREIIR
QCAHPDYKPILTDYLDRAEHYAQRSRSMHEPHILQQALRFHTHLAEKGTMKVPSWD*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites