Element type: CDS
Element length: 1971 nucleotides,
on sense strand of
Ergo0F: 472535..474505.
Other names:
AFR020W
AGOS_AFR020W
Coding sequence: 657 codons.
Element length: 1971 nucleotides,
on sense strand of
Ergo0F: 472535..474505.
Other names:
AFR020W
AGOS_AFR020W
Coding sequence: 657 codons.
Homologs and Orthologs
Homologs in protein families: GL3R1841 GL3R1841.F1 GL3R1841.N1Orthologs by synteny: ZYRO0C13134g SAKL0H19690g KLTH0H08338g KLLA0E10561g
Protein domain map
Sequence data 
Nucleotide sequence
>ERGO0F05984g.nt ATGCGCCGGTCCGCCCGGACGCGTAGCGTGCATACATACATACATATAACTACTGCCGAA CTCCGCGACGCATCGCTCGCGCTTCGGCGCGTGGCTGGTCGCCGATCGTTTTGGTGCGCT TGGGGAAGCTCTTGCATTCCCGTTGTGCCGGCGCCCCCGGATTCCAACAACCCCGTCATG CGACTGTCTAAACGCATGACTGCCATCAGATGGTACAGTATAGCGCGTGATACCGCGAGC GTATCCATTTTGCCGTCCCGGGAGCTGACGGTGGCAATAGCTCCAACAACCGGCGGCATT GTGGTTATACGAGTTCAACAGTATAAAAGGGGCCGCGGTGGCAAATGGTACGATCGGGTT TTTGCGTGGGCGTTGTGTAAGCGGCAACGGACATACCGAATGACAGTTTCGCAGCTTTTG AAACAGAGGGTCCGGTACGCCCCATACCTATCCAAAGTACGGAGAGCAGAGGAGCTGTTG CCTCTGTTTAAGCATGGGCAGTATATCGGGTGGTCCGGGTTCACGGGCGTGGGCGCGCCC AAAGTGATCCCCACGGCGCTTGCAGACCATGTGGAGAAGAACGGGCTACAGGGCCAGCTG GCATTCAATCTTTTCGTGGGCGCGTCGGCTGGGCCGGAGGAGAACAGGTGGGCGGACCTG GACATGATTTTGCGGCGCGCGCCGCACCAGGTCGGCAAGCCGATCGCGCGCGCGATCAAC GATGGGCGCATCAAGTTCTTCGACAAGCACCTGTCGATGTTTCCGCAGGACCTGACGTAC GGGTACTACACCCGGGAGCGGACGGACGGCAAGATCCTGGACTACGCGATCGTGGAAGCG ACGGCGATCAAGGAGGACGGCTCGATTGTGCTGGGCCCGTCGGTGGGCGGCTCGCCGGAG TTCATGTCTGCGGCGGACAAGCTGATCGTGGAGGTCAACACCGCGACGCCGTCGTTCGAG GGGCTGCATGACATCGACATGCCGGTGCTGCCACCGCACCGCGTGCCGTACCCATACACG CGGGTGGACGAGCGCAGCGGGCTGGACGCGGTGCCGGTGGACCCTGCGCGCGTGGTCGCG CTAGTGGAGAGCACCGAGCGCGACAAGGTGGGGCCCAACACGCCCTCGGACGAGGGGTCG CGCGCGATTGCGGGGCATCTGGTGGAGTTCTTCGAGAACGAGGTCAGGCACGGGCGGCTG CCAGCCAACCTGCTGCCGCTGCAGAGCGGCATCGGCAACATTGCGAACGCGGTGATCGAG GGGCTTGCAGGCGCCTCATTCCGGAACCTGACAGTTTGGACCGAGGTGTTGCAGGACTCG TTCCTGGATCTGTTCGAGAACGGCTCCCTAGAATTCGCGACCGCCACCAGTATCCGCCTG ACCGAGGCCGGCTTCGAGAAGTTCTTTGCCAACTGGGACGAATACTCCTCCAAGCTGTGC TTGCGCTCCCAGGTCGTGTCCAACAGCCCAGAGATGATCCGCCGGCTCGGTGTCATCGCA ATGAATACGCCTGTGGAGGTCGACATCTACGCGCACGCGAACTCGACCAACGTGTCTGGC TCTCGGATGCTCAACGGGCTCGGCGGCTCCGCGGACTTCCTCCGGAACGCAAAGCTGTCC ATCATGCATGCGCCTTCTGCCAGACCTAGCAAGACCGATCCTACCGGCATCTCGACCATC GTGCCCATGGCATCCCATGTCGACCAGCTTGAGCATGACCTAGACGTCCTAGTCACGGAT CAGGGTCTGGCCGACCTGCGTGGTCTCTGTCCGCGGGAGCGCGCGCGCGAGATTATCCGC CAGTGCGCACATCCAGACTACAAGCCAATTTTGACTGACTACCTAGACAGAGCTGAGCAT TATGCGCAGCGCTCGCGCTCGATGCACGAGCCTCATATTTTGCAGCAAGCTCTCAGATTC CATACCCATCTCGCTGAAAAAGGCACCATGAAGGTCCCTTCGTGGGACTAA
Coding sequence
>ERGO0F05984g.cds ATGCGCCGGTCCGCCCGGACGCGTAGCGTGCATACATACATACATATAACTACTGCCGAA CTCCGCGACGCATCGCTCGCGCTTCGGCGCGTGGCTGGTCGCCGATCGTTTTGGTGCGCT TGGGGAAGCTCTTGCATTCCCGTTGTGCCGGCGCCCCCGGATTCCAACAACCCCGTCATG CGACTGTCTAAACGCATGACTGCCATCAGATGGTACAGTATAGCGCGTGATACCGCGAGC GTATCCATTTTGCCGTCCCGGGAGCTGACGGTGGCAATAGCTCCAACAACCGGCGGCATT GTGGTTATACGAGTTCAACAGTATAAAAGGGGCCGCGGTGGCAAATGGTACGATCGGGTT TTTGCGTGGGCGTTGTGTAAGCGGCAACGGACATACCGAATGACAGTTTCGCAGCTTTTG AAACAGAGGGTCCGGTACGCCCCATACCTATCCAAAGTACGGAGAGCAGAGGAGCTGTTG CCTCTGTTTAAGCATGGGCAGTATATCGGGTGGTCCGGGTTCACGGGCGTGGGCGCGCCC AAAGTGATCCCCACGGCGCTTGCAGACCATGTGGAGAAGAACGGGCTACAGGGCCAGCTG GCATTCAATCTTTTCGTGGGCGCGTCGGCTGGGCCGGAGGAGAACAGGTGGGCGGACCTG GACATGATTTTGCGGCGCGCGCCGCACCAGGTCGGCAAGCCGATCGCGCGCGCGATCAAC GATGGGCGCATCAAGTTCTTCGACAAGCACCTGTCGATGTTTCCGCAGGACCTGACGTAC GGGTACTACACCCGGGAGCGGACGGACGGCAAGATCCTGGACTACGCGATCGTGGAAGCG ACGGCGATCAAGGAGGACGGCTCGATTGTGCTGGGCCCGTCGGTGGGCGGCTCGCCGGAG TTCATGTCTGCGGCGGACAAGCTGATCGTGGAGGTCAACACCGCGACGCCGTCGTTCGAG GGGCTGCATGACATCGACATGCCGGTGCTGCCACCGCACCGCGTGCCGTACCCATACACG CGGGTGGACGAGCGCAGCGGGCTGGACGCGGTGCCGGTGGACCCTGCGCGCGTGGTCGCG CTAGTGGAGAGCACCGAGCGCGACAAGGTGGGGCCCAACACGCCCTCGGACGAGGGGTCG CGCGCGATTGCGGGGCATCTGGTGGAGTTCTTCGAGAACGAGGTCAGGCACGGGCGGCTG CCAGCCAACCTGCTGCCGCTGCAGAGCGGCATCGGCAACATTGCGAACGCGGTGATCGAG GGGCTTGCAGGCGCCTCATTCCGGAACCTGACAGTTTGGACCGAGGTGTTGCAGGACTCG TTCCTGGATCTGTTCGAGAACGGCTCCCTAGAATTCGCGACCGCCACCAGTATCCGCCTG ACCGAGGCCGGCTTCGAGAAGTTCTTTGCCAACTGGGACGAATACTCCTCCAAGCTGTGC TTGCGCTCCCAGGTCGTGTCCAACAGCCCAGAGATGATCCGCCGGCTCGGTGTCATCGCA ATGAATACGCCTGTGGAGGTCGACATCTACGCGCACGCGAACTCGACCAACGTGTCTGGC TCTCGGATGCTCAACGGGCTCGGCGGCTCCGCGGACTTCCTCCGGAACGCAAAGCTGTCC ATCATGCATGCGCCTTCTGCCAGACCTAGCAAGACCGATCCTACCGGCATCTCGACCATC GTGCCCATGGCATCCCATGTCGACCAGCTTGAGCATGACCTAGACGTCCTAGTCACGGAT CAGGGTCTGGCCGACCTGCGTGGTCTCTGTCCGCGGGAGCGCGCGCGCGAGATTATCCGC CAGTGCGCACATCCAGACTACAAGCCAATTTTGACTGACTACCTAGACAGAGCTGAGCAT TATGCGCAGCGCTCGCGCTCGATGCACGAGCCTCATATTTTGCAGCAAGCTCTCAGATTC CATACCCATCTCGCTGAAAAAGGCACCATGAAGGTCCCTTCGTGGGACTAA
Predicted translation product
>ERGO0F05984g.aa MRRSARTRSVHTYIHITTAELRDASLALRRVAGRRSFWCAWGSSCIPVVPAPPDSNNPVM RLSKRMTAIRWYSIARDTASVSILPSRELTVAIAPTTGGIVVIRVQQYKRGRGGKWYDRV FAWALCKRQRTYRMTVSQLLKQRVRYAPYLSKVRRAEELLPLFKHGQYIGWSGFTGVGAP KVIPTALADHVEKNGLQGQLAFNLFVGASAGPEENRWADLDMILRRAPHQVGKPIARAIN DGRIKFFDKHLSMFPQDLTYGYYTRERTDGKILDYAIVEATAIKEDGSIVLGPSVGGSPE FMSAADKLIVEVNTATPSFEGLHDIDMPVLPPHRVPYPYTRVDERSGLDAVPVDPARVVA LVESTERDKVGPNTPSDEGSRAIAGHLVEFFENEVRHGRLPANLLPLQSGIGNIANAVIE GLAGASFRNLTVWTEVLQDSFLDLFENGSLEFATATSIRLTEAGFEKFFANWDEYSSKLC LRSQVVSNSPEMIRRLGVIAMNTPVEVDIYAHANSTNVSGSRMLNGLGGSADFLRNAKLS IMHAPSARPSKTDPTGISTIVPMASHVDQLEHDLDVLVTDQGLADLRGLCPRERAREIIR QCAHPDYKPILTDYLDRAEHYAQRSRSMHEPHILQQALRFHTHLAEKGTMKVPSWD*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://www.genolevures.org/elt/ERGO/ERGO0F05984g