Element type: CDS
Element length: 2211 nucleotides,
on sense strand of
Ergo0C: 774173..776383.
Other names:
ACR236W
AGOS_ACR236W
Coding sequence: 737 codons.
Element length: 2211 nucleotides,
on sense strand of
Ergo0C: 774173..776383.
Other names:
ACR236W
AGOS_ACR236W
Coding sequence: 737 codons.
Homologs and Orthologs
Homologs in protein families: GL3C0136 GL3C0136.F2 GL3C0136.N4Orthologs by synteny: ZYRO0B02882g SAKL0F11946g KLTH0B04994g KLLA0E19911g
Protein domain map
Sequence data 
Nucleotide sequence
>ERGO0C10164g.nt ATGACGATTAGTGCTGACTCTCAGGCAAAGAAGAGCAAGATGCTCGATGGGCTCGATATC AAGGACATCATACAAGGTGATGAGCAAGACGGCGCAGGAAACGGCGGCAGCAGACGCGTA CACAGGCCGCGCGATGGGAATGGGCCGGACTACCGCATACACAAACCCATGGCGGCGGTA GGGTGGCTACAGGGACGGATTCCGTCGGATGTGGCGGAGTGGGGGCGGTGGAACGCACAC GTAGATTTTCCGGATGGCACGAAGAACAAGGTGAGGATACGATACGACCCGCGCAAGCTG TACGACTTCCGGCGGGTGCTGCGGGCGTCGGCGGCGGAGACGGGGCCACCGCCGCACCTG GACTTTGAGATCTACGAGACCGAGAATCTGCTGAACATGCTGGACAACGAGTCGTACCCG TACCGCGGGGCGATTGCCAACAAGAAGGACTACTCGACCAGCAAGACGACGCCGACGCTG CGCGATCGCCAGTTTTTTCAGAAGCTGCTGTTTCAGAGCGCGAACGCGGCGCGCACCAAC GGGAACACGTTACTATCTTTCACAGAGGAAGAGCCCAAAGGCAGGCCGAGCAAGGCCGAG CGGGCGCACAAGGTTTCGTACATCTATTTCAATGACCACGAGATCAAAACGTGGTATACA GCGCCGTATCCTGAAGAATATAGCAAAAACAGAATTCTCTACATTTGCGATTCGTGTCTG AAGTACATGAACTCGAAATATATATACTACCGACACAAGCTGAAATGTTCGATGATGCAC CCACCTGGCAATGAAATCTACAGGGATGGCAAAATCTCGATATGGGAGATCGACGGCCGG GAACAGGTAATATATTGCCAGAACTTGTGTTTACTTGCCAAGCTTTTCTTGAACTCCAAG ACGCTTTACTATGATGTCGAGCCGTTTGTGTTTTACCTGCTAACGGAGAACGTTAAGACA GGCCAAGGCTTAAAATTCAACGTCGTGGGGTACTTTTCAAAGGAGAAGTTGAACACAACC GATTATAACCTGAGTTGCATTCTGGTGCTTCCGACGCACCAGCGGTTAGGTTATGGCCAT CTATTAATGGACTTTTCGTACCTGCTATCTCGAAGAGAGTTCAAATGGGGCACCCCAGAG AAGCCGCTTAGCGATCTCGGGCTGCTCTCATACCGGAACTATTGGAAGATCAAGATGGCA CAAGTCCTGCGAAGCTTGAAACCGATAATAGCGAAGTCGACTTATTTTTGCATTTCTTTG GAGGATATCGCCAATTTGACGGGGATGACACCAACCGATGTCATATTCGGTCTGGAACAG TTGGGGTGCTTCTACAGATACTCATCTATTGACGGGAGAAAATACGCCATAAGAATAGAC AGCTGGGATGAGATAGAACATATTTACCAGCAATGGGAATCCAAGGCATACATCACTTTG GACCCAGATAAGCTGATCTGGCGCCCGCTAATCTTCGGACCCTCCTGTGGGATAAACGCT GTAGGACCATGTGAAACTAGCACAGGCGCACACAGAAATACTGGTGCTGTGGGCTCCAGT CAAGGGACCGATTTATTCAAGAACAGTATATCTGTTTTGGTTAACTTTATGCATGATGAT ATTTCAGATTCCAGGACCATGGAAAAAATGGCATGGGACAAAATTGTAGCACAACAGAAG AATGCGCGTGAGGTCCAGAAGGATGCAACAGATCCTGCCATTCTGGAGTTGGCGTACGTG CACCCTTCTTTCACAAATGGCAGCACAGCGGAATATGGGGTCAAACCTGTAAGAAGAAAC CCGCCTGCGATTCGCGAAACCCCGAAGGTAGGCTCACCTGCAACGCCGGAGGAGAGGGAT GATATGACACCCTTAGAGGACGTGTATGAGGATTCATTTGATCAGACACACCTCAGCGAC CAGGATGACGAATACACGGATCAGGGAGAAGAGGAGGACGATGAAGAGGAGGACGATGAA GAGGACGATGGAATGAATGTCGGCCGATTTAACAGCAACGCATTACCCAGTAGGGCGTAC TCGAGACGAAGTGCTTCAGAGCATTCCAATAATCGAACAAGGTCGTCACGTCACCAGCAG TCTGTACCACATGTGGAATCAGACGATGACGAAGATCTTGAAGAACATTGGGTGGACGCA CTAGAAACTGTTTCGAGAACAAGGAGGATATTGCGAAACCGCACAGCTTAG
Coding sequence
>ERGO0C10164g.cds ATGACGATTAGTGCTGACTCTCAGGCAAAGAAGAGCAAGATGCTCGATGGGCTCGATATC AAGGACATCATACAAGGTGATGAGCAAGACGGCGCAGGAAACGGCGGCAGCAGACGCGTA CACAGGCCGCGCGATGGGAATGGGCCGGACTACCGCATACACAAACCCATGGCGGCGGTA GGGTGGCTACAGGGACGGATTCCGTCGGATGTGGCGGAGTGGGGGCGGTGGAACGCACAC GTAGATTTTCCGGATGGCACGAAGAACAAGGTGAGGATACGATACGACCCGCGCAAGCTG TACGACTTCCGGCGGGTGCTGCGGGCGTCGGCGGCGGAGACGGGGCCACCGCCGCACCTG GACTTTGAGATCTACGAGACCGAGAATCTGCTGAACATGCTGGACAACGAGTCGTACCCG TACCGCGGGGCGATTGCCAACAAGAAGGACTACTCGACCAGCAAGACGACGCCGACGCTG CGCGATCGCCAGTTTTTTCAGAAGCTGCTGTTTCAGAGCGCGAACGCGGCGCGCACCAAC GGGAACACGTTACTATCTTTCACAGAGGAAGAGCCCAAAGGCAGGCCGAGCAAGGCCGAG CGGGCGCACAAGGTTTCGTACATCTATTTCAATGACCACGAGATCAAAACGTGGTATACA GCGCCGTATCCTGAAGAATATAGCAAAAACAGAATTCTCTACATTTGCGATTCGTGTCTG AAGTACATGAACTCGAAATATATATACTACCGACACAAGCTGAAATGTTCGATGATGCAC CCACCTGGCAATGAAATCTACAGGGATGGCAAAATCTCGATATGGGAGATCGACGGCCGG GAACAGGTAATATATTGCCAGAACTTGTGTTTACTTGCCAAGCTTTTCTTGAACTCCAAG ACGCTTTACTATGATGTCGAGCCGTTTGTGTTTTACCTGCTAACGGAGAACGTTAAGACA GGCCAAGGCTTAAAATTCAACGTCGTGGGGTACTTTTCAAAGGAGAAGTTGAACACAACC GATTATAACCTGAGTTGCATTCTGGTGCTTCCGACGCACCAGCGGTTAGGTTATGGCCAT CTATTAATGGACTTTTCGTACCTGCTATCTCGAAGAGAGTTCAAATGGGGCACCCCAGAG AAGCCGCTTAGCGATCTCGGGCTGCTCTCATACCGGAACTATTGGAAGATCAAGATGGCA CAAGTCCTGCGAAGCTTGAAACCGATAATAGCGAAGTCGACTTATTTTTGCATTTCTTTG GAGGATATCGCCAATTTGACGGGGATGACACCAACCGATGTCATATTCGGTCTGGAACAG TTGGGGTGCTTCTACAGATACTCATCTATTGACGGGAGAAAATACGCCATAAGAATAGAC AGCTGGGATGAGATAGAACATATTTACCAGCAATGGGAATCCAAGGCATACATCACTTTG GACCCAGATAAGCTGATCTGGCGCCCGCTAATCTTCGGACCCTCCTGTGGGATAAACGCT GTAGGACCATGTGAAACTAGCACAGGCGCACACAGAAATACTGGTGCTGTGGGCTCCAGT CAAGGGACCGATTTATTCAAGAACAGTATATCTGTTTTGGTTAACTTTATGCATGATGAT ATTTCAGATTCCAGGACCATGGAAAAAATGGCATGGGACAAAATTGTAGCACAACAGAAG AATGCGCGTGAGGTCCAGAAGGATGCAACAGATCCTGCCATTCTGGAGTTGGCGTACGTG CACCCTTCTTTCACAAATGGCAGCACAGCGGAATATGGGGTCAAACCTGTAAGAAGAAAC CCGCCTGCGATTCGCGAAACCCCGAAGGTAGGCTCACCTGCAACGCCGGAGGAGAGGGAT GATATGACACCCTTAGAGGACGTGTATGAGGATTCATTTGATCAGACACACCTCAGCGAC CAGGATGACGAATACACGGATCAGGGAGAAGAGGAGGACGATGAAGAGGAGGACGATGAA GAGGACGATGGAATGAATGTCGGCCGATTTAACAGCAACGCATTACCCAGTAGGGCGTAC TCGAGACGAAGTGCTTCAGAGCATTCCAATAATCGAACAAGGTCGTCACGTCACCAGCAG TCTGTACCACATGTGGAATCAGACGATGACGAAGATCTTGAAGAACATTGGGTGGACGCA CTAGAAACTGTTTCGAGAACAAGGAGGATATTGCGAAACCGCACAGCTTAG
Predicted translation product
>ERGO0C10164g.aa MTISADSQAKKSKMLDGLDIKDIIQGDEQDGAGNGGSRRVHRPRDGNGPDYRIHKPMAAV GWLQGRIPSDVAEWGRWNAHVDFPDGTKNKVRIRYDPRKLYDFRRVLRASAAETGPPPHL DFEIYETENLLNMLDNESYPYRGAIANKKDYSTSKTTPTLRDRQFFQKLLFQSANAARTN GNTLLSFTEEEPKGRPSKAERAHKVSYIYFNDHEIKTWYTAPYPEEYSKNRILYICDSCL KYMNSKYIYYRHKLKCSMMHPPGNEIYRDGKISIWEIDGREQVIYCQNLCLLAKLFLNSK TLYYDVEPFVFYLLTENVKTGQGLKFNVVGYFSKEKLNTTDYNLSCILVLPTHQRLGYGH LLMDFSYLLSRREFKWGTPEKPLSDLGLLSYRNYWKIKMAQVLRSLKPIIAKSTYFCISL EDIANLTGMTPTDVIFGLEQLGCFYRYSSIDGRKYAIRIDSWDEIEHIYQQWESKAYITL DPDKLIWRPLIFGPSCGINAVGPCETSTGAHRNTGAVGSSQGTDLFKNSISVLVNFMHDD ISDSRTMEKMAWDKIVAQQKNAREVQKDATDPAILELAYVHPSFTNGSTAEYGVKPVRRN PPAIRETPKVGSPATPEERDDMTPLEDVYEDSFDQTHLSDQDDEYTDQGEEEDDEEEDDE EDDGMNVGRFNSNALPSRAYSRRSASEHSNNRTRSSRHQQSVPHVESDDDEDLEEHWVDA LETVSRTRRILRNRTA*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://www.genolevures.org/elt/ERGO/ERGO0C10164p