ERGO0G00528g
Syntenic homolog of Saccharomyces cerevisiae YMR307W (GAS1); Tandem gene duplication in this genome
Element type: CDS
Element length: 1668 nucleotides,
on sense strand of
Ergo0G: 49757..51424.
Other names:
AGL351W
AGOS_AGL351W
Coding sequence: 556 codons.
Element length: 1668 nucleotides,
on sense strand of
Ergo0G: 49757..51424.
Other names:
AGL351W
AGOS_AGL351W
Coding sequence: 556 codons.
Homologs and Orthologs
Homologs in protein families: GL3R0042 GL3R0042.F1 GL3R0042.N2Orthologs: strict determination not possible; homologs must be refined manually
Protein domain map
Sequence data 
>ERGO0G00528g.nt ATGTTGCAGACAGCGATATACGGAACGGTTGCATGCCTCCTACAGGCTGTGGTCGCACAG AGAAACAATGGTGGGGCAGCGCTTAGCGCGACAGCCGGCACGGCGCCCTCGATTGCGACC TCCAGTCCTTCGACCAACAAGGTGCCGCCTATTGAGATCCATGGAAACAAGTTCTTCTAT TCAAACAACGGCTCCCAGTTTTACATGAAGGGGATTGCATACCAGGCGGAGACGTCTGAC GCTTCTGCGAGTGCCACGATCAACGACCCGCTAGCGGATTATGAGAGCTGTTCCCGGGAT TTGCCGTACTTGTTGGAGCTAAACACCAACACGCTTCGGGTATATGCAGTGAACAGTAGC TTGGACCATTCAAGATGCATGGAACTTTTTGAAAGCAACGGGATTTATATTGTTGCGGAT TTATCGGAACCAGCACTATCGATCAACAGAAACAACCCTGAATGGTCTTTGAAGCTTTTC GAAAGATATGCAGGTGTGGTTGACGAAATGCAGAAATACAGCAATGTGTTGGGGTTCTTT GCAGGAAATGAAGTAACTAACGAGATAAATAATACGGCAGCTTCCGCTTTTGTGAAAGCT GCTATTCGTGACACTAAGGCATACATTAAGGAAAAGGGCTATAGGGAAATTCCAGTGGGG TACTCAACCAATGATGACAGTAACTTTAGACAAGAGATTGCGGATTACTTTGCGTGCGGG TCTCAAGAGGAGAAGGCGGACTTTCTATGGGTTCAACGTCTATTTCATGGTGCGGGAGAT TCTACCTTTGAAAAATCAGGCTACTCAGACCAGACGGAAGAGTTTTCCAACTTGGGTATC CCAGTGTTCTTCTCCGAATATGGCTGTAACGAAGTCAGGCCTCGGAAGTTTGGCGAAGTT TCTGCTCTATACGGTAGCGATATGACAGATGTCTGGTCAGGAGGTATTGTATACATGTAT TTTGAAGAAACCAATCAATACGGCTTGGTAACCGTCGACAGCAGTGGCCGCGTTTCTACC AACGATGACTATAACAACTTGAAGACAGCATTGGCCACAATTTCGCCATCATCTGCAAAC AAAGATTCATACACCGCAAGTTCTGGCTCTGTCGCATGTCCTACAACTGGCTCAAACTGG CAAGCAGCAACTAGCTTACCACCATCACCACAAAAGGATGTTTGTGATTGTATAAAGAGT GCATTAAGATGTGTCATTTCTCCAGATGTCGACCAGAAGGATTACTCGGAATTATTTGGC TATCTATGCTCCGAAGCGGACGTTGATTGTTCAGACATTTCTGCCGATGGAACTACAGGA AACTATGGAGCATTTTCCTTCTGCGACGACGAGACCAAGCTATCATACCTATTGAACAAG TATTACCAAGAAAAGGGCAGGTCCTCCTCCTCTGCATGTGACTTCAGCGGCTCTGCAACG CTAGTCTCCGCCACAGGCACAGCATCTACATGTGTTCCTACATCTACCGGGATAAACTTG GGCTCATCTTCATCCAACAGCGGCAGTGGCAGAGGTGACAGCACTACCGCCACCTCTTCA GCAGCCGCAGCCGCAGATGCTATTCAACCGAACACCTTCTCAATGGCTTTATTCATCCCT ACCGCTTTAGTATTTATGCTTACCGGTTTTGGCATAGCTATGTCTTGA
>ERGO0G00528g.cds ATGTTGCAGACAGCGATATACGGAACGGTTGCATGCCTCCTACAGGCTGTGGTCGCACAG AGAAACAATGGTGGGGCAGCGCTTAGCGCGACAGCCGGCACGGCGCCCTCGATTGCGACC TCCAGTCCTTCGACCAACAAGGTGCCGCCTATTGAGATCCATGGAAACAAGTTCTTCTAT TCAAACAACGGCTCCCAGTTTTACATGAAGGGGATTGCATACCAGGCGGAGACGTCTGAC GCTTCTGCGAGTGCCACGATCAACGACCCGCTAGCGGATTATGAGAGCTGTTCCCGGGAT TTGCCGTACTTGTTGGAGCTAAACACCAACACGCTTCGGGTATATGCAGTGAACAGTAGC TTGGACCATTCAAGATGCATGGAACTTTTTGAAAGCAACGGGATTTATATTGTTGCGGAT TTATCGGAACCAGCACTATCGATCAACAGAAACAACCCTGAATGGTCTTTGAAGCTTTTC GAAAGATATGCAGGTGTGGTTGACGAAATGCAGAAATACAGCAATGTGTTGGGGTTCTTT GCAGGAAATGAAGTAACTAACGAGATAAATAATACGGCAGCTTCCGCTTTTGTGAAAGCT GCTATTCGTGACACTAAGGCATACATTAAGGAAAAGGGCTATAGGGAAATTCCAGTGGGG TACTCAACCAATGATGACAGTAACTTTAGACAAGAGATTGCGGATTACTTTGCGTGCGGG TCTCAAGAGGAGAAGGCGGACTTTCTATGGGTTCAACGTCTATTTCATGGTGCGGGAGAT TCTACCTTTGAAAAATCAGGCTACTCAGACCAGACGGAAGAGTTTTCCAACTTGGGTATC CCAGTGTTCTTCTCCGAATATGGCTGTAACGAAGTCAGGCCTCGGAAGTTTGGCGAAGTT TCTGCTCTATACGGTAGCGATATGACAGATGTCTGGTCAGGAGGTATTGTATACATGTAT TTTGAAGAAACCAATCAATACGGCTTGGTAACCGTCGACAGCAGTGGCCGCGTTTCTACC AACGATGACTATAACAACTTGAAGACAGCATTGGCCACAATTTCGCCATCATCTGCAAAC AAAGATTCATACACCGCAAGTTCTGGCTCTGTCGCATGTCCTACAACTGGCTCAAACTGG CAAGCAGCAACTAGCTTACCACCATCACCACAAAAGGATGTTTGTGATTGTATAAAGAGT GCATTAAGATGTGTCATTTCTCCAGATGTCGACCAGAAGGATTACTCGGAATTATTTGGC TATCTATGCTCCGAAGCGGACGTTGATTGTTCAGACATTTCTGCCGATGGAACTACAGGA AACTATGGAGCATTTTCCTTCTGCGACGACGAGACCAAGCTATCATACCTATTGAACAAG TATTACCAAGAAAAGGGCAGGTCCTCCTCCTCTGCATGTGACTTCAGCGGCTCTGCAACG CTAGTCTCCGCCACAGGCACAGCATCTACATGTGTTCCTACATCTACCGGGATAAACTTG GGCTCATCTTCATCCAACAGCGGCAGTGGCAGAGGTGACAGCACTACCGCCACCTCTTCA GCAGCCGCAGCCGCAGATGCTATTCAACCGAACACCTTCTCAATGGCTTTATTCATCCCT ACCGCTTTAGTATTTATGCTTACCGGTTTTGGCATAGCTATGTCTTGA
>ERGO0G00528g.aa MLQTAIYGTVACLLQAVVAQRNNGGAALSATAGTAPSIATSSPSTNKVPPIEIHGNKFFY SNNGSQFYMKGIAYQAETSDASASATINDPLADYESCSRDLPYLLELNTNTLRVYAVNSS LDHSRCMELFESNGIYIVADLSEPALSINRNNPEWSLKLFERYAGVVDEMQKYSNVLGFF AGNEVTNEINNTAASAFVKAAIRDTKAYIKEKGYREIPVGYSTNDDSNFRQEIADYFACG SQEEKADFLWVQRLFHGAGDSTFEKSGYSDQTEEFSNLGIPVFFSEYGCNEVRPRKFGEV SALYGSDMTDVWSGGIVYMYFEETNQYGLVTVDSSGRVSTNDDYNNLKTALATISPSSAN KDSYTASSGSVACPTTGSNWQAATSLPPSPQKDVCDCIKSALRCVISPDVDQKDYSELFG YLCSEADVDCSDISADGTTGNYGAFSFCDDETKLSYLLNKYYQEKGRSSSSACDFSGSAT LVSATGTASTCVPTSTGINLGSSSSNSGSGRGDSTTATSSAAAAADAIQPNTFSMALFIP TALVFMLTGFGIAMS*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://www.genolevures.org/elt/ERGO/ERGO0G00528p