Element type: CDS
Element length: 2196 nucleotides,
on sense strand of
Ergo0G: 529321..531516.
Other names:
AGL094W
AGOS_AGL094W
Coding sequence: 732 codons.
Element length: 2196 nucleotides,
on sense strand of
Ergo0G: 529321..531516.
Other names:
AGL094W
AGOS_AGL094W
Coding sequence: 732 codons.
Homologs and Orthologs
Homologs in protein families: GL3C0213 GL3C0213.F1 GL3C0213.N2Orthologs by synteny: ZYRO0A09284g SAKL0H13090g KLTH0F07942g KLLA0A04235g
Protein domain map
Sequence data 
Nucleotide sequence
>ERGO0G06292g.nt ATGGCAGGAAAGAAGCAGAGAGCCAAGGGCCAGTCTGGAAACGCGAATAGTGCCGGTAAA AGGAATGAAGTGGTGCAGGATGAGAGGTTGCAAGCTGTGGTGTTGACGGACTCGTTTGAG ACCCGGTTTATGCCACTGACATATGAGATGCCGCGGTGCTTGATGCCATTGGCAAATGTG CCGCTGATAGAGTATACGCTCGAGTTTCTGGCCAAGGCTGGTGTACATGAGGTGTACCTG ATTTGTACTTCCCACGCAGAGCAGGTACAGGAATACATCGACAACTCGAAGTGGAACTTG CCGTGGTCGCCATTCAAGGTCTCGACTATCTTGGCTCTTGAGGCGCGGTCTGTGGGGGAT GCTATGCGAGATCTGGACAACCGTGGGCTCATAACTGGTGATTTCATTCTTGTCAGCGGA GACCTAGTGACCAACATGGAGTTTGACAGGGCCTTAGAGGCGCACCGTGCCAGACGTGCG GAGGACAAAGAGCATATCGTAACGATGTGCTTGAGTACAGCAACACAGTCGCACAGAACG AGATCATGCGAGCCGGCAGTCTTCATGCTGGATAAATCCAACGACAGATGTCTGTACTAC CAGGACATCCCGTTGGCCTCTTCCAAGAGGAAGACAGCAGTAGATATTGACCCAGAGCTG CTAGAAGGCGTTGAAGAGTTTAAGCTGCGCAATGATCTGATTGACTGCCACATCGACATT TGTTCGCCTCTTGTCCCGGCAATTTTCCAGGAGAATTTTGACTACCAGTACCTGCGTAGA GATTTTGTGAAAGGTGTTCTGTCCAGCGATCTGCTAAAGAAGCACATCTATGCATATATC ACGAAGGAATATGCGGCAAGGGTAGAAAGCTGGCAGACGTACGATGCCATATCTCAGGAC TTTTTGGCCAGGTGGTGCTACCCTCTGGTCCTGGACAGCAACTTACTGGAAGATCAGACA TACTCGTACGAGTCCAAGCACGTTTACAAAGAGAAAGACGTTATACTGGCGCAGTCATGC AAGATTGGCAAGTGCACTGCAATCGGCTCTGGTTCTACCATCGGCGAGGGGACCTTTATT GAAAACTCGGTGATCGGCCGCAACTGCCAGATTGGCGCCAATGTCAAAATCATCAACAGT TATATCTGGGAAAATAGCATCATCGGCGACAACAGCGTCCTCAACCACTCCATTGTAGCA GCCGGGGCAAAGCTCGGCTCCGCCGTGACGCTGGAAGATGGCTGCGTCATCGGCTTCAAC GTCGTCGTGGCCTCGAGCAAGACCATCCCCAGCGGCACCAGGATTTCCGCCGCGCCCATC ACCGTGTCTTCGATGTCGGTGCCCACGCCGTCGTACAGCAGCGACGAGGATAGCGACGAG GACAGCGGTCTGCACCCCGCCACCAGGTCCAAGTCCACTGCAAAGGCTGTGCAGCTGGTG GGCGAGAGCGGCGTCGGCTACATCTACGAGAGCGATGACTCGGACGACTCGGAGGAAACC GACGACTGCGACGGCGCAGGAGCAAACACGCTCTGCACCCGCATGGACGAGCTCTATCTG TCCGACTCGTCCATCTATTCCGCCTCCAAGACCAAGAAGCGCAGAACCCTCTCCACCGCT AGCTACTACACCGACTGGGAGGACGGCTCCGACGCCTCAGAGGAGGAGGAGGACTTCGAG AAGGAGGCCATCGCCACCGTCGAACGTGCCATGGAGCACAACCACGACATCGACACCGCC CTGCTCGAGCTCAACACCCTGCGCATGTCGCTCAACGTCACCTACCACGAGGTACGCGCC GCCACCGTCTCCGCCATGCTCAAGCGCGTCTACCACTTCATCGCCACGCAGACCCTCGGC CCCAAGGACGCCGTCGCCAAGGTCTTCGCTCACTGGGGCCCGCTCTTCCGCCGCCAAGCC TTCACCGCGGCCGAGTACGTCGACCTTATGGACGTCATCCTCCACCGTGTCGTCTCCATG CGCTTCGACCGCCCGGACTTCATCCTCTTCTGCGTCTATAGCTGTCTCTACGACAGCGAC ATCCTCGACGAGGACGTAGTATACCAGTGGTGGGCCGCCGCCGCCGCCGACCCCGCCCAC GCTGACGTCAAGACCCTGACCGCTAAGTGGGTCGACTGGCTGCAGACCGCCGACGAGGAG TCGGGCGACGACTCGGGCGACGACTCCGATGAGTAA
Coding sequence
>ERGO0G06292g.cds ATGGCAGGAAAGAAGCAGAGAGCCAAGGGCCAGTCTGGAAACGCGAATAGTGCCGGTAAA AGGAATGAAGTGGTGCAGGATGAGAGGTTGCAAGCTGTGGTGTTGACGGACTCGTTTGAG ACCCGGTTTATGCCACTGACATATGAGATGCCGCGGTGCTTGATGCCATTGGCAAATGTG CCGCTGATAGAGTATACGCTCGAGTTTCTGGCCAAGGCTGGTGTACATGAGGTGTACCTG ATTTGTACTTCCCACGCAGAGCAGGTACAGGAATACATCGACAACTCGAAGTGGAACTTG CCGTGGTCGCCATTCAAGGTCTCGACTATCTTGGCTCTTGAGGCGCGGTCTGTGGGGGAT GCTATGCGAGATCTGGACAACCGTGGGCTCATAACTGGTGATTTCATTCTTGTCAGCGGA GACCTAGTGACCAACATGGAGTTTGACAGGGCCTTAGAGGCGCACCGTGCCAGACGTGCG GAGGACAAAGAGCATATCGTAACGATGTGCTTGAGTACAGCAACACAGTCGCACAGAACG AGATCATGCGAGCCGGCAGTCTTCATGCTGGATAAATCCAACGACAGATGTCTGTACTAC CAGGACATCCCGTTGGCCTCTTCCAAGAGGAAGACAGCAGTAGATATTGACCCAGAGCTG CTAGAAGGCGTTGAAGAGTTTAAGCTGCGCAATGATCTGATTGACTGCCACATCGACATT TGTTCGCCTCTTGTCCCGGCAATTTTCCAGGAGAATTTTGACTACCAGTACCTGCGTAGA GATTTTGTGAAAGGTGTTCTGTCCAGCGATCTGCTAAAGAAGCACATCTATGCATATATC ACGAAGGAATATGCGGCAAGGGTAGAAAGCTGGCAGACGTACGATGCCATATCTCAGGAC TTTTTGGCCAGGTGGTGCTACCCTCTGGTCCTGGACAGCAACTTACTGGAAGATCAGACA TACTCGTACGAGTCCAAGCACGTTTACAAAGAGAAAGACGTTATACTGGCGCAGTCATGC AAGATTGGCAAGTGCACTGCAATCGGCTCTGGTTCTACCATCGGCGAGGGGACCTTTATT GAAAACTCGGTGATCGGCCGCAACTGCCAGATTGGCGCCAATGTCAAAATCATCAACAGT TATATCTGGGAAAATAGCATCATCGGCGACAACAGCGTCCTCAACCACTCCATTGTAGCA GCCGGGGCAAAGCTCGGCTCCGCCGTGACGCTGGAAGATGGCTGCGTCATCGGCTTCAAC GTCGTCGTGGCCTCGAGCAAGACCATCCCCAGCGGCACCAGGATTTCCGCCGCGCCCATC ACCGTGTCTTCGATGTCGGTGCCCACGCCGTCGTACAGCAGCGACGAGGATAGCGACGAG GACAGCGGTCTGCACCCCGCCACCAGGTCCAAGTCCACTGCAAAGGCTGTGCAGCTGGTG GGCGAGAGCGGCGTCGGCTACATCTACGAGAGCGATGACTCGGACGACTCGGAGGAAACC GACGACTGCGACGGCGCAGGAGCAAACACGCTCTGCACCCGCATGGACGAGCTCTATCTG TCCGACTCGTCCATCTATTCCGCCTCCAAGACCAAGAAGCGCAGAACCCTCTCCACCGCT AGCTACTACACCGACTGGGAGGACGGCTCCGACGCCTCAGAGGAGGAGGAGGACTTCGAG AAGGAGGCCATCGCCACCGTCGAACGTGCCATGGAGCACAACCACGACATCGACACCGCC CTGCTCGAGCTCAACACCCTGCGCATGTCGCTCAACGTCACCTACCACGAGGTACGCGCC GCCACCGTCTCCGCCATGCTCAAGCGCGTCTACCACTTCATCGCCACGCAGACCCTCGGC CCCAAGGACGCCGTCGCCAAGGTCTTCGCTCACTGGGGCCCGCTCTTCCGCCGCCAAGCC TTCACCGCGGCCGAGTACGTCGACCTTATGGACGTCATCCTCCACCGTGTCGTCTCCATG CGCTTCGACCGCCCGGACTTCATCCTCTTCTGCGTCTATAGCTGTCTCTACGACAGCGAC ATCCTCGACGAGGACGTAGTATACCAGTGGTGGGCCGCCGCCGCCGCCGACCCCGCCCAC GCTGACGTCAAGACCCTGACCGCTAAGTGGGTCGACTGGCTGCAGACCGCCGACGAGGAG TCGGGCGACGACTCGGGCGACGACTCCGATGAGTAA
Predicted translation product
>ERGO0G06292g.aa MAGKKQRAKGQSGNANSAGKRNEVVQDERLQAVVLTDSFETRFMPLTYEMPRCLMPLANV PLIEYTLEFLAKAGVHEVYLICTSHAEQVQEYIDNSKWNLPWSPFKVSTILALEARSVGD AMRDLDNRGLITGDFILVSGDLVTNMEFDRALEAHRARRAEDKEHIVTMCLSTATQSHRT RSCEPAVFMLDKSNDRCLYYQDIPLASSKRKTAVDIDPELLEGVEEFKLRNDLIDCHIDI CSPLVPAIFQENFDYQYLRRDFVKGVLSSDLLKKHIYAYITKEYAARVESWQTYDAISQD FLARWCYPLVLDSNLLEDQTYSYESKHVYKEKDVILAQSCKIGKCTAIGSGSTIGEGTFI ENSVIGRNCQIGANVKIINSYIWENSIIGDNSVLNHSIVAAGAKLGSAVTLEDGCVIGFN VVVASSKTIPSGTRISAAPITVSSMSVPTPSYSSDEDSDEDSGLHPATRSKSTAKAVQLV GESGVGYIYESDDSDDSEETDDCDGAGANTLCTRMDELYLSDSSIYSASKTKKRRTLSTA SYYTDWEDGSDASEEEEDFEKEAIATVERAMEHNHDIDTALLELNTLRMSLNVTYHEVRA ATVSAMLKRVYHFIATQTLGPKDAVAKVFAHWGPLFRRQAFTAAEYVDLMDVILHRVVSM RFDRPDFILFCVYSCLYDSDILDEDVVYQWWAAAAADPAHADVKTLTAKWVDWLQTADEE SGDDSGDDSDE*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://www.genolevures.org/elt/ERGO/ERGO0G06292p