Element type: CDS
Element length: 2166 nucleotides,
on anti-sense strand of
Ergo0E: complement(1295961..1298126).
Other names:
AER358C
AGOS_AER358C
Coding sequence: 722 codons.
Element length: 2166 nucleotides,
on anti-sense strand of
Ergo0E: complement(1295961..1298126).
Other names:
AER358C
AGOS_AER358C
Coding sequence: 722 codons.
Homologs and Orthologs
Homologs in protein families: GL3C0456 GL3C0456.F1 GL3C0456.N1Orthologs by synteny: ZYRO0E08338g SAKL0F02068g KLTH0G01914g KLLA0F09933g
Protein domain map
Sequence data 
Nucleotide sequence
>ERGO0E16500g.nt ATGACAAAGGCATCAGTGGTGGACCAGTCCGCGCCGGCGTACGCGCCCAAGCGGCTGCTG GCAGAGGCGCGCGCGGCGTCGAAGGTGAACATCGAGCAGGTCTTCGCGTTTCTGGAAGGC TCGCCGGAGAAGGCGGCGCTGACGAACGAGCTACTGGCGGAGTTTGCAGCCGACCCTGCG ATCACGCAGGGCCCGGAGTACTACGACCTCACAAAGGCCGAGCAGCGGGAGCAGACGGTG AAGAAGATCGCGCGGCTGGCGCTGTACTTGGAGAATGACATTAAGCTGGCACGCAAGCAG CACCACAAGGACGTGGTGCGGGACCTGCAGTCGCCGGACGCGCCGATGGTGACTATGAGC GACATGGAACGCTTCGAGAAGCGCTCAACGCTGGTGGCGCTGATCGACCCGCAGCTGGCA ACGCGGCTGGGCGTGAACCTGAGCTTGTTCGGTAATGCCGTGCGGGGTAACGGCACGGAC GAACAGATCAAGTATTGGCTGCAGGAGCGCGGGCTCATCTTCGTGAAGGGCATCTATGGC TGCTTCGCGATGACAGAGCTAGGCCATGGGTCCAACGTGGCGAACCTGCAGACACGCGCT ACGTACGACCCTGCGAGCGACTCGTTTGTGATTCAGACGCCCGACCTTGTCGCGACGAAG TGGTGGATCGGCGGTGCTGCGCACAGCGCGACGCACTCGACCGTGTACGCCCGTCTGATC GTGGAGGGCAAGGACTACGGCGTGAAGGTCTTCGTGGTGCCTCTGCGCAACCCCAAGACC ATGGAGTTGCTGGCCGGGATTTCCATCGGCGACATCGGCTCCAAGATGGGCCGCGACGGT ATCGACAACGGCTGGATCCAGTTTAACAATGTGCGTATTCCCCGTGAGTACATGCTGAGC CGGTTTACGAAGGTGATCCCCGGCAACCCGCCAAAGGTTGAGATGGAGCCTCTGTTGGAC TCCATCTCCGGCTACGCCGCGTTGCTGTCCGGACGTGTGAGCATGGTATTGGACTCCTAC CGCTTTGGCGCACGCTTCTCCACCATCGCCACGCGGTATGCCTTTGGCAGACAGCAGTTT GGTGACCCAACCAATGAGACCCAGCTAATAGAGTACCCATTGCACCAGTTCCGTGTTCTC CCTCAGCTTGCCATAATATACATGATGGCGCCGGGCGCGATGAAGTTGATGGACACATAC AACAGCTGTTTGGGTGAGTTGTACGGTGCTGGCGATGACAAGAAGAAGTTGACTACTGTT AGCGCCAGAATGAAGGACTTGTTTGTGGAGTCTGCCAGTTTGAAGGCCACCTGCACTTGG TTGACTTCGACGTTGATCGACGAGTTGAGACAGACCTGCGGTGGCCACGGGTACTCCAGC TACAACGGTTTCGGAAAGGCATACAACGACTGGGTCGTTCAGTGCACTTGGGAAGGCGAC AACAACGTTCTGTGTTTGACCTCTGGTAAGTCGCTGCTCAAGAAGTTCGCTGGTATTGTT CGTGGCAAGAAGGTGACTATCTGTGACACCTCCATGGACTACCTCCGCATGGACTACATC CAGAAGGTGGTTATGGGCGGCACCAAAAAGGTGAGCAACTTATCCACACTTCCAGACTAC TACCAGATCTGGTCGGTTATCTTGGTGAAGTACTTGAAGCGCTGCGCCGAGACTGTCCGT GACAACAACGACCCAGAATCTGTGTCCAAGCTGCTCGTGAGTATCGCCAAGTTCCACGCA TTCTACTCTATGCTCCAGGAGTTCCACCGCAAGTTGGCCTCTGACCAGAGCCACGTGGGC GACGCCGCAACCAAGGAGGTCTTGTGGAAGGTCTACAAGCTCTCCTCGCTCTACTTCATC GACAAGTTCAGCGGCGAGTTCCAGCAGTTGAAGGTCATGTCCCCAGACCAGATGACGAAC GTGCAGGAGCAGATGTTGGCTATCCTGCCTGAGATCAAGACACACGCCATCCGTCTAACT GACGCCTTTCACCTTCCTGACGCCGTGATCAACTCATCTATCGGCAACTACGACGGCGAC ATTTACCACAACTACTTCAACGATGTCACCCGTGTTGCCGCCAAGGACAAGGCTCCAGGT GTGCCCCCATACGCGGACAATGCTTGTCAACTTAGTGGGCTCGTGACGACCAGTTCGACA ATTTGA
Coding sequence
>ERGO0E16500g.cds ATGACAAAGGCATCAGTGGTGGACCAGTCCGCGCCGGCGTACGCGCCCAAGCGGCTGCTG GCAGAGGCGCGCGCGGCGTCGAAGGTGAACATCGAGCAGGTCTTCGCGTTTCTGGAAGGC TCGCCGGAGAAGGCGGCGCTGACGAACGAGCTACTGGCGGAGTTTGCAGCCGACCCTGCG ATCACGCAGGGCCCGGAGTACTACGACCTCACAAAGGCCGAGCAGCGGGAGCAGACGGTG AAGAAGATCGCGCGGCTGGCGCTGTACTTGGAGAATGACATTAAGCTGGCACGCAAGCAG CACCACAAGGACGTGGTGCGGGACCTGCAGTCGCCGGACGCGCCGATGGTGACTATGAGC GACATGGAACGCTTCGAGAAGCGCTCAACGCTGGTGGCGCTGATCGACCCGCAGCTGGCA ACGCGGCTGGGCGTGAACCTGAGCTTGTTCGGTAATGCCGTGCGGGGTAACGGCACGGAC GAACAGATCAAGTATTGGCTGCAGGAGCGCGGGCTCATCTTCGTGAAGGGCATCTATGGC TGCTTCGCGATGACAGAGCTAGGCCATGGGTCCAACGTGGCGAACCTGCAGACACGCGCT ACGTACGACCCTGCGAGCGACTCGTTTGTGATTCAGACGCCCGACCTTGTCGCGACGAAG TGGTGGATCGGCGGTGCTGCGCACAGCGCGACGCACTCGACCGTGTACGCCCGTCTGATC GTGGAGGGCAAGGACTACGGCGTGAAGGTCTTCGTGGTGCCTCTGCGCAACCCCAAGACC ATGGAGTTGCTGGCCGGGATTTCCATCGGCGACATCGGCTCCAAGATGGGCCGCGACGGT ATCGACAACGGCTGGATCCAGTTTAACAATGTGCGTATTCCCCGTGAGTACATGCTGAGC CGGTTTACGAAGGTGATCCCCGGCAACCCGCCAAAGGTTGAGATGGAGCCTCTGTTGGAC TCCATCTCCGGCTACGCCGCGTTGCTGTCCGGACGTGTGAGCATGGTATTGGACTCCTAC CGCTTTGGCGCACGCTTCTCCACCATCGCCACGCGGTATGCCTTTGGCAGACAGCAGTTT GGTGACCCAACCAATGAGACCCAGCTAATAGAGTACCCATTGCACCAGTTCCGTGTTCTC CCTCAGCTTGCCATAATATACATGATGGCGCCGGGCGCGATGAAGTTGATGGACACATAC AACAGCTGTTTGGGTGAGTTGTACGGTGCTGGCGATGACAAGAAGAAGTTGACTACTGTT AGCGCCAGAATGAAGGACTTGTTTGTGGAGTCTGCCAGTTTGAAGGCCACCTGCACTTGG TTGACTTCGACGTTGATCGACGAGTTGAGACAGACCTGCGGTGGCCACGGGTACTCCAGC TACAACGGTTTCGGAAAGGCATACAACGACTGGGTCGTTCAGTGCACTTGGGAAGGCGAC AACAACGTTCTGTGTTTGACCTCTGGTAAGTCGCTGCTCAAGAAGTTCGCTGGTATTGTT CGTGGCAAGAAGGTGACTATCTGTGACACCTCCATGGACTACCTCCGCATGGACTACATC CAGAAGGTGGTTATGGGCGGCACCAAAAAGGTGAGCAACTTATCCACACTTCCAGACTAC TACCAGATCTGGTCGGTTATCTTGGTGAAGTACTTGAAGCGCTGCGCCGAGACTGTCCGT GACAACAACGACCCAGAATCTGTGTCCAAGCTGCTCGTGAGTATCGCCAAGTTCCACGCA TTCTACTCTATGCTCCAGGAGTTCCACCGCAAGTTGGCCTCTGACCAGAGCCACGTGGGC GACGCCGCAACCAAGGAGGTCTTGTGGAAGGTCTACAAGCTCTCCTCGCTCTACTTCATC GACAAGTTCAGCGGCGAGTTCCAGCAGTTGAAGGTCATGTCCCCAGACCAGATGACGAAC GTGCAGGAGCAGATGTTGGCTATCCTGCCTGAGATCAAGACACACGCCATCCGTCTAACT GACGCCTTTCACCTTCCTGACGCCGTGATCAACTCATCTATCGGCAACTACGACGGCGAC ATTTACCACAACTACTTCAACGATGTCACCCGTGTTGCCGCCAAGGACAAGGCTCCAGGT GTGCCCCCATACGCGGACAATGCTTGTCAACTTAGTGGGCTCGTGACGACCAGTTCGACA ATTTGA
Predicted translation product
>ERGO0E16500g.aa MTKASVVDQSAPAYAPKRLLAEARAASKVNIEQVFAFLEGSPEKAALTNELLAEFAADPA ITQGPEYYDLTKAEQREQTVKKIARLALYLENDIKLARKQHHKDVVRDLQSPDAPMVTMS DMERFEKRSTLVALIDPQLATRLGVNLSLFGNAVRGNGTDEQIKYWLQERGLIFVKGIYG CFAMTELGHGSNVANLQTRATYDPASDSFVIQTPDLVATKWWIGGAAHSATHSTVYARLI VEGKDYGVKVFVVPLRNPKTMELLAGISIGDIGSKMGRDGIDNGWIQFNNVRIPREYMLS RFTKVIPGNPPKVEMEPLLDSISGYAALLSGRVSMVLDSYRFGARFSTIATRYAFGRQQF GDPTNETQLIEYPLHQFRVLPQLAIIYMMAPGAMKLMDTYNSCLGELYGAGDDKKKLTTV SARMKDLFVESASLKATCTWLTSTLIDELRQTCGGHGYSSYNGFGKAYNDWVVQCTWEGD NNVLCLTSGKSLLKKFAGIVRGKKVTICDTSMDYLRMDYIQKVVMGGTKKVSNLSTLPDY YQIWSVILVKYLKRCAETVRDNNDPESVSKLLVSIAKFHAFYSMLQEFHRKLASDQSHVG DAATKEVLWKVYKLSSLYFIDKFSGEFQQLKVMSPDQMTNVQEQMLAILPEIKTHAIRLT DAFHLPDAVINSSIGNYDGDIYHNYFNDVTRVAAKDKAPGVPPYADNACQLSGLVTTSST I*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://www.genolevures.org/elt/ERGO/ERGO0E16500p