Element type: CDS
Element length: 2112 nucleotides,
on sense strand of
Ergo0E: 550556..552667.
Other names:
AEL044W
AGOS_AEL044W
Coding sequence: 704 codons.
Element length: 2112 nucleotides,
on sense strand of
Ergo0E: 550556..552667.
Other names:
AEL044W
AGOS_AEL044W
Coding sequence: 704 codons.
Homologs and Orthologs
Homologs in protein families: GL3R2291 GL3R2291.F1 GL3R2291.N1Orthologs by synteny: ZYRO0E03828g SAKL0D10538g KLTH0H13794g KLLA0A11176g
Protein domain map
Sequence data 
Nucleotide sequence
>ERGO0E07282g.nt ATGGCTGGTGTACCTGATAACGTCAAGGGCGTGGTTGAGCTGGACCCCTGGTTAGCTCCT TACGGGGACATCCTCTCTGCGAGACGGTTCCTTGCCGACAAGTGGAGGCACGATATCGAA CATGCGGTGCCCGGCGGGCGGCGCAGTCTAGTTGAGTTTGCGCGCGACGCATACAAGAGC TACGGGCTGCACGCGGACGCGCAGAGCAAAAGCATAACGTACAGGGAGTGGGCGCCCAAT GCAACCCGGGCGTTTCTAGTCGGCGACTTCAACGGGTGGGATGAGACCTCGCACGAGCTC CAGAACAAGGACGAGTTCGGGGTGTTCACGGGTGTGTTCGGACCTGGGGCGGACGGCGAT TTCATGATTCCGCATGACTCACGCGTGAAGGTGGTGTTCGAGCTTGCCGACGGGAGCCGG ATACACCGGTTGCCAGCGTGGATCAAAAGGGCGACGCAGCCCAGCAAGGAGACCGCGAAG GAGTGGGGGCCGTCGTACGAGGCGCGGTTCTGGAACCCTGCCAGCCCCTACAAATTCAAG CACGAAAGGCCGCGCCTGGACCCGAACGTGGAGTCTCTGAGAATATACGAGGCACACGTG GGCATCTCGACGCCGGAGCCGCGGGTTGGCAGCTACAGCGAGTTCACCAAGGATGTGCTG CCGCGCATCAGGGATCTCGGATACAACGCGATACAGCTGATGGCGATCATGGAGCACGCG TACTACGCGTCGTTCGGCTACCAGGTCACGAACTTCTTTGCTGTTTCTTCGCGGTACGGG ACGCCAGAGGAGCTCAAGGAGCTCATCGACACTGCCCACGGAATGGGCATCCAGGTGCTG CTTGACGTTGTGCACTCCCACGCCTCGAAGAACGTCTCCGACGGACTGAACATGTTCGAT GGCACCGACTACCAGTACTTCCACTCCATCAGCTCCGGACGCGGCGAGCACCCGCTGTGG GACTCGAGGCTGTTCAACTACGGCAGCTTTGAGGTGCAACGGTTTTTGCTGGCGAACCTT GCATTTTACATCGATGTCTATCAGTTCGACGGGTTCCGCTTCGATGGCGTGACCTCGATG CTTTATCATCACCACGGCGTCGGAGAGCGCGGCGCGTTCAGTGGTGACTATAACGAGTAT CTCTCCGACCATTCGGGCGTTGACCACGAGGCGCTGGCGTACCTCATGCTGGCCAATGAC TTGATCCATGACATGCTGCCCGCCAACGGCGTGACCGTTGCTGAAGACGTGTCCGGTTAT CCAACCCTTTGCTTGCCCCGGTCTGTGGGTGGCTGTGGATTTGATTACCGCCTCGCCATG GCGCTGCCAGATATGTGGATTAAGCTGCTAAAGGAGAGCAAGGACGAGGACTGGAGCATG GGCCACATTGTCTACACGCTTGTCAACAGGCGCTACAAAGAAAAGGTCGTCGCGTATGCA GAGTCGCACGACCAGGCGCTCGTGGGCGATAAAACGCTCGCGTTCTGGATGATGGACGCC GCGATGTACACCGACATGACGGTGCTGAAGGAGCTCACGCCGGTGGTCGACCGGGGCATC GCGCTGCACAAGCTGATCCGCCTGATCACGCACTCGCTCGGCGGCGAATCCTACCTGAAC TTTGAAGGGAATGAGTTTGGTCATCCCGAGTGGCTGGACTTCCCGAACGCCAACAATGGC GACAGCTACCAGTACGCCCGCCGCCAGTTCAATCTGGTGGACGACGGCCTCCTCCGCTAT AAGCACCTCTATGCCTTCGATAAGGCCATGCAGGAGGCAGAGGGCAAGCACAAGTGGCTG AATACTCCCCAGGCCTACGTCTCGCTGAAACATGAGACAGACAAGGTCATCTCCTTCGAG CGCAACGGCCTCGTGTTCATCTTCAACTTCCATCCGACCCAGTCCTTCACGGACTACCGC ATCGGAGTTGACGAGGCGGGCGCGTACCGTATAATCCTCAACTCGGACAGAGAGGAGTTC GGCGGGCACCGCCGCATAGAGGAGGAAAACTCCGTATTCCACACCACAGACCTCGAGTGG AACGGCAGAAGAAACTTCATCCAAGTCTATCTGCCCTCGAGAACCGCGCTGGTTCTGGCG CGCAACCCCTGA
Coding sequence
>ERGO0E07282g.cds ATGGCTGGTGTACCTGATAACGTCAAGGGCGTGGTTGAGCTGGACCCCTGGTTAGCTCCT TACGGGGACATCCTCTCTGCGAGACGGTTCCTTGCCGACAAGTGGAGGCACGATATCGAA CATGCGGTGCCCGGCGGGCGGCGCAGTCTAGTTGAGTTTGCGCGCGACGCATACAAGAGC TACGGGCTGCACGCGGACGCGCAGAGCAAAAGCATAACGTACAGGGAGTGGGCGCCCAAT GCAACCCGGGCGTTTCTAGTCGGCGACTTCAACGGGTGGGATGAGACCTCGCACGAGCTC CAGAACAAGGACGAGTTCGGGGTGTTCACGGGTGTGTTCGGACCTGGGGCGGACGGCGAT TTCATGATTCCGCATGACTCACGCGTGAAGGTGGTGTTCGAGCTTGCCGACGGGAGCCGG ATACACCGGTTGCCAGCGTGGATCAAAAGGGCGACGCAGCCCAGCAAGGAGACCGCGAAG GAGTGGGGGCCGTCGTACGAGGCGCGGTTCTGGAACCCTGCCAGCCCCTACAAATTCAAG CACGAAAGGCCGCGCCTGGACCCGAACGTGGAGTCTCTGAGAATATACGAGGCACACGTG GGCATCTCGACGCCGGAGCCGCGGGTTGGCAGCTACAGCGAGTTCACCAAGGATGTGCTG CCGCGCATCAGGGATCTCGGATACAACGCGATACAGCTGATGGCGATCATGGAGCACGCG TACTACGCGTCGTTCGGCTACCAGGTCACGAACTTCTTTGCTGTTTCTTCGCGGTACGGG ACGCCAGAGGAGCTCAAGGAGCTCATCGACACTGCCCACGGAATGGGCATCCAGGTGCTG CTTGACGTTGTGCACTCCCACGCCTCGAAGAACGTCTCCGACGGACTGAACATGTTCGAT GGCACCGACTACCAGTACTTCCACTCCATCAGCTCCGGACGCGGCGAGCACCCGCTGTGG GACTCGAGGCTGTTCAACTACGGCAGCTTTGAGGTGCAACGGTTTTTGCTGGCGAACCTT GCATTTTACATCGATGTCTATCAGTTCGACGGGTTCCGCTTCGATGGCGTGACCTCGATG CTTTATCATCACCACGGCGTCGGAGAGCGCGGCGCGTTCAGTGGTGACTATAACGAGTAT CTCTCCGACCATTCGGGCGTTGACCACGAGGCGCTGGCGTACCTCATGCTGGCCAATGAC TTGATCCATGACATGCTGCCCGCCAACGGCGTGACCGTTGCTGAAGACGTGTCCGGTTAT CCAACCCTTTGCTTGCCCCGGTCTGTGGGTGGCTGTGGATTTGATTACCGCCTCGCCATG GCGCTGCCAGATATGTGGATTAAGCTGCTAAAGGAGAGCAAGGACGAGGACTGGAGCATG GGCCACATTGTCTACACGCTTGTCAACAGGCGCTACAAAGAAAAGGTCGTCGCGTATGCA GAGTCGCACGACCAGGCGCTCGTGGGCGATAAAACGCTCGCGTTCTGGATGATGGACGCC GCGATGTACACCGACATGACGGTGCTGAAGGAGCTCACGCCGGTGGTCGACCGGGGCATC GCGCTGCACAAGCTGATCCGCCTGATCACGCACTCGCTCGGCGGCGAATCCTACCTGAAC TTTGAAGGGAATGAGTTTGGTCATCCCGAGTGGCTGGACTTCCCGAACGCCAACAATGGC GACAGCTACCAGTACGCCCGCCGCCAGTTCAATCTGGTGGACGACGGCCTCCTCCGCTAT AAGCACCTCTATGCCTTCGATAAGGCCATGCAGGAGGCAGAGGGCAAGCACAAGTGGCTG AATACTCCCCAGGCCTACGTCTCGCTGAAACATGAGACAGACAAGGTCATCTCCTTCGAG CGCAACGGCCTCGTGTTCATCTTCAACTTCCATCCGACCCAGTCCTTCACGGACTACCGC ATCGGAGTTGACGAGGCGGGCGCGTACCGTATAATCCTCAACTCGGACAGAGAGGAGTTC GGCGGGCACCGCCGCATAGAGGAGGAAAACTCCGTATTCCACACCACAGACCTCGAGTGG AACGGCAGAAGAAACTTCATCCAAGTCTATCTGCCCTCGAGAACCGCGCTGGTTCTGGCG CGCAACCCCTGA
Predicted translation product
>ERGO0E07282g.aa MAGVPDNVKGVVELDPWLAPYGDILSARRFLADKWRHDIEHAVPGGRRSLVEFARDAYKS YGLHADAQSKSITYREWAPNATRAFLVGDFNGWDETSHELQNKDEFGVFTGVFGPGADGD FMIPHDSRVKVVFELADGSRIHRLPAWIKRATQPSKETAKEWGPSYEARFWNPASPYKFK HERPRLDPNVESLRIYEAHVGISTPEPRVGSYSEFTKDVLPRIRDLGYNAIQLMAIMEHA YYASFGYQVTNFFAVSSRYGTPEELKELIDTAHGMGIQVLLDVVHSHASKNVSDGLNMFD GTDYQYFHSISSGRGEHPLWDSRLFNYGSFEVQRFLLANLAFYIDVYQFDGFRFDGVTSM LYHHHGVGERGAFSGDYNEYLSDHSGVDHEALAYLMLANDLIHDMLPANGVTVAEDVSGY PTLCLPRSVGGCGFDYRLAMALPDMWIKLLKESKDEDWSMGHIVYTLVNRRYKEKVVAYA ESHDQALVGDKTLAFWMMDAAMYTDMTVLKELTPVVDRGIALHKLIRLITHSLGGESYLN FEGNEFGHPEWLDFPNANNGDSYQYARRQFNLVDDGLLRYKHLYAFDKAMQEAEGKHKWL NTPQAYVSLKHETDKVISFERNGLVFIFNFHPTQSFTDYRIGVDEAGAYRIILNSDREEF GGHRRIEEENSVFHTTDLEWNGRRNFIQVYLPSRTALVLARNP*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://www.genolevures.org/elt/ERGO/ERGO0E07282p