Element type: CDS
Element length: 2406 nucleotides,
on anti-sense strand of
Ergo0G: complement(969095..971500).
Other names:
AGOS_AGR119C
AGR119C
Coding sequence: 802 codons.
Element length: 2406 nucleotides,
on anti-sense strand of
Ergo0G: complement(969095..971500).
Other names:
AGOS_AGR119C
AGR119C
Coding sequence: 802 codons.
Database cross references:
EMBL: AE016820
GeneID: 4623087
GenomeReviews: AE016820_GR
NMPDR: fig|33169.1.peg.4430
Orthologs: strict determination not possible; homologs must be refined manually
EMBL: AE016820
GeneID: 4623087
GenomeReviews: AE016820_GR
NMPDR: fig|33169.1.peg.4430
Homologs and Orthologs
Homologs in protein family: GL3C3499Orthologs: strict determination not possible; homologs must be refined manually
Protein domain map
Database cross references:
AGD: AGR119C
InterPro: IPR002048
KEGG: ago:AGOS_AGR119C
RefSeq: NP_986785.1
UniProtKB/TrEMBL: Q74ZS9
UniProtKB: Q74ZS9_ASHGO
Phylogeny
PhylomeDB:Q74ZS9
AGD: AGR119C
InterPro: IPR002048
KEGG: ago:AGOS_AGR119C
RefSeq: NP_986785.1
UniProtKB/TrEMBL: Q74ZS9
UniProtKB: Q74ZS9_ASHGO
Phylogeny 
PhylomeDB:Q74ZS9Sequence data 
>ERGO0G11044g.nt ATGCGGCCTCAGAAGACACAGTTGCGCGCAACTGCCGATGATTGCGGTGGTGCGATCGAA TCGACGTCGAAGTGGAAGATCCCGCACTACTACAAGAAGGCGTCCTCGGTGACACAGACA GTGATTTCGGATCTGAACCCGGCTACGAGCTCGCAGCAGTGTGCGACGTACTCACCGACC ATGGCGCTGAACGGGGGCAACAATCGCATCAATATTATGAACACACCGGTGGCGGTCCAG CTGGAGGACCCAGACCAGCCGAAGACGCGGAAGGGCCAGAAGGCGAAGGCTAAGAAGAAT GGCGAGATGGTGTTTGTGAACTTTACAGTGCAGGACACGACCGAGTCCCCAAAAAAGCCC AAGTCGTCACGTCGCGAGAAGATGCTGCGCATCTTCAAGAGCAGTGAGCAGCTGAAGCTG CGGCCGCATGATCTGGGCGAGCTGGCGCCCAGCTTGTCCAAAGAGCCCCGCTCGGCGCCC GCGTCGACGCCGGCCTCCGCCTCGAAGCGTAACTACAGTTCATTCCTCAAGTGCGGCAAG ATTGGGAGCAGCATCAACAGCAGTGCCGGGAGCGCGGTCGGAAAACGCAGCGGCGTGCCG TCATCGCTGCATAACTGGAAAGCTGCTAGTAGCGAGGACGTGCTCATTAAGCGTTCCAGT TCAAGCGTTGCGGCGCTGCACCCACTACGGCAGCACACCAGGCCCCAACTTAGCAGATCC ATGAGCGCCAACATTATGGGTGTCAACCACAGTCGGTACAATAGTCTCTCGCCTGTTGTA TCCCACAACAGCGCAGGCTCATCGGGTGCAGTGGGTGCTGTTGGAGAACCGGATAGCAAG ATTGATTTGCTGAAGAAGAGGTTTCTTCAGACGGCCACTTATCTGCATGGCGACCCCAGC GACTACGGTACTGCGACGAAGTCCCCGACAGGAAATCCCCACACTAGCAAGATTCCTATG GGGATGGACGTTCAGGACTTCGGCAGCACAAACATGTATGATGATTATGATGATAGTGAC GAGATAGGGGAGGCAGATTTCGATCCACATTATTTGGGAACATCAGAAGATGGCTACAAA ATGCAGCTCTCAGATAACGGACAGGACGGTGCTGCTACTTTGAAACCTAATTTGCATACG TCTGACACATTCTCTGGTAGCTATTATCAGGTGGACGAAAATGATGCCTCGATTGCTTTT AGCAAAATGTTCACAAGGAAAAGAGCAAACACTGGCGGGTCCCTAAGTTCTTTTGTTTCT AATACACAAACCATATCGCCCTATAACATCCCCGGTAATATCAGCACGCATTCAATTCCT TCGCAGCGCTATTCTCCCATCAGATCCCACTCCCCGGGGCGGCCGCGATCCAATACACGT GGTTCTTTAACACACAGAAATACGCGTGATCTGAGTTCAGCATACTTGTCTGCAAATAAT GTTAGTGGCTTGTCTTCGCAGTCAGGTGCTGCGTTGTTCAACCCGTCCGAACCTGTAGTC GGAATGGATACGTTTATTGATAGTCAGTACAAGTCAAGGTACGGGCAGAAGAAAAAGCAT GACGGTGTTTGCGATACGCGCAAACCATACCAGGCTCCTGCAGGCTTCAGCAGTACTTCT TCCACATCGCATGCTTCCACTCCTTCGGTTGCAGATGCTGGAGGTCTCGGCAGCGGTCCC TTATCCATGGACCTAAATGGATTCTGGGCATTGGAACGGGATACGTCATTCATACACGAC GTATACGACGAGCAACCTCTACCTTCCGCCAAGGGTATTCAAGCGGCAACTATAGAAGAA GACGAGGAGGTCGAGGTTGATCTGGGTTCCCTTCCAAACACCAGTAATAGTAGCAAATGC ATTGAGCTGTCTTTGGAGCCGTCTACGTCTGTGGGGAGCTCTTCATGTCAGTATTACCAT GAAACTCAAGGGTTTACGCTCATTAATACTGGTCTTGGTGCCACTACTGTCGCGCTACCG AAAAAGGATCGCCTGAAGCAGGAAGAAACTACACAGCATGGGGCCATGTTCTTTGGGGAG CCCCTGGGAGGTTATGCCGCTGCCCCAACAGATAGCAATCTTCTAGATGCTTACATGGAC CTTGATCTGGAAAATACACTAACCCTGCTTTCAGGTGAAAATGATATAACCGGCTTGGAC GTCAGTGGCGAAGGGAACATGGGGTATTCGACTATGTATCATGATGGGCTTGCCCCCTCG GCTGCTACTAACATATCTCCGACAACCCTCACTGGTGCGGAGGCCCATTCTGAGGATGAA ACAACCAGTTCCGACGTACGCCACACACAACAGGCTCAACAGGTTTTTCTGCATTCCCGC TCGGTAGACGCTGACCAGCCACAGCCGCGAGTGGACTTTTATGGGTTAAATGATGGTAAG ACATAG
>ERGO0G11044g.cds ATGCGGCCTCAGAAGACACAGTTGCGCGCAACTGCCGATGATTGCGGTGGTGCGATCGAA TCGACGTCGAAGTGGAAGATCCCGCACTACTACAAGAAGGCGTCCTCGGTGACACAGACA GTGATTTCGGATCTGAACCCGGCTACGAGCTCGCAGCAGTGTGCGACGTACTCACCGACC ATGGCGCTGAACGGGGGCAACAATCGCATCAATATTATGAACACACCGGTGGCGGTCCAG CTGGAGGACCCAGACCAGCCGAAGACGCGGAAGGGCCAGAAGGCGAAGGCTAAGAAGAAT GGCGAGATGGTGTTTGTGAACTTTACAGTGCAGGACACGACCGAGTCCCCAAAAAAGCCC AAGTCGTCACGTCGCGAGAAGATGCTGCGCATCTTCAAGAGCAGTGAGCAGCTGAAGCTG CGGCCGCATGATCTGGGCGAGCTGGCGCCCAGCTTGTCCAAAGAGCCCCGCTCGGCGCCC GCGTCGACGCCGGCCTCCGCCTCGAAGCGTAACTACAGTTCATTCCTCAAGTGCGGCAAG ATTGGGAGCAGCATCAACAGCAGTGCCGGGAGCGCGGTCGGAAAACGCAGCGGCGTGCCG TCATCGCTGCATAACTGGAAAGCTGCTAGTAGCGAGGACGTGCTCATTAAGCGTTCCAGT TCAAGCGTTGCGGCGCTGCACCCACTACGGCAGCACACCAGGCCCCAACTTAGCAGATCC ATGAGCGCCAACATTATGGGTGTCAACCACAGTCGGTACAATAGTCTCTCGCCTGTTGTA TCCCACAACAGCGCAGGCTCATCGGGTGCAGTGGGTGCTGTTGGAGAACCGGATAGCAAG ATTGATTTGCTGAAGAAGAGGTTTCTTCAGACGGCCACTTATCTGCATGGCGACCCCAGC GACTACGGTACTGCGACGAAGTCCCCGACAGGAAATCCCCACACTAGCAAGATTCCTATG GGGATGGACGTTCAGGACTTCGGCAGCACAAACATGTATGATGATTATGATGATAGTGAC GAGATAGGGGAGGCAGATTTCGATCCACATTATTTGGGAACATCAGAAGATGGCTACAAA ATGCAGCTCTCAGATAACGGACAGGACGGTGCTGCTACTTTGAAACCTAATTTGCATACG TCTGACACATTCTCTGGTAGCTATTATCAGGTGGACGAAAATGATGCCTCGATTGCTTTT AGCAAAATGTTCACAAGGAAAAGAGCAAACACTGGCGGGTCCCTAAGTTCTTTTGTTTCT AATACACAAACCATATCGCCCTATAACATCCCCGGTAATATCAGCACGCATTCAATTCCT TCGCAGCGCTATTCTCCCATCAGATCCCACTCCCCGGGGCGGCCGCGATCCAATACACGT GGTTCTTTAACACACAGAAATACGCGTGATCTGAGTTCAGCATACTTGTCTGCAAATAAT GTTAGTGGCTTGTCTTCGCAGTCAGGTGCTGCGTTGTTCAACCCGTCCGAACCTGTAGTC GGAATGGATACGTTTATTGATAGTCAGTACAAGTCAAGGTACGGGCAGAAGAAAAAGCAT GACGGTGTTTGCGATACGCGCAAACCATACCAGGCTCCTGCAGGCTTCAGCAGTACTTCT TCCACATCGCATGCTTCCACTCCTTCGGTTGCAGATGCTGGAGGTCTCGGCAGCGGTCCC TTATCCATGGACCTAAATGGATTCTGGGCATTGGAACGGGATACGTCATTCATACACGAC GTATACGACGAGCAACCTCTACCTTCCGCCAAGGGTATTCAAGCGGCAACTATAGAAGAA GACGAGGAGGTCGAGGTTGATCTGGGTTCCCTTCCAAACACCAGTAATAGTAGCAAATGC ATTGAGCTGTCTTTGGAGCCGTCTACGTCTGTGGGGAGCTCTTCATGTCAGTATTACCAT GAAACTCAAGGGTTTACGCTCATTAATACTGGTCTTGGTGCCACTACTGTCGCGCTACCG AAAAAGGATCGCCTGAAGCAGGAAGAAACTACACAGCATGGGGCCATGTTCTTTGGGGAG CCCCTGGGAGGTTATGCCGCTGCCCCAACAGATAGCAATCTTCTAGATGCTTACATGGAC CTTGATCTGGAAAATACACTAACCCTGCTTTCAGGTGAAAATGATATAACCGGCTTGGAC GTCAGTGGCGAAGGGAACATGGGGTATTCGACTATGTATCATGATGGGCTTGCCCCCTCG GCTGCTACTAACATATCTCCGACAACCCTCACTGGTGCGGAGGCCCATTCTGAGGATGAA ACAACCAGTTCCGACGTACGCCACACACAACAGGCTCAACAGGTTTTTCTGCATTCCCGC TCGGTAGACGCTGACCAGCCACAGCCGCGAGTGGACTTTTATGGGTTAAATGATGGTAAG ACATAG
>ERGO0G11044g.aa MRPQKTQLRATADDCGGAIESTSKWKIPHYYKKASSVTQTVISDLNPATSSQQCATYSPT MALNGGNNRINIMNTPVAVQLEDPDQPKTRKGQKAKAKKNGEMVFVNFTVQDTTESPKKP KSSRREKMLRIFKSSEQLKLRPHDLGELAPSLSKEPRSAPASTPASASKRNYSSFLKCGK IGSSINSSAGSAVGKRSGVPSSLHNWKAASSEDVLIKRSSSSVAALHPLRQHTRPQLSRS MSANIMGVNHSRYNSLSPVVSHNSAGSSGAVGAVGEPDSKIDLLKKRFLQTATYLHGDPS DYGTATKSPTGNPHTSKIPMGMDVQDFGSTNMYDDYDDSDEIGEADFDPHYLGTSEDGYK MQLSDNGQDGAATLKPNLHTSDTFSGSYYQVDENDASIAFSKMFTRKRANTGGSLSSFVS NTQTISPYNIPGNISTHSIPSQRYSPIRSHSPGRPRSNTRGSLTHRNTRDLSSAYLSANN VSGLSSQSGAALFNPSEPVVGMDTFIDSQYKSRYGQKKKHDGVCDTRKPYQAPAGFSSTS STSHASTPSVADAGGLGSGPLSMDLNGFWALERDTSFIHDVYDEQPLPSAKGIQAATIEE DEEVEVDLGSLPNTSNSSKCIELSLEPSTSVGSSSCQYYHETQGFTLINTGLGATTVALP KKDRLKQEETTQHGAMFFGEPLGGYAAAPTDSNLLDAYMDLDLENTLTLLSGENDITGLD VSGEGNMGYSTMYHDGLAPSAATNISPTTLTGAEAHSEDETTSSDVRHTQQAQQVFLHSR SVDADQPQPRVDFYGLNDGKT*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://192.168.122.177/elt/ERGO/ERGO0G11044g