ERGO0G11044g


AGOS_AGR119C, Syntenic homolog of Saccharomyces cerevisiae YER167W (BCK2)

Genomic environment map

Element type: CDS
Element length: 2406 nucleotides,
on anti-sense strand of
Ergo0G: complement(969095..971500).
Other names:
AGOS_AGR119C
AGR119C
Coding sequence: 802 codons.
Database cross references:
EMBL: AE016820
GeneID: 4623087
GenomeReviews: AE016820_GR
NMPDR: fig|33169.1.peg.4430

Computed results  

None available yet


Homologs and Orthologs

Homologs in protein family: GL3C3499
Orthologs: strict determination not possible; homologs must be refined manually

Protein ERGO0G11044p  


Protein domain map

Protein length: 801 amino acids
Protein family: GL3C3499
Database cross references:
AGD: AGR119C
InterPro: IPR002048
KEGG: ago:AGOS_AGR119C
RefSeq: NP_986785.1
UniProtKB/TrEMBL: Q74ZS9
UniProtKB: Q74ZS9_ASHGO

Phylogeny  

PhylomeDB:Q74ZS9

Computed results for ERGO0G11044p  

None available yet

Gene Ontology terms  

None available yet

Sequence data  


Nucleotide sequence    

>ERGO0G11044g.nt
ATGCGGCCTCAGAAGACACAGTTGCGCGCAACTGCCGATGATTGCGGTGGTGCGATCGAA
TCGACGTCGAAGTGGAAGATCCCGCACTACTACAAGAAGGCGTCCTCGGTGACACAGACA
GTGATTTCGGATCTGAACCCGGCTACGAGCTCGCAGCAGTGTGCGACGTACTCACCGACC
ATGGCGCTGAACGGGGGCAACAATCGCATCAATATTATGAACACACCGGTGGCGGTCCAG
CTGGAGGACCCAGACCAGCCGAAGACGCGGAAGGGCCAGAAGGCGAAGGCTAAGAAGAAT
GGCGAGATGGTGTTTGTGAACTTTACAGTGCAGGACACGACCGAGTCCCCAAAAAAGCCC
AAGTCGTCACGTCGCGAGAAGATGCTGCGCATCTTCAAGAGCAGTGAGCAGCTGAAGCTG
CGGCCGCATGATCTGGGCGAGCTGGCGCCCAGCTTGTCCAAAGAGCCCCGCTCGGCGCCC
GCGTCGACGCCGGCCTCCGCCTCGAAGCGTAACTACAGTTCATTCCTCAAGTGCGGCAAG
ATTGGGAGCAGCATCAACAGCAGTGCCGGGAGCGCGGTCGGAAAACGCAGCGGCGTGCCG
TCATCGCTGCATAACTGGAAAGCTGCTAGTAGCGAGGACGTGCTCATTAAGCGTTCCAGT
TCAAGCGTTGCGGCGCTGCACCCACTACGGCAGCACACCAGGCCCCAACTTAGCAGATCC
ATGAGCGCCAACATTATGGGTGTCAACCACAGTCGGTACAATAGTCTCTCGCCTGTTGTA
TCCCACAACAGCGCAGGCTCATCGGGTGCAGTGGGTGCTGTTGGAGAACCGGATAGCAAG
ATTGATTTGCTGAAGAAGAGGTTTCTTCAGACGGCCACTTATCTGCATGGCGACCCCAGC
GACTACGGTACTGCGACGAAGTCCCCGACAGGAAATCCCCACACTAGCAAGATTCCTATG
GGGATGGACGTTCAGGACTTCGGCAGCACAAACATGTATGATGATTATGATGATAGTGAC
GAGATAGGGGAGGCAGATTTCGATCCACATTATTTGGGAACATCAGAAGATGGCTACAAA
ATGCAGCTCTCAGATAACGGACAGGACGGTGCTGCTACTTTGAAACCTAATTTGCATACG
TCTGACACATTCTCTGGTAGCTATTATCAGGTGGACGAAAATGATGCCTCGATTGCTTTT
AGCAAAATGTTCACAAGGAAAAGAGCAAACACTGGCGGGTCCCTAAGTTCTTTTGTTTCT
AATACACAAACCATATCGCCCTATAACATCCCCGGTAATATCAGCACGCATTCAATTCCT
TCGCAGCGCTATTCTCCCATCAGATCCCACTCCCCGGGGCGGCCGCGATCCAATACACGT
GGTTCTTTAACACACAGAAATACGCGTGATCTGAGTTCAGCATACTTGTCTGCAAATAAT
GTTAGTGGCTTGTCTTCGCAGTCAGGTGCTGCGTTGTTCAACCCGTCCGAACCTGTAGTC
GGAATGGATACGTTTATTGATAGTCAGTACAAGTCAAGGTACGGGCAGAAGAAAAAGCAT
GACGGTGTTTGCGATACGCGCAAACCATACCAGGCTCCTGCAGGCTTCAGCAGTACTTCT
TCCACATCGCATGCTTCCACTCCTTCGGTTGCAGATGCTGGAGGTCTCGGCAGCGGTCCC
TTATCCATGGACCTAAATGGATTCTGGGCATTGGAACGGGATACGTCATTCATACACGAC
GTATACGACGAGCAACCTCTACCTTCCGCCAAGGGTATTCAAGCGGCAACTATAGAAGAA
GACGAGGAGGTCGAGGTTGATCTGGGTTCCCTTCCAAACACCAGTAATAGTAGCAAATGC
ATTGAGCTGTCTTTGGAGCCGTCTACGTCTGTGGGGAGCTCTTCATGTCAGTATTACCAT
GAAACTCAAGGGTTTACGCTCATTAATACTGGTCTTGGTGCCACTACTGTCGCGCTACCG
AAAAAGGATCGCCTGAAGCAGGAAGAAACTACACAGCATGGGGCCATGTTCTTTGGGGAG
CCCCTGGGAGGTTATGCCGCTGCCCCAACAGATAGCAATCTTCTAGATGCTTACATGGAC
CTTGATCTGGAAAATACACTAACCCTGCTTTCAGGTGAAAATGATATAACCGGCTTGGAC
GTCAGTGGCGAAGGGAACATGGGGTATTCGACTATGTATCATGATGGGCTTGCCCCCTCG
GCTGCTACTAACATATCTCCGACAACCCTCACTGGTGCGGAGGCCCATTCTGAGGATGAA
ACAACCAGTTCCGACGTACGCCACACACAACAGGCTCAACAGGTTTTTCTGCATTCCCGC
TCGGTAGACGCTGACCAGCCACAGCCGCGAGTGGACTTTTATGGGTTAAATGATGGTAAG
ACATAG

Coding sequence    

>ERGO0G11044g.cds
ATGCGGCCTCAGAAGACACAGTTGCGCGCAACTGCCGATGATTGCGGTGGTGCGATCGAA
TCGACGTCGAAGTGGAAGATCCCGCACTACTACAAGAAGGCGTCCTCGGTGACACAGACA
GTGATTTCGGATCTGAACCCGGCTACGAGCTCGCAGCAGTGTGCGACGTACTCACCGACC
ATGGCGCTGAACGGGGGCAACAATCGCATCAATATTATGAACACACCGGTGGCGGTCCAG
CTGGAGGACCCAGACCAGCCGAAGACGCGGAAGGGCCAGAAGGCGAAGGCTAAGAAGAAT
GGCGAGATGGTGTTTGTGAACTTTACAGTGCAGGACACGACCGAGTCCCCAAAAAAGCCC
AAGTCGTCACGTCGCGAGAAGATGCTGCGCATCTTCAAGAGCAGTGAGCAGCTGAAGCTG
CGGCCGCATGATCTGGGCGAGCTGGCGCCCAGCTTGTCCAAAGAGCCCCGCTCGGCGCCC
GCGTCGACGCCGGCCTCCGCCTCGAAGCGTAACTACAGTTCATTCCTCAAGTGCGGCAAG
ATTGGGAGCAGCATCAACAGCAGTGCCGGGAGCGCGGTCGGAAAACGCAGCGGCGTGCCG
TCATCGCTGCATAACTGGAAAGCTGCTAGTAGCGAGGACGTGCTCATTAAGCGTTCCAGT
TCAAGCGTTGCGGCGCTGCACCCACTACGGCAGCACACCAGGCCCCAACTTAGCAGATCC
ATGAGCGCCAACATTATGGGTGTCAACCACAGTCGGTACAATAGTCTCTCGCCTGTTGTA
TCCCACAACAGCGCAGGCTCATCGGGTGCAGTGGGTGCTGTTGGAGAACCGGATAGCAAG
ATTGATTTGCTGAAGAAGAGGTTTCTTCAGACGGCCACTTATCTGCATGGCGACCCCAGC
GACTACGGTACTGCGACGAAGTCCCCGACAGGAAATCCCCACACTAGCAAGATTCCTATG
GGGATGGACGTTCAGGACTTCGGCAGCACAAACATGTATGATGATTATGATGATAGTGAC
GAGATAGGGGAGGCAGATTTCGATCCACATTATTTGGGAACATCAGAAGATGGCTACAAA
ATGCAGCTCTCAGATAACGGACAGGACGGTGCTGCTACTTTGAAACCTAATTTGCATACG
TCTGACACATTCTCTGGTAGCTATTATCAGGTGGACGAAAATGATGCCTCGATTGCTTTT
AGCAAAATGTTCACAAGGAAAAGAGCAAACACTGGCGGGTCCCTAAGTTCTTTTGTTTCT
AATACACAAACCATATCGCCCTATAACATCCCCGGTAATATCAGCACGCATTCAATTCCT
TCGCAGCGCTATTCTCCCATCAGATCCCACTCCCCGGGGCGGCCGCGATCCAATACACGT
GGTTCTTTAACACACAGAAATACGCGTGATCTGAGTTCAGCATACTTGTCTGCAAATAAT
GTTAGTGGCTTGTCTTCGCAGTCAGGTGCTGCGTTGTTCAACCCGTCCGAACCTGTAGTC
GGAATGGATACGTTTATTGATAGTCAGTACAAGTCAAGGTACGGGCAGAAGAAAAAGCAT
GACGGTGTTTGCGATACGCGCAAACCATACCAGGCTCCTGCAGGCTTCAGCAGTACTTCT
TCCACATCGCATGCTTCCACTCCTTCGGTTGCAGATGCTGGAGGTCTCGGCAGCGGTCCC
TTATCCATGGACCTAAATGGATTCTGGGCATTGGAACGGGATACGTCATTCATACACGAC
GTATACGACGAGCAACCTCTACCTTCCGCCAAGGGTATTCAAGCGGCAACTATAGAAGAA
GACGAGGAGGTCGAGGTTGATCTGGGTTCCCTTCCAAACACCAGTAATAGTAGCAAATGC
ATTGAGCTGTCTTTGGAGCCGTCTACGTCTGTGGGGAGCTCTTCATGTCAGTATTACCAT
GAAACTCAAGGGTTTACGCTCATTAATACTGGTCTTGGTGCCACTACTGTCGCGCTACCG
AAAAAGGATCGCCTGAAGCAGGAAGAAACTACACAGCATGGGGCCATGTTCTTTGGGGAG
CCCCTGGGAGGTTATGCCGCTGCCCCAACAGATAGCAATCTTCTAGATGCTTACATGGAC
CTTGATCTGGAAAATACACTAACCCTGCTTTCAGGTGAAAATGATATAACCGGCTTGGAC
GTCAGTGGCGAAGGGAACATGGGGTATTCGACTATGTATCATGATGGGCTTGCCCCCTCG
GCTGCTACTAACATATCTCCGACAACCCTCACTGGTGCGGAGGCCCATTCTGAGGATGAA
ACAACCAGTTCCGACGTACGCCACACACAACAGGCTCAACAGGTTTTTCTGCATTCCCGC
TCGGTAGACGCTGACCAGCCACAGCCGCGAGTGGACTTTTATGGGTTAAATGATGGTAAG
ACATAG

Predicted translation product    

>ERGO0G11044g.aa
MRPQKTQLRATADDCGGAIESTSKWKIPHYYKKASSVTQTVISDLNPATSSQQCATYSPT
MALNGGNNRINIMNTPVAVQLEDPDQPKTRKGQKAKAKKNGEMVFVNFTVQDTTESPKKP
KSSRREKMLRIFKSSEQLKLRPHDLGELAPSLSKEPRSAPASTPASASKRNYSSFLKCGK
IGSSINSSAGSAVGKRSGVPSSLHNWKAASSEDVLIKRSSSSVAALHPLRQHTRPQLSRS
MSANIMGVNHSRYNSLSPVVSHNSAGSSGAVGAVGEPDSKIDLLKKRFLQTATYLHGDPS
DYGTATKSPTGNPHTSKIPMGMDVQDFGSTNMYDDYDDSDEIGEADFDPHYLGTSEDGYK
MQLSDNGQDGAATLKPNLHTSDTFSGSYYQVDENDASIAFSKMFTRKRANTGGSLSSFVS
NTQTISPYNIPGNISTHSIPSQRYSPIRSHSPGRPRSNTRGSLTHRNTRDLSSAYLSANN
VSGLSSQSGAALFNPSEPVVGMDTFIDSQYKSRYGQKKKHDGVCDTRKPYQAPAGFSSTS
STSHASTPSVADAGGLGSGPLSMDLNGFWALERDTSFIHDVYDEQPLPSAKGIQAATIEE
DEEVEVDLGSLPNTSNSSKCIELSLEPSTSVGSSSCQYYHETQGFTLINTGLGATTVALP
KKDRLKQEETTQHGAMFFGEPLGGYAAAPTDSNLLDAYMDLDLENTLTLLSGENDITGLD
VSGEGNMGYSTMYHDGLAPSAATNISPTTLTGAEAHSEDETTSSDVRHTQQAQQVFLHSR
SVDADQPQPRVDFYGLNDGKT*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites