ERGO0G16808g


AGOS_AGR379W, Syntenic homolog of Saccharomyces cerevisiae YGL150C (INO80)

Genomic environment map

Element type: CDS
Element length: 4245 nucleotides,
on sense strand of
Ergo0G: 1427329..1431573.
Other names:
AGOS_AGR379W
AGR379W
Coding sequence: 1415 codons.
Database cross references:
EMBL: AE016820
GeneID: 4623349
GenomeReviews: AE016820_GR
HOGENOM: HBG398013
NMPDR: fig|33169.1.peg.4690

Computed results  

None available yet


Homologs and Orthologs

Homologs in protein family: GL3C0282
Orthologs: strict determination not possible; homologs must be refined manually

Protein ERGO0G16808p  


Protein domain map

Protein length: 1414 amino acids
Protein family: GL3C0282
Database cross references:
AGD: AGR379W
InterPro: IPR000330
InterPro: IPR001650
InterPro: IPR014001
InterPro: IPR014021
InterPro: IPR020838
KEGG: ago:AGOS_AGR379W
PROSITE: PS51192
PROSITE: PS51194
PROSITE: PS51413
Pfam: PF00176
Pfam: PF00271
RefSeq: NP_987045.2
SMART: SM00487
SMART: SM00490
UniProtKB/Swiss-Prot: Q74Z27
UniProtKB: INO80_ASHGO

Phylogeny  

PhylomeDB:Q74Z27

Computed results for ERGO0G16808p  

None available yet

Gene Ontology terms  


Sequence data  


Nucleotide sequence    

>ERGO0G16808g.nt
ATGTCGCTGGAGGCGCTACTAAACAAGGAGGAGAAGGGGTCGGGGGCGGCGCGCGAGGCG
TTCATGCGGCGGATGAACGAGCGGTTCAACAGCGTGTGCCACAAGGACGCGCGCGAGCAG
CAGTACCAGGACTGGAAGTACTTGAGCTACCAGGAGTTCGAGCTGGTGAACGAGTGGAGC
GCGGCGAGCCGGGAGCTGACGGTGAACCAGTGCGGGCAGCTGCTGTCGAACGTGAAGAGC
GCGCAGGCGGAGTGGCTGGCGTACGAGGAGTTCGTGGCGGGGCGTGCGCGGCTGGTGGCG
GAGGTGGAGGCGAAGCGCGAGCGGGAGCAGGAGGAGCAGGAGGAGCGGCAGCCGGCGCGG
GCGAAGGCGAAGGAGCGGGCGCGGGCGCGGGGCACGGCGAAGCGGGCCGTGGGCCGCAAG
GCGACGAAGGCGCCGGCAGCGCCCGCGGAGCAGGAGGGGGGCGTGCCGGCGGCGCGGGCG
GCGGCGAAGGCGCCGGTGCGCGCGGAGGGCGCGGCGGAGCCGCCATTGAAGCGCGAGCTG
AGCGGCGCGAAGCTGGAGGACGAGGGCGAGGGCGAGGATGACGACGAGGAGGAGGATGAG
GACGACGAGGACGGCGAGGACGAGGACGAGGACGAGGACGAGGAGGAGGAGGAGGAGGAG
GAGCTCGAGGAGCTCGAAGATATCCAGCTTCTGGACGATGACAACGACAAGGACTTCTCG
CCCGAAGGCGGTCGCTCGAAGTCCTCGATTAAGCTGAATACGAAGCTGGACATAAACTCC
GACATCGCCTTGATCCAGCGGGAGCTGCTTAAGATGGCGCAGAAGCATAAGAGTGCCAAG
GCGAAAAAGAGGAAGTTCACCAGCTGCGTCGTCCAGCGCTACGACTCGGACCACACACGG
CTAGAAGTAAAGGTGACCTTGAAGCAGCTCCACATCAAGCGCCTTAAGCGCTTGCTCAAT
GAGGCCAAGCGGAAGCGTGCTGCCGAGGAAGCGCTCGCAGCGAATGAGCAGCAGGGTAAC
CTGGCCAAGCGTCGCAAGACCGCGCAGAAACAGAAGCCCGCGGCGAACGGTGCGCCTGCT
GCTTCATCTTCAGAAGACGTGACCGTGACAGAGGCTGCCACGCCGGTACCCGCGAAAGTC
AACGGAGAGGCACCTTCGAACGAGTCTCCAGTAGCTGCTATTCCGGATATTAACCCCACC
ACTGGTTTGCCGACGTACGGGATGAAAATGACGGCGAAGGAGGCCAGAGCTATCCAGAGA
CACTATGATACTACATACATTACGGTATGGAAGGATATGGCACGCAAGGATTCCGCAAAG
CTTTCTCGGTTGGTGCAACAGATACAGTCGATCCGCTCCGCCAATTTCAAGAAAACATCA
TCTCTTGTTGCTCGGGAGGCCAGAAAATGGCAAAGTAGAAATTTCAGGCAAGTGAAGGAT
TTCCAGACGCGGGCGCGCAGAGGTGTCCGGGAAATGAGCAGCTTCTGGAAGAAGAACGAG
CGCGAAGAAAGAGAGTTGAAGAAGCGAGCCGAAAGGGAAGCCATCGAGCAGGCGAAGAAA
GAAGAGGAGGAAAGAGAGTCAAAGAGACAGGCTAGAAAGCTAAACTTTTTGTTGACGCAA
ACCGAACTATTTTCGCACTTTATTGGCAGCAAAATCAAGACCAATGAGCTAGAAGGCAAT
ATGGCAGATTCGAATCTGGCCACCGCTCCGGATGTCAGTGCAATTGACTTAAGTAAACCC
CCAACCAGAAAGAACGAAGTGCATACAATAGATTTTGACAACGAGGATGACGAGGAGCTA
CATCGCAAAGCCGCTCAAAATGCATCTAATGCTTTGAAGGAAACACGAGAGAAAGCAAAG
GCCTTTGATGGTATGTCTGGAGACGATGAGGAGCTAAACTTCCAGAATCCGACATCTTTG
GGTGAAATCACCATCGAGCAGCCAAAGATTCTCGCATGCACCCTAAAGGAGTACCAATTG
AAGGGTTTGAACTGGTTGGCAAACTTATATGACCAGGGCATCAATGGCATCCTAGCAGAT
GAAATGGGTTTGGGTAAGACAGTCCAGTCCATCTCAGTGCTAGCCCATTTAGCTGAGCGC
TATAATATTTGGGGGCCATTTATTGTTGTGACACCTGCGTCTACCCTACATAATTGGGTG
AACGAGATTCAGAAATTTGTTCCGGATTTCAAGATCCTACCATACTGGGGCAACGGTAAT
GATCGAAAGATATTGAGACGTTTTTGGGATCGGAAGCACTTGAGATATAGTAAGGATGCG
CCATTCCACGTTATGATTACTTCTTACCAAATGATTGTATCCGATGCTGCATATCTTCAG
AAAATGAAATGGCAATATATGATTTTGGATGAAGCCCAAGCAATCAAATCCTCGCAGTCC
TCTAGGTGGAAGAACTTGTTAAGCTTCCACTGTCGTAACAGGCTTCTACTAACCGGTACC
CCGATTCAAAATAGCATGCAGGAGCTATGGGCCTTGTTGCATTTTATTATGCCCTCCCTC
TTCGATTCTCACGATGAATTTAACGACTGGTTTTCGAAAGATATCGAGTCTCATGCTCAG
TCAAACACTCAATTGAATCAGCAACAACTACGCAGGCTACATATGATATTGAAGCCATTT
ATGTTGCGTCGGATTAAGAAGAATGTCCAATCTGAGTTAGGCGATAAGATCGAGATTGAT
GTCATGTGTGATCTAACTCATAGACAAGCAAAGCTTTACCAAGTTTTGAAATCTCAGGTG
TCTGCTTCTTATGATGCCATCGAAAACGCTGCTAGTAACAGCAGTGGTGATGACTCTGGA
AATATGTCGTTGTCTGACTCAAAGATAATGAACACAGTAATGGAATTCAGAAAAGTCTGT
AATCACCCTGATCTTTTTGAGCGTGCCGATGTGTCATCCCCCTTTTCCTTTACGTCCTTT
GGCCAAACAGGTTCCATAATGCGGGAGGGAGATGTAATTGATGTTCAGTATTCTTCAAAA
AACCCGGTATCCTTCCACTTGCCCAGGTTGATATATGATGATTTGATTCTTCCGAACTAC
AATCACGACAGTGACATGAGGACCAAGATCTTGAATCATATGATGAGTATATTTGCTCCT
GCGAATTCACCAGATTTATGTGCCACACTTTCAAAGGTAGCTGGTGTGGAGCCCAATAGC
ATTTTACGGCTATCGCAAGAGCATATTGTAAAGCGAGCCATCGATCTATCTGCGCACTCT
CCGAATGTAACTCGAAGTGGTATTTTTTCTGTTGTCTATGAGGATGACAAGAGTTCTCTT
TCTTCTCTAGATAAGACCTTACTCATCAATGATAAAAGTGACTACTTGCACACTATTGCG
CGGACTACGCAAAATGGTGTGTTAGCCTCGTTATTGAATATCCAGGGAAACTTTTATGAA
AATGAATATATGAACGTTTTGCGCCCGGCTTACCGCCCTGCCGCCGCAGCTCCACCAATT
AGTATTCATGTCATGGGCTCTAGCAACTTTTCAATCAAGAGAGATAATGCATTGTTTGAG
CCATATATTACGAGGAGTCTCGGCATAATACCGCCTGAACTACAGACAAGATTAACGGAG
AAAGAAAACAATATTTTTACCGCATTGCCTATTTCAGAGTTGTACCCAGCACCTTTGAAT
AAGAGTTTCTCATCCTACATTTCTATGCCTTCGATGGATAGGTTCATTACAGAATCTGCC
AAACTAAAGAAGTTAGATGAACTTCTAGTTCGCTTGAAAGCGGGAGAGCATCGTGTTTTG
ATATATTTCCAGATGACAAGAATGATGGATTTGATCGAAGAATATTTGACCTACAGACAA
TACAAACATATCAGATTAGATGGTTCTTCGAAACTAGAAGATCGCCGTGATTTAGTTCAC
GATTGGCAAACTAAATCCGATATTTTTATTTTCTTACTATCGACAAGGGCTGGTGGCTTA
GGTATTAATCTTACATCTGCTGACACAGTTATCTTCTATGATTCTGATTGGAATCCTACC
ATAGATTCTCAAGCCATGGATAGAGCTCATAGATTGGGTCAAACAAAACAAGTTACTGTT
TATAGATTGTTGATCAAGGGCACCATTGAAGAGAGAATGAGGGATCGTGCAAAACAGAAA
GAACATGTTCAGCAGGTTGTCATGGAGGGTAAAACCAAAGAGAACAACGTGCAGACTATT
ACAGCAAATGGTAAAACTCTTGAAAACCTGCCTCTGCCACTTTAA

Coding sequence    

>ERGO0G16808g.cds
ATGTCGCTGGAGGCGCTACTAAACAAGGAGGAGAAGGGGTCGGGGGCGGCGCGCGAGGCG
TTCATGCGGCGGATGAACGAGCGGTTCAACAGCGTGTGCCACAAGGACGCGCGCGAGCAG
CAGTACCAGGACTGGAAGTACTTGAGCTACCAGGAGTTCGAGCTGGTGAACGAGTGGAGC
GCGGCGAGCCGGGAGCTGACGGTGAACCAGTGCGGGCAGCTGCTGTCGAACGTGAAGAGC
GCGCAGGCGGAGTGGCTGGCGTACGAGGAGTTCGTGGCGGGGCGTGCGCGGCTGGTGGCG
GAGGTGGAGGCGAAGCGCGAGCGGGAGCAGGAGGAGCAGGAGGAGCGGCAGCCGGCGCGG
GCGAAGGCGAAGGAGCGGGCGCGGGCGCGGGGCACGGCGAAGCGGGCCGTGGGCCGCAAG
GCGACGAAGGCGCCGGCAGCGCCCGCGGAGCAGGAGGGGGGCGTGCCGGCGGCGCGGGCG
GCGGCGAAGGCGCCGGTGCGCGCGGAGGGCGCGGCGGAGCCGCCATTGAAGCGCGAGCTG
AGCGGCGCGAAGCTGGAGGACGAGGGCGAGGGCGAGGATGACGACGAGGAGGAGGATGAG
GACGACGAGGACGGCGAGGACGAGGACGAGGACGAGGACGAGGAGGAGGAGGAGGAGGAG
GAGCTCGAGGAGCTCGAAGATATCCAGCTTCTGGACGATGACAACGACAAGGACTTCTCG
CCCGAAGGCGGTCGCTCGAAGTCCTCGATTAAGCTGAATACGAAGCTGGACATAAACTCC
GACATCGCCTTGATCCAGCGGGAGCTGCTTAAGATGGCGCAGAAGCATAAGAGTGCCAAG
GCGAAAAAGAGGAAGTTCACCAGCTGCGTCGTCCAGCGCTACGACTCGGACCACACACGG
CTAGAAGTAAAGGTGACCTTGAAGCAGCTCCACATCAAGCGCCTTAAGCGCTTGCTCAAT
GAGGCCAAGCGGAAGCGTGCTGCCGAGGAAGCGCTCGCAGCGAATGAGCAGCAGGGTAAC
CTGGCCAAGCGTCGCAAGACCGCGCAGAAACAGAAGCCCGCGGCGAACGGTGCGCCTGCT
GCTTCATCTTCAGAAGACGTGACCGTGACAGAGGCTGCCACGCCGGTACCCGCGAAAGTC
AACGGAGAGGCACCTTCGAACGAGTCTCCAGTAGCTGCTATTCCGGATATTAACCCCACC
ACTGGTTTGCCGACGTACGGGATGAAAATGACGGCGAAGGAGGCCAGAGCTATCCAGAGA
CACTATGATACTACATACATTACGGTATGGAAGGATATGGCACGCAAGGATTCCGCAAAG
CTTTCTCGGTTGGTGCAACAGATACAGTCGATCCGCTCCGCCAATTTCAAGAAAACATCA
TCTCTTGTTGCTCGGGAGGCCAGAAAATGGCAAAGTAGAAATTTCAGGCAAGTGAAGGAT
TTCCAGACGCGGGCGCGCAGAGGTGTCCGGGAAATGAGCAGCTTCTGGAAGAAGAACGAG
CGCGAAGAAAGAGAGTTGAAGAAGCGAGCCGAAAGGGAAGCCATCGAGCAGGCGAAGAAA
GAAGAGGAGGAAAGAGAGTCAAAGAGACAGGCTAGAAAGCTAAACTTTTTGTTGACGCAA
ACCGAACTATTTTCGCACTTTATTGGCAGCAAAATCAAGACCAATGAGCTAGAAGGCAAT
ATGGCAGATTCGAATCTGGCCACCGCTCCGGATGTCAGTGCAATTGACTTAAGTAAACCC
CCAACCAGAAAGAACGAAGTGCATACAATAGATTTTGACAACGAGGATGACGAGGAGCTA
CATCGCAAAGCCGCTCAAAATGCATCTAATGCTTTGAAGGAAACACGAGAGAAAGCAAAG
GCCTTTGATGGTATGTCTGGAGACGATGAGGAGCTAAACTTCCAGAATCCGACATCTTTG
GGTGAAATCACCATCGAGCAGCCAAAGATTCTCGCATGCACCCTAAAGGAGTACCAATTG
AAGGGTTTGAACTGGTTGGCAAACTTATATGACCAGGGCATCAATGGCATCCTAGCAGAT
GAAATGGGTTTGGGTAAGACAGTCCAGTCCATCTCAGTGCTAGCCCATTTAGCTGAGCGC
TATAATATTTGGGGGCCATTTATTGTTGTGACACCTGCGTCTACCCTACATAATTGGGTG
AACGAGATTCAGAAATTTGTTCCGGATTTCAAGATCCTACCATACTGGGGCAACGGTAAT
GATCGAAAGATATTGAGACGTTTTTGGGATCGGAAGCACTTGAGATATAGTAAGGATGCG
CCATTCCACGTTATGATTACTTCTTACCAAATGATTGTATCCGATGCTGCATATCTTCAG
AAAATGAAATGGCAATATATGATTTTGGATGAAGCCCAAGCAATCAAATCCTCGCAGTCC
TCTAGGTGGAAGAACTTGTTAAGCTTCCACTGTCGTAACAGGCTTCTACTAACCGGTACC
CCGATTCAAAATAGCATGCAGGAGCTATGGGCCTTGTTGCATTTTATTATGCCCTCCCTC
TTCGATTCTCACGATGAATTTAACGACTGGTTTTCGAAAGATATCGAGTCTCATGCTCAG
TCAAACACTCAATTGAATCAGCAACAACTACGCAGGCTACATATGATATTGAAGCCATTT
ATGTTGCGTCGGATTAAGAAGAATGTCCAATCTGAGTTAGGCGATAAGATCGAGATTGAT
GTCATGTGTGATCTAACTCATAGACAAGCAAAGCTTTACCAAGTTTTGAAATCTCAGGTG
TCTGCTTCTTATGATGCCATCGAAAACGCTGCTAGTAACAGCAGTGGTGATGACTCTGGA
AATATGTCGTTGTCTGACTCAAAGATAATGAACACAGTAATGGAATTCAGAAAAGTCTGT
AATCACCCTGATCTTTTTGAGCGTGCCGATGTGTCATCCCCCTTTTCCTTTACGTCCTTT
GGCCAAACAGGTTCCATAATGCGGGAGGGAGATGTAATTGATGTTCAGTATTCTTCAAAA
AACCCGGTATCCTTCCACTTGCCCAGGTTGATATATGATGATTTGATTCTTCCGAACTAC
AATCACGACAGTGACATGAGGACCAAGATCTTGAATCATATGATGAGTATATTTGCTCCT
GCGAATTCACCAGATTTATGTGCCACACTTTCAAAGGTAGCTGGTGTGGAGCCCAATAGC
ATTTTACGGCTATCGCAAGAGCATATTGTAAAGCGAGCCATCGATCTATCTGCGCACTCT
CCGAATGTAACTCGAAGTGGTATTTTTTCTGTTGTCTATGAGGATGACAAGAGTTCTCTT
TCTTCTCTAGATAAGACCTTACTCATCAATGATAAAAGTGACTACTTGCACACTATTGCG
CGGACTACGCAAAATGGTGTGTTAGCCTCGTTATTGAATATCCAGGGAAACTTTTATGAA
AATGAATATATGAACGTTTTGCGCCCGGCTTACCGCCCTGCCGCCGCAGCTCCACCAATT
AGTATTCATGTCATGGGCTCTAGCAACTTTTCAATCAAGAGAGATAATGCATTGTTTGAG
CCATATATTACGAGGAGTCTCGGCATAATACCGCCTGAACTACAGACAAGATTAACGGAG
AAAGAAAACAATATTTTTACCGCATTGCCTATTTCAGAGTTGTACCCAGCACCTTTGAAT
AAGAGTTTCTCATCCTACATTTCTATGCCTTCGATGGATAGGTTCATTACAGAATCTGCC
AAACTAAAGAAGTTAGATGAACTTCTAGTTCGCTTGAAAGCGGGAGAGCATCGTGTTTTG
ATATATTTCCAGATGACAAGAATGATGGATTTGATCGAAGAATATTTGACCTACAGACAA
TACAAACATATCAGATTAGATGGTTCTTCGAAACTAGAAGATCGCCGTGATTTAGTTCAC
GATTGGCAAACTAAATCCGATATTTTTATTTTCTTACTATCGACAAGGGCTGGTGGCTTA
GGTATTAATCTTACATCTGCTGACACAGTTATCTTCTATGATTCTGATTGGAATCCTACC
ATAGATTCTCAAGCCATGGATAGAGCTCATAGATTGGGTCAAACAAAACAAGTTACTGTT
TATAGATTGTTGATCAAGGGCACCATTGAAGAGAGAATGAGGGATCGTGCAAAACAGAAA
GAACATGTTCAGCAGGTTGTCATGGAGGGTAAAACCAAAGAGAACAACGTGCAGACTATT
ACAGCAAATGGTAAAACTCTTGAAAACCTGCCTCTGCCACTTTAA

Predicted translation product    

>ERGO0G16808g.aa
MSLEALLNKEEKGSGAAREAFMRRMNERFNSVCHKDAREQQYQDWKYLSYQEFELVNEWS
AASRELTVNQCGQLLSNVKSAQAEWLAYEEFVAGRARLVAEVEAKREREQEEQEERQPAR
AKAKERARARGTAKRAVGRKATKAPAAPAEQEGGVPAARAAAKAPVRAEGAAEPPLKREL
SGAKLEDEGEGEDDDEEEDEDDEDGEDEDEDEDEEEEEEEELEELEDIQLLDDDNDKDFS
PEGGRSKSSIKLNTKLDINSDIALIQRELLKMAQKHKSAKAKKRKFTSCVVQRYDSDHTR
LEVKVTLKQLHIKRLKRLLNEAKRKRAAEEALAANEQQGNLAKRRKTAQKQKPAANGAPA
ASSSEDVTVTEAATPVPAKVNGEAPSNESPVAAIPDINPTTGLPTYGMKMTAKEARAIQR
HYDTTYITVWKDMARKDSAKLSRLVQQIQSIRSANFKKTSSLVAREARKWQSRNFRQVKD
FQTRARRGVREMSSFWKKNEREERELKKRAEREAIEQAKKEEEERESKRQARKLNFLLTQ
TELFSHFIGSKIKTNELEGNMADSNLATAPDVSAIDLSKPPTRKNEVHTIDFDNEDDEEL
HRKAAQNASNALKETREKAKAFDGMSGDDEELNFQNPTSLGEITIEQPKILACTLKEYQL
KGLNWLANLYDQGINGILADEMGLGKTVQSISVLAHLAERYNIWGPFIVVTPASTLHNWV
NEIQKFVPDFKILPYWGNGNDRKILRRFWDRKHLRYSKDAPFHVMITSYQMIVSDAAYLQ
KMKWQYMILDEAQAIKSSQSSRWKNLLSFHCRNRLLLTGTPIQNSMQELWALLHFIMPSL
FDSHDEFNDWFSKDIESHAQSNTQLNQQQLRRLHMILKPFMLRRIKKNVQSELGDKIEID
VMCDLTHRQAKLYQVLKSQVSASYDAIENAASNSSGDDSGNMSLSDSKIMNTVMEFRKVC
NHPDLFERADVSSPFSFTSFGQTGSIMREGDVIDVQYSSKNPVSFHLPRLIYDDLILPNY
NHDSDMRTKILNHMMSIFAPANSPDLCATLSKVAGVEPNSILRLSQEHIVKRAIDLSAHS
PNVTRSGIFSVVYEDDKSSLSSLDKTLLINDKSDYLHTIARTTQNGVLASLLNIQGNFYE
NEYMNVLRPAYRPAAAAPPISIHVMGSSNFSIKRDNALFEPYITRSLGIIPPELQTRLTE
KENNIFTALPISELYPAPLNKSFSSYISMPSMDRFITESAKLKKLDELLVRLKAGEHRVL
IYFQMTRMMDLIEEYLTYRQYKHIRLDGSSKLEDRRDLVHDWQTKSDIFIFLLSTRAGGL
GINLTSADTVIFYDSDWNPTIDSQAMDRAHRLGQTKQVTVYRLLIKGTIEERMRDRAKQK
EHVQQVVMEGKTKENNVQTITANGKTLENLPLPL*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites