ERGO0G00528g


Syntenic homolog of Saccharomyces cerevisiae YMR307W (GAS1); Tandem gene duplication in this genome

Genomic environment map

Element type: CDS
Element length: 1668 nucleotides,
on sense strand of
Ergo0G: 49757..51424.
Other names:
AGL351W
AGOS_AGL351W
Coding sequence: 556 codons.

Computed results  

None available yet

Homologs and Orthologs

Homologs in protein families: GL3R0042 GL3R0042.F1 GL3R0042.N2
Orthologs: strict determination not possible; homologs must be refined manually

Protein ERGO0G00528p  


Protein domain map

Protein length: 555 amino acids
Protein family: GL3R0042
Database cross references:
InterPro: IPR004886
InterPro: IPR012946
UniProtKB/TrEMBL: Q751S4

Computed results for ERGO0G00528p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  

None available yet

Sequence data  


Nucleotide sequence    

>ERGO0G00528g.nt
ATGTTGCAGACAGCGATATACGGAACGGTTGCATGCCTCCTACAGGCTGTGGTCGCACAG
AGAAACAATGGTGGGGCAGCGCTTAGCGCGACAGCCGGCACGGCGCCCTCGATTGCGACC
TCCAGTCCTTCGACCAACAAGGTGCCGCCTATTGAGATCCATGGAAACAAGTTCTTCTAT
TCAAACAACGGCTCCCAGTTTTACATGAAGGGGATTGCATACCAGGCGGAGACGTCTGAC
GCTTCTGCGAGTGCCACGATCAACGACCCGCTAGCGGATTATGAGAGCTGTTCCCGGGAT
TTGCCGTACTTGTTGGAGCTAAACACCAACACGCTTCGGGTATATGCAGTGAACAGTAGC
TTGGACCATTCAAGATGCATGGAACTTTTTGAAAGCAACGGGATTTATATTGTTGCGGAT
TTATCGGAACCAGCACTATCGATCAACAGAAACAACCCTGAATGGTCTTTGAAGCTTTTC
GAAAGATATGCAGGTGTGGTTGACGAAATGCAGAAATACAGCAATGTGTTGGGGTTCTTT
GCAGGAAATGAAGTAACTAACGAGATAAATAATACGGCAGCTTCCGCTTTTGTGAAAGCT
GCTATTCGTGACACTAAGGCATACATTAAGGAAAAGGGCTATAGGGAAATTCCAGTGGGG
TACTCAACCAATGATGACAGTAACTTTAGACAAGAGATTGCGGATTACTTTGCGTGCGGG
TCTCAAGAGGAGAAGGCGGACTTTCTATGGGTTCAACGTCTATTTCATGGTGCGGGAGAT
TCTACCTTTGAAAAATCAGGCTACTCAGACCAGACGGAAGAGTTTTCCAACTTGGGTATC
CCAGTGTTCTTCTCCGAATATGGCTGTAACGAAGTCAGGCCTCGGAAGTTTGGCGAAGTT
TCTGCTCTATACGGTAGCGATATGACAGATGTCTGGTCAGGAGGTATTGTATACATGTAT
TTTGAAGAAACCAATCAATACGGCTTGGTAACCGTCGACAGCAGTGGCCGCGTTTCTACC
AACGATGACTATAACAACTTGAAGACAGCATTGGCCACAATTTCGCCATCATCTGCAAAC
AAAGATTCATACACCGCAAGTTCTGGCTCTGTCGCATGTCCTACAACTGGCTCAAACTGG
CAAGCAGCAACTAGCTTACCACCATCACCACAAAAGGATGTTTGTGATTGTATAAAGAGT
GCATTAAGATGTGTCATTTCTCCAGATGTCGACCAGAAGGATTACTCGGAATTATTTGGC
TATCTATGCTCCGAAGCGGACGTTGATTGTTCAGACATTTCTGCCGATGGAACTACAGGA
AACTATGGAGCATTTTCCTTCTGCGACGACGAGACCAAGCTATCATACCTATTGAACAAG
TATTACCAAGAAAAGGGCAGGTCCTCCTCCTCTGCATGTGACTTCAGCGGCTCTGCAACG
CTAGTCTCCGCCACAGGCACAGCATCTACATGTGTTCCTACATCTACCGGGATAAACTTG
GGCTCATCTTCATCCAACAGCGGCAGTGGCAGAGGTGACAGCACTACCGCCACCTCTTCA
GCAGCCGCAGCCGCAGATGCTATTCAACCGAACACCTTCTCAATGGCTTTATTCATCCCT
ACCGCTTTAGTATTTATGCTTACCGGTTTTGGCATAGCTATGTCTTGA

Coding sequence    

>ERGO0G00528g.cds
ATGTTGCAGACAGCGATATACGGAACGGTTGCATGCCTCCTACAGGCTGTGGTCGCACAG
AGAAACAATGGTGGGGCAGCGCTTAGCGCGACAGCCGGCACGGCGCCCTCGATTGCGACC
TCCAGTCCTTCGACCAACAAGGTGCCGCCTATTGAGATCCATGGAAACAAGTTCTTCTAT
TCAAACAACGGCTCCCAGTTTTACATGAAGGGGATTGCATACCAGGCGGAGACGTCTGAC
GCTTCTGCGAGTGCCACGATCAACGACCCGCTAGCGGATTATGAGAGCTGTTCCCGGGAT
TTGCCGTACTTGTTGGAGCTAAACACCAACACGCTTCGGGTATATGCAGTGAACAGTAGC
TTGGACCATTCAAGATGCATGGAACTTTTTGAAAGCAACGGGATTTATATTGTTGCGGAT
TTATCGGAACCAGCACTATCGATCAACAGAAACAACCCTGAATGGTCTTTGAAGCTTTTC
GAAAGATATGCAGGTGTGGTTGACGAAATGCAGAAATACAGCAATGTGTTGGGGTTCTTT
GCAGGAAATGAAGTAACTAACGAGATAAATAATACGGCAGCTTCCGCTTTTGTGAAAGCT
GCTATTCGTGACACTAAGGCATACATTAAGGAAAAGGGCTATAGGGAAATTCCAGTGGGG
TACTCAACCAATGATGACAGTAACTTTAGACAAGAGATTGCGGATTACTTTGCGTGCGGG
TCTCAAGAGGAGAAGGCGGACTTTCTATGGGTTCAACGTCTATTTCATGGTGCGGGAGAT
TCTACCTTTGAAAAATCAGGCTACTCAGACCAGACGGAAGAGTTTTCCAACTTGGGTATC
CCAGTGTTCTTCTCCGAATATGGCTGTAACGAAGTCAGGCCTCGGAAGTTTGGCGAAGTT
TCTGCTCTATACGGTAGCGATATGACAGATGTCTGGTCAGGAGGTATTGTATACATGTAT
TTTGAAGAAACCAATCAATACGGCTTGGTAACCGTCGACAGCAGTGGCCGCGTTTCTACC
AACGATGACTATAACAACTTGAAGACAGCATTGGCCACAATTTCGCCATCATCTGCAAAC
AAAGATTCATACACCGCAAGTTCTGGCTCTGTCGCATGTCCTACAACTGGCTCAAACTGG
CAAGCAGCAACTAGCTTACCACCATCACCACAAAAGGATGTTTGTGATTGTATAAAGAGT
GCATTAAGATGTGTCATTTCTCCAGATGTCGACCAGAAGGATTACTCGGAATTATTTGGC
TATCTATGCTCCGAAGCGGACGTTGATTGTTCAGACATTTCTGCCGATGGAACTACAGGA
AACTATGGAGCATTTTCCTTCTGCGACGACGAGACCAAGCTATCATACCTATTGAACAAG
TATTACCAAGAAAAGGGCAGGTCCTCCTCCTCTGCATGTGACTTCAGCGGCTCTGCAACG
CTAGTCTCCGCCACAGGCACAGCATCTACATGTGTTCCTACATCTACCGGGATAAACTTG
GGCTCATCTTCATCCAACAGCGGCAGTGGCAGAGGTGACAGCACTACCGCCACCTCTTCA
GCAGCCGCAGCCGCAGATGCTATTCAACCGAACACCTTCTCAATGGCTTTATTCATCCCT
ACCGCTTTAGTATTTATGCTTACCGGTTTTGGCATAGCTATGTCTTGA

Predicted translation product    

>ERGO0G00528g.aa
MLQTAIYGTVACLLQAVVAQRNNGGAALSATAGTAPSIATSSPSTNKVPPIEIHGNKFFY
SNNGSQFYMKGIAYQAETSDASASATINDPLADYESCSRDLPYLLELNTNTLRVYAVNSS
LDHSRCMELFESNGIYIVADLSEPALSINRNNPEWSLKLFERYAGVVDEMQKYSNVLGFF
AGNEVTNEINNTAASAFVKAAIRDTKAYIKEKGYREIPVGYSTNDDSNFRQEIADYFACG
SQEEKADFLWVQRLFHGAGDSTFEKSGYSDQTEEFSNLGIPVFFSEYGCNEVRPRKFGEV
SALYGSDMTDVWSGGIVYMYFEETNQYGLVTVDSSGRVSTNDDYNNLKTALATISPSSAN
KDSYTASSGSVACPTTGSNWQAATSLPPSPQKDVCDCIKSALRCVISPDVDQKDYSELFG
YLCSEADVDCSDISADGTTGNYGAFSFCDDETKLSYLLNKYYQEKGRSSSSACDFSGSAT
LVSATGTASTCVPTSTGINLGSSSSNSGSGRGDSTTATSSAAAAADAIQPNTFSMALFIP
TALVFMLTGFGIAMS*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites