ERGO0D01562g


Syntenic homolog of Saccharomyces cerevisiae YBR110W (ALG1)

Genomic environment map

Element type: CDS
Element length: 1416 nucleotides,
on anti-sense strand of
Ergo0D: complement(112082..113497).
Other names:
ADL338C
AGOS_ADL338C
Coding sequence: 472 codons.

Computed results  

None available yet

Homologs and Orthologs

Homologs in protein families: GL3R2097 GL3R2097.F1 GL3R2097.N1
Orthologs by synteny: ZYRO0C16368g SAKL0B07832g KLTH0F10604g KLLA0B09405g

Protein ERGO0D01562p  


Protein domain map

Protein length: 471 amino acids
Protein family: GL3R2097
Database cross references:
InterPro: IPR001296
UniProtKB/Swiss-Prot: Q75BA5

Computed results for ERGO0D01562p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  

None available yet

Sequence data  


Nucleotide sequence    

>ERGO0D01562g.nt
ATGGACACGTCCTCCGTTACTATGCACACTGAACGCGCGTGCTGCCACCAAGCTCAACGG
GCGGTCGCTGCGATGTTGGATAAGGCTCCTAGCTGGTTGATCTGGACGGCTGTTCTCTAT
GTAGGGCTTCCGTTTATGCTATACTGGGCCGTTCCGTACCTATTCTACCACAACAAGACA
AAGAGCAGACGCATTGCGATCTATGTACTAGGCGATCTTGGGCACTCTCCGCGGATCTGC
TACCACGCGCGCTCTTTCAGCGCTGCGGGCTGGGAGGTGGAGCTGTGTGGTTATCTGGAG
GAGCAGCCACCGAAGGACTTGCTGGACGATCCGCGCGTGACGATCCGGGCGTTGCCAGGA
GCCTCTAATGCAGGCAAGAGCCTGGGCCAGACTGCGCGCAAGGTCGTATTGCAGACATGC
CACATTGTGCGGCAGCTGTGGGAGCTGCGCGGGTGCGACTACATTCTGATCCAGAATCCG
CCCAGCATCCCGCTTCTGCCCATCGTGGCGATCTTCAAGGTGCTGACGCGCACTCGGTTA
ATCTTGGACTGGCACAATTTTGCGTATACCGTCCTGCAGTTGCGGGTGGGGCGTTTTCTG
CACCCGCTCGTACTTGTCTCGTATGCTGTGGAGTTTCTGTTCAGCCGCATGGCTGACTAC
CATATCACCGTGACCGCCGCCATGAAGGATTATCTCGTACAGAGCTTTCTGTTGCCCGCG
CGGCGTATTGCTGTCATGTACGATAGGCCTGGCGAGCAATTTAGGCCACTGCCGGCGGGG
GAGCGGGGGGCTGCGCTCGCAGAGCCATTCATCAGAGGTTACATTCCGGCAGGATTCGAC
GTCCAGCGAGGCGATACCATACTGGTGACGTCAACATCCTTCACTCTGGACGAAGATATT
AATGTGCTATTTGGTGCGCTCAAGATATACGAGAGTGCTGCGGCGAAGTTCGATACTACT
CTTCCGCGCATCCTGCTTTTCGTCACTGGTAAGGGCCCCCTCAAGGGCAAGTACATGGAG
GAAGTGAGGAATTACAAATGGGAACGCTGCACAATCCACTTCCTGTGGCTCTCTGCTGAG
GACTACCCGCGGCTTCTGCAGCTGTGTGACTTCGGTGTTTCGCTACACACTTCGACTTCG
GGGCTGGATCTGCCCATGAAGGTGCTTGACATGTTCGGGTCGGGTCTGCCGGCCTTTGTA
ATGGACTATCCCGCCATCGGCGAATTGGTCCAGGACCGCGTCAACGGCTTGAGGTTTACA
ACCCGGCGTGAGCTAGAGCAGTGTCTTATTTTCGCAATCAAGGATGAACACACTAGGAAG
GTGCTGAAGGAGAACGCGCTTCTTGAGAGTAAGAACAGATGGCACCAGCGTTGGGCCTCC
GCCATGAGCGAGCTGCAAGTTGTTCGGCAATCTTAG

Coding sequence    

>ERGO0D01562g.cds
ATGGACACGTCCTCCGTTACTATGCACACTGAACGCGCGTGCTGCCACCAAGCTCAACGG
GCGGTCGCTGCGATGTTGGATAAGGCTCCTAGCTGGTTGATCTGGACGGCTGTTCTCTAT
GTAGGGCTTCCGTTTATGCTATACTGGGCCGTTCCGTACCTATTCTACCACAACAAGACA
AAGAGCAGACGCATTGCGATCTATGTACTAGGCGATCTTGGGCACTCTCCGCGGATCTGC
TACCACGCGCGCTCTTTCAGCGCTGCGGGCTGGGAGGTGGAGCTGTGTGGTTATCTGGAG
GAGCAGCCACCGAAGGACTTGCTGGACGATCCGCGCGTGACGATCCGGGCGTTGCCAGGA
GCCTCTAATGCAGGCAAGAGCCTGGGCCAGACTGCGCGCAAGGTCGTATTGCAGACATGC
CACATTGTGCGGCAGCTGTGGGAGCTGCGCGGGTGCGACTACATTCTGATCCAGAATCCG
CCCAGCATCCCGCTTCTGCCCATCGTGGCGATCTTCAAGGTGCTGACGCGCACTCGGTTA
ATCTTGGACTGGCACAATTTTGCGTATACCGTCCTGCAGTTGCGGGTGGGGCGTTTTCTG
CACCCGCTCGTACTTGTCTCGTATGCTGTGGAGTTTCTGTTCAGCCGCATGGCTGACTAC
CATATCACCGTGACCGCCGCCATGAAGGATTATCTCGTACAGAGCTTTCTGTTGCCCGCG
CGGCGTATTGCTGTCATGTACGATAGGCCTGGCGAGCAATTTAGGCCACTGCCGGCGGGG
GAGCGGGGGGCTGCGCTCGCAGAGCCATTCATCAGAGGTTACATTCCGGCAGGATTCGAC
GTCCAGCGAGGCGATACCATACTGGTGACGTCAACATCCTTCACTCTGGACGAAGATATT
AATGTGCTATTTGGTGCGCTCAAGATATACGAGAGTGCTGCGGCGAAGTTCGATACTACT
CTTCCGCGCATCCTGCTTTTCGTCACTGGTAAGGGCCCCCTCAAGGGCAAGTACATGGAG
GAAGTGAGGAATTACAAATGGGAACGCTGCACAATCCACTTCCTGTGGCTCTCTGCTGAG
GACTACCCGCGGCTTCTGCAGCTGTGTGACTTCGGTGTTTCGCTACACACTTCGACTTCG
GGGCTGGATCTGCCCATGAAGGTGCTTGACATGTTCGGGTCGGGTCTGCCGGCCTTTGTA
ATGGACTATCCCGCCATCGGCGAATTGGTCCAGGACCGCGTCAACGGCTTGAGGTTTACA
ACCCGGCGTGAGCTAGAGCAGTGTCTTATTTTCGCAATCAAGGATGAACACACTAGGAAG
GTGCTGAAGGAGAACGCGCTTCTTGAGAGTAAGAACAGATGGCACCAGCGTTGGGCCTCC
GCCATGAGCGAGCTGCAAGTTGTTCGGCAATCTTAG

Predicted translation product    

>ERGO0D01562g.aa
MDTSSVTMHTERACCHQAQRAVAAMLDKAPSWLIWTAVLYVGLPFMLYWAVPYLFYHNKT
KSRRIAIYVLGDLGHSPRICYHARSFSAAGWEVELCGYLEEQPPKDLLDDPRVTIRALPG
ASNAGKSLGQTARKVVLQTCHIVRQLWELRGCDYILIQNPPSIPLLPIVAIFKVLTRTRL
ILDWHNFAYTVLQLRVGRFLHPLVLVSYAVEFLFSRMADYHITVTAAMKDYLVQSFLLPA
RRIAVMYDRPGEQFRPLPAGERGAALAEPFIRGYIPAGFDVQRGDTILVTSTSFTLDEDI
NVLFGALKIYESAAAKFDTTLPRILLFVTGKGPLKGKYMEEVRNYKWERCTIHFLWLSAE
DYPRLLQLCDFGVSLHTSTSGLDLPMKVLDMFGSGLPAFVMDYPAIGELVQDRVNGLRFT
TRRELEQCLIFAIKDEHTRKVLKENALLESKNRWHQRWASAMSELQVVRQS*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites