ERGO0F02398g


Syntenic homolog of Saccharomyces cerevisiae YPL228W (CET1) and YMR180C (CTL1)

Genomic environment map

Element type: CDS
Element length: 1437 nucleotides,
on sense strand of
Ergo0F: 181202..182638.
Other names:
AFL134W
AGOS_AFL134W
Coding sequence: 479 codons.

Computed results  

None available yet

Homologs and Orthologs

Homologs in protein families: GL3C0218 GL3C0218.F1 GL3C0218.N2
Orthologs by synteny: ZYRO0E07700g SAKL0A03520g KLTH0H03234g KLLA0C16049g

Protein ERGO0F02398p  


Protein domain map

Protein length: 478 amino acids
Protein family: GL3C0218
Database cross references:
InterPro: IPR004206
UniProtKB/Swiss-Prot: Q755F7

Computed results for ERGO0F02398p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  

None available yet

Sequence data  


Nucleotide sequence    

>ERGO0F02398g.nt
ATGTCGAGCGAAAAGCAAGAGGGCTCCCCGAGAAGGGTACTTTCGCTGTCAGATCTCGTC
AATCGTGAGGATAGCAAGGATGCTAGTGGTAACACGGCGGGCGCGGCACTGAAGCCGCTC
TCAAACGACGAAGTCAGGAGGCGGCTGGCGCACGAGGACTACAGTTCGAATACGGTGTCG
CAGGTGGACGACGAAACAGACACGGATGACGGGCTCGGGGAGGTGGCTTTCGACCGGGCG
GATTTCCGGTTTGATTTCGAGGGCCACCAGAGAAGCGGCGGGCGGGCGAGCGGAGAAGCC
GAGGCAGAGAGTGGGGGTCCAGGGCACGAAAAAGCGACGCAGCAGCCAAGTGACGCTATC
GCCACCTCGCCGGACAGGGAGAGCACAGAGCCACGAGCGATGGACATCTTCGAAGAAAGG
GCTTCGCTGGAGTCGAAGAAGAACAATCTGCGCAAGGACCTGCGGGTACTGAACGAGATT
GCGTCGACGGCCCGGCCGGGGCGGTACAAGGTGGCGCCGATCTGGGCGCAGAAGTGGAAG
CCGACCGTGCGGGCGCTGCAGAACGTGAACTCGAAGGACCTGATGAAGATCGATGTATCG
TTCACGCAGGTGATACCGGACGACGACCTGACGAAGTCGGTGCAGGACTGGGTGTACGCG
ACGCTGCTGAGCATCCCGCCCGAGCAGCGGCAGTACGTGGAGGTGGAAATGAAGTTTGGG
ATTCTGATGGACAGAAGCTCGGACTCGCAGCGGGTGACGCCGCCGGTGTCCTCCCAGACG
GTGTACATGGAGGCAGACGCGCGGATGAAGCCGGACGTGGACGAGCGGGTCTTTGTGGAG
CTGAACCGCTACGTCAAGGGCATATCCGAGCTGACGGAGAACACGGGCAAGTTTAACATC
ATCGAGTCGCACAACAAAGACGAGATGTACCGTGCCGGGATTAACACGCAGCGGCCGCGG
TTCCTGCGTATGAGCAAGGACGTCAAGACTGGCCGTGTGGGCGAGTTCATCGAAAAGCGG
CGGATCTCGCAGCTCCTGCTCTTTTCCCCCAAAGATAGCTACGATGTCAAGATTTCGATC
AACGTGGAACTGCCTGTACCGGAGAACGATCCTCCCGAGAAGTACATGGGCCAGGCGCCG
CTGAATTCTCGAACCAAAGAGCGCATTAGTTACATACACAATGACTCGTGCACCCGCATA
GACATTACGAAGGTTACCAACCACAACAAGGGCAAGCGTGACGATGCAGAGGTGACGCAC
GAGATCGAGCTAGAACTCAACTCACAGGCGTTGCTCGCTGCCTTCGACAAAATTGCGCAG
GATAGTAAAGACTATGCCACCATCGTTCGCACATTCCTGAATAATGGAACCATTATCAGG
AGGAAGCTGACCTCTCTCTCTTACGAGATTTTCGAGGGTGGCAAGAAGGTTGTGTAG

Coding sequence    

>ERGO0F02398g.cds
ATGTCGAGCGAAAAGCAAGAGGGCTCCCCGAGAAGGGTACTTTCGCTGTCAGATCTCGTC
AATCGTGAGGATAGCAAGGATGCTAGTGGTAACACGGCGGGCGCGGCACTGAAGCCGCTC
TCAAACGACGAAGTCAGGAGGCGGCTGGCGCACGAGGACTACAGTTCGAATACGGTGTCG
CAGGTGGACGACGAAACAGACACGGATGACGGGCTCGGGGAGGTGGCTTTCGACCGGGCG
GATTTCCGGTTTGATTTCGAGGGCCACCAGAGAAGCGGCGGGCGGGCGAGCGGAGAAGCC
GAGGCAGAGAGTGGGGGTCCAGGGCACGAAAAAGCGACGCAGCAGCCAAGTGACGCTATC
GCCACCTCGCCGGACAGGGAGAGCACAGAGCCACGAGCGATGGACATCTTCGAAGAAAGG
GCTTCGCTGGAGTCGAAGAAGAACAATCTGCGCAAGGACCTGCGGGTACTGAACGAGATT
GCGTCGACGGCCCGGCCGGGGCGGTACAAGGTGGCGCCGATCTGGGCGCAGAAGTGGAAG
CCGACCGTGCGGGCGCTGCAGAACGTGAACTCGAAGGACCTGATGAAGATCGATGTATCG
TTCACGCAGGTGATACCGGACGACGACCTGACGAAGTCGGTGCAGGACTGGGTGTACGCG
ACGCTGCTGAGCATCCCGCCCGAGCAGCGGCAGTACGTGGAGGTGGAAATGAAGTTTGGG
ATTCTGATGGACAGAAGCTCGGACTCGCAGCGGGTGACGCCGCCGGTGTCCTCCCAGACG
GTGTACATGGAGGCAGACGCGCGGATGAAGCCGGACGTGGACGAGCGGGTCTTTGTGGAG
CTGAACCGCTACGTCAAGGGCATATCCGAGCTGACGGAGAACACGGGCAAGTTTAACATC
ATCGAGTCGCACAACAAAGACGAGATGTACCGTGCCGGGATTAACACGCAGCGGCCGCGG
TTCCTGCGTATGAGCAAGGACGTCAAGACTGGCCGTGTGGGCGAGTTCATCGAAAAGCGG
CGGATCTCGCAGCTCCTGCTCTTTTCCCCCAAAGATAGCTACGATGTCAAGATTTCGATC
AACGTGGAACTGCCTGTACCGGAGAACGATCCTCCCGAGAAGTACATGGGCCAGGCGCCG
CTGAATTCTCGAACCAAAGAGCGCATTAGTTACATACACAATGACTCGTGCACCCGCATA
GACATTACGAAGGTTACCAACCACAACAAGGGCAAGCGTGACGATGCAGAGGTGACGCAC
GAGATCGAGCTAGAACTCAACTCACAGGCGTTGCTCGCTGCCTTCGACAAAATTGCGCAG
GATAGTAAAGACTATGCCACCATCGTTCGCACATTCCTGAATAATGGAACCATTATCAGG
AGGAAGCTGACCTCTCTCTCTTACGAGATTTTCGAGGGTGGCAAGAAGGTTGTGTAG

Predicted translation product    

>ERGO0F02398g.aa
MSSEKQEGSPRRVLSLSDLVNREDSKDASGNTAGAALKPLSNDEVRRRLAHEDYSSNTVS
QVDDETDTDDGLGEVAFDRADFRFDFEGHQRSGGRASGEAEAESGGPGHEKATQQPSDAI
ATSPDRESTEPRAMDIFEERASLESKKNNLRKDLRVLNEIASTARPGRYKVAPIWAQKWK
PTVRALQNVNSKDLMKIDVSFTQVIPDDDLTKSVQDWVYATLLSIPPEQRQYVEVEMKFG
ILMDRSSDSQRVTPPVSSQTVYMEADARMKPDVDERVFVELNRYVKGISELTENTGKFNI
IESHNKDEMYRAGINTQRPRFLRMSKDVKTGRVGEFIEKRRISQLLLFSPKDSYDVKISI
NVELPVPENDPPEKYMGQAPLNSRTKERISYIHNDSCTRIDITKVTNHNKGKRDDAEVTH
EIELELNSQALLAAFDKIAQDSKDYATIVRTFLNNGTIIRRKLTSLSYEIFEGGKKVV*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites