SAKL0A01738g


highly similar to uniprot|P25636 Saccharomyces cerevisiae YCR057C PWP2 Conserved 90S pre-ribosomal component essential for proper endonucleolytic cleavage of the 35 S rRNA precursor at A0, A1, and A2 sites

Genomic environment map

Element type: CDS
Element length: 2730 nucleotides,
on sense strand of
Sakl0A: 157360..160089.
Other names:
SAKL-ORF15606
Coding sequence: 910 codons.

Computed results  

None available yet


Homologs and Orthologs

Homologs in protein family: GL3C0129
Orthologs: strict determination not possible; homologs must be refined manually

Protein SAKL0A01738p  


highly similar to uniprot|P25636 Saccharomyces cerevisiae YCR057C PWP2 Conserved 90S pre-ribosomal component essential for proper endonucleolytic cleavage of the 35 S rRNA precursor at A0, A1, and A2 sites;

Protein domain map

Protein length: 909 amino acids
Protein family: GL3C0129
Database cross references:

Phylogeny  

PhylomeDB:SAKL0A01738g

Computed results for SAKL0A01738p  

None available yet

Gene Ontology terms  

None available yet

Sequence data  


Nucleotide sequence    

>SAKL0A01738g.nt
ATGAAGTCGGATTTTAAGTTCTCCAACCTTTTGGGTACTGTTTACAGGCAAGGAAATGTT
ATTTTTTCTGATGACGGAACTAAATTGCTAAGCCCTGTCGGGAATAGAGTGTCAGTGTTT
GACTTAATTAATAACAAGTCTTTCACTTTTGAATATGAACATAGGAAGAACATCGCAAGA
ATTGACTTAAACAAACAGGGGACATTGCTTTTGTCGGTTGATGAGGATGGGCGTGCCATC
CTGGTCAATTTTAAGTCTCGTAACGTCTTGCATCATTTCAACTTTAGAGAAAAGGTTTAC
GATTTGAAGTTCTCTCCGGATGGTAGACTGTTTGCTTTAGCTTGTGGAAGATTTATTCAA
ATATGGAAGACTCCAGACGTTAGTGAAGATAGACAATTTGCCCCATTCGTTCGTTATAGG
GTCCATGCAGGCCATTTTGCAGATATCATCTCTTTAACATGGTCTCCAGATTCTAGATTT
ATTACTTCCACTTCTAAAGATCTAACTGCGAGGATATGGTCCATCGACTCCGAAGAAAAA
GACTTGGCATCTATGACATTTGCAGGCCACAAAGATTACGTTGTGAACTCCTTTTTTAGC
GCTGACCAAGAAATAATTTACACCATCAGTAAGGATGGTGCGCTGTTTCAATGGGAATAT
ACAAGAAGACCAGACGAAGTTGATGAAGAAGAAAGTGAAGATGAAGAAAATATTGATTTG
TCAAAATATAGCTGGAGAATCACTAAAAAAAACTTTTTCTACACCAATCAAGCCAAAGTG
AAATGTGCTATTTTTCATCCACAGTCCAACCTGTTAATCGTTGGATTTAGCAACGGTGAG
TTCAGGCTTTATGAACTACCACATTTCGCTTTAATTCAACAATTGTCTATGGGCCAAAAT
GCTGTTAACACTGTTTCCATTAACAACTCTGGTGAATGGTTGGCTTTTGGTTCCAGCAAA
TTAGGACAGCTGTTGGTCTATGAATGGCAGTCAGAATCCTATATTTTGAAGCAACAAGGA
CACTTTGATTCTATGAACTCCTTAACATACTCTCCGGATGGATCACGTGTTGTTACCTCT
TCCGATGACGGCAAGATCAAAATATGGGACGTTGTCTCCGGTTTCTGTCTAGCTACTTTC
GATGATCATACTTCATCTGTTACTGCGGTTCAATTTGCCAAGAAAGGCCAAGTGCTATTC
TCCGCATCTTTAGATGGCACTGTTAAAGCATGGGATTTAATCAGGTATCGTAATTTCAGA
ACTTTTACTGCTGCTGAAAGAATTCAATTTAACTGTCTGGCAGTAGATCCAAGTGGTGAA
GTGGTTTGTGCTGGCTCTGTGGACAGTTTTGAAATCCACGTTTGGTCTGTACAAACCGGG
AATTTATTGGACACTCTAGCTGGTCACGAAGGGCCTGTTTCTTGTCTATCGTTCAGTAAT
GAAAACAGTGTCCTGGCATCTGCTTCATGGGATAAGACAATCAGGATTTGGAGTATATTC
GGTAGATCTCAACAGGTCGAACCGTTTGATGTTTATTCTGATGTCCTGGCTATTTCTATG
AGACCAGATGGTAAGCAGGTTGCTGCGACTACCCTAAATGGCCAAATCTCCTTTTTCGAC
ATTCAAAGTGGTAAGCAAGTCGGAAATATTGACTGTAGAAAAGACATTGTTTCTGGTAGA
CATCTCGAAGATAGGTTTACTGCAAAGAATTCTGCAAGATCCAAGTATTTTACCACTATA
AACTATAGTTTTGATGGTCTCTCTATTGTTGCAGGTGGTAATAATAACTCTATTTGCCTG
TACGACATCTCGAATGAAGTTTTATTAAGAAGATTTGTGGTTTCTAGAAATATGACTCTA
AACGGCACTATGGAGTTTTTGAATAGCAGTAAGATGACTGAAGCTGGAACTCTCGATTTG
ATCGATCAAGATGCGGAAAATTCAGATTTGGAAGATCGTATCGACAACTCTCTACCAGGA
TCTAACAGGGGTGGTGATCTCTCGACTAGGAGAGTGAGGCCAGAAATTAGAGTGATAGCT
GTTCAGTTCTCGCCTACTGCAAATGCATTTGCAGCAGCATCAACGGAAGGGTTACTGGTT
TATTCCGTTGACGAAACAGTATTTTTTGATCCGTTTGATCTAGACGTTGATGTTACTCCT
CAAACAACTTTAGAAGCTTTGGAAAACAAAGAATACTTAAACTCACTGGTCATGGCCTTT
AGATTAAATGAAGAGTATCTAATTAACAGGGTCTACGAATCTGTACCCATCAAGGATATT
CCATTAGTCAGTGCTAACTTACCAATCGTCTATGCGGCTAGAATTTTACGGTTTATTGGT
AACTTCTCTATGGACTCTCAACACATTGAGTTCAACTTACTGTGGGTCAAATCATTGCTT
TCAGCACACGGCAAATACATTAATGCTCACAAGCAAGAGTTTACCAGTGCATTGAGAGCA
GTTCAAAGATTCATTGGTAGAGTGGCGAAAGATGTTGTAGCTGCCTCTAAAGACAACAAA
TATGCCTACTACTTTCTTACTTCTACTGATGGGACTTTAGAACAAAGTGATGCGGAAGAT
GAAATTAATTTAGATCGAGAGGAGGATAGTCTTGATGAAGACGCAATGGAAGCTTCTGAC
GATGAAGAGGAGTGGGTAGGTTTCAGCGATAAGGACAACAAGCTACCATTACAACAGGAC
GAAGATGATTCAGATGAAGATTTAATCTAA

Coding sequence    

>SAKL0A01738g.cds
ATGAAGTCGGATTTTAAGTTCTCCAACCTTTTGGGTACTGTTTACAGGCAAGGAAATGTT
ATTTTTTCTGATGACGGAACTAAATTGCTAAGCCCTGTCGGGAATAGAGTGTCAGTGTTT
GACTTAATTAATAACAAGTCTTTCACTTTTGAATATGAACATAGGAAGAACATCGCAAGA
ATTGACTTAAACAAACAGGGGACATTGCTTTTGTCGGTTGATGAGGATGGGCGTGCCATC
CTGGTCAATTTTAAGTCTCGTAACGTCTTGCATCATTTCAACTTTAGAGAAAAGGTTTAC
GATTTGAAGTTCTCTCCGGATGGTAGACTGTTTGCTTTAGCTTGTGGAAGATTTATTCAA
ATATGGAAGACTCCAGACGTTAGTGAAGATAGACAATTTGCCCCATTCGTTCGTTATAGG
GTCCATGCAGGCCATTTTGCAGATATCATCTCTTTAACATGGTCTCCAGATTCTAGATTT
ATTACTTCCACTTCTAAAGATCTAACTGCGAGGATATGGTCCATCGACTCCGAAGAAAAA
GACTTGGCATCTATGACATTTGCAGGCCACAAAGATTACGTTGTGAACTCCTTTTTTAGC
GCTGACCAAGAAATAATTTACACCATCAGTAAGGATGGTGCGCTGTTTCAATGGGAATAT
ACAAGAAGACCAGACGAAGTTGATGAAGAAGAAAGTGAAGATGAAGAAAATATTGATTTG
TCAAAATATAGCTGGAGAATCACTAAAAAAAACTTTTTCTACACCAATCAAGCCAAAGTG
AAATGTGCTATTTTTCATCCACAGTCCAACCTGTTAATCGTTGGATTTAGCAACGGTGAG
TTCAGGCTTTATGAACTACCACATTTCGCTTTAATTCAACAATTGTCTATGGGCCAAAAT
GCTGTTAACACTGTTTCCATTAACAACTCTGGTGAATGGTTGGCTTTTGGTTCCAGCAAA
TTAGGACAGCTGTTGGTCTATGAATGGCAGTCAGAATCCTATATTTTGAAGCAACAAGGA
CACTTTGATTCTATGAACTCCTTAACATACTCTCCGGATGGATCACGTGTTGTTACCTCT
TCCGATGACGGCAAGATCAAAATATGGGACGTTGTCTCCGGTTTCTGTCTAGCTACTTTC
GATGATCATACTTCATCTGTTACTGCGGTTCAATTTGCCAAGAAAGGCCAAGTGCTATTC
TCCGCATCTTTAGATGGCACTGTTAAAGCATGGGATTTAATCAGGTATCGTAATTTCAGA
ACTTTTACTGCTGCTGAAAGAATTCAATTTAACTGTCTGGCAGTAGATCCAAGTGGTGAA
GTGGTTTGTGCTGGCTCTGTGGACAGTTTTGAAATCCACGTTTGGTCTGTACAAACCGGG
AATTTATTGGACACTCTAGCTGGTCACGAAGGGCCTGTTTCTTGTCTATCGTTCAGTAAT
GAAAACAGTGTCCTGGCATCTGCTTCATGGGATAAGACAATCAGGATTTGGAGTATATTC
GGTAGATCTCAACAGGTCGAACCGTTTGATGTTTATTCTGATGTCCTGGCTATTTCTATG
AGACCAGATGGTAAGCAGGTTGCTGCGACTACCCTAAATGGCCAAATCTCCTTTTTCGAC
ATTCAAAGTGGTAAGCAAGTCGGAAATATTGACTGTAGAAAAGACATTGTTTCTGGTAGA
CATCTCGAAGATAGGTTTACTGCAAAGAATTCTGCAAGATCCAAGTATTTTACCACTATA
AACTATAGTTTTGATGGTCTCTCTATTGTTGCAGGTGGTAATAATAACTCTATTTGCCTG
TACGACATCTCGAATGAAGTTTTATTAAGAAGATTTGTGGTTTCTAGAAATATGACTCTA
AACGGCACTATGGAGTTTTTGAATAGCAGTAAGATGACTGAAGCTGGAACTCTCGATTTG
ATCGATCAAGATGCGGAAAATTCAGATTTGGAAGATCGTATCGACAACTCTCTACCAGGA
TCTAACAGGGGTGGTGATCTCTCGACTAGGAGAGTGAGGCCAGAAATTAGAGTGATAGCT
GTTCAGTTCTCGCCTACTGCAAATGCATTTGCAGCAGCATCAACGGAAGGGTTACTGGTT
TATTCCGTTGACGAAACAGTATTTTTTGATCCGTTTGATCTAGACGTTGATGTTACTCCT
CAAACAACTTTAGAAGCTTTGGAAAACAAAGAATACTTAAACTCACTGGTCATGGCCTTT
AGATTAAATGAAGAGTATCTAATTAACAGGGTCTACGAATCTGTACCCATCAAGGATATT
CCATTAGTCAGTGCTAACTTACCAATCGTCTATGCGGCTAGAATTTTACGGTTTATTGGT
AACTTCTCTATGGACTCTCAACACATTGAGTTCAACTTACTGTGGGTCAAATCATTGCTT
TCAGCACACGGCAAATACATTAATGCTCACAAGCAAGAGTTTACCAGTGCATTGAGAGCA
GTTCAAAGATTCATTGGTAGAGTGGCGAAAGATGTTGTAGCTGCCTCTAAAGACAACAAA
TATGCCTACTACTTTCTTACTTCTACTGATGGGACTTTAGAACAAAGTGATGCGGAAGAT
GAAATTAATTTAGATCGAGAGGAGGATAGTCTTGATGAAGACGCAATGGAAGCTTCTGAC
GATGAAGAGGAGTGGGTAGGTTTCAGCGATAAGGACAACAAGCTACCATTACAACAGGAC
GAAGATGATTCAGATGAAGATTTAATCTAA

Predicted translation product    

>SAKL0A01738g.aa
MKSDFKFSNLLGTVYRQGNVIFSDDGTKLLSPVGNRVSVFDLINNKSFTFEYEHRKNIAR
IDLNKQGTLLLSVDEDGRAILVNFKSRNVLHHFNFREKVYDLKFSPDGRLFALACGRFIQ
IWKTPDVSEDRQFAPFVRYRVHAGHFADIISLTWSPDSRFITSTSKDLTARIWSIDSEEK
DLASMTFAGHKDYVVNSFFSADQEIIYTISKDGALFQWEYTRRPDEVDEEESEDEENIDL
SKYSWRITKKNFFYTNQAKVKCAIFHPQSNLLIVGFSNGEFRLYELPHFALIQQLSMGQN
AVNTVSINNSGEWLAFGSSKLGQLLVYEWQSESYILKQQGHFDSMNSLTYSPDGSRVVTS
SDDGKIKIWDVVSGFCLATFDDHTSSVTAVQFAKKGQVLFSASLDGTVKAWDLIRYRNFR
TFTAAERIQFNCLAVDPSGEVVCAGSVDSFEIHVWSVQTGNLLDTLAGHEGPVSCLSFSN
ENSVLASASWDKTIRIWSIFGRSQQVEPFDVYSDVLAISMRPDGKQVAATTLNGQISFFD
IQSGKQVGNIDCRKDIVSGRHLEDRFTAKNSARSKYFTTINYSFDGLSIVAGGNNNSICL
YDISNEVLLRRFVVSRNMTLNGTMEFLNSSKMTEAGTLDLIDQDAENSDLEDRIDNSLPG
SNRGGDLSTRRVRPEIRVIAVQFSPTANAFAAASTEGLLVYSVDETVFFDPFDLDVDVTP
QTTLEALENKEYLNSLVMAFRLNEEYLINRVYESVPIKDIPLVSANLPIVYAARILRFIG
NFSMDSQHIEFNLLWVKSLLSAHGKYINAHKQEFTSALRAVQRFIGRVAKDVVAASKDNK
YAYYFLTSTDGTLEQSDAEDEINLDREEDSLDEDAMEASDDEEEWVGFSDKDNKLPLQQD
EDDSDEDLI*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites