ZYRO0F10406g


similar to uniprot|Q12220 Saccharomyces cerevisiae YLR129W DIP2 Nucleolar protein specifically associated with the U3 snoRNA part of the large ribonucleoprotein complex known as the small subunit (SSU) processome required for 18S rRNA biogenesis part of the active pre- rRNA processing complex

Genomic environment map

Element type: CDS
Element length: 2817 nucleotides,
on sense strand of
Zyro0F: 843429..846245.
Other names:
ZYRO-ORF2595
Coding sequence: 939 codons.
Database cross references:
EMBL: CU928178
GeneID: 8205415
GenomeReviews: CU928178_GR

Computed results  

None available yet


Homologs and Orthologs

Homologs in protein family: GL3C0129
Orthologs: strict determination not possible; homologs must be refined manually

Protein ZYRO0F10406p  


similar to uniprot|Q12220 Saccharomyces cerevisiae YLR129W DIP2 Nucleolar protein specifically associated with the U3 snoRNA part of the large ribonucleoprotein complex known as the small subunit (SSU) processome required for 18S rRNA biogenesis part of the active pre- rRNA processing complex; SubName: Full=ZYRO0F10406p;

Protein domain map

Protein length: 938 amino acids
Protein family: GL3C0129
Database cross references:
Gene3D: G3DSA:2.130.10.10
InterPro: IPR001680
InterPro: IPR007148
InterPro: IPR011046
InterPro: IPR011047
InterPro: IPR015943
InterPro: IPR017986
InterPro: IPR019775
InterPro: IPR019781
InterPro: IPR019782
InterPro: IPR020472
KEGG: zro:ZYRO0F10406g
PRINTS: PR00320
PROSITE: PS00678
PROSITE: PS50082
PROSITE: PS50294
Pfam: PF00400
Pfam: PF04003
RefSeq: XP_002497650.1
SMART: SM00320
UniProtKB/TrEMBL: C5DY56
UniProtKB: C5DY56_ZYGRC

Phylogeny  

PhylomeDB:ZYRO0F10406g

Computed results for ZYRO0F10406p  

None available yet

Gene Ontology terms  

None available yet

Sequence data  


Nucleotide sequence    

>ZYRO0F10406g.nt
ATGGTGAAATCGTATCAACGTTTTGAACAATCCTCCGTGTTTGGTGTTGTTTCCTCAGGT
GATAACTCTATTTGGATACCAGCTGAAAACAAAAGGACTAATGGTCCTGGTCAAATCATT
ACTGGTGCTTTAGAAAATGTCAATATTTGGGATATTAAGACTGGAGAATTAGTCTCTCAT
CTTTCTGATGGGTTACCACCAGGTGCAATTGATGCAAGAACAACTAAACCTGCAGAAGTT
ACATATATGCAATACTACAGTGATACAAGTCTTTTGGCTGTTGGTTATGCTGATGGTGTT
ATTAAGATTTGGGATATGTTCTCAAAAACGGTTCTTTTGAGTTTCAACGGTCATAGGTCT
GCTGTAACTCAACTTTTATTCGATTCCACGGGTACAAGATTAATTTCAGGTTCCAGAGAC
TCGAATATCATCGTCTGGGATTTAGTAAGTGAAGTAGGCCTTTACAAATTACGTTCACAT
AAGGATTCCATTACAGGTCTTTCGCTGCCCGATGAAAATTGGTTGATGAGTGTTTCTAAG
GATGGACTCATTAAGCTTTGGGATTTAAAGATTCAACAGTGTGTAGAAACACATATTGCA
CATCCGGGTGAATGTTGGGCCCTTGGTGTTGATGAAGACGTTGTTGTCACTACAAGTTCA
GATTCTCAAATAAAAATTTGGCATTTGGATTTAGACGCCGATGCTGGTGTTAAAGTTACA
GAGAAGGGTATTTTCGAGAAGCAGAGTAAACAAAGAGGTGTAGCAGTTAGTTTTATTACT
GTTTCAGATGGTACCAAATTCTTCTATGTTCAAAATGCTGATAAAACCCTAGAAATTATT
AGGCTGAGGAAAGAAGAAGAGATATCAAGAGCTTTAAAGAAAAGAGAGAAAAGGTTCAAG
GAAAAAGGAATGACTGATGAAGAAATCCAACAAAATTTTAAGGAATCTTATATCTCTATT
ATAATGCACTCATTCCAAGTCTTGAGGTCAGCGTTCAAAATCAGAGCTGCTTCATGGGCT
GTCACTACGTCTTCCAAATTGGAGTTAACAGTTACTACCTCTGGTAATACAATTGAGTAC
TACTCAATACCGTACGAGAAAAGAGAACCTAAGCATCCAACACCGATCAGGGTTCATTCC
ATCGAGTTACAAGGTCACAGGACTGACGTTCGTGCCATCGATATCAGTGATGATGGTAAA
TTGTTGGCTACTGCTTCTAATGGTCTATTGAAGATTTGGAATATTAGAACAAAGACTTGT
TTAAGAACTTTTGAATGTGGATATGCATTGACCTGTAAATTCTTGCCAGGTGGTGTTCTG
GTGATAGTTGGTACTAGGAGTGGTCAATTACAATTGTTTGATCTTGCATCTTCAACTATA
TTAGAAAATAACGAAGAAGCTCACGATGCCGCAATCTGGTCTCTTGACATTACTTCTGAT
GGTAGGAGATTGGTTACAGGATCAGCAGATAAAACAGTCAAATTTTGGAATTTCATTGTG
GAAGAAAAGAAAGTCCCTGGTACAACCGATAAGTTGATCCCTACGATGAGTCTTCATCAC
GATACTACTTTAGAGATGAGCGATGACATATTGAGTGTGAAGATTTCACCAGAGAACAAG
TTGTTAGCAGTTTCACTGTTGGATAACACGGTCAAGGTCTTTTATTTGGAAACTATGAAA
TTTTTCTTAAGCTTGTACGGTCATAAATTACCAGTATTGTCGATTGACATTTCCTTTGAT
TCTAAACTTATCATCACTTCTTCGGCAGATAAGAACATTAAAATATGGGGTTTAGATTTT
GGTGATTGTCATAAATCACTTTTCGCCCATCAAGACTCTATTATGAACGTAAAATTTTTA
CCAGAATCCCATAATTTCTTCAGTTGTTCTAAAGATGCAACTGTTAAATACTGGGACGGT
GATAAATTTGAAACGATTCAGAAGCTAGCTGGCCATCAGAGTGAAGTTTGGGCTTTAGCG
GTAGCTAGGAGTGGTACTTGCGTCATATCTGTTTCCCATGACAGCAGTATAAGAGTTTGG
GAAGAAACTGATGATCAAGTGTTTTTAGAAGAAGAACGTGAAAGAGAATTGGAGGAACAA
AATGAAGAAGGTCTACTGACCTCGTTAGAGGAAGGTTCTGGGGATTCGGCTTTCAAGCAA
GATGAAAAGGATGATGATCATGATGATGCAGTCGATGTGCATAAGCAGACCGTGGAGTCC
TTAAAGGCTGGTGAAAGATTAATGGAAGCAATTGATTTGGGGATCCCAGAGATAGAGGCA
TGGGAAATTTACGAGAAGGAATTACAACTTTGGAAGAAGAAGAAACAGGGTGTGGAGCCT
GAGAGACCACAAGATAATGCTATTTTGTTGGCCATTAACAAAAGACCAGAAGAATATATC
ATGGACACCTTAGTGAGGATAAAACCATCACAATTAGAAGACGCCTTATTGACACTTCCT
TTTTCATACGTTTTGAAATTCTTAAAGTTCCTAGATACTGTGCTCCAGGACAAGAAATTA
CTCCACAATCATTTGTCGCTTATTTGCAAGAACCTATTCTTCATAGTTCAATCAAATCAT
CGGGAATTAGTGTCTCAAAAGAATGAAGAACTGAAGCAAAGAATTACTAGAGTAAAGAAC
GAACTTAGAGAAGCATTGAAAAACAACGAAGACGATCTAGGGTTCAACATTGAAGGACTA
AAATTCATTAAACAACAATGGAACCTCAAGCACAACTATGAATTTGTTGATGAATTTGAC
CAAAGGAAGCAAGCAGAGAAAACCGCGAAGAAGAGAGTTTTCGAAACTCTAAGTTAA

Coding sequence    

>ZYRO0F10406g.cds
ATGGTGAAATCGTATCAACGTTTTGAACAATCCTCCGTGTTTGGTGTTGTTTCCTCAGGT
GATAACTCTATTTGGATACCAGCTGAAAACAAAAGGACTAATGGTCCTGGTCAAATCATT
ACTGGTGCTTTAGAAAATGTCAATATTTGGGATATTAAGACTGGAGAATTAGTCTCTCAT
CTTTCTGATGGGTTACCACCAGGTGCAATTGATGCAAGAACAACTAAACCTGCAGAAGTT
ACATATATGCAATACTACAGTGATACAAGTCTTTTGGCTGTTGGTTATGCTGATGGTGTT
ATTAAGATTTGGGATATGTTCTCAAAAACGGTTCTTTTGAGTTTCAACGGTCATAGGTCT
GCTGTAACTCAACTTTTATTCGATTCCACGGGTACAAGATTAATTTCAGGTTCCAGAGAC
TCGAATATCATCGTCTGGGATTTAGTAAGTGAAGTAGGCCTTTACAAATTACGTTCACAT
AAGGATTCCATTACAGGTCTTTCGCTGCCCGATGAAAATTGGTTGATGAGTGTTTCTAAG
GATGGACTCATTAAGCTTTGGGATTTAAAGATTCAACAGTGTGTAGAAACACATATTGCA
CATCCGGGTGAATGTTGGGCCCTTGGTGTTGATGAAGACGTTGTTGTCACTACAAGTTCA
GATTCTCAAATAAAAATTTGGCATTTGGATTTAGACGCCGATGCTGGTGTTAAAGTTACA
GAGAAGGGTATTTTCGAGAAGCAGAGTAAACAAAGAGGTGTAGCAGTTAGTTTTATTACT
GTTTCAGATGGTACCAAATTCTTCTATGTTCAAAATGCTGATAAAACCCTAGAAATTATT
AGGCTGAGGAAAGAAGAAGAGATATCAAGAGCTTTAAAGAAAAGAGAGAAAAGGTTCAAG
GAAAAAGGAATGACTGATGAAGAAATCCAACAAAATTTTAAGGAATCTTATATCTCTATT
ATAATGCACTCATTCCAAGTCTTGAGGTCAGCGTTCAAAATCAGAGCTGCTTCATGGGCT
GTCACTACGTCTTCCAAATTGGAGTTAACAGTTACTACCTCTGGTAATACAATTGAGTAC
TACTCAATACCGTACGAGAAAAGAGAACCTAAGCATCCAACACCGATCAGGGTTCATTCC
ATCGAGTTACAAGGTCACAGGACTGACGTTCGTGCCATCGATATCAGTGATGATGGTAAA
TTGTTGGCTACTGCTTCTAATGGTCTATTGAAGATTTGGAATATTAGAACAAAGACTTGT
TTAAGAACTTTTGAATGTGGATATGCATTGACCTGTAAATTCTTGCCAGGTGGTGTTCTG
GTGATAGTTGGTACTAGGAGTGGTCAATTACAATTGTTTGATCTTGCATCTTCAACTATA
TTAGAAAATAACGAAGAAGCTCACGATGCCGCAATCTGGTCTCTTGACATTACTTCTGAT
GGTAGGAGATTGGTTACAGGATCAGCAGATAAAACAGTCAAATTTTGGAATTTCATTGTG
GAAGAAAAGAAAGTCCCTGGTACAACCGATAAGTTGATCCCTACGATGAGTCTTCATCAC
GATACTACTTTAGAGATGAGCGATGACATATTGAGTGTGAAGATTTCACCAGAGAACAAG
TTGTTAGCAGTTTCACTGTTGGATAACACGGTCAAGGTCTTTTATTTGGAAACTATGAAA
TTTTTCTTAAGCTTGTACGGTCATAAATTACCAGTATTGTCGATTGACATTTCCTTTGAT
TCTAAACTTATCATCACTTCTTCGGCAGATAAGAACATTAAAATATGGGGTTTAGATTTT
GGTGATTGTCATAAATCACTTTTCGCCCATCAAGACTCTATTATGAACGTAAAATTTTTA
CCAGAATCCCATAATTTCTTCAGTTGTTCTAAAGATGCAACTGTTAAATACTGGGACGGT
GATAAATTTGAAACGATTCAGAAGCTAGCTGGCCATCAGAGTGAAGTTTGGGCTTTAGCG
GTAGCTAGGAGTGGTACTTGCGTCATATCTGTTTCCCATGACAGCAGTATAAGAGTTTGG
GAAGAAACTGATGATCAAGTGTTTTTAGAAGAAGAACGTGAAAGAGAATTGGAGGAACAA
AATGAAGAAGGTCTACTGACCTCGTTAGAGGAAGGTTCTGGGGATTCGGCTTTCAAGCAA
GATGAAAAGGATGATGATCATGATGATGCAGTCGATGTGCATAAGCAGACCGTGGAGTCC
TTAAAGGCTGGTGAAAGATTAATGGAAGCAATTGATTTGGGGATCCCAGAGATAGAGGCA
TGGGAAATTTACGAGAAGGAATTACAACTTTGGAAGAAGAAGAAACAGGGTGTGGAGCCT
GAGAGACCACAAGATAATGCTATTTTGTTGGCCATTAACAAAAGACCAGAAGAATATATC
ATGGACACCTTAGTGAGGATAAAACCATCACAATTAGAAGACGCCTTATTGACACTTCCT
TTTTCATACGTTTTGAAATTCTTAAAGTTCCTAGATACTGTGCTCCAGGACAAGAAATTA
CTCCACAATCATTTGTCGCTTATTTGCAAGAACCTATTCTTCATAGTTCAATCAAATCAT
CGGGAATTAGTGTCTCAAAAGAATGAAGAACTGAAGCAAAGAATTACTAGAGTAAAGAAC
GAACTTAGAGAAGCATTGAAAAACAACGAAGACGATCTAGGGTTCAACATTGAAGGACTA
AAATTCATTAAACAACAATGGAACCTCAAGCACAACTATGAATTTGTTGATGAATTTGAC
CAAAGGAAGCAAGCAGAGAAAACCGCGAAGAAGAGAGTTTTCGAAACTCTAAGTTAA

Predicted translation product    

>ZYRO0F10406g.aa
MVKSYQRFEQSSVFGVVSSGDNSIWIPAENKRTNGPGQIITGALENVNIWDIKTGELVSH
LSDGLPPGAIDARTTKPAEVTYMQYYSDTSLLAVGYADGVIKIWDMFSKTVLLSFNGHRS
AVTQLLFDSTGTRLISGSRDSNIIVWDLVSEVGLYKLRSHKDSITGLSLPDENWLMSVSK
DGLIKLWDLKIQQCVETHIAHPGECWALGVDEDVVVTTSSDSQIKIWHLDLDADAGVKVT
EKGIFEKQSKQRGVAVSFITVSDGTKFFYVQNADKTLEIIRLRKEEEISRALKKREKRFK
EKGMTDEEIQQNFKESYISIIMHSFQVLRSAFKIRAASWAVTTSSKLELTVTTSGNTIEY
YSIPYEKREPKHPTPIRVHSIELQGHRTDVRAIDISDDGKLLATASNGLLKIWNIRTKTC
LRTFECGYALTCKFLPGGVLVIVGTRSGQLQLFDLASSTILENNEEAHDAAIWSLDITSD
GRRLVTGSADKTVKFWNFIVEEKKVPGTTDKLIPTMSLHHDTTLEMSDDILSVKISPENK
LLAVSLLDNTVKVFYLETMKFFLSLYGHKLPVLSIDISFDSKLIITSSADKNIKIWGLDF
GDCHKSLFAHQDSIMNVKFLPESHNFFSCSKDATVKYWDGDKFETIQKLAGHQSEVWALA
VARSGTCVISVSHDSSIRVWEETDDQVFLEEERERELEEQNEEGLLTSLEEGSGDSAFKQ
DEKDDDHDDAVDVHKQTVESLKAGERLMEAIDLGIPEIEAWEIYEKELQLWKKKKQGVEP
ERPQDNAILLAINKRPEEYIMDTLVRIKPSQLEDALLTLPFSYVLKFLKFLDTVLQDKKL
LHNHLSLICKNLFFIVQSNHRELVSQKNEELKQRITRVKNELREALKNNEDDLGFNIEGL
KFIKQQWNLKHNYEFVDEFDQRKQAEKTAKKRVFETLS*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites