SACE0L12627g


YLR222C UTP13, Nucleolar protein, component of the small subunit (SSU) processome containing the U3 snoRNA that is involved in processing of pre-18S rRNA

Genomic environment map

Element type: CDS
Element length: 2454 nucleotides,
on anti-sense strand of
Sace0L: complement(579320..581773).
Other names:
UTP13
YLR222C
Coding sequence: 818 codons.
Database cross references:
ArrayExpress: Q05946
CYGD: YLR222c
EMBL: AAB67411.1
EMBL: BK006945
EMBL: U19027
GeneID: 850919
HOGENOM: HBG397335
NMPDR: fig|4932.3.peg.4333

Computed results  

None available yet


Homologs and Orthologs

Homologs in protein family: GL3C0129
Orthologs: strict determination not possible; homologs must be refined manually

Protein SACE0L12627p  


Protein domain map

Protein length: 817 amino acids
Protein family: GL3C0129
Database cross references:
DIP: DIP-4790N
Gene3D: G3DSA:2.130.10.10
GermOnline: YLR222C
IntAct: Q05946
InterPro: IPR001680
InterPro: IPR011046
InterPro: IPR013934
InterPro: IPR015943
InterPro: IPR017986
InterPro: IPR019775
InterPro: IPR019781
InterPro: IPR019782
InterPro: IPR020472
KEGG: sce:YLR222C
NextBio: 967338
PIR: S51445
PRINTS: PR00320
PROSITE: PS00678
PROSITE: PS50082
PROSITE: PS50294
PeptideAtlas: Q05946
Pfam: PF00400
Pfam: PF08625
RefSeq: NP_013323.1
SGD: S000004212
SMART: SM00320
SMR: Q05946
UniProtKB/Swiss-Prot: Q05946
UniProtKB: UTP13_YEAST

Phylogeny  

PhylomeDB:SACE0L12627g

Computed results for SACE0L12627p  

None available yet

Gene Ontology terms  


Sequence data  


Nucleotide sequence    

>SACE0L12627g.nt
ATGGATCTGAAAACCTCATATAAAGGTATATCGTTAAACCCTATTTATGCAGGAAGCAGT
GCTGTTGCCACTGTTTCAGAAAATGGTAAAATACTAGCTACTCCGGTGCTTGACGAAATC
AACATAATCGATTTAACCCCAGGCTCCAGAAAAATTTTGCACAAGATTTCTAATGAAGAC
GAGCAAGAAATTACTGCACTGAAATTAACTCCTGATGGTCAATATCTCACGTATGTTTCA
CAAGCTCAGCTTTTAAAAATCTTTCACCTTAAGACCGGGAAAGTTGTCAGATCAATGAAA
ATCTCTTCTCCATCGTACATTCTCGACGCAGATTCCACTTCAACGCTTCTCGCAGTCGGA
GGGACAGATGGTAGTATAATTGTTGTGGATATTGAAAATGGTTACATTACTCATTCGTTC
AAAGGTCATGGTGGTACAATTTCCAGTTTGAAGTTTTATGGACAATTGAATAGTAAAATA
TGGTTATTAGCATCCGGTGACACCAATGGTATGGTAAAGGTTTGGGATCTTGTTAAAAGG
AAGTGCTTGCACACATTACAAGAGCACACATCTGCTGTGAGAGGGTTGGATATCATTGAG
GTACCAGATAATGATGAGCCAAGCCTAAATTTACTTTCCGGTGGTAGAGACGATATCATC
AATCTCTGGGATTTCAATATGAAAAAGAAATGTAAATTGTTGAAAACACTTCCAGTAAAT
CAACAGGTAGAGTCATGCGGATTCCTAAAGGATGGTGACGGTAAACGCATAATTTATACA
GCCGGCGGTGATGCTATCTTTCAGTTAATTGACTCTGAATCTGGTAGCGTGCTTAAAAGA
ACCAATAAACCTATAGAAGAGTTATTCATTATTGGTGTACTTCCAATATTGAGCAATTCA
CAGATGTTTTTAGTATTGTCTGACCAGACATTACAACTAATTAACGTTGAAGAAGACTTA
AAGAATGATGAAGACACAATACAGGTCACTTCGAGCATTGCTGGTAATCATGGTATTATT
GCTGACATGAGGTATGTTGGCCCAGAGCTGAATAAATTAGCTTTAGCTACCAATTCTCCA
TCGCTTAGGATAATACCCGTCCCCGATTTATCAGGCCCAGAAGCTTCGCTGCCTTTGGAT
GTCGAAATTTATGAAGGTCATGAAGATTTGCTAAATTCATTGGATGCTACTGAGGATGGC
CTGTGGATCGCAACCGCCTCTAAAGATAATACGGCTATTGTTTGGAGATACAATGAGAAC
AGCTGCAAGTTTGATATTTATGCCAAGTATATTGGTCATTCAGCTGCCGTAACTGCTGTT
GGATTGCCAAATATAGTTTCAAAGGGATATCCTGAATTTTTATTGACAGCATCGAATGAT
TTAACAATTAAGAAATGGATAATTCCAAAGCCAACTGCTAGTATGGATGTTCAAATCATC
AAGGTATCCGAATATACTCGTCATGCCCATGAAAAGGATATCAATGCTTTGTCAGTTTCT
CCTAACGATTCTATTTTTGCAACAGCATCATACGACAAGACCTGTAAGATCTGGAACCTA
GAAAATGGTGAATTGGAAGCCACGTTGGCCAACCACAAGCGTGGACTATGGGATGTATCA
TTTTGCCAATATGATAAATTATTGGCAACTTCTTCAGGTGATAAAACAGTCAAGATATGG
TCATTGGATACATTCAGCGTTATGAAAACATTGGAAGGTCATACCAATGCGGTTCAAAGA
TGTTCGTTTATTAATAAGCAAAAACAACTGATCAGTTGTGGTGCTGATGGCTTGATCAAA
ATATGGGATTGTTCTAGCGGTGAATGTCTGAAGACTTTGGATGGTCATAATAATAGATTA
TGGGCTTTAAGTACTATGAATGATGGTGATATGATCGTAAGTGCTGATGCAGATGGTGTT
TTTCAGTTTTGGAAAGATTGTACGGAACAAGAAATAGAAGAAGAACAAGAAAAAGCTAAA
TTACAAGTCGAACAAGAACAATCGCTTCAAAATTATATGAGCAAAGGTGATTGGACAAAT
GCATTTTTGTTAGCAATGACTTTAGATCACCCAATGAGGTTGTTTAATGTTTTGAAAAGA
GCTTTGGGCGAATCAAGGTCTAGACAAGATACCGAAGAGGGCAAAATTGAAGTCATTTTC
AATGAAGAATTGGACCAAGCCATCTCTATCCTAAATGATGAACAGTTAATTTTGTTAATG
AAACGATGCAGAGATTGGAATACAAATGCAAAAACACATACCATAGCGCAAAGAACAATA
AGATGTATTTTGATGCATCATAACATAGCAAAATTGAGTGAGATACCCGGAATGGTAAAG
ATAGTTGATGCAATAATTCCATACACGCAAAGGCATTTCACAAGGGTTGATAACTTAGTT
GAACAAAGTTACATATTAGACTACGCGCTAGTGGAAATGGATAAGCTATTCTAG

Coding sequence    

>SACE0L12627g.cds
ATGGATCTGAAAACCTCATATAAAGGTATATCGTTAAACCCTATTTATGCAGGAAGCAGT
GCTGTTGCCACTGTTTCAGAAAATGGTAAAATACTAGCTACTCCGGTGCTTGACGAAATC
AACATAATCGATTTAACCCCAGGCTCCAGAAAAATTTTGCACAAGATTTCTAATGAAGAC
GAGCAAGAAATTACTGCACTGAAATTAACTCCTGATGGTCAATATCTCACGTATGTTTCA
CAAGCTCAGCTTTTAAAAATCTTTCACCTTAAGACCGGGAAAGTTGTCAGATCAATGAAA
ATCTCTTCTCCATCGTACATTCTCGACGCAGATTCCACTTCAACGCTTCTCGCAGTCGGA
GGGACAGATGGTAGTATAATTGTTGTGGATATTGAAAATGGTTACATTACTCATTCGTTC
AAAGGTCATGGTGGTACAATTTCCAGTTTGAAGTTTTATGGACAATTGAATAGTAAAATA
TGGTTATTAGCATCCGGTGACACCAATGGTATGGTAAAGGTTTGGGATCTTGTTAAAAGG
AAGTGCTTGCACACATTACAAGAGCACACATCTGCTGTGAGAGGGTTGGATATCATTGAG
GTACCAGATAATGATGAGCCAAGCCTAAATTTACTTTCCGGTGGTAGAGACGATATCATC
AATCTCTGGGATTTCAATATGAAAAAGAAATGTAAATTGTTGAAAACACTTCCAGTAAAT
CAACAGGTAGAGTCATGCGGATTCCTAAAGGATGGTGACGGTAAACGCATAATTTATACA
GCCGGCGGTGATGCTATCTTTCAGTTAATTGACTCTGAATCTGGTAGCGTGCTTAAAAGA
ACCAATAAACCTATAGAAGAGTTATTCATTATTGGTGTACTTCCAATATTGAGCAATTCA
CAGATGTTTTTAGTATTGTCTGACCAGACATTACAACTAATTAACGTTGAAGAAGACTTA
AAGAATGATGAAGACACAATACAGGTCACTTCGAGCATTGCTGGTAATCATGGTATTATT
GCTGACATGAGGTATGTTGGCCCAGAGCTGAATAAATTAGCTTTAGCTACCAATTCTCCA
TCGCTTAGGATAATACCCGTCCCCGATTTATCAGGCCCAGAAGCTTCGCTGCCTTTGGAT
GTCGAAATTTATGAAGGTCATGAAGATTTGCTAAATTCATTGGATGCTACTGAGGATGGC
CTGTGGATCGCAACCGCCTCTAAAGATAATACGGCTATTGTTTGGAGATACAATGAGAAC
AGCTGCAAGTTTGATATTTATGCCAAGTATATTGGTCATTCAGCTGCCGTAACTGCTGTT
GGATTGCCAAATATAGTTTCAAAGGGATATCCTGAATTTTTATTGACAGCATCGAATGAT
TTAACAATTAAGAAATGGATAATTCCAAAGCCAACTGCTAGTATGGATGTTCAAATCATC
AAGGTATCCGAATATACTCGTCATGCCCATGAAAAGGATATCAATGCTTTGTCAGTTTCT
CCTAACGATTCTATTTTTGCAACAGCATCATACGACAAGACCTGTAAGATCTGGAACCTA
GAAAATGGTGAATTGGAAGCCACGTTGGCCAACCACAAGCGTGGACTATGGGATGTATCA
TTTTGCCAATATGATAAATTATTGGCAACTTCTTCAGGTGATAAAACAGTCAAGATATGG
TCATTGGATACATTCAGCGTTATGAAAACATTGGAAGGTCATACCAATGCGGTTCAAAGA
TGTTCGTTTATTAATAAGCAAAAACAACTGATCAGTTGTGGTGCTGATGGCTTGATCAAA
ATATGGGATTGTTCTAGCGGTGAATGTCTGAAGACTTTGGATGGTCATAATAATAGATTA
TGGGCTTTAAGTACTATGAATGATGGTGATATGATCGTAAGTGCTGATGCAGATGGTGTT
TTTCAGTTTTGGAAAGATTGTACGGAACAAGAAATAGAAGAAGAACAAGAAAAAGCTAAA
TTACAAGTCGAACAAGAACAATCGCTTCAAAATTATATGAGCAAAGGTGATTGGACAAAT
GCATTTTTGTTAGCAATGACTTTAGATCACCCAATGAGGTTGTTTAATGTTTTGAAAAGA
GCTTTGGGCGAATCAAGGTCTAGACAAGATACCGAAGAGGGCAAAATTGAAGTCATTTTC
AATGAAGAATTGGACCAAGCCATCTCTATCCTAAATGATGAACAGTTAATTTTGTTAATG
AAACGATGCAGAGATTGGAATACAAATGCAAAAACACATACCATAGCGCAAAGAACAATA
AGATGTATTTTGATGCATCATAACATAGCAAAATTGAGTGAGATACCCGGAATGGTAAAG
ATAGTTGATGCAATAATTCCATACACGCAAAGGCATTTCACAAGGGTTGATAACTTAGTT
GAACAAAGTTACATATTAGACTACGCGCTAGTGGAAATGGATAAGCTATTCTAG

Predicted translation product    

>SACE0L12627g.aa
MDLKTSYKGISLNPIYAGSSAVATVSENGKILATPVLDEINIIDLTPGSRKILHKISNED
EQEITALKLTPDGQYLTYVSQAQLLKIFHLKTGKVVRSMKISSPSYILDADSTSTLLAVG
GTDGSIIVVDIENGYITHSFKGHGGTISSLKFYGQLNSKIWLLASGDTNGMVKVWDLVKR
KCLHTLQEHTSAVRGLDIIEVPDNDEPSLNLLSGGRDDIINLWDFNMKKKCKLLKTLPVN
QQVESCGFLKDGDGKRIIYTAGGDAIFQLIDSESGSVLKRTNKPIEELFIIGVLPILSNS
QMFLVLSDQTLQLINVEEDLKNDEDTIQVTSSIAGNHGIIADMRYVGPELNKLALATNSP
SLRIIPVPDLSGPEASLPLDVEIYEGHEDLLNSLDATEDGLWIATASKDNTAIVWRYNEN
SCKFDIYAKYIGHSAAVTAVGLPNIVSKGYPEFLLTASNDLTIKKWIIPKPTASMDVQII
KVSEYTRHAHEKDINALSVSPNDSIFATASYDKTCKIWNLENGELEATLANHKRGLWDVS
FCQYDKLLATSSGDKTVKIWSLDTFSVMKTLEGHTNAVQRCSFINKQKQLISCGADGLIK
IWDCSSGECLKTLDGHNNRLWALSTMNDGDMIVSADADGVFQFWKDCTEQEIEEEQEKAK
LQVEQEQSLQNYMSKGDWTNAFLLAMTLDHPMRLFNVLKRALGESRSRQDTEEGKIEVIF
NEELDQAISILNDEQLILLMKRCRDWNTNAKTHTIAQRTIRCILMHHNIAKLSEIPGMVK
IVDAIIPYTQRHFTRVDNLVEQSYILDYALVEMDKLF*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites