ZYRO0G20306g


similar to uniprot|Q03957 Saccharomyces cerevisiae YKL139W CTK1 Catalytic (alpha) subunit of C-terminal domain kinase I (CTDK-I) which phosphorylates the C- terminal repeated domain of the RNA polymerase II large subunit (Rpo21p) to affect both transcription and pre-mRNA 3' end processing

Genomic environment map

Element type: CDS
Element length: 1650 nucleotides,
on anti-sense strand of
Zyro0G: complement(1675578..1677227).
Other names:
ZYRO-ORF204
Coding sequence: 550 codons.
Database cross references:
EMBL: CU928179
GeneID: 8206694
GenomeReviews: CU928179_GR

Computed results  

None available yet


Homologs and Orthologs

Homologs in protein family: GL3M4606
Orthologs: strict determination not possible; homologs must be refined manually

Protein ZYRO0G20306p  


similar to uniprot|Q03957 Saccharomyces cerevisiae YKL139W CTK1 Catalytic (alpha) subunit of C-terminal domain kinase I (CTDK-I) which phosphorylates the C- terminal repeated domain of the RNA polymerase II large subunit (Rpo21p) to affect both transcription and pre-mRNA 3' end processing; SubName: Full=ZYRO0G20306p;

Protein domain map

Protein length: 549 amino acids
Protein family: GL3M4606
Database cross references:
InterPro: IPR000719
InterPro: IPR002290
InterPro: IPR008271
InterPro: IPR011009
InterPro: IPR017441
InterPro: IPR017442
KEGG: zro:ZYRO0G20306g
PROSITE: PS00107
PROSITE: PS00108
PROSITE: PS50011
Pfam: PF00069
RefSeq: XP_002498860.1
SMART: SM00220
UniProtKB/TrEMBL: C5E1E3
UniProtKB: C5E1E3_ZYGRC

Phylogeny  

PhylomeDB:ZYRO0G20306g

Computed results for ZYRO0G20306p  

None available yet

Gene Ontology terms  


Sequence data  


Nucleotide sequence    

>ZYRO0G20306g.nt
ATGCCATACGGTAATAGTATGTATCTCACGGATTTCAGTCATACAACTTTTATATTATGG
TTCCCATTACTAACGTTAGTAGATCGTCAAAATTTTAGAAAACGAGCTGGTAATCCACCA
CCGCCAAGACCTGCTCCTAAAAGACTAAGGGCTAATAATCGACCACAACCTCCATTACCG
ACACGTCCAAGTGGACAACAATCACCACGTCATCCAGGGAATAACTTTCGTTTTAACAGG
AATAATAATGCCAATAGTAATGATGCTAGACCGATGTCACGATATGAAGGCTCACGATAT
AATAATGCAAGTAATACCAACAATAGTGGGGATGTGAATCGTCCTATTGGCTTAAATAAG
AGGCAACAACAACAGAATATGGATCCACTTCTTTCACAGTTACCGAAAGGCCCTAAGGTT
AGCATGTCGAGGTATGATAATAACAATAATAATGGTCATAATCGTAACGGCCATCATAGT
CATAATAATAATAATAACAATAATAATAACGGTAATAGTAATAATAGTAGGCCGGCAAAT
GGTGAATTAAGAAGAGTAGATTCGAGGAAAATAAATCCATCATTTCTAATTCAAAAGAGA
AACACATCTATTTATGAAAGGATCCTACAAGTTGGAGAAGGTACTTATGGTAAAGTTTAC
AAGGCACGTAATACTGTGACCGGAAGGATGGTTGCATTGAAGAAATTACGATTAGAAAGT
GAGAGAGAAGGTTTTCCCATTACATCAATTAGAGAGATTAAACTGCTGCAAAGCTTCGAT
CATCCGAACGTATCGACATTAACAGAAATCATGGTGGAATCTCAAAAGACCGTTTATATG
ATATTTGATTACGCTGATAACGATTTAAGTGGACTTTTATTAAACAAACAGATAGAAATT
AACGTTGCGCAGTGTAAACATATTTTCCAACAATTGTTACAAGGAATGGAGTATCTTCAT
GATAATGGTGTACTGCATCGTGATATTAAAGGATCCAATATTTTAGTTGACAATAAAGGA
CGTTTAAGGATAACAGATTTTGGATTAGCGAGAAGAATGAAGAGGGATAAGGATTATACA
AATCGTGTCATTACACTTTGGTATAGACCACCAGAATTATTATTGGGTACGACTAAATAT
TCTGAAGAAGTGGACATGTGGGGGTGTGGATGTGTCCTAGTAGAATTGTTCAACAAAACG
GCGGCATTCCAGGGCCAAAATGAACTAGAACAATTGGATTCTATTTTCAAAATCATGGGG
ACACCAACGATTGATCAATGGCCTAATTTATTTGAGATGCCTTGGTTTTTTATGGTCATA
CCACAACACAGTGAAAAGTATCCAACTGTCTTTAGAAATAGATTTGGCAATGTGATTCCA
TCAGAATCGTGTTTCCAATTAGCTGAGGGTCTCCTTGATTATAATCAGGACAAAAGGTTA
TCAGCAACAACGGCATTACAGAGTCCATTTTTCAAAGAAGACCCTCAACCAGAACCGCTT
GTCTTAGAAGGATATGCTGGTTGCCATGAATATGAAGTTAAATTAGCTAGGAAACAGAGG
AAAATGGAGGAACAGGCATCAAAACGAGAATCTCAATCACAATCACAATCACAATCACAA
TCACAAACACAATCGACTAATGGTAAATAG

Coding sequence    

>ZYRO0G20306g.cds
ATGCCATACGGTAATAGTATGTATCTCACGGATTTCAGTCATACAACTTTTATATTATGG
TTCCCATTACTAACGTTAGTAGATCGTCAAAATTTTAGAAAACGAGCTGGTAATCCACCA
CCGCCAAGACCTGCTCCTAAAAGACTAAGGGCTAATAATCGACCACAACCTCCATTACCG
ACACGTCCAAGTGGACAACAATCACCACGTCATCCAGGGAATAACTTTCGTTTTAACAGG
AATAATAATGCCAATAGTAATGATGCTAGACCGATGTCACGATATGAAGGCTCACGATAT
AATAATGCAAGTAATACCAACAATAGTGGGGATGTGAATCGTCCTATTGGCTTAAATAAG
AGGCAACAACAACAGAATATGGATCCACTTCTTTCACAGTTACCGAAAGGCCCTAAGGTT
AGCATGTCGAGGTATGATAATAACAATAATAATGGTCATAATCGTAACGGCCATCATAGT
CATAATAATAATAATAACAATAATAATAACGGTAATAGTAATAATAGTAGGCCGGCAAAT
GGTGAATTAAGAAGAGTAGATTCGAGGAAAATAAATCCATCATTTCTAATTCAAAAGAGA
AACACATCTATTTATGAAAGGATCCTACAAGTTGGAGAAGGTACTTATGGTAAAGTTTAC
AAGGCACGTAATACTGTGACCGGAAGGATGGTTGCATTGAAGAAATTACGATTAGAAAGT
GAGAGAGAAGGTTTTCCCATTACATCAATTAGAGAGATTAAACTGCTGCAAAGCTTCGAT
CATCCGAACGTATCGACATTAACAGAAATCATGGTGGAATCTCAAAAGACCGTTTATATG
ATATTTGATTACGCTGATAACGATTTAAGTGGACTTTTATTAAACAAACAGATAGAAATT
AACGTTGCGCAGTGTAAACATATTTTCCAACAATTGTTACAAGGAATGGAGTATCTTCAT
GATAATGGTGTACTGCATCGTGATATTAAAGGATCCAATATTTTAGTTGACAATAAAGGA
CGTTTAAGGATAACAGATTTTGGATTAGCGAGAAGAATGAAGAGGGATAAGGATTATACA
AATCGTGTCATTACACTTTGGTATAGACCACCAGAATTATTATTGGGTACGACTAAATAT
TCTGAAGAAGTGGACATGTGGGGGTGTGGATGTGTCCTAGTAGAATTGTTCAACAAAACG
GCGGCATTCCAGGGCCAAAATGAACTAGAACAATTGGATTCTATTTTCAAAATCATGGGG
ACACCAACGATTGATCAATGGCCTAATTTATTTGAGATGCCTTGGTTTTTTATGGTCATA
CCACAACACAGTGAAAAGTATCCAACTGTCTTTAGAAATAGATTTGGCAATGTGATTCCA
TCAGAATCGTGTTTCCAATTAGCTGAGGGTCTCCTTGATTATAATCAGGACAAAAGGTTA
TCAGCAACAACGGCATTACAGAGTCCATTTTTCAAAGAAGACCCTCAACCAGAACCGCTT
GTCTTAGAAGGATATGCTGGTTGCCATGAATATGAAGTTAAATTAGCTAGGAAACAGAGG
AAAATGGAGGAACAGGCATCAAAACGAGAATCTCAATCACAATCACAATCACAATCACAA
TCACAAACACAATCGACTAATGGTAAATAG

Predicted translation product    

>ZYRO0G20306g.aa
MPYGNSMYLTDFSHTTFILWFPLLTLVDRQNFRKRAGNPPPPRPAPKRLRANNRPQPPLP
TRPSGQQSPRHPGNNFRFNRNNNANSNDARPMSRYEGSRYNNASNTNNSGDVNRPIGLNK
RQQQQNMDPLLSQLPKGPKVSMSRYDNNNNNGHNRNGHHSHNNNNNNNNNGNSNNSRPAN
GELRRVDSRKINPSFLIQKRNTSIYERILQVGEGTYGKVYKARNTVTGRMVALKKLRLES
EREGFPITSIREIKLLQSFDHPNVSTLTEIMVESQKTVYMIFDYADNDLSGLLLNKQIEI
NVAQCKHIFQQLLQGMEYLHDNGVLHRDIKGSNILVDNKGRLRITDFGLARRMKRDKDYT
NRVITLWYRPPELLLGTTKYSEEVDMWGCGCVLVELFNKTAAFQGQNELEQLDSIFKIMG
TPTIDQWPNLFEMPWFFMVIPQHSEKYPTVFRNRFGNVIPSESCFQLAEGLLDYNQDKRL
SATTALQSPFFKEDPQPEPLVLEGYAGCHEYEVKLARKQRKMEEQASKRESQSQSQSQSQ
SQTQSTNGK*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites