CAGL0L09042g


similar to uniprot|Q12341 Saccharomyces cerevisiae YPL001w HAT1 Catalytic subunit of the Hat1p-Hat2p histone acetyltransferase complex that uses the cofactor acetyl coenzyme A, to acetylate free nuclear and cytoplasmic histone H4

Genomic environment map

Element type: CDS
Element length: 1167 nucleotides,
on sense strand of
Cagl0L: 985819..986985.
Other names:
CAGL-CDS2840.1
CAGL-IPF7485
Coding sequence: 389 codons.
Database cross references:
EMBL: CR380958
GeneID: 2891100
HOGENOM: Q6FKS5

Computed results  

None available yet

Homologs and Orthologs

Homologs in protein families: GL3R2212 GL3R2212.N1
Orthologs: strict determination not possible; homologs must be refined manually

Protein CAGL0L09042p  


similar to uniprot|Q12341 Saccharomyces cerevisiae YPL001w HAT1 histone acetyltransferase subunit

Protein domain map

Protein length: 388 amino acids
Protein family: GL3R2212
Database cross references:
Gene3D: G3DSA:1.10.10.390
Gene3D: G3DSA:3.40.630.30
InterPro: IPR000182
InterPro: IPR013523
InterPro: IPR016181
InterPro: IPR017380
KEGG: cgr:CAGL0L09042g
PIRSF: PIRSF038084
PROSITE: PS51186
Pfam: PF00583
RefSeq: XP_449169.1
UniProtKB/Swiss-Prot: Q6FKS5
UniprotKB: HAT1_CANGA

Computed results for CAGL0L09042p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  

None available yet

Sequence data  


Nucleotide sequence    

>CAGL0L09042g.nt
TAGAATCTCATTTCCGGCTTGATTATCGATCCATAACATACTATTCGCCGCCATATCTTC
CAATACTGACCGACCTCTTACAGCATCCCACTGAAAATTTGCCCTGAGAAGCGATATACT
AGCATAACCCATAATAGAGCAAACCTCCAGAATTTTTGTCTGATCTCCTGTTATTTCATT
TGGTACACTTCTCAGAAATTTTTTTCCTCGAATGCTGAATACTTTAAATCCACCATCTAG
CACTTGAATCATATCTATTGATTTTTCTAAATCTTCCATGGTAACATTTAGATTCTTAAA
ATAACCATTTCGCAGTTCATCAAAAGATATTACTCCACCATTCATGTCTTTAGTTTCCCT
ACACACTTCTATAATTTTAACCGATATTTCATAATAAAAATCATCAACATTAAATAAATG
TTGGTCTTTCTCAAATATTTCTAGAGGGTCTATACCGATAGATGAGCACATACGTAAAAA
CCTTGATCTAAATTCAGGATTAGCCTTCAGATCTTTATTGTGCTTTTTGGCAAAGTCAAT
TAACCTGCCCTGGAAAACTTCTAATTGCTGCTCGAGTTCCAATGATTGCCTTTCCAGAAT
ATTTGAACTTACATTTTTATAGCTTTCATCATTAAGATGATCAAAAGCAGACAAACCAAA
GTTTGGCATCAACAATTCCTCAGATAGTATTTTATTCTTTATAACTCGTTAGTCTGTAGC
TACTAAATACTTTGTAGCTTAAAGATATTTCAATTTAAGGATCTAGCTAGTAGCGTAGTT
CAAACAACCACATATAAACCTTTCTATATACTATCATTATTTTATCTCTGTTTTTATGTT
TGACCTTTTTTAAACAAGATCCTTGTACGGCTCATTTCAACGTTCGAAGTTGCGAAACAC
GTAAAATAAGCTTACCAAATTCTTATCATCTAATATAGTGTTATCTTAGTCTTCATTACA
GTTACCAGACTAGATTTATTAAGAAACACGATAGAAAGCAATGTCAATTGATGATTTCAA
GCCTGAAAAGTGGACGATATCTTCAAATGAAGCTTTGAAGTTGTCGCTAGTTTCTGAGGA
TAACGCGATACAGTTCTCGCCAACCTTTACCTATCCTATATTTGGCACGGAAGAACAAAT
ATTTGGCTATAAGGACTTAGTTATTCATCTAGCATTCGATGCTATCACATTCAAACCATT
TTTGAATGTGAAGTTTTCATCTAAGTTCGAGGGAAGTGAAGAAGAGTTAGTTAACATAAA
AGAGAAGCTATTGGAATATTTACCGATTGACGATACCATTTATAAGGATGAAGAGAAATG
GATTGACTCTTTTAAGAAGGAACAGGAATCTATAGAGGCCTACAAAAATGACCAAAATAT
TGATGAGTATAAAATCGACAACGCTGACTTTGAGATTTACAAAGTAAACTTACAAGACCC
CAAAATGAAAAGATTTCACAGGAGAATTCAAATATTTTCCTTACTTTTCATTGAAGCAGC
TTCATATATTGATGAAGATGATCCAAAATGGGAAATATTTATAGTACAGACCAAGAAAGA
CAAAAAGTTTGTTGGTTATGCCACCGCATACAATTACTGGTACTACCCTGGTGCTAATAA
TTTTGATAGCGAGAGTAAATACCGCTACAGAGGCAAAATTTCACAATTTCTAATTTTACC
ACCATATCAAGGTAGGGGTCATGGTTCACATCTGTATAACAGCATTGTAAAAAATTGGCG
AAACGATTCCTCAATTTTGGAAATCGTTGTCGAAGACCCAAATGAAAGTTTCGATGACTT
AAGAGATGTTAATGACCTTGAAATGTTATATAAAGATGGATTCTTCAACAAACTACCACA
AGAGAGGCCAATCCCTAATGCATGGATAGAATCTACGAGATTAAAATATAAAATAGAAAA
GCGACAATTCAGTAGGCTACTTGAAATGATATTGCTATCAACTGGTTCAAATAATTTTGA
GTACCAAGTCAAACAGAGACTGCTGATAAAAAATAAAGATGGATTGGAAGGTATGGAAGT
TTCTGATATCAAGGATGCCTTAAATAAATCCTTTGAGTCACTGCGTGAGGATTATGATCG
CATTCTTGGTAAATGTCAATTTTCCAACGACGCGGATGGACCATCAAAAAAGAAAATAAA
GACTTGAAATAGATTTATAGACTACAAAAGCTAACCATCTATATAATTATTGTATTCTCG
ATAAAAAAAATCTTTATATCGATTCTCAATTAGCCTTACAAGCCATTCCAACTGTTATAA
TTAGATATTATAATTTTTCTACTCCATTGTCATGTGATTGAAAAACAGTTAAAATGATTT
TAATAAAAAAAAGTTAATAATACTAATTCTAATATTATATATGACGAAGTATATGGCTTT
GATTTCCGAAATAAGAAAAGCAGTCTATAAATCGAAGTTTGGTTTTGATATACTATTTTT
TCAGGTC

Coding sequence    

>CAGL0L09042g.cds
ATGTCAATTGATGATTTCAAGCCTGAAAAGTGGACGATATCTTCAAATGAAGCTTTGAAG
TTGTCGCTAGTTTCTGAGGATAACGCGATACAGTTCTCGCCAACCTTTACCTATCCTATA
TTTGGCACGGAAGAACAAATATTTGGCTATAAGGACTTAGTTATTCATCTAGCATTCGAT
GCTATCACATTCAAACCATTTTTGAATGTGAAGTTTTCATCTAAGTTCGAGGGAAGTGAA
GAAGAGTTAGTTAACATAAAAGAGAAGCTATTGGAATATTTACCGATTGACGATACCATT
TATAAGGATGAAGAGAAATGGATTGACTCTTTTAAGAAGGAACAGGAATCTATAGAGGCC
TACAAAAATGACCAAAATATTGATGAGTATAAAATCGACAACGCTGACTTTGAGATTTAC
AAAGTAAACTTACAAGACCCCAAAATGAAAAGATTTCACAGGAGAATTCAAATATTTTCC
TTACTTTTCATTGAAGCAGCTTCATATATTGATGAAGATGATCCAAAATGGGAAATATTT
ATAGTACAGACCAAGAAAGACAAAAAGTTTGTTGGTTATGCCACCGCATACAATTACTGG
TACTACCCTGGTGCTAATAATTTTGATAGCGAGAGTAAATACCGCTACAGAGGCAAAATT
TCACAATTTCTAATTTTACCACCATATCAAGGTAGGGGTCATGGTTCACATCTGTATAAC
AGCATTGTAAAAAATTGGCGAAACGATTCCTCAATTTTGGAAATCGTTGTCGAAGACCCA
AATGAAAGTTTCGATGACTTAAGAGATGTTAATGACCTTGAAATGTTATATAAAGATGGA
TTCTTCAACAAACTACCACAAGAGAGGCCAATCCCTAATGCATGGATAGAATCTACGAGA
TTAAAATATAAAATAGAAAAGCGACAATTCAGTAGGCTACTTGAAATGATATTGCTATCA
ACTGGTTCAAATAATTTTGAGTACCAAGTCAAACAGAGACTGCTGATAAAAAATAAAGAT
GGATTGGAAGGTATGGAAGTTTCTGATATCAAGGATGCCTTAAATAAATCCTTTGAGTCA
CTGCGTGAGGATTATGATCGCATTCTTGGTAAATGTCAATTTTCCAACGACGCGGATGGA
CCATCAAAAAAGAAAATAAAGACTTGA

Predicted translation product    

>CAGL0L09042g.aa
MSIDDFKPEKWTISSNEALKLSLVSEDNAIQFSPTFTYPIFGTEEQIFGYKDLVIHLAFD
AITFKPFLNVKFSSKFEGSEEELVNIKEKLLEYLPIDDTIYKDEEKWIDSFKKEQESIEA
YKNDQNIDEYKIDNADFEIYKVNLQDPKMKRFHRRIQIFSLLFIEAASYIDEDDPKWEIF
IVQTKKDKKFVGYATAYNYWYYPGANNFDSESKYRYRGKISQFLILPPYQGRGHGSHLYN
SIVKNWRNDSSILEIVVEDPNESFDDLRDVNDLEMLYKDGFFNKLPQERPIPNAWIESTR
LKYKIEKRQFSRLLEMILLSTGSNNFEYQVKQRLLIKNKDGLEGMEVSDIKDALNKSFES
LREDYDRILGKCQFSNDADGPSKKKIKT*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites