YALI0E04136g


some similarities with uniprot|P31244 Saccharomyces cerevisiae YBR114w RAD16 Protein that recognizes and binds damaged DNA in an ATP-dependent manner (with Rad7p) during nucleotide excision repair

Genomic environment map

Element type: CDS
Element length: 2880 nucleotides,
on anti-sense strand of
Yali0E: complement(460440..463319).
Other names:
YALI-CDS0509.1
YALI-IPF4218
Coding sequence: 960 codons.
Database cross references:
EMBL: CR382131
GeneID: 2912181
HOGENOM: Q6C733

Computed results  

None available yet

Homologs and Orthologs

Homologs in protein families: GL3C0553 GL3C0553.N1
Orthologs: strict determination not possible; homologs must be refined manually

Protein YALI0E04136p  


some similarities with uniprot|P31244 Saccharomyces cerevisiae YBR114w RAD16 nucleotide excision repair protein

Protein domain map

Protein length: 959 amino acids
Protein family: GL3C0553
Database cross references:
InterPro: IPR000330
InterPro: IPR001650
InterPro: IPR014001
InterPro: IPR014021
KEGG: yli:YALI0E04136g
PROSITE: PS51192
PROSITE: PS51194
Pfam: PF00176
Pfam: PF00271
RefSeq: XP_503529.1
SMART: SM00487
SMART: SM00490
UniProtKB/TrEMBL: Q6C733
UniprotKB: Q6C733_YARLI

Computed results for YALI0E04136p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  

GO:0005524 ATP binding
GO:0004386 helicase activity
GO:0003677 DNA binding

Sequence data  


Nucleotide sequence    

>YALI0E04136g.nt
TATAGTAATTAAAGAGCTCCGAAGTGGACAATAAGACCTTGGGATTCAATAGAGGCTCCA
CTGTCTCCATCCTTTGTCAATTCTTTGTCAAGTCGTAGCGACTGACAGGTATATTCGAGC
CTGTTTGTCTCTCAAGATCACATCTCAAACTTCCCACACTCTCGTTACCCCTCAGAAGAT
GACCTGAACCCCACTCAACCCCTCCCCGGATTCGGAGATTTAATATGAATATTGGCTCCC
ATCTGAGACAGCTCTAGTGTAAACTATTGTAGTTTTGTACTTACCGATATCGTACTGTAG
TTAACTATTGTACAGATGTCGGACCCTTCCACACAGTATCCTCTCCAACTTCCTACAAGT
CAATATTGTAGCCTGTGTGCACTGCAGGAAGAACCAAAATACAGTAAGTACAGCAACCCG
TAATTTGAACGTGGGTAAGAGCTACTGTATCGGTAAGCGTAGCTACTTGTAGTGTAGGTG
ACAATAGTTTTCGGATGTGTGTGACAACTCAACTACTTATGTTTTTACACACGTGGTACT
CGTACTTGTACGATACGATAGGTGTCTTGATCCCGCTTCTATTCGCCATCAGAACCTTTT
TGAACGGTCCTCTTCCCTCAGGTATGTAACCGTGGCAACAACCCCACGTGACGTTGTAAA
CAATGGAACCTTAAGCAGCCACTCAACACTTCCACCTATTGTTGAGAATAGCTGAAAAGG
AGTTGTACTGGAATCAATCAAAACTTGCAATTGACGTTTCATATATTGTATGGCATGATC
ATCCAGCAATATTGACGCGATTTCCCCCGTTTTCAGCCACTTTCAGAGCAAAAACGCCAA
TACTTGTCCTGTCCGACGTGCTGCAAATGCACATATTGTCTTTCTTAACTGCATATAGTG
CATAGAAGAAAAATACTTAACGATTCGATAGGTCCTCGTCTCTATCGTCACTACACAGCC
TGATACCCGCCAGAACATACAGCCAACACAGCCTACCGTCATGGCTCGACTCTCCGCCAA
GATGGATGCCCATCCCCCTTCGACCCGTTCTAGACGCCAAATCATGTCTGACAGTGATGA
AACTGAATCGTGTTCTGATGATAGCGATCACAGCGATGCTAGCAGTGCAGCCGCTCCTCG
AACTACGAAGCCCTGGCAAACCAAGAGCTACCAGGAAAACCTCAGTCGAGGTGGCGACTC
CGGCGACGACTCGGAAGAAGGCGATTCTGACGATTCTGATGATTCTGATGATTCTGATGA
TTCTGACAATTCTGACGACTCGGAGGATGGTCTGAATGGTATCCTTAATAAGTCTGACAG
CCCCAAGGCCATTGCACCAGATTTCCAACGTCACACTTCTCCCGTCAAGCGAATGCCACA
ACCTGATCGGCCTGTCGCTACTCCAGTACGACGCAACCGATTGTCCATGTCGCCCAAAAA
CTCCGTCTTTGACAGTGCCAACCCTCTGCTCTCAGCATCCAAGAAGGCCACTGCTGGAAC
CAAGAAGAACATATTTGATGATGAACCCATGGCACCCATGAAGATCCCCGATTTGCATGC
AGCCCCCAAACCCAAGCCCAAGTCTGATCTTTCTGCTGCTGGACTGCGGCGCCCATCCAA
ACCCATTTCCACGCCCATCAAGACCCCCAGCAACTCAATGGAGTCTGCAAACAGAGCCGC
CTACGAGGAGTATAAGCAGTCGATGAAACCTCGAGTTCCCTCCAGATCGGCCTTCTCCAT
TTCCTCGCCCATCAAGCAGAACATTGAGCATCAGCAGCGATTCCAGCAAGCTTCTGTCGG
ACTCTCTGAGTCTGAACTGGCCATCATGTCTGGTGCTGGTAAGCAGCAGTCTGGATACGA
TCGATATAACCAGCCCAACAACACTCCACGTGTCAACTATGGTGTTCGTGGCCCTCCCGG
CCTTGTTAAGGACGCTGCCCTAACCGGCGTGAACGCCCACTCGTCTGCCCCCGCACCCGA
GTTCAACTACACCCCTGCATCTACGGCTGGAGATGTTGAAGCGTTACTTCCCTATATTCT
AAGCAATAATGACGAAGACGAGGAGAGCTCTGATGAGGAGGAAGAGGAGCCAGAAGACAA
GACAAAGAAGGGCGCTTCTAGCAAGAAGGCCACCTCTCGAAGGTCTGAGGAGCGAGCTGC
TCAGGAGAAGGAGATGGGAGATCGAGGTCATCTCGTCAGAAAACATTGGGACAGTGAGTT
CCTCAAGAACATGCGGTTCAAGGGTATTGTTCCTGGACTCTCTGTGACTCTTATGCAACA
TCAGCGGAAGGGTGTTCGGTGGCTTCTTGGACGAGAGGTCCCCACCAACAAACACAAGGG
GGGCATGCTTTGTGATGACATGGGTCTAGGTAAGACTGTTCAGTCTATCAGTTTGATTCT
ATCTAACCCTCGAGGTCTGCATGCGAAGACCGCTTCCAAGGACGGTGAGCCTCGAGAGTG
CAAGGCTACCCTGGTCATCGCACCACTGTCCCTGGCCACTCAGTGGGAGCAGGAAATCAA
AGATAAGTCTCCCGGTCTGAGAGTTCTGAAGCACCACGGTCCTGGTCGAACCTCTGATTC
TCATGTCTTTAGAGACTATGATGTCATTGTCACCACCTACCAGACACTCTCCTCCGAGAT
CAAGAAGGACAACTCTCCTCTTCTGGGAGTCAAGTTCTGGCGAGTCATTCTTGATGAAGC
CCATACCATCAAGAACAAGCGGTCTCAGATGTACCAGGCTGCCTGCCGAGTCTTCGCTGA
TCGAAGATGGTGTCTCACAGGTACTCCCGTTCAGAACAATATTGATGAACTTCAAGCCTT
GCTTCAGTTCATTCGAGTTCCTCCTTACGATGATCCTGTCGTATGGAAGGAACAGATCTC
TGGGCCTCTTTCCAAGGAGGGAGCCGCTCGAACTGCAATGGCGAAGCTGCATCTTGTGCT
TTCTGGTCTCATGTTGAGACGAACAAAAGCTGTTCTCAAGGACTCCAAGATGAACATGAA
GGCTCGACGAGTTCACCAGGTTGACATTGAGTTCCAGCCTGATGAGAGAGCTTTCTACGA
CGCCGTTAACGAGCGAATTGGAAGTCAGATTGACACCATTTCCAACGGATCCATGATGCA
GGCCCTGACTCTGCTCTTGCGTTTGCGGCAGATTTGTGACCATCGATATCTTGTCTCCAA
GGAAGCTGCCACGGGTGGGCATCTCGACGAATTTGAGGGTTACTCTGCTGAAGCCGACGA
TGGCAAAGACCTCGATGATCTTGCTGATATGTTTGCTGACATGGGTATGGATGGAGCTGG
AAGCTCCAGTACTTCTTCTGGAGACAACAAGGTGTCTATTAACGGCAAGGATGTACATGC
CAGTGCCAAGGTTGTCAAGCTTCTGGAGCTTCTAAAGGCTGATCCTCGAAAGACCATTGT
TTTCAGTCAGTTCACCAAGTTCTTTGATGTTCTTGAACCATTCCTGATTCGAGAGAACAT
CCGTTACGTAAAGTACGATGGAAGCATGCCTATTAGGAAGCGAGACGCTGCCCTCGCTAC
TCTACGAGCTGATCCCGATACTACAGTGCTGCTCTGTTCGCTAAAGTGTGGTGCTCTCGG
TCTCAATCTGACTTGTGCCAACCGAGTGGTGCTCTTGGACCCCTGGTGGAACCCCATGGT
CTCCGAGCAAGCTATTGATCGAGTCCACAGAATTGGTCAGACCGTCGATGTGGATGTCTA
CGAGTTTAGCGTGGTTGATTCTGTGGAAAAGAAGATCATGCAGCTTCAGGACAAGAAGCG
TAAGCTGGCCGGCTCTGTCATCAACGGTGACAGAGAGCTCATGAAGGAGGTCAGCACTCT
GTCTCGGGCAGAGCTGCTCTTTGTGTTTGGAAGAGCTTGAAGGAACTGCAATTGCTGTTT
GATGTTTATTTCCAATAGAGTACTGTAATTATTTATTGATGATGTTATCCAGACTTACAC
ACTATGGCGTGTACAGTAGGAAGTGGTGAAGGACCGCGACGTTCAATGTACTTCTACAAG
TTCGATATCTTTCTGCAGAACGATTTTAATGCAATGGCGCTTGTACAAGTACTTGTACAA
GTAACAGCTTGTGCTTGTTTTCAGATTCTCCAATGAAACACTGCTTCCAGATAGATGAAA
TCGAAAATTTGTTTTTACGCTACCAGAATGGTGGATGTAT

Coding sequence    

>YALI0E04136g.cds
ATGGCTCGACTCTCCGCCAAGATGGATGCCCATCCCCCTTCGACCCGTTCTAGACGCCAA
ATCATGTCTGACAGTGATGAAACTGAATCGTGTTCTGATGATAGCGATCACAGCGATGCT
AGCAGTGCAGCCGCTCCTCGAACTACGAAGCCCTGGCAAACCAAGAGCTACCAGGAAAAC
CTCAGTCGAGGTGGCGACTCCGGCGACGACTCGGAAGAAGGCGATTCTGACGATTCTGAT
GATTCTGATGATTCTGATGATTCTGACAATTCTGACGACTCGGAGGATGGTCTGAATGGT
ATCCTTAATAAGTCTGACAGCCCCAAGGCCATTGCACCAGATTTCCAACGTCACACTTCT
CCCGTCAAGCGAATGCCACAACCTGATCGGCCTGTCGCTACTCCAGTACGACGCAACCGA
TTGTCCATGTCGCCCAAAAACTCCGTCTTTGACAGTGCCAACCCTCTGCTCTCAGCATCC
AAGAAGGCCACTGCTGGAACCAAGAAGAACATATTTGATGATGAACCCATGGCACCCATG
AAGATCCCCGATTTGCATGCAGCCCCCAAACCCAAGCCCAAGTCTGATCTTTCTGCTGCT
GGACTGCGGCGCCCATCCAAACCCATTTCCACGCCCATCAAGACCCCCAGCAACTCAATG
GAGTCTGCAAACAGAGCCGCCTACGAGGAGTATAAGCAGTCGATGAAACCTCGAGTTCCC
TCCAGATCGGCCTTCTCCATTTCCTCGCCCATCAAGCAGAACATTGAGCATCAGCAGCGA
TTCCAGCAAGCTTCTGTCGGACTCTCTGAGTCTGAACTGGCCATCATGTCTGGTGCTGGT
AAGCAGCAGTCTGGATACGATCGATATAACCAGCCCAACAACACTCCACGTGTCAACTAT
GGTGTTCGTGGCCCTCCCGGCCTTGTTAAGGACGCTGCCCTAACCGGCGTGAACGCCCAC
TCGTCTGCCCCCGCACCCGAGTTCAACTACACCCCTGCATCTACGGCTGGAGATGTTGAA
GCGTTACTTCCCTATATTCTAAGCAATAATGACGAAGACGAGGAGAGCTCTGATGAGGAG
GAAGAGGAGCCAGAAGACAAGACAAAGAAGGGCGCTTCTAGCAAGAAGGCCACCTCTCGA
AGGTCTGAGGAGCGAGCTGCTCAGGAGAAGGAGATGGGAGATCGAGGTCATCTCGTCAGA
AAACATTGGGACAGTGAGTTCCTCAAGAACATGCGGTTCAAGGGTATTGTTCCTGGACTC
TCTGTGACTCTTATGCAACATCAGCGGAAGGGTGTTCGGTGGCTTCTTGGACGAGAGGTC
CCCACCAACAAACACAAGGGGGGCATGCTTTGTGATGACATGGGTCTAGGTAAGACTGTT
CAGTCTATCAGTTTGATTCTATCTAACCCTCGAGGTCTGCATGCGAAGACCGCTTCCAAG
GACGGTGAGCCTCGAGAGTGCAAGGCTACCCTGGTCATCGCACCACTGTCCCTGGCCACT
CAGTGGGAGCAGGAAATCAAAGATAAGTCTCCCGGTCTGAGAGTTCTGAAGCACCACGGT
CCTGGTCGAACCTCTGATTCTCATGTCTTTAGAGACTATGATGTCATTGTCACCACCTAC
CAGACACTCTCCTCCGAGATCAAGAAGGACAACTCTCCTCTTCTGGGAGTCAAGTTCTGG
CGAGTCATTCTTGATGAAGCCCATACCATCAAGAACAAGCGGTCTCAGATGTACCAGGCT
GCCTGCCGAGTCTTCGCTGATCGAAGATGGTGTCTCACAGGTACTCCCGTTCAGAACAAT
ATTGATGAACTTCAAGCCTTGCTTCAGTTCATTCGAGTTCCTCCTTACGATGATCCTGTC
GTATGGAAGGAACAGATCTCTGGGCCTCTTTCCAAGGAGGGAGCCGCTCGAACTGCAATG
GCGAAGCTGCATCTTGTGCTTTCTGGTCTCATGTTGAGACGAACAAAAGCTGTTCTCAAG
GACTCCAAGATGAACATGAAGGCTCGACGAGTTCACCAGGTTGACATTGAGTTCCAGCCT
GATGAGAGAGCTTTCTACGACGCCGTTAACGAGCGAATTGGAAGTCAGATTGACACCATT
TCCAACGGATCCATGATGCAGGCCCTGACTCTGCTCTTGCGTTTGCGGCAGATTTGTGAC
CATCGATATCTTGTCTCCAAGGAAGCTGCCACGGGTGGGCATCTCGACGAATTTGAGGGT
TACTCTGCTGAAGCCGACGATGGCAAAGACCTCGATGATCTTGCTGATATGTTTGCTGAC
ATGGGTATGGATGGAGCTGGAAGCTCCAGTACTTCTTCTGGAGACAACAAGGTGTCTATT
AACGGCAAGGATGTACATGCCAGTGCCAAGGTTGTCAAGCTTCTGGAGCTTCTAAAGGCT
GATCCTCGAAAGACCATTGTTTTCAGTCAGTTCACCAAGTTCTTTGATGTTCTTGAACCA
TTCCTGATTCGAGAGAACATCCGTTACGTAAAGTACGATGGAAGCATGCCTATTAGGAAG
CGAGACGCTGCCCTCGCTACTCTACGAGCTGATCCCGATACTACAGTGCTGCTCTGTTCG
CTAAAGTGTGGTGCTCTCGGTCTCAATCTGACTTGTGCCAACCGAGTGGTGCTCTTGGAC
CCCTGGTGGAACCCCATGGTCTCCGAGCAAGCTATTGATCGAGTCCACAGAATTGGTCAG
ACCGTCGATGTGGATGTCTACGAGTTTAGCGTGGTTGATTCTGTGGAAAAGAAGATCATG
CAGCTTCAGGACAAGAAGCGTAAGCTGGCCGGCTCTGTCATCAACGGTGACAGAGAGCTC
ATGAAGGAGGTCAGCACTCTGTCTCGGGCAGAGCTGCTCTTTGTGTTTGGAAGAGCTTGA


Predicted translation product    

>YALI0E04136g.aa
MARLSAKMDAHPPSTRSRRQIMSDSDETESCSDDSDHSDASSAAAPRTTKPWQTKSYQEN
LSRGGDSGDDSEEGDSDDSDDSDDSDDSDNSDDSEDGLNGILNKSDSPKAIAPDFQRHTS
PVKRMPQPDRPVATPVRRNRLSMSPKNSVFDSANPLLSASKKATAGTKKNIFDDEPMAPM
KIPDLHAAPKPKPKSDLSAAGLRRPSKPISTPIKTPSNSMESANRAAYEEYKQSMKPRVP
SRSAFSISSPIKQNIEHQQRFQQASVGLSESELAIMSGAGKQQSGYDRYNQPNNTPRVNY
GVRGPPGLVKDAALTGVNAHSSAPAPEFNYTPASTAGDVEALLPYILSNNDEDEESSDEE
EEEPEDKTKKGASSKKATSRRSEERAAQEKEMGDRGHLVRKHWDSEFLKNMRFKGIVPGL
SVTLMQHQRKGVRWLLGREVPTNKHKGGMLCDDMGLGKTVQSISLILSNPRGLHAKTASK
DGEPRECKATLVIAPLSLATQWEQEIKDKSPGLRVLKHHGPGRTSDSHVFRDYDVIVTTY
QTLSSEIKKDNSPLLGVKFWRVILDEAHTIKNKRSQMYQAACRVFADRRWCLTGTPVQNN
IDELQALLQFIRVPPYDDPVVWKEQISGPLSKEGAARTAMAKLHLVLSGLMLRRTKAVLK
DSKMNMKARRVHQVDIEFQPDERAFYDAVNERIGSQIDTISNGSMMQALTLLLRLRQICD
HRYLVSKEAATGGHLDEFEGYSAEADDGKDLDDLADMFADMGMDGAGSSSTSSGDNKVSI
NGKDVHASAKVVKLLELLKADPRKTIVFSQFTKFFDVLEPFLIRENIRYVKYDGSMPIRK
RDAALATLRADPDTTVLLCSLKCGALGLNLTCANRVVLLDPWWNPMVSEQAIDRVHRIGQ
TVDVDVYEFSVVDSVEKKIMQLQDKKRKLAGSVINGDRELMKEVSTLSRAELLFVFGRA*





Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites