YALI0E04136g
some similarities with uniprot|P31244 Saccharomyces cerevisiae YBR114w RAD16 Protein that recognizes and binds damaged DNA in an ATP-dependent manner (with Rad7p) during nucleotide excision repair
Element type: CDS
Element length: 2880 nucleotides,
on anti-sense strand of
Yali0E: complement(460440..463319).
Other names:
YALI-CDS0509.1
YALI-IPF4218
Coding sequence: 960 codons.
Element length: 2880 nucleotides,
on anti-sense strand of
Yali0E: complement(460440..463319).
Other names:
YALI-CDS0509.1
YALI-IPF4218
Coding sequence: 960 codons.
Database cross references:
EMBL: CR382131
GeneID: 2912181
HOGENOM: Q6C733
Orthologs: strict determination not possible; homologs must be refined manually
EMBL: CR382131
GeneID: 2912181
HOGENOM: Q6C733
Homologs and Orthologs
Homologs in protein families: GL3C0553 GL3C0553.N1Orthologs: strict determination not possible; homologs must be refined manually
Protein YALI0E04136p 
some similarities with uniprot|P31244 Saccharomyces cerevisiae YBR114w RAD16 nucleotide excision repair protein
Protein domain map
Database cross references:
InterPro: IPR000330
InterPro: IPR001650
InterPro: IPR014001
InterPro: IPR014021
KEGG: yli:YALI0E04136g
PROSITE: PS51192
PROSITE: PS51194
Pfam: PF00176
Pfam: PF00271
RefSeq: XP_503529.1
SMART: SM00487
SMART: SM00490
UniProtKB/TrEMBL: Q6C733
UniprotKB: Q6C733_YARLI
InterPro: IPR000330
InterPro: IPR001650
InterPro: IPR014001
InterPro: IPR014021
KEGG: yli:YALI0E04136g
PROSITE: PS51192
PROSITE: PS51194
Pfam: PF00176
Pfam: PF00271
RefSeq: XP_503529.1
SMART: SM00487
SMART: SM00490
UniProtKB/TrEMBL: Q6C733
UniprotKB: Q6C733_YARLI
Sequence data 
>YALI0E04136g.nt TATAGTAATTAAAGAGCTCCGAAGTGGACAATAAGACCTTGGGATTCAATAGAGGCTCCA CTGTCTCCATCCTTTGTCAATTCTTTGTCAAGTCGTAGCGACTGACAGGTATATTCGAGC CTGTTTGTCTCTCAAGATCACATCTCAAACTTCCCACACTCTCGTTACCCCTCAGAAGAT GACCTGAACCCCACTCAACCCCTCCCCGGATTCGGAGATTTAATATGAATATTGGCTCCC ATCTGAGACAGCTCTAGTGTAAACTATTGTAGTTTTGTACTTACCGATATCGTACTGTAG TTAACTATTGTACAGATGTCGGACCCTTCCACACAGTATCCTCTCCAACTTCCTACAAGT CAATATTGTAGCCTGTGTGCACTGCAGGAAGAACCAAAATACAGTAAGTACAGCAACCCG TAATTTGAACGTGGGTAAGAGCTACTGTATCGGTAAGCGTAGCTACTTGTAGTGTAGGTG ACAATAGTTTTCGGATGTGTGTGACAACTCAACTACTTATGTTTTTACACACGTGGTACT CGTACTTGTACGATACGATAGGTGTCTTGATCCCGCTTCTATTCGCCATCAGAACCTTTT TGAACGGTCCTCTTCCCTCAGGTATGTAACCGTGGCAACAACCCCACGTGACGTTGTAAA CAATGGAACCTTAAGCAGCCACTCAACACTTCCACCTATTGTTGAGAATAGCTGAAAAGG AGTTGTACTGGAATCAATCAAAACTTGCAATTGACGTTTCATATATTGTATGGCATGATC ATCCAGCAATATTGACGCGATTTCCCCCGTTTTCAGCCACTTTCAGAGCAAAAACGCCAA TACTTGTCCTGTCCGACGTGCTGCAAATGCACATATTGTCTTTCTTAACTGCATATAGTG CATAGAAGAAAAATACTTAACGATTCGATAGGTCCTCGTCTCTATCGTCACTACACAGCC TGATACCCGCCAGAACATACAGCCAACACAGCCTACCGTCATGGCTCGACTCTCCGCCAA GATGGATGCCCATCCCCCTTCGACCCGTTCTAGACGCCAAATCATGTCTGACAGTGATGA AACTGAATCGTGTTCTGATGATAGCGATCACAGCGATGCTAGCAGTGCAGCCGCTCCTCG AACTACGAAGCCCTGGCAAACCAAGAGCTACCAGGAAAACCTCAGTCGAGGTGGCGACTC CGGCGACGACTCGGAAGAAGGCGATTCTGACGATTCTGATGATTCTGATGATTCTGATGA TTCTGACAATTCTGACGACTCGGAGGATGGTCTGAATGGTATCCTTAATAAGTCTGACAG CCCCAAGGCCATTGCACCAGATTTCCAACGTCACACTTCTCCCGTCAAGCGAATGCCACA ACCTGATCGGCCTGTCGCTACTCCAGTACGACGCAACCGATTGTCCATGTCGCCCAAAAA CTCCGTCTTTGACAGTGCCAACCCTCTGCTCTCAGCATCCAAGAAGGCCACTGCTGGAAC CAAGAAGAACATATTTGATGATGAACCCATGGCACCCATGAAGATCCCCGATTTGCATGC AGCCCCCAAACCCAAGCCCAAGTCTGATCTTTCTGCTGCTGGACTGCGGCGCCCATCCAA ACCCATTTCCACGCCCATCAAGACCCCCAGCAACTCAATGGAGTCTGCAAACAGAGCCGC CTACGAGGAGTATAAGCAGTCGATGAAACCTCGAGTTCCCTCCAGATCGGCCTTCTCCAT TTCCTCGCCCATCAAGCAGAACATTGAGCATCAGCAGCGATTCCAGCAAGCTTCTGTCGG ACTCTCTGAGTCTGAACTGGCCATCATGTCTGGTGCTGGTAAGCAGCAGTCTGGATACGA TCGATATAACCAGCCCAACAACACTCCACGTGTCAACTATGGTGTTCGTGGCCCTCCCGG CCTTGTTAAGGACGCTGCCCTAACCGGCGTGAACGCCCACTCGTCTGCCCCCGCACCCGA GTTCAACTACACCCCTGCATCTACGGCTGGAGATGTTGAAGCGTTACTTCCCTATATTCT AAGCAATAATGACGAAGACGAGGAGAGCTCTGATGAGGAGGAAGAGGAGCCAGAAGACAA GACAAAGAAGGGCGCTTCTAGCAAGAAGGCCACCTCTCGAAGGTCTGAGGAGCGAGCTGC TCAGGAGAAGGAGATGGGAGATCGAGGTCATCTCGTCAGAAAACATTGGGACAGTGAGTT CCTCAAGAACATGCGGTTCAAGGGTATTGTTCCTGGACTCTCTGTGACTCTTATGCAACA TCAGCGGAAGGGTGTTCGGTGGCTTCTTGGACGAGAGGTCCCCACCAACAAACACAAGGG GGGCATGCTTTGTGATGACATGGGTCTAGGTAAGACTGTTCAGTCTATCAGTTTGATTCT ATCTAACCCTCGAGGTCTGCATGCGAAGACCGCTTCCAAGGACGGTGAGCCTCGAGAGTG CAAGGCTACCCTGGTCATCGCACCACTGTCCCTGGCCACTCAGTGGGAGCAGGAAATCAA AGATAAGTCTCCCGGTCTGAGAGTTCTGAAGCACCACGGTCCTGGTCGAACCTCTGATTC TCATGTCTTTAGAGACTATGATGTCATTGTCACCACCTACCAGACACTCTCCTCCGAGAT CAAGAAGGACAACTCTCCTCTTCTGGGAGTCAAGTTCTGGCGAGTCATTCTTGATGAAGC CCATACCATCAAGAACAAGCGGTCTCAGATGTACCAGGCTGCCTGCCGAGTCTTCGCTGA TCGAAGATGGTGTCTCACAGGTACTCCCGTTCAGAACAATATTGATGAACTTCAAGCCTT GCTTCAGTTCATTCGAGTTCCTCCTTACGATGATCCTGTCGTATGGAAGGAACAGATCTC TGGGCCTCTTTCCAAGGAGGGAGCCGCTCGAACTGCAATGGCGAAGCTGCATCTTGTGCT TTCTGGTCTCATGTTGAGACGAACAAAAGCTGTTCTCAAGGACTCCAAGATGAACATGAA GGCTCGACGAGTTCACCAGGTTGACATTGAGTTCCAGCCTGATGAGAGAGCTTTCTACGA CGCCGTTAACGAGCGAATTGGAAGTCAGATTGACACCATTTCCAACGGATCCATGATGCA GGCCCTGACTCTGCTCTTGCGTTTGCGGCAGATTTGTGACCATCGATATCTTGTCTCCAA GGAAGCTGCCACGGGTGGGCATCTCGACGAATTTGAGGGTTACTCTGCTGAAGCCGACGA TGGCAAAGACCTCGATGATCTTGCTGATATGTTTGCTGACATGGGTATGGATGGAGCTGG AAGCTCCAGTACTTCTTCTGGAGACAACAAGGTGTCTATTAACGGCAAGGATGTACATGC CAGTGCCAAGGTTGTCAAGCTTCTGGAGCTTCTAAAGGCTGATCCTCGAAAGACCATTGT TTTCAGTCAGTTCACCAAGTTCTTTGATGTTCTTGAACCATTCCTGATTCGAGAGAACAT CCGTTACGTAAAGTACGATGGAAGCATGCCTATTAGGAAGCGAGACGCTGCCCTCGCTAC TCTACGAGCTGATCCCGATACTACAGTGCTGCTCTGTTCGCTAAAGTGTGGTGCTCTCGG TCTCAATCTGACTTGTGCCAACCGAGTGGTGCTCTTGGACCCCTGGTGGAACCCCATGGT CTCCGAGCAAGCTATTGATCGAGTCCACAGAATTGGTCAGACCGTCGATGTGGATGTCTA CGAGTTTAGCGTGGTTGATTCTGTGGAAAAGAAGATCATGCAGCTTCAGGACAAGAAGCG TAAGCTGGCCGGCTCTGTCATCAACGGTGACAGAGAGCTCATGAAGGAGGTCAGCACTCT GTCTCGGGCAGAGCTGCTCTTTGTGTTTGGAAGAGCTTGAAGGAACTGCAATTGCTGTTT GATGTTTATTTCCAATAGAGTACTGTAATTATTTATTGATGATGTTATCCAGACTTACAC ACTATGGCGTGTACAGTAGGAAGTGGTGAAGGACCGCGACGTTCAATGTACTTCTACAAG TTCGATATCTTTCTGCAGAACGATTTTAATGCAATGGCGCTTGTACAAGTACTTGTACAA GTAACAGCTTGTGCTTGTTTTCAGATTCTCCAATGAAACACTGCTTCCAGATAGATGAAA TCGAAAATTTGTTTTTACGCTACCAGAATGGTGGATGTAT
>YALI0E04136g.cds ATGGCTCGACTCTCCGCCAAGATGGATGCCCATCCCCCTTCGACCCGTTCTAGACGCCAA ATCATGTCTGACAGTGATGAAACTGAATCGTGTTCTGATGATAGCGATCACAGCGATGCT AGCAGTGCAGCCGCTCCTCGAACTACGAAGCCCTGGCAAACCAAGAGCTACCAGGAAAAC CTCAGTCGAGGTGGCGACTCCGGCGACGACTCGGAAGAAGGCGATTCTGACGATTCTGAT GATTCTGATGATTCTGATGATTCTGACAATTCTGACGACTCGGAGGATGGTCTGAATGGT ATCCTTAATAAGTCTGACAGCCCCAAGGCCATTGCACCAGATTTCCAACGTCACACTTCT CCCGTCAAGCGAATGCCACAACCTGATCGGCCTGTCGCTACTCCAGTACGACGCAACCGA TTGTCCATGTCGCCCAAAAACTCCGTCTTTGACAGTGCCAACCCTCTGCTCTCAGCATCC AAGAAGGCCACTGCTGGAACCAAGAAGAACATATTTGATGATGAACCCATGGCACCCATG AAGATCCCCGATTTGCATGCAGCCCCCAAACCCAAGCCCAAGTCTGATCTTTCTGCTGCT GGACTGCGGCGCCCATCCAAACCCATTTCCACGCCCATCAAGACCCCCAGCAACTCAATG GAGTCTGCAAACAGAGCCGCCTACGAGGAGTATAAGCAGTCGATGAAACCTCGAGTTCCC TCCAGATCGGCCTTCTCCATTTCCTCGCCCATCAAGCAGAACATTGAGCATCAGCAGCGA TTCCAGCAAGCTTCTGTCGGACTCTCTGAGTCTGAACTGGCCATCATGTCTGGTGCTGGT AAGCAGCAGTCTGGATACGATCGATATAACCAGCCCAACAACACTCCACGTGTCAACTAT GGTGTTCGTGGCCCTCCCGGCCTTGTTAAGGACGCTGCCCTAACCGGCGTGAACGCCCAC TCGTCTGCCCCCGCACCCGAGTTCAACTACACCCCTGCATCTACGGCTGGAGATGTTGAA GCGTTACTTCCCTATATTCTAAGCAATAATGACGAAGACGAGGAGAGCTCTGATGAGGAG GAAGAGGAGCCAGAAGACAAGACAAAGAAGGGCGCTTCTAGCAAGAAGGCCACCTCTCGA AGGTCTGAGGAGCGAGCTGCTCAGGAGAAGGAGATGGGAGATCGAGGTCATCTCGTCAGA AAACATTGGGACAGTGAGTTCCTCAAGAACATGCGGTTCAAGGGTATTGTTCCTGGACTC TCTGTGACTCTTATGCAACATCAGCGGAAGGGTGTTCGGTGGCTTCTTGGACGAGAGGTC CCCACCAACAAACACAAGGGGGGCATGCTTTGTGATGACATGGGTCTAGGTAAGACTGTT CAGTCTATCAGTTTGATTCTATCTAACCCTCGAGGTCTGCATGCGAAGACCGCTTCCAAG GACGGTGAGCCTCGAGAGTGCAAGGCTACCCTGGTCATCGCACCACTGTCCCTGGCCACT CAGTGGGAGCAGGAAATCAAAGATAAGTCTCCCGGTCTGAGAGTTCTGAAGCACCACGGT CCTGGTCGAACCTCTGATTCTCATGTCTTTAGAGACTATGATGTCATTGTCACCACCTAC CAGACACTCTCCTCCGAGATCAAGAAGGACAACTCTCCTCTTCTGGGAGTCAAGTTCTGG CGAGTCATTCTTGATGAAGCCCATACCATCAAGAACAAGCGGTCTCAGATGTACCAGGCT GCCTGCCGAGTCTTCGCTGATCGAAGATGGTGTCTCACAGGTACTCCCGTTCAGAACAAT ATTGATGAACTTCAAGCCTTGCTTCAGTTCATTCGAGTTCCTCCTTACGATGATCCTGTC GTATGGAAGGAACAGATCTCTGGGCCTCTTTCCAAGGAGGGAGCCGCTCGAACTGCAATG GCGAAGCTGCATCTTGTGCTTTCTGGTCTCATGTTGAGACGAACAAAAGCTGTTCTCAAG GACTCCAAGATGAACATGAAGGCTCGACGAGTTCACCAGGTTGACATTGAGTTCCAGCCT GATGAGAGAGCTTTCTACGACGCCGTTAACGAGCGAATTGGAAGTCAGATTGACACCATT TCCAACGGATCCATGATGCAGGCCCTGACTCTGCTCTTGCGTTTGCGGCAGATTTGTGAC CATCGATATCTTGTCTCCAAGGAAGCTGCCACGGGTGGGCATCTCGACGAATTTGAGGGT TACTCTGCTGAAGCCGACGATGGCAAAGACCTCGATGATCTTGCTGATATGTTTGCTGAC ATGGGTATGGATGGAGCTGGAAGCTCCAGTACTTCTTCTGGAGACAACAAGGTGTCTATT AACGGCAAGGATGTACATGCCAGTGCCAAGGTTGTCAAGCTTCTGGAGCTTCTAAAGGCT GATCCTCGAAAGACCATTGTTTTCAGTCAGTTCACCAAGTTCTTTGATGTTCTTGAACCA TTCCTGATTCGAGAGAACATCCGTTACGTAAAGTACGATGGAAGCATGCCTATTAGGAAG CGAGACGCTGCCCTCGCTACTCTACGAGCTGATCCCGATACTACAGTGCTGCTCTGTTCG CTAAAGTGTGGTGCTCTCGGTCTCAATCTGACTTGTGCCAACCGAGTGGTGCTCTTGGAC CCCTGGTGGAACCCCATGGTCTCCGAGCAAGCTATTGATCGAGTCCACAGAATTGGTCAG ACCGTCGATGTGGATGTCTACGAGTTTAGCGTGGTTGATTCTGTGGAAAAGAAGATCATG CAGCTTCAGGACAAGAAGCGTAAGCTGGCCGGCTCTGTCATCAACGGTGACAGAGAGCTC ATGAAGGAGGTCAGCACTCTGTCTCGGGCAGAGCTGCTCTTTGTGTTTGGAAGAGCTTGA
>YALI0E04136g.aa MARLSAKMDAHPPSTRSRRQIMSDSDETESCSDDSDHSDASSAAAPRTTKPWQTKSYQEN LSRGGDSGDDSEEGDSDDSDDSDDSDDSDNSDDSEDGLNGILNKSDSPKAIAPDFQRHTS PVKRMPQPDRPVATPVRRNRLSMSPKNSVFDSANPLLSASKKATAGTKKNIFDDEPMAPM KIPDLHAAPKPKPKSDLSAAGLRRPSKPISTPIKTPSNSMESANRAAYEEYKQSMKPRVP SRSAFSISSPIKQNIEHQQRFQQASVGLSESELAIMSGAGKQQSGYDRYNQPNNTPRVNY GVRGPPGLVKDAALTGVNAHSSAPAPEFNYTPASTAGDVEALLPYILSNNDEDEESSDEE EEEPEDKTKKGASSKKATSRRSEERAAQEKEMGDRGHLVRKHWDSEFLKNMRFKGIVPGL SVTLMQHQRKGVRWLLGREVPTNKHKGGMLCDDMGLGKTVQSISLILSNPRGLHAKTASK DGEPRECKATLVIAPLSLATQWEQEIKDKSPGLRVLKHHGPGRTSDSHVFRDYDVIVTTY QTLSSEIKKDNSPLLGVKFWRVILDEAHTIKNKRSQMYQAACRVFADRRWCLTGTPVQNN IDELQALLQFIRVPPYDDPVVWKEQISGPLSKEGAARTAMAKLHLVLSGLMLRRTKAVLK DSKMNMKARRVHQVDIEFQPDERAFYDAVNERIGSQIDTISNGSMMQALTLLLRLRQICD HRYLVSKEAATGGHLDEFEGYSAEADDGKDLDDLADMFADMGMDGAGSSSTSSGDNKVSI NGKDVHASAKVVKLLELLKADPRKTIVFSQFTKFFDVLEPFLIRENIRYVKYDGSMPIRK RDAALATLRADPDTTVLLCSLKCGALGLNLTCANRVVLLDPWWNPMVSEQAIDRVHRIGQ TVDVDVYEFSVVDSVEKKIMQLQDKKRKLAGSVINGDRELMKEVSTLSRAELLFVFGRA*
Legend and notes 
Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.
Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.
Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.
Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.
Sequences
| Color | Nucleotide sequence and Coding sequence | Predicted translation product |
| RED | start and stop codons | Initial methionine and sequence end |
| BLUE | coding sequence | protein sequence |
| grey | non-coding sequence (upstream, downstream or intron) | |
| grey | donor and acceptor splicing sites |
Home
URL: http://www.genolevures.org/elt/YALI/YALI0E04136g