YALI0F04356g


some similarities with uniprot|P43610 Saccharomyces cerevisiae YFR038w IRC5 Putative ATPase containing the DEAD/H helicase-related sequence motif

Genomic environment map

Element type: CDS
Element length: 3729 nucleotides,
on sense strand of
Yali0F: 671238..674966.
Other names:
YALI-CDS0138.1
YALI-IPF1403
Coding sequence: 1243 codons.
Database cross references:
EMBL: CR382132
GeneID: 2909027
HOGENOM: Q6C2X3

Computed results  

None available yet

Protein YALI0F04356p  


some similarities with uniprot|P43610 Saccharomyces cerevisiae YFR038w

Protein domain map

Protein length: 1242 amino acids
Protein family: GL3M4588
Database cross references:
InterPro: IPR000330
InterPro: IPR001650
InterPro: IPR014001
InterPro: IPR014021
KEGG: yli:YALI0F04356g
PROSITE: PS51192
PROSITE: PS51194
Pfam: PF00176
Pfam: PF00271
RefSeq: XP_504989.1
SMART: SM00487
SMART: SM00490
UniProtKB/TrEMBL: Q6C2X3
UniprotKB: Q6C2X3_YARLI

Computed results for YALI0F04356p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  

GO:0005524 ATP binding
GO:0004386 helicase activity
GO:0003677 DNA binding

Sequence data  


Nucleotide sequence    

>YALI0F04356g.nt
CTGGACAACTTCAGAGGACCATTGTACTGGTGTGTGCTGCTTTTGTACTGTTGTCGTACG
GTCGATGGAATGACTTTTTGAGACTGGGCTGAAACTATTGCAGACGACCTTGCCGCTTAG
CGAGACCGATTATAGTCTCCGCAATGAGCATAGCCAAGGCCTTTCCGGCTCTACTGTCAA
AAACAATTTCTTCCTTGAGGATCAAGGCCAAGTTCATTCCTGAAAACTACTCAGTAGCAA
CTGTGCCCAGACAGTGGTGGAATTCTGGTGCCGATCTGAGTATTGGACTATGACTGACTG
GTGTCTGACTAGTTTAACTACTGCCGACGGACTTTGCAGCGACTAGATTGGTCTAGGTTT
GGTCAGGAGATCATCTGCTCCTCGACTCATCCCCATAATGAGATGACTAGACATACTGCC
CATAACCGACGTCTGGGTGGCTCGTGGAGTTCTTCAGCGACACAAGTGACTAGAGAACCA
GTTGATCGTAGTCACAGTCCACAGCTGTCTTACCAAGTCCCTGACGAGTCCCTGACAGAG
TTTACAGGCTGTTGCTTGTCCACCTGTTCCACCTTGATGGGTCATGAGGACCTTGACCTT
TATATCCGACCGATTCAGGACAGCAACTTCATGTACGGCATTCACATGTTGCTACGTTCG
ACCTTCGGCCTCACTTCAGTACAGCACTAACCCAGTGGACATTTCCTCGCTCTCCGAGTC
CGACCGGTTCAACCGAATCTCGCATCTCCTGGAACAGTCTAAAGTCTACTCTTCGGTCCT
CCAAGACCGGCTGCAGAAAGAGCGCGAAGCCCGGGAGCTCGCCCGACAAGAGGAGGAAAA
GGCACAAGAAGCGTCACGAGACCGCAAAGATCGCCGCGAGGCGCGTGAGCGCGAGAAGCA
AAAGAATCTCGAAAAGCTCGAGGAGAAAAAGAATGCCACCAGGGGCAAGACGCGAGCCAG
CCGGCGCAAGGCGGTTCTGTCCAAAACTGAGGTTCAGGAAATGGACAAGTCCAAGAAATC
CAAAAACTTCAAGAAAATCGGCCAGCCGCGAATCATCACCGGCGCGTCCATGTACGACTA
CCAGATCCATGGAATCGAGTGGATGGCGTCGTTGTACGAGAACGGGCTCAATGGCATTCT
GGCCGACGAAATGGGATTGGGCAAGACCCTTCAGACCATTGCATTCCTTTCTTTCTTGAT
AGAGAAGCAGGTTGGCGGGCCCTATCTGGTCGTGGTTCCTCTGAGCACGCTCAACAACTG
GGAAAATGAGTTTCGTAAGTTTGCACCGAGTATTCCCGTTGTCAAGTTTTATGGCGACAA
AAAGGAGCGGGCGGCACTTTGGAAGGGTGTGCGAGTGGACTATGAGATGAGGGGTCTGAA
AAAAAGGGGTGGGAAAGACGGCGAGTTTGTGGAGACCTTCCCGGTAGTCATCACCACTTA
CGAGACGGTTGTGATGGAGACCCGACGTCTGCAGATGATGACGTGGAAGTATCTCATTGT
GGACGAGGGCCATCGCATCAAGAACGTCAACAGTTTGCTTCTGAAGAAACTGAAGCTTCT
GGATACGTCGAATCGTCTGCTTTTGACCGGCACTCCTCTGCAGAACAATTTGACGGAGCT
TTGGTCTCTTCTCAACTTTCTGCTTCCCGACGTGTTTTCCGACCTCAGTATGTTCCAATC
GTGGTTTGACGAGAAAGAAAACGGGTCTGGCGACGGGTTTGGCGGCGAGAATCGATCGGC
GGAGTTGGTGGAGACTCTTCATTCGATTCTGAAGCCGTTTTTGCTGCGACGGCTCAAGTC
GGAAGTCTATTCGAATCTGCCCGACAAACGCGAGTACTTGATTTACATCCAGATGGCTCC
TCTTCAGGAGGCTCTGGAGCACCGCCTGCGACGTCATGCACGGACAATCAACACCAAGGC
CCAGAAAAAAGACCGGGTCACTCCCGAGATCGTTCCCGAGCCTCTTGGCAAGCGGGTCCG
AAAGCAAATCGTGCGGGAAGACGTGGTTGACACTTTTGACGCTGATTTGGGGTCCGACGA
TGATGGATACGACTCGAACGAGAGTGTCGATCGAGACCTGAAGCGCCAGAAGAAGGAGGA
CTATGAAGAAGAAGAGGATTATGAGGACGGAAGTGACGGAGAAGAGGAGGAGGAAGAGGA
GGACGATGAAGAGGGGGGTTCTGAGGTCGAAGATGAGGGCATCAGAAAAAAGAAGAAGAA
GCACGTGAATTCAACTACCTCGTCTTCGTTGCTTCCTCCGGCCGACAAAGCGGCGCCCCT
AACCCAACTCCAAAAAGACATTATCGAGGGGCGGGTCAAATGGGTTGATAACAAGTGGGT
CCCCAAGACGGAGGAGGAACTGGCCAAGATGGAGGCCGATAGAGCGCTGGCCGAGAATTT
GATGAACGAGCACACCATGAGCGAAACGGCCCAAATGGCTGATCACGTGCCCAAAGACAT
CCTGGACGTGATTAACGATACGAAAAACGCCTCTACTACCATCCCCTCCCTGCAACCCAT
GCACCCGGACGCGCCCAGGCACGCCATCAAGTCTGATCCTCACCAAATGCCGCCGAAAAA
CTACTACTCGGCTCCAGATGGCAACTGGTACCCGCTGTACCTTCATCCGTTGTTCCCAAA
AAGGAAACAGACGGTGACAGGGATGACCAATCACGTGATTGGCAGTGACGGGTTTGGGAA
ACCGCTCAAGGAGTCACATCAAAAGTCTCGTGAGCAAAAGTCTCGTGAGCAAAAGTCTCG
TGAGCAAAAGTCTCGTGAGCAAAAGTCTCGTGAGCAGTCGCCGTCGCAGAAGTTAGGCTC
TTGCGTCAGCACTGCCGCCAAAGTCGAGTCCCGTGACCTCACCTCTTCCTTCTCGGTGAA
AAAGGAGTTGGATCCTGTTGATCAGTACATGCTTCGGTTACAACTGCTCAAAGAGAAGCG
CGAGAAAGAGGAGGCTGAGGCGGCAGCTGAGCGGGCGAAAATGGATCCTGTGGATGCCTA
TGTCGACAAACTGAGGCAATTGCAGCGTTTGTCGCCTGAGGACGAGGAGGAGACGGATGT
GGAGACGAAGATGGAAGAGGAATTGAGACGGTTGAGAGAAGAGACTCTCAATTCACTAGA
TTCGGTGGAAGCAGAGACAGAGACCGAGTCCCGTGAAATGAGGGAGTTGAGGGCGGAGTT
GTTGGGTTTGAAGAAGCAAGAGGATGTCAAAAGTGAGTCTGAGACTGTCAAGAAGGGGTC
TTTCGGGGAGGACAAGGACTCGATGAAGAAGGAGCAAGTATCTATTTCGTCTTTTGTGAA
GAGTGAATCGGAGGGTGTTACCTCCAAAAATGATTCTGAGTCTTCTGGTGTTTCTGAGCC
TTCCGCGGTCACTGTCACATCTGGAATGGCTTTTGAGCCGATTTACAGTGCTTCCCAGTC
TTCTGAAGTGACTCATAATGCCCCTGAAACGGTTCAGTCGCCTTCTATGGTCTCTGACAC
GACGCTGAACCCTTCTACGCTCTCTGACACGGCTCTGAACCCTTCTATCGTCTCTGACAC
GGCTCCAAAACCTTCCACGAGCCCTTCTGTGACGCCAACAGCCCCTAAAACGGTTGTGAA
GCCTTCTACGACCTCTCAACTCTCTGCTGTCCCTCTCACGACGATCTCAGACTCGGACTC
TGACGAGTTTGCCTCTGCCAACGAGGACTTGCAAGAATCCAACGGAAAGGTTCCACCAGA
AGATCTGGACGAAGAAGTTAGTGAGAAACTCAAGGAGGAGGTCAAGAGCGAGCAGAAGGT
GGTGGATTCGGTTGATGAGAAAGCCGCCGAGGTGGTGCAGGAGCTGTCGACTTCGGTCAT
GCCTTCACGCGACCTCACTCCCGAGGAGCTGGCTCCGGATATTACGGAGTTCTTGTCTAC
GATTGCGTTTGACAAAATCTCCACTTCACAGACCCTGGCTAATCTCCGAAAAGTGGCCAA
TTCGCCATACCTCGTCAAGTTCCCCTGGGGCGAGGAAGAACCTGTTGACGAGCGAATCAT
TTCTGATTCAGGCAAGATGAGGGTGTTCGATCAGCTGGCGATGGAGCTGGTGAGCAGAAA
GCACAAAATGCTAGTGTTTTCGCAGTTCTCCGGCACCCTGGATCTGCTCACCGAATGGTG
CGAGTTCCGTCACTTGCCTTACTGCATGTTGATCGGATCCATGGGTTTGGAGGAGCGTCA
GGAGATGATTGATGCGTTCAACGAGGAAAGTGGCCCGTCCATTTTCTTGATCACTACACG
CGCAGGCGGCACCGGCATCAATTTGACCGCTGCGGACTCGGTGGTCATTTTTGACAGTGA
TTGGAACCCCCAACAAGACAAGCAGGCGATCGATCGGAGCCATCGAATTGGCCAGAAGAA
GCCTTGTGTCATTTACCGGCTCATCTCAACCAACACCATGGAAGAAATGCTGGTTAGAGT
GGCTAGTGATAAAAAGAGACTGGATGAGATGGTCATTCAGGCCGGAGATTACTCCGGCTT
CTCCAAGGACGCCAACAAAACCGTGGATATCGACAAGACGTTTCTGCGAGAGATTATGAC
CGGTAAGAGTGATTATGAAGCGCATGAGGGCATCGAGAAGATTGATGACGAGCTGATGGA
AACCCTGTTGGATCGAAGTGACGAGAGTTACAAGAGACACAGAGATAAGACGGTGGAGCT
ACCTAAGAGTATCGAGGTGATTATTGGCTCGCGAGATGAGCATACGTGATGAGCACACAT
ACGCTCACGTGCTGAGCGCAGCGAGTCAGGTGTTTTTACCCTTGGTACCCGTTTCCTGTT
CTGAAGCGGTTCGTGCTGCGTCCATGGATGTATTTTAAATGTATTATTAGGGTGTAATGT
GGTGATGTGATATTATGATGTCCACGTGGTATTTTTTTCGTCGTATTTTTCGAGGTATCT
TCACGTGACATATTCACGTGACTCCGCTAGCCGTTTTTGTTGGCTGACTATTTTGTTGAA
ATTTCTGCAGAGACACGTTGTGTAGAAAAAATGCCCTCCTCTCTACAAG

Coding sequence    

>YALI0F04356g.cds
ATGGACAAGTCCAAGAAATCCAAAAACTTCAAGAAAATCGGCCAGCCGCGAATCATCACC
GGCGCGTCCATGTACGACTACCAGATCCATGGAATCGAGTGGATGGCGTCGTTGTACGAG
AACGGGCTCAATGGCATTCTGGCCGACGAAATGGGATTGGGCAAGACCCTTCAGACCATT
GCATTCCTTTCTTTCTTGATAGAGAAGCAGGTTGGCGGGCCCTATCTGGTCGTGGTTCCT
CTGAGCACGCTCAACAACTGGGAAAATGAGTTTCGTAAGTTTGCACCGAGTATTCCCGTT
GTCAAGTTTTATGGCGACAAAAAGGAGCGGGCGGCACTTTGGAAGGGTGTGCGAGTGGAC
TATGAGATGAGGGGTCTGAAAAAAAGGGGTGGGAAAGACGGCGAGTTTGTGGAGACCTTC
CCGGTAGTCATCACCACTTACGAGACGGTTGTGATGGAGACCCGACGTCTGCAGATGATG
ACGTGGAAGTATCTCATTGTGGACGAGGGCCATCGCATCAAGAACGTCAACAGTTTGCTT
CTGAAGAAACTGAAGCTTCTGGATACGTCGAATCGTCTGCTTTTGACCGGCACTCCTCTG
CAGAACAATTTGACGGAGCTTTGGTCTCTTCTCAACTTTCTGCTTCCCGACGTGTTTTCC
GACCTCAGTATGTTCCAATCGTGGTTTGACGAGAAAGAAAACGGGTCTGGCGACGGGTTT
GGCGGCGAGAATCGATCGGCGGAGTTGGTGGAGACTCTTCATTCGATTCTGAAGCCGTTT
TTGCTGCGACGGCTCAAGTCGGAAGTCTATTCGAATCTGCCCGACAAACGCGAGTACTTG
ATTTACATCCAGATGGCTCCTCTTCAGGAGGCTCTGGAGCACCGCCTGCGACGTCATGCA
CGGACAATCAACACCAAGGCCCAGAAAAAAGACCGGGTCACTCCCGAGATCGTTCCCGAG
CCTCTTGGCAAGCGGGTCCGAAAGCAAATCGTGCGGGAAGACGTGGTTGACACTTTTGAC
GCTGATTTGGGGTCCGACGATGATGGATACGACTCGAACGAGAGTGTCGATCGAGACCTG
AAGCGCCAGAAGAAGGAGGACTATGAAGAAGAAGAGGATTATGAGGACGGAAGTGACGGA
GAAGAGGAGGAGGAAGAGGAGGACGATGAAGAGGGGGGTTCTGAGGTCGAAGATGAGGGC
ATCAGAAAAAAGAAGAAGAAGCACGTGAATTCAACTACCTCGTCTTCGTTGCTTCCTCCG
GCCGACAAAGCGGCGCCCCTAACCCAACTCCAAAAAGACATTATCGAGGGGCGGGTCAAA
TGGGTTGATAACAAGTGGGTCCCCAAGACGGAGGAGGAACTGGCCAAGATGGAGGCCGAT
AGAGCGCTGGCCGAGAATTTGATGAACGAGCACACCATGAGCGAAACGGCCCAAATGGCT
GATCACGTGCCCAAAGACATCCTGGACGTGATTAACGATACGAAAAACGCCTCTACTACC
ATCCCCTCCCTGCAACCCATGCACCCGGACGCGCCCAGGCACGCCATCAAGTCTGATCCT
CACCAAATGCCGCCGAAAAACTACTACTCGGCTCCAGATGGCAACTGGTACCCGCTGTAC
CTTCATCCGTTGTTCCCAAAAAGGAAACAGACGGTGACAGGGATGACCAATCACGTGATT
GGCAGTGACGGGTTTGGGAAACCGCTCAAGGAGTCACATCAAAAGTCTCGTGAGCAAAAG
TCTCGTGAGCAAAAGTCTCGTGAGCAAAAGTCTCGTGAGCAAAAGTCTCGTGAGCAGTCG
CCGTCGCAGAAGTTAGGCTCTTGCGTCAGCACTGCCGCCAAAGTCGAGTCCCGTGACCTC
ACCTCTTCCTTCTCGGTGAAAAAGGAGTTGGATCCTGTTGATCAGTACATGCTTCGGTTA
CAACTGCTCAAAGAGAAGCGCGAGAAAGAGGAGGCTGAGGCGGCAGCTGAGCGGGCGAAA
ATGGATCCTGTGGATGCCTATGTCGACAAACTGAGGCAATTGCAGCGTTTGTCGCCTGAG
GACGAGGAGGAGACGGATGTGGAGACGAAGATGGAAGAGGAATTGAGACGGTTGAGAGAA
GAGACTCTCAATTCACTAGATTCGGTGGAAGCAGAGACAGAGACCGAGTCCCGTGAAATG
AGGGAGTTGAGGGCGGAGTTGTTGGGTTTGAAGAAGCAAGAGGATGTCAAAAGTGAGTCT
GAGACTGTCAAGAAGGGGTCTTTCGGGGAGGACAAGGACTCGATGAAGAAGGAGCAAGTA
TCTATTTCGTCTTTTGTGAAGAGTGAATCGGAGGGTGTTACCTCCAAAAATGATTCTGAG
TCTTCTGGTGTTTCTGAGCCTTCCGCGGTCACTGTCACATCTGGAATGGCTTTTGAGCCG
ATTTACAGTGCTTCCCAGTCTTCTGAAGTGACTCATAATGCCCCTGAAACGGTTCAGTCG
CCTTCTATGGTCTCTGACACGACGCTGAACCCTTCTACGCTCTCTGACACGGCTCTGAAC
CCTTCTATCGTCTCTGACACGGCTCCAAAACCTTCCACGAGCCCTTCTGTGACGCCAACA
GCCCCTAAAACGGTTGTGAAGCCTTCTACGACCTCTCAACTCTCTGCTGTCCCTCTCACG
ACGATCTCAGACTCGGACTCTGACGAGTTTGCCTCTGCCAACGAGGACTTGCAAGAATCC
AACGGAAAGGTTCCACCAGAAGATCTGGACGAAGAAGTTAGTGAGAAACTCAAGGAGGAG
GTCAAGAGCGAGCAGAAGGTGGTGGATTCGGTTGATGAGAAAGCCGCCGAGGTGGTGCAG
GAGCTGTCGACTTCGGTCATGCCTTCACGCGACCTCACTCCCGAGGAGCTGGCTCCGGAT
ATTACGGAGTTCTTGTCTACGATTGCGTTTGACAAAATCTCCACTTCACAGACCCTGGCT
AATCTCCGAAAAGTGGCCAATTCGCCATACCTCGTCAAGTTCCCCTGGGGCGAGGAAGAA
CCTGTTGACGAGCGAATCATTTCTGATTCAGGCAAGATGAGGGTGTTCGATCAGCTGGCG
ATGGAGCTGGTGAGCAGAAAGCACAAAATGCTAGTGTTTTCGCAGTTCTCCGGCACCCTG
GATCTGCTCACCGAATGGTGCGAGTTCCGTCACTTGCCTTACTGCATGTTGATCGGATCC
ATGGGTTTGGAGGAGCGTCAGGAGATGATTGATGCGTTCAACGAGGAAAGTGGCCCGTCC
ATTTTCTTGATCACTACACGCGCAGGCGGCACCGGCATCAATTTGACCGCTGCGGACTCG
GTGGTCATTTTTGACAGTGATTGGAACCCCCAACAAGACAAGCAGGCGATCGATCGGAGC
CATCGAATTGGCCAGAAGAAGCCTTGTGTCATTTACCGGCTCATCTCAACCAACACCATG
GAAGAAATGCTGGTTAGAGTGGCTAGTGATAAAAAGAGACTGGATGAGATGGTCATTCAG
GCCGGAGATTACTCCGGCTTCTCCAAGGACGCCAACAAAACCGTGGATATCGACAAGACG
TTTCTGCGAGAGATTATGACCGGTAAGAGTGATTATGAAGCGCATGAGGGCATCGAGAAG
ATTGATGACGAGCTGATGGAAACCCTGTTGGATCGAAGTGACGAGAGTTACAAGAGACAC
AGAGATAAGACGGTGGAGCTACCTAAGAGTATCGAGGTGATTATTGGCTCGCGAGATGAG
CATACGTGA

Predicted translation product    

>YALI0F04356g.aa
MDKSKKSKNFKKIGQPRIITGASMYDYQIHGIEWMASLYENGLNGILADEMGLGKTLQTI
AFLSFLIEKQVGGPYLVVVPLSTLNNWENEFRKFAPSIPVVKFYGDKKERAALWKGVRVD
YEMRGLKKRGGKDGEFVETFPVVITTYETVVMETRRLQMMTWKYLIVDEGHRIKNVNSLL
LKKLKLLDTSNRLLLTGTPLQNNLTELWSLLNFLLPDVFSDLSMFQSWFDEKENGSGDGF
GGENRSAELVETLHSILKPFLLRRLKSEVYSNLPDKREYLIYIQMAPLQEALEHRLRRHA
RTINTKAQKKDRVTPEIVPEPLGKRVRKQIVREDVVDTFDADLGSDDDGYDSNESVDRDL
KRQKKEDYEEEEDYEDGSDGEEEEEEEDDEEGGSEVEDEGIRKKKKKHVNSTTSSSLLPP
ADKAAPLTQLQKDIIEGRVKWVDNKWVPKTEEELAKMEADRALAENLMNEHTMSETAQMA
DHVPKDILDVINDTKNASTTIPSLQPMHPDAPRHAIKSDPHQMPPKNYYSAPDGNWYPLY
LHPLFPKRKQTVTGMTNHVIGSDGFGKPLKESHQKSREQKSREQKSREQKSREQKSREQS
PSQKLGSCVSTAAKVESRDLTSSFSVKKELDPVDQYMLRLQLLKEKREKEEAEAAAERAK
MDPVDAYVDKLRQLQRLSPEDEEETDVETKMEEELRRLREETLNSLDSVEAETETESREM
RELRAELLGLKKQEDVKSESETVKKGSFGEDKDSMKKEQVSISSFVKSESEGVTSKNDSE
SSGVSEPSAVTVTSGMAFEPIYSASQSSEVTHNAPETVQSPSMVSDTTLNPSTLSDTALN
PSIVSDTAPKPSTSPSVTPTAPKTVVKPSTTSQLSAVPLTTISDSDSDEFASANEDLQES
NGKVPPEDLDEEVSEKLKEEVKSEQKVVDSVDEKAAEVVQELSTSVMPSRDLTPEELAPD
ITEFLSTIAFDKISTSQTLANLRKVANSPYLVKFPWGEEEPVDERIISDSGKMRVFDQLA
MELVSRKHKMLVFSQFSGTLDLLTEWCEFRHLPYCMLIGSMGLEERQEMIDAFNEESGPS
IFLITTRAGGTGINLTAADSVVIFDSDWNPQQDKQAIDRSHRIGQKKPCVIYRLISTNTM
EEMLVRVASDKKRLDEMVIQAGDYSGFSKDANKTVDIDKTFLREIMTGKSDYEAHEGIEK
IDDELMETLLDRSDESYKRHRDKTVELPKSIEVIIGSRDEHT*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites