YALI0C01243g


similar to uniprot|P32597 Saccharomyces cerevisiae YIL126W STH1 ATPase component of the RSC chromatin remodeling complex

Genomic environment map

Element type: CDS
Element length: 4006 nucleotides,
on anti-sense strand of
Yali0C: complement(join(178398..178523,174518..178099)).
Other names:
YALI-CDS0263.1
YALI-IPF7766
Coding sequence: 1236 codons.
Database cross references:
EMBL: CR382129
GeneID: 2909161
HOGENOM: Q6CDE1

Computed results  

None available yet

Homologs and Orthologs

Homologs in protein families: GL3C0284 GL3C0284.N2
Orthologs: strict determination not possible; homologs must be refined manually

Protein YALI0C01243p  


similar to uniprot|P32597 Saccharomyces cerevisiae YIL126W Nuclear protein STH1/NPS1

Protein domain map

Protein length: 1235 amino acids
Protein family: GL3C0284
Database cross references:
Gene3D: G3DSA:1.20.920.10
InterPro: IPR000330
InterPro: IPR001487
InterPro: IPR001650
InterPro: IPR003016
InterPro: IPR006562
InterPro: IPR014001
InterPro: IPR014012
InterPro: IPR014021
KEGG: yli:YALI0C01243g
PRINTS: PR00503
PROSITE: PS00189
PROSITE: PS00633
PROSITE: PS50014
PROSITE: PS51192
PROSITE: PS51194
PROSITE: PS51204
Pfam: PF00176
Pfam: PF00271
Pfam: PF00439
Pfam: PF07529
RefSeq: XP_501321.1
SMART: SM00297
SMART: SM00487
SMART: SM00490
UniProtKB/TrEMBL: Q6CDE1
UniprotKB: Q6CDE1_YARLI

Computed results for YALI0C01243p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  

GO:0005524 ATP binding
GO:0004386 helicase activity
GO:0003677 DNA binding

Sequence data  


Nucleotide sequence    

>YALI0C01243g.nt
TTGATAGTGTGTCCCTGGTTAGCTCTGATCCAGTATCCCGTGCAATCGTTGGGGTCTGTG
GGTTCTTTGGGCTGTTTTTCTGTGGGCTCAAAGATGAGCTTGTATCTCTGTTTGTCGTTA
TCTCGCACAACCCGCTTGGTGTCTTCCAGGTTCCATTTCCCCTTGATTCCAGCTCTATTG
AGCAGCACACCTACAGACAAGTATCCGTCTTCTCGAATGGGCAGCTTTTCCTTGGTGGCG
CCGTGTCGCAAGATGTACGACAGTTTGCGAGAGTACTCCTCGTCAGGCGTCATTTCTCGT
CGGGGTCGAGGCATTGTTGTTGGAGTGAACTGTGTGAAGAAGATTCAATTTAGATGCAGA
TAAAGATCCTGGTTATGCATGCAAATGTGACAGTGGGATACATGACAGAGCAGCATAGGT
ATGGGCGATATAGGACGGGGATGCGTCACGTGACTGATTGAGAAGTTGGCTTAGACCCCA
TTGAGGCCCCCGGGCACCTGTGTAGTGTCAAACGAGAAAGTATGAACGTTCCTTAACGAG
GTGCTGTATGTTCCACTTCAGCTGAAAGATGAAGGCCCTTCAGTAATTAGTCTACAAGAG
CACTGTTTTAACGTACATACTGTACTGGACCTACTCTGTCCTCCCCCCAAGTAAGTCAAA
GCAGTGCAACAGGACATTTTAACACCATGCGGACCGTCTGAGTAAATATTTCGGCCGTGT
CTGCCAGCCACAACCCGCTATTTCGACATCTTCCTTCTGACTGGTGCACCTTCATTCCAG
CCAAACTTTCCAAGCATACTCACCCAAAGCGGAGTCGAACCGGCAACATCCATACCGCGC
CGCGTAAAACGTGGTCCGAAACCGTGGGTGGGAGCAGATTAGACTTTGGTCGACCCTCAC
ACACTTTCACCCACTGCTGTCGAGCGAAACTATCAAGCGTGTAGAACCGTGCGGTATTCG
ACGAAACCGAATCATCAACGGCACACACAGCACTGGCAAGATGGATCTCGAAAACGGGCA
TTCGCCAGCACCCAATGGCGGTGCTCTCAACGGCTCCGCCAAACAGCCCCACTATTATTC
CATCCCCACCCCCGAGAGCGTGGAGCAGGCCCAGTTGTTGTTTTTGgtgagtatcagagg
aagaggaagacaagaggaggggagacagacccaacacgacggacggaacaccatcacgtg
acaccgaggacgtgacgacaccacgtgacattgtcagagtgtgattgtgttatggacacg
acagccgacgttggacgggtcgacacgacattatcgatgtcatgggtcatgggtcattgg
tatcgtatcgtatcgttgcacataacgtgacgacacacgtgtctcaacacagcacagtgt
gcaagtctattgcttgaccgccatctattctcattctaacacagCGATTCAATGCTCTTA
ATCAATCCGACCACCCGGTGACACCCGAACTCGAGGGCATCGTCACCCGGCTGCACACCA
TGGGCCAGGAGCAGCAGCTGTACAACAAGCGTCTGGCAGAGGTCAACAGCGCCCGAAAAG
CCGCCCACAACAAGATCGACCTCGACCTGCTCCGAACCCAGGTCAACGCGTTCAGATACC
TGAGCCGAGGTCTCGACATGCCCACCGACGTGCAGGAGCGCCTAATGGAGACGGTAGGCA
TCGAAAGTGACATCACCGAAGAGGAGGCTGACGCAGCACTTGAAAAGTCACTCACCCCTT
ACCAGCGACGTCTCAAGGACAGCGGAGTGCCCCCCCGAGGCGGGTTTGTGGACAACGGGC
GGGGAGTCGGTGGACGACACGAGTTCATCATCACGAACCGAATCTCGCAGCGGATTCTGG
AGCTCGAGTCTCTGCCATCAAACCTCGGAGTGTTGGATGCTTCCCGGGATGATCTCGGCA
AAGCCGAAGGCACTGATGCGCTCAAAGTACGAGCTGTGGCCGAGCTCAAGGCCCTGCGGC
TGCTAACCAAGCAAAAAAGTCTGCGGCAACATCTAGTCTTCTGCAAGGCCCAGACGTCCC
AGGTGACCGACTCGGTCATCAACCGGGCTCTCAACCGACGAGCCAAGGTGCAGACCGCGC
ACGAGATGCGACTGACGGAGCAGCTAGAGCGACAGCAGCGCATGGAGCGTGAACGCAAGC
AGCGGGAGGAAATCATGGCGTACGTCAACTCGGTGTGCCACCAGTCGGACCGAATCAGAG
AAGAGGCCCATCGTCGACGAACCAAGCAGTACGCCGTGGCCAAGGGCATCCAGCAGTTCC
ACTCGTACGTTGAGCGAGAAGAGAGTCGACGAGTCGAGCGAACTGCCAAGCAGCGTCTGC
AAGCTCTCAAGTCCAACGACGAGGAGGCCTACCTCAAGCTCCTGGACCAGACCAAGGATA
CACGTATCACCCACTTGCTGCGACAGACCAATTCGTTCTTGGACTCTCTGTCATCTGCCG
TTCGTGCTCAGCAAGGAGAAGCCGGAACTCAAATGCCCATTCCTGCTGCGGAGGAAGAAG
GAGGAGAGGGAGAAGAGGACCGCGAGAAGATCGACTACTACCATATTGCTCATCGAGTCA
AGGAGACAGTCTCCAAGCAGCCTTCTATCTTGGTTGGGGGTCAGCTCAAGGAGTACCAGC
TCAAGGGTCTGCAGTGGATGGTGTCTCTGTATAACAACTCTCTGAATGGTATCTTGGCCG
ACGAAATGGGCCTTGGAAAAACTATCCAGTCCATTTCCCTCATTACCTACCTTATTGAAG
TCAAGAGACAGACCCGGCCTTACCTGGTCATTGTCCCTCTCTCGACACTCACTAACTGGA
CCAATGAGTTTGAAAAGTGGGCTCCCAGCGTTAAGAAGATTGTGTTCAAGGGTTCGCCCA
ACCAGCGTAAGGAGTTGTCCAACCAGGTACGAGCAGGAGACTTCCAGGTGCTGCTCACCA
CCTACGAGTACATCATCAAGGACAAGGCGCTTCTGGGCCGAATCCGGTGGGTGCATATGA
TCATTGACGAAGGTCATCGAATGAAAAACACCCAATCCAAGCTTGCCCAGACGCTCACGC
AGTTCTACTACTCTCGATACCGACTCATTCTTACAGGTACCCCTCTGCAGAACAACCTGC
CCGAACTGTGGGCGCTGCTCAACTTTGTGCTACCCAAGATTTTCAACTCGGTTAAGACGT
TCGACGAGTGGTTCAACACGCCGTTTGCTAACAGCGGATCACAGGACAAGATGGAGCTCA
CGGAAGAAGAGACGCTGCTGGTCATCCGACGTCTTCATAAGGTGCTGCGTCCCTTCCTGC
TGCGACGTCTTAAGAAGGACGTTGAGAAGGATCTCCCCGACAAGGTGGAGACCGTCATCA
AGTGCAAGATGAGCGCTCTTCAGCTCAAAATGTACCAGCAGATGCTCAAGTACAACGCTT
TGTACGTTGGTGACGATTCTGGCGCGGCTGGTGTCAACAAGAGCGGAGTCAAGGGTCTCA
ACAACAAGATTATGCAACTGCGTAAGATTTGTAACCATCCTTATGTATACGAGGAAGTCG
AGACTCTTCTGAACCCATCACATGGCAACAACGACCTACTGTGGCGTTCTGCAGGTAAGT
TTGAGCTTTTGGACCGAATTCTGCCCAAGTTCAAGGCTAGGGACCATAGGGTGCTCATGT
TCTTCCAAATGACCCAGATTATGGACATTATGGAGGACTATTTGCGGCTCCGTGGCTTGC
AGTACCTGCGACTAGATGGTAACACCAAGGCCGACGATCGATCCGAGATGCTGAAGCTCT
TCAACGCTCCCGATTCGCCCTATTTCTGTTTTCTGCTGTCCACTCGAGCTGGTGGTCTAG
GTCTGAACCTGCAGACAGCCGACACAGTTATCATCTACGACACTGATTGGAACCCTCATC
AGGATCTGCAGGCACAGGATCGAGCACATCGAATTGGTCAAACCAAGGAGGTGCGAATTC
TGCGTCTGATCACAGAGGATTCCGTTGAGGAGGTTATTCTCCAGCGAGCTCACGCAAAGC
TTGAAATCGACGGAAAGGTCATTCAGGCCGGAAAGTTCGATAACAAGTCGACGGCCGAGG
AGCAGGAGGCGTTCCTGCGAGGGCTGCTGGAGCAGGAGGAGTCTAAGCGTGGCTCTCGAG
AGGCAGAGGACGAGGACCTTGACGATGAGGAGCTCAACGACCTGCTGGCTCGAAACGAGG
AGGAGCGGGCTTTCTACGCCCAGATGGACGCCGAGCGGAACGCCACAAGCGATTATGGCA
AGGGCGCAGGACGACCCGATCGGCTGCTTTCCGAGTCCGAGTTGCCGGACCAGTTCACTC
AGGATGTGTCTGAGCATTTCCGAGAGGACGACATGGCCGACTCGGACAAGTACGGCCGAG
GAGCTCGAGAGCGAAAGGAAGTGTACTACGACGACGGCCTCACAGAGGAGCAGTGGCTCA
ACATTGTGGACAATGACGAGGATCTGGACGCTGCCATCAAGAAGCGACGCCGAAGCCGAC
GGTCCCGAGCCCCCACTGATCTGGATGACGGTCTGGACGCCGATGATATCATCGTTGACG
ATGCCGATGGAGACGACCAGTTTGTCGACGAAGAGGCGTCTCCTCCTCCCAAGAAGAAGA
ACAAGAACAAGATCCCCAAGGTCCGGGAAACCTTGTCTGTGGCTGACCGGGAGGAGCTCA
ACGAGTCATGCCAGCTGGCTCTCAACGAGTGTTTTGCGCTGACGGAGCCCTGGGATCCTG
AAGATGAGTCTGCGGGCCTGCGGCATGTGTGTGACCTGTTCCGGTCCATTCCCTCCAAAA
AGCTGTATCCCGACTACTTCAAGCTCATCAAGAATCCGATTGCGCTTGACGTGATCCAGA
AGAAGGTGAAGCAGAATGTGTACCAGAACCGGTACGATTTTGTGGCTGATTTCGAGCTCA
TGAACCAGAATGCTCGAACCTACAACGAGGAGAACTCTCCGGTTGTTACGGATGCCGACC
GTCTTTTGGACATTGCCCGGGGTTAGAAGATATATAATGGGCAAAAAGGTCCCACTAAGC
TGTATAATATGATGTGTTTGTTTATTACACAGCTGGAGCGAAGCTGTCTATCCGTACGTC
TACTTGTAGTAATATCGCCTTTACCTTCTACTCAACGAAACCAAGCACTCGCAGTGTATC
TATCGAAATTCGTCTCATTGTATGTTCTTGCCAAGGGCGCTCAGAAAAAGCGCCTCACAG
CTAGTCTAGACAATTAGAGATTAGAACACACCCTTTTTTGCAAGTTTTACTACAGTACTT
GTTTTGTCATGTGACATAATCTATGG

Coding sequence    

>YALI0C01243g.cds
ATGGATCTCGAAAACGGGCATTCGCCAGCACCCAATGGCGGTGCTCTCAACGGCTCCGCC
AAACAGCCCCACTATTATTCCATCCCCACCCCCGAGAGCGTGGAGCAGGCCCAGTTGTTG
TTTTTGCGATTCAATGCTCTTAATCAATCCGACCACCCGGTGACACCCGAACTCGAGGGC
ATCGTCACCCGGCTGCACACCATGGGCCAGGAGCAGCAGCTGTACAACAAGCGTCTGGCA
GAGGTCAACAGCGCCCGAAAAGCCGCCCACAACAAGATCGACCTCGACCTGCTCCGAACC
CAGGTCAACGCGTTCAGATACCTGAGCCGAGGTCTCGACATGCCCACCGACGTGCAGGAG
CGCCTAATGGAGACGGTAGGCATCGAAAGTGACATCACCGAAGAGGAGGCTGACGCAGCA
CTTGAAAAGTCACTCACCCCTTACCAGCGACGTCTCAAGGACAGCGGAGTGCCCCCCCGA
GGCGGGTTTGTGGACAACGGGCGGGGAGTCGGTGGACGACACGAGTTCATCATCACGAAC
CGAATCTCGCAGCGGATTCTGGAGCTCGAGTCTCTGCCATCAAACCTCGGAGTGTTGGAT
GCTTCCCGGGATGATCTCGGCAAAGCCGAAGGCACTGATGCGCTCAAAGTACGAGCTGTG
GCCGAGCTCAAGGCCCTGCGGCTGCTAACCAAGCAAAAAAGTCTGCGGCAACATCTAGTC
TTCTGCAAGGCCCAGACGTCCCAGGTGACCGACTCGGTCATCAACCGGGCTCTCAACCGA
CGAGCCAAGGTGCAGACCGCGCACGAGATGCGACTGACGGAGCAGCTAGAGCGACAGCAG
CGCATGGAGCGTGAACGCAAGCAGCGGGAGGAAATCATGGCGTACGTCAACTCGGTGTGC
CACCAGTCGGACCGAATCAGAGAAGAGGCCCATCGTCGACGAACCAAGCAGTACGCCGTG
GCCAAGGGCATCCAGCAGTTCCACTCGTACGTTGAGCGAGAAGAGAGTCGACGAGTCGAG
CGAACTGCCAAGCAGCGTCTGCAAGCTCTCAAGTCCAACGACGAGGAGGCCTACCTCAAG
CTCCTGGACCAGACCAAGGATACACGTATCACCCACTTGCTGCGACAGACCAATTCGTTC
TTGGACTCTCTGTCATCTGCCGTTCGTGCTCAGCAAGGAGAAGCCGGAACTCAAATGCCC
ATTCCTGCTGCGGAGGAAGAAGGAGGAGAGGGAGAAGAGGACCGCGAGAAGATCGACTAC
TACCATATTGCTCATCGAGTCAAGGAGACAGTCTCCAAGCAGCCTTCTATCTTGGTTGGG
GGTCAGCTCAAGGAGTACCAGCTCAAGGGTCTGCAGTGGATGGTGTCTCTGTATAACAAC
TCTCTGAATGGTATCTTGGCCGACGAAATGGGCCTTGGAAAAACTATCCAGTCCATTTCC
CTCATTACCTACCTTATTGAAGTCAAGAGACAGACCCGGCCTTACCTGGTCATTGTCCCT
CTCTCGACACTCACTAACTGGACCAATGAGTTTGAAAAGTGGGCTCCCAGCGTTAAGAAG
ATTGTGTTCAAGGGTTCGCCCAACCAGCGTAAGGAGTTGTCCAACCAGGTACGAGCAGGA
GACTTCCAGGTGCTGCTCACCACCTACGAGTACATCATCAAGGACAAGGCGCTTCTGGGC
CGAATCCGGTGGGTGCATATGATCATTGACGAAGGTCATCGAATGAAAAACACCCAATCC
AAGCTTGCCCAGACGCTCACGCAGTTCTACTACTCTCGATACCGACTCATTCTTACAGGT
ACCCCTCTGCAGAACAACCTGCCCGAACTGTGGGCGCTGCTCAACTTTGTGCTACCCAAG
ATTTTCAACTCGGTTAAGACGTTCGACGAGTGGTTCAACACGCCGTTTGCTAACAGCGGA
TCACAGGACAAGATGGAGCTCACGGAAGAAGAGACGCTGCTGGTCATCCGACGTCTTCAT
AAGGTGCTGCGTCCCTTCCTGCTGCGACGTCTTAAGAAGGACGTTGAGAAGGATCTCCCC
GACAAGGTGGAGACCGTCATCAAGTGCAAGATGAGCGCTCTTCAGCTCAAAATGTACCAG
CAGATGCTCAAGTACAACGCTTTGTACGTTGGTGACGATTCTGGCGCGGCTGGTGTCAAC
AAGAGCGGAGTCAAGGGTCTCAACAACAAGATTATGCAACTGCGTAAGATTTGTAACCAT
CCTTATGTATACGAGGAAGTCGAGACTCTTCTGAACCCATCACATGGCAACAACGACCTA
CTGTGGCGTTCTGCAGGTAAGTTTGAGCTTTTGGACCGAATTCTGCCCAAGTTCAAGGCT
AGGGACCATAGGGTGCTCATGTTCTTCCAAATGACCCAGATTATGGACATTATGGAGGAC
TATTTGCGGCTCCGTGGCTTGCAGTACCTGCGACTAGATGGTAACACCAAGGCCGACGAT
CGATCCGAGATGCTGAAGCTCTTCAACGCTCCCGATTCGCCCTATTTCTGTTTTCTGCTG
TCCACTCGAGCTGGTGGTCTAGGTCTGAACCTGCAGACAGCCGACACAGTTATCATCTAC
GACACTGATTGGAACCCTCATCAGGATCTGCAGGCACAGGATCGAGCACATCGAATTGGT
CAAACCAAGGAGGTGCGAATTCTGCGTCTGATCACAGAGGATTCCGTTGAGGAGGTTATT
CTCCAGCGAGCTCACGCAAAGCTTGAAATCGACGGAAAGGTCATTCAGGCCGGAAAGTTC
GATAACAAGTCGACGGCCGAGGAGCAGGAGGCGTTCCTGCGAGGGCTGCTGGAGCAGGAG
GAGTCTAAGCGTGGCTCTCGAGAGGCAGAGGACGAGGACCTTGACGATGAGGAGCTCAAC
GACCTGCTGGCTCGAAACGAGGAGGAGCGGGCTTTCTACGCCCAGATGGACGCCGAGCGG
AACGCCACAAGCGATTATGGCAAGGGCGCAGGACGACCCGATCGGCTGCTTTCCGAGTCC
GAGTTGCCGGACCAGTTCACTCAGGATGTGTCTGAGCATTTCCGAGAGGACGACATGGCC
GACTCGGACAAGTACGGCCGAGGAGCTCGAGAGCGAAAGGAAGTGTACTACGACGACGGC
CTCACAGAGGAGCAGTGGCTCAACATTGTGGACAATGACGAGGATCTGGACGCTGCCATC
AAGAAGCGACGCCGAAGCCGACGGTCCCGAGCCCCCACTGATCTGGATGACGGTCTGGAC
GCCGATGATATCATCGTTGACGATGCCGATGGAGACGACCAGTTTGTCGACGAAGAGGCG
TCTCCTCCTCCCAAGAAGAAGAACAAGAACAAGATCCCCAAGGTCCGGGAAACCTTGTCT
GTGGCTGACCGGGAGGAGCTCAACGAGTCATGCCAGCTGGCTCTCAACGAGTGTTTTGCG
CTGACGGAGCCCTGGGATCCTGAAGATGAGTCTGCGGGCCTGCGGCATGTGTGTGACCTG
TTCCGGTCCATTCCCTCCAAAAAGCTGTATCCCGACTACTTCAAGCTCATCAAGAATCCG
ATTGCGCTTGACGTGATCCAGAAGAAGGTGAAGCAGAATGTGTACCAGAACCGGTACGAT
TTTGTGGCTGATTTCGAGCTCATGAACCAGAATGCTCGAACCTACAACGAGGAGAACTCT
CCGGTTGTTACGGATGCCGACCGTCTTTTGGACATTGCCCGGGGTTAG

Predicted translation product    

>YALI0C01243g.aa
MDLENGHSPAPNGGALNGSAKQPHYYSIPTPESVEQAQLLFLRFNALNQSDHPVTPELEG
IVTRLHTMGQEQQLYNKRLAEVNSARKAAHNKIDLDLLRTQVNAFRYLSRGLDMPTDVQE
RLMETVGIESDITEEEADAALEKSLTPYQRRLKDSGVPPRGGFVDNGRGVGGRHEFIITN
RISQRILELESLPSNLGVLDASRDDLGKAEGTDALKVRAVAELKALRLLTKQKSLRQHLV
FCKAQTSQVTDSVINRALNRRAKVQTAHEMRLTEQLERQQRMERERKQREEIMAYVNSVC
HQSDRIREEAHRRRTKQYAVAKGIQQFHSYVEREESRRVERTAKQRLQALKSNDEEAYLK
LLDQTKDTRITHLLRQTNSFLDSLSSAVRAQQGEAGTQMPIPAAEEEGGEGEEDREKIDY
YHIAHRVKETVSKQPSILVGGQLKEYQLKGLQWMVSLYNNSLNGILADEMGLGKTIQSIS
LITYLIEVKRQTRPYLVIVPLSTLTNWTNEFEKWAPSVKKIVFKGSPNQRKELSNQVRAG
DFQVLLTTYEYIIKDKALLGRIRWVHMIIDEGHRMKNTQSKLAQTLTQFYYSRYRLILTG
TPLQNNLPELWALLNFVLPKIFNSVKTFDEWFNTPFANSGSQDKMELTEEETLLVIRRLH
KVLRPFLLRRLKKDVEKDLPDKVETVIKCKMSALQLKMYQQMLKYNALYVGDDSGAAGVN
KSGVKGLNNKIMQLRKICNHPYVYEEVETLLNPSHGNNDLLWRSAGKFELLDRILPKFKA
RDHRVLMFFQMTQIMDIMEDYLRLRGLQYLRLDGNTKADDRSEMLKLFNAPDSPYFCFLL
STRAGGLGLNLQTADTVIIYDTDWNPHQDLQAQDRAHRIGQTKEVRILRLITEDSVEEVI
LQRAHAKLEIDGKVIQAGKFDNKSTAEEQEAFLRGLLEQEESKRGSREAEDEDLDDEELN
DLLARNEEERAFYAQMDAERNATSDYGKGAGRPDRLLSESELPDQFTQDVSEHFREDDMA
DSDKYGRGARERKEVYYDDGLTEEQWLNIVDNDEDLDAAIKKRRRSRRSRAPTDLDDGLD
ADDIIVDDADGDDQFVDEEASPPPKKKNKNKIPKVRETLSVADREELNESCQLALNECFA
LTEPWDPEDESAGLRHVCDLFRSIPSKKLYPDYFKLIKNPIALDVIQKKVKQNVYQNRYD
FVADFELMNQNARTYNEENSPVVTDADRLLDIARG*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites