YALI0E13970g


some similarities with DEHA0D03223g Debaryomyces hansenii

Genomic environment map

Element type: CDS
Element length: 1899 nucleotides,
on anti-sense strand of
Yali0E: complement(1681783..1683681).
Other names:
YALI-CDS1436.1
YALI-IPF4380
Coding sequence: 633 codons.
Database cross references:
EMBL: CR382131
GeneID: 2911493
HOGENOM: Q6C5Y9

Computed results  

None available yet

Homologs and Orthologs

Homologs in protein families: GL3C0218 GL3C0218.N2
Orthologs: strict determination not possible; homologs must be refined manually

Protein YALI0E13970p  


Protein domain map

Protein length: 632 amino acids
Protein family: GL3C0218
Database cross references:
InterPro: IPR007518
KEGG: yli:YALI0E13970g
PANTHER: PTHR18063
Pfam: PF04424
RefSeq: XP_503923.1
UniProtKB/TrEMBL: Q6C5Y9
UniprotKB: Q6C5Y9_YARLI

Computed results for YALI0E13970p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  

None available yet

Sequence data  

Nucleotide sequence

>YALI0E13970g.nt
GACGAGCTCAAGGGTACATTGCTGGAGCTGGGAATCGCTGAGGTTCTGATTCCCCTCACC
TTGTCGGACAACATTGAGGTGCAGGGCAACAGTGCCGCTGCGCTTGGCAACCTGTCTTCC
AAGGTGGGCAACTACGACACGTTTGTCAACCACTGGAACGAGCCCAGCGGAGGAATCCGG
GAGTTTTTGATCCGGTTCCTCACCAGCGGCGACTCTACGTTTGGCCACATTGCCGTGTGG
ACCGTGCTGCAGCTGCTGGAGTCCAAGGACCAGCGTCTTAAGGATCTGCTCAAGAACTCC
AATGAGATTGTCGATGCCATCCACCAGCTATCCAGCATGAACTCTGATGGGGCTGACGGC
GATGGAGAGGACATGCGGGATTCCAAGAGCGAGGTTGTCATGTTGGCTAAAAAGGTGGCT
CCTTTGCTCAAATAGTAGGATGCGATGATTTATTATATATGGAGAAGGAAGACAACTAAT
GTATTTGTAATCGGCACTTGGAGTTGTAGATACTGTAGTTTCCATCGTCATATCATATCT
CCTTGGAGATAACACAAACTTTGTTATCGGTGTTACCTGTAATAGGCACAATGTAACCTT
GTTCATTCACATGTGTGGATGAATAAAAGTGGAGGTGGTGACCTACAATTGTAGATGGAT
GACAATCTTAATGGGCCGGCTTCATAGACCAGTGGGGCCGTTGAGGGTGCAAACCGTGTT
TTGATAGAGTGGAAAAAGCAACAATCGGGTGAGATTAAAGGTCACGTGTCCTCTTCCTGT
TGGTCCGTCGGTCACTCTATGCTGTGTGTCCTCCGAAATGCTCCTGTTGTAGGCTGTGGT
ACACCCCCCTAGCACAGCCCCTTTAAACTCGGTCACCGTGCATAGATAGCTCCGATATAA
GCATGCTCCCACGGGGAAAACAAGCCCACTACACCGACTTGAAAATTGGAACCGGAAGCG
AAAAAACAAACCAACCATATCAAAACAGACAGCTATAGTTATGGATGGATCTGAAGCGAG
CATACGGCACATGGAGACGACCGCGAGCGGGGCCATGGAGGACGGCGAAAAGGCGGCGTT
GAACACGAGCTCGGAGCCCAGTCGCAACCCCTTTCGCCAGTCTCTGTCTACCGAGCCTCA
CTTGATGCCCTCTCCTCCGGCGGACTCGACGCTAAACTCCGATATCGCAAATACCATTGC
GGCGGAGGAGACTGACGACCATGCCAAGAGCGCCACAGCGGTCGCGGGCCACGCCCAGGC
ACACTCTTGTCCGGCCGTGTATACCACAGAGGCTATGCAGGAACCAGGCGAAGTAGCACA
GCCCTCCTCGATCCAGGGGCACACTACAACCACGCCCACGACCTCGGACTTCATTGTCGA
ACGCAGCCAGGTCGACATTCCTGCCACCGTGCCCACCACCTCTGTGTCTGCCGTCAACAT
CCCCGCCGAACCCGAAAAGATGCACTATCTCGACGTCCAGGGAGGCTCCAGCAACACGCC
GTCCGACGTGGACTTTACAAGCTCCATTCGAAACTCGATCCATTCCTTCACATCGTTCCA
ACCCAACCCCGCAAACAGAGCCAGCTATATTGGAGACGGCCATGTGGAGGAGCTGGGTCT
GCCAGAGGCTGGCAATTCAGACAGTGCCCTGGCTCCCCTGACAGTCAGGTCACCAGTCAG
ATCGCCTATCCTGAAGTCTGCTTCTCCTCCTCCTCAGAATATCATCCCAGCACCGCTCAC
CTTCAACCCACCAAAGGACAATATCACGTTCCAGGTCAAGGTCATCAAGTGGAGAACACC
AGCCAACTACCTGGTCAAAACACCCATCATCCTCCAGGACGAAAACGGCCCGTGTCCCTT
CATCGCACTGGTCAACACCCTCGTTTTCACCGAGGCCATGTCTCCCATCCCTCCAGGCCC
CGGAAGACCACTTTCTGCTCTTTTGGAGAACAAAGAGATGGTGAGCAAAAACCTGCTACT
GGATCATCTCGGACAGTGGCTCCTCAGTATCGGCAGCCGCCAAAGCGGACCTCATATCAA
CCCAGACGATCTCAACACCTGTCTACGACTTTTGCCAGAACTATACTCCGGTCTCAACAT
CAACCCTCGGTTCGACGGCACGTTTGAAGAAGGGCCCGAGCTGGCTCTCTTCAGAGCCTT
TGAGGTGGATGTAGTGCATGGATGGATCGCAGACCCCAAGGAGCCGTACCATGACGATGT
AATGGAGGTGGGATCATACGATGCAGCCCAACTATTACAGATCGAGGTTACCGAAGACGG
AAAGATGAAACAAAGGGAACGAGAGGTACTGCATCGACAACTGGCAGCCACGTTCGACTT
CATGGACGAGAACCCCTCACAGCTGACCACCTACGGTATCCGGTACATTGAGGAGATCCT
GGTTCCTGGTTCTGTGTGTGTCTTCTTCAGGAACAACCACTTTGCCACTCTGTACAAACA
GCCAACCTCAGGGCGTCTCTTTAGTCTCGTGACAGACAGAGAGCTGTGTGGACGGAACGG
AATTGTGTGGATTTCGCTGGAGGGAACTTCGGGAACAGACGATACCTTTTACACCGGAGG
CTTTGATTTGGTACAAATGATGACAGACCAGGAACAGGAAGAGTCGAGAAGAAGAGCCCA
CCAAACAGTCGAGGCCACCAACGACTTCCATCTCGCCAAGCAGATCCAGGAGCAGGACGA
CGCCGAGTACGCCAGACAGATTCAGGAAGAAGACCAACAGCGTCGACGACCCCAGCAGAC
AACCACTAGCACGGCAGGTACGACTGCTCGACGACAGCAGCAGCAGACCCGAAGCGGCAA
GGCCACTAAAAGCAGACCCGATAAGACCAAAGGAAAAAAATCCAAAGATGGCAAGGATAA
AAAGTGTGTCGTGATGTGACTTACAGTCAAAAGAACATCCTTGAAAATCAAAATAAAAGT
TACGAATTACGAATAATGATGTTTGGATGGTGCTTGTACGAACAACTATCTGGTTACTGA
TGTAGCCATCGACTTCTCTGCTTGAGATCATGTATTTTTCTATTTCGATCTGCGATCATC
GCATCATAGACAACCATTGCTCTGTATTATGAATTTTGCTCCCCCAACTCACCATGTACT
GCAATATAGCTATGGTTATTACGCAATAAATTACCTATGATACTCATACTCCGCTGAGAC
CGTTTTCTTCGTTGTGGAC

Coding sequence

>YALI0E13970g.cds
ATGGATGGATCTGAAGCGAGCATACGGCACATGGAGACGACCGCGAGCGGGGCCATGGAG
GACGGCGAAAAGGCGGCGTTGAACACGAGCTCGGAGCCCAGTCGCAACCCCTTTCGCCAG
TCTCTGTCTACCGAGCCTCACTTGATGCCCTCTCCTCCGGCGGACTCGACGCTAAACTCC
GATATCGCAAATACCATTGCGGCGGAGGAGACTGACGACCATGCCAAGAGCGCCACAGCG
GTCGCGGGCCACGCCCAGGCACACTCTTGTCCGGCCGTGTATACCACAGAGGCTATGCAG
GAACCAGGCGAAGTAGCACAGCCCTCCTCGATCCAGGGGCACACTACAACCACGCCCACG
ACCTCGGACTTCATTGTCGAACGCAGCCAGGTCGACATTCCTGCCACCGTGCCCACCACC
TCTGTGTCTGCCGTCAACATCCCCGCCGAACCCGAAAAGATGCACTATCTCGACGTCCAG
GGAGGCTCCAGCAACACGCCGTCCGACGTGGACTTTACAAGCTCCATTCGAAACTCGATC
CATTCCTTCACATCGTTCCAACCCAACCCCGCAAACAGAGCCAGCTATATTGGAGACGGC
CATGTGGAGGAGCTGGGTCTGCCAGAGGCTGGCAATTCAGACAGTGCCCTGGCTCCCCTG
ACAGTCAGGTCACCAGTCAGATCGCCTATCCTGAAGTCTGCTTCTCCTCCTCCTCAGAAT
ATCATCCCAGCACCGCTCACCTTCAACCCACCAAAGGACAATATCACGTTCCAGGTCAAG
GTCATCAAGTGGAGAACACCAGCCAACTACCTGGTCAAAACACCCATCATCCTCCAGGAC
GAAAACGGCCCGTGTCCCTTCATCGCACTGGTCAACACCCTCGTTTTCACCGAGGCCATG
TCTCCCATCCCTCCAGGCCCCGGAAGACCACTTTCTGCTCTTTTGGAGAACAAAGAGATG
GTGAGCAAAAACCTGCTACTGGATCATCTCGGACAGTGGCTCCTCAGTATCGGCAGCCGC
CAAAGCGGACCTCATATCAACCCAGACGATCTCAACACCTGTCTACGACTTTTGCCAGAA
CTATACTCCGGTCTCAACATCAACCCTCGGTTCGACGGCACGTTTGAAGAAGGGCCCGAG
CTGGCTCTCTTCAGAGCCTTTGAGGTGGATGTAGTGCATGGATGGATCGCAGACCCCAAG
GAGCCGTACCATGACGATGTAATGGAGGTGGGATCATACGATGCAGCCCAACTATTACAG
ATCGAGGTTACCGAAGACGGAAAGATGAAACAAAGGGAACGAGAGGTACTGCATCGACAA
CTGGCAGCCACGTTCGACTTCATGGACGAGAACCCCTCACAGCTGACCACCTACGGTATC
CGGTACATTGAGGAGATCCTGGTTCCTGGTTCTGTGTGTGTCTTCTTCAGGAACAACCAC
TTTGCCACTCTGTACAAACAGCCAACCTCAGGGCGTCTCTTTAGTCTCGTGACAGACAGA
GAGCTGTGTGGACGGAACGGAATTGTGTGGATTTCGCTGGAGGGAACTTCGGGAACAGAC
GATACCTTTTACACCGGAGGCTTTGATTTGGTACAAATGATGACAGACCAGGAACAGGAA
GAGTCGAGAAGAAGAGCCCACCAAACAGTCGAGGCCACCAACGACTTCCATCTCGCCAAG
CAGATCCAGGAGCAGGACGACGCCGAGTACGCCAGACAGATTCAGGAAGAAGACCAACAG
CGTCGACGACCCCAGCAGACAACCACTAGCACGGCAGGTACGACTGCTCGACGACAGCAG
CAGCAGACCCGAAGCGGCAAGGCCACTAAAAGCAGACCCGATAAGACCAAAGGAAAAAAA
TCCAAAGATGGCAAGGATAAAAAGTGTGTCGTGATGTGA

Predicted translation product

>YALI0E13970g.aa
MDGSEASIRHMETTASGAMEDGEKAALNTSSEPSRNPFRQSLSTEPHLMPSPPADSTLNS
DIANTIAAEETDDHAKSATAVAGHAQAHSCPAVYTTEAMQEPGEVAQPSSIQGHTTTTPT
TSDFIVERSQVDIPATVPTTSVSAVNIPAEPEKMHYLDVQGGSSNTPSDVDFTSSIRNSI
HSFTSFQPNPANRASYIGDGHVEELGLPEAGNSDSALAPLTVRSPVRSPILKSASPPPQN
IIPAPLTFNPPKDNITFQVKVIKWRTPANYLVKTPIILQDENGPCPFIALVNTLVFTEAM
SPIPPGPGRPLSALLENKEMVSKNLLLDHLGQWLLSIGSRQSGPHINPDDLNTCLRLLPE
LYSGLNINPRFDGTFEEGPELALFRAFEVDVVHGWIADPKEPYHDDVMEVGSYDAAQLLQ
IEVTEDGKMKQREREVLHRQLAATFDFMDENPSQLTTYGIRYIEEILVPGSVCVFFRNNH
FATLYKQPTSGRLFSLVTDRELCGRNGIVWISLEGTSGTDDTFYTGGFDLVQMMTDQEQE
ESRRRAHQTVEATNDFHLAKQIQEQDDAEYARQIQEEDQQRRRPQQTTTSTAGTTARRQQ
QQTRSGKATKSRPDKTKGKKSKDGKDKKCVVM*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites