KLLA0B14839g


conserved hypothetical protein

Genomic environment map

Element type: CDS
Element length: 1266 nucleotides,
on anti-sense strand of
Klla0B: complement(1304089..1305354).
Other names:
KLLA-ORF8550
Coding sequence: 422 codons.
Database cross references:
EMBL: CR382122
EMBL: X06997
GeneID: 2897102
HOGENOM: P08540

Computed results  

None available yet

Homologs and Orthologs

Homologs in protein families: GL3C0576 GL3C0576.F2 GL3C0576.N1
Orthologs: strict determination not possible; homologs must be refined manually

Protein KLLA0B14839p  


Protein domain map

Protein length: 421 amino acids
Protein family: GL3C0576
Database cross references:
InterPro: IPR007312
KEGG: kla:KLLA0B14839g
PIR: B31776
Pfam: PF04185
RefSeq: XP_452192.1
UniProtKB/Swiss-Prot: P08540
UniprotKB: PHOX_KLULA

Computed results for KLLA0B14839p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  

None available yet

Sequence data  

Nucleotide sequence

>KLLA0B14839g.nt
CTCCAATGCAATCCATGTACTCAACAGAAGTGTCTACAAACTTGACGAGATCTAAGGCCC
AACTCCTCAACTTTGTGGTTTCTGGTGTTGCCCAATTTGTTAATCAATTTGCTACTCCAA
AGGCAATGAAGAATATCAAATATTGGTTCTATGTGTTCTACGTTTTCTTCGATATTTTCG
AATTTATTGTTATCTACTTCTTCTTCGTTGAAACTAAGGGTAGAAGCTTAGAAGAATTAG
AAGTTGTCTTTGAAGCTCCAAACCCAAGAAAGGCATCCGTTGATCAAGCATTCTTGGCTC
AAGTCAGGGCAACTTTGGTCCAACGAAATGACGTTAGAGTTGCAAATGCTCAAAATTTGA
AAGAGCAAGAGCCTCTAAAGAGCGATGCTGATCATGTCGAAAAGCTTTCAGAGGCAGAAT
CTGTTTAAAGCCATCTTTTCAATATATTTTGTTAGGTGCAAGAAGTTTCCGTACTTCATA
ATTTGTTTTTTATTCTGCTTGATCTTCTCCTAATTGCAGCAAAAAGTCTTGTGGAATTCA
TCAATTAAAAAGCCACAGGCTTTAGACTCTTAATGGATTATTCTAACAGTTACTGTTAGA
CGTTAGCACATGGGCCGGTATGTCATTGAAGTACGTTTAAATTGCCTGAATTGGGAAAAG
TATTGACCTTTGAGCCGGTTGCATACCGCCTTGCGGTATGCAGTTTGGCTCGGTTTTCCC
AGCACGCATGTGGGCATCATTTCCACACGTGTGAACCCTCGGCAGTTAAATGTGTGCATT
TAGGATCGGCTAATAGTTGTTTTTAGCTTCAGCTTCAGCTCATCGGATTTTGTGAAACAT
ATAAATCTGGCGTCAATTGAGTCTCTCGAATGGTTGAAAGGTCAATTCAGGTTGGAATTT
TGTATTATGTTAGCTATATGGGATTGATCAAAAAAACCAGCCAAGGATTCAGAGAATTAC
AGCGCAAGCTCAGGTAGCACTCCAGTTTTAAACATAAAAAATGAAATTCTCCGATTTTAG
TGTTCTCGGATTAGGTGCTCTTGCCTTAAATGCCGTTACCGTCAGTGCTAACACTGCTGA
CACCGCTTTGTTGAGAACTTACTCTACTATCTCTCCAAGTTTGAGTGAAATTGAATCCGC
CGCTTCTGCTACTGAGGTTGCTGAGGTGGTTTCAGATGTCGAAGGTGCCGCTTTCAAGAG
ATTCTTCATTATCTTCTTGGAAAACACCGATTACGACAAGGCTGCTGGTGATGAATCCCT
TTCATGGCTGGCTGAACAAGGTATTACCTTGACCAACTACTGGGCTTTGACTCACCCTTC
AGAACCAAACTATTTGGCTTCTGTCGGTGGTGATTACTTCGCCTTGGATGATGACAGATT
TATTTCTATGCCATCCAATGTGTCTAACATCGTTGATTTGTTGGACACTAAGGGTATCTC
TTGGGCTGAATACCAAGAACACTCTCCATATGCTGGTTTCCAAGGTATGAACTTCTCTAA
CCAAGAGACATACGCCAGTGATTATGTCAGAAAACATAACCCATTGATTTTGTTCGATAA
CGTTGTCAACAACGATACTCGTTTGGCTAACATTAAGAACTTTGAAGATTTCAACAATGA
CGTAGAAAACGAAAAGTTACCTCAATACGCTTTCATTACTCCAAATATGACTAACGATGG
TCATGACACCACTATCCAATTTGCTGGTAAATGGTCAAAGGACTTCTTGGCTCCATTATT
AGAAAACGATTACTTCATGGAGGACACTTTGGTCTTGTTGACCTTCGATGAAAATGAAAC
CTATGGTATCAAGAACAAGGTCTTCTCCATCTTGTTGGGTGGCGTTATCCCAGATGAATT
GAAGGGTACCAAGGATGATACTTTCTATGACCATTACTCTCAATTGGCTAGTGTTGAAGC
CAACTGGGATTTGCCTCATTTAGGTAGACACGATGGTGACGCCAATGTTTTGGAAATTGT
TGCTAACGCTACCAACATCACCAACGTTGAAGTTGACACCACTTACATGATCAACGAAAC
CTATATTGGTTACTTGAACGATTACAACATTGAATTGCCAGCTCCAAATGTTACTGCTAT
TAACAGAAACGGTCAACCAATCTTGGACTCCATCAAGGAAACTTGGGAAGATGAATATTC
TAAGCAAGTCTCTGAATCCTACTATACTTCCACCACTACTACCGTTTCTGCAGATGTTAC
TGATGCTGAAACCTTCTCCAATTTCTACCGCTACCGACAGTGCTGATGAAACTAGTGCTA
CCGCTACATCTTCTTCCAACAGCACTTCTGCTGAAACTTTAAGCTCTTCCGCCGGTGCTC
CATCAATGAGCACCTTCTCCGGTAACGTCTTGTTGGGTGTCATTGCTGCCTTTTTCTTAT
GAAAAGATATCGACTCAAAGGAGTAGCAGCACACTCCTATAGAGAAACTAGAGTGAAAAC
GGCAGTTCACTCTTAGTTAATATTATTTATTCCGTACAAATAGAACAGTTGAAAAGTGAC
GCTTTTATAAAATAAGAACTTGATTTTGATTTGACCTTTTAATTAT

Coding sequence

>KLLA0B14839g.cds
ATGAAATTCTCCGATTTTAGTGTTCTCGGATTAGGTGCTCTTGCCTTAAATGCCGTTACC
GTCAGTGCTAACACTGCTGACACCGCTTTGTTGAGAACTTACTCTACTATCTCTCCAAGT
TTGAGTGAAATTGAATCCGCCGCTTCTGCTACTGAGGTTGCTGAGGTGGTTTCAGATGTC
GAAGGTGCCGCTTTCAAGAGATTCTTCATTATCTTCTTGGAAAACACCGATTACGACAAG
GCTGCTGGTGATGAATCCCTTTCATGGCTGGCTGAACAAGGTATTACCTTGACCAACTAC
TGGGCTTTGACTCACCCTTCAGAACCAAACTATTTGGCTTCTGTCGGTGGTGATTACTTC
GCCTTGGATGATGACAGATTTATTTCTATGCCATCCAATGTGTCTAACATCGTTGATTTG
TTGGACACTAAGGGTATCTCTTGGGCTGAATACCAAGAACACTCTCCATATGCTGGTTTC
CAAGGTATGAACTTCTCTAACCAAGAGACATACGCCAGTGATTATGTCAGAAAACATAAC
CCATTGATTTTGTTCGATAACGTTGTCAACAACGATACTCGTTTGGCTAACATTAAGAAC
TTTGAAGATTTCAACAATGACGTAGAAAACGAAAAGTTACCTCAATACGCTTTCATTACT
CCAAATATGACTAACGATGGTCATGACACCACTATCCAATTTGCTGGTAAATGGTCAAAG
GACTTCTTGGCTCCATTATTAGAAAACGATTACTTCATGGAGGACACTTTGGTCTTGTTG
ACCTTCGATGAAAATGAAACCTATGGTATCAAGAACAAGGTCTTCTCCATCTTGTTGGGT
GGCGTTATCCCAGATGAATTGAAGGGTACCAAGGATGATACTTTCTATGACCATTACTCT
CAATTGGCTAGTGTTGAAGCCAACTGGGATTTGCCTCATTTAGGTAGACACGATGGTGAC
GCCAATGTTTTGGAAATTGTTGCTAACGCTACCAACATCACCAACGTTGAAGTTGACACC
ACTTACATGATCAACGAAACCTATATTGGTTACTTGAACGATTACAACATTGAATTGCCA
GCTCCAAATGTTACTGCTATTAACAGAAACGGTCAACCAATCTTGGACTCCATCAAGGAA
ACTTGGGAAGATGAATATTCTAAGCAAGTCTCTGAATCCTACTATACTTCCACCACTACT
ACCGTTTCTGCAGATGTTACTGATGCTGAAACCTTCTCCAATTTCTACCGCTACCGACAG
TGCTGA

Predicted translation product

>KLLA0B14839g.aa
MKFSDFSVLGLGALALNAVTVSANTADTALLRTYSTISPSLSEIESAASATEVAEVVSDV
EGAAFKRFFIIFLENTDYDKAAGDESLSWLAEQGITLTNYWALTHPSEPNYLASVGGDYF
ALDDDRFISMPSNVSNIVDLLDTKGISWAEYQEHSPYAGFQGMNFSNQETYASDYVRKHN
PLILFDNVVNNDTRLANIKNFEDFNNDVENEKLPQYAFITPNMTNDGHDTTIQFAGKWSK
DFLAPLLENDYFMEDTLVLLTFDENETYGIKNKVFSILLGGVIPDELKGTKDDTFYDHYS
QLASVEANWDLPHLGRHDGDANVLEIVANATNITNVEVDTTYMINETYIGYLNDYNIELP
APNVTAINRNGQPILDSIKETWEDEYSKQVSESYYTSTTTTVSADVTDAETFSNFYRYRQ
C*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites