YALI0C06644g


similar to ca|CA0882|CaPHR3 Candida albicans surface glycoprotein (by homology)

Genomic environment map

Element type: CDS
Element length: 1308 nucleotides,
on anti-sense strand of
Yali0C: complement(join(891654..892767,891460..891566)).
Other names:
YALI-CDS2921.1
YALI-IPF6496
Coding sequence: 407 codons.
Database cross references:
EMBL: CR382129
GeneID: 2909217
HOGENOM: Q6CCT6

Computed results  

None available yet

Homologs and Orthologs

Homologs in protein families: GL3R0042 GL3R0042.N2
Orthologs: strict determination not possible; homologs must be refined manually

Protein YALI0C06644p  


Protein domain map

Protein length: 406 amino acids
Protein family: GL3R0042
Database cross references:
InterPro: IPR004886
KEGG: yli:YALI0C06644g
Pfam: PF03198
RefSeq: XP_501526.1
UniProtKB/TrEMBL: Q6CCT6
UniprotKB: Q6CCT6_YARLI

Computed results for YALI0C06644p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  

None available yet

Sequence data  


Nucleotide sequence    

>YALI0C06644g.nt
AGCTGTGGTATGGTGTGATGTGCATGGGTGCGACAGAAAGGTGGGGTGCAGAAATATGAG
TATTTTAGGATGGTTTAATTGGATTAAATAAATAAATAAATGAAAGAGAAATACGATAAA
TTCATGACAGCGTATGGAGAGTGTATTGGATAGAAGAGTTCTATAGTAGTAAAATCAGTA
GTATAATGTATACCAAGGAACCATTAGCAGCTCAAAGATCCCACCACCTCTGTTATGTGA
CCTTGTGGGCAGTGATATGGCGTGTCTCTGATGTATAAATAGAGATCCACGGAAGCTCAC
TGACTGCTGCTGTTTTTTTTTTTTGCATATAAAAGCAAGGGAGGTTGCCTAAGACGCTTC
AGCCTAATTACACTACAAGTACAAGTACAAGTACAGTACAGTACAGTACATACTTCATTG
CTCCCATACCCCAATTAAGGCGGCGGCCATAACACCGTCTCCGTGGTAACGTGAAAACGC
TTTTGTGACCATATCTAAAAAGACATGGCCTAATTCTCAGCTACTCACAACCTGCCTCTC
TTCATCCACCACACCCTATCAAGCAACACACTGAAGAAACATGAGGACCGTCCTGCGACT
TTTCGGACTGGCGGCCACAGTTCTGGCCGGAATTGCTCCTATCGAGTGGGAAGGCAGCAC
TTCCGCGACTCGGTCACCAAAGAACCGGTATGATACCTTCCTCCTTGACACCCCCCCTTT
TGATTCCACCCTCAACACATCAGAGCTTGTGATACCAGTTTGTGTTACTAACGCAGTTCT
TCTTCAAGGGCATAGACTACCAGCCCGGCGGGTCGTCGGCGTTTGTGGGCCGATCGGACC
CGCTGTCAGACTACGAGGCATGTTCCAGAGATATTTACCTGTTTCAGCAGCTCGGAGTCA
ATGTGAGTATAATCCACTCTTAACAGTGACTTGCACCCCTCAGCTAACATAGGCAATCCG
CGTCTACACAGTCAACCCAGACATCAACCACGACGAATGCATGACCCTGCTGGCAGAAGC
AGGCATCTACCTCATTCTCGACGTCAACTCCCCACGCATTGGCGAGTCGCTAAACCGATA
CGAGCCCTGGACCACCTACCATGAAAAGTACCTGGAGCACATCTTCAAGGTTGTCGAGCA
GTTCTCCCATTACAACAACACTCTGGCTTTCTTTGCCGGCAACGAGGTCGTCAACGATGA
CCAGTCCGCCATGGTGTCGCCCAACTACATCAAGGCTGTAGTGAGAGACCTCAAGTACTA
CCTAGCCAACCAGTCGCCCCGAAAGATTCCCGTCGGATACTCTGCCGCAGACGATCTCAA
GTACAGAACCTCGTTGGCACAATACCTCGAGTGTGGTGACGAGATGTCTTCCGTCGACTT
TTACGGAGTCAACTCCTATCAGTGGTGTGGCGAACAGTCTTTCGTGTCTTCCGGATACGA
CAGACTCGTGGACGACTACAGGGACTACTCGCTGCCGCTCATCTTCTCCGAGTACGGATG
CAACGAAGTCAAGCCCCGTACTTTCCAGGAGGTCCGAGCAGTTTACTCAAGCAGCATGAC
TGACGTATTTTCTGGAGGGCTCATTTACGAGTTCTCCCAGGAGCCCAACGACTATGGTCT
TGTCCAGATCTACAAGAACCACTCCGCGCAGGTTCTGGAGGACTTTGAGGCACTCAAGAA
AGCATACCACGACGCCCCCAAGGCCAAGCTGTCAGACTTCCGAGCTGTGGAGCGACCTCA
GAGGTGCGAGCTCGTGTATCCCAACATCAACACCATGAACCCTCTCCCCGACACGTTTGG
TCTGGACATGATAACTCGTGGCGTGCGGGCACCCAGGGGCAAGTATGTTGACTTGCACAA
GCGGGGTACATCGTACACCATCTACGACCTGGAAGGAAATGTGATCGAGGACACTGAGGT
GCAGCAGGTGATTGATCTGAAGGAGCCTCTTGCGGCACCGCCCCCATCGCCCCCCACACG
GAAGGCCCCCGCTGTCCCAGAGCCAAAGGACCCTGTGGAGCCAGTCTACGATGAGTATCC
GAAGAAAGTCGCTACCGTTGAAGAGGAGGATGATCACGAGGTGGTGAACCCACGGAGACC
ATTTGAGAGAAAGAgtgagtgtgatagttgcggtaccgacatcgacggtgatggctggag
ccgagatttcttttcatcctttaattatgctaacttactagAGACAAGCACCGGGTCATA
CAGAGACACGGCAGTGCTGGGGTGGATCAAGTACCTGGTGTCAGTGATGCTTGGCGTGGT
GGTGGTGGAAGTGGTGCGGCTGATGTAGGTCGAGGAGGTTTTGGTGTTTGGTCTGTGCGT
AAGCCTGGCACTCCTGCTGGCCGACGGCAACGTTTTCGAAGTCAGAACAAGCCGGCTCCA
CTCTCACAGAGGGAGCTCAAGCACCGAGTCTATGCACTGTCCAACCTTGGTGCGTTCCCT
CGGCGTGCACGACGACCTCTTGGGGGCGTGAGGCTTGCGCGGAACCAACAAAATCAGCTT
GTGATTATTTTATAAGAGAATTCGCTTTCACTATTCTTTTCCGTATTGTTGCTAACAATA
AATACATTAATATTATTATGGTAAAAAT

Coding sequence    

>YALI0C06644g.cds
ATGACCCTGCTGGCAGAAGCAGGCATCTACCTCATTCTCGACGTCAACTCCCCACGCATT
GGCGAGTCGCTAAACCGATACGAGCCCTGGACCACCTACCATGAAAAGTACCTGGAGCAC
ATCTTCAAGGTTGTCGAGCAGTTCTCCCATTACAACAACACTCTGGCTTTCTTTGCCGGC
AACGAGGTCGTCAACGATGACCAGTCCGCCATGGTGTCGCCCAACTACATCAAGGCTGTA
GTGAGAGACCTCAAGTACTACCTAGCCAACCAGTCGCCCCGAAAGATTCCCGTCGGATAC
TCTGCCGCAGACGATCTCAAGTACAGAACCTCGTTGGCACAATACCTCGAGTGTGGTGAC
GAGATGTCTTCCGTCGACTTTTACGGAGTCAACTCCTATCAGTGGTGTGGCGAACAGTCT
TTCGTGTCTTCCGGATACGACAGACTCGTGGACGACTACAGGGACTACTCGCTGCCGCTC
ATCTTCTCCGAGTACGGATGCAACGAAGTCAAGCCCCGTACTTTCCAGGAGGTCCGAGCA
GTTTACTCAAGCAGCATGACTGACGTATTTTCTGGAGGGCTCATTTACGAGTTCTCCCAG
GAGCCCAACGACTATGGTCTTGTCCAGATCTACAAGAACCACTCCGCGCAGGTTCTGGAG
GACTTTGAGGCACTCAAGAAAGCATACCACGACGCCCCCAAGGCCAAGCTGTCAGACTTC
CGAGCTGTGGAGCGACCTCAGAGGTGCGAGCTCGTGTATCCCAACATCAACACCATGAAC
CCTCTCCCCGACACGTTTGGTCTGGACATGATAACTCGTGGCGTGCGGGCACCCAGGGGC
AAGTATGTTGACTTGCACAAGCGGGGTACATCGTACACCATCTACGACCTGGAAGGAAAT
GTGATCGAGGACACTGAGGTGCAGCAGGTGATTGATCTGAAGGAGCCTCTTGCGGCACCG
CCCCCATCGCCCCCCACACGGAAGGCCCCCGCTGTCCCAGAGCCAAAGGACCCTGTGGAG
CCAGTCTACGATGAGTATCCGAAGAAAGTCGCTACCGTTGAAGAGGAGGATGATCACGAG
GTGGTGAACCCACGGAGACCATTTGAGAGAAAGAAGACAAGCACCGGGTCATACAGAGAC
ACGGCAGTGCTGGGGTGGATCAAGTACCTGGTGTCAGTGATGCTTGGCGTGGTGGTGGTG
GAAGTGGTGCGGCTGATGTAG

Predicted translation product    

>YALI0C06644g.aa
MTLLAEAGIYLILDVNSPRIGESLNRYEPWTTYHEKYLEHIFKVVEQFSHYNNTLAFFAG
NEVVNDDQSAMVSPNYIKAVVRDLKYYLANQSPRKIPVGYSAADDLKYRTSLAQYLECGD
EMSSVDFYGVNSYQWCGEQSFVSSGYDRLVDDYRDYSLPLIFSEYGCNEVKPRTFQEVRA
VYSSSMTDVFSGGLIYEFSQEPNDYGLVQIYKNHSAQVLEDFEALKKAYHDAPKAKLSDF
RAVERPQRCELVYPNINTMNPLPDTFGLDMITRGVRAPRGKYVDLHKRGTSYTIYDLEGN
VIEDTEVQQVIDLKEPLAAPPPSPPTRKAPAVPEPKDPVEPVYDEYPKKVATVEEEDDHE
VVNPRRPFERKKTSTGSYRDTAVLGWIKYLVSVMLGVVVVEVVRLM*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites