CAGL0G00286g


uniprot|Q8X0Z7 Candida glabrata CAGL0G00286g GAS1 homologue

Genomic environment map

Element type: CDS
Element length: 1680 nucleotides,
on anti-sense strand of
Cagl0G: complement(27239..28918).
Other names:
CAGL-CDS1723.1
CAGL-IPF8924
Coding sequence: 560 codons.
Database cross references:
EMBL: CR380953
GeneID: 2888417
HOGENOM: Q6FTR7

Computed results  

None available yet

Homologs and Orthologs

Homologs in protein families: GL3R0042 GL3R0042.N2
Orthologs: strict determination not possible; homologs must be refined manually

Protein CAGL0G00286p  


Protein domain map

Protein length: 559 amino acids
Protein family: GL3R0042
Database cross references:
InterPro: IPR000873
InterPro: IPR004886
InterPro: IPR012946
KEGG: cgr:CAGL0G00286g
PROSITE: PS00455
Pfam: PF03198
Pfam: PF07983
RefSeq: XP_446377.1
SMART: SM00768
UniProtKB/TrEMBL: Q6FTR7
UniprotKB: Q6FTR7_CANGA

Computed results for CAGL0G00286p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  

None available yet

Sequence data  


Nucleotide sequence    

>CAGL0G00286g.nt
ACTCATACTCTGAACAAAGATGATGACCGGAGGGTAGTTGGAGCACTAAGTGCAGAGAAT
GTAGCGTACACCCTCTGCCATTAATCCTTCCCCCGCTTTTTCATAGCCAGCACATTATAC
CTCTTCGAGCGATAAAACAACAGAAACTGCAAAAATGTTTCCCCATCCGGCACAAATTAT
CCTACTGAGCGGTCCCGCGACCAGTGGTAGCTCAGAAATGGCCGCAATAACCACTAAAAA
ACCTGGCATGCTGATCTGAAATACACAGACAAACACAGACACACTGCCATACCTGTGCAA
ACCCAGAGAAGGCAGTACCAGAGAGCACAATCCGCCCCCAAAAAGAAACAATCTTCAGAG
AAAAGCTTCTGTATCCTCCCCCCACTGCAAGCAACGCACAACTGCGCAATAGAAAGTATA
AAACCTTCGAAAAATAGAAGAAAATCAGAGAGTGCGGGTCCTGCAGAAGTGTCTGGGCAG
CATATCTAGAAAGGGCTTGAAAGGGCTTGAAAGGGCTTCAAAAGTAGCTTCGAAAGAAAA
CCGCTTTGTTCCTCTCCCCCCCATCTCAAGCCTTTGTGTTACGTGCACTACCTTCGTTCT
CCCCCCACCGATAATCCAGCAAATCCACCAAAAACTCAGCTCACATATTTTGGTGTTTTC
GTGTTGGTTAGCATTGCCCTTGCAGCGAAATATTCTATCCAGTGAAAAAAATGAATTAAA
CAATCAAACAATAACACAGCCTACTTTAAAATTCACAGGACGCAATGCGTAACCTTGCTT
TTTGAACAGAACTAATTAAGGCAGGAAAAGGAAAAGATTGAATTTATCTTGACAACAACA
GGATGAGGATATATAAGGACCACATTTGATCCTTTCTTGAAACGTAATCTTAAACTCTAT
GTTTACTTCTAGTGCTGTGTTCTAATAGTTTTTTTTCTTCACTTAGCTGTATCAAACAAC
TCACTGTATCAATCACTATTTTACTATAACTAGATCAATAATGCAATACAGTCTGGTTTC
TTTCATAATAGCTGCTACATTGCTGTTGTCGTCAGTTATGGCTGATGACCTGCCGGCTAT
TGAGATCAAGGGTAACAAGTTCTTTTTCTCCAATAACGGTTCCCAATTCTACATGAAGGG
TATCGCTTACCAAGCTGATACCGCTAACGTCACAGGCGGTGCCACCATCAACGACCCATT
GGCCGACTGGGACACCTGTTCCAGAGATATTCCATACTTGCAACAACTAGCTACTAACGT
CATCCGTGTCTACGCCGTCAACACTTCCTTGGACCACTCCAGATGTATGAACGCTCTGAA
CGATGCCGGTATCTACGTCATTGCCGATTTGTCCTCTCCAAAGGTTTCCGTCAACAGAAA
GTCTCCTTCCTGGGACTTGGAAATCTTCGACCGTTACAAGTCCGTTGTTGACATGTTCGC
TAACTACTCCAACGTTCTAGGTTTCTTCGCAGGTAACGAGGTTACCAACGATGCCACCAA
CACTGACGCTTCCGCTTTCGTTAAGGCCGCCATTAGAGACACCAAGTCCTACATCAAGGA
AAAGGGTTACAGAGGTATCCCAGTTGGTTACTCTTCTAACGATGACGCCGACACCAGAGT
TGACATCGCTGATTACTTCGCTTGTGGTGACGACGCTGAAAGAGCTGACTTCTACGGTAT
TAACATGTACGAATGGTGTGGTAACTCTACTTTCCAAAAGTCTGGTTACGCTGACAGAAC
CAAGGAATTCGCCAACTTGTCCATCCCATTGTTCTTCTCCGAATACGGTTGTAACGAAGT
TCAACCAAGAGAGTTCACTGAAGTTCAAGCACTATACGGCCCTGATATGACTGATGTCTG
GTCCGGTGGTATTGTCTACATGTATTTCCAAGAAGCAAACAACTACGGTTTGGTTAGCAT
CGATGGCTCTAGTGTTAAGACTTTGGAAGATTTCAACTACTACTCTAAAGAAATCCACTC
CATCTCCCCATCCTCAGTAAACTCCAAGACTTACACTCCAACCGCAACCTCTTTGGCTTG
CCCATCTACTAACCAATACTGGAAGGCTGCCACTAACTTGCCACCAACCCCACAACTAGA
TCTATGTGAATGTATGGATGCTGCTAACTCTTGTATTGTTCAAGATGATGTCGATGAAGA
CGACTACCAAGATTTGTTCTCCTACTTGTGTGGTAAGATTGACTGTGGTGGTATTACTGG
TAACGGTACCACCGGTAAGTACGGTTCTTACTCTTTCTGCTCTCCAAAGGAAAAGCTAAA
CTTCGTTCTAAACCTATACTACAACGCCCAAGGTGGTTCCAAGTCTAACTGTGACTTCAG
TGGTTCTGCTACATTGAGAAGTGGAACTACCCAAGCTGGTTGTGCCTCTGCCTTGAAGGA
AATTGGTAGCGTCGGTACTAACTCTGCTACCGATTCTGTTACTTTCTCTGGTGGCTCTAC
TGGTACTTCCAAGGCATCTGCTACCGGCTCTAACTCTTCCAAGTCCGGCTCAAGCAAATC
CGGCTCTTCTACAAGTTCTTCTGCTAAGAGCTCTTCCTCTGGTAAGAGTAACAAGAAGTC
TAACAGCTCTAGCTCCGTCCAAGTTGGTCTATACCAACTTCTTTTCTCAGCTTTCATCAC
ATTAGGTGCAGTCGCCGGTGCTGGTTTCGCTCTTATTTAATTTCAGAATGATTTAAATAT
TTGATGAATCCTAATGTTCATATGATCAACGGGAAGAATATAGAAAAATATTCATTTTAA
TGCATGTATACTAAAGACATTTGTTTCATTTTTTCTCAACTAATTTCATTCATATCCAAT
TAACCAAAATATTTATTTGTAATTTTAGCTATGATCTTTAATGGAATTGTGTTTTTTTTT
GGACGCGATTTATTGTCGCCTAAAGTATTCTAATTTGCTGACATATATTTTCTATAAAAG
CGATGCAGGGGTATACAGTTCTGTGTTGAGGTAATATTAG

Coding sequence    

>CAGL0G00286g.cds
ATGCAATACAGTCTGGTTTCTTTCATAATAGCTGCTACATTGCTGTTGTCGTCAGTTATG
GCTGATGACCTGCCGGCTATTGAGATCAAGGGTAACAAGTTCTTTTTCTCCAATAACGGT
TCCCAATTCTACATGAAGGGTATCGCTTACCAAGCTGATACCGCTAACGTCACAGGCGGT
GCCACCATCAACGACCCATTGGCCGACTGGGACACCTGTTCCAGAGATATTCCATACTTG
CAACAACTAGCTACTAACGTCATCCGTGTCTACGCCGTCAACACTTCCTTGGACCACTCC
AGATGTATGAACGCTCTGAACGATGCCGGTATCTACGTCATTGCCGATTTGTCCTCTCCA
AAGGTTTCCGTCAACAGAAAGTCTCCTTCCTGGGACTTGGAAATCTTCGACCGTTACAAG
TCCGTTGTTGACATGTTCGCTAACTACTCCAACGTTCTAGGTTTCTTCGCAGGTAACGAG
GTTACCAACGATGCCACCAACACTGACGCTTCCGCTTTCGTTAAGGCCGCCATTAGAGAC
ACCAAGTCCTACATCAAGGAAAAGGGTTACAGAGGTATCCCAGTTGGTTACTCTTCTAAC
GATGACGCCGACACCAGAGTTGACATCGCTGATTACTTCGCTTGTGGTGACGACGCTGAA
AGAGCTGACTTCTACGGTATTAACATGTACGAATGGTGTGGTAACTCTACTTTCCAAAAG
TCTGGTTACGCTGACAGAACCAAGGAATTCGCCAACTTGTCCATCCCATTGTTCTTCTCC
GAATACGGTTGTAACGAAGTTCAACCAAGAGAGTTCACTGAAGTTCAAGCACTATACGGC
CCTGATATGACTGATGTCTGGTCCGGTGGTATTGTCTACATGTATTTCCAAGAAGCAAAC
AACTACGGTTTGGTTAGCATCGATGGCTCTAGTGTTAAGACTTTGGAAGATTTCAACTAC
TACTCTAAAGAAATCCACTCCATCTCCCCATCCTCAGTAAACTCCAAGACTTACACTCCA
ACCGCAACCTCTTTGGCTTGCCCATCTACTAACCAATACTGGAAGGCTGCCACTAACTTG
CCACCAACCCCACAACTAGATCTATGTGAATGTATGGATGCTGCTAACTCTTGTATTGTT
CAAGATGATGTCGATGAAGACGACTACCAAGATTTGTTCTCCTACTTGTGTGGTAAGATT
GACTGTGGTGGTATTACTGGTAACGGTACCACCGGTAAGTACGGTTCTTACTCTTTCTGC
TCTCCAAAGGAAAAGCTAAACTTCGTTCTAAACCTATACTACAACGCCCAAGGTGGTTCC
AAGTCTAACTGTGACTTCAGTGGTTCTGCTACATTGAGAAGTGGAACTACCCAAGCTGGT
TGTGCCTCTGCCTTGAAGGAAATTGGTAGCGTCGGTACTAACTCTGCTACCGATTCTGTT
ACTTTCTCTGGTGGCTCTACTGGTACTTCCAAGGCATCTGCTACCGGCTCTAACTCTTCC
AAGTCCGGCTCAAGCAAATCCGGCTCTTCTACAAGTTCTTCTGCTAAGAGCTCTTCCTCT
GGTAAGAGTAACAAGAAGTCTAACAGCTCTAGCTCCGTCCAAGTTGGTCTATACCAACTT
CTTTTCTCAGCTTTCATCACATTAGGTGCAGTCGCCGGTGCTGGTTTCGCTCTTATTTAA


Predicted translation product    

>CAGL0G00286g.aa
MQYSLVSFIIAATLLLSSVMADDLPAIEIKGNKFFFSNNGSQFYMKGIAYQADTANVTGG
ATINDPLADWDTCSRDIPYLQQLATNVIRVYAVNTSLDHSRCMNALNDAGIYVIADLSSP
KVSVNRKSPSWDLEIFDRYKSVVDMFANYSNVLGFFAGNEVTNDATNTDASAFVKAAIRD
TKSYIKEKGYRGIPVGYSSNDDADTRVDIADYFACGDDAERADFYGINMYEWCGNSTFQK
SGYADRTKEFANLSIPLFFSEYGCNEVQPREFTEVQALYGPDMTDVWSGGIVYMYFQEAN
NYGLVSIDGSSVKTLEDFNYYSKEIHSISPSSVNSKTYTPTATSLACPSTNQYWKAATNL
PPTPQLDLCECMDAANSCIVQDDVDEDDYQDLFSYLCGKIDCGGITGNGTTGKYGSYSFC
SPKEKLNFVLNLYYNAQGGSKSNCDFSGSATLRSGTTQAGCASALKEIGSVGTNSATDSV
TFSGGSTGTSKASATGSNSSKSGSSKSGSSTSSSAKSSSSGKSNKKSNSSSSVQVGLYQL
LFSAFITLGAVAGAGFALI*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites