CAGL0M14113g


similar to uniprot|Q05998 Saccharomyces cerevisiae YLR237w THI7 Plasma membrane transporter responsible for the uptake of thiamine, member of the major facilitator superfamily of transporters

Genomic environment map

Element type: CDS
Element length: 1746 nucleotides,
on anti-sense strand of
Cagl0M: complement(1397407..1399152).
Other names:
CAGL-CDS1607.1
CAGL-IPF6031
Coding sequence: 582 codons.
Database cross references:
EMBL: CR380959
GeneID: 2891620
HOGENOM: Q6FII6

Computed results  

None available yet

Homologs and Orthologs

Homologs in protein families: GL3C0733 GL3C0733.N6
Orthologs: strict determination not possible; homologs must be refined manually

Protein CAGL0M14113p  


similar to uniprot|Q05998 Saccharomyces cerevisiae YLR237w THI7 or uniprot|Q08485 Saccharomyces cerevisiae YOR071c or uniprot|Q08579 Saccharomyces cerevisiae YOR192c

Protein domain map

Protein length: 581 amino acids
Protein family: GL3C0733
Database cross references:
InterPro: IPR001248
InterPro: IPR012681
KEGG: cgr:CAGL0M14113g
Pfam: PF02133
RefSeq: XP_449958.1
TIGRFAMs: TIGR00800
UniProtKB/TrEMBL: Q6FII6
UniprotKB: Q6FII6_CANGA

Computed results for CAGL0M14113p  

Blastp Genolevures
Blastp Uniprot

Gene Ontology terms  

GO:0016020 membrane
GO:0015931 nucleobase, nucleoside, nucleotide and nucleic acid transport
GO:0015205 nucleobase transmembrane transporter activity

Sequence data  


Nucleotide sequence    

>CAGL0M14113g.nt
TCTGTACGAAAAGCAATGGATTATCACCTTAGAAGGTGAGGTGCGGACAAAAGAACGTGG
GCAGCAAGTTTTCTCACATCCAAAGGACTTGACTACAAACCGAAGAGGGGTGGGAATGGC
AGCAATCAGCATTACTACGGGTGACTGTGTGTACACAGAAGGTGGGGTGGAAACGTACAA
AAGGGTTTCTTCCAAGGATTGGTCGTTCGTTATCGGAAAAGTATTTCATGTGACCGCTAT
CATGGCCGGAACAATTTGTGCTGTAGTTTGAGTGCGGATTCCCTCTCAGTTAGCTTGGTG
GGCCAGTTCTACATAAAATATTGTGCGACTCACTTAGTGTCAAAATAAAACAGGGCAGCT
CTCTGGTGTTGTACCTTGTAAAGGATGGTGGAAGGTAGATTGCCAAACAATTTGAGTGTT
TAGATAAGAATCTGCTGCCTTCATTTGTTTGACTCTTTTAAGTTGTTGTTCACTGCTGTG
TCATGAATAATCAAATCAAGGAAATGCATAGTTTTTGGAGAGCGACGGAATTTTGATTAA
ATATAGGAAGCTGTGGTGTATTGTTTTTCAGTTTTTTTCTTTGATGTTGGCTTTATATGA
TGAAGAAAAGCTTTATAAATACTATTTGAGCTAGTTAATTCAAGATGACATGACCAGAAA
AAAATGTTTCGACGTCCATGAATTTCATTACAACGGCTTTGCAGTTTTTATCAACCAGGG
AAAAAAAAATATATAAATAACAGATGTTTTTACAAGTAGTTCTGAAATTTAGTTGACAAT
ATTATATAGAAAATTTACTGGAATTTTTGTACTGTCGTGTTTAGATTTATTATTTAATTA
TTTTTGACGCCTAAGTATTTTTTAATTTTTTTTCTATTTTTTTTTTATTTTTAATTAAAT
TTTTATTTAAATTTAATTCATTTACTTCTATATGCAAAATAAATTTATTTTGACTGTTTA
GATTTAATATAAATCTCAACTGAATAAAGACGTACCGATAATGAGAGGATTTAAGTTTCT
CAAGTATTTGGAAGTCCCTGTGGAAGAAAGACAAACTTTAAGTTTCTTAAAAAATCCAGA
CTTGGTACCAATCCCTAAAAGCCACCAAACATGGGGGTTCTGGTCTAACTTTGCCTACTG
GGGTACAATTGCCTTTACAGTAGGTACATGGATGGGCGCTACAGCAGCATTGACAGTTGG
GCTAAGTTATCCAGAGACTATCGCAACATTTATTCTGGGGAATGCTCTAACAATTGTTTA
CTCACTGGCCAACTGTTATCCAGGTTGGGACTGGAAAGTTGGCTATACGCTTTCTCAAAG
ATTTACCTTCGGTATCTATGGATCTGCTTTTGGTGTTATTATTAGAGTTTTATTAAGTAT
TGTGAACTATGGCTCAAACGCGTGGTTAGGAGGGTTATGTATTAACCTGATATTAAATTC
ATGGTCACACCACTATCTAGAATTAAAAAATACTTTATCTCCGCATGTTGCTATGACCAC
AAAAGAATTGATCGGTTTCGTTATTTTCCATATTGTCTGTGCTTTATGCTATCTAATGAA
ACCATATCAAATTAATAGGATATTAATAATTGCTTGTGCCGGGACATGTTTCTCAATGCT
GGGGATAATCATATATCTGTGTCATGCAAATGGGGGTGCTGGTTCATTATTTCACACCCA
AAAAACAACGGTTAGTGGTTCTGATAGGGCTTGGGCATGGGTTTACATGATATCATATTG
GTTTGGCGCTGTTTCACCTGGTGCTGTAAACCAAAGTGACTATTCCAGATTTGGTTCATC
TTTAAAAGGAATTTATCTTGGAACTATTGTTGGTTTAATGATTCCCCCAAACCTAGTTCC
AGCATATGGTGTTATTGGTGCTTCTACTACACAAGAATTGTATGGTGAGTCCATGTGGAT
GCCAACAGAAATCTGTAATTATTGGTTAAATCACGGCTACCATCCTGCAGCTAGAGCAGC
CTCGTTTTTTTGCGGTGTTTTCTTCGCATTATCTCAAATTGCTTATACCATCTCTAATGC
TGGATTCGCAAGCGGTATGGATTTGGCTGGTTTATTACCCAAATACGTCAACATTAAGAG
AGGTGCTTTTTTTACTGCTATTGTATCGGTTGCAGTTCAACCGTGGAACTTTTATAATTC
CTCCTCAACCTTTTTAACAGTAATGAGTTCTTTCGGTGTTGTCATGGTGCCTATACTAGC
TGTTATGATTTGTGATAACTTCATTATCAGGAAACGAAACTATTCAGCATCACAGGCTTT
TATTTTAAAAGGTGAATATTATTTTACAAAGGGTTTCAACTGGAGAGCGTTTATTGCGAT
GATTGTGCCTATGGCACCAGGTCTACCGGGAATCGCATGGCAAGTCAATCATAATGCTTT
TAACAATAGAGGTATAGTCAATTTTTATTATGGTGACTCCTTTTTCGCGTTTGTTATGTC
ATTTTGCTTGTACTGGATTTTGTGTATTATATTCCCCTTCAAAATTAACGTTCTACAGGA
TGACAAAGACTATTATGGTGCCTTCGATGAAAAAACCGCGATTAAAAAGGGTATGGTTCC
TTATAGTGGATTGACAGAGGCTGAAAGACAAGAGTACTTCATTCCAACCGCTAATGAGGT
CATGAGAGAAGAGCAACCAACTGAGTCGGATAGTGAGATATCACGTGCATATGTTGAGGA
AGGAACTGAAAAGGGTAGCACAAAGGACGGAGTTGCAGAAATATGATTTATTTAGCAACT
TGCACTGAAGTTGCGAATTTCAATTACATTTTGTTTTCTTATTTGCTAAAAAGACATTTA
TATATACAGGTAGATTTTTTTTCATCTATCATTGCTAGAGAGTATGTATTGTATACTATG
ATTAACTTTTTTAAATCAATTCATGAAATTAAATGTTTAGTAGATATAACATTATTAGTC
AAACAGTAATTCATAGCACTCTTCGAAAAGCAAAATACCTATATTGTATAATGATCGTAG
ATTAGATAATAATTTGAAATGGTACGGCGGCATAATAGGGAATGGA

Coding sequence    

>CAGL0M14113g.cds
ATGAGAGGATTTAAGTTTCTCAAGTATTTGGAAGTCCCTGTGGAAGAAAGACAAACTTTA
AGTTTCTTAAAAAATCCAGACTTGGTACCAATCCCTAAAAGCCACCAAACATGGGGGTTC
TGGTCTAACTTTGCCTACTGGGGTACAATTGCCTTTACAGTAGGTACATGGATGGGCGCT
ACAGCAGCATTGACAGTTGGGCTAAGTTATCCAGAGACTATCGCAACATTTATTCTGGGG
AATGCTCTAACAATTGTTTACTCACTGGCCAACTGTTATCCAGGTTGGGACTGGAAAGTT
GGCTATACGCTTTCTCAAAGATTTACCTTCGGTATCTATGGATCTGCTTTTGGTGTTATT
ATTAGAGTTTTATTAAGTATTGTGAACTATGGCTCAAACGCGTGGTTAGGAGGGTTATGT
ATTAACCTGATATTAAATTCATGGTCACACCACTATCTAGAATTAAAAAATACTTTATCT
CCGCATGTTGCTATGACCACAAAAGAATTGATCGGTTTCGTTATTTTCCATATTGTCTGT
GCTTTATGCTATCTAATGAAACCATATCAAATTAATAGGATATTAATAATTGCTTGTGCC
GGGACATGTTTCTCAATGCTGGGGATAATCATATATCTGTGTCATGCAAATGGGGGTGCT
GGTTCATTATTTCACACCCAAAAAACAACGGTTAGTGGTTCTGATAGGGCTTGGGCATGG
GTTTACATGATATCATATTGGTTTGGCGCTGTTTCACCTGGTGCTGTAAACCAAAGTGAC
TATTCCAGATTTGGTTCATCTTTAAAAGGAATTTATCTTGGAACTATTGTTGGTTTAATG
ATTCCCCCAAACCTAGTTCCAGCATATGGTGTTATTGGTGCTTCTACTACACAAGAATTG
TATGGTGAGTCCATGTGGATGCCAACAGAAATCTGTAATTATTGGTTAAATCACGGCTAC
CATCCTGCAGCTAGAGCAGCCTCGTTTTTTTGCGGTGTTTTCTTCGCATTATCTCAAATT
GCTTATACCATCTCTAATGCTGGATTCGCAAGCGGTATGGATTTGGCTGGTTTATTACCC
AAATACGTCAACATTAAGAGAGGTGCTTTTTTTACTGCTATTGTATCGGTTGCAGTTCAA
CCGTGGAACTTTTATAATTCCTCCTCAACCTTTTTAACAGTAATGAGTTCTTTCGGTGTT
GTCATGGTGCCTATACTAGCTGTTATGATTTGTGATAACTTCATTATCAGGAAACGAAAC
TATTCAGCATCACAGGCTTTTATTTTAAAAGGTGAATATTATTTTACAAAGGGTTTCAAC
TGGAGAGCGTTTATTGCGATGATTGTGCCTATGGCACCAGGTCTACCGGGAATCGCATGG
CAAGTCAATCATAATGCTTTTAACAATAGAGGTATAGTCAATTTTTATTATGGTGACTCC
TTTTTCGCGTTTGTTATGTCATTTTGCTTGTACTGGATTTTGTGTATTATATTCCCCTTC
AAAATTAACGTTCTACAGGATGACAAAGACTATTATGGTGCCTTCGATGAAAAAACCGCG
ATTAAAAAGGGTATGGTTCCTTATAGTGGATTGACAGAGGCTGAAAGACAAGAGTACTTC
ATTCCAACCGCTAATGAGGTCATGAGAGAAGAGCAACCAACTGAGTCGGATAGTGAGATA
TCACGTGCATATGTTGAGGAAGGAACTGAAAAGGGTAGCACAAAGGACGGAGTTGCAGAA
ATATGA

Predicted translation product    

>CAGL0M14113g.aa
MRGFKFLKYLEVPVEERQTLSFLKNPDLVPIPKSHQTWGFWSNFAYWGTIAFTVGTWMGA
TAALTVGLSYPETIATFILGNALTIVYSLANCYPGWDWKVGYTLSQRFTFGIYGSAFGVI
IRVLLSIVNYGSNAWLGGLCINLILNSWSHHYLELKNTLSPHVAMTTKELIGFVIFHIVC
ALCYLMKPYQINRILIIACAGTCFSMLGIIIYLCHANGGAGSLFHTQKTTVSGSDRAWAW
VYMISYWFGAVSPGAVNQSDYSRFGSSLKGIYLGTIVGLMIPPNLVPAYGVIGASTTQEL
YGESMWMPTEICNYWLNHGYHPAARAASFFCGVFFALSQIAYTISNAGFASGMDLAGLLP
KYVNIKRGAFFTAIVSVAVQPWNFYNSSSTFLTVMSSFGVVMVPILAVMICDNFIIRKRN
YSASQAFILKGEYYFTKGFNWRAFIAMIVPMAPGLPGIAWQVNHNAFNNRGIVNFYYGDS
FFAFVMSFCLYWILCIIFPFKINVLQDDKDYYGAFDEKTAIKKGMVPYSGLTEAERQEYF
IPTANEVMREEQPTESDSEISRAYVEEGTEKGSTKDGVAEI*




Legend and notes  


Lengths
The length, in codons, of coding sequences includes the stop codon, hence it is one unit longer than the protein length.

Genomic environment map
Click on the symbol of an element or a family to go to its corresponding page. Colors in the lane "protein encoding genes" indicate strandedness: shades of blue for direct orientation, shades of red for reverse orientation. Colors in the lane "protein family" are arbitrarly chosen in such a way that different protein families have different colors in the map.

Protein domain map
Domains are extracted from SwissProt files. Click on the symbol of a domain to extract the domain sequence.

Genemark image and list
Genemark computation of protein-coding potential of DNA was made from 1000 nucleotides upstream the open reading frame to 300 nucleotides downstream. Thus the open reading frame protein-coding potential appears on frame #3.

Sequences
ColorNucleotide sequence and Coding sequencePredicted translation product
REDstart and stop codonsInitial methionine and sequence end
BLUEcoding sequenceprotein sequence
greynon-coding sequence (upstream, downstream or intron)
greydonor and acceptor splicing sites