NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1268004713|ref|WP_098644393|]
View 

transglutaminase domain-containing protein [Bacillus toyonensis]

Protein Classification

CYK3 family protein( domain architecture ID 11474529)

CYK3 family protein

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
CYK3 COG5279
Cytokinesis protein 3, contains TGc (transglutaminase/protease-like) domain [Cell cycle ...
162-357 7.05e-50

Cytokinesis protein 3, contains TGc (transglutaminase/protease-like) domain [Cell cycle control, cell division, chromosome partitioning];


:

Pssm-ID: 444090 [Multi-domain]  Cd Length: 250  Bit Score: 168.27  E-value: 7.05e-50
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1268004713 162 VKEYDKAVESNEYLNHNISHTQYSVRGIPGNYTFTVKITYRESKGQTDYVKAQAK-----SIINSIIKAGMDEHEKVKVI 236
Cdd:COG5279    29 KDTTLTGLGSLYELAVLLFALLEGNSLAAFLKDAISSSTIYALKDEEGLLTEKATiadesKDDDYIITPGMSDYEKVRAI 108
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1268004713 237 HDYVVKHVSYDTS------FQAYTAYEALANRSAVCQGYALLTYQLLKEAGIETHIVTGTGNGQ-----PHAWNQVKIEG 305
Cdd:COG5279   109 HDWIVDNIEYDYEaynsgkSDSHSAYGALKNGKGVCEGYAKLFKLLCNKAGIECYIVTGYARGSggesgNHAWNAVKIDG 188
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 1268004713 306 KWYHLDTTFDDPIPD-VQGRVTYSYYNLSDEQIARNHQ-----WDRNKFAPATTNYAN 357
Cdd:COG5279   189 KWYLVDATWDDGVPDnGGGDVNYDYFLLSDEEFAKDHLpedpkWQLLDYPISKDEFAN 246
PspC_relate_1 super family cl41464
PspC-related protein choline-binding protein 1; Members of this family share C-terminal ...
23-110 6.34e-30

PspC-related protein choline-binding protein 1; Members of this family share C-terminal homology to the choline-binding form of the pneumococcal surface antigen PspC, but not to its allelic LPXTG-anchored forms because they lack the choline-binding repeat region. Members of this family should not be confused with PspC itself, whose identity and function reflect regions N-terminal to the choline-binding region. See Iannelli, et al. (PMID: 11891047) for information about the different allelic forms of PspC.


The actual alignment was detected with superfamily member NF033840:

Pssm-ID: 411409 [Multi-domain]  Cd Length: 648  Bit Score: 121.34  E-value: 6.34e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1268004713  23 TYTSADTGWKQTGSVWNYYNVNGVKQTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNGSGAMQTGWIQDNGKQYYLE 102
Cdd:NF033840  524 TDGSMATGWVQVNGSWYYLNSNGSMATGWVQVNGSWYYLNSNGSMATGWVQVDGSWYYLNDNGSMETGWLQNNGSWYYLN 603

                  ....*...
gi 1268004713 103 SNGVWNPN 110
Cdd:NF033840  604 SNGSMKAN 611
 
Name Accession Description Interval E-value
CYK3 COG5279
Cytokinesis protein 3, contains TGc (transglutaminase/protease-like) domain [Cell cycle ...
162-357 7.05e-50

Cytokinesis protein 3, contains TGc (transglutaminase/protease-like) domain [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 444090 [Multi-domain]  Cd Length: 250  Bit Score: 168.27  E-value: 7.05e-50
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1268004713 162 VKEYDKAVESNEYLNHNISHTQYSVRGIPGNYTFTVKITYRESKGQTDYVKAQAK-----SIINSIIKAGMDEHEKVKVI 236
Cdd:COG5279    29 KDTTLTGLGSLYELAVLLFALLEGNSLAAFLKDAISSSTIYALKDEEGLLTEKATiadesKDDDYIITPGMSDYEKVRAI 108
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1268004713 237 HDYVVKHVSYDTS------FQAYTAYEALANRSAVCQGYALLTYQLLKEAGIETHIVTGTGNGQ-----PHAWNQVKIEG 305
Cdd:COG5279   109 HDWIVDNIEYDYEaynsgkSDSHSAYGALKNGKGVCEGYAKLFKLLCNKAGIECYIVTGYARGSggesgNHAWNAVKIDG 188
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 1268004713 306 KWYHLDTTFDDPIPD-VQGRVTYSYYNLSDEQIARNHQ-----WDRNKFAPATTNYAN 357
Cdd:COG5279   189 KWYLVDATWDDGVPDnGGGDVNYDYFLLSDEEFAKDHLpedpkWQLLDYPISKDEFAN 246
PspC_relate_1 NF033840
PspC-related protein choline-binding protein 1; Members of this family share C-terminal ...
23-110 6.34e-30

PspC-related protein choline-binding protein 1; Members of this family share C-terminal homology to the choline-binding form of the pneumococcal surface antigen PspC, but not to its allelic LPXTG-anchored forms because they lack the choline-binding repeat region. Members of this family should not be confused with PspC itself, whose identity and function reflect regions N-terminal to the choline-binding region. See Iannelli, et al. (PMID: 11891047) for information about the different allelic forms of PspC.


Pssm-ID: 411409 [Multi-domain]  Cd Length: 648  Bit Score: 121.34  E-value: 6.34e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1268004713  23 TYTSADTGWKQTGSVWNYYNVNGVKQTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNGSGAMQTGWIQDNGKQYYLE 102
Cdd:NF033840  524 TDGSMATGWVQVNGSWYYLNSNGSMATGWVQVNGSWYYLNSNGSMATGWVQVDGSWYYLNDNGSMETGWLQNNGSWYYLN 603

                  ....*...
gi 1268004713 103 SNGVWNPN 110
Cdd:NF033840  604 SNGSMKAN 611
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
29-105 6.64e-30

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 121.27  E-value: 6.64e-30
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1268004713  29 TGWKQTGSVWNYYNVNGVKQTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNGSGAMQTGWIQDNGKQYYLESNG 105
Cdd:NF033838  486 TGWKQENGMWYFYNTDGSMATGWLQNNGSWYYLNANGAMATGWLQNNGSWYYLNANGSMATGWLQNNGSWYYLNANG 562
pneumo_PspA NF033930
pneumococcal surface protein A; The pneumococcal surface protein proteins, found in ...
26-105 9.43e-30

pneumococcal surface protein A; The pneumococcal surface protein proteins, found in Streptococcus pneumoniae, are repetitive, with patterns of localized high sequence identity across pairs of proteins given different specific names that recombination may be presumed. This protein, PspA, has an N-terminal region that lacks a cross-wall-targeting YSIRK type extended signal peptide, in contrast to the closely related choline-binding protein CbpA which has a similar C-terminus but a YSIRK-containing region at the N-terminus.


Pssm-ID: 468251 [Multi-domain]  Cd Length: 660  Bit Score: 120.79  E-value: 9.43e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1268004713  26 SADTGWKQTGSVWNYYNVNGVKQTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNGSGAMQTGWIQDNGKQYYLESNG 105
Cdd:NF033930  460 SMATGWLQNNGSWYYLNSNGAMATGWLQYNGSWYYLNANGAMATGWAKVNGSWYYLNANGAMATGWLQYNGSWYYLNANG 539
PspC_relate_1 NF033840
PspC-related protein choline-binding protein 1; Members of this family share C-terminal ...
26-105 1.16e-29

PspC-related protein choline-binding protein 1; Members of this family share C-terminal homology to the choline-binding form of the pneumococcal surface antigen PspC, but not to its allelic LPXTG-anchored forms because they lack the choline-binding repeat region. Members of this family should not be confused with PspC itself, whose identity and function reflect regions N-terminal to the choline-binding region. See Iannelli, et al. (PMID: 11891047) for information about the different allelic forms of PspC.


Pssm-ID: 411409 [Multi-domain]  Cd Length: 648  Bit Score: 120.57  E-value: 1.16e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1268004713  26 SADTGWKQTGSVWNYYNVNGVKQTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNGSGAMQTGWIQDNGKQYYLESNG 105
Cdd:NF033840  507 APTTGWKQENGMWYFYNTDGSMATGWVQVNGSWYYLNSNGSMATGWVQVNGSWYYLNSNGSMATGWVQVDGSWYYLNDNG 586
pneumo_PspA NF033930
pneumococcal surface protein A; The pneumococcal surface protein proteins, found in ...
29-105 1.50e-29

pneumococcal surface protein A; The pneumococcal surface protein proteins, found in Streptococcus pneumoniae, are repetitive, with patterns of localized high sequence identity across pairs of proteins given different specific names that recombination may be presumed. This protein, PspA, has an N-terminal region that lacks a cross-wall-targeting YSIRK type extended signal peptide, in contrast to the closely related choline-binding protein CbpA which has a similar C-terminus but a YSIRK-containing region at the N-terminus.


Pssm-ID: 468251 [Multi-domain]  Cd Length: 660  Bit Score: 120.40  E-value: 1.50e-29
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1268004713  29 TGWKQTGSVWNYYNVNGVKQTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNGSGAMQTGWIQDNGKQYYLESNG 105
Cdd:NF033930  443 TGWKQENGMWYFYNTDGSMATGWLQNNGSWYYLNSNGAMATGWLQYNGSWYYLNANGAMATGWAKVNGSWYYLNANG 519
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
26-105 1.31e-28

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 117.42  E-value: 1.31e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1268004713  26 SADTGWKQTGSVWNYYNVNGVKQTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNGSGAMQTGWIQDNGKQYYLESNG 105
Cdd:NF033838  503 SMATGWLQNNGSWYYLNANGAMATGWLQNNGSWYYLNANGSMATGWLQNNGSWYYLNANGAMATGWLQYNGSWYYLNANG 582
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
29-105 2.73e-28

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 116.65  E-value: 2.73e-28
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1268004713  29 TGWKQTGSVWNYYNVNGVKQTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNGSGAMQTGWIQDNGKQYYLESNG 105
Cdd:NF033838  526 TGWLQNNGSWYYLNANGSMATGWLQNNGSWYYLNANGAMATGWLQYNGSWYYLNANGDMATGWLQYNGSWYYLNANG 602
pneumo_PspA NF033930
pneumococcal surface protein A; The pneumococcal surface protein proteins, found in ...
29-105 4.24e-27

pneumococcal surface protein A; The pneumococcal surface protein proteins, found in Streptococcus pneumoniae, are repetitive, with patterns of localized high sequence identity across pairs of proteins given different specific names that recombination may be presumed. This protein, PspA, has an N-terminal region that lacks a cross-wall-targeting YSIRK type extended signal peptide, in contrast to the closely related choline-binding protein CbpA which has a similar C-terminus but a YSIRK-containing region at the N-terminus.


Pssm-ID: 468251 [Multi-domain]  Cd Length: 660  Bit Score: 113.08  E-value: 4.24e-27
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1268004713  29 TGWKQTGSVWNYYNVNGVKQTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNGSGAMQTGWIQDNGKQYYLESNG 105
Cdd:NF033930  543 TGWLKYNGSWYYLNANGAMATGWLQYNGSWYYLNANGAMATGWAKVNGSWYYLNANGSMATGWVKDGDTWYYLEASG 619
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
26-105 1.22e-26

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 111.64  E-value: 1.22e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1268004713  26 SADTGWKQTGSVWNYYNVNGVKQTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNGSGAMQTGWIQDNGKQYYLESNG 105
Cdd:NF033838  543 SMATGWLQNNGSWYYLNANGAMATGWLQYNGSWYYLNANGDMATGWLQYNGSWYYLNANGDMATGWLQYNGSWYYLNANG 622
COG5263 COG5263
Glucan-binding domain (YG repeat) [Carbohydrate transport and metabolism];
29-117 2.64e-24

Glucan-binding domain (YG repeat) [Carbohydrate transport and metabolism];


Pssm-ID: 444077 [Multi-domain]  Cd Length: 486  Bit Score: 104.18  E-value: 2.64e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1268004713  29 TGWKQTGSVWNYYNVNGVKQTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNGSGAMQTGWIQDNGKQYYLESNGVWN 108
Cdd:COG5263   386 TGWVKVDGKWYYFDSSGAMATGWLKIDGKWYYFDSDGAMATGWQKIGGKWYYFDSNGAMATGWVKVDGKWYYFDSDGAMA 465

                  ....*....
gi 1268004713 109 PNTTSANNN 117
Cdd:COG5263   466 TGWQTIDGK 474
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
29-105 1.08e-23

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 102.78  E-value: 1.08e-23
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1268004713  29 TGWKQTGSVWNYYNVNGVKQTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNGSGAMQTGWIQDNGKQYYLESNG 105
Cdd:NF033838  566 TGWLQYNGSWYYLNANGDMATGWLQYNGSWYYLNANGDMATGWLQYNGSWYYLNANGSMATGWVKDGDTWYYLEASG 642
PspC_relate_1 NF033840
PspC-related protein choline-binding protein 1; Members of this family share C-terminal ...
26-112 1.11e-22

PspC-related protein choline-binding protein 1; Members of this family share C-terminal homology to the choline-binding form of the pneumococcal surface antigen PspC, but not to its allelic LPXTG-anchored forms because they lack the choline-binding repeat region. Members of this family should not be confused with PspC itself, whose identity and function reflect regions N-terminal to the choline-binding region. See Iannelli, et al. (PMID: 11891047) for information about the different allelic forms of PspC.


Pssm-ID: 411409 [Multi-domain]  Cd Length: 648  Bit Score: 99.77  E-value: 1.11e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1268004713  26 SADTGWKQTGSVWNYYNVNGVKQTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNGSGAMQTG-WIQDNGKQYYLESN 104
Cdd:NF033840  547 SMATGWVQVNGSWYYLNSNGSMATGWVQVDGSWYYLNDNGSMETGWLQNNGSWYYLNSNGSMKANqWFQVGSKWYYVNAS 626

                  ....*...
gi 1268004713 105 GVWNPNTT 112
Cdd:NF033840  627 GELAVNTS 634
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
48-105 1.59e-21

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 96.24  E-value: 1.59e-21
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 1268004713  48 QTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNGSGAMQTGWIQDNGKQYYLESNG 105
Cdd:NF033838  485 KTGWKQENGMWYFYNTDGSMATGWLQNNGSWYYLNANGAMATGWLQNNGSWYYLNANG 542
pneumo_PspA NF033930
pneumococcal surface protein A; The pneumococcal surface protein proteins, found in ...
9-112 6.63e-19

pneumococcal surface protein A; The pneumococcal surface protein proteins, found in Streptococcus pneumoniae, are repetitive, with patterns of localized high sequence identity across pairs of proteins given different specific names that recombination may be presumed. This protein, PspA, has an N-terminal region that lacks a cross-wall-targeting YSIRK type extended signal peptide, in contrast to the closely related choline-binding protein CbpA which has a similar C-terminus but a YSIRK-containing region at the N-terminus.


Pssm-ID: 468251 [Multi-domain]  Cd Length: 660  Bit Score: 88.43  E-value: 6.63e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1268004713   9 AMAVG-LTVMGSTVatYTSAD----TGWKQTGSVWNYYNVNGVKQTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNG 83
Cdd:NF033930  540 AMATGwLKYNGSWY--YLNANgamaTGWLQYNGSWYYLNANGAMATGWAKVNGSWYYLNANGSMATGWVKDGDTWYYLEA 617
                          90       100       110
                  ....*....|....*....|....*....|
gi 1268004713  84 SGAMQTG-WIQDNGKQYYLESNGVWNPNTT 112
Cdd:NF033930  618 SGAMKASqWFKVSDKWYYVNGLGALAVNTT 647
Transglut_core pfam01841
Transglutaminase-like superfamily; This family includes animal transglutaminases and other ...
216-311 6.40e-18

Transglutaminase-like superfamily; This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that centre around conserved cysteine, histidine, and aspartate residues that form the catalytic triad in the structurally characterized transglutaminase, the human blood clotting factor XIIIa'. On the basis of the experimentally demonstrated activity of the Methanobacterium phage pseudomurein endoisopeptidase, it is proposed that many, if not all, microbial homologs of the transglutaminases are proteases and that the eukaryotic transglutaminases have evolved from an ancestral protease.


Pssm-ID: 376628 [Multi-domain]  Cd Length: 108  Bit Score: 78.60  E-value: 6.40e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1268004713 216 KSIINSIIKAGMDEHEKVKVIHDYVVKHVSYD---TSFQAYTAYEALANRSAVCQGYALLTYQLLKEAGIETHIVTGT-- 290
Cdd:pfam01841   1 KALADRITGGATDPLEKARAIYDYVRKNITYDlpgRSPGDGDAEEFLFTGKGDCEDFASLFVALLRALGIPARYVTGYlr 80
                          90       100
                  ....*....|....*....|....*..
gi 1268004713 291 -----GNGQPHAWNQVKIEG-KWYHLD 311
Cdd:pfam01841  81 gpdtvRGGDAHAWVEVYLPGyGWVPVD 107
TGc smart00460
Transglutaminase/protease-like homologues; Transglutaminases are enzymes that establish ...
258-314 4.75e-10

Transglutaminase/protease-like homologues; Transglutaminases are enzymes that establish covalent links between proteins. A subset of transglutaminase homologues appear to catalyse the reverse reaction, the hydrolysis of peptide bonds. Proteins with this domain are both extracellular and intracellular, and it is likely that the eukaryotic intracellular proteins are involved in signalling events.


Pssm-ID: 214673  Cd Length: 68  Bit Score: 55.08  E-value: 4.75e-10
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1268004713  258 ALANRSAVCQGYALLTYQLLKEAGIETHIVTG-----------TGNGQPHAWNQVKIEGKWYHLDTTF 314
Cdd:smart00460   1 LLKTKYGTCGEFAALFVALLRSLGIPARVVSGylkapdtigglRSIWEAHAWAEVYLEGGWVPVDPTP 68
Choline_bind_3 pfam19127
Choline-binding repeat; Pair of presumed choline-binding repeats often found adjacent to ...
49-87 6.18e-09

Choline-binding repeat; Pair of presumed choline-binding repeats often found adjacent to pfam01473.


Pssm-ID: 465978 [Multi-domain]  Cd Length: 47  Bit Score: 51.39  E-value: 6.18e-09
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|.
gi 1268004713  49 TGWQQVNGAWYYFNGSGAMQTNWQ-QVNGAWYYFNG-SGAM 87
Cdd:pfam19127   2 TGWQTINGQTLYFDSDGKQVKGWVvTIDGKWYYFDAdSGEM 42
glucan_65_rpt TIGR04035
glucan-binding repeat; This model describes a region of about 63 amino acids that is composed ...
40-96 1.59e-03

glucan-binding repeat; This model describes a region of about 63 amino acids that is composed of three repeats of a more broadly distributed family of shorter repeats modeled by pfam01473. While the shorter repeats are often associated with choline binding (and therefore with cell wall binding), the longer repeat described here represents a subgroup of repeat sequences associated with glucan binding, as found in a number glycosylhydrolases. Shah, et al. describe a repeat consensus, WYYFDANGKAVTGAQTINGQTLYFDQDGKQVKG, that corresponds to half of the repeat as modeled here and one and a half copies of the repeat as modeled by pfam01473.


Pssm-ID: 274933 [Multi-domain]  Cd Length: 62  Bit Score: 36.34  E-value: 1.59e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1268004713  40 YYNVNGVKQTGWQQVNGAWYYFNGSGamqtnwQQVNGAW--------YYFNGSGAMQT-GWIQDNG 96
Cdd:TIGR04035   2 YFDADGKAVTGAQTIDGVTYYFDENG------KQVKGDFvtngggtyYYDKDSGALVTnRFVTIKD 61
 
Name Accession Description Interval E-value
CYK3 COG5279
Cytokinesis protein 3, contains TGc (transglutaminase/protease-like) domain [Cell cycle ...
162-357 7.05e-50

Cytokinesis protein 3, contains TGc (transglutaminase/protease-like) domain [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 444090 [Multi-domain]  Cd Length: 250  Bit Score: 168.27  E-value: 7.05e-50
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1268004713 162 VKEYDKAVESNEYLNHNISHTQYSVRGIPGNYTFTVKITYRESKGQTDYVKAQAK-----SIINSIIKAGMDEHEKVKVI 236
Cdd:COG5279    29 KDTTLTGLGSLYELAVLLFALLEGNSLAAFLKDAISSSTIYALKDEEGLLTEKATiadesKDDDYIITPGMSDYEKVRAI 108
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1268004713 237 HDYVVKHVSYDTS------FQAYTAYEALANRSAVCQGYALLTYQLLKEAGIETHIVTGTGNGQ-----PHAWNQVKIEG 305
Cdd:COG5279   109 HDWIVDNIEYDYEaynsgkSDSHSAYGALKNGKGVCEGYAKLFKLLCNKAGIECYIVTGYARGSggesgNHAWNAVKIDG 188
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 1268004713 306 KWYHLDTTFDDPIPD-VQGRVTYSYYNLSDEQIARNHQ-----WDRNKFAPATTNYAN 357
Cdd:COG5279   189 KWYLVDATWDDGVPDnGGGDVNYDYFLLSDEEFAKDHLpedpkWQLLDYPISKDEFAN 246
PspC_relate_1 NF033840
PspC-related protein choline-binding protein 1; Members of this family share C-terminal ...
23-110 6.34e-30

PspC-related protein choline-binding protein 1; Members of this family share C-terminal homology to the choline-binding form of the pneumococcal surface antigen PspC, but not to its allelic LPXTG-anchored forms because they lack the choline-binding repeat region. Members of this family should not be confused with PspC itself, whose identity and function reflect regions N-terminal to the choline-binding region. See Iannelli, et al. (PMID: 11891047) for information about the different allelic forms of PspC.


Pssm-ID: 411409 [Multi-domain]  Cd Length: 648  Bit Score: 121.34  E-value: 6.34e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1268004713  23 TYTSADTGWKQTGSVWNYYNVNGVKQTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNGSGAMQTGWIQDNGKQYYLE 102
Cdd:NF033840  524 TDGSMATGWVQVNGSWYYLNSNGSMATGWVQVNGSWYYLNSNGSMATGWVQVDGSWYYLNDNGSMETGWLQNNGSWYYLN 603

                  ....*...
gi 1268004713 103 SNGVWNPN 110
Cdd:NF033840  604 SNGSMKAN 611
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
29-105 6.64e-30

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 121.27  E-value: 6.64e-30
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1268004713  29 TGWKQTGSVWNYYNVNGVKQTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNGSGAMQTGWIQDNGKQYYLESNG 105
Cdd:NF033838  486 TGWKQENGMWYFYNTDGSMATGWLQNNGSWYYLNANGAMATGWLQNNGSWYYLNANGSMATGWLQNNGSWYYLNANG 562
pneumo_PspA NF033930
pneumococcal surface protein A; The pneumococcal surface protein proteins, found in ...
26-105 9.43e-30

pneumococcal surface protein A; The pneumococcal surface protein proteins, found in Streptococcus pneumoniae, are repetitive, with patterns of localized high sequence identity across pairs of proteins given different specific names that recombination may be presumed. This protein, PspA, has an N-terminal region that lacks a cross-wall-targeting YSIRK type extended signal peptide, in contrast to the closely related choline-binding protein CbpA which has a similar C-terminus but a YSIRK-containing region at the N-terminus.


Pssm-ID: 468251 [Multi-domain]  Cd Length: 660  Bit Score: 120.79  E-value: 9.43e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1268004713  26 SADTGWKQTGSVWNYYNVNGVKQTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNGSGAMQTGWIQDNGKQYYLESNG 105
Cdd:NF033930  460 SMATGWLQNNGSWYYLNSNGAMATGWLQYNGSWYYLNANGAMATGWAKVNGSWYYLNANGAMATGWLQYNGSWYYLNANG 539
PspC_relate_1 NF033840
PspC-related protein choline-binding protein 1; Members of this family share C-terminal ...
26-105 1.16e-29

PspC-related protein choline-binding protein 1; Members of this family share C-terminal homology to the choline-binding form of the pneumococcal surface antigen PspC, but not to its allelic LPXTG-anchored forms because they lack the choline-binding repeat region. Members of this family should not be confused with PspC itself, whose identity and function reflect regions N-terminal to the choline-binding region. See Iannelli, et al. (PMID: 11891047) for information about the different allelic forms of PspC.


Pssm-ID: 411409 [Multi-domain]  Cd Length: 648  Bit Score: 120.57  E-value: 1.16e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1268004713  26 SADTGWKQTGSVWNYYNVNGVKQTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNGSGAMQTGWIQDNGKQYYLESNG 105
Cdd:NF033840  507 APTTGWKQENGMWYFYNTDGSMATGWVQVNGSWYYLNSNGSMATGWVQVNGSWYYLNSNGSMATGWVQVDGSWYYLNDNG 586
pneumo_PspA NF033930
pneumococcal surface protein A; The pneumococcal surface protein proteins, found in ...
29-105 1.50e-29

pneumococcal surface protein A; The pneumococcal surface protein proteins, found in Streptococcus pneumoniae, are repetitive, with patterns of localized high sequence identity across pairs of proteins given different specific names that recombination may be presumed. This protein, PspA, has an N-terminal region that lacks a cross-wall-targeting YSIRK type extended signal peptide, in contrast to the closely related choline-binding protein CbpA which has a similar C-terminus but a YSIRK-containing region at the N-terminus.


Pssm-ID: 468251 [Multi-domain]  Cd Length: 660  Bit Score: 120.40  E-value: 1.50e-29
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1268004713  29 TGWKQTGSVWNYYNVNGVKQTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNGSGAMQTGWIQDNGKQYYLESNG 105
Cdd:NF033930  443 TGWKQENGMWYFYNTDGSMATGWLQNNGSWYYLNSNGAMATGWLQYNGSWYYLNANGAMATGWAKVNGSWYYLNANG 519
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
26-105 1.31e-28

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 117.42  E-value: 1.31e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1268004713  26 SADTGWKQTGSVWNYYNVNGVKQTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNGSGAMQTGWIQDNGKQYYLESNG 105
Cdd:NF033838  503 SMATGWLQNNGSWYYLNANGAMATGWLQNNGSWYYLNANGSMATGWLQNNGSWYYLNANGAMATGWLQYNGSWYYLNANG 582
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
29-105 2.73e-28

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 116.65  E-value: 2.73e-28
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1268004713  29 TGWKQTGSVWNYYNVNGVKQTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNGSGAMQTGWIQDNGKQYYLESNG 105
Cdd:NF033838  526 TGWLQNNGSWYYLNANGSMATGWLQNNGSWYYLNANGAMATGWLQYNGSWYYLNANGDMATGWLQYNGSWYYLNANG 602
pneumo_PspA NF033930
pneumococcal surface protein A; The pneumococcal surface protein proteins, found in ...
29-105 4.24e-27

pneumococcal surface protein A; The pneumococcal surface protein proteins, found in Streptococcus pneumoniae, are repetitive, with patterns of localized high sequence identity across pairs of proteins given different specific names that recombination may be presumed. This protein, PspA, has an N-terminal region that lacks a cross-wall-targeting YSIRK type extended signal peptide, in contrast to the closely related choline-binding protein CbpA which has a similar C-terminus but a YSIRK-containing region at the N-terminus.


Pssm-ID: 468251 [Multi-domain]  Cd Length: 660  Bit Score: 113.08  E-value: 4.24e-27
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1268004713  29 TGWKQTGSVWNYYNVNGVKQTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNGSGAMQTGWIQDNGKQYYLESNG 105
Cdd:NF033930  543 TGWLKYNGSWYYLNANGAMATGWLQYNGSWYYLNANGAMATGWAKVNGSWYYLNANGSMATGWVKDGDTWYYLEASG 619
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
26-105 1.22e-26

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 111.64  E-value: 1.22e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1268004713  26 SADTGWKQTGSVWNYYNVNGVKQTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNGSGAMQTGWIQDNGKQYYLESNG 105
Cdd:NF033838  543 SMATGWLQNNGSWYYLNANGAMATGWLQYNGSWYYLNANGDMATGWLQYNGSWYYLNANGDMATGWLQYNGSWYYLNANG 622
COG5263 COG5263
Glucan-binding domain (YG repeat) [Carbohydrate transport and metabolism];
29-117 2.64e-24

Glucan-binding domain (YG repeat) [Carbohydrate transport and metabolism];


Pssm-ID: 444077 [Multi-domain]  Cd Length: 486  Bit Score: 104.18  E-value: 2.64e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1268004713  29 TGWKQTGSVWNYYNVNGVKQTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNGSGAMQTGWIQDNGKQYYLESNGVWN 108
Cdd:COG5263   386 TGWVKVDGKWYYFDSSGAMATGWLKIDGKWYYFDSDGAMATGWQKIGGKWYYFDSNGAMATGWVKVDGKWYYFDSDGAMA 465

                  ....*....
gi 1268004713 109 PNTTSANNN 117
Cdd:COG5263   466 TGWQTIDGK 474
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
29-105 1.08e-23

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 102.78  E-value: 1.08e-23
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1268004713  29 TGWKQTGSVWNYYNVNGVKQTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNGSGAMQTGWIQDNGKQYYLESNG 105
Cdd:NF033838  566 TGWLQYNGSWYYLNANGDMATGWLQYNGSWYYLNANGDMATGWLQYNGSWYYLNANGSMATGWVKDGDTWYYLEASG 642
COG5263 COG5263
Glucan-binding domain (YG repeat) [Carbohydrate transport and metabolism];
29-106 1.63e-23

Glucan-binding domain (YG repeat) [Carbohydrate transport and metabolism];


Pssm-ID: 444077 [Multi-domain]  Cd Length: 486  Bit Score: 101.87  E-value: 1.63e-23
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1268004713  29 TGWKQTGSVWNYYNVNGVKQTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNGSGAMQTGWIQDNGKQYYLESNGV 106
Cdd:COG5263   346 TGWVTDDGKWYYLGSDGAMATGWQKIDGKWYYFDSNGAMATGWVKVDGKWYYFDSSGAMATGWLKIDGKWYYFDSDGA 423
COG5263 COG5263
Glucan-binding domain (YG repeat) [Carbohydrate transport and metabolism];
26-107 5.04e-23

Glucan-binding domain (YG repeat) [Carbohydrate transport and metabolism];


Pssm-ID: 444077 [Multi-domain]  Cd Length: 486  Bit Score: 100.33  E-value: 5.04e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1268004713  26 SADTGWKQTGSVWNYYNVNGVKQTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNGSGAMQTGWIQDNGKQYYLESNG 105
Cdd:COG5263   363 AMATGWQKIDGKWYYFDSNGAMATGWVKVDGKWYYFDSSGAMATGWLKIDGKWYYFDSDGAMATGWQKIGGKWYYFDSNG 442

                  ..
gi 1268004713 106 VW 107
Cdd:COG5263   443 AM 444
COG5263 COG5263
Glucan-binding domain (YG repeat) [Carbohydrate transport and metabolism];
29-106 6.69e-23

Glucan-binding domain (YG repeat) [Carbohydrate transport and metabolism];


Pssm-ID: 444077 [Multi-domain]  Cd Length: 486  Bit Score: 99.94  E-value: 6.69e-23
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1268004713  29 TGWKQTGSVWNYYNVNGVKQTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNGSGAMQTGWIQDNGKQYYLESNGV 106
Cdd:COG5263   326 TGWQKINGKWYYFDEDGAMATGWVTDDGKWYYLGSDGAMATGWQKIDGKWYYFDSNGAMATGWVKVDGKWYYFDSSGA 403
PspC_relate_1 NF033840
PspC-related protein choline-binding protein 1; Members of this family share C-terminal ...
26-112 1.11e-22

PspC-related protein choline-binding protein 1; Members of this family share C-terminal homology to the choline-binding form of the pneumococcal surface antigen PspC, but not to its allelic LPXTG-anchored forms because they lack the choline-binding repeat region. Members of this family should not be confused with PspC itself, whose identity and function reflect regions N-terminal to the choline-binding region. See Iannelli, et al. (PMID: 11891047) for information about the different allelic forms of PspC.


Pssm-ID: 411409 [Multi-domain]  Cd Length: 648  Bit Score: 99.77  E-value: 1.11e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1268004713  26 SADTGWKQTGSVWNYYNVNGVKQTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNGSGAMQTG-WIQDNGKQYYLESN 104
Cdd:NF033840  547 SMATGWVQVNGSWYYLNSNGSMATGWVQVDGSWYYLNDNGSMETGWLQNNGSWYYLNSNGSMKANqWFQVGSKWYYVNAS 626

                  ....*...
gi 1268004713 105 GVWNPNTT 112
Cdd:NF033840  627 GELAVNTS 634
COG5263 COG5263
Glucan-binding domain (YG repeat) [Carbohydrate transport and metabolism];
29-106 3.17e-22

Glucan-binding domain (YG repeat) [Carbohydrate transport and metabolism];


Pssm-ID: 444077 [Multi-domain]  Cd Length: 486  Bit Score: 98.02  E-value: 3.17e-22
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1268004713  29 TGWKQTGSVWNYYNVNGVKQTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNGSGAMQTGWIQDNGKQYYLESNGV 106
Cdd:COG5263   306 TGWQTINGKWYYFDSDGAMATGWQKINGKWYYFDEDGAMATGWVTDDGKWYYLGSDGAMATGWQKIDGKWYYFDSNGA 383
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
48-105 1.59e-21

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 96.24  E-value: 1.59e-21
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 1268004713  48 QTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNGSGAMQTGWIQDNGKQYYLESNG 105
Cdd:NF033838  485 KTGWKQENGMWYFYNTDGSMATGWLQNNGSWYYLNANGAMATGWLQNNGSWYYLNANG 542
pneumo_PspA NF033930
pneumococcal surface protein A; The pneumococcal surface protein proteins, found in ...
9-112 6.63e-19

pneumococcal surface protein A; The pneumococcal surface protein proteins, found in Streptococcus pneumoniae, are repetitive, with patterns of localized high sequence identity across pairs of proteins given different specific names that recombination may be presumed. This protein, PspA, has an N-terminal region that lacks a cross-wall-targeting YSIRK type extended signal peptide, in contrast to the closely related choline-binding protein CbpA which has a similar C-terminus but a YSIRK-containing region at the N-terminus.


Pssm-ID: 468251 [Multi-domain]  Cd Length: 660  Bit Score: 88.43  E-value: 6.63e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1268004713   9 AMAVG-LTVMGSTVatYTSAD----TGWKQTGSVWNYYNVNGVKQTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNG 83
Cdd:NF033930  540 AMATGwLKYNGSWY--YLNANgamaTGWLQYNGSWYYLNANGAMATGWAKVNGSWYYLNANGSMATGWVKDGDTWYYLEA 617
                          90       100       110
                  ....*....|....*....|....*....|
gi 1268004713  84 SGAMQTG-WIQDNGKQYYLESNGVWNPNTT 112
Cdd:NF033930  618 SGAMKASqWFKVSDKWYYVNGLGALAVNTT 647
Transglut_core pfam01841
Transglutaminase-like superfamily; This family includes animal transglutaminases and other ...
216-311 6.40e-18

Transglutaminase-like superfamily; This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that centre around conserved cysteine, histidine, and aspartate residues that form the catalytic triad in the structurally characterized transglutaminase, the human blood clotting factor XIIIa'. On the basis of the experimentally demonstrated activity of the Methanobacterium phage pseudomurein endoisopeptidase, it is proposed that many, if not all, microbial homologs of the transglutaminases are proteases and that the eukaryotic transglutaminases have evolved from an ancestral protease.


Pssm-ID: 376628 [Multi-domain]  Cd Length: 108  Bit Score: 78.60  E-value: 6.40e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1268004713 216 KSIINSIIKAGMDEHEKVKVIHDYVVKHVSYD---TSFQAYTAYEALANRSAVCQGYALLTYQLLKEAGIETHIVTGT-- 290
Cdd:pfam01841   1 KALADRITGGATDPLEKARAIYDYVRKNITYDlpgRSPGDGDAEEFLFTGKGDCEDFASLFVALLRALGIPARYVTGYlr 80
                          90       100
                  ....*....|....*....|....*..
gi 1268004713 291 -----GNGQPHAWNQVKIEG-KWYHLD 311
Cdd:pfam01841  81 gpdtvRGGDAHAWVEVYLPGyGWVPVD 107
COG5263 COG5263
Glucan-binding domain (YG repeat) [Carbohydrate transport and metabolism];
15-106 9.89e-18

Glucan-binding domain (YG repeat) [Carbohydrate transport and metabolism];


Pssm-ID: 444077 [Multi-domain]  Cd Length: 486  Bit Score: 84.54  E-value: 9.89e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1268004713  15 TVMGSTVATYTSADTGWKQTGSVWNYYNVNGVKQTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNGSGAMQTGWIQD 94
Cdd:COG5263   272 YDDAGAAGVDGTGTTGTVGWVDGKWYYFDAGKMVTGWQTINGKWYYFDSDGAMATGWQKINGKWYYFDEDGAMATGWVTD 351
                          90
                  ....*....|..
gi 1268004713  95 NGKQYYLESNGV 106
Cdd:COG5263   352 DGKWYYLGSDGA 363
YebA COG1305
Transglutaminase-like enzyme, putative cysteine protease [Posttranslational modification, ...
186-313 3.19e-17

Transglutaminase-like enzyme, putative cysteine protease [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 440916 [Multi-domain]  Cd Length: 174  Bit Score: 78.51  E-value: 3.19e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1268004713 186 VRGIPGNYTFTVKITYRESKGQTDYVKAQAKSIInsiiKAGMDEHEKVKVIHDYVVKHVSYDTSF--QAYTAYEALANRS 263
Cdd:COG1305    38 SVVPGGGTELLAGPGELLSASYDPELRALAAELT----GGATTPYEKARALYDWVRDNIRYDPGStgVGTTALETLERRR 113
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1268004713 264 AVCQGYALLTYQLLKEAGIETHIVTG----------TGNGQPHAWNQVKIEGK-WYHLDTT 313
Cdd:COG1305   114 GVCRDFAHLLVALLRALGIPARYVSGylpgepppggGRADDAHAWVEVYLPGAgWVPFDPT 174
COG5263 COG5263
Glucan-binding domain (YG repeat) [Carbohydrate transport and metabolism];
29-89 2.96e-14

Glucan-binding domain (YG repeat) [Carbohydrate transport and metabolism];


Pssm-ID: 444077 [Multi-domain]  Cd Length: 486  Bit Score: 73.75  E-value: 2.96e-14
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1268004713  29 TGWKQTGSVWNYYNVNGVKQTGWQQVNGAWYYFNGSGAMQTNWQQVNGAWYYFNGSGAMQT 89
Cdd:COG5263   426 TGWQKIGGKWYYFDSNGAMATGWVKVDGKWYYFDSDGAMATGWQTIDGKTYYFDSNGAWVG 486
TGc smart00460
Transglutaminase/protease-like homologues; Transglutaminases are enzymes that establish ...
258-314 4.75e-10

Transglutaminase/protease-like homologues; Transglutaminases are enzymes that establish covalent links between proteins. A subset of transglutaminase homologues appear to catalyse the reverse reaction, the hydrolysis of peptide bonds. Proteins with this domain are both extracellular and intracellular, and it is likely that the eukaryotic intracellular proteins are involved in signalling events.


Pssm-ID: 214673  Cd Length: 68  Bit Score: 55.08  E-value: 4.75e-10
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1268004713  258 ALANRSAVCQGYALLTYQLLKEAGIETHIVTG-----------TGNGQPHAWNQVKIEGKWYHLDTTF 314
Cdd:smart00460   1 LLKTKYGTCGEFAALFVALLRSLGIPARVVSGylkapdtigglRSIWEAHAWAEVYLEGGWVPVDPTP 68
COG5263 COG5263
Glucan-binding domain (YG repeat) [Carbohydrate transport and metabolism];
18-106 6.82e-10

Glucan-binding domain (YG repeat) [Carbohydrate transport and metabolism];


Pssm-ID: 444077 [Multi-domain]  Cd Length: 486  Bit Score: 60.27  E-value: 6.82e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1268004713  18 GSTVATYTSADTGWKQTGSVWNYYNVNGVKQTGWQQVNGAWYYFNgSGAMQTNWQQVNGAWYYFNGSGAMQTGWIQDNGK 97
Cdd:COG5263   256 ESGGENNQSLAGNGTSYDDAGAAGVDGTGTTGTVGWVDGKWYYFD-AGKMVTGWQTINGKWYYFDSDGAMATGWQKINGK 334

                  ....*....
gi 1268004713  98 QYYLESNGV 106
Cdd:COG5263   335 WYYFDEDGA 343
Choline_bind_3 pfam19127
Choline-binding repeat; Pair of presumed choline-binding repeats often found adjacent to ...
49-87 6.18e-09

Choline-binding repeat; Pair of presumed choline-binding repeats often found adjacent to pfam01473.


Pssm-ID: 465978 [Multi-domain]  Cd Length: 47  Bit Score: 51.39  E-value: 6.18e-09
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|.
gi 1268004713  49 TGWQQVNGAWYYFNGSGAMQTNWQ-QVNGAWYYFNG-SGAM 87
Cdd:pfam19127   2 TGWQTINGQTLYFDSDGKQVKGWVvTIDGKWYYFDAdSGEM 42
Choline_bind_3 pfam19127
Choline-binding repeat; Pair of presumed choline-binding repeats often found adjacent to ...
29-70 4.20e-06

Choline-binding repeat; Pair of presumed choline-binding repeats often found adjacent to pfam01473.


Pssm-ID: 465978 [Multi-domain]  Cd Length: 47  Bit Score: 43.30  E-value: 4.20e-06
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....
gi 1268004713  29 TGWKQTGSVWNYYNVNGVKQTGWQ-QVNGAWYYFNG-SGAMQTN 70
Cdd:pfam19127   2 TGWQTINGQTLYFDSDGKQVKGWVvTIDGKWYYFDAdSGEMVTN 45
Choline_bind_3 pfam19127
Choline-binding repeat; Pair of presumed choline-binding repeats often found adjacent to ...
69-101 1.42e-05

Choline-binding repeat; Pair of presumed choline-binding repeats often found adjacent to pfam01473.


Pssm-ID: 465978 [Multi-domain]  Cd Length: 47  Bit Score: 41.76  E-value: 1.42e-05
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1268004713  69 TNWQQVNGAWYYFNGSGAMQTGW-IQDNGKQYYL 101
Cdd:pfam19127   2 TGWQTINGQTLYFDSDGKQVKGWvVTIDGKWYYF 35
Choline_bind_1 pfam01473
Putative cell wall binding repeat; These repeats are characterized by conserved aromatic ...
49-67 1.81e-04

Putative cell wall binding repeat; These repeats are characterized by conserved aromatic residues and glycines are found in multiple tandem copies in a number of proteins. The CW repeat is 20 amino acid residues long. The exact domain boundaries may not be correct. It has been suggested that these repeats in Swiss:P15057 might be responsible for the specific recognition of choline-containing cell walls. Similar but longer repeats are found in the glucosyltransferases and glucan-binding proteins of oral streptococci and shown to be involved in glucan binding as well as in the related dextransucrases of Leuconostoc mesenteroides. Repeats also occur in toxins of Clostridium difficile and other clostridia, though the ligands are not always known.


Pssm-ID: 366661 [Multi-domain]  Cd Length: 19  Bit Score: 38.14  E-value: 1.81e-04
                          10
                  ....*....|....*....
gi 1268004713  49 TGWQQVNGAWYYFNGSGAM 67
Cdd:pfam01473   1 TGWVKINGNWYYFDSNGVM 19
glucan_65_rpt TIGR04035
glucan-binding repeat; This model describes a region of about 63 amino acids that is composed ...
40-96 1.59e-03

glucan-binding repeat; This model describes a region of about 63 amino acids that is composed of three repeats of a more broadly distributed family of shorter repeats modeled by pfam01473. While the shorter repeats are often associated with choline binding (and therefore with cell wall binding), the longer repeat described here represents a subgroup of repeat sequences associated with glucan binding, as found in a number glycosylhydrolases. Shah, et al. describe a repeat consensus, WYYFDANGKAVTGAQTINGQTLYFDQDGKQVKG, that corresponds to half of the repeat as modeled here and one and a half copies of the repeat as modeled by pfam01473.


Pssm-ID: 274933 [Multi-domain]  Cd Length: 62  Bit Score: 36.34  E-value: 1.59e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1268004713  40 YYNVNGVKQTGWQQVNGAWYYFNGSGamqtnwQQVNGAW--------YYFNGSGAMQT-GWIQDNG 96
Cdd:TIGR04035   2 YFDADGKAVTGAQTIDGVTYYFDENG------KQVKGDFvtngggtyYYDKDSGALVTnRFVTIKD 61
glucan_65_rpt TIGR04035
glucan-binding repeat; This model describes a region of about 63 amino acids that is composed ...
59-104 1.81e-03

glucan-binding repeat; This model describes a region of about 63 amino acids that is composed of three repeats of a more broadly distributed family of shorter repeats modeled by pfam01473. While the shorter repeats are often associated with choline binding (and therefore with cell wall binding), the longer repeat described here represents a subgroup of repeat sequences associated with glucan binding, as found in a number glycosylhydrolases. Shah, et al. describe a repeat consensus, WYYFDANGKAVTGAQTINGQTLYFDQDGKQVKG, that corresponds to half of the repeat as modeled here and one and a half copies of the repeat as modeled by pfam01473.


Pssm-ID: 274933 [Multi-domain]  Cd Length: 62  Bit Score: 36.34  E-value: 1.81e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 1268004713  59 YYFNGSGAMQTNWQQVNGAWYYFNGSGAMQTG-WIQDNGKQYYLESN 104
Cdd:TIGR04035   1 YYFDADGKAVTGAQTIDGVTYYFDENGKQVKGdFVTNGGGTYYYDKD 47
Choline_bind_1 pfam01473
Putative cell wall binding repeat; These repeats are characterized by conserved aromatic ...
69-87 2.94e-03

Putative cell wall binding repeat; These repeats are characterized by conserved aromatic residues and glycines are found in multiple tandem copies in a number of proteins. The CW repeat is 20 amino acid residues long. The exact domain boundaries may not be correct. It has been suggested that these repeats in Swiss:P15057 might be responsible for the specific recognition of choline-containing cell walls. Similar but longer repeats are found in the glucosyltransferases and glucan-binding proteins of oral streptococci and shown to be involved in glucan binding as well as in the related dextransucrases of Leuconostoc mesenteroides. Repeats also occur in toxins of Clostridium difficile and other clostridia, though the ligands are not always known.


Pssm-ID: 366661 [Multi-domain]  Cd Length: 19  Bit Score: 34.67  E-value: 2.94e-03
                          10
                  ....*....|....*....
gi 1268004713  69 TNWQQVNGAWYYFNGSGAM 87
Cdd:pfam01473   1 TGWVKINGNWYYFDSNGVM 19
DUF553 pfam04473
Transglutaminase-like domain; This family of uncharacterized archaeal proteins are related to ...
253-311 3.10e-03

Transglutaminase-like domain; This family of uncharacterized archaeal proteins are related to Transglutaminase-like domains. This family has previously been called DUF553 and UPF0252.


Pssm-ID: 282345  Cd Length: 140  Bit Score: 37.82  E-value: 3.10e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1268004713 253 YTAYEALANRSAVCQGYALLTYQLLKEAGIETH---IVTGTGNGQPHAWNQVKIEGKWYHLD 311
Cdd:pfam04473  51 QTPSETIKTRKGVCSDYAILTAALLLDLNVSPVylvDVTFENSITGHAAAAVKINGTLFVLD 112
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH