NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2793962408|ref|WP_372123004|]
View 

retention module-containing protein, partial [Vibrio sp. 10N.222.54.C2]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
retention_LapA NF033682
retention module-containing protein; The retention module, as described for the giant adhesin ...
7-156 3.66e-47

retention module-containing protein; The retention module, as described for the giant adhesin LapA of Pseudomonas fluorescens and for an ice-binding giant adhesin of an Antarctic bacterium, appears at the N-terminus of a number of very large repetitive proteins, many of which have C-terminal regions that make them substrates for type I secretion systems.


:

Pssm-ID: 468140  Cd Length: 145  Bit Score: 165.11  E-value: 3.66e-47
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408    7 RQAAVVEAVSGEVIAVKSDGSARKISVGDIIRENEIVITANQAELLLGNPNGVT-EVASNCVGCVDQDlvwadapiAGEV 85
Cdd:NF033682     1 TQVAVVKAVSGTVFAVNADGSVRVLKVGDTLQAGEIVITGNGAAVELQLADGSTlTLGENCVACVTED--------NGLI 72
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2793962408   86 NFDLQQADAGDFDDDEFAAIQEAILGGADPTQILEATAAG--GGLGSANAGFVTIDYNYTETHPSTFFETAGL 156
Cdd:NF033682    73 EFDAEEAAAASFDDPDIAAIQAAILAGADPTELLEATAAGlaGGAGGAGGGFVTIDRNGDEVLPSTGFPTAGF 145
T1SS_rpt_143 super family cl42883
T1SS-143 repeat domain; This model represents a domain of about 143 amino acids that may occur ...
661-799 2.77e-28

T1SS-143 repeat domain; This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion. [Cellular processes, Pathogenesis]


The actual alignment was detected with superfamily member TIGR03660:

Pssm-ID: 132699 [Multi-domain]  Cd Length: 137  Bit Score: 110.84  E-value: 2.77e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  661 GLFTVDEGADGVAKYELVDEDLVLTGLTSDGESLEWQAVSQNGTTFTYVAqTATSNeAVFEIIFDTsDNSYQFELFKPIK 740
Cdd:TIGR03660    1 GQFTVTQGADGVVSYQLDDSTNPVAGLTSGGQAVTLSETSNADGNFTYTA-TAGGN-PVFTLTLNA-DGSYEFTLEGPLD 77
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 2793962408  741 HPDGSgeNTIDLNFSVVAEDSDQDKSNAVdLQITVTDDVPTITDMTAAStfvVDEDDLS 799
Cdd:TIGR03660   78 HAAGS--DELTLNFPIIATDFDGDTSSIT-LPVTIVDDVPTITDVDALT---VDEDDLP 130
T1SS_rpt_143 super family cl42883
T1SS-143 repeat domain; This model represents a domain of about 143 amino acids that may occur ...
807-948 4.53e-26

T1SS-143 repeat domain; This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion. [Cellular processes, Pathogenesis]


The actual alignment was detected with superfamily member TIGR03660:

Pssm-ID: 132699 [Multi-domain]  Cd Length: 137  Bit Score: 104.67  E-value: 4.53e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  807 GAFVTTEGADKVEVYELRNVSALEATLTSGTEAISITEiTGAANTTTYQGTTTSGTPVFTLALANDGSYTFTLLGPLNHP 886
Cdd:TIGR03660    1 GQFTVTQGADGVVSYQLDDSTNPVAGLTSGGQAVTLSE-TSNADGNFTYTATAGGNPVFTLTLNADGSYEFTLEGPLDHA 79
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2793962408  887 TspNSNTLTIPFDVVAVDGDGDDSNQyVLPIEVLDDAPIMSAPTGETvVDEDDLTGVGSDQS 948
Cdd:TIGR03660   80 A--GSDELTLNFPIIATDFDGDTSSI-TLPVTIVDDVPTITDVDALT-VDEDDLPGGSDGSK 137
T1SS_rpt_143 super family cl42883
T1SS-143 repeat domain; This model represents a domain of about 143 amino acids that may occur ...
955-1058 1.00e-19

T1SS-143 repeat domain; This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion. [Cellular processes, Pathogenesis]


The actual alignment was detected with superfamily member TIGR03660:

Pssm-ID: 132699 [Multi-domain]  Cd Length: 137  Bit Score: 86.57  E-value: 1.00e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  955 GLFTVDEGADGVVKYELVDEDLVLTGLTSDGESLEWQAVSQNGTTFTYVAqtTASNEAVFEIIFDTsDNSYQFELFKPLK 1034
Cdd:TIGR03660    1 GQFTVTQGADGVVSYQLDDSTNPVAGLTSGGQAVTLSETSNADGNFTYTA--TAGGNPVFTLTLNA-DGSYEFTLEGPLD 77
                           90       100
                   ....*....|....*....|....
gi 2793962408 1035 HldGAGENAINLNFSVVAEDFDQD 1058
Cdd:TIGR03660   78 H--AAGSDELTLNFPIIATDFDGD 99
T1SS_rpt_143 super family cl42883
T1SS-143 repeat domain; This model represents a domain of about 143 amino acids that may occur ...
362-487 8.77e-16

T1SS-143 repeat domain; This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion. [Cellular processes, Pathogenesis]


The actual alignment was detected with superfamily member TIGR03660:

Pssm-ID: 132699 [Multi-domain]  Cd Length: 137  Bit Score: 75.01  E-value: 8.77e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  362 GSDNLQSAVFDAGAlDQFDGLLSDNQN-TLARLSDDGTTITLSIQGRGEVVLTISLDTDGTYKFEQFNPIEQ-VGTDSLT 439
Cdd:TIGR03660    8 GADGVVSYQLDDST-NPVAGLTSGGQAvTLSETSNADGNFTYTATAGGNPVFTLTLNADGSYEFTLEGPLDHaAGSDELT 86
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 2793962408  440 FDLPITITDFDQDVVTNNINIVITDgDSPVITNVDSISVDEAGIIGGS 487
Cdd:TIGR03660   87 LNFPIIATDFDGDTSSITLPVTIVD-DVPTITDVDALTVDEDDLPGGS 133
T1SS_rpt_143 super family cl42883
T1SS-143 repeat domain; This model represents a domain of about 143 amino acids that may occur ...
507-654 4.38e-14

T1SS-143 repeat domain; This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion. [Cellular processes, Pathogenesis]


The actual alignment was detected with superfamily member TIGR03660:

Pssm-ID: 132699 [Multi-domain]  Cd Length: 137  Bit Score: 70.39  E-value: 4.38e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  507 SDIIDHYELEPTeFNTGGTLVSNGEAVLLELIGEIGGVRTYEGYveVNGTRITVFDVKIDspslGNYEFNLYEELSHQGA 586
Cdd:TIGR03660    9 ADGVVSYQLDDS-TNPVAGLTSGGQAVTLSETSNADGNFTYTAT--AGGNPVFTLTLNAD----GSYEFTLEGPLDHAAG 81
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2793962408  587 EDAlLTFALPIYAVDADGDRSalsggsntpeAAEILVNVKDDVPVMTAPTGETvVDEDDLTGIGSDQS 654
Cdd:TIGR03660   82 SDE-LTLNFPIIATDFDGDTS----------SITLPVTIVDDVPTITDVDALT-VDEDDLPGGSDGSK 137
 
Name Accession Description Interval E-value
retention_LapA NF033682
retention module-containing protein; The retention module, as described for the giant adhesin ...
7-156 3.66e-47

retention module-containing protein; The retention module, as described for the giant adhesin LapA of Pseudomonas fluorescens and for an ice-binding giant adhesin of an Antarctic bacterium, appears at the N-terminus of a number of very large repetitive proteins, many of which have C-terminal regions that make them substrates for type I secretion systems.


Pssm-ID: 468140  Cd Length: 145  Bit Score: 165.11  E-value: 3.66e-47
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408    7 RQAAVVEAVSGEVIAVKSDGSARKISVGDIIRENEIVITANQAELLLGNPNGVT-EVASNCVGCVDQDlvwadapiAGEV 85
Cdd:NF033682     1 TQVAVVKAVSGTVFAVNADGSVRVLKVGDTLQAGEIVITGNGAAVELQLADGSTlTLGENCVACVTED--------NGLI 72
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2793962408   86 NFDLQQADAGDFDDDEFAAIQEAILGGADPTQILEATAAG--GGLGSANAGFVTIDYNYTETHPSTFFETAGL 156
Cdd:NF033682    73 EFDAEEAAAASFDDPDIAAIQAAILAGADPTELLEATAAGlaGGAGGAGGGFVTIDRNGDEVLPSTGFPTAGF 145
T1SS_rpt_143 TIGR03660
T1SS-143 repeat domain; This model represents a domain of about 143 amino acids that may occur ...
661-799 2.77e-28

T1SS-143 repeat domain; This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion. [Cellular processes, Pathogenesis]


Pssm-ID: 132699 [Multi-domain]  Cd Length: 137  Bit Score: 110.84  E-value: 2.77e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  661 GLFTVDEGADGVAKYELVDEDLVLTGLTSDGESLEWQAVSQNGTTFTYVAqTATSNeAVFEIIFDTsDNSYQFELFKPIK 740
Cdd:TIGR03660    1 GQFTVTQGADGVVSYQLDDSTNPVAGLTSGGQAVTLSETSNADGNFTYTA-TAGGN-PVFTLTLNA-DGSYEFTLEGPLD 77
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 2793962408  741 HPDGSgeNTIDLNFSVVAEDSDQDKSNAVdLQITVTDDVPTITDMTAAStfvVDEDDLS 799
Cdd:TIGR03660   78 HAAGS--DELTLNFPIIATDFDGDTSSIT-LPVTIVDDVPTITDVDALT---VDEDDLP 130
T1SS_rpt_143 TIGR03660
T1SS-143 repeat domain; This model represents a domain of about 143 amino acids that may occur ...
807-948 4.53e-26

T1SS-143 repeat domain; This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion. [Cellular processes, Pathogenesis]


Pssm-ID: 132699 [Multi-domain]  Cd Length: 137  Bit Score: 104.67  E-value: 4.53e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  807 GAFVTTEGADKVEVYELRNVSALEATLTSGTEAISITEiTGAANTTTYQGTTTSGTPVFTLALANDGSYTFTLLGPLNHP 886
Cdd:TIGR03660    1 GQFTVTQGADGVVSYQLDDSTNPVAGLTSGGQAVTLSE-TSNADGNFTYTATAGGNPVFTLTLNADGSYEFTLEGPLDHA 79
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2793962408  887 TspNSNTLTIPFDVVAVDGDGDDSNQyVLPIEVLDDAPIMSAPTGETvVDEDDLTGVGSDQS 948
Cdd:TIGR03660   80 A--GSDELTLNFPIIATDFDGDTSSI-TLPVTIVDDVPTITDVDALT-VDEDDLPGGSDGSK 137
T1SS_rpt_143 TIGR03660
T1SS-143 repeat domain; This model represents a domain of about 143 amino acids that may occur ...
955-1058 1.00e-19

T1SS-143 repeat domain; This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion. [Cellular processes, Pathogenesis]


Pssm-ID: 132699 [Multi-domain]  Cd Length: 137  Bit Score: 86.57  E-value: 1.00e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  955 GLFTVDEGADGVVKYELVDEDLVLTGLTSDGESLEWQAVSQNGTTFTYVAqtTASNEAVFEIIFDTsDNSYQFELFKPLK 1034
Cdd:TIGR03660    1 GQFTVTQGADGVVSYQLDDSTNPVAGLTSGGQAVTLSETSNADGNFTYTA--TAGGNPVFTLTLNA-DGSYEFTLEGPLD 77
                           90       100
                   ....*....|....*....|....
gi 2793962408 1035 HldGAGENAINLNFSVVAEDFDQD 1058
Cdd:TIGR03660   78 H--AAGSDELTLNFPIIATDFDGD 99
T1SS_rpt_143 TIGR03660
T1SS-143 repeat domain; This model represents a domain of about 143 amino acids that may occur ...
362-487 8.77e-16

T1SS-143 repeat domain; This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion. [Cellular processes, Pathogenesis]


Pssm-ID: 132699 [Multi-domain]  Cd Length: 137  Bit Score: 75.01  E-value: 8.77e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  362 GSDNLQSAVFDAGAlDQFDGLLSDNQN-TLARLSDDGTTITLSIQGRGEVVLTISLDTDGTYKFEQFNPIEQ-VGTDSLT 439
Cdd:TIGR03660    8 GADGVVSYQLDDST-NPVAGLTSGGQAvTLSETSNADGNFTYTATAGGNPVFTLTLNADGSYEFTLEGPLDHaAGSDELT 86
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 2793962408  440 FDLPITITDFDQDVVTNNINIVITDgDSPVITNVDSISVDEAGIIGGS 487
Cdd:TIGR03660   87 LNFPIIATDFDGDTSSITLPVTIVD-DVPTITDVDALTVDEDDLPGGS 133
T1SS_rpt_143 TIGR03660
T1SS-143 repeat domain; This model represents a domain of about 143 amino acids that may occur ...
507-654 4.38e-14

T1SS-143 repeat domain; This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion. [Cellular processes, Pathogenesis]


Pssm-ID: 132699 [Multi-domain]  Cd Length: 137  Bit Score: 70.39  E-value: 4.38e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  507 SDIIDHYELEPTeFNTGGTLVSNGEAVLLELIGEIGGVRTYEGYveVNGTRITVFDVKIDspslGNYEFNLYEELSHQGA 586
Cdd:TIGR03660    9 ADGVVSYQLDDS-TNPVAGLTSGGQAVTLSETSNADGNFTYTAT--AGGNPVFTLTLNAD----GSYEFTLEGPLDHAAG 81
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2793962408  587 EDAlLTFALPIYAVDADGDRSalsggsntpeAAEILVNVKDDVPVMTAPTGETvVDEDDLTGIGSDQS 654
Cdd:TIGR03660   82 SDE-LTLNFPIIATDFDGDTS----------SITLPVTIVDDVPTITDVDALT-VDEDDLPGGSDGSK 137
DUF5801 pfam19116
Domain of unknown function (DUF5801); This entry contains a presumed domain that is found as ...
623-777 1.35e-09

Domain of unknown function (DUF5801); This entry contains a presumed domain that is found as tandem repeats in a number of bacterial proteins.


Pssm-ID: 465976 [Multi-domain]  Cd Length: 152  Bit Score: 57.64  E-value: 1.35e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  623 VNVKDDVPVMTAPTGET---VVDEDDLTGIGSDQSEDTIINGLFTVDEGADGVA----KYELVDEDLVLTGL--TSDGES 693
Cdd:pfam19116    1 ISFEDDGPSITASAGEAptlTVDETALGTGGGLADATASFAGLFTSDFGADGAGstgsTYSLSLSAGAASGLtdTATGQA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  694 LEwqaVSQNGTTFtyVAQTATSNEAVFEIIFDTSDNSYQFELFKPIKHPDGSGENtidlnfsvvaeDSDQDKSNAVDLQI 773
Cdd:pfam19116   81 IL---LFLEGGVV--VGRTAGGGDVVFTVSVDAATGEVTLTQYRAVVHPDTSDPD-----------DSVSLAAGLITLTA 144

                   ....
gi 2793962408  774 TVTD 777
Cdd:pfam19116  145 TVTD 148
DUF5801 pfam19116
Domain of unknown function (DUF5801); This entry contains a presumed domain that is found as ...
917-1058 2.25e-06

Domain of unknown function (DUF5801); This entry contains a presumed domain that is found as tandem repeats in a number of bacterial proteins.


Pssm-ID: 465976 [Multi-domain]  Cd Length: 152  Bit Score: 48.39  E-value: 2.25e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  917 IEVLDDAPIMSAPTGET---VVDEDDLTGVGSDQSEDSIINGLFTVDEGADG----VVKYELVDEDLVLTGL--TSDGES 987
Cdd:pfam19116    1 ISFEDDGPSITASAGEAptlTVDETALGTGGGLADATASFAGLFTSDFGADGagstGSTYSLSLSAGAASGLtdTATGQA 80
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2793962408  988 LEwqaVSQNGTTFtyVAQTTASNEAVFEIIFDTSDNSYQFELFKPLKHLDGAGEN------AINLNFSVVAEDFDQD 1058
Cdd:pfam19116   81 IL---LFLEGGVV--VGRTAGGGDVVFTVSVDAATGEVTLTQYRAVVHPDTSDPDdsvslaAGLITLTATVTDGDGD 152
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
174-1010 5.14e-04

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 44.37  E-value: 5.14e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  174 SSGGQSISEVLTEGSISGNTYPQSITTTETIIAGSLALSPDSFVPEALSLASLLTELNSDITSSGQPVTFTYDAATNSII 253
Cdd:COG3210    806 AAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVA 885
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  254 GVQGTDEVLRIDIDVVSVGNNAELSLTTTISQPIDHVTSVGGGQVSYTGDQINITFDIQGEDTAGNPLATPINAQVAVVD 333
Cdd:COG3210    886 TSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGD 965
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  334 GGDPSAESVNISNVETSSAAIEGTFSNIGSDNLQSAVFDAGALDQFDGLLSDNQNTLARL---SDDGTTITLSIQGRGEV 410
Cdd:COG3210    966 TGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTtgtASATGTGTAATAGGQNG 1045
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  411 VLTISLDTDGTYKFEQFNPIEQVGTDSLTFDLPITITDFDQDVVTNNINIVITDGDSPVITNVDSISVDEAGIIGGSQEG 490
Cdd:COG3210   1046 VGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGVTASKVGGTT 1125
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  491 TAPVAGSGSITADIFESDIIDHYELEPTEFNTGGTLVSNGEAVLLELIGEIGGVRTYEGYVEVNGTRITVFDVKIDSPSL 570
Cdd:COG3210   1126 TVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAATEGTAGTDLKGGDSTG 1205
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  571 GNYEFNLYEELSHQGAEDALLTFALPIYAVDADGDRSALSGGSNTPEAAEILVNVKDDVPVMTAPTGETVVDEDDLTGIG 650
Cdd:COG3210   1206 GSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGATSTVAGNAGATATGST 1285
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  651 SDQSEDTIINGLFTVDEGADGVAKYELVDEDLVLTGLTSDGESLEWQAVSQNGTTFTYVAQTATSNEAVFEIIFDTSDNS 730
Cdd:COG3210   1286 VDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANTGLNGGNGATDSAAGA 1365
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  731 YQFELFKPIKHPDGSGENTIDLNFSVVAEDSDQDKSNAVDLQITVTDDVPTITDMTAASTFVVDEDDLSSVIAQATGAFV 810
Cdd:COG3210   1366 GSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSATTGTGGTGNTTGTSV 1445
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  811 TTEGAdkveVYELRNVSALEATLTSGTEAISITEITGAANTTTYQGTTTSGTPVFTLALANDGSYTFTLLGPLNHPTSPN 890
Cdd:COG3210   1446 AGAGG----GNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGNGAGVAGATASNGGTSTGAGGTAGGTTAEVAK 1521
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  891 SNTLTIPFDVVAVDGDGDDSNQYVLPIEVLDDAPIMSAPTGETVVDEDDLTGVGSDQSEDSIINGLFTVDEGADGVVKYE 970
Cdd:COG3210   1522 ASLEGGEGTYGGSSVAEAGTGGGILGAVSGAGSEGGAAGGVTGSVGVGGTDGAGGDTGGADDTGAQAPTAGNTATLTLSL 1601
                          810       820       830       840
                   ....*....|....*....|....*....|....*....|
gi 2793962408  971 LVDEDLVLTGLTSDGESLEWQAVSQNGTTFTYVAQTTASN 1010
Cdd:COG3210   1602 AEGTNAEYGGTTNVTSGTAGNAGATGANSNTVVTTNGGEG 1641
DUF5801 pfam19116
Domain of unknown function (DUF5801); This entry contains a presumed domain that is found as ...
773-908 5.71e-03

Domain of unknown function (DUF5801); This entry contains a presumed domain that is found as tandem repeats in a number of bacterial proteins.


Pssm-ID: 465976 [Multi-domain]  Cd Length: 152  Bit Score: 38.38  E-value: 5.71e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  773 ITVTDDVPTIT-DMTAASTFVVDEDDLSSVIAQAT------GAFVTTEGAD----KVEVYELRNVSALEATLTSGTEAIS 841
Cdd:pfam19116    1 ISFEDDGPSITaSAGEAPTLTVDETALGTGGGLADatasfaGLFTSDFGADgagsTGSTYSLSLSAGAASGLTDTATGQA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  842 IT------EITGAANTTTYQgtttsgtpVFTLAL-ANDGSYTFTLLGPLNHPTSPNSN-TLTIPFDVVAV-----DGDGD 908
Cdd:pfam19116   81 ILlfleggVVVGRTAGGGDV--------VFTVSVdAATGEVTLTQYRAVVHPDTSDPDdSVSLAAGLITLtatvtDGDGD 152
 
Name Accession Description Interval E-value
retention_LapA NF033682
retention module-containing protein; The retention module, as described for the giant adhesin ...
7-156 3.66e-47

retention module-containing protein; The retention module, as described for the giant adhesin LapA of Pseudomonas fluorescens and for an ice-binding giant adhesin of an Antarctic bacterium, appears at the N-terminus of a number of very large repetitive proteins, many of which have C-terminal regions that make them substrates for type I secretion systems.


Pssm-ID: 468140  Cd Length: 145  Bit Score: 165.11  E-value: 3.66e-47
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408    7 RQAAVVEAVSGEVIAVKSDGSARKISVGDIIRENEIVITANQAELLLGNPNGVT-EVASNCVGCVDQDlvwadapiAGEV 85
Cdd:NF033682     1 TQVAVVKAVSGTVFAVNADGSVRVLKVGDTLQAGEIVITGNGAAVELQLADGSTlTLGENCVACVTED--------NGLI 72
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2793962408   86 NFDLQQADAGDFDDDEFAAIQEAILGGADPTQILEATAAG--GGLGSANAGFVTIDYNYTETHPSTFFETAGL 156
Cdd:NF033682    73 EFDAEEAAAASFDDPDIAAIQAAILAGADPTELLEATAAGlaGGAGGAGGGFVTIDRNGDEVLPSTGFPTAGF 145
T1SS_rpt_143 TIGR03660
T1SS-143 repeat domain; This model represents a domain of about 143 amino acids that may occur ...
661-799 2.77e-28

T1SS-143 repeat domain; This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion. [Cellular processes, Pathogenesis]


Pssm-ID: 132699 [Multi-domain]  Cd Length: 137  Bit Score: 110.84  E-value: 2.77e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  661 GLFTVDEGADGVAKYELVDEDLVLTGLTSDGESLEWQAVSQNGTTFTYVAqTATSNeAVFEIIFDTsDNSYQFELFKPIK 740
Cdd:TIGR03660    1 GQFTVTQGADGVVSYQLDDSTNPVAGLTSGGQAVTLSETSNADGNFTYTA-TAGGN-PVFTLTLNA-DGSYEFTLEGPLD 77
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 2793962408  741 HPDGSgeNTIDLNFSVVAEDSDQDKSNAVdLQITVTDDVPTITDMTAAStfvVDEDDLS 799
Cdd:TIGR03660   78 HAAGS--DELTLNFPIIATDFDGDTSSIT-LPVTIVDDVPTITDVDALT---VDEDDLP 130
T1SS_rpt_143 TIGR03660
T1SS-143 repeat domain; This model represents a domain of about 143 amino acids that may occur ...
807-948 4.53e-26

T1SS-143 repeat domain; This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion. [Cellular processes, Pathogenesis]


Pssm-ID: 132699 [Multi-domain]  Cd Length: 137  Bit Score: 104.67  E-value: 4.53e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  807 GAFVTTEGADKVEVYELRNVSALEATLTSGTEAISITEiTGAANTTTYQGTTTSGTPVFTLALANDGSYTFTLLGPLNHP 886
Cdd:TIGR03660    1 GQFTVTQGADGVVSYQLDDSTNPVAGLTSGGQAVTLSE-TSNADGNFTYTATAGGNPVFTLTLNADGSYEFTLEGPLDHA 79
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2793962408  887 TspNSNTLTIPFDVVAVDGDGDDSNQyVLPIEVLDDAPIMSAPTGETvVDEDDLTGVGSDQS 948
Cdd:TIGR03660   80 A--GSDELTLNFPIIATDFDGDTSSI-TLPVTIVDDVPTITDVDALT-VDEDDLPGGSDGSK 137
T1SS_rpt_143 TIGR03660
T1SS-143 repeat domain; This model represents a domain of about 143 amino acids that may occur ...
955-1058 1.00e-19

T1SS-143 repeat domain; This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion. [Cellular processes, Pathogenesis]


Pssm-ID: 132699 [Multi-domain]  Cd Length: 137  Bit Score: 86.57  E-value: 1.00e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  955 GLFTVDEGADGVVKYELVDEDLVLTGLTSDGESLEWQAVSQNGTTFTYVAqtTASNEAVFEIIFDTsDNSYQFELFKPLK 1034
Cdd:TIGR03660    1 GQFTVTQGADGVVSYQLDDSTNPVAGLTSGGQAVTLSETSNADGNFTYTA--TAGGNPVFTLTLNA-DGSYEFTLEGPLD 77
                           90       100
                   ....*....|....*....|....
gi 2793962408 1035 HldGAGENAINLNFSVVAEDFDQD 1058
Cdd:TIGR03660   78 H--AAGSDELTLNFPIIATDFDGD 99
T1SS_rpt_143 TIGR03660
T1SS-143 repeat domain; This model represents a domain of about 143 amino acids that may occur ...
362-487 8.77e-16

T1SS-143 repeat domain; This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion. [Cellular processes, Pathogenesis]


Pssm-ID: 132699 [Multi-domain]  Cd Length: 137  Bit Score: 75.01  E-value: 8.77e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  362 GSDNLQSAVFDAGAlDQFDGLLSDNQN-TLARLSDDGTTITLSIQGRGEVVLTISLDTDGTYKFEQFNPIEQ-VGTDSLT 439
Cdd:TIGR03660    8 GADGVVSYQLDDST-NPVAGLTSGGQAvTLSETSNADGNFTYTATAGGNPVFTLTLNADGSYEFTLEGPLDHaAGSDELT 86
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 2793962408  440 FDLPITITDFDQDVVTNNINIVITDgDSPVITNVDSISVDEAGIIGGS 487
Cdd:TIGR03660   87 LNFPIIATDFDGDTSSITLPVTIVD-DVPTITDVDALTVDEDDLPGGS 133
T1SS_rpt_143 TIGR03660
T1SS-143 repeat domain; This model represents a domain of about 143 amino acids that may occur ...
507-654 4.38e-14

T1SS-143 repeat domain; This model represents a domain of about 143 amino acids that may occur singly or in up to 23 tandem repeats in very large proteins in the genus Vibrio, and in related species such as Legionella pneumophila, Photobacterium profundum, Rhodopseudomonas palustris, Shewanella pealeana, and Aeromonas hydrophila. Proteins with these domains represent a subset of a broader set of proteins with a particular signal for type 1 secretion, consisting of several glycine-rich repeats modeled by pfam00353, followed by a C-terminal domain modeled by TIGR03661. Proteins with this domain tend to share several properties with the RtxA (Repeats in Toxin) protein of Vibrio cholerae, including a large size often containing tandemly repeated domains and a C-terminal signal for type 1 secretion. [Cellular processes, Pathogenesis]


Pssm-ID: 132699 [Multi-domain]  Cd Length: 137  Bit Score: 70.39  E-value: 4.38e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  507 SDIIDHYELEPTeFNTGGTLVSNGEAVLLELIGEIGGVRTYEGYveVNGTRITVFDVKIDspslGNYEFNLYEELSHQGA 586
Cdd:TIGR03660    9 ADGVVSYQLDDS-TNPVAGLTSGGQAVTLSETSNADGNFTYTAT--AGGNPVFTLTLNAD----GSYEFTLEGPLDHAAG 81
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2793962408  587 EDAlLTFALPIYAVDADGDRSalsggsntpeAAEILVNVKDDVPVMTAPTGETvVDEDDLTGIGSDQS 654
Cdd:TIGR03660   82 SDE-LTLNFPIIATDFDGDTS----------SITLPVTIVDDVPTITDVDALT-VDEDDLPGGSDGSK 137
DUF5801 pfam19116
Domain of unknown function (DUF5801); This entry contains a presumed domain that is found as ...
623-777 1.35e-09

Domain of unknown function (DUF5801); This entry contains a presumed domain that is found as tandem repeats in a number of bacterial proteins.


Pssm-ID: 465976 [Multi-domain]  Cd Length: 152  Bit Score: 57.64  E-value: 1.35e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  623 VNVKDDVPVMTAPTGET---VVDEDDLTGIGSDQSEDTIINGLFTVDEGADGVA----KYELVDEDLVLTGL--TSDGES 693
Cdd:pfam19116    1 ISFEDDGPSITASAGEAptlTVDETALGTGGGLADATASFAGLFTSDFGADGAGstgsTYSLSLSAGAASGLtdTATGQA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  694 LEwqaVSQNGTTFtyVAQTATSNEAVFEIIFDTSDNSYQFELFKPIKHPDGSGENtidlnfsvvaeDSDQDKSNAVDLQI 773
Cdd:pfam19116   81 IL---LFLEGGVV--VGRTAGGGDVVFTVSVDAATGEVTLTQYRAVVHPDTSDPD-----------DSVSLAAGLITLTA 144

                   ....
gi 2793962408  774 TVTD 777
Cdd:pfam19116  145 TVTD 148
DUF5801 pfam19116
Domain of unknown function (DUF5801); This entry contains a presumed domain that is found as ...
917-1058 2.25e-06

Domain of unknown function (DUF5801); This entry contains a presumed domain that is found as tandem repeats in a number of bacterial proteins.


Pssm-ID: 465976 [Multi-domain]  Cd Length: 152  Bit Score: 48.39  E-value: 2.25e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  917 IEVLDDAPIMSAPTGET---VVDEDDLTGVGSDQSEDSIINGLFTVDEGADG----VVKYELVDEDLVLTGL--TSDGES 987
Cdd:pfam19116    1 ISFEDDGPSITASAGEAptlTVDETALGTGGGLADATASFAGLFTSDFGADGagstGSTYSLSLSAGAASGLtdTATGQA 80
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2793962408  988 LEwqaVSQNGTTFtyVAQTTASNEAVFEIIFDTSDNSYQFELFKPLKHLDGAGEN------AINLNFSVVAEDFDQD 1058
Cdd:pfam19116   81 IL---LFLEGGVV--VGRTAGGGDVVFTVSVDAATGEVTLTQYRAVVHPDTSDPDdsvslaAGLITLTATVTDGDGD 152
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
174-1010 5.14e-04

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 44.37  E-value: 5.14e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  174 SSGGQSISEVLTEGSISGNTYPQSITTTETIIAGSLALSPDSFVPEALSLASLLTELNSDITSSGQPVTFTYDAATNSII 253
Cdd:COG3210    806 AAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVA 885
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  254 GVQGTDEVLRIDIDVVSVGNNAELSLTTTISQPIDHVTSVGGGQVSYTGDQINITFDIQGEDTAGNPLATPINAQVAVVD 333
Cdd:COG3210    886 TSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGD 965
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  334 GGDPSAESVNISNVETSSAAIEGTFSNIGSDNLQSAVFDAGALDQFDGLLSDNQNTLARL---SDDGTTITLSIQGRGEV 410
Cdd:COG3210    966 TGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTtgtASATGTGTAATAGGQNG 1045
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  411 VLTISLDTDGTYKFEQFNPIEQVGTDSLTFDLPITITDFDQDVVTNNINIVITDGDSPVITNVDSISVDEAGIIGGSQEG 490
Cdd:COG3210   1046 VGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGVTASKVGGTT 1125
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  491 TAPVAGSGSITADIFESDIIDHYELEPTEFNTGGTLVSNGEAVLLELIGEIGGVRTYEGYVEVNGTRITVFDVKIDSPSL 570
Cdd:COG3210   1126 TVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAATEGTAGTDLKGGDSTG 1205
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  571 GNYEFNLYEELSHQGAEDALLTFALPIYAVDADGDRSALSGGSNTPEAAEILVNVKDDVPVMTAPTGETVVDEDDLTGIG 650
Cdd:COG3210   1206 GSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGATSTVAGNAGATATGST 1285
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  651 SDQSEDTIINGLFTVDEGADGVAKYELVDEDLVLTGLTSDGESLEWQAVSQNGTTFTYVAQTATSNEAVFEIIFDTSDNS 730
Cdd:COG3210   1286 VDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANTGLNGGNGATDSAAGA 1365
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  731 YQFELFKPIKHPDGSGENTIDLNFSVVAEDSDQDKSNAVDLQITVTDDVPTITDMTAASTFVVDEDDLSSVIAQATGAFV 810
Cdd:COG3210   1366 GSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSATTGTGGTGNTTGTSV 1445
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  811 TTEGAdkveVYELRNVSALEATLTSGTEAISITEITGAANTTTYQGTTTSGTPVFTLALANDGSYTFTLLGPLNHPTSPN 890
Cdd:COG3210   1446 AGAGG----GNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGNGAGVAGATASNGGTSTGAGGTAGGTTAEVAK 1521
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  891 SNTLTIPFDVVAVDGDGDDSNQYVLPIEVLDDAPIMSAPTGETVVDEDDLTGVGSDQSEDSIINGLFTVDEGADGVVKYE 970
Cdd:COG3210   1522 ASLEGGEGTYGGSSVAEAGTGGGILGAVSGAGSEGGAAGGVTGSVGVGGTDGAGGDTGGADDTGAQAPTAGNTATLTLSL 1601
                          810       820       830       840
                   ....*....|....*....|....*....|....*....|
gi 2793962408  971 LVDEDLVLTGLTSDGESLEWQAVSQNGTTFTYVAQTTASN 1010
Cdd:COG3210   1602 AEGTNAEYGGTTNVTSGTAGNAGATGANSNTVVTTNGGEG 1641
COG4254 COG4254
Uncharacterized peptidoglycan binding protein, contains LysM and FecR domains [General ...
9-53 1.01e-03

Uncharacterized peptidoglycan binding protein, contains LysM and FecR domains [General function prediction only];


Pssm-ID: 443396 [Multi-domain]  Cd Length: 207  Bit Score: 41.59  E-value: 1.01e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 2793962408    9 AAVVEAVSGEVIAVKSDGSARKISVGDIIRENEIVITANQAELLL 53
Cdd:COG4254     31 AARVVAVSGEVTAVRADGKWRPLKVGDPLFEGDRIRTGANSRAQL 75
DUF5801 pfam19116
Domain of unknown function (DUF5801); This entry contains a presumed domain that is found as ...
773-908 5.71e-03

Domain of unknown function (DUF5801); This entry contains a presumed domain that is found as tandem repeats in a number of bacterial proteins.


Pssm-ID: 465976 [Multi-domain]  Cd Length: 152  Bit Score: 38.38  E-value: 5.71e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  773 ITVTDDVPTIT-DMTAASTFVVDEDDLSSVIAQAT------GAFVTTEGAD----KVEVYELRNVSALEATLTSGTEAIS 841
Cdd:pfam19116    1 ISFEDDGPSITaSAGEAPTLTVDETALGTGGGLADatasfaGLFTSDFGADgagsTGSTYSLSLSAGAASGLTDTATGQA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2793962408  842 IT------EITGAANTTTYQgtttsgtpVFTLAL-ANDGSYTFTLLGPLNHPTSPNSN-TLTIPFDVVAV-----DGDGD 908
Cdd:pfam19116   81 ILlfleggVVVGRTAGGGDV--------VFTVSVdAATGEVTLTQYRAVVHPDTSDPDdSVSLAAGLITLtatvtDGDGD 152
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH