NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1002889586|gb|AMN97975|]
View 

type IV pilin biogenesis protein [Salmonella enterica subsp. enterica serovar Enteritidis str. EC20120685]

Protein Classification

protein transport protein HofC( domain architecture ID 11484805)

protein transport protein HofC, a homolog of Pseudomonas aeruginosa type IV pilus assembly protein PilC that is involved in the translocation of the type IV pilin

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PRK10573 PRK10573
protein transport protein HofC;
1-397 0e+00

protein transport protein HofC;


:

Pssm-ID: 182559 [Multi-domain]  Cd Length: 399  Bit Score: 611.89  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586   1 MSVKQLWRWQGVNDKGQLEQDVVWVDNRLALIITLQHQRIMPLRIKR-IGVNAALWKEEQSAEIIHQLATLIHAGLTLSE 79
Cdd:PRK10573    1 MASKQLWRWQAINGKGELQDGMLWATSRLLLYQALQQQGLQPLSLKRgRRINARYWRGEQSAEFIRQLATLLQAGLPLSE 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586  80 GLELLAKQHPHRQWQALLRTLAHELEQGVPFSSALVSWPQVFPPLYQTMIRTGELTGKLAECCFELARQQKAQRQITVSV 159
Cdd:PRK10573   81 GLQLLAEQHPSAQWQALLQDLAHQLEQGEAFSEALLQWPQVFPPLYQALIATGELTGKLDECCFQLARQQEAQQQLTKKV 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586 160 KKALRYPAIILTMAALVVFAMLHFVLPEFAAIYRSFNTPLPLLTRGIIAIAQWGSAWGWLILFLTMLVAIAHRRVKQK-P 238
Cdd:PRK10573  161 KKALRYPLIILAVALLVVLAMLHFVLPEFAAIYRSFNTPLPLLTRGILALSDFLIQYGWLLLLLLFLLAIAYKRLRRKkP 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586 239 SWQAQRQRLLLRLPVMGRLIRGQKLAQIFTVLALTQSAGIPFLQGLESAIESLGCPYWSQRLTQVHQEIAAGNPVWLALK 318
Cdd:PRK10573  241 TWQIREQRLLLRLPLVGSLIRGQKLSQIFTILALTQSAGLTLLQGLESAAETLRCPYWQQALTQIQQQIAQGIPLWLALK 320
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1002889586 319 NTQEFSPLCLQLVRTGEASGSLDIMLHNLARHHSESTLALADNLASLLEPALLIITGLIIGTLVVAMYLPIFHLGDAMS 397
Cdd:PRK10573  321 NHPLFPPLCLQLVRVGEESGSLDLMLENLAHWHQEQTQALADNLAQLLEPLLMIITGGIVGTLVVAMYLPIFQLGDAMS 399
 
Name Accession Description Interval E-value
PRK10573 PRK10573
protein transport protein HofC;
1-397 0e+00

protein transport protein HofC;


Pssm-ID: 182559 [Multi-domain]  Cd Length: 399  Bit Score: 611.89  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586   1 MSVKQLWRWQGVNDKGQLEQDVVWVDNRLALIITLQHQRIMPLRIKR-IGVNAALWKEEQSAEIIHQLATLIHAGLTLSE 79
Cdd:PRK10573    1 MASKQLWRWQAINGKGELQDGMLWATSRLLLYQALQQQGLQPLSLKRgRRINARYWRGEQSAEFIRQLATLLQAGLPLSE 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586  80 GLELLAKQHPHRQWQALLRTLAHELEQGVPFSSALVSWPQVFPPLYQTMIRTGELTGKLAECCFELARQQKAQRQITVSV 159
Cdd:PRK10573   81 GLQLLAEQHPSAQWQALLQDLAHQLEQGEAFSEALLQWPQVFPPLYQALIATGELTGKLDECCFQLARQQEAQQQLTKKV 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586 160 KKALRYPAIILTMAALVVFAMLHFVLPEFAAIYRSFNTPLPLLTRGIIAIAQWGSAWGWLILFLTMLVAIAHRRVKQK-P 238
Cdd:PRK10573  161 KKALRYPLIILAVALLVVLAMLHFVLPEFAAIYRSFNTPLPLLTRGILALSDFLIQYGWLLLLLLFLLAIAYKRLRRKkP 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586 239 SWQAQRQRLLLRLPVMGRLIRGQKLAQIFTVLALTQSAGIPFLQGLESAIESLGCPYWSQRLTQVHQEIAAGNPVWLALK 318
Cdd:PRK10573  241 TWQIREQRLLLRLPLVGSLIRGQKLSQIFTILALTQSAGLTLLQGLESAAETLRCPYWQQALTQIQQQIAQGIPLWLALK 320
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1002889586 319 NTQEFSPLCLQLVRTGEASGSLDIMLHNLARHHSESTLALADNLASLLEPALLIITGLIIGTLVVAMYLPIFHLGDAMS 397
Cdd:PRK10573  321 NHPLFPPLCLQLVRVGEESGSLDLMLENLAHWHQEQTQALADNLAQLLEPLLMIITGGIVGTLVVAMYLPIFQLGDAMS 399
PulF COG1459
Type II secretory pathway, component PulF [Cell motility, Intracellular trafficking, secretion, ...
6-397 1.22e-137

Type II secretory pathway, component PulF [Cell motility, Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 441068 [Multi-domain]  Cd Length: 399  Bit Score: 398.72  E-value: 1.22e-137
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586   6 LWRWQGVNDKGQLEQDVVWVDNRLALIITLQHQRIMPLRIKRIGVNAAL------WKEEQSAEIIHQLATLIHAGLTLSE 79
Cdd:COG1459     2 TFRYKALDADGKKVKGEIEADSEAEARAQLREQGLTPLSVKEKKEGLAArlfrrkVKAKDLALFTRQLATLLRAGLPLLE 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586  80 GLELLAKQHPHRQWQALLRTLAHELEQGVPFSSALVSWPQVFPPLYQTMIRTGELTGKLAECCFELARQQKAQRQITVSV 159
Cdd:COG1459    82 ALEILAEQTENPRLRKVLADIREDVEEGASLSEALAKHPKVFPPLYVNMVRAGEASGNLDEVLERLADYLEKQEELRKKI 161
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586 160 KKALRYPAIILTMAALVVFAMLHFVLPEFAAIYRSFNTPLPLLTRGIIAIAQWGSAWGWLILFLTMLVAIAHRRVKQKPS 239
Cdd:COG1459   162 KSALIYPAIVLVVAIGVVLFLLTFVVPQFAGIFESFGAELPLLTRILIALSDFLQNYWWLLLLGLVLLVVGFRRLLRTPK 241
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586 240 WQAQRQRLLLRLPVMGRLIRGQKLAQIFTVLALTQSAGIPFLQGLESAIESLGCPYWSQRLTQVHQEIAAGNPVWLALKN 319
Cdd:COG1459   242 GRLRLDRLLLKLPVIGPLIRKAALARFARTLATLLSSGVPLLEALEIAAEVVGNRVLREALEEARERVREGESLSEALEA 321
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1002889586 320 TQEFSPLCLQLVRTGEASGSLDIMLHNLARHHSESTLALADNLASLLEPALLIITGLIIGTLVVAMYLPIFHLGDAMS 397
Cdd:COG1459   322 SGLFPPLVVQMIAVGEESGALDEMLEKVADFYEEEVDRAVDRLTSLLEPLLIVVLGGIVGFIVLAIYLPIFSLGSLVG 399
GspF TIGR02120
type II secretion system protein F; This membrane protein is a component of the terminal ...
8-392 2.64e-68

type II secretion system protein F; This membrane protein is a component of the terminal branch complex of the general secretion pathway (GSP), also known as the"Type II" secretion pathway. The GSP transports proteins (generally virulence-associated cell wall hydrolases) across the outer membrase of the bacterial cell. Transport across the inner membrane is often, but not exclusively handled by the Sec system. This model was constructed from the broader subfamily model, pfam00482 which includes components of pilin complexes (PilC) as well as other related genes. GspF is nearly always gene clustered with other GSP subunits. Some genes from Xylella and Xanthomonas strains score below the trusted cutoff due to excessive divergence from the family such that a sequence from Deinococcus which does not appear to be GspF scores higher. [Protein fate, Protein and peptide secretion and trafficking]


Pssm-ID: 273980 [Multi-domain]  Cd Length: 399  Bit Score: 221.05  E-value: 2.64e-68
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586   8 RWQGVNDKGQLEQDVVWVDNRLALIITLQHQRIMPLRIKRIGVNAA---------LWKEEQSAEIIHQLATLIHAGLTLS 78
Cdd:TIGR02120   2 RYRALDAAGRAQKGTLEADSARAARLQLRERGLFPLDVDPVAAKGAgsgrklgrrRLSRAELALFTRQLATLLGAGLPLE 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586  79 EGLELLAKQHPHRQWQALLRTLAHELEQGVPFSSALVSWPQVFPPLYQTMIRTGELTGKLAECCFELARQQKAQRQITVS 158
Cdd:TIGR02120  82 EALAALLEQAEKPRLKSVLAAIRSRVLEGKSLADALAQHPRDFPPLYRALVAAGEASGALDAVLERLADYLEERQALRSK 161
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586 159 VKKALRYPAIILTMAALVVFAMLHFVLPEFAAIYRSFNTPLPLLTRGIIAIAQWGSAWGWLILFLTMLVAIAHRRVKQKP 238
Cdd:TIGR02120 162 ITTALIYPAVLTVVAIGVVIFLLAYVVPKVVEQFAHMKQTLPLLTRALIALSDFLRSWGWALLAALAALVVLFRRLLRDP 241
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586 239 SWQAQRQRLLLRLPVMGRLIRGQKLAQIFTVLALTQSAGIPFLQGLESAIESLGCPYWSQRLTQVHQEIAAGNPVWLALK 318
Cdd:TIGR02120 242 AFRLRFDRRLLRLPVIGRLVRGLNTARFARTLSILLSSGVPLLRALQIARETLTNRALRAAVEDAAARVREGGSLSRALR 321
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1002889586 319 NTQEFSPLCLQLVRTGEASGSLDIMLHNLARHHSESTLALADNLASLLEPALLIITGLIIGTLVVAMYLPIFHL 392
Cdd:TIGR02120 322 ATGLFPPLLVHMIASGEKSGQLETMLERAADNQEREFERRIATLTALLEPLLIVVMGGVVLFIVLAVLLPILQL 395
T4P_ComGB NF041012
competence type IV pilus assembly protein ComGB; Members of this family occur in Gram-positive ...
59-392 1.88e-29

competence type IV pilus assembly protein ComGB; Members of this family occur in Gram-positive bacteria as part of a type IV pilus system used for DNA binding and uptake when species such as Streptococcus pneumoniae or Bacilus subtilis are in a competent state for natural transformation. Members of this family, typically called ComGB (second protein of the ComG operon), belong more broadly to the family of GspF, part of type 2 secretion systems (T2SS) in Gram-negative bacteria.


Pssm-ID: 468941 [Multi-domain]  Cd Length: 333  Bit Score: 116.42  E-value: 1.88e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586  59 QSAEIIHQLATLIHAGLTLSEGLELLAKQHPHRQWQALLRtLAHELEQGVPFSSALVSWPqvFPPLYQTMIRTGELTGKL 138
Cdd:NF041012    2 QQAKFLQRLGELLESGFSLSEALEFLLRQLPKKSKEYLQK-ILEGLKEGASLSEILKQLG--FSDEIVTQIYFAEKHGNL 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586 139 AECCFELARQQKAQRQITVSVKKALRYPAIILTMAALVVFAMLHFVLPEFAAIYRSFNTPLPLLTRGIIAIAQWGSAWGW 218
Cdd:NF041012   79 AETLKEIAEYLKRKEKQKKKLIKVLQYPLLLLLFLILILLGLRQYLLPQFEQLYSSMNVMLSSFTNLLTLFIQHLPLIIL 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586 219 LILFLTMLVAIAHRRVKQKPSwQAQRQRLLLRLPVMGRLIR-----------------GQKLAQIFTVLALTQSAgiPFL 281
Cdd:NF041012  159 GFLLLLLLLFLIYIFYFKKLS-PLKQIKFLSKIPLIGSLYKlyltyyfarelgnllkqGLSLQQILQLMQEQKSD--PFL 235
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586 282 QGLESAIEslgcpywsqrltqvhQEIAAGNPVWLALKN----TQEFSplclQLVRTGEASGSLDIMLHNLARHHSESTLA 357
Cdd:NF041012  236 QELAKRLE---------------ERLLKGESLEQILKKypffEKELS----LIIEHGEKKGKLGKELLLYSQLLLEKFEQ 296
                         330       340       350
                  ....*....|....*....|....*....|....*
gi 1002889586 358 LADNLASLLEPALLIITGLIIGTLVVAMYLPIFHL 392
Cdd:NF041012  297 KIEKLLKFIQPIIFLLIALLIVSIYLAILLPMYQM 331
T2SSF pfam00482
Type II secretion system (T2SS), protein F; The original family covered both the regions found ...
63-180 2.44e-21

Type II secretion system (T2SS), protein F; The original family covered both the regions found by the current model. The splitting of the family has allowed the related FlaJ_arch (archaeal FlaJ family) to be merged with it. Proteins with this domain in form a platform for the machiney of the Type II secretion system, as well as the Type 4 pili and the archaeal flagella. This domain seems to show some similarity to PF00664 but this may just be due to similarities in the TM helices (personal obs: C Yeats).


Pssm-ID: 425708 [Multi-domain]  Cd Length: 119  Bit Score: 88.55  E-value: 2.44e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586  63 IIHQLATLIHAGLTLSEGLELLAKQHPHRQWQALLRTLAHELEQGVPFSSALVSWPQ-VFPPLYQTMIRTGELTGKLAEC 141
Cdd:pfam00482   1 FLRQLATLLRAGLPLVEALEILAEEAENGPLREELRRIAERVREGGSLSEALARTPSsVFPPLLVALIAAGESGGNLAEV 80
                          90       100       110
                  ....*....|....*....|....*....|....*....
gi 1002889586 142 CFELARQQKAQRQITVSVKKALRYPAIILTMAALVVFAM 180
Cdd:pfam00482  81 LERLADYLEEERELRRKIKAALLYPLILLVVALLVLLIL 119
 
Name Accession Description Interval E-value
PRK10573 PRK10573
protein transport protein HofC;
1-397 0e+00

protein transport protein HofC;


Pssm-ID: 182559 [Multi-domain]  Cd Length: 399  Bit Score: 611.89  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586   1 MSVKQLWRWQGVNDKGQLEQDVVWVDNRLALIITLQHQRIMPLRIKR-IGVNAALWKEEQSAEIIHQLATLIHAGLTLSE 79
Cdd:PRK10573    1 MASKQLWRWQAINGKGELQDGMLWATSRLLLYQALQQQGLQPLSLKRgRRINARYWRGEQSAEFIRQLATLLQAGLPLSE 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586  80 GLELLAKQHPHRQWQALLRTLAHELEQGVPFSSALVSWPQVFPPLYQTMIRTGELTGKLAECCFELARQQKAQRQITVSV 159
Cdd:PRK10573   81 GLQLLAEQHPSAQWQALLQDLAHQLEQGEAFSEALLQWPQVFPPLYQALIATGELTGKLDECCFQLARQQEAQQQLTKKV 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586 160 KKALRYPAIILTMAALVVFAMLHFVLPEFAAIYRSFNTPLPLLTRGIIAIAQWGSAWGWLILFLTMLVAIAHRRVKQK-P 238
Cdd:PRK10573  161 KKALRYPLIILAVALLVVLAMLHFVLPEFAAIYRSFNTPLPLLTRGILALSDFLIQYGWLLLLLLFLLAIAYKRLRRKkP 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586 239 SWQAQRQRLLLRLPVMGRLIRGQKLAQIFTVLALTQSAGIPFLQGLESAIESLGCPYWSQRLTQVHQEIAAGNPVWLALK 318
Cdd:PRK10573  241 TWQIREQRLLLRLPLVGSLIRGQKLSQIFTILALTQSAGLTLLQGLESAAETLRCPYWQQALTQIQQQIAQGIPLWLALK 320
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1002889586 319 NTQEFSPLCLQLVRTGEASGSLDIMLHNLARHHSESTLALADNLASLLEPALLIITGLIIGTLVVAMYLPIFHLGDAMS 397
Cdd:PRK10573  321 NHPLFPPLCLQLVRVGEESGSLDLMLENLAHWHQEQTQALADNLAQLLEPLLMIITGGIVGTLVVAMYLPIFQLGDAMS 399
PulF COG1459
Type II secretory pathway, component PulF [Cell motility, Intracellular trafficking, secretion, ...
6-397 1.22e-137

Type II secretory pathway, component PulF [Cell motility, Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 441068 [Multi-domain]  Cd Length: 399  Bit Score: 398.72  E-value: 1.22e-137
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586   6 LWRWQGVNDKGQLEQDVVWVDNRLALIITLQHQRIMPLRIKRIGVNAAL------WKEEQSAEIIHQLATLIHAGLTLSE 79
Cdd:COG1459     2 TFRYKALDADGKKVKGEIEADSEAEARAQLREQGLTPLSVKEKKEGLAArlfrrkVKAKDLALFTRQLATLLRAGLPLLE 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586  80 GLELLAKQHPHRQWQALLRTLAHELEQGVPFSSALVSWPQVFPPLYQTMIRTGELTGKLAECCFELARQQKAQRQITVSV 159
Cdd:COG1459    82 ALEILAEQTENPRLRKVLADIREDVEEGASLSEALAKHPKVFPPLYVNMVRAGEASGNLDEVLERLADYLEKQEELRKKI 161
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586 160 KKALRYPAIILTMAALVVFAMLHFVLPEFAAIYRSFNTPLPLLTRGIIAIAQWGSAWGWLILFLTMLVAIAHRRVKQKPS 239
Cdd:COG1459   162 KSALIYPAIVLVVAIGVVLFLLTFVVPQFAGIFESFGAELPLLTRILIALSDFLQNYWWLLLLGLVLLVVGFRRLLRTPK 241
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586 240 WQAQRQRLLLRLPVMGRLIRGQKLAQIFTVLALTQSAGIPFLQGLESAIESLGCPYWSQRLTQVHQEIAAGNPVWLALKN 319
Cdd:COG1459   242 GRLRLDRLLLKLPVIGPLIRKAALARFARTLATLLSSGVPLLEALEIAAEVVGNRVLREALEEARERVREGESLSEALEA 321
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1002889586 320 TQEFSPLCLQLVRTGEASGSLDIMLHNLARHHSESTLALADNLASLLEPALLIITGLIIGTLVVAMYLPIFHLGDAMS 397
Cdd:COG1459   322 SGLFPPLVVQMIAVGEESGALDEMLEKVADFYEEEVDRAVDRLTSLLEPLLIVVLGGIVGFIVLAIYLPIFSLGSLVG 399
GspF TIGR02120
type II secretion system protein F; This membrane protein is a component of the terminal ...
8-392 2.64e-68

type II secretion system protein F; This membrane protein is a component of the terminal branch complex of the general secretion pathway (GSP), also known as the"Type II" secretion pathway. The GSP transports proteins (generally virulence-associated cell wall hydrolases) across the outer membrase of the bacterial cell. Transport across the inner membrane is often, but not exclusively handled by the Sec system. This model was constructed from the broader subfamily model, pfam00482 which includes components of pilin complexes (PilC) as well as other related genes. GspF is nearly always gene clustered with other GSP subunits. Some genes from Xylella and Xanthomonas strains score below the trusted cutoff due to excessive divergence from the family such that a sequence from Deinococcus which does not appear to be GspF scores higher. [Protein fate, Protein and peptide secretion and trafficking]


Pssm-ID: 273980 [Multi-domain]  Cd Length: 399  Bit Score: 221.05  E-value: 2.64e-68
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586   8 RWQGVNDKGQLEQDVVWVDNRLALIITLQHQRIMPLRIKRIGVNAA---------LWKEEQSAEIIHQLATLIHAGLTLS 78
Cdd:TIGR02120   2 RYRALDAAGRAQKGTLEADSARAARLQLRERGLFPLDVDPVAAKGAgsgrklgrrRLSRAELALFTRQLATLLGAGLPLE 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586  79 EGLELLAKQHPHRQWQALLRTLAHELEQGVPFSSALVSWPQVFPPLYQTMIRTGELTGKLAECCFELARQQKAQRQITVS 158
Cdd:TIGR02120  82 EALAALLEQAEKPRLKSVLAAIRSRVLEGKSLADALAQHPRDFPPLYRALVAAGEASGALDAVLERLADYLEERQALRSK 161
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586 159 VKKALRYPAIILTMAALVVFAMLHFVLPEFAAIYRSFNTPLPLLTRGIIAIAQWGSAWGWLILFLTMLVAIAHRRVKQKP 238
Cdd:TIGR02120 162 ITTALIYPAVLTVVAIGVVIFLLAYVVPKVVEQFAHMKQTLPLLTRALIALSDFLRSWGWALLAALAALVVLFRRLLRDP 241
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586 239 SWQAQRQRLLLRLPVMGRLIRGQKLAQIFTVLALTQSAGIPFLQGLESAIESLGCPYWSQRLTQVHQEIAAGNPVWLALK 318
Cdd:TIGR02120 242 AFRLRFDRRLLRLPVIGRLVRGLNTARFARTLSILLSSGVPLLRALQIARETLTNRALRAAVEDAAARVREGGSLSRALR 321
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1002889586 319 NTQEFSPLCLQLVRTGEASGSLDIMLHNLARHHSESTLALADNLASLLEPALLIITGLIIGTLVVAMYLPIFHL 392
Cdd:TIGR02120 322 ATGLFPPLLVHMIASGEKSGQLETMLERAADNQEREFERRIATLTALLEPLLIVVMGGVVLFIVLAVLLPILQL 395
T4P_ComGB NF041012
competence type IV pilus assembly protein ComGB; Members of this family occur in Gram-positive ...
59-392 1.88e-29

competence type IV pilus assembly protein ComGB; Members of this family occur in Gram-positive bacteria as part of a type IV pilus system used for DNA binding and uptake when species such as Streptococcus pneumoniae or Bacilus subtilis are in a competent state for natural transformation. Members of this family, typically called ComGB (second protein of the ComG operon), belong more broadly to the family of GspF, part of type 2 secretion systems (T2SS) in Gram-negative bacteria.


Pssm-ID: 468941 [Multi-domain]  Cd Length: 333  Bit Score: 116.42  E-value: 1.88e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586  59 QSAEIIHQLATLIHAGLTLSEGLELLAKQHPHRQWQALLRtLAHELEQGVPFSSALVSWPqvFPPLYQTMIRTGELTGKL 138
Cdd:NF041012    2 QQAKFLQRLGELLESGFSLSEALEFLLRQLPKKSKEYLQK-ILEGLKEGASLSEILKQLG--FSDEIVTQIYFAEKHGNL 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586 139 AECCFELARQQKAQRQITVSVKKALRYPAIILTMAALVVFAMLHFVLPEFAAIYRSFNTPLPLLTRGIIAIAQWGSAWGW 218
Cdd:NF041012   79 AETLKEIAEYLKRKEKQKKKLIKVLQYPLLLLLFLILILLGLRQYLLPQFEQLYSSMNVMLSSFTNLLTLFIQHLPLIIL 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586 219 LILFLTMLVAIAHRRVKQKPSwQAQRQRLLLRLPVMGRLIR-----------------GQKLAQIFTVLALTQSAgiPFL 281
Cdd:NF041012  159 GFLLLLLLLFLIYIFYFKKLS-PLKQIKFLSKIPLIGSLYKlyltyyfarelgnllkqGLSLQQILQLMQEQKSD--PFL 235
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586 282 QGLESAIEslgcpywsqrltqvhQEIAAGNPVWLALKN----TQEFSplclQLVRTGEASGSLDIMLHNLARHHSESTLA 357
Cdd:NF041012  236 QELAKRLE---------------ERLLKGESLEQILKKypffEKELS----LIIEHGEKKGKLGKELLLYSQLLLEKFEQ 296
                         330       340       350
                  ....*....|....*....|....*....|....*
gi 1002889586 358 LADNLASLLEPALLIITGLIIGTLVVAMYLPIFHL 392
Cdd:NF041012  297 KIEKLLKFIQPIIFLLIALLIVSIYLAILLPMYQM 331
T2SSF pfam00482
Type II secretion system (T2SS), protein F; The original family covered both the regions found ...
63-180 2.44e-21

Type II secretion system (T2SS), protein F; The original family covered both the regions found by the current model. The splitting of the family has allowed the related FlaJ_arch (archaeal FlaJ family) to be merged with it. Proteins with this domain in form a platform for the machiney of the Type II secretion system, as well as the Type 4 pili and the archaeal flagella. This domain seems to show some similarity to PF00664 but this may just be due to similarities in the TM helices (personal obs: C Yeats).


Pssm-ID: 425708 [Multi-domain]  Cd Length: 119  Bit Score: 88.55  E-value: 2.44e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586  63 IIHQLATLIHAGLTLSEGLELLAKQHPHRQWQALLRTLAHELEQGVPFSSALVSWPQ-VFPPLYQTMIRTGELTGKLAEC 141
Cdd:pfam00482   1 FLRQLATLLRAGLPLVEALEILAEEAENGPLREELRRIAERVREGGSLSEALARTPSsVFPPLLVALIAAGESGGNLAEV 80
                          90       100       110
                  ....*....|....*....|....*....|....*....
gi 1002889586 142 CFELARQQKAQRQITVSVKKALRYPAIILTMAALVVFAM 180
Cdd:pfam00482  81 LERLADYLEEERELRRKIKAALLYPLILLVVALLVLLIL 119
T2SSF pfam00482
Type II secretion system (T2SS), protein F; The original family covered both the regions found ...
266-382 4.65e-17

Type II secretion system (T2SS), protein F; The original family covered both the regions found by the current model. The splitting of the family has allowed the related FlaJ_arch (archaeal FlaJ family) to be merged with it. Proteins with this domain in form a platform for the machiney of the Type II secretion system, as well as the Type 4 pili and the archaeal flagella. This domain seems to show some similarity to PF00664 but this may just be due to similarities in the TM helices (personal obs: C Yeats).


Pssm-ID: 425708 [Multi-domain]  Cd Length: 119  Bit Score: 76.61  E-value: 4.65e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586 266 IFTVLALTQSAGIPFLQGLESAIESLGCPYWSQRLTQVHQEIAAGNPVWLALKNTQ--EFSPLCLQLVRTGEASGSLDIM 343
Cdd:pfam00482   1 FLRQLATLLRAGLPLVEALEILAEEAENGPLREELRRIAERVREGGSLSEALARTPssVFPPLLVALIAAGESGGNLAEV 80
                          90       100       110
                  ....*....|....*....|....*....|....*....
gi 1002889586 344 LHNLARHHSESTLALADNLASLLEPALLIITGLIIGTLV 382
Cdd:pfam00482  81 LERLADYLEEERELRRKIKAALLYPLILLVVALLVLLIL 119
TadB COG4965
Flp pilus assembly protein TadB [Intracellular trafficking, secretion, and vesicular transport, ...
57-210 4.38e-03

Flp pilus assembly protein TadB [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 443991 [Multi-domain]  Cd Length: 214  Bit Score: 38.25  E-value: 4.38e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586  57 EEQSAEIIHQLATLIHAGLTLSEGLELLAKQHPHrQWQALLRTLAHELEQGVPFSSALVSWPQVFP-PLYQTMI------ 129
Cdd:COG4965    43 EEQLPDALDLLARALRAGLSLPQALEAVAREAPE-PLREEFRRIVRELRLGVDLEEALRRLAERLPsPELDLFAaalriq 121
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002889586 130 -RTGeltGKLAECCFELAR----QQKAQRQITVSVKKAlRYPAIILTMAALVVFAMLHFVLPEFAAIYrsFNTPLplltr 204
Cdd:COG4965   122 rRTG---GNLAEVLENLAEtireRLRLRREIRALTAEG-RLSARILAALPVLVLLLLYLLNPDYLAPL--FTTPL----- 190

                  ....*.
gi 1002889586 205 GIIAIA 210
Cdd:COG4965   191 GQILLA 196
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH