NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|16128676|ref|NP_415229|]
View 

rhs element protein RhsC [Escherichia coli str. K-12 substr. MG1655]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
RHS_core NF041261
RHS element core protein;
1-1247 0e+00

RHS element core protein;


:

Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 2562.66  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676     1 MSGKPAARQGDMTQYGGSIVQGSAGVRIGAPTGVACSVCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSS 80
Cdd:NF041261    1 MSGKPAARQGDMTQYGGPIVQGSAGVRIGAPTGVACSVCPGGMTSGNPVNPLLGAKVLPGETDIALPGPLPFILSRTYSS 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676    81 YRTKTPAPVGSLGPGWKMPADIRLQLRDNTLILSDNGGRSLYFEHLFPGEDGYSRSESLWLVRGGVAKLDEGHRLAALWQ 160
Cdd:NF041261   81 YRTRTPAPVGVFGPGWKAPSDIRLQLRDDGLILNDNGGRSIHFEPLFPGEAVYSRSESLWLVRGGVAAQPDGHTLAALWQ 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676   161 ALPEELRLSPHRYLATNSPQGPWWLLGWCERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGEFSGEITGVTDG 240
Cdd:NF041261  161 ALPEDIRLSPHLYLATNSAQGPWWILGWSERVPGADEVLPAPLPPYRVLTGMVDRFGRTLTFHREAAGDLAGEITGVTDG 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676   241 AGRHFRLVLTTQAQRAEEARQQAISGGTEP--------SAFPDTLPGYTEYGRDNGIRLSAVWLTHDPEYPENLPAAPLV 312
Cdd:NF041261  241 AGREFRLVLTTQAQRAEEARKQRTSSLSSPdgprplssSAFPDTLPGGTEYGPDNGIRLSAVWLTHDPAYPESLPAAPLV 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676   313 RYGWTPRGELAAVYDRSNTQVRSFTYDDKYRGRMVAHRHTGRPEICYRYDSDGRVTEQLNPAGLSYTYQYEKDRITITDS 392
Cdd:NF041261  321 RYTYTEAGELLAVYDRSNTQVRAFTYDAQHPGRMVAHRYAGRPEMCYRYDDTGRVTEQLNPAGLSYRYQYEQDRITITDS 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676   393 LNRREVLHTQGEGGLKRVVKKEHADGSVTQSQFDAVGRLRAQTDAAGRTTEYSPDVVTGLITRITTPDGRASAFYYNHHS 472
Cdd:NF041261  401 LNRREVLHTEGEGGLKRVVKKEHADGSVTRSGYDAAGRLTAQTDAAGRRTEYSLNVVSGDITDITTPDGRETKFYYNDGN 480
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676   473 QLTSATGPDGLEIRREYDEWGRLIQETAPDGDITRYRYDNPHSDLPCATEDATGSRKTMTWSRYGQLLSFTDCSGYVTRY 552
Cdd:NF041261  481 QLTSVTSPDGLESRREYDEPGRLVSETSRSGETTRYRYDDPHSELPATTTDATGSTKQMTWSRYGQLLAFTDCSGYQTRY 560
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676   553 DHDRFGQVTAVHREEGLSQYRAYDSRGQLIAVKDTQGHETRYEYNAAGDLTTVIAPDGSRNGTQYDAWGKAICTTQGGLT 632
Cdd:NF041261  561 EYDRFGQMTAVHREEGISTYRRYDNRGQLTSVKDAQGRETRYEYNAAGDLTAVITPDGNRSETQYDAWGKAVSTTQGGLT 640
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676   633 RSMEYDAAGRVIRLTSENGSHTTFRYDVLDRLIQETGFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTVNG 712
Cdd:NF041261  641 RSMEYDAAGRITTLTNENGSHSTFLYDALDRLVQQRGFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRITHRTVNG 720
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676   713 ETAERWQYDERGWLTDISHISEGHRVTVHYGYDSKGRLASEHLTVHHPQTNELLWQHETRHAYNAQGLANRCIPDSLPAV 792
Cdd:NF041261  721 EPAEQWQYDEHGWLTDISHLSEGHRVAVHYGYDDKGRLTGERQTVENPETGELLWQHETGHAYNEQGLANRVTPDSLPPV 800
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676   793 EWLTYGSGWLSGMKLGDTPLVEYTRDRLHRETLRSFG------RYELTTAYTPAGQLQSQHLNSLLSDRDYTWNDNGELI 866
Cdd:NF041261  801 EWLTYGSGYLAGMKLGGTPLVEYTRDRLHRETVRSFGgagsnaAYELTTAYTPAGQLQSQHLNSLVYDRDYTWNDNGDLV 880
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676   867 RISSPRQTRSYSYSTTGRLTGVHTTAANLDIRIPYTTDPAGNRLPDPELHPDSALSMWPDNRIARDAHYLYRYDRHGRLT 946
Cdd:NF041261  881 RISGPRQTREYGYSATGRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLTAWPDNRIAEDAHYVYRYDEYGRLT 960
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676   947 EKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQYAEPLVESRYLYDPLGRRVAKRVWRRERDLTGWMSLSRKPQVTW 1026
Cdd:NF041261  961 EKTDRIPEGVIRTDDERTHHYHYDSQHRLVFYTRIQHGEPLVESRYLYDPLGRRMAKRVWRRERDLTGWMSLSRKPEVTW 1040
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  1027 YGWDGDRLTTIQNDRTRIQTIYQPGSFTPLIRVETATGELAKTQRRSLADTLQQSGGEDGGSVVFPPVLVQMLDRLESEI 1106
Cdd:NF041261 1041 YGWDGDRLTTVQTDTTRIQTVYQPGSFTPLIRVETENGERAKAQRRSLAETLQQEGSENGHGVVFPAELVRMLDRLEEEI 1120
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  1107 LADRVSEESRRWLASCGLTVAQMQSQMDPVYTPARKIHLYHCDHRGLPLALISTEGTTAWYAEYDEWGNLLNEENPHQLQ 1186
Cdd:NF041261 1121 RADRVSEESRAWLAQCGLTVEQMARQVEPEYTPARKLHLYHCDHRGLPLALISEEGNTAWQGEYDEWGNLLNEENPHHLQ 1200
                        1210      1220      1230      1240      1250      1260
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 16128676  1187 QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNFYQYPLNPISNIDPLGL 1247
Cdd:NF041261 1201 QPYRLPGQQYDEESGLYYNRNRYYDPLQGRYITQDPIGLKGGWNLYQYPLNPIRFIDPLGL 1261
 
Name Accession Description Interval E-value
RHS_core NF041261
RHS element core protein;
1-1247 0e+00

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 2562.66  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676     1 MSGKPAARQGDMTQYGGSIVQGSAGVRIGAPTGVACSVCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSS 80
Cdd:NF041261    1 MSGKPAARQGDMTQYGGPIVQGSAGVRIGAPTGVACSVCPGGMTSGNPVNPLLGAKVLPGETDIALPGPLPFILSRTYSS 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676    81 YRTKTPAPVGSLGPGWKMPADIRLQLRDNTLILSDNGGRSLYFEHLFPGEDGYSRSESLWLVRGGVAKLDEGHRLAALWQ 160
Cdd:NF041261   81 YRTRTPAPVGVFGPGWKAPSDIRLQLRDDGLILNDNGGRSIHFEPLFPGEAVYSRSESLWLVRGGVAAQPDGHTLAALWQ 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676   161 ALPEELRLSPHRYLATNSPQGPWWLLGWCERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGEFSGEITGVTDG 240
Cdd:NF041261  161 ALPEDIRLSPHLYLATNSAQGPWWILGWSERVPGADEVLPAPLPPYRVLTGMVDRFGRTLTFHREAAGDLAGEITGVTDG 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676   241 AGRHFRLVLTTQAQRAEEARQQAISGGTEP--------SAFPDTLPGYTEYGRDNGIRLSAVWLTHDPEYPENLPAAPLV 312
Cdd:NF041261  241 AGREFRLVLTTQAQRAEEARKQRTSSLSSPdgprplssSAFPDTLPGGTEYGPDNGIRLSAVWLTHDPAYPESLPAAPLV 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676   313 RYGWTPRGELAAVYDRSNTQVRSFTYDDKYRGRMVAHRHTGRPEICYRYDSDGRVTEQLNPAGLSYTYQYEKDRITITDS 392
Cdd:NF041261  321 RYTYTEAGELLAVYDRSNTQVRAFTYDAQHPGRMVAHRYAGRPEMCYRYDDTGRVTEQLNPAGLSYRYQYEQDRITITDS 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676   393 LNRREVLHTQGEGGLKRVVKKEHADGSVTQSQFDAVGRLRAQTDAAGRTTEYSPDVVTGLITRITTPDGRASAFYYNHHS 472
Cdd:NF041261  401 LNRREVLHTEGEGGLKRVVKKEHADGSVTRSGYDAAGRLTAQTDAAGRRTEYSLNVVSGDITDITTPDGRETKFYYNDGN 480
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676   473 QLTSATGPDGLEIRREYDEWGRLIQETAPDGDITRYRYDNPHSDLPCATEDATGSRKTMTWSRYGQLLSFTDCSGYVTRY 552
Cdd:NF041261  481 QLTSVTSPDGLESRREYDEPGRLVSETSRSGETTRYRYDDPHSELPATTTDATGSTKQMTWSRYGQLLAFTDCSGYQTRY 560
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676   553 DHDRFGQVTAVHREEGLSQYRAYDSRGQLIAVKDTQGHETRYEYNAAGDLTTVIAPDGSRNGTQYDAWGKAICTTQGGLT 632
Cdd:NF041261  561 EYDRFGQMTAVHREEGISTYRRYDNRGQLTSVKDAQGRETRYEYNAAGDLTAVITPDGNRSETQYDAWGKAVSTTQGGLT 640
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676   633 RSMEYDAAGRVIRLTSENGSHTTFRYDVLDRLIQETGFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTVNG 712
Cdd:NF041261  641 RSMEYDAAGRITTLTNENGSHSTFLYDALDRLVQQRGFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRITHRTVNG 720
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676   713 ETAERWQYDERGWLTDISHISEGHRVTVHYGYDSKGRLASEHLTVHHPQTNELLWQHETRHAYNAQGLANRCIPDSLPAV 792
Cdd:NF041261  721 EPAEQWQYDEHGWLTDISHLSEGHRVAVHYGYDDKGRLTGERQTVENPETGELLWQHETGHAYNEQGLANRVTPDSLPPV 800
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676   793 EWLTYGSGWLSGMKLGDTPLVEYTRDRLHRETLRSFG------RYELTTAYTPAGQLQSQHLNSLLSDRDYTWNDNGELI 866
Cdd:NF041261  801 EWLTYGSGYLAGMKLGGTPLVEYTRDRLHRETVRSFGgagsnaAYELTTAYTPAGQLQSQHLNSLVYDRDYTWNDNGDLV 880
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676   867 RISSPRQTRSYSYSTTGRLTGVHTTAANLDIRIPYTTDPAGNRLPDPELHPDSALSMWPDNRIARDAHYLYRYDRHGRLT 946
Cdd:NF041261  881 RISGPRQTREYGYSATGRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLTAWPDNRIAEDAHYVYRYDEYGRLT 960
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676   947 EKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQYAEPLVESRYLYDPLGRRVAKRVWRRERDLTGWMSLSRKPQVTW 1026
Cdd:NF041261  961 EKTDRIPEGVIRTDDERTHHYHYDSQHRLVFYTRIQHGEPLVESRYLYDPLGRRMAKRVWRRERDLTGWMSLSRKPEVTW 1040
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  1027 YGWDGDRLTTIQNDRTRIQTIYQPGSFTPLIRVETATGELAKTQRRSLADTLQQSGGEDGGSVVFPPVLVQMLDRLESEI 1106
Cdd:NF041261 1041 YGWDGDRLTTVQTDTTRIQTVYQPGSFTPLIRVETENGERAKAQRRSLAETLQQEGSENGHGVVFPAELVRMLDRLEEEI 1120
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  1107 LADRVSEESRRWLASCGLTVAQMQSQMDPVYTPARKIHLYHCDHRGLPLALISTEGTTAWYAEYDEWGNLLNEENPHQLQ 1186
Cdd:NF041261 1121 RADRVSEESRAWLAQCGLTVEQMARQVEPEYTPARKLHLYHCDHRGLPLALISEEGNTAWQGEYDEWGNLLNEENPHHLQ 1200
                        1210      1220      1230      1240      1250      1260
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 16128676  1187 QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNFYQYPLNPISNIDPLGL 1247
Cdd:NF041261 1201 QPYRLPGQQYDEESGLYYNRNRYYDPLQGRYITQDPIGLKGGWNLYQYPLNPIRFIDPLGL 1261
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
133-1321 5.71e-32

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 136.04  E-value: 5.71e-32
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  133 YSRSESLWLVRGGVAKLDEGHRLAALWQALPEELRLSPHRYLATNSPQGPWWLLGWCERVPEADEVLPAPLPPYRVLTGL 212
Cdd:COG3209   61 STTDVVGTLTGAGGTSAGGVTALGDASAAGGGYVGGAAAGGGATLTGLAAATASAGRLVSTGAGAGGTVTAATGGTLGAT 140
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  213 VDRFGRTQTFHREAAGEFSGEITGVTDGAGRHFRLVLTTQAQRAEEARQQAISGGTepSAFPDTLPGYTEYGRDNGIRLS 292
Cdd:COG3209  141 AGSATTGSTDGGRGGVAVTGLAGGGASAYGLTLGGAAAGPATGVGTGAVTLATGLA--GSALLALGSGAILGGLAGAYSG 218
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  293 AVWLTHDPEYPENLPAAPLVRYGWTPRGELAAVYDRSNTQVRSFTYDDKYRGRMVAHRHTGRPEICYRYDSDGRVTEQLN 372
Cdd:COG3209  219 SATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTTGAGTGASGAGLDASTGTGGAGGSNAAATAGGLGGA 298
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  373 PAGLSYTYQYEKDRITITDSLNRREVLHTQGEGGLKRVVKKEHADGSVTQSQFDAVGRLRAQTDAAGRTTEYSPDVVTGL 452
Cdd:COG3209  299 GLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGGTTTTVGGGGSLTLGGYGAAGGLTTSVGAGGGGST 378
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  453 ITRITTPDGRASAFYYNHHSQLTSATGPDGLEIRREYDEWGRLIQETAPDGDITRYRYDNPHSDLPCATEDATGSRKTMT 532
Cdd:COG3209  379 SGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGALTAGGTATGTGTGGGGTTAGTDATTTTGGAGAS 458
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  533 WSRYGQLLSFTDCSGYVTRYDHDRFGQVTAVHREEGLSQYRAYDSRGQLIAVKDTQGHETRYEYNAAGDLTTVIAPDGSR 612
Cdd:COG3209  459 GTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDDTLGGTTTTTAGARGLVVTTGTTLTLGTTTTATLS 538
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  613 NGTQYDAWGKAICTTQGGLTRSMEYDAAGRVIRLTSENGSHTTFRYDVLDRLIQETGFDGRTQRYHHDLTGKLIRSEDEG 692
Cdd:COG3209  539 ATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTGGTATTTTVTTTTTTSTAGTTTTTTSGYTRAGL 618
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  693 LVTHWHYDEADRLTHRTVNGETAERWQYDERGWLTDISHISEGHRVTVHYGYDSKGRLASEHLTVHHPQTNELLWQHeTR 772
Cdd:COG3209  619 TLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTGTGTGVTAGLTTLATGGTTVGGGTGTTSTATT-GA 697
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  773 HAYNAQGLANRCIPDSLPAVEWLTYGSGWLSGMKLGDTPLVEYTRDRLHRETLRSFGRYELTTAYTPAGQLQSQHLNSLL 852
Cdd:COG3209  698 TTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGTLTTTSTTTTTTAGALTYTYDALGRLTSETTPGGV 777
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  853 SDRD----YTWNDNGELIRISSPR-QTRSYSYSTTGRLTGVHTTAANLDIRIpyttdpagnrlpdpelhpdsalsmwpdn 927
Cdd:COG3209  778 TQGTyttrYTYDALGRLTSVTYPDgETVTYTYDALGRLTSVITVGSGGGTDL---------------------------- 829
                        810       820       830       840       850       860       870       880
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  928 riardAHYLYRYDRHGRLTEKTDLIPEGvirtddERTHRYHYDSQHRLVHYTRTQYAEplvesRYLYDPLGRRVAKRVWr 1007
Cdd:COG3209  830 -----QDRTYTYDAAGNITSITDALRAG------TLTQTYTYDALGRLTSATDPGTTE-----SYTYDANGNLTSRTDG- 892
                        890       900       910       920       930       940       950       960
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676 1008 rerdltgwmslsrkpQVTWYGWDG-DRLTTIQNDrtriqtiyqpgsftplirvetatgelaktqrrsladtlqqsggeDG 1086
Cdd:COG3209  893 ---------------GTTTYTYDAlGRLVSVTKP--------------------------------------------DG 913
                        970       980       990      1000      1010      1020      1030      1040
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676 1087 GSVVFppvlvqmldrleseiladrvseesrrwlascgltvaqmqsqmdpVYTParkihLYHCDHRGLPLALISTEGTTAW 1166
Cdd:COG3209  914 TTTTY--------------------------------------------TYDA-----LGHTDHLGSVRALTDASGQVVW 944
                       1050      1060      1070      1080      1090      1100      1110      1120
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676 1167 YAEYDEWGNLLNEENPHQLQQLiRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNFYQYPL-NPISNIDPL 1245
Cdd:COG3209  945 RYDYDPFGNLLAETSGAAANPL-RFTGQEYDAETGLYYNGARYYDPALGRFLSPDPIGLAGGLNLYAYVGnNPVNYVDPL 1023
                       1130      1140      1150      1160      1170      1180      1190
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 16128676 1246 GLETL--KCIKPLHSMGGTGERSGPDIWGNPFYHQYLCVPDGKGDYTCGGQDQRGESKGDGLWGPGKASNDTKEAAGR 1321
Cdd:COG3209 1024 GLAALlgTTGLGGGAGVGAGAAGGGAAAAGGSAGAGAAGGGAGGAGAGGAGGGAGAGAGAAAGAAGGAGGGAGASGAG 1101
Rhs_assc_core TIGR03696
RHS repeat-associated core domain; This model represents a conserved unique core sequence ...
1170-1247 1.52e-30

RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.


Pssm-ID: 274730 [Multi-domain]  Cd Length: 77  Bit Score: 115.29  E-value: 1.52e-30
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 16128676   1170 YDEWGNLLNEENPhqLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNFYQY-PLNPISNIDPLGL 1247
Cdd:TIGR03696    1 YDPYGEVLSESGA--APNPLRFTGQYYDAETGLYYNGARYYDPELGRFLSPDPIGLGGGLNLYAYvGNNPVNWVDPLGL 77
DUF6531 pfam20148
Domain of unknown function (DUF6531); This putative domain is found in a range of RHS proteins.
46-123 3.98e-18

Domain of unknown function (DUF6531); This putative domain is found in a range of RHS proteins.


Pssm-ID: 466309 [Multi-domain]  Cd Length: 74  Bit Score: 79.88  E-value: 3.98e-18
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 16128676     46 GHPVNPLLGAKVLPgETDIALPGPLPFILSRTYSSYRTKTpapvGSLGPGWKMPADIRLQLR-DNTLILSDNGGRSLYF 123
Cdd:pfam20148    1 GDPVNVATGNKVLE-ETDFSLPGPLPLVWTRTYNSSSERD----GPLGPGWSHPYDQRLELEgDGGVVYIDADGREVTF 74
PAAR_2 cd14738
proline-alanine-alanine-arginine (PAAR) domain; This domain is found in the PAAR ...
1-29 8.35e-08

proline-alanine-alanine-arginine (PAAR) domain; This domain is found in the PAAR (proline-alanine-alanine-arginine) repeat family, where it forms a sharp conical extension on the VgrG spike, a trimeric protein complex of the bacterial type VI secretion system (T6SS). The T6SS is responsible for translocation of a wide variety of toxic effector molecules, allowing predatory cells to kill prokaryotic as well as eukaryotic prey cells. The pointed tip of the PAAR domain is stabilized by a zinc atom positioned close to the cone's vertex and is likely to be important for its integrity during penetration of the target cell envelope. VgrG proteins are orthologous to the central baseplate spikes of bacteriophages with contractile tails, and genes encoding proteins with PAAR motifs have been frequently found immediately downstream from vgrG-like genes. It has been shown that PAAR proteins are essential for T6SS-mediated secretion and target cell killing by Vibrio cholerae (encodes two PAAR proteins) and Acinetobacter baylyi (encodes three PAAR proteins); inactivation of all these PAAR genes results in inactivation of Hcp secretion as well as T6SS-dependent killing of E. coli.


Pssm-ID: 269823  Cd Length: 94  Bit Score: 51.48  E-value: 8.35e-08
                         10        20
                 ....*....|....*....|....*....
gi 16128676    1 MSGKPAARQGDMTQYGGSIVQGSAGVRIG 29
Cdd:cd14738   66 IGGKPAARMGDSTAHGGVIVSGVPTVLIG 94
 
Name Accession Description Interval E-value
RHS_core NF041261
RHS element core protein;
1-1247 0e+00

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 2562.66  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676     1 MSGKPAARQGDMTQYGGSIVQGSAGVRIGAPTGVACSVCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSS 80
Cdd:NF041261    1 MSGKPAARQGDMTQYGGPIVQGSAGVRIGAPTGVACSVCPGGMTSGNPVNPLLGAKVLPGETDIALPGPLPFILSRTYSS 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676    81 YRTKTPAPVGSLGPGWKMPADIRLQLRDNTLILSDNGGRSLYFEHLFPGEDGYSRSESLWLVRGGVAKLDEGHRLAALWQ 160
Cdd:NF041261   81 YRTRTPAPVGVFGPGWKAPSDIRLQLRDDGLILNDNGGRSIHFEPLFPGEAVYSRSESLWLVRGGVAAQPDGHTLAALWQ 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676   161 ALPEELRLSPHRYLATNSPQGPWWLLGWCERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGEFSGEITGVTDG 240
Cdd:NF041261  161 ALPEDIRLSPHLYLATNSAQGPWWILGWSERVPGADEVLPAPLPPYRVLTGMVDRFGRTLTFHREAAGDLAGEITGVTDG 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676   241 AGRHFRLVLTTQAQRAEEARQQAISGGTEP--------SAFPDTLPGYTEYGRDNGIRLSAVWLTHDPEYPENLPAAPLV 312
Cdd:NF041261  241 AGREFRLVLTTQAQRAEEARKQRTSSLSSPdgprplssSAFPDTLPGGTEYGPDNGIRLSAVWLTHDPAYPESLPAAPLV 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676   313 RYGWTPRGELAAVYDRSNTQVRSFTYDDKYRGRMVAHRHTGRPEICYRYDSDGRVTEQLNPAGLSYTYQYEKDRITITDS 392
Cdd:NF041261  321 RYTYTEAGELLAVYDRSNTQVRAFTYDAQHPGRMVAHRYAGRPEMCYRYDDTGRVTEQLNPAGLSYRYQYEQDRITITDS 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676   393 LNRREVLHTQGEGGLKRVVKKEHADGSVTQSQFDAVGRLRAQTDAAGRTTEYSPDVVTGLITRITTPDGRASAFYYNHHS 472
Cdd:NF041261  401 LNRREVLHTEGEGGLKRVVKKEHADGSVTRSGYDAAGRLTAQTDAAGRRTEYSLNVVSGDITDITTPDGRETKFYYNDGN 480
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676   473 QLTSATGPDGLEIRREYDEWGRLIQETAPDGDITRYRYDNPHSDLPCATEDATGSRKTMTWSRYGQLLSFTDCSGYVTRY 552
Cdd:NF041261  481 QLTSVTSPDGLESRREYDEPGRLVSETSRSGETTRYRYDDPHSELPATTTDATGSTKQMTWSRYGQLLAFTDCSGYQTRY 560
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676   553 DHDRFGQVTAVHREEGLSQYRAYDSRGQLIAVKDTQGHETRYEYNAAGDLTTVIAPDGSRNGTQYDAWGKAICTTQGGLT 632
Cdd:NF041261  561 EYDRFGQMTAVHREEGISTYRRYDNRGQLTSVKDAQGRETRYEYNAAGDLTAVITPDGNRSETQYDAWGKAVSTTQGGLT 640
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676   633 RSMEYDAAGRVIRLTSENGSHTTFRYDVLDRLIQETGFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTVNG 712
Cdd:NF041261  641 RSMEYDAAGRITTLTNENGSHSTFLYDALDRLVQQRGFDGRTQRYHYDLTGKLTQSEDEGLVTLWHYDESDRITHRTVNG 720
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676   713 ETAERWQYDERGWLTDISHISEGHRVTVHYGYDSKGRLASEHLTVHHPQTNELLWQHETRHAYNAQGLANRCIPDSLPAV 792
Cdd:NF041261  721 EPAEQWQYDEHGWLTDISHLSEGHRVAVHYGYDDKGRLTGERQTVENPETGELLWQHETGHAYNEQGLANRVTPDSLPPV 800
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676   793 EWLTYGSGWLSGMKLGDTPLVEYTRDRLHRETLRSFG------RYELTTAYTPAGQLQSQHLNSLLSDRDYTWNDNGELI 866
Cdd:NF041261  801 EWLTYGSGYLAGMKLGGTPLVEYTRDRLHRETVRSFGgagsnaAYELTTAYTPAGQLQSQHLNSLVYDRDYTWNDNGDLV 880
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676   867 RISSPRQTRSYSYSTTGRLTGVHTTAANLDIRIPYTTDPAGNRLPDPELHPDSALSMWPDNRIARDAHYLYRYDRHGRLT 946
Cdd:NF041261  881 RISGPRQTREYGYSATGRLTGVHTTAANLDIRIPYATDPAGNRLPDPELHPDSTLTAWPDNRIAEDAHYVYRYDEYGRLT 960
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676   947 EKTDLIPEGVIRTDDERTHRYHYDSQHRLVHYTRTQYAEPLVESRYLYDPLGRRVAKRVWRRERDLTGWMSLSRKPQVTW 1026
Cdd:NF041261  961 EKTDRIPEGVIRTDDERTHHYHYDSQHRLVFYTRIQHGEPLVESRYLYDPLGRRMAKRVWRRERDLTGWMSLSRKPEVTW 1040
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  1027 YGWDGDRLTTIQNDRTRIQTIYQPGSFTPLIRVETATGELAKTQRRSLADTLQQSGGEDGGSVVFPPVLVQMLDRLESEI 1106
Cdd:NF041261 1041 YGWDGDRLTTVQTDTTRIQTVYQPGSFTPLIRVETENGERAKAQRRSLAETLQQEGSENGHGVVFPAELVRMLDRLEEEI 1120
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  1107 LADRVSEESRRWLASCGLTVAQMQSQMDPVYTPARKIHLYHCDHRGLPLALISTEGTTAWYAEYDEWGNLLNEENPHQLQ 1186
Cdd:NF041261 1121 RADRVSEESRAWLAQCGLTVEQMARQVEPEYTPARKLHLYHCDHRGLPLALISEEGNTAWQGEYDEWGNLLNEENPHHLQ 1200
                        1210      1220      1230      1240      1250      1260
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 16128676  1187 QLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNFYQYPLNPISNIDPLGL 1247
Cdd:NF041261 1201 QPYRLPGQQYDEESGLYYNRNRYYDPLQGRYITQDPIGLKGGWNLYQYPLNPIRFIDPLGL 1261
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
133-1321 5.71e-32

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 136.04  E-value: 5.71e-32
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  133 YSRSESLWLVRGGVAKLDEGHRLAALWQALPEELRLSPHRYLATNSPQGPWWLLGWCERVPEADEVLPAPLPPYRVLTGL 212
Cdd:COG3209   61 STTDVVGTLTGAGGTSAGGVTALGDASAAGGGYVGGAAAGGGATLTGLAAATASAGRLVSTGAGAGGTVTAATGGTLGAT 140
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  213 VDRFGRTQTFHREAAGEFSGEITGVTDGAGRHFRLVLTTQAQRAEEARQQAISGGTepSAFPDTLPGYTEYGRDNGIRLS 292
Cdd:COG3209  141 AGSATTGSTDGGRGGVAVTGLAGGGASAYGLTLGGAAAGPATGVGTGAVTLATGLA--GSALLALGSGAILGGLAGAYSG 218
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  293 AVWLTHDPEYPENLPAAPLVRYGWTPRGELAAVYDRSNTQVRSFTYDDKYRGRMVAHRHTGRPEICYRYDSDGRVTEQLN 372
Cdd:COG3209  219 SATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTTGAGTGASGAGLDASTGTGGAGGSNAAATAGGLGGA 298
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  373 PAGLSYTYQYEKDRITITDSLNRREVLHTQGEGGLKRVVKKEHADGSVTQSQFDAVGRLRAQTDAAGRTTEYSPDVVTGL 452
Cdd:COG3209  299 GLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGGTTTTVGGGGSLTLGGYGAAGGLTTSVGAGGGGST 378
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  453 ITRITTPDGRASAFYYNHHSQLTSATGPDGLEIRREYDEWGRLIQETAPDGDITRYRYDNPHSDLPCATEDATGSRKTMT 532
Cdd:COG3209  379 SGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGALTAGGTATGTGTGGGGTTAGTDATTTTGGAGAS 458
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  533 WSRYGQLLSFTDCSGYVTRYDHDRFGQVTAVHREEGLSQYRAYDSRGQLIAVKDTQGHETRYEYNAAGDLTTVIAPDGSR 612
Cdd:COG3209  459 GTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDDTLGGTTTTTAGARGLVVTTGTTLTLGTTTTATLS 538
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  613 NGTQYDAWGKAICTTQGGLTRSMEYDAAGRVIRLTSENGSHTTFRYDVLDRLIQETGFDGRTQRYHHDLTGKLIRSEDEG 692
Cdd:COG3209  539 ATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTGGTATTTTVTTTTTTSTAGTTTTTTSGYTRAGL 618
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  693 LVTHWHYDEADRLTHRTVNGETAERWQYDERGWLTDISHISEGHRVTVHYGYDSKGRLASEHLTVHHPQTNELLWQHeTR 772
Cdd:COG3209  619 TLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTGTGTGVTAGLTTLATGGTTVGGGTGTTSTATT-GA 697
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  773 HAYNAQGLANRCIPDSLPAVEWLTYGSGWLSGMKLGDTPLVEYTRDRLHRETLRSFGRYELTTAYTPAGQLQSQHLNSLL 852
Cdd:COG3209  698 TTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGTLTTTSTTTTTTAGALTYTYDALGRLTSETTPGGV 777
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  853 SDRD----YTWNDNGELIRISSPR-QTRSYSYSTTGRLTGVHTTAANLDIRIpyttdpagnrlpdpelhpdsalsmwpdn 927
Cdd:COG3209  778 TQGTyttrYTYDALGRLTSVTYPDgETVTYTYDALGRLTSVITVGSGGGTDL---------------------------- 829
                        810       820       830       840       850       860       870       880
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  928 riardAHYLYRYDRHGRLTEKTDLIPEGvirtddERTHRYHYDSQHRLVHYTRTQYAEplvesRYLYDPLGRRVAKRVWr 1007
Cdd:COG3209  830 -----QDRTYTYDAAGNITSITDALRAG------TLTQTYTYDALGRLTSATDPGTTE-----SYTYDANGNLTSRTDG- 892
                        890       900       910       920       930       940       950       960
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676 1008 rerdltgwmslsrkpQVTWYGWDG-DRLTTIQNDrtriqtiyqpgsftplirvetatgelaktqrrsladtlqqsggeDG 1086
Cdd:COG3209  893 ---------------GTTTYTYDAlGRLVSVTKP--------------------------------------------DG 913
                        970       980       990      1000      1010      1020      1030      1040
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676 1087 GSVVFppvlvqmldrleseiladrvseesrrwlascgltvaqmqsqmdpVYTParkihLYHCDHRGLPLALISTEGTTAW 1166
Cdd:COG3209  914 TTTTY--------------------------------------------TYDA-----LGHTDHLGSVRALTDASGQVVW 944
                       1050      1060      1070      1080      1090      1100      1110      1120
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676 1167 YAEYDEWGNLLNEENPHQLQQLiRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNFYQYPL-NPISNIDPL 1245
Cdd:COG3209  945 RYDYDPFGNLLAETSGAAANPL-RFTGQEYDAETGLYYNGARYYDPALGRFLSPDPIGLAGGLNLYAYVGnNPVNYVDPL 1023
                       1130      1140      1150      1160      1170      1180      1190
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 16128676 1246 GLETL--KCIKPLHSMGGTGERSGPDIWGNPFYHQYLCVPDGKGDYTCGGQDQRGESKGDGLWGPGKASNDTKEAAGR 1321
Cdd:COG3209 1024 GLAALlgTTGLGGGAGVGAGAAGGGAAAAGGSAGAGAAGGGAGGAGAGGAGGGAGAGAGAAAGAAGGAGGGAGASGAG 1101
Rhs_assc_core TIGR03696
RHS repeat-associated core domain; This model represents a conserved unique core sequence ...
1170-1247 1.52e-30

RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.


Pssm-ID: 274730 [Multi-domain]  Cd Length: 77  Bit Score: 115.29  E-value: 1.52e-30
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 16128676   1170 YDEWGNLLNEENPhqLQQLIRLPGQQYDEESGLYYNRHRYYDPLQGRYITQDPIGLKGGWNFYQY-PLNPISNIDPLGL 1247
Cdd:TIGR03696    1 YDPYGEVLSESGA--APNPLRFTGQYYDAETGLYYNGARYYDPELGRFLSPDPIGLGGGLNLYAYvGNNPVNWVDPLGL 77
DUF6531 pfam20148
Domain of unknown function (DUF6531); This putative domain is found in a range of RHS proteins.
46-123 3.98e-18

Domain of unknown function (DUF6531); This putative domain is found in a range of RHS proteins.


Pssm-ID: 466309 [Multi-domain]  Cd Length: 74  Bit Score: 79.88  E-value: 3.98e-18
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 16128676     46 GHPVNPLLGAKVLPgETDIALPGPLPFILSRTYSSYRTKTpapvGSLGPGWKMPADIRLQLR-DNTLILSDNGGRSLYF 123
Cdd:pfam20148    1 GDPVNVATGNKVLE-ETDFSLPGPLPLVWTRTYNSSSERD----GPLGPGWSHPYDQRLELEgDGGVVYIDADGREVTF 74
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
5-752 3.19e-17

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 87.89  E-value: 3.19e-17
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676    5 PAARQGDMTQYGGSIVQGSAGVRIGAPTGVACSVCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSSYRTK 84
Cdd:COG3209  134 GGTLGATAGSATTGSTDGGRGGVAVTGLAGGGASAYGLTLGGAAAGPATGVGTGAVTLATGLAGSALLALGSGAILGGLA 213
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676   85 TPAPVGSLGPGWKMPADIRLQLRDNTLILSDNGGRSLYFEHLFPGEDGYSRSESLWLVRGGVAKLDEGHRLAALWQALPE 164
Cdd:COG3209  214 GAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTTGAGTGASGAGLDASTGTGGAGGSNAAATAG 293
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  165 ELRLSPHRYLATNSPQGPwwllGWCERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGEFSGEITGVTDGAGRH 244
Cdd:COG3209  294 GLGGAGLGSGGAGGGGTA----GGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGGTTTTVGGGGSLTLGGYGAAGGLTTS 369
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  245 FRLVLTTQAQRAEEARQQAISGGTEPSAFPDTLPGYTEYGRDNGIRLSAVWLTHDPEYPENLPAAPLVRYGWTPRGELAA 324
Cdd:COG3209  370 VGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGALTAGGTATGTGTGGGGTTAGTDAT 449
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  325 VYDRSNTQVRSFTYDDKYRGRMVAHRHTGRPEICYRYDSDGRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLHTQGE 404
Cdd:COG3209  450 TTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDDTLGGTTTTTAGARGLVVTTGTTLTL 529
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  405 GGLKRVVKKEHADGSVTQSQFDAVGRLRAQTDAAGRTTEYSPDVVTGLITRITTPDGRASAFYYNHHSQLTSATGPDGLE 484
Cdd:COG3209  530 GTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTGGTATTTTVTTTTTTSTAGTTTTT 609
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  485 IRREYDEWGRLIQETAPDGDITRYRYDNPHSDLPCATEDATGSRKTMTWSRYGQLLSFTDCSGYVTRYDHDRFGQVTAVH 564
Cdd:COG3209  610 TSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTGTGTGVTAGLTTLATGGTTVGGGTGT 689
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  565 REEGLSQYRAYDSRGQLIAVKDTQGHETRYEYNAAGDLTTVIAPDGSRNGTQYDAWGKAICTTQGGLTRSMEYDAAGRVI 644
Cdd:COG3209  690 TSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGTLTTTSTTTTTTAGALTYTYDALGRLT 769
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16128676  645 RLTSENGS-----HTTFRYDVLDRLIQETGFDGRTQRYHHDLTGKLIR-------SEDEGLVTHWHYDEADRLTHRT--- 709
Cdd:COG3209  770 SETTPGGVtqgtyTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTSvitvgsgGGTDLQDRTYTYDAAGNITSITdal 849
                        730       740       750       760
                 ....*....|....*....|....*....|....*....|...
gi 16128676  710 VNGETAERWQYDERGWLTDIShiseGHRVTVHYGYDSKGRLAS 752
Cdd:COG3209  850 RAGTLTQTYTYDALGRLTSAT----DPGTTESYTYDANGNLTS 888
RHS pfam03527
RHS protein;
1143-1180 1.94e-12

RHS protein;


Pssm-ID: 427349 [Multi-domain]  Cd Length: 38  Bit Score: 62.71  E-value: 1.94e-12
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 16128676   1143 IHLYHCDHRGLPLALISTEGTTAWYAEYDEWGNLLNEE 1180
Cdd:pfam03527    1 IYYYHTDHLGTPEELTDEAGEIVWSAEYDAWGNVTEER 38
PAAR_2 cd14738
proline-alanine-alanine-arginine (PAAR) domain; This domain is found in the PAAR ...
1-29 8.35e-08

proline-alanine-alanine-arginine (PAAR) domain; This domain is found in the PAAR (proline-alanine-alanine-arginine) repeat family, where it forms a sharp conical extension on the VgrG spike, a trimeric protein complex of the bacterial type VI secretion system (T6SS). The T6SS is responsible for translocation of a wide variety of toxic effector molecules, allowing predatory cells to kill prokaryotic as well as eukaryotic prey cells. The pointed tip of the PAAR domain is stabilized by a zinc atom positioned close to the cone's vertex and is likely to be important for its integrity during penetration of the target cell envelope. VgrG proteins are orthologous to the central baseplate spikes of bacteriophages with contractile tails, and genes encoding proteins with PAAR motifs have been frequently found immediately downstream from vgrG-like genes. It has been shown that PAAR proteins are essential for T6SS-mediated secretion and target cell killing by Vibrio cholerae (encodes two PAAR proteins) and Acinetobacter baylyi (encodes three PAAR proteins); inactivation of all these PAAR genes results in inactivation of Hcp secretion as well as T6SS-dependent killing of E. coli.


Pssm-ID: 269823  Cd Length: 94  Bit Score: 51.48  E-value: 8.35e-08
                         10        20
                 ....*....|....*....|....*....
gi 16128676    1 MSGKPAARQGDMTQYGGSIVQGSAGVRIG 29
Cdd:cd14738   66 IGGKPAARMGDSTAHGGVIVSGVPTVLIG 94
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
575-611 3.89e-07

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 47.59  E-value: 3.89e-07
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 16128676    575 YDSRGQLIAVKDTQGHETRYEYNAAGDLTTVIAPDGS 611
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
637-673 1.94e-06

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 45.67  E-value: 1.94e-06
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 16128676    637 YDAAGRVIRLTSENGSHTTFRYDVLDRLIQETGFDGR 673
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
575-612 1.25e-05

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 43.35  E-value: 1.25e-05
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 16128676    575 YDSRGQLIAVKDTQGHETRYEYNAAGDLTTVIAPDGSR 612
Cdd:TIGR01643    1 YDAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGS 38
PAAR COG4104
Zn-binding Pro-Ala-Ala-Arg (PAAR) domain, involved in Type VI secretion [Intracellular ...
3-48 2.88e-05

Zn-binding Pro-Ala-Ala-Arg (PAAR) domain, involved in Type VI secretion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 443280  Cd Length: 87  Bit Score: 44.04  E-value: 2.88e-05
                         10        20        30        40        50
                 ....*....|....*....|....*....|....*....|....*....|...
gi 16128676    3 GKPAARQGDMTQYGGSIVQGSAGVRIG----APTG--VACSVC-PGGVTSGHP 48
Cdd:COG4104    2 PKPAARLGDKTSHGGPVISGSPTVLIGgrpaARVGdkVSCPKHgPDTIAEGSP 54
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
425-462 4.81e-05

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 41.82  E-value: 4.81e-05
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 16128676    425 FDAVGRLRAQTDAAGRTTEYSPDvVTGLITRITTPDGR 462
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRVTTYTYD-AAGRLTAVTDPDGT 37
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
468-504 5.36e-05

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 41.43  E-value: 5.36e-05
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 16128676    468 YNHHSQLTSATGPDGLEIRREYDEWGRLIQETAPDGD 504
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
PAAR COG4104
Zn-binding Pro-Ala-Ala-Arg (PAAR) domain, involved in Type VI secretion [Intracellular ...
3-29 6.24e-05

Zn-binding Pro-Ala-Ala-Arg (PAAR) domain, involved in Type VI secretion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 443280  Cd Length: 87  Bit Score: 42.88  E-value: 6.24e-05
                         10        20
                 ....*....|....*....|....*..
gi 16128676    3 GKPAARQGDMTQYGGSIVQGSAGVRIG 29
Cdd:COG4104   60 GKPAARVGDKTACGGTIISGSPTVLIG 86
PAAR_CT_1 cd14743
proline-alanine-alanine-arginine (PAAR) domain with C-terminal extension; This domain is found ...
2-46 3.18e-04

proline-alanine-alanine-arginine (PAAR) domain with C-terminal extension; This domain is found in the PAAR (proline-alanine-alanine-arginine) repeat family of mostly gamma-proteobacteria, and forms a sharp conical extension on the VgrG spike, a trimeric protein complex of the bacterial type VI secretion system (T6SS). Some members contains C-terminal domain extensions corresponding to Rearrangement hotspot (Rhs) protein repeats and conserved Rhs repeat-associated unique core sequences as well as uncharacterized domains. However, these terminal domains are exposed to solution, and do not distort the binding site of VgrG. Rhs and related YD-peptide repeat proteins are widely distributed in bacteria. Rhs shares similar architecture with distantly related WapA proteins of Bacillus and Listeria species, suggesting intercellular growth inhibition as its primary function. Additionally, a plasmid-encoded Rhs protein has been implicated in bacteriocin production in Pseudomonas savastanoi. The pointed tip of the PAAR domain is stabilized by a zinc atom positioned close to the cone's vertex and is likely to be important for its integrity during penetration of the target cell envelope. VgrG proteins are orthologous to the central baseplate spikes of bacteriophages with contractile tails, and genes encoding proteins with PAAR motifs have been frequently found immediately downstream from vgrG-like genes.


Pssm-ID: 269828  Cd Length: 78  Bit Score: 40.75  E-value: 3.18e-04
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|....*...
gi 16128676    2 SGKPAARQGDMTQYGGSIVQGSAGVRI-GAPTGVACSV--CPGGVTSG 46
Cdd:cd14743   30 DGLPAARVGDKTSCGATIVSGSINVLInGKPAAVLGSTtsHGGVVIGG 77
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
533-568 1.58e-03

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 37.58  E-value: 1.58e-03
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 16128676    533 WSRYGQLLSFTDCSGYVTRYDHDRFGQVTAVHREEG 568
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDG 36
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
637-678 1.79e-03

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 37.57  E-value: 1.79e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 16128676    637 YDAAGRVIRLTSENGSHTTFRYDVLDRLIQETGFDGRTQRYH 678
Cdd:TIGR01643    1 YDAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRYE 42
PAAR_RHS cd14742
proline-alanine-alanine-arginine (PAAR) domain, also containing C-terminal Rearrangement ...
3-29 4.26e-03

proline-alanine-alanine-arginine (PAAR) domain, also containing C-terminal Rearrangement hotspot (Rhs) extensions; This PAAR (proline-alanine-alanine-arginine) repeat subfamily, which forms a sharp conical extension on the VgrG spike, a trimeric protein complex of the bacterial type VI secretion system (T6SS), contains C- and N-terminal domain extensions. These include Rearrangement hotspot (Rhs) protein repeats and conserved Rhs repeat-associated unique core sequences at the C-terminal, and various predicted functions at N- and C-terminal extensions. However, these terminal domains are exposed to solution, and do not distort the binding site of VgrG. Rhs and related YD-peptide repeat proteins are widely distributed in bacteria. Rhs shares similar architecture with distantly related WapA proteins of Bacillus and Listeria species, suggesting intercellular growth inhibition as its primary function. Additionally, a plasmid-encoded Rhs protein has been implicated in bacteriocin production in Pseudomonas savastanoi. The pointed tip of the PAAR domain is stabilized by a zinc atom positioned close to the cone's vertex and is likely to be important for its integrity during penetration of the target cell envelope. VgrG proteins are orthologous to the central baseplate spikes of bacteriophages with contractile tails, and genes encoding proteins with PAAR motifs have been frequently found immediately downstream from vgrG-like genes.


Pssm-ID: 269827  Cd Length: 86  Bit Score: 37.57  E-value: 4.26e-03
                         10        20
                 ....*....|....*....|....*..
gi 16128676    3 GKPAARQGDMTQYGGSIVQGSAGVRIG 29
Cdd:cd14742   60 GQPAARKGDKTTCSAVISEGSPNVFIG 86
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
489-511 5.04e-03

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 36.04  E-value: 5.04e-03
                           10        20
                   ....*....|....*....|...
gi 16128676    489 YDEWGRLIQETAPDGDITRYRYD 511
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRVTTYTYD 23
PAAR COG4104
Zn-binding Pro-Ala-Ala-Arg (PAAR) domain, involved in Type VI secretion [Intracellular ...
3-48 5.04e-03

Zn-binding Pro-Ala-Ala-Arg (PAAR) domain, involved in Type VI secretion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 443280  Cd Length: 87  Bit Score: 37.49  E-value: 5.04e-03
                         10        20        30        40        50
                 ....*....|....*....|....*....|....*....|....*....|....
gi 16128676    3 GKPAARQGDMTQYGG----SIVQGSAGVRIG----APTGVACSvCPGGVTSGHP 48
Cdd:COG4104   29 GRPAARVGDKVSCPKhgpdTIAEGSPTVLINgkpaARVGDKTA-CGGTIISGSP 81
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
473-508 5.36e-03

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 36.03  E-value: 5.36e-03
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 16128676    473 QLTSATGPDGLEIRREYDEWGRLIQETAPDGDITRY 508
Cdd:TIGR01643    6 RLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRY 41
PAAR_2 cd14738
proline-alanine-alanine-arginine (PAAR) domain; This domain is found in the PAAR ...
3-29 7.89e-03

proline-alanine-alanine-arginine (PAAR) domain; This domain is found in the PAAR (proline-alanine-alanine-arginine) repeat family, where it forms a sharp conical extension on the VgrG spike, a trimeric protein complex of the bacterial type VI secretion system (T6SS). The T6SS is responsible for translocation of a wide variety of toxic effector molecules, allowing predatory cells to kill prokaryotic as well as eukaryotic prey cells. The pointed tip of the PAAR domain is stabilized by a zinc atom positioned close to the cone's vertex and is likely to be important for its integrity during penetration of the target cell envelope. VgrG proteins are orthologous to the central baseplate spikes of bacteriophages with contractile tails, and genes encoding proteins with PAAR motifs have been frequently found immediately downstream from vgrG-like genes. It has been shown that PAAR proteins are essential for T6SS-mediated secretion and target cell killing by Vibrio cholerae (encodes two PAAR proteins) and Acinetobacter baylyi (encodes three PAAR proteins); inactivation of all these PAAR genes results in inactivation of Hcp secretion as well as T6SS-dependent killing of E. coli.


Pssm-ID: 269823  Cd Length: 94  Bit Score: 37.23  E-value: 7.89e-03
                         10        20        30
                 ....*....|....*....|....*....|
gi 16128676    3 GKPAARQGDMTQYGG---SIVQGSAGVRIG 29
Cdd:cd14738   38 GLPAARVGDMCVCVGppdTIVQGSSTVLIG 67
PAAR_1 cd14737
proline-alanine-alanine-arginine (PAAR) domain; This domain is found in the PAAR ...
3-29 9.84e-03

proline-alanine-alanine-arginine (PAAR) domain; This domain is found in the PAAR (proline-alanine-alanine-arginine) repeat family, where it forms a sharp conical extension on the VgrG spike, a trimeric protein complex of the bacterial type VI secretion system (T6SS). The T6SS is responsible for translocation of a wide variety of toxic effector molecules, allowing predatory cells to kill prokaryotic as well as eukaryotic prey cells. The pointed tip of the PAAR domain is stabilized by a zinc atom positioned close to the cone's vertex and is likely to be important for its integrity during penetration of the target cell envelope. VgrG proteins are orthologous to the central baseplate spikes of bacteriophages with contractile tails, and genes encoding proteins with PAAR motifs have been frequently found immediately downstream from vgrG-like genes. It has been shown that PAAR proteins are essential for T6SS-mediated secretion and target cell killing by Vibrio cholerae (encodes two PAAR proteins) and Acinetobacter baylyi (encodes three PAAR proteins); inactivation of all these PAAR genes results in inactivation of Hcp secretion as well as T6SS-dependent killing of E. coli.


Pssm-ID: 269822  Cd Length: 94  Bit Score: 36.87  E-value: 9.84e-03
                         10        20
                 ....*....|....*....|....*..
gi 16128676    3 GKPAARQGDMTQYGGSIVQGSAGVRIG 29
Cdd:cd14737   68 GKPAARVGDPVSCGGTVAGGSPNVFIG 94
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH