NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|446425357|ref|WP_000503212|]
View 

RHS repeat-associated core domain-containing protein [Salmonella enterica]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
RHS_core super family cl49306
RHS element core protein;
386-1445 1.80e-107

RHS element core protein;


The actual alignment was detected with superfamily member NF041261:

Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 372.80  E-value: 1.80e-107
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  386 GINMMVQKAGSALNRPVNAATGAKYLAGDDDVdfSLPGHFTLEWQRTYSSRDERTE---GMFGRGWSVLYEVCLERTpdn 462
Cdd:NF041261   33 GVACSVCPGGMTSGNPVNPLLGAKVLPGETDI--ALPGPLPFILSRTYSSYRTRTPapvGVFGPGWKAPSDIRLQLR--- 107
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  463 pdENCMTYVAPMGRRIDLQAVEPGSGFYSPGEGLAVRR----------------------------------SEQGHWLI 508
Cdd:NF041261  108 --DDGLILNDNGGRSIHFEPLFPGEAVYSRSESLWLVRggvaaqpdghtlaalwqalpedirlsphlylatnSAQGPWWI 185
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  509 SSddGVYRLFEAD---PSS-PQRRRLKMLGDRNSNCQHLTYDNHGRLV-EISGDRQRPCIRLHYELAAHPQRvTRIFRH- 582
Cdd:NF041261  186 LG--WSERVPGADevlPAPlPPYRVLTGMVDRFGRTLTFHREAAGDLAgEITGVTDGAGREFRLVLTTQAQR-AEEARKq 262
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  583 -----------------------------------------------YPEGEPEL-LRRYRYDEAGRLNGVVDNAGQYQR 614
Cdd:NF041261  263 rtsslsspdgprplsssafpdtlpggteygpdngirlsavwlthdpaYPESLPAApLVRYTYTEAGELLAVYDRSNTQVR 342
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  615 EFAYDDNDC--MTMHREPGGERYYYTWawfegpdDAAWRVTGHHTDSGEQYRLDWNlaERSLCVTDSLGRTRC-HWWDAQ 691
Cdd:NF041261  343 AFTYDAQHPgrMVAHRYAGRPEMCYRY-------DDTGRVTEQLNPAGLSYRYQYE--QDRITITDSLNRREVlHTEGEG 413
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  692 GLVTAYRDE-AGQMTTFRWSDEERLLLGMTDAQGGKWRYVYDRL-GHLTETHDPLGRvEQTQWHPVWHQPETEVDAAGAA 769
Cdd:NF041261  414 GLKRVVKKEhADGSVTRSGYDAAGRLTAQTDAAGRRTEYSLNVVsGDITDITTPDGR-ETKFYYNDGNQLTSVTSPDGLE 492
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  770 WRYEYDERGNLQAVIDPLHQRTVYGYDR-HGQV-VRITDARGGDKYLQWNEDGQLMRHTDCSGSQTAWFYDERTRLERVT 847
Cdd:NF041261  493 SRREYDEPGRLVSETSRSGETTRYRYDDpHSELpATTTDATGSTKQMTWSRYGQLLAFTDCSGYQTRYEYDRFGQMTAVH 572
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  848 DAESNSTRYSYDGNGHLTEVMFADGRTERYQPDAAGRLVKYTSPAGQITRWQRDGQGRVRRQTDAtGRRTAYEYDAYGRL 927
Cdd:NF041261  573 REEGISTYRRYDNRGQLTSVKDAQGRETRYEYNAAGDLTAVITPDGNRSETQYDAWGKAVSTTQG-GLTRSMEYDAAGRI 651
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  928 TTLTNENGESYRFRYDVLDRVTEQTDPGGSRRVYGYNalnaVTAVIYGGERGGEIRHgLERDAAGRLTAK-ITPETRTKY 1006
Cdd:NF041261  652 TTLTNENGSHSTFLYDALDRLVQQRGFDGRTQRYHYD----LTGKLTQSEDEGLVTL-WHYDESDRITHRtVNGEPAEQW 726
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357 1007 RYDAADRLLEIRRRQHdaaegGEPEVIRFSYDSAGNLLSE-------ETAQGVLQHR----YDVQG--NRtetQMPDG-R 1072
Cdd:NF041261  727 QYDEHGWLTDISHLSE-----GHRVAVHYGYDDKGRLTGErqtvenpETGELLWQHEtghaYNEQGlaNR---VTPDSlP 798
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357 1073 TLRYLYYGSGHLQQVNLGRDVISEFTRDHLHREVQRSQGRLDTRRMYDRTGRLTR--KLTCKGMRGVVpetfIDREYAYS 1150
Cdd:NF041261  799 PVEWLTYGSGYLAGMKLGGTPLVEYTRDRLHRETVRSFGGAGSNAAYELTTAYTPagQLQSQHLNSLV----YDRDYTWN 874
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357 1151 GQDELLKKRHSRQgVTDYFYDTTGRITACRNEAY-LD---SWQYDAAANLLDrrQGETAQAGAGSVVPFNRITSYRGLHY 1226
Cdd:NF041261  875 DNGDLVRISGPRQ-TREYGYSATGRLTGVHTTAAnLDiriPYATDPAGNRLP--DPELHPDSTLTAWPDNRIAEDAHYVY 951
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357 1227 RYDEYGRVVEKRGR----------NGTQHYRWDAEHRLTEVAVIR-GSTVRRYGYVYDAPGRRVEK----HKLDAEG--- 1288
Cdd:NF041261  952 RYDEYGRLTEKTDRipegvirtddERTHHYHYDSQHRLVFYTRIQhGEPLVESRYLYDPLGRRMAKrvwrRERDLTGwms 1031
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357 1289 ---KPyNRTTFLWDGMRLAQ-ECRLGRSSSLYiysDQGSHEPLARVDRAA----------------------------PG 1336
Cdd:NF041261 1032 lsrKP-EVTWYGWDGDRLTTvQTDTTRIQTVY---QPGSFTPLIRVETENgerakaqrrslaetlqqegsenghgvvfPA 1107
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357 1337 E---------------------------------------------ADEVLYYHTDVNGAPEEMTDGRGNIVWEAGYQVW 1371
Cdd:NF041261 1108 ElvrmldrleeeiradrvseesrawlaqcgltveqmarqvepeytpARKLHLYHCDHRGLPLALISEEGNTAWQGEYDEW 1187
                        1210      1220      1230      1240      1250      1260      1270
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 446425357 1372 GNLTHEKETRPVQQNLRFQGQYLDRETGLHYNLYRFYDPDIGKFISGDPISIRGGINLYQYAPNPISWIDPLGL 1445
Cdd:NF041261 1188 GNLLNEENPHHLQQPYRLPGQQYDEESGLYYNRNRYYDPLQGRYITQDPIGLKGGWNLYQYPLNPIRFIDPLGL 1261
WHH pfam14414
A nuclease of the HNH/ENDO VII superfamily with conserved WHH; WHH is a predicted nuclease of ...
1510-1558 5.84e-13

A nuclease of the HNH/ENDO VII superfamily with conserved WHH; WHH is a predicted nuclease of the HNH/ENDO VII superfamily of the treble clef fold. The name is derived from the conserved motif WHH. It is found in bacterial polymorphic toxin systems and functions as a toxin module. WHH is the shortest version of HNH nuclease families. Like AHH and LHH, the WHH nuclease contains 4 conserved histidines of which the first one is predicted to bind a metal-ion and other three ones are involved in activation of water molecule for hydrolysis.


:

Pssm-ID: 433943  Cd Length: 43  Bit Score: 64.33  E-value: 5.84e-13
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 446425357  1510 NQKSTPRGYVWHHLDDydpvtnKGTMQLIKQGAHQGISHSGGVSQYKAA 1558
Cdd:pfam14414    1 AKGATPKGYTWHHLDD------TGTMQLVPEELHNATPHTGGVSLWKKG 43
PAAR_like super family cl21497
proline-alanine-alanine-arginine (PAAR) repeat superfamily; This domain is found in the PAAR ...
289-342 6.22e-10

proline-alanine-alanine-arginine (PAAR) repeat superfamily; This domain is found in the PAAR (proline-alanine-alanine-arginine) repeat superfamily, where it forms a sharp conical extension on the VgrG spike, a trimeric protein complex of the bacterial type VI secretion system (T6SS). The T6SS is responsible for translocation of a wide variety of toxic effector molecules, allowing predatory cells to kill prokaryotic as well as eukaryotic prey cells. The pointed tip of the PAAR domain is stabilized by a zinc atom positioned close to the cone's vertex and is likely to be important for its integrity during penetration of the target cell envelope. The PAAR-repeat proteins form a diverse superfamily with several subgroups extended both N- and C-terminally by domains with various predicted functions; the termini are exposed to solution, and do not distort the VgrG binding site. VgrG proteins are orthologous to the central baseplate spikes of bacteriophages with contractile tails, and genes encoding proteins with PAAR motifs have been frequently found immediately downstream from vgrG-like genes. It has been shown that PAAR proteins are essential for T6SS-mediated secretion and target cell killing by Vibrio cholerae (encodes two PAAR proteins) and Acinetobacter baylyi (encodes three PAAR proteins); inactivation of all these PAAR genes results in inactivation of Hcp secretion as well as T6SS-dependent killing of E. coli.


The actual alignment was detected with superfamily member cd14742:

Pssm-ID: 451275  Cd Length: 86  Bit Score: 57.21  E-value: 6.22e-10
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....
gi 446425357  289 DDEPPAAEYIAEGTRDVRINSQPAARSGVRCTCEAKVVDEpengvhvSGDVRIG 342
Cdd:cd14742    40 SKHPPPPQLIAEGSETVFINGQPAARKGDKTTCSAVISEG-------SPNVFIG 86
 
Name Accession Description Interval E-value
RHS_core NF041261
RHS element core protein;
386-1445 1.80e-107

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 372.80  E-value: 1.80e-107
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  386 GINMMVQKAGSALNRPVNAATGAKYLAGDDDVdfSLPGHFTLEWQRTYSSRDERTE---GMFGRGWSVLYEVCLERTpdn 462
Cdd:NF041261   33 GVACSVCPGGMTSGNPVNPLLGAKVLPGETDI--ALPGPLPFILSRTYSSYRTRTPapvGVFGPGWKAPSDIRLQLR--- 107
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  463 pdENCMTYVAPMGRRIDLQAVEPGSGFYSPGEGLAVRR----------------------------------SEQGHWLI 508
Cdd:NF041261  108 --DDGLILNDNGGRSIHFEPLFPGEAVYSRSESLWLVRggvaaqpdghtlaalwqalpedirlsphlylatnSAQGPWWI 185
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  509 SSddGVYRLFEAD---PSS-PQRRRLKMLGDRNSNCQHLTYDNHGRLV-EISGDRQRPCIRLHYELAAHPQRvTRIFRH- 582
Cdd:NF041261  186 LG--WSERVPGADevlPAPlPPYRVLTGMVDRFGRTLTFHREAAGDLAgEITGVTDGAGREFRLVLTTQAQR-AEEARKq 262
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  583 -----------------------------------------------YPEGEPEL-LRRYRYDEAGRLNGVVDNAGQYQR 614
Cdd:NF041261  263 rtsslsspdgprplsssafpdtlpggteygpdngirlsavwlthdpaYPESLPAApLVRYTYTEAGELLAVYDRSNTQVR 342
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  615 EFAYDDNDC--MTMHREPGGERYYYTWawfegpdDAAWRVTGHHTDSGEQYRLDWNlaERSLCVTDSLGRTRC-HWWDAQ 691
Cdd:NF041261  343 AFTYDAQHPgrMVAHRYAGRPEMCYRY-------DDTGRVTEQLNPAGLSYRYQYE--QDRITITDSLNRREVlHTEGEG 413
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  692 GLVTAYRDE-AGQMTTFRWSDEERLLLGMTDAQGGKWRYVYDRL-GHLTETHDPLGRvEQTQWHPVWHQPETEVDAAGAA 769
Cdd:NF041261  414 GLKRVVKKEhADGSVTRSGYDAAGRLTAQTDAAGRRTEYSLNVVsGDITDITTPDGR-ETKFYYNDGNQLTSVTSPDGLE 492
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  770 WRYEYDERGNLQAVIDPLHQRTVYGYDR-HGQV-VRITDARGGDKYLQWNEDGQLMRHTDCSGSQTAWFYDERTRLERVT 847
Cdd:NF041261  493 SRREYDEPGRLVSETSRSGETTRYRYDDpHSELpATTTDATGSTKQMTWSRYGQLLAFTDCSGYQTRYEYDRFGQMTAVH 572
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  848 DAESNSTRYSYDGNGHLTEVMFADGRTERYQPDAAGRLVKYTSPAGQITRWQRDGQGRVRRQTDAtGRRTAYEYDAYGRL 927
Cdd:NF041261  573 REEGISTYRRYDNRGQLTSVKDAQGRETRYEYNAAGDLTAVITPDGNRSETQYDAWGKAVSTTQG-GLTRSMEYDAAGRI 651
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  928 TTLTNENGESYRFRYDVLDRVTEQTDPGGSRRVYGYNalnaVTAVIYGGERGGEIRHgLERDAAGRLTAK-ITPETRTKY 1006
Cdd:NF041261  652 TTLTNENGSHSTFLYDALDRLVQQRGFDGRTQRYHYD----LTGKLTQSEDEGLVTL-WHYDESDRITHRtVNGEPAEQW 726
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357 1007 RYDAADRLLEIRRRQHdaaegGEPEVIRFSYDSAGNLLSE-------ETAQGVLQHR----YDVQG--NRtetQMPDG-R 1072
Cdd:NF041261  727 QYDEHGWLTDISHLSE-----GHRVAVHYGYDDKGRLTGErqtvenpETGELLWQHEtghaYNEQGlaNR---VTPDSlP 798
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357 1073 TLRYLYYGSGHLQQVNLGRDVISEFTRDHLHREVQRSQGRLDTRRMYDRTGRLTR--KLTCKGMRGVVpetfIDREYAYS 1150
Cdd:NF041261  799 PVEWLTYGSGYLAGMKLGGTPLVEYTRDRLHRETVRSFGGAGSNAAYELTTAYTPagQLQSQHLNSLV----YDRDYTWN 874
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357 1151 GQDELLKKRHSRQgVTDYFYDTTGRITACRNEAY-LD---SWQYDAAANLLDrrQGETAQAGAGSVVPFNRITSYRGLHY 1226
Cdd:NF041261  875 DNGDLVRISGPRQ-TREYGYSATGRLTGVHTTAAnLDiriPYATDPAGNRLP--DPELHPDSTLTAWPDNRIAEDAHYVY 951
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357 1227 RYDEYGRVVEKRGR----------NGTQHYRWDAEHRLTEVAVIR-GSTVRRYGYVYDAPGRRVEK----HKLDAEG--- 1288
Cdd:NF041261  952 RYDEYGRLTEKTDRipegvirtddERTHHYHYDSQHRLVFYTRIQhGEPLVESRYLYDPLGRRMAKrvwrRERDLTGwms 1031
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357 1289 ---KPyNRTTFLWDGMRLAQ-ECRLGRSSSLYiysDQGSHEPLARVDRAA----------------------------PG 1336
Cdd:NF041261 1032 lsrKP-EVTWYGWDGDRLTTvQTDTTRIQTVY---QPGSFTPLIRVETENgerakaqrrslaetlqqegsenghgvvfPA 1107
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357 1337 E---------------------------------------------ADEVLYYHTDVNGAPEEMTDGRGNIVWEAGYQVW 1371
Cdd:NF041261 1108 ElvrmldrleeeiradrvseesrawlaqcgltveqmarqvepeytpARKLHLYHCDHRGLPLALISEEGNTAWQGEYDEW 1187
                        1210      1220      1230      1240      1250      1260      1270
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 446425357 1372 GNLTHEKETRPVQQNLRFQGQYLDRETGLHYNLYRFYDPDIGKFISGDPISIRGGINLYQYAPNPISWIDPLGL 1445
Cdd:NF041261 1188 GNLLNEENPHHLQQPYRLPGQQYDEESGLYYNRNRYYDPLQGRYITQDPIGLKGGWNLYQYPLNPIRFIDPLGL 1261
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
594-1476 1.29e-42

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 170.71  E-value: 1.29e-42
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  594 YRYDEAGRLNGVVDNAGQYQREFAYDDNDCMTMHREPGGERYYYTWAWFEGPDDAAWRVTGHHTDSGEQYRLDWNLAERS 673
Cdd:COG3209   319 GTTGTAAVSGAADAGTTTTTGTGTGGTTTTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGS 398
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  674 LCVTDSLGRTRCHWWDAQGLVTAYRDEAGQMTTFRWSDEERLLLGMTDAQGGKWRYVYDRLGHLTETHDPLGRVEQTQWH 753
Cdd:COG3209   399 STTGVGAGTTTTSTTGGDGGPATAAGALTAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTE 478
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  754 PVWHQPETEVDAAGAAWRYEYDERGNLQAVIDPLHQRTVYGYDRHGQVVRIT---DARGGDKYLQWNEDGQLMRHTDCSG 830
Cdd:COG3209   479 AGTGGGTLTSGSAGATTLGTDTTLDDTLGGTTTTTAGARGLVVTTGTTLTLGtttTATLSATDATGTGDTTTTGTVGTGT 558
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  831 SQTAWFYDERTRLERVTDAESNSTRYSYDGNGHLTEVMFADGRTERYQPDAAGRLVKYTSPAGQITRWQRDGQGRVRRQT 910
Cdd:COG3209   559 STGTGGTGTVTTTGDGTGGASTTTGTTGGTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTG 638
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  911 DATGRRTAYEYDAYGRLTTLTNENGESYRFRYDVLDRVTEQTDPGGSRRVYGYNALNAVTAVIYGGERGgeiRHGLERDA 990
Cdd:COG3209   639 STTGGTTGTGVTTTGTTTTRATGTTGTGTGVTAGLTTLATGGTTVGGGTGTTSTATTGATTGGTETGTT---VTTLAGGT 715
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  991 AGRLTAKITPETRTKYRYDAADRLLEIRRRQHDAAEGGEPEVIRFSYDSAGNLLSEETAQGV------LQHRYDVQGNRT 1064
Cdd:COG3209   716 TTRLGTTTTGGGGGTTTDGTGTGGTTGTLTTTSTTTTTTAGALTYTYDALGRLTSETTPGGVtqgtytTRYTYDALGRLT 795
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357 1065 ETQMPDGRTLRYLYYGSGHLQQVnlgrdviseftrdhLHREVQRSQGRLDTRRMYDRTGRLTRKltckgmrgvvpetfid 1144
Cdd:COG3209   796 SVTYPDGETVTYTYDALGRLTSV--------------ITVGSGGGTDLQDRTYTYDAAGNITSI---------------- 845
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357 1145 REYAYSGQDellkkrhsrqgVTDYFYDTTGRITACRNEAYLDSWQYDAAANLLdrrqgetaqagagsvvpfnritsyrgl 1224
Cdd:COG3209   846 TDALRAGTL-----------TQTYTYDALGRLTSATDPGTTESYTYDANGNLT--------------------------- 887
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357 1225 hyrydeygrvveKRGRNGTQHYRWDAEHRLTEVAVIRGSTVRrygYVYDAPGrrvekhkldaegkpynrttflwdgmrla 1304
Cdd:COG3209   888 ------------SRTDGGTTTYTYDALGRLVSVTKPDGTTTT---YTYDALG---------------------------- 924
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357 1305 qecrlgrssslyiysdqgsheplarvdraapgeadevlyyHTDVNGAPEEMTDGRGNIVWEAGYQVWGNLTHEKETrPVQ 1384
Cdd:COG3209   925 ----------------------------------------HTDHLGSVRALTDASGQVVWRYDYDPFGNLLAETSG-AAA 963
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357 1385 QNLRFQGQYLDRETGLHYNLYRFYDPDIGKFISGDPISIRGGINLYQYA-PNPISWIDPLGLAVDPIAKLEDRGYTGVTR 1463
Cdd:COG3209   964 NPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPDPIGLAGGLNLYAYVgNNPVNYVDPLGLAALLGTTGLGGGAGVGAG 1043
                         890
                  ....*....|...
gi 446425357 1464 TSGGGLDYSDSNA 1476
Cdd:COG3209  1044 AAGGGAAAAGGSA 1056
Rhs_assc_core TIGR03696
RHS repeat-associated core domain; This model represents a conserved unique core sequence ...
1368-1445 4.12e-31

RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.


Pssm-ID: 274730 [Multi-domain]  Cd Length: 77  Bit Score: 117.22  E-value: 4.12e-31
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 446425357  1368 YQVWGNLTHEKEtrPVQQNLRFQGQYLDRETGLHYNLYRFYDPDIGKFISGDPISIRGGINLYQYAP-NPISWIDPLGL 1445
Cdd:TIGR03696    1 YDPYGEVLSESG--AAPNPLRFTGQYYDAETGLYYNGARYYDPELGRFLSPDPIGLGGGLNLYAYVGnNPVNWVDPLGL 77
DUF6531 pfam20148
Domain of unknown function (DUF6531); This putative domain is found in a range of RHS proteins.
401-478 2.91e-19

Domain of unknown function (DUF6531); This putative domain is found in a range of RHS proteins.


Pssm-ID: 466309 [Multi-domain]  Cd Length: 74  Bit Score: 83.35  E-value: 2.91e-19
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 446425357   401 PVNAATGAKYLagdDDVDFSLPGHFTLEWQRTYSSRDERTeGMFGRGWSVLYEVCLERTpdnpDENCMTYVAPMGRRI 478
Cdd:pfam20148    3 PVNVATGNKVL---EETDFSLPGPLPLVWTRTYNSSSERD-GPLGPGWSHPYDQRLELE----GDGGVVYIDADGREV 72
WHH pfam14414
A nuclease of the HNH/ENDO VII superfamily with conserved WHH; WHH is a predicted nuclease of ...
1510-1558 5.84e-13

A nuclease of the HNH/ENDO VII superfamily with conserved WHH; WHH is a predicted nuclease of the HNH/ENDO VII superfamily of the treble clef fold. The name is derived from the conserved motif WHH. It is found in bacterial polymorphic toxin systems and functions as a toxin module. WHH is the shortest version of HNH nuclease families. Like AHH and LHH, the WHH nuclease contains 4 conserved histidines of which the first one is predicted to bind a metal-ion and other three ones are involved in activation of water molecule for hydrolysis.


Pssm-ID: 433943  Cd Length: 43  Bit Score: 64.33  E-value: 5.84e-13
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 446425357  1510 NQKSTPRGYVWHHLDDydpvtnKGTMQLIKQGAHQGISHSGGVSQYKAA 1558
Cdd:pfam14414    1 AKGATPKGYTWHHLDD------TGTMQLVPEELHNATPHTGGVSLWKKG 43
PAAR_RHS cd14742
proline-alanine-alanine-arginine (PAAR) domain, also containing C-terminal Rearrangement ...
289-342 6.22e-10

proline-alanine-alanine-arginine (PAAR) domain, also containing C-terminal Rearrangement hotspot (Rhs) extensions; This PAAR (proline-alanine-alanine-arginine) repeat subfamily, which forms a sharp conical extension on the VgrG spike, a trimeric protein complex of the bacterial type VI secretion system (T6SS), contains C- and N-terminal domain extensions. These include Rearrangement hotspot (Rhs) protein repeats and conserved Rhs repeat-associated unique core sequences at the C-terminal, and various predicted functions at N- and C-terminal extensions. However, these terminal domains are exposed to solution, and do not distort the binding site of VgrG. Rhs and related YD-peptide repeat proteins are widely distributed in bacteria. Rhs shares similar architecture with distantly related WapA proteins of Bacillus and Listeria species, suggesting intercellular growth inhibition as its primary function. Additionally, a plasmid-encoded Rhs protein has been implicated in bacteriocin production in Pseudomonas savastanoi. The pointed tip of the PAAR domain is stabilized by a zinc atom positioned close to the cone's vertex and is likely to be important for its integrity during penetration of the target cell envelope. VgrG proteins are orthologous to the central baseplate spikes of bacteriophages with contractile tails, and genes encoding proteins with PAAR motifs have been frequently found immediately downstream from vgrG-like genes.


Pssm-ID: 269827  Cd Length: 86  Bit Score: 57.21  E-value: 6.22e-10
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....
gi 446425357  289 DDEPPAAEYIAEGTRDVRINSQPAARSGVRCTCEAKVVDEpengvhvSGDVRIG 342
Cdd:cd14742    40 SKHPPPPQLIAEGSETVFINGQPAARKGDKTTCSAVISEG-------SPNVFIG 86
PAAR_motif pfam05488
PAAR motif; This motif is found usually in pairs in a family of bacterial membrane proteins. ...
274-326 8.51e-04

PAAR motif; This motif is found usually in pairs in a family of bacterial membrane proteins. It is also found as a triplet of tandem repeats comprising the entire length in a another family of hypothetical proteins.


Pssm-ID: 428491  Cd Length: 71  Bit Score: 39.48  E-value: 8.51e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 446425357   274 AASALIGSVSNLFKGD----DEPPAAEYIAEGTRDVRINSQPAARSGVRCTCEAKVV 326
Cdd:pfam05488   15 SPTVLIGGKPAARVGDlvvcPPCGGGGPIAEGSPTVLINGKPAAREGDKTACGATLI 71
PAAR COG4104
Zn-binding Pro-Ala-Ala-Arg (PAAR) domain, involved in Type VI secretion [Intracellular ...
298-343 3.33e-03

Zn-binding Pro-Ala-Ala-Arg (PAAR) domain, involved in Type VI secretion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 443280  Cd Length: 87  Bit Score: 38.26  E-value: 3.33e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 446425357  298 IAEGTRDVRINSQPAARSGVRCTCEAKVVDepenGvhvSGDVRIGG 343
Cdd:COG4104    49 IAEGSPTVLINGKPAARVGDKTACGGTIIS----G---SPTVLIGG 87
 
Name Accession Description Interval E-value
RHS_core NF041261
RHS element core protein;
386-1445 1.80e-107

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 372.80  E-value: 1.80e-107
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  386 GINMMVQKAGSALNRPVNAATGAKYLAGDDDVdfSLPGHFTLEWQRTYSSRDERTE---GMFGRGWSVLYEVCLERTpdn 462
Cdd:NF041261   33 GVACSVCPGGMTSGNPVNPLLGAKVLPGETDI--ALPGPLPFILSRTYSSYRTRTPapvGVFGPGWKAPSDIRLQLR--- 107
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  463 pdENCMTYVAPMGRRIDLQAVEPGSGFYSPGEGLAVRR----------------------------------SEQGHWLI 508
Cdd:NF041261  108 --DDGLILNDNGGRSIHFEPLFPGEAVYSRSESLWLVRggvaaqpdghtlaalwqalpedirlsphlylatnSAQGPWWI 185
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  509 SSddGVYRLFEAD---PSS-PQRRRLKMLGDRNSNCQHLTYDNHGRLV-EISGDRQRPCIRLHYELAAHPQRvTRIFRH- 582
Cdd:NF041261  186 LG--WSERVPGADevlPAPlPPYRVLTGMVDRFGRTLTFHREAAGDLAgEITGVTDGAGREFRLVLTTQAQR-AEEARKq 262
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  583 -----------------------------------------------YPEGEPEL-LRRYRYDEAGRLNGVVDNAGQYQR 614
Cdd:NF041261  263 rtsslsspdgprplsssafpdtlpggteygpdngirlsavwlthdpaYPESLPAApLVRYTYTEAGELLAVYDRSNTQVR 342
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  615 EFAYDDNDC--MTMHREPGGERYYYTWawfegpdDAAWRVTGHHTDSGEQYRLDWNlaERSLCVTDSLGRTRC-HWWDAQ 691
Cdd:NF041261  343 AFTYDAQHPgrMVAHRYAGRPEMCYRY-------DDTGRVTEQLNPAGLSYRYQYE--QDRITITDSLNRREVlHTEGEG 413
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  692 GLVTAYRDE-AGQMTTFRWSDEERLLLGMTDAQGGKWRYVYDRL-GHLTETHDPLGRvEQTQWHPVWHQPETEVDAAGAA 769
Cdd:NF041261  414 GLKRVVKKEhADGSVTRSGYDAAGRLTAQTDAAGRRTEYSLNVVsGDITDITTPDGR-ETKFYYNDGNQLTSVTSPDGLE 492
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  770 WRYEYDERGNLQAVIDPLHQRTVYGYDR-HGQV-VRITDARGGDKYLQWNEDGQLMRHTDCSGSQTAWFYDERTRLERVT 847
Cdd:NF041261  493 SRREYDEPGRLVSETSRSGETTRYRYDDpHSELpATTTDATGSTKQMTWSRYGQLLAFTDCSGYQTRYEYDRFGQMTAVH 572
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  848 DAESNSTRYSYDGNGHLTEVMFADGRTERYQPDAAGRLVKYTSPAGQITRWQRDGQGRVRRQTDAtGRRTAYEYDAYGRL 927
Cdd:NF041261  573 REEGISTYRRYDNRGQLTSVKDAQGRETRYEYNAAGDLTAVITPDGNRSETQYDAWGKAVSTTQG-GLTRSMEYDAAGRI 651
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  928 TTLTNENGESYRFRYDVLDRVTEQTDPGGSRRVYGYNalnaVTAVIYGGERGGEIRHgLERDAAGRLTAK-ITPETRTKY 1006
Cdd:NF041261  652 TTLTNENGSHSTFLYDALDRLVQQRGFDGRTQRYHYD----LTGKLTQSEDEGLVTL-WHYDESDRITHRtVNGEPAEQW 726
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357 1007 RYDAADRLLEIRRRQHdaaegGEPEVIRFSYDSAGNLLSE-------ETAQGVLQHR----YDVQG--NRtetQMPDG-R 1072
Cdd:NF041261  727 QYDEHGWLTDISHLSE-----GHRVAVHYGYDDKGRLTGErqtvenpETGELLWQHEtghaYNEQGlaNR---VTPDSlP 798
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357 1073 TLRYLYYGSGHLQQVNLGRDVISEFTRDHLHREVQRSQGRLDTRRMYDRTGRLTR--KLTCKGMRGVVpetfIDREYAYS 1150
Cdd:NF041261  799 PVEWLTYGSGYLAGMKLGGTPLVEYTRDRLHRETVRSFGGAGSNAAYELTTAYTPagQLQSQHLNSLV----YDRDYTWN 874
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357 1151 GQDELLKKRHSRQgVTDYFYDTTGRITACRNEAY-LD---SWQYDAAANLLDrrQGETAQAGAGSVVPFNRITSYRGLHY 1226
Cdd:NF041261  875 DNGDLVRISGPRQ-TREYGYSATGRLTGVHTTAAnLDiriPYATDPAGNRLP--DPELHPDSTLTAWPDNRIAEDAHYVY 951
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357 1227 RYDEYGRVVEKRGR----------NGTQHYRWDAEHRLTEVAVIR-GSTVRRYGYVYDAPGRRVEK----HKLDAEG--- 1288
Cdd:NF041261  952 RYDEYGRLTEKTDRipegvirtddERTHHYHYDSQHRLVFYTRIQhGEPLVESRYLYDPLGRRMAKrvwrRERDLTGwms 1031
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357 1289 ---KPyNRTTFLWDGMRLAQ-ECRLGRSSSLYiysDQGSHEPLARVDRAA----------------------------PG 1336
Cdd:NF041261 1032 lsrKP-EVTWYGWDGDRLTTvQTDTTRIQTVY---QPGSFTPLIRVETENgerakaqrrslaetlqqegsenghgvvfPA 1107
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357 1337 E---------------------------------------------ADEVLYYHTDVNGAPEEMTDGRGNIVWEAGYQVW 1371
Cdd:NF041261 1108 ElvrmldrleeeiradrvseesrawlaqcgltveqmarqvepeytpARKLHLYHCDHRGLPLALISEEGNTAWQGEYDEW 1187
                        1210      1220      1230      1240      1250      1260      1270
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 446425357 1372 GNLTHEKETRPVQQNLRFQGQYLDRETGLHYNLYRFYDPDIGKFISGDPISIRGGINLYQYAPNPISWIDPLGL 1445
Cdd:NF041261 1188 GNLLNEENPHHLQQPYRLPGQQYDEESGLYYNRNRYYDPLQGRYITQDPIGLKGGWNLYQYPLNPIRFIDPLGL 1261
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
594-1476 1.29e-42

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 170.71  E-value: 1.29e-42
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  594 YRYDEAGRLNGVVDNAGQYQREFAYDDNDCMTMHREPGGERYYYTWAWFEGPDDAAWRVTGHHTDSGEQYRLDWNLAERS 673
Cdd:COG3209   319 GTTGTAAVSGAADAGTTTTTGTGTGGTTTTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGS 398
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  674 LCVTDSLGRTRCHWWDAQGLVTAYRDEAGQMTTFRWSDEERLLLGMTDAQGGKWRYVYDRLGHLTETHDPLGRVEQTQWH 753
Cdd:COG3209   399 STTGVGAGTTTTSTTGGDGGPATAAGALTAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTE 478
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  754 PVWHQPETEVDAAGAAWRYEYDERGNLQAVIDPLHQRTVYGYDRHGQVVRIT---DARGGDKYLQWNEDGQLMRHTDCSG 830
Cdd:COG3209   479 AGTGGGTLTSGSAGATTLGTDTTLDDTLGGTTTTTAGARGLVVTTGTTLTLGtttTATLSATDATGTGDTTTTGTVGTGT 558
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  831 SQTAWFYDERTRLERVTDAESNSTRYSYDGNGHLTEVMFADGRTERYQPDAAGRLVKYTSPAGQITRWQRDGQGRVRRQT 910
Cdd:COG3209   559 STGTGGTGTVTTTGDGTGGASTTTGTTGGTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTG 638
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  911 DATGRRTAYEYDAYGRLTTLTNENGESYRFRYDVLDRVTEQTDPGGSRRVYGYNALNAVTAVIYGGERGgeiRHGLERDA 990
Cdd:COG3209   639 STTGGTTGTGVTTTGTTTTRATGTTGTGTGVTAGLTTLATGGTTVGGGTGTTSTATTGATTGGTETGTT---VTTLAGGT 715
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  991 AGRLTAKITPETRTKYRYDAADRLLEIRRRQHDAAEGGEPEVIRFSYDSAGNLLSEETAQGV------LQHRYDVQGNRT 1064
Cdd:COG3209   716 TTRLGTTTTGGGGGTTTDGTGTGGTTGTLTTTSTTTTTTAGALTYTYDALGRLTSETTPGGVtqgtytTRYTYDALGRLT 795
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357 1065 ETQMPDGRTLRYLYYGSGHLQQVnlgrdviseftrdhLHREVQRSQGRLDTRRMYDRTGRLTRKltckgmrgvvpetfid 1144
Cdd:COG3209   796 SVTYPDGETVTYTYDALGRLTSV--------------ITVGSGGGTDLQDRTYTYDAAGNITSI---------------- 845
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357 1145 REYAYSGQDellkkrhsrqgVTDYFYDTTGRITACRNEAYLDSWQYDAAANLLdrrqgetaqagagsvvpfnritsyrgl 1224
Cdd:COG3209   846 TDALRAGTL-----------TQTYTYDALGRLTSATDPGTTESYTYDANGNLT--------------------------- 887
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357 1225 hyrydeygrvveKRGRNGTQHYRWDAEHRLTEVAVIRGSTVRrygYVYDAPGrrvekhkldaegkpynrttflwdgmrla 1304
Cdd:COG3209   888 ------------SRTDGGTTTYTYDALGRLVSVTKPDGTTTT---YTYDALG---------------------------- 924
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357 1305 qecrlgrssslyiysdqgsheplarvdraapgeadevlyyHTDVNGAPEEMTDGRGNIVWEAGYQVWGNLTHEKETrPVQ 1384
Cdd:COG3209   925 ----------------------------------------HTDHLGSVRALTDASGQVVWRYDYDPFGNLLAETSG-AAA 963
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357 1385 QNLRFQGQYLDRETGLHYNLYRFYDPDIGKFISGDPISIRGGINLYQYA-PNPISWIDPLGLAVDPIAKLEDRGYTGVTR 1463
Cdd:COG3209   964 NPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPDPIGLAGGLNLYAYVgNNPVNYVDPLGLAALLGTTGLGGGAGVGAG 1043
                         890
                  ....*....|...
gi 446425357 1464 TSGGGLDYSDSNA 1476
Cdd:COG3209  1044 AAGGGAAAAGGSA 1056
Rhs_assc_core TIGR03696
RHS repeat-associated core domain; This model represents a conserved unique core sequence ...
1368-1445 4.12e-31

RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.


Pssm-ID: 274730 [Multi-domain]  Cd Length: 77  Bit Score: 117.22  E-value: 4.12e-31
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 446425357  1368 YQVWGNLTHEKEtrPVQQNLRFQGQYLDRETGLHYNLYRFYDPDIGKFISGDPISIRGGINLYQYAP-NPISWIDPLGL 1445
Cdd:TIGR03696    1 YDPYGEVLSESG--AAPNPLRFTGQYYDAETGLYYNGARYYDPELGRFLSPDPIGLGGGLNLYAYVGnNPVNWVDPLGL 77
DUF6531 pfam20148
Domain of unknown function (DUF6531); This putative domain is found in a range of RHS proteins.
401-478 2.91e-19

Domain of unknown function (DUF6531); This putative domain is found in a range of RHS proteins.


Pssm-ID: 466309 [Multi-domain]  Cd Length: 74  Bit Score: 83.35  E-value: 2.91e-19
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 446425357   401 PVNAATGAKYLagdDDVDFSLPGHFTLEWQRTYSSRDERTeGMFGRGWSVLYEVCLERTpdnpDENCMTYVAPMGRRI 478
Cdd:pfam20148    3 PVNVATGNKVL---EETDFSLPGPLPLVWTRTYNSSSERD-GPLGPGWSHPYDQRLELE----GDGGVVYIDADGREV 72
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
593-1008 3.46e-18

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 91.36  E-value: 3.46e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  593 RYRYDEAGRLNGVVDNAGQYQREFAYDDNDCMTMHREPGGERYYYTWAWFEGPDDAAWRVTGHHTDSGEQYRLDWNLAER 672
Cdd:COG3209   554 VGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTGGTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERA 633
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  673 SLCVTDSLGRTRCHWWDAQGLVTAYRDEAGQMTTFRWSDEERLLLGMTDAQGGKWRYVYDRLGHLTETHDPLGRVEQTQW 752
Cdd:COG3209   634 TASTGSTTGGTTGTGVTTTGTTTTRATGTTGTGTGVTAGLTTLATGGTTVGGGTGTTSTATTGATTGGTETGTTVTTLAG 713
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  753 HPVWHQPETEVDAAGAAWRYEYDERGNLQAVIDPLHQRT------VYGYDRHGQVVRITDARGgdkylqwnedgqlmrhT 826
Cdd:COG3209   714 GTTTRLGTTTTGGGGGTTTDGTGTGGTTGTLTTTSTTTTttagalTYTYDALGRLTSETTPGG----------------V 777
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  827 DCSGSQTAWFYDERTRLERVTDAESNSTRYSYDGNGHLTEVMFADGRTE------RYQPDAAGRLVKYTSPAGQITRWQR 900
Cdd:COG3209   778 TQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTSVITVGSGGGtdlqdrTYTYDAAGNITSITDALRAGTLTQT 857
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  901 ---DGQGRVRRQTDATGRRTaYEYDAYGRLTTLTNENGESYrfRYDVLDRVTEQTDPGGSRRVYGYNALN------AVTA 971
Cdd:COG3209   858 ytyDALGRLTSATDPGTTES-YTYDANGNLTSRTDGGTTTY--TYDALGRLVSVTKPDGTTTTYTYDALGhtdhlgSVRA 934
                         410       420       430
                  ....*....|....*....|....*....|....*..
gi 446425357  972 VIyggERGGEIRHGLERDAAGRLTAKITPETRTKYRY 1008
Cdd:COG3209   935 LT---DASGQVVWRYDYDPFGNLLAETSGAAANPLRF 968
WHH pfam14414
A nuclease of the HNH/ENDO VII superfamily with conserved WHH; WHH is a predicted nuclease of ...
1510-1558 5.84e-13

A nuclease of the HNH/ENDO VII superfamily with conserved WHH; WHH is a predicted nuclease of the HNH/ENDO VII superfamily of the treble clef fold. The name is derived from the conserved motif WHH. It is found in bacterial polymorphic toxin systems and functions as a toxin module. WHH is the shortest version of HNH nuclease families. Like AHH and LHH, the WHH nuclease contains 4 conserved histidines of which the first one is predicted to bind a metal-ion and other three ones are involved in activation of water molecule for hydrolysis.


Pssm-ID: 433943  Cd Length: 43  Bit Score: 64.33  E-value: 5.84e-13
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 446425357  1510 NQKSTPRGYVWHHLDDydpvtnKGTMQLIKQGAHQGISHSGGVSQYKAA 1558
Cdd:pfam14414    1 AKGATPKGYTWHHLDD------TGTMQLVPEELHNATPHTGGVSLWKKG 43
RHS pfam03527
RHS protein;
1343-1377 8.57e-12

RHS protein;


Pssm-ID: 427349 [Multi-domain]  Cd Length: 38  Bit Score: 61.17  E-value: 8.57e-12
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 446425357  1343 YYHTDVNGAPEEMTDGRGNIVWEAGYQVWGNLTHE 1377
Cdd:pfam03527    3 YYHTDHLGTPEELTDEAGEIVWSAEYDAWGNVTEE 37
PAAR_RHS cd14742
proline-alanine-alanine-arginine (PAAR) domain, also containing C-terminal Rearrangement ...
289-342 6.22e-10

proline-alanine-alanine-arginine (PAAR) domain, also containing C-terminal Rearrangement hotspot (Rhs) extensions; This PAAR (proline-alanine-alanine-arginine) repeat subfamily, which forms a sharp conical extension on the VgrG spike, a trimeric protein complex of the bacterial type VI secretion system (T6SS), contains C- and N-terminal domain extensions. These include Rearrangement hotspot (Rhs) protein repeats and conserved Rhs repeat-associated unique core sequences at the C-terminal, and various predicted functions at N- and C-terminal extensions. However, these terminal domains are exposed to solution, and do not distort the binding site of VgrG. Rhs and related YD-peptide repeat proteins are widely distributed in bacteria. Rhs shares similar architecture with distantly related WapA proteins of Bacillus and Listeria species, suggesting intercellular growth inhibition as its primary function. Additionally, a plasmid-encoded Rhs protein has been implicated in bacteriocin production in Pseudomonas savastanoi. The pointed tip of the PAAR domain is stabilized by a zinc atom positioned close to the cone's vertex and is likely to be important for its integrity during penetration of the target cell envelope. VgrG proteins are orthologous to the central baseplate spikes of bacteriophages with contractile tails, and genes encoding proteins with PAAR motifs have been frequently found immediately downstream from vgrG-like genes.


Pssm-ID: 269827  Cd Length: 86  Bit Score: 57.21  E-value: 6.22e-10
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....
gi 446425357  289 DDEPPAAEYIAEGTRDVRINSQPAARSGVRCTCEAKVVDEpengvhvSGDVRIG 342
Cdd:cd14742    40 SKHPPPPQLIAEGSETVFINGQPAARKGDKTTCSAVISEG-------SPNVFIG 86
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
921-957 9.06e-08

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 49.52  E-value: 9.06e-08
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 446425357   921 YDAYGRLTTLTNENGESYRFRYDVLDRVTEQTDPGGS 957
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
901-936 9.33e-08

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 49.52  E-value: 9.33e-08
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 446425357   901 DGQGRVRRQTDATGRRTAYEYDAYGRLTTLTNENGE 936
Cdd:pfam05593    2 DAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
880-920 4.51e-07

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 47.58  E-value: 4.51e-07
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 446425357   880 DAAGRLVKYTSPAGQITRWQRDGQGRVRRQTDATGRRTAYE 920
Cdd:TIGR01643    2 DAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRYE 42
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
858-894 7.74e-07

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 46.82  E-value: 7.74e-07
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 446425357   858 YDGNGHLTEVMFADGRTERYQPDAAGRLVKYTSPAGQ 894
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
Colicin-DNase pfam12639
DNase/tRNase domain of colicin-like bacteriocin; Colicin-like bacteriocins are complex ...
1474-1550 3.61e-06

DNase/tRNase domain of colicin-like bacteriocin; Colicin-like bacteriocins are complex structures with an N-terminal beta-barrel translocation domain (pfam09000), a long double-alpha-helical receptor-binding domain (pfam11570) and this C-terminal RNAse/DNase domain with endonuclease activity. Their competitor bacteriocidal action is by a process that involves binding to a surface receptor, entering the cell, and, finally, killing it. The lethal action of colicin E3 is a specific cleavage in the ribosomal decoding A site. The crystal structure of colicin E3 reveals a Y-shaped molecule with the receptor binding domain forming a 100 Angstrom long stalk and the two globular heads of the translocation domain and this catalytic domain comprising the two arms.


Pssm-ID: 432688  Cd Length: 96  Bit Score: 47.08  E-value: 3.61e-06
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 446425357  1474 SNALYnKRPGVNPVVTIEYSGDYLKDFERantaaklnqKSTPRGYVWHHLDDYdpvtnkGTMQLIKQGAHQGISHSG 1550
Cdd:pfam12639   36 NKALK-EEVANDPELANQFTKEQLEGIEN---------GKTPEGYTWHHHQDT------GTMQLVPTEIHDKTGHTG 96
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
774-809 8.15e-06

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 44.13  E-value: 8.15e-06
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 446425357   774 YDERGNLQAVIDPLHQRTVYGYDRHGQVVRITDARG 809
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDG 36
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
921-959 1.93e-05

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 42.96  E-value: 1.93e-05
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 446425357   921 YDAYGRLTTLTNENGESYRFRYDVLDRVTEQTDPGGSRR 959
Cdd:TIGR01643    1 YDAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGST 39
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
880-915 3.70e-05

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 42.20  E-value: 3.70e-05
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 446425357   880 DAAGRLVKYTSPAGQITRWQRDGQGRVRRQTDATGR 915
Cdd:pfam05593    2 DAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
837-878 9.40e-05

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 41.04  E-value: 9.40e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 446425357   837 YDERTRLERVTDAESNSTRYSYDGNGHLTEVMFADGRTERYQ 878
Cdd:TIGR01643    1 YDAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRYE 42
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
837-873 1.64e-04

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 40.27  E-value: 1.64e-04
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 446425357   837 YDERTRLERVTDAESNSTRYSYDGNGHLTEVMFADGR 873
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
PAAR_like cd14671
proline-alanine-alanine-arginine (PAAR) repeat superfamily; This domain is found in the PAAR ...
243-326 2.44e-04

proline-alanine-alanine-arginine (PAAR) repeat superfamily; This domain is found in the PAAR (proline-alanine-alanine-arginine) repeat superfamily, where it forms a sharp conical extension on the VgrG spike, a trimeric protein complex of the bacterial type VI secretion system (T6SS). The T6SS is responsible for translocation of a wide variety of toxic effector molecules, allowing predatory cells to kill prokaryotic as well as eukaryotic prey cells. The pointed tip of the PAAR domain is stabilized by a zinc atom positioned close to the cone's vertex and is likely to be important for its integrity during penetration of the target cell envelope. The PAAR-repeat proteins form a diverse superfamily with several subgroups extended both N- and C-terminally by domains with various predicted functions; the termini are exposed to solution, and do not distort the VgrG binding site. VgrG proteins are orthologous to the central baseplate spikes of bacteriophages with contractile tails, and genes encoding proteins with PAAR motifs have been frequently found immediately downstream from vgrG-like genes. It has been shown that PAAR proteins are essential for T6SS-mediated secretion and target cell killing by Vibrio cholerae (encodes two PAAR proteins) and Acinetobacter baylyi (encodes three PAAR proteins); inactivation of all these PAAR genes results in inactivation of Hcp secretion as well as T6SS-dependent killing of E. coli.


Pssm-ID: 269821  Cd Length: 77  Bit Score: 41.16  E-value: 2.44e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  243 MAQKLQALTDDPVGTVL--GAMNPFgaleIGFQAAsALIGSVsnlfkgDDEPPAAEYIAEGTRDVRINSQPAARSGVRCT 320
Cdd:cd14671     1 PAARVGDPTAHTPGGPVisGSPNVF----INGRPA-ARVGDV------GDHPGGGNAIVSGSGTVFINGKPAARVGDRTS 69

                  ....*.
gi 446425357  321 CEAKVV 326
Cdd:cd14671    70 CGGVIV 75
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
399-782 4.62e-04

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 45.13  E-value: 4.62e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  399 NRPVNAATGAKYLAGDDDVDFSLPGHFTLEWQRTYSSRDERTEGMFGRGWSVLYEVCLERTPDNPDENCMTYVAPMGRRI 478
Cdd:COG3209   616 AGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTGTGTGVTAGLTTLATGGTTVGGGTGTTSTATT 695
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  479 DLQAVEPGSGFYSPGEGLAVRRSEQGHWLISSDDGVYRLFEADPSSPQRRRLKMLGDRNSNCQHLTYDNHGRLVEISGDR 558
Cdd:COG3209   696 GATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGTLTTTSTTTTTTAGALTYTYDALGRLTSETTPG 775
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  559 QRPC----IRLHYELAAHPQRVTrifrhYPEGEpelLRRYRYDEAGRLNGVVDNAGQ-----YQREFAYDDNDCMTmhre 629
Cdd:COG3209   776 GVTQgtytTRYTYDALGRLTSVT-----YPDGE---TVTYTYDALGRLTSVITVGSGggtdlQDRTYTYDAAGNIT---- 843
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425357  630 pggeryyytwawfegpddaawRVTGHHTDSGEQYRLDWNLAERSLCVTDSLGRTRCHwWDAQGLVTayRDEAGQMTTFRW 709
Cdd:COG3209   844 ---------------------SITDALRAGTLTQTYTYDALGRLTSATDPGTTESYT-YDANGNLT--SRTDGGTTTYTY 899
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 446425357  710 sDEERLLLGMTDAQGGKWRYVYDRLGHltetHDPLGrveqtqwhpvwhQPETEVDAAGA-AWRYEYDERGNLQA 782
Cdd:COG3209   900 -DALGRLVSVTKPDGTTTTYTYDALGH----TDHLG------------SVRALTDASGQvVWRYDYDPFGNLLA 956
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
774-810 6.15e-04

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 38.73  E-value: 6.15e-04
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 446425357   774 YDERGNLQAVIDPLHQRTVYGYDRHGQVVRITDARGG 810
Cdd:TIGR01643    1 YDAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGG 37
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
816-850 6.45e-04

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 38.73  E-value: 6.45e-04
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 446425357   816 WNEDGQLMRHTDCSGSQTAWFYDERTRLERVTDAE 850
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPD 35
PAAR_motif pfam05488
PAAR motif; This motif is found usually in pairs in a family of bacterial membrane proteins. ...
274-326 8.51e-04

PAAR motif; This motif is found usually in pairs in a family of bacterial membrane proteins. It is also found as a triplet of tandem repeats comprising the entire length in a another family of hypothetical proteins.


Pssm-ID: 428491  Cd Length: 71  Bit Score: 39.48  E-value: 8.51e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 446425357   274 AASALIGSVSNLFKGD----DEPPAAEYIAEGTRDVRINSQPAARSGVRCTCEAKVV 326
Cdd:pfam05488   15 SPTVLIGGKPAARVGDlvvcPPCGGGGPIAEGSPTVLINGKPAAREGDKTACGATLI 71
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
647-684 9.39e-04

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 38.34  E-value: 9.39e-04
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 446425357   647 DAAWRVTGHHTDSGEQYRLDWNLAERSLCVTDSLGRTR 684
Cdd:TIGR01643    2 DAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGST 39
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
858-899 9.39e-04

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 38.34  E-value: 9.39e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 446425357   858 YDGNGHLTEVMFADGRTERYQPDAAGRLVKYTSPAGQITRWQ 899
Cdd:TIGR01643    1 YDAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRYE 42
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
715-746 1.49e-03

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 37.58  E-value: 1.49e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 446425357   715 LLLGMTDAQGGKWRYVYDRLGHLTETHDPLGR 746
Cdd:pfam05593    6 RLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
942-976 1.96e-03

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 37.19  E-value: 1.96e-03
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 446425357   942 YDVLDRVTEQTDPGGSRRVYGYNALNAVTAVIYGG 976
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPD 35
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
1037-1072 2.10e-03

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 37.19  E-value: 2.10e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 446425357  1037 YDSAGNLLSEETAQG-VLQHRYDVQGNRTETQMPDGR 1072
Cdd:pfam05593    1 YDAAGRLTSVTDPDGrVTTYTYDAAGRLTAVTDPDGT 37
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
731-786 2.69e-03

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 36.81  E-value: 2.69e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 446425357   731 YDRLGHLTETHDPLGRVeqtqwhpvwhqpetevdaagaaWRYEYDERGNLQAVIDP 786
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRV----------------------TTYTYDAAGRLTAVTDP 34
PAAR COG4104
Zn-binding Pro-Ala-Ala-Arg (PAAR) domain, involved in Type VI secretion [Intracellular ...
298-343 3.33e-03

Zn-binding Pro-Ala-Ala-Arg (PAAR) domain, involved in Type VI secretion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 443280  Cd Length: 87  Bit Score: 38.26  E-value: 3.33e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 446425357  298 IAEGTRDVRINSQPAARSGVRCTCEAKVVDepenGvhvSGDVRIGG 343
Cdd:COG4104    49 IAEGSPTVLINGKPAARVGDKTACGGTIIS----G---SPTVLIGG 87
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
711-751 4.63e-03

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 36.41  E-value: 4.63e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 446425357   711 DEERLLLGMTDAQGGKWRYVYDRLGHLTETHDPLGRVEQTQ 751
Cdd:TIGR01643    2 DAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRYE 42
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
901-940 4.91e-03

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 36.41  E-value: 4.91e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 446425357   901 DGQGRVRRQTDATGRRTAYEYDAYGRLTTLTNENGESYRF 940
Cdd:TIGR01643    2 DAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRY 41
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
989-1018 7.63e-03

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 35.65  E-value: 7.63e-03
                           10        20        30
                   ....*....|....*....|....*....|.
gi 446425357   989 DAAGRLTAKITPE-TRTKYRYDAADRLLEIR 1018
Cdd:pfam05593    2 DAAGRLTSVTDPDgRVTTYTYDAAGRLTAVT 32
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH