NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2787272368|ref|WP_370374804|]
View 

RHS repeat-associated core domain-containing protein [Maricaulis sp.]

Protein Classification

RHS repeat-associated core domain-containing protein( domain architecture ID 10022768)

RHS (rearrangement hotspot) repeat-associated core domain-containing protein

PubMed:  23572593

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Rhs_assc_core TIGR03696
RHS repeat-associated core domain; This model represents a conserved unique core sequence ...
38-111 9.05e-33

RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.


:

Pssm-ID: 274730 [Multi-domain]  Cd Length: 77  Bit Score: 116.06  E-value: 9.05e-33
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2787272368  38 YDEYGAPASGNAG---RFGYTGQVWLPETGLYHYKNRAYHPELGRFMQTDPIGVNGGMNLYAYVGGDPVNFVDPLGL 111
Cdd:TIGR03696   1 YDPYGEVLSESGAapnPLRFTGQYYDAETGLYYNGARYYDPELGRFLSPDPIGLGGGLNLYAYVGNNPVNWVDPLGL 77
 
Name Accession Description Interval E-value
Rhs_assc_core TIGR03696
RHS repeat-associated core domain; This model represents a conserved unique core sequence ...
38-111 9.05e-33

RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.


Pssm-ID: 274730 [Multi-domain]  Cd Length: 77  Bit Score: 116.06  E-value: 9.05e-33
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2787272368  38 YDEYGAPASGNAG---RFGYTGQVWLPETGLYHYKNRAYHPELGRFMQTDPIGVNGGMNLYAYVGGDPVNFVDPLGL 111
Cdd:TIGR03696   1 YDPYGEVLSESGAapnPLRFTGQYYDAETGLYYNGARYYDPELGRFLSPDPIGLGGGLNLYAYVGNNPVNWVDPLGL 77
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
15-113 2.05e-29

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 118.71  E-value: 2.05e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2787272368   15 LADERGSIVAVTDWSGASIQINSYDEYGAPASGNAG----RFGYTGQVWLPETGLYHYKNRAYHPELGRFMQTDPIGVNG 90
Cdd:COG3209    925 HTDHLGSVRALTDASGQVVWRYDYDPFGNLLAETSGaaanPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPDPIGLAG 1004
                           90       100
                   ....*....|....*....|...
gi 2787272368   91 GMNLYAYVGGDPVNFVDPLGLEA 113
Cdd:COG3209   1005 GLNLYAYVGNNPVNYVDPLGLAA 1027
RHS_core NF041261
RHS element core protein;
17-111 3.20e-12

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 67.34  E-value: 3.20e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2787272368   17 DERGSIVAVTDWSGASIQINSYDEYGAPASGNAGRFGYT-----GQVWLPETGLYHYKNRAYHPELGRFMQTDPIGVNGG 91
Cdd:NF041261  1163 DHRGLPLALISEEGNTAWQGEYDEWGNLLNEENPHHLQQpyrlpGQQYDEESGLYYNRNRYYDPLQGRYITQDPIGLKGG 1242
                           90       100
                   ....*....|....*....|
gi 2787272368   92 MNLYAYvGGDPVNFVDPLGL 111
Cdd:NF041261  1243 WNLYQY-PLNPIRFIDPLGL 1261
 
Name Accession Description Interval E-value
Rhs_assc_core TIGR03696
RHS repeat-associated core domain; This model represents a conserved unique core sequence ...
38-111 9.05e-33

RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.


Pssm-ID: 274730 [Multi-domain]  Cd Length: 77  Bit Score: 116.06  E-value: 9.05e-33
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2787272368  38 YDEYGAPASGNAG---RFGYTGQVWLPETGLYHYKNRAYHPELGRFMQTDPIGVNGGMNLYAYVGGDPVNFVDPLGL 111
Cdd:TIGR03696   1 YDPYGEVLSESGAapnPLRFTGQYYDAETGLYYNGARYYDPELGRFLSPDPIGLGGGLNLYAYVGNNPVNWVDPLGL 77
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
15-113 2.05e-29

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 118.71  E-value: 2.05e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2787272368   15 LADERGSIVAVTDWSGASIQINSYDEYGAPASGNAG----RFGYTGQVWLPETGLYHYKNRAYHPELGRFMQTDPIGVNG 90
Cdd:COG3209    925 HTDHLGSVRALTDASGQVVWRYDYDPFGNLLAETSGaaanPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPDPIGLAG 1004
                           90       100
                   ....*....|....*....|...
gi 2787272368   91 GMNLYAYVGGDPVNFVDPLGLEA 113
Cdd:COG3209   1005 GLNLYAYVGNNPVNYVDPLGLAA 1027
RHS_core NF041261
RHS element core protein;
17-111 3.20e-12

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 67.34  E-value: 3.20e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2787272368   17 DERGSIVAVTDWSGASIQINSYDEYGAPASGNAGRFGYT-----GQVWLPETGLYHYKNRAYHPELGRFMQTDPIGVNGG 91
Cdd:NF041261  1163 DHRGLPLALISEEGNTAWQGEYDEWGNLLNEENPHHLQQpyrlpGQQYDEESGLYYNRNRYYDPLQGRYITQDPIGLKGG 1242
                           90       100
                   ....*....|....*....|
gi 2787272368   92 MNLYAYvGGDPVNFVDPLGL 111
Cdd:NF041261  1243 WNLYQY-PLNPIRFIDPLGL 1261
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH