NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|386764301|ref|NP_001245642|]
View 

tenascin accessory, isoform M [Drosophila melanogaster]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Tox-GHH pfam15636
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ...
2898-2975 1.60e-38

GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.


:

Pssm-ID: 464783  Cd Length: 78  Bit Score: 139.28  E-value: 1.60e-38
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 386764301  2898 KEQQRLMHHAKLTAVRKAWHREKEALRSGLTTALEWSQQETDEILKQSYANNYEGEYIHDVNLYPELAEDPYNIKFVK 2975
Cdd:pfam15636    1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
NHL super family cl18310
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1248-1553 6.10e-35

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


The actual alignment was detected with superfamily member cd14953:

Pssm-ID: 302697 [Multi-domain]  Cd Length: 323  Bit Score: 137.66  E-value: 6.10e-35
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1248 NGVAKDAKLLTPIALATGPDGSLYVGDF--NLVRRITPDGKVYTILQL----------SATQVSYQYYLAVSPAdGHLYI 1315
Cdd:cd14953    14 GGGGTAARFNSPSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTgtagfadgggAAAQFNTPSGVAVDAA-GNLYV 92
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1316 SDPERHqilrlvRLEKVkDPSINSDPVVGSGQRcipgdeGNCGDGGpALLARLSHPKGLAIAADRTMYIADGTN--IRAV 1393
Cdd:cd14953    93 ADTGNH------RIRKI-TPDGVVSTLAGTGTA------GFSDDGG-ATAAQFNYPTGVAVDAAGNLYVADTGNhrIRKI 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1394 DPKGVIHTLIGhhghhnhwSPAPCS---GTlmANQAQLQWPTGLALSPlDGSLhFIDDRL---VLRLTSDMKIRVVAGTp 1467
Cdd:cd14953   159 TPDGVVTTVAG--------TGGAGYagdGP--ATAAQFNNPTGVAVDA-AGNL-YVADRGnhrIRKITPDGVVTTVAGT- 225
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1468 lhcsnGGQDGRVNKTGADNVLGTVLAMAFSPFGNLYIADSDSRRvnsIRVVDTAGNMRYFAGKQEGTgsqtcdcaigGGS 1547
Cdd:cd14953   226 -----GTAGFSGDGGATAAQLNNPTGVAVDAAGNLYVADSGNHR---IRKITPAGVVTTVAGGGAGF----------SGD 287

                  ....*.
gi 386764301 1548 NGSATN 1553
Cdd:cd14953   288 GGPATS 293
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
1992-2641 5.52e-30

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


:

Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 131.03  E-value: 5.52e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1992 YSVYSLVGDVRNPQQTLNREIWVNQSRVIGVEFDQFTNRETFYDARRTPILIVAYDQSGLPKSYYPTNGYPVNITYDRFN 2071
Cdd:COG3209   375 GGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGALTAGGTATGTGTGGGGTTAGTDATTTTGG 454
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2072 RVEGWAWGPAELKYSYDRHGLLSEITSQQDGIVSFVYNDWNLVSEIGLASQRkfvlqYDDAGGLRHVVLPSGTRHSFSMQ 2151
Cdd:COG3209   455 AGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDDTL-----GGTTTTTAGARGLVVTTGTTLTL 529
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2152 TSIGFIRCTYTPPGSTRAYLQHYSHAGALLQTILPGDGARIVYRYNAAGQLTEVVHGDGRSEFQYNEATGMPSTVSHTER 2231
Cdd:COG3209   530 GTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTGGTATTTTVTTTTTTSTAGTTTTT 609
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2232 ELEY-RWDFEYAAGLLAEERIDYVAKTGLSNAKFSYEYDSQLRVVALQGRIGGQSLPTQAFAYDPRTGRPSLIGQFRFSQ 2310
Cdd:COG3209   610 TSGYtRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTGTGTGVTAGLTTLATGGTTVGGGTGT 689
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2311 PAQNQTQLHDGTASFTRTVDGRFQTQRMALAIHRLEVFRMEFSYGVHGRISQTRTYTRNMAVNSYTNVKNYTWDCDGQLV 2390
Cdd:COG3209   690 TSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGTLTTTSTTTTTTAGALTYTYDALGRLT 769
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2391 GVEAQEP-------WGFRYDDNGNLLSLTY-RGNTIPMEYNAQDRIVK-----------FGEGQYKYDARGLVAQ----- 2446
Cdd:COG3209   770 SETTPGGvtqgtytTRYTYDALGRLTSVTYpDGETVTYTYDALGRLTSvitvgsgggtdLQDRTYTYDAAGNITSitdal 849
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2447 --NAREERFHYNTQGLLVRASKRGRfDVRYYYDHLKRLTTRKDnfGNVTQFFYTNQQRPYEVSQiyspRDGKLMSLTYDD 2524
Cdd:COG3209   850 raGTLTQTYTYDALGRLTSATDPGT-TESYTYDANGNLTSRTD--GGTTTYTYDALGRLVSVTK----PDGTTTTYTYDA 922
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2525 VGHliyaqvyrhkyyvaTDQSGTPLMLFNQYGEGIREIMRSPFGHIVYDSNPYLYLPIDFCGGILDQVTTLVHMGdGRVY 2604
Cdd:COG3209   923 LGH--------------TDHLGSVRALTDASGQVVWRYDYDPFGNLLAETSGAAANPLRFTGQEYDAETGLYYNG-ARYY 987
                         650       660       670
                  ....*....|....*....|....*....|....*..
gi 386764301 2605 DPLIGQWMSPDwqrvaeRIITPTRLHLYRFNGNDPIN 2641
Cdd:COG3209   988 DPALGRFLSPD------PIGLAGGLNLYAYVGNNPVN 1018
acid_disulf_rpt NF033662
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ...
825-851 3.20e-09

acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.


:

Pssm-ID: 411265 [Multi-domain]  Cd Length: 32  Bit Score: 54.44  E-value: 3.20e-09
                          10        20
                  ....*....|....*....|....*..
gi 386764301  825 NCKDNIDNDGDGMTDCSDSECCSHPAC 851
Cdd:NF033662    6 TCSDGIDNDGDGLTDCADPDCAGNPVC 32
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
660-683 2.88e-05

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


:

Pssm-ID: 400365  Cd Length: 26  Bit Score: 43.11  E-value: 2.88e-05
                           10        20
                   ....*....|....*....|....*.
gi 386764301   660 LCSGHGTCVA--GQCYCKAGWQGEDC 683
Cdd:pfam07974    1 ICSGRGTCVNqcGKCVCDSGYQGATC 26
RHS_core super family cl49306
RHS element core protein;
1744-1874 2.86e-04

RHS element core protein;


The actual alignment was detected with superfamily member NF041261:

Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 46.92  E-value: 2.86e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1744 SSEIYVFNRYGQHVATKDLTSGKTRYSFlysknTSFGRLSTVTDASGnkIQFLRDYSN--VVSSIENTQDHKSEIQINGI 1821
Cdd:NF041261  535 STKQMTWSRYGQLLAFTDCSGYQTRYEY-----DRFGQMTAVHREEG--ISTYRRYDNrgQLTSVKDAQGRETRYEYNAA 607
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|...
gi 386764301 1822 GIMTKLSEKGRQEIELDYDSnTGLLNSRSSGGETYIYQYDEFGRVTGMILPSG 1874
Cdd:NF041261  608 GDLTAVITPDGNRSETQYDA-WGKAVSTTQGGLTRSMEYDAAGRITTLTNENG 659
C_rich_MXAN6577 super family cl49352
MXAN_6577-like cysteine-rich domain;
661-772 5.86e-04

MXAN_6577-like cysteine-rich domain;


The actual alignment was detected with superfamily member NF041328:

Pssm-ID: 469225 [Multi-domain]  Cd Length: 145  Bit Score: 42.82  E-value: 5.86e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301  661 CSGHGTCVAGQCYCKAGwqGEDCGtidqqvyqclpgcsehgtydletGQCVCERhwTGPD-CSQavCSLDCGRNGVCESG 739
Cdd:NF041328   49 CGAGQTCVAGACGCGPG--TVACG-----------------------GACVDTA--SDPAhCGA--CGAACAPGQVCEGG 99
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 386764301  740 KCR--CNSGWT--GNLC-----DQL---PCDSRCSEHGQCKNGTC 772
Cdd:NF041328  100 ACReaCSEGLTrcGGACvdlatDPLhcgACGVACDPGESCRGGAC 144
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
563-586 8.82e-04

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


:

Pssm-ID: 400365  Cd Length: 26  Bit Score: 38.87  E-value: 8.82e-04
                           10        20
                   ....*....|....*....|....*.
gi 386764301   563 DCSGRGSCYL--GKCDCIDGYQGVDC 586
Cdd:pfam07974    1 ICSGRGTCVNqcGKCVCDSGYQGATC 26
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
629-651 1.88e-03

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


:

Pssm-ID: 400365  Cd Length: 26  Bit Score: 38.10  E-value: 1.88e-03
                           10        20
                   ....*....|....*....|....*
gi 386764301   629 CSSHGRCI--EGECHCERGWKGPYC 651
Cdd:pfam07974    2 CSGRGTCVnqCGKCVCDSGYQGATC 26
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
789-818 3.84e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.23  E-value: 3.84e-03
                          10        20        30
                  ....*....|....*....|....*....|
gi 386764301  789 ENGCSRHGQCTLENGEYRCDCIEGWAGRDC 818
Cdd:cd00054     8 GNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
 
Name Accession Description Interval E-value
Tox-GHH pfam15636
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ...
2898-2975 1.60e-38

GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.


Pssm-ID: 464783  Cd Length: 78  Bit Score: 139.28  E-value: 1.60e-38
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 386764301  2898 KEQQRLMHHAKLTAVRKAWHREKEALRSGLTTALEWSQQETDEILKQSYANNYEGEYIHDVNLYPELAEDPYNIKFVK 2975
Cdd:pfam15636    1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1248-1553 6.10e-35

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 137.66  E-value: 6.10e-35
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1248 NGVAKDAKLLTPIALATGPDGSLYVGDF--NLVRRITPDGKVYTILQL----------SATQVSYQYYLAVSPAdGHLYI 1315
Cdd:cd14953    14 GGGGTAARFNSPSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTgtagfadgggAAAQFNTPSGVAVDAA-GNLYV 92
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1316 SDPERHqilrlvRLEKVkDPSINSDPVVGSGQRcipgdeGNCGDGGpALLARLSHPKGLAIAADRTMYIADGTN--IRAV 1393
Cdd:cd14953    93 ADTGNH------RIRKI-TPDGVVSTLAGTGTA------GFSDDGG-ATAAQFNYPTGVAVDAAGNLYVADTGNhrIRKI 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1394 DPKGVIHTLIGhhghhnhwSPAPCS---GTlmANQAQLQWPTGLALSPlDGSLhFIDDRL---VLRLTSDMKIRVVAGTp 1467
Cdd:cd14953   159 TPDGVVTTVAG--------TGGAGYagdGP--ATAAQFNNPTGVAVDA-AGNL-YVADRGnhrIRKITPDGVVTTVAGT- 225
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1468 lhcsnGGQDGRVNKTGADNVLGTVLAMAFSPFGNLYIADSDSRRvnsIRVVDTAGNMRYFAGKQEGTgsqtcdcaigGGS 1547
Cdd:cd14953   226 -----GTAGFSGDGGATAAQLNNPTGVAVDAAGNLYVADSGNHR---IRKITPAGVVTTVAGGGAGF----------SGD 287

                  ....*.
gi 386764301 1548 NGSATN 1553
Cdd:cd14953   288 GGPATS 293
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
1992-2641 5.52e-30

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 131.03  E-value: 5.52e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1992 YSVYSLVGDVRNPQQTLNREIWVNQSRVIGVEFDQFTNRETFYDARRTPILIVAYDQSGLPKSYYPTNGYPVNITYDRFN 2071
Cdd:COG3209   375 GGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGALTAGGTATGTGTGGGGTTAGTDATTTTGG 454
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2072 RVEGWAWGPAELKYSYDRHGLLSEITSQQDGIVSFVYNDWNLVSEIGLASQRkfvlqYDDAGGLRHVVLPSGTRHSFSMQ 2151
Cdd:COG3209   455 AGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDDTL-----GGTTTTTAGARGLVVTTGTTLTL 529
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2152 TSIGFIRCTYTPPGSTRAYLQHYSHAGALLQTILPGDGARIVYRYNAAGQLTEVVHGDGRSEFQYNEATGMPSTVSHTER 2231
Cdd:COG3209   530 GTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTGGTATTTTVTTTTTTSTAGTTTTT 609
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2232 ELEY-RWDFEYAAGLLAEERIDYVAKTGLSNAKFSYEYDSQLRVVALQGRIGGQSLPTQAFAYDPRTGRPSLIGQFRFSQ 2310
Cdd:COG3209   610 TSGYtRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTGTGTGVTAGLTTLATGGTTVGGGTGT 689
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2311 PAQNQTQLHDGTASFTRTVDGRFQTQRMALAIHRLEVFRMEFSYGVHGRISQTRTYTRNMAVNSYTNVKNYTWDCDGQLV 2390
Cdd:COG3209   690 TSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGTLTTTSTTTTTTAGALTYTYDALGRLT 769
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2391 GVEAQEP-------WGFRYDDNGNLLSLTY-RGNTIPMEYNAQDRIVK-----------FGEGQYKYDARGLVAQ----- 2446
Cdd:COG3209   770 SETTPGGvtqgtytTRYTYDALGRLTSVTYpDGETVTYTYDALGRLTSvitvgsgggtdLQDRTYTYDAAGNITSitdal 849
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2447 --NAREERFHYNTQGLLVRASKRGRfDVRYYYDHLKRLTTRKDnfGNVTQFFYTNQQRPYEVSQiyspRDGKLMSLTYDD 2524
Cdd:COG3209   850 raGTLTQTYTYDALGRLTSATDPGT-TESYTYDANGNLTSRTD--GGTTTYTYDALGRLVSVTK----PDGTTTTYTYDA 922
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2525 VGHliyaqvyrhkyyvaTDQSGTPLMLFNQYGEGIREIMRSPFGHIVYDSNPYLYLPIDFCGGILDQVTTLVHMGdGRVY 2604
Cdd:COG3209   923 LGH--------------TDHLGSVRALTDASGQVVWRYDYDPFGNLLAETSGAAANPLRFTGQEYDAETGLYYNG-ARYY 987
                         650       660       670
                  ....*....|....*....|....*....|....*..
gi 386764301 2605 DPLIGQWMSPDwqrvaeRIITPTRLHLYRFNGNDPIN 2641
Cdd:COG3209   988 DPALGRFLSPD------PIGLAGGLNLYAYVGNNPVN 1018
RHS_core NF041261
RHS element core protein;
2076-2528 2.44e-14

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 80.05  E-value: 2.44e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2076 WAWGPA-ELKYSYDRHGllseitSQqdgIVSFVYNDWNLVSEIG--LASQRKFVLQYDDAGGLRHVVLPSGTRHSFSMQT 2152
Cdd:NF041261  322 YTYTEAgELLAVYDRSN------TQ---VRAFTYDAQHPGRMVAhrYAGRPEMCYRYDDTGRVTEQLNPAGLSYRYQYEQ 392
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2153 SigfiRCTYTPPGSTRAYLqhYSHAGALLQTILP---GDGARIVYRYNAAGQLTEVVHGDGR-SEFQYNEATGMPSTVSH 2228
Cdd:NF041261  393 D----RITITDSLNRREVL--HTEGEGGLKRVVKkehADGSVTRSGYDAAGRLTAQTDAAGRrTEYSLNVVSGDITDITT 466
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2229 TE-RELEYRWDfeyaagllaeERIDYVAKTGLSNAKFSYEYDSQLRVVALQGRIGgqslPTQAFAYD-PRTGRPSLIG-- 2304
Cdd:NF041261  467 PDgRETKFYYN----------DGNQLTSVTSPDGLESRREYDEPGRLVSETSRSG----ETTRYRYDdPHSELPATTTda 532
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2305 -----QFRFSQPAQnQTQLHDGTASFTRTVDGRFQTQrmaLAIHRLEVFRMEFSYGVHGRISQTRtytrnmavNSYTNVK 2379
Cdd:NF041261  533 tgstkQMTWSRYGQ-LLAFTDCSGYQTRYEYDRFGQM---TAVHREEGISTYRRYDNRGQLTSVK--------DAQGRET 600
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2380 NYTWDCDGQLVGVEAqePWGFR----YDDNGNLLSLTYRGNTIPMEYNAQDRIVKF-----GEGQYKYDARGLVAQ---- 2446
Cdd:NF041261  601 RYEYNAAGDLTAVIT--PDGNRsetqYDAWGKAVSTTQGGLTRSMEYDAAGRITTLtnengSHSTFLYDALDRLVQqrgf 678
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2447 NAREERFHYNTQGLLVRASKRGrFDVRYYYDHLKRLTTRKDNFGNVTQFFYTNQQRPYEVSQIyspRDGKLMSL--TYDD 2524
Cdd:NF041261  679 DGRTQRYHYDLTGKLTQSEDEG-LVTLWHYDESDRITHRTVNGEPAEQWQYDEHGWLTDISHL---SEGHRVAVhyGYDD 754

                  ....
gi 386764301 2525 VGHL 2528
Cdd:NF041261  755 KGRL 758
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1253-1528 1.02e-11

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 68.12  E-value: 1.02e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1253 DAKLLTPIALATGPDGSLYVGDF--NLVRRITPDGKVYTILQLSATQvSYQYYLAVSPaDGHLYISDPERHQILRLvrle 1330
Cdd:COG4257    55 LGGGSGPHGIAVDPDGNLWFTDNgnNRIGRIDPKTGEITTFALPGGG-SNPHGIAFDP-DGNLWFTDQGGNRIGRL---- 128
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1331 kvkdpsinsDPVVGSgqrcIPGDEGNCGDGGPAllarlshpkGLAIAADRTMYIAD-GTN-IRAVDPK-GVIHTLighhg 1407
Cdd:COG4257   129 ---------DPATGE----VTEFPLPTGGAGPY---------GIAVDPDGNLWVTDfGANaIGRIDPDtGTLTEY----- 181
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1408 hhnhwspapcsgtlmANQAQLQWPTGLALSPlDGSLHFIDdrlvlrltsdmkirvvagtplhcSNGGQDGRVN-KTGADN 1486
Cdd:COG4257   182 ---------------ALPTPGAGPRGLAVDP-DGNLWVAD-----------------------TGSGRIGRFDpKTGTVT 222
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*...
gi 386764301 1487 VLGTVL------AMAFSPFGNLYIADSDSrrvNSIRVVDTAGNMRYFA 1528
Cdd:COG4257   223 EYPLPGggarpyGVAVDGDGRVWFAESGA---NRIVRFDPDTELTEYV 267
RHS_core NF041261
RHS element core protein;
1849-2486 1.35e-11

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 70.80  E-value: 1.35e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1849 RSSGGETYIYQYDEFGRVTGMILPSGeiVRITSQLADSQgLTVyvhasVESLFSRERIAGEANellvlGGVRSTFLKRgq 1928
Cdd:NF041261  358 RYAGRPEMCYRYDDTGRVTEQLNPAG--LSYRYQYEQDR-ITI-----TDSLNRREVLHTEGE-----GGLKRVVKKE-- 422
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1929 aHADAELKANntlvihgdngvvvEASAVARhplLEAalpveaemlamwshQSVTMGEGLTNSMYSVYSLVGDVRNPQQTL 2008
Cdd:NF041261  423 -HADGSVTRS-------------GYDAAGR---LTA--------------QTDAAGRRTEYSLNVVSGDITDITTPDGRE 471
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2009 NREIWVNQSRVIGV----------EFDQfTNRETFYDARRTPILIVAYD--QSGLPKSYYPTNGYPVNITYDRFNRVEGW 2076
Cdd:NF041261  472 TKFYYNDGNQLTSVtspdglesrrEYDE-PGRLVSETSRSGETTRYRYDdpHSELPATTTDATGSTKQMTWSRYGQLLAF 550
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2077 A-WGPAELKYSYDRHGLLSEItSQQDGIVSF-VYNDWNLVSEIGLASQRKFVLQYDDAGGLRHVVLPSGTRhSFSMQTSI 2154
Cdd:NF041261  551 TdCSGYQTRYEYDRFGQMTAV-HREEGISTYrRYDNRGQLTSVKDAQGRETRYEYNAAGDLTAVITPDGNR-SETQYDAW 628
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2155 GFIRCTyTPPGSTRAylQHYSHAGALLqTILPGDGARIVYRYNAAGQLTEVVHGDGRSEFQYNEATGmpsTVSHTERE-L 2233
Cdd:NF041261  629 GKAVST-TQGGLTRS--MEYDAAGRIT-TLTNENGSHSTFLYDALDRLVQQRGFDGRTQRYHYDLTG---KLTQSEDEgL 701
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2234 EYRWDFEyaagllAEERIDYVAKTGLSNAKFSYE---------YDSQLRVVAL------QGRIGGQSLPTQafayDPRTG 2298
Cdd:NF041261  702 VTLWHYD------ESDRITHRTVNGEPAEQWQYDehgwltdisHLSEGHRVAVhygyddKGRLTGERQTVE----NPETG 771
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2299 RpsLIGQFR----FSQPAQNQTQLHDGTASFTRTVDGRFQTQRMALA-----------IHRlEVFRMEFSYGVHGRISQT 2363
Cdd:NF041261  772 E--LLWQHEtghaYNEQGLANRVTPDSLPPVEWLTYGSGYLAGMKLGgtplveytrdrLHR-ETVRSFGGAGSNAAYELT 848
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2364 RTYT-----RNMAVNSYTNVKNYTWDCDGQLVGVEA-QEPWGFRYDDNGNLLS-------LTYR--------GNTIP--- 2419
Cdd:NF041261  849 TAYTpagqlQSQHLNSLVYDRDYTWNDNGDLVRISGpRQTREYGYSATGRLTGvhttaanLDIRipyatdpaGNRLPdpe 928
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2420 ------MEYNAQDRIVKFGEGQYKYDARGLVAQ-------------NAREERFHYNTQGLLVRASK----RGRFDVRYYY 2476
Cdd:NF041261  929 lhpdstLTAWPDNRIAEDAHYVYRYDEYGRLTEktdripegvirtdDERTHHYHYDSQHRLVFYTRiqhgEPLVESRYLY 1008
                         730
                  ....*....|
gi 386764301 2477 DHLKRLTTRK 2486
Cdd:NF041261 1009 DPLGRRMAKR 1018
acid_disulf_rpt NF033662
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ...
825-851 3.20e-09

acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.


Pssm-ID: 411265 [Multi-domain]  Cd Length: 32  Bit Score: 54.44  E-value: 3.20e-09
                          10        20
                  ....*....|....*....|....*..
gi 386764301  825 NCKDNIDNDGDGMTDCSDSECCSHPAC 851
Cdd:NF033662    6 TCSDGIDNDGDGLTDCADPDCAGNPVC 32
Rhs_assc_core TIGR03696
RHS repeat-associated core domain; This model represents a conserved unique core sequence ...
2565-2641 3.17e-08

RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.


Pssm-ID: 274730 [Multi-domain]  Cd Length: 77  Bit Score: 52.89  E-value: 3.17e-08
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 386764301  2565 SPFGHIVYDSNPyLYLPIDFCGGILDQVTTLVHMGdGRVYDPLIGQWMSPDwqrvaeRIITPTRLHLYRFNGNDPIN 2641
Cdd:TIGR03696    2 DPYGEVLSESGA-APNPLRFTGQYYDAETGLYYNG-ARYYDPELGRFLSPD------PIGLGGGLNLYAYVGNNPVN 70
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
660-683 2.88e-05

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 43.11  E-value: 2.88e-05
                           10        20
                   ....*....|....*....|....*.
gi 386764301   660 LCSGHGTCVA--GQCYCKAGWQGEDC 683
Cdd:pfam07974    1 ICSGRGTCVNqcGKCVCDSGYQGATC 26
RHS_core NF041261
RHS element core protein;
1744-1874 2.86e-04

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 46.92  E-value: 2.86e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1744 SSEIYVFNRYGQHVATKDLTSGKTRYSFlysknTSFGRLSTVTDASGnkIQFLRDYSN--VVSSIENTQDHKSEIQINGI 1821
Cdd:NF041261  535 STKQMTWSRYGQLLAFTDCSGYQTRYEY-----DRFGQMTAVHREEG--ISTYRRYDNrgQLTSVKDAQGRETRYEYNAA 607
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|...
gi 386764301 1822 GIMTKLSEKGRQEIELDYDSnTGLLNSRSSGGETYIYQYDEFGRVTGMILPSG 1874
Cdd:NF041261  608 GDLTAVITPDGNRSETQYDA-WGKAVSTTQGGLTRSMEYDAAGRITTLTNENG 659
PLN02919 PLN02919
haloacid dehalogenase-like hydrolase family protein
1295-1396 3.99e-04

haloacid dehalogenase-like hydrolase family protein


Pssm-ID: 215497 [Multi-domain]  Cd Length: 1057  Bit Score: 46.38  E-value: 3.99e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1295 ATQVSYQYYLAV-SPADGHLYISDPERHQIlrlvrleKVKDPSINS-DPVVGSGQRCIpgdegncgDGGPALLARLSHPK 1372
Cdd:PLN02919  798 GSEVLLQHPLGVlCAKDGQIYVADSYNHKI-------KKLDPATKRvTTLAGTGKAGF--------KDGKALKAQLSEPA 862
                          90       100
                  ....*....|....*....|....*.
gi 386764301 1373 GLAIAADRTMYIADGTN--IRAVDPK 1396
Cdd:PLN02919  863 GLALGENGRLFVADTNNslIRYLDLN 888
C_rich_MXAN6577 NF041328
MXAN_6577-like cysteine-rich domain;
661-772 5.86e-04

MXAN_6577-like cysteine-rich domain;


Pssm-ID: 469225 [Multi-domain]  Cd Length: 145  Bit Score: 42.82  E-value: 5.86e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301  661 CSGHGTCVAGQCYCKAGwqGEDCGtidqqvyqclpgcsehgtydletGQCVCERhwTGPD-CSQavCSLDCGRNGVCESG 739
Cdd:NF041328   49 CGAGQTCVAGACGCGPG--TVACG-----------------------GACVDTA--SDPAhCGA--CGAACAPGQVCEGG 99
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 386764301  740 KCR--CNSGWT--GNLC-----DQL---PCDSRCSEHGQCKNGTC 772
Cdd:NF041328  100 ACReaCSEGLTrcGGACvdlatDPLhcgACGVACDPGESCRGGAC 144
DSL pfam01414
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ...
743-786 7.82e-04

Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.


Pssm-ID: 460202  Cd Length: 46  Bit Score: 39.53  E-value: 7.82e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 386764301   743 CNSGWTGNLCDQLpCDSR--------CSEHGQCkngtcVCSQGWNGRHCTLP 786
Cdd:pfam01414    1 CDENYYGSTCSKF-CRPRddkfghytCDANGNK-----VCLPGWTGPYCDKP 46
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
563-586 8.82e-04

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 38.87  E-value: 8.82e-04
                           10        20
                   ....*....|....*....|....*.
gi 386764301   563 DCSGRGSCYL--GKCDCIDGYQGVDC 586
Cdd:pfam07974    1 ICSGRGTCVNqcGKCVCDSGYQGATC 26
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
629-651 1.88e-03

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 38.10  E-value: 1.88e-03
                           10        20
                   ....*....|....*....|....*
gi 386764301   629 CSSHGRCI--EGECHCERGWKGPYC 651
Cdd:pfam07974    2 CSGRGTCVnqCGKCVCDSGYQGATC 26
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
1839-1875 1.88e-03

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 37.96  E-value: 1.88e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 386764301  1839 YDSNTGLLNSRSSGGETYIYQYDEFGRVTGMILPSGE 1875
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
789-818 3.84e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.23  E-value: 3.84e-03
                          10        20        30
                  ....*....|....*....|....*....|
gi 386764301  789 ENGCSRHGQCTLENGEYRCDCIEGWAGRDC 818
Cdd:cd00054     8 GNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
757-784 8.37e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 36.46  E-value: 8.37e-03
                          10        20        30
                  ....*....|....*....|....*....|....
gi 386764301  757 CDSR--CSEHGQCKNG----TCVCSQGWNGRHCT 784
Cdd:cd00054     5 CASGnpCQNGGTCVNTvgsyRCSCPPGYTGRNCE 38
 
Name Accession Description Interval E-value
Tox-GHH pfam15636
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ...
2898-2975 1.60e-38

GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.


Pssm-ID: 464783  Cd Length: 78  Bit Score: 139.28  E-value: 1.60e-38
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 386764301  2898 KEQQRLMHHAKLTAVRKAWHREKEALRSGLTTALEWSQQETDEILKQSYANNYEGEYIHDVNLYPELAEDPYNIKFVK 2975
Cdd:pfam15636    1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1248-1553 6.10e-35

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 137.66  E-value: 6.10e-35
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1248 NGVAKDAKLLTPIALATGPDGSLYVGDF--NLVRRITPDGKVYTILQL----------SATQVSYQYYLAVSPAdGHLYI 1315
Cdd:cd14953    14 GGGGTAARFNSPSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTgtagfadgggAAAQFNTPSGVAVDAA-GNLYV 92
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1316 SDPERHqilrlvRLEKVkDPSINSDPVVGSGQRcipgdeGNCGDGGpALLARLSHPKGLAIAADRTMYIADGTN--IRAV 1393
Cdd:cd14953    93 ADTGNH------RIRKI-TPDGVVSTLAGTGTA------GFSDDGG-ATAAQFNYPTGVAVDAAGNLYVADTGNhrIRKI 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1394 DPKGVIHTLIGhhghhnhwSPAPCS---GTlmANQAQLQWPTGLALSPlDGSLhFIDDRL---VLRLTSDMKIRVVAGTp 1467
Cdd:cd14953   159 TPDGVVTTVAG--------TGGAGYagdGP--ATAAQFNNPTGVAVDA-AGNL-YVADRGnhrIRKITPDGVVTTVAGT- 225
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1468 lhcsnGGQDGRVNKTGADNVLGTVLAMAFSPFGNLYIADSDSRRvnsIRVVDTAGNMRYFAGKQEGTgsqtcdcaigGGS 1547
Cdd:cd14953   226 -----GTAGFSGDGGATAAQLNNPTGVAVDAAGNLYVADSGNHR---IRKITPAGVVTTVAGGGAGF----------SGD 287

                  ....*.
gi 386764301 1548 NGSATN 1553
Cdd:cd14953   288 GGPATS 293
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
1992-2641 5.52e-30

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 131.03  E-value: 5.52e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1992 YSVYSLVGDVRNPQQTLNREIWVNQSRVIGVEFDQFTNRETFYDARRTPILIVAYDQSGLPKSYYPTNGYPVNITYDRFN 2071
Cdd:COG3209   375 GGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGALTAGGTATGTGTGGGGTTAGTDATTTTGG 454
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2072 RVEGWAWGPAELKYSYDRHGLLSEITSQQDGIVSFVYNDWNLVSEIGLASQRkfvlqYDDAGGLRHVVLPSGTRHSFSMQ 2151
Cdd:COG3209   455 AGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDDTL-----GGTTTTTAGARGLVVTTGTTLTL 529
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2152 TSIGFIRCTYTPPGSTRAYLQHYSHAGALLQTILPGDGARIVYRYNAAGQLTEVVHGDGRSEFQYNEATGMPSTVSHTER 2231
Cdd:COG3209   530 GTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTGGTATTTTVTTTTTTSTAGTTTTT 609
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2232 ELEY-RWDFEYAAGLLAEERIDYVAKTGLSNAKFSYEYDSQLRVVALQGRIGGQSLPTQAFAYDPRTGRPSLIGQFRFSQ 2310
Cdd:COG3209   610 TSGYtRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTGTGTGVTAGLTTLATGGTTVGGGTGT 689
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2311 PAQNQTQLHDGTASFTRTVDGRFQTQRMALAIHRLEVFRMEFSYGVHGRISQTRTYTRNMAVNSYTNVKNYTWDCDGQLV 2390
Cdd:COG3209   690 TSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGTLTTTSTTTTTTAGALTYTYDALGRLT 769
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2391 GVEAQEP-------WGFRYDDNGNLLSLTY-RGNTIPMEYNAQDRIVK-----------FGEGQYKYDARGLVAQ----- 2446
Cdd:COG3209   770 SETTPGGvtqgtytTRYTYDALGRLTSVTYpDGETVTYTYDALGRLTSvitvgsgggtdLQDRTYTYDAAGNITSitdal 849
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2447 --NAREERFHYNTQGLLVRASKRGRfDVRYYYDHLKRLTTRKDnfGNVTQFFYTNQQRPYEVSQiyspRDGKLMSLTYDD 2524
Cdd:COG3209   850 raGTLTQTYTYDALGRLTSATDPGT-TESYTYDANGNLTSRTD--GGTTTYTYDALGRLVSVTK----PDGTTTTYTYDA 922
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2525 VGHliyaqvyrhkyyvaTDQSGTPLMLFNQYGEGIREIMRSPFGHIVYDSNPYLYLPIDFCGGILDQVTTLVHMGdGRVY 2604
Cdd:COG3209   923 LGH--------------TDHLGSVRALTDASGQVVWRYDYDPFGNLLAETSGAAANPLRFTGQEYDAETGLYYNG-ARYY 987
                         650       660       670
                  ....*....|....*....|....*....|....*..
gi 386764301 2605 DPLIGQWMSPDwqrvaeRIITPTRLHLYRFNGNDPIN 2641
Cdd:COG3209   988 DPALGRFLSPD------PIGLAGGLNLYAYVGNNPVN 1018
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1246-1515 7.79e-27

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 114.16  E-value: 7.79e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1246 YCNGVAKDAKLLTPIALATGPDGSLYVGDF--NLVRRITPDGKVYTIL-----------QLSATQVSYQYYLAVSPAdGH 1312
Cdd:cd14953    66 FADGGGAAAQFNTPSGVAVDAAGNLYVADTgnHRIRKITPDGVVSTLAgtgtagfsddgGATAAQFNYPTGVAVDAA-GN 144
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1313 LYISDPERHQILRlvrlekvkdpsINSDPVV----GSGqrcipgdEGNCGDGGPALLARLSHPKGLAIAADRTMYIADGT 1388
Cdd:cd14953   145 LYVADTGNHRIRK-----------ITPDGVVttvaGTG-------GAGYAGDGPATAAQFNNPTGVAVDAAGNLYVADRG 206
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1389 N--IRAVDPKGVIHTLIGHHGhhnhwspAPCSGTLMANQAQLQWPTGLALSPlDGSLhFIDDRL---VLRLTSDMKIRVV 1463
Cdd:cd14953   207 NhrIRKITPDGVVTTVAGTGT-------AGFSGDGGATAAQLNNPTGVAVDA-AGNL-YVADSGnhrIRKITPAGVVTTV 277
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|..
gi 386764301 1464 AGTPlhCSNGGQDGRVNKTGADNVLGtvlaMAFSPFGNLYIADSDSRRVNSI 1515
Cdd:cd14953   278 AGGG--AGFSGDGGPATSAQFNNPTG----VAVDAAGNLYVADTGNNRIRKI 323
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1253-1512 4.28e-19

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 90.07  E-value: 4.28e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1253 DAKLLTPIALATGPDGSLYVGDF--NLVRRITPDGKVYTILQLSATQVSYQYY---LAVSPaDGHLYISDPERHQILRLv 1327
Cdd:cd05819    51 DGQFNEPAGVAVDSDGNLYVADTgnHRIQKFDPDGNFLASFGGSGDGDGEFNGprgIAVDS-SGNIYVADTGNHRIQKF- 128
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1328 rlekvkDPSinsdpvvGSGQRCIPGDEGNCGDggpallarLSHPKGLAIAADRTMYIADGTN--IRAVDPKGVIHTLIGH 1405
Cdd:cd05819   129 ------DPD-------GEFLTTFGSGGSGPGQ--------FNGPTGVAVDSDGNIYVADTGNhrIQVFDPDGNFLTTFGS 187
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1406 HGHHNhwspapcsgtlmanqAQLQWPTGLALSPlDGSLHFID--DRLVLRLTSDMKIRVVAGTPLhcsnggqdgrvnktG 1483
Cdd:cd05819   188 TGTGP---------------GQFNYPTGIAVDS-DGNIYVADsgNNRVQVFDPDGAGFGGNGNFL--------------G 237
                         250       260
                  ....*....|....*....|....*....
gi 386764301 1484 ADNVLGTVLAMAFSPFGNLYIADSDSRRV 1512
Cdd:cd05819   238 SDGQFNRPSGLAVDSDGNLYVADTGNNRI 266
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1358-1553 1.49e-18

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 89.51  E-value: 1.49e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1358 GDGGPALLARLSHPKGLAIAADRTMYIADGTN--IRAVDPKGVIHTLI--GHHGHHNHWSPApcsgtlmanqAQLQWPTG 1433
Cdd:cd14953    12 FSGGGGTAARFNSPSGVAVDAAGNLYVADRGNhrIRKITPDGVVTTVAgtGTAGFADGGGAA----------AQFNTPSG 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1434 LALSPlDGSLhFIDDRL---VLRLTSDMKIRVVAGTplhCSNGGQDGrvnkTGADN-VLGTVLAMAFSPFGNLYIADsds 1509
Cdd:cd14953    82 VAVDA-AGNL-YVADTGnhrIRKITPDGVVSTLAGT---GTAGFSDD----GGATAaQFNYPTGVAVDAAGNLYVAD--- 149
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....
gi 386764301 1510 RRVNSIRVVDTAGNMRYFAgkqeGTGSQtcdcaiGGGSNGSATN 1553
Cdd:cd14953   150 TGNHRIRKITPDGVVTTVA----GTGGA------GYAGDGPATA 183
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1252-1537 2.48e-16

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 81.98  E-value: 2.48e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1252 KDAKLLTPIALATGPDGSLYVGDF--NLVRRITPDGKVYTILQLSAT---QVSYQYYLAVSPaDGHLYISDPERHQILRL 1326
Cdd:cd05819     3 GPGELNNPQGIAVDSSGNIYVADTgnNRIQVFDPDGNFITSFGSFGSgdgQFNEPAGVAVDS-DGNLYVADTGNHRIQKF 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1327 vrlekvkdpSINSDPVVGSGqrcIPGDegncGDGGpallarLSHPKGLAIAADRTMYIADGTN--IRAVDPKGVIHTLIG 1404
Cdd:cd05819    82 ---------DPDGNFLASFG---GSGD----GDGE------FNGPRGIAVDSSGNIYVADTGNhrIQKFDPDGEFLTTFG 139
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1405 HHGhhnhwspapcsgtlmANQAQLQWPTGLALSPlDGSLhFIDDRLVLRltsdmkIRVVAgtplhcSNGGQDGRVNKTGA 1484
Cdd:cd05819   140 SGG---------------SGPGQFNGPTGVAVDS-DGNI-YVADTGNHR------IQVFD------PDGNFLTTFGSTGT 190
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....
gi 386764301 1485 -DNVLGTVLAMAFSPFGNLYIADSDSRRvnsIRVVDTAGNMRYFAGKQEGTGSQ 1537
Cdd:cd05819   191 gPGQFNYPTGIAVDSDGNIYVADSGNNR---VQVFDPDGAGFGGNGNFLGSDGQ 241
RHS_core NF041261
RHS element core protein;
2076-2528 2.44e-14

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 80.05  E-value: 2.44e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2076 WAWGPA-ELKYSYDRHGllseitSQqdgIVSFVYNDWNLVSEIG--LASQRKFVLQYDDAGGLRHVVLPSGTRHSFSMQT 2152
Cdd:NF041261  322 YTYTEAgELLAVYDRSN------TQ---VRAFTYDAQHPGRMVAhrYAGRPEMCYRYDDTGRVTEQLNPAGLSYRYQYEQ 392
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2153 SigfiRCTYTPPGSTRAYLqhYSHAGALLQTILP---GDGARIVYRYNAAGQLTEVVHGDGR-SEFQYNEATGMPSTVSH 2228
Cdd:NF041261  393 D----RITITDSLNRREVL--HTEGEGGLKRVVKkehADGSVTRSGYDAAGRLTAQTDAAGRrTEYSLNVVSGDITDITT 466
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2229 TE-RELEYRWDfeyaagllaeERIDYVAKTGLSNAKFSYEYDSQLRVVALQGRIGgqslPTQAFAYD-PRTGRPSLIG-- 2304
Cdd:NF041261  467 PDgRETKFYYN----------DGNQLTSVTSPDGLESRREYDEPGRLVSETSRSG----ETTRYRYDdPHSELPATTTda 532
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2305 -----QFRFSQPAQnQTQLHDGTASFTRTVDGRFQTQrmaLAIHRLEVFRMEFSYGVHGRISQTRtytrnmavNSYTNVK 2379
Cdd:NF041261  533 tgstkQMTWSRYGQ-LLAFTDCSGYQTRYEYDRFGQM---TAVHREEGISTYRRYDNRGQLTSVK--------DAQGRET 600
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2380 NYTWDCDGQLVGVEAqePWGFR----YDDNGNLLSLTYRGNTIPMEYNAQDRIVKF-----GEGQYKYDARGLVAQ---- 2446
Cdd:NF041261  601 RYEYNAAGDLTAVIT--PDGNRsetqYDAWGKAVSTTQGGLTRSMEYDAAGRITTLtnengSHSTFLYDALDRLVQqrgf 678
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2447 NAREERFHYNTQGLLVRASKRGrFDVRYYYDHLKRLTTRKDNFGNVTQFFYTNQQRPYEVSQIyspRDGKLMSL--TYDD 2524
Cdd:NF041261  679 DGRTQRYHYDLTGKLTQSEDEG-LVTLWHYDESDRITHRTVNGEPAEQWQYDEHGWLTDISHL---SEGHRVAVhyGYDD 754

                  ....
gi 386764301 2525 VGHL 2528
Cdd:NF041261  755 KGRL 758
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1252-1443 7.82e-13

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 71.58  E-value: 7.82e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1252 KDAKLLTPIALATGPDGSLYVGDF--NLVRRITPDGKVYTIL---QLSATQVSYQYYLAVSPaDGHLYISDPERHQILRL 1326
Cdd:cd05819    97 GDGEFNGPRGIAVDSSGNIYVADTgnHRIQKFDPDGEFLTTFgsgGSGPGQFNGPTGVAVDS-DGNIYVADTGNHRIQVF 175
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1327 vrlekvkDPSINSDPVVGSGqrcipgdegncGDGGpallARLSHPKGLAIAADRTMYIADGTN--IRAVDPKGVIHTlig 1404
Cdd:cd05819   176 -------DPDGNFLTTFGST-----------GTGP----GQFNYPTGIAVDSDGNIYVADSGNnrVQVFDPDGAGFG--- 230
                         170       180       190
                  ....*....|....*....|....*....|....*....
gi 386764301 1405 hhghhnhwspapCSGTLMANQAQLQWPTGLALSPlDGSL 1443
Cdd:cd05819   231 ------------GNGNFLGSDGQFNRPSGLAVDS-DGNL 256
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1253-1528 1.02e-11

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 68.12  E-value: 1.02e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1253 DAKLLTPIALATGPDGSLYVGDF--NLVRRITPDGKVYTILQLSATQvSYQYYLAVSPaDGHLYISDPERHQILRLvrle 1330
Cdd:COG4257    55 LGGGSGPHGIAVDPDGNLWFTDNgnNRIGRIDPKTGEITTFALPGGG-SNPHGIAFDP-DGNLWFTDQGGNRIGRL---- 128
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1331 kvkdpsinsDPVVGSgqrcIPGDEGNCGDGGPAllarlshpkGLAIAADRTMYIAD-GTN-IRAVDPK-GVIHTLighhg 1407
Cdd:COG4257   129 ---------DPATGE----VTEFPLPTGGAGPY---------GIAVDPDGNLWVTDfGANaIGRIDPDtGTLTEY----- 181
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1408 hhnhwspapcsgtlmANQAQLQWPTGLALSPlDGSLHFIDdrlvlrltsdmkirvvagtplhcSNGGQDGRVN-KTGADN 1486
Cdd:COG4257   182 ---------------ALPTPGAGPRGLAVDP-DGNLWVAD-----------------------TGSGRIGRFDpKTGTVT 222
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*...
gi 386764301 1487 VLGTVL------AMAFSPFGNLYIADSDSrrvNSIRVVDTAGNMRYFA 1528
Cdd:COG4257   223 EYPLPGggarpyGVAVDGDGRVWFAESGA---NRIVRFDPDTELTEYV 267
RHS_core NF041261
RHS element core protein;
1849-2486 1.35e-11

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 70.80  E-value: 1.35e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1849 RSSGGETYIYQYDEFGRVTGMILPSGeiVRITSQLADSQgLTVyvhasVESLFSRERIAGEANellvlGGVRSTFLKRgq 1928
Cdd:NF041261  358 RYAGRPEMCYRYDDTGRVTEQLNPAG--LSYRYQYEQDR-ITI-----TDSLNRREVLHTEGE-----GGLKRVVKKE-- 422
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1929 aHADAELKANntlvihgdngvvvEASAVARhplLEAalpveaemlamwshQSVTMGEGLTNSMYSVYSLVGDVRNPQQTL 2008
Cdd:NF041261  423 -HADGSVTRS-------------GYDAAGR---LTA--------------QTDAAGRRTEYSLNVVSGDITDITTPDGRE 471
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2009 NREIWVNQSRVIGV----------EFDQfTNRETFYDARRTPILIVAYD--QSGLPKSYYPTNGYPVNITYDRFNRVEGW 2076
Cdd:NF041261  472 TKFYYNDGNQLTSVtspdglesrrEYDE-PGRLVSETSRSGETTRYRYDdpHSELPATTTDATGSTKQMTWSRYGQLLAF 550
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2077 A-WGPAELKYSYDRHGLLSEItSQQDGIVSF-VYNDWNLVSEIGLASQRKFVLQYDDAGGLRHVVLPSGTRhSFSMQTSI 2154
Cdd:NF041261  551 TdCSGYQTRYEYDRFGQMTAV-HREEGISTYrRYDNRGQLTSVKDAQGRETRYEYNAAGDLTAVITPDGNR-SETQYDAW 628
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2155 GFIRCTyTPPGSTRAylQHYSHAGALLqTILPGDGARIVYRYNAAGQLTEVVHGDGRSEFQYNEATGmpsTVSHTERE-L 2233
Cdd:NF041261  629 GKAVST-TQGGLTRS--MEYDAAGRIT-TLTNENGSHSTFLYDALDRLVQQRGFDGRTQRYHYDLTG---KLTQSEDEgL 701
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2234 EYRWDFEyaagllAEERIDYVAKTGLSNAKFSYE---------YDSQLRVVAL------QGRIGGQSLPTQafayDPRTG 2298
Cdd:NF041261  702 VTLWHYD------ESDRITHRTVNGEPAEQWQYDehgwltdisHLSEGHRVAVhygyddKGRLTGERQTVE----NPETG 771
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2299 RpsLIGQFR----FSQPAQNQTQLHDGTASFTRTVDGRFQTQRMALA-----------IHRlEVFRMEFSYGVHGRISQT 2363
Cdd:NF041261  772 E--LLWQHEtghaYNEQGLANRVTPDSLPPVEWLTYGSGYLAGMKLGgtplveytrdrLHR-ETVRSFGGAGSNAAYELT 848
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2364 RTYT-----RNMAVNSYTNVKNYTWDCDGQLVGVEA-QEPWGFRYDDNGNLLS-------LTYR--------GNTIP--- 2419
Cdd:NF041261  849 TAYTpagqlQSQHLNSLVYDRDYTWNDNGDLVRISGpRQTREYGYSATGRLTGvhttaanLDIRipyatdpaGNRLPdpe 928
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 2420 ------MEYNAQDRIVKFGEGQYKYDARGLVAQ-------------NAREERFHYNTQGLLVRASK----RGRFDVRYYY 2476
Cdd:NF041261  929 lhpdstLTAWPDNRIAEDAHYVYRYDEYGRLTEktdripegvirtdDERTHHYHYDSQHRLVFYTRiqhgEPLVESRYLY 1008
                         730
                  ....*....|
gi 386764301 2477 DHLKRLTTRK 2486
Cdd:NF041261 1009 DPLGRRMAKR 1018
acid_disulf_rpt NF033662
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ...
825-851 3.20e-09

acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.


Pssm-ID: 411265 [Multi-domain]  Cd Length: 32  Bit Score: 54.44  E-value: 3.20e-09
                          10        20
                  ....*....|....*....|....*..
gi 386764301  825 NCKDNIDNDGDGMTDCSDSECCSHPAC 851
Cdd:NF033662    6 TCSDGIDNDGDGLTDCADPDCAGNPVC 32
Rhs_assc_core TIGR03696
RHS repeat-associated core domain; This model represents a conserved unique core sequence ...
2565-2641 3.17e-08

RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.


Pssm-ID: 274730 [Multi-domain]  Cd Length: 77  Bit Score: 52.89  E-value: 3.17e-08
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 386764301  2565 SPFGHIVYDSNPyLYLPIDFCGGILDQVTTLVHMGdGRVYDPLIGQWMSPDwqrvaeRIITPTRLHLYRFNGNDPIN 2641
Cdd:TIGR03696    2 DPYGEVLSESGA-APNPLRFTGQYYDAETGLYYNG-ARYYDPELGRFLSPD------PIGLGGGLNLYAYVGNNPVN 70
NHL_like_6 cd14962
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ...
1249-1407 5.83e-08

Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271332 [Multi-domain]  Cd Length: 271  Bit Score: 56.83  E-value: 5.83e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1249 GVAKDAKLLTPIALATGPDGSLYVGDFNL--VRRITPDGK-VYTI----LQLSATQVsyqyylAVSPADGHLYISDPERH 1321
Cdd:cd14962    49 GNAGPNRFVSPIGVAIDANGNLYVSDAELgkVFVFDRDGKfLRAIgagaLFKRPTGI------AVDPAGKRLYVVDTLAH 122
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1322 QIlrlvrleKVKDPSinsdpvvGSGQRCIpgdeGNCGDGGpallARLSHPKGLAIAADRTMYIADGTNIR--AVDPKGVI 1399
Cdd:cd14962   123 KV-------KVFDLD-------GRLLFDI----GKRGSGP----GEFNLPTDLAVDRDGNLYVTDTMNFRvqIFDADGKF 180

                  ....*...
gi 386764301 1400 HTLIGHHG 1407
Cdd:cd14962   181 LRSFGERG 188
NHL_like_3 cd14956
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ...
1253-1512 8.02e-08

Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271326 [Multi-domain]  Cd Length: 274  Bit Score: 56.52  E-value: 8.02e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1253 DAKLLTPIALATGPDGSLYVGDF--NLVRRITPDGKVYTILQLSAT---QVSYQYYLAVSPaDGHLYISDPERHQILRLv 1327
Cdd:cd14956    56 PGQFGRPRGLAVDKDGWLYVADYwgDRIQVFTLTGELQTIGGSSGSgpgQFNAPRGVAVDA-DGNLYVADFGNQRIQKF- 133
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1328 rlekvkdpsinsDP---VVGS-GQRCIPGDEGNcgdggpallarlsHPKGLAIAADRTMYIADGTN--IRAVDPKGVIHT 1401
Cdd:cd14956   134 ------------DPdgsFLRQwGGTGIEPGSFN-------------YPRGVAVDPDGTLYVADTYNdrIQVFDNDGAFLR 188
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1402 LIGHHGHHNHWspapcsgtlmanqaqLQWPTGLALSPlDGSLHFID---DRlVLRLTSDMKIRVVAGTPlhcsnGGQDGR 1478
Cdd:cd14956   189 KWGGRGTGPGQ---------------FNYPYGIAIDP-DGNVFVADfgnNR-IQKFTADGTFLTSWGSP-----GTGPGQ 246
                         250       260       270
                  ....*....|....*....|....*....|....
gi 386764301 1479 vnktgadnvLGTVLAMAFSPFGNLYIADSDSRRV 1512
Cdd:cd14956   247 ---------FKNPWGVVVDADGTVYVADSNNNRV 271
NHL-2_like cd14951
NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL ...
1355-1523 1.90e-07

NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL repeat-containing protein 2 (NHLRC2) and related bacterial proteins; members of this eukaryotic and bacterial family are uncharacterized, the NHL repeat domain is found C-terminally of a thioredoxin domain. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271321 [Multi-domain]  Cd Length: 334  Bit Score: 56.05  E-value: 1.90e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1355 GNCG--DGGPALlARLSHPKGLAIAADRTMYIADGTN--IRAVDP-KGVIHTL--IGHHGHHNhwspapcSGTLMANQAQ 1427
Cdd:cd14951     4 GERGlkDGSFAE-ASFNEPQGLALLPGNILYVADTENhaLRKIDLeTGTVTTLagTGEQGRDG-------EGGGPGREQP 75
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1428 LQwptglalSPLDGSLHFIDDRLVLR----------LTSDMKIRVVAGTplhcsngGQDGRVNKTGADNvlgTVLA---- 1493
Cdd:cd14951    76 LS-------SPWDVAWGPEDDILYIAmagthqiwayDLDTGTCRVFAGS-------GNEGNRNGPYPHE---AWFAqpsg 138
                         170       180       190
                  ....*....|....*....|....*....|
gi 386764301 1494 MAFSPFGNLYIADSDSrrvNSIRVVDTAGN 1523
Cdd:cd14951   139 LSLAGWGELFVADSES---SAIRAVSLKDG 165
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
660-683 2.88e-05

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 43.11  E-value: 2.88e-05
                           10        20
                   ....*....|....*....|....*.
gi 386764301   660 LCSGHGTCVA--GQCYCKAGWQGEDC 683
Cdd:pfam07974    1 ICSGRGTCVNqcGKCVCDSGYQGATC 26
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1302-1535 5.36e-05

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 47.71  E-value: 5.36e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1302 YYLAVSPaDGHLYISDPERHQILRLvrlekvkdpsinsDPVVGSGQRcipgdegncgdggpALLARLSHPKGLAIAADRT 1381
Cdd:COG4257    20 RDVAVDP-DGAVWFTDQGGGRIGRL-------------DPATGEFTE--------------YPLGGGSGPHGIAVDPDGN 71
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1382 MYIADGTN--IRAVDPK-GVIHTLIGhhghhnhwsPAPCSGtlmanqaqlqwPTGLALSPlDGSLHFIDDR--LVLRLT- 1455
Cdd:COG4257    72 LWFTDNGNnrIGRIDPKtGEITTFAL---------PGGGSN-----------PHGIAFDP-DGNLWFTDQGgnRIGRLDp 130
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1456 SDMKIRVVAGTplhcSNGGQDGrvnktgadnvlgtvlAMAFSPFGNLYIADsdsRRVNSIRVVDTA-GNMRYFAGKQEGT 1534
Cdd:COG4257   131 ATGEVTEFPLP----TGGAGPY---------------GIAVDPDGNLWVTD---FGANAIGRIDPDtGTLTEYALPTPGA 188

                  .
gi 386764301 1535 G 1535
Cdd:COG4257   189 G 189
NHL-2_like cd14951
NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL ...
1249-1331 6.38e-05

NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL repeat-containing protein 2 (NHLRC2) and related bacterial proteins; members of this eukaryotic and bacterial family are uncharacterized, the NHL repeat domain is found C-terminally of a thioredoxin domain. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271321 [Multi-domain]  Cd Length: 334  Bit Score: 47.96  E-value: 6.38e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1249 GVAKDAKLLTPIALATGPDGSLYVGD-FN-LVRRITPD-GKVYTI---LQLSATQVSYQYY----LAVSPaDGHLYISDP 1318
Cdd:cd14951   188 GPGAEALLQHPLGVAALPDGSVYVADtYNhKIKRVDPAtGEVSTLagtGKAGYKDLEAQFSepsgLVVDG-DGRLYVADT 266
                          90
                  ....*....|...
gi 386764301 1319 ERHQIlRLVRLEK 1331
Cdd:cd14951   267 NNHRI-RRLDLPT 278
NHL-2_like cd14951
NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL ...
1304-1443 6.44e-05

NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL repeat-containing protein 2 (NHLRC2) and related bacterial proteins; members of this eukaryotic and bacterial family are uncharacterized, the NHL repeat domain is found C-terminally of a thioredoxin domain. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271321 [Multi-domain]  Cd Length: 334  Bit Score: 47.96  E-value: 6.44e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1304 LAVSPaDGHLYISDPERhQILRLVRLEKVKDPSINSDPVVGsgqrcipgdeGNCGDGG----PALLARLSHPKGLAIAAD 1379
Cdd:cd14951   139 LSLAG-WGELFVADSES-SAIRAVSLKDGGVKTLVGGTRVG----------TGLFDFGdrdgPGAEALLQHPLGVAALPD 206
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 386764301 1380 RTMYIADGTN--IRAVDPK-GVIHTLIghhghhnhwspapcsGTLMAN----QAQLQWPTGLALSPlDGSL 1443
Cdd:cd14951   207 GSVYVADTYNhkIKRVDPAtGEVSTLA---------------GTGKAGykdlEAQFSEPSGLVVDG-DGRL 261
RHS_core NF041261
RHS element core protein;
1744-1874 2.86e-04

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 46.92  E-value: 2.86e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1744 SSEIYVFNRYGQHVATKDLTSGKTRYSFlysknTSFGRLSTVTDASGnkIQFLRDYSN--VVSSIENTQDHKSEIQINGI 1821
Cdd:NF041261  535 STKQMTWSRYGQLLAFTDCSGYQTRYEY-----DRFGQMTAVHREEG--ISTYRRYDNrgQLTSVKDAQGRETRYEYNAA 607
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|...
gi 386764301 1822 GIMTKLSEKGRQEIELDYDSnTGLLNSRSSGGETYIYQYDEFGRVTGMILPSG 1874
Cdd:NF041261  608 GDLTAVITPDGNRSETQYDA-WGKAVSTTQGGLTRSMEYDAAGRITTLTNENG 659
YvrE COG3386
Sugar lactone lactonase YvrE [Carbohydrate transport and metabolism]; Sugar lactone lactonase ...
1309-1535 3.49e-04

Sugar lactone lactonase YvrE [Carbohydrate transport and metabolism]; Sugar lactone lactonase YvrE is part of the Pathway/BioSystem: Non-phosphorylated Entner-Doudoroff pathway


Pssm-ID: 442613 [Multi-domain]  Cd Length: 266  Bit Score: 45.27  E-value: 3.49e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1309 ADGHLYISDPERHQILRLvrlekvkDPSinsdpvvGSGQRCIPGDEGncgdggpallarlsHPKGLAIAADRTMYIAD-G 1387
Cdd:COG3386    17 PDGRLYWVDIPGGRIHRY-------DPD-------GGAVEVFAEPSG--------------RPNGLAFDPDGRLLVADhG 68
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1388 TNIRAVDPK-GVIHTLIGHHGHhnhwspapcsgtlmanqaQLQWPTGLALSPlDGSL------HFIDDRLVLRLTSDMKI 1460
Cdd:COG3386    69 RGLVRFDPAdGEVTVLADEYGK------------------PLNRPNDGVVDP-DGRLyftdmgEYLPTGALYRVDPDGSL 129
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 386764301 1461 RVVAgTPLHCSNGgqdgrvnktgadnvlgtvlaMAFSPFGN-LYIADSDSRRVNSIRVVD--TAGNMRYFAGKQEGTG 1535
Cdd:COG3386   130 RVLA-DGLTFPNG--------------------IAFSPDGRtLYVADTGAGRIYRFDLDAdgTLGNRRVFADLPDGPG 186
PLN02919 PLN02919
haloacid dehalogenase-like hydrolase family protein
1295-1396 3.99e-04

haloacid dehalogenase-like hydrolase family protein


Pssm-ID: 215497 [Multi-domain]  Cd Length: 1057  Bit Score: 46.38  E-value: 3.99e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1295 ATQVSYQYYLAV-SPADGHLYISDPERHQIlrlvrleKVKDPSINS-DPVVGSGQRCIpgdegncgDGGPALLARLSHPK 1372
Cdd:PLN02919  798 GSEVLLQHPLGVlCAKDGQIYVADSYNHKI-------KKLDPATKRvTTLAGTGKAGF--------KDGKALKAQLSEPA 862
                          90       100
                  ....*....|....*....|....*.
gi 386764301 1373 GLAIAADRTMYIADGTN--IRAVDPK 1396
Cdd:PLN02919  863 GLALGENGRLFVADTNNslIRYLDLN 888
YvrE COG3386
Sugar lactone lactonase YvrE [Carbohydrate transport and metabolism]; Sugar lactone lactonase ...
1262-1403 4.64e-04

Sugar lactone lactonase YvrE [Carbohydrate transport and metabolism]; Sugar lactone lactonase YvrE is part of the Pathway/BioSystem: Non-phosphorylated Entner-Doudoroff pathway


Pssm-ID: 442613 [Multi-domain]  Cd Length: 266  Bit Score: 44.88  E-value: 4.64e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1262 LATGPDGSLYVGDFNLVR------RITPDGKVYTIlqlsATQVSYQYYLAVSPADGHLYISDPERHQILRLvrlekvkdp 1335
Cdd:COG3386    98 GVVDPDGRLYFTDMGEYLptgalyRVDPDGSLRVL----ADGLTFPNGIAFSPDGRTLYVADTGAGRIYRF--------- 164
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1336 SINSDPVVGSGQRCIPGDEgncGDGGPAllarlshpkGLAIAADRTMYIA--DGTNIRAVDPKGVIHTLI 1403
Cdd:COG3386   165 DLDADGTLGNRRVFADLPD---GPGGPD---------GLAVDADGNLWVAlwGGGGVVRFDPDGELLGRI 222
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1248-1326 5.02e-04

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 44.62  E-value: 5.02e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1248 NGVAKDAKLLTPIALATGPDGSLYVGDF--NLVRRITPDGKVYTIL---QLSATQVSYQYYLAVSPaDGHLYISDPERHQ 1322
Cdd:cd05819   187 STGTGPGQFNYPTGIAVDSDGNIYVADSgnNRVQVFDPDGAGFGGNgnfLGSDGQFNRPSGLAVDS-DGNLYVADTGNNR 265

                  ....
gi 386764301 1323 ILRL 1326
Cdd:cd05819   266 IQVF 269
C_rich_MXAN6577 NF041328
MXAN_6577-like cysteine-rich domain;
661-772 5.86e-04

MXAN_6577-like cysteine-rich domain;


Pssm-ID: 469225 [Multi-domain]  Cd Length: 145  Bit Score: 42.82  E-value: 5.86e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301  661 CSGHGTCVAGQCYCKAGwqGEDCGtidqqvyqclpgcsehgtydletGQCVCERhwTGPD-CSQavCSLDCGRNGVCESG 739
Cdd:NF041328   49 CGAGQTCVAGACGCGPG--TVACG-----------------------GACVDTA--SDPAhCGA--CGAACAPGQVCEGG 99
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 386764301  740 KCR--CNSGWT--GNLC-----DQL---PCDSRCSEHGQCKNGTC 772
Cdd:NF041328  100 ACReaCSEGLTrcGGACvdlatDPLhcgACGVACDPGESCRGGAC 144
DSL pfam01414
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ...
743-786 7.82e-04

Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.


Pssm-ID: 460202  Cd Length: 46  Bit Score: 39.53  E-value: 7.82e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 386764301   743 CNSGWTGNLCDQLpCDSR--------CSEHGQCkngtcVCSQGWNGRHCTLP 786
Cdd:pfam01414    1 CDENYYGSTCSKF-CRPRddkfghytCDANGNK-----VCLPGWTGPYCDKP 46
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
563-586 8.82e-04

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 38.87  E-value: 8.82e-04
                           10        20
                   ....*....|....*....|....*.
gi 386764301   563 DCSGRGSCYL--GKCDCIDGYQGVDC 586
Cdd:pfam07974    1 ICSGRGTCVNqcGKCVCDSGYQGATC 26
YliI COG2133
Glucose/arabinose dehydrogenase, beta-propeller fold [Carbohydrate transport and metabolism];
1259-1390 1.14e-03

Glucose/arabinose dehydrogenase, beta-propeller fold [Carbohydrate transport and metabolism];


Pssm-ID: 441736 [Multi-domain]  Cd Length: 365  Bit Score: 44.15  E-value: 1.14e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1259 PIALATGPDGSLYVGD-FNLVRRITPDGKVYTILQLSATQVSYQY---YLAVSP---ADGHLYI--SDPERHQiLRLVRL 1329
Cdd:COG2133    39 PWGLAFLPDGRLLVTErAGRIRLLDDDGKLSTPVADLPVFAGGEGgllGVALDPdfaTNGYLYVayTDPGGAG-TRVARF 117
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 386764301 1330 EKVKDPSINSDPVVGSGqrcIPGDEGNcgdggpallarlsHP-KGLAIAADRTMYIA--DGTNI 1390
Cdd:COG2133   118 TLSDGDTLTSEEVILDG---LPAGGGN-------------HNgGRLAFGPDGKLYVSvgDRGNA 165
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
697-721 1.14e-03

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 38.48  E-value: 1.14e-03
                           10        20
                   ....*....|....*....|....*
gi 386764301   697 CSEHGTYDLETGQCVCERHWTGPDC 721
Cdd:pfam07974    2 CSGRGTCVNQCGKCVCDSGYQGATC 26
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
629-651 1.88e-03

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 38.10  E-value: 1.88e-03
                           10        20
                   ....*....|....*....|....*
gi 386764301   629 CSSHGRCI--EGECHCERGWKGPYC 651
Cdd:pfam07974    2 CSGRGTCVnqCGKCVCDSGYQGATC 26
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
1839-1875 1.88e-03

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 37.96  E-value: 1.88e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 386764301  1839 YDSNTGLLNSRSSGGETYIYQYDEFGRVTGMILPSGE 1875
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
761-783 2.73e-03

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 37.33  E-value: 2.73e-03
                           10        20
                   ....*....|....*....|....*
gi 386764301   761 CSEHGQC--KNGTCVCSQGWNGRHC 783
Cdd:pfam07974    2 CSGRGTCvnQCGKCVCDSGYQGATC 26
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
729-752 2.81e-03

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 37.33  E-value: 2.81e-03
                           10        20
                   ....*....|....*....|....*.
gi 386764301   729 DCGRNGVCES--GKCRCNSGWTGNLC 752
Cdd:pfam07974    1 ICSGRGTCVNqcGKCVCDSGYQGATC 26
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
789-818 3.84e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.23  E-value: 3.84e-03
                          10        20        30
                  ....*....|....*....|....*....|
gi 386764301  789 ENGCSRHGQCTLENGEYRCDCIEGWAGRDC 818
Cdd:cd00054     8 GNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF_Tenascin pfam18720
Tenascin EGF domain; This entry represents the EGF-like domains found in tenascin proteins.
560-587 4.73e-03

Tenascin EGF domain; This entry represents the EGF-like domains found in tenascin proteins.


Pssm-ID: 376143  Cd Length: 29  Bit Score: 36.89  E-value: 4.73e-03
                           10        20
                   ....*....|....*....|....*...
gi 386764301   560 CPNDCSGRGSCYLGKCDCIDGYQGVDCS 587
Cdd:pfam18720    2 CPLGCSSRGVCVDGQCICDSEYSGDDCS 29
DSL pfam01414
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ...
608-653 6.31e-03

Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.


Pssm-ID: 460202  Cd Length: 46  Bit Score: 36.83  E-value: 6.31e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 386764301   608 CEEGWKGAECDIpvgECEVPN-------CSSHGRCIegechCERGWKGPYCDQ 653
Cdd:pfam01414    1 CDENYYGSTCSK---FCRPRDdkfghytCDANGNKV-----CLPGWTGPYCDK 45
NHL-2_like cd14951
NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL ...
1262-1396 6.36e-03

NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL repeat-containing protein 2 (NHLRC2) and related bacterial proteins; members of this eukaryotic and bacterial family are uncharacterized, the NHL repeat domain is found C-terminally of a thioredoxin domain. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271321 [Multi-domain]  Cd Length: 334  Bit Score: 41.41  E-value: 6.36e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1262 LATGPDGSLYVGD--FNLVRRITP-DGKVYTILQLS---------------ATQVSYQYYLAVSPA-DGHLYISDPERHQ 1322
Cdd:cd14951   139 LSLAGWGELFVADseSSAIRAVSLkDGGVKTLVGGTrvgtglfdfgdrdgpGAEALLQHPLGVAALpDGSVYVADTYNHK 218
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 386764301 1323 ILRLvrlekvkdpsinsDPVVGSGQRCIPGDEGNCGDggpaLLARLSHPKGLAIAADRTMYIADgTN---IRAVDPK 1396
Cdd:cd14951   219 IKRV-------------DPATGEVSTLAGTGKAGYKD----LEAQFSEPSGLVVDGDGRLYVAD-TNnhrIRRLDLP 277
YvrE COG3386
Sugar lactone lactonase YvrE [Carbohydrate transport and metabolism]; Sugar lactone lactonase ...
1266-1545 7.28e-03

Sugar lactone lactonase YvrE [Carbohydrate transport and metabolism]; Sugar lactone lactonase YvrE is part of the Pathway/BioSystem: Non-phosphorylated Entner-Doudoroff pathway


Pssm-ID: 442613 [Multi-domain]  Cd Length: 266  Bit Score: 41.03  E-value: 7.28e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1266 PDGSLYVGDF--NLVRRITPDGKVYTILQLSATQVSYqyyLAVSPaDGHLYISDPERhqilRLVRLEKvkdpsinsdpvv 1343
Cdd:COG3386    17 PDGRLYWVDIpgGRIHRYDPDGGAVEVFAEPSGRPNG---LAFDP-DGRLLVADHGR----GLVRFDP------------ 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1344 gsgqrcipgdegncGDGGPALLA-----RLSHPKGLAIAADRTMYIAD------GTNIRAVDPKGVIHTLIGHhghhnhw 1412
Cdd:COG3386    77 --------------ADGEVTVLAdeygkPLNRPNDGVVDPDGRLYFTDmgeylpTGALYRVDPDGSLRVLADG------- 135
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386764301 1413 spapcsgtlmanqaqLQWPTGLALSPlDGS-LHFID--DRLVLRLTSDMKIRVVAGTPLHcsnggqDGRVNKTGADNvlg 1489
Cdd:COG3386   136 ---------------LTFPNGIAFSP-DGRtLYVADtgAGRIYRFDLDADGTLGNRRVFA------DLPDGPGGPDG--- 190
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 386764301 1490 tvlaMAFSPFGNLYIADSDSRRVnsiRVVDTAGNMRyfaGKQEGTGSQTCDCAIGG 1545
Cdd:COG3386   191 ----LAVDADGNLWVALWGGGGV---VRFDPDGELL---GRIELPERRPTNVAFGG 236
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
757-784 8.37e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 36.46  E-value: 8.37e-03
                          10        20        30
                  ....*....|....*....|....*....|....
gi 386764301  757 CDSR--CSEHGQCKNG----TCVCSQGWNGRHCT 784
Cdd:cd00054     5 CASGnpCQNGGTCVNTvgsyRCSCPPGYTGRNCE 38
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
790-818 8.51e-03

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 36.30  E-value: 8.51e-03
                          10        20        30
                  ....*....|....*....|....*....|
gi 386764301  790 NGCSRHGQCTLENGEYRCDCIEGWAG-RDC 818
Cdd:cd00053     6 NPCSNGGTCVNTPGSYRCVCPPGYTGdRSC 35
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH