NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|178557739|ref|NP_001002029|]
View 

complement C4-B preproprotein [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
TED_complement pfam07678
A-macroglobulin TED domain; This entry corresponds to the TED domain of the complement ...
982-1315 5.75e-123

A-macroglobulin TED domain; This entry corresponds to the TED domain of the complement components such as C3, C4 and C5. This domain contains a short highly conserved region of proteinase-binding alpha-macro-globulins contains the cysteine and a glutamine of a thiol-ester bond that is cleaved at the moment of proteinase binding, and mediates the covalent binding of the alpha-macro-globulin to the proteinase. The GCGEQ motif is highly conserved.


:

Pssm-ID: 462227 [Multi-domain]  Cd Length: 311  Bit Score: 388.58  E-value: 5.75e-123
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739   982 ASDPLDTLGSEGALSPGGVASLLRLPRGCGEQTMIYLAPTLAASRYLDKTEQWStlpPETKDHAVDLIQKGYMRIQQFRK 1061
Cdd:pfam07678    1 ISVVGDIMGPAIQVVPENLSSLLRLPYGCGEQNMVLFAPNVYVLRYLDKTNQLT---KLIKSKAIDYLEQGYQRQLSYKH 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  1062 ADGSYAAWLSRGSSTWLTAFVLKVLSLAQEQVGGSPEKLQETSNWLLSQQQADGSFQDLSPVIHRSMQGGLVGNdetVAL 1141
Cdd:pfam07678   78 PDGSYSAFGHSPGSTWLTAFVLKVFAQARKFIFIDPEEICQSLRWLLSQQKPDGSFREPGPLLHRAMKGGVDGE---VSL 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  1142 TAFVTIALHHGLAVFQdegaepLKQRVEASISKASSFLGEKASAGLLGAHAAAITAYALTLTKAPADlRGVAHNNLMAMA 1221
Cdd:pfam07678  155 TAYVTIALLEALDING------LLQRVHPSIRKALTYLEQAQLAGLTSPYTLAILAYALALAGSPET-REELLKSLDAMA 227
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  1222 QETGDNLYWGSVTGSQSNAVSPtpaprnpsdPMPQAPALWIETTAYAlLHLLLHEGKAEMADQAAAWLTRQGSFQGGFRS 1301
Cdd:pfam07678  228 REEGNSRYWERDEKSDPQGVPE---------YPPQAPSLEVETTAYA-LLAYLLLGDLTYADPIVKWLTSQRNSHGGFSS 297
                          330
                   ....*....|....
gi 178557739  1302 TQDTVIALDALSAY 1315
Cdd:pfam07678  298 TQDTVVALQALAEY 311
NTR_complement_C4 cd03584
NTR/C345C domain, complement C4 subfamily; The NTR domain found in complement C4 is also known ...
1588-1742 7.40e-82

NTR/C345C domain, complement C4 subfamily; The NTR domain found in complement C4 is also known as the C345C domain because it occurs at the C-terminus of complement C3, C4 and C5. Complement C4 is a key player in the activation of the component classical pathway. C4 is cleaved by activated C1 to yield C4a anaphylatoxin, and the larger fragment C4b, an essential component of the C3- and C5-convertase enzymes. C4b binds covalently to the surface of pathogens through a reactive thioester. The role of the NTR/C345C domain in C4 (C4b) is unclear.


:

Pssm-ID: 239639  Cd Length: 153  Bit Score: 265.37  E-value: 7.40e-82
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739 1588 CQCAEGKCPRQRRALErgLQDEDGYRMKFACYYPRVEYGFQVKVLREDSRAAFRLFETKITQVLHFTKDVKAAANQMRNF 1667
Cdd:cd03584     1 CQCAEGGCPKQKSTFS--KEITKTDRFDFACYSPRVDYAYVVKVLNISEKSNFELYETSITDVLQTTGDVSVKPEETRVF 78
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 178557739 1668 LVRASCRLRLEPGKEYLIMGLDGATYDLEGHPQYLLDSNSWIEEMPSERLCRSTRQRAACAQLNDFLQEYGTQGC 1742
Cdd:cd03584    79 LKRLSCKLELKKGKEYLIMGKDGATSDSNGHMQYLLDSKTWVEKIPSEKRCKATRNRSACKQLNEFLKEYKINGC 153
A2M_recep pfam07677
A-macroglobulin receptor binding domain; This family includes the receptor binding domain ...
1481-1570 6.90e-34

A-macroglobulin receptor binding domain; This family includes the receptor binding domain region of the alpha-2-macroglobulin family.


:

Pssm-ID: 462226  Cd Length: 92  Bit Score: 125.76  E-value: 6.90e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  1481 SGMAIADVTLLSGFHALRADLEKLTSlsDRYVSHFETEGP-HVLLYFDSVPTSRECVGFEAVQEVPVGLVQPASATLYDY 1559
Cdd:pfam07677    3 SNMAILEVGLPSGFVPDEEDLKKLGV--DPLIKRVETVDDgKVILYLDKLSGEPLCFSFRAEQTFPVANLKPAPVKVYDY 80
                           90
                   ....*....|.
gi 178557739  1560 YNPERRCSVFY 1570
Cdd:pfam07677   81 YEPERRATTFY 91
A2M pfam00207
Alpha-2-macroglobulin family; This family includes the C-terminal region of the ...
781-869 2.08e-33

Alpha-2-macroglobulin family; This family includes the C-terminal region of the alpha-2-macroglobulin family.


:

Pssm-ID: 459711 [Multi-domain]  Cd Length: 91  Bit Score: 124.24  E-value: 2.08e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739   781 NWLWRVETV--DRFQILTLWLPDSLTTWEIHGLSLSKTKGLCVATPVQLRVFREFHLHLRLPMSVRRFEQLELRPVLYNY 858
Cdd:pfam00207    1 TWLWDPVLVtdNGKASLSFTLPDSITTWRATAFALSPDTGLGVAEPPELVVFKPFFVDLNLPYSVRRGEQFELKATVFNY 80
                           90
                   ....*....|.
gi 178557739   859 LDKNLTVSVHV 869
Cdd:pfam00207   81 LDKCLKVRVRL 91
YfaS super family cl34462
Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function ...
138-1336 1.31e-28

Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function prediction only];


The actual alignment was detected with superfamily member COG2373:

Pssm-ID: 441940 [Multi-domain]  Cd Length: 1605  Bit Score: 125.96  E-value: 1.31e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  138 RGHLFlqTDQPIYNPGQRVRYRVFALDQKMRPSTDT-ITVMVENSHGLRVRKKEVYMPSS-IFQDDFVIPDISEPGTWKI 215
Cdd:COG2373   370 DAFLF--TDRGIYRPGETVHLKALLRDADGKAPAGLpLTLELTDPDGKEVRRQTLTLNEFgGYSFSFPLPEDAPTGTWRL 447
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  216 SARfSDGLESNSSTQFEVKKYVLPNFEVKITPGKPYILtvPGhlDEMQLDIQARYIYGKP-----VQGVAYVR------- 283
Cdd:COG2373   448 ELY-VDPKPALGSKSFRVEEFKPPRFKVDLTLDKEPLK--PG--DPVTVTVDARYLFGAPaaglkVEGEVTLRpartafp 522
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  284 ------FGLLDEDGKKTFFrgLESQTKL-VNGQSHISLSKAEFQDALeklnmgitdlQGLRLYVAAAIIEsPGGEMEEAE 356
Cdd:COG2373   523 gypgyrFGDPDEEFEPEEL--DLGEGTLdADGKASLSLPLPDAPDAP----------GPLRATVEASVFE-SGGRPVTRS 589
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  357 LTSWYFVSSPF---SLDLSKTKRhlvPGAPFLLQALVREMSGSPASGIPVKV--------SATVSSPG----SVPEVQD- 420
Cdd:COG2373   590 ATVPVHPADFYvgiRLPLFDGDP---EGAPATFEVVAVDPDGKPVAGKGLKVelyreewrYVWYKSDDggwrYESQEKEe 666
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  421 ------IQQNTDGSGQVSIPiiiPQTISELQLSVS-AGSPHPAIARLTVAAPPSGG---PGFLSIErPDSRPPRVGDTLN 490
Cdd:COG2373   667 pvaegtLTTGADGPASLSLT---PVEWGRYRLEVKdPDGGLATSVRFYAGGNASWGaerPDRLELS-LDKESYKPGETAK 742
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  491 LNLRA--VGSGatfshyYYMILSRGQIVFMNREPKRTLTSVSVFVDHHLAPSFYFVAFYYHGDHPVAN----------SL 558
Cdd:COG2373   743 LLIQSpfAGRA------LVTVERDGVLETQWVDVKGGGTTVEIPVTEDWAPNAYVSATLVRPGDSTANdmparaygvaPL 816
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  559 RVDVQAGacegKLELSVDGAKQYRNGESVKLHLETDSL----ALVALGALDTALYA-AGSKSHKPL-------------- 619
Cdd:COG2373   817 PVDPPAR----RLKVELTAPEKLRPGETLTVTVKVKGAagkaAEVTLAAVDEGILNlTGYKTPDPLdffygkralgvetr 892
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  620 -NMGKVFEAMnSYDLGCGPGGGDSALQvfqaaglafsdgdqwtlsrkrlscpKEKTTRKKRNvNFQkaineklgqyasPT 698
Cdd:COG2373   893 dLYGRLIGAF-GGAAGALRSGGDGALG-------------------------RGGNPKPPRK-RFK------------PV 933
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  699 AkrccqdgvtrlpmmrsceqraarvqqpdcrepflsccQFAESLrkKSRDKGQAglqraleilqeedlideddipvrsff 778
Cdd:COG2373   934 A-------------------------------------LFSGPV--KTDADGKA-------------------------- 948
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  779 penwlwrveTVdRFQiltlwLPDSLTTWEIHGLSLSKT------KGLCVATPVQLRvfrefhlhLRLPmsvrRF----EQ 848
Cdd:COG2373   949 ---------TV-SFD-----LPDFNGTLRVMAVAWSDDrfgsaeATVTVRKPLVVR--------PSLP----RFlapgDR 1001
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  849 LELRPVLYNYLDKNLTVSVHVS---PVEGLCLAGggglaQQVLVPAGSARPVAFSVVPTAATAVSLKVVARGSFEfpvGD 925
Cdd:COG2373  1002 FELPVDVFNLTGKAGTVTVTLEasgGLTLEGEAT-----QTVTLAAGGRATVRFPLKAPDAGDAKVTVTATGGGE---SD 1073
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  926 AVSKVLQIEKEGAIHREELVYELNPldhrGRTLEIPGNSDPNMIPDgdFNSY-VRVTASDPLDTlgsegalsPGGVASLL 1004
Cdd:COG2373  1074 AREVELPVRPANPLVTRATSGVLAP----GESWTLPLDLPGGLRPG--TGSLtLSLSSSPPLDL--------AGLLRYLL 1139
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739 1005 RLPRGCGEQTMIYLAPTLaasrYLDKTEQWSTLPPETKDHAVDLIQKGYMRIQQFRKADGSYAAW-LSRGSSTWLTAFVL 1083
Cdd:COG2373  1140 RYPYGCTEQTTSRALPLL----YLSDLAEALGLKGDKDAELRARIQAAIARLLSMQNSDGGFGLWpGGSESDPWLTAYAT 1215
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739 1084 KVLSLAQEQvGGS-PEK-LQETSNWLLSQQQADGSFQDLSPvihrsmqgglvGNDETVALTAFVtialhhgLAVFQ--DE 1159
Cdd:COG2373  1216 DFLLEAREA-GYAvPDDaLDRALDYLRNYLRNPWEIEYDDA-----------YRLAVRAYALYV-------LARAGkaDL 1276
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739 1160 GA-EPLKQRVEASIS-KASSFLGekasagllgahaaaitayaLTLTKAPADLRGV-AHNNLMAMAQETGDNLYWGSVTGS 1236
Cdd:COG2373  1277 GDlRYLYDRRKDALSpLAKAQLA-------------------AALALLGDKARAEeLLAAALARLRETGARDYWYGDYGS 1337
                        1210      1220      1230      1240      1250      1260      1270      1280
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739 1237 QsnavsptpaprnpsdpmpqapalwIETTAYALLHLLLHEGKAEMADQAAAWLTRQGSfQGGFRSTQDTVIALDALSAYw 1316
Cdd:COG2373  1338 P------------------------LRDQALALALLAELGPDAPLAPKLARWLAKALK-SGRWLSTQETAWALLALAAY- 1391
                        1290      1300
                  ....*....|....*....|
gi 178557739 1317 iASHTTEERGLNVTLSSTGR 1336
Cdd:COG2373  1392 -ARAAGASPDFTATLTLDGK 1410
ANATO cd00017
Anaphylatoxin homologous domain; C3a, C4a and C5a anaphylatoxins are protein fragments ...
686-750 1.81e-22

Anaphylatoxin homologous domain; C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to repeats in fibulins.


:

Pssm-ID: 237984  Cd Length: 70  Bit Score: 92.52  E-value: 1.81e-22
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 178557739  686 AINEKLGQYASPTAKRCCQDGVTRLPMMRSCEQRAARV-QQPDCREPFLSCCQFAESLRKKSRDKG 750
Cdd:cd00017     1 KNSEKAAQYKDKELRKCCLDGMRENPMGQTCEERAAYItDGKECRKAFLECCVYAEELRDEEREDG 66
 
Name Accession Description Interval E-value
TED_complement pfam07678
A-macroglobulin TED domain; This entry corresponds to the TED domain of the complement ...
982-1315 5.75e-123

A-macroglobulin TED domain; This entry corresponds to the TED domain of the complement components such as C3, C4 and C5. This domain contains a short highly conserved region of proteinase-binding alpha-macro-globulins contains the cysteine and a glutamine of a thiol-ester bond that is cleaved at the moment of proteinase binding, and mediates the covalent binding of the alpha-macro-globulin to the proteinase. The GCGEQ motif is highly conserved.


Pssm-ID: 462227 [Multi-domain]  Cd Length: 311  Bit Score: 388.58  E-value: 5.75e-123
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739   982 ASDPLDTLGSEGALSPGGVASLLRLPRGCGEQTMIYLAPTLAASRYLDKTEQWStlpPETKDHAVDLIQKGYMRIQQFRK 1061
Cdd:pfam07678    1 ISVVGDIMGPAIQVVPENLSSLLRLPYGCGEQNMVLFAPNVYVLRYLDKTNQLT---KLIKSKAIDYLEQGYQRQLSYKH 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  1062 ADGSYAAWLSRGSSTWLTAFVLKVLSLAQEQVGGSPEKLQETSNWLLSQQQADGSFQDLSPVIHRSMQGGLVGNdetVAL 1141
Cdd:pfam07678   78 PDGSYSAFGHSPGSTWLTAFVLKVFAQARKFIFIDPEEICQSLRWLLSQQKPDGSFREPGPLLHRAMKGGVDGE---VSL 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  1142 TAFVTIALHHGLAVFQdegaepLKQRVEASISKASSFLGEKASAGLLGAHAAAITAYALTLTKAPADlRGVAHNNLMAMA 1221
Cdd:pfam07678  155 TAYVTIALLEALDING------LLQRVHPSIRKALTYLEQAQLAGLTSPYTLAILAYALALAGSPET-REELLKSLDAMA 227
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  1222 QETGDNLYWGSVTGSQSNAVSPtpaprnpsdPMPQAPALWIETTAYAlLHLLLHEGKAEMADQAAAWLTRQGSFQGGFRS 1301
Cdd:pfam07678  228 REEGNSRYWERDEKSDPQGVPE---------YPPQAPSLEVETTAYA-LLAYLLLGDLTYADPIVKWLTSQRNSHGGFSS 297
                          330
                   ....*....|....
gi 178557739  1302 TQDTVIALDALSAY 1315
Cdd:pfam07678  298 TQDTVVALQALAEY 311
complement_C3_C4_C5 cd02896
Proteins similar to C3, C4 and C5 of vertebrate complement. The vertebrate complement system, ...
996-1315 6.29e-116

Proteins similar to C3, C4 and C5 of vertebrate complement. The vertebrate complement system, comprised of a large number of distinct plasma proteins, is an effector of both the acquired and innate immune systems. The point of convergence of the classical, alternative and lectin pathways of the complement system is the proteolytic activation of C3. C4 plays a key role in propagating the classical and lectin pathways. C5 participates in the classical and alternative pathways. The thioester bond located within the structure of C3 and C4 is central to the function of complement. C5 does not contain an active thioester bond.


Pssm-ID: 239226 [Multi-domain]  Cd Length: 297  Bit Score: 368.52  E-value: 6.29e-116
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  996 SPGGVASLLRLPRGCGEQTMIYLAPTLAASRYLDKTEQWSTLPPETKDHAVDLIQKGYMRIQQFRKADGSYAAWLSRGSS 1075
Cdd:cd02896     1 SPEGLEKLIRLPTGCGEQTMIKLAPTVYALRYLDTTNQWEKLGPERRDEALKYIRQGYQRQLSYRKPDGSYAAWKNRPSS 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739 1076 TWLTAFVLKVLSLAQEQVGGSPEKLQETSNWLLSQQQADGSFQDLSPVIHRSMQGGLVGNDETVALTAFVTIALHHGLAV 1155
Cdd:cd02896    81 TWLTAFVVKVFSLARKYIPVDQNVICGSVNWLISNQKPDGSFQEPSPVIHREMTGGVEGSEGDVSLTAFVLIALQEARSI 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739 1156 FqdegaEPLKQRVEASISKASSFLGEKASagllgaHAAAITAYALT---LTKAPADLRGVAHNNLMAMAQETGDNLYWgs 1232
Cdd:cd02896   161 C-----PPEVQNLDQSIRKAISYLENQLP------NLQRPYALAITayaLALADSPLSHAANRKLLSLAKRDGNGWYW-- 227
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739 1233 vtgsqsnavsptPAPRNPSDPMPQAPALWIETTAYALLHLLLHeGKAEMADQAAAWLTRQGSFQGGFRSTQDTVIALDAL 1312
Cdd:cd02896   228 ------------WTIDSPYWPVPGPSAITVETTAYALLALLKL-GDIEYANPIARWLTEQRNYGGGFGSTQDTVVALQAL 294

                  ...
gi 178557739 1313 SAY 1315
Cdd:cd02896   295 AEY 297
NTR_complement_C4 cd03584
NTR/C345C domain, complement C4 subfamily; The NTR domain found in complement C4 is also known ...
1588-1742 7.40e-82

NTR/C345C domain, complement C4 subfamily; The NTR domain found in complement C4 is also known as the C345C domain because it occurs at the C-terminus of complement C3, C4 and C5. Complement C4 is a key player in the activation of the component classical pathway. C4 is cleaved by activated C1 to yield C4a anaphylatoxin, and the larger fragment C4b, an essential component of the C3- and C5-convertase enzymes. C4b binds covalently to the surface of pathogens through a reactive thioester. The role of the NTR/C345C domain in C4 (C4b) is unclear.


Pssm-ID: 239639  Cd Length: 153  Bit Score: 265.37  E-value: 7.40e-82
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739 1588 CQCAEGKCPRQRRALErgLQDEDGYRMKFACYYPRVEYGFQVKVLREDSRAAFRLFETKITQVLHFTKDVKAAANQMRNF 1667
Cdd:cd03584     1 CQCAEGGCPKQKSTFS--KEITKTDRFDFACYSPRVDYAYVVKVLNISEKSNFELYETSITDVLQTTGDVSVKPEETRVF 78
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 178557739 1668 LVRASCRLRLEPGKEYLIMGLDGATYDLEGHPQYLLDSNSWIEEMPSERLCRSTRQRAACAQLNDFLQEYGTQGC 1742
Cdd:cd03584    79 LKRLSCKLELKKGKEYLIMGKDGATSDSNGHMQYLLDSKTWVEKIPSEKRCKATRNRSACKQLNEFLKEYKINGC 153
A2M_recep pfam07677
A-macroglobulin receptor binding domain; This family includes the receptor binding domain ...
1481-1570 6.90e-34

A-macroglobulin receptor binding domain; This family includes the receptor binding domain region of the alpha-2-macroglobulin family.


Pssm-ID: 462226  Cd Length: 92  Bit Score: 125.76  E-value: 6.90e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  1481 SGMAIADVTLLSGFHALRADLEKLTSlsDRYVSHFETEGP-HVLLYFDSVPTSRECVGFEAVQEVPVGLVQPASATLYDY 1559
Cdd:pfam07677    3 SNMAILEVGLPSGFVPDEEDLKKLGV--DPLIKRVETVDDgKVILYLDKLSGEPLCFSFRAEQTFPVANLKPAPVKVYDY 80
                           90
                   ....*....|.
gi 178557739  1560 YNPERRCSVFY 1570
Cdd:pfam07677   81 YEPERRATTFY 91
C345C smart00643
Netrin C-terminal Domain;
1614-1724 8.72e-34

Netrin C-terminal Domain;


Pssm-ID: 214759  Cd Length: 114  Bit Score: 126.33  E-value: 8.72e-34
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739   1614 MKFACYYPRvEYGFQVKVLREDSRAAFRLFETKITQVLHFTKDVK-AAANQMRNFLVRASCR--LRLEPGKEYLIMGLDG 1690
Cdd:smart00643    1 LEKACKSDV-DYVYKVKVLSVEEEGGFDKYTVKILEVIKSGTDELvRGKNKLRVFISRASCRcpLLLKLGKSYLIMGKSG 79
                            90       100       110
                    ....*....|....*....|....*....|....
gi 178557739   1691 ATYDLEGHPQYLLDSNSWIEEMPSERLCRSTRQR 1724
Cdd:smart00643   80 DLWDAKGRGQYVLGKNSWVEEWPTEEECRLRRLQ 113
A2M pfam00207
Alpha-2-macroglobulin family; This family includes the C-terminal region of the ...
781-869 2.08e-33

Alpha-2-macroglobulin family; This family includes the C-terminal region of the alpha-2-macroglobulin family.


Pssm-ID: 459711 [Multi-domain]  Cd Length: 91  Bit Score: 124.24  E-value: 2.08e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739   781 NWLWRVETV--DRFQILTLWLPDSLTTWEIHGLSLSKTKGLCVATPVQLRVFREFHLHLRLPMSVRRFEQLELRPVLYNY 858
Cdd:pfam00207    1 TWLWDPVLVtdNGKASLSFTLPDSITTWRATAFALSPDTGLGVAEPPELVVFKPFFVDLNLPYSVRRGEQFELKATVFNY 80
                           90
                   ....*....|.
gi 178557739   859 LDKNLTVSVHV 869
Cdd:pfam00207   81 LDKCLKVRVRL 91
YfaS COG2373
Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function ...
138-1336 1.31e-28

Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function prediction only];


Pssm-ID: 441940 [Multi-domain]  Cd Length: 1605  Bit Score: 125.96  E-value: 1.31e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  138 RGHLFlqTDQPIYNPGQRVRYRVFALDQKMRPSTDT-ITVMVENSHGLRVRKKEVYMPSS-IFQDDFVIPDISEPGTWKI 215
Cdd:COG2373   370 DAFLF--TDRGIYRPGETVHLKALLRDADGKAPAGLpLTLELTDPDGKEVRRQTLTLNEFgGYSFSFPLPEDAPTGTWRL 447
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  216 SARfSDGLESNSSTQFEVKKYVLPNFEVKITPGKPYILtvPGhlDEMQLDIQARYIYGKP-----VQGVAYVR------- 283
Cdd:COG2373   448 ELY-VDPKPALGSKSFRVEEFKPPRFKVDLTLDKEPLK--PG--DPVTVTVDARYLFGAPaaglkVEGEVTLRpartafp 522
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  284 ------FGLLDEDGKKTFFrgLESQTKL-VNGQSHISLSKAEFQDALeklnmgitdlQGLRLYVAAAIIEsPGGEMEEAE 356
Cdd:COG2373   523 gypgyrFGDPDEEFEPEEL--DLGEGTLdADGKASLSLPLPDAPDAP----------GPLRATVEASVFE-SGGRPVTRS 589
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  357 LTSWYFVSSPF---SLDLSKTKRhlvPGAPFLLQALVREMSGSPASGIPVKV--------SATVSSPG----SVPEVQD- 420
Cdd:COG2373   590 ATVPVHPADFYvgiRLPLFDGDP---EGAPATFEVVAVDPDGKPVAGKGLKVelyreewrYVWYKSDDggwrYESQEKEe 666
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  421 ------IQQNTDGSGQVSIPiiiPQTISELQLSVS-AGSPHPAIARLTVAAPPSGG---PGFLSIErPDSRPPRVGDTLN 490
Cdd:COG2373   667 pvaegtLTTGADGPASLSLT---PVEWGRYRLEVKdPDGGLATSVRFYAGGNASWGaerPDRLELS-LDKESYKPGETAK 742
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  491 LNLRA--VGSGatfshyYYMILSRGQIVFMNREPKRTLTSVSVFVDHHLAPSFYFVAFYYHGDHPVAN----------SL 558
Cdd:COG2373   743 LLIQSpfAGRA------LVTVERDGVLETQWVDVKGGGTTVEIPVTEDWAPNAYVSATLVRPGDSTANdmparaygvaPL 816
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  559 RVDVQAGacegKLELSVDGAKQYRNGESVKLHLETDSL----ALVALGALDTALYA-AGSKSHKPL-------------- 619
Cdd:COG2373   817 PVDPPAR----RLKVELTAPEKLRPGETLTVTVKVKGAagkaAEVTLAAVDEGILNlTGYKTPDPLdffygkralgvetr 892
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  620 -NMGKVFEAMnSYDLGCGPGGGDSALQvfqaaglafsdgdqwtlsrkrlscpKEKTTRKKRNvNFQkaineklgqyasPT 698
Cdd:COG2373   893 dLYGRLIGAF-GGAAGALRSGGDGALG-------------------------RGGNPKPPRK-RFK------------PV 933
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  699 AkrccqdgvtrlpmmrsceqraarvqqpdcrepflsccQFAESLrkKSRDKGQAglqraleilqeedlideddipvrsff 778
Cdd:COG2373   934 A-------------------------------------LFSGPV--KTDADGKA-------------------------- 948
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  779 penwlwrveTVdRFQiltlwLPDSLTTWEIHGLSLSKT------KGLCVATPVQLRvfrefhlhLRLPmsvrRF----EQ 848
Cdd:COG2373   949 ---------TV-SFD-----LPDFNGTLRVMAVAWSDDrfgsaeATVTVRKPLVVR--------PSLP----RFlapgDR 1001
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  849 LELRPVLYNYLDKNLTVSVHVS---PVEGLCLAGggglaQQVLVPAGSARPVAFSVVPTAATAVSLKVVARGSFEfpvGD 925
Cdd:COG2373  1002 FELPVDVFNLTGKAGTVTVTLEasgGLTLEGEAT-----QTVTLAAGGRATVRFPLKAPDAGDAKVTVTATGGGE---SD 1073
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  926 AVSKVLQIEKEGAIHREELVYELNPldhrGRTLEIPGNSDPNMIPDgdFNSY-VRVTASDPLDTlgsegalsPGGVASLL 1004
Cdd:COG2373  1074 AREVELPVRPANPLVTRATSGVLAP----GESWTLPLDLPGGLRPG--TGSLtLSLSSSPPLDL--------AGLLRYLL 1139
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739 1005 RLPRGCGEQTMIYLAPTLaasrYLDKTEQWSTLPPETKDHAVDLIQKGYMRIQQFRKADGSYAAW-LSRGSSTWLTAFVL 1083
Cdd:COG2373  1140 RYPYGCTEQTTSRALPLL----YLSDLAEALGLKGDKDAELRARIQAAIARLLSMQNSDGGFGLWpGGSESDPWLTAYAT 1215
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739 1084 KVLSLAQEQvGGS-PEK-LQETSNWLLSQQQADGSFQDLSPvihrsmqgglvGNDETVALTAFVtialhhgLAVFQ--DE 1159
Cdd:COG2373  1216 DFLLEAREA-GYAvPDDaLDRALDYLRNYLRNPWEIEYDDA-----------YRLAVRAYALYV-------LARAGkaDL 1276
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739 1160 GA-EPLKQRVEASIS-KASSFLGekasagllgahaaaitayaLTLTKAPADLRGV-AHNNLMAMAQETGDNLYWGSVTGS 1236
Cdd:COG2373  1277 GDlRYLYDRRKDALSpLAKAQLA-------------------AALALLGDKARAEeLLAAALARLRETGARDYWYGDYGS 1337
                        1210      1220      1230      1240      1250      1260      1270      1280
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739 1237 QsnavsptpaprnpsdpmpqapalwIETTAYALLHLLLHEGKAEMADQAAAWLTRQGSfQGGFRSTQDTVIALDALSAYw 1316
Cdd:COG2373  1338 P------------------------LRDQALALALLAELGPDAPLAPKLARWLAKALK-SGRWLSTQETAWALLALAAY- 1391
                        1290      1300
                  ....*....|....*....|
gi 178557739 1317 iASHTTEERGLNVTLSSTGR 1336
Cdd:COG2373  1392 -ARAAGASPDFTATLTLDGK 1410
NTR pfam01759
UNC-6/NTR/C345C module; Sequence similarity between netrin UNC-6 and C345C complement protein ...
1615-1724 2.28e-25

UNC-6/NTR/C345C module; Sequence similarity between netrin UNC-6 and C345C complement protein family members, and hence the existence of the UNC-6 module, was first reported in. Subsequently, many additional members of the family were identified on the basis of sequence similarity between the C-terminal domains of netrins, complement proteins C3, C4, C5, secreted frizzled-related proteins, and type I pro-collagen C-proteinase enhancer proteins (PCOLCEs), which are homologous with the N-terminal domains of tissue inhibitors of metalloproteinases (TIMPs). The TIMPs are classified as a separate family in Pfam (pfam00965). This expanded domain family has been named as the NTR module.


Pssm-ID: 396359  Cd Length: 106  Bit Score: 102.04  E-value: 2.28e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  1615 KFACYypRVEYGFQVKVLREDSRAAFRLFETKITQVLHfTKDVKAAANQMRNFLVRASCR-LRLEPGKEYLIMGLDGaty 1693
Cdd:pfam01759    1 KKACK--GSDYVYKVKVLSVEEEGSFDKYTVKVKEVLK-EGTDKIQRGKVRLFLKRGDCRcPQLRLGKEYLIMGKVG--- 74
                           90       100       110
                   ....*....|....*....|....*....|.
gi 178557739  1694 DLEGHPQYLLDSNSWIEEMPSERLCRSTRQR 1724
Cdd:pfam01759   75 DLEGRGRYVLDKNSWVEPWPTKWECKLRELQ 105
A2M_BRD pfam07703
Alpha-2-macroglobulin bait region domain; Alpha-2-macroglobulins (A2Ms) are plasma proteins ...
473-610 2.76e-24

Alpha-2-macroglobulin bait region domain; Alpha-2-macroglobulins (A2Ms) are plasma proteins that trap and inhibit a broad range of proteases and are major components of the eukaryotic innate immune system. However, A2M-like proteins were identified in pathogenically invasive bacteria and species that colonize higher eukaryotes. This domain is found in eukaryotic and bacterial proteins. In human A2Ms, this domain encompasses macroglobulin-like domain MG5 and 6 including bait region. In Salmonella enterica ser A2Ms, this domain encompasses MG7 and MG8 including the bait region. The Bait region is cleaved by proteases, followed by a large conformational change that blocks the target protease within a cage-like complex. This model of protease entrapment is recognized as the Venus flytrap mechanism.


Pssm-ID: 462235 [Multi-domain]  Cd Length: 139  Bit Score: 100.12  E-value: 2.76e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739   473 LSIErPDSRPPRVGDTLNLNLRAVGSGATFSHY-YYMILSRGQIVFMNRepKRTLTSVSVFVDHHLAPSFYFVAFYYHGD 551
Cdd:pfam07703    1 LHLS-TDKTEYKPGETATVTVKSPFDGTVERDGfTYLVLSKGQIVVVGR--GGVTTSFSLPVTAEMAPSARVVAYYVRVD 77
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 178557739   552 HP----VANSLRVDVQAGaCEGKLELSVDgAKQYRNGESVKLHLETDSLALVALGALDTALYA 610
Cdd:pfam07703   78 LSkpevVADSVWVDVDDT-CENKLKVTLS-AEKYRPGSTVELKVKADPGAYVALAAVDKGVLL 138
ANATO cd00017
Anaphylatoxin homologous domain; C3a, C4a and C5a anaphylatoxins are protein fragments ...
686-750 1.81e-22

Anaphylatoxin homologous domain; C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to repeats in fibulins.


Pssm-ID: 237984  Cd Length: 70  Bit Score: 92.52  E-value: 1.81e-22
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 178557739  686 AINEKLGQYASPTAKRCCQDGVTRLPMMRSCEQRAARV-QQPDCREPFLSCCQFAESLRKKSRDKG 750
Cdd:cd00017     1 KNSEKAAQYKDKELRKCCLDGMRENPMGQTCEERAAYItDGKECRKAFLECCVYAEELRDEEREDG 66
ANATO smart00104
Anaphylatoxin homologous domain; C3a, C4a and C5a anaphylatoxins are protein fragments ...
702-736 8.50e-13

Anaphylatoxin homologous domain; C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-fold repeat in fibulins.


Pssm-ID: 197517  Cd Length: 35  Bit Score: 63.89  E-value: 8.50e-13
                            10        20        30
                    ....*....|....*....|....*....|....*
gi 178557739    702 CCQDGVTRLPMMRSCEQRAARVQQPDCREPFLSCC 736
Cdd:smart00104    1 CCADGMRLAPMGETCEERAARINSGDCRKAFLQCC 35
ANATO pfam01821
Anaphylotoxin-like domain; C3a, C4a and C5a anaphylatoxins are protein fragments generated ...
702-736 8.93e-13

Anaphylotoxin-like domain; C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-fold repeat in fibulins.


Pssm-ID: 460347  Cd Length: 36  Bit Score: 63.83  E-value: 8.93e-13
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 178557739   702 CCQDGVTRLPMMRSCEQRAARVQQ-PDCREPFLSCC 736
Cdd:pfam01821    1 CCLDGMKRNPMGRSCEQRAARIKEgPRCRKAFLQCC 36
 
Name Accession Description Interval E-value
TED_complement pfam07678
A-macroglobulin TED domain; This entry corresponds to the TED domain of the complement ...
982-1315 5.75e-123

A-macroglobulin TED domain; This entry corresponds to the TED domain of the complement components such as C3, C4 and C5. This domain contains a short highly conserved region of proteinase-binding alpha-macro-globulins contains the cysteine and a glutamine of a thiol-ester bond that is cleaved at the moment of proteinase binding, and mediates the covalent binding of the alpha-macro-globulin to the proteinase. The GCGEQ motif is highly conserved.


Pssm-ID: 462227 [Multi-domain]  Cd Length: 311  Bit Score: 388.58  E-value: 5.75e-123
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739   982 ASDPLDTLGSEGALSPGGVASLLRLPRGCGEQTMIYLAPTLAASRYLDKTEQWStlpPETKDHAVDLIQKGYMRIQQFRK 1061
Cdd:pfam07678    1 ISVVGDIMGPAIQVVPENLSSLLRLPYGCGEQNMVLFAPNVYVLRYLDKTNQLT---KLIKSKAIDYLEQGYQRQLSYKH 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  1062 ADGSYAAWLSRGSSTWLTAFVLKVLSLAQEQVGGSPEKLQETSNWLLSQQQADGSFQDLSPVIHRSMQGGLVGNdetVAL 1141
Cdd:pfam07678   78 PDGSYSAFGHSPGSTWLTAFVLKVFAQARKFIFIDPEEICQSLRWLLSQQKPDGSFREPGPLLHRAMKGGVDGE---VSL 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  1142 TAFVTIALHHGLAVFQdegaepLKQRVEASISKASSFLGEKASAGLLGAHAAAITAYALTLTKAPADlRGVAHNNLMAMA 1221
Cdd:pfam07678  155 TAYVTIALLEALDING------LLQRVHPSIRKALTYLEQAQLAGLTSPYTLAILAYALALAGSPET-REELLKSLDAMA 227
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  1222 QETGDNLYWGSVTGSQSNAVSPtpaprnpsdPMPQAPALWIETTAYAlLHLLLHEGKAEMADQAAAWLTRQGSFQGGFRS 1301
Cdd:pfam07678  228 REEGNSRYWERDEKSDPQGVPE---------YPPQAPSLEVETTAYA-LLAYLLLGDLTYADPIVKWLTSQRNSHGGFSS 297
                          330
                   ....*....|....
gi 178557739  1302 TQDTVIALDALSAY 1315
Cdd:pfam07678  298 TQDTVVALQALAEY 311
complement_C3_C4_C5 cd02896
Proteins similar to C3, C4 and C5 of vertebrate complement. The vertebrate complement system, ...
996-1315 6.29e-116

Proteins similar to C3, C4 and C5 of vertebrate complement. The vertebrate complement system, comprised of a large number of distinct plasma proteins, is an effector of both the acquired and innate immune systems. The point of convergence of the classical, alternative and lectin pathways of the complement system is the proteolytic activation of C3. C4 plays a key role in propagating the classical and lectin pathways. C5 participates in the classical and alternative pathways. The thioester bond located within the structure of C3 and C4 is central to the function of complement. C5 does not contain an active thioester bond.


Pssm-ID: 239226 [Multi-domain]  Cd Length: 297  Bit Score: 368.52  E-value: 6.29e-116
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  996 SPGGVASLLRLPRGCGEQTMIYLAPTLAASRYLDKTEQWSTLPPETKDHAVDLIQKGYMRIQQFRKADGSYAAWLSRGSS 1075
Cdd:cd02896     1 SPEGLEKLIRLPTGCGEQTMIKLAPTVYALRYLDTTNQWEKLGPERRDEALKYIRQGYQRQLSYRKPDGSYAAWKNRPSS 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739 1076 TWLTAFVLKVLSLAQEQVGGSPEKLQETSNWLLSQQQADGSFQDLSPVIHRSMQGGLVGNDETVALTAFVTIALHHGLAV 1155
Cdd:cd02896    81 TWLTAFVVKVFSLARKYIPVDQNVICGSVNWLISNQKPDGSFQEPSPVIHREMTGGVEGSEGDVSLTAFVLIALQEARSI 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739 1156 FqdegaEPLKQRVEASISKASSFLGEKASagllgaHAAAITAYALT---LTKAPADLRGVAHNNLMAMAQETGDNLYWgs 1232
Cdd:cd02896   161 C-----PPEVQNLDQSIRKAISYLENQLP------NLQRPYALAITayaLALADSPLSHAANRKLLSLAKRDGNGWYW-- 227
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739 1233 vtgsqsnavsptPAPRNPSDPMPQAPALWIETTAYALLHLLLHeGKAEMADQAAAWLTRQGSFQGGFRSTQDTVIALDAL 1312
Cdd:cd02896   228 ------------WTIDSPYWPVPGPSAITVETTAYALLALLKL-GDIEYANPIARWLTEQRNYGGGFGSTQDTVVALQAL 294

                  ...
gi 178557739 1313 SAY 1315
Cdd:cd02896   295 AEY 297
NTR_complement_C4 cd03584
NTR/C345C domain, complement C4 subfamily; The NTR domain found in complement C4 is also known ...
1588-1742 7.40e-82

NTR/C345C domain, complement C4 subfamily; The NTR domain found in complement C4 is also known as the C345C domain because it occurs at the C-terminus of complement C3, C4 and C5. Complement C4 is a key player in the activation of the component classical pathway. C4 is cleaved by activated C1 to yield C4a anaphylatoxin, and the larger fragment C4b, an essential component of the C3- and C5-convertase enzymes. C4b binds covalently to the surface of pathogens through a reactive thioester. The role of the NTR/C345C domain in C4 (C4b) is unclear.


Pssm-ID: 239639  Cd Length: 153  Bit Score: 265.37  E-value: 7.40e-82
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739 1588 CQCAEGKCPRQRRALErgLQDEDGYRMKFACYYPRVEYGFQVKVLREDSRAAFRLFETKITQVLHFTKDVKAAANQMRNF 1667
Cdd:cd03584     1 CQCAEGGCPKQKSTFS--KEITKTDRFDFACYSPRVDYAYVVKVLNISEKSNFELYETSITDVLQTTGDVSVKPEETRVF 78
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 178557739 1668 LVRASCRLRLEPGKEYLIMGLDGATYDLEGHPQYLLDSNSWIEEMPSERLCRSTRQRAACAQLNDFLQEYGTQGC 1742
Cdd:cd03584    79 LKRLSCKLELKKGKEYLIMGKDGATSDSNGHMQYLLDSKTWVEKIPSEKRCKATRNRSACKQLNEFLKEYKINGC 153
A2M_like cd02891
Proteins similar to alpha2-macroglobulin (alpha (2)-M). Alpha (2)-M is a major carrier ...
996-1315 5.61e-76

Proteins similar to alpha2-macroglobulin (alpha (2)-M). Alpha (2)-M is a major carrier protein in serum. It is a broadly specific proteinase inhibitor. The structural thioester of alpha (2)-M, is involved in the immobilization and entrapment of proteases. This group contains another broadly specific proteinase inhibitor: pregnancy zone protein (PZP). PZP is a trace protein in the plasma of non-pregnant females and males which is elevated in pregnancy. Alpha (2)-M and PZ bind to placental protein-14 and may modulate its activity in T-cell growth and cytokine production thereby protecting the allogeneic fetus from attack by the maternal immune system. This group also contains C3, C4 and C5 of vertebrate complement. The vertebrate complement is an effector of both the acquired and innate immune systems The point of convergence of the classical, alternative and lectin pathways of the complement system is the proteolytic activation of C3. C4 plays a key role in propagating the classical and lectin pathways. C5 participates in the classical and alternative pathways. The thioester bond located within the structure of C3 and C4 is central to the function of complement. C5 does not contain an active thioester bond.


Pssm-ID: 239221 [Multi-domain]  Cd Length: 282  Bit Score: 253.85  E-value: 5.61e-76
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  996 SPGGVASLLRLPRGCGEQTMIYLAPTLAASRYLDKTEQWStlpPETKDHAVDLIQKGYMRIQQFRKADGSYAAWLSRG-S 1074
Cdd:cd02891     1 SLGNLDYLLRYPYGCGEQTMSRAAPNLYVLKYLDATGQLT---PEIREKALEYIRKGYQRLLTYQRSDGSFSAWGNSDsG 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739 1075 STWLTAFVLKVLSLAQEQVGGSPEKLQETSNWLLSQQQADGSFQDLSPVIHRSMQGglvGNDETVALTAFVTIALHHgla 1154
Cdd:cd02891    78 STWLTAYVVKFLSQARKYIDVDENVLARALGWLVPQQKEDGSFRELGPVIHREMKG---GVDDSVSLTAYVLIALAE--- 151
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739 1155 vfqdegaepLKQRVEASISKASSFLGEKASAGLLgahaaaitayalTLTKA-------PADLRGVAHNNLMAMAQETGDN 1227
Cdd:cd02891   152 ---------AGKACDASIEKALAYLETQLDGLLD------------PYALAilayalaLAGDSTRADEALKKLLEAAREK 210
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739 1228 LYWgsvtgsqsnavsptpAPRNPSDPMPQAPALWIETTAYALLHLLLHEGKAEmADQAAAWLTRQGSFQGGFRSTQDTVI 1307
Cdd:cd02891   211 GGT---------------AHWSLSWPGDYGSSLRVEATAYALLALLKLGDLEE-AGPIAKWLAQQRNSGGGFLSTQDTVV 274

                  ....*...
gi 178557739 1308 ALDALSAY 1315
Cdd:cd02891   275 ALQALAAY 282
A2M_2 cd02897
Proteins similar to alpha2-macroglobulin (alpha (2)-M). This group also contains the pregnancy ...
1002-1315 4.41e-59

Proteins similar to alpha2-macroglobulin (alpha (2)-M). This group also contains the pregnancy zone protein (PZP). Alpha(2)-M and PZP are broadly specific proteinase inhibitors. Alpha (2)-M is a major carrier protein in serum. The structural thioester of alpha (2)-M, is involved in the immobilization and entrapment of proteases. PZP is a trace protein in the plasma of non-pregnant females and males which is elevated in pregnancy. Alpha (2)-M and PZ bind to placental protein-14 and may modulate its activity in T-cell growth and cytokine production contributing to fetal survival. It has been suggested that thioester bond cleavage promotes the binding of PZ and alpha (2)-M to the CD91 receptor clearing them from circulation.


Pssm-ID: 239227  Cd Length: 292  Bit Score: 205.89  E-value: 4.41e-59
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739 1002 SLLRLPRGCGEQTMIYLAPTLAASRYLDKTEQwstLPPETKDHAVDLIQKGYMRIQQFRKADGSYAAWLSR--GSSTWLT 1079
Cdd:cd02897     7 NLLRMPYGCGEQNMVNFAPNIYVLDYLKATGQ---LTPEIESKALGFLRTGYQRQLTYKHSDGSYSAFGESdkSGSTWLT 83
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739 1080 AFVLKVLSLAQEQVGGSPEKLQETSNWLLSQQQADGSFQDLSPVIHRSMQGglvGNDETVALTAFVTIAL-HHGLAVFQd 1158
Cdd:cd02897    84 AFVLKSFAQARPFIYIDENVLQQALTWLSSHQKSNGCFREVGRVFHKAMQG---GVDDEVALTAYVLIALlEAGLPSER- 159
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739 1159 egaeplkqrveASISKASSFLgEKASAGLLGAHAAAITAYALTLTKAPAdlRGVAHNNLMAMAQETGDNLYWGSvtgsqs 1238
Cdd:cd02897   160 -----------PVVEKALSCL-EAALDSISDPYTLALAAYALTLAGSEK--RPEALKKLDELAISEDGTKHWSR------ 219
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 178557739 1239 navspTPAPRNPSDPMPQAPALWIETTAYALLHLLLHEGKA-EMADQAAAWLTRQGSFQGGFRSTQDTVIALDALSAY 1315
Cdd:cd02897   220 -----PPPSEEGPSYYWQAPSAEVEMTAYALLALLSAGGEDlAEALPIVKWLAKQRNSLGGFSSTQDTVVALQALAKY 292
NTR_complement_C345C cd03574
NTR/C345C domain; The NTR domains that are found in the C-termini of complement C3, C4 and C5, ...
1595-1742 3.95e-48

NTR/C345C domain; The NTR domains that are found in the C-termini of complement C3, C4 and C5, are also called C345C domains. In C5, the domain interacts with various partners during the formation of the membrane attack complex, a fundamental process in the mammalian defense against infection. It's role in component C3 and C4 is not well understood.


Pssm-ID: 239629  Cd Length: 147  Bit Score: 168.73  E-value: 3.95e-48
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739 1595 CPRQRRALErglqDEDGYRMKFACYYprVEYGFQVKVLREDSRAAFRLFETKITQVLHFTKDVKAAANQMRNFLVRASCR 1674
Cdd:cd03574     1 CPICKRELS----DTCENLLDKACTS--VDYVYKVKVTSVEEEAGFRIYKARVTEVIKSGSDDVQNGNARRTFIIRESCD 74
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 178557739 1675 --LRLEPGKEYLIMGLDGATYD---LEGHPQYLLDSNSWIEEMPSERLCRSTRQRAACAQLNDFLQEYGTQGC 1742
Cdd:cd03574    75 cpLRLKEGRHYLIMGSDGAFYDdrnGEDRYQYVLDSNTWVEEWPTDSKCRNERQQAACDKLKKFEESMVLQGC 147
ISOPREN_C2_like cd00688
This group contains class II terpene cyclases, protein prenyltransferases beta subunit, two ...
996-1315 1.29e-39

This group contains class II terpene cyclases, protein prenyltransferases beta subunit, two broadly specific proteinase inhibitors alpha2-macroglobulin (alpha (2)-M) and pregnancy zone protein (PZP) and, the C3 C4 and C5 components of vertebrate complement. Class II terpene cyclases include squalene cyclase (SQCY) and 2,3-oxidosqualene cyclase (OSQCY), these integral membrane proteins catalyze a cationic cyclization cascade converting linear triterpenes to fused ring compounds. The protein prenyltransferases include protein farnesyltransferase (FTase) and geranylgeranyltransferase types I and II (GGTase-I and GGTase-II) which catalyze the carboxyl-terminal lipidation of Ras, Rab, and several other cellular signal transduction proteins, facilitating membrane associations and specific protein-protein interactions. Alpha (2)-M is a major carrier protein in serum and involved in the immobilization and entrapment of proteases. PZP is a pregnancy associated protein. Alpha (2)-M and PZP are known to bind to and, may modulate, the activity of placental protein-14 in T-cell growth and cytokine production thereby protecting the allogeneic fetus from attack by the maternal immune system.


Pssm-ID: 238362 [Multi-domain]  Cd Length: 300  Bit Score: 150.01  E-value: 1.29e-39
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  996 SPGGVASLLRLPRG--------CGEQTMIYLAPTLAASRYLDKTEqwstlppeTKDHAVDLIQKGYMRIQQFRKADGSYA 1067
Cdd:cd00688     1 IEKHLKYLLRYPYGdghwyqslCGEQTWSTAWPLLALLLLLAATG--------IRDKADENIEKGIQRLLSYQLSDGGFS 72
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739 1068 AWLSRG-SSTWLTAFVLKVLSLAQEQVGGSPEKLQETSNWLLSQQQADGSFQDLSPVIHRSMqgglvGNDETVALTAFVT 1146
Cdd:cd00688    73 GWGGNDyPSLWLTAYALKALLLAGDYIAVDRIDLARALNWLLSLQNEDGGFREDGPGNHRIG-----GDESDVRLTAYAL 147
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739 1147 IALhhglavfqdegAEPLKQRVEASISKASSFLGEKASAGLLGAHAAAITAYAL-----TLTKAPADLRGVAHNNLMAMA 1221
Cdd:cd00688   148 IAL-----------ALLGKLDPDPLIEKALDYLLSCQNYDGGFGPGGESHGYGTacaaaALALLGDLDSPDAKKALRWLL 216
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739 1222 QETGDNLYWGSvtgsqsnavsptpaprNPSDPMPQAPALWIETTAYALLHLLLHEGKAEmADQAAAWLTRQGSFQGGFRS 1301
Cdd:cd00688   217 SRQRPDGGWGE----------------GRDRTNKLSDSCYTEWAAYALLALGKLGDLED-AEKLVKWLLSQQNEDGGFSS 279
                         330       340
                  ....*....|....*....|.
gi 178557739 1302 -------TQDTVIALDALSAY 1315
Cdd:cd00688   280 kpgksydTQHTVFALLALSLY 300
A2M_recep pfam07677
A-macroglobulin receptor binding domain; This family includes the receptor binding domain ...
1481-1570 6.90e-34

A-macroglobulin receptor binding domain; This family includes the receptor binding domain region of the alpha-2-macroglobulin family.


Pssm-ID: 462226  Cd Length: 92  Bit Score: 125.76  E-value: 6.90e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  1481 SGMAIADVTLLSGFHALRADLEKLTSlsDRYVSHFETEGP-HVLLYFDSVPTSRECVGFEAVQEVPVGLVQPASATLYDY 1559
Cdd:pfam07677    3 SNMAILEVGLPSGFVPDEEDLKKLGV--DPLIKRVETVDDgKVILYLDKLSGEPLCFSFRAEQTFPVANLKPAPVKVYDY 80
                           90
                   ....*....|.
gi 178557739  1560 YNPERRCSVFY 1570
Cdd:pfam07677   81 YEPERRATTFY 91
C345C smart00643
Netrin C-terminal Domain;
1614-1724 8.72e-34

Netrin C-terminal Domain;


Pssm-ID: 214759  Cd Length: 114  Bit Score: 126.33  E-value: 8.72e-34
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739   1614 MKFACYYPRvEYGFQVKVLREDSRAAFRLFETKITQVLHFTKDVK-AAANQMRNFLVRASCR--LRLEPGKEYLIMGLDG 1690
Cdd:smart00643    1 LEKACKSDV-DYVYKVKVLSVEEEGGFDKYTVKILEVIKSGTDELvRGKNKLRVFISRASCRcpLLLKLGKSYLIMGKSG 79
                            90       100       110
                    ....*....|....*....|....*....|....
gi 178557739   1691 ATYDLEGHPQYLLDSNSWIEEMPSERLCRSTRQR 1724
Cdd:smart00643   80 DLWDAKGRGQYVLGKNSWVEEWPTEEECRLRRLQ 113
A2M pfam00207
Alpha-2-macroglobulin family; This family includes the C-terminal region of the ...
781-869 2.08e-33

Alpha-2-macroglobulin family; This family includes the C-terminal region of the alpha-2-macroglobulin family.


Pssm-ID: 459711 [Multi-domain]  Cd Length: 91  Bit Score: 124.24  E-value: 2.08e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739   781 NWLWRVETV--DRFQILTLWLPDSLTTWEIHGLSLSKTKGLCVATPVQLRVFREFHLHLRLPMSVRRFEQLELRPVLYNY 858
Cdd:pfam00207    1 TWLWDPVLVtdNGKASLSFTLPDSITTWRATAFALSPDTGLGVAEPPELVVFKPFFVDLNLPYSVRRGEQFELKATVFNY 80
                           90
                   ....*....|.
gi 178557739   859 LDKNLTVSVHV 869
Cdd:pfam00207   81 LDKCLKVRVRL 91
YfaS COG2373
Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function ...
138-1336 1.31e-28

Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function prediction only];


Pssm-ID: 441940 [Multi-domain]  Cd Length: 1605  Bit Score: 125.96  E-value: 1.31e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  138 RGHLFlqTDQPIYNPGQRVRYRVFALDQKMRPSTDT-ITVMVENSHGLRVRKKEVYMPSS-IFQDDFVIPDISEPGTWKI 215
Cdd:COG2373   370 DAFLF--TDRGIYRPGETVHLKALLRDADGKAPAGLpLTLELTDPDGKEVRRQTLTLNEFgGYSFSFPLPEDAPTGTWRL 447
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  216 SARfSDGLESNSSTQFEVKKYVLPNFEVKITPGKPYILtvPGhlDEMQLDIQARYIYGKP-----VQGVAYVR------- 283
Cdd:COG2373   448 ELY-VDPKPALGSKSFRVEEFKPPRFKVDLTLDKEPLK--PG--DPVTVTVDARYLFGAPaaglkVEGEVTLRpartafp 522
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  284 ------FGLLDEDGKKTFFrgLESQTKL-VNGQSHISLSKAEFQDALeklnmgitdlQGLRLYVAAAIIEsPGGEMEEAE 356
Cdd:COG2373   523 gypgyrFGDPDEEFEPEEL--DLGEGTLdADGKASLSLPLPDAPDAP----------GPLRATVEASVFE-SGGRPVTRS 589
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  357 LTSWYFVSSPF---SLDLSKTKRhlvPGAPFLLQALVREMSGSPASGIPVKV--------SATVSSPG----SVPEVQD- 420
Cdd:COG2373   590 ATVPVHPADFYvgiRLPLFDGDP---EGAPATFEVVAVDPDGKPVAGKGLKVelyreewrYVWYKSDDggwrYESQEKEe 666
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  421 ------IQQNTDGSGQVSIPiiiPQTISELQLSVS-AGSPHPAIARLTVAAPPSGG---PGFLSIErPDSRPPRVGDTLN 490
Cdd:COG2373   667 pvaegtLTTGADGPASLSLT---PVEWGRYRLEVKdPDGGLATSVRFYAGGNASWGaerPDRLELS-LDKESYKPGETAK 742
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  491 LNLRA--VGSGatfshyYYMILSRGQIVFMNREPKRTLTSVSVFVDHHLAPSFYFVAFYYHGDHPVAN----------SL 558
Cdd:COG2373   743 LLIQSpfAGRA------LVTVERDGVLETQWVDVKGGGTTVEIPVTEDWAPNAYVSATLVRPGDSTANdmparaygvaPL 816
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  559 RVDVQAGacegKLELSVDGAKQYRNGESVKLHLETDSL----ALVALGALDTALYA-AGSKSHKPL-------------- 619
Cdd:COG2373   817 PVDPPAR----RLKVELTAPEKLRPGETLTVTVKVKGAagkaAEVTLAAVDEGILNlTGYKTPDPLdffygkralgvetr 892
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  620 -NMGKVFEAMnSYDLGCGPGGGDSALQvfqaaglafsdgdqwtlsrkrlscpKEKTTRKKRNvNFQkaineklgqyasPT 698
Cdd:COG2373   893 dLYGRLIGAF-GGAAGALRSGGDGALG-------------------------RGGNPKPPRK-RFK------------PV 933
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  699 AkrccqdgvtrlpmmrsceqraarvqqpdcrepflsccQFAESLrkKSRDKGQAglqraleilqeedlideddipvrsff 778
Cdd:COG2373   934 A-------------------------------------LFSGPV--KTDADGKA-------------------------- 948
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  779 penwlwrveTVdRFQiltlwLPDSLTTWEIHGLSLSKT------KGLCVATPVQLRvfrefhlhLRLPmsvrRF----EQ 848
Cdd:COG2373   949 ---------TV-SFD-----LPDFNGTLRVMAVAWSDDrfgsaeATVTVRKPLVVR--------PSLP----RFlapgDR 1001
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  849 LELRPVLYNYLDKNLTVSVHVS---PVEGLCLAGggglaQQVLVPAGSARPVAFSVVPTAATAVSLKVVARGSFEfpvGD 925
Cdd:COG2373  1002 FELPVDVFNLTGKAGTVTVTLEasgGLTLEGEAT-----QTVTLAAGGRATVRFPLKAPDAGDAKVTVTATGGGE---SD 1073
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  926 AVSKVLQIEKEGAIHREELVYELNPldhrGRTLEIPGNSDPNMIPDgdFNSY-VRVTASDPLDTlgsegalsPGGVASLL 1004
Cdd:COG2373  1074 AREVELPVRPANPLVTRATSGVLAP----GESWTLPLDLPGGLRPG--TGSLtLSLSSSPPLDL--------AGLLRYLL 1139
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739 1005 RLPRGCGEQTMIYLAPTLaasrYLDKTEQWSTLPPETKDHAVDLIQKGYMRIQQFRKADGSYAAW-LSRGSSTWLTAFVL 1083
Cdd:COG2373  1140 RYPYGCTEQTTSRALPLL----YLSDLAEALGLKGDKDAELRARIQAAIARLLSMQNSDGGFGLWpGGSESDPWLTAYAT 1215
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739 1084 KVLSLAQEQvGGS-PEK-LQETSNWLLSQQQADGSFQDLSPvihrsmqgglvGNDETVALTAFVtialhhgLAVFQ--DE 1159
Cdd:COG2373  1216 DFLLEAREA-GYAvPDDaLDRALDYLRNYLRNPWEIEYDDA-----------YRLAVRAYALYV-------LARAGkaDL 1276
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739 1160 GA-EPLKQRVEASIS-KASSFLGekasagllgahaaaitayaLTLTKAPADLRGV-AHNNLMAMAQETGDNLYWGSVTGS 1236
Cdd:COG2373  1277 GDlRYLYDRRKDALSpLAKAQLA-------------------AALALLGDKARAEeLLAAALARLRETGARDYWYGDYGS 1337
                        1210      1220      1230      1240      1250      1260      1270      1280
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739 1237 QsnavsptpaprnpsdpmpqapalwIETTAYALLHLLLHEGKAEMADQAAAWLTRQGSfQGGFRSTQDTVIALDALSAYw 1316
Cdd:COG2373  1338 P------------------------LRDQALALALLAELGPDAPLAPKLARWLAKALK-SGRWLSTQETAWALLALAAY- 1391
                        1290      1300
                  ....*....|....*....|
gi 178557739 1317 iASHTTEERGLNVTLSSTGR 1336
Cdd:COG2373  1392 -ARAAGASPDFTATLTLDGK 1410
NTR pfam01759
UNC-6/NTR/C345C module; Sequence similarity between netrin UNC-6 and C345C complement protein ...
1615-1724 2.28e-25

UNC-6/NTR/C345C module; Sequence similarity between netrin UNC-6 and C345C complement protein family members, and hence the existence of the UNC-6 module, was first reported in. Subsequently, many additional members of the family were identified on the basis of sequence similarity between the C-terminal domains of netrins, complement proteins C3, C4, C5, secreted frizzled-related proteins, and type I pro-collagen C-proteinase enhancer proteins (PCOLCEs), which are homologous with the N-terminal domains of tissue inhibitors of metalloproteinases (TIMPs). The TIMPs are classified as a separate family in Pfam (pfam00965). This expanded domain family has been named as the NTR module.


Pssm-ID: 396359  Cd Length: 106  Bit Score: 102.04  E-value: 2.28e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739  1615 KFACYypRVEYGFQVKVLREDSRAAFRLFETKITQVLHfTKDVKAAANQMRNFLVRASCR-LRLEPGKEYLIMGLDGaty 1693
Cdd:pfam01759    1 KKACK--GSDYVYKVKVLSVEEEGSFDKYTVKVKEVLK-EGTDKIQRGKVRLFLKRGDCRcPQLRLGKEYLIMGKVG--- 74
                           90       100       110
                   ....*....|....*....|....*....|.
gi 178557739  1694 DLEGHPQYLLDSNSWIEEMPSERLCRSTRQR 1724
Cdd:pfam01759   75 DLEGRGRYVLDKNSWVEPWPTKWECKLRELQ 105
A2M_BRD pfam07703
Alpha-2-macroglobulin bait region domain; Alpha-2-macroglobulins (A2Ms) are plasma proteins ...
473-610 2.76e-24

Alpha-2-macroglobulin bait region domain; Alpha-2-macroglobulins (A2Ms) are plasma proteins that trap and inhibit a broad range of proteases and are major components of the eukaryotic innate immune system. However, A2M-like proteins were identified in pathogenically invasive bacteria and species that colonize higher eukaryotes. This domain is found in eukaryotic and bacterial proteins. In human A2Ms, this domain encompasses macroglobulin-like domain MG5 and 6 including bait region. In Salmonella enterica ser A2Ms, this domain encompasses MG7 and MG8 including the bait region. The Bait region is cleaved by proteases, followed by a large conformational change that blocks the target protease within a cage-like complex. This model of protease entrapment is recognized as the Venus flytrap mechanism.


Pssm-ID: 462235 [Multi-domain]  Cd Length: 139  Bit Score: 100.12  E-value: 2.76e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739   473 LSIErPDSRPPRVGDTLNLNLRAVGSGATFSHY-YYMILSRGQIVFMNRepKRTLTSVSVFVDHHLAPSFYFVAFYYHGD 551
Cdd:pfam07703    1 LHLS-TDKTEYKPGETATVTVKSPFDGTVERDGfTYLVLSKGQIVVVGR--GGVTTSFSLPVTAEMAPSARVVAYYVRVD 77
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 178557739   552 HP----VANSLRVDVQAGaCEGKLELSVDgAKQYRNGESVKLHLETDSLALVALGALDTALYA 610
Cdd:pfam07703   78 LSkpevVADSVWVDVDDT-CENKLKVTLS-AEKYRPGSTVELKVKADPGAYVALAAVDKGVLL 138
MG3 pfam17791
Macroglobulin domain MG3; This entry corresponds to the MG3 domain found in complement ...
235-320 1.13e-22

Macroglobulin domain MG3; This entry corresponds to the MG3 domain found in complement components C3, C4 and C5.


Pssm-ID: 465509  Cd Length: 83  Bit Score: 93.49  E-value: 1.13e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739   235 KYVLPNFEVKITPgkPYILTVPGhlDEMQLDIQARYIYGKPVQGVAYVRFGLLDEDGKKTFFrgLESQTKLVNGQSHISL 314
Cdd:pfam17791    1 EYVLPKFEVKVEV--PKFISVKD--EEFQVTICAKYTYGKPVKGKAYVTLCLKDDSKRKCFE--SFSKELDKDGCGSASL 74

                   ....*.
gi 178557739   315 SKAEFQ 320
Cdd:pfam17791   75 STEEFQ 80
ANATO cd00017
Anaphylatoxin homologous domain; C3a, C4a and C5a anaphylatoxins are protein fragments ...
686-750 1.81e-22

Anaphylatoxin homologous domain; C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to repeats in fibulins.


Pssm-ID: 237984  Cd Length: 70  Bit Score: 92.52  E-value: 1.81e-22
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 178557739  686 AINEKLGQYASPTAKRCCQDGVTRLPMMRSCEQRAARV-QQPDCREPFLSCCQFAESLRKKSRDKG 750
Cdd:cd00017     1 KNSEKAAQYKDKELRKCCLDGMRENPMGQTCEERAAYItDGKECRKAFLECCVYAEELRDEEREDG 66
MG2 pfam01835
MG2 domain; This is the MG2 (macroglobulin) domain of alpha-2-macroglobulin in eukaryotes. ...
141-233 5.32e-19

MG2 domain; This is the MG2 (macroglobulin) domain of alpha-2-macroglobulin in eukaryotes. Alpha-2-macroglobulins (A2Ms) are plasma proteins that trap and inhibit a broad range of proteases and are major components of the eukaryotic innate immune system. However, A2M-like proteins were identified in pathogenically invasive bacteria and species that colonize higher eukaryotes. This domain is found in eukaryotic and bacterial proteins. In human A2Ms, this domain is termed macroglobulin-like (MG) domain 2 and in Salmonella enterica ser A2Ms, this is domain 4.


Pssm-ID: 426464 [Multi-domain]  Cd Length: 95  Bit Score: 83.52  E-value: 5.32e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739   141 LFLQTDQPIYNPGQRVRYRVFALDQKMRPSTDT-ITVMVENSHGLRVRKKE-VYMPSSIFQDDFVIPDISEPGTWKISAR 218
Cdd:pfam01835    2 AFVYTDRGIYRPGETVHFKGLLRDQDLRPLAGLpVTLTVTDPDGNEVRRLPlTTDEFGGFSGSFPLPETAPTGTYTVVLR 81
                           90
                   ....*....|....*
gi 178557739   219 FSDGlESNSSTQFEV 233
Cdd:pfam01835   82 DGAG-GSLGSGSFRV 95
NTR_complement_C3 cd03583
NTR/C345C domain, complement C3 subfamily; The NTR domain found in complement C3 is also known ...
1590-1742 1.32e-16

NTR/C345C domain, complement C3 subfamily; The NTR domain found in complement C3 is also known as the C345C domain because it occurs at the C-terminus of complement C3, C4 and C5. Complement C3 plays a pivotal role in the activation of the complement systems, as all pathways (classical, alternative, and lectin) result in the processing of C3 by C3 convertase. The larger fragment, activated C3b, contains the NTR/C345C domain and binds covalently, via a reactive thioester, to cell surface carbohydrates including components of bacterial cell walls and immune aggregates. The smaller cleavage product, C3a, acts independently as a diffusible signal to mediate local inflammatory processes. The structure of C3 shows that the NTR/C345C domain is located in an exposed position relative to the rest of the molecule. The function of the domain in complement C3 is poorly understood.


Pssm-ID: 239638  Cd Length: 149  Bit Score: 78.55  E-value: 1.32e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739 1590 CAEGKCPRQRRalERGLQDEDgyRMKFACYyPRVEYGFQVKVLREDSRAAFRLFETKITQVLHFTKDVkAAANQMRNFLV 1669
Cdd:cd03583     1 CAEENCSMQKK--GDKVTNDE--RIDKACE-PGVDYVYKVKLVNVELSDSYDIYTMEILQVIKEGTDE-GPEGKTRTFIS 74
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 178557739 1670 RASCR--LRLEPGKEYLIMGLDGATYDLEGHPQYLLDSNSWIEEMPSERLCRSTRQRAACAQLNDFLQEYGTQGC 1742
Cdd:cd03583    75 HPKCReaLNLKEGKDYLIMGLSSDLWRIKDKYSYVIGKDTWIEYWPTEDECQDEENQKLCLDLAEFSEQLTVFGC 149
ANATO smart00104
Anaphylatoxin homologous domain; C3a, C4a and C5a anaphylatoxins are protein fragments ...
702-736 8.50e-13

Anaphylatoxin homologous domain; C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-fold repeat in fibulins.


Pssm-ID: 197517  Cd Length: 35  Bit Score: 63.89  E-value: 8.50e-13
                            10        20        30
                    ....*....|....*....|....*....|....*
gi 178557739    702 CCQDGVTRLPMMRSCEQRAARVQQPDCREPFLSCC 736
Cdd:smart00104    1 CCADGMRLAPMGETCEERAARINSGDCRKAFLQCC 35
ANATO pfam01821
Anaphylotoxin-like domain; C3a, C4a and C5a anaphylatoxins are protein fragments generated ...
702-736 8.93e-13

Anaphylotoxin-like domain; C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-fold repeat in fibulins.


Pssm-ID: 460347  Cd Length: 36  Bit Score: 63.83  E-value: 8.93e-13
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 178557739   702 CCQDGVTRLPMMRSCEQRAARVQQ-PDCREPFLSCC 736
Cdd:pfam01821    1 CCLDGMKRNPMGRSCEQRAARIKEgPRCRKAFLQCC 36
NTR_complement_C5 cd03582
NTR/C345C domain, complement C5 subfamily; The NTR domain found in complement C5 is also known ...
1583-1742 3.78e-11

NTR/C345C domain, complement C5 subfamily; The NTR domain found in complement C5 is also known as C345C because it occurs at the C-terminus of complement C3, C4 and C5. Complement C5 is activated by C5 convertase, which itself is a complex between C3b and C3 convertase. The small cleavage fragment, C5a, is the most important small peptide mediator of inflammation, and the larger active fragment, C5b, initiates late events of complement activation. The NTR/C345C domain is important in the function of C5 as it interacts with enzymes that convert C5 to the active form, C5b. The domain has also been found to bind to complement components C6 and C7, and may specifically interact with their factor I modules.


Pssm-ID: 239637  Cd Length: 150  Bit Score: 62.91  E-value: 3.78e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739 1583 CSAEVCQCAEGKCPRQRRALERGlqdedgyrmKFACYyPRVEYGFQVKVLREDSRAAFRLFETKITQVLHFTKDVKAAAN 1662
Cdd:cd03582     1 CVAAQCQCFAAACDVTITAARRK---------SETCK-EQIAYAYKVMIKSSAAEGDFVTYKATVLDVLKNGQAELEKDS 70
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739 1663 QMrNFLVRASC-RLRLEPGKEYLIMGLDGATYDLEG--HPQYLLDSNSWIEEMPSERLCRSTRQRAacAQLNDFLQEYGT 1739
Cdd:cd03582    71 EV-TLVKKATCtSVELQEGQQYLIMGKEALKIRLNRsfRYRYPLDSEAWIEWWPTDTGCPECQDFL--NQLDDFAEDLQL 147

                  ...
gi 178557739 1740 QGC 1742
Cdd:cd03582   148 MGC 150
MG4 pfam17789
Macroglobulin domain MG4; This domain is MG4 found in complement C3 and C5 proteins.
369-464 1.23e-07

Macroglobulin domain MG4; This domain is MG4 found in complement C3 and C5 proteins.


Pssm-ID: 465507  Cd Length: 95  Bit Score: 51.10  E-value: 1.23e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 178557739   369 LDLSKTKRHLVPGAPFLLQALVREMSGSPASGIPVKVSATVSSPGSVPEvqdiqqnTDGSGQVSIPIIIPQTISELQLSV 448
Cdd:pfam17789    1 ITFEKTPKYFKPGLPFSGQVLVVDPDGSPAPNVPVFIEAGNTEFNQNLT-------TDEDGTAQFSINTPGNAASLSITV 73
                           90       100
                   ....*....|....*....|.
gi 178557739   449 SAGSPHP-----AIARLTVAA 464
Cdd:pfam17789   74 KTKDPDLcpehqALAEMYAEA 94
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH