NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2450112274|ref|NP_002448|]
View 

mucin-2 precursor [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
VWD smart00216
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ...
849-1008 1.72e-43

von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.


:

Pssm-ID: 214566 [Multi-domain]  Cd Length: 163  Bit Score: 157.56  E-value: 1.72e-43
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274   849 WVCTQAVCHGTCSIYGSGHYITFDGKYYDFDGHCSYVAVQDycgqNSSLGSFSIITENVPCGtTGVTCSKAIKIFMGRTE 928
Cdd:smart00216    1 WCCTQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD----CSSEPTFSVLLKNVPCG-GGATCLKSVKVELNGDE 75
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274   929 LKLEDKHRVVIQRDEGHHVAYTTREV-------GQYLVVESSTGII-VIWDKRTTVFIKLAPSYKGTVCGLCGNFDHRSN 1000
Cdd:smart00216   76 IELKDDNGKVTVNGQQVSLPYKTSDGsiqirssGGYLVVITSLGLIqVTFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPE 155

                    ....*...
gi 2450112274  1001 NDFTTRDH 1008
Cdd:smart00216  156 DDFRTPDG 163
VWD smart00216
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ...
4422-4589 7.37e-39

von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.


:

Pssm-ID: 214566 [Multi-domain]  Cd Length: 163  Bit Score: 144.08  E-value: 7.37e-39
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274  4422 CCWHWECDCYCTGWGDPHYVTFDGLYYSYQGNCTYVLVEEISPSvDNFGVYIDNYHCDPNdkVSCPRTLIVRHETQEVLI 4501
Cdd:smart00216    2 CCTQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQDCSSE-PTFSVLLKNVPCGGG--ATCLKSVKVELNGDEIEL 78
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274  4502 KTVhmmpmQVQVQVNRQAVALPYKKYGLEVYQ-SGINYVVDIPELGVL-VSYNGLS-FSVRLPYhRFGNNTKGQCGTCTN 4578
Cdd:smart00216   79 KDD-----NGKVTVNGQQVSLPYKTSDGSIQIrSSGGYLVVITSLGLIqVTFDGLTlLSVQLPS-KYRGKTCGLCGNFDG 152
                           170
                    ....*....|.
gi 2450112274  4579 TTSDDCILPSG 4589
Cdd:smart00216  153 EPEDDFRTPDG 163
VWD pfam00094
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ...
391-545 3.82e-38

von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.


:

Pssm-ID: 459671 [Multi-domain]  Cd Length: 154  Bit Score: 141.74  E-value: 3.82e-38
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274  391 CALEGGSHITTFDGKTYTFHGDCYYVLAKGDHNDS-YALLGELAPCGSTDKQTCLKTVVLLadKKKNVVVFKSDGSVLLN 469
Cdd:pfam00094    1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCSEEPdFSFSVTNKNCNGGASGVCLKSVTVI--VGDLEITLQKGGTVLVN 78
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2450112274  470 ELQVNLPHVTASFSVFRPSSYHIMVSMAIGVRLQVQLAPVMQLFVTLDQASQGQVQGLCGNFNGLEGDDFKTASGL 545
Cdd:pfam00094   79 GQKVSLPYKSDGGEVEILGSGFVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDGT 154
VWD pfam00094
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ...
37-186 8.41e-35

von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.


:

Pssm-ID: 459671 [Multi-domain]  Cd Length: 154  Bit Score: 132.11  E-value: 8.41e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274   37 CSTWGNFHYKTFDGDVFRFPGLCDYNFASDCrGSYKEFAVHLKRGPGQAEAPAGVESIL-LTIKDDTIYLTRHLAVL-NG 114
Cdd:pfam00094    1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDC-SEEPDFSFSVTNKNCNGGASGVCLKSVtVIVGDLEITLQKGGTVLvNG 79
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2450112274  115 AVVSTPHYSPGLLIEKSDAYTK-VYSRAGLTLMWNRED--ALMLELDTKFRNHTCGLCGDYNGLQSYSEFLSDGV 186
Cdd:pfam00094   80 QKVSLPYKSDGGEVEILGSGFVvVDLSPGVGLQVDGDGrgQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDGT 154
Mucin2_WxxW pfam13330
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ...
1305-1388 2.00e-24

Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues.


:

Pssm-ID: 463846 [Multi-domain]  Cd Length: 85  Bit Score: 100.10  E-value: 2.00e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274 1305 WSDWINEDHPSsGSDDGDRETFD------GVCGAPEDIECRSVKDPHLSLEQLGQKVQCDVSVGFICKNEDQFGNGpfgl 1378
Cdd:pfam13330    1 WTPWFDVDNPS-GSGGGDFETLEnlraygKFCENPTDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQPDG---- 75
                           90
                   ....*....|
gi 2450112274 1379 CYDYKIRVNC 1388
Cdd:pfam13330   76 CLDYEVRFLC 85
C8 smart00832
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ...
582-652 2.08e-24

This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.


:

Pssm-ID: 214843  Cd Length: 76  Bit Score: 99.72  E-value: 2.08e-24
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2450112274   582 NYAEHWCSLLKKTETPFGRCHSAVDPAEYYKRCKYDTCNCQNNEDCLCAALSSYARACTAKGVMLWGWREH 652
Cdd:smart00832    2 YYACSQCGILLSPRGPFAACHSVVDPEPFFENCVYDTCACGGDCECLCDALAAYAAACAEAGVCISPWRTP 72
C8 smart00832
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ...
1046-1118 6.07e-23

This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.


:

Pssm-ID: 214843  Cd Length: 76  Bit Score: 95.49  E-value: 6.07e-23
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2450112274  1046 WAEKQCSILKSS--VFSICHSKVDPKPFYEACVHDSCSCdtGGDCECFCSAVASYAQECTKEGACV-FWRTPDLCP 1118
Cdd:smart00832    3 YACSQCGILLSPrgPFAACHSVVDPEPFFENCVYDTCAC--GGDCECLCDALAAYAAACAEAGVCIsPWRTPTFCP 76
Cys_knot pfam00007
Cystine-knot domain; The family comprises glycoprotein hormones and the C-terminal domain of ...
5023-5126 2.20e-22

Cystine-knot domain; The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerization.


:

Pssm-ID: 394966  Cd Length: 105  Bit Score: 94.78  E-value: 2.20e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274 5023 RVPCSTVPVTTEVSYAGCT--KTVLMNHCSGSCGTFVMYSAKAQALDHSCSCCKEEKTSQREVVLSCPNGGSLTHTYTHI 5100
Cdd:pfam00007    1 RKKCIPTNYTISVEKEGCTscKTINTTICAGYCYTRDPVYKDGRRAVSQRVCTYRDVTYETVVLPGCPPGVDPTVTYPVA 80
                           90       100
                   ....*....|....*....|....*.
gi 2450112274 5101 ESCQCQdtVCGLPTGTSRRARRSPRH 5126
Cdd:pfam00007   81 LSCHCG--NCPTDNSDCTRLSLQPDS 104
Mucin2_WxxW pfam13330
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ...
1788-1875 5.53e-17

Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues.


:

Pssm-ID: 463846 [Multi-domain]  Cd Length: 85  Bit Score: 78.92  E-value: 5.53e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274 1788 WTGWLDSGKPNfHKPGGDTELI------GDVC-GPgwaANISCRATMYPDVPIGQLGQTVVCDVSVGLICKNEDQKPGGv 1860
Cdd:pfam13330    1 WTPWFDVDNPS-GSGGGDFETLenlrayGKFCeNP---TDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQPDG- 75
                           90
                   ....*....|....*
gi 2450112274 1861 ipmafCLNYEINVQC 1875
Cdd:pfam13330   76 -----CLDYEVRFLC 85
C8 pfam08742
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ...
225-291 5.49e-16

C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.


:

Pssm-ID: 462584  Cd Length: 68  Bit Score: 75.11  E-value: 5.49e-16
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2450112274  225 CERLLTAEAFADCQDLVPLEPYLRACQQDRCRCPGGDTCVCSTVAEFSRQCSHAGGRPGNWRTATLC 291
Cdd:pfam08742    2 CGLLSDSGPFAPCHSVVDPEPYFEACVYDMCSCGGDDECLCAALAAYARACQAAGVCIGDWRTPTFC 68
C8 pfam08742
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ...
4643-4706 1.13e-14

C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.


:

Pssm-ID: 462584  Cd Length: 68  Bit Score: 71.64  E-value: 1.13e-14
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2450112274 4643 LCQLIKDS-LFAQCHALVPPQHYYDACVFDSCFMPGS-SLECASLQAYAALCAQQNICL-DWRNHTH 4706
Cdd:pfam08742    1 KCGLLSDSgPFAPCHSVVDPEPYFEACVYDMCSCGGDdECLCAALAAYARACQAAGVCIgDWRTPTF 67
TIL cd19941
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ...
295-351 1.11e-08

trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.


:

Pssm-ID: 410995  Cd Length: 55  Bit Score: 54.25  E-value: 1.11e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2450112274  295 CPGNLVYLESGSPCMDTCSHLEVSSLCEEHRMDGCFCPEGTVYDDIGDsgCVPVSQC 351
Cdd:cd19941      1 CPPNEVYSECGSACPPTCANPNAPPPCTKQCVEGCFCPEGYVRNSGGK--CVPPSQC 55
VWC smart00214
von Willebrand factor (vWF) type C domain;
4877-4941 5.02e-05

von Willebrand factor (vWF) type C domain;


:

Pssm-ID: 214564  Cd Length: 59  Bit Score: 44.04  E-value: 5.02e-05
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2450112274  4877 CVHGNAEYQPGSPVYSSKCQDCVCTDKVdnntllnVIACTHVPCN--TSCSPGfELMEAPGECCKKC 4941
Cdd:smart00214    1 CVHNGRVYNDGETWKPDPCQICTCLDGT-------TVLCDPVECPppPDCPNP-ERVKPPGECCPRC 59
TIL cd19941
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ...
4713-4752 5.94e-03

trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.


:

Pssm-ID: 410995  Cd Length: 55  Bit Score: 38.07  E-value: 5.94e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 2450112274 4713 CPSHREYQACGPAEEPTCKSSSSQQN-NTVLVEGCFCPEGT 4752
Cdd:cd19941      1 CPPNEVYSECGSACPPTCANPNAPPPcTKQCVEGCFCPEGY 41
TIL cd19941
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ...
661-718 8.98e-03

trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.


:

Pssm-ID: 410995  Cd Length: 55  Bit Score: 37.30  E-value: 8.98e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2450112274  661 CPNSQVFLYNLTTCQQTCRSLsEADSHCLEGFapVDGCGCPDHTFLDEKGRCVPLAKC 718
Cdd:cd19941      1 CPPNEVYSECGSACPPTCANP-NAPPPCTKQC--VEGCFCPEGYVRNSGGKCVPPSQC 55
 
Name Accession Description Interval E-value
VWD smart00216
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ...
849-1008 1.72e-43

von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.


Pssm-ID: 214566 [Multi-domain]  Cd Length: 163  Bit Score: 157.56  E-value: 1.72e-43
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274   849 WVCTQAVCHGTCSIYGSGHYITFDGKYYDFDGHCSYVAVQDycgqNSSLGSFSIITENVPCGtTGVTCSKAIKIFMGRTE 928
Cdd:smart00216    1 WCCTQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD----CSSEPTFSVLLKNVPCG-GGATCLKSVKVELNGDE 75
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274   929 LKLEDKHRVVIQRDEGHHVAYTTREV-------GQYLVVESSTGII-VIWDKRTTVFIKLAPSYKGTVCGLCGNFDHRSN 1000
Cdd:smart00216   76 IELKDDNGKVTVNGQQVSLPYKTSDGsiqirssGGYLVVITSLGLIqVTFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPE 155

                    ....*...
gi 2450112274  1001 NDFTTRDH 1008
Cdd:smart00216  156 DDFRTPDG 163
VWD smart00216
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ...
4422-4589 7.37e-39

von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.


Pssm-ID: 214566 [Multi-domain]  Cd Length: 163  Bit Score: 144.08  E-value: 7.37e-39
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274  4422 CCWHWECDCYCTGWGDPHYVTFDGLYYSYQGNCTYVLVEEISPSvDNFGVYIDNYHCDPNdkVSCPRTLIVRHETQEVLI 4501
Cdd:smart00216    2 CCTQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQDCSSE-PTFSVLLKNVPCGGG--ATCLKSVKVELNGDEIEL 78
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274  4502 KTVhmmpmQVQVQVNRQAVALPYKKYGLEVYQ-SGINYVVDIPELGVL-VSYNGLS-FSVRLPYhRFGNNTKGQCGTCTN 4578
Cdd:smart00216   79 KDD-----NGKVTVNGQQVSLPYKTSDGSIQIrSSGGYLVVITSLGLIqVTFDGLTlLSVQLPS-KYRGKTCGLCGNFDG 152
                           170
                    ....*....|.
gi 2450112274  4579 TTSDDCILPSG 4589
Cdd:smart00216  153 EPEDDFRTPDG 163
VWD pfam00094
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ...
391-545 3.82e-38

von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.


Pssm-ID: 459671 [Multi-domain]  Cd Length: 154  Bit Score: 141.74  E-value: 3.82e-38
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274  391 CALEGGSHITTFDGKTYTFHGDCYYVLAKGDHNDS-YALLGELAPCGSTDKQTCLKTVVLLadKKKNVVVFKSDGSVLLN 469
Cdd:pfam00094    1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCSEEPdFSFSVTNKNCNGGASGVCLKSVTVI--VGDLEITLQKGGTVLVN 78
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2450112274  470 ELQVNLPHVTASFSVFRPSSYHIMVSMAIGVRLQVQLAPVMQLFVTLDQASQGQVQGLCGNFNGLEGDDFKTASGL 545
Cdd:pfam00094   79 GQKVSLPYKSDGGEVEILGSGFVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDGT 154
VWD smart00216
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ...
380-544 1.55e-37

von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.


Pssm-ID: 214566 [Multi-domain]  Cd Length: 163  Bit Score: 140.23  E-value: 1.55e-37
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274   380 WVCKDLPCPGTCALEGGSHITTFDGKTYTFHGDCYYVLAKGDH-NDSYALLGELAPCGSTdkQTCLKTVVLLaDKKKNVV 458
Cdd:smart00216    1 WCCTQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQDCSsEPTFSVLLKNVPCGGG--ATCLKSVKVE-LNGDEIE 77
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274   459 VFKSDGSVLLNELQVNLPHVTASFSV-FRPSSYHIMVSMAIGVrLQVQLAPVMQLFVTLDQASQGQVQGLCGNFNGLEGD 537
Cdd:smart00216   78 LKDDNGKVTVNGQQVSLPYKTSDGSIqIRSSGGYLVVITSLGL-IQVTFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPED 156

                    ....*..
gi 2450112274   538 DFKTASG 544
Cdd:smart00216  157 DFRTPDG 163
VWD pfam00094
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ...
4432-4589 3.41e-36

von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.


Pssm-ID: 459671 [Multi-domain]  Cd Length: 154  Bit Score: 136.35  E-value: 3.41e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274 4432 CTGWGDPHYVTFDGLYYSYQGNCTYVLVEEISpSVDNFGVYIDNYHCDPNDKVSCPRTLIVRHETQEVLIKtvhmmpMQV 4511
Cdd:pfam00094    1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCS-EEPDFSFSVTNKNCNGGASGVCLKSVTVIVGDLEITLQ------KGG 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274 4512 QVQVNRQAVALPYKKYG--LEVYQSGINYVVDIPELGVLVSYNGLSFSVRLPYHRFGNNTKGQCGTCTNTTSDDCILPSG 4589
Cdd:pfam00094   74 TVLVNGQKVSLPYKSDGgeVEILGSGFVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDG 153
VWD pfam00094
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ...
860-1009 3.58e-36

von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.


Pssm-ID: 459671 [Multi-domain]  Cd Length: 154  Bit Score: 135.96  E-value: 3.58e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274  860 CSIYGSGHYITFDGKYYDFDGHCSYVAVQDyCGQNSSLgSFSIITENVPCGTTGVtCSKAIKIFMGRTELKLEDKHRV-V 938
Cdd:pfam00094    1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKD-CSEEPDF-SFSVTNKNCNGGASGV-CLKSVTVIVGDLEITLQKGGTVlV 77
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2450112274  939 IQRDEGHHVA----YTTREVGQYLVVESSTGII--VIWDKRTTVFIKLAPSYKGTVCGLCGNFDHRSNNDFTTRDHM 1009
Cdd:pfam00094   78 NGQKVSLPYKsdggEVEILGSGFVVVDLSPGVGlqVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDGT 154
VWD pfam00094
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ...
37-186 8.41e-35

von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.


Pssm-ID: 459671 [Multi-domain]  Cd Length: 154  Bit Score: 132.11  E-value: 8.41e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274   37 CSTWGNFHYKTFDGDVFRFPGLCDYNFASDCrGSYKEFAVHLKRGPGQAEAPAGVESIL-LTIKDDTIYLTRHLAVL-NG 114
Cdd:pfam00094    1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDC-SEEPDFSFSVTNKNCNGGASGVCLKSVtVIVGDLEITLQKGGTVLvNG 79
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2450112274  115 AVVSTPHYSPGLLIEKSDAYTK-VYSRAGLTLMWNRED--ALMLELDTKFRNHTCGLCGDYNGLQSYSEFLSDGV 186
Cdd:pfam00094   80 QKVSLPYKSDGGEVEILGSGFVvVDLSPGVGLQVDGDGrgQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDGT 154
VWD smart00216
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ...
36-185 2.02e-30

von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.


Pssm-ID: 214566 [Multi-domain]  Cd Length: 163  Bit Score: 120.20  E-value: 2.02e-30
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274    36 VCSTWGNFHYKTFDGDVFRFPGLCDYNFASDCrGSYKEFAVHLKRGPGQAEaPAGVESILLTIKDDTIYLT--RHLAVLN 113
Cdd:smart00216   11 TCSVSGDPHYTTFDGVAYTFPGNCYYVLAQDC-SSEPTFSVLLKNVPCGGG-ATCLKSVKVELNGDEIELKddNGKVTVN 88
                            90       100       110       120       130       140       150
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2450112274   114 GAVVSTPHYSPGLLI--EKSDAYTKVYSRAGL-TLMWNREDALMLELDTKFRNHTCGLCGDYNGLQSYSEFLSDG 185
Cdd:smart00216   89 GQQVSLPYKTSDGSIqiRSSGGYLVVITSLGLiQVTFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPEDDFRTPDG 163
Mucin2_WxxW pfam13330
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ...
1305-1388 2.00e-24

Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues.


Pssm-ID: 463846 [Multi-domain]  Cd Length: 85  Bit Score: 100.10  E-value: 2.00e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274 1305 WSDWINEDHPSsGSDDGDRETFD------GVCGAPEDIECRSVKDPHLSLEQLGQKVQCDVSVGFICKNEDQFGNGpfgl 1378
Cdd:pfam13330    1 WTPWFDVDNPS-GSGGGDFETLEnlraygKFCENPTDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQPDG---- 75
                           90
                   ....*....|
gi 2450112274 1379 CYDYKIRVNC 1388
Cdd:pfam13330   76 CLDYEVRFLC 85
C8 smart00832
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ...
582-652 2.08e-24

This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.


Pssm-ID: 214843  Cd Length: 76  Bit Score: 99.72  E-value: 2.08e-24
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2450112274   582 NYAEHWCSLLKKTETPFGRCHSAVDPAEYYKRCKYDTCNCQNNEDCLCAALSSYARACTAKGVMLWGWREH 652
Cdd:smart00832    2 YYACSQCGILLSPRGPFAACHSVVDPEPFFENCVYDTCACGGDCECLCDALAAYAAACAEAGVCISPWRTP 72
C8 smart00832
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ...
1046-1118 6.07e-23

This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.


Pssm-ID: 214843  Cd Length: 76  Bit Score: 95.49  E-value: 6.07e-23
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2450112274  1046 WAEKQCSILKSS--VFSICHSKVDPKPFYEACVHDSCSCdtGGDCECFCSAVASYAQECTKEGACV-FWRTPDLCP 1118
Cdd:smart00832    3 YACSQCGILLSPrgPFAACHSVVDPEPFFENCVYDTCAC--GGDCECLCDALAAYAAACAEAGVCIsPWRTPTFCP 76
Cys_knot pfam00007
Cystine-knot domain; The family comprises glycoprotein hormones and the C-terminal domain of ...
5023-5126 2.20e-22

Cystine-knot domain; The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerization.


Pssm-ID: 394966  Cd Length: 105  Bit Score: 94.78  E-value: 2.20e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274 5023 RVPCSTVPVTTEVSYAGCT--KTVLMNHCSGSCGTFVMYSAKAQALDHSCSCCKEEKTSQREVVLSCPNGGSLTHTYTHI 5100
Cdd:pfam00007    1 RKKCIPTNYTISVEKEGCTscKTINTTICAGYCYTRDPVYKDGRRAVSQRVCTYRDVTYETVVLPGCPPGVDPTVTYPVA 80
                           90       100
                   ....*....|....*....|....*.
gi 2450112274 5101 ESCQCQdtVCGLPTGTSRRARRSPRH 5126
Cdd:pfam00007   81 LSCHCG--NCPTDNSDCTRLSLQPDS 104
CT smart00041
C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta ...
5028-5108 1.22e-19

C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta (TGFbeta), nerve growth factor (NGF), platelet-derived growth factor (PDGF) and gonadotropin all form 2 highly twisted antiparallel pairs of beta-strands and contain three disulphide bonds. The domain is non-globular and little is conserved among these presumed homologues except for their cysteine residues. CT domains are predicted to form homodimers.


Pssm-ID: 214482  Cd Length: 82  Bit Score: 86.30  E-value: 1.22e-19
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274  5028 TVPVTTEVSYAGCTK-TVLMNHCSGSCGTFVMYSAkaQALDHSCSCCKEEKTSQREVVLSCPNGGSLTHTYTHIESCQCQ 5106
Cdd:smart00041    1 KSPVRQTITYNGCTSvTVKNAFCEGKCGSASSYSI--QDVQHSCSCCQPHKTKTRQVRLRCPDGSTVKKTVMHIEECGCE 78

                    ..
gi 2450112274  5107 DT 5108
Cdd:smart00041   79 PN 80
C8 pfam08742
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ...
1050-1117 5.48e-19

C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.


Pssm-ID: 462584  Cd Length: 68  Bit Score: 83.97  E-value: 5.48e-19
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274 1050 QCSILK-SSVFSICHSKVDPKPFYEACVHDSCSCdtGGDCECFCSAVASYAQECTKEGACVF-WRTPDLC 1117
Cdd:pfam08742    1 KCGLLSdSGPFAPCHSVVDPEPYFEACVYDMCSC--GGDDECLCAALAAYARACQAAGVCIGdWRTPTFC 68
C8 pfam08742
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ...
587-654 1.32e-18

C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.


Pssm-ID: 462584  Cd Length: 68  Bit Score: 82.81  E-value: 1.32e-18
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2450112274  587 WCSLLKKTEtPFGRCHSAVDPAEYYKRCKYDTCNCQNNEDCLCAALSSYARACTAKGVMLWGWREHVC 654
Cdd:pfam08742    1 KCGLLSDSG-PFAPCHSVVDPEPYFEACVYDMCSCGGDDECLCAALAAYARACQAAGVCIGDWRTPTF 67
Mucin2_WxxW pfam13330
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ...
1788-1875 5.53e-17

Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues.


Pssm-ID: 463846 [Multi-domain]  Cd Length: 85  Bit Score: 78.92  E-value: 5.53e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274 1788 WTGWLDSGKPNfHKPGGDTELI------GDVC-GPgwaANISCRATMYPDVPIGQLGQTVVCDVSVGLICKNEDQKPGGv 1860
Cdd:pfam13330    1 WTPWFDVDNPS-GSGGGDFETLenlrayGKFCeNP---TDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQPDG- 75
                           90
                   ....*....|....*
gi 2450112274 1861 ipmafCLNYEINVQC 1875
Cdd:pfam13330   76 -----CLDYEVRFLC 85
C8 pfam08742
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ...
225-291 5.49e-16

C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.


Pssm-ID: 462584  Cd Length: 68  Bit Score: 75.11  E-value: 5.49e-16
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2450112274  225 CERLLTAEAFADCQDLVPLEPYLRACQQDRCRCPGGDTCVCSTVAEFSRQCSHAGGRPGNWRTATLC 291
Cdd:pfam08742    2 CGLLSDSGPFAPCHSVVDPEPYFEACVYDMCSCGGDDECLCAALAAYARACQAAGVCIGDWRTPTFC 68
C8 pfam08742
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ...
4643-4706 1.13e-14

C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.


Pssm-ID: 462584  Cd Length: 68  Bit Score: 71.64  E-value: 1.13e-14
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2450112274 4643 LCQLIKDS-LFAQCHALVPPQHYYDACVFDSCFMPGS-SLECASLQAYAALCAQQNICL-DWRNHTH 4706
Cdd:pfam08742    1 KCGLLSDSgPFAPCHSVVDPEPYFEACVYDMCSCGGDdECLCAALAAYARACQAAGVCIgDWRTPTF 67
C8 smart00832
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ...
4644-4705 6.92e-13

This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.


Pssm-ID: 214843  Cd Length: 76  Bit Score: 66.60  E-value: 6.92e-13
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2450112274  4644 CQLIKDSL--FAQCHALVPPQHYYDACVFDSCFMPGSSL-ECASLQAYAALCAQQNICL-DWRNHT 4705
Cdd:smart00832    8 CGILLSPRgpFAACHSVVDPEPFFENCVYDTCACGGDCEcLCDALAAYAAACAEAGVCIsPWRTPT 73
C8 smart00832
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ...
234-292 8.27e-11

This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.


Pssm-ID: 214843  Cd Length: 76  Bit Score: 60.82  E-value: 8.27e-11
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|....*....
gi 2450112274   234 FADCQDLVPLEPYLRACQQDRCRCPGGDTCVCSTVAEFSRQCSHAGGRPGNWRTATLCP 292
Cdd:smart00832   18 FAACHSVVDPEPFFENCVYDTCACGGDCECLCDALAAYAAACAEAGVCISPWRTPTFCP 76
TIL cd19941
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ...
295-351 1.11e-08

trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.


Pssm-ID: 410995  Cd Length: 55  Bit Score: 54.25  E-value: 1.11e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2450112274  295 CPGNLVYLESGSPCMDTCSHLEVSSLCEEHRMDGCFCPEGTVYDDIGDsgCVPVSQC 351
Cdd:cd19941      1 CPPNEVYSECGSACPPTCANPNAPPPCTKQCVEGCFCPEGYVRNSGGK--CVPPSQC 55
TIL pfam01826
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ...
295-351 1.55e-08

Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.


Pssm-ID: 460351  Cd Length: 55  Bit Score: 53.55  E-value: 1.55e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2450112274  295 CPGNLVYLESGSPCMDTCSHLEVSSLCEEHRMDGCFCPEGTVYDDIGdsGCVPVSQC 351
Cdd:pfam01826    1 CPANEVYSECGSACPPTCANLSPPDVCPEPCVEGCVCPPGFVRNSGG--KCVPPSDC 55
VWC smart00214
von Willebrand factor (vWF) type C domain;
4877-4941 5.02e-05

von Willebrand factor (vWF) type C domain;


Pssm-ID: 214564  Cd Length: 59  Bit Score: 44.04  E-value: 5.02e-05
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2450112274  4877 CVHGNAEYQPGSPVYSSKCQDCVCTDKVdnntllnVIACTHVPCN--TSCSPGfELMEAPGECCKKC 4941
Cdd:smart00214    1 CVHNGRVYNDGETWKPDPCQICTCLDGT-------TVLCDPVECPppPDCPNP-ERVKPPGECCPRC 59
TIL cd19941
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ...
4713-4752 5.94e-03

trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.


Pssm-ID: 410995  Cd Length: 55  Bit Score: 38.07  E-value: 5.94e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 2450112274 4713 CPSHREYQACGPAEEPTCKSSSSQQN-NTVLVEGCFCPEGT 4752
Cdd:cd19941      1 CPPNEVYSECGSACPPTCANPNAPPPcTKQCVEGCFCPEGY 41
TIL cd19941
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ...
661-718 8.98e-03

trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.


Pssm-ID: 410995  Cd Length: 55  Bit Score: 37.30  E-value: 8.98e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2450112274  661 CPNSQVFLYNLTTCQQTCRSLsEADSHCLEGFapVDGCGCPDHTFLDEKGRCVPLAKC 718
Cdd:cd19941      1 CPPNEVYSECGSACPPTCANP-NAPPPCTKQC--VEGCFCPEGYVRNSGGKCVPPSQC 55
 
Name Accession Description Interval E-value
VWD smart00216
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ...
849-1008 1.72e-43

von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.


Pssm-ID: 214566 [Multi-domain]  Cd Length: 163  Bit Score: 157.56  E-value: 1.72e-43
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274   849 WVCTQAVCHGTCSIYGSGHYITFDGKYYDFDGHCSYVAVQDycgqNSSLGSFSIITENVPCGtTGVTCSKAIKIFMGRTE 928
Cdd:smart00216    1 WCCTQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD----CSSEPTFSVLLKNVPCG-GGATCLKSVKVELNGDE 75
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274   929 LKLEDKHRVVIQRDEGHHVAYTTREV-------GQYLVVESSTGII-VIWDKRTTVFIKLAPSYKGTVCGLCGNFDHRSN 1000
Cdd:smart00216   76 IELKDDNGKVTVNGQQVSLPYKTSDGsiqirssGGYLVVITSLGLIqVTFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPE 155

                    ....*...
gi 2450112274  1001 NDFTTRDH 1008
Cdd:smart00216  156 DDFRTPDG 163
VWD smart00216
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ...
4422-4589 7.37e-39

von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.


Pssm-ID: 214566 [Multi-domain]  Cd Length: 163  Bit Score: 144.08  E-value: 7.37e-39
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274  4422 CCWHWECDCYCTGWGDPHYVTFDGLYYSYQGNCTYVLVEEISPSvDNFGVYIDNYHCDPNdkVSCPRTLIVRHETQEVLI 4501
Cdd:smart00216    2 CCTQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQDCSSE-PTFSVLLKNVPCGGG--ATCLKSVKVELNGDEIEL 78
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274  4502 KTVhmmpmQVQVQVNRQAVALPYKKYGLEVYQ-SGINYVVDIPELGVL-VSYNGLS-FSVRLPYhRFGNNTKGQCGTCTN 4578
Cdd:smart00216   79 KDD-----NGKVTVNGQQVSLPYKTSDGSIQIrSSGGYLVVITSLGLIqVTFDGLTlLSVQLPS-KYRGKTCGLCGNFDG 152
                           170
                    ....*....|.
gi 2450112274  4579 TTSDDCILPSG 4589
Cdd:smart00216  153 EPEDDFRTPDG 163
VWD pfam00094
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ...
391-545 3.82e-38

von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.


Pssm-ID: 459671 [Multi-domain]  Cd Length: 154  Bit Score: 141.74  E-value: 3.82e-38
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274  391 CALEGGSHITTFDGKTYTFHGDCYYVLAKGDHNDS-YALLGELAPCGSTDKQTCLKTVVLLadKKKNVVVFKSDGSVLLN 469
Cdd:pfam00094    1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCSEEPdFSFSVTNKNCNGGASGVCLKSVTVI--VGDLEITLQKGGTVLVN 78
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2450112274  470 ELQVNLPHVTASFSVFRPSSYHIMVSMAIGVRLQVQLAPVMQLFVTLDQASQGQVQGLCGNFNGLEGDDFKTASGL 545
Cdd:pfam00094   79 GQKVSLPYKSDGGEVEILGSGFVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDGT 154
VWD smart00216
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ...
380-544 1.55e-37

von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.


Pssm-ID: 214566 [Multi-domain]  Cd Length: 163  Bit Score: 140.23  E-value: 1.55e-37
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274   380 WVCKDLPCPGTCALEGGSHITTFDGKTYTFHGDCYYVLAKGDH-NDSYALLGELAPCGSTdkQTCLKTVVLLaDKKKNVV 458
Cdd:smart00216    1 WCCTQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQDCSsEPTFSVLLKNVPCGGG--ATCLKSVKVE-LNGDEIE 77
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274   459 VFKSDGSVLLNELQVNLPHVTASFSV-FRPSSYHIMVSMAIGVrLQVQLAPVMQLFVTLDQASQGQVQGLCGNFNGLEGD 537
Cdd:smart00216   78 LKDDNGKVTVNGQQVSLPYKTSDGSIqIRSSGGYLVVITSLGL-IQVTFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPED 156

                    ....*..
gi 2450112274   538 DFKTASG 544
Cdd:smart00216  157 DFRTPDG 163
VWD pfam00094
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ...
4432-4589 3.41e-36

von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.


Pssm-ID: 459671 [Multi-domain]  Cd Length: 154  Bit Score: 136.35  E-value: 3.41e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274 4432 CTGWGDPHYVTFDGLYYSYQGNCTYVLVEEISpSVDNFGVYIDNYHCDPNDKVSCPRTLIVRHETQEVLIKtvhmmpMQV 4511
Cdd:pfam00094    1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCS-EEPDFSFSVTNKNCNGGASGVCLKSVTVIVGDLEITLQ------KGG 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274 4512 QVQVNRQAVALPYKKYG--LEVYQSGINYVVDIPELGVLVSYNGLSFSVRLPYHRFGNNTKGQCGTCTNTTSDDCILPSG 4589
Cdd:pfam00094   74 TVLVNGQKVSLPYKSDGgeVEILGSGFVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDG 153
VWD pfam00094
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ...
860-1009 3.58e-36

von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.


Pssm-ID: 459671 [Multi-domain]  Cd Length: 154  Bit Score: 135.96  E-value: 3.58e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274  860 CSIYGSGHYITFDGKYYDFDGHCSYVAVQDyCGQNSSLgSFSIITENVPCGTTGVtCSKAIKIFMGRTELKLEDKHRV-V 938
Cdd:pfam00094    1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKD-CSEEPDF-SFSVTNKNCNGGASGV-CLKSVTVIVGDLEITLQKGGTVlV 77
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2450112274  939 IQRDEGHHVA----YTTREVGQYLVVESSTGII--VIWDKRTTVFIKLAPSYKGTVCGLCGNFDHRSNNDFTTRDHM 1009
Cdd:pfam00094   78 NGQKVSLPYKsdggEVEILGSGFVVVDLSPGVGlqVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDGT 154
VWD pfam00094
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ...
37-186 8.41e-35

von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.


Pssm-ID: 459671 [Multi-domain]  Cd Length: 154  Bit Score: 132.11  E-value: 8.41e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274   37 CSTWGNFHYKTFDGDVFRFPGLCDYNFASDCrGSYKEFAVHLKRGPGQAEAPAGVESIL-LTIKDDTIYLTRHLAVL-NG 114
Cdd:pfam00094    1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDC-SEEPDFSFSVTNKNCNGGASGVCLKSVtVIVGDLEITLQKGGTVLvNG 79
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2450112274  115 AVVSTPHYSPGLLIEKSDAYTK-VYSRAGLTLMWNRED--ALMLELDTKFRNHTCGLCGDYNGLQSYSEFLSDGV 186
Cdd:pfam00094   80 QKVSLPYKSDGGEVEILGSGFVvVDLSPGVGLQVDGDGrgQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDGT 154
VWD smart00216
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ...
36-185 2.02e-30

von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.


Pssm-ID: 214566 [Multi-domain]  Cd Length: 163  Bit Score: 120.20  E-value: 2.02e-30
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274    36 VCSTWGNFHYKTFDGDVFRFPGLCDYNFASDCrGSYKEFAVHLKRGPGQAEaPAGVESILLTIKDDTIYLT--RHLAVLN 113
Cdd:smart00216   11 TCSVSGDPHYTTFDGVAYTFPGNCYYVLAQDC-SSEPTFSVLLKNVPCGGG-ATCLKSVKVELNGDEIELKddNGKVTVN 88
                            90       100       110       120       130       140       150
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2450112274   114 GAVVSTPHYSPGLLI--EKSDAYTKVYSRAGL-TLMWNREDALMLELDTKFRNHTCGLCGDYNGLQSYSEFLSDG 185
Cdd:smart00216   89 GQQVSLPYKTSDGSIqiRSSGGYLVVITSLGLiQVTFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPEDDFRTPDG 163
Mucin2_WxxW pfam13330
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ...
1305-1388 2.00e-24

Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues.


Pssm-ID: 463846 [Multi-domain]  Cd Length: 85  Bit Score: 100.10  E-value: 2.00e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274 1305 WSDWINEDHPSsGSDDGDRETFD------GVCGAPEDIECRSVKDPHLSLEQLGQKVQCDVSVGFICKNEDQFGNGpfgl 1378
Cdd:pfam13330    1 WTPWFDVDNPS-GSGGGDFETLEnlraygKFCENPTDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQPDG---- 75
                           90
                   ....*....|
gi 2450112274 1379 CYDYKIRVNC 1388
Cdd:pfam13330   76 CLDYEVRFLC 85
C8 smart00832
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ...
582-652 2.08e-24

This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.


Pssm-ID: 214843  Cd Length: 76  Bit Score: 99.72  E-value: 2.08e-24
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2450112274   582 NYAEHWCSLLKKTETPFGRCHSAVDPAEYYKRCKYDTCNCQNNEDCLCAALSSYARACTAKGVMLWGWREH 652
Cdd:smart00832    2 YYACSQCGILLSPRGPFAACHSVVDPEPFFENCVYDTCACGGDCECLCDALAAYAAACAEAGVCISPWRTP 72
C8 smart00832
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ...
1046-1118 6.07e-23

This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.


Pssm-ID: 214843  Cd Length: 76  Bit Score: 95.49  E-value: 6.07e-23
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2450112274  1046 WAEKQCSILKSS--VFSICHSKVDPKPFYEACVHDSCSCdtGGDCECFCSAVASYAQECTKEGACV-FWRTPDLCP 1118
Cdd:smart00832    3 YACSQCGILLSPrgPFAACHSVVDPEPFFENCVYDTCAC--GGDCECLCDALAAYAAACAEAGVCIsPWRTPTFCP 76
Cys_knot pfam00007
Cystine-knot domain; The family comprises glycoprotein hormones and the C-terminal domain of ...
5023-5126 2.20e-22

Cystine-knot domain; The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerization.


Pssm-ID: 394966  Cd Length: 105  Bit Score: 94.78  E-value: 2.20e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274 5023 RVPCSTVPVTTEVSYAGCT--KTVLMNHCSGSCGTFVMYSAKAQALDHSCSCCKEEKTSQREVVLSCPNGGSLTHTYTHI 5100
Cdd:pfam00007    1 RKKCIPTNYTISVEKEGCTscKTINTTICAGYCYTRDPVYKDGRRAVSQRVCTYRDVTYETVVLPGCPPGVDPTVTYPVA 80
                           90       100
                   ....*....|....*....|....*.
gi 2450112274 5101 ESCQCQdtVCGLPTGTSRRARRSPRH 5126
Cdd:pfam00007   81 LSCHCG--NCPTDNSDCTRLSLQPDS 104
CT smart00041
C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta ...
5028-5108 1.22e-19

C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta (TGFbeta), nerve growth factor (NGF), platelet-derived growth factor (PDGF) and gonadotropin all form 2 highly twisted antiparallel pairs of beta-strands and contain three disulphide bonds. The domain is non-globular and little is conserved among these presumed homologues except for their cysteine residues. CT domains are predicted to form homodimers.


Pssm-ID: 214482  Cd Length: 82  Bit Score: 86.30  E-value: 1.22e-19
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274  5028 TVPVTTEVSYAGCTK-TVLMNHCSGSCGTFVMYSAkaQALDHSCSCCKEEKTSQREVVLSCPNGGSLTHTYTHIESCQCQ 5106
Cdd:smart00041    1 KSPVRQTITYNGCTSvTVKNAFCEGKCGSASSYSI--QDVQHSCSCCQPHKTKTRQVRLRCPDGSTVKKTVMHIEECGCE 78

                    ..
gi 2450112274  5107 DT 5108
Cdd:smart00041   79 PN 80
C8 pfam08742
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ...
1050-1117 5.48e-19

C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.


Pssm-ID: 462584  Cd Length: 68  Bit Score: 83.97  E-value: 5.48e-19
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274 1050 QCSILK-SSVFSICHSKVDPKPFYEACVHDSCSCdtGGDCECFCSAVASYAQECTKEGACVF-WRTPDLC 1117
Cdd:pfam08742    1 KCGLLSdSGPFAPCHSVVDPEPYFEACVYDMCSC--GGDDECLCAALAAYARACQAAGVCIGdWRTPTFC 68
C8 pfam08742
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ...
587-654 1.32e-18

C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.


Pssm-ID: 462584  Cd Length: 68  Bit Score: 82.81  E-value: 1.32e-18
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2450112274  587 WCSLLKKTEtPFGRCHSAVDPAEYYKRCKYDTCNCQNNEDCLCAALSSYARACTAKGVMLWGWREHVC 654
Cdd:pfam08742    1 KCGLLSDSG-PFAPCHSVVDPEPYFEACVYDMCSCGGDDECLCAALAAYARACQAAGVCIGDWRTPTF 67
Mucin2_WxxW pfam13330
Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. ...
1788-1875 5.53e-17

Mucin-2 protein WxxW repeating region; This family is repeating region found on mucins 2 and 5. The function is not known, but the repeat can be present in up to 32 copies, as in Swiss:C3Y5K5, from Branchiostoma floridae. The region carries a highly conserved WxxW sequence motif and also has at least six well conserved cysteine residues.


Pssm-ID: 463846 [Multi-domain]  Cd Length: 85  Bit Score: 78.92  E-value: 5.53e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274 1788 WTGWLDSGKPNfHKPGGDTELI------GDVC-GPgwaANISCRATMYPDVPIGQLGQTVVCDVSVGLICKNEDQKPGGv 1860
Cdd:pfam13330    1 WTPWFDVDNPS-GSGGGDFETLenlrayGKFCeNP---TDIECRAEPPTGVPASETGQVVTCDVTTGLVCRNADQQPDG- 75
                           90
                   ....*....|....*
gi 2450112274 1861 ipmafCLNYEINVQC 1875
Cdd:pfam13330   76 -----CLDYEVRFLC 85
C8 pfam08742
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ...
225-291 5.49e-16

C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.


Pssm-ID: 462584  Cd Length: 68  Bit Score: 75.11  E-value: 5.49e-16
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2450112274  225 CERLLTAEAFADCQDLVPLEPYLRACQQDRCRCPGGDTCVCSTVAEFSRQCSHAGGRPGNWRTATLC 291
Cdd:pfam08742    2 CGLLSDSGPFAPCHSVVDPEPYFEACVYDMCSCGGDDECLCAALAAYARACQAAGVCIGDWRTPTFC 68
C8 pfam08742
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ...
4643-4706 1.13e-14

C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.


Pssm-ID: 462584  Cd Length: 68  Bit Score: 71.64  E-value: 1.13e-14
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2450112274 4643 LCQLIKDS-LFAQCHALVPPQHYYDACVFDSCFMPGS-SLECASLQAYAALCAQQNICL-DWRNHTH 4706
Cdd:pfam08742    1 KCGLLSDSgPFAPCHSVVDPEPYFEACVYDMCSCGGDdECLCAALAAYARACQAAGVCIgDWRTPTF 67
C8 smart00832
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ...
4644-4705 6.92e-13

This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.


Pssm-ID: 214843  Cd Length: 76  Bit Score: 66.60  E-value: 6.92e-13
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2450112274  4644 CQLIKDSL--FAQCHALVPPQHYYDACVFDSCFMPGSSL-ECASLQAYAALCAQQNICL-DWRNHT 4705
Cdd:smart00832    8 CGILLSPRgpFAACHSVVDPEPFFENCVYDTCACGGDCEcLCDALAAYAAACAEAGVCIsPWRTPT 73
C8 smart00832
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ...
234-292 8.27e-11

This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.


Pssm-ID: 214843  Cd Length: 76  Bit Score: 60.82  E-value: 8.27e-11
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|....*....
gi 2450112274   234 FADCQDLVPLEPYLRACQQDRCRCPGGDTCVCSTVAEFSRQCSHAGGRPGNWRTATLCP 292
Cdd:smart00832   18 FAACHSVVDPEPFFENCVYDTCACGGDCECLCDALAAYAAACAEAGVCISPWRTPTFCP 76
TIL cd19941
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ...
295-351 1.11e-08

trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.


Pssm-ID: 410995  Cd Length: 55  Bit Score: 54.25  E-value: 1.11e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2450112274  295 CPGNLVYLESGSPCMDTCSHLEVSSLCEEHRMDGCFCPEGTVYDDIGDsgCVPVSQC 351
Cdd:cd19941      1 CPPNEVYSECGSACPPTCANPNAPPPCTKQCVEGCFCPEGYVRNSGGK--CVPPSQC 55
TIL pfam01826
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ...
295-351 1.55e-08

Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.


Pssm-ID: 460351  Cd Length: 55  Bit Score: 53.55  E-value: 1.55e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2450112274  295 CPGNLVYLESGSPCMDTCSHLEVSSLCEEHRMDGCFCPEGTVYDDIGdsGCVPVSQC 351
Cdd:pfam01826    1 CPANEVYSECGSACPPTCANLSPPDVCPEPCVEGCVCPPGFVRNSGG--KCVPPSDC 55
DAN pfam03045
DAN domain; This domain contains 9 conserved cysteines and is extracellular. Therefore the ...
5026-5106 1.45e-05

DAN domain; This domain contains 9 conserved cysteines and is extracellular. Therefore the cysteines may form disulphide bridges. This family of proteins has been termed the DAN family after the first member to be reported. This family includes DAN, Cerberus and Gremlin. The gremlin protein is an antagonist of bone morphogenetic protein signaling. It is postulated that all members of this family antagonize different TGF beta pfam00019 ligands. Recent work shows that the DAN protein is not an efficient antagonist of BMP-2/4 class signals, we found that DAN was able to interact with GDF-5 in a frog embryo assay, suggesting that DAN may regulate signaling by the GDF-5/6/7 class of BMPs in vivo.


Pssm-ID: 460786  Cd Length: 108  Bit Score: 46.89  E-value: 1.45e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2450112274 5026 CSTVPVTTEVSYAGC-TKTVLMNHCSGSCGTFVMYSAKAQALDH--SCSCCKEEKTSQREVVLSCPNGGSL-THTYTHIE 5101
Cdd:pfam03045   24 CRTQPFTQTITEEGClSRTVQNRFCYGQCNSFYIPNSIGRGKWSfaSCSRCKPSKFTTVTVTLNCPGGPPTrTKRVMRVK 103

                   ....*
gi 2450112274 5102 SCQCQ 5106
Cdd:pfam03045  104 ECKCK 108
VWC smart00214
von Willebrand factor (vWF) type C domain;
4877-4941 5.02e-05

von Willebrand factor (vWF) type C domain;


Pssm-ID: 214564  Cd Length: 59  Bit Score: 44.04  E-value: 5.02e-05
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2450112274  4877 CVHGNAEYQPGSPVYSSKCQDCVCTDKVdnntllnVIACTHVPCN--TSCSPGfELMEAPGECCKKC 4941
Cdd:smart00214    1 CVHNGRVYNDGETWKPDPCQICTCLDGT-------TVLCDPVECPppPDCPNP-ERVKPPGECCPRC 59
TIL cd19941
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ...
4713-4752 5.94e-03

trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.


Pssm-ID: 410995  Cd Length: 55  Bit Score: 38.07  E-value: 5.94e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 2450112274 4713 CPSHREYQACGPAEEPTCKSSSSQQN-NTVLVEGCFCPEGT 4752
Cdd:cd19941      1 CPPNEVYSECGSACPPTCANPNAPPPcTKQCVEGCFCPEGY 41
TIL cd19941
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ...
661-718 8.98e-03

trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.


Pssm-ID: 410995  Cd Length: 55  Bit Score: 37.30  E-value: 8.98e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2450112274  661 CPNSQVFLYNLTTCQQTCRSLsEADSHCLEGFapVDGCGCPDHTFLDEKGRCVPLAKC 718
Cdd:cd19941      1 CPPNEVYSECGSACPPTCANP-NAPPPCTKQC--VEGCFCPEGYVRNSGGKCVPPSQC 55
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH