NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1958789605|ref|XP_038935132|]
View 

collagen alpha-1(XIV) chain isoform X1 [Rattus norvegicus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
vWA_collagen_alphaI-XII-like cd01482
Collagen: The extracellular matrix represents a complex alloy of variable members of diverse ...
1029-1193 1.37e-88

Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


:

Pssm-ID: 238759 [Multi-domain]  Cd Length: 164  Bit Score: 285.33  E-value: 1.37e-88
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1029 ADLVFMVDGSWSIGDDNFNKIINFLYSTVGALDkIGADGTQVAMVQFTDDPRTEFKLDAYKTKETLLDAIRHISYKGGNT 1108
Cdd:cd01482      1 ADIVFLVDGSWSIGRSNFNLVRSFLSSVVEAFE-IGPDGVQVGLVQYSDDPRTEFDLNAYTSKEDVLAAIKNLPYKGGNT 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1109 KTGKAIKHVRDTLFTADSGTRRGIPKVIVVITDGRSQDDVNKISREMQADGYNIFAIGVADADYSELVRIGSKPSSRHVF 1188
Cdd:cd01482     80 RTGKALTHVREKNFTPDAGARPGVPKVVILITDGKSQDDVELPARVLRNLGVNVFAVGVKDADESELKMIASKPSETHVF 159

                   ....*
gi 1958789605 1189 FVDDF 1193
Cdd:cd01482    160 NVADF 164
vWA_collagen_alphaI-XII-like cd01482
Collagen: The extracellular matrix represents a complex alloy of variable members of diverse ...
158-321 1.20e-82

Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


:

Pssm-ID: 238759 [Multi-domain]  Cd Length: 164  Bit Score: 268.39  E-value: 1.20e-82
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  158 ADIVILVDGSWSIGRFNFRLVRNFLENLVTAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIDAVRSLPYKGGNTL 237
Cdd:cd01482      1 ADIVFLVDGSWSIGRSNFNLVRSFLSSVVEAFEIGPDGVQVGLVQYSDDPRTEFDLNAYTSKEDVLAAIKNLPYKGGNTR 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  238 TGLALNFIFENSFKPEAGSRSGVSKIGILITDGKSQDDIIPPSRNLREAGVELFAIGVKNADLSELQEIASEPDSTHVYN 317
Cdd:cd01482     81 TGKALTHVREKNFTPDAGARPGVPKVVILITDGKSQDDVELPARVLRNLGVNVFAVGVKDADESELKMIASKPSETHVFN 160

                   ....
gi 1958789605  318 VAEF 321
Cdd:cd01482    161 VADF 164
TSPN smart00210
Thrombospondin N-terminal -like domains; Heparin-binding and cell adhesion domain of ...
1227-1422 9.13e-52

Thrombospondin N-terminal -like domains; Heparin-binding and cell adhesion domain of thrombospondin


:

Pssm-ID: 214560  Cd Length: 184  Bit Score: 180.63  E-value: 9.13e-52
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  1227 GFKMMEMFGLVEKDFSAVEGVSMEPGtfnlFPCYQIHKDALVSQPTKYLHPEGLPADYTMTFLFRIlpdTPQEPFALWEI 1306
Cdd:smart00210    1 GQDLLQVFDLPSLSFAIRQVVGPEPG----SPAYRLGDPALVPQPTRDLFPSGLPEDFSLLTTFRQ---TPKSRGVLFAI 73
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  1307 LNKKSEPLVGIILDNGGKTLTYFSYDYTGDFQTVTFEGpdiRKMFYGSFHKLHVVVSKTLAKVVVDCKEVGQKAINASA- 1385
Cdd:smart00210   74 YDAQNVRQFGLEVDGRANTLLLRYQGVDGKQHTVSFRN---LPLADGQWHKLALSVSGSSATLYVDCNEIDSRPLDRPGq 150
                           170       180       190
                    ....*....|....*....|....*....|....*...
gi 1958789605  1386 -NITSDGVEVLGRMVRSRGPngnsAPFQLQMFDIVCST 1422
Cdd:smart00210  151 pPIDTDGIEVRGAQAADRKP----FQGDLQQLKIVCDP 184
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
398-763 1.65e-16

Fibronectin type 3 domain [General function prediction only];


:

Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 85.44  E-value: 1.65e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  398 EVVADGRVSSIVLKNLMSSTEYQIAVFAVSAH---TASEGLRGTeTTLALPMA-SDLELYDVTENSMRVKWDAVP--GAT 471
Cdd:COG3401    184 SLTVTSTTLVDGGGDIEPGTTYYYRVAATDTGgesAPSNEVSVT-TPTTPPSApTGLTATADTPGSVTLSWDPVTesDAT 262
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  472 GYLILYAPlteglAGDEKEMKIGE-THTDIELSGLFPNTEYTVTVYAM--FGEE--ASDPATGQETTLPLTPPRNLRISN 546
Cdd:COG3401    263 GYRVYRSN-----SGDGPFTKVATvTTTSYTDTGLTNGTTYYYRVTAVdaAGNEsaPSNVVSVTTDLTPPAAPSGLTATA 337
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  547 VGSNSARLTWDPTSGK-ITGYRIVYTSADGTEINEV-EVDPITTFPLKGLTPLTDYSIAIFSIYEEGQSLPLVGEFTTEE 624
Cdd:COG3401    338 VGSSSITLSWTASSDAdVTGYNVYRSTSGGGTYTKIaETVTTTSYTDTGLTPGTTYYYKVTAVDAAGNESAPSEEVSATT 417
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  625 VPAQQ----YLEIDEVKTDSFRVTWHPLSAEEGQHKLMWIPVYGGKTQEVALKEEQDSYVVEGLDPGTEYEVSLLAVLDD 700
Cdd:COG3401    418 ASAASgeslTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTGNAVPFTTTSSTVTATTTDTTTANLSVTTGSLV 497
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1958789605  701 GSESEVVTAVGTTLDDFWTEAPTTVEPTSPVTSVLQTGIRNLVVDDEAATSLRVTWDISDSNV 763
Cdd:COG3401    498 GGSGASSVTNSVSVIGASAAAAVGGAPDGTPNVTGASPVTVGASTGDVLITDLVSLTTSASSS 560
fn3 pfam00041
Fibronectin type III domain;
830-906 2.08e-12

Fibronectin type III domain;


:

Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 64.36  E-value: 2.08e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  830 PQNLRVSEEWYNRLRITWDPP---SAPVKGYRIVYKPVSVPGQTLETFVGADVNTIVMTNLLSGMDYNVKIFASQAAGYS 906
Cdd:pfam00041    3 PSNLTVTDVTSTSLTVSWTPPpdgNGPITGYEVEYRPKNSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGEG 82
fn3 pfam00041
Fibronectin type III domain;
355-433 1.60e-11

Fibronectin type III domain;


:

Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 62.05  E-value: 1.60e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  355 GPPTELITSEVTARSFMVNWTHSP---GKVEKYRVVYYPTRGGKPE-EVVADGRVSSIVLKNLMSSTEYQIAVFAVSAHT 430
Cdd:pfam00041    1 SAPSNLTVTDVTSTSLTVSWTPPPdgnGPITGYEVEYRPKNSGEPWnEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGG 80

                   ...
gi 1958789605  431 ASE 433
Cdd:pfam00041   81 EGP 83
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
740-824 7.03e-11

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 60.20  E-value: 7.03e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  740 RNLVVDDEAATSLRVTWD---ISDSNVEHFRVTYLTAQGDPKEEVVMVPGVQNSLLLKNLLPDTEYKVTVTPIYTVGEGV 816
Cdd:cd00063      5 TNLRVTDVTSTSVTLSWTppeDDGGPITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGGGESP 84

                   ....*....
gi 1958789605  817 -SVSAPGKT 824
Cdd:cd00063     85 pSESVTVTT 93
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
32-112 8.71e-11

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 60.20  E-value: 8.71e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605   32 PPTRLRYNVISHDSIQISWKAPRGKFG---GYKLLVAPASGGKTNQMNLQNG-ATKAIIQGLLPEQNYTVQLIAYYKDKE 107
Cdd:cd00063      3 PPTNLRVTDVTSTSVTLSWTPPEDDGGpitGYVVEYREKGSGDWKEVEVTPGsETSYTLTGLKPGTEYEFRVRAVNGGGE 82

                   ....*
gi 1958789605  108 SKPAQ 112
Cdd:cd00063     83 SPPSE 87
gly_rich_SclB super family cl45768
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1476-1606 9.91e-11

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


The actual alignment was detected with superfamily member NF038329:

Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 66.08  E-value: 9.91e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1476 QQGEPGPKGPEGPRGETGPagpqgppgpqgpsglsiQGMPGMPGDKGDKGDAGLPGPQGVPGGVGSPGRDGSPGQRGFPG 1555
Cdd:NF038329   166 PQGEAGPQGPAGKDGEAGA-----------------KGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAG 228
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1958789605 1556 KDGSSGPPGPPGpigipgapgvPGIAGSMGPQGALGPPGVPGAKGERGERG 1606
Cdd:NF038329   229 PAGDGQQGPDGD----------PGPTGEDGPQGPDGPAGKDGPRGDRGEAG 269
FN3 super family cl21522
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
920-995 5.77e-04

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


The actual alignment was detected with superfamily member pfam00041:

Pssm-ID: 473895 [Multi-domain]  Cd Length: 85  Bit Score: 40.48  E-value: 5.77e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  920 VTDLQANQVEMTSLCARWQI----HRHATAYRIVLESLQDTQA-QESTVGGGVNRHCFYGLQPDSEYKISVYTKLQEIEG 994
Cdd:pfam00041    3 PSNLTVTDVTSTSLTVSWTPppdgNGPITGYEVEYRPKNSGEPwNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGEG 82

                   .
gi 1958789605  995 P 995
Cdd:pfam00041   83 P 83
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1681-1755 9.79e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


:

Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 35.93  E-value: 9.79e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1958789605 1681 AGVPGTPGERGLTGVKGDKGNPGigtqgprgppgpagpsgesrpgspgppgspgprgppgHLGVPGPQGPSGQPG 1755
Cdd:pfam01391   15 PGPPGPPGPPGPPGPPGEPGPPG-------------------------------------PPGPPGPPGPPGAPG 52
 
Name Accession Description Interval E-value
vWA_collagen_alphaI-XII-like cd01482
Collagen: The extracellular matrix represents a complex alloy of variable members of diverse ...
1029-1193 1.37e-88

Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238759 [Multi-domain]  Cd Length: 164  Bit Score: 285.33  E-value: 1.37e-88
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1029 ADLVFMVDGSWSIGDDNFNKIINFLYSTVGALDkIGADGTQVAMVQFTDDPRTEFKLDAYKTKETLLDAIRHISYKGGNT 1108
Cdd:cd01482      1 ADIVFLVDGSWSIGRSNFNLVRSFLSSVVEAFE-IGPDGVQVGLVQYSDDPRTEFDLNAYTSKEDVLAAIKNLPYKGGNT 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1109 KTGKAIKHVRDTLFTADSGTRRGIPKVIVVITDGRSQDDVNKISREMQADGYNIFAIGVADADYSELVRIGSKPSSRHVF 1188
Cdd:cd01482     80 RTGKALTHVREKNFTPDAGARPGVPKVVILITDGKSQDDVELPARVLRNLGVNVFAVGVKDADESELKMIASKPSETHVF 159

                   ....*
gi 1958789605 1189 FVDDF 1193
Cdd:cd01482    160 NVADF 164
vWA_collagen_alphaI-XII-like cd01482
Collagen: The extracellular matrix represents a complex alloy of variable members of diverse ...
158-321 1.20e-82

Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238759 [Multi-domain]  Cd Length: 164  Bit Score: 268.39  E-value: 1.20e-82
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  158 ADIVILVDGSWSIGRFNFRLVRNFLENLVTAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIDAVRSLPYKGGNTL 237
Cdd:cd01482      1 ADIVFLVDGSWSIGRSNFNLVRSFLSSVVEAFEIGPDGVQVGLVQYSDDPRTEFDLNAYTSKEDVLAAIKNLPYKGGNTR 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  238 TGLALNFIFENSFKPEAGSRSGVSKIGILITDGKSQDDIIPPSRNLREAGVELFAIGVKNADLSELQEIASEPDSTHVYN 317
Cdd:cd01482     81 TGKALTHVREKNFTPDAGARPGVPKVVILITDGKSQDDVELPARVLRNLGVNVFAVGVKDADESELKMIASKPSETHVFN 160

                   ....
gi 1958789605  318 VAEF 321
Cdd:cd01482    161 VADF 164
VWA pfam00092
von Willebrand factor type A domain;
159-329 3.55e-66

von Willebrand factor type A domain;


Pssm-ID: 459670 [Multi-domain]  Cd Length: 174  Bit Score: 221.38  E-value: 3.55e-66
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  159 DIVILVDGSWSIGRFNFRLVRNFLENLVTAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIDAVRSLPYKGGNTL- 237
Cdd:pfam00092    1 DIVFLLDGSGSIGGDNFEKVKEFLKKLVESLDIGPDGTRVGLVQYSSDVRTEFPLNDYSSKEELLSAVDNLRYLGGGTTn 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  238 TGLALNFIFENSFKPEAGSRSGVSKIGILITDGKSQD-DIIPPSRNLREAGVELFAIGVKNADLSELQEIASEPDSTHVY 316
Cdd:pfam00092   81 TGKALKYALENLFSSAAGARPGAPKVVVLLTDGRSQDgDPEEVARELKSAGVTVFAVGVGNADDEELRKIASEPGEGHVF 160
                          170
                   ....*....|...
gi 1958789605  317 NVAEFDLMHTVVE 329
Cdd:pfam00092  161 TVSDFEALEDLQD 173
VWA pfam00092
von Willebrand factor type A domain;
1030-1202 1.09e-65

von Willebrand factor type A domain;


Pssm-ID: 459670 [Multi-domain]  Cd Length: 174  Bit Score: 220.22  E-value: 1.09e-65
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1030 DLVFMVDGSWSIGDDNFNKIINFLYSTVGALDkIGADGTQVAMVQFTDDPRTEFKLDAYKTKETLLDAIRHISYKGGNTK 1109
Cdd:pfam00092    1 DIVFLLDGSGSIGGDNFEKVKEFLKKLVESLD-IGPDGTRVGLVQYSSDVRTEFPLNDYSSKEELLSAVDNLRYLGGGTT 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1110 -TGKAIKHVRDTLFTADSGTRRGIPKVIVVITDGRSQD-DVNKISREMQADGYNIFAIGVADADYSELVRIGSKPSSRHV 1187
Cdd:pfam00092   80 nTGKALKYALENLFSSAAGARPGAPKVVVLLTDGRSQDgDPEEVARELKSAGVTVFAVGVGNADDEELRKIASEPGEGHV 159
                          170
                   ....*....|....*
gi 1958789605 1188 FFVDDFDAFKKIEDE 1202
Cdd:pfam00092  160 FTVSDFEALEDLQDQ 174
TSPN smart00210
Thrombospondin N-terminal -like domains; Heparin-binding and cell adhesion domain of ...
1227-1422 9.13e-52

Thrombospondin N-terminal -like domains; Heparin-binding and cell adhesion domain of thrombospondin


Pssm-ID: 214560  Cd Length: 184  Bit Score: 180.63  E-value: 9.13e-52
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  1227 GFKMMEMFGLVEKDFSAVEGVSMEPGtfnlFPCYQIHKDALVSQPTKYLHPEGLPADYTMTFLFRIlpdTPQEPFALWEI 1306
Cdd:smart00210    1 GQDLLQVFDLPSLSFAIRQVVGPEPG----SPAYRLGDPALVPQPTRDLFPSGLPEDFSLLTTFRQ---TPKSRGVLFAI 73
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  1307 LNKKSEPLVGIILDNGGKTLTYFSYDYTGDFQTVTFEGpdiRKMFYGSFHKLHVVVSKTLAKVVVDCKEVGQKAINASA- 1385
Cdd:smart00210   74 YDAQNVRQFGLEVDGRANTLLLRYQGVDGKQHTVSFRN---LPLADGQWHKLALSVSGSSATLYVDCNEIDSRPLDRPGq 150
                           170       180       190
                    ....*....|....*....|....*....|....*...
gi 1958789605  1386 -NITSDGVEVLGRMVRSRGPngnsAPFQLQMFDIVCST 1422
Cdd:smart00210  151 pPIDTDGIEVRGAQAADRKP----FQGDLQQLKIVCDP 184
VWA smart00327
von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins ...
159-324 5.75e-50

von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.


Pssm-ID: 214621 [Multi-domain]  Cd Length: 175  Bit Score: 174.95  E-value: 5.75e-50
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605   159 DIVILVDGSWSIGRFNFRLVRNFLENLVTAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIDAVRSLPYK-GGNTL 237
Cdd:smart00327    1 DVVFLLDGSGSMGGNRFELAKEFVLKLVEQLDIGPDGDRVGLVTFSDDARVLFPLNDSRSKDALLEALASLSYKlGGGTN 80
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605   238 TGLALNFIFENSFKPEAGSRSGVSKIGILITDGKSQD---DIIPPSRNLREAGVELFAIGVKNA-DLSELQEIASEPDST 313
Cdd:smart00327   81 LGAALQYALENLFSKSAGSRRGAPKVVILITDGESNDgpkDLLKAAKELKRSGVKVFVVGVGNDvDEEELKKLASAPGGV 160
                           170
                    ....*....|.
gi 1958789605   314 HVYNVAEFDLM 324
Cdd:smart00327  161 YVFLPELLDLL 171
VWA smart00327
von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins ...
1030-1196 6.45e-50

von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.


Pssm-ID: 214621 [Multi-domain]  Cd Length: 175  Bit Score: 174.95  E-value: 6.45e-50
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  1030 DLVFMVDGSWSIGDDNFNKIINFLYSTVGALDkIGADGTQVAMVQFTDDPRTEFKLDAYKTKETLLDAIRHISYK-GGNT 1108
Cdd:smart00327    1 DVVFLLDGSGSMGGNRFELAKEFVLKLVEQLD-IGPDGDRVGLVTFSDDARVLFPLNDSRSKDALLEALASLSYKlGGGT 79
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  1109 KTGKAIKHVRDTLFTADSGTRRGIPKVIVVITDGRSQD---DVNKISREMQADGYNIFAIGV-ADADYSELVRIGSKPSS 1184
Cdd:smart00327   80 NLGAALQYALENLFSKSAGSRRGAPKVVILITDGESNDgpkDLLKAAKELKRSGVKVFVVGVgNDVDEEELKKLASAPGG 159
                           170
                    ....*....|..
gi 1958789605  1185 RHVFFVDDFDAF 1196
Cdd:smart00327  160 VYVFLPELLDLL 171
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
398-763 1.65e-16

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 85.44  E-value: 1.65e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  398 EVVADGRVSSIVLKNLMSSTEYQIAVFAVSAH---TASEGLRGTeTTLALPMA-SDLELYDVTENSMRVKWDAVP--GAT 471
Cdd:COG3401    184 SLTVTSTTLVDGGGDIEPGTTYYYRVAATDTGgesAPSNEVSVT-TPTTPPSApTGLTATADTPGSVTLSWDPVTesDAT 262
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  472 GYLILYAPlteglAGDEKEMKIGE-THTDIELSGLFPNTEYTVTVYAM--FGEE--ASDPATGQETTLPLTPPRNLRISN 546
Cdd:COG3401    263 GYRVYRSN-----SGDGPFTKVATvTTTSYTDTGLTNGTTYYYRVTAVdaAGNEsaPSNVVSVTTDLTPPAAPSGLTATA 337
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  547 VGSNSARLTWDPTSGK-ITGYRIVYTSADGTEINEV-EVDPITTFPLKGLTPLTDYSIAIFSIYEEGQSLPLVGEFTTEE 624
Cdd:COG3401    338 VGSSSITLSWTASSDAdVTGYNVYRSTSGGGTYTKIaETVTTTSYTDTGLTPGTTYYYKVTAVDAAGNESAPSEEVSATT 417
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  625 VPAQQ----YLEIDEVKTDSFRVTWHPLSAEEGQHKLMWIPVYGGKTQEVALKEEQDSYVVEGLDPGTEYEVSLLAVLDD 700
Cdd:COG3401    418 ASAASgeslTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTGNAVPFTTTSSTVTATTTDTTTANLSVTTGSLV 497
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1958789605  701 GSESEVVTAVGTTLDDFWTEAPTTVEPTSPVTSVLQTGIRNLVVDDEAATSLRVTWDISDSNV 763
Cdd:COG3401    498 GGSGASSVTNSVSVIGASAAAAVGGAPDGTPNVTGASPVTVGASTGDVLITDLVSLTTSASSS 560
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
538-622 8.78e-16

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 74.46  E-value: 8.78e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  538 PPRNLRISNVGSNSARLTWDP---TSGKITGYRIVYTSADGTEINEVEVDPI--TTFPLKGLTPLTDYSIAIFSIYEEGQ 612
Cdd:cd00063      3 PPTNLRVTDVTSTSVTLSWTPpedDGGPITGYVVEYREKGSGDWKEVEVTPGseTSYTLTGLKPGTEYEFRVRAVNGGGE 82
                           90
                   ....*....|.
gi 1958789605  613 SLPL-VGEFTT 622
Cdd:cd00063     83 SPPSeSVTVTT 93
fn3 pfam00041
Fibronectin type III domain;
538-615 1.15e-13

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 67.83  E-value: 1.15e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  538 PPRNLRISNVGSNSARLTWDP---TSGKITGYRIVYTSADGTEI-NEVEVDP-ITTFPLKGLTPLTDYSIAIFSIYEEGQ 612
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSWTPppdGNGPITGYEVEYRPKNSGEPwNEITVPGtTTSVTLTGLKPGTEYEVRVQAVNGGGE 81

                   ...
gi 1958789605  613 SLP 615
Cdd:pfam00041   82 GPP 84
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
538-613 1.44e-13

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 67.64  E-value: 1.44e-13
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605   538 PPRNLRISNVGSNSARLTWDP-----TSGKITGYRIVYTSADGTEINEVEVDPITTFPLKGLTPLTDYSIAIFSIYEEGQ 612
Cdd:smart00060    3 PPSNLRVTDVTSTSVTLSWEPppddgITGYIVGYRVEYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAGE 82

                    .
gi 1958789605   613 S 613
Cdd:smart00060   83 G 83
fn3 pfam00041
Fibronectin type III domain;
830-906 2.08e-12

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 64.36  E-value: 2.08e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  830 PQNLRVSEEWYNRLRITWDPP---SAPVKGYRIVYKPVSVPGQTLETFVGADVNTIVMTNLLSGMDYNVKIFASQAAGYS 906
Cdd:pfam00041    3 PSNLTVTDVTSTSLTVSWTPPpdgNGPITGYEVEYRPKNSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGEG 82
fn3 pfam00041
Fibronectin type III domain;
355-433 1.60e-11

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 62.05  E-value: 1.60e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  355 GPPTELITSEVTARSFMVNWTHSP---GKVEKYRVVYYPTRGGKPE-EVVADGRVSSIVLKNLMSSTEYQIAVFAVSAHT 430
Cdd:pfam00041    1 SAPSNLTVTDVTSTSLTVSWTPPPdgnGPITGYEVEYRPKNSGEPWnEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGG 80

                   ...
gi 1958789605  431 ASE 433
Cdd:pfam00041   81 EGP 83
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
828-910 1.65e-11

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 62.13  E-value: 1.65e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  828 SGPQNLRVSEEWYNRLRITWDPPS---APVKGYRIVYKPVSVPGQTLETFVGADVNTIVMTNLLSGMDYNVKIFASQAAG 904
Cdd:cd00063      2 SPPTNLRVTDVTSTSVTLSWTPPEddgGPITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGGG 81

                   ....*.
gi 1958789605  905 YSDALT 910
Cdd:cd00063     82 ESPPSE 87
ChlD COG1240
vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and ...
158-309 3.56e-11

vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and metabolism];


Pssm-ID: 440853 [Multi-domain]  Cd Length: 262  Bit Score: 65.73  E-value: 3.56e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  158 ADIVILVDGSWSIGRFN-FRLVRNFLENLVTAFnvgSEKTRIGLAQYSGdpRIEWHLNAFNTKDEVIDAVRSLPYKGGNT 236
Cdd:COG1240     93 RDVVLVVDASGSMAAENrLEAAKGALLDFLDDY---RPRDRVGLVAFGG--EAEVLLPLTRDREALKRALDELPPGGGTP 167
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  237 LT---GLALNFIfensfkpeAGSRSGVSKIGILITDGKSQDDIIPP---SRNLREAGVELFAIGV--KNADLSELQEIAS 308
Cdd:COG1240    168 LGdalALALELL--------KRADPARRKVIVLLTDGRDNAGRIDPleaAELAAAAGIRIYTIGVgtEAVDEGLLREIAE 239

                   .
gi 1958789605  309 E 309
Cdd:COG1240    240 A 240
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
740-824 7.03e-11

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 60.20  E-value: 7.03e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  740 RNLVVDDEAATSLRVTWD---ISDSNVEHFRVTYLTAQGDPKEEVVMVPGVQNSLLLKNLLPDTEYKVTVTPIYTVGEGV 816
Cdd:cd00063      5 TNLRVTDVTSTSVTLSWTppeDDGGPITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGGGESP 84

                   ....*....
gi 1958789605  817 -SVSAPGKT 824
Cdd:cd00063     85 pSESVTVTT 93
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
355-433 7.75e-11

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 60.20  E-value: 7.75e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  355 GPPTELITSEVTARSFMVNWTHSP---GKVEKYRVVYYPTRGGKPEEV-VADGRVSSIVLKNLMSSTEYQIAVFAVSAHT 430
Cdd:cd00063      2 SPPTNLRVTDVTSTSVTLSWTPPEddgGPITGYVVEYREKGSGDWKEVeVTPGSETSYTLTGLKPGTEYEFRVRAVNGGG 81

                   ...
gi 1958789605  431 ASE 433
Cdd:cd00063     82 ESP 84
fn3 pfam00041
Fibronectin type III domain;
740-815 8.48e-11

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 59.74  E-value: 8.48e-11
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1958789605  740 RNLVVDDEAATSLRVTWDIS---DSNVEHFRVTYLTAQGDPKEEVVMVPGVQNSLLLKNLLPDTEYKVTVTPIYTVGEG 815
Cdd:pfam00041    4 SNLTVTDVTSTSLTVSWTPPpdgNGPITGYEVEYRPKNSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGEG 82
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
32-112 8.71e-11

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 60.20  E-value: 8.71e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605   32 PPTRLRYNVISHDSIQISWKAPRGKFG---GYKLLVAPASGGKTNQMNLQNG-ATKAIIQGLLPEQNYTVQLIAYYKDKE 107
Cdd:cd00063      3 PPTNLRVTDVTSTSVTLSWTPPEDDGGpitGYVVEYREKGSGDWKEVEVTPGsETSYTLTGLKPGTEYEFRVRAVNGGGE 82

                   ....*
gi 1958789605  108 SKPAQ 112
Cdd:cd00063     83 SPPSE 87
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1476-1606 9.91e-11

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 66.08  E-value: 9.91e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1476 QQGEPGPKGPEGPRGETGPagpqgppgpqgpsglsiQGMPGMPGDKGDKGDAGLPGPQGVPGGVGSPGRDGSPGQRGFPG 1555
Cdd:NF038329   166 PQGEAGPQGPAGKDGEAGA-----------------KGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAG 228
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1958789605 1556 KDGSSGPPGPPGpigipgapgvPGIAGSMGPQGALGPPGVPGAKGERGERG 1606
Cdd:NF038329   229 PAGDGQQGPDGD----------PGPTGEDGPQGPDGPAGKDGPRGDRGEAG 269
ChlD COG1240
vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and ...
1029-1199 4.21e-10

vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and metabolism];


Pssm-ID: 440853 [Multi-domain]  Cd Length: 262  Bit Score: 62.26  E-value: 4.21e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1029 ADLVFMVDGSWSIGDDN-FNKIINFLYSTVGALDKigadGTQVAMVQFTDDPRT--EFKLDayktKETLLDAIRHISYKG 1105
Cdd:COG1240     93 RDVVLVVDASGSMAAENrLEAAKGALLDFLDDYRP----RDRVGLVAFGGEAEVllPLTRD----REALKRALDELPPGG 164
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1106 GnTKTGKAIKHVRDTLFTADSGTRrgipKVIVVITDGR---SQDDVNKISREMQADGYNIFAIGVADADYSE--LVRIgS 1180
Cdd:COG1240    165 G-TPLGDALALALELLKRADPARR----KVIVLLTDGRdnaGRIDPLEAAELAAAAGIRIYTIGVGTEAVDEglLREI-A 238
                          170
                   ....*....|....*....
gi 1958789605 1181 KPSSRHVFFVDDFDAFKKI 1199
Cdd:COG1240    239 EATGGRYFRADDLSELAAI 257
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
740-815 6.79e-10

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 57.24  E-value: 6.79e-10
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1958789605   740 RNLVVDDEAATSLRVTWDI-SDSNVEHFRVTY--LTAQGDPKEEVVMVPGVQNSLLLKNLLPDTEYKVTVTPIYTVGEG 815
Cdd:smart00060    5 SNLRVTDVTSTSVTLSWEPpPDDGITGYIVGYrvEYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAGEG 83
fn3 pfam00041
Fibronectin type III domain;
32-110 2.37e-09

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 55.88  E-value: 2.37e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605   32 PPTRLRYNVISHDSIQISWKAPR---GKFGGYKLLVAPA-SGGKTNQMNLQNGATKAIIQGLLPEQNYTVQLIAYYKDKE 107
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSWTPPPdgnGPITGYEVEYRPKnSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGE 81

                   ...
gi 1958789605  108 SKP 110
Cdd:pfam00041   82 GPP 84
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
828-906 2.55e-08

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 52.62  E-value: 2.55e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605   828 SGPQNLRVSEEWYNRLRITWDPPS-----APVKGYRIVYKPVSVPGQTLEtfVGADVNTIVMTNLLSGMDYNVKIFASQA 902
Cdd:smart00060    2 SPPSNLRVTDVTSTSVTLSWEPPPddgitGYIVGYRVEYREEGSEWKEVN--VTPSSTSYTLTGLKPGTEYEFRVRAVNG 79

                    ....
gi 1958789605   903 AGYS 906
Cdd:smart00060   80 AGEG 83
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1476-1606 3.93e-08

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 57.99  E-value: 3.93e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1476 QQGEPGPKGPEGPRGEtgpagpqgppgpqgpsglsiQGMPGMPGDKGDKGDAglpgpqgvpggvgspgrdGSPGQRGFPG 1555
Cdd:NF038329   118 EKGEPGPAGPAGPAGE--------------------QGPRGDRGETGPAGPA------------------GPPGPQGERG 159
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1958789605 1556 KDGSSGPPGPpgpigipgapgvPGIAGSMGPQGALGPPGVPGAKGERGERG 1606
Cdd:NF038329   160 EKGPAGPQGE------------AGPQGPAGKDGEAGAKGPAGEKGPQGPRG 198
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
355-432 4.90e-08

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 51.85  E-value: 4.90e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605   355 GPPTELITSEVTARSFMVNWTHSPGKVEKYRVVYY----PTRGGKPEEVVADGRVSSIVLKNLMSSTEYQIAVFAVSAHT 430
Cdd:smart00060    2 SPPSNLRVTDVTSTSVTLSWEPPPDDGITGYIVGYrveyREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAG 81

                    ..
gi 1958789605   431 AS 432
Cdd:smart00060   82 EG 83
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1477-1606 1.11e-07

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 56.45  E-value: 1.11e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1477 QGEPGPKGPEGPRGETGPAGPqgppgpqgpsglsiQGMPGMPGDKGDKGDAGLPGPQGVPGgvgspgrDGSPGQRGFPGK 1556
Cdd:NF038329   185 KGPAGEKGPQGPRGETGPAGE--------------QGPAGPAGPDGEAGPAGEDGPAGPAG-------DGQQGPDGDPGP 243
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1557 DGSSGPPGPPgpigipgapgvpGIAGSMGPQGALGPPGVPGAKGERGERG 1606
Cdd:NF038329   244 TGEDGPQGPD------------GPAGKDGPRGDRGEAGPDGPDGKDGERG 281
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
32-108 1.08e-06

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 47.99  E-value: 1.08e-06
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605    32 PPTRLRYNVISHDSIQISWKAP-RGKFGGYKL---LVAPASGGKTNQMNLQNGATKAIIQGLLPEQNYTVQLIAYYKDKE 107
Cdd:smart00060    3 PPSNLRVTDVTSTSVTLSWEPPpDDGITGYIVgyrVEYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAGE 82

                    .
gi 1958789605   108 S 108
Cdd:smart00060   83 G 83
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1476-1606 2.72e-06

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 51.83  E-value: 2.72e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1476 QQGEPGPKGPEGPRGETGPAGPQGPPGPQGPSGLSIQGMPGMPGDKGDKGDAGLpgpqgvpggvgspgrDGSPGQRGFPG 1555
Cdd:NF038329   193 PQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDGQQGPDGDPGPTGE---------------DGPQGPDGPAG 257
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1958789605 1556 KDGSSGPPGPpgpigipgapgvPGIAGSMGPQGALGPPGVPGAKGERGERG 1606
Cdd:NF038329   258 KDGPRGDRGE------------AGPDGPDGKDGERGPVGPAGKDGQNGKDG 296
PTZ00441 PTZ00441
sporozoite surface protein 2 (SSP2); Provisional
1021-1194 1.49e-05

sporozoite surface protein 2 (SSP2); Provisional


Pssm-ID: 240420 [Multi-domain]  Cd Length: 576  Bit Score: 49.96  E-value: 1.49e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1021 KEVCKAaKADLVFMVDGSWSIGDDNF-NKIINFLYSTVGALDkIGADGTQVAMVQFTDDPRTEFKL--DAYKTKETLLDA 1097
Cdd:PTZ00441    36 EEVCNE-EVDLYLLVDGSGSIGYHNWiTHVIPMLMGLIQQLN-LSDDAINLYMSLFSNNTTELIRLgsGASKDKEQALII 113
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1098 IRHISyKG----GNTKTGKAIKHVRDTLftADSGTRRGIPKVIVVITDG--RSQDDVNKISREMQADGYNIFAIGVA--- 1168
Cdd:PTZ00441   114 VKSLR-KTylpyGKTNMTDALLEVRKHL--NDRVNRENAIQLVILMTDGipNSKYRALEESRKLKDRNVKLAVIGIGqgi 190
                          170       180
                   ....*....|....*....|....*.
gi 1958789605 1169 DADYSELVrIGSKPSSRHVFFVDDFD 1194
Cdd:PTZ00441   191 NHQFNRLL-AGCRPREGKCKFYSDAD 215
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1478-1602 2.20e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 43.64  E-value: 2.20e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1478 GEPGPKGPEGPRGEtgpagpqgppgpqgpsglsiqgmPGMPGDKGDKGDaglpgpqgvpggvgspgrdgsPGQRGFPgkd 1557
Cdd:pfam01391    1 GPPGPPGPPGPPGP-----------------------PGPPGPPGPPGP---------------------PGPPGEP--- 33
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*
gi 1958789605 1558 gssgppgppgpigipgapgvpGIAGSMGPQGALGPPGVPGAKGER 1602
Cdd:pfam01391   34 ---------------------GPPGPPGPPGPPGPPGAPGAPGPP 57
fn3 pfam00041
Fibronectin type III domain;
920-995 5.77e-04

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 40.48  E-value: 5.77e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  920 VTDLQANQVEMTSLCARWQI----HRHATAYRIVLESLQDTQA-QESTVGGGVNRHCFYGLQPDSEYKISVYTKLQEIEG 994
Cdd:pfam00041    3 PSNLTVTDVTSTSLTVSWTPppdgNGPITGYEVEYRPKNSGEPwNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGEG 82

                   .
gi 1958789605  995 P 995
Cdd:pfam00041   83 P 83
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1681-1755 9.79e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 35.93  E-value: 9.79e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1958789605 1681 AGVPGTPGERGLTGVKGDKGNPGigtqgprgppgpagpsgesrpgspgppgspgprgppgHLGVPGPQGPSGQPG 1755
Cdd:pfam01391   15 PGPPGPPGPPGPPGPPGEPGPPG-------------------------------------PPGPPGPPGPPGAPG 52
 
Name Accession Description Interval E-value
vWA_collagen_alphaI-XII-like cd01482
Collagen: The extracellular matrix represents a complex alloy of variable members of diverse ...
1029-1193 1.37e-88

Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238759 [Multi-domain]  Cd Length: 164  Bit Score: 285.33  E-value: 1.37e-88
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1029 ADLVFMVDGSWSIGDDNFNKIINFLYSTVGALDkIGADGTQVAMVQFTDDPRTEFKLDAYKTKETLLDAIRHISYKGGNT 1108
Cdd:cd01482      1 ADIVFLVDGSWSIGRSNFNLVRSFLSSVVEAFE-IGPDGVQVGLVQYSDDPRTEFDLNAYTSKEDVLAAIKNLPYKGGNT 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1109 KTGKAIKHVRDTLFTADSGTRRGIPKVIVVITDGRSQDDVNKISREMQADGYNIFAIGVADADYSELVRIGSKPSSRHVF 1188
Cdd:cd01482     80 RTGKALTHVREKNFTPDAGARPGVPKVVILITDGKSQDDVELPARVLRNLGVNVFAVGVKDADESELKMIASKPSETHVF 159

                   ....*
gi 1958789605 1189 FVDDF 1193
Cdd:cd01482    160 NVADF 164
vWA_collagen_alphaI-XII-like cd01482
Collagen: The extracellular matrix represents a complex alloy of variable members of diverse ...
158-321 1.20e-82

Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238759 [Multi-domain]  Cd Length: 164  Bit Score: 268.39  E-value: 1.20e-82
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  158 ADIVILVDGSWSIGRFNFRLVRNFLENLVTAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIDAVRSLPYKGGNTL 237
Cdd:cd01482      1 ADIVFLVDGSWSIGRSNFNLVRSFLSSVVEAFEIGPDGVQVGLVQYSDDPRTEFDLNAYTSKEDVLAAIKNLPYKGGNTR 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  238 TGLALNFIFENSFKPEAGSRSGVSKIGILITDGKSQDDIIPPSRNLREAGVELFAIGVKNADLSELQEIASEPDSTHVYN 317
Cdd:cd01482     81 TGKALTHVREKNFTPDAGARPGVPKVVILITDGKSQDDVELPARVLRNLGVNVFAVGVKDADESELKMIASKPSETHVFN 160

                   ....
gi 1958789605  318 VAEF 321
Cdd:cd01482    161 VADF 164
vWA_collagen cd01472
von Willebrand factor (vWF) type A domain; equivalent to the I-domain of integrins. This ...
158-321 4.03e-75

von Willebrand factor (vWF) type A domain; equivalent to the I-domain of integrins. This domain has a variety of functions including: intermolecular adhesion, cell migration, signalling, transcription, and DNA repair. In integrins these domains form heterodimers while in vWF it forms homodimers and multimers. There are different interaction surfaces of this domain as seen by its complexes with collagen with either integrin or human vWFA. In integrins collagen binding occurs via the metal ion-dependent adhesion site (MIDAS) and involves three surface loops located on the upper surface of the molecule. In human vWFA, collagen binding is thought to occur on the bottom of the molecule and does not involve the vestigial MIDAS motif.


Pssm-ID: 238749 [Multi-domain]  Cd Length: 164  Bit Score: 246.76  E-value: 4.03e-75
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  158 ADIVILVDGSWSIGRFNFRLVRNFLENLVTAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIDAVRSLPYKGGNTL 237
Cdd:cd01472      1 ADIVFLVDGSESIGLSNFNLVKDFVKRVVERLDIGPDGVRVGVVQYSDDPRTEFYLNTYRSKDDVLEAVKNLRYIGGGTN 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  238 TGLALNFIFENSFKPEAGSRSGVSKIGILITDGKSQDDIIPPSRNLREAGVELFAIGVKNADLSELQEIASEPDSTHVYN 317
Cdd:cd01472     81 TGKALKYVRENLFTEASGSREGVPKVLVVITDGKSQDDVEEPAVELKQAGIEVFAVGVKNADEEELKQIASDPKELYVFN 160

                   ....
gi 1958789605  318 VAEF 321
Cdd:cd01472    161 VADF 164
vWA_collagen cd01472
von Willebrand factor (vWF) type A domain; equivalent to the I-domain of integrins. This ...
1029-1193 8.92e-75

von Willebrand factor (vWF) type A domain; equivalent to the I-domain of integrins. This domain has a variety of functions including: intermolecular adhesion, cell migration, signalling, transcription, and DNA repair. In integrins these domains form heterodimers while in vWF it forms homodimers and multimers. There are different interaction surfaces of this domain as seen by its complexes with collagen with either integrin or human vWFA. In integrins collagen binding occurs via the metal ion-dependent adhesion site (MIDAS) and involves three surface loops located on the upper surface of the molecule. In human vWFA, collagen binding is thought to occur on the bottom of the molecule and does not involve the vestigial MIDAS motif.


Pssm-ID: 238749 [Multi-domain]  Cd Length: 164  Bit Score: 245.60  E-value: 8.92e-75
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1029 ADLVFMVDGSWSIGDDNFNKIINFLYSTVGALDkIGADGTQVAMVQFTDDPRTEFKLDAYKTKETLLDAIRHISYKGGNT 1108
Cdd:cd01472      1 ADIVFLVDGSESIGLSNFNLVKDFVKRVVERLD-IGPDGVRVGVVQYSDDPRTEFYLNTYRSKDDVLEAVKNLRYIGGGT 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1109 KTGKAIKHVRDTLFTADSGTRRGIPKVIVVITDGRSQDDVNKISREMQADGYNIFAIGVADADYSELVRIGSKPSSRHVF 1188
Cdd:cd01472     80 NTGKALKYVRENLFTEASGSREGVPKVLVVITDGKSQDDVEEPAVELKQAGIEVFAVGVKNADEEELKQIASDPKELYVF 159

                   ....*
gi 1958789605 1189 FVDDF 1193
Cdd:cd01472    160 NVADF 164
VWA pfam00092
von Willebrand factor type A domain;
159-329 3.55e-66

von Willebrand factor type A domain;


Pssm-ID: 459670 [Multi-domain]  Cd Length: 174  Bit Score: 221.38  E-value: 3.55e-66
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  159 DIVILVDGSWSIGRFNFRLVRNFLENLVTAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIDAVRSLPYKGGNTL- 237
Cdd:pfam00092    1 DIVFLLDGSGSIGGDNFEKVKEFLKKLVESLDIGPDGTRVGLVQYSSDVRTEFPLNDYSSKEELLSAVDNLRYLGGGTTn 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  238 TGLALNFIFENSFKPEAGSRSGVSKIGILITDGKSQD-DIIPPSRNLREAGVELFAIGVKNADLSELQEIASEPDSTHVY 316
Cdd:pfam00092   81 TGKALKYALENLFSSAAGARPGAPKVVVLLTDGRSQDgDPEEVARELKSAGVTVFAVGVGNADDEELRKIASEPGEGHVF 160
                          170
                   ....*....|...
gi 1958789605  317 NVAEFDLMHTVVE 329
Cdd:pfam00092  161 TVSDFEALEDLQD 173
VWA pfam00092
von Willebrand factor type A domain;
1030-1202 1.09e-65

von Willebrand factor type A domain;


Pssm-ID: 459670 [Multi-domain]  Cd Length: 174  Bit Score: 220.22  E-value: 1.09e-65
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1030 DLVFMVDGSWSIGDDNFNKIINFLYSTVGALDkIGADGTQVAMVQFTDDPRTEFKLDAYKTKETLLDAIRHISYKGGNTK 1109
Cdd:pfam00092    1 DIVFLLDGSGSIGGDNFEKVKEFLKKLVESLD-IGPDGTRVGLVQYSSDVRTEFPLNDYSSKEELLSAVDNLRYLGGGTT 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1110 -TGKAIKHVRDTLFTADSGTRRGIPKVIVVITDGRSQD-DVNKISREMQADGYNIFAIGVADADYSELVRIGSKPSSRHV 1187
Cdd:pfam00092   80 nTGKALKYALENLFSSAAGARPGAPKVVVLLTDGRSQDgDPEEVARELKSAGVTVFAVGVGNADDEELRKIASEPGEGHV 159
                          170
                   ....*....|....*
gi 1958789605 1188 FFVDDFDAFKKIEDE 1202
Cdd:pfam00092  160 FTVSDFEALEDLQDQ 174
vWFA_subfamily_ECM cd01450
Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation ...
158-316 4.74e-54

Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains


Pssm-ID: 238727 [Multi-domain]  Cd Length: 161  Bit Score: 186.34  E-value: 4.74e-54
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  158 ADIVILVDGSWSIGRFNFRLVRNFLENLVTAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIDAVRSLPYKGGN-T 236
Cdd:cd01450      1 LDIVFLLDGSESVGPENFEKVKDFIEKLVEKLDIGPDKTRVGLVQYSDDVRVEFSLNDYKSKDDLLKAVKNLKYLGGGgT 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  237 LTGLALNFIFENSFKPeAGSRSGVSKIGILITDGKSQD--DIIPPSRNLREAGVELFAIGVKNADLSELQEIASEPDSTH 314
Cdd:cd01450     81 NTGKALQYALEQLFSE-SNARENVPKVIIVLTDGRSDDggDPKEAAAKLKDEGIKVFVVGVGPADEEELREIASCPSERH 159

                   ..
gi 1958789605  315 VY 316
Cdd:cd01450    160 VF 161
vWFA_subfamily_ECM cd01450
Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation ...
1029-1188 1.74e-53

Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains


Pssm-ID: 238727 [Multi-domain]  Cd Length: 161  Bit Score: 184.80  E-value: 1.74e-53
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1029 ADLVFMVDGSWSIGDDNFNKIINFLYSTVGALDkIGADGTQVAMVQFTDDPRTEFKLDAYKTKETLLDAIRHISYKGGN- 1107
Cdd:cd01450      1 LDIVFLLDGSESVGPENFEKVKDFIEKLVEKLD-IGPDKTRVGLVQYSDDVRVEFSLNDYKSKDDLLKAVKNLKYLGGGg 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1108 TKTGKAIKHVRDTLFTaDSGTRRGIPKVIVVITDGRSQD--DVNKISREMQADGYNIFAIGVADADYSELVRIGSKPSSR 1185
Cdd:cd01450     80 TNTGKALQYALEQLFS-ESNARENVPKVIIVLTDGRSDDggDPKEAAAKLKDEGIKVFVVGVGPADEEELREIASCPSER 158

                   ...
gi 1958789605 1186 HVF 1188
Cdd:cd01450    159 HVF 161
TSPN smart00210
Thrombospondin N-terminal -like domains; Heparin-binding and cell adhesion domain of ...
1227-1422 9.13e-52

Thrombospondin N-terminal -like domains; Heparin-binding and cell adhesion domain of thrombospondin


Pssm-ID: 214560  Cd Length: 184  Bit Score: 180.63  E-value: 9.13e-52
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  1227 GFKMMEMFGLVEKDFSAVEGVSMEPGtfnlFPCYQIHKDALVSQPTKYLHPEGLPADYTMTFLFRIlpdTPQEPFALWEI 1306
Cdd:smart00210    1 GQDLLQVFDLPSLSFAIRQVVGPEPG----SPAYRLGDPALVPQPTRDLFPSGLPEDFSLLTTFRQ---TPKSRGVLFAI 73
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  1307 LNKKSEPLVGIILDNGGKTLTYFSYDYTGDFQTVTFEGpdiRKMFYGSFHKLHVVVSKTLAKVVVDCKEVGQKAINASA- 1385
Cdd:smart00210   74 YDAQNVRQFGLEVDGRANTLLLRYQGVDGKQHTVSFRN---LPLADGQWHKLALSVSGSSATLYVDCNEIDSRPLDRPGq 150
                           170       180       190
                    ....*....|....*....|....*....|....*...
gi 1958789605  1386 -NITSDGVEVLGRMVRSRGPngnsAPFQLQMFDIVCST 1422
Cdd:smart00210  151 pPIDTDGIEVRGAQAADRKP----FQGDLQQLKIVCDP 184
vWA_collagen_alpha3-VI-like cd01481
VWA_collagen alpha 3(VI) like: The extracellular matrix represents a complex alloy of variable ...
158-321 2.45e-51

VWA_collagen alpha 3(VI) like: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238758  Cd Length: 165  Bit Score: 178.67  E-value: 2.45e-51
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  158 ADIVILVDGSWSIGRFNFRLVRNFLENLVTAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIDAVRSLPYKGGNTL 237
Cdd:cd01481      1 KDIVFLIDGSDNVGSGNFPAIRDFIERIVQSLDVGPDKIRVAVVQFSDTPRPEFYLNTHSTKADVLGAVRRLRLRGGSQL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  238 -TGLALNFIFENSFKPEAGSR--SGVSKIGILITDGKSQDDIIPPSRNLREAGVELFAIGVKNADLSELQEIASEPDstH 314
Cdd:cd01481     81 nTGSALDYVVKNLFTKSAGSRieEGVPQFLVLITGGKSQDDVERPAVALKRAGIVPFAIGARNADLAELQQIAFDPS--F 158

                   ....*..
gi 1958789605  315 VYNVAEF 321
Cdd:cd01481    159 VFQVSDF 165
vWA_Matrilin cd01475
VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and ...
158-336 2.42e-50

VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.


Pssm-ID: 238752 [Multi-domain]  Cd Length: 224  Bit Score: 177.96  E-value: 2.42e-50
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  158 ADIVILVDGSWSIGRFNFRLVRNFLENLVTAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIDAVRSLPYKGGNTL 237
Cdd:cd01475      3 TDLVFLIDSSRSVRPENFELVKQFLNQIIDSLDVGPDATRVGLVQYSSTVKQEFPLGRFKSKADLKRAVRRMEYLETGTM 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  238 TGLALNFIFENSFKPEAGSRSG---VSKIGILITDGKSQDDIIPPSRNLREAGVELFAIGVKNADLSELQEIASEPDSTH 314
Cdd:cd01475     83 TGLAIQYAMNNAFSEAEGARPGserVPRVGIVVTDGRPQDDVSEVAAKARALGIEMFAVGVGRADEEELREIASEPLADH 162
                          170       180
                   ....*....|....*....|..
gi 1958789605  315 VYNVAEFDLMHTVVESLTRTVC 336
Cdd:cd01475    163 VFYVEDFSTIEELTKKFQGKIC 184
VWA smart00327
von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins ...
159-324 5.75e-50

von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.


Pssm-ID: 214621 [Multi-domain]  Cd Length: 175  Bit Score: 174.95  E-value: 5.75e-50
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605   159 DIVILVDGSWSIGRFNFRLVRNFLENLVTAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIDAVRSLPYK-GGNTL 237
Cdd:smart00327    1 DVVFLLDGSGSMGGNRFELAKEFVLKLVEQLDIGPDGDRVGLVTFSDDARVLFPLNDSRSKDALLEALASLSYKlGGGTN 80
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605   238 TGLALNFIFENSFKPEAGSRSGVSKIGILITDGKSQD---DIIPPSRNLREAGVELFAIGVKNA-DLSELQEIASEPDST 313
Cdd:smart00327   81 LGAALQYALENLFSKSAGSRRGAPKVVILITDGESNDgpkDLLKAAKELKRSGVKVFVVGVGNDvDEEELKKLASAPGGV 160
                           170
                    ....*....|.
gi 1958789605   314 HVYNVAEFDLM 324
Cdd:smart00327  161 YVFLPELLDLL 171
VWA smart00327
von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins ...
1030-1196 6.45e-50

von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.


Pssm-ID: 214621 [Multi-domain]  Cd Length: 175  Bit Score: 174.95  E-value: 6.45e-50
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  1030 DLVFMVDGSWSIGDDNFNKIINFLYSTVGALDkIGADGTQVAMVQFTDDPRTEFKLDAYKTKETLLDAIRHISYK-GGNT 1108
Cdd:smart00327    1 DVVFLLDGSGSMGGNRFELAKEFVLKLVEQLD-IGPDGDRVGLVTFSDDARVLFPLNDSRSKDALLEALASLSYKlGGGT 79
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  1109 KTGKAIKHVRDTLFTADSGTRRGIPKVIVVITDGRSQD---DVNKISREMQADGYNIFAIGV-ADADYSELVRIGSKPSS 1184
Cdd:smart00327   80 NLGAALQYALENLFSKSAGSRRGAPKVVILITDGESNDgpkDLLKAAKELKRSGVKVFVVGVgNDVDEEELKKLASAPGG 159
                           170
                    ....*....|..
gi 1958789605  1185 RHVFFVDDFDAF 1196
Cdd:smart00327  160 VYVFLPELLDLL 171
vWA_Matrilin cd01475
VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and ...
1029-1212 2.62e-47

VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.


Pssm-ID: 238752 [Multi-domain]  Cd Length: 224  Bit Score: 169.49  E-value: 2.62e-47
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1029 ADLVFMVDGSWSIGDDNFNKIINFLYSTVGALDkIGADGTQVAMVQFTDDPRTEFKLDAYKTKETLLDAIRHISYKGGNT 1108
Cdd:cd01475      3 TDLVFLIDSSRSVRPENFELVKQFLNQIIDSLD-VGPDATRVGLVQYSSTVKQEFPLGRFKSKADLKRAVRRMEYLETGT 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1109 KTGKAIKHVRDTLFTADSGTRRG---IPKVIVVITDGRSQDDVNKISREMQADGYNIFAIGVADADYSELVRIGSKPSSR 1185
Cdd:cd01475     82 MTGLAIQYAMNNAFSEAEGARPGserVPRVGIVVTDGRPQDDVSEVAAKARALGIEMFAVGVGRADEEELREIASEPLAD 161
                          170       180
                   ....*....|....*....|....*..
gi 1958789605 1186 HVFFVDDFDAFKKIEDELITFVCETAS 1212
Cdd:cd01475    162 HVFYVEDFSTIEELTKKFQGKICVVPD 188
vWA_integrins_alpha_subunit cd01469
Integrins are a class of adhesion receptors that link the extracellular matrix to the ...
1030-1199 1.29e-43

Integrins are a class of adhesion receptors that link the extracellular matrix to the cytoskeleton and cooperate with growth factor receptors to promote celll survival, cell cycle progression and cell migration. Integrins consist of an alpha and a beta sub-unit. Each sub-unit has a large extracellular portion, a single transmembrane segment and a short cytoplasmic domain. The N-terminal domains of the alpha and beta subunits associate to form the integrin headpiece, which contains the ligand binding site, whereas the C-terminal segments traverse the plasma membrane and mediate interaction with the cytoskeleton and with signalling proteins.The VWA domains present in the alpha subunits of integrins seem to be a chordate specific radiation of the gene family being found only in vertebrates. They mediate protein-protein interactions.


Pssm-ID: 238746 [Multi-domain]  Cd Length: 177  Bit Score: 157.13  E-value: 1.29e-43
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1030 DLVFMVDGSWSIGDDNFNKIINFLYSTVGALDkIGADGTQVAMVQFTDDPRTEFKLDAYKTKETLLDAIRHISYKGGNTK 1109
Cdd:cd01469      2 DIVFVLDGSGSIYPDDFQKVKNFLSTVMKKLD-IGPTKTQFGLVQYSESFRTEFTLNEYRTKEEPLSLVKHISQLLGLTN 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1110 TGKAIKHVRDTLFTADSGTRRGIPKVIVVITDGRSQDDVNKISREMQADGYNI--FAIGVADA-----DYSELVRIGSKP 1182
Cdd:cd01469     81 TATAIQYVVTELFSESNGARKDATKVLVVITDGESHDDPLLKDVIPQAEREGIirYAIGVGGHfqrenSREELKTIASKP 160
                          170
                   ....*....|....*..
gi 1958789605 1183 SSRHVFFVDDFDAFKKI 1199
Cdd:cd01469    161 PEEHFFNVTDFAALKDI 177
vWA_collagen_alpha3-VI-like cd01481
VWA_collagen alpha 3(VI) like: The extracellular matrix represents a complex alloy of variable ...
1029-1193 2.33e-43

VWA_collagen alpha 3(VI) like: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238758  Cd Length: 165  Bit Score: 155.94  E-value: 2.33e-43
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1029 ADLVFMVDGSWSIGDDNFNKIINFLYSTVGALDkIGADGTQVAMVQFTDDPRTEFKLDAYKTKETLLDAIRHISYKGGN- 1107
Cdd:cd01481      1 KDIVFLIDGSDNVGSGNFPAIRDFIERIVQSLD-VGPDKIRVAVVQFSDTPRPEFYLNTHSTKADVLGAVRRLRLRGGSq 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1108 TKTGKAIKHVRDTLFTADSGTR--RGIPKVIVVITDGRSQDDVNKISREMQADGYNIFAIGVADADYSELVRIGSKPSsr 1185
Cdd:cd01481     80 LNTGSALDYVVKNLFTKSAGSRieEGVPQFLVLITGGKSQDDVERPAVALKRAGIVPFAIGARNADLAELQQIAFDPS-- 157

                   ....*...
gi 1958789605 1186 HVFFVDDF 1193
Cdd:cd01481    158 FVFQVSDF 165
vWA_integrins_alpha_subunit cd01469
Integrins are a class of adhesion receptors that link the extracellular matrix to the ...
159-322 2.68e-37

Integrins are a class of adhesion receptors that link the extracellular matrix to the cytoskeleton and cooperate with growth factor receptors to promote celll survival, cell cycle progression and cell migration. Integrins consist of an alpha and a beta sub-unit. Each sub-unit has a large extracellular portion, a single transmembrane segment and a short cytoplasmic domain. The N-terminal domains of the alpha and beta subunits associate to form the integrin headpiece, which contains the ligand binding site, whereas the C-terminal segments traverse the plasma membrane and mediate interaction with the cytoskeleton and with signalling proteins.The VWA domains present in the alpha subunits of integrins seem to be a chordate specific radiation of the gene family being found only in vertebrates. They mediate protein-protein interactions.


Pssm-ID: 238746 [Multi-domain]  Cd Length: 177  Bit Score: 139.03  E-value: 2.68e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  159 DIVILVDGSWSIGRFNFRLVRNFLENLVTAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIDAVRSLPYKGGNTLT 238
Cdd:cd01469      2 DIVFVLDGSGSIYPDDFQKVKNFLSTVMKKLDIGPTKTQFGLVQYSESFRTEFTLNEYRTKEEPLSLVKHISQLLGLTNT 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  239 GLALNFIFENSFKPEAGSRSGVSKIGILITDGKSQDD-----IIPPSrnlREAGVELFAIGV-----KNADLSELQEIAS 308
Cdd:cd01469     82 ATAIQYVVTELFSESNGARKDATKVLVVITDGESHDDpllkdVIPQA---EREGIIRYAIGVgghfqRENSREELKTIAS 158
                          170
                   ....*....|....
gi 1958789605  309 EPDSTHVYNVAEFD 322
Cdd:cd01469    159 KPPEEHFFNVTDFA 172
vWFA cd00198
Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation ...
158-316 9.94e-33

Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains.


Pssm-ID: 238119 [Multi-domain]  Cd Length: 161  Bit Score: 125.37  E-value: 9.94e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  158 ADIVILVDGSWSIGRFNFRLVRNFLENLVTAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIDAVRSLPYK-GGNT 236
Cdd:cd00198      1 ADIVFLLDVSGSMGGEKLDKAKEALKALVSSLSASPPGDRVGLVTFGSNARVVLPLTTDTDKADLLEAIDALKKGlGGGT 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  237 LTGLALNFIFENSFKPEagsRSGVSKIGILITDGKSQDDIIPP---SRNLREAGVELFAIGVKN-ADLSELQEIASEPDS 312
Cdd:cd00198     81 NIGAALRLALELLKSAK---RPNARRVIILLTDGEPNDGPELLaeaARELRKLGITVYTIGIGDdANEDELKEIADKTTG 157

                   ....
gi 1958789605  313 THVY 316
Cdd:cd00198    158 GAVF 161
vWFA cd00198
Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation ...
1029-1188 1.29e-29

Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains.


Pssm-ID: 238119 [Multi-domain]  Cd Length: 161  Bit Score: 116.13  E-value: 1.29e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1029 ADLVFMVDGSWSIGDDNFNKIINFLYSTVGALDkIGADGTQVAMVQFTDDPRTEFKLDAYKTKETLLDAIRHISYK-GGN 1107
Cdd:cd00198      1 ADIVFLLDVSGSMGGEKLDKAKEALKALVSSLS-ASPPGDRVGLVTFGSNARVVLPLTTDTDKADLLEAIDALKKGlGGG 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1108 TKTGKAIKHVRDTLFtadSGTRRGIPKVIVVITDGRSQDD---VNKISREMQADGYNIFAIGV-ADADYSELVRIGSKPS 1183
Cdd:cd00198     80 TNIGAALRLALELLK---SAKRPNARRVIILLTDGEPNDGpelLAEAARELRKLGITVYTIGIgDDANEDELKEIADKTT 156

                   ....*
gi 1958789605 1184 SRHVF 1188
Cdd:cd00198    157 GGAVF 161
VWA_integrin_invertebrates cd01476
VWA_integrin (invertebrates): Integrins are a family of cell surface receptors that have ...
1029-1189 2.03e-26

VWA_integrin (invertebrates): Integrins are a family of cell surface receptors that have diverse functions in cell-cell and cell-extracellular matrix interactions. Because of their involvement in many biologically important adhesion processes, integrins are conserved across a wide range of multicellular animals. Integrins from invertebrates have been identified from six phyla. There are no data to date to suggest any immunological functions for the invertebrate integrins. The members of this sub-group have the conserved MIDAS motif that is charateristic of this domain suggesting the involvement of the integrins in the recognition and binding of multi-ligands.


Pssm-ID: 238753 [Multi-domain]  Cd Length: 163  Bit Score: 107.10  E-value: 2.03e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1029 ADLVFMVDGSWSIgDDNFNKIINFLYSTVGALDkIGADGTQVAMVQFTDDPRT--EFKLDAYKTKETLLDAIRHISYKGG 1106
Cdd:cd01476      1 LDLLFVLDSSGSV-RGKFEKYKKYIERIVEGLE-IGPTATRVALITYSGRGRQrvRFNLPKHNDGEELLEKVDNLRFIGG 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1107 NTKTGKAIKhVRDTLFTADSGTRRGIPKVIVVITDGRSQDDVNKISREMQAD-GYNIFAIGVAD---ADYSELVRIGSKP 1182
Cdd:cd01476     79 TTATGAAIE-VALQQLDPSEGRREGIPKVVVVLTDGRSHDDPEKQARILRAVpNIETFAVGTGDpgtVDTEELHSITGNE 157

                   ....*..
gi 1958789605 1183 ssRHVFF 1189
Cdd:cd01476    158 --DHIFT 162
VWA_integrin_invertebrates cd01476
VWA_integrin (invertebrates): Integrins are a family of cell surface receptors that have ...
158-313 1.10e-24

VWA_integrin (invertebrates): Integrins are a family of cell surface receptors that have diverse functions in cell-cell and cell-extracellular matrix interactions. Because of their involvement in many biologically important adhesion processes, integrins are conserved across a wide range of multicellular animals. Integrins from invertebrates have been identified from six phyla. There are no data to date to suggest any immunological functions for the invertebrate integrins. The members of this sub-group have the conserved MIDAS motif that is charateristic of this domain suggesting the involvement of the integrins in the recognition and binding of multi-ligands.


Pssm-ID: 238753 [Multi-domain]  Cd Length: 163  Bit Score: 102.09  E-value: 1.10e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  158 ADIVILVDGSWSIgRFNFRLVRNFLENLVTAFNVGSEKTRIGLAQYSGDPR--IEWHLNAFNTKDEVIDAVRSLPYKGGN 235
Cdd:cd01476      1 LDLLFVLDSSGSV-RGKFEKYKKYIERIVEGLEIGPTATRVALITYSGRGRqrVRFNLPKHNDGEELLEKVDNLRFIGGT 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  236 TLTGLALNFIFeNSFKPEAGSRSGVSKIGILITDGKSQDDIIPPSRNLRE-AGVELFAIGVKN---ADLSELQEIASEPD 311
Cdd:cd01476     80 TATGAAIEVAL-QQLDPSEGRREGIPKVVVVLTDGRSHDDPEKQARILRAvPNIETFAVGTGDpgtVDTEELHSITGNED 158

                   ..
gi 1958789605  312 ST 313
Cdd:cd01476    159 HI 160
vWA_micronemal_protein cd01471
Micronemal proteins: The Toxoplasma lytic cycle begins when the parasite actively invades a ...
159-295 5.54e-21

Micronemal proteins: The Toxoplasma lytic cycle begins when the parasite actively invades a target cell. In association with invasion, T. gondii sequentially discharges three sets of secretory organelles beginning with the micronemes, which contain adhesive proteins involved in parasite attachment to a host cell. Deployed as protein complexes, several micronemal proteins possess vertebrate-derived adhesive sequences that function in binding receptors. The VWA domain likely mediates the protein-protein interactions of these with their interacting partners.


Pssm-ID: 238748 [Multi-domain]  Cd Length: 186  Bit Score: 92.45  E-value: 5.54e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  159 DIVILVDGSWSIGRFN-FRLVRNFLENLVTAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDE-----VIDAVRSLPYK 232
Cdd:cd01471      2 DLYLLVDGSGSIGYSNwVTHVVPFLHTFVQNLNISPDEINLYLVTFSTNAKELIRLSSPNSTNKdlalnAIRALLSLYYP 81
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1958789605  233 GGNTLTGLALNFIFENSFKPeAGSRSGVSKIGILITDGKSQDD--IIPPSRNLREAGVELFAIGV 295
Cdd:cd01471     82 NGSTNTTSALLVVEKHLFDT-RGNRENAPQLVIIMTDGIPDSKfrTLKEARKLRERGVIIAVLGV 145
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
398-763 1.65e-16

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 85.44  E-value: 1.65e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  398 EVVADGRVSSIVLKNLMSSTEYQIAVFAVSAH---TASEGLRGTeTTLALPMA-SDLELYDVTENSMRVKWDAVP--GAT 471
Cdd:COG3401    184 SLTVTSTTLVDGGGDIEPGTTYYYRVAATDTGgesAPSNEVSVT-TPTTPPSApTGLTATADTPGSVTLSWDPVTesDAT 262
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  472 GYLILYAPlteglAGDEKEMKIGE-THTDIELSGLFPNTEYTVTVYAM--FGEE--ASDPATGQETTLPLTPPRNLRISN 546
Cdd:COG3401    263 GYRVYRSN-----SGDGPFTKVATvTTTSYTDTGLTNGTTYYYRVTAVdaAGNEsaPSNVVSVTTDLTPPAAPSGLTATA 337
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  547 VGSNSARLTWDPTSGK-ITGYRIVYTSADGTEINEV-EVDPITTFPLKGLTPLTDYSIAIFSIYEEGQSLPLVGEFTTEE 624
Cdd:COG3401    338 VGSSSITLSWTASSDAdVTGYNVYRSTSGGGTYTKIaETVTTTSYTDTGLTPGTTYYYKVTAVDAAGNESAPSEEVSATT 417
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  625 VPAQQ----YLEIDEVKTDSFRVTWHPLSAEEGQHKLMWIPVYGGKTQEVALKEEQDSYVVEGLDPGTEYEVSLLAVLDD 700
Cdd:COG3401    418 ASAASgeslTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTGNAVPFTTTSSTVTATTTDTTTANLSVTTGSLV 497
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1958789605  701 GSESEVVTAVGTTLDDFWTEAPTTVEPTSPVTSVLQTGIRNLVVDDEAATSLRVTWDISDSNV 763
Cdd:COG3401    498 GGSGASSVTNSVSVIGASAAAAVGGAPDGTPNVTGASPVTVGASTGDVLITDLVSLTTSASSS 560
vWA_micronemal_protein cd01471
Micronemal proteins: The Toxoplasma lytic cycle begins when the parasite actively invades a ...
1030-1167 1.69e-16

Micronemal proteins: The Toxoplasma lytic cycle begins when the parasite actively invades a target cell. In association with invasion, T. gondii sequentially discharges three sets of secretory organelles beginning with the micronemes, which contain adhesive proteins involved in parasite attachment to a host cell. Deployed as protein complexes, several micronemal proteins possess vertebrate-derived adhesive sequences that function in binding receptors. The VWA domain likely mediates the protein-protein interactions of these with their interacting partners.


Pssm-ID: 238748 [Multi-domain]  Cd Length: 186  Bit Score: 79.35  E-value: 1.69e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1030 DLVFMVDGSWSIGDDN-FNKIINFLYSTVGALDkIGADGTQVAMVQFTDDPRTEFKLDAYKT--KETLLDAIRH---ISY 1103
Cdd:cd01471      2 DLYLLVDGSGSIGYSNwVTHVVPFLHTFVQNLN-ISPDEINLYLVTFSTNAKELIRLSSPNStnKDLALNAIRAllsLYY 80
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1958789605 1104 KGGNTKTGKAIKHVRDTLFTAdSGTRRGIPKVIVVITDGRSQDDVN--KISREMQADGYNIFAIGV 1167
Cdd:cd01471     81 PNGSTNTTSALLVVEKHLFDT-RGNRENAPQLVIIMTDGIPDSKFRtlKEARKLRERGVIIAVLGV 145
vWA_ATR cd01474
ATR (Anthrax Toxin Receptor): Anthrax toxin is a key virulence factor for Bacillus anthracis, ...
1029-1208 8.05e-16

ATR (Anthrax Toxin Receptor): Anthrax toxin is a key virulence factor for Bacillus anthracis, the causative agent of anthrax. ATR is the cellular receptor for the anthrax protective antigen and facilitates entry of the toxin into cells. The VWA domain in ATR contains the toxin binding site and mediates interaction with protective antigen. The binding is mediated by divalent cations that binds to the MIDAS motif. These proteins are a family of vertebrate ECM receptors expressed by endothelial cells.


Pssm-ID: 238751 [Multi-domain]  Cd Length: 185  Bit Score: 77.55  E-value: 8.05e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1029 ADLVFMVDGSWSIGDdNFNKIINFLYSTVgalDKIGADGTQVAMVQFTDDPRTEFKLDAYKTKETL-LDAIRHISyKGGN 1107
Cdd:cd01474      5 FDLYFVLDKSGSVAA-NWIEIYDFVEQLV---DRFNSPGLRFSFITFSTRATKILPLTDDSSAIIKgLEVLKKVT-PSGQ 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1108 TKTGKAIKHVRDTLFTADSGTRRgIPKVIVVITDGRSQDDV-------NKISREMqadGYNIFAIGVADADYSELVRIGS 1180
Cdd:cd01474     80 TYIHEGLENANEQIFNRNGGGRE-TVSVIIALTDGQLLLNGhkypeheAKLSRKL---GAIVYCVGVTDFLKSQLINIAD 155
                          170       180
                   ....*....|....*....|....*....
gi 1958789605 1181 kpSSRHVFFVDD-FDAFKKIEDELITFVC 1208
Cdd:cd01474    156 --SKEYVFPVTSgFQALSGIIESVVKKAC 182
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
538-622 8.78e-16

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 74.46  E-value: 8.78e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  538 PPRNLRISNVGSNSARLTWDP---TSGKITGYRIVYTSADGTEINEVEVDPI--TTFPLKGLTPLTDYSIAIFSIYEEGQ 612
Cdd:cd00063      3 PPTNLRVTDVTSTSVTLSWTPpedDGGPITGYVVEYREKGSGDWKEVEVTPGseTSYTLTGLKPGTEYEFRVRAVNGGGE 82
                           90
                   ....*....|.
gi 1958789605  613 SLPL-VGEFTT 622
Cdd:cd00063     83 SPPSeSVTVTT 93
vWA_collagen_alpha_1-VI-type cd01480
VWA_collagen alpha(VI) type: The extracellular matrix represents a complex alloy of variable ...
1029-1192 1.81e-15

VWA_collagen alpha(VI) type: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238757 [Multi-domain]  Cd Length: 186  Bit Score: 76.27  E-value: 1.81e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1029 ADLVFMVDGSWSIGDDNFN-------KIINFLYSTvgALDKIGADGTQVAMVQFTDDPRTEF-KLDAYKTKETLLDAIRH 1100
Cdd:cd01480      3 VDITFVLDSSESVGLQNFDitknfvkRVAERFLKD--YYRKDPAGSWRVGVVQYSDQQEVEAgFLRDIRNYTSLKEAVDN 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1101 ISYKGGNTKTGKAIKHVRDTLFTADSGTRRgipKVIVVITDGRSQ---DDVNKIS-REMQADGYNIFAIGVADADYSELV 1176
Cdd:cd01480     81 LEYIGGGTFTDCALKYATEQLLEGSHQKEN---KFLLVITDGHSDgspDGGIEKAvNEADHLGIKIFFVAVGSQNEEPLS 157
                          170       180
                   ....*....|....*....|....*.
gi 1958789605 1177 RIGSKPSSRHV----------FFVDD 1192
Cdd:cd01480    158 RIACDGKSALYrenfaellwsFFIDD 183
vWA_collagen_alpha_1-VI-type cd01480
VWA_collagen alpha(VI) type: The extracellular matrix represents a complex alloy of variable ...
158-320 3.04e-15

VWA_collagen alpha(VI) type: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238757 [Multi-domain]  Cd Length: 186  Bit Score: 75.88  E-value: 3.04e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  158 ADIVILVDGSWSIGRFNFRLVRNF----LENLVTAFNV--GSEKTRIGLAQYSGDPRIEWHLNAF-NTKDEVIDAVRSLP 230
Cdd:cd01480      3 VDITFVLDSSESVGLQNFDITKNFvkrvAERFLKDYYRkdPAGSWRVGVVQYSDQQEVEAGFLRDiRNYTSLKEAVDNLE 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  231 YKGGNTLTGLALNFIFENSFKpeaGSRSGVSKIGILITDGKSqdDIIPPSRNLR------EAGVELFAIGVkNADLSE-L 303
Cdd:cd01480     83 YIGGGTFTDCALKYATEQLLE---GSHQKENKFLLVITDGHS--DGSPDGGIEKavneadHLGIKIFFVAV-GSQNEEpL 156
                          170
                   ....*....|....*...
gi 1958789605  304 QEIASEPDSTHV-YNVAE 320
Cdd:cd01480    157 SRIACDGKSALYrENFAE 174
fn3 pfam00041
Fibronectin type III domain;
538-615 1.15e-13

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 67.83  E-value: 1.15e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  538 PPRNLRISNVGSNSARLTWDP---TSGKITGYRIVYTSADGTEI-NEVEVDP-ITTFPLKGLTPLTDYSIAIFSIYEEGQ 612
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSWTPppdGNGPITGYEVEYRPKNSGEPwNEITVPGtTTSVTLTGLKPGTEYEVRVQAVNGGGE 81

                   ...
gi 1958789605  613 SLP 615
Cdd:pfam00041   82 GPP 84
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
538-613 1.44e-13

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 67.64  E-value: 1.44e-13
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605   538 PPRNLRISNVGSNSARLTWDP-----TSGKITGYRIVYTSADGTEINEVEVDPITTFPLKGLTPLTDYSIAIFSIYEEGQ 612
Cdd:smart00060    3 PPSNLRVTDVTSTSVTLSWEPppddgITGYIVGYRVEYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAGE 82

                    .
gi 1958789605   613 S 613
Cdd:smart00060   83 G 83
fn3 pfam00041
Fibronectin type III domain;
830-906 2.08e-12

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 64.36  E-value: 2.08e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  830 PQNLRVSEEWYNRLRITWDPP---SAPVKGYRIVYKPVSVPGQTLETFVGADVNTIVMTNLLSGMDYNVKIFASQAAGYS 906
Cdd:pfam00041    3 PSNLTVTDVTSTSLTVSWTPPpdgNGPITGYEVEYRPKNSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGEG 82
VWA_2 pfam13519
von Willebrand factor type A domain;
1031-1139 4.58e-12

von Willebrand factor type A domain;


Pssm-ID: 463909 [Multi-domain]  Cd Length: 103  Bit Score: 63.85  E-value: 4.58e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1031 LVFMVDGSWSIGDD-----NFNKIINFLYSTVGALDkigadGTQVAMVQFTDDPRTEFKLDayKTKETLLDAIRHISYKG 1105
Cdd:pfam13519    1 LVFVLDTSGSMRNGdygptRLEAAKDAVLALLKSLP-----GDRVGLVTFGDGPEVLIPLT--KDRAKILRALRRLEPKG 73
                           90       100       110
                   ....*....|....*....|....*....|....
gi 1958789605 1106 GNTKTGKAIKHVRDTLFTAdsgtRRGIPKVIVVI 1139
Cdd:pfam13519   74 GGTNLAAALQLARAALKHR----RKNQPRRIVLI 103
fn3 pfam00041
Fibronectin type III domain;
355-433 1.60e-11

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 62.05  E-value: 1.60e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  355 GPPTELITSEVTARSFMVNWTHSP---GKVEKYRVVYYPTRGGKPE-EVVADGRVSSIVLKNLMSSTEYQIAVFAVSAHT 430
Cdd:pfam00041    1 SAPSNLTVTDVTSTSLTVSWTPPPdgnGPITGYEVEYRPKNSGEPWnEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGG 80

                   ...
gi 1958789605  431 ASE 433
Cdd:pfam00041   81 EGP 83
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
828-910 1.65e-11

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 62.13  E-value: 1.65e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  828 SGPQNLRVSEEWYNRLRITWDPPS---APVKGYRIVYKPVSVPGQTLETFVGADVNTIVMTNLLSGMDYNVKIFASQAAG 904
Cdd:cd00063      2 SPPTNLRVTDVTSTSVTLSWTPPEddgGPITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGGG 81

                   ....*.
gi 1958789605  905 YSDALT 910
Cdd:cd00063     82 ESPPSE 87
ChlD COG1240
vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and ...
158-309 3.56e-11

vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and metabolism];


Pssm-ID: 440853 [Multi-domain]  Cd Length: 262  Bit Score: 65.73  E-value: 3.56e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  158 ADIVILVDGSWSIGRFN-FRLVRNFLENLVTAFnvgSEKTRIGLAQYSGdpRIEWHLNAFNTKDEVIDAVRSLPYKGGNT 236
Cdd:COG1240     93 RDVVLVVDASGSMAAENrLEAAKGALLDFLDDY---RPRDRVGLVAFGG--EAEVLLPLTRDREALKRALDELPPGGGTP 167
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  237 LT---GLALNFIfensfkpeAGSRSGVSKIGILITDGKSQDDIIPP---SRNLREAGVELFAIGV--KNADLSELQEIAS 308
Cdd:COG1240    168 LGdalALALELL--------KRADPARRKVIVLLTDGRDNAGRIDPleaAELAAAAGIRIYTIGVgtEAVDEGLLREIAE 239

                   .
gi 1958789605  309 E 309
Cdd:COG1240    240 A 240
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
740-824 7.03e-11

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 60.20  E-value: 7.03e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  740 RNLVVDDEAATSLRVTWD---ISDSNVEHFRVTYLTAQGDPKEEVVMVPGVQNSLLLKNLLPDTEYKVTVTPIYTVGEGV 816
Cdd:cd00063      5 TNLRVTDVTSTSVTLSWTppeDDGGPITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGGGESP 84

                   ....*....
gi 1958789605  817 -SVSAPGKT 824
Cdd:cd00063     85 pSESVTVTT 93
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
355-433 7.75e-11

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 60.20  E-value: 7.75e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  355 GPPTELITSEVTARSFMVNWTHSP---GKVEKYRVVYYPTRGGKPEEV-VADGRVSSIVLKNLMSSTEYQIAVFAVSAHT 430
Cdd:cd00063      2 SPPTNLRVTDVTSTSVTLSWTPPEddgGPITGYVVEYREKGSGDWKEVeVTPGSETSYTLTGLKPGTEYEFRVRAVNGGG 81

                   ...
gi 1958789605  431 ASE 433
Cdd:cd00063     82 ESP 84
fn3 pfam00041
Fibronectin type III domain;
740-815 8.48e-11

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 59.74  E-value: 8.48e-11
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1958789605  740 RNLVVDDEAATSLRVTWDIS---DSNVEHFRVTYLTAQGDPKEEVVMVPGVQNSLLLKNLLPDTEYKVTVTPIYTVGEG 815
Cdd:pfam00041    4 SNLTVTDVTSTSLTVSWTPPpdgNGPITGYEVEYRPKNSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGEG 82
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
32-112 8.71e-11

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 60.20  E-value: 8.71e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605   32 PPTRLRYNVISHDSIQISWKAPRGKFG---GYKLLVAPASGGKTNQMNLQNG-ATKAIIQGLLPEQNYTVQLIAYYKDKE 107
Cdd:cd00063      3 PPTNLRVTDVTSTSVTLSWTPPEDDGGpitGYVVEYREKGSGDWKEVEVTPGsETSYTLTGLKPGTEYEFRVRAVNGGGE 82

                   ....*
gi 1958789605  108 SKPAQ 112
Cdd:cd00063     83 SPPSE 87
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1476-1606 9.91e-11

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 66.08  E-value: 9.91e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1476 QQGEPGPKGPEGPRGETGPagpqgppgpqgpsglsiQGMPGMPGDKGDKGDAGLPGPQGVPGGVGSPGRDGSPGQRGFPG 1555
Cdd:NF038329   166 PQGEAGPQGPAGKDGEAGA-----------------KGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAG 228
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1958789605 1556 KDGSSGPPGPPGpigipgapgvPGIAGSMGPQGALGPPGVPGAKGERGERG 1606
Cdd:NF038329   229 PAGDGQQGPDGD----------PGPTGEDGPQGPDGPAGKDGPRGDRGEAG 269
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
356-608 1.00e-10

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 66.56  E-value: 1.00e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  356 PPTELITSEVTARSFMVNWTHSPGK-VEKYRVvYYPTRGGKPEEVVADGRVSSIVLKNLMSSTEYQIAVFAVSAH----T 430
Cdd:COG3401    235 APTGLTATADTPGSVTLSWDPVTESdATGYRV-YRSNSGDGPFTKVATVTTTSYTDTGLTNGTTYYYRVTAVDAAgnesA 313
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  431 ASEGLRGTETTLALPMASDLELYDVTENSMRVKWDAVPG--ATGYLILYAPLTEGLAGdekemKIGETHTDIEL--SGLF 506
Cdd:COG3401    314 PSNVVSVTTDLTPPAAPSGLTATAVGSSSITLSWTASSDadVTGYNVYRSTSGGGTYT-----KIAETVTTTSYtdTGLT 388
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  507 PNTEYTVTVYAM----FGEEASDPATGQETTLP----LTPPRNLRISNVGSNSARLTWDPTSGKITGYrIVYTSADGTEI 578
Cdd:COG3401    389 PGTTYYYKVTAVdaagNESAPSEEVSATTASAAsgesLTASVDAVPLTDVAGATAAASAASNPGVSAA-VLADGGDTGNA 467
                          250       260       270
                   ....*....|....*....|....*....|
gi 1958789605  579 NEVEVDPITTFPLKGLTPLTDYSIAIFSIY 608
Cdd:COG3401    468 VPFTTTSSTVTATTTDTTTANLSVTTGSLV 497
ChlD COG1240
vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and ...
1029-1199 4.21e-10

vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and metabolism];


Pssm-ID: 440853 [Multi-domain]  Cd Length: 262  Bit Score: 62.26  E-value: 4.21e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1029 ADLVFMVDGSWSIGDDN-FNKIINFLYSTVGALDKigadGTQVAMVQFTDDPRT--EFKLDayktKETLLDAIRHISYKG 1105
Cdd:COG1240     93 RDVVLVVDASGSMAAENrLEAAKGALLDFLDDYRP----RDRVGLVAFGGEAEVllPLTRD----REALKRALDELPPGG 164
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1106 GnTKTGKAIKHVRDTLFTADSGTRrgipKVIVVITDGR---SQDDVNKISREMQADGYNIFAIGVADADYSE--LVRIgS 1180
Cdd:COG1240    165 G-TPLGDALALALELLKRADPARR----KVIVLLTDGRdnaGRIDPLEAAELAAAAGIRIYTIGVGTEAVDEglLREI-A 238
                          170
                   ....*....|....*....
gi 1958789605 1181 KPSSRHVFFVDDFDAFKKI 1199
Cdd:COG1240    239 EATGGRYFRADDLSELAAI 257
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
448-517 5.32e-10

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 57.62  E-value: 5.32e-10
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1958789605   448 SDLELYDVTENSMRVKWDAV--PGATGYLILYAPLTEGLAGDEKEMKIGETHTDIELSGLFPNTEYTVTVYA 517
Cdd:smart00060    5 SNLRVTDVTSTSVTLSWEPPpdDGITGYIVGYRVEYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRA 76
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
740-815 6.79e-10

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 57.24  E-value: 6.79e-10
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1958789605   740 RNLVVDDEAATSLRVTWDI-SDSNVEHFRVTY--LTAQGDPKEEVVMVPGVQNSLLLKNLLPDTEYKVTVTPIYTVGEG 815
Cdd:smart00060    5 SNLRVTDVTSTSVTLSWEPpPDDGITGYIVGYrvEYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAGEG 83
vWA_ATR cd01474
ATR (Anthrax Toxin Receptor): Anthrax toxin is a key virulence factor for Bacillus anthracis, ...
157-336 1.44e-09

ATR (Anthrax Toxin Receptor): Anthrax toxin is a key virulence factor for Bacillus anthracis, the causative agent of anthrax. ATR is the cellular receptor for the anthrax protective antigen and facilitates entry of the toxin into cells. The VWA domain in ATR contains the toxin binding site and mediates interaction with protective antigen. The binding is mediated by divalent cations that binds to the MIDAS motif. These proteins are a family of vertebrate ECM receptors expressed by endothelial cells.


Pssm-ID: 238751 [Multi-domain]  Cd Length: 185  Bit Score: 59.45  E-value: 1.44e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  157 IADIVILVDGSWSIGRfNFRLVRNFLENLVTAFNvgSEKTRIGLAQYSGDPRIEWHLNAFNTKD----EVIDAVrsLPyk 232
Cdd:cd01474      4 HFDLYFVLDKSGSVAA-NWIEIYDFVEQLVDRFN--SPGLRFSFITFSTRATKILPLTDDSSAIikglEVLKKV--TP-- 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  233 GGNTLTGLALNFIFENSFKPEAGSRSgVSKIGILITDGKSQDDiiPPSRNLREA------GVELFAIGVKNADLSELQEI 306
Cdd:cd01474     77 SGQTYIHEGLENANEQIFNRNGGGRE-TVSVIIALTDGQLLLN--GHKYPEHEAklsrklGAIVYCVGVTDFLKSQLINI 153
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1958789605  307 ASEPDstHVYNVAE-FDLMHTVVESLTRTVC 336
Cdd:cd01474    154 ADSKE--YVFPVTSgFQALSGIIESVVKKAC 182
fn3 pfam00041
Fibronectin type III domain;
32-110 2.37e-09

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 55.88  E-value: 2.37e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605   32 PPTRLRYNVISHDSIQISWKAPR---GKFGGYKLLVAPA-SGGKTNQMNLQNGATKAIIQGLLPEQNYTVQLIAYYKDKE 107
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSWTPPPdgnGPITGYEVEYRPKnSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGE 81

                   ...
gi 1958789605  108 SKP 110
Cdd:pfam00041   82 GPP 84
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
448-533 4.70e-09

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 55.20  E-value: 4.70e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  448 SDLELYDVTENSMRVKWDAVPGA----TGYLILYAPLTEGlaGDEKEMKIGETHTDIELSGLFPNTEYTVTVYAMFGEEA 523
Cdd:cd00063      5 TNLRVTDVTSTSVTLSWTPPEDDggpiTGYVVEYREKGSG--DWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGGGE 82
                           90
                   ....*....|
gi 1958789605  524 SDPATGQETT 533
Cdd:cd00063     83 SPPSESVTVT 92
fn3 pfam00041
Fibronectin type III domain;
448-517 1.18e-08

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 53.57  E-value: 1.18e-08
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1958789605  448 SDLELYDVTENSMRVKWDAVPGA----TGYLILYAPLTEGlaGDEKEMKIGETHTDIELSGLFPNTEYTVTVYA 517
Cdd:pfam00041    4 SNLTVTDVTSTSLTVSWTPPPDGngpiTGYEVEYRPKNSG--EPWNEITVPGTTTSVTLTGLKPGTEYEVRVQA 75
COG4733 COG4733
Phage-related protein, tail protein J [Mobilome: prophages, transposons];
447-770 1.55e-08

Phage-related protein, tail protein J [Mobilome: prophages, transposons];


Pssm-ID: 443767 [Multi-domain]  Cd Length: 978  Bit Score: 59.96  E-value: 1.55e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  447 ASDLELYDVTENS----MRVKWDAVPGATGYLILYapltegLAGDEKEMKIGETH-TDIELSGLFPNTeYTVTVYAM-FG 520
Cdd:COG4733    537 TTSESLSVVAQGTavttLTVSWDAPAGAVAYEVEW------RRDDGNWVSVPRTSgTSFEVPGIYAGD-YEVRVRAInAL 609
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  521 EEASDPATGQETTL-----PLTPPRNLRISNvGSNSARLTWDPTSG-KITGYRIVYTSADGTEINEVEVD--PITTFPLK 592
Cdd:COG4733    610 GVSSAWAASSETTVtgktaPPPAPTGLTATG-GLGGITLSWSFPVDaDTLRTEIRYSTTGDWASATVAQAlyPGNTYTLA 688
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  593 GLTPLTDYSIAIFSIYEEGQSLPLVGEFTTEEVPAQQYLEIDEVKTDSfrVTWHPLSAEEGQHKLMWIpVYGGKTQEVAL 672
Cdd:COG4733    689 GLKAGQTYYYRARAVDRSGNVSAWWVSGQASADAAGILDAITGQILET--ELGQELDAIIQNATVAEV-VAATVTDVTAQ 765
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  673 KEEQDSYVVEGLDPGTEYEVSLLAVLDDGSESEVVTAVGTTLDD------FWTEAPTTVEPTSPVTSVLQTGIRNLVVDD 746
Cdd:COG4733    766 IDTAVLFAGVATAAAIGAEARVAATVAESATAAAATGTAADAAGdasggvTAGTSGTTGAGDTAASTTRVAAAVVLAGVV 845
                          330       340
                   ....*....|....*....|....
gi 1958789605  747 EAATSLRVTWDISDSNVEHFRVTY 770
Cdd:COG4733    846 VYGDAIIESGNTGDIVATGDIASA 869
vWA_CTRP cd01473
CTRP for CS protein-TRAP-related protein: Adhesion of Plasmodium to host cells is an ...
159-336 2.23e-08

CTRP for CS protein-TRAP-related protein: Adhesion of Plasmodium to host cells is an important phenomenon in parasite invasion and in malaria associated pathology.CTRP encodes a protein containing a putative signal sequence followed by a long extracellular region of 1990 amino acids, a transmembrane domain, and a short cytoplasmic segment. The extracellular region of CTRP contains two separated adhesive domains. The first domain contains six 210-amino acid-long homologous VWA domain repeats. The second domain contains seven repeats of 87-60 amino acids in length, which share similarities with the thrombospondin type 1 domain found in a variety of adhesive molecules. Finally, CTRP also contains consensus motifs found in the superfamily of haematopoietin receptors. The VWA domains in these proteins likely mediate protein-protein interactions.


Pssm-ID: 238750 [Multi-domain]  Cd Length: 192  Bit Score: 55.79  E-value: 2.23e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  159 DIVILVDGSWSIGRFNFRL-VRNFLENLVTAFNVGSEKTRIGLAQYSGDPR--IEWHLNAFNTKDEVIDAVRSLP--YK- 232
Cdd:cd01473      2 DLTLILDESASIGYSNWRKdVIPFTEKIINNLNISKDKVHVGILLFAEKNRdvVPFSDEERYDKNELLKKINDLKnsYRs 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  233 GGNTLTGLALNFIFENSFKPEaGSRSGVSKIGILITDG----KSQDDIIPPSRNLREAGVELFAIGVKNADLSELQEIA- 307
Cdd:cd01473     82 GGETYIVEALKYGLKNYTKHG-NRRKDAPKVTMLFTDGndtsASKKELQDISLLYKEENVKLLVVGVGAASENKLKLLAg 160
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1958789605  308 -SEPDSTHVYNV-AEFDLMHTVVESLTRTVC 336
Cdd:cd01473    161 cDINNDNCPNVIkTEWNNLNGISKFLTDKIC 191
ViaA COG2425
Uncharacterized conserved protein, contains a von Willebrand factor type A (vWA) domain ...
1030-1178 2.42e-08

Uncharacterized conserved protein, contains a von Willebrand factor type A (vWA) domain [Function unknown];


Pssm-ID: 441973 [Multi-domain]  Cd Length: 263  Bit Score: 57.00  E-value: 2.42e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1030 DLVFMVDGSWSIGDDNFNKIINFLYSTVGALdkigADGTQVAMVQFTDDPRTEFKLDAYKTKETLLDAIRHISYKGGnTK 1109
Cdd:COG2425    120 PVVLCVDTSGSMAGSKEAAAKAAALALLRAL----RPNRRFGVILFDTEVVEDLPLTADDGLEDAIEFLSGLFAGGG-TD 194
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1958789605 1110 TGKAIKHVRDTLftADSGTRRgipKVIVVITDGRSQDDVNKISREMQADGYN--IFAIGVADADYSELVRI 1178
Cdd:COG2425    195 IAPALRAALELL--EEPDYRN---ADIVLITDGEAGVSPEELLREVRAKESGvrLFTVAIGDAGNPGLLEA 260
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
828-906 2.55e-08

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 52.62  E-value: 2.55e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605   828 SGPQNLRVSEEWYNRLRITWDPPS-----APVKGYRIVYKPVSVPGQTLEtfVGADVNTIVMTNLLSGMDYNVKIFASQA 902
Cdd:smart00060    2 SPPSNLRVTDVTSTSVTLSWEPPPddgitGYIVGYRVEYREEGSEWKEVN--VTPSSTSYTLTGLKPGTEYEFRVRAVNG 79

                    ....
gi 1958789605   903 AGYS 906
Cdd:smart00060   80 AGEG 83
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1476-1606 3.93e-08

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 57.99  E-value: 3.93e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1476 QQGEPGPKGPEGPRGEtgpagpqgppgpqgpsglsiQGMPGMPGDKGDKGDAglpgpqgvpggvgspgrdGSPGQRGFPG 1555
Cdd:NF038329   118 EKGEPGPAGPAGPAGE--------------------QGPRGDRGETGPAGPA------------------GPPGPQGERG 159
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1958789605 1556 KDGSSGPPGPpgpigipgapgvPGIAGSMGPQGALGPPGVPGAKGERGERG 1606
Cdd:NF038329   160 EKGPAGPQGE------------AGPQGPAGKDGEAGAKGPAGEKGPQGPRG 198
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
631-702 4.36e-08

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 52.23  E-value: 4.36e-08
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1958789605   631 LEIDEVKTDSFRVTW-HPLSAEEGQHKLMWIPVY---GGKTQEVALKEEQDSYVVEGLDPGTEYEVSLLAVLDDGS 702
Cdd:smart00060    7 LRVTDVTSTSVTLSWePPPDDGITGYIVGYRVEYreeGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAGE 82
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
631-708 4.74e-08

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 52.50  E-value: 4.74e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  631 LEIDEVKTDSFRVTWHPLSAEEGQH---KLMWIPVYGGKTQEVALKEEQD-SYVVEGLDPGTEYEVSLLAVLDDG----S 702
Cdd:cd00063      7 LRVTDVTSTSVTLSWTPPEDDGGPItgyVVEYREKGSGDWKEVEVTPGSEtSYTLTGLKPGTEYEFRVRAVNGGGesppS 86

                   ....*.
gi 1958789605  703 ESEVVT 708
Cdd:cd00063     87 ESVTVT 92
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
355-432 4.90e-08

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 51.85  E-value: 4.90e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605   355 GPPTELITSEVTARSFMVNWTHSPGKVEKYRVVYY----PTRGGKPEEVVADGRVSSIVLKNLMSSTEYQIAVFAVSAHT 430
Cdd:smart00060    2 SPPSNLRVTDVTSTSVTLSWEPPPDDGITGYIVGYrveyREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAG 81

                    ..
gi 1958789605   431 AS 432
Cdd:smart00060   82 EG 83
vWA_complement_factors cd01470
Complement factors B and C2 are two critical proteases for complement activation. They both ...
159-329 7.88e-08

Complement factors B and C2 are two critical proteases for complement activation. They both contain three CCP or Sushi domains, a trypsin-type serine protease domain and a single VWA domain with a conserved metal ion dependent adhesion site referred commonly as the MIDAS motif. Orthologues of these molecules are found from echinoderms to chordates. During complement activation, the CCP domains are cleaved off, resulting in the formation of an active protease that cleaves and activates complement C3. Complement C2 is in the classical pathway and complement B is in the alternative pathway. The interaction of C2 with C4 and of factor B with C3b are both dependent on Mg2+ binding sites within the VWA domains and the VWA domain of factor B has been shown to mediate the binding of C3. This is consistent with the common inferred function of VWA domains as magnesium-dependent protein interaction domains.


Pssm-ID: 238747 [Multi-domain]  Cd Length: 198  Bit Score: 54.60  E-value: 7.88e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  159 DIVILVDGSWSIGRFNFRLVRNFLENLVT---AFNVgseKTRIGLAQYSGDPRIEWHLNAFNT--KDEVIDAVRSLPYK- 232
Cdd:cd01470      2 NIYIALDASDSIGEEDFDEAKNAIKTLIEkisSYEV---SPRYEIISYASDPKEIVSIRDFNSndADDVIKRLEDFNYDd 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  233 ---GGNTLTGLALNFIFENSFKPEAGSRSGVSKIG---ILITDGKSQ---------DDII------PPSRNLREAGVELF 291
Cdd:cd01470     79 hgdKTGTNTAAALKKVYERMALEKVRNKEAFNETRhviILFTDGKSNmggsplptvDKIKnlvyknNKSDNPREDYLDVY 158
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|
gi 1958789605  292 AIGV-KNADLSELQEIASE-PDSTHVYNVAEFDLMHTVVE 329
Cdd:cd01470    159 VFGVgDDVNKEELNDLASKkDNERHFFKLKDYEDLQEVFD 198
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1477-1606 1.11e-07

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 56.45  E-value: 1.11e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1477 QGEPGPKGPEGPRGETGPAGPqgppgpqgpsglsiQGMPGMPGDKGDKGDAGLPGPQGVPGgvgspgrDGSPGQRGFPGK 1556
Cdd:NF038329   185 KGPAGEKGPQGPRGETGPAGE--------------QGPAGPAGPDGEAGPAGEDGPAGPAG-------DGQQGPDGDPGP 243
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1557 DGSSGPPGPPgpigipgapgvpGIAGSMGPQGALGPPGVPGAKGERGERG 1606
Cdd:NF038329   244 TGEDGPQGPD------------GPAGKDGPRGDRGEAGPDGPDGKDGERG 281
fn3 pfam00041
Fibronectin type III domain;
626-701 4.62e-07

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 49.34  E-value: 4.62e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  626 PAQQYLEIDEVKTDSFRVTWHPLSAEEGQ---HKLMWIPVYGGK-TQEVALKEEQDSYVVEGLDPGTEYEVSLLAVLDDG 701
Cdd:pfam00041    1 SAPSNLTVTDVTSTSLTVSWTPPPDGNGPitgYEVEYRPKNSGEpWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGG 80
VWA_2 pfam13519
von Willebrand factor type A domain;
160-267 5.63e-07

von Willebrand factor type A domain;


Pssm-ID: 463909 [Multi-domain]  Cd Length: 103  Bit Score: 49.60  E-value: 5.63e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  160 IVILVDGSWSI-----GRFNFRLVRNFLENLVTAFNvgseKTRIGLAQYSGDPRIEWHLNAfnTKDEVIDAVRSLPYKGG 234
Cdd:pfam13519    1 LVFVLDTSGSMrngdyGPTRLEAAKDAVLALLKSLP----GDRVGLVTFGDGPEVLIPLTK--DRAKILRALRRLEPKGG 74
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1958789605  235 NTLTGLALNFIFENSFKPeagsRSGVSKIGILI 267
Cdd:pfam13519   75 GTNLAAALQLARAALKHR----RKNQPRRIVLI 103
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
32-108 1.08e-06

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 47.99  E-value: 1.08e-06
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605    32 PPTRLRYNVISHDSIQISWKAP-RGKFGGYKL---LVAPASGGKTNQMNLQNGATKAIIQGLLPEQNYTVQLIAYYKDKE 107
Cdd:smart00060    3 PPSNLRVTDVTSTSVTLSWEPPpDDGITGYIVgyrVEYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAGE 82

                    .
gi 1958789605   108 S 108
Cdd:smart00060   83 G 83
COG4733 COG4733
Phage-related protein, tail protein J [Mobilome: prophages, transposons];
679-915 2.09e-06

Phage-related protein, tail protein J [Mobilome: prophages, transposons];


Pssm-ID: 443767 [Multi-domain]  Cd Length: 978  Bit Score: 53.03  E-value: 2.09e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  679 YVVEGLDPGTEYEVSLLAVLDDGSESEVVTAvgttlddfwtEAPTTVEPTSPVTSVLQTGIRNLVVDDEAATSLRVTWDI 758
Cdd:COG4733    491 FRVVSIEENEDGTYTITAVQHAPEKYAAIDA----------GAFDDVPPQWPPVNVTTSESLSVVAQGTAVTTLTVSWDA 560
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  759 SDSNVeHFRVTYLTAQGDpkeeVVMVPGVQNSLLLKNLLPDTEYKVTVTPIYTVG------EGVSVSAPGKTLPTSGPQN 832
Cdd:COG4733    561 PAGAV-AYEVEWRRDDGN----WVSVPRTSGTSFEVPGIYAGDYEVRVRAINALGvssawaASSETTVTGKTAPPPAPTG 635
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  833 LRVSEEwYNRLRITWDPPS-APVKGYRIVYKPVSVPGQTLETFVGADVNTIVMTNLLSGMDYNVKIFASQAAGYSDALTG 911
Cdd:COG4733    636 LTATGG-LGGITLSWSFPVdADTLRTEIRYSTTGDWASATVAQALYPGNTYTLAGLKAGQTYYYRARAVDRSGNVSAWWV 714

                   ....
gi 1958789605  912 LVQT 915
Cdd:COG4733    715 SGQA 718
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1476-1606 2.72e-06

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 51.83  E-value: 2.72e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1476 QQGEPGPKGPEGPRGETGPAGPQGPPGPQGPSGLSIQGMPGMPGDKGDKGDAGLpgpqgvpggvgspgrDGSPGQRGFPG 1555
Cdd:NF038329   193 PQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDGQQGPDGDPGPTGE---------------DGPQGPDGPAG 257
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1958789605 1556 KDGSSGPPGPpgpigipgapgvPGIAGSMGPQGALGPPGVPGAKGERGERG 1606
Cdd:NF038329   258 KDGPRGDRGE------------AGPDGPDGKDGERGPVGPAGKDGQNGKDG 296
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
684-938 5.25e-06

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 51.16  E-value: 5.25e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  684 LDPGTEYEVSLLAVLDDGSESEVVTAVGTTLDDFWTEAPTTVEPTSPVTSVLQTGIRNLVVDDEAATSLRVTWDISDSNV 763
Cdd:COG3401     87 APPTATGLTTLTGSGSVGGATNTGLTSSDEVPSPAVGTATTATAVAGGAATAGTYALGAGLYGVDGANASGTTASSVAGA 166
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  764 EHFRVTYLTAQGDPKEEVVMVPGVQNSLLLKNLLPDTEYKVTVTPIYTVGEGV---SVSAPGKTLPTSGPQNLRVSEEWY 840
Cdd:COG3401    167 GVVVSPDTSATAAVATTSLTVTSTTLVDGGGDIEPGTTYYYRVAATDTGGESApsnEVSVTTPTTPPSAPTGLTATADTP 246
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  841 NRLRITWDPPSAP-VKGYRIVYKpvSVPGQTLETFVGADVNTIVMTNLLSGMDYNVKIFASQAAG----YSDALTGLVQT 915
Cdd:COG3401    247 GSVTLSWDPVTESdATGYRVYRS--NSGDGPFTKVATVTTTSYTDTGLTNGTTYYYRVTAVDAAGnesaPSNVVSVTTDL 324
                          250       260
                   ....*....|....*....|....
gi 1958789605  916 LFLGV-TDLQANQVEMTSLCARWQ 938
Cdd:COG3401    325 TPPAApSGLTATAVGSSSITLSWT 348
ViaA COG2425
Uncharacterized conserved protein, contains a von Willebrand factor type A (vWA) domain ...
155-309 9.07e-06

Uncharacterized conserved protein, contains a von Willebrand factor type A (vWA) domain [Function unknown];


Pssm-ID: 441973 [Multi-domain]  Cd Length: 263  Bit Score: 49.29  E-value: 9.07e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  155 PAIADIVILVDGSWSIGRFNFRLVRNFLENLVTAFNvgsEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIDAVRSLPYKGG 234
Cdd:COG2425    116 LLEGPVVLCVDTSGSMAGSKEAAAKAAALALLRALR---PNRRFGVILFDTEVVEDLPLTADDGLEDAIEFLSGLFAGGG 192
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1958789605  235 nTLTGLALNFIFEnSFKPEAGSRSGVskigILITDGKSQDDIIPPSRNLR--EAGVELFAIGVKNADLSELQEIASE 309
Cdd:COG2425    193 -TDIAPALRAALE-LLEEPDYRNADI----VLITDGEAGVSPEELLREVRakESGVRLFTVAIGDAGNPGLLEALAD 263
PTZ00441 PTZ00441
sporozoite surface protein 2 (SSP2); Provisional
1021-1194 1.49e-05

sporozoite surface protein 2 (SSP2); Provisional


Pssm-ID: 240420 [Multi-domain]  Cd Length: 576  Bit Score: 49.96  E-value: 1.49e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1021 KEVCKAaKADLVFMVDGSWSIGDDNF-NKIINFLYSTVGALDkIGADGTQVAMVQFTDDPRTEFKL--DAYKTKETLLDA 1097
Cdd:PTZ00441    36 EEVCNE-EVDLYLLVDGSGSIGYHNWiTHVIPMLMGLIQQLN-LSDDAINLYMSLFSNNTTELIRLgsGASKDKEQALII 113
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1098 IRHISyKG----GNTKTGKAIKHVRDTLftADSGTRRGIPKVIVVITDG--RSQDDVNKISREMQADGYNIFAIGVA--- 1168
Cdd:PTZ00441   114 VKSLR-KTylpyGKTNMTDALLEVRKHL--NDRVNRENAIQLVILMTDGipNSKYRALEESRKLKDRNVKLAVIGIGqgi 190
                          170       180
                   ....*....|....*....|....*.
gi 1958789605 1169 DADYSELVrIGSKPSSRHVFFVDDFD 1194
Cdd:PTZ00441   191 NHQFNRLL-AGCRPREGKCKFYSDAD 215
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1478-1602 2.20e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 43.64  E-value: 2.20e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1478 GEPGPKGPEGPRGEtgpagpqgppgpqgpsglsiqgmPGMPGDKGDKGDaglpgpqgvpggvgspgrdgsPGQRGFPgkd 1557
Cdd:pfam01391    1 GPPGPPGPPGPPGP-----------------------PGPPGPPGPPGP---------------------PGPPGEP--- 33
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*
gi 1958789605 1558 gssgppgppgpigipgapgvpGIAGSMGPQGALGPPGVPGAKGER 1602
Cdd:pfam01391   34 ---------------------GPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1513-1606 2.52e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 43.25  E-value: 2.52e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1513 GMPGMPGDKGDKGDAglpgpqgvpggvgspgrdGSPGQRGFPgkdgssgppGPpgpigipgapgvPGIAGSMGPQGALGP 1592
Cdd:pfam01391    1 GPPGPPGPPGPPGPP------------------GPPGPPGPP---------GP------------PGPPGEPGPPGPPGP 41
                           90
                   ....*....|....
gi 1958789605 1593 PGVPGAKGERGERG 1606
Cdd:pfam01391   42 PGPPGPPGAPGAPG 55
TerY COG4245
Uncharacterized conserved protein YegL, contains vWA domain of TerY type [Function unknown];
1032-1199 3.92e-05

Uncharacterized conserved protein YegL, contains vWA domain of TerY type [Function unknown];


Pssm-ID: 443387 [Multi-domain]  Cd Length: 196  Bit Score: 46.46  E-value: 3.92e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1032 VFMVDGSWSIGDDNFNKI---INFLYSTVGAlDKIGADGTQVAMVQFTDDPRTEFKLdayktkeTLLDAIR--HISYkGG 1106
Cdd:COG4245      9 YLLLDTSGSMSGEPIEALnegLQALIDELRQ-DPYALETVEVSVITFDGEAKVLLPL-------TDLEDFQppDLSA-SG 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1107 NTKTGKAIKHVRDTLfTADSGTRRGIPK-----VIVVITDGRSQDD-----VNKISREMQADGYNIFAIGV-ADADYSEL 1175
Cdd:COG4245     80 GTPLGAALELLLDLI-ERRVQKYTAEGKgdwrpVVFLITDGEPTDSdweaaLQRLKDGEAAKKANIFAIGVgPDADTEVL 158
                          170       180
                   ....*....|....*....|....
gi 1958789605 1176 VRIGskPSSRhVFFVDDFDAFKKI 1199
Cdd:COG4245    159 KQLT--DPVR-ALDALDGLDFREF 179
vWA_F09G8-8_type cd01477
VWA F09G8.8 type: Von Willebrand factor type A (vWA) domain was originally found in the blood ...
1030-1188 4.75e-05

VWA F09G8.8 type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains. The members of this subgroup lack the MIDAS motif. This subgroup is found only in C. elegans and the members identified thus far are always found fused to a C-Lectin type domain. Biochemical function thus far has not be attributed to any of the members of this subgroup.


Pssm-ID: 238754 [Multi-domain]  Cd Length: 193  Bit Score: 46.26  E-value: 4.75e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1030 DLVFMVDGSWSIGDDNFNKIINFLYSTVGALDKIGAD-----GTQVAMVQFTDDPRTEFKLDAYKTKETLLDAIRHI--- 1101
Cdd:cd01477     21 DIVFVVDNSKGMTQGGLWQVRATISSLFGSSSQIGTDyddprSTRVGLVTYNSNATVVADLNDLQSFDDLYSQIQGSltd 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1102 --SYKGGNTKTG--KAIKhvrdtLFTADSGT-RRGIPKVIVVIT---DGRSQDDVNKISREMQADGYNIfaIGVA---DA 1170
Cdd:cd01477    101 vsSTNASYLDTGlqAAEQ-----MLAAGKRTsRENYKKVVIVFAsdyNDEGSNDPRPIAARLKSTGIAI--ITVAftqDE 173
                          170
                   ....*....|....*...
gi 1958789605 1171 DYSELVRIGSKPSSRHVF 1188
Cdd:cd01477    174 SSNLLDKLGKIASPGMNF 191
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1578-1607 5.76e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.48  E-value: 5.76e-05
                           10        20        30
                   ....*....|....*....|....*....|
gi 1958789605 1578 PGIAGSMGPQGALGPPGVPGAKGERGERGD 1607
Cdd:pfam01391    3 PGPPGPPGPPGPPGPPGPPGPPGPPGPPGE 32
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1476-1598 2.69e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 40.55  E-value: 2.69e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605 1476 QQGEPGPKGPEGPRGETgpagpqgppgpqgpsglsiqGMPGMPGDKGDKGDAglpgpqgvpggvgspgrdGSPGQRGFPg 1555
Cdd:pfam01391    2 PPGPPGPPGPPGPPGPP--------------------GPPGPPGPPGPPGEP------------------GPPGPPGPP- 42
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|...
gi 1958789605 1556 kdgssgppgppgpigipgapgvpgiagsmGPQGALGPPGVPGA 1598
Cdd:pfam01391   43 -----------------------------GPPGPPGAPGAPGP 56
fn3 pfam00041
Fibronectin type III domain;
920-995 5.77e-04

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 40.48  E-value: 5.77e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958789605  920 VTDLQANQVEMTSLCARWQI----HRHATAYRIVLESLQDTQA-QESTVGGGVNRHCFYGLQPDSEYKISVYTKLQEIEG 994
Cdd:pfam00041    3 PSNLTVTDVTSTSLTVSWTPppdgNGPITGYEVEYRPKNSGEPwNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGEG 82

                   .
gi 1958789605  995 P 995
Cdd:pfam00041   83 P 83
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1681-1755 9.79e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 35.93  E-value: 9.79e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1958789605 1681 AGVPGTPGERGLTGVKGDKGNPGigtqgprgppgpagpsgesrpgspgppgspgprgppgHLGVPGPQGPSGQPG 1755
Cdd:pfam01391   15 PGPPGPPGPPGPPGPPGEPGPPG-------------------------------------PPGPPGPPGPPGAPG 52
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH