NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|3834380|gb|AAC71661|]
View 

intrinsic factor-B12 receptor precursor [Rattus norvegicus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
1620-1733 2.37e-38

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


:

Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 140.24  E-value: 2.37e-38
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  1620 CGGRIMTDSSDTIFSPLYPHNYLHNQNCSWIIEAqPPFNHITLSFTHFQLQNSTDCTRDFVEILDGNDYDAPVQGRYCGF 1699
Cdd:cd00041    1 CGGTLTASTSGTISSPNYPNNYPNNLNCVWTIEA-PPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCGS 79
                         90       100       110
                 ....*....|....*....|....*....|....
gi 3834380  1700 SLPHPIISFGNALTVRFVTDSTRSFEGFRAIYSA 1733
Cdd:cd00041   80 TLPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
2689-2800 6.23e-37

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


:

Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 136.39  E-value: 6.23e-37
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  2689 CGG-IRTGDNGVISSPNYPNLYSAWTHCSWLLKAPEGHTITLTFSDFLLEAHPTCTSDSVTVRNGDSPGSPVIGRYCGQS 2767
Cdd:cd00041    1 CGGtLTASTSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCGST 80
                         90       100       110
                 ....*....|....*....|....*....|...
gi 3834380  2768 VPRPIQSGSNQLIVTFNTNNQGQTRGFYATWTT 2800
Cdd:cd00041   81 LPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
1391-1505 1.65e-36

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


:

Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 135.23  E-value: 1.65e-36
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  1391 CGGEMSGTA-GSFSSPGYPNSYPHNKECIWNIRVAPGSSIQLTIHDFDVEYHTSCNYDSLEIYAGLDFNSPRIAQLCSQS 1469
Cdd:cd00041    1 CGGTLTASTsGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCGST 80
                         90       100       110
                 ....*....|....*....|....*....|....*.
gi 3834380  1470 PsanPMQVSSTGNELAIRFKTDSTLNGRGFNASWRA 1505
Cdd:cd00041   81 L---PPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
590-699 4.82e-35

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


:

Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 131.00  E-value: 4.82e-35
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380   590 CGGILTDN-YGSITSPGYPGNYPPGRDCVWQVLVNPNSLITFTFGTLSLESHNDCSKDYLEIRDGPFHQDPVLGKFCTSL 668
Cdd:cd00041    1 CGGTLTAStSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCGST 80
                         90       100       110
                 ....*....|....*....|....*....|.
gi 3834380   669 STPPLKTTGPAARIHFHSDSETSDKGFHITY 699
Cdd:cd00041   81 LPPPIISSGNSLTVRFRSDSSVTGRGFKATY 111
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
1165-1275 9.69e-35

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


:

Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 129.84  E-value: 9.69e-35
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  1165 CGGNLTTPT-GVLTSPNYPMPYYHSSECYWRLEASHGSPFELEFQDFHLEHHPSCSLDYLAVFDGPTTNSRLIDKLCGDT 1243
Cdd:cd00041    1 CGGTLTASTsGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCGST 80
                         90       100       110
                 ....*....|....*....|....*....|..
gi 3834380  1244 TPAPIRSNKDVVLLKLRTDAGQQGRGFEINFR 1275
Cdd:cd00041   81 LPPPIISSGNSLTVRFRSDSSVTGRGFKATYS 112
CUB pfam00431
CUB domain;
1048-1157 1.28e-34

CUB domain;


:

Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 129.34  E-value: 1.28e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    1048 CLYDYTDNFGMLSSPNFPNNYPSNWECIYRITVGLNQQIALHFTDFTLEDYFGsqCV-DFVEIRDGGYETSPLVGIYCGS 1126
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDE--CGyDYVEIRDGPSASSPLLGRFCGS 78
                           90       100       110
                   ....*....|....*....|....*....|.
gi 3834380    1127 VLPPTIISHSNKLWLKFKSDAALTAKGFSAY 1157
Cdd:pfam00431   79 GIPEDIVSSSNQMTIKFVSDASVQKRGFKAT 109
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
2217-2333 1.64e-34

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


:

Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 129.45  E-value: 1.64e-34
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  2217 CGGTVYihdADSDGYLTSPNYPANYPQHAECIWILEAPPGRSIQLQFEDqFNIEDTPNCSVSYLELRDGANSNARLVSKL 2296
Cdd:cd00041    1 CGGTLT---ASTSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFED-FDLESSPNCSYDYLEIYDGPSTSSPLLGRF 76
                         90       100       110
                 ....*....|....*....|....*....|....*..
gi 3834380  2297 CGHTLPHSWVSSRERIYLKFHTDGGSSYMGFKAKYSI 2333
Cdd:cd00041   77 CGSTLPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
932-1041 4.76e-34

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


:

Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 127.91  E-value: 4.76e-34
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380   932 CGEVLTAST-GIIESPGHPNVYPRGVNCTWHVVVQRGQLIRLEFSSFYLEFHYNCTNDYLEIYD--TAAQTFLGRYCGKS 1008
Cdd:cd00041    1 CGGTLTASTsGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDgpSTSSPLLGRFCGST 80
                         90       100       110
                 ....*....|....*....|....*....|...
gi 3834380  1009 IPPSLTSNSNSIKLIFVSDSALAHEGFSINYEA 1041
Cdd:cd00041   81 LPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
1738-1847 1.44e-33

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


:

Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 126.76  E-value: 1.44e-33
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  1738 CGGSFY-TLDGIFNSPDYPADYHPNAECVWNIASSPGNRLQLSFLSFNLENSLNCNKDFVEIREGNAT-GHLIGRYCGNS 1815
Cdd:cd00041    1 CGGTLTaSTSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTsSPLLGRFCGST 80
                         90       100       110
                 ....*....|....*....|....*....|..
gi 3834380  1816 LPGNYSSaEGHSLWVRFVSDGSGTGMGFQARF 1847
Cdd:cd00041   81 LPPPIIS-SGNSLTVRFRSDSSVTGRGFKATY 111
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
3395-3506 4.98e-33

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


:

Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 125.22  E-value: 4.98e-33
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  3395 CNREYN-QTFGNLKSPGWPQNYDNNLDCTIILRAPQNHSISLFFYWFQLEDSRQCMNDFLEVRNGGSSTSPLLDKYCSNL 3473
Cdd:cd00041    1 CGGTLTaSTSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCGST 80
                         90       100       110
                 ....*....|....*....|....*....|...
gi 3834380  3474 LPNPVFSQSNELYLHFHSDHSVTNNGYEIIWTS 3506
Cdd:cd00041   81 LPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
2336-2447 3.50e-32

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


:

Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 122.52  E-value: 3.50e-32
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  2336 CGGTVSG-DSGVIESIGYPTlPYANNVFCQWFIRGLPGHYLTLSFEDFNLQSSPGCTKDFVEIWE-NHTSGRVLGRYCGN 2413
Cdd:cd00041    1 CGGTLTAsTSGTISSPNYPN-NYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDgPSTSSPLLGRFCGS 79
                         90       100       110
                 ....*....|....*....|....*....|....
gi 3834380  2414 STPSSVDTSSNVASVKFVTDGSVTASGFRLQFKS 2447
Cdd:cd00041   80 TLPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
1978-2088 1.43e-31

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


:

Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 120.98  E-value: 1.43e-31
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  1978 CGGFmVTGDTPVHIFSPGWPREYANGADCIWIIYAPD-STVELNILSLDIEPQQSCNYDKLIVKDGDSDLSPELAVLCGV 2056
Cdd:cd00041    1 CGGT-LTASTSGTISSPNYPNNYPNNLNCVWTIEAPPgYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCGS 79
                         90       100       110
                 ....*....|....*....|....*....|..
gi 3834380  2057 SPPGPIRSTGEYMYIRFTSDTSVAGTGFNASF 2088
Cdd:cd00041   80 TLPPPIISSGNSLTVRFRSDSSVTGRGFKATY 111
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
3037-3148 1.75e-31

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


:

Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 120.59  E-value: 1.75e-31
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  3037 CGGIYNES-SGILRSPSYsYSNYPNNLYCVYSLHVRSSRVIIIRFNDFDVAPSNLCAHDFLEVFDGPSIGNRSLGKFCGS 3115
Cdd:cd00041    1 CGGTLTAStSGTISSPNY-PNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCGS 79
                         90       100       110
                 ....*....|....*....|....*....|...
gi 3834380  3116 TRPQTVKSTNSSLTLLFKTDSSQTARGWKIFFR 3148
Cdd:cd00041   80 TLPPPIISSGNSLTVRFRSDSSVTGRGFKATYS 112
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
2452-2564 2.77e-31

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


:

Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 120.21  E-value: 2.77e-31
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  2452 CGGDLHGPT-GTFTSPNYPNPNPHARICEWTITVQEGRRIVLTFTNLRLSTQPSCNSEHLIVFNGIRSNSPLLQKLCSRv 2530
Cdd:cd00041    1 CGGTLTASTsGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCGS- 79
                         90       100       110
                 ....*....|....*....|....*....|....
gi 3834380  2531 NVTNEFKSSGNTMKVVFFTDGSRPYGGFTASYTS 2564
Cdd:cd00041   80 TLPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
2092-2212 5.89e-30

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


:

Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 116.36  E-value: 5.89e-30
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  2092 CGGYLHADR-GVITSPKYPDTYLPNLNCSWHVLVQTGLTIAVHFEQpFQIQNrDSFCSQgDYLVLRNGPDNHSPPLGpsg 2170
Cdd:cd00041    1 CGGTLTASTsGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFED-FDLES-SPNCSY-DYLEIYDGPSTSSPLLG--- 74
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|..
gi 3834380  2171 rngRFCGMYAPSTLFTSGNEMFVQFISDSSNGGQGFKIRYEA 2212
Cdd:cd00041   75 ---RFCGSTLPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
3157-3273 2.02e-29

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


:

Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 114.82  E-value: 2.02e-29
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  3157 CGGYLT-EDNQSFVSPDSDSNgrYDKGLSCIWYIVAPENKLVKLTFNVFTLEgpsSAGSCVYDYVQIADGASINSYLGGK 3235
Cdd:cd00041    1 CGGTLTaSTSGTISSPNYPNN--YPNNLNCVWTIEAPPGYRIRLTFEDFDLE---SSPNCSYDYLEIYDGPSTSSPLLGR 75
                         90       100       110
                 ....*....|....*....|....*....|....*...
gi 3834380  3236 FCGSRMPAPFISSGNFLTFQFVSDVTVEMRGFNATYTF 3273
Cdd:cd00041   76 FCGSTLPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
1510-1617 2.68e-29

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


:

Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 114.43  E-value: 2.68e-29
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  1510 CGGIIQLSR-GEIHSPNYPNNYRANTECSWIIQVERHHRVLLNITDFDLEAPDSC----LRLMDGSSSTNARVASVCGRQ 1584
Cdd:cd00041    1 CGGTLTASTsGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCsydyLEIYDGPSTSSPLLGRFCGST 80
                         90       100       110
                 ....*....|....*....|....*....|...
gi 3834380  1585 QPPnSIIASGNSLFVRFRSGSSSQNRGFRAEFR 1617
Cdd:cd00041   81 LPP-PIISSGNSLTVRFRSDSSVTGRGFKATYS 112
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
3511-3623 1.78e-28

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


:

Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 112.12  E-value: 1.78e-28
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  3511 CGGTLLG-DEGIFTNPGFPDSYPNNTHCEWTIVAPSGRPVSVGFPFLSIDSSGGCDQNYLIVFNGPDANSPPFGPLCGiN 3589
Cdd:cd00041    1 CGGTLTAsTSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCG-S 79
                         90       100       110
                 ....*....|....*....|....*....|....
gi 3834380  3590 TGIAPFYASSNRVFIRFHAEYTTRLSGFEIMWSS 3623
Cdd:cd00041   80 TLPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
1278-1388 2.52e-28

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


:

Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 111.74  E-value: 2.52e-28
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  1278 CDNVVIVNkTSGILESINYPNPYDKNQRCNWTIQATTGNTVNYTFLGFDVESYMNCSTDYVELYDGP----QWMGRYCGN 1353
Cdd:cd00041    1 CGGTLTAS-TSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPstssPLLGRFCGS 79
                         90       100       110
                 ....*....|....*....|....*....|....*
gi 3834380  1354 NMPPPGATTGSQLHVLFHTDGINSGeKGFKMQWFT 1388
Cdd:cd00041   80 TLPPPIISSGNSLTVRFRSDSSVTG-RGFKATYSA 113
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
1852-1962 2.65e-28

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


:

Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 111.74  E-value: 2.65e-28
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  1852 GNNNIVGTHGKIASPFWPGKYPYNSNYKWVVNVDAYHIIHGRILEMDIEPTTNCFYDSLKIYDGFDTHSRLIGTYCGTQT 1931
Cdd:cd00041    2 GGTLTASTSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCGSTL 81
                         90       100       110
                 ....*....|....*....|....*....|..
gi 3834380  1932 -ESFSSSRNSLTFQFSSDSSVSGRGFLLEWFA 1962
Cdd:cd00041   82 pPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
474-585 5.22e-28

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


:

Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 110.96  E-value: 5.22e-28
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380   474 CGGILSGT-QGTFayHSPN--DTYIHNVNCFWIVRTDEEKVLHVTFTFFDLESASNCPREYLQIHDGDSSADFPLGRYCG 550
Cdd:cd00041    1 CGGTLTAStSGTI--SSPNypNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCG 78
                         90       100       110
                 ....*....|....*....|....*....|....*
gi 3834380   551 SRPPQGIHSSANALYFHLYSEYIRSGRGFTARWEA 585
Cdd:cd00041   79 STLPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
817-927 7.03e-26

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


:

Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 104.80  E-value: 7.03e-26
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380   817 CGGMLRG--EGFFRSPFYPNAYPGRRTCRWTISQPQRQVVLLNFTDFQIGSSASCDTDYIEIGPSSVLGSPGNEKFCSSN 894
Cdd:cd00041    1 CGGTLTAstSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCGST 80
                         90       100       110
                 ....*....|....*....|....*....|...
gi 3834380   895 IPSFITSVYNILYVTFVKSSSMENRGFTAKFSS 927
Cdd:cd00041   81 LPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
2920-3034 1.31e-24

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


:

Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 100.95  E-value: 1.31e-24
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  2920 CGRTFN-TSPGDIISPNFPKQYDNNMNCTYLIDADPQSLVILTFVSFHLEDrsaiTGTCDHDGLHIIKGRNLSSTPLVTI 2998
Cdd:cd00041    1 CGGTLTaSTSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLES----SPNCSYDYLEIYDGPSTSSPLLGRF 76
                         90       100       110
                 ....*....|....*....|....*....|....*..
gi 3834380  2999 CGSETLRPLTVDGP-VLLNFYSDAYTTDFGFKISYRA 3034
Cdd:cd00041   77 CGSTLPPPIISSGNsLTVRFRSDSSVTGRGFKATYSA 113
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
2570-2686 6.18e-24

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


:

Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 99.02  E-value: 6.18e-24
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  2570 CGGFLPSVSGGNFSSPGYNGirDYARNLDCEWTLSNPnrENSSISIYFLELSIESHQDCTFDVLEFRVG-DADGPLIEKF 2648
Cdd:cd00041    1 CGGTLTASTSGTISSPNYPN--NYPNNLNCVWTIEAP--PGYRIRLTFEDFDLESSPNCSYDYLEIYDGpSTSSPLLGRF 76
                         90       100       110
                 ....*....|....*....|....*....|....*...
gi 3834380  2649 CSLSAPtAPLVIPYPQVWIHFVSNERVEYTGFYIEYSF 2686
Cdd:cd00041   77 CGSTLP-PPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
cubilin_NTD cd22201
N-terminal domain of cubilin and similar proteins; Cubilin (CUBN, also called 460 kDa receptor, ...
38-132 7.80e-21

N-terminal domain of cubilin and similar proteins; Cubilin (CUBN, also called 460 kDa receptor, intestinal intrinsic factor receptor, intrinsic factor-cobalamin receptor, or intrinsic factor-vitamin B12 receptor) is an endocytic receptor which plays a role in lipoprotein, vitamin and iron metabolism by facilitating their uptake. It acts together with the 45-kDa transmembrane protein amnionless (AMN) to mediate endocytosis of the cobalamin (vitamin B12) binding intrinsic factor (CBLIF)-cobalamin complex. This model corresponds to the N-terminal domain of cubilin, which is responsible for the interaction with AMN. The cubilin interface with AMN is formed by the N-terminal strands of three cubilin chains.


:

Pssm-ID: 412063  Cd Length: 129  Bit Score: 90.85  E-value: 7.80e-21
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    38 QPRMTTEEGNLVFLTSSTQNIEFRTGSLGKIKLNDEDLGECLHQIQRNKDDIIDLRKN-----------TTGLPQNILSQ 106
Cdd:cd22201   13 QPRIITEDGHLIFEAAYDKNISFRTSGNGRININDEDLLELLQQAKNNKSDIENLKQSelptfeqqlseLVGGPQGLLRR 92
                         90       100
                 ....*....|....*....|....*.
gi 3834380   107 VHQLNSKLVDLERDFQNLQQNVERKV 132
Cdd:cd22201   93 LALLENRTSGLSSTLNNNIRRLRRRL 118
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
708-815 3.03e-20

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


:

Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 88.62  E-value: 3.03e-20
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380   708 CGGNYTDTD-GELLLPPLSGPFSHSRQCVYLITQAQGEQIVINFTHVELESQMGCSHTYIEVGDHDS----LLRKICGNE 782
Cdd:cd00041    1 CGGTLTASTsGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPStsspLLGRFCGST 80
                         90       100       110
                 ....*....|....*....|....*....|...
gi 3834380   783 TLFPIRSVSNKVWIRLRIDALVQKASFRADYQV 815
Cdd:cd00041   81 LPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
2805-2918 7.34e-20

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


:

Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 87.47  E-value: 7.34e-20
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  2805 CGGTFHSA-NGTIKSPHWPQTFPENSRCSWTVITHESKHWEISFDSNFRIPSSDsqCQNSFVKVWEGRLmINKTLLATSC 2883
Cdd:cd00041    1 CGGTLTAStSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPN--CSYDYLEIYDGPS-TSSPLLGRFC 77
                         90       100       110
                 ....*....|....*....|....*....|....*.
gi 3834380  2884 GDVAPSPIVTSGNIFTAVFQSEEM-AAQGFSASFIS 2918
Cdd:cd00041   78 GSTLPPPIISSGNSLTVRFRSDSSvTGRGFKATYSA 113
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
3278-3392 3.18e-16

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


:

Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 77.07  E-value: 3.18e-16
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  3278 CGGTYNATsTPQNASSPHLSNIGRPYSTCTWVIAAPPQQQVQITVWDLQL-PSQDCSQSYLELQDSVQTGGNRVTQFCGa 3356
Cdd:cd00041    1 CGGTLTAS-TSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLeSSPNCSYDYLEIYDGPSTSSPLLGRFCG- 78
                         90       100       110
                 ....*....|....*....|....*....|....*.
gi 3834380  3357 nYTTLPVFYSSMSTAVVVFKSGVLNRNSQVQFSYQI 3392
Cdd:cd00041   79 -STLPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
432-468 8.18e-09

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 53.41  E-value: 8.18e-09
                         10        20        30
                 ....*....|....*....|....*....|....*...
gi 3834380   432 NINDCSS-NPCLNGGTCIDGINGFTCDCTSSWTGYYCQ 468
Cdd:cd00054    1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
167-207 3.58e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 48.79  E-value: 3.58e-07
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|.
gi 3834380   167 DVNECVvySGTPfgCQSGSTCVNTVGSFRCDCTPDTYGPQC 207
Cdd:cd00054    1 DIDECA--SGNP--CQNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
350-387 4.64e-07

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 48.36  E-value: 4.64e-07
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 3834380     350 CSIHNGGCHPEATCSSSPvlGSFlpVCTCPPGYTGNGY 387
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTG--GSF--TCTCNDGYTGDGV 34
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
133-164 1.48e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 47.25  E-value: 1.48e-06
                         10        20        30
                 ....*....|....*....|....*....|...
gi 3834380   133 CSS-NPCLNGGTCVNLHDSFVCICPSQWKGLFC 164
Cdd:cd00054    5 CASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
306-344 6.23e-06

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 45.28  E-value: 6.23e-06
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 3834380     306 CEINNGGCSQAPLvpCLNTPGSFSCgNCPAGFSGDGRVC 344
Cdd:pfam12947    1 CSDNNGGCHPNAT--CTNTGGSFTC-TCNDGYTGDGVTC 36
EGF_CA smart00179
Calcium-binding EGF-like domain;
260-301 6.56e-06

Calcium-binding EGF-like domain;


:

Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 45.32  E-value: 6.56e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|..
gi 3834380      260 DKDECSlQPSPCSEHAQCFNTQGSFYCgACPKGWQgNGYECQ 301
Cdd:smart00179    1 DIDECA-SGNPCQNGGTCVNTVGSYRC-ECPPGYT-DGRNCE 39
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
400-430 4.92e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 43.01  E-value: 4.92e-05
                         10        20        30
                 ....*....|....*....|....*....|..
gi 3834380   400 SRHPCVN-GQCIETVSSYFCKCDSGWSGQNCT 430
Cdd:cd00054    7 SGNPCQNgGTCVNTVGSYRCSCPPGYTGRNCE 38
 
Name Accession Description Interval E-value
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
1620-1733 2.37e-38

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 140.24  E-value: 2.37e-38
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  1620 CGGRIMTDSSDTIFSPLYPHNYLHNQNCSWIIEAqPPFNHITLSFTHFQLQNSTDCTRDFVEILDGNDYDAPVQGRYCGF 1699
Cdd:cd00041    1 CGGTLTASTSGTISSPNYPNNYPNNLNCVWTIEA-PPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCGS 79
                         90       100       110
                 ....*....|....*....|....*....|....
gi 3834380  1700 SLPHPIISFGNALTVRFVTDSTRSFEGFRAIYSA 1733
Cdd:cd00041   80 TLPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
2689-2800 6.23e-37

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 136.39  E-value: 6.23e-37
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  2689 CGG-IRTGDNGVISSPNYPNLYSAWTHCSWLLKAPEGHTITLTFSDFLLEAHPTCTSDSVTVRNGDSPGSPVIGRYCGQS 2767
Cdd:cd00041    1 CGGtLTASTSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCGST 80
                         90       100       110
                 ....*....|....*....|....*....|...
gi 3834380  2768 VPRPIQSGSNQLIVTFNTNNQGQTRGFYATWTT 2800
Cdd:cd00041   81 LPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
1391-1505 1.65e-36

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 135.23  E-value: 1.65e-36
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  1391 CGGEMSGTA-GSFSSPGYPNSYPHNKECIWNIRVAPGSSIQLTIHDFDVEYHTSCNYDSLEIYAGLDFNSPRIAQLCSQS 1469
Cdd:cd00041    1 CGGTLTASTsGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCGST 80
                         90       100       110
                 ....*....|....*....|....*....|....*.
gi 3834380  1470 PsanPMQVSSTGNELAIRFKTDSTLNGRGFNASWRA 1505
Cdd:cd00041   81 L---PPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
590-699 4.82e-35

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 131.00  E-value: 4.82e-35
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380   590 CGGILTDN-YGSITSPGYPGNYPPGRDCVWQVLVNPNSLITFTFGTLSLESHNDCSKDYLEIRDGPFHQDPVLGKFCTSL 668
Cdd:cd00041    1 CGGTLTAStSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCGST 80
                         90       100       110
                 ....*....|....*....|....*....|.
gi 3834380   669 STPPLKTTGPAARIHFHSDSETSDKGFHITY 699
Cdd:cd00041   81 LPPPIISSGNSLTVRFRSDSSVTGRGFKATY 111
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
1165-1275 9.69e-35

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 129.84  E-value: 9.69e-35
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  1165 CGGNLTTPT-GVLTSPNYPMPYYHSSECYWRLEASHGSPFELEFQDFHLEHHPSCSLDYLAVFDGPTTNSRLIDKLCGDT 1243
Cdd:cd00041    1 CGGTLTASTsGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCGST 80
                         90       100       110
                 ....*....|....*....|....*....|..
gi 3834380  1244 TPAPIRSNKDVVLLKLRTDAGQQGRGFEINFR 1275
Cdd:cd00041   81 LPPPIISSGNSLTVRFRSDSSVTGRGFKATYS 112
CUB pfam00431
CUB domain;
1391-1502 1.00e-34

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 129.72  E-value: 1.00e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    1391 CGGEMSGTAGSFSSPGYPNSYPHNKECIWNIRVAPGSSIQLTIHDFDVEYHTSCNYDSLEIYAGLDFNSPRIAQLCSqsp 1470
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECGYDYVEIRDGPSASSPLLGRFCG--- 77
                           90       100       110
                   ....*....|....*....|....*....|..
gi 3834380    1471 SANPMQVSSTGNELAIRFKTDSTLNGRGFNAS 1502
Cdd:pfam00431   78 SGIPEDIVSSSNQMTIKFVSDASVQKRGFKAT 109
CUB pfam00431
CUB domain;
1048-1157 1.28e-34

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 129.34  E-value: 1.28e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    1048 CLYDYTDNFGMLSSPNFPNNYPSNWECIYRITVGLNQQIALHFTDFTLEDYFGsqCV-DFVEIRDGGYETSPLVGIYCGS 1126
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDE--CGyDYVEIRDGPSASSPLLGRFCGS 78
                           90       100       110
                   ....*....|....*....|....*....|.
gi 3834380    1127 VLPPTIISHSNKLWLKFKSDAALTAKGFSAY 1157
Cdd:pfam00431   79 GIPEDIVSSSNQMTIKFVSDASVQKRGFKAT 109
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
2217-2333 1.64e-34

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 129.45  E-value: 1.64e-34
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  2217 CGGTVYihdADSDGYLTSPNYPANYPQHAECIWILEAPPGRSIQLQFEDqFNIEDTPNCSVSYLELRDGANSNARLVSKL 2296
Cdd:cd00041    1 CGGTLT---ASTSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFED-FDLESSPNCSYDYLEIYDGPSTSSPLLGRF 76
                         90       100       110
                 ....*....|....*....|....*....|....*..
gi 3834380  2297 CGHTLPHSWVSSRERIYLKFHTDGGSSYMGFKAKYSI 2333
Cdd:cd00041   77 CGSTLPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB pfam00431
CUB domain;
1165-1274 2.31e-34

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 128.95  E-value: 2.31e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    1165 CGGNLTTPTGVLTSPNYPMPYYHSSECYWRLEASHGSPFELEFQDFHLEHHPSCSLDYLAVFDGPTTNSRLIDKLCGDTT 1244
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECGYDYVEIRDGPSASSPLLGRFCGSGI 80
                           90       100       110
                   ....*....|....*....|....*....|
gi 3834380    1245 PAPIRSNKDVVLLKLRTDAGQQGRGFEINF 1274
Cdd:pfam00431   81 PEDIVSSSNQMTIKFVSDASVQKRGFKATY 110
CUB pfam00431
CUB domain;
590-699 2.73e-34

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 128.57  E-value: 2.73e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     590 CGGILTDNYGSITSPGYPGNYPPGRDCVWQVLVNPNSLITFTFGTLSLESHNDCSKDYLEIRDGPFHQDPVLGKFCTSLS 669
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECGYDYVEIRDGPSASSPLLGRFCGSGI 80
                           90       100       110
                   ....*....|....*....|....*....|
gi 3834380     670 TPPLKTTGPAARIHFHSDSETSDKGFHITY 699
Cdd:pfam00431   81 PEDIVSSSNQMTIKFVSDASVQKRGFKATY 110
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
932-1041 4.76e-34

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 127.91  E-value: 4.76e-34
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380   932 CGEVLTAST-GIIESPGHPNVYPRGVNCTWHVVVQRGQLIRLEFSSFYLEFHYNCTNDYLEIYD--TAAQTFLGRYCGKS 1008
Cdd:cd00041    1 CGGTLTASTsGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDgpSTSSPLLGRFCGST 80
                         90       100       110
                 ....*....|....*....|....*....|...
gi 3834380  1009 IPPSLTSNSNSIKLIFVSDSALAHEGFSINYEA 1041
Cdd:cd00041   81 LPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
1738-1847 1.44e-33

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 126.76  E-value: 1.44e-33
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  1738 CGGSFY-TLDGIFNSPDYPADYHPNAECVWNIASSPGNRLQLSFLSFNLENSLNCNKDFVEIREGNAT-GHLIGRYCGNS 1815
Cdd:cd00041    1 CGGTLTaSTSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTsSPLLGRFCGST 80
                         90       100       110
                 ....*....|....*....|....*....|..
gi 3834380  1816 LPGNYSSaEGHSLWVRFVSDGSGTGMGFQARF 1847
Cdd:cd00041   81 LPPPIIS-SGNSLTVRFRSDSSVTGRGFKATY 111
CUB pfam00431
CUB domain;
1738-1847 2.62e-33

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 125.87  E-value: 2.62e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    1738 CGGSFYTLDGIFNSPDYPADYHPNAECVWNIASSPGNRLQLSFLSFNLENSLNCNKDFVEIREG-NATGHLIGRYCGNSL 1816
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECGYDYVEIRDGpSASSPLLGRFCGSGI 80
                           90       100       110
                   ....*....|....*....|....*....|.
gi 3834380    1817 PGNYSSaEGHSLWVRFVSDGSGTGMGFQARF 1847
Cdd:pfam00431   81 PEDIVS-SSNQMTIKFVSDASVQKRGFKATY 110
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
3395-3506 4.98e-33

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 125.22  E-value: 4.98e-33
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  3395 CNREYN-QTFGNLKSPGWPQNYDNNLDCTIILRAPQNHSISLFFYWFQLEDSRQCMNDFLEVRNGGSSTSPLLDKYCSNL 3473
Cdd:cd00041    1 CGGTLTaSTSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCGST 80
                         90       100       110
                 ....*....|....*....|....*....|...
gi 3834380  3474 LPNPVFSQSNELYLHFHSDHSVTNNGYEIIWTS 3506
Cdd:cd00041   81 LPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
2336-2447 3.50e-32

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 122.52  E-value: 3.50e-32
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  2336 CGGTVSG-DSGVIESIGYPTlPYANNVFCQWFIRGLPGHYLTLSFEDFNLQSSPGCTKDFVEIWE-NHTSGRVLGRYCGN 2413
Cdd:cd00041    1 CGGTLTAsTSGTISSPNYPN-NYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDgPSTSSPLLGRFCGS 79
                         90       100       110
                 ....*....|....*....|....*....|....
gi 3834380  2414 STPSSVDTSSNVASVKFVTDGSVTASGFRLQFKS 2447
Cdd:cd00041   80 TLPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
2698-2798 3.76e-32

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 122.11  E-value: 3.76e-32
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     2698 GVISSPNYPNLYSAWTHCSWLLKAPEGHTITLTFSDFLLEAHPTCTSDSVTVRNGDSPGSPVIGRYCGQSVPRP-IQSGS 2776
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDGPSASSPLLGRFCGSEAPPPvISSSS 80
                            90       100
                    ....*....|....*....|..
gi 3834380     2777 NQLIVTFNTNNQGQTRGFYATW 2798
Cdd:smart00042   81 NSLTLTFVSDSSVQKRGFSARY 102
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
1747-1847 4.88e-32

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 121.73  E-value: 4.88e-32
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     1747 GIFNSPDYPADYHPNAECVWNIASSPGNRLQLSFLSFNLENSLNCNKDFVEIREGNATGH-LIGRYCGNSLPGNYSSAEG 1825
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDGPSASSpLLGRFCGSEAPPPVISSSS 80
                            90       100
                    ....*....|....*....|..
gi 3834380     1826 HSLWVRFVSDGSGTGMGFQARF 1847
Cdd:smart00042   81 NSLTLTFVSDSSVQKRGFSARY 102
CUB pfam00431
CUB domain;
2689-2797 6.13e-32

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 122.02  E-value: 6.13e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    2689 CGGIRTGDNGVISSPNYPNLYSAWTHCSWLLKAPEGHTITLTFSDFLLEAHPTCTSDSVTVRNGDSPGSPVIGRYCGQSV 2768
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECGYDYVEIRDGPSASSPLLGRFCGSGI 80
                           90       100
                   ....*....|....*....|....*....
gi 3834380    2769 PRPIQSGSNQLIVTFNTNNQGQTRGFYAT 2797
Cdd:pfam00431   81 PEDIVSSSNQMTIKFVSDASVQKRGFKAT 109
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
1052-1158 7.40e-32

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 121.75  E-value: 7.40e-32
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  1052 YTDNFGMLSSPNFPNNYPSNWECIYRITVGLNQQIALHFTDFTLEDYfgSQCV-DFVEIRDGGYETSPLVGIYCGSVLPP 1130
Cdd:cd00041    6 TASTSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESS--PNCSyDYLEIYDGPSTSSPLLGRFCGSTLPP 83
                         90       100
                 ....*....|....*....|....*...
gi 3834380  1131 TIISHSNKLWLKFKSDAALTAKGFSAYW 1158
Cdd:cd00041   84 PIISSGNSLTVRFRSDSSVTGRGFKATY 111
CUB pfam00431
CUB domain;
1620-1731 1.36e-31

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 120.86  E-value: 1.36e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    1620 CGGRImTDSSDTIFSPLYPHNYLHNQNCSWIIEAQPPFnHITLSFTHFQLQNSTDCTRDFVEILDGNDYDAPVQGRYCGF 1699
Cdd:pfam00431    1 CGGVL-TDSSGSISSPNYPNPYPPNKDCVWLIRAPPGF-RVKLTFQDFELEDHDECGYDYVEIRDGPSASSPLLGRFCGS 78
                           90       100       110
                   ....*....|....*....|....*....|..
gi 3834380    1700 SLPHPIISFGNALTVRFVTDSTRSFEGFRAIY 1731
Cdd:pfam00431   79 GIPEDIVSSSNQMTIKFVSDASVQKRGFKATY 110
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
1978-2088 1.43e-31

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 120.98  E-value: 1.43e-31
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  1978 CGGFmVTGDTPVHIFSPGWPREYANGADCIWIIYAPD-STVELNILSLDIEPQQSCNYDKLIVKDGDSDLSPELAVLCGV 2056
Cdd:cd00041    1 CGGT-LTASTSGTISSPNYPNNYPNNLNCVWTIEAPPgYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCGS 79
                         90       100       110
                 ....*....|....*....|....*....|..
gi 3834380  2057 SPPGPIRSTGEYMYIRFTSDTSVAGTGFNASF 2088
Cdd:cd00041   80 TLPPPIISSGNSLTVRFRSDSSVTGRGFKATY 111
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
3037-3148 1.75e-31

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 120.59  E-value: 1.75e-31
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  3037 CGGIYNES-SGILRSPSYsYSNYPNNLYCVYSLHVRSSRVIIIRFNDFDVAPSNLCAHDFLEVFDGPSIGNRSLGKFCGS 3115
Cdd:cd00041    1 CGGTLTAStSGTISSPNY-PNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCGS 79
                         90       100       110
                 ....*....|....*....|....*....|...
gi 3834380  3116 TRPQTVKSTNSSLTLLFKTDSSQTARGWKIFFR 3148
Cdd:cd00041   80 TLPPPIISSGNSLTVRFRSDSSVTGRGFKATYS 112
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
2452-2564 2.77e-31

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 120.21  E-value: 2.77e-31
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  2452 CGGDLHGPT-GTFTSPNYPNPNPHARICEWTITVQEGRRIVLTFTNLRLSTQPSCNSEHLIVFNGIRSNSPLLQKLCSRv 2530
Cdd:cd00041    1 CGGTLTASTsGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCGS- 79
                         90       100       110
                 ....*....|....*....|....*....|....
gi 3834380  2531 NVTNEFKSSGNTMKVVFFTDGSRPYGGFTASYTS 2564
Cdd:cd00041   80 TLPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB pfam00431
CUB domain;
2217-2331 4.19e-31

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 119.32  E-value: 4.19e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    2217 CGGTVyihdADSDGYLTSPNYPANYPQHAECIWILEAPPGRSIQLQFEDqFNIEDTPNCSVSYLELRDGANSNARLVSKL 2296
Cdd:pfam00431    1 CGGVL----TDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQD-FELEDHDECGYDYVEIRDGPSASSPLLGRF 75
                           90       100       110
                   ....*....|....*....|....*....|....*
gi 3834380    2297 CGHTLPHSWVSSRERIYLKFHTDGGSSYMGFKAKY 2331
Cdd:pfam00431   76 CGSGIPEDIVSSSNQMTIKFVSDASVQKRGFKATY 110
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
1400-1503 4.56e-31

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 119.03  E-value: 4.56e-31
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     1400 GSFSSPGYPNSYPHNKECIWNIRVAPGSSIQLTIHDFDVEYHTSCNYDSLEIYAGLDFNSPRIAQLCSQSPSANPmqVSS 1479
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDGPSASSPLLGRFCGSEAPPPV--ISS 78
                            90       100
                    ....*....|....*....|....
gi 3834380     1480 TGNELAIRFKTDSTLNGRGFNASW 1503
Cdd:smart00042   79 SSNSLTLTFVSDSSVQKRGFSARY 102
CUB pfam00431
CUB domain;
932-1039 5.55e-30

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 116.24  E-value: 5.55e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     932 CGEVLTASTGIIESPGHPNVYPRGVNCTWHVVVQRGQLIRLEFSSFYLEFHYNCTNDYLEIYD--TAAQTFLGRYCGKSI 1009
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECGYDYVEIRDgpSASSPLLGRFCGSGI 80
                           90       100       110
                   ....*....|....*....|....*....|
gi 3834380    1010 PPSLTSNSNSIKLIFVSDSALAHEGFSINY 1039
Cdd:pfam00431   81 PEDIVSSSNQMTIKFVSDASVQKRGFKATY 110
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
2092-2212 5.89e-30

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 116.36  E-value: 5.89e-30
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  2092 CGGYLHADR-GVITSPKYPDTYLPNLNCSWHVLVQTGLTIAVHFEQpFQIQNrDSFCSQgDYLVLRNGPDNHSPPLGpsg 2170
Cdd:cd00041    1 CGGTLTASTsGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFED-FDLES-SPNCSY-DYLEIYDGPSTSSPLLG--- 74
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|..
gi 3834380  2171 rngRFCGMYAPSTLFTSGNEMFVQFISDSSNGGQGFKIRYEA 2212
Cdd:cd00041   75 ---RFCGSTLPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
1057-1158 1.40e-29

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 114.79  E-value: 1.40e-29
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     1057 GMLSSPNFPNNYPSNWECIYRITVGLNQQIALHFTDFTLEDyfGSQCV-DFVEIRDGGYETSPLVGIYCGSVLPPTII-S 1134
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLES--SDNCEyDYVEIYDGPSASSPLLGRFCGSEAPPPVIsS 78
                            90       100
                    ....*....|....*....|....
gi 3834380     1135 HSNKLWLKFKSDAALTAKGFSAYW 1158
Cdd:smart00042   79 SSNSLTLTFVSDSSVQKRGFSARY 102
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
3157-3273 2.02e-29

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 114.82  E-value: 2.02e-29
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  3157 CGGYLT-EDNQSFVSPDSDSNgrYDKGLSCIWYIVAPENKLVKLTFNVFTLEgpsSAGSCVYDYVQIADGASINSYLGGK 3235
Cdd:cd00041    1 CGGTLTaSTSGTISSPNYPNN--YPNNLNCVWTIEAPPGYRIRLTFEDFDLE---SSPNCSYDYLEIYDGPSTSSPLLGR 75
                         90       100       110
                 ....*....|....*....|....*....|....*...
gi 3834380  3236 FCGSRMPAPFISSGNFLTFQFVSDVTVEMRGFNATYTF 3273
Cdd:cd00041   76 FCGSTLPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
1510-1617 2.68e-29

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 114.43  E-value: 2.68e-29
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  1510 CGGIIQLSR-GEIHSPNYPNNYRANTECSWIIQVERHHRVLLNITDFDLEAPDSC----LRLMDGSSSTNARVASVCGRQ 1584
Cdd:cd00041    1 CGGTLTASTsGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCsydyLEIYDGPSTSSPLLGRFCGST 80
                         90       100       110
                 ....*....|....*....|....*....|...
gi 3834380  1585 QPPnSIIASGNSLFVRFRSGSSSQNRGFRAEFR 1617
Cdd:cd00041   81 LPP-PIISSGNSLTVRFRSDSSVTGRGFKATYS 112
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
1631-1731 3.64e-29

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 113.64  E-value: 3.64e-29
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     1631 TIFSPLYPHNYLHNQNCSWIIEAqPPFNHITLSFTHFQLQNSTDCTRDFVEILDGNDYDAPVQGRYCGFSLPHPII-SFG 1709
Cdd:smart00042    2 TITSPNYPQSYPNNLDCVWTIRA-PPGYRIELQFTDFDLESSDNCEYDYVEIYDGPSASSPLLGRFCGSEAPPPVIsSSS 80
                            90       100
                    ....*....|....*....|..
gi 3834380     1710 NALTVRFVTDSTRSFEGFRAIY 1731
Cdd:smart00042   81 NSLTLTFVSDSSVQKRGFSARY 102
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
599-699 5.63e-29

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 113.25  E-value: 5.63e-29
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380      599 GSITSPGYPGNYPPGRDCVWQVLVNPNSLITFTFGTLSLESHNDCSKDYLEIRDGPFHQDPVLGKFCTSLSTPPLKTT-G 677
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDGPSASSPLLGRFCGSEAPPPVISSsS 80
                            90       100
                    ....*....|....*....|..
gi 3834380      678 PAARIHFHSDSETSDKGFHITY 699
Cdd:smart00042   81 NSLTLTFVSDSSVQKRGFSARY 102
CUB pfam00431
CUB domain;
2336-2445 5.65e-29

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 113.54  E-value: 5.65e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    2336 CGGTVSGDSGVIESIGYPTlPYANNVFCQWFIRGLPGHYLTLSFEDFNLQSSPGCTKDFVEIWENHT-SGRVLGRYCGNS 2414
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPN-PYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECGYDYVEIRDGPSaSSPLLGRFCGSG 79
                           90       100       110
                   ....*....|....*....|....*....|.
gi 3834380    2415 TPSSVDTSSNVASVKFVTDGSVTASGFRLQF 2445
Cdd:pfam00431   80 IPEDIVSSSNQMTIKFVSDASVQKRGFKATY 110
CUB pfam00431
CUB domain;
1510-1616 1.49e-28

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 112.39  E-value: 1.49e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    1510 CGGIIQLSRGEIHSPNYPNNYRANTECSWIIQVERHHRVLLNITDFDLEAPDSC----LRLMDGSSSTNARVASVCGRQQ 1585
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECgydyVEIRDGPSASSPLLGRFCGSGI 80
                           90       100       110
                   ....*....|....*....|....*....|.
gi 3834380    1586 PPNsIIASGNSLFVRFRSGSSSQNRGFRAEF 1616
Cdd:pfam00431   81 PED-IVSSSNQMTIKFVSDASVQKRGFKATY 110
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
3511-3623 1.78e-28

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 112.12  E-value: 1.78e-28
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  3511 CGGTLLG-DEGIFTNPGFPDSYPNNTHCEWTIVAPSGRPVSVGFPFLSIDSSGGCDQNYLIVFNGPDANSPPFGPLCGiN 3589
Cdd:cd00041    1 CGGTLTAsTSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCG-S 79
                         90       100       110
                 ....*....|....*....|....*....|....
gi 3834380  3590 TGIAPFYASSNRVFIRFHAEYTTRLSGFEIMWSS 3623
Cdd:cd00041   80 TLPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB pfam00431
CUB domain;
3037-3147 1.99e-28

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 112.00  E-value: 1.99e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    3037 CGGIYNESSGILRSPSYSySNYPNNLYCVYSLHVRSSRVIIIRFNDFDVAPSNLCAHDFLEVFDGPSIGNRSLGKFCGST 3116
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYP-NPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECGYDYVEIRDGPSASSPLLGRFCGSG 79
                           90       100       110
                   ....*....|....*....|....*....|.
gi 3834380    3117 RPQTVKSTNSSLTLLFKTDSSQTARGWKIFF 3147
Cdd:pfam00431   80 IPEDIVSSSNQMTIKFVSDASVQKRGFKATY 110
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
1278-1388 2.52e-28

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 111.74  E-value: 2.52e-28
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  1278 CDNVVIVNkTSGILESINYPNPYDKNQRCNWTIQATTGNTVNYTFLGFDVESYMNCSTDYVELYDGP----QWMGRYCGN 1353
Cdd:cd00041    1 CGGTLTAS-TSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPstssPLLGRFCGS 79
                         90       100       110
                 ....*....|....*....|....*....|....*
gi 3834380  1354 NMPPPGATTGSQLHVLFHTDGINSGeKGFKMQWFT 1388
Cdd:cd00041   80 TLPPPIISSGNSLTVRFRSDSSVTG-RGFKATYSA 113
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
1852-1962 2.65e-28

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 111.74  E-value: 2.65e-28
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  1852 GNNNIVGTHGKIASPFWPGKYPYNSNYKWVVNVDAYHIIHGRILEMDIEPTTNCFYDSLKIYDGFDTHSRLIGTYCGTQT 1931
Cdd:cd00041    2 GGTLTASTSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCGSTL 81
                         90       100       110
                 ....*....|....*....|....*....|..
gi 3834380  1932 -ESFSSSRNSLTFQFSSDSSVSGRGFLLEWFA 1962
Cdd:cd00041   82 pPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
2230-2331 4.64e-28

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 110.56  E-value: 4.64e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     2230 GYLTSPNYPANYPQHAECIWILEAPPGRSIQLQFEDqFNIEDTPNCSVSYLELRDGANSNARLVSKLCGHTLPHSWVSSR 2309
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTD-FDLESSDNCEYDYVEIYDGPSASSPLLGRFCGSEAPPPVISSS 79
                            90       100
                    ....*....|....*....|...
gi 3834380     2310 -ERIYLKFHTDGGSSYMGFKAKY 2331
Cdd:smart00042   80 sNSLTLTFVSDSSVQKRGFSARY 102
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
474-585 5.22e-28

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 110.96  E-value: 5.22e-28
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380   474 CGGILSGT-QGTFayHSPN--DTYIHNVNCFWIVRTDEEKVLHVTFTFFDLESASNCPREYLQIHDGDSSADFPLGRYCG 550
Cdd:cd00041    1 CGGTLTAStSGTI--SSPNypNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCG 78
                         90       100       110
                 ....*....|....*....|....*....|....*
gi 3834380   551 SRPPQGIHSSANALYFHLYSEYIRSGRGFTARWEA 585
Cdd:cd00041   79 STLPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
1174-1274 6.27e-28

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 110.17  E-value: 6.27e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     1174 GVLTSPNYPMPYYHSSECYWRLEASHGSPFELEFQDFHLEHHPSCSLDYLAVFDGPTTNSRLIDKLCGDTTPAP-IRSNK 1252
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDGPSASSPLLGRFCGSEAPPPvISSSS 80
                            90       100
                    ....*....|....*....|..
gi 3834380     1253 DVVLLKLRTDAGQQGRGFEINF 1274
Cdd:smart00042   81 NSLTLTFVSDSSVQKRGFSARY 102
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
941-1039 9.99e-28

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 109.40  E-value: 9.99e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380      941 GIIESPGHPNVYPRGVNCTWHVVVQRGQLIRLEFSSFYLEFHYNCTNDYLEIYD--TAAQTFLGRYCGKSIPPS-LTSNS 1017
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDgpSASSPLLGRFCGSEAPPPvISSSS 80
                            90       100
                    ....*....|....*....|..
gi 3834380     1018 NSIKLIFVSDSALAHEGFSINY 1039
Cdd:smart00042   81 NSLTLTFVSDSSVQKRGFSARY 102
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
2345-2445 1.79e-27

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 109.02  E-value: 1.79e-27
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     2345 GVIESIGYPTlPYANNVFCQWFIRGLPGHYLTLSFEDFNLQSSPGCTKDFVEIWENH-TSGRVLGRYCGNSTPSSV-DTS 2422
Cdd:smart00042    1 GTITSPNYPQ-SYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDGPsASSPLLGRFCGSEAPPPViSSS 79
                            90       100
                    ....*....|....*....|...
gi 3834380     2423 SNVASVKFVTDGSVTASGFRLQF 2445
Cdd:smart00042   80 SNSLTLTFVSDSSVQKRGFSARY 102
CUB pfam00431
CUB domain;
3157-3271 5.47e-27

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 107.77  E-value: 5.47e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    3157 CGGYLTEDNQSFVSPDSDSNgrYDKGLSCIWYIVAPENKLVKLTFNVFTLEGpssAGSCVYDYVQIADGASINSYLGGKF 3236
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPNP--YPPNKDCVWLIRAPPGFRVKLTFQDFELED---HDECGYDYVEIRDGPSASSPLLGRF 75
                           90       100       110
                   ....*....|....*....|....*....|....*
gi 3834380    3237 CGSRMPAPFISSGNFLTFQFVSDVTVEMRGFNATY 3271
Cdd:pfam00431   76 CGSGIPEDIVSSSNQMTIKFVSDASVQKRGFKATY 110
CUB pfam00431
CUB domain;
1978-2088 6.14e-27

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 107.77  E-value: 6.14e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    1978 CGGFmVTgDTPVHIFSPGWPREYANGADCIWIIYAPD-STVELNILSLDIEPQQSCNYDKLIVKDGDSDLSPELAVLCGV 2056
Cdd:pfam00431    1 CGGV-LT-DSSGSISSPNYPNPYPPNKDCVWLIRAPPgFRVKLTFQDFELEDHDECGYDYVEIRDGPSASSPLLGRFCGS 78
                           90       100       110
                   ....*....|....*....|....*....|..
gi 3834380    2057 SPPGPIRSTGEYMYIRFTSDTSVAGTGFNASF 2088
Cdd:pfam00431   79 GIPEDIVSSSNQMTIKFVSDASVQKRGFKATY 110
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
3046-3147 7.25e-27

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 107.09  E-value: 7.25e-27
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     3046 GILRSPSYSySNYPNNLYCVYSLHVRSSRVIIIRFNDFDVAPSNLCAHDFLEVFDGPSIGNRSLGKFCGSTRPQTVKSTN 3125
Cdd:smart00042    1 GTITSPNYP-QSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDGPSASSPLLGRFCGSEAPPPVISSS 79
                            90       100
                    ....*....|....*....|...
gi 3834380     3126 S-SLTLLFKTDSSQTARGWKIFF 3147
Cdd:smart00042   80 SnSLTLTFVSDSSVQKRGFSARY 102
CUB pfam00431
CUB domain;
3395-3502 2.01e-26

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 106.23  E-value: 2.01e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    3395 CNREYNQTFGNLKSPGWPQNYDNNLDCTIILRAPQNHSISLFFYWFQLEDSRQCMNDFLEVRNGGSSTSPLLDKYCSNLL 3474
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECGYDYVEIRDGPSASSPLLGRFCGSGI 80
                           90       100
                   ....*....|....*....|....*...
gi 3834380    3475 PNPVFSQSNELYLHFHSDHSVTNNGYEI 3502
Cdd:pfam00431   81 PEDIVSSSNQMTIKFVSDASVQKRGFKA 108
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
1519-1616 2.54e-26

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 105.55  E-value: 2.54e-26
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     1519 GEIHSPNYPNNYRANTECSWIIQVERHHRVLLNITDFDLEAPDSC----LRLMDGSSSTNARVASVCGRQQPPNSIIASG 1594
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCeydyVEIYDGPSASSPLLGRFCGSEAPPPVISSSS 80
                            90       100
                    ....*....|....*....|..
gi 3834380     1595 NSLFVRFRSGSSSQNRGFRAEF 1616
Cdd:smart00042   81 NSLTLTFVSDSSVQKRGFSARY 102
CUB pfam00431
CUB domain;
2452-2562 3.02e-26

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 105.45  E-value: 3.02e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    2452 CGGDLHGPTGTFTSPNYPNPNPHARICEWTITVQEGRRIVLTFTNLRLSTQPSCNSEHLIVFNGIRSNSPLLQKLCSRVN 2531
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECGYDYVEIRDGPSASSPLLGRFCGSGI 80
                           90       100       110
                   ....*....|....*....|....*....|.
gi 3834380    2532 VTNeFKSSGNTMKVVFFTDGSRPYGGFTASY 2562
Cdd:pfam00431   81 PED-IVSSSNQMTIKFVSDASVQKRGFKATY 110
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
817-927 7.03e-26

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 104.80  E-value: 7.03e-26
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380   817 CGGMLRG--EGFFRSPFYPNAYPGRRTCRWTISQPQRQVVLLNFTDFQIGSSASCDTDYIEIGPSSVLGSPGNEKFCSSN 894
Cdd:cd00041    1 CGGTLTAstSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCGST 80
                         90       100       110
                 ....*....|....*....|....*....|...
gi 3834380   895 IPSFITSVYNILYVTFVKSSSMENRGFTAKFSS 927
Cdd:cd00041   81 LPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
3404-3504 9.51e-26

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 104.01  E-value: 9.51e-26
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     3404 GNLKSPGWPQNYDNNLDCTIILRAPQNHSISLFFYWFQLEDSRQCMNDFLEVRNGGSSTSPLLDKYCSNLLPNPVF-SQS 3482
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDGPSASSPLLGRFCGSEAPPPVIsSSS 80
                            90       100
                    ....*....|....*....|..
gi 3834380     3483 NELYLHFHSDHSVTNNGYEIIW 3504
Cdd:smart00042   81 NSLTLTFVSDSSVQKRGFSARY 102
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
1990-2088 1.26e-25

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 103.62  E-value: 1.26e-25
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     1990 HIFSPGWPREYANGADCIWIIYAPD-STVELNILSLDIEPQQSCNYDKLIVKDGDSDLSPELAVLCG-VSPPGPIRSTGE 2067
Cdd:smart00042    2 TITSPNYPQSYPNNLDCVWTIRAPPgYRIELQFTDFDLESSDNCEYDYVEIYDGPSASSPLLGRFCGsEAPPPVISSSSN 81
                            90       100
                    ....*....|....*....|.
gi 3834380     2068 YMYIRFTSDTSVAGTGFNASF 2088
Cdd:smart00042   82 SLTLTFVSDSSVQKRGFSARY 102
CUB pfam00431
CUB domain;
3511-3619 2.69e-25

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 102.76  E-value: 2.69e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    3511 CGGTLLGDEGIFTNPGFPDSYPNNTHCEWTIVAPSGRPVSVGFPFLSIDSSGGCDQNYLIVFNGPDANSPPFGPLCGinT 3590
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECGYDYVEIRDGPSASSPLLGRFCG--S 78
                           90       100       110
                   ....*....|....*....|....*....|
gi 3834380    3591 GI-APFYASSNRVFIRFHAEYTTRLSGFEI 3619
Cdd:pfam00431   79 GIpEDIVSSSNQMTIKFVSDASVQKRGFKA 108
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
2461-2562 2.72e-25

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 102.47  E-value: 2.72e-25
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     2461 GTFTSPNYPNPNPHARICEWTITVQEGRRIVLTFTNLRLSTQPSCNSEHLIVFNGIRSNSPLLQKLCSRVNVTNEFKSSG 2540
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDGPSASSPLLGRFCGSEAPPPVISSSS 80
                            90       100
                    ....*....|....*....|..
gi 3834380     2541 NTMKVVFFTDGSRPYGGFTASY 2562
Cdd:smart00042   81 NSLTLTFVSDSSVQKRGFSARY 102
CUB pfam00431
CUB domain;
2092-2210 1.14e-24

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 101.22  E-value: 1.14e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    2092 CGGYLHADRGVITSPKYPDTYLPNLNCSWHVLVQTGLTIAVHFeQPFQIQnRDSFCsQGDYLVLRNGPDNHSPPLgpsgr 2171
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTF-QDFELE-DHDEC-GYDYVEIRDGPSASSPLL----- 72
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 3834380    2172 nGRFCGMYAPSTLFTSGNEMFVQFISDSSNGGQGFKIRY 2210
Cdd:pfam00431   73 -GRFCGSGIPEDIVSSSNQMTIKFVSDASVQKRGFKATY 110
CUB pfam00431
CUB domain;
1286-1383 1.20e-24

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 101.22  E-value: 1.20e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    1286 KTSGILESINYPNPYDKNQRCNWTIQATTGNTVNYTFLGFDVESYMNCSTDYVELYDGP----QWMGRYCGNNMPPPGAT 1361
Cdd:pfam00431    7 DSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECGYDYVEIRDGPsassPLLGRFCGSGIPEDIVS 86
                           90       100
                   ....*....|....*....|..
gi 3834380    1362 TGSQLHVLFHTDGINSGeKGFK 1383
Cdd:pfam00431   87 SSNQMTIKFVSDASVQK-RGFK 107
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
2920-3034 1.31e-24

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 100.95  E-value: 1.31e-24
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  2920 CGRTFN-TSPGDIISPNFPKQYDNNMNCTYLIDADPQSLVILTFVSFHLEDrsaiTGTCDHDGLHIIKGRNLSSTPLVTI 2998
Cdd:cd00041    1 CGGTLTaSTSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLES----SPNCSYDYLEIYDGPSTSSPLLGRF 76
                         90       100       110
                 ....*....|....*....|....*....|....*..
gi 3834380  2999 CGSETLRPLTVDGP-VLLNFYSDAYTTDFGFKISYRA 3034
Cdd:cd00041   77 CGSTLPPPIISSGNsLTVRFRSDSSVTGRGFKATYSA 113
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
1289-1386 2.85e-24

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 99.77  E-value: 2.85e-24
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     1289 GILESINYPNPYDKNQRCNWTIQATTGNTVNYTFLGFDVESYMNCSTDYVELYDGPQW----MGRYCGNNMPPPGATT-G 1363
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDGPSAssplLGRFCGSEAPPPVISSsS 80
                            90       100
                    ....*....|....*....|...
gi 3834380     1364 SQLHVLFHTDGINSGeKGFKMQW 1386
Cdd:smart00042   81 NSLTLTFVSDSSVQK-RGFSARY 102
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
2101-2210 5.74e-24

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 99.00  E-value: 5.74e-24
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     2101 GVITSPKYPDTYLPNLNCSWHVLVQTGLTIAVHFEqPFQIQNRDSfCSQgDYLVLRNGPDNHSPPLgpsgrnGRFCGMYA 2180
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFT-DFDLESSDN-CEY-DYVEIYDGPSASSPLL------GRFCGSEA 71
                            90       100       110
                    ....*....|....*....|....*....|.
gi 3834380     2181 PSTLFTS-GNEMFVQFISDSSNGGQGFKIRY 2210
Cdd:smart00042   72 PPPVISSsSNSLTLTFVSDSSVQKRGFSARY 102
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
2570-2686 6.18e-24

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 99.02  E-value: 6.18e-24
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  2570 CGGFLPSVSGGNFSSPGYNGirDYARNLDCEWTLSNPnrENSSISIYFLELSIESHQDCTFDVLEFRVG-DADGPLIEKF 2648
Cdd:cd00041    1 CGGTLTASTSGTISSPNYPN--NYPNNLNCVWTIEAP--PGYRIRLTFEDFDLESSPNCSYDYLEIYDGpSTSSPLLGRF 76
                         90       100       110
                 ....*....|....*....|....*....|....*...
gi 3834380  2649 CSLSAPtAPLVIPYPQVWIHFVSNERVEYTGFYIEYSF 2686
Cdd:cd00041   77 CGSTLP-PPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
3520-3619 5.86e-23

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 95.92  E-value: 5.86e-23
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     3520 GIFTNPGFPDSYPNNTHCEWTIVAPSGRPVSVGFPFLSIDSSGGCDQNYLIVFNGPDANSPPFGPLCGINTGIAPFYASS 3599
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDGPSASSPLLGRFCGSEAPPPVISSSS 80
                            90       100
                    ....*....|....*....|
gi 3834380     3600 NRVFIRFHAEYTTRLSGFEI 3619
Cdd:smart00042   81 NSLTLTFVSDSSVQKRGFSA 100
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
1861-1960 1.55e-22

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 94.76  E-value: 1.55e-22
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     1861 GKIASPFWPGKYPYNSNYKWVVNVDAYHIIHGRILEMDIEPTTNCFYDSLKIYDGFDTHSRLIGTYCGTQT--ESFSSSR 1938
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDGPSASSPLLGRFCGSEAppPVISSSS 80
                            90       100
                    ....*....|....*....|..
gi 3834380     1939 NSLTFQFSSDSSVSGRGFLLEW 1960
Cdd:smart00042   81 NSLTLTFVSDSSVQKRGFSARY 102
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
3179-3271 2.09e-22

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 94.38  E-value: 2.09e-22
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     3179 YDKGLSCIWYIVAPENKLVKLTFNVFTLEgpsSAGSCVYDYVQIADGASINSYLGGKFCGSRMPAPFISS-GNFLTFQFV 3257
Cdd:smart00042   12 YPNNLDCVWTIRAPPGYRIELQFTDFDLE---SSDNCEYDYVEIYDGPSASSPLLGRFCGSEAPPPVISSsSNSLTLTFV 88
                            90
                    ....*....|....
gi 3834380     3258 SDVTVEMRGFNATY 3271
Cdd:smart00042   89 SDSSVQKRGFSARY 102
CUB pfam00431
CUB domain;
1861-1956 1.82e-21

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 91.97  E-value: 1.82e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    1861 GKIASPFWPGKYPYNSNYKWVVNVDAYHIIHGRILEMDIEPTTNCFYDSLKIYDGFDTHSRLIGTYCGTQT-ESFSSSRN 1939
Cdd:pfam00431   10 GSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECGYDYVEIRDGPSASSPLLGRFCGSGIpEDIVSSSN 89
                           90
                   ....*....|....*..
gi 3834380    1940 SLTFQFSSDSSVSGRGF 1956
Cdd:pfam00431   90 QMTIKFVSDASVQKRGF 106
cubilin_NTD cd22201
N-terminal domain of cubilin and similar proteins; Cubilin (CUBN, also called 460 kDa receptor, ...
38-132 7.80e-21

N-terminal domain of cubilin and similar proteins; Cubilin (CUBN, also called 460 kDa receptor, intestinal intrinsic factor receptor, intrinsic factor-cobalamin receptor, or intrinsic factor-vitamin B12 receptor) is an endocytic receptor which plays a role in lipoprotein, vitamin and iron metabolism by facilitating their uptake. It acts together with the 45-kDa transmembrane protein amnionless (AMN) to mediate endocytosis of the cobalamin (vitamin B12) binding intrinsic factor (CBLIF)-cobalamin complex. This model corresponds to the N-terminal domain of cubilin, which is responsible for the interaction with AMN. The cubilin interface with AMN is formed by the N-terminal strands of three cubilin chains.


Pssm-ID: 412063  Cd Length: 129  Bit Score: 90.85  E-value: 7.80e-21
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    38 QPRMTTEEGNLVFLTSSTQNIEFRTGSLGKIKLNDEDLGECLHQIQRNKDDIIDLRKN-----------TTGLPQNILSQ 106
Cdd:cd22201   13 QPRIITEDGHLIFEAAYDKNISFRTSGNGRININDEDLLELLQQAKNNKSDIENLKQSelptfeqqlseLVGGPQGLLRR 92
                         90       100
                 ....*....|....*....|....*.
gi 3834380   107 VHQLNSKLVDLERDFQNLQQNVERKV 132
Cdd:cd22201   93 LALLENRTSGLSSTLNNNIRRLRRRL 118
CUB pfam00431
CUB domain;
817-925 2.58e-20

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 88.51  E-value: 2.58e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     817 CGGMLRGE-GFFRSPFYPNAYPGRRTCRWTISQPQRQVVLLNFTDFQIGSSASCDTDYIEI--GPSSVLGSPGneKFCSS 893
Cdd:pfam00431    1 CGGVLTDSsGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECGYDYVEIrdGPSASSPLLG--RFCGS 78
                           90       100       110
                   ....*....|....*....|....*....|..
gi 3834380     894 NIPSFITSVYNILYVTFVKSSSMENRGFTAKF 925
Cdd:pfam00431   79 GIPEDIVSSSNQMTIKFVSDASVQKRGFKATY 110
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
708-815 3.03e-20

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 88.62  E-value: 3.03e-20
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380   708 CGGNYTDTD-GELLLPPLSGPFSHSRQCVYLITQAQGEQIVINFTHVELESQMGCSHTYIEVGDHDS----LLRKICGNE 782
Cdd:cd00041    1 CGGTLTASTsGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPStsspLLGRFCGST 80
                         90       100       110
                 ....*....|....*....|....*....|...
gi 3834380   783 TLFPIRSVSNKVWIRLRIDALVQKASFRADYQV 815
Cdd:cd00041   81 LPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB pfam00431
CUB domain;
474-582 4.99e-20

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 87.74  E-value: 4.99e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     474 CGGILSGTQGTFayHSPN--DTYIHNVNCFWIVRTDEEKVLHVTFTFFDLESASNCPREYLQIHDGDSSADFPLGRYCGS 551
Cdd:pfam00431    1 CGGVLTDSSGSI--SSPNypNPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECGYDYVEIRDGPSASSPLLGRFCGS 78
                           90       100       110
                   ....*....|....*....|....*....|.
gi 3834380     552 RPPQGIHSSANALYFHLYSEYIRSGRGFTAR 582
Cdd:pfam00431   79 GIPEDIVSSSNQMTIKFVSDASVQKRGFKAT 109
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
825-925 5.44e-20

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 87.45  E-value: 5.44e-20
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380      825 GFFRSPFYPNAYPGRRTCRWTISQPQRQVVLLNFTDFQIGSSASCDTDYIEIGPSSVLGSPGNEKFCSSNIP-SFITSVY 903
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDGPSASSPLLGRFCGSEAPpPVISSSS 80
                            90       100
                    ....*....|....*....|..
gi 3834380      904 NILYVTFVKSSSMENRGFTAKF 925
Cdd:smart00042   81 NSLTLTFVSDSSVQKRGFSARY 102
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
2805-2918 7.34e-20

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 87.47  E-value: 7.34e-20
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  2805 CGGTFHSA-NGTIKSPHWPQTFPENSRCSWTVITHESKHWEISFDSNFRIPSSDsqCQNSFVKVWEGRLmINKTLLATSC 2883
Cdd:cd00041    1 CGGTLTAStSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPN--CSYDYLEIYDGPS-TSSPLLGRFC 77
                         90       100       110
                 ....*....|....*....|....*....|....*.
gi 3834380  2884 GDVAPSPIVTSGNIFTAVFQSEEM-AAQGFSASFIS 2918
Cdd:cd00041   78 GSTLPPPIISSGNSLTVRFRSDSSvTGRGFKATYSA 113
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
2580-2684 6.12e-19

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 84.36  E-value: 6.12e-19
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     2580 GNFSSPGYngIRDYARNLDCEWTLSNPNreNSSISIYFLELSIESHQDCTFDVLEFRVGD-ADGPLIEKFCSLSAPTAPL 2658
Cdd:smart00042    1 GTITSPNY--PQSYPNNLDCVWTIRAPP--GYRIELQFTDFDLESSDNCEYDYVEIYDGPsASSPLLGRFCGSEAPPPVI 76
                            90       100
                    ....*....|....*....|....*.
gi 3834380     2659 VIPYPQVWIHFVSNERVEYTGFYIEY 2684
Cdd:smart00042   77 SSSSNSLTLTFVSDSSVQKRGFSARY 102
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
487-583 1.45e-18

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 83.59  E-value: 1.45e-18
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380      487 YHSPN--DTYIHNVNCFWIVRTDEEKVLHVTFTFFDLESASNCPREYLQIHDGDSSADFPLGRYCGSRPPQGIHSSA-NA 563
Cdd:smart00042    3 ITSPNypQSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDGPSASSPLLGRFCGSEAPPPVISSSsNS 82
                            90       100
                    ....*....|....*....|
gi 3834380      564 LYFHLYSEYIRSGRGFTARW 583
Cdd:smart00042   83 LTLTFVSDSSVQKRGFSARY 102
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
2929-3032 2.79e-18

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 82.44  E-value: 2.79e-18
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     2929 GDIISPNFPKQYDNNMNCTYLIDADPQSLVILTFVSFHLEDRSaitgTCDHDGLHIIKGRNLSSTPLVTICGSETLRPL- 3007
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSD----NCEYDYVEIYDGPSASSPLLGRFCGSEAPPPVi 76
                            90       100
                    ....*....|....*....|....*.
gi 3834380     3008 -TVDGPVLLNFYSDAYTTDFGFKISY 3032
Cdd:smart00042   77 sSSSNSLTLTFVSDSSVQKRGFSARY 102
CUB pfam00431
CUB domain;
2570-2684 4.44e-18

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 82.34  E-value: 4.44e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    2570 CGGFLPSVSGgNFSSPGYNgiRDYARNLDCEWTLSNPnrENSSISIYFLELSIESHQDCTFDVLEFRVGD-ADGPLIEKF 2648
Cdd:pfam00431    1 CGGVLTDSSG-SISSPNYP--NPYPPNKDCVWLIRAP--PGFRVKLTFQDFELEDHDECGYDYVEIRDGPsASSPLLGRF 75
                           90       100       110
                   ....*....|....*....|....*....|....*.
gi 3834380    2649 CSLSAPtAPLVIPYPQVWIHFVSNERVEYTGFYIEY 2684
Cdd:pfam00431   76 CGSGIP-EDIVSSSNQMTIKFVSDASVQKRGFKATY 110
CUB pfam00431
CUB domain;
2920-3032 3.52e-17

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 79.65  E-value: 3.52e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    2920 CGRTFNTSPGDIISPNFPKQYDNNMNCTYLIDADPQSLVILTFVSFHLEDrsaiTGTCDHDGLHIIKGRNLSSTPLVTIC 2999
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFELED----HDECGYDYVEIRDGPSASSPLLGRFC 76
                           90       100       110
                   ....*....|....*....|....*....|....
gi 3834380    3000 GSETLRPLTVDGP-VLLNFYSDAYTTDFGFKISY 3032
Cdd:pfam00431   77 GSGIPEDIVSSSNqMTIKFVSDASVQKRGFKATY 110
CUB pfam00431
CUB domain;
708-813 3.84e-17

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 79.65  E-value: 3.84e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     708 CGGNYTDTDGELLLPPLSGPFSHSRQCVYLITQAQGEQIVINFTHVELESQMGCSHTYIEVGDHDS----LLRKICGNET 783
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECGYDYVEIRDGPSasspLLGRFCGSGI 80
                           90       100       110
                   ....*....|....*....|....*....|
gi 3834380     784 LFPIRSVSNKVWIRLRIDALVQKASFRADY 813
Cdd:pfam00431   81 PEDIVSSSNQMTIKFVSDASVQKRGFKATY 110
CUB pfam00431
CUB domain;
2805-2916 2.27e-16

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 77.34  E-value: 2.27e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    2805 CGGTFHSANGTIKSPHWPQTFPENSRCSWTVITHESKHWEISFDSnFRIPSSDSqCQNSFVKVWEGRlMINKTLLATSCG 2884
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQD-FELEDHDE-CGYDYVEIRDGP-SASSPLLGRFCG 77
                           90       100       110
                   ....*....|....*....|....*....|...
gi 3834380    2885 DVAPSPIVTSGNIFTAVFQS-EEMAAQGFSASF 2916
Cdd:pfam00431   78 SGIPEDIVSSSNQMTIKFVSdASVQKRGFKATY 110
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
3278-3392 3.18e-16

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 77.07  E-value: 3.18e-16
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  3278 CGGTYNATsTPQNASSPHLSNIGRPYSTCTWVIAAPPQQQVQITVWDLQL-PSQDCSQSYLELQDSVQTGGNRVTQFCGa 3356
Cdd:cd00041    1 CGGTLTAS-TSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLeSSPNCSYDYLEIYDGPSTSSPLLGRFCG- 78
                         90       100       110
                 ....*....|....*....|....*....|....*.
gi 3834380  3357 nYTTLPVFYSSMSTAVVVFKSGVLNRNSQVQFSYQI 3392
Cdd:cd00041   79 -STLPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
2814-2916 3.23e-14

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 70.88  E-value: 3.23e-14
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     2814 GTIKSPHWPQTFPENSRCSWTVITHESKHWEISFDSnFRIpSSDSQCQNSFVKVWEGRLMINKtLLATSCGDVAPSPIVT 2893
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTD-FDL-ESSDNCEYDYVEIYDGPSASSP-LLGRFCGSEAPPPVIS 77
                            90       100
                    ....*....|....*....|....*
gi 3834380     2894 S-GNIFTAVFQS-EEMAAQGFSASF 2916
Cdd:smart00042   78 SsSNSLTLTFVSdSSVQKRGFSARY 102
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
726-813 2.08e-12

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 65.87  E-value: 2.08e-12
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380      726 GPFSHSRQCVYLITQAQGEQIVINFTHVELESQMGCSHTYIEVGD----HDSLLRKICGNETLFP-IRSVSNKVWIRLRI 800
Cdd:smart00042   10 QSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDgpsaSSPLLGRFCGSEAPPPvISSSSNSLTLTFVS 89
                            90
                    ....*....|...
gi 3834380      801 DALVQKASFRADY 813
Cdd:smart00042   90 DSSVQKRGFSARY 102
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
432-468 8.18e-09

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 53.41  E-value: 8.18e-09
                         10        20        30
                 ....*....|....*....|....*....|....*...
gi 3834380   432 NINDCSS-NPCLNGGTCIDGINGFTCDCTSSWTGYYCQ 468
Cdd:cd00054    1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
3292-3377 2.33e-08

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 54.32  E-value: 2.33e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     3292 SSPHLSNIGRPYSTCTWVIAAPPQQQVQITVWDLQL-PSQDCSQSYLELQDSVQTGGNRVTQFCGaNYTTLPVFYSSMST 3370
Cdd:smart00042    4 TSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLeSSDNCEYDYVEIYDGPSASSPLLGRFCG-SEAPPPVISSSSNS 82

                    ....*..
gi 3834380     3371 AVVVFKS 3377
Cdd:smart00042   83 LTLTFVS 89
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
167-207 3.58e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 48.79  E-value: 3.58e-07
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|.
gi 3834380   167 DVNECVvySGTPfgCQSGSTCVNTVGSFRCDCTPDTYGPQC 207
Cdd:cd00054    1 DIDECA--SGNP--CQNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
350-387 4.64e-07

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 48.36  E-value: 4.64e-07
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 3834380     350 CSIHNGGCHPEATCSSSPvlGSFlpVCTCPPGYTGNGY 387
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTG--GSF--TCTCNDGYTGDGV 34
EGF_CA smart00179
Calcium-binding EGF-like domain;
167-207 9.78e-07

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 47.63  E-value: 9.78e-07
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|..
gi 3834380      167 DVNECVVYSGtpfgCQSGSTCVNTVGSFRCDCTPD-TYGPQC 207
Cdd:smart00179    1 DIDECASGNP----CQNGGTCVNTVGSYRCECPPGyTDGRNC 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
133-164 1.48e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 47.25  E-value: 1.48e-06
                         10        20        30
                 ....*....|....*....|....*....|...
gi 3834380   133 CSS-NPCLNGGTCVNLHDSFVCICPSQWKGLFC 164
Cdd:cd00054    5 CASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF_CA smart00179
Calcium-binding EGF-like domain;
432-468 1.60e-06

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 47.24  E-value: 1.60e-06
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 3834380      432 NINDCSS-NPCLNGGTCIDGINGFTCDCTSSWT-GYYCQ 468
Cdd:smart00179    1 DIDECASgNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
436-464 2.41e-06

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 46.22  E-value: 2.41e-06
                           10        20
                   ....*....|....*....|....*....
gi 3834380     436 CSSNPCLNGGTCIDGINGFTCDCTSSWTG 464
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTG 29
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
306-344 6.23e-06

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 45.28  E-value: 6.23e-06
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 3834380     306 CEINNGGCSQAPLvpCLNTPGSFSCgNCPAGFSGDGRVC 344
Cdd:pfam12947    1 CSDNNGGCHPNAT--CTNTGGSFTC-TCNDGYTGDGVTC 36
EGF_CA smart00179
Calcium-binding EGF-like domain;
260-301 6.56e-06

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 45.32  E-value: 6.56e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|..
gi 3834380      260 DKDECSlQPSPCSEHAQCFNTQGSFYCgACPKGWQgNGYECQ 301
Cdd:smart00179    1 DIDECA-SGNPCQNGGTCVNTVGSYRC-ECPPGYT-DGRNCE 39
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
260-295 1.17e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 44.55  E-value: 1.17e-05
                         10        20        30
                 ....*....|....*....|....*....|....*.
gi 3834380   260 DKDECSlQPSPCSEHAQCFNTQGSFYCgACPKGWQG 295
Cdd:cd00054    1 DIDECA-SGNPCQNGGTCVNTVGSYRC-SCPPGYTG 34
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
400-430 4.92e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 43.01  E-value: 4.92e-05
                         10        20        30
                 ....*....|....*....|....*....|..
gi 3834380   400 SRHPCVN-GQCIETVSSYFCKCDSGWSGQNCT 430
Cdd:cd00054    7 SGNPCQNgGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA smart00179
Calcium-binding EGF-like domain;
302-345 5.36e-05

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 42.62  E-value: 5.36e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....
gi 3834380      302 DINECEINNGgCSQAPLvpCLNTPGSFSCgNCPAGFSgDGRVCT 345
Cdd:smart00179    1 DIDECASGNP-CQNGGT--CVNTVGSYRC-ECPPGYT-DGRNCE 39
EGF_CA pfam07645
Calcium-binding EGF domain;
167-198 7.77e-05

Calcium-binding EGF domain;


Pssm-ID: 429571  Cd Length: 32  Bit Score: 42.22  E-value: 7.77e-05
                           10        20        30
                   ....*....|....*....|....*....|..
gi 3834380     167 DVNECVVYsgtPFGCQSGSTCVNTVGSFRCDC 198
Cdd:pfam07645    1 DVDECATG---THNCPANTVCVNTIGSFECRC 29
EGF_CA smart00179
Calcium-binding EGF-like domain;
134-156 1.02e-04

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 41.85  E-value: 1.02e-04
                            10        20
                    ....*....|....*....|...
gi 3834380      134 SSNPCLNGGTCVNLHDSFVCICP 156
Cdd:smart00179    7 SGNPCQNGGTCVNTVGSYRCECP 29
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
133-161 1.44e-04

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 41.21  E-value: 1.44e-04
                           10        20
                   ....*....|....*....|....*....
gi 3834380     133 CSSNPCLNGGTCVNLHDSFVCICPSQWKG 161
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTG 29
EGF smart00181
Epidermal growth factor-like domain;
400-428 3.14e-03

Epidermal growth factor-like domain;


Pssm-ID: 214544  Cd Length: 35  Bit Score: 37.88  E-value: 3.14e-03
                            10        20
                    ....*....|....*....|....*....
gi 3834380      400 SRHPCVNGQCIETVSSYFCKCDSGWSGQN 428
Cdd:smart00181    4 SGGPCSNGTCINTPGSYTCSCPPGYTGDK 32
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
302-340 5.15e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.23  E-value: 5.15e-03
                         10        20        30
                 ....*....|....*....|....*....|....*....
gi 3834380   302 DINECEINNGgCSQAPLvpCLNTPGSFSCgNCPAGFSGD 340
Cdd:cd00054    1 DIDECASGNP-CQNGGT--CVNTVGSYRC-SCPPGYTGR 35
EGF_CA pfam07645
Calcium-binding EGF domain;
260-292 7.95e-03

Calcium-binding EGF domain;


Pssm-ID: 429571  Cd Length: 32  Bit Score: 36.45  E-value: 7.95e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 3834380     260 DKDECSLQPSPCSEHAQCFNTQGSFYCgACPKG 292
Cdd:pfam07645    1 DVDECATGTHNCPANTVCVNTIGSFEC-RCPDG 32
 
Name Accession Description Interval E-value
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
1620-1733 2.37e-38

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 140.24  E-value: 2.37e-38
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  1620 CGGRIMTDSSDTIFSPLYPHNYLHNQNCSWIIEAqPPFNHITLSFTHFQLQNSTDCTRDFVEILDGNDYDAPVQGRYCGF 1699
Cdd:cd00041    1 CGGTLTASTSGTISSPNYPNNYPNNLNCVWTIEA-PPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCGS 79
                         90       100       110
                 ....*....|....*....|....*....|....
gi 3834380  1700 SLPHPIISFGNALTVRFVTDSTRSFEGFRAIYSA 1733
Cdd:cd00041   80 TLPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
2689-2800 6.23e-37

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 136.39  E-value: 6.23e-37
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  2689 CGG-IRTGDNGVISSPNYPNLYSAWTHCSWLLKAPEGHTITLTFSDFLLEAHPTCTSDSVTVRNGDSPGSPVIGRYCGQS 2767
Cdd:cd00041    1 CGGtLTASTSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCGST 80
                         90       100       110
                 ....*....|....*....|....*....|...
gi 3834380  2768 VPRPIQSGSNQLIVTFNTNNQGQTRGFYATWTT 2800
Cdd:cd00041   81 LPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
1391-1505 1.65e-36

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 135.23  E-value: 1.65e-36
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  1391 CGGEMSGTA-GSFSSPGYPNSYPHNKECIWNIRVAPGSSIQLTIHDFDVEYHTSCNYDSLEIYAGLDFNSPRIAQLCSQS 1469
Cdd:cd00041    1 CGGTLTASTsGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCGST 80
                         90       100       110
                 ....*....|....*....|....*....|....*.
gi 3834380  1470 PsanPMQVSSTGNELAIRFKTDSTLNGRGFNASWRA 1505
Cdd:cd00041   81 L---PPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
590-699 4.82e-35

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 131.00  E-value: 4.82e-35
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380   590 CGGILTDN-YGSITSPGYPGNYPPGRDCVWQVLVNPNSLITFTFGTLSLESHNDCSKDYLEIRDGPFHQDPVLGKFCTSL 668
Cdd:cd00041    1 CGGTLTAStSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCGST 80
                         90       100       110
                 ....*....|....*....|....*....|.
gi 3834380   669 STPPLKTTGPAARIHFHSDSETSDKGFHITY 699
Cdd:cd00041   81 LPPPIISSGNSLTVRFRSDSSVTGRGFKATY 111
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
1165-1275 9.69e-35

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 129.84  E-value: 9.69e-35
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  1165 CGGNLTTPT-GVLTSPNYPMPYYHSSECYWRLEASHGSPFELEFQDFHLEHHPSCSLDYLAVFDGPTTNSRLIDKLCGDT 1243
Cdd:cd00041    1 CGGTLTASTsGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCGST 80
                         90       100       110
                 ....*....|....*....|....*....|..
gi 3834380  1244 TPAPIRSNKDVVLLKLRTDAGQQGRGFEINFR 1275
Cdd:cd00041   81 LPPPIISSGNSLTVRFRSDSSVTGRGFKATYS 112
CUB pfam00431
CUB domain;
1391-1502 1.00e-34

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 129.72  E-value: 1.00e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    1391 CGGEMSGTAGSFSSPGYPNSYPHNKECIWNIRVAPGSSIQLTIHDFDVEYHTSCNYDSLEIYAGLDFNSPRIAQLCSqsp 1470
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECGYDYVEIRDGPSASSPLLGRFCG--- 77
                           90       100       110
                   ....*....|....*....|....*....|..
gi 3834380    1471 SANPMQVSSTGNELAIRFKTDSTLNGRGFNAS 1502
Cdd:pfam00431   78 SGIPEDIVSSSNQMTIKFVSDASVQKRGFKAT 109
CUB pfam00431
CUB domain;
1048-1157 1.28e-34

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 129.34  E-value: 1.28e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    1048 CLYDYTDNFGMLSSPNFPNNYPSNWECIYRITVGLNQQIALHFTDFTLEDYFGsqCV-DFVEIRDGGYETSPLVGIYCGS 1126
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDE--CGyDYVEIRDGPSASSPLLGRFCGS 78
                           90       100       110
                   ....*....|....*....|....*....|.
gi 3834380    1127 VLPPTIISHSNKLWLKFKSDAALTAKGFSAY 1157
Cdd:pfam00431   79 GIPEDIVSSSNQMTIKFVSDASVQKRGFKAT 109
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
2217-2333 1.64e-34

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 129.45  E-value: 1.64e-34
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  2217 CGGTVYihdADSDGYLTSPNYPANYPQHAECIWILEAPPGRSIQLQFEDqFNIEDTPNCSVSYLELRDGANSNARLVSKL 2296
Cdd:cd00041    1 CGGTLT---ASTSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFED-FDLESSPNCSYDYLEIYDGPSTSSPLLGRF 76
                         90       100       110
                 ....*....|....*....|....*....|....*..
gi 3834380  2297 CGHTLPHSWVSSRERIYLKFHTDGGSSYMGFKAKYSI 2333
Cdd:cd00041   77 CGSTLPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB pfam00431
CUB domain;
1165-1274 2.31e-34

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 128.95  E-value: 2.31e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    1165 CGGNLTTPTGVLTSPNYPMPYYHSSECYWRLEASHGSPFELEFQDFHLEHHPSCSLDYLAVFDGPTTNSRLIDKLCGDTT 1244
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECGYDYVEIRDGPSASSPLLGRFCGSGI 80
                           90       100       110
                   ....*....|....*....|....*....|
gi 3834380    1245 PAPIRSNKDVVLLKLRTDAGQQGRGFEINF 1274
Cdd:pfam00431   81 PEDIVSSSNQMTIKFVSDASVQKRGFKATY 110
CUB pfam00431
CUB domain;
590-699 2.73e-34

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 128.57  E-value: 2.73e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     590 CGGILTDNYGSITSPGYPGNYPPGRDCVWQVLVNPNSLITFTFGTLSLESHNDCSKDYLEIRDGPFHQDPVLGKFCTSLS 669
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECGYDYVEIRDGPSASSPLLGRFCGSGI 80
                           90       100       110
                   ....*....|....*....|....*....|
gi 3834380     670 TPPLKTTGPAARIHFHSDSETSDKGFHITY 699
Cdd:pfam00431   81 PEDIVSSSNQMTIKFVSDASVQKRGFKATY 110
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
932-1041 4.76e-34

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 127.91  E-value: 4.76e-34
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380   932 CGEVLTAST-GIIESPGHPNVYPRGVNCTWHVVVQRGQLIRLEFSSFYLEFHYNCTNDYLEIYD--TAAQTFLGRYCGKS 1008
Cdd:cd00041    1 CGGTLTASTsGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDgpSTSSPLLGRFCGST 80
                         90       100       110
                 ....*....|....*....|....*....|...
gi 3834380  1009 IPPSLTSNSNSIKLIFVSDSALAHEGFSINYEA 1041
Cdd:cd00041   81 LPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
1738-1847 1.44e-33

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 126.76  E-value: 1.44e-33
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  1738 CGGSFY-TLDGIFNSPDYPADYHPNAECVWNIASSPGNRLQLSFLSFNLENSLNCNKDFVEIREGNAT-GHLIGRYCGNS 1815
Cdd:cd00041    1 CGGTLTaSTSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTsSPLLGRFCGST 80
                         90       100       110
                 ....*....|....*....|....*....|..
gi 3834380  1816 LPGNYSSaEGHSLWVRFVSDGSGTGMGFQARF 1847
Cdd:cd00041   81 LPPPIIS-SGNSLTVRFRSDSSVTGRGFKATY 111
CUB pfam00431
CUB domain;
1738-1847 2.62e-33

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 125.87  E-value: 2.62e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    1738 CGGSFYTLDGIFNSPDYPADYHPNAECVWNIASSPGNRLQLSFLSFNLENSLNCNKDFVEIREG-NATGHLIGRYCGNSL 1816
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECGYDYVEIRDGpSASSPLLGRFCGSGI 80
                           90       100       110
                   ....*....|....*....|....*....|.
gi 3834380    1817 PGNYSSaEGHSLWVRFVSDGSGTGMGFQARF 1847
Cdd:pfam00431   81 PEDIVS-SSNQMTIKFVSDASVQKRGFKATY 110
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
3395-3506 4.98e-33

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 125.22  E-value: 4.98e-33
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  3395 CNREYN-QTFGNLKSPGWPQNYDNNLDCTIILRAPQNHSISLFFYWFQLEDSRQCMNDFLEVRNGGSSTSPLLDKYCSNL 3473
Cdd:cd00041    1 CGGTLTaSTSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCGST 80
                         90       100       110
                 ....*....|....*....|....*....|...
gi 3834380  3474 LPNPVFSQSNELYLHFHSDHSVTNNGYEIIWTS 3506
Cdd:cd00041   81 LPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
2336-2447 3.50e-32

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 122.52  E-value: 3.50e-32
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  2336 CGGTVSG-DSGVIESIGYPTlPYANNVFCQWFIRGLPGHYLTLSFEDFNLQSSPGCTKDFVEIWE-NHTSGRVLGRYCGN 2413
Cdd:cd00041    1 CGGTLTAsTSGTISSPNYPN-NYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDgPSTSSPLLGRFCGS 79
                         90       100       110
                 ....*....|....*....|....*....|....
gi 3834380  2414 STPSSVDTSSNVASVKFVTDGSVTASGFRLQFKS 2447
Cdd:cd00041   80 TLPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
2698-2798 3.76e-32

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 122.11  E-value: 3.76e-32
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     2698 GVISSPNYPNLYSAWTHCSWLLKAPEGHTITLTFSDFLLEAHPTCTSDSVTVRNGDSPGSPVIGRYCGQSVPRP-IQSGS 2776
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDGPSASSPLLGRFCGSEAPPPvISSSS 80
                            90       100
                    ....*....|....*....|..
gi 3834380     2777 NQLIVTFNTNNQGQTRGFYATW 2798
Cdd:smart00042   81 NSLTLTFVSDSSVQKRGFSARY 102
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
1747-1847 4.88e-32

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 121.73  E-value: 4.88e-32
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     1747 GIFNSPDYPADYHPNAECVWNIASSPGNRLQLSFLSFNLENSLNCNKDFVEIREGNATGH-LIGRYCGNSLPGNYSSAEG 1825
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDGPSASSpLLGRFCGSEAPPPVISSSS 80
                            90       100
                    ....*....|....*....|..
gi 3834380     1826 HSLWVRFVSDGSGTGMGFQARF 1847
Cdd:smart00042   81 NSLTLTFVSDSSVQKRGFSARY 102
CUB pfam00431
CUB domain;
2689-2797 6.13e-32

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 122.02  E-value: 6.13e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    2689 CGGIRTGDNGVISSPNYPNLYSAWTHCSWLLKAPEGHTITLTFSDFLLEAHPTCTSDSVTVRNGDSPGSPVIGRYCGQSV 2768
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECGYDYVEIRDGPSASSPLLGRFCGSGI 80
                           90       100
                   ....*....|....*....|....*....
gi 3834380    2769 PRPIQSGSNQLIVTFNTNNQGQTRGFYAT 2797
Cdd:pfam00431   81 PEDIVSSSNQMTIKFVSDASVQKRGFKAT 109
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
1052-1158 7.40e-32

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 121.75  E-value: 7.40e-32
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  1052 YTDNFGMLSSPNFPNNYPSNWECIYRITVGLNQQIALHFTDFTLEDYfgSQCV-DFVEIRDGGYETSPLVGIYCGSVLPP 1130
Cdd:cd00041    6 TASTSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESS--PNCSyDYLEIYDGPSTSSPLLGRFCGSTLPP 83
                         90       100
                 ....*....|....*....|....*...
gi 3834380  1131 TIISHSNKLWLKFKSDAALTAKGFSAYW 1158
Cdd:cd00041   84 PIISSGNSLTVRFRSDSSVTGRGFKATY 111
CUB pfam00431
CUB domain;
1620-1731 1.36e-31

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 120.86  E-value: 1.36e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    1620 CGGRImTDSSDTIFSPLYPHNYLHNQNCSWIIEAQPPFnHITLSFTHFQLQNSTDCTRDFVEILDGNDYDAPVQGRYCGF 1699
Cdd:pfam00431    1 CGGVL-TDSSGSISSPNYPNPYPPNKDCVWLIRAPPGF-RVKLTFQDFELEDHDECGYDYVEIRDGPSASSPLLGRFCGS 78
                           90       100       110
                   ....*....|....*....|....*....|..
gi 3834380    1700 SLPHPIISFGNALTVRFVTDSTRSFEGFRAIY 1731
Cdd:pfam00431   79 GIPEDIVSSSNQMTIKFVSDASVQKRGFKATY 110
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
1978-2088 1.43e-31

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 120.98  E-value: 1.43e-31
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  1978 CGGFmVTGDTPVHIFSPGWPREYANGADCIWIIYAPD-STVELNILSLDIEPQQSCNYDKLIVKDGDSDLSPELAVLCGV 2056
Cdd:cd00041    1 CGGT-LTASTSGTISSPNYPNNYPNNLNCVWTIEAPPgYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCGS 79
                         90       100       110
                 ....*....|....*....|....*....|..
gi 3834380  2057 SPPGPIRSTGEYMYIRFTSDTSVAGTGFNASF 2088
Cdd:cd00041   80 TLPPPIISSGNSLTVRFRSDSSVTGRGFKATY 111
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
3037-3148 1.75e-31

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 120.59  E-value: 1.75e-31
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  3037 CGGIYNES-SGILRSPSYsYSNYPNNLYCVYSLHVRSSRVIIIRFNDFDVAPSNLCAHDFLEVFDGPSIGNRSLGKFCGS 3115
Cdd:cd00041    1 CGGTLTAStSGTISSPNY-PNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCGS 79
                         90       100       110
                 ....*....|....*....|....*....|...
gi 3834380  3116 TRPQTVKSTNSSLTLLFKTDSSQTARGWKIFFR 3148
Cdd:cd00041   80 TLPPPIISSGNSLTVRFRSDSSVTGRGFKATYS 112
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
2452-2564 2.77e-31

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 120.21  E-value: 2.77e-31
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  2452 CGGDLHGPT-GTFTSPNYPNPNPHARICEWTITVQEGRRIVLTFTNLRLSTQPSCNSEHLIVFNGIRSNSPLLQKLCSRv 2530
Cdd:cd00041    1 CGGTLTASTsGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCGS- 79
                         90       100       110
                 ....*....|....*....|....*....|....
gi 3834380  2531 NVTNEFKSSGNTMKVVFFTDGSRPYGGFTASYTS 2564
Cdd:cd00041   80 TLPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB pfam00431
CUB domain;
2217-2331 4.19e-31

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 119.32  E-value: 4.19e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    2217 CGGTVyihdADSDGYLTSPNYPANYPQHAECIWILEAPPGRSIQLQFEDqFNIEDTPNCSVSYLELRDGANSNARLVSKL 2296
Cdd:pfam00431    1 CGGVL----TDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQD-FELEDHDECGYDYVEIRDGPSASSPLLGRF 75
                           90       100       110
                   ....*....|....*....|....*....|....*
gi 3834380    2297 CGHTLPHSWVSSRERIYLKFHTDGGSSYMGFKAKY 2331
Cdd:pfam00431   76 CGSGIPEDIVSSSNQMTIKFVSDASVQKRGFKATY 110
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
1400-1503 4.56e-31

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 119.03  E-value: 4.56e-31
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     1400 GSFSSPGYPNSYPHNKECIWNIRVAPGSSIQLTIHDFDVEYHTSCNYDSLEIYAGLDFNSPRIAQLCSQSPSANPmqVSS 1479
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDGPSASSPLLGRFCGSEAPPPV--ISS 78
                            90       100
                    ....*....|....*....|....
gi 3834380     1480 TGNELAIRFKTDSTLNGRGFNASW 1503
Cdd:smart00042   79 SSNSLTLTFVSDSSVQKRGFSARY 102
CUB pfam00431
CUB domain;
932-1039 5.55e-30

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 116.24  E-value: 5.55e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     932 CGEVLTASTGIIESPGHPNVYPRGVNCTWHVVVQRGQLIRLEFSSFYLEFHYNCTNDYLEIYD--TAAQTFLGRYCGKSI 1009
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECGYDYVEIRDgpSASSPLLGRFCGSGI 80
                           90       100       110
                   ....*....|....*....|....*....|
gi 3834380    1010 PPSLTSNSNSIKLIFVSDSALAHEGFSINY 1039
Cdd:pfam00431   81 PEDIVSSSNQMTIKFVSDASVQKRGFKATY 110
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
2092-2212 5.89e-30

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 116.36  E-value: 5.89e-30
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  2092 CGGYLHADR-GVITSPKYPDTYLPNLNCSWHVLVQTGLTIAVHFEQpFQIQNrDSFCSQgDYLVLRNGPDNHSPPLGpsg 2170
Cdd:cd00041    1 CGGTLTASTsGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFED-FDLES-SPNCSY-DYLEIYDGPSTSSPLLG--- 74
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|..
gi 3834380  2171 rngRFCGMYAPSTLFTSGNEMFVQFISDSSNGGQGFKIRYEA 2212
Cdd:cd00041   75 ---RFCGSTLPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
1057-1158 1.40e-29

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 114.79  E-value: 1.40e-29
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     1057 GMLSSPNFPNNYPSNWECIYRITVGLNQQIALHFTDFTLEDyfGSQCV-DFVEIRDGGYETSPLVGIYCGSVLPPTII-S 1134
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLES--SDNCEyDYVEIYDGPSASSPLLGRFCGSEAPPPVIsS 78
                            90       100
                    ....*....|....*....|....
gi 3834380     1135 HSNKLWLKFKSDAALTAKGFSAYW 1158
Cdd:smart00042   79 SSNSLTLTFVSDSSVQKRGFSARY 102
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
3157-3273 2.02e-29

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 114.82  E-value: 2.02e-29
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  3157 CGGYLT-EDNQSFVSPDSDSNgrYDKGLSCIWYIVAPENKLVKLTFNVFTLEgpsSAGSCVYDYVQIADGASINSYLGGK 3235
Cdd:cd00041    1 CGGTLTaSTSGTISSPNYPNN--YPNNLNCVWTIEAPPGYRIRLTFEDFDLE---SSPNCSYDYLEIYDGPSTSSPLLGR 75
                         90       100       110
                 ....*....|....*....|....*....|....*...
gi 3834380  3236 FCGSRMPAPFISSGNFLTFQFVSDVTVEMRGFNATYTF 3273
Cdd:cd00041   76 FCGSTLPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
1510-1617 2.68e-29

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 114.43  E-value: 2.68e-29
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  1510 CGGIIQLSR-GEIHSPNYPNNYRANTECSWIIQVERHHRVLLNITDFDLEAPDSC----LRLMDGSSSTNARVASVCGRQ 1584
Cdd:cd00041    1 CGGTLTASTsGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCsydyLEIYDGPSTSSPLLGRFCGST 80
                         90       100       110
                 ....*....|....*....|....*....|...
gi 3834380  1585 QPPnSIIASGNSLFVRFRSGSSSQNRGFRAEFR 1617
Cdd:cd00041   81 LPP-PIISSGNSLTVRFRSDSSVTGRGFKATYS 112
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
1631-1731 3.64e-29

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 113.64  E-value: 3.64e-29
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     1631 TIFSPLYPHNYLHNQNCSWIIEAqPPFNHITLSFTHFQLQNSTDCTRDFVEILDGNDYDAPVQGRYCGFSLPHPII-SFG 1709
Cdd:smart00042    2 TITSPNYPQSYPNNLDCVWTIRA-PPGYRIELQFTDFDLESSDNCEYDYVEIYDGPSASSPLLGRFCGSEAPPPVIsSSS 80
                            90       100
                    ....*....|....*....|..
gi 3834380     1710 NALTVRFVTDSTRSFEGFRAIY 1731
Cdd:smart00042   81 NSLTLTFVSDSSVQKRGFSARY 102
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
599-699 5.63e-29

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 113.25  E-value: 5.63e-29
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380      599 GSITSPGYPGNYPPGRDCVWQVLVNPNSLITFTFGTLSLESHNDCSKDYLEIRDGPFHQDPVLGKFCTSLSTPPLKTT-G 677
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDGPSASSPLLGRFCGSEAPPPVISSsS 80
                            90       100
                    ....*....|....*....|..
gi 3834380      678 PAARIHFHSDSETSDKGFHITY 699
Cdd:smart00042   81 NSLTLTFVSDSSVQKRGFSARY 102
CUB pfam00431
CUB domain;
2336-2445 5.65e-29

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 113.54  E-value: 5.65e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    2336 CGGTVSGDSGVIESIGYPTlPYANNVFCQWFIRGLPGHYLTLSFEDFNLQSSPGCTKDFVEIWENHT-SGRVLGRYCGNS 2414
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPN-PYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECGYDYVEIRDGPSaSSPLLGRFCGSG 79
                           90       100       110
                   ....*....|....*....|....*....|.
gi 3834380    2415 TPSSVDTSSNVASVKFVTDGSVTASGFRLQF 2445
Cdd:pfam00431   80 IPEDIVSSSNQMTIKFVSDASVQKRGFKATY 110
CUB pfam00431
CUB domain;
1510-1616 1.49e-28

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 112.39  E-value: 1.49e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    1510 CGGIIQLSRGEIHSPNYPNNYRANTECSWIIQVERHHRVLLNITDFDLEAPDSC----LRLMDGSSSTNARVASVCGRQQ 1585
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECgydyVEIRDGPSASSPLLGRFCGSGI 80
                           90       100       110
                   ....*....|....*....|....*....|.
gi 3834380    1586 PPNsIIASGNSLFVRFRSGSSSQNRGFRAEF 1616
Cdd:pfam00431   81 PED-IVSSSNQMTIKFVSDASVQKRGFKATY 110
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
3511-3623 1.78e-28

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 112.12  E-value: 1.78e-28
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  3511 CGGTLLG-DEGIFTNPGFPDSYPNNTHCEWTIVAPSGRPVSVGFPFLSIDSSGGCDQNYLIVFNGPDANSPPFGPLCGiN 3589
Cdd:cd00041    1 CGGTLTAsTSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCG-S 79
                         90       100       110
                 ....*....|....*....|....*....|....
gi 3834380  3590 TGIAPFYASSNRVFIRFHAEYTTRLSGFEIMWSS 3623
Cdd:cd00041   80 TLPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB pfam00431
CUB domain;
3037-3147 1.99e-28

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 112.00  E-value: 1.99e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    3037 CGGIYNESSGILRSPSYSySNYPNNLYCVYSLHVRSSRVIIIRFNDFDVAPSNLCAHDFLEVFDGPSIGNRSLGKFCGST 3116
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYP-NPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECGYDYVEIRDGPSASSPLLGRFCGSG 79
                           90       100       110
                   ....*....|....*....|....*....|.
gi 3834380    3117 RPQTVKSTNSSLTLLFKTDSSQTARGWKIFF 3147
Cdd:pfam00431   80 IPEDIVSSSNQMTIKFVSDASVQKRGFKATY 110
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
1278-1388 2.52e-28

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 111.74  E-value: 2.52e-28
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  1278 CDNVVIVNkTSGILESINYPNPYDKNQRCNWTIQATTGNTVNYTFLGFDVESYMNCSTDYVELYDGP----QWMGRYCGN 1353
Cdd:cd00041    1 CGGTLTAS-TSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPstssPLLGRFCGS 79
                         90       100       110
                 ....*....|....*....|....*....|....*
gi 3834380  1354 NMPPPGATTGSQLHVLFHTDGINSGeKGFKMQWFT 1388
Cdd:cd00041   80 TLPPPIISSGNSLTVRFRSDSSVTG-RGFKATYSA 113
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
1852-1962 2.65e-28

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 111.74  E-value: 2.65e-28
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  1852 GNNNIVGTHGKIASPFWPGKYPYNSNYKWVVNVDAYHIIHGRILEMDIEPTTNCFYDSLKIYDGFDTHSRLIGTYCGTQT 1931
Cdd:cd00041    2 GGTLTASTSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCGSTL 81
                         90       100       110
                 ....*....|....*....|....*....|..
gi 3834380  1932 -ESFSSSRNSLTFQFSSDSSVSGRGFLLEWFA 1962
Cdd:cd00041   82 pPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
2230-2331 4.64e-28

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 110.56  E-value: 4.64e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     2230 GYLTSPNYPANYPQHAECIWILEAPPGRSIQLQFEDqFNIEDTPNCSVSYLELRDGANSNARLVSKLCGHTLPHSWVSSR 2309
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTD-FDLESSDNCEYDYVEIYDGPSASSPLLGRFCGSEAPPPVISSS 79
                            90       100
                    ....*....|....*....|...
gi 3834380     2310 -ERIYLKFHTDGGSSYMGFKAKY 2331
Cdd:smart00042   80 sNSLTLTFVSDSSVQKRGFSARY 102
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
474-585 5.22e-28

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 110.96  E-value: 5.22e-28
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380   474 CGGILSGT-QGTFayHSPN--DTYIHNVNCFWIVRTDEEKVLHVTFTFFDLESASNCPREYLQIHDGDSSADFPLGRYCG 550
Cdd:cd00041    1 CGGTLTAStSGTI--SSPNypNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCG 78
                         90       100       110
                 ....*....|....*....|....*....|....*
gi 3834380   551 SRPPQGIHSSANALYFHLYSEYIRSGRGFTARWEA 585
Cdd:cd00041   79 STLPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
1174-1274 6.27e-28

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 110.17  E-value: 6.27e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     1174 GVLTSPNYPMPYYHSSECYWRLEASHGSPFELEFQDFHLEHHPSCSLDYLAVFDGPTTNSRLIDKLCGDTTPAP-IRSNK 1252
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDGPSASSPLLGRFCGSEAPPPvISSSS 80
                            90       100
                    ....*....|....*....|..
gi 3834380     1253 DVVLLKLRTDAGQQGRGFEINF 1274
Cdd:smart00042   81 NSLTLTFVSDSSVQKRGFSARY 102
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
941-1039 9.99e-28

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 109.40  E-value: 9.99e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380      941 GIIESPGHPNVYPRGVNCTWHVVVQRGQLIRLEFSSFYLEFHYNCTNDYLEIYD--TAAQTFLGRYCGKSIPPS-LTSNS 1017
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDgpSASSPLLGRFCGSEAPPPvISSSS 80
                            90       100
                    ....*....|....*....|..
gi 3834380     1018 NSIKLIFVSDSALAHEGFSINY 1039
Cdd:smart00042   81 NSLTLTFVSDSSVQKRGFSARY 102
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
2345-2445 1.79e-27

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 109.02  E-value: 1.79e-27
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     2345 GVIESIGYPTlPYANNVFCQWFIRGLPGHYLTLSFEDFNLQSSPGCTKDFVEIWENH-TSGRVLGRYCGNSTPSSV-DTS 2422
Cdd:smart00042    1 GTITSPNYPQ-SYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDGPsASSPLLGRFCGSEAPPPViSSS 79
                            90       100
                    ....*....|....*....|...
gi 3834380     2423 SNVASVKFVTDGSVTASGFRLQF 2445
Cdd:smart00042   80 SNSLTLTFVSDSSVQKRGFSARY 102
CUB pfam00431
CUB domain;
3157-3271 5.47e-27

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 107.77  E-value: 5.47e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    3157 CGGYLTEDNQSFVSPDSDSNgrYDKGLSCIWYIVAPENKLVKLTFNVFTLEGpssAGSCVYDYVQIADGASINSYLGGKF 3236
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPNP--YPPNKDCVWLIRAPPGFRVKLTFQDFELED---HDECGYDYVEIRDGPSASSPLLGRF 75
                           90       100       110
                   ....*....|....*....|....*....|....*
gi 3834380    3237 CGSRMPAPFISSGNFLTFQFVSDVTVEMRGFNATY 3271
Cdd:pfam00431   76 CGSGIPEDIVSSSNQMTIKFVSDASVQKRGFKATY 110
CUB pfam00431
CUB domain;
1978-2088 6.14e-27

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 107.77  E-value: 6.14e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    1978 CGGFmVTgDTPVHIFSPGWPREYANGADCIWIIYAPD-STVELNILSLDIEPQQSCNYDKLIVKDGDSDLSPELAVLCGV 2056
Cdd:pfam00431    1 CGGV-LT-DSSGSISSPNYPNPYPPNKDCVWLIRAPPgFRVKLTFQDFELEDHDECGYDYVEIRDGPSASSPLLGRFCGS 78
                           90       100       110
                   ....*....|....*....|....*....|..
gi 3834380    2057 SPPGPIRSTGEYMYIRFTSDTSVAGTGFNASF 2088
Cdd:pfam00431   79 GIPEDIVSSSNQMTIKFVSDASVQKRGFKATY 110
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
3046-3147 7.25e-27

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 107.09  E-value: 7.25e-27
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     3046 GILRSPSYSySNYPNNLYCVYSLHVRSSRVIIIRFNDFDVAPSNLCAHDFLEVFDGPSIGNRSLGKFCGSTRPQTVKSTN 3125
Cdd:smart00042    1 GTITSPNYP-QSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDGPSASSPLLGRFCGSEAPPPVISSS 79
                            90       100
                    ....*....|....*....|...
gi 3834380     3126 S-SLTLLFKTDSSQTARGWKIFF 3147
Cdd:smart00042   80 SnSLTLTFVSDSSVQKRGFSARY 102
CUB pfam00431
CUB domain;
3395-3502 2.01e-26

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 106.23  E-value: 2.01e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    3395 CNREYNQTFGNLKSPGWPQNYDNNLDCTIILRAPQNHSISLFFYWFQLEDSRQCMNDFLEVRNGGSSTSPLLDKYCSNLL 3474
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECGYDYVEIRDGPSASSPLLGRFCGSGI 80
                           90       100
                   ....*....|....*....|....*...
gi 3834380    3475 PNPVFSQSNELYLHFHSDHSVTNNGYEI 3502
Cdd:pfam00431   81 PEDIVSSSNQMTIKFVSDASVQKRGFKA 108
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
1519-1616 2.54e-26

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 105.55  E-value: 2.54e-26
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     1519 GEIHSPNYPNNYRANTECSWIIQVERHHRVLLNITDFDLEAPDSC----LRLMDGSSSTNARVASVCGRQQPPNSIIASG 1594
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCeydyVEIYDGPSASSPLLGRFCGSEAPPPVISSSS 80
                            90       100
                    ....*....|....*....|..
gi 3834380     1595 NSLFVRFRSGSSSQNRGFRAEF 1616
Cdd:smart00042   81 NSLTLTFVSDSSVQKRGFSARY 102
CUB pfam00431
CUB domain;
2452-2562 3.02e-26

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 105.45  E-value: 3.02e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    2452 CGGDLHGPTGTFTSPNYPNPNPHARICEWTITVQEGRRIVLTFTNLRLSTQPSCNSEHLIVFNGIRSNSPLLQKLCSRVN 2531
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECGYDYVEIRDGPSASSPLLGRFCGSGI 80
                           90       100       110
                   ....*....|....*....|....*....|.
gi 3834380    2532 VTNeFKSSGNTMKVVFFTDGSRPYGGFTASY 2562
Cdd:pfam00431   81 PED-IVSSSNQMTIKFVSDASVQKRGFKATY 110
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
817-927 7.03e-26

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 104.80  E-value: 7.03e-26
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380   817 CGGMLRG--EGFFRSPFYPNAYPGRRTCRWTISQPQRQVVLLNFTDFQIGSSASCDTDYIEIGPSSVLGSPGNEKFCSSN 894
Cdd:cd00041    1 CGGTLTAstSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPLLGRFCGST 80
                         90       100       110
                 ....*....|....*....|....*....|...
gi 3834380   895 IPSFITSVYNILYVTFVKSSSMENRGFTAKFSS 927
Cdd:cd00041   81 LPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
3404-3504 9.51e-26

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 104.01  E-value: 9.51e-26
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     3404 GNLKSPGWPQNYDNNLDCTIILRAPQNHSISLFFYWFQLEDSRQCMNDFLEVRNGGSSTSPLLDKYCSNLLPNPVF-SQS 3482
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDGPSASSPLLGRFCGSEAPPPVIsSSS 80
                            90       100
                    ....*....|....*....|..
gi 3834380     3483 NELYLHFHSDHSVTNNGYEIIW 3504
Cdd:smart00042   81 NSLTLTFVSDSSVQKRGFSARY 102
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
1990-2088 1.26e-25

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 103.62  E-value: 1.26e-25
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     1990 HIFSPGWPREYANGADCIWIIYAPD-STVELNILSLDIEPQQSCNYDKLIVKDGDSDLSPELAVLCG-VSPPGPIRSTGE 2067
Cdd:smart00042    2 TITSPNYPQSYPNNLDCVWTIRAPPgYRIELQFTDFDLESSDNCEYDYVEIYDGPSASSPLLGRFCGsEAPPPVISSSSN 81
                            90       100
                    ....*....|....*....|.
gi 3834380     2068 YMYIRFTSDTSVAGTGFNASF 2088
Cdd:smart00042   82 SLTLTFVSDSSVQKRGFSARY 102
CUB pfam00431
CUB domain;
3511-3619 2.69e-25

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 102.76  E-value: 2.69e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    3511 CGGTLLGDEGIFTNPGFPDSYPNNTHCEWTIVAPSGRPVSVGFPFLSIDSSGGCDQNYLIVFNGPDANSPPFGPLCGinT 3590
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECGYDYVEIRDGPSASSPLLGRFCG--S 78
                           90       100       110
                   ....*....|....*....|....*....|
gi 3834380    3591 GI-APFYASSNRVFIRFHAEYTTRLSGFEI 3619
Cdd:pfam00431   79 GIpEDIVSSSNQMTIKFVSDASVQKRGFKA 108
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
2461-2562 2.72e-25

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 102.47  E-value: 2.72e-25
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     2461 GTFTSPNYPNPNPHARICEWTITVQEGRRIVLTFTNLRLSTQPSCNSEHLIVFNGIRSNSPLLQKLCSRVNVTNEFKSSG 2540
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDGPSASSPLLGRFCGSEAPPPVISSSS 80
                            90       100
                    ....*....|....*....|..
gi 3834380     2541 NTMKVVFFTDGSRPYGGFTASY 2562
Cdd:smart00042   81 NSLTLTFVSDSSVQKRGFSARY 102
CUB pfam00431
CUB domain;
2092-2210 1.14e-24

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 101.22  E-value: 1.14e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    2092 CGGYLHADRGVITSPKYPDTYLPNLNCSWHVLVQTGLTIAVHFeQPFQIQnRDSFCsQGDYLVLRNGPDNHSPPLgpsgr 2171
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTF-QDFELE-DHDEC-GYDYVEIRDGPSASSPLL----- 72
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 3834380    2172 nGRFCGMYAPSTLFTSGNEMFVQFISDSSNGGQGFKIRY 2210
Cdd:pfam00431   73 -GRFCGSGIPEDIVSSSNQMTIKFVSDASVQKRGFKATY 110
CUB pfam00431
CUB domain;
1286-1383 1.20e-24

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 101.22  E-value: 1.20e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    1286 KTSGILESINYPNPYDKNQRCNWTIQATTGNTVNYTFLGFDVESYMNCSTDYVELYDGP----QWMGRYCGNNMPPPGAT 1361
Cdd:pfam00431    7 DSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECGYDYVEIRDGPsassPLLGRFCGSGIPEDIVS 86
                           90       100
                   ....*....|....*....|..
gi 3834380    1362 TGSQLHVLFHTDGINSGeKGFK 1383
Cdd:pfam00431   87 SSNQMTIKFVSDASVQK-RGFK 107
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
2920-3034 1.31e-24

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 100.95  E-value: 1.31e-24
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  2920 CGRTFN-TSPGDIISPNFPKQYDNNMNCTYLIDADPQSLVILTFVSFHLEDrsaiTGTCDHDGLHIIKGRNLSSTPLVTI 2998
Cdd:cd00041    1 CGGTLTaSTSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLES----SPNCSYDYLEIYDGPSTSSPLLGRF 76
                         90       100       110
                 ....*....|....*....|....*....|....*..
gi 3834380  2999 CGSETLRPLTVDGP-VLLNFYSDAYTTDFGFKISYRA 3034
Cdd:cd00041   77 CGSTLPPPIISSGNsLTVRFRSDSSVTGRGFKATYSA 113
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
1289-1386 2.85e-24

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 99.77  E-value: 2.85e-24
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     1289 GILESINYPNPYDKNQRCNWTIQATTGNTVNYTFLGFDVESYMNCSTDYVELYDGPQW----MGRYCGNNMPPPGATT-G 1363
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDGPSAssplLGRFCGSEAPPPVISSsS 80
                            90       100
                    ....*....|....*....|...
gi 3834380     1364 SQLHVLFHTDGINSGeKGFKMQW 1386
Cdd:smart00042   81 NSLTLTFVSDSSVQK-RGFSARY 102
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
2101-2210 5.74e-24

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 99.00  E-value: 5.74e-24
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     2101 GVITSPKYPDTYLPNLNCSWHVLVQTGLTIAVHFEqPFQIQNRDSfCSQgDYLVLRNGPDNHSPPLgpsgrnGRFCGMYA 2180
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFT-DFDLESSDN-CEY-DYVEIYDGPSASSPLL------GRFCGSEA 71
                            90       100       110
                    ....*....|....*....|....*....|.
gi 3834380     2181 PSTLFTS-GNEMFVQFISDSSNGGQGFKIRY 2210
Cdd:smart00042   72 PPPVISSsSNSLTLTFVSDSSVQKRGFSARY 102
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
2570-2686 6.18e-24

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 99.02  E-value: 6.18e-24
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  2570 CGGFLPSVSGGNFSSPGYNGirDYARNLDCEWTLSNPnrENSSISIYFLELSIESHQDCTFDVLEFRVG-DADGPLIEKF 2648
Cdd:cd00041    1 CGGTLTASTSGTISSPNYPN--NYPNNLNCVWTIEAP--PGYRIRLTFEDFDLESSPNCSYDYLEIYDGpSTSSPLLGRF 76
                         90       100       110
                 ....*....|....*....|....*....|....*...
gi 3834380  2649 CSLSAPtAPLVIPYPQVWIHFVSNERVEYTGFYIEYSF 2686
Cdd:cd00041   77 CGSTLP-PPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
3520-3619 5.86e-23

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 95.92  E-value: 5.86e-23
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     3520 GIFTNPGFPDSYPNNTHCEWTIVAPSGRPVSVGFPFLSIDSSGGCDQNYLIVFNGPDANSPPFGPLCGINTGIAPFYASS 3599
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDGPSASSPLLGRFCGSEAPPPVISSSS 80
                            90       100
                    ....*....|....*....|
gi 3834380     3600 NRVFIRFHAEYTTRLSGFEI 3619
Cdd:smart00042   81 NSLTLTFVSDSSVQKRGFSA 100
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
1861-1960 1.55e-22

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 94.76  E-value: 1.55e-22
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     1861 GKIASPFWPGKYPYNSNYKWVVNVDAYHIIHGRILEMDIEPTTNCFYDSLKIYDGFDTHSRLIGTYCGTQT--ESFSSSR 1938
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDGPSASSPLLGRFCGSEAppPVISSSS 80
                            90       100
                    ....*....|....*....|..
gi 3834380     1939 NSLTFQFSSDSSVSGRGFLLEW 1960
Cdd:smart00042   81 NSLTLTFVSDSSVQKRGFSARY 102
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
3179-3271 2.09e-22

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 94.38  E-value: 2.09e-22
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     3179 YDKGLSCIWYIVAPENKLVKLTFNVFTLEgpsSAGSCVYDYVQIADGASINSYLGGKFCGSRMPAPFISS-GNFLTFQFV 3257
Cdd:smart00042   12 YPNNLDCVWTIRAPPGYRIELQFTDFDLE---SSDNCEYDYVEIYDGPSASSPLLGRFCGSEAPPPVISSsSNSLTLTFV 88
                            90
                    ....*....|....
gi 3834380     3258 SDVTVEMRGFNATY 3271
Cdd:smart00042   89 SDSSVQKRGFSARY 102
CUB pfam00431
CUB domain;
1861-1956 1.82e-21

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 91.97  E-value: 1.82e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    1861 GKIASPFWPGKYPYNSNYKWVVNVDAYHIIHGRILEMDIEPTTNCFYDSLKIYDGFDTHSRLIGTYCGTQT-ESFSSSRN 1939
Cdd:pfam00431   10 GSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECGYDYVEIRDGPSASSPLLGRFCGSGIpEDIVSSSN 89
                           90
                   ....*....|....*..
gi 3834380    1940 SLTFQFSSDSSVSGRGF 1956
Cdd:pfam00431   90 QMTIKFVSDASVQKRGF 106
cubilin_NTD cd22201
N-terminal domain of cubilin and similar proteins; Cubilin (CUBN, also called 460 kDa receptor, ...
38-132 7.80e-21

N-terminal domain of cubilin and similar proteins; Cubilin (CUBN, also called 460 kDa receptor, intestinal intrinsic factor receptor, intrinsic factor-cobalamin receptor, or intrinsic factor-vitamin B12 receptor) is an endocytic receptor which plays a role in lipoprotein, vitamin and iron metabolism by facilitating their uptake. It acts together with the 45-kDa transmembrane protein amnionless (AMN) to mediate endocytosis of the cobalamin (vitamin B12) binding intrinsic factor (CBLIF)-cobalamin complex. This model corresponds to the N-terminal domain of cubilin, which is responsible for the interaction with AMN. The cubilin interface with AMN is formed by the N-terminal strands of three cubilin chains.


Pssm-ID: 412063  Cd Length: 129  Bit Score: 90.85  E-value: 7.80e-21
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    38 QPRMTTEEGNLVFLTSSTQNIEFRTGSLGKIKLNDEDLGECLHQIQRNKDDIIDLRKN-----------TTGLPQNILSQ 106
Cdd:cd22201   13 QPRIITEDGHLIFEAAYDKNISFRTSGNGRININDEDLLELLQQAKNNKSDIENLKQSelptfeqqlseLVGGPQGLLRR 92
                         90       100
                 ....*....|....*....|....*.
gi 3834380   107 VHQLNSKLVDLERDFQNLQQNVERKV 132
Cdd:cd22201   93 LALLENRTSGLSSTLNNNIRRLRRRL 118
CUB pfam00431
CUB domain;
817-925 2.58e-20

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 88.51  E-value: 2.58e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     817 CGGMLRGE-GFFRSPFYPNAYPGRRTCRWTISQPQRQVVLLNFTDFQIGSSASCDTDYIEI--GPSSVLGSPGneKFCSS 893
Cdd:pfam00431    1 CGGVLTDSsGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECGYDYVEIrdGPSASSPLLG--RFCGS 78
                           90       100       110
                   ....*....|....*....|....*....|..
gi 3834380     894 NIPSFITSVYNILYVTFVKSSSMENRGFTAKF 925
Cdd:pfam00431   79 GIPEDIVSSSNQMTIKFVSDASVQKRGFKATY 110
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
708-815 3.03e-20

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 88.62  E-value: 3.03e-20
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380   708 CGGNYTDTD-GELLLPPLSGPFSHSRQCVYLITQAQGEQIVINFTHVELESQMGCSHTYIEVGDHDS----LLRKICGNE 782
Cdd:cd00041    1 CGGTLTASTsGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPStsspLLGRFCGST 80
                         90       100       110
                 ....*....|....*....|....*....|...
gi 3834380   783 TLFPIRSVSNKVWIRLRIDALVQKASFRADYQV 815
Cdd:cd00041   81 LPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB pfam00431
CUB domain;
474-582 4.99e-20

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 87.74  E-value: 4.99e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     474 CGGILSGTQGTFayHSPN--DTYIHNVNCFWIVRTDEEKVLHVTFTFFDLESASNCPREYLQIHDGDSSADFPLGRYCGS 551
Cdd:pfam00431    1 CGGVLTDSSGSI--SSPNypNPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECGYDYVEIRDGPSASSPLLGRFCGS 78
                           90       100       110
                   ....*....|....*....|....*....|.
gi 3834380     552 RPPQGIHSSANALYFHLYSEYIRSGRGFTAR 582
Cdd:pfam00431   79 GIPEDIVSSSNQMTIKFVSDASVQKRGFKAT 109
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
825-925 5.44e-20

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 87.45  E-value: 5.44e-20
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380      825 GFFRSPFYPNAYPGRRTCRWTISQPQRQVVLLNFTDFQIGSSASCDTDYIEIGPSSVLGSPGNEKFCSSNIP-SFITSVY 903
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDGPSASSPLLGRFCGSEAPpPVISSSS 80
                            90       100
                    ....*....|....*....|..
gi 3834380      904 NILYVTFVKSSSMENRGFTAKF 925
Cdd:smart00042   81 NSLTLTFVSDSSVQKRGFSARY 102
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
2805-2918 7.34e-20

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 87.47  E-value: 7.34e-20
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  2805 CGGTFHSA-NGTIKSPHWPQTFPENSRCSWTVITHESKHWEISFDSNFRIPSSDsqCQNSFVKVWEGRLmINKTLLATSC 2883
Cdd:cd00041    1 CGGTLTAStSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPN--CSYDYLEIYDGPS-TSSPLLGRFC 77
                         90       100       110
                 ....*....|....*....|....*....|....*.
gi 3834380  2884 GDVAPSPIVTSGNIFTAVFQSEEM-AAQGFSASFIS 2918
Cdd:cd00041   78 GSTLPPPIISSGNSLTVRFRSDSSvTGRGFKATYSA 113
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
2580-2684 6.12e-19

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 84.36  E-value: 6.12e-19
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     2580 GNFSSPGYngIRDYARNLDCEWTLSNPNreNSSISIYFLELSIESHQDCTFDVLEFRVGD-ADGPLIEKFCSLSAPTAPL 2658
Cdd:smart00042    1 GTITSPNY--PQSYPNNLDCVWTIRAPP--GYRIELQFTDFDLESSDNCEYDYVEIYDGPsASSPLLGRFCGSEAPPPVI 76
                            90       100
                    ....*....|....*....|....*.
gi 3834380     2659 VIPYPQVWIHFVSNERVEYTGFYIEY 2684
Cdd:smart00042   77 SSSSNSLTLTFVSDSSVQKRGFSARY 102
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
487-583 1.45e-18

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 83.59  E-value: 1.45e-18
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380      487 YHSPN--DTYIHNVNCFWIVRTDEEKVLHVTFTFFDLESASNCPREYLQIHDGDSSADFPLGRYCGSRPPQGIHSSA-NA 563
Cdd:smart00042    3 ITSPNypQSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDGPSASSPLLGRFCGSEAPPPVISSSsNS 82
                            90       100
                    ....*....|....*....|
gi 3834380      564 LYFHLYSEYIRSGRGFTARW 583
Cdd:smart00042   83 LTLTFVSDSSVQKRGFSARY 102
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
2929-3032 2.79e-18

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 82.44  E-value: 2.79e-18
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     2929 GDIISPNFPKQYDNNMNCTYLIDADPQSLVILTFVSFHLEDRSaitgTCDHDGLHIIKGRNLSSTPLVTICGSETLRPL- 3007
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSD----NCEYDYVEIYDGPSASSPLLGRFCGSEAPPPVi 76
                            90       100
                    ....*....|....*....|....*.
gi 3834380     3008 -TVDGPVLLNFYSDAYTTDFGFKISY 3032
Cdd:smart00042   77 sSSSNSLTLTFVSDSSVQKRGFSARY 102
CUB pfam00431
CUB domain;
2570-2684 4.44e-18

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 82.34  E-value: 4.44e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    2570 CGGFLPSVSGgNFSSPGYNgiRDYARNLDCEWTLSNPnrENSSISIYFLELSIESHQDCTFDVLEFRVGD-ADGPLIEKF 2648
Cdd:pfam00431    1 CGGVLTDSSG-SISSPNYP--NPYPPNKDCVWLIRAP--PGFRVKLTFQDFELEDHDECGYDYVEIRDGPsASSPLLGRF 75
                           90       100       110
                   ....*....|....*....|....*....|....*.
gi 3834380    2649 CSLSAPtAPLVIPYPQVWIHFVSNERVEYTGFYIEY 2684
Cdd:pfam00431   76 CGSGIP-EDIVSSSNQMTIKFVSDASVQKRGFKATY 110
CUB pfam00431
CUB domain;
2920-3032 3.52e-17

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 79.65  E-value: 3.52e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    2920 CGRTFNTSPGDIISPNFPKQYDNNMNCTYLIDADPQSLVILTFVSFHLEDrsaiTGTCDHDGLHIIKGRNLSSTPLVTIC 2999
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFELED----HDECGYDYVEIRDGPSASSPLLGRFC 76
                           90       100       110
                   ....*....|....*....|....*....|....
gi 3834380    3000 GSETLRPLTVDGP-VLLNFYSDAYTTDFGFKISY 3032
Cdd:pfam00431   77 GSGIPEDIVSSSNqMTIKFVSDASVQKRGFKATY 110
CUB pfam00431
CUB domain;
708-813 3.84e-17

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 79.65  E-value: 3.84e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     708 CGGNYTDTDGELLLPPLSGPFSHSRQCVYLITQAQGEQIVINFTHVELESQMGCSHTYIEVGDHDS----LLRKICGNET 783
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECGYDYVEIRDGPSasspLLGRFCGSGI 80
                           90       100       110
                   ....*....|....*....|....*....|
gi 3834380     784 LFPIRSVSNKVWIRLRIDALVQKASFRADY 813
Cdd:pfam00431   81 PEDIVSSSNQMTIKFVSDASVQKRGFKATY 110
CUB pfam00431
CUB domain;
2805-2916 2.27e-16

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 77.34  E-value: 2.27e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380    2805 CGGTFHSANGTIKSPHWPQTFPENSRCSWTVITHESKHWEISFDSnFRIPSSDSqCQNSFVKVWEGRlMINKTLLATSCG 2884
Cdd:pfam00431    1 CGGVLTDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQD-FELEDHDE-CGYDYVEIRDGP-SASSPLLGRFCG 77
                           90       100       110
                   ....*....|....*....|....*....|...
gi 3834380    2885 DVAPSPIVTSGNIFTAVFQS-EEMAAQGFSASF 2916
Cdd:pfam00431   78 SGIPEDIVSSSNQMTIKFVSdASVQKRGFKATY 110
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
3278-3392 3.18e-16

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 77.07  E-value: 3.18e-16
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380  3278 CGGTYNATsTPQNASSPHLSNIGRPYSTCTWVIAAPPQQQVQITVWDLQL-PSQDCSQSYLELQDSVQTGGNRVTQFCGa 3356
Cdd:cd00041    1 CGGTLTAS-TSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLeSSPNCSYDYLEIYDGPSTSSPLLGRFCG- 78
                         90       100       110
                 ....*....|....*....|....*....|....*.
gi 3834380  3357 nYTTLPVFYSSMSTAVVVFKSGVLNRNSQVQFSYQI 3392
Cdd:cd00041   79 -STLPPPIISSGNSLTVRFRSDSSVTGRGFKATYSA 113
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
2814-2916 3.23e-14

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 70.88  E-value: 3.23e-14
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     2814 GTIKSPHWPQTFPENSRCSWTVITHESKHWEISFDSnFRIpSSDSQCQNSFVKVWEGRLMINKtLLATSCGDVAPSPIVT 2893
Cdd:smart00042    1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTD-FDL-ESSDNCEYDYVEIYDGPSASSP-LLGRFCGSEAPPPVIS 77
                            90       100
                    ....*....|....*....|....*
gi 3834380     2894 S-GNIFTAVFQS-EEMAAQGFSASF 2916
Cdd:smart00042   78 SsSNSLTLTFVSdSSVQKRGFSARY 102
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
726-813 2.08e-12

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 65.87  E-value: 2.08e-12
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380      726 GPFSHSRQCVYLITQAQGEQIVINFTHVELESQMGCSHTYIEVGD----HDSLLRKICGNETLFP-IRSVSNKVWIRLRI 800
Cdd:smart00042   10 QSYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDgpsaSSPLLGRFCGSEAPPPvISSSSNSLTLTFVS 89
                            90
                    ....*....|...
gi 3834380      801 DALVQKASFRADY 813
Cdd:smart00042   90 DSSVQKRGFSARY 102
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
432-468 8.18e-09

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 53.41  E-value: 8.18e-09
                         10        20        30
                 ....*....|....*....|....*....|....*...
gi 3834380   432 NINDCSS-NPCLNGGTCIDGINGFTCDCTSSWTGYYCQ 468
Cdd:cd00054    1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
3292-3377 2.33e-08

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 54.32  E-value: 2.33e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3834380     3292 SSPHLSNIGRPYSTCTWVIAAPPQQQVQITVWDLQL-PSQDCSQSYLELQDSVQTGGNRVTQFCGaNYTTLPVFYSSMST 3370
Cdd:smart00042    4 TSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLeSSDNCEYDYVEIYDGPSASSPLLGRFCG-SEAPPPVISSSSNS 82

                    ....*..
gi 3834380     3371 AVVVFKS 3377
Cdd:smart00042   83 LTLTFVS 89
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
167-207 3.58e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 48.79  E-value: 3.58e-07
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|.
gi 3834380   167 DVNECVvySGTPfgCQSGSTCVNTVGSFRCDCTPDTYGPQC 207
Cdd:cd00054    1 DIDECA--SGNP--CQNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
350-387 4.64e-07

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 48.36  E-value: 4.64e-07
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 3834380     350 CSIHNGGCHPEATCSSSPvlGSFlpVCTCPPGYTGNGY 387
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTG--GSF--TCTCNDGYTGDGV 34
EGF_CA smart00179
Calcium-binding EGF-like domain;
167-207 9.78e-07

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 47.63  E-value: 9.78e-07
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|..
gi 3834380      167 DVNECVVYSGtpfgCQSGSTCVNTVGSFRCDCTPD-TYGPQC 207
Cdd:smart00179    1 DIDECASGNP----CQNGGTCVNTVGSYRCECPPGyTDGRNC 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
133-164 1.48e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 47.25  E-value: 1.48e-06
                         10        20        30
                 ....*....|....*....|....*....|...
gi 3834380   133 CSS-NPCLNGGTCVNLHDSFVCICPSQWKGLFC 164
Cdd:cd00054    5 CASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF_CA smart00179
Calcium-binding EGF-like domain;
432-468 1.60e-06

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 47.24  E-value: 1.60e-06
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 3834380      432 NINDCSS-NPCLNGGTCIDGINGFTCDCTSSWT-GYYCQ 468
Cdd:smart00179    1 DIDECASgNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
436-464 2.41e-06

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 46.22  E-value: 2.41e-06
                           10        20
                   ....*....|....*....|....*....
gi 3834380     436 CSSNPCLNGGTCIDGINGFTCDCTSSWTG 464
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTG 29
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
306-344 6.23e-06

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 45.28  E-value: 6.23e-06
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 3834380     306 CEINNGGCSQAPLvpCLNTPGSFSCgNCPAGFSGDGRVC 344
Cdd:pfam12947    1 CSDNNGGCHPNAT--CTNTGGSFTC-TCNDGYTGDGVTC 36
EGF_CA smart00179
Calcium-binding EGF-like domain;
260-301 6.56e-06

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 45.32  E-value: 6.56e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|..
gi 3834380      260 DKDECSlQPSPCSEHAQCFNTQGSFYCgACPKGWQgNGYECQ 301
Cdd:smart00179    1 DIDECA-SGNPCQNGGTCVNTVGSYRC-ECPPGYT-DGRNCE 39
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
260-295 1.17e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 44.55  E-value: 1.17e-05
                         10        20        30
                 ....*....|....*....|....*....|....*.
gi 3834380   260 DKDECSlQPSPCSEHAQCFNTQGSFYCgACPKGWQG 295
Cdd:cd00054    1 DIDECA-SGNPCQNGGTCVNTVGSYRC-SCPPGYTG 34
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
400-430 4.92e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 43.01  E-value: 4.92e-05
                         10        20        30
                 ....*....|....*....|....*....|..
gi 3834380   400 SRHPCVN-GQCIETVSSYFCKCDSGWSGQNCT 430
Cdd:cd00054    7 SGNPCQNgGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA smart00179
Calcium-binding EGF-like domain;
302-345 5.36e-05

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 42.62  E-value: 5.36e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....
gi 3834380      302 DINECEINNGgCSQAPLvpCLNTPGSFSCgNCPAGFSgDGRVCT 345
Cdd:smart00179    1 DIDECASGNP-CQNGGT--CVNTVGSYRC-ECPPGYT-DGRNCE 39
EGF_CA pfam07645
Calcium-binding EGF domain;
167-198 7.77e-05

Calcium-binding EGF domain;


Pssm-ID: 429571  Cd Length: 32  Bit Score: 42.22  E-value: 7.77e-05
                           10        20        30
                   ....*....|....*....|....*....|..
gi 3834380     167 DVNECVVYsgtPFGCQSGSTCVNTVGSFRCDC 198
Cdd:pfam07645    1 DVDECATG---THNCPANTVCVNTIGSFECRC 29
EGF_CA smart00179
Calcium-binding EGF-like domain;
134-156 1.02e-04

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 41.85  E-value: 1.02e-04
                            10        20
                    ....*....|....*....|...
gi 3834380      134 SSNPCLNGGTCVNLHDSFVCICP 156
Cdd:smart00179    7 SGNPCQNGGTCVNTVGSYRCECP 29
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
133-161 1.44e-04

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 41.21  E-value: 1.44e-04
                           10        20
                   ....*....|....*....|....*....
gi 3834380     133 CSSNPCLNGGTCVNLHDSFVCICPSQWKG 161
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTG 29
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
437-468 1.76e-04

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 41.31  E-value: 1.76e-04
                         10        20        30
                 ....*....|....*....|....*....|...
gi 3834380   437 SSNPCLNGGTCIDGINGFTCDCTSSWTG-YYCQ 468
Cdd:cd00053    4 ASNPCSNGGTCVNTPGSYRCVCPPGYTGdRSCE 36
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
134-161 2.10e-04

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 40.92  E-value: 2.10e-04
                         10        20
                 ....*....|....*....|....*...
gi 3834380   134 SSNPCLNGGTCVNLHDSFVCICPSQWKG 161
Cdd:cd00053    4 ASNPCSNGGTCVNTPGSYRCVCPPGYTG 31
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
263-301 7.39e-04

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 39.38  E-value: 7.39e-04
                         10        20        30
                 ....*....|....*....|....*....|....*....
gi 3834380   263 ECSlQPSPCSEHAQCFNTQGSFYCgACPKGWQGNGYeCQ 301
Cdd:cd00053    1 ECA-ASNPCSNGGTCVNTPGSYRC-VCPPGYTGDRS-CE 36
FXa_inhibition pfam14670
Coagulation Factor Xa inhibitory site; This short domain on coagulation enzyme factor Xa is ...
306-344 1.23e-03

Coagulation Factor Xa inhibitory site; This short domain on coagulation enzyme factor Xa is found to be the target for a potent inhibitor of coagulation, TAK-442.


Pssm-ID: 464251 [Multi-domain]  Cd Length: 36  Bit Score: 38.76  E-value: 1.23e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 3834380     306 CEINNGGCSQAplvpCLNTPGSFSCGnCPAGF--SGDGRVC 344
Cdd:pfam14670    1 CSVNNGGCSHL----CLNTPGGYTCS-CPEGYelQDDGRTC 36
EGF smart00181
Epidermal growth factor-like domain;
400-428 3.14e-03

Epidermal growth factor-like domain;


Pssm-ID: 214544  Cd Length: 35  Bit Score: 37.88  E-value: 3.14e-03
                            10        20
                    ....*....|....*....|....*....
gi 3834380      400 SRHPCVNGQCIETVSSYFCKCDSGWSGQN 428
Cdd:smart00181    4 SGGPCSNGTCINTPGSYTCSCPPGYTGDK 32
hEGF pfam12661
Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six ...
441-458 3.29e-03

Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six conserved residues disulfide-bonded into the characteriztic 'ababcc' pattern. They are involved in growth and proliferation of cells, in proteins of the Notch/Delta pathway, neurogulin and selectins. hEGFs are also found in mosaic proteins with four-disulfide laminin EGFs such as aggrecan and perlecan. The core fold of the EGF domain consists of two small beta-hairpins packed against each other. Two major structural variants have been identified based on the structural context of the C-terminal Cys residue of disulfide 'c' in the C-terminal hairpin: hEGFs and cEGFs. In hEGFs the C-terminal thiol resides in the beta-turn, resulting in shorter loop-lengths between the Cys residues of disulfide 'c', typically C[8-9]XC. These shorter loop-lengths are also typical of the four-disulfide EGF domains, laminin ad integrin. Tandem hEGF domains have six linking residues between terminal cysteines of adjacent domains. hEGF domains may or may not bind calcium in the linker region. hEGF domains with the consensus motif CXD4X[F,Y]XCXC are hydroxylated exclusively in the Asp residue.


Pssm-ID: 463660  Cd Length: 22  Bit Score: 37.31  E-value: 3.29e-03
                           10
                   ....*....|....*...
gi 3834380     441 CLNGGTCIDGINGFTCDC 458
Cdd:pfam12661    1 CQNGGTCVDGVNGYKCQC 18
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
302-340 5.15e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.23  E-value: 5.15e-03
                         10        20        30
                 ....*....|....*....|....*....|....*....
gi 3834380   302 DINECEINNGgCSQAPLvpCLNTPGSFSCgNCPAGFSGD 340
Cdd:cd00054    1 DIDECASGNP-CQNGGT--CVNTVGSYRC-SCPPGYTGR 35
EGF_CA smart00179
Calcium-binding EGF-like domain;
400-430 5.72e-03

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 37.23  E-value: 5.72e-03
                            10        20        30
                    ....*....|....*....|....*....|...
gi 3834380      400 SRHPCVNG-QCIETVSSYFCKCDSGWS-GQNCT 430
Cdd:smart00179    7 SGNPCQNGgTCVNTVGSYRCECPPGYTdGRNCE 39
hEGF pfam12661
Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six ...
138-159 7.80e-03

Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six conserved residues disulfide-bonded into the characteriztic 'ababcc' pattern. They are involved in growth and proliferation of cells, in proteins of the Notch/Delta pathway, neurogulin and selectins. hEGFs are also found in mosaic proteins with four-disulfide laminin EGFs such as aggrecan and perlecan. The core fold of the EGF domain consists of two small beta-hairpins packed against each other. Two major structural variants have been identified based on the structural context of the C-terminal Cys residue of disulfide 'c' in the C-terminal hairpin: hEGFs and cEGFs. In hEGFs the C-terminal thiol resides in the beta-turn, resulting in shorter loop-lengths between the Cys residues of disulfide 'c', typically C[8-9]XC. These shorter loop-lengths are also typical of the four-disulfide EGF domains, laminin ad integrin. Tandem hEGF domains have six linking residues between terminal cysteines of adjacent domains. hEGF domains may or may not bind calcium in the linker region. hEGF domains with the consensus motif CXD4X[F,Y]XCXC are hydroxylated exclusively in the Asp residue.


Pssm-ID: 463660  Cd Length: 22  Bit Score: 36.16  E-value: 7.80e-03
                           10        20
                   ....*....|....*....|..
gi 3834380     138 CLNGGTCVNLHDSFVCICPSQW 159
Cdd:pfam12661    1 CQNGGTCVDGVNGYKCQCPPGY 22
EGF_CA pfam07645
Calcium-binding EGF domain;
260-292 7.95e-03

Calcium-binding EGF domain;


Pssm-ID: 429571  Cd Length: 32  Bit Score: 36.45  E-value: 7.95e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 3834380     260 DKDECSLQPSPCSEHAQCFNTQGSFYCgACPKG 292
Cdd:pfam07645    1 DVDECATGTHNCPANTVCVNTIGSFEC-RCPDG 32
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH