NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|117647249|ref|NP_032507|]
View 

laminin subunit alpha-2 precursor [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Laminin_I pfam06008
Laminin Domain I; coiled-coil structure. It has been suggested that the domains I and II from ...
1589-1848 1.40e-105

Laminin Domain I; coiled-coil structure. It has been suggested that the domains I and II from laminin A, B1 and B2 may come together to form a triple helical coiled-coil structure.


:

Pssm-ID: 310534 [Multi-domain]  Cd Length: 258  Bit Score: 338.62  E-value: 1.40e-105
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1589 TMNIN-LTGPLPAPYKILYGLENTTQELKHLLSPQRAPERLIQLAEGNVNTLVMETNELLTRATKVTADGEQTGQDAERT 1667
Cdd:pfam06008    1 LLSLNsLTGALPAPYKINYNLENLTKQLQEYLSPENAHKIQIEILEKELSSLAQETEELQKKATQTLAKAQQVNAESERT 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1668 NSRAESLEEFIKGLVQDAEAINEKAVQLNEtlgNQDKTAERNLEELQKEIDRMLKELRSKDLQTQKEVAEDELVAAEGLL 1747
Cdd:pfam06008   81 LGHAKELAEAIKNLIDNIKEINEKVATLGE---NDFALPSSDLSRMLAEAQRMLGEIRSRDFGTQLQNAEAELKAAQDLL 157
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1748 KRVNKLFGEPRAQNEDMEKDLQQKLAEYKNKLDDAWDLLREATDKTRDANRLSAANQKNMTILETKKEAIEGSKRQIENT 1827
Cdd:pfam06008  158 SRIQTWFQSPQEENKALANALRDSLAEYEAKLSDLRELLREAAAKTRDANRLNLANQANLREFQRKKEEVSEQKNQLEET 237
                          250       260
                   ....*....|....*....|.
gi 117647249  1828 LKEGNDILDEANRLLGEINSV 1848
Cdd:pfam06008  238 LKTARDSLDAANLLLQEIDDA 258
LamNT smart00136
Laminin N-terminal domain (domain VI); N-terminal domain of laminins and laminin-related ...
29-281 5.71e-93

Laminin N-terminal domain (domain VI); N-terminal domain of laminins and laminin-related protein such as Unc-6/ netrins.


:

Pssm-ID: 214532  Cd Length: 238  Bit Score: 301.59  E-value: 5.71e-93
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249     29 AHQQRGLFPAVLNLASNALITTNATCGEKGPEMYCKLVEHVpgqpVRNPQCRICNQNssNPYQRHPITNAIDGKN----T 104
Cdd:smart00136    1 AGRPRSCYPPFVNLAFGREVTATSTCGEPGPERYCKLVGHT----EQGKKCDYCDAR--NPRRSHPAENLTDGNNpnnpT 74
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249    105 WWQSPSIKNGVeyHYVTITLDLQQVFQIAYVIVKAAnSPRPGNWILERSLDDVEYKPWQYHAvtdTECLTLYNIyPRTGP 184
Cdd:smart00136   75 WWQSEPLSNGP--QNVNLTLDLGKEFHVTYVILKFC-SPRPSLWILERSDFGKTWQPWQYFS---SDCRRTFGR-PPRGP 147
                           170       180       190       200       210       220       230       240
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249    185 PSYAKDDEVICTSFYSKIHPLENGEIHISLINGRPSADDP--SPELLEFTSARYIRLRFQRIRTLNADLMMfahkdpreI 262
Cdd:smart00136  148 ITKGNEDEVICTSEYSDIVPLEGGEIAFSLLEGRPSATDFdnSPVLQEWVTATNIRVRLTRLRTLGDELMD--------D 219
                           250
                    ....*....|....*....
gi 117647249    263 DPIVTRRYYYSVKDISVGG 281
Cdd:smart00136  220 RPEVTRRYYYAISDIAVGG 238
Laminin_II pfam06009
Laminin Domain II; It has been suggested that the domains I and II from laminin A, B1 and B2 ...
2033-2168 2.95e-54

Laminin Domain II; It has been suggested that the domains I and II from laminin A, B1 and B2 may come together to form a triple helical coiled-coil structure.


:

Pssm-ID: 368703 [Multi-domain]  Cd Length: 138  Bit Score: 186.54  E-value: 2.95e-54
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2033 VKEKAREANDTAKAVLAQVKDLHQNLDGLKQNYNKLADSVAKTNAVVKDpsKNKIIADAGTSVRNLEQEADRLIDKLKPI 2112
Cdd:pfam06009    1 SKELAREANETAKEVLEQLAPLSQNLENTSEKLSGINRSLEETNELVND--ANKALDDAGRSVKKLEELAPDLLDKLKPL 78
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 117647249  2113 KELEDN---LKKNISEIKELINQARKQANSIKVSVSSGGDCVRTYRPEIKKGSYNNIVV 2168
Cdd:pfam06009   79 KQLEVNsssLSDNISRIKELIAQARKAANSIKVSVSFDGDSIVELRPPISVTDLAAYTS 137
LamB smart00281
Laminin B domain;
1225-1360 5.37e-47

Laminin B domain;


:

Pssm-ID: 214597  Cd Length: 127  Bit Score: 165.51  E-value: 5.37e-47
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249   1225 HLEPFYWKLPQQFEGKKLMAYGGKLKYAIYFEARDEtGFATYKPQVIIRGGtptHARIITRHMAAPLIGQLTRHEIEMTE 1304
Cdd:smart00281    1 DNEPVYWVAPEQFLGDKVTSYGGKLRYTLSFDGRRG-GTHVSAPDVILEGN---GLRISHPAEGPPLPDELTTVEVRFRE 76
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*.
gi 117647249   1305 KEWKYYGddpriSRTVTREDFLDILYDIHYILIKATYGNVVRQSRISEISMEVAEP 1360
Cdd:smart00281   77 ENWQYYG-----GRPVTREDLMMVLANLTAILIRATYSQQMAGSRLSDVSLEVAVP 127
Laminin_G_1 pfam00054
Laminin G domain;
2789-2917 2.21e-45

Laminin G domain;


:

Pssm-ID: 395008 [Multi-domain]  Cd Length: 131  Bit Score: 160.95  E-value: 2.21e-45
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2789 VRTEAESGLLFYMARINHADFATVQLRNGFPYFSYDLGSGDTSTMIPTKINDGQWHKIKIVRVKQEGILYVDD-ASSQTI 2867
Cdd:pfam00054    1 FRTTEPSGLLLYNGTQTERDFLALELRDGRLEVSYDLGSGAAVVRSGDKLNDGKWHSVELERNGRSGTLSVDGeARPTGE 80
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 117647249  2868 SPKKADI-LDVVGILYVGGLPINYTTRRIGPVTYSLDGCVRNLHMEQAPVD 2917
Cdd:pfam00054   81 SPLGATTdLDVDGPLYVGGLPSLGVKKRRLAISPSFDGCIRDVIVNGKPLD 131
Laminin_B pfam00052
Laminin B (Domain IV);
579-718 3.54e-39

Laminin B (Domain IV);


:

Pssm-ID: 459652  Cd Length: 136  Bit Score: 143.56  E-value: 3.54e-39
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249   579 YWSAPPPYLGNRLPAVGGQLSFTISYDLEEEEDDTekILQLMIIFEGNDLRISTAYKE-VYLEPSEEHIEEVSLKEEAFT 657
Cdd:pfam00052    1 YWSAPEQFLGNKLTSYGGYLTYTVRYEPLPGGGSL--NSEPDVILEGNGLRLSYSSPDqPPPDPGQEQTYSVRLHEENWR 78
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 117647249   658 iHGTNLPVTRKDFMIVLTNLERVLMQITYNLGMDAIfRLSSVNLESAVPYPTDRRiATDVE 718
Cdd:pfam00052   79 -DSDGAPVSREDFMMVLANLTAILIRATYSTGSGQV-SLSNVSLDSAVPGGSGPP-ASWVE 136
Laminin_G_1 pfam00054
Laminin G domain;
2364-2503 4.19e-39

Laminin G domain;


:

Pssm-ID: 395008 [Multi-domain]  Cd Length: 131  Bit Score: 142.84  E-value: 4.19e-39
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2364 FRTFSSSALLMYLATRDLKDFMSVELSDGHVKVSYDLGSGMTSVVSNQNHNDGKWKAFTLSRIQKQANISIVDIdsnqEE 2443
Cdd:pfam00054    1 FRTTEPSGLLLYNGTQTERDFLALELRDGRLEVSYDLGSGAAVVRSGDKLNDGKWHSVELERNGRSGTLSVDGE----AR 76
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2444 NVATSSSGNNFglDLKADDKIYFGGLPtlrNLSMKARPEVNVKKYSGCLKDIEISRTPYN 2503
Cdd:pfam00054   77 PTGESPLGATT--DLDVDGPLYVGGLP---SLGVKKRRLAISPSFDGCIRDVIVNGKPLD 131
Laminin_G_1 pfam00054
Laminin G domain;
2170-2310 4.64e-34

Laminin G domain;


:

Pssm-ID: 395008 [Multi-domain]  Cd Length: 131  Bit Score: 128.59  E-value: 4.64e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2170 VKTAVADNLLFYLGSAKFIDFLAIEMRKGKVSFLWDVGSGVGRVEYPDlTIDDSYWYRIEASRTGRNGSISVRALDGPKA 2249
Cdd:pfam00054    1 FRTTEPSGLLLYNGTQTERDFLALELRDGRLEVSYDLGSGAAVVRSGD-KLNDGKWHSVELERNGRSGTLSVDGEARPTG 79
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 117647249  2250 SmvpstyhsvSPPGYTIlDVDANAMLFVGGLTGKI-KKADAVRVITFTGCMGETYFDNKPIG 2310
Cdd:pfam00054   80 E---------SPLGATT-DLDVDGPLYVGGLPSLGvKKRRLAISPSFDGCIRDVIVNGKPLD 131
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
2937-3089 3.87e-33

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


:

Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 126.76  E-value: 3.87e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2937 GTYFDGTGFAKAVGGFKVGLDLLVEFEFRTTRPTGVLLGVSSQ-KMDGMGIEMIDEKLMFHVDNGAGRFTAIYdagiPGH 3015
Cdd:cd00110     1 GVSFSGSSYVRLPTLPAPRTRLSISFSFRTTSPNGLLLYAGSQnGGDFLALELEDGRLVLRYDLGSGSLVLSS----KTP 76
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 117647249 3016 MCNGQWHKVTAKKIKNRLELVVDGNQVDAQSPNSASTSADTNDPVFVGGFPGGLNQFGLTTNIRFRGCIRSLKL 3089
Cdd:cd00110    77 LNDGQWHSVSVERNGRSVTLSVDGERVVESGSPGGSALLNLDGPLYLGGLPEDLKSPGLPVSPGFVGCIRDLKV 150
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
2524-2686 1.26e-27

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


:

Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 110.97  E-value: 1.26e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2524 TVSFPKPGFVELAAVSID-VGTEINLSFSTRNESGIILLGSGGTLTPprrkrrqttqaYYAIFLNKGRLEVHLSSGTRTm 2602
Cdd:cd00110     1 GVSFSGSSYVRLPTLPAPrTRLSISFSFRTTSPNGLLLYAGSQNGGD-----------FLALELEDGRLVLRYDLGSGS- 68
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2603 rkIVIKPePNLFHDGREHSVHVERTRGIFTVQIDEDR--RHMQNLTEEQPIEVKKLFVGGAPPEFQPSPLRNIPAFQGCV 2680
Cdd:cd00110    69 --LVLSS-KTPLNDGQWHSVSVERNGRSVTLSVDGERvvESGSPGGSALLNLDGPLYLGGLPEDLKSPGLPVSPGFVGCI 145

                  ....*.
gi 117647249 2681 WNLVIN 2686
Cdd:cd00110   146 RDLKVN 151
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
913-960 5.03e-14

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


:

Pssm-ID: 238012  Cd Length: 50  Bit Score: 68.53  E-value: 5.03e-14
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 117647249  913 PCRCNINGSFSEICHTRTGQCECRPNVQGRHCDECKPETFGLQL-GRGC 960
Cdd:cd00055     1 PCDCNGHGSLSGQCDPGTGQCECKPNTTGRRCDRCAPGYYGLPSqGGGC 49
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1694-2054 3.43e-13

Uncharacterized protein, contains DUF3084 domain [Function unknown];


:

Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 74.17  E-value: 3.43e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1694 QLNETLGNQDKTAERNLEELQKEIDRMLKELRskDLQTQKEVAEDELvaaegllkrvnklfgepraqnedmeKDLQQKLA 1773
Cdd:COG4372    17 GLRPKTGILIAALSEQLRKALFELDKLQEELE--QLREELEQAREEL-------------------------EQLEEELE 69
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1774 EYKNKLDDAWDLLREATDKTRDANRLSAANQKNMTILETKKEAIEgskRQIENTLKEGNDILDEANRLLGEINSVIDYVD 1853
Cdd:COG4372    70 QARSELEQLEEELEELNEQLQAAQAELAQAQEELESLQEEAEELQ---EELEELQKERQDLEQQRKQLEAQIAELQSEIA 146
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1854 DIKTKLppmsEELSDKIDDLAQEIKDrrlaekvFQAESHAAQLNDSSAVLDGILDEAKNISFNATAAFRAYSNIKDYIDE 1933
Cdd:COG4372   147 EREEEL----KELEEQLESLQEELAA-------LEQELQALSEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRE 215
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1934 AEKVAREAK---ELAQGATKLATSPQGLLKEDAKGSLQKSFRILNEAKKLANDVKGNHNDLNDLKTRLETADLRNSGLLG 2010
Cdd:COG4372   216 LAEELLEAKdslEAKLGLALSALLDALELEEDKEELLEEVILKEIEELELAILVEKDTEEEELEIAALELEALEEAALEL 295
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....
gi 117647249 2011 ALNDTMDKLSAITNDTAAKLQAVKEKAREANDTAKAVLAQVKDL 2054
Cdd:COG4372   296 KLLALLLNLAALSLIGALEDALLAALLELAKKLELALAILLAEL 339
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
963-1007 3.73e-12

Laminin-type epidermal growth factor-like domai;


:

Pssm-ID: 214543  Cd Length: 46  Bit Score: 63.10  E-value: 3.73e-12
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*.
gi 117647249    963 CNCNSFGSKSFDCEA-SGQCWCQPGVAGKKCDRCAHGYFNFQEGGC 1007
Cdd:smart00180    1 CDCDPGGSASGTCDPdTGQCECKPNVTGRRCDRCAPGYYGDGPPGC 46
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
1056-1104 1.74e-11

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


:

Pssm-ID: 395007  Cd Length: 49  Bit Score: 61.22  E-value: 1.74e-11
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 117647249  1056 CNCSTVGSLASQCNVNTGQCSCHPKFSGMKCSECSRGHWNYPLCTLCDC 1104
Cdd:pfam00053    1 CDCNPHGSLSDTCDPETGQCLCKPGVTGRHCDRCKPGYYGLPSDPPQGC 49
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
1416-1462 4.80e-11

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


:

Pssm-ID: 395007  Cd Length: 49  Bit Score: 60.06  E-value: 4.80e-11
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 117647249  1416 CQCNGHSS---QCDPETSVCQnCQHHTAGDFCERCALGYYGIVRGLPNDC 1462
Cdd:pfam00053    1 CDCNPHGSlsdTCDPETGQCL-CKPGVTGRHCDRCKPGYYGLPSDPPQGC 49
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
860-912 4.97e-11

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


:

Pssm-ID: 238012  Cd Length: 50  Bit Score: 60.06  E-value: 4.97e-11
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|...
gi 117647249  860 PCQCNDNldYSIPGSCDSLSGSCLiCKPGTTGRYCELCADGYFGDAVNAKNCQ 912
Cdd:cd00055     1 PCDCNGH--GSLSGQCDPGTGQCE-CKPNTTGRRCDRCAPGYYGLPSQGGGCQ 50
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
1102-1159 1.09e-10

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


:

Pssm-ID: 395007  Cd Length: 49  Bit Score: 58.90  E-value: 1.09e-10
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 117647249  1102 CDCFLPGTDATTCDLETrkcscsdqtGQCSCKVNVEGVHCDRCRPGKFGLDAKNPLGC 1159
Cdd:pfam00053    1 CDCNPHGSLSDTCDPET---------GQCLCKPGVTGRHCDRCKPGYYGLPSDPPQGC 49
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
1009-1054 2.94e-10

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


:

Pssm-ID: 238012  Cd Length: 50  Bit Score: 57.75  E-value: 2.94e-10
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 117647249 1009 ACDCSHLG---NNCDPKTGQCICPPNTTGEKCSECLPNTWGH-SIVTGCK 1054
Cdd:cd00055     1 PCDCNGHGslsGQCDPGTGQCECKPNTTGRRCDRCAPGYYGLpSQGGGCQ 50
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
752-801 8.22e-10

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


:

Pssm-ID: 238012  Cd Length: 50  Bit Score: 56.59  E-value: 8.22e-10
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|...
gi 117647249  752 PCQCFAHA---EACDDITGECLnCKDHTGGPYCNECLPGFYGDPTRgsPEDCQ 801
Cdd:cd00055     1 PCDCNGHGslsGQCDPGTGQCE-CKPNTTGRRCDRCAPGYYGLPSQ--GGGCQ 50
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
465-511 1.07e-09

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


:

Pssm-ID: 395007  Cd Length: 49  Bit Score: 56.21  E-value: 1.07e-09
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 117647249   465 CNCSGLGSTNE--DPCVGPCSCKENVEGEDCSRCKSGFFNLQEDNQKGC 511
Cdd:pfam00053    1 CDCNPHGSLSDtcDPETGQCLCKPGVTGRHCDRCKPGYYGLPSDPPQGC 49
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
410-467 1.26e-09

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


:

Pssm-ID: 395007  Cd Length: 49  Bit Score: 55.82  E-value: 1.26e-09
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 117647249   410 CHCDPTGSLSEVCVKDekyaqrglkPGSCHCKTGFGGVNCDRCVRGYHGYPDCQPCNC 467
Cdd:pfam00053    1 CDCNPHGSLSDTCDPE---------TGQCLCKPGVTGRHCDRCKPGYYGLPSDPPQGC 49
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
1523-1566 3.36e-09

Laminin-type epidermal growth factor-like domai;


:

Pssm-ID: 214543  Cd Length: 46  Bit Score: 54.62  E-value: 3.36e-09
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....
gi 117647249   1523 CECDPYGSLPVPCDRVTGLCTCRPGATGRKCDGCEHWHAREGAE 1566
Cdd:smart00180    1 CDCDPGGSASGTCDPDTGQCECKPNVTGRRCDRCAPGYYGDGPP 44
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
802-859 2.19e-06

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


:

Pssm-ID: 238012  Cd Length: 50  Bit Score: 46.96  E-value: 2.19e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 117647249  802 PCACPLNIpsnNFSPTCHLDrslGLICdECPIGYTGPRCERCAEGYFGQPSIPGGsCQ 859
Cdd:cd00055     1 PCDCNGHG---SLSGQCDPG---TGQC-ECKPNTTGRRCDRCAPGYYGLPSQGGG-CQ 50
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
283-330 2.41e-06

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


:

Pssm-ID: 238012  Cd Length: 50  Bit Score: 46.58  E-value: 2.41e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 117647249  283 CICYGHA---RACplDPATnkSRCECEHNTCGESCDRCCPGFHQKPWRAGT 330
Cdd:cd00055     2 CDCNGHGslsGQC--DPGT--GQCECKPNTTGRRCDRCAPGYYGLPSQGGG 48
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
1465-1520 1.49e-05

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


:

Pssm-ID: 395007  Cd Length: 49  Bit Score: 44.65  E-value: 1.49e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 117647249  1465 CACPLISPSnnfSPSCVLEGledYRCTaCPRGYEGQYCERCAPGYTGSPSSPGGSC 1520
Cdd:pfam00053    1 CDCNPHGSL---SDTCDPET---GQCL-CKPGVTGRHCDRCKPGYYGLPSDPPQGC 49
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
1375-1403 2.88e-04

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


:

Pssm-ID: 238012  Cd Length: 50  Bit Score: 40.80  E-value: 2.88e-04
                          10        20
                  ....*....|....*....|....*....
gi 117647249 1375 RCDCPPGYSGLSCETCAPGFYRLRSEPGG 1403
Cdd:cd00055    20 QCECKPNTTGRRCDRCAPGYYGLPSQGGG 48
 
Name Accession Description Interval E-value
Laminin_I pfam06008
Laminin Domain I; coiled-coil structure. It has been suggested that the domains I and II from ...
1589-1848 1.40e-105

Laminin Domain I; coiled-coil structure. It has been suggested that the domains I and II from laminin A, B1 and B2 may come together to form a triple helical coiled-coil structure.


Pssm-ID: 310534 [Multi-domain]  Cd Length: 258  Bit Score: 338.62  E-value: 1.40e-105
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1589 TMNIN-LTGPLPAPYKILYGLENTTQELKHLLSPQRAPERLIQLAEGNVNTLVMETNELLTRATKVTADGEQTGQDAERT 1667
Cdd:pfam06008    1 LLSLNsLTGALPAPYKINYNLENLTKQLQEYLSPENAHKIQIEILEKELSSLAQETEELQKKATQTLAKAQQVNAESERT 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1668 NSRAESLEEFIKGLVQDAEAINEKAVQLNEtlgNQDKTAERNLEELQKEIDRMLKELRSKDLQTQKEVAEDELVAAEGLL 1747
Cdd:pfam06008   81 LGHAKELAEAIKNLIDNIKEINEKVATLGE---NDFALPSSDLSRMLAEAQRMLGEIRSRDFGTQLQNAEAELKAAQDLL 157
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1748 KRVNKLFGEPRAQNEDMEKDLQQKLAEYKNKLDDAWDLLREATDKTRDANRLSAANQKNMTILETKKEAIEGSKRQIENT 1827
Cdd:pfam06008  158 SRIQTWFQSPQEENKALANALRDSLAEYEAKLSDLRELLREAAAKTRDANRLNLANQANLREFQRKKEEVSEQKNQLEET 237
                          250       260
                   ....*....|....*....|.
gi 117647249  1828 LKEGNDILDEANRLLGEINSV 1848
Cdd:pfam06008  238 LKTARDSLDAANLLLQEIDDA 258
LamNT smart00136
Laminin N-terminal domain (domain VI); N-terminal domain of laminins and laminin-related ...
29-281 5.71e-93

Laminin N-terminal domain (domain VI); N-terminal domain of laminins and laminin-related protein such as Unc-6/ netrins.


Pssm-ID: 214532  Cd Length: 238  Bit Score: 301.59  E-value: 5.71e-93
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249     29 AHQQRGLFPAVLNLASNALITTNATCGEKGPEMYCKLVEHVpgqpVRNPQCRICNQNssNPYQRHPITNAIDGKN----T 104
Cdd:smart00136    1 AGRPRSCYPPFVNLAFGREVTATSTCGEPGPERYCKLVGHT----EQGKKCDYCDAR--NPRRSHPAENLTDGNNpnnpT 74
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249    105 WWQSPSIKNGVeyHYVTITLDLQQVFQIAYVIVKAAnSPRPGNWILERSLDDVEYKPWQYHAvtdTECLTLYNIyPRTGP 184
Cdd:smart00136   75 WWQSEPLSNGP--QNVNLTLDLGKEFHVTYVILKFC-SPRPSLWILERSDFGKTWQPWQYFS---SDCRRTFGR-PPRGP 147
                           170       180       190       200       210       220       230       240
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249    185 PSYAKDDEVICTSFYSKIHPLENGEIHISLINGRPSADDP--SPELLEFTSARYIRLRFQRIRTLNADLMMfahkdpreI 262
Cdd:smart00136  148 ITKGNEDEVICTSEYSDIVPLEGGEIAFSLLEGRPSATDFdnSPVLQEWVTATNIRVRLTRLRTLGDELMD--------D 219
                           250
                    ....*....|....*....
gi 117647249    263 DPIVTRRYYYSVKDISVGG 281
Cdd:smart00136  220 RPEVTRRYYYAISDIAVGG 238
Laminin_N pfam00055
Laminin N-terminal (Domain VI);
35-281 6.13e-88

Laminin N-terminal (Domain VI);


Pssm-ID: 459653  Cd Length: 230  Bit Score: 286.79  E-value: 6.13e-88
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249    35 LFPAVLNLASNALITTNATCGEKGPEMYCKLVEHVPGQpvrnpQCRICNqnSSNPYQRHPITNAIDGKN----TWWQSPS 110
Cdd:pfam00055    1 CYPAFGNLAFGREVSATSTCGLNGPERYCILSGLEGGK-----KCFICD--SRDPHNSHPPSNLTDSNNgtneTWWQSET 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249   111 IKngVEYHYVTITLDLQQVFQIAYVIVKAAnSPRPGNWILERSLD-DVEYKPWQYHAvtdTECLTLYNIypRTGPPSYAK 189
Cdd:pfam00055   74 GV--IQYENVNLTLDLGKEFHFTYLILKFK-SPRPAAMVLERSTDfGKTWQPYQYFA---SDCRRTFGR--PSGPSRGIK 145
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249   190 DDEVICTSFYSKIHPLENGEIHISLINGRPSA--DDPSPELLEFTSARYIRLRFQRIRTLNadlmmfahkDPREIDPIVT 267
Cdd:pfam00055  146 DDEVICTSEYSDISPLTGGEVIFSTLEGRPSAniFDYSPELQDWLTATNIRIRLLRLHTLG---------DELLDDPSVL 216
                          250
                   ....*....|....
gi 117647249   268 RRYYYSVKDISVGG 281
Cdd:pfam00055  217 RKYYYAISDISVGG 230
Laminin_II pfam06009
Laminin Domain II; It has been suggested that the domains I and II from laminin A, B1 and B2 ...
2033-2168 2.95e-54

Laminin Domain II; It has been suggested that the domains I and II from laminin A, B1 and B2 may come together to form a triple helical coiled-coil structure.


Pssm-ID: 368703 [Multi-domain]  Cd Length: 138  Bit Score: 186.54  E-value: 2.95e-54
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2033 VKEKAREANDTAKAVLAQVKDLHQNLDGLKQNYNKLADSVAKTNAVVKDpsKNKIIADAGTSVRNLEQEADRLIDKLKPI 2112
Cdd:pfam06009    1 SKELAREANETAKEVLEQLAPLSQNLENTSEKLSGINRSLEETNELVND--ANKALDDAGRSVKKLEELAPDLLDKLKPL 78
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 117647249  2113 KELEDN---LKKNISEIKELINQARKQANSIKVSVSSGGDCVRTYRPEIKKGSYNNIVV 2168
Cdd:pfam06009   79 KQLEVNsssLSDNISRIKELIAQARKAANSIKVSVSFDGDSIVELRPPISVTDLAAYTS 137
LamB smart00281
Laminin B domain;
1225-1360 5.37e-47

Laminin B domain;


Pssm-ID: 214597  Cd Length: 127  Bit Score: 165.51  E-value: 5.37e-47
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249   1225 HLEPFYWKLPQQFEGKKLMAYGGKLKYAIYFEARDEtGFATYKPQVIIRGGtptHARIITRHMAAPLIGQLTRHEIEMTE 1304
Cdd:smart00281    1 DNEPVYWVAPEQFLGDKVTSYGGKLRYTLSFDGRRG-GTHVSAPDVILEGN---GLRISHPAEGPPLPDELTTVEVRFRE 76
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*.
gi 117647249   1305 KEWKYYGddpriSRTVTREDFLDILYDIHYILIKATYGNVVRQSRISEISMEVAEP 1360
Cdd:smart00281   77 ENWQYYG-----GRPVTREDLMMVLANLTAILIRATYSQQMAGSRLSDVSLEVAVP 127
Laminin_G_1 pfam00054
Laminin G domain;
2789-2917 2.21e-45

Laminin G domain;


Pssm-ID: 395008 [Multi-domain]  Cd Length: 131  Bit Score: 160.95  E-value: 2.21e-45
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2789 VRTEAESGLLFYMARINHADFATVQLRNGFPYFSYDLGSGDTSTMIPTKINDGQWHKIKIVRVKQEGILYVDD-ASSQTI 2867
Cdd:pfam00054    1 FRTTEPSGLLLYNGTQTERDFLALELRDGRLEVSYDLGSGAAVVRSGDKLNDGKWHSVELERNGRSGTLSVDGeARPTGE 80
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 117647249  2868 SPKKADI-LDVVGILYVGGLPINYTTRRIGPVTYSLDGCVRNLHMEQAPVD 2917
Cdd:pfam00054   81 SPLGATTdLDVDGPLYVGGLPSLGVKKRRLAISPSFDGCIRDVIVNGKPLD 131
Laminin_B pfam00052
Laminin B (Domain IV);
1230-1374 1.18e-44

Laminin B (Domain IV);


Pssm-ID: 459652  Cd Length: 136  Bit Score: 158.97  E-value: 1.18e-44
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1230 YWKLPQQFEGKKLMAYGGKLKYAIYFEARDETGFATYKPQVIIRGGtptHARIITRHMA--APLIGQLTRHEIEMTEKEW 1307
Cdd:pfam00052    1 YWSAPEQFLGNKLTSYGGYLTYTVRYEPLPGGGSLNSEPDVILEGN---GLRLSYSSPDqpPPDPGQEQTYSVRLHEENW 77
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 117647249  1308 KYygddpRISRTVTREDFLDILYDIHYILIKATYGNVVRQSRISEISMEVAEPGhvlAGSPPAHLIE 1374
Cdd:pfam00052   78 RD-----SDGAPVSREDFMMVLANLTAILIRATYSTGSGQVSLSNVSLDSAVPG---GSGPPASWVE 136
Laminin_B pfam00052
Laminin B (Domain IV);
579-718 3.54e-39

Laminin B (Domain IV);


Pssm-ID: 459652  Cd Length: 136  Bit Score: 143.56  E-value: 3.54e-39
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249   579 YWSAPPPYLGNRLPAVGGQLSFTISYDLEEEEDDTekILQLMIIFEGNDLRISTAYKE-VYLEPSEEHIEEVSLKEEAFT 657
Cdd:pfam00052    1 YWSAPEQFLGNKLTSYGGYLTYTVRYEPLPGGGSL--NSEPDVILEGNGLRLSYSSPDqPPPDPGQEQTYSVRLHEENWR 78
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 117647249   658 iHGTNLPVTRKDFMIVLTNLERVLMQITYNLGMDAIfRLSSVNLESAVPYPTDRRiATDVE 718
Cdd:pfam00052   79 -DSDGAPVSREDFMMVLANLTAILIRATYSTGSGQV-SLSNVSLDSAVPGGSGPP-ASWVE 136
Laminin_G_1 pfam00054
Laminin G domain;
2364-2503 4.19e-39

Laminin G domain;


Pssm-ID: 395008 [Multi-domain]  Cd Length: 131  Bit Score: 142.84  E-value: 4.19e-39
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2364 FRTFSSSALLMYLATRDLKDFMSVELSDGHVKVSYDLGSGMTSVVSNQNHNDGKWKAFTLSRIQKQANISIVDIdsnqEE 2443
Cdd:pfam00054    1 FRTTEPSGLLLYNGTQTERDFLALELRDGRLEVSYDLGSGAAVVRSGDKLNDGKWHSVELERNGRSGTLSVDGE----AR 76
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2444 NVATSSSGNNFglDLKADDKIYFGGLPtlrNLSMKARPEVNVKKYSGCLKDIEISRTPYN 2503
Cdd:pfam00054   77 PTGESPLGATT--DLDVDGPLYVGGLP---SLGVKKRRLAISPSFDGCIRDVIVNGKPLD 131
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
2762-2911 2.58e-37

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 138.71  E-value: 2.58e-37
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2762 FGLSRNSHIAIAFDDTKvKNRLTIELEVRTEAESGLLFYMARINHADFATVQLRNGFPYFSYDLGSGDTSTMIPTKINDG 2841
Cdd:cd00110     2 VSFSGSSYVRLPTLPAP-RTRLSISFSFRTTSPNGLLLYAGSQNGGDFLALELEDGRLVLRYDLGSGSLVLSSKTPLNDG 80
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 117647249 2842 QWHKIKIVRVKQEGILYVDDASS-QTISPKKADILDVVGILYVGGLPINYTTRRIgPVTYSLDGCVRNLHM 2911
Cdd:cd00110    81 QWHSVSVERNGRSVTLSVDGERVvESGSPGGSALLNLDGPLYLGGLPEDLKSPGL-PVSPGFVGCIRDLKV 150
LamG smart00282
Laminin G domain;
2784-2911 1.67e-36

Laminin G domain;


Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 135.54  E-value: 1.67e-36
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249   2784 TIELEVRTEAESGLLFYMARINHADFATVQLRNGFPYFSYDLGSGDTSTMI-PTKINDGQWHKIKIVRVKQEGILYVDDA 2862
Cdd:smart00282    1 SISFSFRTTSPNGLLLYAGSKGGGDYLALELRDGRLVLRYDLGSGPARLTSdPTPLNDGQWHRVAVERNGRSVTLSVDGG 80
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|
gi 117647249   2863 SSQTI-SPKKADILDVVGILYVGGLPINYtTRRIGPVTYSLDGCVRNLHM 2911
Cdd:smart00282   81 NRVSGeSPGGLTILNLDGPLYLGGLPEDL-KLPPLPVTPGFRGCIRNLKV 129
Laminin_G_1 pfam00054
Laminin G domain;
2170-2310 4.64e-34

Laminin G domain;


Pssm-ID: 395008 [Multi-domain]  Cd Length: 131  Bit Score: 128.59  E-value: 4.64e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2170 VKTAVADNLLFYLGSAKFIDFLAIEMRKGKVSFLWDVGSGVGRVEYPDlTIDDSYWYRIEASRTGRNGSISVRALDGPKA 2249
Cdd:pfam00054    1 FRTTEPSGLLLYNGTQTERDFLALELRDGRLEVSYDLGSGAAVVRSGD-KLNDGKWHSVELERNGRSGTLSVDGEARPTG 79
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 117647249  2250 SmvpstyhsvSPPGYTIlDVDANAMLFVGGLTGKI-KKADAVRVITFTGCMGETYFDNKPIG 2310
Cdd:pfam00054   80 E---------SPLGATT-DLDVDGPLYVGGLPSLGvKKRRLAISPSFDGCIRDVIVNGKPLD 131
LamB smart00281
Laminin B domain;
578-706 2.82e-33

Laminin B domain;


Pssm-ID: 214597  Cd Length: 127  Bit Score: 126.22  E-value: 2.82e-33
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249    578 YYWSAPPPYLGNRLPAVGGQLSFTISYdleEEEDDTEKILQLMIIFEGNDLRISTAYkEVYLEPSEEHIEEVSLKEEAFT 657
Cdd:smart00281    5 VYWVAPEQFLGDKVTSYGGKLRYTLSF---DGRRGGTHVSAPDVILEGNGLRISHPA-EGPPLPDELTTVEVRFREENWQ 80
                            90       100       110       120
                    ....*....|....*....|....*....|....*....|....*....
gi 117647249    658 IHGTNlPVTRKDFMIVLTNLERVLMQITYNLGMDAIfRLSSVNLESAVP 706
Cdd:smart00281   81 YYGGR-PVTREDLMMVLANLTAILIRATYSQQMAGS-RLSDVSLEVAVP 127
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
2937-3089 3.87e-33

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 126.76  E-value: 3.87e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2937 GTYFDGTGFAKAVGGFKVGLDLLVEFEFRTTRPTGVLLGVSSQ-KMDGMGIEMIDEKLMFHVDNGAGRFTAIYdagiPGH 3015
Cdd:cd00110     1 GVSFSGSSYVRLPTLPAPRTRLSISFSFRTTSPNGLLLYAGSQnGGDFLALELEDGRLVLRYDLGSGSLVLSS----KTP 76
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 117647249 3016 MCNGQWHKVTAKKIKNRLELVVDGNQVDAQSPNSASTSADTNDPVFVGGFPGGLNQFGLTTNIRFRGCIRSLKL 3089
Cdd:cd00110    77 LNDGQWHSVSVERNGRSVTLSVDGERVVESGSPGGSALLNLDGPLYLGGLPEDLKSPGLPVSPGFVGCIRDLKV 150
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
2336-2497 2.28e-32

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 124.45  E-value: 2.28e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2336 TIQFDGEGYALVSRPIRWYPNiSTVMFKFRTFSSSALLMYLATRDLKDFMSVELSDGHVKVSYDLGSGMTSVVSNQNHND 2415
Cdd:cd00110     1 GVSFSGSSYVRLPTLPAPRTR-LSISFSFRTTSPNGLLLYAGSQNGGDFLALELEDGRLVLRYDLGSGSLVLSSKTPLND 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2416 GKWKAFTLSRIQKQANISIvdidsNQEENVATSSSGNNFGLDLkaDDKIYFGGLPTlrnlSMKARPEVNVKKYSGCLKDI 2495
Cdd:cd00110    80 GQWHSVSVERNGRSVTLSV-----DGERVVESGSPGGSALLNL--DGPLYLGGLPE----DLKSPGLPVSPGFVGCIRDL 148

                  ..
gi 117647249 2496 EI 2497
Cdd:cd00110   149 KV 150
LamG smart00282
Laminin G domain;
2359-2499 1.38e-30

Laminin G domain;


Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 118.60  E-value: 1.38e-30
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249   2359 TVMFKFRTFSSSALLMYLATRDLKDFMSVELSDGHVKVSYDLGSGMTSVVSNQNH-NDGKWKAFTLSRIQKQANISIvdi 2437
Cdd:smart00282    1 SISFSFRTTSPNGLLLYAGSKGGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPlNDGQWHRVAVERNGRSVTLSV--- 77
                            90       100       110       120       130       140
                    ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 117647249   2438 dsNQEENVATSSSGNNFGLDLkaDDKIYFGGLPTlrnlSMKARPEVNVKKYSGCLKDIEISR 2499
Cdd:smart00282   78 --DGGNRVSGESPGGLTILNL--DGPLYLGGLPE----DLKLPPLPVTPGFRGCIRNLKVNG 131
LamG smart00282
Laminin G domain;
2960-3089 2.96e-30

Laminin G domain;


Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 117.83  E-value: 2.96e-30
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249   2960 VEFEFRTTRPTGVLLGVSS-QKMDGMGIEMIDEKLMFHVDNGAGRFTAIYDagiPGHMCNGQWHKVTAKKIKNRLELVVD 3038
Cdd:smart00282    2 ISFSFRTTSPNGLLLYAGSkGGGDYLALELRDGRLVLRYDLGSGPARLTSD---PTPLNDGQWHRVAVERNGRSVTLSVD 78
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|..
gi 117647249   3039 G-NQVDAQSPnSASTSADTNDPVFVGGFPGGLNQFGLTTNIRFRGCIRSLKL 3089
Cdd:smart00282   79 GgNRVSGESP-GGLTILNLDGPLYLGGLPEDLKLPPLPVTPGFRGCIRNLKV 129
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
2524-2686 1.26e-27

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 110.97  E-value: 1.26e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2524 TVSFPKPGFVELAAVSID-VGTEINLSFSTRNESGIILLGSGGTLTPprrkrrqttqaYYAIFLNKGRLEVHLSSGTRTm 2602
Cdd:cd00110     1 GVSFSGSSYVRLPTLPAPrTRLSISFSFRTTSPNGLLLYAGSQNGGD-----------FLALELEDGRLVLRYDLGSGS- 68
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2603 rkIVIKPePNLFHDGREHSVHVERTRGIFTVQIDEDR--RHMQNLTEEQPIEVKKLFVGGAPPEFQPSPLRNIPAFQGCV 2680
Cdd:cd00110    69 --LVLSS-KTPLNDGQWHSVSVERNGRSVTLSVDGERvvESGSPGGSALLNLDGPLYLGGLPEDLKSPGLPVSPGFVGCI 145

                  ....*.
gi 117647249 2681 WNLVIN 2686
Cdd:cd00110   146 RDLKVN 151
LamG smart00282
Laminin G domain;
2165-2307 5.70e-27

Laminin G domain;


Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 108.20  E-value: 5.70e-27
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249   2165 NIVVHVKTAVADNLLFYLGSAKFIDFLAIEMRKGKVSFLWDVGSGVGRVEYPDLTIDDSYWYRIEASRTGRNGSISVral 2244
Cdd:smart00282    1 SISFSFRTTSPNGLLLYAGSKGGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQWHRVAVERNGRSVTLSV--- 77
                            90       100       110       120       130       140
                    ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 117647249   2245 DGPKASMVpstyhsVSPPGYTILDVDANamLFVGGLTGKIKKADAVRVITFTGCMGETYFDNK 2307
Cdd:smart00282   78 DGGNRVSG------ESPGGLTILNLDGP--LYLGGLPEDLKLPPLPVTPGFRGCIRNLKVNGK 132
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
2146-2305 2.85e-25

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 104.04  E-value: 2.85e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2146 SGGDCVRtYRPEIKKGSYNNIVVHVKTAVADNLLFYLGSAKFIDFLAIEMRKGKVSFLWDVGSGVGRVEYPDlTIDDSYW 2225
Cdd:cd00110     5 SGSSYVR-LPTLPAPRTRLSISFSFRTTSPNGLLLYAGSQNGGDFLALELEDGRLVLRYDLGSGSLVLSSKT-PLNDGQW 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2226 YRIEASRTGRNGSISVralDGPKasmvpsTYHSVSPPGYTILDVDANamLFVGGLTGKIKKADAVRVITFTGCMGETYFD 2305
Cdd:cd00110    83 HSVSVERNGRSVTLSV---DGER------VVESGSPGGSALLNLDGP--LYLGGLPEDLKSPGLPVSPGFVGCIRDLKVN 151
Laminin_G_1 pfam00054
Laminin G domain;
2550-2691 4.58e-25

Laminin G domain;


Pssm-ID: 395008 [Multi-domain]  Cd Length: 131  Bit Score: 102.78  E-value: 4.58e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2550 FSTRNESGIILlgSGGTLTPPrrkrrqttqAYYAIFLNKGRLEVHLSSGTRtmrKIVIKPEPNLfHDGREHSVHVERTRG 2629
Cdd:pfam00054    1 FRTTEPSGLLL--YNGTQTER---------DFLALELRDGRLEVSYDLGSG---AAVVRSGDKL-NDGKWHSVELERNGR 65
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 117647249  2630 IFTVQIDEDRRHM--QNLTEEQ-PIEVKKLFVGGAPPE-FQPSPLRNIPAFQGCVWNLVINSIPMD 2691
Cdd:pfam00054   66 SGTLSVDGEARPTgeSPLGATTdLDVDGPLYVGGLPSLgVKKRRLAISPSFDGCIRDVIVNGKPLD 131
Laminin_G_2 pfam02210
Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G ...
2964-3089 2.41e-24

Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.


Pssm-ID: 460494 [Multi-domain]  Cd Length: 126  Bit Score: 100.57  E-value: 2.41e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2964 FRTTRPTGVLLGVSSQKMDGMGIEMIDEKLMFHVDNGAGRFTAIYdagIPGHMCNGQWHKVTAKKIKNRLELVVDGNQVD 3043
Cdd:pfam02210    1 FRTRQPNGLLLYAGGGGSDFLALELVNGRLVLRYDLGSGPESLLS---SGKNLNDGQWHSVRVERNGNTLTLSVDGQTVV 77
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 117647249  3044 AQSPNSASTSADTNDPVFVGGFPGGLNQFGLTTNIRFRGCIRSLKL 3089
Cdd:pfam02210   78 SSLPPGESLLLNLNGPLYLGGLPPLLLLPALPVRAGFVGCIRDVRV 123
LamG smart00282
Laminin G domain;
2545-2687 3.76e-23

Laminin G domain;


Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 97.41  E-value: 3.76e-23
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249   2545 EINLSFSTRNESGIILLGSGGTLTPprrkrrqttqaYYAIFLNKGRLEVHLSSGTRTmrkIVIKPEPNLFHDGREHSVHV 2624
Cdd:smart00282    1 SISFSFRTTSPNGLLLYAGSKGGGD-----------YLALELRDGRLVLRYDLGSGP---ARLTSDPTPLNDGQWHRVAV 66
                            90       100       110       120       130       140
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 117647249   2625 ERTRGIFTVQID-EDRRHMQNLTEEQPIEVK-KLFVGGAPPEFQPSPLRNIPAFQGCVWNLVINS 2687
Cdd:smart00282   67 ERNGRSVTLSVDgGNRVSGESPGGLTILNLDgPLYLGGLPEDLKLPPLPVTPGFRGCIRNLKVNG 131
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
1665-2143 3.54e-17

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 88.92  E-value: 3.54e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1665 ERTNSRAESLEEFIKGLVQDAEAINEKAVQLNETLGNQDKTAERnLEELQKEIDR----------MLKELRSK--DLQTQ 1732
Cdd:TIGR04523  225 KQNNQLKDNIEKKQQEINEKTTEISNTQTQLNQLKDEQNKIKKQ-LSEKQKELEQnnkkikelekQLNQLKSEisDLNNQ 303
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1733 KEvaedelvaaEGLLKRVNKLFGEPRAQNEDMEKDL---QQKLAEYKNKLDDawdLLREATDKTRDanrlsaaNQKnmti 1809
Cdd:TIGR04523  304 KE---------QDWNKELKSELKNQEKKLEEIQNQIsqnNKIISQLNEQISQ---LKKELTNSESE-------NSE---- 360
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1810 letKKEAIEGSKRQIENTLKEGNDILDEANRLLGEINSVIDYVDDIKTKlppmSEELSDKI-------DDLAQEIKdrRL 1882
Cdd:TIGR04523  361 ---KQRELEEKQNEIEKLKKENQSYKQEIKNLESQINDLESKIQNQEKL----NQQKDEQIkklqqekELLEKEIE--RL 431
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1883 AEKVFQAESHAAQLNDSSAVLDGILDEAKNISfnataafraySNIKDYIDEAEKVAREAK-ELAQGATKLATSPQGLLKe 1961
Cdd:TIGR04523  432 KETIIKNNSEIKDLTNQDSVKELIIKNLDNTR----------ESLETQLKVLSRSINKIKqNLEQKQKELKSKEKELKK- 500
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1962 dakgslqksfrILNEAKKLANDVKgnhndlnDLKTRLETadlrnsgllgaLNDTMDKLSAITNDTAAKLQAVKEKAREAN 2041
Cdd:TIGR04523  501 -----------LNEEKKELEEKVK-------DLTKKISS-----------LKEKIEKLESEKKEKESKISDLEDELNKDD 551
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2042 DT-AKAVLAQVKD-LHQNLDGLKQNYNKLADSVAKTNAVVKDPSKNKI-----IADAGTSVRNLEQEADRLIDKLKPIKE 2114
Cdd:TIGR04523  552 FElKKENLEKEIDeKNKEIEELKQTQKSLKKKQEEKQELIDQKEKEKKdlikeIEEKEKKISSLEKELEKAKKENEKLSS 631
                          490       500
                   ....*....|....*....|....*....
gi 117647249  2115 LEDNLKKNISEIKELINQARKQANSIKVS 2143
Cdd:TIGR04523  632 IIKNIKSKKNKLKQEVKQIKETIKEIRNK 660
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1609-2145 2.99e-14

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 79.34  E-value: 2.99e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1609 ENTTQELKHLLSPQRAPERLIQlAEGNVNTLVMETNELLTRATKVTADGEQtgqDAERTNSRAESLEEFIKGLVQDAEAI 1688
Cdd:PRK03918  165 KNLGEVIKEIKRRIERLEKFIK-RTENIEELIKEKEKELEEVLREINEISS---ELPELREELEKLEKEVKELEELKEEI 240
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1689 NEKAVQLNETLGNQdktaeRNLEELQKEIDRMLKELRSKDLQTQKEVAE-DELvaaEGLLKRVNKLfGEPRAQNEDMEKD 1767
Cdd:PRK03918  241 EELEKELESLEGSK-----RKLEEKIRELEERIEELKKEIEELEEKVKElKEL---KEKAEEYIKL-SEFYEEYLDELRE 311
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1768 LQQKLAEYKNKLDDAWDLLREATDKTRDANRLSaanqKNMTILETKKEAIEGSKRQIEnTLKEgndILDEANRL------ 1841
Cdd:PRK03918  312 IEKRLSRLEEEINGIEERIKELEEKEERLEELK----KKLKELEKRLEELEERHELYE-EAKA---KKEELERLkkrltg 383
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1842 --LGEINSVIDYVDDIKTKLPPMSEELSDKIDDLAQEIKDRRLA-EKVFQAESH----AAQLNDssavldgilDEAKNIs 1914
Cdd:PRK03918  384 ltPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAiEELKKAKGKcpvcGRELTE---------EHRKEL- 453
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1915 fnataafraysnIKDYIDEAEKVAREAKELAQGATKLATSPQGLLKEDAKGS-LQKSFRILNEAKKLANDVKG-NHNDLN 1992
Cdd:PRK03918  454 ------------LEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESeLIKLKELAEQLKELEEKLKKyNLEELE 521
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1993 DLKTRLETADLRNSGLLG---ALNDTMDKLSAITNdtaaKLQAVKEKAREANDTAKAVLAQ--------VKDLHQNLDGL 2061
Cdd:PRK03918  522 KKAEEYEKLKEKLIKLKGeikSLKKELEKLEELKK----KLAELEKKLDELEEELAELLKEleelgfesVEELEERLKEL 597
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2062 KQNYNKLadsvaktnavvkdpsknkiiadagTSVRNLEQEADRLIDKLKPIKELEDNLKKNISEIKELINQARKQANSIK 2141
Cdd:PRK03918  598 EPFYNEY------------------------LELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELE 653

                  ....
gi 117647249 2142 VSVS 2145
Cdd:PRK03918  654 KKYS 657
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
913-960 5.03e-14

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 68.53  E-value: 5.03e-14
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 117647249  913 PCRCNINGSFSEICHTRTGQCECRPNVQGRHCDECKPETFGLQL-GRGC 960
Cdd:cd00055     1 PCDCNGHGSLSGQCDPGTGQCECKPNTTGRRCDRCAPGYYGLPSqGGGC 49
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1694-2054 3.43e-13

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 74.17  E-value: 3.43e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1694 QLNETLGNQDKTAERNLEELQKEIDRMLKELRskDLQTQKEVAEDELvaaegllkrvnklfgepraqnedmeKDLQQKLA 1773
Cdd:COG4372    17 GLRPKTGILIAALSEQLRKALFELDKLQEELE--QLREELEQAREEL-------------------------EQLEEELE 69
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1774 EYKNKLDDAWDLLREATDKTRDANRLSAANQKNMTILETKKEAIEgskRQIENTLKEGNDILDEANRLLGEINSVIDYVD 1853
Cdd:COG4372    70 QARSELEQLEEELEELNEQLQAAQAELAQAQEELESLQEEAEELQ---EELEELQKERQDLEQQRKQLEAQIAELQSEIA 146
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1854 DIKTKLppmsEELSDKIDDLAQEIKDrrlaekvFQAESHAAQLNDSSAVLDGILDEAKNISFNATAAFRAYSNIKDYIDE 1933
Cdd:COG4372   147 EREEEL----KELEEQLESLQEELAA-------LEQELQALSEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRE 215
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1934 AEKVAREAK---ELAQGATKLATSPQGLLKEDAKGSLQKSFRILNEAKKLANDVKGNHNDLNDLKTRLETADLRNSGLLG 2010
Cdd:COG4372   216 LAEELLEAKdslEAKLGLALSALLDALELEEDKEELLEEVILKEIEELELAILVEKDTEEEELEIAALELEALEEAALEL 295
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....
gi 117647249 2011 ALNDTMDKLSAITNDTAAKLQAVKEKAREANDTAKAVLAQVKDL 2054
Cdd:COG4372   296 KLLALLLNLAALSLIGALEDALLAALLELAKKLELALAILLAEL 339
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
914-960 5.47e-13

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


Pssm-ID: 395007  Cd Length: 49  Bit Score: 65.45  E-value: 5.47e-13
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 117647249   914 CRCNINGSFSEICHTRTGQCECRPNVQGRHCDECKPETFGLQ--LGRGC 960
Cdd:pfam00053    1 CDCNPHGSLSDTCDPETGQCLCKPGVTGRHCDRCKPGYYGLPsdPPQGC 49
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
1642-2129 6.76e-13

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 75.08  E-value: 6.76e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1642 ETNELLTRATKVTADgeQTGQDAERTNSRAESLEEFIKGLVQD-AEAINEKavqlnETLGNQDKTAERNLEELQKEIDRM 1720
Cdd:PRK02224  226 EEQREQARETRDEAD--EVLEEHEERREELETLEAEIEDLRETiAETERER-----EELAEEVRDLRERLEELEEERDDL 298
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1721 LKE------------LRSKDLQTQKEVAEDELV-----------AAEGLLKRVNKLfgEPRAQN-----EDMEKDLQ--- 1769
Cdd:PRK02224  299 LAEaglddadaeaveARREELEDRDEELRDRLEecrvaaqahneEAESLREDADDL--EERAEElreeaAELESELEear 376
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1770 QKLAEYKNKLDDAWDLLREATDKTRDAN-RLSAANQKNMTILETKkEAIEGSKRQIENTLKEGNDILDEANRLLGEIN-- 1846
Cdd:PRK02224  377 EAVEDRREEIEELEEEIEELRERFGDAPvDLGNAEDFLEELREER-DELREREAELEATLRTARERVEEAEALLEAGKcp 455
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1847 ---------SVIDYVDDIKTKLppmsEELSDKIDDLAQEIKDrrLAEKVFQAEShaaqLNDSSAVLDGILDEAKNISfna 1917
Cdd:PRK02224  456 ecgqpvegsPHVETIEEDRERV----EELEAELEDLEEEVEE--VEERLERAED----LVEAEDRIERLEERREDLE--- 522
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1918 taafraySNIKDYIDEAEKVAREAKELAQGATKLATSPQGllKEDAKGSLQKsfrilnEAKKLANDVKGNHNDLNDLKTR 1997
Cdd:PRK02224  523 -------ELIAERRETIEEKRERAEELRERAAELEAEAEE--KREAAAEAEE------EAEEAREEVAELNSKLAELKER 587
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1998 LEtadlrnsgllgALNDTMDKLSAITN--DTAAKLQAVKEKAREANDTAKAVLAQVKDLHQNLDGlkqnynKLadsvakt 2075
Cdd:PRK02224  588 IE-----------SLERIRTLLAAIADaeDEIERLREKREALAELNDERRERLAEKRERKRELEA------EF------- 643
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 117647249 2076 navvkDPSKnkiIADAGTSVRNLEQEADRLIDKLKPIKELEDNLKKNI----SEIKEL 2129
Cdd:PRK02224  644 -----DEAR---IEEAREDKERAEEYLEQVEEKLDELREERDDLQAEIgaveNELEEL 693
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
914-960 9.87e-13

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 64.64  E-value: 9.87e-13
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*..
gi 117647249    914 CRCNINGSFSEICHTRTGQCECRPNVQGRHCDECKPETFGLQlGRGC 960
Cdd:smart00180    1 CDCDPGGSASGTCDPDTGQCECKPNVTGRRCDRCAPGYYGDG-PPGC 46
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
1644-2146 1.17e-12

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 73.90  E-value: 1.17e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1644 NELLTRATKVTADGEQTGQDAERTNSraesLEEFIKGLVQDAEAINEKAV-------QLNETL---GNQDKTAERNLEEL 1713
Cdd:TIGR04523   47 NELKNKEKELKNLDKNLNKDEEKINN----SNNKIKILEQQIKDLNDKLKknkdkinKLNSDLskiNSEIKNDKEQKNKL 122
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1714 QKEIDRMLKELRSKDlQTQKEVAeDELVAAEGLLKRVNKLFGEPRAQNEDMEKdlqqklaeYKNKLDDAWDLLREATDKT 1793
Cdd:TIGR04523  123 EVELNKLEKQKKENK-KNIDKFL-TEIKKKEKELEKLNNKYNDLKKQKEELEN--------ELNLLEKEKLNIQKNIDKI 192
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1794 RdaNRLSAANQKnMTILETKKEAIEGSKRQIENtLKEGNDIL-DEANRLLGEINsvidyvdDIKTKLPPMSEELSDKIDD 1872
Cdd:TIGR04523  193 K--NKLLKLELL-LSNLKKKIQKNKSLESQISE-LKKQNNQLkDNIEKKQQEIN-------EKTTEISNTQTQLNQLKDE 261
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1873 LaQEIKDRrLAEKVFQAESHAAQLNDSSAVLdgildeaknisfnataafraySNIKDYIDEAEKVareaKElaQGATKLa 1952
Cdd:TIGR04523  262 Q-NKIKKQ-LSEKQKELEQNNKKIKELEKQL---------------------NQLKSEISDLNNQ----KE--QDWNKE- 311
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1953 tspqglLKEDAKgSLQKSFRI----LNEAKKLANDVKgnhNDLNDLKTRLETADLRNSGLLGALNDTMDKLSAITNDTAA 2028
Cdd:TIGR04523  312 ------LKSELK-NQEKKLEEiqnqISQNNKIISQLN---EQISQLKKELTNSESENSEKQRELEEKQNEIEKLKKENQS 381
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2029 KLQAVKEKAREANDTaKAVLAQVKDLHQNLD----GLKQNYNK-------LADSVAKTNAVVKD-----PSKNKIIADAG 2092
Cdd:TIGR04523  382 YKQEIKNLESQINDL-ESKIQNQEKLNQQKDeqikKLQQEKELlekeierLKETIIKNNSEIKDltnqdSVKELIIKNLD 460
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2093 TSVRNLEQEADRLIDKLKPIK-ELEDN---------------------------LKKNISEIKELINQARKQANSIKVSV 2144
Cdd:TIGR04523  461 NTRESLETQLKVLSRSINKIKqNLEQKqkelkskekelkklneekkeleekvkdLTKKISSLKEKIEKLESEKKEKESKI 540

                   ..
gi 117647249  2145 SS 2146
Cdd:TIGR04523  541 SD 542
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
963-1007 3.73e-12

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 63.10  E-value: 3.73e-12
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*.
gi 117647249    963 CNCNSFGSKSFDCEA-SGQCWCQPGVAGKKCDRCAHGYFNFQEGGC 1007
Cdd:smart00180    1 CDCDPGGSASGTCDPdTGQCECKPNVTGRRCDRCAPGYYGDGPPGC 46
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
1056-1104 1.74e-11

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


Pssm-ID: 395007  Cd Length: 49  Bit Score: 61.22  E-value: 1.74e-11
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 117647249  1056 CNCSTVGSLASQCNVNTGQCSCHPKFSGMKCSECSRGHWNYPLCTLCDC 1104
Cdd:pfam00053    1 CDCNPHGSLSDTCDPETGQCLCKPGVTGRHCDRCKPGYYGLPSDPPQGC 49
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
962-1007 4.55e-11

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 60.06  E-value: 4.55e-11
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 117647249  962 PCNCNSFGSKSFDCEA-SGQCWCQPGVAGKKCDRCAHGYFNF--QEGGC 1007
Cdd:cd00055     1 PCDCNGHGSLSGQCDPgTGQCECKPNTTGRRCDRCAPGYYGLpsQGGGC 49
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
1416-1462 4.80e-11

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


Pssm-ID: 395007  Cd Length: 49  Bit Score: 60.06  E-value: 4.80e-11
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 117647249  1416 CQCNGHSS---QCDPETSVCQnCQHHTAGDFCERCALGYYGIVRGLPNDC 1462
Cdd:pfam00053    1 CDCNPHGSlsdTCDPETGQCL-CKPGVTGRHCDRCKPGYYGLPSDPPQGC 49
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
860-912 4.97e-11

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 60.06  E-value: 4.97e-11
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|...
gi 117647249  860 PCQCNDNldYSIPGSCDSLSGSCLiCKPGTTGRYCELCADGYFGDAVNAKNCQ 912
Cdd:cd00055     1 PCDCNGH--GSLSGQCDPGTGQCE-CKPNTTGRRCDRCAPGYYGLPSQGGGCQ 50
COG1340 COG1340
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
1635-1912 6.44e-11

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];


Pssm-ID: 440951 [Multi-domain]  Cd Length: 297  Bit Score: 66.09  E-value: 6.44e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1635 NVNTLVMETNELLTRATKVTadgEQTGQDAERTNSRAESLEEF----------IKGLVQDAEAINEKAVQLNETLgnQDK 1704
Cdd:COG1340     2 KTDELSSSLEELEEKIEELR---EEIEELKEKRDELNEELKELaekrdelnaqVKELREEAQELREKRDELNEKV--KEL 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1705 TAERNleELQKEIDRMLKELRSKDLQTQKEVAEDELVAAegLLKRVNKLfgEPRAQNEDM----EKDLQQKLAEYKNKLD 1780
Cdd:COG1340    77 KEERD--ELNEKLNELREELDELRKELAELNKAGGSIDK--LRKEIERL--EWRQQTEVLspeeEKELVEKIKELEKELE 150
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1781 DAwdllREATDKTRDANRLSAAnqknmtiLETKKEAIEGSKRQIENTLKEGNDILDEANRLLGEINSVIDYVDDIKTKLP 1860
Cdd:COG1340   151 KA----KKALEKNEKLKELRAE-------LKELRKEAEEIHKKIKELAEEAQELHEEMIELYKEADELRKEADELHKEIV 219
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 117647249 1861 PMSE---ELSDKIDDLAQEIKDRRLAEKVFQAESHAAQLNDSSAVLDGILDEAKN 1912
Cdd:COG1340   220 EAQEkadELHEEIIELQKELRELRKELKKLRKKQRALKREKEKEELEEKAEEIFE 274
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
1102-1159 1.09e-10

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


Pssm-ID: 395007  Cd Length: 49  Bit Score: 58.90  E-value: 1.09e-10
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 117647249  1102 CDCFLPGTDATTCDLETrkcscsdqtGQCSCKVNVEGVHCDRCRPGKFGLDAKNPLGC 1159
Cdd:pfam00053    1 CDCNPHGSLSDTCDPET---------GQCLCKPGVTGRHCDRCKPGYYGLPSDPPQGC 49
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
1415-1463 1.51e-10

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 58.52  E-value: 1.51e-10
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|..
gi 117647249 1415 PCQCNGHSS---QCDPETSVCQnCQHHTAGDFCERCALGYYGIvRGLPNDCQ 1463
Cdd:cd00055     1 PCDCNGHGSlsgQCDPGTGQCE-CKPNTTGRRCDRCAPGYYGL-PSQGGGCQ 50
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
963-1010 2.41e-10

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


Pssm-ID: 395007  Cd Length: 49  Bit Score: 58.13  E-value: 2.41e-10
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 117647249   963 CNCNSFGSKSFDCEAS-GQCWCQPGVAGKKCDRCAHGYFNFQEGGCIAC 1010
Cdd:pfam00053    1 CDCNPHGSLSDTCDPEtGQCLCKPGVTGRHCDRCKPGYYGLPSDPPQGC 49
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
1009-1054 2.94e-10

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 57.75  E-value: 2.94e-10
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 117647249 1009 ACDCSHLG---NNCDPKTGQCICPPNTTGEKCSECLPNTWGH-SIVTGCK 1054
Cdd:cd00055     1 PCDCNGHGslsGQCDPGTGQCECKPNTTGRRCDRCAPGYYGLpSQGGGCQ 50
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
1056-1098 3.51e-10

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 57.75  E-value: 3.51e-10
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|...
gi 117647249 1056 CNCSTVGSLASQCNVNTGQCSCHPKFSGMKCSECSRGHWNYPL 1098
Cdd:cd00055     2 CDCNGHGSLSGQCDPGTGQCECKPNTTGRRCDRCAPGYYGLPS 44
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
1010-1048 5.56e-10

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 56.94  E-value: 5.56e-10
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|..
gi 117647249   1010 CDCS---HLGNNCDPKTGQCICPPNTTGEKCSECLPNTWGHS 1048
Cdd:smart00180    1 CDCDpggSASGTCDPDTGQCECKPNVTGRRCDRCAPGYYGDG 42
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
752-801 8.22e-10

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 56.59  E-value: 8.22e-10
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|...
gi 117647249  752 PCQCFAHA---EACDDITGECLnCKDHTGGPYCNECLPGFYGDPTRgsPEDCQ 801
Cdd:cd00055     1 PCDCNGHGslsGQCDPGTGQCE-CKPNTTGRRCDRCAPGYYGLPSQ--GGGCQ 50
HOOK pfam05622
HOOK protein coiled-coil region; This family consists of several HOOK1, 2 and 3 proteins from ...
1653-2119 1.04e-09

HOOK protein coiled-coil region; This family consists of several HOOK1, 2 and 3 proteins from different eukaryotic organizms. The different members of the human gene family are HOOK1, HOOK2 and HOOK3. Different domains have been identified in the three human HOOK proteins, and it was demonstrated that the highly conserved NH2-domain mediates attachment to microtubules, whereas this central coiled-coil motif mediates homodimerization and the more divergent C-terminal domains are involved in binding to specific organelles (organelle-binding domains). It has been demonstrated that endogenous HOOK3 binds to Golgi membranes, whereas both HOOK1 and HOOK2 are localized to discrete but unidentified cellular structures. In mice the Hook1 gene is predominantly expressed in the testis. Hook1 function is necessary for the correct positioning of microtubular structures within the haploid germ cell. Disruption of Hook1 function in mice causes abnormal sperm head shape and fragile attachment of the flagellum to the sperm head. This entry includes the central coiled-coiled domain and the divergent C-terminal domain.


Pssm-ID: 461694 [Multi-domain]  Cd Length: 528  Bit Score: 63.94  E-value: 1.04e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1653 VTADGEQTGQDAERTNSRAESLEEFIKGLVQDAEAINEKAVQLnETLGNQDKTAERNLEELQKEIDRMLKELrskdlqTQ 1732
Cdd:pfam05622    5 AQEEKDELAQRCHELDQQVSLLQEEKNSLQQENKKLQERLDQL-ESGDDSGTPGGKKYLLLQKQLEQLQEEN------FR 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1733 KEVAEDEL-VAAEGLLKRVnklfGEPRAQNEDMekdlqQKLAEYKNKLDDAWDLLREATDKtrdANRLSAAnqknmtiLE 1811
Cdd:pfam05622   78 LETARDDYrIKCEELEKEV----LELQHRNEEL-----TSLAEEAQALKDEMDILRESSDK---VKKLEAT-------VE 138
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1812 TKKEAIEGS---KRQIEnTLKEGN-----------DILDEANRLLGEINSVIDYVDDIKTKLppmSEE------------ 1865
Cdd:pfam05622  139 TYKKKLEDLgdlRRQVK-LLEERNaeymqrtlqleEELKKANALRGQLETYKRQVQELHGKL---SEEskkadklefeyk 214
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1866 -LSDKIDDLAQEiKDRRLAE----KVFQAESHAAQLNDSSAVLDGILDEAKNISFNATAAFRAYSNIKDYID--EAE-KV 1937
Cdd:pfam05622  215 kLEEKLEALQKE-KERLIIErdtlRETNEELRCAQLQQAELSQADALLSPSSDPGDNLAAEIMPAEIREKLIrlQHEnKM 293
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1938 AREAKElAQGATKLATSpQGLLkEDAKGSLQKsfriLNEAKKLAND-VKGNHNDLNDLKTRLETADLRN--SGLLGA-LN 2013
Cdd:pfam05622  294 LRLGQE-GSYRERLTEL-QQLL-EDANRRKNE----LETQNRLANQrILELQQQVEELQKALQEQGSKAedSSLLKQkLE 366
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2014 DTMDKLSaitndtaaKLQAVKEKAREA-NDTAKAVLAQvkdLHQNLDGLKQNYNKLADS-----------VAKTNAVVK- 2080
Cdd:pfam05622  367 EHLEKLH--------EAQSELQKKKEQiEELEPKQDSN---LAQKIDELQEALRKKDEDmkameerykkyVEKAKSVIKt 435
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|....*..
gi 117647249  2081 -DPSKNKIIADAGTSVRNLEQEADRLID-------KLKPIKELEDNL 2119
Cdd:pfam05622  436 lDPKQNPASPPEIQALKNQLLEKDKKIEhlerdfeKSKLQREQEEKL 482
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
465-511 1.07e-09

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


Pssm-ID: 395007  Cd Length: 49  Bit Score: 56.21  E-value: 1.07e-09
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 117647249   465 CNCSGLGSTNE--DPCVGPCSCKENVEGEDCSRCKSGFFNLQEDNQKGC 511
Cdd:pfam00053    1 CDCNPHGSLSDtcDPETGQCLCKPGVTGRHCDRCKPGYYGLPSDPPQGC 49
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
1010-1053 1.23e-09

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


Pssm-ID: 395007  Cd Length: 49  Bit Score: 56.21  E-value: 1.23e-09
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 117647249  1010 CDCSHLG---NNCDPKTGQCICPPNTTGEKCSECLPNTWGHSIVTGC 1053
Cdd:pfam00053    1 CDCNPHGslsDTCDPETGQCLCKPGVTGRHCDRCKPGYYGLPSDPPQ 47
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
410-467 1.26e-09

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


Pssm-ID: 395007  Cd Length: 49  Bit Score: 55.82  E-value: 1.26e-09
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 117647249   410 CHCDPTGSLSEVCVKDekyaqrglkPGSCHCKTGFGGVNCDRCVRGYHGYPDCQPCNC 467
Cdd:pfam00053    1 CDCNPHGSLSDTCDPE---------TGQCLCKPGVTGRHCDRCKPGYYGLPSDPPQGC 49
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
753-800 2.33e-09

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


Pssm-ID: 395007  Cd Length: 49  Bit Score: 55.05  E-value: 2.33e-09
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 117647249   753 CQCFAHA---EACDDITGECLnCKDHTGGPYCNECLPGFYGDPTrGSPEDC 800
Cdd:pfam00053    1 CDCNPHGslsDTCDPETGQCL-CKPGVTGRHCDRCKPGYYGLPS-DPPQGC 49
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
1102-1159 2.99e-09

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 55.01  E-value: 2.99e-09
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|....*...
gi 117647249   1102 CDCFLPGTDATTCDLETrkcscsdqtGQCSCKVNVEGVHCDRCRPGKFGldaKNPLGC 1159
Cdd:smart00180    1 CDCDPGGSASGTCDPDT---------GQCECKPNVTGRRCDRCAPGYYG---DGPPGC 46
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
1523-1566 3.36e-09

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 54.62  E-value: 3.36e-09
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....
gi 117647249   1523 CECDPYGSLPVPCDRVTGLCTCRPGATGRKCDGCEHWHAREGAE 1566
Cdd:smart00180    1 CDCDPGGSASGTCDPDTGQCECKPNVTGRRCDRCAPGYYGDGPP 44
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
1523-1560 3.43e-09

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 54.67  E-value: 3.43e-09
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 117647249 1523 CECDPYGSLPVPCDRVTGLCTCRPGATGRKCDGCEHWH 1560
Cdd:cd00055     2 CDCNGHGSLSGQCDPGTGQCECKPNTTGRRCDRCAPGY 39
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
1056-1097 3.64e-09

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 54.62  E-value: 3.64e-09
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|..
gi 117647249   1056 CNCSTVGSLASQCNVNTGQCSCHPKFSGMKCSECSRGHWNYP 1097
Cdd:smart00180    1 CDCDPGGSASGTCDPDTGQCECKPNVTGRRCDRCAPGYYGDG 42
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
1523-1558 6.02e-09

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


Pssm-ID: 395007  Cd Length: 49  Bit Score: 53.90  E-value: 6.02e-09
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 117647249  1523 CECDPYGSLPVPCDRVTGLCTCRPGATGRKCDGCEH 1558
Cdd:pfam00053    1 CDCNPHGSLSDTCDPETGQCLCKPGVTGRHCDRCKP 36
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
1102-1159 6.29e-09

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 53.90  E-value: 6.29e-09
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 117647249 1102 CDCFLPGTDATTCDLETrkcscsdqtGQCSCKVNVEGVHCDRCRPGKFGlDAKNPLGC 1159
Cdd:cd00055     2 CDCNGHGSLSGQCDPGT---------GQCECKPNTTGRRCDRCAPGYYG-LPSQGGGC 49
ClyA_Cry6Aa-like cd22656
Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes ...
1853-2142 6.33e-09

Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes pesticidal Cry6Aa toxin from Bacillus thuringiensis, one of the many parasporal crystal (Cry) toxins produced during the sporulation phase of growth. Many of these proteins are toxic to numerous insect species and have been effectively used as proteinaceous insecticides to directly kill insect pests; some have been used to control insect growth on transgenic agricultural plants. Cry6Aa exists as a protoxin, which is activated by cleavage using trypsin. Structure studies for Cry6Aa support a mechanism of action by pore formation, similar to cytolysin A (ClyA)-type alpha pore-forming toxins (alpha-PFTs) such as HblB, and bioassay and mutation studies show that Cry6Aa is an active pore-forming toxin. Cry6Aa shows atypical features compared to other members of alpha-PFTs, including internal repeat sequences and small loop regions within major alpha helices.


Pssm-ID: 439154 [Multi-domain]  Cd Length: 309  Bit Score: 60.46  E-value: 6.33e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1853 DDIKTKLPpmseELSDKIDDLAQEIKDRRLAEKVFQaeSHAAQLNDSsaVLDGILDEAKNISFNATAA----FRAYSNIK 1928
Cdd:cd22656    26 EEYRKRLG----ISSDIDDKLSSDFDPLLDAYKSIK--DHCTDFKDD--TYPSIVSLAGDIYNYAQNAggtiDSYYAEIL 97
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1929 DYIDEA-----EKVAREAKELAQGATKLatspqglLKEDAKgslqksfRILNEAKKLANDVKGNHNDLNDLKTRLETADL 2003
Cdd:cd22656    98 ELIDDLadatdDEELEEAKKTIKALLDD-------LLKEAK-------KYQDKAAKVVDKLTDFENQTEKDQTALETLEK 163
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2004 RNSGLLGalndtmDKLSAITNDTAAKLQAVKEKAREandtakavlAQVKDLHQNLDGLKQNYNKLADSVAKTNAVVKDps 2083
Cdd:cd22656   164 ALKDLLT------DEGGAIARKEIKDLQKELEKLNE---------EYAAKLKAKIDELKALIADDEAKLAAALRLIAD-- 226
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 117647249 2084 knkiIADAGTSVRNLEQEADRLIDKLKPIKELEDNLKKNISEIKELINQARKQANSIKV 2142
Cdd:cd22656   227 ----LTAADTDLDNLLALIGPAIPALEKLQGAWQAIATDLDSLKDLLEDDISKIPAAIL 281
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
861-904 7.36e-09

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 53.85  E-value: 7.36e-09
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....
gi 117647249    861 CQCNdnLDYSIPGSCDSLSGSCLiCKPGTTGRYCELCADGYFGD 904
Cdd:smart00180    1 CDCD--PGGSASGTCDPDTGQCE-CKPNVTGRRCDRCAPGYYGD 41
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
409-461 2.76e-08

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 52.36  E-value: 2.76e-08
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|...
gi 117647249  409 PCHCDPTGSLSEVCVKDEkyaqrglkpGSCHCKTGFGGVNCDRCVRGYHGYPD 461
Cdd:cd00055     1 PCDCNGHGSLSGQCDPGT---------GQCECKPNTTGRRCDRCAPGYYGLPS 44
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
861-907 3.95e-08

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


Pssm-ID: 395007  Cd Length: 49  Bit Score: 51.59  E-value: 3.95e-08
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 117647249   861 CQCNDNLDYSipGSCDSLSGSCLiCKPGTTGRYCELCADGYFGDAVN 907
Cdd:pfam00053    1 CDCNPHGSLS--DTCDPETGQCL-CKPGVTGRHCDRCKPGYYGLPSD 44
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
1416-1462 6.47e-08

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 51.16  E-value: 6.47e-08
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|
gi 117647249   1416 CQCN--GH-SSQCDPETSVCQnCQHHTAGDFCERCALGYYGIVrglPNDC 1462
Cdd:smart00180    1 CDCDpgGSaSGTCDPDTGQCE-CKPNVTGRRCDRCAPGYYGDG---PPGC 46
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
410-462 2.24e-07

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 49.62  E-value: 2.24e-07
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|....*
gi 117647249    410 CHCDPTGSLSEVCVKDEkyaqrglkpGSCHCKTGFGGVNCDRCVRGYHG--YPDC 462
Cdd:smart00180    1 CDCDPGGSASGTCDPDT---------GQCECKPNVTGRRCDRCAPGYYGdgPPGC 46
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
464-512 2.84e-07

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 49.27  E-value: 2.84e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 117647249  464 PCNCSGLGSTNE--DPCVGPCSCKENVEGEDCSRCKSGFFNLQEDNQkGCE 512
Cdd:cd00055     1 PCDCNGHGSLSGqcDPGTGQCECKPNTTGRRCDRCAPGYYGLPSQGG-GCQ 50
SPEC cd00176
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members ...
1682-1881 3.36e-07

Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here


Pssm-ID: 238103 [Multi-domain]  Cd Length: 213  Bit Score: 53.60  E-value: 3.36e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1682 VQDAEA-INEKAVQLNETLGNQDktaERNLEELQKEIDRMLKEL-----RSKDLQTQ-KEVAEDELVAAEGLLKRVNKLf 1754
Cdd:cd00176     9 ADELEAwLSEKEELLSSTDYGDD---LESVEALLKKHEALEAELaaheeRVEALNELgEQLIEEGHPDAEEIQERLEEL- 84
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1755 gepraqnEDMEKDLQQKLAEYKNKLDDAWDLLRE----------ATDKTRDANRLSAAnqKNMTILETKKEAIEGSKRQI 1824
Cdd:cd00176    85 -------NQRWEELRELAEERRQRLEEALDLQQFfrdaddleqwLEEKEAALASEDLG--KDLESVEELLKKHKELEEEL 155
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 117647249 1825 ENTLKEGNDILDEANRLLGEINSviDYVDDIKTKLppmsEELSDKIDDLAQEIKDRR 1881
Cdd:cd00176   156 EAHEPRLKSLNELAEELLEEGHP--DADEEIEEKL----EELNERWEELLELAEERQ 206
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1929-2142 3.79e-07

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 55.29  E-value: 3.79e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1929 DYIDEAEKVAREAKELAQGATKLATSpqgLLKEDAKGSLQKSFRILNEAKKLANDVKGNHNDLNDLKTRLETAD---LRN 2005
Cdd:COG4372     2 DRLGEKVGKARLSLFGLRPKTGILIA---ALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARselEQL 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2006 SGLLGALNDTMDKLSAITNDTAAKLQAVKEKAREANDTAKAVLAQVKDLHQNLDGLKQNYNKLADSVAktnavvkdpSKN 2085
Cdd:COG4372    79 EEELEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIA---------ERE 149
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 117647249 2086 KIIADAGTSVRNLEQEADRLIDKLKPIKElednlKKNISEIKELINQARKQANSIKV 2142
Cdd:COG4372   150 EELKELEEQLESLQEELAALEQELQALSE-----AEAEQALDELLKEANRNAEKEEE 201
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
753-795 4.16e-07

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 48.85  E-value: 4.16e-07
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*.
gi 117647249    753 CQCF---AHAEACDDITGECLnCKDHTGGPYCNECLPGFYGDPTRG 795
Cdd:smart00180    1 CDCDpggSASGTCDPDTGQCE-CKPNVTGRRCDRCAPGYYGDGPPG 45
growth_prot_Scy NF041483
polarized growth protein Scy;
1613-1980 5.70e-07

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 55.60  E-value: 5.70e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1613 QELKHLLSPQRaperliQLAEGNVNTLVMETNELLTRatkvtadgeqTGQDAER--TNSRAESlEEFIKGLVQDAEAINE 1690
Cdd:NF041483  130 QQLDQELAERR------QTVESHVNENVAWAEQLRAR----------TESQARRllDESRAEA-EQALAAARAEAERLAE 192
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1691 KAVQlneTLGNQDKTAERNLEEL----QKEIDRMLKELrskdlQTQKEVAEDElvaAEGLLKRVNKLFGEPRAQNEDMEK 1766
Cdd:NF041483  193 EARQ---RLGSEAESARAEAEAIlrraRKDAERLLNAA-----STQAQEATDH---AEQLRSSTAAESDQARRQAAELSR 261
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1767 DLQQKLAEYKNKLD----DAWDLLREAtdKTRDANRLSAANQKNMTILETKKEAIegsKRQIENTLKEGNDILDEANRLL 1842
Cdd:NF041483  262 AAEQRMQEAEEALRearaEAEKVVAEA--KEAAAKQLASAESANEQRTRTAKEEI---ARLVGEATKEAEALKAEAEQAL 336
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1843 GeinsvidyvdDIKTKLPPMSEELSDKIDDLAqeikdrrlaekvfqAESHAAQLNDSSAVLDGILDEAkniSFNATAAFR 1922
Cdd:NF041483  337 A----------DARAEAEKLVAEAAEKARTVA--------------AEDTAAQLAKAARTAEEVLTKA---SEDAKATTR 389
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 117647249 1923 AYSnikdyiDEAEKVAREAK----ELAQGATKLATSPQGLLKEDAKGSLQKSFRILNEAKKL 1980
Cdd:NF041483  390 AAA------EEAERIRREAEaeadRLRGEAADQAEQLKGAAKDDTKEYRAKTVELQEEARRL 445
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
802-859 2.19e-06

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 46.96  E-value: 2.19e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 117647249  802 PCACPLNIpsnNFSPTCHLDrslGLICdECPIGYTGPRCERCAEGYFGQPSIPGGsCQ 859
Cdd:cd00055     1 PCDCNGHG---SLSGQCDPG---TGQC-ECKPNTTGRRCDRCAPGYYGLPSQGGG-CQ 50
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
283-330 2.41e-06

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 46.58  E-value: 2.41e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 117647249  283 CICYGHA---RACplDPATnkSRCECEHNTCGESCDRCCPGFHQKPWRAGT 330
Cdd:cd00055     2 CDCNGHGslsGQC--DPGT--GQCECKPNTTGRRCDRCAPGYYGLPSQGGG 48
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
803-858 4.63e-06

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


Pssm-ID: 395007  Cd Length: 49  Bit Score: 45.81  E-value: 4.63e-06
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 117647249   803 CACPLNIPSnnfSPTCHldrSLGLICDeCPIGYTGPRCERCAEGYFGQPSIPGGSC 858
Cdd:pfam00053    1 CDCNPHGSL---SDTCD---PETGQCL-CKPGVTGRHCDRCKPGYYGLPSDPPQGC 49
alph_xenorhab_A NF033928
alpha-xenorhabdolysin family binary toxin subunit A; Alpha-xenorhabdolysin was the founding ...
1664-1944 5.07e-06

alpha-xenorhabdolysin family binary toxin subunit A; Alpha-xenorhabdolysin was the founding member of a family of alpha-helical pore-forming binary toxins. YaxAB from Yersinia enterocolitica has been studied structurally. This HMM represents subunit A proteins such as XaxA and YaxA, capable of binding to the membrane even in the absence of the B subunit. This family is related to the Bacillus haemolytic enterotoxin family (see PF05791.9), although thresholds for this HMM are set to exclude that family.


Pssm-ID: 468250 [Multi-domain]  Cd Length: 340  Bit Score: 51.53  E-value: 5.07e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1664 AERTNSRAESLEEFIKGLvQDAEAINEKAVQLNE----TLGNQDKTAERN----LEELQKEIDRMLKELR--SKDLQT-Q 1732
Cdd:NF033928   79 ARNIVVTGNPIIDLINEM-PIIKRGDLTEEELSElppiPLSSDDKEIVKElkeiLEDLKNDIKDYQQKADdvKKELDDfE 157
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1733 KEVAEDELVAAEGLLKRVNKLFGEP-----RAQNEDMEKDLQQKLAEYKNKLDDAWdllreatdktrdanrlSAANQKN- 1806
Cdd:NF033928  158 NDLREELLPQLKLKKKLYDDNLGSDsieelREKIDQLEKEIEQLNKEYDDYVKLSF----------------TGLAGGPi 221
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1807 -----MTILETKKEAIEGSKRQIENTLKEGNDILDEANRLLGEINSVIDYVDDIKTKlppMSEELS--DKIDDLAQEIkd 1879
Cdd:NF033928  222 glaitGGIFGSKAEKIRKEKNALIQEIDELQEQLKKKNALLGSLERLQTSLDDILTR---MEDALPalKKLKGVWQSL-- 296
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 117647249 1880 rrlaekvfqaeshAAQLNDSSAVLDGILDEAKNISFNAtaafraysNIKDYIDEAEKVAREAKEL 1944
Cdd:NF033928  297 -------------LTDIDSSINALKEIDDADSLRLFKL--------EFEQVIAPWKEIQDYAKQL 340
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
803-851 8.45e-06

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 44.99  E-value: 8.45e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*....
gi 117647249    803 CACPlniPSNNFSPTCHLDrslGLICdECPIGYTGPRCERCAEGYFGQP 851
Cdd:smart00180    1 CDCD---PGGSASGTCDPD---TGQC-ECKPNVTGRRCDRCAPGYYGDG 42
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
465-511 1.03e-05

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 44.99  E-value: 1.03e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*....
gi 117647249    465 CNCSGLGSTNE--DPCVGPCSCKENVEGEDCSRCKSGFFNlqeDNQKGC 511
Cdd:smart00180    1 CDCDPGGSASGtcDPDTGQCECKPNVTGRRCDRCAPGYYG---DGPPGC 46
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
1465-1520 1.49e-05

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


Pssm-ID: 395007  Cd Length: 49  Bit Score: 44.65  E-value: 1.49e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 117647249  1465 CACPLISPSnnfSPSCVLEGledYRCTaCPRGYEGQYCERCAPGYTGSPSSPGGSC 1520
Cdd:pfam00053    1 CDCNPHGSL---SDTCDPET---GQCL-CKPGVTGRHCDRCKPGYYGLPSDPPQGC 49
MA smart00283
Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer); Thought to undergo ...
1822-2062 2.17e-05

Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer); Thought to undergo reversible methylation in response to attractants or repellants during bacterial chemotaxis.


Pssm-ID: 214599 [Multi-domain]  Cd Length: 262  Bit Score: 48.82  E-value: 2.17e-05
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249   1822 RQIENTLKEGNDILDEANRLLGEINSVIDYVDDIKTKLPPMSE------ELSDKIDDLAQEIKDR--RLAEKVFQAESHA 1893
Cdd:smart00283    4 EAVEEIAAGAEEQAEELEELAERMEELSASIEEVAANADEIAAtaqsaaEAAEEGREAVEDAITAmdQIREVVEEAVSAV 83
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249   1894 AQLNDSSAVLDGILDEAKNISF-------NAT--AAfRAysnikdyiDEAEK----VAREAKELAQGATKLATSPQGLLK 1960
Cdd:smart00283   84 EELEESSDEIGEIVSVIDDIADqtnllalNAAieAA-RA--------GEAGRgfavVADEVRKLAERSAESAKEIESLIK 154
                           170       180       190       200       210       220       230       240
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249   1961 E------DAKGSLQKSFRILNEAKKLANDVKGNHNDLNDL---------KTRLETADLRNSglLGALNDTMDKLSAITND 2025
Cdd:smart00283  155 EiqeetnEAVAAMEESSSEVEEGVELVEETGDALEEIVDSveeiadlvqEIAAATDEQAAG--SEEVNAAIDEIAQVTQE 232
                           250       260       270
                    ....*....|....*....|....*....|....*..
gi 117647249   2026 TAAKLQavkekarEANDTAKAVLAQVKDLHQNLDGLK 2062
Cdd:smart00283  233 TAAMSE-------EISAAAEELSGLAEELDELVERFK 262
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
1464-1521 2.41e-04

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 41.19  E-value: 2.41e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 117647249 1464 PCACPLISpsnNFSPSCVLEGLedyRCTaCPRGYEGQYCERCAPGYTGSPSSPGGsCQ 1521
Cdd:cd00055     1 PCDCNGHG---SLSGQCDPGTG---QCE-CKPNTTGRRCDRCAPGYYGLPSQGGG-CQ 50
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
1375-1403 2.88e-04

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 40.80  E-value: 2.88e-04
                          10        20
                  ....*....|....*....|....*....
gi 117647249 1375 RCDCPPGYSGLSCETCAPGFYRLRSEPGG 1403
Cdd:cd00055    20 QCECKPNTTGRRCDRCAPGYYGLPSQGGG 48
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
1465-1513 3.30e-04

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 40.76  E-value: 3.30e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*....
gi 117647249   1465 CACPlisPSNNFSPSCVLEGLedyRCTaCPRGYEGQYCERCAPGYTGSP 1513
Cdd:smart00180    1 CDCD---PGGSASGTCDPDTG---QCE-CKPNVTGRRCDRCAPGYYGDG 42
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
302-326 3.61e-04

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 40.37  E-value: 3.61e-04
                            10        20
                    ....*....|....*....|....*
gi 117647249    302 RCECEHNTCGESCDRCCPGFHQKPW 326
Cdd:smart00180   19 QCECKPNVTGRRCDRCAPGYYGDGP 43
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
1375-1403 3.93e-04

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


Pssm-ID: 395007  Cd Length: 49  Bit Score: 40.41  E-value: 3.93e-04
                           10        20
                   ....*....|....*....|....*....
gi 117647249  1375 RCDCPPGYSGLSCETCAPGFYRLRSEPGG 1403
Cdd:pfam00053   19 QCLCKPGVTGRHCDRCKPGYYGLPSDPPQ 47
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
1375-1396 5.00e-03

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 37.29  E-value: 5.00e-03
                            10        20
                    ....*....|....*....|..
gi 117647249   1375 RCDCPPGYSGLSCETCAPGFYR 1396
Cdd:smart00180   19 QCECKPNVTGRRCDRCAPGYYG 40
Surf_Exclu_PgrA TIGR04320
SEC10/PgrA surface exclusion domain; This model describes a conserved domain found in surface ...
1972-2099 5.09e-03

SEC10/PgrA surface exclusion domain; This model describes a conserved domain found in surface proteins of a number of Firmutes. Many members have LPXTG C-terminal anchoring motifs and a substantial number have the KxYKxGKxW putative sorting signal at the N-terminus. The tetracycline resistance plasmid pCF10 in Enterococcus faecalis promotes conjugal plasmid transfer in response to sex pheromones, but PgrA/Sec10 encoded by that plasmid, a member of this family, specifically inhibits the ability of cells to receive homologous plasmids. The phenomenon is called surface exclusion.


Pssm-ID: 275124 [Multi-domain]  Cd Length: 356  Bit Score: 42.02  E-value: 5.09e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1972 RILNEAKKLANDVKGNHNDLNDLKTRLETADLRNSGLLGALNDTMDKLSAITNDTAAKLQAVKekareandTAKAVLAQV 2051
Cdd:TIGR04320  237 YIADGNKFDKTPIPNPPNSLAALQAKLATAQADLAAAQTALNTAQAALTSAQTAYAAAQAALA--------TAQKELANA 308
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|
gi 117647249  2052 KDlhQNLDGLKQNYNKLADSVAKTNAVVKDPSKNK--IIADAGTSVRNLE 2099
Cdd:TIGR04320  309 QA--QALQTAQNNLATAQAALANAEARLAKAKEALanLNADLAKKQAALD 356
ClyA-like cd21116
family of the cytolysin A (ClyA) family alpha pore-forming toxins (alpha-PFT) including ...
1983-2143 8.55e-03

family of the cytolysin A (ClyA) family alpha pore-forming toxins (alpha-PFT) including Bacillus cereus HblB, Aeromonas hydrophila AhlB, Bacillus thuringiensis Cry6Aa and similar proteins; This family belongs to the ClyA family of alpha-PFT bacterial toxins. PFTs form the major group of virulence factors in many pathogenic bacteria and in general are critical components of the molecular offensive and defensive machinery of cells in all kingdoms of life. Bacterial PFTs facilitate the takeover of host resources by puncturing holes in the membrane. PFTs can be classified as alpha-PFTs and beta-PFTs depending on the secondary structures of their membrane component. Alpha-PFTs use a ring of amphipathic helices while beta-PFTs use a beta-barrel to construct the pore. Members of this family include the toxins: Bacillus cereus hemolysin binding component B (HblB or HBL-B) of the diarrheal enterotoxin hemolysin BL, Aeromonas hydrophila hemolytic (Ahl) component B (AhlB) of the tripartite AhlABC toxin, Vibrio cholerae cytotoxin motility associated killing factor A (MakA) cytotoxin, Xenorhabdus nematophila alpha-xenorhabdolysin (XaxA), Bacillus thuringiensis crystal 6Aa (Cry6Aa) parasporal crystal (Cry) toxin, and Bacillus cereus non-hemolytic enterotoxin (Nhe) component A (NheA) of the non-hemolytic enterotoxin Nhe, which, despite its name, is hemolytic, among others. In solution, ClyA proteins have an elongated, almost entirely alpha-helical structure, except for a short hydrophobic beta-hairpin known as the beta-tongue. Pore formation by ClyA requires circular oligomerization of the toxin by a sequential mechanism. This, in turn, concentrates the amphipathic helices in the center of the ring-like structure, forming a helical barrel that inserts into the membrane by a wedge-like mechanism. Compared with ClyA, NheA is almost entirely alpha-helical with an enlarged "head" domain, and an enlarged beta-tongue; it has been proposed that NheA could even form beta-barrel pores. Alpha-PFTs with similar structures are increasingly being found in eukaryotes, in particular as components of the immune systems of animals. This family may be distantly related to Escherichia coli alpha-PFT hemolysin E (HlyE, also known as ClyA or SheA).


Pssm-ID: 439149 [Multi-domain]  Cd Length: 224  Bit Score: 40.47  E-value: 8.55e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1983 DVKGNHND-LNDLKTRLE---TADLRNSGLLGALNDTMDKL--SAITNDTAAK------LQAVKEKAREANDTAKAVLAQ 2050
Cdd:cd21116    41 LARAHALEwLNEIKPKLLslpNDIIGYNNTFQSYYPDLIELadNLIKGDQGAKqqllqgLEALQSQVTKKQTSVTSFINE 120
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2051 VKDLHQNLDGLKQNYNKLADSVAKTNAVVKDPSKNkiiadagtsVRNLEQEADRLIDKLKPIKELEDNLKKNISEIKELI 2130
Cdd:cd21116   121 LTTFKNDLDDDSRNLQTDATKAQAQVAVLNALKNQ---------LNSLAEQIDAAIDALEKLSNDWQTLDSDIKELITDL 191
                         170
                  ....*....|...
gi 117647249 2131 NQARKQANSIKVS 2143
Cdd:cd21116   192 EDAESSIDAAFLQ 204
 
Name Accession Description Interval E-value
Laminin_I pfam06008
Laminin Domain I; coiled-coil structure. It has been suggested that the domains I and II from ...
1589-1848 1.40e-105

Laminin Domain I; coiled-coil structure. It has been suggested that the domains I and II from laminin A, B1 and B2 may come together to form a triple helical coiled-coil structure.


Pssm-ID: 310534 [Multi-domain]  Cd Length: 258  Bit Score: 338.62  E-value: 1.40e-105
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1589 TMNIN-LTGPLPAPYKILYGLENTTQELKHLLSPQRAPERLIQLAEGNVNTLVMETNELLTRATKVTADGEQTGQDAERT 1667
Cdd:pfam06008    1 LLSLNsLTGALPAPYKINYNLENLTKQLQEYLSPENAHKIQIEILEKELSSLAQETEELQKKATQTLAKAQQVNAESERT 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1668 NSRAESLEEFIKGLVQDAEAINEKAVQLNEtlgNQDKTAERNLEELQKEIDRMLKELRSKDLQTQKEVAEDELVAAEGLL 1747
Cdd:pfam06008   81 LGHAKELAEAIKNLIDNIKEINEKVATLGE---NDFALPSSDLSRMLAEAQRMLGEIRSRDFGTQLQNAEAELKAAQDLL 157
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1748 KRVNKLFGEPRAQNEDMEKDLQQKLAEYKNKLDDAWDLLREATDKTRDANRLSAANQKNMTILETKKEAIEGSKRQIENT 1827
Cdd:pfam06008  158 SRIQTWFQSPQEENKALANALRDSLAEYEAKLSDLRELLREAAAKTRDANRLNLANQANLREFQRKKEEVSEQKNQLEET 237
                          250       260
                   ....*....|....*....|.
gi 117647249  1828 LKEGNDILDEANRLLGEINSV 1848
Cdd:pfam06008  238 LKTARDSLDAANLLLQEIDDA 258
LamNT smart00136
Laminin N-terminal domain (domain VI); N-terminal domain of laminins and laminin-related ...
29-281 5.71e-93

Laminin N-terminal domain (domain VI); N-terminal domain of laminins and laminin-related protein such as Unc-6/ netrins.


Pssm-ID: 214532  Cd Length: 238  Bit Score: 301.59  E-value: 5.71e-93
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249     29 AHQQRGLFPAVLNLASNALITTNATCGEKGPEMYCKLVEHVpgqpVRNPQCRICNQNssNPYQRHPITNAIDGKN----T 104
Cdd:smart00136    1 AGRPRSCYPPFVNLAFGREVTATSTCGEPGPERYCKLVGHT----EQGKKCDYCDAR--NPRRSHPAENLTDGNNpnnpT 74
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249    105 WWQSPSIKNGVeyHYVTITLDLQQVFQIAYVIVKAAnSPRPGNWILERSLDDVEYKPWQYHAvtdTECLTLYNIyPRTGP 184
Cdd:smart00136   75 WWQSEPLSNGP--QNVNLTLDLGKEFHVTYVILKFC-SPRPSLWILERSDFGKTWQPWQYFS---SDCRRTFGR-PPRGP 147
                           170       180       190       200       210       220       230       240
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249    185 PSYAKDDEVICTSFYSKIHPLENGEIHISLINGRPSADDP--SPELLEFTSARYIRLRFQRIRTLNADLMMfahkdpreI 262
Cdd:smart00136  148 ITKGNEDEVICTSEYSDIVPLEGGEIAFSLLEGRPSATDFdnSPVLQEWVTATNIRVRLTRLRTLGDELMD--------D 219
                           250
                    ....*....|....*....
gi 117647249    263 DPIVTRRYYYSVKDISVGG 281
Cdd:smart00136  220 RPEVTRRYYYAISDIAVGG 238
Laminin_N pfam00055
Laminin N-terminal (Domain VI);
35-281 6.13e-88

Laminin N-terminal (Domain VI);


Pssm-ID: 459653  Cd Length: 230  Bit Score: 286.79  E-value: 6.13e-88
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249    35 LFPAVLNLASNALITTNATCGEKGPEMYCKLVEHVPGQpvrnpQCRICNqnSSNPYQRHPITNAIDGKN----TWWQSPS 110
Cdd:pfam00055    1 CYPAFGNLAFGREVSATSTCGLNGPERYCILSGLEGGK-----KCFICD--SRDPHNSHPPSNLTDSNNgtneTWWQSET 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249   111 IKngVEYHYVTITLDLQQVFQIAYVIVKAAnSPRPGNWILERSLD-DVEYKPWQYHAvtdTECLTLYNIypRTGPPSYAK 189
Cdd:pfam00055   74 GV--IQYENVNLTLDLGKEFHFTYLILKFK-SPRPAAMVLERSTDfGKTWQPYQYFA---SDCRRTFGR--PSGPSRGIK 145
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249   190 DDEVICTSFYSKIHPLENGEIHISLINGRPSA--DDPSPELLEFTSARYIRLRFQRIRTLNadlmmfahkDPREIDPIVT 267
Cdd:pfam00055  146 DDEVICTSEYSDISPLTGGEVIFSTLEGRPSAniFDYSPELQDWLTATNIRIRLLRLHTLG---------DELLDDPSVL 216
                          250
                   ....*....|....
gi 117647249   268 RRYYYSVKDISVGG 281
Cdd:pfam00055  217 RKYYYAISDISVGG 230
Laminin_II pfam06009
Laminin Domain II; It has been suggested that the domains I and II from laminin A, B1 and B2 ...
2033-2168 2.95e-54

Laminin Domain II; It has been suggested that the domains I and II from laminin A, B1 and B2 may come together to form a triple helical coiled-coil structure.


Pssm-ID: 368703 [Multi-domain]  Cd Length: 138  Bit Score: 186.54  E-value: 2.95e-54
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2033 VKEKAREANDTAKAVLAQVKDLHQNLDGLKQNYNKLADSVAKTNAVVKDpsKNKIIADAGTSVRNLEQEADRLIDKLKPI 2112
Cdd:pfam06009    1 SKELAREANETAKEVLEQLAPLSQNLENTSEKLSGINRSLEETNELVND--ANKALDDAGRSVKKLEELAPDLLDKLKPL 78
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 117647249  2113 KELEDN---LKKNISEIKELINQARKQANSIKVSVSSGGDCVRTYRPEIKKGSYNNIVV 2168
Cdd:pfam06009   79 KQLEVNsssLSDNISRIKELIAQARKAANSIKVSVSFDGDSIVELRPPISVTDLAAYTS 137
LamB smart00281
Laminin B domain;
1225-1360 5.37e-47

Laminin B domain;


Pssm-ID: 214597  Cd Length: 127  Bit Score: 165.51  E-value: 5.37e-47
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249   1225 HLEPFYWKLPQQFEGKKLMAYGGKLKYAIYFEARDEtGFATYKPQVIIRGGtptHARIITRHMAAPLIGQLTRHEIEMTE 1304
Cdd:smart00281    1 DNEPVYWVAPEQFLGDKVTSYGGKLRYTLSFDGRRG-GTHVSAPDVILEGN---GLRISHPAEGPPLPDELTTVEVRFRE 76
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*.
gi 117647249   1305 KEWKYYGddpriSRTVTREDFLDILYDIHYILIKATYGNVVRQSRISEISMEVAEP 1360
Cdd:smart00281   77 ENWQYYG-----GRPVTREDLMMVLANLTAILIRATYSQQMAGSRLSDVSLEVAVP 127
Laminin_G_1 pfam00054
Laminin G domain;
2789-2917 2.21e-45

Laminin G domain;


Pssm-ID: 395008 [Multi-domain]  Cd Length: 131  Bit Score: 160.95  E-value: 2.21e-45
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2789 VRTEAESGLLFYMARINHADFATVQLRNGFPYFSYDLGSGDTSTMIPTKINDGQWHKIKIVRVKQEGILYVDD-ASSQTI 2867
Cdd:pfam00054    1 FRTTEPSGLLLYNGTQTERDFLALELRDGRLEVSYDLGSGAAVVRSGDKLNDGKWHSVELERNGRSGTLSVDGeARPTGE 80
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 117647249  2868 SPKKADI-LDVVGILYVGGLPINYTTRRIGPVTYSLDGCVRNLHMEQAPVD 2917
Cdd:pfam00054   81 SPLGATTdLDVDGPLYVGGLPSLGVKKRRLAISPSFDGCIRDVIVNGKPLD 131
Laminin_B pfam00052
Laminin B (Domain IV);
1230-1374 1.18e-44

Laminin B (Domain IV);


Pssm-ID: 459652  Cd Length: 136  Bit Score: 158.97  E-value: 1.18e-44
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1230 YWKLPQQFEGKKLMAYGGKLKYAIYFEARDETGFATYKPQVIIRGGtptHARIITRHMA--APLIGQLTRHEIEMTEKEW 1307
Cdd:pfam00052    1 YWSAPEQFLGNKLTSYGGYLTYTVRYEPLPGGGSLNSEPDVILEGN---GLRLSYSSPDqpPPDPGQEQTYSVRLHEENW 77
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 117647249  1308 KYygddpRISRTVTREDFLDILYDIHYILIKATYGNVVRQSRISEISMEVAEPGhvlAGSPPAHLIE 1374
Cdd:pfam00052   78 RD-----SDGAPVSREDFMMVLANLTAILIRATYSTGSGQVSLSNVSLDSAVPG---GSGPPASWVE 136
Laminin_B pfam00052
Laminin B (Domain IV);
579-718 3.54e-39

Laminin B (Domain IV);


Pssm-ID: 459652  Cd Length: 136  Bit Score: 143.56  E-value: 3.54e-39
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249   579 YWSAPPPYLGNRLPAVGGQLSFTISYDLEEEEDDTekILQLMIIFEGNDLRISTAYKE-VYLEPSEEHIEEVSLKEEAFT 657
Cdd:pfam00052    1 YWSAPEQFLGNKLTSYGGYLTYTVRYEPLPGGGSL--NSEPDVILEGNGLRLSYSSPDqPPPDPGQEQTYSVRLHEENWR 78
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 117647249   658 iHGTNLPVTRKDFMIVLTNLERVLMQITYNLGMDAIfRLSSVNLESAVPYPTDRRiATDVE 718
Cdd:pfam00052   79 -DSDGAPVSREDFMMVLANLTAILIRATYSTGSGQV-SLSNVSLDSAVPGGSGPP-ASWVE 136
Laminin_G_1 pfam00054
Laminin G domain;
2364-2503 4.19e-39

Laminin G domain;


Pssm-ID: 395008 [Multi-domain]  Cd Length: 131  Bit Score: 142.84  E-value: 4.19e-39
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2364 FRTFSSSALLMYLATRDLKDFMSVELSDGHVKVSYDLGSGMTSVVSNQNHNDGKWKAFTLSRIQKQANISIVDIdsnqEE 2443
Cdd:pfam00054    1 FRTTEPSGLLLYNGTQTERDFLALELRDGRLEVSYDLGSGAAVVRSGDKLNDGKWHSVELERNGRSGTLSVDGE----AR 76
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2444 NVATSSSGNNFglDLKADDKIYFGGLPtlrNLSMKARPEVNVKKYSGCLKDIEISRTPYN 2503
Cdd:pfam00054   77 PTGESPLGATT--DLDVDGPLYVGGLP---SLGVKKRRLAISPSFDGCIRDVIVNGKPLD 131
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
2762-2911 2.58e-37

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 138.71  E-value: 2.58e-37
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2762 FGLSRNSHIAIAFDDTKvKNRLTIELEVRTEAESGLLFYMARINHADFATVQLRNGFPYFSYDLGSGDTSTMIPTKINDG 2841
Cdd:cd00110     2 VSFSGSSYVRLPTLPAP-RTRLSISFSFRTTSPNGLLLYAGSQNGGDFLALELEDGRLVLRYDLGSGSLVLSSKTPLNDG 80
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 117647249 2842 QWHKIKIVRVKQEGILYVDDASS-QTISPKKADILDVVGILYVGGLPINYTTRRIgPVTYSLDGCVRNLHM 2911
Cdd:cd00110    81 QWHSVSVERNGRSVTLSVDGERVvESGSPGGSALLNLDGPLYLGGLPEDLKSPGL-PVSPGFVGCIRDLKV 150
LamG smart00282
Laminin G domain;
2784-2911 1.67e-36

Laminin G domain;


Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 135.54  E-value: 1.67e-36
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249   2784 TIELEVRTEAESGLLFYMARINHADFATVQLRNGFPYFSYDLGSGDTSTMI-PTKINDGQWHKIKIVRVKQEGILYVDDA 2862
Cdd:smart00282    1 SISFSFRTTSPNGLLLYAGSKGGGDYLALELRDGRLVLRYDLGSGPARLTSdPTPLNDGQWHRVAVERNGRSVTLSVDGG 80
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|
gi 117647249   2863 SSQTI-SPKKADILDVVGILYVGGLPINYtTRRIGPVTYSLDGCVRNLHM 2911
Cdd:smart00282   81 NRVSGeSPGGLTILNLDGPLYLGGLPEDL-KLPPLPVTPGFRGCIRNLKV 129
Laminin_G_1 pfam00054
Laminin G domain;
2170-2310 4.64e-34

Laminin G domain;


Pssm-ID: 395008 [Multi-domain]  Cd Length: 131  Bit Score: 128.59  E-value: 4.64e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2170 VKTAVADNLLFYLGSAKFIDFLAIEMRKGKVSFLWDVGSGVGRVEYPDlTIDDSYWYRIEASRTGRNGSISVRALDGPKA 2249
Cdd:pfam00054    1 FRTTEPSGLLLYNGTQTERDFLALELRDGRLEVSYDLGSGAAVVRSGD-KLNDGKWHSVELERNGRSGTLSVDGEARPTG 79
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 117647249  2250 SmvpstyhsvSPPGYTIlDVDANAMLFVGGLTGKI-KKADAVRVITFTGCMGETYFDNKPIG 2310
Cdd:pfam00054   80 E---------SPLGATT-DLDVDGPLYVGGLPSLGvKKRRLAISPSFDGCIRDVIVNGKPLD 131
LamB smart00281
Laminin B domain;
578-706 2.82e-33

Laminin B domain;


Pssm-ID: 214597  Cd Length: 127  Bit Score: 126.22  E-value: 2.82e-33
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249    578 YYWSAPPPYLGNRLPAVGGQLSFTISYdleEEEDDTEKILQLMIIFEGNDLRISTAYkEVYLEPSEEHIEEVSLKEEAFT 657
Cdd:smart00281    5 VYWVAPEQFLGDKVTSYGGKLRYTLSF---DGRRGGTHVSAPDVILEGNGLRISHPA-EGPPLPDELTTVEVRFREENWQ 80
                            90       100       110       120
                    ....*....|....*....|....*....|....*....|....*....
gi 117647249    658 IHGTNlPVTRKDFMIVLTNLERVLMQITYNLGMDAIfRLSSVNLESAVP 706
Cdd:smart00281   81 YYGGR-PVTREDLMMVLANLTAILIRATYSQQMAGS-RLSDVSLEVAVP 127
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
2937-3089 3.87e-33

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 126.76  E-value: 3.87e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2937 GTYFDGTGFAKAVGGFKVGLDLLVEFEFRTTRPTGVLLGVSSQ-KMDGMGIEMIDEKLMFHVDNGAGRFTAIYdagiPGH 3015
Cdd:cd00110     1 GVSFSGSSYVRLPTLPAPRTRLSISFSFRTTSPNGLLLYAGSQnGGDFLALELEDGRLVLRYDLGSGSLVLSS----KTP 76
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 117647249 3016 MCNGQWHKVTAKKIKNRLELVVDGNQVDAQSPNSASTSADTNDPVFVGGFPGGLNQFGLTTNIRFRGCIRSLKL 3089
Cdd:cd00110    77 LNDGQWHSVSVERNGRSVTLSVDGERVVESGSPGGSALLNLDGPLYLGGLPEDLKSPGLPVSPGFVGCIRDLKV 150
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
2336-2497 2.28e-32

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 124.45  E-value: 2.28e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2336 TIQFDGEGYALVSRPIRWYPNiSTVMFKFRTFSSSALLMYLATRDLKDFMSVELSDGHVKVSYDLGSGMTSVVSNQNHND 2415
Cdd:cd00110     1 GVSFSGSSYVRLPTLPAPRTR-LSISFSFRTTSPNGLLLYAGSQNGGDFLALELEDGRLVLRYDLGSGSLVLSSKTPLND 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2416 GKWKAFTLSRIQKQANISIvdidsNQEENVATSSSGNNFGLDLkaDDKIYFGGLPTlrnlSMKARPEVNVKKYSGCLKDI 2495
Cdd:cd00110    80 GQWHSVSVERNGRSVTLSV-----DGERVVESGSPGGSALLNL--DGPLYLGGLPE----DLKSPGLPVSPGFVGCIRDL 148

                  ..
gi 117647249 2496 EI 2497
Cdd:cd00110   149 KV 150
LamG smart00282
Laminin G domain;
2359-2499 1.38e-30

Laminin G domain;


Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 118.60  E-value: 1.38e-30
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249   2359 TVMFKFRTFSSSALLMYLATRDLKDFMSVELSDGHVKVSYDLGSGMTSVVSNQNH-NDGKWKAFTLSRIQKQANISIvdi 2437
Cdd:smart00282    1 SISFSFRTTSPNGLLLYAGSKGGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPlNDGQWHRVAVERNGRSVTLSV--- 77
                            90       100       110       120       130       140
                    ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 117647249   2438 dsNQEENVATSSSGNNFGLDLkaDDKIYFGGLPTlrnlSMKARPEVNVKKYSGCLKDIEISR 2499
Cdd:smart00282   78 --DGGNRVSGESPGGLTILNL--DGPLYLGGLPE----DLKLPPLPVTPGFRGCIRNLKVNG 131
LamG smart00282
Laminin G domain;
2960-3089 2.96e-30

Laminin G domain;


Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 117.83  E-value: 2.96e-30
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249   2960 VEFEFRTTRPTGVLLGVSS-QKMDGMGIEMIDEKLMFHVDNGAGRFTAIYDagiPGHMCNGQWHKVTAKKIKNRLELVVD 3038
Cdd:smart00282    2 ISFSFRTTSPNGLLLYAGSkGGGDYLALELRDGRLVLRYDLGSGPARLTSD---PTPLNDGQWHRVAVERNGRSVTLSVD 78
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|..
gi 117647249   3039 G-NQVDAQSPnSASTSADTNDPVFVGGFPGGLNQFGLTTNIRFRGCIRSLKL 3089
Cdd:smart00282   79 GgNRVSGESP-GGLTILNLDGPLYLGGLPEDLKLPPLPVTPGFRGCIRNLKV 129
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
2524-2686 1.26e-27

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 110.97  E-value: 1.26e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2524 TVSFPKPGFVELAAVSID-VGTEINLSFSTRNESGIILLGSGGTLTPprrkrrqttqaYYAIFLNKGRLEVHLSSGTRTm 2602
Cdd:cd00110     1 GVSFSGSSYVRLPTLPAPrTRLSISFSFRTTSPNGLLLYAGSQNGGD-----------FLALELEDGRLVLRYDLGSGS- 68
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2603 rkIVIKPePNLFHDGREHSVHVERTRGIFTVQIDEDR--RHMQNLTEEQPIEVKKLFVGGAPPEFQPSPLRNIPAFQGCV 2680
Cdd:cd00110    69 --LVLSS-KTPLNDGQWHSVSVERNGRSVTLSVDGERvvESGSPGGSALLNLDGPLYLGGLPEDLKSPGLPVSPGFVGCI 145

                  ....*.
gi 117647249 2681 WNLVIN 2686
Cdd:cd00110   146 RDLKVN 151
Laminin_G_2 pfam02210
Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G ...
2789-2911 1.40e-27

Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.


Pssm-ID: 460494 [Multi-domain]  Cd Length: 126  Bit Score: 109.82  E-value: 1.40e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2789 VRTEAESGLLFYMARINHaDFATVQLRNGFPYFSYDLGSGDTSTM-IPTKINDGQWHKIKIVRVKQEGILYVDDASSQTI 2867
Cdd:pfam02210    1 FRTRQPNGLLLYAGGGGS-DFLALELVNGRLVLRYDLGSGPESLLsSGKNLNDGQWHSVRVERNGNTLTLSVDGQTVVSS 79
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*
gi 117647249  2868 S-PKKADILDVVGILYVGGLPINYTTRRIgPVTYSLDGCVRNLHM 2911
Cdd:pfam02210   80 LpPGESLLLNLNGPLYLGGLPPLLLLPAL-PVRAGFVGCIRDVRV 123
LamG smart00282
Laminin G domain;
2165-2307 5.70e-27

Laminin G domain;


Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 108.20  E-value: 5.70e-27
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249   2165 NIVVHVKTAVADNLLFYLGSAKFIDFLAIEMRKGKVSFLWDVGSGVGRVEYPDLTIDDSYWYRIEASRTGRNGSISVral 2244
Cdd:smart00282    1 SISFSFRTTSPNGLLLYAGSKGGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQWHRVAVERNGRSVTLSV--- 77
                            90       100       110       120       130       140
                    ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 117647249   2245 DGPKASMVpstyhsVSPPGYTILDVDANamLFVGGLTGKIKKADAVRVITFTGCMGETYFDNK 2307
Cdd:smart00282   78 DGGNRVSG------ESPGGLTILNLDGP--LYLGGLPEDLKLPPLPVTPGFRGCIRNLKVNGK 132
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
2146-2305 2.85e-25

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 104.04  E-value: 2.85e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2146 SGGDCVRtYRPEIKKGSYNNIVVHVKTAVADNLLFYLGSAKFIDFLAIEMRKGKVSFLWDVGSGVGRVEYPDlTIDDSYW 2225
Cdd:cd00110     5 SGSSYVR-LPTLPAPRTRLSISFSFRTTSPNGLLLYAGSQNGGDFLALELEDGRLVLRYDLGSGSLVLSSKT-PLNDGQW 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2226 YRIEASRTGRNGSISVralDGPKasmvpsTYHSVSPPGYTILDVDANamLFVGGLTGKIKKADAVRVITFTGCMGETYFD 2305
Cdd:cd00110    83 HSVSVERNGRSVTLSV---DGER------VVESGSPGGSALLNLDGP--LYLGGLPEDLKSPGLPVSPGFVGCIRDLKVN 151
Laminin_G_1 pfam00054
Laminin G domain;
2550-2691 4.58e-25

Laminin G domain;


Pssm-ID: 395008 [Multi-domain]  Cd Length: 131  Bit Score: 102.78  E-value: 4.58e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2550 FSTRNESGIILlgSGGTLTPPrrkrrqttqAYYAIFLNKGRLEVHLSSGTRtmrKIVIKPEPNLfHDGREHSVHVERTRG 2629
Cdd:pfam00054    1 FRTTEPSGLLL--YNGTQTER---------DFLALELRDGRLEVSYDLGSG---AAVVRSGDKL-NDGKWHSVELERNGR 65
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 117647249  2630 IFTVQIDEDRRHM--QNLTEEQ-PIEVKKLFVGGAPPE-FQPSPLRNIPAFQGCVWNLVINSIPMD 2691
Cdd:pfam00054   66 SGTLSVDGEARPTgeSPLGATTdLDVDGPLYVGGLPSLgVKKRRLAISPSFDGCIRDVIVNGKPLD 131
Laminin_G_2 pfam02210
Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G ...
2964-3089 2.41e-24

Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.


Pssm-ID: 460494 [Multi-domain]  Cd Length: 126  Bit Score: 100.57  E-value: 2.41e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2964 FRTTRPTGVLLGVSSQKMDGMGIEMIDEKLMFHVDNGAGRFTAIYdagIPGHMCNGQWHKVTAKKIKNRLELVVDGNQVD 3043
Cdd:pfam02210    1 FRTRQPNGLLLYAGGGGSDFLALELVNGRLVLRYDLGSGPESLLS---SGKNLNDGQWHSVRVERNGNTLTLSVDGQTVV 77
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 117647249  3044 AQSPNSASTSADTNDPVFVGGFPGGLNQFGLTTNIRFRGCIRSLKL 3089
Cdd:pfam02210   78 SSLPPGESLLLNLNGPLYLGGLPPLLLLPALPVRAGFVGCIRDVRV 123
LamG smart00282
Laminin G domain;
2545-2687 3.76e-23

Laminin G domain;


Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 97.41  E-value: 3.76e-23
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249   2545 EINLSFSTRNESGIILLGSGGTLTPprrkrrqttqaYYAIFLNKGRLEVHLSSGTRTmrkIVIKPEPNLFHDGREHSVHV 2624
Cdd:smart00282    1 SISFSFRTTSPNGLLLYAGSKGGGD-----------YLALELRDGRLVLRYDLGSGP---ARLTSDPTPLNDGQWHRVAV 66
                            90       100       110       120       130       140
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 117647249   2625 ERTRGIFTVQID-EDRRHMQNLTEEQPIEVK-KLFVGGAPPEFQPSPLRNIPAFQGCVWNLVINS 2687
Cdd:smart00282   67 ERNGRSVTLSVDgGNRVSGESPGGLTILNLDgPLYLGGLPEDLKLPPLPVTPGFRGCIRNLKVNG 131
Laminin_G_2 pfam02210
Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G ...
2364-2497 1.97e-21

Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.


Pssm-ID: 460494 [Multi-domain]  Cd Length: 126  Bit Score: 92.48  E-value: 1.97e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2364 FRTFSSSALLMYlATRDLKDFMSVELSDGHVKVSYDLGSGMTSVVSNQNH-NDGKWKAFTLSRIQKQANISIvdidsNQE 2442
Cdd:pfam02210    1 FRTRQPNGLLLY-AGGGGSDFLALELVNGRLVLRYDLGSGPESLLSSGKNlNDGQWHSVRVERNGNTLTLSV-----DGQ 74
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 117647249  2443 ENVATSSSGNNFGLDLKADdkIYFGGLPTLRNLSMKARPEVnvkkYSGCLKDIEI 2497
Cdd:pfam02210   75 TVVSSLPPGESLLLNLNGP--LYLGGLPPLLLLPALPVRAG----FVGCIRDVRV 123
Laminin_G_1 pfam00054
Laminin G domain;
2964-3098 1.31e-19

Laminin G domain;


Pssm-ID: 395008 [Multi-domain]  Cd Length: 131  Bit Score: 87.37  E-value: 1.31e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2964 FRTTRPTGVLLGVSSQ-KMDGMGIEMIDEKLMFHVDNGAGRFTAiydaGIPGHMCNGQWHKVTAKKIKNRLELVVDGNQV 3042
Cdd:pfam00054    1 FRTTEPSGLLLYNGTQtERDFLALELRDGRLEVSYDLGSGAAVV----RSGDKLNDGKWHSVELERNGRSGTLSVDGEAR 76
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 117647249  3043 -DAQSPNSASTSADTNDPVFVGGFPG-GLNQFGLTTNIRFRGCIRSLKLtkgTGKPLE 3098
Cdd:pfam00054   77 pTGESPLGATTDLDVDGPLYVGGLPSlGVKKRRLAISPSFDGCIRDVIV---NGKPLD 131
Laminin_G_2 pfam02210
Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G ...
2550-2688 4.57e-18

Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.


Pssm-ID: 460494 [Multi-domain]  Cd Length: 126  Bit Score: 82.85  E-value: 4.57e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2550 FSTRNESGIILLGSGGTltpprrkrrqttQAYYAIFLNKGRLEVHLSSGTRTMRKIVIKpepNLFHDGREHSVHVERTRG 2629
Cdd:pfam02210    1 FRTRQPNGLLLYAGGGG------------SDFLALELVNGRLVLRYDLGSGPESLLSSG---KNLNDGQWHSVRVERNGN 65
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 117647249  2630 IFTVQIDEDRRHMQNLTEEQPI--EVKKLFVGGAPPEFQPSPLRNIPAFQGCVWNLVINSI 2688
Cdd:pfam02210   66 TLTLSVDGQTVVSSLPPGESLLlnLNGPLYLGGLPPLLLLPALPVRAGFVGCIRDVRVNGE 126
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
1665-2143 3.54e-17

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 88.92  E-value: 3.54e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1665 ERTNSRAESLEEFIKGLVQDAEAINEKAVQLNETLGNQDKTAERnLEELQKEIDR----------MLKELRSK--DLQTQ 1732
Cdd:TIGR04523  225 KQNNQLKDNIEKKQQEINEKTTEISNTQTQLNQLKDEQNKIKKQ-LSEKQKELEQnnkkikelekQLNQLKSEisDLNNQ 303
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1733 KEvaedelvaaEGLLKRVNKLFGEPRAQNEDMEKDL---QQKLAEYKNKLDDawdLLREATDKTRDanrlsaaNQKnmti 1809
Cdd:TIGR04523  304 KE---------QDWNKELKSELKNQEKKLEEIQNQIsqnNKIISQLNEQISQ---LKKELTNSESE-------NSE---- 360
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1810 letKKEAIEGSKRQIENTLKEGNDILDEANRLLGEINSVIDYVDDIKTKlppmSEELSDKI-------DDLAQEIKdrRL 1882
Cdd:TIGR04523  361 ---KQRELEEKQNEIEKLKKENQSYKQEIKNLESQINDLESKIQNQEKL----NQQKDEQIkklqqekELLEKEIE--RL 431
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1883 AEKVFQAESHAAQLNDSSAVLDGILDEAKNISfnataafraySNIKDYIDEAEKVAREAK-ELAQGATKLATSPQGLLKe 1961
Cdd:TIGR04523  432 KETIIKNNSEIKDLTNQDSVKELIIKNLDNTR----------ESLETQLKVLSRSINKIKqNLEQKQKELKSKEKELKK- 500
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1962 dakgslqksfrILNEAKKLANDVKgnhndlnDLKTRLETadlrnsgllgaLNDTMDKLSAITNDTAAKLQAVKEKAREAN 2041
Cdd:TIGR04523  501 -----------LNEEKKELEEKVK-------DLTKKISS-----------LKEKIEKLESEKKEKESKISDLEDELNKDD 551
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2042 DT-AKAVLAQVKD-LHQNLDGLKQNYNKLADSVAKTNAVVKDPSKNKI-----IADAGTSVRNLEQEADRLIDKLKPIKE 2114
Cdd:TIGR04523  552 FElKKENLEKEIDeKNKEIEELKQTQKSLKKKQEEKQELIDQKEKEKKdlikeIEEKEKKISSLEKELEKAKKENEKLSS 631
                          490       500
                   ....*....|....*....|....*....
gi 117647249  2115 LEDNLKKNISEIKELINQARKQANSIKVS 2143
Cdd:TIGR04523  632 IIKNIKSKKNKLKQEVKQIKETIKEIRNK 660
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
1674-2141 1.33e-16

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 87.00  E-value: 1.33e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1674 LEEFIKgLVQDAEAINEKAVQLN---ETLGNQDKTAERNLEELQKEIDRMLKELRSKDL---------QTQKEvaedelv 1741
Cdd:TIGR04523  144 LTEIKK-KEKELEKLNNKYNDLKkqkEELENELNLLEKEKLNIQKNIDKIKNKLLKLELllsnlkkkiQKNKS------- 215
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1742 aaegLLKRVNKLfgepRAQNEDMEKD---LQQKLAEYKNKLDDAWDLLREATDK-TRDANRLSAANQ---KNMTILETKK 1814
Cdd:TIGR04523  216 ----LESQISEL----KKQNNQLKDNiekKQQEINEKTTEISNTQTQLNQLKDEqNKIKKQLSEKQKeleQNNKKIKELE 287
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1815 EAIEGSKRQIENTLKEGNDILDeaNRLLGEINSVIDYVDDIKTKLPPMSE---ELSDKIDDLAQEI---------KDRRL 1882
Cdd:TIGR04523  288 KQLNQLKSEISDLNNQKEQDWN--KELKSELKNQEKKLEEIQNQISQNNKiisQLNEQISQLKKELtnsesenseKQREL 365
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1883 AEKVFQAESHAAQlNDSSavldgiLDEAKNISfnataafraySNIKDYIDEAEKVAREAKELAQGATKLATSPQGLLKEd 1962
Cdd:TIGR04523  366 EEKQNEIEKLKKE-NQSY------KQEIKNLE----------SQINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKE- 427
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1963 akgslQKsfRILNEAKKLANDVKGNHNDLNDLKTRLETADLRNSGLLGALNDTMDKLSAITNDTAAKLQAVKEKAREA-- 2040
Cdd:TIGR04523  428 -----IE--RLKETIIKNNSEIKDLTNQDSVKELIIKNLDNTRESLETQLKVLSRSINKIKQNLEQKQKELKSKEKELkk 500
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2041 -NDTAKAVLAQVKDLHQNLDGLKQNYNKLADSVAKTNAVVKDPSKNKIIADAGTSVRNLEQEADRLIDKLKPIKELEDNL 2119
Cdd:TIGR04523  501 lNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESKISDLEDELNKDDFELKKENLEKEIDEKNKEIEELKQTQKSL 580
                          490       500
                   ....*....|....*....|..
gi 117647249  2120 KKNISEIKELINQARKQANSIK 2141
Cdd:TIGR04523  581 KKKQEEKQELIDQKEKEKKDLI 602
Laminin_G_2 pfam02210
Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G ...
2170-2307 8.68e-16

Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.


Pssm-ID: 460494 [Multi-domain]  Cd Length: 126  Bit Score: 76.30  E-value: 8.68e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2170 VKTAVADNLLFYLGSAKfIDFLAIEMRKGKVSFLWDVGSGVGRVEYPDLTIDDSYWYRIEASRTGRNGSISVralDGpka 2249
Cdd:pfam02210    1 FRTRQPNGLLLYAGGGG-SDFLALELVNGRLVLRYDLGSGPESLLSSGKNLNDGQWHSVRVERNGNTLTLSV---DG--- 73
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 117647249  2250 smvpSTYHSVSPPGYTiLDVDANAMLFVGGLTGKIKKADAVRVITFTGCMGETYFDNK 2307
Cdd:pfam02210   74 ----QTVVSSLPPGES-LLLNLNGPLYLGGLPPLLLLPALPVRAGFVGCIRDVRVNGE 126
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1609-2145 2.99e-14

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 79.34  E-value: 2.99e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1609 ENTTQELKHLLSPQRAPERLIQlAEGNVNTLVMETNELLTRATKVTADGEQtgqDAERTNSRAESLEEFIKGLVQDAEAI 1688
Cdd:PRK03918  165 KNLGEVIKEIKRRIERLEKFIK-RTENIEELIKEKEKELEEVLREINEISS---ELPELREELEKLEKEVKELEELKEEI 240
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1689 NEKAVQLNETLGNQdktaeRNLEELQKEIDRMLKELRSKDLQTQKEVAE-DELvaaEGLLKRVNKLfGEPRAQNEDMEKD 1767
Cdd:PRK03918  241 EELEKELESLEGSK-----RKLEEKIRELEERIEELKKEIEELEEKVKElKEL---KEKAEEYIKL-SEFYEEYLDELRE 311
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1768 LQQKLAEYKNKLDDAWDLLREATDKTRDANRLSaanqKNMTILETKKEAIEGSKRQIEnTLKEgndILDEANRL------ 1841
Cdd:PRK03918  312 IEKRLSRLEEEINGIEERIKELEEKEERLEELK----KKLKELEKRLEELEERHELYE-EAKA---KKEELERLkkrltg 383
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1842 --LGEINSVIDYVDDIKTKLPPMSEELSDKIDDLAQEIKDRRLA-EKVFQAESH----AAQLNDssavldgilDEAKNIs 1914
Cdd:PRK03918  384 ltPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAiEELKKAKGKcpvcGRELTE---------EHRKEL- 453
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1915 fnataafraysnIKDYIDEAEKVAREAKELAQGATKLATSPQGLLKEDAKGS-LQKSFRILNEAKKLANDVKG-NHNDLN 1992
Cdd:PRK03918  454 ------------LEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESeLIKLKELAEQLKELEEKLKKyNLEELE 521
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1993 DLKTRLETADLRNSGLLG---ALNDTMDKLSAITNdtaaKLQAVKEKAREANDTAKAVLAQ--------VKDLHQNLDGL 2061
Cdd:PRK03918  522 KKAEEYEKLKEKLIKLKGeikSLKKELEKLEELKK----KLAELEKKLDELEEELAELLKEleelgfesVEELEERLKEL 597
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2062 KQNYNKLadsvaktnavvkdpsknkiiadagTSVRNLEQEADRLIDKLKPIKELEDNLKKNISEIKELINQARKQANSIK 2141
Cdd:PRK03918  598 EPFYNEY------------------------LELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELE 653

                  ....
gi 117647249 2142 VSVS 2145
Cdd:PRK03918  654 KKYS 657
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
1665-2164 4.72e-14

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 78.86  E-value: 4.72e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1665 ERTNSRAESLE--EFIKGlvqdaEAINEKAVQLNETLGNQDKTAERNLEELQKEIDRMLKELRSKDLQTQKEVAEDELva 1742
Cdd:pfam02463  154 RRLEIEEEAAGsrLKRKK-----KEALKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEYL-- 226
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1743 aegLLKRVNKLFGEPRAQNEDMEKDLQQKLAEYKNKLDDAWDLLREATDKtrdaNRLSAANQKNMTILETKKEAIEGSKR 1822
Cdd:pfam02463  227 ---LYLDYLKLNEERIDLLQELLRDEQEEIESSKQEIEKEEEKLAQVLKE----NKEEEKEKKLQEEELKLLAKEEEELK 299
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1823 QIENTLKEGNDILDEANRLLGEINSVIdyvdDIKTKLPPMSEELSDKIDDLAQEIKDRRLAEKVFQAESHAAQLNDSSAV 1902
Cdd:pfam02463  300 SELLKLERRKVDDEEKLKESEKEKKKA----EKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEEL 375
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1903 LDGILDEAKNISfnatAAFRAYSNIKDYIDEAEKVAREAKELAQgatklatspqgLLKEDAKGSLQKSFRILNEAKKLAN 1982
Cdd:pfam02463  376 LAKKKLESERLS----SAAKLKEEELELKSEEEKEAQLLLELAR-----------QLEDLLKEEKKEELEILEEEEESIE 440
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1983 DVKGNHN-------DLNDLKTRLETADLRNSGLLGALNDTMDKLSAITNDTAAKLQAVKEKAREANDTAKAVLAQVKDLH 2055
Cdd:pfam02463  441 LKQGKLTeekeeleKQELKLLKDELELKKSEDLLKETQLVKLQEQLELLLSRQKLEERSQKESKARSGLKVLLALIKDGV 520
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2056 QNLDGLKQNYNKL------ADSVAKTNAVVKDPSKNKIIADAGTSVRNLEQEADRLIDKLKPIKELEDNLKKNIS--EIK 2127
Cdd:pfam02463  521 GGRIISAHGRLGDlgvaveNYKVAISTAVIVEVSATADEVEERQKLVRALTELPLGARKLRLLIPKLKLPLKSIAvlEID 600
                          490       500       510
                   ....*....|....*....|....*....|....*..
gi 117647249  2128 ELINQARKQANSIKVSVSSGGDCVRTYRPEIKKGSYN 2164
Cdd:pfam02463  601 PILNLAQLDKATLEADEDDKRAKVVEGILKDTELTKL 637
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
913-960 5.03e-14

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 68.53  E-value: 5.03e-14
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 117647249  913 PCRCNINGSFSEICHTRTGQCECRPNVQGRHCDECKPETFGLQL-GRGC 960
Cdd:cd00055     1 PCDCNGHGSLSGQCDPGTGQCECKPNTTGRRCDRCAPGYYGLPSqGGGC 49
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
1608-2031 7.54e-14

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 78.16  E-value: 7.54e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1608 LENTTQELKHLLSPQRAPerlIQLAEGNVNTLVMETNELLTRATKVTADGEQTGQDAERTNSRAESLEEFIKGLVQDAEA 1687
Cdd:PRK02224  319 LEDRDEELRDRLEECRVA---AQAHNEEAESLREDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEE 395
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1688 INEKAVQLNETLGNqdktAERNLEELQKEIDRM---LKELRSkDLQTQKE-VAEDELVAAEGllK------------RVN 1751
Cdd:PRK02224  396 LRERFGDAPVDLGN----AEDFLEELREERDELrerEAELEA-TLRTARErVEEAEALLEAG--KcpecgqpvegspHVE 468
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1752 KLfGEPRAQNEDMEKDLQQ---KLAEYKNKLDDAWDLLREATDKTRDANRLSAANQknmtILETKKEAIEGSKRQIENTL 1828
Cdd:PRK02224  469 TI-EEDRERVEELEAELEDleeEVEEVEERLERAEDLVEAEDRIERLEERREDLEE----LIAERRETIEEKRERAEELR 543
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1829 KEGNDILDEA------------------------NRLLGEINSVIDYVDDIktklppmsEELSDKIDDLAQEIKDRR--- 1881
Cdd:PRK02224  544 ERAAELEAEAeekreaaaeaeeeaeeareevaelNSKLAELKERIESLERI--------RTLLAAIADAEDEIERLRekr 615
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1882 --LAEKVFQAESHAAQLNDSSAVLDGILDEAKnisfnataafraysnikdyIDEA-EKVAREAKELAQGATKLATspqgl 1958
Cdd:PRK02224  616 eaLAELNDERRERLAEKRERKRELEAEFDEAR-------------------IEEArEDKERAEEYLEQVEEKLDE----- 671
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1959 lKEDAKGSLQKSF-RILNEAKKLaNDVKGNHNDLNDLKTRLET----------------ADLR--NSGLLGA-LNDTMDK 2018
Cdd:PRK02224  672 -LREERDDLQAEIgAVENELEEL-EELRERREALENRVEALEAlydeaeelesmygdlrAELRqrNVETLERmLNETFDL 749
                         490
                  ....*....|...
gi 117647249 2019 LsaITNDTAAKLQ 2031
Cdd:PRK02224  750 V--YQNDAYSHIE 760
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
1602-2141 2.55e-13

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 76.55  E-value: 2.55e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1602 YKILYGLENTTQELKHLLSPQRAPERliqlaegnvntlvmETNELLTRATKVTADGEQTGQDAERTNSRAESLEEFIKGL 1681
Cdd:pfam02463  225 YLLYLDYLKLNEERIDLLQELLRDEQ--------------EEIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKL 290
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1682 VQDAEAINEKAvqlNETLGNQDKTAERNLEELQKE--------------IDRMLKELRSKDLQTQ-----KEVAEDELVA 1742
Cdd:pfam02463  291 LAKEEEELKSE---LLKLERRKVDDEEKLKESEKEkkkaekelkkekeeIEELEKELKELEIKREaeeeeEEELEKLQEK 367
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1743 AEGLLKRVNKLFGEPRAQNEDMEKDLQQKLAEYKNKlddawdlLREATDKTRDANRLSAANQKNMTILETKKEAIEGSKR 1822
Cdd:pfam02463  368 LEQLEEELLAKKKLESERLSSAAKLKEEELELKSEE-------EKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIE 440
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1823 QIENTLKEGNDILDEANRLLGEINSVIDYVDDI-----KTKLPPMSEELSDKID---DLAQEIKDRRLAEKVFQAESHAA 1894
Cdd:pfam02463  441 LKQGKLTEEKEELEKQELKLLKDELELKKSEDLlketqLVKLQEQLELLLSRQKleeRSQKESKARSGLKVLLALIKDGV 520
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1895 QLNDSSAVLDGILDE---AKNISFNATAAFRAYSNIKDYIDEAEKVAREAKELAQGATKLATSPQGLLKEDAKGSLQKSF 1971
Cdd:pfam02463  521 GGRIISAHGRLGDLGvavENYKVAISTAVIVEVSATADEVEERQKLVRALTELPLGARKLRLLIPKLKLPLKSIAVLEID 600
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1972 RILNEAKKLANDVKGNHNDLNDLKTRLETADLRNSGLLGALNDTMDKLSAITNDTAAKLQAVKEKAREANDTAKAVLAQV 2051
Cdd:pfam02463  601 PILNLAQLDKATLEADEDDKRAKVVEGILKDTELTKLKESAKAKESGLRKGVSLEEGLAEKSEVKASLSELTKELLEIQE 680
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2052 K--DLHQNLDGLK----QNYNKLADSVAKTNAVVKDPSKNKIIAD--------AGTSVRNLEQEADRLIDKLKPIKELED 2117
Cdd:pfam02463  681 LqeKAESELAKEEilrrQLEIKKKEQREKEELKKLKLEAEELLADrvqeaqdkINEELKLLKQKIDEEEEEEEKSRLKKE 760
                          570       580
                   ....*....|....*....|....
gi 117647249  2118 NLKKNISEIKELINQARKQANSIK 2141
Cdd:pfam02463  761 EKEEEKSELSLKEKELAEEREKTE 784
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
1646-2132 2.60e-13

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 76.23  E-value: 2.60e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1646 LLTRATKVTADGEQTGQ----DAERTNSRAESLEEFIKGLVQ-DAEA-INEKAVQLNETlgnqDKTAERNLEELQKEIDR 1719
Cdd:PRK02224   82 HIERRVRLSGDRATTAKcvleTPEGTIDGARDVREEVTELLRmDAEAfVNCAYVRQGEV----NKLINATPSDRQDMIDD 157
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1720 MLKELRskdLQTQKEVAEDELVAAEGLLKRVNKLFGEPRAQNEDME-KDLQQKLAEYKNKLDDAWDLL------REATDK 1792
Cdd:PRK02224  158 LLQLGK---LEEYRERASDARLGVERVLSDQRGSLDQLKAQIEEKEeKDLHERLNGLESELAELDEEIeryeeqREQARE 234
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1793 TRDA--NRLSAANQK--------------NMTILETKKE------AIEGSKRQIENTLKEGNDILDEANRLLGEINSVID 1850
Cdd:PRK02224  235 TRDEadEVLEEHEERreeletleaeiedlRETIAETEREreelaeEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEA 314
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1851 YVDDIKTKLppmsEELSDKIDDLAQEIKD-----RRLAEKVFQAESHAAQLNDSSAVLDGILDEAKN------------- 1912
Cdd:PRK02224  315 RREELEDRD----EELRDRLEECRVAAQAhneeaESLREDADDLEERAEELREEAAELESELEEAREavedrreeieele 390
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1913 --------------ISFNATAAF---------RAYSNIKDY---IDEAEKVAREAKELaQGATK-------LATSPQGLL 1959
Cdd:PRK02224  391 eeieelrerfgdapVDLGNAEDFleelreerdELREREAELeatLRTARERVEEAEAL-LEAGKcpecgqpVEGSPHVET 469
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1960 KEDAKGSLQKSFRILNEAKKLANDVKGNHNDLNDLK---TRLETADLRNSGL----------LGALNDTMDKLSAITNDT 2026
Cdd:PRK02224  470 IEEDRERVEELEAELEDLEEEVEEVEERLERAEDLVeaeDRIERLEERREDLeeliaerretIEEKRERAEELRERAAEL 549
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2027 AAKLQAVKEKAREANDTAKAVLAQVKDLHQNLDGLK---QNYNKLADSVAKtnavvkdpsknkiiadagtsVRNLEQEAD 2103
Cdd:PRK02224  550 EAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKeriESLERIRTLLAA--------------------IADAEDEIE 609
                         570       580
                  ....*....|....*....|....*....
gi 117647249 2104 RLIDKLKPIKELEDNLKKNISEIKELINQ 2132
Cdd:PRK02224  610 RLREKREALAELNDERRERLAEKRERKRE 638
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1694-2054 3.43e-13

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 74.17  E-value: 3.43e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1694 QLNETLGNQDKTAERNLEELQKEIDRMLKELRskDLQTQKEVAEDELvaaegllkrvnklfgepraqnedmeKDLQQKLA 1773
Cdd:COG4372    17 GLRPKTGILIAALSEQLRKALFELDKLQEELE--QLREELEQAREEL-------------------------EQLEEELE 69
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1774 EYKNKLDDAWDLLREATDKTRDANRLSAANQKNMTILETKKEAIEgskRQIENTLKEGNDILDEANRLLGEINSVIDYVD 1853
Cdd:COG4372    70 QARSELEQLEEELEELNEQLQAAQAELAQAQEELESLQEEAEELQ---EELEELQKERQDLEQQRKQLEAQIAELQSEIA 146
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1854 DIKTKLppmsEELSDKIDDLAQEIKDrrlaekvFQAESHAAQLNDSSAVLDGILDEAKNISFNATAAFRAYSNIKDYIDE 1933
Cdd:COG4372   147 EREEEL----KELEEQLESLQEELAA-------LEQELQALSEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRE 215
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1934 AEKVAREAK---ELAQGATKLATSPQGLLKEDAKGSLQKSFRILNEAKKLANDVKGNHNDLNDLKTRLETADLRNSGLLG 2010
Cdd:COG4372   216 LAEELLEAKdslEAKLGLALSALLDALELEEDKEELLEEVILKEIEELELAILVEKDTEEEELEIAALELEALEEAALEL 295
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....
gi 117647249 2011 ALNDTMDKLSAITNDTAAKLQAVKEKAREANDTAKAVLAQVKDL 2054
Cdd:COG4372   296 KLLALLLNLAALSLIGALEDALLAALLELAKKLELALAILLAEL 339
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
914-960 5.47e-13

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


Pssm-ID: 395007  Cd Length: 49  Bit Score: 65.45  E-value: 5.47e-13
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 117647249   914 CRCNINGSFSEICHTRTGQCECRPNVQGRHCDECKPETFGLQ--LGRGC 960
Cdd:pfam00053    1 CDCNPHGSLSDTCDPETGQCLCKPGVTGRHCDRCKPGYYGLPsdPPQGC 49
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
1642-2129 6.76e-13

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 75.08  E-value: 6.76e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1642 ETNELLTRATKVTADgeQTGQDAERTNSRAESLEEFIKGLVQD-AEAINEKavqlnETLGNQDKTAERNLEELQKEIDRM 1720
Cdd:PRK02224  226 EEQREQARETRDEAD--EVLEEHEERREELETLEAEIEDLRETiAETERER-----EELAEEVRDLRERLEELEEERDDL 298
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1721 LKE------------LRSKDLQTQKEVAEDELV-----------AAEGLLKRVNKLfgEPRAQN-----EDMEKDLQ--- 1769
Cdd:PRK02224  299 LAEaglddadaeaveARREELEDRDEELRDRLEecrvaaqahneEAESLREDADDL--EERAEElreeaAELESELEear 376
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1770 QKLAEYKNKLDDAWDLLREATDKTRDAN-RLSAANQKNMTILETKkEAIEGSKRQIENTLKEGNDILDEANRLLGEIN-- 1846
Cdd:PRK02224  377 EAVEDRREEIEELEEEIEELRERFGDAPvDLGNAEDFLEELREER-DELREREAELEATLRTARERVEEAEALLEAGKcp 455
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1847 ---------SVIDYVDDIKTKLppmsEELSDKIDDLAQEIKDrrLAEKVFQAEShaaqLNDSSAVLDGILDEAKNISfna 1917
Cdd:PRK02224  456 ecgqpvegsPHVETIEEDRERV----EELEAELEDLEEEVEE--VEERLERAED----LVEAEDRIERLEERREDLE--- 522
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1918 taafraySNIKDYIDEAEKVAREAKELAQGATKLATSPQGllKEDAKGSLQKsfrilnEAKKLANDVKGNHNDLNDLKTR 1997
Cdd:PRK02224  523 -------ELIAERRETIEEKRERAEELRERAAELEAEAEE--KREAAAEAEE------EAEEAREEVAELNSKLAELKER 587
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1998 LEtadlrnsgllgALNDTMDKLSAITN--DTAAKLQAVKEKAREANDTAKAVLAQVKDLHQNLDGlkqnynKLadsvakt 2075
Cdd:PRK02224  588 IE-----------SLERIRTLLAAIADaeDEIERLREKREALAELNDERRERLAEKRERKRELEA------EF------- 643
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 117647249 2076 navvkDPSKnkiIADAGTSVRNLEQEADRLIDKLKPIKELEDNLKKNI----SEIKEL 2129
Cdd:PRK02224  644 -----DEAR---IEEAREDKERAEEYLEQVEEKLDELREERDDLQAEIgaveNELEEL 693
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
914-960 9.87e-13

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 64.64  E-value: 9.87e-13
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*..
gi 117647249    914 CRCNINGSFSEICHTRTGQCECRPNVQGRHCDECKPETFGLQlGRGC 960
Cdd:smart00180    1 CDCDPGGSASGTCDPDTGQCECKPNVTGRRCDRCAPGYYGDG-PPGC 46
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
1644-2146 1.17e-12

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 73.90  E-value: 1.17e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1644 NELLTRATKVTADGEQTGQDAERTNSraesLEEFIKGLVQDAEAINEKAV-------QLNETL---GNQDKTAERNLEEL 1713
Cdd:TIGR04523   47 NELKNKEKELKNLDKNLNKDEEKINN----SNNKIKILEQQIKDLNDKLKknkdkinKLNSDLskiNSEIKNDKEQKNKL 122
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1714 QKEIDRMLKELRSKDlQTQKEVAeDELVAAEGLLKRVNKLFGEPRAQNEDMEKdlqqklaeYKNKLDDAWDLLREATDKT 1793
Cdd:TIGR04523  123 EVELNKLEKQKKENK-KNIDKFL-TEIKKKEKELEKLNNKYNDLKKQKEELEN--------ELNLLEKEKLNIQKNIDKI 192
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1794 RdaNRLSAANQKnMTILETKKEAIEGSKRQIENtLKEGNDIL-DEANRLLGEINsvidyvdDIKTKLPPMSEELSDKIDD 1872
Cdd:TIGR04523  193 K--NKLLKLELL-LSNLKKKIQKNKSLESQISE-LKKQNNQLkDNIEKKQQEIN-------EKTTEISNTQTQLNQLKDE 261
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1873 LaQEIKDRrLAEKVFQAESHAAQLNDSSAVLdgildeaknisfnataafraySNIKDYIDEAEKVareaKElaQGATKLa 1952
Cdd:TIGR04523  262 Q-NKIKKQ-LSEKQKELEQNNKKIKELEKQL---------------------NQLKSEISDLNNQ----KE--QDWNKE- 311
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1953 tspqglLKEDAKgSLQKSFRI----LNEAKKLANDVKgnhNDLNDLKTRLETADLRNSGLLGALNDTMDKLSAITNDTAA 2028
Cdd:TIGR04523  312 ------LKSELK-NQEKKLEEiqnqISQNNKIISQLN---EQISQLKKELTNSESENSEKQRELEEKQNEIEKLKKENQS 381
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2029 KLQAVKEKAREANDTaKAVLAQVKDLHQNLD----GLKQNYNK-------LADSVAKTNAVVKD-----PSKNKIIADAG 2092
Cdd:TIGR04523  382 YKQEIKNLESQINDL-ESKIQNQEKLNQQKDeqikKLQQEKELlekeierLKETIIKNNSEIKDltnqdSVKELIIKNLD 460
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2093 TSVRNLEQEADRLIDKLKPIK-ELEDN---------------------------LKKNISEIKELINQARKQANSIKVSV 2144
Cdd:TIGR04523  461 NTRESLETQLKVLSRSINKIKqNLEQKqkelkskekelkklneekkeleekvkdLTKKISSLKEKIEKLESEKKEKESKI 540

                   ..
gi 117647249  2145 SS 2146
Cdd:TIGR04523  541 SD 542
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1678-2141 3.16e-12

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 72.79  E-value: 3.16e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1678 IKGLVQDAEAiNEKAVQlnETLGNQD-KTAERNLEELQKEIDRMLKEL-----RSKDLQTQKEVAEDELvaaEGLLKRVN 1751
Cdd:PRK03918  137 IDAILESDES-REKVVR--QILGLDDyENAYKNLGEVIKEIKRRIERLekfikRTENIEELIKEKEKEL---EEVLREIN 210
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1752 KLfgepraqnEDMEKDLQQKLAEYKNKLddawdllrEATDKTRdaNRLSAanqknmtiLETKKEAIEGSKRQIENTLKEg 1831
Cdd:PRK03918  211 EI--------SSELPELREELEKLEKEV--------KELEELK--EEIEE--------LEKELESLEGSKRKLEEKIRE- 263
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1832 ndildeanrllgeinsvidyvddiktkLPPMSEELSDKIDDLaqEIKDRRLAEKVFQAESHAAQlndsSAVLDGILDEAK 1911
Cdd:PRK03918  264 ---------------------------LEERIEELKKEIEEL--EEKVKELKELKEKAEEYIKL----SEFYEEYLDELR 310
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1912 NISFNATAAFRAYSNIKDYIDEAEKVAREAKELaqgatklatspQGLLKEdakgsLQKSFRILNEAKKLANDVKGNHNDL 1991
Cdd:PRK03918  311 EIEKRLSRLEEEINGIEERIKELEEKEERLEEL-----------KKKLKE-----LEKRLEELEERHELYEEAKAKKEEL 374
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1992 NDLKTRLEtadlrnsgllgalNDTMDKLsaitndtAAKLQAVKEKAREANDTAKAVLAQVKDLHQNLDGLKQNYNKLADS 2071
Cdd:PRK03918  375 ERLKKRLT-------------GLTPEKL-------EKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKA 434
                         410       420       430       440       450       460       470
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 117647249 2072 VAK---TNAVVKDPSKNKIIADAGTSVRNLEQEADRLIDKLKPIKELEDNLKKNISEIKELINQaRKQANSIK 2141
Cdd:PRK03918  435 KGKcpvCGRELTEEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKL-KELAEQLK 506
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
963-1007 3.73e-12

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 63.10  E-value: 3.73e-12
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*.
gi 117647249    963 CNCNSFGSKSFDCEA-SGQCWCQPGVAGKKCDRCAHGYFNFQEGGC 1007
Cdd:smart00180    1 CDCDPGGSASGTCDPdTGQCECKPNVTGRRCDRCAPGYYGDGPPGC 46
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1610-1943 3.93e-12

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 72.78  E-value: 3.93e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1610 NTTQELKHLLSPQRAPERLIQLAEGNVNTLVMETNELLTRATKVTADGEQTGQDAERTNSRAESLEEFIKGLVQD----- 1684
Cdd:TIGR02168  674 ERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERiaqls 753
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1685 ---AEAINEKAVQLNETLGNQD--KTAERNLEELQKEID----------RMLKELRsKDLQTQKEVAEDELVAAEGLLKR 1749
Cdd:TIGR02168  754 kelTELEAEIEELEERLEEAEEelAEAEAEIEELEAQIEqlkeelkalrEALDELR-AELTLLNEEAANLRERLESLERR 832
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1750 VnklfGEPRAQNEDME---KDLQQKLAEYKNKLDDAWDLLREATDKTRDANRLSAANQKNMTILETKKEAIEGSKRQIEN 1826
Cdd:TIGR02168  833 I----AATERRLEDLEeqiEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELES 908
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1827 TLKEGNDILDEANRLLGEINSVIDyvdDIKTKLPPMSEELSDKIDDLAQEIkdrrlAEKVFQAESHAAQLNDSSAVLDGI 1906
Cdd:TIGR02168  909 KRSELRRELEELREKLAQLELRLE---GLEVRIDNLQERLSEEYSLTLEEA-----EALENKIEDDEEEARRRLKRLENK 980
                          330       340       350
                   ....*....|....*....|....*....|....*..
gi 117647249  1907 LDEAKNISFNATAAFRAYSNIKDYIDEAEKVAREAKE 1943
Cdd:TIGR02168  981 IKELGPVNLAAIEEYEELKERYDFLTAQKEDLTEAKE 1017
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1608-1879 9.38e-12

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 71.24  E-value: 9.38e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1608 LENTTQELKHLLSPQRAPERLIQLAEGNVNTLVMETNELLTRATK-------VTADGEQTGQDAERTNSRAESLEEFIKG 1680
Cdd:TIGR02168  728 ISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEaeeelaeAEAEIEELEAQIEQLKEELKALREALDE 807
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1681 LVQDAEAINEKAVQLNETLG---NQDKTAERNLEELQKEIDRMLKELRS-----KDLQTQKEVAEDELvaaEGLLKRVNK 1752
Cdd:TIGR02168  808 LRAELTLLNEEAANLRERLEsleRRIAATERRLEDLEEQIEELSEDIESlaaeiEELEELIEELESEL---EALLNERAS 884
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1753 LFGEPRAQNEDMEkDLQQKLAEYKNKLDDAWDLLREATDKTRDAN-RLSAANQKNMTILE--------------TKKEAI 1817
Cdd:TIGR02168  885 LEEALALLRSELE-ELSEELRELESKRSELRRELEELREKLAQLElRLEGLEVRIDNLQErlseeysltleeaeALENKI 963
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 117647249  1818 EGSKRQIENTLKEGNDILDEanrlLGEIN-SVIDYVDDIKTKLppmsEELSDKIDDLAQEIKD 1879
Cdd:TIGR02168  964 EDDEEEARRRLKRLENKIKE----LGPVNlAAIEEYEELKERY----DFLTAQKEDLTEAKET 1018
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
1056-1104 1.74e-11

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


Pssm-ID: 395007  Cd Length: 49  Bit Score: 61.22  E-value: 1.74e-11
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 117647249  1056 CNCSTVGSLASQCNVNTGQCSCHPKFSGMKCSECSRGHWNYPLCTLCDC 1104
Cdd:pfam00053    1 CDCNPHGSLSDTCDPETGQCLCKPGVTGRHCDRCKPGYYGLPSDPPQGC 49
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1658-2073 2.56e-11

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 70.09  E-value: 2.56e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1658 EQtGQDAERTNSRAESLEEFIkglvQDAEAINEKAVQLNETLGNQDKTAErNLEELQ---KEIDRMLKEL--------RS 1726
Cdd:TIGR02168  142 EQ-GKISEIIEAKPEERRAIF----EEAAGISKYKERRKETERKLERTRE-NLDRLEdilNELERQLKSLerqaekaeRY 215
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1727 KDLQTQKEVAE-----DELVAAEGLLKRVNKLFgeprAQNEDMEKDLQQKLAEYKNKLDDAWDLLREATDKTRDANR--L 1799
Cdd:TIGR02168  216 KELKAELRELElallvLRLEELREELEELQEEL----KEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKelY 291
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1800 SAANQKNmtILETKKEAIEGSKRQIENTLKEGNDILDEANRLLGEINsvidyvdDIKTKLPPMSEELSDKIDDLAQEIKd 1879
Cdd:TIGR02168  292 ALANEIS--RLEQQKQILRERLANLERQLEELEAQLEELESKLDELA-------EELAELEEKLEELKEELESLEAELE- 361
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1880 rRLAEKVFQAESHAAQLNDSsavldgiLDEAKNisfnataafraysnikDYIDEAEKVAREAKELAQGATKLatspqgll 1959
Cdd:TIGR02168  362 -ELEAELEELESRLEELEEQ-------LETLRS----------------KVAQLELQIASLNNEIERLEARL-------- 409
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1960 kEDAKGSLQKSFRILNEAKKlandvKGNHNDLNDLKTRLETADLRNSGLLGALNDTMDKLSaitndtaaKLQAVKEKARE 2039
Cdd:TIGR02168  410 -ERLEDRRERLQQEIEELLK-----KLEEAELKELQAELEELEEELEELQEELERLEEALE--------ELREELEEAEQ 475
                          410       420       430
                   ....*....|....*....|....*....|....
gi 117647249  2040 ANDTAKAVLAQVKDLHQNLDGLKQNYNKLADSVA 2073
Cdd:TIGR02168  476 ALDAAERELAQLQARLDSLERLQENLEGFSEGVK 509
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1673-2136 3.42e-11

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 69.03  E-value: 3.42e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1673 SLEEFIKGLVqdAEAINEKAVQLNETLGNQDKTAERNLEELQKEIDRMLKELRS--------KDLQTQKEVAEDELVAAE 1744
Cdd:COG4717    38 TLLAFIRAML--LERLEKEADELFKPQGRKPELNLKELKELEEELKEAEEKEEEyaelqeelEELEEELEELEAELEELR 115
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1745 GLLKRVNKLFGEPRAQNEdmEKDLQQKLAEYKNKLDDawdLLREATDKTRDANRLSAANQKnmtiLETKKEAIEGSKRQI 1824
Cdd:COG4717   116 EELEKLEKLLQLLPLYQE--LEALEAELAELPERLEE---LEERLEELRELEEELEELEAE----LAELQEELEELLEQL 186
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1825 EN-TLKEGNDILDEANRLLGEINSVIDYVDDIKTKLppmsEELSDKIDDLAQEIKDRRLAEKVFQAESHAAQLNdSSAVL 1903
Cdd:COG4717   187 SLaTEEELQDLAEELEELQQRLAELEEELEEAQEEL----EELEEELEQLENELEAAALEERLKEARLLLLIAA-ALLAL 261
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1904 DGILDEAKNISFNATAAF---------------RAYSNIKDYIDEAEKVAREaKELAQGATKLATSPQGLLKEDAKGSLQ 1968
Cdd:COG4717   262 LGLGGSLLSLILTIAGVLflvlgllallflllaREKASLGKEAEELQALPAL-EELEEEELEELLAALGLPPDLSPEELL 340
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1969 KSFRILNEAKKLandvkgnHNDLNDLKTRLETADLRNsgllgALNDTMDKLSAITNDTAAKLQAVKEKAREAndtakavL 2048
Cdd:COG4717   341 ELLDRIEELQEL-------LREAEELEEELQLEELEQ-----EIAALLAEAGVEDEEELRAALEQAEEYQEL-------K 401
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2049 AQVKDLHQNLDGLKQNYNKLADSVAKTNavvkdpsknkiiadagtsvrnLEQEADRLIDKLKPIKELEDNLKKNISEIKE 2128
Cdd:COG4717   402 EELEELEEQLEELLGELEELLEALDEEE---------------------LEEELEELEEELEELEEELEELREELAELEA 460

                  ....*...
gi 117647249 2129 LINQARKQ 2136
Cdd:COG4717   461 ELEQLEED 468
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
962-1007 4.55e-11

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 60.06  E-value: 4.55e-11
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 117647249  962 PCNCNSFGSKSFDCEA-SGQCWCQPGVAGKKCDRCAHGYFNF--QEGGC 1007
Cdd:cd00055     1 PCDCNGHGSLSGQCDPgTGQCECKPNTTGRRCDRCAPGYYGLpsQGGGC 49
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
1416-1462 4.80e-11

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


Pssm-ID: 395007  Cd Length: 49  Bit Score: 60.06  E-value: 4.80e-11
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 117647249  1416 CQCNGHSS---QCDPETSVCQnCQHHTAGDFCERCALGYYGIVRGLPNDC 1462
Cdd:pfam00053    1 CDCNPHGSlsdTCDPETGQCL-CKPGVTGRHCDRCKPGYYGLPSDPPQGC 49
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
860-912 4.97e-11

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 60.06  E-value: 4.97e-11
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|...
gi 117647249  860 PCQCNDNldYSIPGSCDSLSGSCLiCKPGTTGRYCELCADGYFGDAVNAKNCQ 912
Cdd:cd00055     1 PCDCNGH--GSLSGQCDPGTGQCE-CKPNTTGRRCDRCAPGYYGLPSQGGGCQ 50
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1660-2137 5.20e-11

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 68.64  E-value: 5.20e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1660 TGQDAERTNSRAESLEEFIKGLVQDAEAINEKAVQLNEtlgnqdktAERNLEELQKEIDRMLKELRSKDLQTQKEVAEDE 1739
Cdd:COG4717    62 QGRKPELNLKELKELEEELKEAEEKEEEYAELQEELEE--------LEEELEELEAELEELREELEKLEKLLQLLPLYQE 133
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1740 LVAAEGLLK----RVNKLFG--EPRAQNEDMEKDLQQKLAEYKNKLDDAWDLLREAT-----DKTRDANRLSAANQKNMT 1808
Cdd:COG4717   134 LEALEAELAelpeRLEELEErlEELRELEEELEELEAELAELQEELEELLEQLSLATeeelqDLAEELEELQQRLAELEE 213
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1809 ILETKKEAIEGSKRQIENTLKEGNDI-----LDEANRLLGEINSVI-------DYVDDIKTKLPPMSEELSDKIDDLAQE 1876
Cdd:COG4717   214 ELEEAQEELEELEEELEQLENELEAAaleerLKEARLLLLIAAALLallglggSLLSLILTIAGVLFLVLGLLALLFLLL 293
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1877 IKDRRLAEKVFQAESHAAQLND-SSAVLDGILDEAK-NISFNATAAFRAYSNIKDYIDEAEKVAREAKELAQGAtkLATS 1954
Cdd:COG4717   294 AREKASLGKEAEELQALPALEElEEEELEELLAALGlPPDLSPEELLELLDRIEELQELLREAEELEEELQLEE--LEQE 371
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1955 PQGLLKEDAKGSLQKSFRILNEAKKlandvkgnhndLNDLKTRLETADLRnsglLGALNDTMDKLSAITNDTaaklqAVK 2034
Cdd:COG4717   372 IAALLAEAGVEDEEELRAALEQAEE-----------YQELKEELEELEEQ----LEELLGELEELLEALDEE-----ELE 431
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2035 EKAREANDTAKAVLAQVKDLHQNLDGLKQNYNKLADSvaktnavvkdpsknkiiadagTSVRNLEQEADRLIDKLKpiKE 2114
Cdd:COG4717   432 EELEELEEELEELEEELEELREELAELEAELEQLEED---------------------GELAELLQELEELKAELR--EL 488
                         490       500
                  ....*....|....*....|...
gi 117647249 2115 LEDNLKKNIseIKELINQARKQA 2137
Cdd:COG4717   489 AEEWAALKL--ALELLEEAREEY 509
HEC1 COG5185
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell ...
1707-2114 5.85e-11

Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 444066 [Multi-domain]  Cd Length: 594  Bit Score: 68.45  E-value: 5.85e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1707 ERNLEELQKEIDRMLKELRSKDLQTQKEVAEDELVAA-EGLLKRVNKLFGEPRAQNEDMEKDLQQKLAEYKNKLDDAWDL 1785
Cdd:COG5185   218 ESTLLEKAKEIINIEEALKGFQDPESELEDLAQTSDKlEKLVEQNTDLRLEKLGENAESSKRLNENANNLIKQFENTKEK 297
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1786 LREATDK------TRDANRLSAANQKNMTILETKKEAIEGSKRQIENTLKEGNDILDEANRLLGEINSVIDYVDdiktkl 1859
Cdd:COG5185   298 IAEYTKSidikkaTESLEEQLAAAEAEQELEESKRETETGIQNLTAEIEQGQESLTENLEAIKEEIENIVGEVE------ 371
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1860 ppmSEELSDKIDDLAQEIkdRRLAEKVFQAesHAAQLNDSSAVLDGILDEAKNISfnataafraySNIKDYIDEAEKVAR 1939
Cdd:COG5185   372 ---LSKSSEELDSFKDTI--ESTKESLDEI--PQNQRGYAQEILATLEDTLKAAD----------RQIEELQRQIEQATS 434
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1940 EAKELAQGATKLATSpqgLLKEDAKGSLQKSFRILNEAKKLANDVKGNHNDLNDLKTRLETAdlrnsglLGALNDTMDKL 2019
Cdd:COG5185   435 SNEEVSKLLNELISE---LNKVMREADEESQSRLEEAYDEINRSVRSKKEDLNEELTQIESR-------VSTLKATLEKL 504
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2020 SAITNdtaAKLQAVKEKAREANDTAKAVLAQVKDLHqnldgLKQNYNKLADSVAKTNavvkDPSKNKIIADAGTSVRNLE 2099
Cdd:COG5185   505 RAKLE---RQLEGVRSKLDQVAESLKDFMRARGYAH-----ILALENLIPASELIQA----SNAKTDGQAANLRTAVIDE 572
                         410
                  ....*....|....*
gi 117647249 2100 QEADRLIDKLKPIKE 2114
Cdd:COG5185   573 LTQYLSTIESQQARE 587
COG1340 COG1340
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
1635-1912 6.44e-11

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];


Pssm-ID: 440951 [Multi-domain]  Cd Length: 297  Bit Score: 66.09  E-value: 6.44e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1635 NVNTLVMETNELLTRATKVTadgEQTGQDAERTNSRAESLEEF----------IKGLVQDAEAINEKAVQLNETLgnQDK 1704
Cdd:COG1340     2 KTDELSSSLEELEEKIEELR---EEIEELKEKRDELNEELKELaekrdelnaqVKELREEAQELREKRDELNEKV--KEL 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1705 TAERNleELQKEIDRMLKELRSKDLQTQKEVAEDELVAAegLLKRVNKLfgEPRAQNEDM----EKDLQQKLAEYKNKLD 1780
Cdd:COG1340    77 KEERD--ELNEKLNELREELDELRKELAELNKAGGSIDK--LRKEIERL--EWRQQTEVLspeeEKELVEKIKELEKELE 150
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1781 DAwdllREATDKTRDANRLSAAnqknmtiLETKKEAIEGSKRQIENTLKEGNDILDEANRLLGEINSVIDYVDDIKTKLP 1860
Cdd:COG1340   151 KA----KKALEKNEKLKELRAE-------LKELRKEAEEIHKKIKELAEEAQELHEEMIELYKEADELRKEADELHKEIV 219
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 117647249 1861 PMSE---ELSDKIDDLAQEIKDRRLAEKVFQAESHAAQLNDSSAVLDGILDEAKN 1912
Cdd:COG1340   220 EAQEkadELHEEIIELQKELRELRKELKKLRKKQRALKREKEKEELEEKAEEIFE 274
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1657-2030 8.42e-11

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 66.85  E-value: 8.42e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1657 GEQTGQDAERTNSRAESLEEFIKGLVQDAEAINEKAVQlnetlgnQDKTAERNLEELQKEIDRMLKELRSkdLQTQKEVA 1736
Cdd:COG4372     1 GDRLGEKVGKARLSLFGLRPKTGILIAALSEQLRKALF-------ELDKLQEELEQLREELEQAREELEQ--LEEELEQA 71
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1737 EDELVAAEGLLKRVNKLFGEPRAQNEDMEKDLQQKLAEYKNKLDDAWDLLREATDKTRDANRLSAANQKNMTILETKKEA 1816
Cdd:COG4372    72 RSELEQLEEELEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEE 151
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1817 IEGSKRQIENT------LKEGNDILDEA------NRLLGEINSVIDYVdDIKTKLPPMSEELSDKIDDLAQEIKDRRLAE 1884
Cdd:COG4372   152 LKELEEQLESLqeelaaLEQELQALSEAeaeqalDELLKEANRNAEKE-EELAEAEKLIESLPRELAEELLEAKDSLEAK 230
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1885 KVFQaesHAAQLNDSSAVLDGILDEAKNISFNATAAFRAYSNIKDYIDEAEKVAREAKELAQGATKLATSPQGLLKEDAk 1964
Cdd:COG4372   231 LGLA---LSALLDALELEEDKEELLEEVILKEIEELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAA- 306
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 117647249 1965 gslQKSFRILNEAKKLANDVKGNHNDLNDLKTRLETADLRNSGLLGALNDTMDKLSAITNDTAAKL 2030
Cdd:COG4372   307 ---LSLIGALEDALLAALLELAKKLELALAILLAELADLLQLLLVGLLDNDVLELLSKGAEAGVAD 369
PTZ00440 PTZ00440
reticulocyte binding protein 2-like protein; Provisional
1670-2146 1.06e-10

reticulocyte binding protein 2-like protein; Provisional


Pssm-ID: 240419 [Multi-domain]  Cd Length: 2722  Bit Score: 68.32  E-value: 1.06e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1670 RAESLEEFIKglvQDAEAINE---KAVQ-LNETLGNQDKTAE------RNLEELQKEIDRMLKELRSKDLQ--TQKEVAE 1737
Cdd:PTZ00440 1085 KVEALLKKID---ENKNKLIEiknKSHEhVVNADKEKNKQTEhynkkkKSLEKIYKQMEKTLKELENMNLEdiTLNEVNE 1161
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1738 DELVAAEGLLKRVNKLFGEPRAQNE------------------DMEKDLQQKLA--EYKNKLDDAWDLLREATDKTRDAN 1797
Cdd:PTZ00440 1162 IEIEYERILIDHIVEQINNEAKKSKtimeeiesykkdidqvkkNMSKERNDHLTtfEYNAYYDKATASYENIEELTTEAK 1241
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1798 RL--SAANQKNMTILETKKEAIEGSKRQ-------IENTLKEGND-----ILDEANRLLGEI----NSVIDYVDDIKTKL 1859
Cdd:PTZ00440 1242 GLkgEANRSTNVDELKEIKLQVFSYLQQvikennkMENALHEIKNmyeflISIDSEKILKEIlnstKKAEEFSNDAKKEL 1321
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1860 ppmseelsDKIDDLAQEIKdrrlaEKVFQAESHAAQlNDSSAVLDGILDEAKNISFNATAAFRAYSNIKDYIDEAEkvar 1939
Cdd:PTZ00440 1322 --------EKTDNLIKQVE-----AKIEQAKEHKNK-IYGSLEDKQIDDEIKKIEQIKEEISNKRKEINKYLSNIK---- 1383
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1940 EAKElaQGATKLATSPQGLLKED--------------------AKGSLQKSFRILNEAKKLANDVKGNHNDLNDLKTRLE 1999
Cdd:PTZ00440 1384 SNKE--KCDLHVRNASRGKDKIDflnkheaiepsnskevniikITDNINKCKQYSNEAMETENKADENNDSIIKYEKEIT 1461
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2000 TAdLRNSGLLG----------ALNDTMDKLSAITNDTAAKLQAVKEKARE--------------ANDTAKAV-------L 2048
Cdd:PTZ00440 1462 NI-LNNSSILGkktklekkkkEATNIMDDINGEHSIIKTKLTKSSEKLNQlneqpnikregdvlNNDKSTIAyetiqynL 1540
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2049 AQVKDLHQNLDGLKQN----YNKLADSVAKTNAVVK-DPSKN--KIIADAGTSVRNLEQ----------EADRLIDKLKP 2111
Cdd:PTZ00440 1541 GRVKHNLLNILNIKDEietiLNKAQDLMRDISKISKiVENKNleNLNDKEADYVKYLDNilkekqlmeaEYKKLNEIYSD 1620
                         570       580       590       600
                  ....*....|....*....|....*....|....*....|...
gi 117647249 2112 IKELEDNLKK--------NISEIKELINQARKQANSIKVSVSS 2146
Cdd:PTZ00440 1621 VDNIEKELKKhkknyeigLLEKVIEINKNIKLYMDSTKESLNS 1663
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
1102-1159 1.09e-10

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


Pssm-ID: 395007  Cd Length: 49  Bit Score: 58.90  E-value: 1.09e-10
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 117647249  1102 CDCFLPGTDATTCDLETrkcscsdqtGQCSCKVNVEGVHCDRCRPGKFGLDAKNPLGC 1159
Cdd:pfam00053    1 CDCNPHGSLSDTCDPET---------GQCLCKPGVTGRHCDRCKPGYYGLPSDPPQGC 49
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
1674-2122 1.36e-10

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 67.77  E-value: 1.36e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1674 LEEFIKGLVQDAEAInEKAVQLNETLGNQD-------KTAERNLEELQKEIDRMLKELRSKdlqtQKEVAEDELvaaEGL 1746
Cdd:TIGR01612  605 LKEKIKNISDKNEYI-KKAIDLKKIIENNNayidelaKISPYQVPEHLKNKDKIYSTIKSE----LSKIYEDDI---DAL 676
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1747 LKRVNKLFGEPRAQN-EDmekdlQQKLAEYKNKLDDAWDLLREATDKTRDANRLSAANQKNM---TILETKKEaiegskr 1822
Cdd:TIGR01612  677 YNELSSIVKENAIDNtED-----KAKLDDLKSKIDKEYDKIQNMETATVELHLSNIENKKNElldIIVEIKKH------- 744
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1823 qientlkegndILDEANRLLGEInsvidyVDDIKTKlppmSEELSDKIDDLAQEiKDR--RLAEKVFQAESHaaqLNDSS 1900
Cdd:TIGR01612  745 -----------IHGEINKDLNKI------LEDFKNK----EKELSNKINDYAKE-KDElnKYKSKISEIKNH---YNDQI 799
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1901 AVlDGILDEAKNISFNATAAFRAYSNIKDyiDEAEKVAREAKelaqgatklatspqgLLKEDAKGSLQKSFRILNEAKKl 1980
Cdd:TIGR01612  800 NI-DNIKDEDAKQNYDKSKEYIKTISIKE--DEIFKIINEMK---------------FMKDDFLNKVDKFINFENNCKE- 860
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1981 anDVKGNHNDLNDL--KTRLETADLRNSGLLGALNDTMDKLSAITNDTAAKLQAVK--EKAREANDTAKAVLAQVKDLHQ 2056
Cdd:TIGR01612  861 --KIDSEHEQFAELtnKIKAEISDDKLNDYEKKFNDSKSLINEINKSIEEEYQNINtlKKVDEYIKICENTKESIEKFHN 938
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 117647249  2057 NLDGLKQNYNKLADSVAKTNAVVK---DPSKN----------KIIADAgtSVRNLEQEADRLIdklKPIKELEDNLKKN 2122
Cdd:TIGR01612  939 KQNILKEILNKNIDTIKESNLIEKsykDKFDNtlidkineldKAFKDA--SLNDYEAKNNELI---KYFNDLKANLGKN 1012
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1634-2141 1.41e-10

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 67.63  E-value: 1.41e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1634 GNVNTLV---M-ETNELLTRATKVTA---DGEQTGQDAERTNSRAESLEEfikgLVQDAEAINEKAVQLN--ETLGN--Q 1702
Cdd:COG4913   207 GDLDDFVreyMlEEPDTFEAADALVEhfdDLERAHEALEDAREQIELLEP----IRELAERYAAARERLAelEYLRAalR 282
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1703 DKTAERNLEELQKEIDRMLKELRS-----KDLQTQKEVAEDELVAAEGLL-----KRVNKLfgepRAQNEDMEKDL---Q 1769
Cdd:COG4913   283 LWFAQRRLELLEAELEELRAELARleaelERLEARLDALREELDELEAQIrgnggDRLEQL----EREIERLERELeerE 358
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1770 QKLAEYKNKLDDA-----------WDLLREATDKTRDANRLSAANQKNMTILETKKEAIEGSKRQIENTLKE----GNDI 1834
Cdd:COG4913   359 RRRARLEALLAALglplpasaeefAALRAEAAALLEALEEELEALEEALAEAEAALRDLRRELRELEAEIASlerrKSNI 438
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1835 ----------------LDE----------------------ANRLLG-----------EINSVIDYVDDIKTKL------ 1859
Cdd:COG4913   439 parllalrdalaealgLDEaelpfvgelievrpeeerwrgaIERVLGgfaltllvppeHYAAALRWVNRLHLRGrlvyer 518
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1860 ------PPMSEELSDK--------------------------------IDDLAQ------------------EIKDRRL- 1882
Cdd:COG4913   519 vrtglpDPERPRLDPDslagkldfkphpfrawleaelgrrfdyvcvdsPEELRRhpraitragqvkgngtrhEKDDRRRi 598
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1883 ----------AEKVFQAESHAAQLNDSSAVLDGILDEAKNISFNATAAFRAYSNIKDYIDEAEKVAREAKELAQgatkla 1952
Cdd:COG4913   599 rsryvlgfdnRAKLAALEAELAELEEELAEAEERLEALEAELDALQERREALQRLAEYSWDEIDVASAEREIAE------ 672
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1953 tspqgllKEDAKGSLQKSFRILNEAKKLANDVKGNHNDL----NDLKTRLETADLRNSGLLGALNDTMDKLSAITNDTAA 2028
Cdd:COG4913   673 -------LEAELERLDASSDDLAALEEQLEELEAELEELeeelDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARL 745
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2029 KLQAVKEKAREANDTAKAVLAQVKDLHQNLDGLKQNYNKLADSVAKT----NAVVKDPSknkiiADAGTSVRNLEQ---- 2100
Cdd:COG4913   746 ELRALLEERFAAALGDAVERELRENLEERIDALRARLNRAEEELERAmrafNREWPAET-----ADLDADLESLPEylal 820
                         650       660       670       680
                  ....*....|....*....|....*....|....*....|....*.
gi 117647249 2101 ----EADRLIDKLKPIKELednLKKN-ISEIKELINQARKQANSIK 2141
Cdd:COG4913   821 ldrlEEDGLPEYEERFKEL---LNENsIEFVADLLSKLRRAIREIK 863
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
1415-1463 1.51e-10

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 58.52  E-value: 1.51e-10
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|..
gi 117647249 1415 PCQCNGHSS---QCDPETSVCQnCQHHTAGDFCERCALGYYGIvRGLPNDCQ 1463
Cdd:cd00055     1 PCDCNGHGSlsgQCDPGTGQCE-CKPNTTGRRCDRCAPGYYGL-PSQGGGCQ 50
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1638-1909 1.57e-10

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 67.40  E-value: 1.57e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1638 TLVMETNE----LLTRATKVTADGE-------QTG----------------QDAERTNSRAESLEEFIKGLVQDAEAINE 1690
Cdd:TIGR02169  623 TLVVEDIEaarrLMGKYRMVTLEGElfeksgaMTGgsraprggilfsrsepAELQRLRERLEGLKRELSSLQSELRRIEN 702
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1691 KAVQLNETLgnqdKTAERNLEELQKEIDRMLKEL-----RSKDLQTQKEVAEDELVAAEGLLKRVNKLFGEP-------R 1758
Cdd:TIGR02169  703 RLDELSQEL----SDASRKIGEIEKEIEQLEQEEeklkeRLEELEEDLSSLEQEIENVKSELKELEARIEELeedlhklE 778
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1759 AQNEDMEKDL-QQKLAEYKNKLDDAWDLLREATDKTRDANRLSAANQKNMTILETKKEAIEGSKRQIENTLKEGNDILDE 1837
Cdd:TIGR02169  779 EALNDLEARLsHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIEN 858
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1838 ANRLLGEINSVIdyvDDIKTKLPPMSEELSD---KIDDL-----AQEIKDRRLAEKVFQAESHAAQLNDSSAVLDGILDE 1909
Cdd:TIGR02169  859 LNGKKEELEEEL---EELEAALRDLESRLGDlkkERDELeaqlrELERKIEELEAQIEKKRKRLSELKAKLEALEEELSE 935
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
963-1010 2.41e-10

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


Pssm-ID: 395007  Cd Length: 49  Bit Score: 58.13  E-value: 2.41e-10
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 117647249   963 CNCNSFGSKSFDCEAS-GQCWCQPGVAGKKCDRCAHGYFNFQEGGCIAC 1010
Cdd:pfam00053    1 CDCNPHGSLSDTCDPEtGQCLCKPGVTGRHCDRCKPGYYGLPSDPPQGC 49
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
1009-1054 2.94e-10

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 57.75  E-value: 2.94e-10
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 117647249 1009 ACDCSHLG---NNCDPKTGQCICPPNTTGEKCSECLPNTWGH-SIVTGCK 1054
Cdd:cd00055     1 PCDCNGHGslsGQCDPGTGQCECKPNTTGRRCDRCAPGYYGLpSQGGGCQ 50
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
1056-1098 3.51e-10

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 57.75  E-value: 3.51e-10
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|...
gi 117647249 1056 CNCSTVGSLASQCNVNTGQCSCHPKFSGMKCSECSRGHWNYPL 1098
Cdd:cd00055     2 CDCNGHGSLSGQCDPGTGQCECKPNTTGRRCDRCAPGYYGLPS 44
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1628-1978 4.42e-10

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 64.54  E-value: 4.42e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1628 LIQLAEGNVNTLVMETNELLTRATKVTADGEQTGQDAERT-------NSRAESLEEFIKGLVQDAEAINEKAVQLNETLg 1700
Cdd:COG4372    25 LIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLeeeleqaRSELEQLEEELEELNEQLQAAQAELAQAQEEL- 103
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1701 nqdKTAERNLEELQKEIDRMLKELrsKDLQTQKEVAEDElvaaeglLKRVNKLFGEPRAQNEDMEKDLQQKLAEYKnKLD 1780
Cdd:COG4372   104 ---ESLQEEAEELQEELEELQKER--QDLEQQRKQLEAQ-------IAELQSEIAEREEELKELEEQLESLQEELA-ALE 170
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1781 DAWDLLREATDKTRDANRLSAANQKNMTILETKK--EAIEGSKRQIENTLKEGNDILDEANRLLGEINSVIDYVDDIKTK 1858
Cdd:COG4372   171 QELQALSEAEAEQALDELLKEANRNAEKEEELAEaeKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEE 250
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1859 LPPMSEELSDKIDDLAQEIKDRRLAEKV---FQAESHAAQLNDSSAVLDGILDEAKNISFNATAAFRAYSNIKDYIDEAE 1935
Cdd:COG4372   251 LLEEVILKEIEELELAILVEKDTEEEELeiaALELEALEEAALELKLLALLLNLAALSLIGALEDALLAALLELAKKLEL 330
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|...
gi 117647249 1936 KVAREAKELAQGATKLAtspQGLLKEDAKGSLQKSFRILNEAK 1978
Cdd:COG4372   331 ALAILLAELADLLQLLL---VGLLDNDVLELLSKGAEAGVADG 370
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
1010-1048 5.56e-10

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 56.94  E-value: 5.56e-10
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|..
gi 117647249   1010 CDCS---HLGNNCDPKTGQCICPPNTTGEKCSECLPNTWGHS 1048
Cdd:smart00180    1 CDCDpggSASGTCDPDTGQCECKPNVTGRRCDRCAPGYYGDG 42
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
1646-1839 6.22e-10

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 64.08  E-value: 6.22e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1646 LLTRATKVTADGEQTGQDAE--RTNSRAESLEEFIKGLVQDAEAINEKAVQLN---ETLGNQDKTAERNLEELQKEIDRM 1720
Cdd:COG3883     5 ALAAPTPAFADPQIQAKQKElsELQAELEAAQAELDALQAELEELNEEYNELQaelEALQAEIDKLQAEIAEAEAEIEER 84
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1721 LKEL--RSKDLQTQKEVAE--DELVAAEG---LLKRVNKLFGEPRAQNEDME--KDLQQKLAEYKNKLDDAWDLLR---- 1787
Cdd:COG3883    85 REELgeRARALYRSGGSVSylDVLLGSESfsdFLDRLSALSKIADADADLLEelKADKAELEAKKAELEAKLAELEalka 164
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|..
gi 117647249 1788 EATDKTRDANRLSAANQKNMTILETKKEAIEGSKRQIENTLKEGNDILDEAN 1839
Cdd:COG3883   165 ELEAAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAA 216
PTZ00440 PTZ00440
reticulocyte binding protein 2-like protein; Provisional
1603-2177 7.19e-10

reticulocyte binding protein 2-like protein; Provisional


Pssm-ID: 240419 [Multi-domain]  Cd Length: 2722  Bit Score: 65.62  E-value: 7.19e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1603 KILYGLENTTQeLKHLLSPQRAPERLIQLAEGNVNTLVMETNELLTRATKVtadgeQTGQDAERTNSRAESLEEFI---- 1678
Cdd:PTZ00440  572 KIKYIEENVDH-IKDIISLNDEIDNIIQQIEELINEALFNKEKFINEKNDL-----QEKVKYILNKFYKGDLQELLdels 645
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1679 ------KGLVQDAEAINEKAVQLNETLGNQDKtaernLEELQKE-IDRMLKELR--SKDLQTQKEVAEDELVaaegllkr 1749
Cdd:PTZ00440  646 hflddhKYLYHEAKSKEDLQTLLNTSKNEYEK-----LEFMKSDnIDNIIKNLKkeLQNLLSLKENIIKKQL-------- 712
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1750 vNKLfgepraqNEDMEKDLQQKLAEYKnklddawdllreatDKTRDANRLSAANQKnmtiLETKKEAIEGSKRQIENTLK 1829
Cdd:PTZ00440  713 -NNI-------EQDISNSLNQYTIKYN--------------DLKSSIEEYKEEEEK----LEVYKHQIINRKNEFILHLY 766
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1830 EGNDILDEANRLLGEINSVIDYVDDIKTKLPPMSEELSDKIDDLAQEIKDRRLAEKVFQA--ESHAAQLNDSsavLDGIL 1907
Cdd:PTZ00440  767 ENDKDLPDGKNTYEEFLQYKDTILNKENKISNDINILKENKKNNQDLLNSYNILIQKLEAhtEKNDEELKQL---LQKFP 843
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1908 DEAKNIS-------FNatAAFRAYSNIKDYIDEAEKVAREAKELAQGATKLATSPQGLL-----KEDAKGSLQKSFRILN 1975
Cdd:PTZ00440  844 TEDENLNlkelekeFN--ENNQIVDNIIKDIENMNKNINIIKTLNIAINRSNSNKQLVEhllnnKIDLKNKLEQHMKIIN 921
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1976 EAKKLANDVKGN-HNDLNDLKTRLETadlrnsgllgALNDT-MDKLSAITNDTAAKLQAVKEKAREANDTAKAVLAQVKD 2053
Cdd:PTZ00440  922 TDNIIQKNEKLNlLNNLNKEKEKIEK----------QLSDTkINNLKMQIEKTLEYYDKSKENINGNDGTHLEKLDKEKD 991
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2054 ----LHQNLDGLKQNYNKLAdsvAKTNAVVKDPsKNKIIADAGTSVRNLEQEADRLIDKLkpIKELEDnLKKNISEIKEL 2129
Cdd:PTZ00440  992 ewehFKSEIDKLNVNYNILN---KKIDDLIKKQ-HDDIIELIDKLIKEKGKEIEEKVDQY--ISLLEK-MKTKLSSFHFN 1064
                         570       580       590       600
                  ....*....|....*....|....*....|....*....|....*...
gi 117647249 2130 INQARKQANSIKVSVSSGGDCVRTYRPEIKKGsyNNIVVHVKTAVADN 2177
Cdd:PTZ00440 1065 IDIKKYKNPKIKEEIKLLEEKVEALLKKIDEN--KNKLIEIKNKSHEH 1110
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
752-801 8.22e-10

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 56.59  E-value: 8.22e-10
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|...
gi 117647249  752 PCQCFAHA---EACDDITGECLnCKDHTGGPYCNECLPGFYGDPTRgsPEDCQ 801
Cdd:cd00055     1 PCDCNGHGslsGQCDPGTGQCE-CKPNTTGRRCDRCAPGYYGLPSQ--GGGCQ 50
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
1688-2136 8.43e-10

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 65.07  E-value: 8.43e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1688 INEKAVQLNETLGNQD---KTAERNL-EELQKEIDRMLKELRSKDLQTQK-------EVAE-------DELVAAEGL--L 1747
Cdd:TIGR01612 1031 IEQKIEDANKNIPNIEiaiHTSIYNIiDEIEKEIGKNIELLNKEILEEAEinitnfnEIKEklkhynfDDFGKEENIkyA 1110
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1748 KRVNKLFGEPRAQNEDMEKDLQQkLAEYKNKLDDAWDLLREATDKTRDANrlsaanqkNMTILETKKEAIEGSKRQIENT 1827
Cdd:TIGR01612 1111 DEINKIKDDIKNLDQKIDHHIKA-LEEIKKKSENYIDEIKAQINDLEDVA--------DKAISNDDPEEIEKKIENIVTK 1181
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1828 LKEGNDILDEANRLLGEINSV-------------------------IDYVDDIKTKLPPMSEELSDKIDDL------AQE 1876
Cdd:TIGR01612 1182 IDKKKNIYDEIKKLLNEIAEIekdktsleevkginlsygknlgklfLEKIDEEKKKSEHMIKAMEAYIEDLdeikekSPE 1261
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1877 IKDRRLAEKVFQAESHAAQLNDS--------SAVLDGILDEAKNISFNATAAFRAYSNIKDYIDEAEKVAREAK----EL 1944
Cdd:TIGR01612 1262 IENEMGIEMDIKAEMETFNISHDddkdhhiiSKKHDENISDIREKSLKIIEDFSEESDINDIKKELQKNLLDAQkhnsDI 1341
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1945 AQGATKLaTSPQGLLKedakgsLQKSFRILNEAKKLANDVKGNHNDLND--------LKTRLETADLRN--SGLLGALND 2014
Cdd:TIGR01612 1342 NLYLNEI-ANIYNILK------LNKIKKIIDEVKEYTKEIEENNKNIKDeldkseklIKKIKDDINLEEckSKIESTLDD 1414
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2015 TmDKLSAITNDTAAKLQAVKEKAReaNDTakavlaQVKDLHQNLDGLKQNYNKLADSVAKTNAVVKDpSKNKIIADAGTS 2094
Cdd:TIGR01612 1415 K-DIDECIKKIKELKNHILSEESN--IDT------YFKNADENNENVLLLFKNIEMADNKSQHILKI-KKDNATNDHDFN 1484
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|..
gi 117647249  2095 VRNLEQEadrlIDKLKPIKELEDNLKKNISEIKELINQARKQ 2136
Cdd:TIGR01612 1485 INELKEH----IDKSKGCKDEADKNAKAIEKNKELFEQYKKD 1522
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1699-1946 9.81e-10

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 64.70  E-value: 9.81e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1699 LGNQDKTAERNLEELQKEIDRMLKELrsKDLQTQKEVAEDELVAAEGLLKRVNKlfgEPRAQNEDMEKDLQQKLAEYKNK 1778
Cdd:TIGR02169  228 LLKEKEALERQKEAIERQLASLEEEL--EKLTEEISELEKRLEEIEQLLEELNK---KIKDLGEEEQLRVKEKIGELEAE 302
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1779 LDDAWDLLREATDKTRDANRLSAANQKNMTILETKKEAIEGS-------KRQIENTLKEGNDILDEANRLLGEINS---- 1847
Cdd:TIGR02169  303 IASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREieeerkrRDKLTEEYAELKEELEDLRAELEEVDKefae 382
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1848 VIDYVDDIKTKLppmsEELSDKIDDLAQEIkdRRLAEKVFQAESHAAQLNDSSAVLdgildEAKNISFNATAafraysni 1927
Cdd:TIGR02169  383 TRDELKDYREKL----EKLKREINELKREL--DRLQEELQRLSEELADLNAAIAGI-----EAKINELEEEK-------- 443
                          250
                   ....*....|....*....
gi 117647249  1928 KDYIDEAEKVAREAKELAQ 1946
Cdd:TIGR02169  444 EDKALEIKKQEWKLEQLAA 462
HOOK pfam05622
HOOK protein coiled-coil region; This family consists of several HOOK1, 2 and 3 proteins from ...
1653-2119 1.04e-09

HOOK protein coiled-coil region; This family consists of several HOOK1, 2 and 3 proteins from different eukaryotic organizms. The different members of the human gene family are HOOK1, HOOK2 and HOOK3. Different domains have been identified in the three human HOOK proteins, and it was demonstrated that the highly conserved NH2-domain mediates attachment to microtubules, whereas this central coiled-coil motif mediates homodimerization and the more divergent C-terminal domains are involved in binding to specific organelles (organelle-binding domains). It has been demonstrated that endogenous HOOK3 binds to Golgi membranes, whereas both HOOK1 and HOOK2 are localized to discrete but unidentified cellular structures. In mice the Hook1 gene is predominantly expressed in the testis. Hook1 function is necessary for the correct positioning of microtubular structures within the haploid germ cell. Disruption of Hook1 function in mice causes abnormal sperm head shape and fragile attachment of the flagellum to the sperm head. This entry includes the central coiled-coiled domain and the divergent C-terminal domain.


Pssm-ID: 461694 [Multi-domain]  Cd Length: 528  Bit Score: 63.94  E-value: 1.04e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1653 VTADGEQTGQDAERTNSRAESLEEFIKGLVQDAEAINEKAVQLnETLGNQDKTAERNLEELQKEIDRMLKELrskdlqTQ 1732
Cdd:pfam05622    5 AQEEKDELAQRCHELDQQVSLLQEEKNSLQQENKKLQERLDQL-ESGDDSGTPGGKKYLLLQKQLEQLQEEN------FR 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1733 KEVAEDEL-VAAEGLLKRVnklfGEPRAQNEDMekdlqQKLAEYKNKLDDAWDLLREATDKtrdANRLSAAnqknmtiLE 1811
Cdd:pfam05622   78 LETARDDYrIKCEELEKEV----LELQHRNEEL-----TSLAEEAQALKDEMDILRESSDK---VKKLEAT-------VE 138
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1812 TKKEAIEGS---KRQIEnTLKEGN-----------DILDEANRLLGEINSVIDYVDDIKTKLppmSEE------------ 1865
Cdd:pfam05622  139 TYKKKLEDLgdlRRQVK-LLEERNaeymqrtlqleEELKKANALRGQLETYKRQVQELHGKL---SEEskkadklefeyk 214
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1866 -LSDKIDDLAQEiKDRRLAE----KVFQAESHAAQLNDSSAVLDGILDEAKNISFNATAAFRAYSNIKDYID--EAE-KV 1937
Cdd:pfam05622  215 kLEEKLEALQKE-KERLIIErdtlRETNEELRCAQLQQAELSQADALLSPSSDPGDNLAAEIMPAEIREKLIrlQHEnKM 293
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1938 AREAKElAQGATKLATSpQGLLkEDAKGSLQKsfriLNEAKKLAND-VKGNHNDLNDLKTRLETADLRN--SGLLGA-LN 2013
Cdd:pfam05622  294 LRLGQE-GSYRERLTEL-QQLL-EDANRRKNE----LETQNRLANQrILELQQQVEELQKALQEQGSKAedSSLLKQkLE 366
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2014 DTMDKLSaitndtaaKLQAVKEKAREA-NDTAKAVLAQvkdLHQNLDGLKQNYNKLADS-----------VAKTNAVVK- 2080
Cdd:pfam05622  367 EHLEKLH--------EAQSELQKKKEQiEELEPKQDSN---LAQKIDELQEALRKKDEDmkameerykkyVEKAKSVIKt 435
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|....*..
gi 117647249  2081 -DPSKNKIIADAGTSVRNLEQEADRLID-------KLKPIKELEDNL 2119
Cdd:pfam05622  436 lDPKQNPASPPEIQALKNQLLEKDKKIEhlerdfeKSKLQREQEEKL 482
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
465-511 1.07e-09

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


Pssm-ID: 395007  Cd Length: 49  Bit Score: 56.21  E-value: 1.07e-09
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 117647249   465 CNCSGLGSTNE--DPCVGPCSCKENVEGEDCSRCKSGFFNLQEDNQKGC 511
Cdd:pfam00053    1 CDCNPHGSLSDtcDPETGQCLCKPGVTGRHCDRCKPGYYGLPSDPPQGC 49
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
1010-1053 1.23e-09

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


Pssm-ID: 395007  Cd Length: 49  Bit Score: 56.21  E-value: 1.23e-09
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 117647249  1010 CDCSHLG---NNCDPKTGQCICPPNTTGEKCSECLPNTWGHSIVTGC 1053
Cdd:pfam00053    1 CDCNPHGslsDTCDPETGQCLCKPGVTGRHCDRCKPGYYGLPSDPPQ 47
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
410-467 1.26e-09

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


Pssm-ID: 395007  Cd Length: 49  Bit Score: 55.82  E-value: 1.26e-09
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 117647249   410 CHCDPTGSLSEVCVKDekyaqrglkPGSCHCKTGFGGVNCDRCVRGYHGYPDCQPCNC 467
Cdd:pfam00053    1 CDCNPHGSLSDTCDPE---------TGQCLCKPGVTGRHCDRCKPGYYGLPSDPPQGC 49
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
1645-2136 1.34e-09

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 64.30  E-value: 1.34e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1645 ELLTRATKVTADGEQTGQDAERTNSRAESLEefIKGLVQDAEAINekavQLNETLGNQDKTAERNLEELQKEIDRMLKEL 1724
Cdd:TIGR00606  369 LIQSLATRLELDGFERGPFSERQIKNFHTLV--IERQEDEAKTAA----QLCADLQSKERLKQEQADEIRDEKKGLGRTI 442
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1725 RSKDLQTQKEVAE-----DELVAAEGLLKRVNKLFGEPRAQNEDMEK-----DLQQKLAEYKNKLDDAWDL---LREATD 1791
Cdd:TIGR00606  443 ELKKEILEKKQEElkfviKELQQLEGSSDRILELDQELRKAERELSKaeknsLTETLKKEVKSLQNEKADLdrkLRKLDQ 522
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1792 KTRDANRLSAANQKNMTILETKKEAIE--------GS------------KRQIENTL----KEGNDILDEANRLLGEINS 1847
Cdd:TIGR00606  523 EMEQLNHHTTTRTQMEMLTKDKMDKDEqirkiksrHSdeltsllgyfpnKKQLEDWLhsksKEINQTRDRLAKLNKELAS 602
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1848 VIDYVDDIKTKLPPMSEELS---DKIDDL----AQEIKDRRLAEKVFQAESHAAQLNDSSAVLDGILDEAKNISFNATAA 1920
Cdd:TIGR00606  603 LEQNKNHINNELESKEEQLSsyeDKLFDVcgsqDEESDLERLKEEIEKSSKQRAMLAGATAVYSQFITQLTDENQSCCPV 682
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1921 ----FRAYSNIKDYIDEAEKVAR----EAKELAQGATKLATSPQGLLkedAKGSLQKSF--RILNEAKKLANDVKGNHND 1990
Cdd:TIGR00606  683 cqrvFQTEAELQEFISDLQSKLRlapdKLKSTESELKKKEKRRDEML---GLAPGRQSIidLKEKEIPELRNKLQKVNRD 759
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1991 LNDLKTRLEtadlRNSGLLGALNDTMDKLSAITNDTAAkLQAVKEKAREANDTAKAVLAQ---------VKDLHQNLDGL 2061
Cdd:TIGR00606  760 IQRLKNDIE----EQETLLGTIMPEEESAKVCLTDVTI-MERFQMELKDVERKIAQQAAKlqgsdldrtVQQVNQEKQEK 834
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 117647249  2062 KQNYNKLADSVAKTNAVVKDpsKNKIIADAGTSVRNLEQEADRLIDKLKPIKELEDNLKKNISEIKEL---INQARKQ 2136
Cdd:TIGR00606  835 QHELDTVVSKIELNRKLIQD--QQEQIQHLKSKTNELKSEKLQIGTNLQRRQQFEEQLVELSTEVQSLireIKDAKEQ 910
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1701-1901 1.72e-09

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 62.86  E-value: 1.72e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1701 NQDKTAERNLEELQKEIDRMLKELRS-----KDLQTQKEVAEDELVAAEGLLKRVNKLFGEPR---AQNEDMEKDLQQKL 1772
Cdd:COG4942    20 DAAAEAEAELEQLQQEIAELEKELAAlkkeeKALLKQLAALERRIAALARRIRALEQELAALEaelAELEKEIAELRAEL 99
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1773 AE------------YKNKLDDAWDLL---REATDKTRDA---NRLSAANQKNMTILETKKEAIEGSKRQIENTLKEGNDI 1834
Cdd:COG4942   100 EAqkeelaellralYRLGRQPPLALLlspEDFLDAVRRLqylKYLAPARREQAEELRADLAELAALRAELEAERAELEAL 179
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 117647249 1835 LDEANRLLGEINSVIDYVDDIKTKLPPMSEELSDKIDDLAQEIKD-RRLAEKVFQAESHAAQLNDSSA 1901
Cdd:COG4942   180 LAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEElEALIARLEAEAAAAAERTPAAG 247
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
1674-2175 2.03e-09

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 63.59  E-value: 2.03e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1674 LEEFI---KGLVQDAEAINEKAVQLNETLG-NQDKTAERNLE---------ELQKEIDRMLkeLRSKDLQTQKEVAEDEL 1740
Cdd:pfam05483  136 LEEEIqenKDLIKENNATRHLCNLLKETCArSAEKTKKYEYEreetrqvymDLNNNIEKMI--LAFEELRVQAENARLEM 213
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1741 -VAAEGLLKRVNKLFGEPRAQNEDMEKD---LQQKLAEYKNKLDDAWDLLREATDKtrdanrlsaANQknmtiLETKkea 1816
Cdd:pfam05483  214 hFKLKEDHEKIQHLEEEYKKEINDKEKQvslLLIQITEKENKMKDLTFLLEESRDK---------ANQ-----LEEK--- 276
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1817 iegSKRQIENtLKEGNDildEANRLLGEInsvidyvDDIKTKLP-PMSEELSDKiDDLaqEIKDRRLAEKVFQAESHAAQ 1895
Cdd:pfam05483  277 ---TKLQDEN-LKELIE---KKDHLTKEL-------EDIKMSLQrSMSTQKALE-EDL--QIATKTICQLTEEKEAQMEE 339
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1896 LNDSSAVLDGILDEAKNISFNATAAFRA----YSNIKDYID----EAEKVAREAKELaqgaTKLATSPQGLLkEDAKGSL 1967
Cdd:pfam05483  340 LNKAKAAHSFVVTEFEATTCSLEELLRTeqqrLEKNEDQLKiitmELQKKSSELEEM----TKFKNNKEVEL-EELKKIL 414
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1968 QKSFRILNEAK---KLANDVKGNHNDL--------------------------------NDLKTRLETADLRNSGLLGAL 2012
Cdd:pfam05483  415 AEDEKLLDEKKqfeKIAEELKGKEQELifllqarekeihdleiqltaiktseehylkevEDLKTELEKEKLKNIELTAHC 494
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2013 NDTM---DKLSAITNDTAAKLQAVKEKAREANDTAKAVLAQVKDLHQN-------LDGLKQNYNKLADSV------AKTN 2076
Cdd:pfam05483  495 DKLLlenKELTQEASDMTLELKKHQEDIINCKKQEERMLKQIENLEEKemnlrdeLESVREEFIQKGDEVkckldkSEEN 574
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2077 AVVKDPS---KNKIIADAGTSVRNLEQEADrliDKLKPIKELEDN---LKKNISEIKELINQARKQANSIKVSVSSG--- 2147
Cdd:pfam05483  575 ARSIEYEvlkKEKQMKILENKCNNLKKQIE---NKNKNIEELHQEnkaLKKKGSAENKQLNAYEIKVNKLELELASAkqk 651
                          570       580       590
                   ....*....|....*....|....*....|.
gi 117647249  2148 -GDCVRTYRPEI--KKGSYNNIVVHVKTAVA 2175
Cdd:pfam05483  652 fEEIIDNYQKEIedKKISEEKLLEEVEKAKA 682
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
753-800 2.33e-09

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


Pssm-ID: 395007  Cd Length: 49  Bit Score: 55.05  E-value: 2.33e-09
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 117647249   753 CQCFAHA---EACDDITGECLnCKDHTGGPYCNECLPGFYGDPTrGSPEDC 800
Cdd:pfam00053    1 CDCNPHGslsDTCDPETGQCL-CKPGVTGRHCDRCKPGYYGLPS-DPPQGC 49
COG1340 COG1340
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
1701-1979 2.37e-09

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];


Pssm-ID: 440951 [Multi-domain]  Cd Length: 297  Bit Score: 61.47  E-value: 2.37e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1701 NQDKTAERNLEELQKEIDRMLKELRS-KDLQTQ-----KEVAE--DELVAaegllkRVNKLFGEPRAQNEDMeKDLQQKL 1772
Cdd:COG1340     1 SKTDELSSSLEELEEKIEELREEIEElKEKRDElneelKELAEkrDELNA------QVKELREEAQELREKR-DELNEKV 73
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1773 AEYKNKLDDAWDLLREATDKTRDANRLSAANQKNMTILETKKEAIEGSKRQIEN---TLKEGNDILDEANRL---LGEIN 1846
Cdd:COG1340    74 KELKEERDELNEKLNELREELDELRKELAELNKAGGSIDKLRKEIERLEWRQQTevlSPEEEKELVEKIKELekeLEKAK 153
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1847 SVIDYVDDIKTKLpPMSEELSDKIDDLAQEIKDrrLAEkvfQAESHAAQLNDSSAVLDGILDEAKnisfnataafRAYSN 1926
Cdd:COG1340   154 KALEKNEKLKELR-AELKELRKEAEEIHKKIKE--LAE---EAQELHEEMIELYKEADELRKEAD----------ELHKE 217
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|...
gi 117647249 1927 IKDYIDEAEKVAREAKELAQGATKLATSPQGLLKEDAKGSLQKSFRILNEAKK 1979
Cdd:COG1340   218 IVEAQEKADELHEEIIELQKELRELRKELKKLRKKQRALKREKEKEELEEKAE 270
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
1102-1159 2.99e-09

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 55.01  E-value: 2.99e-09
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|....*...
gi 117647249   1102 CDCFLPGTDATTCDLETrkcscsdqtGQCSCKVNVEGVHCDRCRPGKFGldaKNPLGC 1159
Cdd:smart00180    1 CDCDPGGSASGTCDPDT---------GQCECKPNVTGRRCDRCAPGYYG---DGPPGC 46
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1626-2137 3.17e-09

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 63.03  E-value: 3.17e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1626 ERLIQLAEGNVNTLVMETNELLTRATKVTADGEQTGQDAERTNSRAESLEEFIKGLVQDAEAINEKAVQLNETLGNqdkt 1705
Cdd:COG1196   238 EAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRE---- 313
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1706 AERNLEELQKEIDRMLKELRskDLQTQKEVAEDELVAAEGLLKRVNKLfgepRAQNEDMEKDLQQKLAEYKNKLDDAWDL 1785
Cdd:COG1196   314 LEERLEELEEELAELEEELE--ELEEELEELEEELEEAEEELEEAEAE----LAEAEEALLEAEAELAEAEEELEELAEE 387
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1786 LREATDKTRDANRLSAANQKNMTILETKKEAIEGSKRQIENTLKEGNDILDEANRLLGEINSVIDYVDDIKTKLPPMSEE 1865
Cdd:COG1196   388 LLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAE 467
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1866 LSDKIDDLAQEIKDRR--LAEKVFQAESHAAQLNDSSAVLDGILDEAKNISFNATAAFRAYSNIKDYIDEAEKVAREAKE 1943
Cdd:COG1196   468 LLEEAALLEAALAELLeeLAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALEAALAAA 547
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1944 LAQGATKLATSPQ---GLLKEDAKGSLqkSFRILNEAKKLANDVKGNHNDLNDLKTRLETADLRNsgLLGALNDTMDKLS 2020
Cdd:COG1196   548 LQNIVVEDDEVAAaaiEYLKAAKAGRA--TFLPLDKIRARAALAAALARGAIGAAVDLVASDLRE--ADARYYVLGDTLL 623
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2021 AITNDTAAKLQAVKEkAREANDTAKAVLAQVKDLHQNLDGLKQNYNKLADSVAKTNAVVKDPSKNKIIADAGTSVRNLEQ 2100
Cdd:COG1196   624 GRTLVAARLEAALRR-AVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAE 702
                         490       500       510
                  ....*....|....*....|....*....|....*..
gi 117647249 2101 EADRLIDKLKPIKELEDNLKKNISEIKELINQARKQA 2137
Cdd:COG1196   703 EEEERELAEAEEERLEEELEEEALEEQLEAEREELLE 739
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
1523-1566 3.36e-09

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 54.62  E-value: 3.36e-09
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....
gi 117647249   1523 CECDPYGSLPVPCDRVTGLCTCRPGATGRKCDGCEHWHAREGAE 1566
Cdd:smart00180    1 CDCDPGGSASGTCDPDTGQCECKPNVTGRRCDRCAPGYYGDGPP 44
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
1523-1560 3.43e-09

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 54.67  E-value: 3.43e-09
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 117647249 1523 CECDPYGSLPVPCDRVTGLCTCRPGATGRKCDGCEHWH 1560
Cdd:cd00055     2 CDCNGHGSLSGQCDPGTGQCECKPNTTGRRCDRCAPGY 39
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
1056-1097 3.64e-09

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 54.62  E-value: 3.64e-09
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|..
gi 117647249   1056 CNCSTVGSLASQCNVNTGQCSCHPKFSGMKCSECSRGHWNYP 1097
Cdd:smart00180    1 CDCDPGGSASGTCDPDTGQCECKPNVTGRRCDRCAPGYYGDG 42
PRK01156 PRK01156
chromosome segregation protein; Provisional
1673-2229 4.09e-09

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 62.61  E-value: 4.09e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1673 SLEEFIKGLVQDAEaINEKAVQLNETLGNQDKTAERNLEELQKEI--------DRMLKELRSKDLQTQKEVAeDELVAAE 1744
Cdd:PRK01156   85 SIERRGKGSRREAY-IKKDGSIIAEGFDDTTKYIEKNILGISKDVflnsifvgQGEMDSLISGDPAQRKKIL-DEILEIN 162
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1745 GLLKRVNKLfgepraqnEDMEKDLQQKLAEYknklDDAWDLLREATDKTRDANRLSAANQKNMTILETKKEAIEGSKRQI 1824
Cdd:PRK01156  163 SLERNYDKL--------KDVIDMLRAEISNI----DYLEEKLKSSNLELENIKKQIADDEKSHSITLKEIERLSIEYNNA 230
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1825 EN-------TLKEGNDILDEANRL---LGEINSVIDYVDDIKTKLPPMSEELSDKIDDLAqeIKDRrlaEKVFQAESHAA 1894
Cdd:PRK01156  231 MDdynnlksALNELSSLEDMKNRYeseIKTAESDLSMELEKNNYYKELEERHMKIINDPV--YKNR---NYINDYFKYKN 305
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1895 QLNDSSAVLDGILDEAKNISfnatAAFRAYSNIKDYIDEAEKVAREAKELAQGATKLATSpqgllkEDAKGSLQKSFRIL 1974
Cdd:PRK01156  306 DIENKKQILSNIDAEINKYH----AIIKKLSVLQKDYNDYIKKKSRYDDLNNQILELEGY------EMDYNSYLKSIESL 375
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1975 NeaKKLANDVKgNHNDLNDLKTRletadlrnsgLLGALNDTMDKLSAITNDTAAKLQAVKEKAREANdtakavlAQVKDL 2054
Cdd:PRK01156  376 K--KKIEEYSK-NIERMSAFISE----------ILKIQEIDPDAIKKELNEINVKLQDISSKVSSLN-------QRIRAL 435
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2055 HQNLDGLKQNYNKL-ADSVA----------KTNAVVKDPS--KNKIIADagtsVRNLEQEADRLIDKLKPIKELEDNLKK 2121
Cdd:PRK01156  436 RENLDELSRNMEMLnGQSVCpvcgttlgeeKSNHIINHYNekKSRLEEK----IREIEIEVKDIDEKIVDLKKRKEYLES 511
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2122 niSEIKELINQ------ARKQANSIKVSVSSGGDCVRTYrpEIKKGSYNNIVVHVKTAVADNLLFYLGSAKFIDFLAIEM 2195
Cdd:PRK01156  512 --EEINKSINEynkiesARADLEDIKIKINELKDKHDKY--EEIKNRYKSLKLEDLDSKRTSWLNALAVISLIDIETNRS 587
                         570       580       590
                  ....*....|....*....|....*....|....*...
gi 117647249 2196 RKGKVSF-LWDVGSGVGRVE--YPDL-TIDDSYWYRIE 2229
Cdd:PRK01156  588 RSNEIKKqLNDLESRLQEIEigFPDDkSYIDKSIREIE 625
Tropomyosin_1 pfam12718
Tropomyosin like; This family is a set of eukaryotic tropomyosins. Within the yeast Tpm1 and ...
1684-1830 5.19e-09

Tropomyosin like; This family is a set of eukaryotic tropomyosins. Within the yeast Tpm1 and Tpm2, biochemical and sequence analyses indicate that Tpm2p spans four actin monomers along a filament, whereas Tpm1p spans five. Despite its shorter length, Tpm2p can compete with Tpm1p for binding to F-actin. Over-expression of Tpm2p in vivo alters the axial budding of haploids to a bipolar pattern, and this can be partially suppressed by co-over-expression of Tpm1p. This suggests distinct functions for the two tropomyosins, and indicates that the ratio between them is important for correct morphogenesis. The family also contains higher eukaryote Tpm3 members.


Pssm-ID: 403808 [Multi-domain]  Cd Length: 142  Bit Score: 57.31  E-value: 5.19e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1684 DAEAINEKAVQLNEtlgnQDKTAERNLEELQKEIdrmlkelrsKDLQTQKEVAEDELVAAEGLLKRVNKLFGEP---RAQ 1760
Cdd:pfam12718    8 EAENAQERAEELEE----KVKELEQENLEKEQEI---------KSLTHKNQQLEEEVEKLEEQLKEAKEKAEESeklKTN 74
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1761 NEDMEKDLQQkLAEyknKLDDAWDLLREATDKTRDANRLSAANQKNMTILETKKEAIEGSKRQIENTLKE 1830
Cdd:pfam12718   75 NENLTRKIQL-LEE---ELEESDKRLKETTEKLRETDVKAEHLERKVQALEQERDEWEKKYEELEEKYKE 140
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
1665-2146 5.33e-09

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 62.76  E-value: 5.33e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1665 ERTNSRAESLEEFIKGLVQDAEAINEKAVQlNETLGNQDKTAERNLEELQK------EIDRMLKELR--SKDLQTQKEVA 1736
Cdd:TIGR01612 1135 EEIKKKSENYIDEIKAQINDLEDVADKAIS-NDDPEEIEKKIENIVTKIDKkkniydEIKKLLNEIAeiEKDKTSLEEVK 1213
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1737 EDELVAAEGLlkrvNKLF----GEPRAQNEDMEK----------DLQQKLAEYKNKLDDAWDLLRE------ATDKTRDA 1796
Cdd:TIGR01612 1214 GINLSYGKNL----GKLFlekiDEEKKKSEHMIKameayiedldEIKEKSPEIENEMGIEMDIKAEmetfniSHDDDKDH 1289
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1797 NRLSAANQKNMTILETK----------KEAIEGSKRQIENTLKEGNDILDEANRLLGEINSV--IDYVDDIKtKLPPMSE 1864
Cdd:TIGR01612 1290 HIISKKHDENISDIREKslkiiedfseESDINDIKKELQKNLLDAQKHNSDINLYLNEIANIynILKLNKIK-KIIDEVK 1368
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1865 ELSDKIDDLAQEIKDR-RLAEKVFQAESHAAQLNDSSAVLDGILDEA------KNISFNATAAFRAYSNIKDYIDEAEKV 1937
Cdd:TIGR01612 1369 EYTKEIEENNKNIKDElDKSEKLIKKIKDDINLEECKSKIESTLDDKdideciKKIKELKNHILSEESNIDTYFKNADEN 1448
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1938 AREAKELAQGATKLATSPQGLLKEDAKGSLQKSFRILNEAKKLANDVKGNHNDLNDLKTRLETADLRNSGLLGALNDTMD 2017
Cdd:TIGR01612 1449 NENVLLLFKNIEMADNKSQHILKIKKDNATNDHDFNINELKEHIDKSKGCKDEADKNAKAIEKNKELFEQYKKDVTELLN 1528
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2018 KLSAItndtaaklqAVKEKAREANDTAKAVLAQVKDLH-----------QNLDGLKQNYNKLADSVAKTNavvkdpSKNK 2086
Cdd:TIGR01612 1529 KYSAL---------AIKNKFAKTKKDSEIIIKEIKDAHkkfileaekseQKIKEIKKEKFRIEDDAAKND------KSNK 1593
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2087 IIADAGTSVRNLEqeadrliDKLkpikelednLKknISEIKELINQARKQANSIKVSVSS 2146
Cdd:TIGR01612 1594 AAIDIQLSLENFE-------NKF---------LK--ISDIKKKINDCLKETESIEKKISS 1635
PTZ00440 PTZ00440
reticulocyte binding protein 2-like protein; Provisional
1630-2172 5.85e-09

reticulocyte binding protein 2-like protein; Provisional


Pssm-ID: 240419 [Multi-domain]  Cd Length: 2722  Bit Score: 62.54  E-value: 5.85e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1630 QLAEGNVNTLVMETNELLTRATKVTADGEqtgqDAERTNsrAESLEEFIKGLVQDAEAINEKAVQLNETLGNQDKTAERN 1709
Cdd:PTZ00440  948 QLSDTKINNLKMQIEKTLEYYDKSKENIN----GNDGTH--LEKLDKEKDEWEHFKSEIDKLNVNYNILNKKIDDLIKKQ 1021
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1710 LEELQKEIDRMLKELRSK-DLQTQKEVAEDELVAAEGLLKRVN---KLFGEPRAQN-----EDMEKDLQQKLAEYKNKLD 1780
Cdd:PTZ00440 1022 HDDIIELIDKLIKEKGKEiEEKVDQYISLLEKMKTKLSSFHFNidiKKYKNPKIKEeikllEEKVEALLKKIDENKNKLI 1101
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1781 DAWDLLREatdktrdaNRLSAANQKNMT--ILETKKEAIEGSKRQIENTLKEGNDILDEANRlLGEINSV-IDY----VD 1853
Cdd:PTZ00440 1102 EIKNKSHE--------HVVNADKEKNKQteHYNKKKKSLEKIYKQMEKTLKELENMNLEDIT-LNEVNEIeIEYerilID 1172
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1854 DIKTKlppMSEElSDKIDDLAQEIKDRRlaEKVFQAEshaaqlNDSSAVLDGILDEaknisFNATAAFRAYSNIKDYIDE 1933
Cdd:PTZ00440 1173 HIVEQ---INNE-AKKSKTIMEEIESYK--KDIDQVK------KNMSKERNDHLTT-----FEYNAYYDKATASYENIEE 1235
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1934 AEKVAREAKELAQGATKlaTSPQGLLKEDAKGSLQKSfrilneaKKLANDVKGNHNDLNDLKTRLETADLRNsgLLGALN 2013
Cdd:PTZ00440 1236 LTTEAKGLKGEANRSTN--VDELKEIKLQVFSYLQQV-------IKENNKMENALHEIKNMYEFLISIDSEK--ILKEIL 1304
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2014 DTMDKLSAITNDTAAKL-------QAVKEKAREANDTAKAVLAQVKD--LHQNLDGLKQNYNKLADSVAKTNAVVKDPSK 2084
Cdd:PTZ00440 1305 NSTKKAEEFSNDAKKELektdnliKQVEAKIEQAKEHKNKIYGSLEDkqIDDEIKKIEQIKEEISNKRKEINKYLSNIKS 1384
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2085 NKIIADagTSVRNLEQEADrLIDKLKPIKELEDNLKK--NISEIKELINQARK---QANSIKVSVSSGGDCVRTYRPEIK 2159
Cdd:PTZ00440 1385 NKEKCD--LHVRNASRGKD-KIDFLNKHEAIEPSNSKevNIIKITDNINKCKQysnEAMETENKADENNDSIIKYEKEIT 1461
                         570
                  ....*....|...
gi 117647249 2160 KGSYNNIVVHVKT 2172
Cdd:PTZ00440 1462 NILNNSSILGKKT 1474
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
1523-1558 6.02e-09

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


Pssm-ID: 395007  Cd Length: 49  Bit Score: 53.90  E-value: 6.02e-09
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 117647249  1523 CECDPYGSLPVPCDRVTGLCTCRPGATGRKCDGCEH 1558
Cdd:pfam00053    1 CDCNPHGSLSDTCDPETGQCLCKPGVTGRHCDRCKP 36
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
1102-1159 6.29e-09

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 53.90  E-value: 6.29e-09
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 117647249 1102 CDCFLPGTDATTCDLETrkcscsdqtGQCSCKVNVEGVHCDRCRPGKFGlDAKNPLGC 1159
Cdd:cd00055     2 CDCNGHGSLSGQCDPGT---------GQCECKPNTTGRRCDRCAPGYYG-LPSQGGGC 49
ClyA_Cry6Aa-like cd22656
Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes ...
1853-2142 6.33e-09

Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes pesticidal Cry6Aa toxin from Bacillus thuringiensis, one of the many parasporal crystal (Cry) toxins produced during the sporulation phase of growth. Many of these proteins are toxic to numerous insect species and have been effectively used as proteinaceous insecticides to directly kill insect pests; some have been used to control insect growth on transgenic agricultural plants. Cry6Aa exists as a protoxin, which is activated by cleavage using trypsin. Structure studies for Cry6Aa support a mechanism of action by pore formation, similar to cytolysin A (ClyA)-type alpha pore-forming toxins (alpha-PFTs) such as HblB, and bioassay and mutation studies show that Cry6Aa is an active pore-forming toxin. Cry6Aa shows atypical features compared to other members of alpha-PFTs, including internal repeat sequences and small loop regions within major alpha helices.


Pssm-ID: 439154 [Multi-domain]  Cd Length: 309  Bit Score: 60.46  E-value: 6.33e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1853 DDIKTKLPpmseELSDKIDDLAQEIKDRRLAEKVFQaeSHAAQLNDSsaVLDGILDEAKNISFNATAA----FRAYSNIK 1928
Cdd:cd22656    26 EEYRKRLG----ISSDIDDKLSSDFDPLLDAYKSIK--DHCTDFKDD--TYPSIVSLAGDIYNYAQNAggtiDSYYAEIL 97
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1929 DYIDEA-----EKVAREAKELAQGATKLatspqglLKEDAKgslqksfRILNEAKKLANDVKGNHNDLNDLKTRLETADL 2003
Cdd:cd22656    98 ELIDDLadatdDEELEEAKKTIKALLDD-------LLKEAK-------KYQDKAAKVVDKLTDFENQTEKDQTALETLEK 163
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2004 RNSGLLGalndtmDKLSAITNDTAAKLQAVKEKAREandtakavlAQVKDLHQNLDGLKQNYNKLADSVAKTNAVVKDps 2083
Cdd:cd22656   164 ALKDLLT------DEGGAIARKEIKDLQKELEKLNE---------EYAAKLKAKIDELKALIADDEAKLAAALRLIAD-- 226
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 117647249 2084 knkiIADAGTSVRNLEQEADRLIDKLKPIKELEDNLKKNISEIKELINQARKQANSIKV 2142
Cdd:cd22656   227 ----LTAADTDLDNLLALIGPAIPALEKLQGAWQAIATDLDSLKDLLEDDISKIPAAIL 281
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
861-904 7.36e-09

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 53.85  E-value: 7.36e-09
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....
gi 117647249    861 CQCNdnLDYSIPGSCDSLSGSCLiCKPGTTGRYCELCADGYFGD 904
Cdd:smart00180    1 CDCD--PGGSASGTCDPDTGQCE-CKPNVTGRRCDRCAPGYYGD 41
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1711-2137 7.55e-09

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 62.01  E-value: 7.55e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1711 EELQKEIDRMLKELRskdlQTQKEVAEDELVAAEgLLKRVNKLFGEpRAQNEDMeKDLQQKLAEYKnklddAWDLLREAT 1790
Cdd:TIGR02169  166 AEFDRKKEKALEELE----EVEENIERLDLIIDE-KRQQLERLRRE-REKAERY-QALLKEKREYE-----GYELLKEKE 233
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1791 DKTRDANRLSAAnqknmtiLETKKEAIEGSKRQIENTLKEgndiLDEANRLLGEINSVIDYVDD-----IKTKLPPMSEE 1865
Cdd:TIGR02169  234 ALERQKEAIERQ-------LASLEEELEKLTEEISELEKR----LEEIEQLLEELNKKIKDLGEeeqlrVKEKIGELEAE 302
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1866 LS---DKIDDLAQEIKDrrLAEKVFQAESHaaqlndssavLDGILDEAKNISfnataafraySNIKDYIDEAEKVAREAK 1942
Cdd:TIGR02169  303 IAsleRSIAEKERELED--AEERLAKLEAE----------IDKLLAEIEELE----------REIEEERKRRDKLTEEYA 360
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1943 ELAQgatKLATSPQGLLKEDAKgsLQKSFRILNEAKKLANDVKGNHNDLNDLKTRLEtadlrnsgllgalnDTMDKLSAI 2022
Cdd:TIGR02169  361 ELKE---ELEDLRAELEEVDKE--FAETRDELKDYREKLEKLKREINELKRELDRLQ--------------EELQRLSEE 421
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2023 TNDTAAKLQAVKEKAREANDTAKAVLAQVKDLHQNLDGLKqnynkladsvaktnavvkdpsknKIIADAGTSVRNLEQEA 2102
Cdd:TIGR02169  422 LADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLA-----------------------ADLSKYEQELYDLKEEY 478
                          410       420       430
                   ....*....|....*....|....*....|....*
gi 117647249  2103 DRLidklkpikelEDNLKKNISEIKELinQARKQA 2137
Cdd:TIGR02169  479 DRV----------EKELSKLQRELAEA--EAQARA 501
PTZ00440 PTZ00440
reticulocyte binding protein 2-like protein; Provisional
1658-2131 8.98e-09

reticulocyte binding protein 2-like protein; Provisional


Pssm-ID: 240419 [Multi-domain]  Cd Length: 2722  Bit Score: 61.77  E-value: 8.98e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1658 EQTGQDAERTNSRAESLEEFIKGLVQ-DAEAINEKAVQLNE----TLGNQDKTAERNLEELQKEIDRMLKEL-RSKDLQT 1731
Cdd:PTZ00440 1176 EQINNEAKKSKTIMEEIESYKKDIDQvKKNMSKERNDHLTTfeynAYYDKATASYENIEELTTEAKGLKGEAnRSTNVDE 1255
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1732 QKEVAEDelvaaegllkrVNKLFGEPRAQNEDMEKDLQQK---------------LAEYKNKLDDAWDLLREATDKTRDA 1796
Cdd:PTZ00440 1256 LKEIKLQ-----------VFSYLQQVIKENNKMENALHEIknmyeflisidsekiLKEILNSTKKAEEFSNDAKKELEKT 1324
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1797 NRLSAANQKNMTILETKKEAIEGS--KRQIENTLKEGNDILDEANRLLGEINSVIDYVDDIKTKLppMSEELS-----DK 1869
Cdd:PTZ00440 1325 DNLIKQVEAKIEQAKEHKNKIYGSleDKQIDDEIKKIEQIKEEISNKRKEINKYLSNIKSNKEKC--DLHVRNasrgkDK 1402
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1870 IDDL----------AQEIKDRRLAEKVFQAESHAAQLNDS----SAVLDGILDEAKNIS--FNataafraYSNIKDYIDE 1933
Cdd:PTZ00440 1403 IDFLnkheaiepsnSKEVNIIKITDNINKCKQYSNEAMETenkaDENNDSIIKYEKEITniLN-------NSSILGKKTK 1475
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1934 AEKVAREAKELAQGATKLATSPQGLLKedakgSLQKSFRILNEAKK-------LAND---------------VKGNHNDL 1991
Cdd:PTZ00440 1476 LEKKKKEATNIMDDINGEHSIIKTKLT-----KSSEKLNQLNEQPNikregdvLNNDkstiayetiqynlgrVKHNLLNI 1550
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1992 NDLKTRLETadlrnsgLLGALNDTMDKLSAITNDTAAK-LQAVKEKarEANDTAKavLAQVKDLHQNLdglKQNYNKLAD 2070
Cdd:PTZ00440 1551 LNIKDEIET-------ILNKAQDLMRDISKISKIVENKnLENLNDK--EADYVKY--LDNILKEKQLM---EAEYKKLNE 1616
                         490       500       510       520       530       540
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 117647249 2071 SVAKTNAVVKDPSKNKIIADagtsVRNLEQeadrlidklkpIKELEDNLKKNISEIKELIN 2131
Cdd:PTZ00440 1617 IYSDVDNIEKELKKHKKNYE----IGLLEK-----------VIEINKNIKLYMDSTKESLN 1662
PRK01156 PRK01156
chromosome segregation protein; Provisional
1665-2160 9.93e-09

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 61.46  E-value: 9.93e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1665 ERTNSRAESLEEFIKGL---VQDAEAINEKAVQLNETLGNQDKtaerNLEELQKEIDRMLKELrsKDLQTQKEVAEDELV 1741
Cdd:PRK01156  162 NSLERNYDKLKDVIDMLraeISNIDYLEEKLKSSNLELENIKK----QIADDEKSHSITLKEI--ERLSIEYNNAMDDYN 235
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1742 AAEGLLKRVNKLfgepraqnEDMEKDLQQKLAEYKNKLDDAWDLLREATDKTRDANRL--SAANQKNMTI---------L 1810
Cdd:PRK01156  236 NLKSALNELSSL--------EDMKNRYESEIKTAESDLSMELEKNNYYKELEERHMKIinDPVYKNRNYIndyfkykndI 307
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1811 ETKKEAIEGSKRQI---ENTLKEGNDI-------------LDEANRLLGEI-------NSVIDYVDDIKTKLppmsEELS 1867
Cdd:PRK01156  308 ENKKQILSNIDAEInkyHAIIKKLSVLqkdyndyikkksrYDDLNNQILELegyemdyNSYLKSIESLKKKI----EEYS 383
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1868 DKIDDLAQEIKdRRLAEKVFQAESHAAQLNDSSAVLDGIldEAKNISFNATaafraysnIKDYIDEAEKVAREAKEL-AQ 1946
Cdd:PRK01156  384 KNIERMSAFIS-EILKIQEIDPDAIKKELNEINVKLQDI--SSKVSSLNQR--------IRALRENLDELSRNMEMLnGQ 452
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1947 G-----ATKLATSPQGLLKEDAKgslQKSFRILNEAKKLANDVKGNHNDLNDLKTRLETADLRNSGLLGALNDTMDKLSA 2021
Cdd:PRK01156  453 SvcpvcGTTLGEEKSNHIINHYN---EKKSRLEEKIREIEIEVKDIDEKIVDLKKRKEYLESEEINKSINEYNKIESARA 529
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2022 ITNDTAAKLQAVKEKareaNDTAKAVLAQVKDLH-QNLDGLKQNYNKLA---DSVAKTNAVVKDPSKNKIIADA------ 2091
Cdd:PRK01156  530 DLEDIKIKINELKDK----HDKYEEIKNRYKSLKlEDLDSKRTSWLNALaviSLIDIETNRSRSNEIKKQLNDLesrlqe 605
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2092 ------------GTSVRNLEQEADRLIDKLKPIKELE----------DNLKKNISEIKELI-----------------NQ 2132
Cdd:PRK01156  606 ieigfpddksyiDKSIREIENEANNLNNKYNEIQENKilieklrgkiDNYKKQIAEIDSIIpdlkeitsrindiednlKK 685
                         570       580
                  ....*....|....*....|....*...
gi 117647249 2133 ARKQANSIKVSVSSGGDCVRTYRPEIKK 2160
Cdd:PRK01156  686 SRKALDDAKANRARLESTIEILRTRINE 713
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1665-1909 1.08e-08

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 61.24  E-value: 1.08e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1665 ERTNSRAESLEEFIKGLVQDAEAINEKAVQLNETLgnqdktaernlEELQKEIDRM-----------LKELRSK--DLQT 1731
Cdd:TIGR02169  240 EAIERQLASLEEELEKLTEEISELEKRLEEIEQLL-----------EELNKKIKDLgeeeqlrvkekIGELEAEiaSLER 308
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1732 QKEVAEDELVAAEGLLKRVNKLFGEPRAQNEDMEKDLQQKLAEyKNKLDDAWDLLREATDKTR-DANRLSAANQKNMTIL 1810
Cdd:TIGR02169  309 SIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKR-RDKLTEEYAELKEELEDLRaELEEVDKEFAETRDEL 387
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1811 ETKKEAIEGSKRQIENTLKEGNDILDEANRLLGEINSVIDYVDDIKTKLPPMSEELSDKIDDL-AQEIKDRRLAEKVFQA 1889
Cdd:TIGR02169  388 KDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIkKQEWKLEQLAADLSKY 467
                          250       260
                   ....*....|....*....|
gi 117647249  1890 EShaaQLNDSSAVLDGILDE 1909
Cdd:TIGR02169  468 EQ---ELYDLKEEYDRVEKE 484
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
1633-2166 1.46e-08

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 61.22  E-value: 1.46e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1633 EGNVNTLVMETNELLTRATKVtadgeQTGQDAERTNSRAESLEEFIKGLVQDAEAINEKAVQLNETLGNQDKTAE-RNLE 1711
Cdd:TIGR01612  721 ELHLSNIENKKNELLDIIVEI-----KKHIHGEINKDLNKILEDFKNKEKELSNKINDYAKEKDELNKYKSKISEiKNHY 795
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1712 ELQKEIDRMLKELRSKDLQTQKE------VAEDELVAA--------EGLLKRVNKLFGEPRAQNEDMEKDLQQ------- 1770
Cdd:TIGR01612  796 NDQINIDNIKDEDAKQNYDKSKEyiktisIKEDEIFKIinemkfmkDDFLNKVDKFINFENNCKEKIDSEHEQfaeltnk 875
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1771 --------KLAEYKNKLDDAWDLLREAT----DKTRDANRLSAANQKnMTILETKKEAIEgSKRQIENTLKEgndILDEA 1838
Cdd:TIGR01612  876 ikaeisddKLNDYEKKFNDSKSLINEINksieEEYQNINTLKKVDEY-IKICENTKESIE-KFHNKQNILKE---ILNKN 950
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1839 NRLLGEINSV-IDYVDDIKTKLppmseelSDKIDDLaqeikdrrlaEKVFQAEShaaqLNDSsavldgildEAKNisfna 1917
Cdd:TIGR01612  951 IDTIKESNLIeKSYKDKFDNTL-------IDKINEL----------DKAFKDAS----LNDY---------EAKN----- 995
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1918 TAAFRAYSNIKDYI------------DEAEKVAREAKELAQGATK--------LATSPQGLLKEDAKgSLQKSFRILNea 1977
Cdd:TIGR01612  996 NELIKYFNDLKANLgknkenmlyhqfDEKEKATNDIEQKIEDANKnipnieiaIHTSIYNIIDEIEK-EIGKNIELLN-- 1072
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1978 KKLANDVKGNHNDLNDLKTRLETADLRNSGLLGALNDTmDKLSAITNDTAAKLQAVKEKAREandtakavLAQVKDLHQN 2057
Cdd:TIGR01612 1073 KEILEEAEINITNFNEIKEKLKHYNFDDFGKEENIKYA-DEINKIKDDIKNLDQKIDHHIKA--------LEEIKKKSEN 1143
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2058 -LDGLKQNYNKLADsVAKTNAVVKDPsknkiiadagtsvRNLEQEADRLIDKLKPIKELEDNLKKNISEIKElINQARKQ 2136
Cdd:TIGR01612 1144 yIDEIKAQINDLED-VADKAISNDDP-------------EEIEKKIENIVTKIDKKKNIYDEIKKLLNEIAE-IEKDKTS 1208
                          570       580       590
                   ....*....|....*....|....*....|....
gi 117647249  2137 ANSIK-VSVSSGGDCVRTYRPEI---KKGSYNNI 2166
Cdd:TIGR01612 1209 LEEVKgINLSYGKNLGKLFLEKIdeeKKKSEHMI 1242
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1708-1884 1.57e-08

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 58.01  E-value: 1.57e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1708 RNLEELQkEIDRMLKELRS--KDLQTQKEVAEDELVAAEGLLKRVNKLFGEPRAQNEDME---KDLQQKLAEYKNKLDDA 1782
Cdd:COG1579     7 RALLDLQ-ELDSELDRLEHrlKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLEleiEEVEARIKKYEEQLGNV 85
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1783 WD------LLREATDKTRDANRLSAANQKNMTILETKKEAIEGSKRQIENTLKEGNDILDEANRLLGEINSVIdyvddik 1856
Cdd:COG1579    86 RNnkeyeaLQKEIESLKRRISDLEDEILELMERIEELEEELAELEAELAELEAELEEKKAELDEELAELEAEL------- 158
                         170       180
                  ....*....|....*....|....*...
gi 117647249 1857 tklppmsEELSDKIDDLAQEIKDRRLAE 1884
Cdd:COG1579   159 -------EELEAEREELAAKIPPELLAL 179
COG1340 COG1340
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
1857-2141 1.96e-08

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];


Pssm-ID: 440951 [Multi-domain]  Cd Length: 297  Bit Score: 58.77  E-value: 1.96e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1857 TKLPPMSEELSDKIDDLAQEIKD--RRLAEKVFQAESHAAQLNDSSAVLDGILDEAKNI-----SFNAtaafraysNIKD 1929
Cdd:COG1340     4 DELSSSLEELEEKIEELREEIEElkEKRDELNEELKELAEKRDELNAQVKELREEAQELrekrdELNE--------KVKE 75
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1930 YIDEAEKVAREAKELAQGATKLatspqgllKEDAKgSLQKSFRILNEAKKlandvkgnhnDLNDLKTRLETADLRnsglL 2009
Cdd:COG1340    76 LKEERDELNEKLNELREELDEL--------RKELA-ELNKAGGSIDKLRK----------EIERLEWRQQTEVLS----P 132
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2010 GALNDTMDKLSAITN--DTAAKLQAVKEKAREANDTAKAVLAQVKDLHQNLDGLKQNYNKLADSVAKTnavvkdpsKNKi 2087
Cdd:COG1340   133 EEEKELVEKIKELEKelEKAKKALEKNEKLKELRAELKELRKEAEEIHKKIKELAEEAQELHEEMIEL--------YKE- 203
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....
gi 117647249 2088 iADAgtsvrnLEQEADRLIDKLKPIKELEDNLKKNISEIKELINQARKQANSIK 2141
Cdd:COG1340   204 -ADE------LRKEADELHKEIVEAQEKADELHEEIIELQKELRELRKELKKLR 250
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
409-461 2.76e-08

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 52.36  E-value: 2.76e-08
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|...
gi 117647249  409 PCHCDPTGSLSEVCVKDEkyaqrglkpGSCHCKTGFGGVNCDRCVRGYHGYPD 461
Cdd:cd00055     1 PCDCNGHGSLSGQCDPGT---------GQCECKPNTTGRRCDRCAPGYYGLPS 44
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
861-907 3.95e-08

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


Pssm-ID: 395007  Cd Length: 49  Bit Score: 51.59  E-value: 3.95e-08
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 117647249   861 CQCNDNLDYSipGSCDSLSGSCLiCKPGTTGRYCELCADGYFGDAVN 907
Cdd:pfam00053    1 CDCNPHGSLS--DTCDPETGQCL-CKPGVTGRHCDRCKPGYYGLPSD 44
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
1622-2158 4.65e-08

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 59.42  E-value: 4.65e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1622 QRAPERLIQLA------EGNVNTLVMETNEL---LTRATKVTADGEQTgqdAERTNSRAESLEEfikgLVQDAEAINEKA 1692
Cdd:pfam01576   15 QKVKERQQKAEselkelEKKHQQLCEEKNALqeqLQAETELCAEAEEM---RARLAARKQELEE----ILHELESRLEEE 87
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1693 VQLNETLGNQDKTAERNLEELQKEIDRmlKELRSKDLQTQKevaedelVAAEGLLKRVNKLFGEPRAQNEDMEKdlQQKL 1772
Cdd:pfam01576   88 EERSQQLQNEKKKMQQHIQDLEEQLDE--EEAARQKLQLEK-------VTTEAKIKKLEEDILLLEDQNSKLSK--ERKL 156
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1773 AEykNKLDDAWDLLREATDKTRDANRLSAANQKNMTILET--KKEA-----IEGSKRQIEntlKEGNDILDEANRLLGEI 1845
Cdd:pfam01576  157 LE--ERISEFTSNLAEEEEKAKSLSKLKNKHEAMISDLEErlKKEEkgrqeLEKAKRKLE---GESTDLQEQIAELQAQI 231
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1846 nsvidyvDDIKTKLPPMSEELSDKIDDLAQEIKDRRLAEK-VFQAESHAAQLN------------------DSSAVLDGI 1906
Cdd:pfam01576  232 -------AELRAQLAKKEEELQAALARLEEETAQKNNALKkIRELEAQISELQedleseraarnkaekqrrDLGEELEAL 304
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1907 LDEAKNiSFNATAA---FRA-----YSNIKDYIDEaEKVAREAK--ELAQGATKLATSPQGLLKE--DAKGSLQKSFRIL 1974
Cdd:pfam01576  305 KTELED-TLDTTAAqqeLRSkreqeVTELKKALEE-ETRSHEAQlqEMRQKHTQALEELTEQLEQakRNKANLEKAKQAL 382
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1975 -NEAKKLANDVKgnhnDLNDLKTRLETadlRNSGLLGALNDTMDKLSaitnDTaaklqavkEKAR-EANDTAKAVLAQVK 2052
Cdd:pfam01576  383 eSENAELQAELR----TLQQAKQDSEH---KRKKLEGQLQELQARLS----ES--------ERQRaELAEKLSKLQSELE 443
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2053 DLHQNLDGLKQNYNKLADSVAKTNAVVKD-------PSKNKIiaDAGTSVRNLEQEADRLIDKLKPIKELEDNLKKNISE 2125
Cdd:pfam01576  444 SVSSLLNEAEGKNIKLSKDVSSLESQLQDtqellqeETRQKL--NLSTRLRQLEDERNSLQEQLEEEEEAKRNVERQLST 521
                          570       580       590
                   ....*....|....*....|....*....|...
gi 117647249  2126 IKELINQARKQANSIKVSVSSGGDCVRTYRPEI 2158
Cdd:pfam01576  522 LQAQLSDMKKKLEEDAGTLEALEEGKKRLQREL 554
HEC1 COG5185
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell ...
1607-1962 6.08e-08

Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 444066 [Multi-domain]  Cd Length: 594  Bit Score: 58.43  E-value: 6.08e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1607 GLENTTQELKHLLSPQRAPERLIQLAEgNVNTLVMETNELLTratkvtadgEQTGQDAERTNSRAESLEEFIKGLVQDAE 1686
Cdd:COG5185   227 EIINIEEALKGFQDPESELEDLAQTSD-KLEKLVEQNTDLRL---------EKLGENAESSKRLNENANNLIKQFENTKE 296
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1687 AINEK-----AVQLNETLGNQDKTAERN--LEELQKEIDRMLKELRSKDLQTQKEVAEDElvaaEGLLKRVNKLFGEPR- 1758
Cdd:COG5185   297 KIAEYtksidIKKATESLEEQLAAAEAEqeLEESKRETETGIQNLTAEIEQGQESLTENL----EAIKEEIENIVGEVEl 372
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1759 AQNEDMEKDLQQKLAEYKNKLDDAwdlLREATDKTRDANrlsAANQKNMTILETKKEAIEGSKRQIENTlkegndiLDEA 1838
Cdd:COG5185   373 SKSSEELDSFKDTIESTKESLDEI---PQNQRGYAQEIL---ATLEDTLKAADRQIEELQRQIEQATSS-------NEEV 439
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1839 NRLLGEINSVIDYVDDiktklpPMSEELSDKIDDLAQEIKdRRLAEKVFQAESHAAQLNDSsavLDGILDEAKNISFNAT 1918
Cdd:COG5185   440 SKLLNELISELNKVMR------EADEESQSRLEEAYDEIN-RSVRSKKEDLNEELTQIESR---VSTLKATLEKLRAKLE 509
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....
gi 117647249 1919 aafRAYSNIKDYIDEAEKVAREAKELAQGATKLATSPQGLLKED 1962
Cdd:COG5185   510 ---RQLEGVRSKLDQVAESLKDFMRARGYAHILALENLIPASEL 550
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
1688-1957 6.13e-08

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 57.92  E-value: 6.13e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1688 INEKAVQLNEtLGNQDKTAERNLEELQKEIDRMLKELrsKDLQTQKEVAEDELVAAEGLLKRVNKLFgepRAQNEDMEKD 1767
Cdd:COG3883    18 IQAKQKELSE-LQAELEAAQAELDALQAELEELNEEY--NELQAELEALQAEIDKLQAEIAEAEAEI---EERREELGER 91
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1768 LQqklAEYKNKLD-DAWDLLREATDKTRDANRLSAANQknmtiletkkeaiegskrqientlkegndILDEANRLLGEIN 1846
Cdd:COG3883    92 AR---ALYRSGGSvSYLDVLLGSESFSDFLDRLSALSK-----------------------------IADADADLLEELK 139
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1847 SVIDYVDDIKTKLPPMSEELSDKIDDLAQEIKDrrLAEKVFQAESHAAQLNDSSAVLDGILDEAKNISFNATAAFRAYSN 1926
Cdd:COG3883   140 ADKAELEAKKAELEAKLAELEALKAELEAAKAE--LEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAA 217
                         250       260       270
                  ....*....|....*....|....*....|.
gi 117647249 1927 IKDYIDEAEKVAREAKELAQGATKLATSPQG 1957
Cdd:COG3883   218 AAAAAAAAAAAAAAAAAAAAAAAAAAASAAG 248
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
1416-1462 6.47e-08

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 51.16  E-value: 6.47e-08
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|
gi 117647249   1416 CQCN--GH-SSQCDPETSVCQnCQHHTAGDFCERCALGYYGIVrglPNDC 1462
Cdd:smart00180    1 CDCDpgGSaSGTCDPDTGQCE-CKPNVTGRRCDRCAPGYYGDG---PPGC 46
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
1658-2144 8.22e-08

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 58.52  E-value: 8.22e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1658 EQTGQDAERTNSRAESLEEFIKglvqDAEAINEKAVQL-NETLGNQDKTAERNLEELQ-----------KEIDRMLKELR 1725
Cdd:TIGR01612 1229 EKIDEEKKKSEHMIKAMEAYIE----DLDEIKEKSPEIeNEMGIEMDIKAEMETFNIShdddkdhhiisKKHDENISDIR 1304
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1726 SKDLQTQKEVAEDELVaaEGLLKRVNKLFGEPRAQNEDMEKDLQQKLAEYK----NKLDDAWDLLREATDKTRDANR-LS 1800
Cdd:TIGR01612 1305 EKSLKIIEDFSEESDI--NDIKKELQKNLLDAQKHNSDINLYLNEIANIYNilklNKIKKIIDEVKEYTKEIEENNKnIK 1382
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1801 AANQKNMTILETKKEAI--EGSKRQIENTLkEGNDIldeanrllgeiNSVIDYVDDIKTKLppMSEElsDKIDDLAQEIK 1878
Cdd:TIGR01612 1383 DELDKSEKLIKKIKDDInlEECKSKIESTL-DDKDI-----------DECIKKIKELKNHI--LSEE--SNIDTYFKNAD 1446
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1879 DRRlaEKVFQAESHAAQLNDSSAVldgILDEAKNisfNATAAFRAYSN-IKDYIDEAEKVAREAKElaqgaTKLATSPQG 1957
Cdd:TIGR01612 1447 ENN--ENVLLLFKNIEMADNKSQH---ILKIKKD---NATNDHDFNINeLKEHIDKSKGCKDEADK-----NAKAIEKNK 1513
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1958 LLKEDAKgslQKSFRILNEAKKLA--NDVKGNHNDLNDLKTrlETADLRNSGLLGAlNDTMDKLSAITNDTAAklqaVKE 2035
Cdd:TIGR01612 1514 ELFEQYK---KDVTELLNKYSALAikNKFAKTKKDSEIIIK--EIKDAHKKFILEA-EKSEQKIKEIKKEKFR----IED 1583
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2036 KAREANDTAKAVLaqvkDLHQNLDGLKQNYNKLADSVAKTNAVVKDP-SKNKIIADagTSVRNLEQEADRLIDKLKPIKE 2114
Cdd:TIGR01612 1584 DAAKNDKSNKAAI----DIQLSLENFENKFLKISDIKKKINDCLKETeSIEKKISS--FSIDSQDTELKENGDNLNSLQE 1657
                          490       500       510
                   ....*....|....*....|....*....|...
gi 117647249  2115 LEDNLK---KNISEIKELINQARKQANSIKVSV 2144
Cdd:TIGR01612 1658 FLESLKdqkKNIEDKKKELDELDSEIEKIEIDV 1690
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1622-2131 9.49e-08

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 58.14  E-value: 9.49e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1622 QRAPERLIQLAEgNVNTLVMETNELLTRATKVTADGEQTGQDAERTNSRAESLEEFIKGLVQDAEAINEKAVQLNETLGN 1701
Cdd:TIGR02168  326 EELESKLDELAE-ELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIER 404
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1702 QDKTAER---NLEELQKEI---DRMLKELRSKDLQTQKEVAEDELVAAEGLLKRVN---KLFGEPRAQNEDMEKDLQQKL 1772
Cdd:TIGR02168  405 LEARLERledRRERLQQEIeelLKKLEEAELKELQAELEELEEELEELQEELERLEealEELREELEEAEQALDAAEREL 484
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1773 AEYKNKLDDAWDLLREATDKTRDA-----NR-----------------------LSAANQKNMTILETK-----KEAIEG 1819
Cdd:TIGR02168  485 AQLQARLDSLERLQENLEGFSEGVkallkNQsglsgilgvlselisvdegyeaaIEAALGGRLQAVVVEnlnaaKKAIAF 564
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1820 SK-----------------RQIE----NTLKEGNDILDEANRLLG---EINSVIDY-------VDDIKTKLppmseELSD 1868
Cdd:TIGR02168  565 LKqnelgrvtflpldsikgTEIQgndrEILKNIEGFLGVAKDLVKfdpKLRKALSYllggvlvVDDLDNAL-----ELAK 639
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1869 KID-----------------------------DLAQEIKDRRLAEKVFQAESHAAQLndsSAVLDGILDEAKNISFNATA 1919
Cdd:TIGR02168  640 KLRpgyrivtldgdlvrpggvitggsaktnssILERRREIEELEEKIEELEEKIAEL---EKALAELRKELEELEEELEQ 716
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1920 AFRAYSNIKDYIDEAEKVAREAKELAQGATKLATSPQGLLKE------DAKGSLQKSFRILNEAKKLANDVKGnhnDLND 1993
Cdd:TIGR02168  717 LRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTEleaeieELEERLEEAEEELAEAEAEIEELEA---QIEQ 793
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1994 LKTRLETADLRNSGLLGALNDT----------MDKLSAITNDTAAKLQAVKEKAREANDTAKAVLAQVKDLHQNLDGLKQ 2063
Cdd:TIGR02168  794 LKEELKALREALDELRAELTLLneeaanlrerLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELES 873
                          570       580       590       600       610       620       630
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 117647249  2064 NYNKLADSVAKTNAVVKDPSKNKI-----IADAGTSVRNLEQEADRLIDKLKPIKELEDNLKKNISEIKELIN 2131
Cdd:TIGR02168  874 ELEALLNERASLEEALALLRSELEelseeLRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLS 946
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
1665-2141 1.01e-07

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 58.26  E-value: 1.01e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1665 ERTNSRAESLEEFIKGLVQDAEAIN----EKAVQ--------------LNETLGNQDKTAE--RNLEELQKEIDRMLKEL 1724
Cdd:pfam01576  534 EEDAGTLEALEEGKKRLQRELEALTqqleEKAAAydklektknrlqqeLDDLLVDLDHQRQlvSNLEKKQKKFDQMLAEE 613
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1725 R--SKDLQTQKEVAE-----------------DELVAAEGLLKRVNKLFgepRAQNEDM--EKD---------------L 1768
Cdd:pfam01576  614 KaiSARYAEERDRAEaeareketralslaralEEALEAKEELERTNKQL---RAEMEDLvsSKDdvgknvhelerskraL 690
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1769 QQKLAEYKNKLDDAWDLLREATD-KTRDANRLSAANQKNMTILETKKEAIEGSKRQIENTLKEGNDILDEANRLLGEINS 1847
Cdd:pfam01576  691 EQQVEEMKTQLEELEDELQATEDaKLRLEVNMQALKAQFERDLQARDEQGEEKRRQLVKQVRELEAELEDERKQRAQAVA 770
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1848 VidyvddiKTKLPPMSEELSDKIDDlAQEIKDRRLAE-KVFQAE--SHAAQLNDSSAVLDGILDEAKnisfnataafray 1924
Cdd:pfam01576  771 A-------KKKLELDLKELEAQIDA-ANKGREEAVKQlKKLQAQmkDLQRELEEARASRDEILAQSK------------- 829
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1925 snikdyidEAEKVAR--EAkELAQGATKLATSpqgllkEDAKGSLQKSFRILneAKKLANDVKGNhNDLNDLKTRLET-- 2000
Cdd:pfam01576  830 --------ESEKKLKnlEA-ELLQLQEDLAAS------ERARRQAQQERDEL--ADEIASGASGK-SALQDEKRRLEAri 891
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2001 ADLRN-----SGLLGALNDTMDKLSAITNDTAAKLQAVKEKAREaNDTAKAVLA-QVKDLH---QNLDG-----LKQNYN 2066
Cdd:pfam01576  892 AQLEEeleeeQSNTELLNDRLRKSTLQVEQLTTELAAERSTSQK-SESARQQLErQNKELKaklQEMEGtvkskFKSSIA 970
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 117647249  2067 KLADSVAKTNAVVKDPSKNKIIadAGTSVRNLEQeadrlidKLKPIKELEDNLKKNISEIKElinQARKQANSIK 2141
Cdd:pfam01576  971 ALEAKIAQLEEQLEQESRERQA--ANKLVRRTEK-------KLKEVLLQVEDERRHADQYKD---QAEKGNSRMK 1033
HEC1 COG5185
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell ...
1645-2141 1.41e-07

Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 444066 [Multi-domain]  Cd Length: 594  Bit Score: 57.27  E-value: 1.41e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1645 ELLTRATKVTADGEQTgQDAERTNSRAESLEEFIKGLVQDAEAINEKAVQLNETLGNQDKTAERNLEELQKEIDRMLKEL 1724
Cdd:COG5185    17 EGNANKELIEILLESS-KSEGKTLVFITILFFPLGISRDSLRVTLRSVINVLDGLNYQNDVKKSESSVKARKFLKEKKLD 95
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1725 RSKDLQTQKEVA-----EDELVAAEGLLKRVNKLfgeprAQNEDMEKDLQQKLAEYKNklDDAwDLLREATDKTRDANRL 1799
Cdd:COG5185    96 TKILQEYVNSLIklpnyEWSADILISLLYLYKSE-----IVALKDELIKVEKLDEIAD--IEA-SYGEVETGIIKDIFGK 167
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1800 SAANQKNMTILETKKEAIEGSKRQIENTLKEGNDILDEANRLLGEINSVIDY------------------VDDIKTKLPP 1861
Cdd:COG5185   168 LTQELNQNLKKLEIFGLTLGLLKGISELKKAEPSGTVNSIKESETGNLGSEStllekakeiinieealkgFQDPESELED 247
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1862 MsEELSDKIDDLAQEIKDRRLAEKVFQAEShaaqlndssavldgildeAKNISFNATAAFRAYSNIKDYIDEAEKVAREA 1941
Cdd:COG5185   248 L-AQTSDKLEKLVEQNTDLRLEKLGENAES------------------SKRLNENANNLIKQFENTKEKIAEYTKSIDIK 308
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1942 KELAQGAtklatspqgllKEDAKGSLQKSF--RILNEAKKLANDVKGNHNDLNDLKTRLETADLRNSGLLGA-------- 2011
Cdd:COG5185   309 KATESLE-----------EQLAAAEAEQELeeSKRETETGIQNLTAEIEQGQESLTENLEAIKEEIENIVGEvelsksse 377
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2012 -LNDTMDKLSAITNDTAAKLQAVKEKAREA----NDTAKAVLAQVKDLHQNLDGLKQNY-------NKLADSVAKTNAVV 2079
Cdd:COG5185   378 eLDSFKDTIESTKESLDEIPQNQRGYAQEIlatlEDTLKAADRQIEELQRQIEQATSSNeevskllNELISELNKVMREA 457
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 117647249 2080 KDPSKNKIIADAGTSVRNLEQEADRLIDKLKPIK-----------ELEDNLKKNISEIKELINQarkQANSIK 2141
Cdd:COG5185   458 DEESQSRLEEAYDEINRSVRSKKEDLNEELTQIEsrvstlkatleKLRAKLERQLEGVRSKLDQ---VAESLK 527
PTZ00440 PTZ00440
reticulocyte binding protein 2-like protein; Provisional
1688-2166 2.00e-07

reticulocyte binding protein 2-like protein; Provisional


Pssm-ID: 240419 [Multi-domain]  Cd Length: 2722  Bit Score: 57.53  E-value: 2.00e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1688 INEKAVQLNETLGNQDKTAERNLEELQKEIDR------------MLKELRSKDLQTQKEVAEDELVAAEGLLKRVNKL-- 1753
Cdd:PTZ00440 1764 MKEKLDELSKMCKQQYNIVDEGYNYIKKKIEYiktlndennlsdSLNQAEDKNKEVANLTHYTNKNEAKNLLGHVVKSan 1843
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1754 ---------FGEPRAQNEDMEKDLQQKLAEYKNKLDDAWDLLREATDKTRDANRLSAANQKNMTILETKKEaIEGSKRQI 1824
Cdd:PTZ00440 1844 figikimtgLQPTELTPDASLETAPELTFESENNSDLELDHLSSNKNELDVYKNIQDAYKSSLQILKYSDD-IDKKQRDC 1922
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1825 ENTLKEGNDI---LDEANRLLGEINSVIDYVDDIktklppmseelSDKIDDLAQEIKdrrlaekvfqaESHAAQLNDSSa 1901
Cdd:PTZ00440 1923 NKLVEDGNEIylkSTAINELKNMINSVKNKESAI-----------SNKIDNVSNKLS-----------ELNKITCNDES- 1979
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1902 vLDGILDEAKNISFNataafRAYSNikdYIDEAEKVAREAKelAQGATKLATSPQGLLKEdakgsLQKSFRILNEAKKLA 1981
Cdd:PTZ00440 1980 -YDEILEKEEYEELK-----DLRNS---FNQEKAETLNNLK--LNKIKEDFNSYKNLLDE-----LEKSVKTLKASENIK 2043
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1982 NDVKGNHNDLNDLKTRLETADLRNSGLLGALNDTMDKLSAITNdtaAKLQAVKEK-AREANDTAKAVLAQVKDLHQNLDG 2060
Cdd:PTZ00440 2044 KIVENKKTSIDAINTNIEDIEKEIESINPSLDELLKKGHKIEI---SRYTSIIDNvQTKISNDSKNINDIEKKAQIYLAY 2120
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2061 LKQNYNKLADSVAKTNAVVK-------DPSK----NKIIAD---AGTSVRNL-------------EQEADRLIDKLKPIK 2113
Cdd:PTZ00440 2121 IKNNYNSIKKDISTLNEYFDekqvsnyILTNidkaNKLSSElseAVTNSEEIienikkeiieineNTEMNTLENTADKLK 2200
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 117647249 2114 ELEDNLKKNiseiKELINQARKQANSIKVSvssggdcvrtyrpEIKKGS--YNNI 2166
Cdd:PTZ00440 2201 ELYENLKKK----KNIINNIYKKINFIKLQ-------------EIENSSekYNDI 2238
PRK04778 PRK04778
septation ring formation regulator EzrA; Provisional
1672-2108 2.02e-07

septation ring formation regulator EzrA; Provisional


Pssm-ID: 179877 [Multi-domain]  Cd Length: 569  Bit Score: 56.77  E-value: 2.02e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1672 ESLEEFikglvqdaEAINEKAVQL-NETLGNqdktAERNLEELQKEIDRMlkelRSKDLQTQKEVAEDELVAAEGLLKRV 1750
Cdd:PRK04778   61 QSEEKF--------EEWRQKWDEIvTNSLPD----IEEQLFEAEELNDKF----RFRKAKHEINEIESLLDLIEEDIEQI 124
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1751 NKLFGEPRAQNEDMEKDLQQKLAEYKNklddawdlLReatdKTRDANRLS--AANQKnmtiLETKKEAIEGSKRQIENTL 1828
Cdd:PRK04778  125 LEELQELLESEEKNREEVEQLKDLYRE--------LR----KSLLANRFSfgPALDE----LEKQLENLEEEFSQFVELT 188
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1829 KEGN-----DILDEANRLLGEINsviDYVDDIktklPPMSEELSDKIDDLAQEIKD--RRLAEK-----VFQAESHAAQL 1896
Cdd:PRK04778  189 ESGDyvearEILDQLEEELAALE---QIMEEI----PELLKELQTELPDQLQELKAgyRELVEEgyhldHLDIEKEIQDL 261
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1897 NDS-SAVLDGI----LDEAKNIsfNAtaafraysNIKDYID------EAE-----KVAREAKELAQGATKLATSPQGLLK 1960
Cdd:PRK04778  262 KEQiDENLALLeeldLDEAEEK--NE--------EIQERIDqlydilEREvkarkYVEKNSDTLPDFLEHAKEQNKELKE 331
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1961 EDAkgSLQKSFRI----LNEAKKLANDVKGNHNDLNDLKTRLETADLRNSGLLGALNDTMDKLSAITN---DTAAKLQAV 2033
Cdd:PRK04778  332 EID--RVKQSYTLneseLESVRQLEKQLESLEKQYDEITERIAEQEIAYSELQEELEEILKQLEEIEKeqeKLSEMLQGL 409
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2034 KEKAREANDTAKAVlaqVKDLH--------QNLDGLKQNY-NKLADSVAKTNAVVKDPSKNKI--------IADAGTSVR 2096
Cdd:PRK04778  410 RKDELEAREKLERY---RNKLHeikrylekSNLPGLPEDYlEMFFEVSDEIEALAEELEEKPInmeavnrlLEEATEDVE 486
                         490
                  ....*....|..
gi 117647249 2097 NLEQEADRLIDK 2108
Cdd:PRK04778  487 TLEEETEELVEN 498
Cast pfam10174
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...
1659-2110 2.04e-07

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.


Pssm-ID: 431111 [Multi-domain]  Cd Length: 766  Bit Score: 57.14  E-value: 2.04e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1659 QTGQDAERTnsRAESLEEFI-KGLVQDAEAINEKAVQLNETLGNQD--KTAERNLEELQKEIDRMLKELRSKD------- 1728
Cdd:pfam10174  344 QTEVDALRL--RLEEKESFLnKKTKQLQDLTEEKSTLAGEIRDLKDmlDVKERKINVLQKKIENLQEQLRDKDkqlaglk 421
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1729 -----LQTQKEVAEDELVAAEGLLKRVNKLFGEPRAQNEDMEKDLQQKLAEYKNKLDDAWD----LLREATDKTRDANRL 1799
Cdd:pfam10174  422 ervksLQTDSSNTDTALTTLEEALSEKERIIERLKEQREREDRERLEELESLKKENKDLKEkvsaLQPELTEKESSLIDL 501
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1800 --SAANQ-----------KNMTI-LETKKEaiEGSKrqIENTLKEGNDILDEANrllgeinsvidyvddiktklppMSEE 1865
Cdd:pfam10174  502 keHASSLassglkkdsklKSLEIaVEQKKE--ECSK--LENQLKKAHNAEEAVR----------------------TNPE 555
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1866 LSDKIDDLAQEIKdrRLAEkvfqaESHAAQlndsSAV--LDGILDEAKNisfnataafraYSNIKD-YIDEAEKVA-REA 1941
Cdd:pfam10174  556 INDRIRLLEQEVA--RYKE-----ESGKAQ----AEVerLLGILREVEN-----------EKNDKDkKIAELESLTlRQM 613
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1942 KELAQGATKLATSPQGLLKEDAKgslqksfrILNEAKKLANDVKGNHndlndLKTRLETadlrnsgLLGALNDTMDKLSA 2021
Cdd:pfam10174  614 KEQNKKVANIKHGQQEMKKKGAQ--------LLEEARRREDNLADNS-----QQLQLEE-------LMGALEKTRQELDA 673
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2022 ITNDTAAKLQAVKEKAREANdtaKAVLAQVKDLHQNLDgLKQNYNKLADSVAKTN-AVVK-DPSKNKIIADagtSVRNLE 2099
Cdd:pfam10174  674 TKARLSSTQQSLAEKDGHLT---NLRAERRKQLEEILE-MKQEALLAAISEKDANiALLElSSSKKKKTQE---EVMALK 746
                          490
                   ....*....|.
gi 117647249  2100 QEADRLIDKLK 2110
Cdd:pfam10174  747 REKDRLVHQLK 757
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
410-462 2.24e-07

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 49.62  E-value: 2.24e-07
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|....*
gi 117647249    410 CHCDPTGSLSEVCVKDEkyaqrglkpGSCHCKTGFGGVNCDRCVRGYHG--YPDC 462
Cdd:smart00180    1 CDCDPGGSASGTCDPDT---------GQCECKPNVTGRRCDRCAPGYYGdgPPGC 46
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
464-512 2.84e-07

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 49.27  E-value: 2.84e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 117647249  464 PCNCSGLGSTNE--DPCVGPCSCKENVEGEDCSRCKSGFFNLQEDNQkGCE 512
Cdd:cd00055     1 PCDCNGHGSLSGqcDPGTGQCECKPNTTGRRCDRCAPGYYGLPSQGG-GCQ 50
COG1340 COG1340
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
1762-2110 3.31e-07

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];


Pssm-ID: 440951 [Multi-domain]  Cd Length: 297  Bit Score: 54.92  E-value: 3.31e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1762 EDMEKDLQQKLAEYKNKLDDAWDLLREATDKTRD-ANRLSAANQKNMTILETKKEAIEgSKRQIENTLKEGNDILDEANR 1840
Cdd:COG1340     7 SSSLEELEEKIEELREEIEELKEKRDELNEELKElAEKRDELNAQVKELREEAQELRE-KRDELNEKVKELKEERDELNE 85
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1841 llgEINSVIDYVDDIKTKLPPMSEELSDkIDDLAQEIkdRRLaEKVFQaeshaaqlndsSAVLDgiLDEAKNIsfnataa 1920
Cdd:COG1340    86 ---KLNELREELDELRKELAELNKAGGS-IDKLRKEI--ERL-EWRQQ-----------TEVLS--PEEEKEL------- 138
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1921 fraysnIKDyIDEAEKVAREAKELAQGATKLATspqgLLKEdakgslqksfriLNEAKKLANDVkgnHNDLNDLktrlet 2000
Cdd:COG1340   139 ------VEK-IKELEKELEKAKKALEKNEKLKE----LRAE------------LKELRKEAEEI---HKKIKEL------ 186
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2001 adlrnSGLLGALNDTMDKLSAitndtaaKLQAVKEKAREANDTAKAVLAQVKDLHQNLDGLKQNYNKLADSVAKTNAVVK 2080
Cdd:COG1340   187 -----AEEAQELHEEMIELYK-------EADELRKEADELHKEIVEAQEKADELHEEIIELQKELRELRKELKKLRKKQR 254
                         330       340       350
                  ....*....|....*....|....*....|
gi 117647249 2081 DPSKNKIIADagtsvrnLEQEADRLIDKLK 2110
Cdd:COG1340   255 ALKREKEKEE-------LEEKAEEIFEKLK 277
SPEC cd00176
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members ...
1682-1881 3.36e-07

Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here


Pssm-ID: 238103 [Multi-domain]  Cd Length: 213  Bit Score: 53.60  E-value: 3.36e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1682 VQDAEA-INEKAVQLNETLGNQDktaERNLEELQKEIDRMLKEL-----RSKDLQTQ-KEVAEDELVAAEGLLKRVNKLf 1754
Cdd:cd00176     9 ADELEAwLSEKEELLSSTDYGDD---LESVEALLKKHEALEAELaaheeRVEALNELgEQLIEEGHPDAEEIQERLEEL- 84
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1755 gepraqnEDMEKDLQQKLAEYKNKLDDAWDLLRE----------ATDKTRDANRLSAAnqKNMTILETKKEAIEGSKRQI 1824
Cdd:cd00176    85 -------NQRWEELRELAEERRQRLEEALDLQQFfrdaddleqwLEEKEAALASEDLG--KDLESVEELLKKHKELEEEL 155
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 117647249 1825 ENTLKEGNDILDEANRLLGEINSviDYVDDIKTKLppmsEELSDKIDDLAQEIKDRR 1881
Cdd:cd00176   156 EAHEPRLKSLNELAEELLEEGHP--DADEEIEEKL----EELNERWEELLELAEERQ 206
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1929-2142 3.79e-07

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 55.29  E-value: 3.79e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1929 DYIDEAEKVAREAKELAQGATKLATSpqgLLKEDAKGSLQKSFRILNEAKKLANDVKGNHNDLNDLKTRLETAD---LRN 2005
Cdd:COG4372     2 DRLGEKVGKARLSLFGLRPKTGILIA---ALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARselEQL 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2006 SGLLGALNDTMDKLSAITNDTAAKLQAVKEKAREANDTAKAVLAQVKDLHQNLDGLKQNYNKLADSVAktnavvkdpSKN 2085
Cdd:COG4372    79 EEELEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIA---------ERE 149
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 117647249 2086 KIIADAGTSVRNLEQEADRLIDKLKPIKElednlKKNISEIKELINQARKQANSIKV 2142
Cdd:COG4372   150 EELKELEEQLESLQEELAALEQELQALSE-----AEAEQALDELLKEANRNAEKEEE 201
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
753-795 4.16e-07

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 48.85  E-value: 4.16e-07
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*.
gi 117647249    753 CQCF---AHAEACDDITGECLnCKDHTGGPYCNECLPGFYGDPTRG 795
Cdd:smart00180    1 CDCDpggSASGTCDPDTGQCE-CKPNVTGRRCDRCAPGYYGDGPPG 45
growth_prot_Scy NF041483
polarized growth protein Scy;
1613-1980 5.70e-07

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 55.60  E-value: 5.70e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1613 QELKHLLSPQRaperliQLAEGNVNTLVMETNELLTRatkvtadgeqTGQDAER--TNSRAESlEEFIKGLVQDAEAINE 1690
Cdd:NF041483  130 QQLDQELAERR------QTVESHVNENVAWAEQLRAR----------TESQARRllDESRAEA-EQALAAARAEAERLAE 192
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1691 KAVQlneTLGNQDKTAERNLEEL----QKEIDRMLKELrskdlQTQKEVAEDElvaAEGLLKRVNKLFGEPRAQNEDMEK 1766
Cdd:NF041483  193 EARQ---RLGSEAESARAEAEAIlrraRKDAERLLNAA-----STQAQEATDH---AEQLRSSTAAESDQARRQAAELSR 261
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1767 DLQQKLAEYKNKLD----DAWDLLREAtdKTRDANRLSAANQKNMTILETKKEAIegsKRQIENTLKEGNDILDEANRLL 1842
Cdd:NF041483  262 AAEQRMQEAEEALRearaEAEKVVAEA--KEAAAKQLASAESANEQRTRTAKEEI---ARLVGEATKEAEALKAEAEQAL 336
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1843 GeinsvidyvdDIKTKLPPMSEELSDKIDDLAqeikdrrlaekvfqAESHAAQLNDSSAVLDGILDEAkniSFNATAAFR 1922
Cdd:NF041483  337 A----------DARAEAEKLVAEAAEKARTVA--------------AEDTAAQLAKAARTAEEVLTKA---SEDAKATTR 389
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 117647249 1923 AYSnikdyiDEAEKVAREAK----ELAQGATKLATSPQGLLKEDAKGSLQKSFRILNEAKKL 1980
Cdd:NF041483  390 AAA------EEAERIRREAEaeadRLRGEAADQAEQLKGAAKDDTKEYRAKTVELQEEARRL 445
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1810-2144 6.35e-07

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 55.46  E-value: 6.35e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1810 LETKKEAIEGSKRQIENTLKEGNDILDEANRLLGEINSVIDYVDDIKTKLPPMSEELSDKIDDLAQEIKD---------R 1880
Cdd:TIGR02169  686 LKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENvkselkeleA 765
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1881 RLAEKVFQAESHAAQLNDSSAVLDGilDEAKNISfnataafRAYSNIKDYIDEAEKVAREAKElaqgatklATSPQGLLK 1960
Cdd:TIGR02169  766 RIEELEEDLHKLEEALNDLEARLSH--SRIPEIQ-------AELSKLEEEVSRIEARLREIEQ--------KLNRLTLEK 828
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1961 EDAKGSLQKSFRILNEAKklaNDVKGNHNDLNDLKTRLETADLRNSGLLGALNDTMDKLSAITNDTA---AKLQAVKEKA 2037
Cdd:TIGR02169  829 EYLEKEIQELQEQRIDLK---EQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDeleAQLRELERKI 905
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2038 REANDTAKAVLAQVKDLHQNLDGLKQNYNKLADSVAktnAVVKDPSKNKIIADAGTSVRNLE--------------QEAD 2103
Cdd:TIGR02169  906 EELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKG---EDEEIPEEELSLEDVQAELQRVEeeiralepvnmlaiQEYE 982
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|.
gi 117647249  2104 RLIDKLKPIKELEDNLKKNISEIKELINQARKQANSIKVSV 2144
Cdd:TIGR02169  983 EVLKRLDELKEKRAKLEEERKAILERIEEYEKKKREVFMEA 1023
PTZ00440 PTZ00440
reticulocyte binding protein 2-like protein; Provisional
1626-2181 6.86e-07

reticulocyte binding protein 2-like protein; Provisional


Pssm-ID: 240419 [Multi-domain]  Cd Length: 2722  Bit Score: 55.61  E-value: 6.86e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1626 ERLIQLAEGNVNTLVMETNELLTRATKVtadgeqtgQDAERTNSRAESLEEFIKGLVQDAEAInEKAVQlnetlgNQDKT 1705
Cdd:PTZ00440  149 EPLNEEIIKNIEQCLGNKNDLDNLIIVL--------ENPEKYNVRKTLYDEKFNEYKNKKEAF-YNCLK------NKKED 213
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1706 AERNLEELQKEIDRMLKELRSKDLQTQKEVAEDELvaaEGLLKRVNKLfgepraqnedMEKDLQQKLAEYKNKLDDAWDL 1785
Cdd:PTZ00440  214 YDKKIKKINNEIRKLLKNIKCTGNMCKTDTYVDMV---ELYLLRVNEV----------PSNNYDNYLNRAKELLESGSDL 280
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1786 LREATDKTRDANRLSAanqknmtiletkkeaiegskrqIENTLKEGNDILDEANRLLGEINSVIDYVDDIKT-KLPPmse 1864
Cdd:PTZ00440  281 INKIKKELGDNKTIYS----------------------INFIQEEIGDIIKRYNFHLKKIEKGKEYIKRIQNnNIPP--- 335
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1865 elsdkiddlaqEIKDRRLAEKVFQAESHAAQL---NDSSAVLDGILDEAKNISFNATAAFRAY--SNIKDYID------E 1933
Cdd:PTZ00440  336 -----------QVKKDELKKKYFESAKHYASFkfsLEMLSMLDSLLIKKEKILNNLFNKLFGDlkEKIETLLDseyfisK 404
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1934 AEKVAREAKELAQGATKLATSPQGLLKEDAKGS------LQKSF--------RILNEAKKLANDVKGNHNDLNDLKTRLE 1999
Cdd:PTZ00440  405 YTNIISLSEHTLKAAEDVLKENSQKIADYALYSnleiieIKKKYdekinelkKSINQLKTLISIMKSFYDLIISEKDSMD 484
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2000 TADLRNSgllgALNDTMDKLSAITNdtaaKLQAVKEKAREANDTAKAV---LAQVKDLHQNLDGLKQNYNKLADSVakTN 2076
Cdd:PTZ00440  485 SKEKKES----SDSNYQEKVDELLQ----IINSIKEKNNIVNNNFKNIedyYITIEGLKNEIEGLIELIKYYLQSI--ET 554
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2077 AVVKDPSKNKIIADAGTSVRNLEQEadrlIDKLKPIKELEDNLKKNISEIKELINQARKQANSIKVSVSSGGDCVRTYRP 2156
Cdd:PTZ00440  555 LIKDEKLKRSMKNDIKNKIKYIEEN----VDHIKDIISLNDEIDNIIQQIEELINEALFNKEKFINEKNDLQEKVKYILN 630
                         570       580
                  ....*....|....*....|....*
gi 117647249 2157 EIKKGSYNNIVVHVKTAVADNLLFY 2181
Cdd:PTZ00440  631 KFYKGDLQELLDELSHFLDDHKYLY 655
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
1626-1913 9.36e-07

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 54.64  E-value: 9.36e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1626 ERLIQLAEGNVNTLVMETNELLTRATKVTADGEQTGQDAERTNSRAESLEEFIKGLVQDAEAINEKAVQLNetlgNQDKT 1705
Cdd:TIGR04523  460 DNTRESLETQLKVLSRSINKIKQNLEQKQKELKSKEKELKKLNEEKKELEEKVKDLTKKISSLKEKIEKLE----SEKKE 535
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1706 AERNLEELQKEIDRMLKELRSKDLQT-----QKEVaeDELVAAEGLLKRVNKLFGEPRAQNEDMEKDLQQKLAEYKNKLd 1780
Cdd:TIGR04523  536 KESKISDLEDELNKDDFELKKENLEKeidekNKEI--EELKQTQKSLKKKQEEKQELIDQKEKEKKDLIKEIEEKEKKI- 612
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1781 daWDLLREATdKTRDANRLSAANQKNmtiLETKKEAIEGSKRQIENTLKEgndILDEANRLLGEINSVIDYVDDIkTKLp 1860
Cdd:TIGR04523  613 --SSLEKELE-KAKKENEKLSSIIKN---IKSKKNKLKQEVKQIKETIKE---IRNKWPEIIKKIKESKTKIDDI-IEL- 681
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|...
gi 117647249  1861 pMSEELSDKIDDLAQEIKDRRLAEKVFQAESHAAQLNDSSAVLDGILDEAKNI 1913
Cdd:TIGR04523  682 -MKDWLKELSLHYKKYITRMIRIKDLPKLEEKYKEIEKELKKLDEFSKELENI 733
COG5022 COG5022
Myosin heavy chain [General function prediction only];
1630-2130 1.23e-06

Myosin heavy chain [General function prediction only];


Pssm-ID: 227355 [Multi-domain]  Cd Length: 1463  Bit Score: 54.70  E-value: 1.23e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1630 QLAEGNVNTLVMETNELLTRATKVTadgeQTGQDAERTNSraESLEEFI---KGLVQDAEAINEKAVQLNET-LGNQDKT 1705
Cdd:COG5022   888 KIDVKSISSLKLVNLELESEIIELK----KSLSSDLIENL--EFKTELIarlKKLLNNIDLEEGPSIEYVKLpELNKLHE 961
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1706 AERNLEELQKEIDRMLKELrskdlqtqkevaedelvaaegllkrvNKLFGEPRAQNEDMEKdLQQKLAEYKNKLDDawdl 1785
Cdd:COG5022   962 VESKLKETSEEYEDLLKKS--------------------------TILVREGNKANSELKN-FKKELAELSKQYGA---- 1010
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1786 LREATDKTRDANRLSAANQKNMTILETkkeaiEGSKRQIENTLKE-GNDILDEANRLLGEINSVIDYVDDIKTKLPPMSE 1864
Cdd:COG5022  1011 LQESTKQLKELPVEVAELQSASKIISS-----ESTELSILKPLQKlKGLLLLENNQLQARYKALKLRRENSLLDDKQLYQ 1085
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1865 ELSDKIDDLAQEIKDRRLAEKVFQAESHAAQLNDSSAVLDGILDEAKniSFNAtaafRAYSNIKDYideaeKVAREAKEL 1944
Cdd:COG5022  1086 LESTENLLKTINVKDLEVTNRNLVKPANVLQFIVAQMIKLNLLQEIS--KFLS----QLVNTLEPV-----FQKLSVLQL 1154
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1945 AQGATKLATSPQGLLKEDAKGSLQKSFRILNEAKKLANdvKGNHNDLNDLKTRLET-ADLRNSGLLgaLNDTMDKLSAIT 2023
Cdd:COG5022  1155 ELDGLFWEANLEALPSPPPFAALSEKRLYQSALYDEKS--KLSSSEVNDLKNELIAlFSKIFSGWP--RGDKLKKLISEG 1230
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2024 NDTAAKLQAVKE-------KAREANDTAKAVLAQVKDLHQNLDGLKQNYNKLADSVAKTNAVVKDPSKNKIIADA----G 2092
Cdd:COG5022  1231 WVPTEYSTSLKGfnnlnkkFDTPASMSNEKLLSLLNSIDNLLSSYKLEEEVLPATINSLLQYINVGLFNALRTKAsslrW 1310
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|...
gi 117647249 2093 TSVRNLEQEADRLIDKLKPIK------ELEDN---------LKKNISEIKELI 2130
Cdd:COG5022  1311 KSATEVNYNSEELDDWCREFEisdvdeELEELiqavkvlqlLKDDLNKLDELL 1363
ClyA_Cry6Aa-like cd22656
Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes ...
1663-1888 1.80e-06

Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes pesticidal Cry6Aa toxin from Bacillus thuringiensis, one of the many parasporal crystal (Cry) toxins produced during the sporulation phase of growth. Many of these proteins are toxic to numerous insect species and have been effectively used as proteinaceous insecticides to directly kill insect pests; some have been used to control insect growth on transgenic agricultural plants. Cry6Aa exists as a protoxin, which is activated by cleavage using trypsin. Structure studies for Cry6Aa support a mechanism of action by pore formation, similar to cytolysin A (ClyA)-type alpha pore-forming toxins (alpha-PFTs) such as HblB, and bioassay and mutation studies show that Cry6Aa is an active pore-forming toxin. Cry6Aa shows atypical features compared to other members of alpha-PFTs, including internal repeat sequences and small loop regions within major alpha helices.


Pssm-ID: 439154 [Multi-domain]  Cd Length: 309  Bit Score: 52.76  E-value: 1.80e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1663 DAERTNSRaESLEEFIKGLVQDAEAINEKAVQLNETLG---NQDKTAERNLEELQKEIDRMLKElrSKDLQTQKEVAEde 1739
Cdd:cd22656   109 DEELEEAK-KTIKALLDDLLKEAKKYQDKAAKVVDKLTdfeNQTEKDQTALETLEKALKDLLTD--EGGAIARKEIKD-- 183
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1740 lvaaegLLKRVnklfgepraqnedmeKDLQQKLA-EYKNKLDDAWDLLREATDKTRDANRLSAanqknmtiletkkeAIE 1818
Cdd:cd22656   184 ------LQKEL---------------EKLNEEYAaKLKAKIDELKALIADDEAKLAAALRLIA--------------DLT 228
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1819 GSKRQIENTLkegnDILDEAnrlLGEINSVIDYVDDIKTKLppmsEELSDKIDDLAQEIKDRRLAEKVFQ 1888
Cdd:cd22656   229 AADTDLDNLL----ALIGPA---IPALEKLQGAWQAIATDL----DSLKDLLEDDISKIPAAILAKLELE 287
PTZ00440 PTZ00440
reticulocyte binding protein 2-like protein; Provisional
1585-2146 2.09e-06

reticulocyte binding protein 2-like protein; Provisional


Pssm-ID: 240419 [Multi-domain]  Cd Length: 2722  Bit Score: 54.07  E-value: 2.09e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1585 LEQMTMNINLTGplpapYKILYGLENTtqELKHLLSPQRAPErLIQLAEGNVN----TLVMETNEL-----LTRATKVTA 1655
Cdd:PTZ00440 1835 LGHVVKSANFIG-----IKIMTGLQPT--ELTPDASLETAPE-LTFESENNSDleldHLSSNKNELdvyknIQDAYKSSL 1906
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1656 DGEQTGQDAERTNSRAESLEEFIKGLVQDAEAINEKAVQLNETLGNQDKTAER--NLEELQKEIDRMLKELRSKDLQTQK 1733
Cdd:PTZ00440 1907 QILKYSDDIDKKQRDCNKLVEDGNEIYLKSTAINELKNMINSVKNKESAISNKidNVSNKLSELNKITCNDESYDEILEK 1986
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1734 EVAEDelvaaeglLKRVNKLFGEPRAQ--NEDMEKDLQQKLAEYKNKLDD----AWDLLREATDKTRDANRLSAAN--QK 1805
Cdd:PTZ00440 1987 EEYEE--------LKDLRNSFNQEKAEtlNNLKLNKIKEDFNSYKNLLDEleksVKTLKASENIKKIVENKKTSIDaiNT 2058
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1806 NMTILETKKEAIEGSkrqIENTLKEGNDIldEANRLlgeiNSVIDyvddiktklppmseELSDKIDDLAQEIKDRrlaEK 1885
Cdd:PTZ00440 2059 NIEDIEKEIESINPS---LDELLKKGHKI--EISRY----TSIID--------------NVQTKISNDSKNINDI---EK 2112
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1886 VFQAESHAAQLNDSSAVLD------------------GILDEAKNISFNATAAFRAYSNI-------------KDYIDEA 1934
Cdd:PTZ00440 2113 KAQIYLAYIKNNYNSIKKDistlneyfdekqvsnyilTNIDKANKLSSELSEAVTNSEEIienikkeiieineNTEMNTL 2192
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1935 EKVAREAKELAQGATKLATSPQGLLK-------EDAKGSLQKSFRIlneAKKLANDVKGNHNDLNDLKTRLETADLRNSG 2007
Cdd:PTZ00440 2193 ENTADKLKELYENLKKKKNIINNIYKkinfiklQEIENSSEKYNDI---SKLFNNVVETQKKKLLDNKNKINNIKDKIND 2269
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2008 LLGALND-----TMDKLSAITNDTAAKLQAVKEkAREANDTAKAVLAQVKDLHQNLDGLKQNYNKL---------ADSVA 2073
Cdd:PTZ00440 2270 KEKELINvdssfTLESIKTFNEIYDDIKSNIGD-LYKLEDTNNDELKKVKLYIENITHLLNRINTLindldnyqdENYGK 2348
                         570       580       590       600       610       620       630
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 117647249 2074 KTNAVVKDPSKNKIIaDAGTSVRNLEQEADRLIDKLKpikelEDNLKKNISEIKELINQARKQANSIKVSVSS 2146
Cdd:PTZ00440 2349 DKNIELNNENNSYII-KTKEKINNLKEEFSKLLKNIK-----RNNTLCNNNNIKDFISNIGKSVETIKQRFSS 2415
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
802-859 2.19e-06

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 46.96  E-value: 2.19e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 117647249  802 PCACPLNIpsnNFSPTCHLDrslGLICdECPIGYTGPRCERCAEGYFGQPSIPGGsCQ 859
Cdd:cd00055     1 PCDCNGHG---SLSGQCDPG---TGQC-ECKPNTTGRRCDRCAPGYYGLPSQGGG-CQ 50
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
283-330 2.41e-06

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 46.58  E-value: 2.41e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 117647249  283 CICYGHA---RACplDPATnkSRCECEHNTCGESCDRCCPGFHQKPWRAGT 330
Cdd:cd00055     2 CDCNGHGslsGQC--DPGT--GQCECKPNTTGRRCDRCAPGYYGLPSQGGG 48
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
803-858 4.63e-06

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


Pssm-ID: 395007  Cd Length: 49  Bit Score: 45.81  E-value: 4.63e-06
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 117647249   803 CACPLNIPSnnfSPTCHldrSLGLICDeCPIGYTGPRCERCAEGYFGQPSIPGGSC 858
Cdd:pfam00053    1 CDCNPHGSL---SDTCD---PETGQCL-CKPGVTGRHCDRCKPGYYGLPSDPPQGC 49
alph_xenorhab_A NF033928
alpha-xenorhabdolysin family binary toxin subunit A; Alpha-xenorhabdolysin was the founding ...
1664-1944 5.07e-06

alpha-xenorhabdolysin family binary toxin subunit A; Alpha-xenorhabdolysin was the founding member of a family of alpha-helical pore-forming binary toxins. YaxAB from Yersinia enterocolitica has been studied structurally. This HMM represents subunit A proteins such as XaxA and YaxA, capable of binding to the membrane even in the absence of the B subunit. This family is related to the Bacillus haemolytic enterotoxin family (see PF05791.9), although thresholds for this HMM are set to exclude that family.


Pssm-ID: 468250 [Multi-domain]  Cd Length: 340  Bit Score: 51.53  E-value: 5.07e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1664 AERTNSRAESLEEFIKGLvQDAEAINEKAVQLNE----TLGNQDKTAERN----LEELQKEIDRMLKELR--SKDLQT-Q 1732
Cdd:NF033928   79 ARNIVVTGNPIIDLINEM-PIIKRGDLTEEELSElppiPLSSDDKEIVKElkeiLEDLKNDIKDYQQKADdvKKELDDfE 157
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1733 KEVAEDELVAAEGLLKRVNKLFGEP-----RAQNEDMEKDLQQKLAEYKNKLDDAWdllreatdktrdanrlSAANQKN- 1806
Cdd:NF033928  158 NDLREELLPQLKLKKKLYDDNLGSDsieelREKIDQLEKEIEQLNKEYDDYVKLSF----------------TGLAGGPi 221
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1807 -----MTILETKKEAIEGSKRQIENTLKEGNDILDEANRLLGEINSVIDYVDDIKTKlppMSEELS--DKIDDLAQEIkd 1879
Cdd:NF033928  222 glaitGGIFGSKAEKIRKEKNALIQEIDELQEQLKKKNALLGSLERLQTSLDDILTR---MEDALPalKKLKGVWQSL-- 296
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 117647249 1880 rrlaekvfqaeshAAQLNDSSAVLDGILDEAKNISFNAtaafraysNIKDYIDEAEKVAREAKEL 1944
Cdd:NF033928  297 -------------LTDIDSSINALKEIDDADSLRLFKL--------EFEQVIAPWKEIQDYAKQL 340
Apolipoprotein pfam01442
Apolipoprotein A1/A4/E domain; These proteins contain several 22 residue repeats which form a ...
1608-1779 5.81e-06

Apolipoprotein A1/A4/E domain; These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E.


Pssm-ID: 460211 [Multi-domain]  Cd Length: 175  Bit Score: 49.18  E-value: 5.81e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1608 LENTTQELKHLLSP--QRAPERLIQLAEGNVNTLVMETNELLTRATKVTADGEQT-GQDAERTNSRAESL-EEFIKGLVQ 1683
Cdd:pfam01442    9 LSTYAEELQEQLGPvaQELVDRLEKETEALRERLQKDLEEVRAKLEPYLEELQAKlGQNVEELRQRLEPYtEELRKRLNA 88
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1684 DAEAINEKAVQLNETLgnqDKTAERNLEELQKEIDRMLKELRSKdlqtqkeVAEdelvAAEGLLKRVNKLFGEPRAQNED 1763
Cdd:pfam01442   89 DAEELQEKLAPYGEEL---RERLEQNVDALRARLAPYAEELRQK-------LAE----RLEELKESLAPYAEEVQAQLSQ 154
                          170
                   ....*....|....*.
gi 117647249  1764 MEKDLQQKLAEYKNKL 1779
Cdd:pfam01442  155 RLQELREKLEPQAEDL 170
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1608-1887 5.89e-06

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 51.44  E-value: 5.89e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1608 LENTTQELKHLLSPQRAPERLIQLAEGNVNTLVMETNELLTRATKVTADGEQTGQDAERTNSRAESLEEFIKGLVQDAEA 1687
Cdd:COG4372    82 LEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLES 161
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1688 INEKAVQLNETLGNQDKT-AERNLEELQKEIDRMLKELRSKDLQTQKEVAEDELVA----------AEGLLKRVNKLFGE 1756
Cdd:COG4372   162 LQEELAALEQELQALSEAeAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAeelleakdslEAKLGLALSALLDA 241
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1757 PRAQNEDMEKDLQQKLAEYKNKLDDAWDLLREATDKTRDANRLSAANQKnmtILETKKEAIEGSKRQIENTLKEGNDILD 1836
Cdd:COG4372   242 LELEEDKEELLEEVILKEIEELELAILVEKDTEEEELEIAALELEALEE---AALELKLLALLLNLAALSLIGALEDALL 318
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|.
gi 117647249 1837 EANRLLGEINSVIDYVDDIKTKLPPMSEELSDKIDDLAQEIKDRRLAEKVF 1887
Cdd:COG4372   319 AALLELAKKLELALAILLAELADLLQLLLVGLLDNDVLELLSKGAEAGVAD 369
PTZ00440 PTZ00440
reticulocyte binding protein 2-like protein; Provisional
1635-2146 6.08e-06

reticulocyte binding protein 2-like protein; Provisional


Pssm-ID: 240419 [Multi-domain]  Cd Length: 2722  Bit Score: 52.53  E-value: 6.08e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1635 NVNTLVMETNELLTRATKVTADGEQTGQDAERTNSraESLE-------EFIKGLVQDAEAINEKAVQLNETLGNQDKTaE 1707
Cdd:PTZ00440 1549 NILNIKDEIETILNKAQDLMRDISKISKIVENKNL--ENLNdkeadyvKYLDNILKEKQLMEAEYKKLNEIYSDVDNI-E 1625
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1708 RNLEELQK--EIDRM--LKELrSKDLQTQKEVAEDELVAaegLLKRVNKLFgepraQNEDMEK-DLQQKLAEYKNKLDDA 1782
Cdd:PTZ00440 1626 KELKKHKKnyEIGLLekVIEI-NKNIKLYMDSTKESLNS---LVNNFSSLF-----NNFYLNKyNINENLEKYKKKLNEI 1696
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1783 WDLLREATDKTRDANRLSAANQKNMTILETKKEAIEGSKRQIENTLKEGNDILDEANRLlgEINSVIDYVDDIKTKLPPM 1862
Cdd:PTZ00440 1697 YNEFMESYNIIQEKMKEVSNDDVDYNEAKTLREEAQKEEVNLNNKEEEAKKYLNDIKKQ--ESFRFILYMKEKLDELSKM 1774
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1863 SEELSDKIDD-------LAQEIK----DRRLAEKVFQAE---------SHAAQLNDSSAVLDGILDEAKNISFNATAAFr 1922
Cdd:PTZ00440 1775 CKQQYNIVDEgynyikkKIEYIKtlndENNLSDSLNQAEdknkevanlTHYTNKNEAKNLLGHVVKSANFIGIKIMTGL- 1853
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1923 AYSNIKDyiDEAEKVAREAKELAQGATKLATSPQGLLK------EDAKGSLQKSFRIL-------------NEAKKLAND 1983
Cdd:PTZ00440 1854 QPTELTP--DASLETAPELTFESENNSDLELDHLSSNKneldvyKNIQDAYKSSLQILkysddidkkqrdcNKLVEDGNE 1931
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1984 VKGNHNDLNDLKTRLETADLRNSGLLGALNDTMDKLSAItndtaaklqavkeKAREAND-TAKAVLAqvKDLHQNLDGLK 2062
Cdd:PTZ00440 1932 IYLKSTAINELKNMINSVKNKESAISNKIDNVSNKLSEL-------------NKITCNDeSYDEILE--KEEYEELKDLR 1996
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2063 QNYNKladsvaKTNAVVKDPSKNKIIADAGTSVRNLeqeaDRLIDKLKPIKElEDNLKKNISEIKELINQARKQANSIKV 2142
Cdd:PTZ00440 1997 NSFNQ------EKAETLNNLKLNKIKEDFNSYKNLL----DELEKSVKTLKA-SENIKKIVENKKTSIDAINTNIEDIEK 2065

                  ....
gi 117647249 2143 SVSS 2146
Cdd:PTZ00440 2066 EIES 2069
ATG17_like pfam04108
Autophagy protein ATG17-like domain; This domain is found in the autophagy-related proteins ...
1931-2141 6.32e-06

Autophagy protein ATG17-like domain; This domain is found in the autophagy-related proteins ATG17 and ATG11, conserved across eukaryotes. ATG17 forms a complex with ATG29 and ATG31, critical for both PAS (preautophagosomal structure) formation and autophagy. Together with ATG13, it is required for ATG1 kinase activation. ATG11 is a scaffold protein required for the cytoplasm-to-vacuole targeting (Cvt) pathway during starvation and to recruit ATG proteins to the pre-autophagosome. It is also required for ATG1 kinase activation. In many eukaryotes, ATG11 (the orthologue in mammals is RB1-inducible coiled-coil protein 1 (RB1CC1) and in S. pombe is Taz1-interacting factor 1 (taf1)) is essential for bulk autophagy, except in S.cerevisiae. ATG17 and ATG11 are large similar proteins, both predicted to be almost entirely helical, containing conserved coiled-coil regions and lack obvious functional motifs.


Pssm-ID: 427715 [Multi-domain]  Cd Length: 360  Bit Score: 51.23  E-value: 6.32e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1931 IDEAEKVAREAKELAQGATKLATSPQGLLKEdakgsLQKSFRILNEA-KKLANDVKGNHNDLNDLKTRLETADLRNSGLL 2009
Cdd:pfam04108    2 LSSAQDLCRWANELLTDARSLLEELVVLLAK-----IAFLRRGLSVQlANLEKVREGLEKVLNELKKDFKQLLKDLDAAL 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2010 GALNDTMDKLSAITNDTAAKLQAVK----------EKAREANDTAKAVLAQVKDLHQNLDG-LKQNYNKLADSVAKTNAV 2078
Cdd:pfam04108   77 ERLEETLDKLRNTPVEPALPPGEEKqktlldfideDSVEILRDALKELIDELQAAQESLDSdLKRFDDDLRDLQKELESL 156
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2079 VKDPSKNKIIADAGTSVRNLEQEADRLIDKL--------------------------KPIKELED---NLKKNISEIKEL 2129
Cdd:pfam04108  157 SSPSESISLIPTLLKELESLEEEMASLLESLtnhydqcvtavklteggraemlevleNDARELDDvvpELQDRLDEMENN 236
                          250
                   ....*....|..
gi 117647249  2130 INQARKQANSIK 2141
Cdd:pfam04108  237 YERLQKLLEQKN 248
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
1652-2137 7.07e-06

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 51.97  E-value: 7.07e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1652 KVTADGEQTGQDAERTNSRAESLEEFIKGLVQDAEAINEKAVQLNETL----GNQDKtaERNLEELQKEIDRMLKELRSK 1727
Cdd:TIGR00606  581 SKSKEINQTRDRLAKLNKELASLEQNKNHINNELESKEEQLSSYEDKLfdvcGSQDE--ESDLERLKEEIEKSSKQRAML 658
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1728 DLQTQ--KEVAEDELVAAEGLLKRVNKLFgEPRAQNEDMEKDLQQKLAEYKNKLDDAWDLLREaTDKTRDANRLSAANQK 1805
Cdd:TIGR00606  659 AGATAvySQFITQLTDENQSCCPVCQRVF-QTEAELQEFISDLQSKLRLAPDKLKSTESELKK-KEKRRDEMLGLAPGRQ 736
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1806 NMTILETK--KEAIEGSKRQIENTLKEGNDIlDEANRLLGEINSVIDYVDDIKTKLPPMsEELSDKIDDLaqeikDRRLA 1883
Cdd:TIGR00606  737 SIIDLKEKeiPELRNKLQKVNRDIQRLKNDI-EEQETLLGTIMPEEESAKVCLTDVTIM-ERFQMELKDV-----ERKIA 809
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1884 EKVfqAESHAAQLNDSSAVLDGILDEAKNisfnataafrAYSNIKDYIDEAEKVAREAKELAQ---------GATKLATS 1954
Cdd:TIGR00606  810 QQA--AKLQGSDLDRTVQQVNQEKQEKQH----------ELDTVVSKIELNRKLIQDQQEQIQhlksktnelKSEKLQIG 877
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1955 PQGllkEDAKGSLQKSFRILNEAKKLANDVKGNHNDLNDLKTRLETADLRNSGLLGALNDTMDKlsaitndTAAKLQAVK 2034
Cdd:TIGR00606  878 TNL---QRRQQFEEQLVELSTEVQSLIREIKDAKEQDSPLETFLEKDQQEKEELISSKETSNKK-------AQDKVNDIK 947
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2035 EKAREANDTAKAVLAQVKDLHQnlDGLKQNYNKLADSVAKTNAVVKdpSKNKIIADAGTSVRNLEQEADR---LIDKL-- 2109
Cdd:TIGR00606  948 EKVKNIHGYMKDIENKIQDGKD--DYLKQKETELNTVNAQLEECEK--HQEKINEDMRLMRQDIDTQKIQerwLQDNLtl 1023
                          490       500       510
                   ....*....|....*....|....*....|..
gi 117647249  2110 ----KPIKELEDNLKKNISEIKELINQARKQA 2137
Cdd:TIGR00606 1024 rkreNELKEVEEELKQHLKEMGQMQVLQMKQE 1055
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
803-851 8.45e-06

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 44.99  E-value: 8.45e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*....
gi 117647249    803 CACPlniPSNNFSPTCHLDrslGLICdECPIGYTGPRCERCAEGYFGQP 851
Cdd:smart00180    1 CDCD---PGGSASGTCDPD---TGQC-ECKPNVTGRRCDRCAPGYYGDG 42
Laminin_I pfam06008
Laminin Domain I; coiled-coil structure. It has been suggested that the domains I and II from ...
1793-2053 8.60e-06

Laminin Domain I; coiled-coil structure. It has been suggested that the domains I and II from laminin A, B1 and B2 may come together to form a triple helical coiled-coil structure.


Pssm-ID: 310534 [Multi-domain]  Cd Length: 258  Bit Score: 50.10  E-value: 8.60e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1793 TRDANRLSAA----NQKNMTILETKKE------AIEGSKRQIENTLKEGNDILDEANRLLGEINSVIDyvddiktklppM 1862
Cdd:pfam06008    1 LLSLNSLTGAlpapYKINYNLENLTKQlqeylsPENAHKIQIEILEKELSSLAQETEELQKKATQTLA-----------K 69
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1863 SEELSDKIDDLAQEIKDrrLAEKVFQAESHAAQLNDSSAVLDGILDEAKNisfnataafraySNIKDYIDEAEKV----- 1937
Cdd:pfam06008   70 AQQVNAESERTLGHAKE--LAEAIKNLIDNIKEINEKVATLGENDFALPS------------SDLSRMLAEAQRMlgeir 135
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1938 AREAKELAQGATKLATSPQGLLKEdakgsLQKSF-RILNEAKKLANDVK---GNHND-LNDLKTRLETA----------D 2002
Cdd:pfam06008  136 SRDFGTQLQNAEAELKAAQDLLSR-----IQTWFqSPQEENKALANALRdslAEYEAkLSDLRELLREAaaktrdanrlN 210
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|.
gi 117647249  2003 LRNSGLLGALNDTMDKLSAITNDTAAKLQavkeKAREANDTAKAVLAQVKD 2053
Cdd:pfam06008  211 LANQANLREFQRKKEEVSEQKNQLEETLK----TARDSLDAANLLLQEIDD 257
PTZ00440 PTZ00440
reticulocyte binding protein 2-like protein; Provisional
1635-2149 9.01e-06

reticulocyte binding protein 2-like protein; Provisional


Pssm-ID: 240419 [Multi-domain]  Cd Length: 2722  Bit Score: 51.76  E-value: 9.01e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1635 NVNTLVMET--NELLTRATKVTADGEQTGQDAERT-NSRAESLEEFIKGLVQDAEAINEKAVQLNEtlgNQDKTAERNLE 1711
Cdd:PTZ00440 1286 NMYEFLISIdsEKILKEILNSTKKAEEFSNDAKKElEKTDNLIKQVEAKIEQAKEHKNKIYGSLED---KQIDDEIKKIE 1362
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1712 ELQKEIDRMLKELRS--KDLQTQKEVAEDELVAAE------GLLKRVNKLfgEPRAQNE-------DMEKDLQQKLAE-- 1774
Cdd:PTZ00440 1363 QIKEEISNKRKEINKylSNIKSNKEKCDLHVRNASrgkdkiDFLNKHEAI--EPSNSKEvniikitDNINKCKQYSNEam 1440
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1775 -YKNKLDDAWDLLRE-ATDKTRDANRLSAANQKnmTILETKK-------EAIEGSKRQIE----------NTLKEGNDIL 1835
Cdd:PTZ00440 1441 eTENKADENNDSIIKyEKEITNILNNSSILGKK--TKLEKKKkeatnimDDINGEHSIIKtkltksseklNQLNEQPNIK 1518
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1836 DEANRL---------------LGEINSVIDYVDDIKTKLppmsEELSDKIDDLAQEIKDRRLAEKVFQAESHAAQLNDSS 1900
Cdd:PTZ00440 1519 REGDVLnndkstiayetiqynLGRVKHNLLNILNIKDEI----ETILNKAQDLMRDISKISKIVENKNLENLNDKEADYV 1594
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1901 AVLDGILDEAKNISFNATAAfraySNIKDYIDEAEKVAREAKELAQgatklatspQGLLKedakgslqksfRILNEAKKL 1980
Cdd:PTZ00440 1595 KYLDNILKEKQLMEAEYKKL----NEIYSDVDNIEKELKKHKKNYE---------IGLLE-----------KVIEINKNI 1650
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1981 ANDVKGNHNDLNDL----KTRLETADLRNSGLLGALNDTMDKLSAITNDTAAKLQAVKEKAREANDtaKAV-LAQVKDLH 2055
Cdd:PTZ00440 1651 KLYMDSTKESLNSLvnnfSSLFNNFYLNKYNINENLEKYKKKLNEIYNEFMESYNIIQEKMKEVSN--DDVdYNEAKTLR 1728
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2056 QNLDGLKQNYNKLADSVAKTNAVVKdpsknkiiadagtsvrnlEQEADRLIDKlkpIKELEDNLKKNISEIKELINQARK 2135
Cdd:PTZ00440 1729 EEAQKEEVNLNNKEEEAKKYLNDIK------------------KQESFRFILY---MKEKLDELSKMCKQQYNIVDEGYN 1787
                         570
                  ....*....|....
gi 117647249 2136 QANSIKVSVSSGGD 2149
Cdd:PTZ00440 1788 YIKKKIEYIKTLND 1801
Tektin pfam03148
Tektin family; Tektins are cytoskeletal proteins. They have been demonstrated in such cellular ...
1662-2005 9.07e-06

Tektin family; Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers of tektin heterodimers with axial periodicity matching tubulin. Tektin polypeptides consist of several alpha-helical regions that are predicted to form coiled coils. Indeed, tektins share considerable structural similarities with intermediate filament proteins. Possible functional roles for tektins are: stabilization of tubulin protofilaments; attachment of A and B-tubules in ciliary/flagellar microtubule doublets and C-tubules in centrioles; binding of axonemal components.


Pssm-ID: 460827 [Multi-domain]  Cd Length: 383  Bit Score: 51.01  E-value: 9.07e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1662 QDAERTNSRAESLEEFIKGLVQDAEAINEKAvQL--NETLGNQDKTAERNLEELQKEIDRMLKElrSKDLQTQKEVAEDE 1739
Cdd:pfam03148   10 REAEAQRNDAERLRQESRRLRNETDAKTKWD-QYdsNRRLGERIQDITFWKSELEKELEELDEE--IELLLEEKRRLEKA 86
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1740 L--------VAAEGLLKRvnklfgEPRaQNEDM-----EKDLQQKL-------AEYKNKLDDAWDLLREatdktrdaNRl 1799
Cdd:pfam03148   87 LealeeplhIAQECLTLR------EKR-QGIDLvhdevEKELLKEVeliegiqELLQRTLEQAWEQLRL--------LR- 150
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1800 saANQKNMTI-LETKKEAIEgskrqientlkegndiLDEANRLLGEINSVIDYVDDIkTKLPPMSEELSDkIDDLAQEIK 1878
Cdd:pfam03148  151 --AARHKLEKdLSDKKEALE----------------IDEKCLSLNNTSPNISYKPGP-TRIPPNSSTPEE-WEKFTQDNI 210
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1879 DRrlAEKVFQAeshAAQLNdssAVLDGILDEAKNI---SFNAT-AAFR--------AYS-------NIKDYIDEAEK--- 1936
Cdd:pfam03148  211 ER--AEKERAA---SAQLR---ELIDSILEQTANDlraQADAVnFALRkrieetedAKNklewqlkKTLQEIAELEKnie 282
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1937 ---VAREAKE----LAQgaTKLAT------------SPQ-GLLKEDAKgsLQKSFRILNEakKLAnDVKGNHNDLNDLKT 1996
Cdd:pfam03148  283 aleKAIRDKEaplkLAQ--TRLENrtyrpnvelcrdEAQyGLVDEVKE--LEETIEALKQ--KLA-EAEASLQALERTRL 355

                   ....*....
gi 117647249  1997 RLETaDLRN 2005
Cdd:pfam03148  356 RLEE-DIAV 363
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
465-511 1.03e-05

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 44.99  E-value: 1.03e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*....
gi 117647249    465 CNCSGLGSTNE--DPCVGPCSCKENVEGEDCSRCKSGFFNlqeDNQKGC 511
Cdd:smart00180    1 CDCDPGGSASGtcDPDTGQCECKPNVTGRRCDRCAPGYYG---DGPPGC 46
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
1465-1520 1.49e-05

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


Pssm-ID: 395007  Cd Length: 49  Bit Score: 44.65  E-value: 1.49e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 117647249  1465 CACPLISPSnnfSPSCVLEGledYRCTaCPRGYEGQYCERCAPGYTGSPSSPGGSC 1520
Cdd:pfam00053    1 CDCNPHGSL---SDTCDPET---GQCL-CKPGVTGRHCDRCKPGYYGLPSDPPQGC 49
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1605-1911 1.77e-05

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 50.54  E-value: 1.77e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1605 LYGLENTTQELKHLLspQRAPERLIQLAEGnvntlVMETNELLTRATKVTADGEQTGQDAERT-NSRAESLEEFIKGLVQ 1683
Cdd:COG4717   127 LLPLYQELEALEAEL--AELPERLEELEER-----LEELRELEEELEELEAELAELQEELEELlEQLSLATEEELQDLAE 199
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1684 DAEAINEKAVQLNETLgnqdKTAERNLEELQKEIDRMLKELRSKDLQTQKEVAEDELVAAEGLL---------------- 1747
Cdd:COG4717   200 ELEELQQRLAELEEEL----EEAQEELEELEEELEQLENELEAAALEERLKEARLLLLIAAALLallglggsllslilti 275
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1748 ------------------KRVNKLFGEP--RAQNEDMEKDLQQK-------------------LAEYKNKLDDAWDLLRE 1788
Cdd:COG4717   276 agvlflvlgllallflllAREKASLGKEaeELQALPALEELEEEeleellaalglppdlspeeLLELLDRIEELQELLRE 355
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1789 ATDKTRDANR----------LSAANQKNMTILETK----------KEAIEGSKRQIENTLKEGNDILDEAN--RLLGEIN 1846
Cdd:COG4717   356 AEELEEELQLeeleqeiaalLAEAGVEDEEELRAAleqaeeyqelKEELEELEEQLEELLGELEELLEALDeeELEEELE 435
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 117647249 1847 SVIDYVDDIKTKLppmsEELSDKIDDLAQEIK----DRRLAEKVFQAESHAAQLND------SSAVLDGILDEAK 1911
Cdd:COG4717   436 ELEEELEELEEEL----EELREELAELEAELEqleeDGELAELLQELEELKAELRElaeewaALKLALELLEEAR 506
Tropomyosin pfam00261
Tropomyosin; Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 ...
1727-1845 2.02e-05

Tropomyosin; Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites. The protein is best known for its role in regulating the interaction between actin and myosin in muscle contraction, but is also involved in the organization and dynamics of the cytoskeleton in non-muscle cells. There are multiple cell-specific isoforms, expressed by alternative promoters and alternative RNA processing of at least four genes. Muscle isoforms of tropomyosin are characterized by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region.


Pssm-ID: 459736 [Multi-domain]  Cd Length: 235  Bit Score: 48.49  E-value: 2.02e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1727 KDLQTQKEVAEDELVAAEGLLKRVNKLFGEPRAQNEDMEKDLQQKLAEyknkLDDAWDLLREATDKTRDANRLSAANQKN 1806
Cdd:pfam00261    4 QQIKEELDEAEERLKEAMKKLEEAEKRAEKAEAEVAALNRRIQLLEEE----LERTEERLAEALEKLEEAEKAADESERG 79
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 117647249  1807 MTILETKKEAIEGSKRQIENTLKEGNDILDEANRLLGEI 1845
Cdd:pfam00261   80 RKVLENRALKDEEKMEILEAQLKEAKEIAEEADRKYEEV 118
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
1665-1915 2.02e-05

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 50.61  E-value: 2.02e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1665 ERTNSRAESLEEFIKGLVQDAEAINEKAVQLNETLGNQDKT---AERNLEELQKEIDRMLKELRSKDLQTQKEVAEDELV 1741
Cdd:pfam12128  600 EELRERLDKAEEALQSAREKQAAAEEQLVQANGELEKASREetfARTALKNARLDLRRLFDEKQSEKDKKNKALAERKDS 679
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1742 AAEgllkRVNKLfgepRAQNEDMEKDLQQKLAEYKNK------------------LDDAWDLLREATDKTRDAN--RLSA 1801
Cdd:pfam12128  680 ANE----RLNSL----EAQLKQLDKKHQAWLEEQKEQkreartekqaywqvvegaLDAQLALLKAAIAARRSGAkaELKA 751
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1802 ANQKNMTILETK---KEAIEGSKRQIENTLKEgndiLDEANRLLGEINSVIDYV--------DDIKTKLPPMSEELSDKI 1870
Cdd:pfam12128  752 LETWYKRDLASLgvdPDVIAKLKREIRTLERK----IERIAVRRQEVLRYFDWYqetwlqrrPRLATQLSNIERAISELQ 827
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*..
gi 117647249  1871 DDLAQEIKDRRLAEKVFQAESHA--AQLNDSSAVLDGILDEAKNISF 1915
Cdd:pfam12128  828 QQLARLIADTKLRRAKLEMERKAseKQQVRLSENLRGLRCEMSKLAT 874
MA smart00283
Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer); Thought to undergo ...
1822-2062 2.17e-05

Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer); Thought to undergo reversible methylation in response to attractants or repellants during bacterial chemotaxis.


Pssm-ID: 214599 [Multi-domain]  Cd Length: 262  Bit Score: 48.82  E-value: 2.17e-05
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249   1822 RQIENTLKEGNDILDEANRLLGEINSVIDYVDDIKTKLPPMSE------ELSDKIDDLAQEIKDR--RLAEKVFQAESHA 1893
Cdd:smart00283    4 EAVEEIAAGAEEQAEELEELAERMEELSASIEEVAANADEIAAtaqsaaEAAEEGREAVEDAITAmdQIREVVEEAVSAV 83
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249   1894 AQLNDSSAVLDGILDEAKNISF-------NAT--AAfRAysnikdyiDEAEK----VAREAKELAQGATKLATSPQGLLK 1960
Cdd:smart00283   84 EELEESSDEIGEIVSVIDDIADqtnllalNAAieAA-RA--------GEAGRgfavVADEVRKLAERSAESAKEIESLIK 154
                           170       180       190       200       210       220       230       240
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249   1961 E------DAKGSLQKSFRILNEAKKLANDVKGNHNDLNDL---------KTRLETADLRNSglLGALNDTMDKLSAITND 2025
Cdd:smart00283  155 EiqeetnEAVAAMEESSSEVEEGVELVEETGDALEEIVDSveeiadlvqEIAAATDEQAAG--SEEVNAAIDEIAQVTQE 232
                           250       260       270
                    ....*....|....*....|....*....|....*..
gi 117647249   2026 TAAKLQavkekarEANDTAKAVLAQVKDLHQNLDGLK 2062
Cdd:smart00283  233 TAAMSE-------EISAAAEELSGLAEELDELVERFK 262
Tar COG0840
Methyl-accepting chemotaxis protein (MCP) [Signal transduction mechanisms];
1740-2149 2.19e-05

Methyl-accepting chemotaxis protein (MCP) [Signal transduction mechanisms];


Pssm-ID: 440602 [Multi-domain]  Cd Length: 533  Bit Score: 50.02  E-value: 2.19e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1740 LVAAEGLLKRVNKLFGEPRAQNEDMEKDLQQKLAEYKNKLDDAWDLLREATDKTRDANRLSAANQKNMTILETKKEAIEG 1819
Cdd:COG0840     4 LLLLLALLLALLLLALSLLALLAAALLILLALLLAALTALALLLLLSLLALLLLLLLLALALLLVLLALLLLLALVVLLA 83
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1820 SKRQIENTLKEGNDILDEANRLLGEINSVIDYVDDIKTKLPPMSEELSD---KIDDLAQEIKDRRLAEKVFQAESHAAQL 1896
Cdd:COG0840    84 LLLALLLLLLALLALALAALALLAALAALLALLELLLAALLAALAIALLalaALLALAALALALLALALLAAAAAAAAAL 163
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1897 NDSSAVLDGILDEAKNISFNATAAFRAYSNIKDYIDE--AEKVAREAKELAQGATKLA----TSPQGLLKEDAKGSLQKS 1970
Cdd:COG0840   164 AALLEAAALALAAAALALALLAAALLALVALAIILALllSRSITRPLRELLEVLERIAegdlTVRIDVDSKDEIGQLADA 243
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1971 F--------RILNEAKKLANDVKGNHNDLNDLKTRLETADLRNSGLLGALNDTMDKLSAITNDTAAKLQAVKEKAREAND 2042
Cdd:COG0840   244 FnrmienlrELVGQVRESAEQVASASEELAASAEELAAGAEEQAASLEETAAAMEELSATVQEVAENAQQAAELAEEASE 323
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2043 TAK-------AVLAQVKDLHQNLDGLKQNYNKLADSVAKTNAVVKDpsknkI--IAD---------------AGTS---- 2094
Cdd:COG0840   324 LAEeggevveEAVEGIEEIRESVEETAETIEELGESSQEIGEIVDV-----IddIAEqtnllalnaaieaarAGEAgrgf 398
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 117647249 2095 ------VRNLeqeADRLIDKLKPIKELEDNLKKNISEIKELINQARKQANSIKVSVSSGGD 2149
Cdd:COG0840   399 avvadeVRKL---AERSAEATKEIEELIEEIQSETEEAVEAMEEGSEEVEEGVELVEEAGE 456
PTZ00121 PTZ00121
MAEBL; Provisional
1682-2048 2.33e-05

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 50.52  E-value: 2.33e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1682 VQDAEAIN--EKAVQLNETLGNQDKTAE--RNLEELQKEIDRMLKELRSKDLQTQKEVAEDELVAAEGLLKR---VNKLF 1754
Cdd:PTZ00121 1557 LKKAEEKKkaEEAKKAEEDKNMALRKAEeaKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAeeeKKKVE 1636
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1755 GEPRAQNEDMEKDLQQKLAEYKNKLDDAwDLLREATDKTRDANRLSAANQKNMTILE-TKKEAIEgsKRQIENTLKEGND 1833
Cdd:PTZ00121 1637 QLKKKEAEEKKKAEELKKAEEENKIKAA-EEAKKAEEDKKKAEEAKKAEEDEKKAAEaLKKEAEE--AKKAEELKKKEAE 1713
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1834 ILDEANRLLGEINSVIDYVDDIKTKlppmSEELSDKIDDLAQEIKDRRLAEKVFQAESHAAQ--LNDSSAVLDGIL---D 1908
Cdd:PTZ00121 1714 EKKKAEELKKAEEENKIKAEEAKKE----AEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEeiRKEKEAVIEEELdeeD 1789
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1909 EAKNISFNATA--AFRAYSNIKDYIDEAEKVAREAKELAQGATKLATSPQGLLKEDAKGSLQKSFRILNEAKKLANDvKG 1986
Cdd:PTZ00121 1790 EKRRMEVDKKIkdIFDNFANIIEGGKEGNLVINDSKEMEDSAIKEVADSKNMQLEEADAFEKHKFNKNNENGEDGNK-EA 1868
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 117647249 1987 NHNDLNDLKTRLETADLRNSGLLGALNDTMDKLSAITN-------DTAAKLQAVKEKAREANDTAKAVL 2048
Cdd:PTZ00121 1869 DFNKEKDLKEDDEEEIEEADEIEKIDKDDIEREIPNNNmagknndIIDDKLDKDEYIKRDAEETREEII 1937
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1697-2053 3.37e-05

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 49.94  E-value: 3.37e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1697 ETLGNQDKTAERNLEELQKEIDRMLKELRskDLQTQKEVAEDELVAAEGLLKRVNKLFgepRAQNEDMEKdLQQKLAEYK 1776
Cdd:COG1196   235 RELEAELEELEAELEELEAELEELEAELA--ELEAELEELRLELEELELELEEAQAEE---YELLAELAR-LEQDIARLE 308
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1777 NKLDDAWDLLREATDKTRDANRLSAANQKNMTILETKKEAIEGSKRQIENTLKEGNDILDEANRLLGEINSVIDyvddik 1856
Cdd:COG1196   309 ERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELE------ 382
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1857 tklppmsEELSDKIDDLAQEIKDRRLAEKvfQAESHAAQLNDSSAVLDGILDEAKNISFNATAAFRAYSNIKDYIDEAEK 1936
Cdd:COG1196   383 -------ELAEELLEALRAAAELAAQLEE--LEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAE 453
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1937 VAREAKELAQGATKLATSPQGLLKEDAKGSLQKSfRILNEAKKLANDVKGNHNDLNDLKTRLETADLRNSGLLGALNDTm 2016
Cdd:COG1196   454 LEEEEEALLELLAELLEEAALLEAALAELLEELA-EAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIG- 531
                         330       340       350
                  ....*....|....*....|....*....|....*..
gi 117647249 2017 DKLSAITNDTAAKLQAVKEKAREANDTAKAVLAQVKD 2053
Cdd:COG1196   532 VEAAYEAALEAALAAALQNIVVEDDEVAAAAIEYLKA 568
PTZ00440 PTZ00440
reticulocyte binding protein 2-like protein; Provisional
1684-2176 3.57e-05

reticulocyte binding protein 2-like protein; Provisional


Pssm-ID: 240419 [Multi-domain]  Cd Length: 2722  Bit Score: 49.83  E-value: 3.57e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1684 DAEAINEKAVQLNETLGNQDKTaERNLEELQKEIDRMLKELRSKDLQTQKEVAEDelvaaegllkrVNKLFGEPRAQNED 1763
Cdd:PTZ00440 2042 IKKIVENKKTSIDAINTNIEDI-EKEIESINPSLDELLKKGHKIEISRYTSIIDN-----------VQTKISNDSKNIND 2109
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1764 MEKDLQQKLAEYKNKLDDAW---DLLREATDKTRDANRLSAANQKNMTILETKKEAIEGSKRQIENTLKEGNDILDeanr 1840
Cdd:PTZ00440 2110 IEKKAQIYLAYIKNNYNSIKkdiSTLNEYFDEKQVSNYILTNIDKANKLSSELSEAVTNSEEIIENIKKEIIEINE---- 2185
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1841 llgeiNSVIDYVDDIKTKLPPMSEELSDK---IDDLAQEIKDRRLAEkvfqAESHAAQLNDSSAVLDGILD-EAKNISFN 1916
Cdd:PTZ00440 2186 -----NTEMNTLENTADKLKELYENLKKKkniINNIYKKINFIKLQE----IENSSEKYNDISKLFNNVVEtQKKKLLDN 2256
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1917 ATAafraYSNIKDYIDEAEKVareakelaqgatklatspqgLLKEDAKGSLQkSFRILNEakkLANDVKGNHNDLNDLKT 1996
Cdd:PTZ00440 2257 KNK----INNIKDKINDKEKE--------------------LINVDSSFTLE-SIKTFNE---IYDDIKSNIGDLYKLED 2308
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1997 -------RLETADLRNSGLLGALNDTMDKLSAITNDTAAKLQAVKEKAREANdtakavlaQVKDLHQNLDGLKQNYNKLA 2069
Cdd:PTZ00440 2309 tnndelkKVKLYIENITHLLNRINTLINDLDNYQDENYGKDKNIELNNENNS--------YIIKTKEKINNLKEEFSKLL 2380
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2070 DSVAKTNAVVKDPSKNKIIADAGTSVRNLEQEadrlIDKLKPIKELEDNLKKNISEIKELINQARKQANsIKVSVSSggd 2149
Cdd:PTZ00440 2381 KNIKRNNTLCNNNNIKDFISNIGKSVETIKQR----FSSNLPEKEKLHQIEENLNEIKNIMNETKRISN-VDAFTNK--- 2452
                         490       500
                  ....*....|....*....|....*..
gi 117647249 2150 CVRTYRPEIKKGSYNNIVVHVKTAVAD 2176
Cdd:PTZ00440 2453 ILQDIDNEKNKENNNMNAEKIDDLIEN 2479
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
1603-2150 5.24e-05

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 49.27  E-value: 5.24e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1603 KILYGLENTTQELKhllspqrapERLIQLAE--GNVNTLVMETNELLTRATKVTADGEQTGQDAER----TNSRAESLEE 1676
Cdd:TIGR00606  238 EIVKSYENELDPLK---------NRLKEIEHnlSKIMKLDNEIKALKSRKKQMEKDNSELELKMEKvfqgTDEQLNDLYH 308
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1677 FIKGLVQDAEA----INEKAVQLNET--LGNQDKTA-ERNLEELQKEIDRMLKELRSKDLQTQKEVAEDELVAAE--GLL 1747
Cdd:TIGR00606  309 NHQRTVREKERelvdCQRELEKLNKErrLLNQEKTElLVEQGRLQLQADRHQEHIRARDSLIQSLATRLELDGFErgPFS 388
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1748 KRVNKLFGE-PRAQNEDMEKDLQQKLAEYKNKLDdawdLLREATDKTRDanRLSAANQknmtILETKKEAIEGSKRQIEN 1826
Cdd:TIGR00606  389 ERQIKNFHTlVIERQEDEAKTAAQLCADLQSKER----LKQEQADEIRD--EKKGLGR----TIELKKEILEKKQEELKF 458
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1827 TLKEGNDILDEANRLLGEINSVIDYVDD---------IKTKLppmSEELSDKIDDLAQEIKDRRLAEKVFQAESHAAQLN 1897
Cdd:TIGR00606  459 VIKELQQLEGSSDRILELDQELRKAERElskaeknslTETLK---KEVKSLQNEKADLDRKLRKLDQEMEQLNHHTTTRT 535
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1898 DSSAVLDGILDEAKNI-------SFNATAAFRAYSNIKDYIDEAEKVAREAKELAQGATKLATSPQGLlkEDAKGSLQKS 1970
Cdd:TIGR00606  536 QMEMLTKDKMDKDEQIrkiksrhSDELTSLLGYFPNKKQLEDWLHSKSKEINQTRDRLAKLNKELASL--EQNKNHINNE 613
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1971 FRILNEAK-KLAN---DVKGNHN---DLNDLKTRLETAdlrnsgllgalNDTMDKLSAITNDTAAKLQAVKEKAREANDT 2043
Cdd:TIGR00606  614 LESKEEQLsSYEDklfDVCGSQDeesDLERLKEEIEKS-----------SKQRAMLAGATAVYSQFITQLTDENQSCCPV 682
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2044 AKAVLAQVKDLHQNLDGLKQNYNKLADSVAKTNAVVKDPSKNKIIADAGTSVRnlEQEADRLIDKLKPIKELEDNLKKNI 2123
Cdd:TIGR00606  683 CQRVFQTEAELQEFISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGR--QSIIDLKEKEIPELRNKLQKVNRDI 760
                          570       580
                   ....*....|....*....|....*..
gi 117647249  2124 SEIKELINQARKQANSIKVSVSSGGDC 2150
Cdd:TIGR00606  761 QRLKNDIEEQETLLGTIMPEEESAKVC 787
46 PHA02562
endonuclease subunit; Provisional
1678-1880 6.66e-05

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 48.47  E-value: 6.66e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1678 IKGLVQDAEAINEKAVQLNETLGNQDKTAERNLEELQKEIDRMLKElrSKDLQTQKEVAEDELVAAE-------GLLKRV 1750
Cdd:PHA02562  183 IQTLDMKIDHIQQQIKTYNKNIEEQRKKNGENIARKQNKYDELVEE--AKTIKAEIEELTDELLNLVmdiedpsAALNKL 260
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1751 NKLFGEPRAQNEDMEKDL-------------QQ------KLAEYKNKLDDAWDLLREATDKTRDAN-RLSAANQKNMTIL 1810
Cdd:PHA02562  261 NTAAAKIKSKIEQFQKVIkmyekggvcptctQQisegpdRITKIKDKLKELQHSLEKLDTAIDELEeIMDEFNEQSKKLL 340
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1811 ETKKEaIEGSKRQIENTLKEGNDILDEANRLLGEinsVIDYVDDIKTklppMSEELSDKIDDLAQEIKDR 1880
Cdd:PHA02562  341 ELKNK-ISTNKQSLITLVDKAKKVKAAIEELQAE---FVDNAEELAK----LQDELDKIVKTKSELVKEK 402
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1764-1961 7.07e-05

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 46.84  E-value: 7.07e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1764 MEKDLQQ--KLAEYKNKLDDAWDLLREATDKTRDANRLSAANQKNMTILETKKEAIEGSKRQIENTLKEGNDILDEANRL 1841
Cdd:COG1579     2 MPEDLRAllDLQELDSELDRLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQ 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1842 LGEINSVIDYvddiktklppmsEELSDKIDDLAQEIKDR--RLAEKVFQAESHAAQLNDSSAVLDGILDEAKnisfNATA 1919
Cdd:COG1579    82 LGNVRNNKEY------------EALQKEIESLKRRISDLedEILELMERIEELEEELAELEAELAELEAELE----EKKA 145
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|...
gi 117647249 1920 AFRAysnikdyidEAEKVAREAKELAQGATKLATS-PQGLLKE 1961
Cdd:COG1579   146 ELDE---------ELAELEAELEELEAEREELAAKiPPELLAL 179
PTZ00440 PTZ00440
reticulocyte binding protein 2-like protein; Provisional
1682-2140 8.53e-05

reticulocyte binding protein 2-like protein; Provisional


Pssm-ID: 240419 [Multi-domain]  Cd Length: 2722  Bit Score: 48.67  E-value: 8.53e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1682 VQDAEAINEKAvqLNETLGNQDKTAERNLEELQKEIDRMLKELRSKDL---QTQKEVAEDELVAAEGLLKR---VNKLFG 1755
Cdd:PTZ00440 2166 VTNSEEIIENI--KKEIIEINENTEMNTLENTADKLKELYENLKKKKNiinNIYKKINFIKLQEIENSSEKyndISKLFN 2243
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1756 EpraqnedMEKDLQQKLAEYKNKLDDAWDLLREATDKTRDANrlsaanqKNMTiLETKKEAIE---------GSKRQIEN 1826
Cdd:PTZ00440 2244 N-------VVETQKKKLLDNKNKINNIKDKINDKEKELINVD-------SSFT-LESIKTFNEiyddiksniGDLYKLED 2308
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1827 T----LKEGNDILDEANRLLGEINSVIDYVDDIKTKLPPMSEELSDKIDDLAQEIKdrrLAEKVfqaeshaaqlNDSSAV 1902
Cdd:PTZ00440 2309 TnndeLKKVKLYIENITHLLNRINTLINDLDNYQDENYGKDKNIELNNENNSYIIK---TKEKI----------NNLKEE 2375
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1903 LDGILdeaKNISFNATaaFRAYSNIKDYIDEAEKVAREAKElaqgatKLATSpqglLKEDAKGSLQKSFriLNEAKKLAN 1982
Cdd:PTZ00440 2376 FSKLL---KNIKRNNT--LCNNNNIKDFISNIGKSVETIKQ------RFSSN----LPEKEKLHQIEEN--LNEIKNIMN 2438
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1983 DVKGNHNdlNDLKTRLETADLRNsgllgalndtmDKLSAITNDTAAKLQAVKEKAREANDTAKAVLAQVKDLHQNLDGLK 2062
Cdd:PTZ00440 2439 ETKRISN--VDAFTNKILQDIDN-----------EKNKENNNMNAEKIDDLIENVTSHNEKIKSELLIINDALRRVKEKK 2505
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2063 QNYNKLADSVAKTNAVVKDPSKNKIiadagTSVRNLEQEADRLIDKL--------KPIKELEDNLKKNISEIKelINQAR 2134
Cdd:PTZ00440 2506 DEMNKLFNSLTENNNNNNNSAKNIV-----DNSTYIINELESHVSKLnellsyidNEIKELENEKLKLLEKAK--IEESR 2578

                  ....*.
gi 117647249 2135 KQANSI 2140
Cdd:PTZ00440 2579 KERERI 2584
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
1635-1838 9.67e-05

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 48.09  E-value: 9.67e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1635 NVNTLVMETNELLTRATKVTADGEQTGQDAERTNSRAESLEEFIKGLVQDAEAINEKAVQLNetLGNQDKTAERNLEELQ 1714
Cdd:COG3206   206 GLVDLSEEAKLLLQQLSELESQLAEARAELAEAEARLAALRAQLGSGPDALPELLQSPVIQQ--LRAQLAELEAELAELS 283
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1715 ----------KEIDRMLKELRSKDLQTQKEVAEDELVAAEGLLKRvnklfgepraqnedmEKDLQQKLAEYKNKLDDAWD 1784
Cdd:COG3206   284 arytpnhpdvIALRAQIAALRAQLQQEAQRILASLEAELEALQAR---------------EASLQAQLAQLEARLAELPE 348
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 117647249 1785 LLREATDKTRDANrlsaANQKNMTILETKKEAIegskrQIENTLKEGN-DILDEA 1838
Cdd:COG3206   349 LEAELRRLEREVE----VARELYESLLQRLEEA-----RLAEALTVGNvRVIDPA 394
ClyA_Cry6Aa-like cd22656
Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes ...
1686-1946 1.15e-04

Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes pesticidal Cry6Aa toxin from Bacillus thuringiensis, one of the many parasporal crystal (Cry) toxins produced during the sporulation phase of growth. Many of these proteins are toxic to numerous insect species and have been effectively used as proteinaceous insecticides to directly kill insect pests; some have been used to control insect growth on transgenic agricultural plants. Cry6Aa exists as a protoxin, which is activated by cleavage using trypsin. Structure studies for Cry6Aa support a mechanism of action by pore formation, similar to cytolysin A (ClyA)-type alpha pore-forming toxins (alpha-PFTs) such as HblB, and bioassay and mutation studies show that Cry6Aa is an active pore-forming toxin. Cry6Aa shows atypical features compared to other members of alpha-PFTs, including internal repeat sequences and small loop regions within major alpha helices.


Pssm-ID: 439154 [Multi-domain]  Cd Length: 309  Bit Score: 46.98  E-value: 1.15e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1686 EAINEKAVQLNETlgNQDKTAERNLEELQKEIDRMLKElrSKDLQTQKEVAEDEL-----------VAAEGLLKRVNKLF 1754
Cdd:cd22656    94 AEILELIDDLADA--TDDEELEEAKKTIKALLDDLLKE--AKKYQDKAAKVVDKLtdfenqtekdqTALETLEKALKDLL 169
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1755 GE--PRAQNEDMeKDLQQKLAEYKNKLddawdllreatdktrdANRLsaanqknmtiletkKEAIEGSKRQIENTlkegN 1832
Cdd:cd22656   170 TDegGAIARKEI-KDLQKELEKLNEEY----------------AAKL--------------KAKIDELKALIADD----E 214
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1833 DILDEANRLLGEINSVIDYVDDIKTKLPPMSEELsdkiddlaQEIKDrrlaekVFQAeshaaqLNDSsavLDGILDEAKN 1912
Cdd:cd22656   215 AKLAAALRLIADLTAADTDLDNLLALIGPAIPAL--------EKLQG------AWQA------IATD---LDSLKDLLED 271
                         250       260       270
                  ....*....|....*....|....*....|....
gi 117647249 1913 ISFNATAAFRAYSNIKDYIDEAEKVAREAKELAQ 1946
Cdd:cd22656   272 DISKIPAAILAKLELEKAIEKWNELAEKADKFRQ 305
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
1609-1815 1.20e-04

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 48.12  E-value: 1.20e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1609 ENTTQELKHLLSPQRAPERLIQLAEGNVNTLVMETNELLTRATKVTADGEQTGQDAERTNSRAESLEEFIKGLVQDAEAI 1688
Cdd:TIGR00606  832 QEKQHELDTVVSKIELNRKLIQDQQEQIQHLKSKTNELKSEKLQIGTNLQRRQQFEEQLVELSTEVQSLIREIKDAKEQD 911
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1689 N------EKAVQLNETLGN----QDKTAERNLEELQKEIDRMLKELRSKDLQTQkEVAEDELVAAEGLLKRVNKLFGEPR 1758
Cdd:TIGR00606  912 SpletflEKDQQEKEELISsketSNKKAQDKVNDIKEKVKNIHGYMKDIENKIQ-DGKDDYLKQKETELNTVNAQLEECE 990
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 117647249  1759 AQNEDMEKDLQQKLAEY-KNKLDDAW---DL-LREATDKTRDANRLSAANQKNMT---ILETKKE 1815
Cdd:TIGR00606  991 KHQEKINEDMRLMRQDIdTQKIQERWlqdNLtLRKRENELKEVEEELKQHLKEMGqmqVLQMKQE 1055
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
1608-1890 1.35e-04

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 47.66  E-value: 1.35e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1608 LENTTQE--LKHLLSPQRAPERLIQLAEgnvntlvMETNELLTRATKVTADGEQTGQdaerTNSRAESLEEFIKGLVQDA 1685
Cdd:TIGR00618  652 QLTLTQErvREHALSIRVLPKELLASRQ-------LALQKMQSEKEQLTYWKEMLAQ----CQTLLRELETHIEEYDREF 720
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1686 EAINEKAVQLNETLGNQDKTAERNLEELQKEIDRMLKELRSKDlqtqkEVAEDELVAAEGLLKRVNKLFGEPRAQNEDME 1765
Cdd:TIGR00618  721 NEIENASSSLGSDLAAREDALNQSLKELMHQARTVLKARTEAH-----FNNNEEVTAALQTGAELSHLAAEIQFFNRLRE 795
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1766 KD---LQQKLAEYKNKLDDaWDLLREATDKT---RDANRLSAANQKNMTILETKKE--AIEGSKRQIENTLKEGNDILDE 1837
Cdd:TIGR00618  796 EDthlLKTLEAEIGQEIPS-DEDILNLQCETlvqEEEQFLSRLEEKSATLGEITHQllKYEECSKQLAQLTQEQAKIIQL 874
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....
gi 117647249  1838 ANRLLGeINSVIdyVDDIKTKLPPMSEELS-DKIDDLAQEIKDRRLAEKVFQAE 1890
Cdd:TIGR00618  875 SDKLNG-INQIK--IQFDGDALIKFLHEITlYANVRLANQSEGRFHGRYADSHV 925
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
1808-2051 1.51e-04

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 47.13  E-value: 1.51e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1808 TILETKKEAIEGSKRQIENTLKEGNDILDEANRLLGEINSVIDYVDDIKTKLppmsEELSDKIDDLAQEIKDRR--LAEK 1885
Cdd:COG3883    16 PQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEI----DKLQAEIAEAEAEIEERReeLGER 91
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1886 VfqaesHAAQLNDSSAVLDGILDEAKNISfnataafraysnikDYIDEAEKVareakelaqgaTKLATSPQGLLKEdakg 1965
Cdd:COG3883    92 A-----RALYRSGGSVSYLDVLLGSESFS--------------DFLDRLSAL-----------SKIADADADLLEE---- 137
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1966 sLQKSFRILNEAKKLANDVKgnhNDLNDLKTRLETAdlrnsglLGALNDTMDKLSAITNDTAAKLQAVKEKAREANDTAK 2045
Cdd:COG3883   138 -LKADKAELEAKKAELEAKL---AELEALKAELEAA-------KAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELA 206

                  ....*.
gi 117647249 2046 AVLAQV 2051
Cdd:COG3883   207 AAEAAA 212
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1872-2137 1.58e-04

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 47.62  E-value: 1.58e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1872 DLAQEIKDRRLAEKVFQAESHAAQLndssAVLDGILDEAKnisfnataafraysnikdyideaEKVAREAKELAQGATKL 1951
Cdd:COG1196   217 ELKEELKELEAELLLLKLRELEAEL----EELEAELEELE-----------------------AELEELEAELAELEAEL 269
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1952 ATSPQGLLKEDAKGS-LQKSFRIL-NEAKKLANDVKGNHNDLNDLKTRLETADLRNSGLLGALNDTMDKLSAITNDTAAK 2029
Cdd:COG1196   270 EELRLELEELELELEeAQAEEYELlAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEA 349
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2030 LQAVKEKAREANDTAKAVLAQVKDLHQNLDGLKQNYNKLADSVAKTNAVVKDpsknkiIADAGTSVRNLEQEADRLIDKl 2109
Cdd:COG1196   350 EEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQ------LEELEEAEEALLERLERLEEE- 422
                         250       260
                  ....*....|....*....|....*...
gi 117647249 2110 kpIKELEDNLKKNISEIKELINQARKQA 2137
Cdd:COG1196   423 --LEELEEALAELEEEEEEEEEALEEAA 448
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1613-1845 1.97e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 47.07  E-value: 1.97e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1613 QELKHLLSPQRAPERLIqlaegnvNTLVMETNELLTRATKVTADGEQTGQDAERTNSRAESLEEFIKGLVQDAEAINEKA 1692
Cdd:COG4717   319 EELEELLAALGLPPDLS-------PEELLELLDRIEELQELLREAEELEEELQLEELEQEIAALLAEAGVEDEEELRAAL 391
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1693 VQLNEtlgNQDKTAErnLEELQKEIDRMLKELRSKDLQTQKEVAEDELVAAEGLLKRVNKLF-------GEPRAQNEDME 1765
Cdd:COG4717   392 EQAEE---YQELKEE--LEELEEQLEELLGELEELLEALDEEELEEELEELEEELEELEEELeelreelAELEAELEQLE 466
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1766 KD--LQQKLAEYKnklddawDLLREATDKTRDANRLSAAnqknMTILETKKEAIEGSKRQientlkegnDILDEANRLLG 1843
Cdd:COG4717   467 EDgeLAELLQELE-------ELKAELRELAEEWAALKLA----LELLEEAREEYREERLP---------PVLERASEYFS 526

                  ..
gi 117647249 1844 EI 1845
Cdd:COG4717   527 RL 528
ATG17_like pfam04108
Autophagy protein ATG17-like domain; This domain is found in the autophagy-related proteins ...
1827-2130 2.25e-04

Autophagy protein ATG17-like domain; This domain is found in the autophagy-related proteins ATG17 and ATG11, conserved across eukaryotes. ATG17 forms a complex with ATG29 and ATG31, critical for both PAS (preautophagosomal structure) formation and autophagy. Together with ATG13, it is required for ATG1 kinase activation. ATG11 is a scaffold protein required for the cytoplasm-to-vacuole targeting (Cvt) pathway during starvation and to recruit ATG proteins to the pre-autophagosome. It is also required for ATG1 kinase activation. In many eukaryotes, ATG11 (the orthologue in mammals is RB1-inducible coiled-coil protein 1 (RB1CC1) and in S. pombe is Taz1-interacting factor 1 (taf1)) is essential for bulk autophagy, except in S.cerevisiae. ATG17 and ATG11 are large similar proteins, both predicted to be almost entirely helical, containing conserved coiled-coil regions and lack obvious functional motifs.


Pssm-ID: 427715 [Multi-domain]  Cd Length: 360  Bit Score: 46.23  E-value: 2.25e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1827 TLKEGNDILDEANRLLgeiNSVIDYVDDIktklppmsEELSDKIDDLAQEIKD--------RRLAEKVFQ-----AESHA 1893
Cdd:pfam04108    1 SLSSAQDLCRWANELL---TDARSLLEEL--------VVLLAKIAFLRRGLSVqlanlekvREGLEKVLNelkkdFKQLL 69
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1894 AQLNDSSAVLDGILDEAKNIS--FNATAAFRAYSNIKDYIDEaEKVareakelaqgatklatspqgllkEDAKGSLQKSF 1971
Cdd:pfam04108   70 KDLDAALERLEETLDKLRNTPvePALPPGEEKQKTLLDFIDE-DSV-----------------------EILRDALKELI 125
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1972 RILNEA-KKLANDVKGNHNDLNDLKTRLETADLRN--SGLLGALNDTMDKLSA--------ITNDTAAKLQAVK--EKAR 2038
Cdd:pfam04108  126 DELQAAqESLDSDLKRFDDDLRDLQKELESLSSPSesISLIPTLLKELESLEEemasllesLTNHYDQCVTAVKltEGGR 205
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  2039 E------ANDtAKAVLAQVKDLHQNLDGLKQNYNKLADSVAKTNAVVKDPSKN-KIIADAGT---SVRNLEQEADRLIDK 2108
Cdd:pfam04108  206 AemlevlEND-ARELDDVVPELQDRLDEMENNYERLQKLLEQKNSLIDELLSAlQLIAEIQSrlpEYLAALKEFEERWEE 284
                          330       340
                   ....*....|....*....|..
gi 117647249  2109 LKpikeleDNLKKNISEIKELI 2130
Cdd:pfam04108  285 EK------ETIEDYLSELEDLR 300
Apolipoprotein pfam01442
Apolipoprotein A1/A4/E domain; These proteins contain several 22 residue repeats which form a ...
1840-2059 2.26e-04

Apolipoprotein A1/A4/E domain; These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E.


Pssm-ID: 460211 [Multi-domain]  Cd Length: 175  Bit Score: 44.56  E-value: 2.26e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1840 RLLGEINSVIDYVDDIKTKLPPMSEELSDKI----DDLAQEIkDRRLAEKVFQAESHAAQLNdssAVLDGILDEAKNIsf 1915
Cdd:pfam01442    1 LLEDSLDELSTYAEELQEQLGPVAQELVDRLeketEALRERL-QKDLEEVRAKLEPYLEELQ---AKLGQNVEELRQR-- 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1916 nataafraysnIKDYIDE-AEKVAREAKELAQgatKLATSPQGLLK--EDAKGSLQKSFRILNEakklandvkgnhndln 1992
Cdd:pfam01442   75 -----------LEPYTEElRKRLNADAEELQE---KLAPYGEELRErlEQNVDALRARLAPYAE---------------- 124
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 117647249  1993 DLKTRLETAdlrnsglLGALNDTMdklsaitndtAAKLQAVKEKAREANDTAKAVLA-QVKDLHQNLD 2059
Cdd:pfam01442  125 ELRQKLAER-------LEELKESL----------APYAEEVQAQLSQRLQELREKLEpQAEDLREKLD 175
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1650-1947 2.28e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 47.07  E-value: 2.28e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1650 ATKVTADGEQTGQDAERTNSRAESLEEFIKGLVQDAEAINEKAVQLNETLGNQdKTAERNLEELQKEIDRM-LKELRSKD 1728
Cdd:COG4717   297 KASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEELLELLDRIEEL-QELLREAEELEEELQLEeLEQEIAAL 375
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1729 LQTQKEVAEDELVAAeglLKRVNKLfgepraqnedmeKDLQQKLAEYKNKLDDAWDLLREATDKTrDANRLsaanqknmt 1808
Cdd:COG4717   376 LAEAGVEDEEELRAA---LEQAEEY------------QELKEELEELEEQLEELLGELEELLEAL-DEEEL--------- 430
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1809 iletkKEAIEGSKRQIENTLKEGNDILDEANRLLGEINSVI--DYVDDIKTKLppmsEELSDKIDDLAQEIKDRRLAEKV 1886
Cdd:COG4717   431 -----EEELEELEEELEELEEELEELREELAELEAELEQLEedGELAELLQEL----EELKAELRELAEEWAALKLALEL 501
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 117647249 1887 FQAESHAAQLNDSSAVLDgildeaknisfNATAAFR-----AYSNIkdYIDEAEKVA--------REAKELAQG 1947
Cdd:COG4717   502 LEEAREEYREERLPPVLE-----------RASEYFSrltdgRYRLI--RIDEDLSLKvdtedgrtRPVEELSRG 562
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
1464-1521 2.41e-04

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 41.19  E-value: 2.41e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 117647249 1464 PCACPLISpsnNFSPSCVLEGLedyRCTaCPRGYEGQYCERCAPGYTGSPSSPGGsCQ 1521
Cdd:cd00055     1 PCDCNGHG---SLSGQCDPGTG---QCE-CKPNTTGRRCDRCAPGYYGLPSQGGG-CQ 50
PRK04778 PRK04778
septation ring formation regulator EzrA; Provisional
1603-1940 2.65e-04

septation ring formation regulator EzrA; Provisional


Pssm-ID: 179877 [Multi-domain]  Cd Length: 569  Bit Score: 46.75  E-value: 2.65e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1603 KILYGLENTTQELKHLLspQRAPERLIQLaegnVNTLVMETNELLTRATKVTADG--------EQTGQDAERTNSRAES- 1673
Cdd:PRK04778  198 EILDQLEEELAALEQIM--EEIPELLKEL----QTELPDQLQELKAGYRELVEEGyhldhldiEKEIQDLKEQIDENLAl 271
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1674 LEEF-IKGLVQDAEAINEKAVQLNETL-----------GNQDKT------AERNLEELQKEIDRmLKE---LRSKDLQTQ 1732
Cdd:PRK04778  272 LEELdLDEAEEKNEEIQERIDQLYDILerevkarkyveKNSDTLpdflehAKEQNKELKEEIDR-VKQsytLNESELESV 350
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1733 KEVaEDELVAAEGLLKRVNKLFGEPRA---QNEDMEKDLQQKLAEYKNKLDDAWDLLREATDKTRDAnrlsaanQKNMTI 1809
Cdd:PRK04778  351 RQL-EKQLESLEKQYDEITERIAEQEIaysELQEELEEILKQLEEIEKEQEKLSEMLQGLRKDELEA-------REKLER 422
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1810 LETKKEAIegsKRQIentlkegndildEANRLLGEINSVIDYVddiktklppmsEELSDKIDDLAQEIKDRRL-AEKVfq 1888
Cdd:PRK04778  423 YRNKLHEI---KRYL------------EKSNLPGLPEDYLEMF-----------FEVSDEIEALAEELEEKPInMEAV-- 474
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 117647249 1889 aeshAAQLNDSSAVLDGILDEAKNISFNATAAFRA--YSN--------IKDYIDEAEKVARE 1940
Cdd:PRK04778  475 ----NRLLEEATEDVETLEEETEELVENATLTEQLiqYANryrsdneeVAEALNEAERLFRE 532
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
1375-1403 2.88e-04

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 40.80  E-value: 2.88e-04
                          10        20
                  ....*....|....*....|....*....
gi 117647249 1375 RCDCPPGYSGLSCETCAPGFYRLRSEPGG 1403
Cdd:cd00055    20 QCECKPNTTGRRCDRCAPGYYGLPSQGGG 48
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
1465-1513 3.30e-04

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 40.76  E-value: 3.30e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*....
gi 117647249   1465 CACPlisPSNNFSPSCVLEGLedyRCTaCPRGYEGQYCERCAPGYTGSP 1513
Cdd:smart00180    1 CDCD---PGGSASGTCDPDTG---QCE-CKPNVTGRRCDRCAPGYYGDG 42
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
302-326 3.61e-04

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 40.37  E-value: 3.61e-04
                            10        20
                    ....*....|....*....|....*
gi 117647249    302 RCECEHNTCGESCDRCCPGFHQKPW 326
Cdd:smart00180   19 QCECKPNVTGRRCDRCAPGYYGDGP 43
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
1375-1403 3.93e-04

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


Pssm-ID: 395007  Cd Length: 49  Bit Score: 40.41  E-value: 3.93e-04
                           10        20
                   ....*....|....*....|....*....
gi 117647249  1375 RCDCPPGYSGLSCETCAPGFYRLRSEPGG 1403
Cdd:pfam00053   19 QCLCKPGVTGRHCDRCKPGYYGLPSDPPQ 47
Tropomyosin pfam00261
Tropomyosin; Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 ...
1653-1774 3.95e-04

Tropomyosin; Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites. The protein is best known for its role in regulating the interaction between actin and myosin in muscle contraction, but is also involved in the organization and dynamics of the cytoskeleton in non-muscle cells. There are multiple cell-specific isoforms, expressed by alternative promoters and alternative RNA processing of at least four genes. Muscle isoforms of tropomyosin are characterized by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region.


Pssm-ID: 459736 [Multi-domain]  Cd Length: 235  Bit Score: 44.63  E-value: 3.95e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1653 VTADGEQTGQDAERTNSRAESLEEFIKGL---VQDAEAINEKAVQLNETLGNQDKT--------------AERNLEELQK 1715
Cdd:pfam00261  125 VEGDLERAEERAELAESKIVELEEELKVVgnnLKSLEASEEKASEREDKYEEQIRFlteklkeaetraefAERSVQKLEK 204
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 117647249  1716 EIDRMlkelrskdlqtqkevaEDELVAAegllkrvnklfgepRAQNEDMEKDLQQKLAE 1774
Cdd:pfam00261  205 EVDRL----------------EDELEAE--------------KEKYKAISEELDQTLAE 233
46 PHA02562
endonuclease subunit; Provisional
1814-2089 4.19e-04

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 45.78  E-value: 4.19e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1814 KEAIEGSKRQIentlkegnDILD-EANRLLGEINSVIDYVDDIKTKLPPMSEELSDKIDDLAQEIKD---------RRLA 1883
Cdd:PHA02562  173 KDKIRELNQQI--------QTLDmKIDHIQQQIKTYNKNIEEQRKKNGENIARKQNKYDELVEEAKTikaeieeltDELL 244
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1884 EKVFQAESHAAQLNDSSavlDGILDEAKNI-SFNATAAF--------RAYSNIKDYIDEAEKVAREAKELAQGATKLats 1954
Cdd:PHA02562  245 NLVMDIEDPSAALNKLN---TAAAKIKSKIeQFQKVIKMyekggvcpTCTQQISEGPDRITKIKDKLKELQHSLEKL--- 318
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1955 pqgllkEDAKGSLQKsfrILNEAKKLANDVKGNHNDLNDLKTRLETADLRNSgllgALNDTMDKLSAITNDTAAKLQAVK 2034
Cdd:PHA02562  319 ------DTAIDELEE---IMDEFNEQSKKLLELKNKISTNKQSLITLVDKAK----KVKAAIEELQAEFVDNAEELAKLQ 385
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 117647249 2035 EKAREANDTaKAVLAQVKDLHQNLDGLkqnynkLADSVAKTnAVVKD--PSKNKIIA 2089
Cdd:PHA02562  386 DELDKIVKT-KSELVKEKYHRGIVTDL------LKDSGIKA-SIIKKyiPYFNKQIN 434
VSP pfam03302
Giardia variant-specific surface protein;
771-1090 4.71e-04

Giardia variant-specific surface protein;


Pssm-ID: 146106 [Multi-domain]  Cd Length: 397  Bit Score: 45.35  E-value: 4.71e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249   771 NCK--DHTGGPYCNECLPGFYGDPTRGSPEDCQPCAcplnipsNNFSPTCHLDRslgLICDECPIGytgpRCERCAEgyf 848
Cdd:pfam03302   27 NCKacSNDKREVCEECNSNNYLTPTSQCIDDCAKIG-------NYYYTTNANNK---KICKECTVA----NCKTCED--- 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249   849 gqpsipGGSCQpcQCNDNLdYSIPGSCDSLSGSCLICKPGTTGRyCELCADG---YFGDAVNAKNCqpcrcningsfSEI 925
Cdd:pfam03302   90 ------QGQCQ--ACNDGF-YKSGDACSPCHESCKTCSGGTASD-CTECLTGkalRYGNDGTKGTC-----------GEG 148
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249   926 CHTRTGQCECRP---NVQG-RHCDECKPETFGLQLGrgclPCNCNSFGSKSFdCEASgqcwcqpGVAGKKCDRCAHGYFN 1001
Cdd:pfam03302  149 CTTGTGAGACKTcglTIDGtSYCSECATETEYPQNG----VCTSTAARATAT-CKAS-------SVANGMCSSCANGYFR 216
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1002 fQEGGCI---------ACDCSHLGNNCDPKTGQCICPPN---TTGEKCSECLPNTWGHSIVTG-------CKVCN--CST 1060
Cdd:pfam03302  217 -MNGGCYettkfpgksVCEEANSGGTCQKEAPGYKLNNGdlvTCSPGCKTCTSNTVCTTCMDGyvktsdsCTKCDssCET 295
                          330       340       350
                   ....*....|....*....|....*....|
gi 117647249  1061 VGSLASQCNVntgqCSCHPKFSGMKCSECS 1090
Cdd:pfam03302  296 CTGATTTCKT----CATGYYKSGTGCVSCT 321
PRK04778 PRK04778
septation ring formation regulator EzrA; Provisional
1713-2141 5.29e-04

septation ring formation regulator EzrA; Provisional


Pssm-ID: 179877 [Multi-domain]  Cd Length: 569  Bit Score: 45.60  E-value: 5.29e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1713 LQKEIDRMLKELRSKDLQTQKEVAEDELVAAEGLlkrvnKLFGEPRAQNEDMEKDLQQ----KLAEYKNKLDDAwdllRE 1788
Cdd:PRK04778   23 LRKRNYKRIDELEERKQELENLPVNDELEKVKKL-----NLTGQSEEKFEEWRQKWDEivtnSLPDIEEQLFEA----EE 93
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1789 ATDKTRdanrLSAAnQKNMTILETKKEAIEGskrQIENTLKEGNDIL--DEANRllGEINSVIDYVDDIKTKLPPMSEEL 1866
Cdd:PRK04778   94 LNDKFR----FRKA-KHEINEIESLLDLIEE---DIEQILEELQELLesEEKNR--EEVEQLKDLYRELRKSLLANRFSF 163
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1867 SDKIDDLAQEIKDrrlAEKVFQaesHAAQLNDSSAVLD--GILDEAKNisfnATAAFRAYSN-IKDYIDEAEKVAREA-K 1942
Cdd:PRK04778  164 GPALDELEKQLEN---LEEEFS---QFVELTESGDYVEarEILDQLEE----ELAALEQIMEeIPELLKELQTELPDQlQ 233
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1943 ELAQGATKLAtspqgllkedAKGSLQKSFRILNEAKKLANDVKGNHNDLNDLKtrLETAdlrnSGLLGALNDTMDKL-SA 2021
Cdd:PRK04778  234 ELKAGYRELV----------EEGYHLDHLDIEKEIQDLKEQIDENLALLEELD--LDEA----EEKNEEIQERIDQLyDI 297
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2022 ITNDTAAKlQAVKEKAREANDTAKAVLAQVKDLHQNLDGLKQNY----------NKLADSVAKTNAVVKDPSKNkiIADA 2091
Cdd:PRK04778  298 LEREVKAR-KYVEKNSDTLPDFLEHAKEQNKELKEEIDRVKQSYtlneselesvRQLEKQLESLEKQYDEITER--IAEQ 374
                         410       420       430       440       450
                  ....*....|....*....|....*....|....*....|....*....|
gi 117647249 2092 GTSVRNLEQEADRLIDKLKPIKELEDNLKKNISEIKELINQARKQANSIK 2141
Cdd:PRK04778  375 EIAYSELQEELEEILKQLEEIEKEQEKLSEMLQGLRKDELEAREKLERYR 424
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
1989-2141 6.20e-04

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 45.21  E-value: 6.20e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1989 NDLNDLKTRLETADLRnsglLGALNDTMDKLSAITNDTAAKLQAVKEKAREANdtakavlAQVKDLHQNLDGLKQNYNKL 2068
Cdd:COG3883    23 KELSELQAELEAAQAE----LDALQAELEELNEEYNELQAELEALQAEIDKLQ-------AEIAEAEAEIEERREELGER 91
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2069 ADSVAKTNAVV---------KDPS---------------KNKIIADAGTSVRNLEQEADRLIDKLKPIKELEDNLKKNIS 2124
Cdd:COG3883    92 ARALYRSGGSVsyldvllgsESFSdfldrlsalskiadaDADLLEELKADKAELEAKKAELEAKLAELEALKAELEAAKA 171
                         170
                  ....*....|....*..
gi 117647249 2125 EIKELINQARKQANSIK 2141
Cdd:COG3883   172 ELEAQQAEQEALLAQLS 188
CENP-F_leu_zip pfam10473
Leucine-rich repeats of kinetochore protein Cenp-F/LEK1; Cenp-F, a centromeric kinetochore, ...
1666-1833 6.59e-04

Leucine-rich repeats of kinetochore protein Cenp-F/LEK1; Cenp-F, a centromeric kinetochore, microtubule-binding protein consisting of two 1,600-amino acid-long coils, is essential for the full functioning of the mitotic checkpoint pathway. There are several leucine-rich repeats along the sequence of LEK1 that are considered to be zippers, though they do not appear to be binding DNA directly in this instance.


Pssm-ID: 463102 [Multi-domain]  Cd Length: 140  Bit Score: 42.28  E-value: 6.59e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1666 RTNSRAESLEEFIKGLVQDAEAINEKavqlNETLGNQDKTAERNLEELQKEIDRMLKELRskDLQTqkevaedELVAAeg 1745
Cdd:pfam10473   14 ESERKADSLKDKVENLERELEMSEEN----QELAILEAENSKAEVETLKAEIEEMAQNLR--DLEL-------DLVTL-- 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1746 llkrvnklfgepRAQNEDMEKDLQQKlaeyKNKLDDAwDLLREATdktrdANRLSAANQKNMTILETKKEAIEgskrQIE 1825
Cdd:pfam10473   79 ------------RSEKENLTKELQKK----QERVSEL-ESLNSSL-----ENLLEEKEQEKVQMKEESKTAVE----MLQ 132

                   ....*...
gi 117647249  1826 NTLKEGND 1833
Cdd:pfam10473  133 TQLKELNE 140
ATG17_like pfam04108
Autophagy protein ATG17-like domain; This domain is found in the autophagy-related proteins ...
1817-2068 6.67e-04

Autophagy protein ATG17-like domain; This domain is found in the autophagy-related proteins ATG17 and ATG11, conserved across eukaryotes. ATG17 forms a complex with ATG29 and ATG31, critical for both PAS (preautophagosomal structure) formation and autophagy. Together with ATG13, it is required for ATG1 kinase activation. ATG11 is a scaffold protein required for the cytoplasm-to-vacuole targeting (Cvt) pathway during starvation and to recruit ATG proteins to the pre-autophagosome. It is also required for ATG1 kinase activation. In many eukaryotes, ATG11 (the orthologue in mammals is RB1-inducible coiled-coil protein 1 (RB1CC1) and in S. pombe is Taz1-interacting factor 1 (taf1)) is essential for bulk autophagy, except in S.cerevisiae. ATG17 and ATG11 are large similar proteins, both predicted to be almost entirely helical, containing conserved coiled-coil regions and lack obvious functional motifs.


Pssm-ID: 427715 [Multi-domain]  Cd Length: 360  Bit Score: 44.69  E-value: 6.67e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1817 IEGSKRQIENTLKEGNDILDEANRLLGEINSVIDYVDDIK--TKLPPMSEE---LSD-----KIDDLAQEIKD--RRLAE 1884
Cdd:pfam04108   51 REGLEKVLNELKKDFKQLLKDLDAALERLEETLDKLRNTPvePALPPGEEKqktLLDfidedSVEILRDALKEliDELQA 130
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1885 KVFQAESHAAQLNDSSAVLDGILDEAKNISfnataafRAYSNIKDYIDEAEKVAREAKELA--------QGATKLATSPQ 1956
Cdd:pfam04108  131 AQESLDSDLKRFDDDLRDLQKELESLSSPS-------ESISLIPTLLKELESLEEEMASLLesltnhydQCVTAVKLTEG 203
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1957 G------LLKEDAKgSLQksfRILNEAKKLANDVKGNHNDLNDLKTRLETAdlrnsglLGALNDTMDKLSAITN---DTA 2027
Cdd:pfam04108  204 GraemleVLENDAR-ELD---DVVPELQDRLDEMENNYERLQKLLEQKNSL-------IDELLSALQLIAEIQSrlpEYL 272
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|.
gi 117647249  2028 AKLQAVKEKAREANDTAKAVLAQVKDLHQNLDGLKQNYNKL 2068
Cdd:pfam04108  273 AALKEFEERWEEEKETIEDYLSELEDLREFYEGFPSAYGSL 313
NtpE COG1390
Archaeal/vacuolar-type H+-ATPase subunit E/Vma4 [Energy production and conversion]; Archaeal ...
1672-1849 1.31e-03

Archaeal/vacuolar-type H+-ATPase subunit E/Vma4 [Energy production and conversion]; Archaeal/vacuolar-type H+-ATPase subunit E/Vma4 is part of the Pathway/BioSystem: A/V-type ATP synthase


Pssm-ID: 441000 [Multi-domain]  Cd Length: 196  Bit Score: 42.62  E-value: 1.31e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1672 ESLEEFIKGLVQDAEAinekavQLNETLGNQDKTAERNLEELQKEIDRMLKELRSKDLQTQKEVAEDELVAAEGLLKR-- 1749
Cdd:COG1390     2 MSLEKIIEEILEEAEA------EAEEILEEAEEEAEKILEEAEEEAEEIKEEILEKAEREAEREKRRIISSAELEARKel 75
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1750 -------VNKLFGEPRAQNEDMEKDlqqklAEYKNKLDdawDLLREA---------------TDKTRDANRLSAANQKNM 1807
Cdd:COG1390    76 leakeelIEEVFEEALEKLKNLPKD-----PEYKELLK---KLLKEAaeelgsgdlvvyvneKDKELLEELLKELKKKGL 147
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|
gi 117647249 1808 TILETKKEAIEG-------SKRQIENTLKEgndILDEA-NRLLGEINSVI 1849
Cdd:COG1390   148 EVSEEDIDILGGvivesedGRIRVDNTFES---LLERLkEELLKEIAEIL 194
PspA COG1842
Phage shock protein A [Transcription, Signal transduction mechanisms];
1745-1949 2.13e-03

Phage shock protein A [Transcription, Signal transduction mechanisms];


Pssm-ID: 441447 [Multi-domain]  Cd Length: 217  Bit Score: 42.12  E-value: 2.13e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1745 GLLKRVNKLFgepRA-------QNEDMEKDLQQKLAEYKNKLDDAwdllREATDKTrdanrlsAANQKNMTI-LETKKEA 1816
Cdd:COG1842     1 GIFKRLSDII---RAninalldKAEDPEKMLDQAIRDMEEDLVEA----RQALAQV-------IANQKRLERqLEELEAE 66
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1817 IEGSKRQIENTLKEGNDilDEANRLLGEINSVIDYVDDIKT---KLPPMSEELSDKIDDLAQEIKDRRLAEKVFQAESHA 1893
Cdd:COG1842    67 AEKWEEKARLALEKGRE--DLAREALERKAELEAQAEALEAqlaQLEEQVEKLKEALRQLESKLEELKAKKDTLKARAKA 144
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1894 A----QLNDSSAvldgildeakniSFNATAAFRAYSNIKDYIDEAEKVAREAKELAQGAT 1949
Cdd:COG1842   145 AkaqeKVNEALS------------GIDSDDATSALERMEEKIEEMEARAEAAAELAAGDS 192
F5_F8_type_C pfam00754
F5/8 type C domain; This domain is also known as the discoidin (DS) domain family.
86-170 2.78e-03

F5/8 type C domain; This domain is also known as the discoidin (DS) domain family.


Pssm-ID: 459925 [Multi-domain]  Cd Length: 127  Bit Score: 40.51  E-value: 2.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249    86 SSNPYQRHPITNAIDGK-NTWWQSpsiKNGVEYHYvtITLDLQQVFQIAYVIVKAANSPRPGN---WILERSLDDVEYKP 161
Cdd:pfam00754    5 SSSYSGEGPAAAALDGDpNTAWSA---WSGDDPQW--IQVDLGKPKKITGVVTQGRQDGSNGYvtsYKIEYSLDGENWTT 79

                   ....*....
gi 117647249   162 WQYHAVTDT 170
Cdd:pfam00754   80 VKDEKIPGN 88
TNFRSF16 cd13416
Tumor necrosis factor receptor superfamily member 16 (TNFRSF16), also known as p75 ...
781-915 3.36e-03

Tumor necrosis factor receptor superfamily member 16 (TNFRSF16), also known as p75 neurotrophin receptor (p75NTR) or CD271; TNFRSF16 (also known as nerve growth factor receptor (NGFR) or p75 neurotrophin receptor (p75NTR or p75(NTR)), CD271, Gp80-LNGFR) is a common receptor for both neurotrophins and proneurotrophins, and plays a diverse role in many tissues, including the nervous system. It has been shown to be expressed in various types of stem cells and has been used to prospectively isolate stem cells with different degrees of potency. p75NTR owes its signaling to the recruitment of intracellular binding proteins, leading to the activation of different signaling pathways. It binds nerve growth factor (NGF) and the complex can initiate a signaling cascade which has been associated with both neuronal apoptosis and neuronal survival of discrete populations of neurons, depending on the presence or absence of intracellular signaling molecules downstream of p75NTR (e.g. NF-kB, JNK, or p75NTR intracellular death domain). p75NTR can also bind NGF in concert with the neurotrophic tyrosine kinase receptor type 1 (TrkA) protein where it is thought to modulate the formation of the high-affinity neurotrophin binding complex. On melanoma cell, p75NTR is an immunosuppressive factor, induced by interferon (IFN)-gamma, and mediates down-regulation of melanoma antigens. It can interact with the aggregated form of amyloid beta (Abeta) peptides, and plays an important role in etiopathogenesis of Alzheimer's disease by influencing protein tau hyper-phosphorylation. p75(NTR) is involved in the formation and progression of retina diseases; its expression is induced in retinal pigment epithelium (RPE) cells and its knockdown rescues RPE cell proliferation activity and inhibits RPE apoptosis induced by hypoxia. It can therefore be a potential therapeutic target for RPE hypoxia or oxidative stress diseases.


Pssm-ID: 276921 [Multi-domain]  Cd Length: 159  Bit Score: 40.75  E-value: 3.36e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  781 CNECLPGFYGDPTRGSPEDCQPCA-CPLNIPsnnfsPTCHLDRSLGLICdecpigytgprceRCAEGYFgqPSIPGGSCQ 859
Cdd:cd13416    35 CEPCLDGVTFSDVVSHTEPCQPCTrCPGLMS-----MRAPCTATHDTVC-------------ECAYGYY--LDEDSGTCE 94
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 117647249  860 PCQcndnldysipgSCDSLSGSCLICKPGTTGRyCELCADGYFGDAVNAKN-CQPCR 915
Cdd:cd13416    95 PCT-----------VCPPGQGVVQSCGPNQDTV-CEACPEGTYSDEDSSTDpCLPCT 139
TNFRSF4 cd13406
Tumor necrosis factor receptor superfamily member 4 (TNFRSF4), also known as CD134 or OXO40; ...
882-969 4.21e-03

Tumor necrosis factor receptor superfamily member 4 (TNFRSF4), also known as CD134 or OXO40; TNFRSF4 (also known as OX40, ACT35, CD134, IMD16, TXGP1L) activates NF-kappaB through its interaction with adaptor proteins TRAF2 and TRAF5. It also promotes the expression of apoptosis inhibitors BCL2 and BCL2lL1/BCL2-XL, and thus suppresses apoptosis. It is primarily expressed on activated CD4+ and CD8+ T cells, where it is transiently expressed and upregulated on the most recently antigen-activated T cells within inflammatory lesions. This makes it an attractive target to modulate immune responses, i.e. TNFRSF4 (OX40) blocking agents to inhibit adverse inflammation or agonists to enhance immune responses. An artificially created biologic fusion protein, OX40-immunoglobulin (OX40-Ig), prevents OX40 from reaching the T-cell receptors, thus reducing the T-cell response. Some single nucleotide polymorphisms (SNPs) of its natural ligand OX40 ligand (OX40L, CD252), which is also found on activated T cells, have been associated with systemic lupus erythematosus.


Pssm-ID: 276911 [Multi-domain]  Cd Length: 142  Bit Score: 40.07  E-value: 4.21e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  882 CLICKPG---------TTGRYCELCADGYFGDAVNAKNCQPC-RCNINGSFSEI--C-HTRTGQCECRPNVQ-------G 941
Cdd:cd13406    15 CHECPPGegmesrctgTQDTVCSPCEPGFYNEAVNYEPCKPCtQCNQRSGSEEKqkCtKTSDTVCRCRPGTQpldsykpG 94
                          90       100
                  ....*....|....*....|....*....
gi 117647249  942 RHCDECKPETFGLQLGRGCLP-CNCNSFG 969
Cdd:cd13406    95 VDCVPCPPGHFSRGDNQACKPwTNCSLAG 123
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
1375-1396 5.00e-03

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 37.29  E-value: 5.00e-03
                            10        20
                    ....*....|....*....|..
gi 117647249   1375 RCDCPPGYSGLSCETCAPGFYR 1396
Cdd:smart00180   19 QCECKPNVTGRRCDRCAPGYYG 40
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1663-1879 5.02e-03

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 42.62  E-value: 5.02e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1663 DAERTNSRAESLEEFIKGLVQDAEAINEKAVQLNETLGNQDKTAERNLEELQKEIDRMLKELRskDLQTQKEVAEDELVA 1742
Cdd:COG1196   631 RLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEEL--ELEEALLAEEEEERE 708
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1743 AEGLLKRvnklfgepRAQNEDMEKDLQQKLAEYKNKLDDAWDLLREATDKTRDANRLSAANqknmtiLETKKEAIEGSKR 1822
Cdd:COG1196   709 LAEAEEE--------RLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPD------LEELERELERLER 774
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 117647249 1823 QIENtlkegndildeanrlLGEIN-SVIDYVDDIKTKLppmsEELSDKIDDLAQEIKD 1879
Cdd:COG1196   775 EIEA---------------LGPVNlLAIEEYEELEERY----DFLSEQREDLEEARET 813
Surf_Exclu_PgrA TIGR04320
SEC10/PgrA surface exclusion domain; This model describes a conserved domain found in surface ...
1972-2099 5.09e-03

SEC10/PgrA surface exclusion domain; This model describes a conserved domain found in surface proteins of a number of Firmutes. Many members have LPXTG C-terminal anchoring motifs and a substantial number have the KxYKxGKxW putative sorting signal at the N-terminus. The tetracycline resistance plasmid pCF10 in Enterococcus faecalis promotes conjugal plasmid transfer in response to sex pheromones, but PgrA/Sec10 encoded by that plasmid, a member of this family, specifically inhibits the ability of cells to receive homologous plasmids. The phenomenon is called surface exclusion.


Pssm-ID: 275124 [Multi-domain]  Cd Length: 356  Bit Score: 42.02  E-value: 5.09e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1972 RILNEAKKLANDVKGNHNDLNDLKTRLETADLRNSGLLGALNDTMDKLSAITNDTAAKLQAVKekareandTAKAVLAQV 2051
Cdd:TIGR04320  237 YIADGNKFDKTPIPNPPNSLAALQAKLATAQADLAAAQTALNTAQAALTSAQTAYAAAQAALA--------TAQKELANA 308
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|
gi 117647249  2052 KDlhQNLDGLKQNYNKLADSVAKTNAVVKDPSKNK--IIADAGTSVRNLE 2099
Cdd:TIGR04320  309 QA--QALQTAQNNLATAQAALANAEARLAKAKEALanLNADLAKKQAALD 356
OmpH pfam03938
Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH ...
1694-1782 5.47e-03

Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH among others. Skp (OmpH) has been characterized as a molecular chaperone that interacts with unfolded proteins as they emerge in the periplasm from the Sec translocation machinery.


Pssm-ID: 461098 [Multi-domain]  Cd Length: 140  Bit Score: 39.87  E-value: 5.47e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  1694 QLNETLGNQDKTAERNLEELQKEIDRMLKELRSK--DLQTQKEVAEDELVAAEGLLKRVNklfgepRAQNEDMEKDLQQK 1771
Cdd:pfam03938   19 AAQAQLEKKFKKRQAELEAKQKELQKLYEELQKDgaLLEEEREEKEQELQKKEQELQQLQ------QKAQQELQKKQQEL 92
                           90
                   ....*....|.
gi 117647249  1772 LAEYKNKLDDA 1782
Cdd:pfam03938   93 LQPIQDKINKA 103
TNFRSF4 cd13406
Tumor necrosis factor receptor superfamily member 4 (TNFRSF4), also known as CD134 or OXO40; ...
828-913 7.32e-03

Tumor necrosis factor receptor superfamily member 4 (TNFRSF4), also known as CD134 or OXO40; TNFRSF4 (also known as OX40, ACT35, CD134, IMD16, TXGP1L) activates NF-kappaB through its interaction with adaptor proteins TRAF2 and TRAF5. It also promotes the expression of apoptosis inhibitors BCL2 and BCL2lL1/BCL2-XL, and thus suppresses apoptosis. It is primarily expressed on activated CD4+ and CD8+ T cells, where it is transiently expressed and upregulated on the most recently antigen-activated T cells within inflammatory lesions. This makes it an attractive target to modulate immune responses, i.e. TNFRSF4 (OX40) blocking agents to inhibit adverse inflammation or agonists to enhance immune responses. An artificially created biologic fusion protein, OX40-immunoglobulin (OX40-Ig), prevents OX40 from reaching the T-cell receptors, thus reducing the T-cell response. Some single nucleotide polymorphisms (SNPs) of its natural ligand OX40 ligand (OX40L, CD252), which is also found on activated T cells, have been associated with systemic lupus erythematosus.


Pssm-ID: 276911 [Multi-domain]  Cd Length: 142  Bit Score: 39.30  E-value: 7.32e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249  828 CDECPIGY---------TGPRCERCAEGYFGQP--SIPggsCQPC-QCNDNLDYSIPGSCDSLSGSCLICKPGTT----- 890
Cdd:cd13406    15 CHECPPGEgmesrctgtQDTVCSPCEPGFYNEAvnYEP---CKPCtQCNQRSGSEEKQKCTKTSDTVCRCRPGTQpldsy 91
                          90       100
                  ....*....|....*....|....*
gi 117647249  891 --GRYCELCADGYFGDAVNaKNCQP 913
Cdd:cd13406    92 kpGVDCVPCPPGHFSRGDN-QACKP 115
ClyA-like cd21116
family of the cytolysin A (ClyA) family alpha pore-forming toxins (alpha-PFT) including ...
1983-2143 8.55e-03

family of the cytolysin A (ClyA) family alpha pore-forming toxins (alpha-PFT) including Bacillus cereus HblB, Aeromonas hydrophila AhlB, Bacillus thuringiensis Cry6Aa and similar proteins; This family belongs to the ClyA family of alpha-PFT bacterial toxins. PFTs form the major group of virulence factors in many pathogenic bacteria and in general are critical components of the molecular offensive and defensive machinery of cells in all kingdoms of life. Bacterial PFTs facilitate the takeover of host resources by puncturing holes in the membrane. PFTs can be classified as alpha-PFTs and beta-PFTs depending on the secondary structures of their membrane component. Alpha-PFTs use a ring of amphipathic helices while beta-PFTs use a beta-barrel to construct the pore. Members of this family include the toxins: Bacillus cereus hemolysin binding component B (HblB or HBL-B) of the diarrheal enterotoxin hemolysin BL, Aeromonas hydrophila hemolytic (Ahl) component B (AhlB) of the tripartite AhlABC toxin, Vibrio cholerae cytotoxin motility associated killing factor A (MakA) cytotoxin, Xenorhabdus nematophila alpha-xenorhabdolysin (XaxA), Bacillus thuringiensis crystal 6Aa (Cry6Aa) parasporal crystal (Cry) toxin, and Bacillus cereus non-hemolytic enterotoxin (Nhe) component A (NheA) of the non-hemolytic enterotoxin Nhe, which, despite its name, is hemolytic, among others. In solution, ClyA proteins have an elongated, almost entirely alpha-helical structure, except for a short hydrophobic beta-hairpin known as the beta-tongue. Pore formation by ClyA requires circular oligomerization of the toxin by a sequential mechanism. This, in turn, concentrates the amphipathic helices in the center of the ring-like structure, forming a helical barrel that inserts into the membrane by a wedge-like mechanism. Compared with ClyA, NheA is almost entirely alpha-helical with an enlarged "head" domain, and an enlarged beta-tongue; it has been proposed that NheA could even form beta-barrel pores. Alpha-PFTs with similar structures are increasingly being found in eukaryotes, in particular as components of the immune systems of animals. This family may be distantly related to Escherichia coli alpha-PFT hemolysin E (HlyE, also known as ClyA or SheA).


Pssm-ID: 439149 [Multi-domain]  Cd Length: 224  Bit Score: 40.47  E-value: 8.55e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 1983 DVKGNHND-LNDLKTRLE---TADLRNSGLLGALNDTMDKL--SAITNDTAAK------LQAVKEKAREANDTAKAVLAQ 2050
Cdd:cd21116    41 LARAHALEwLNEIKPKLLslpNDIIGYNNTFQSYYPDLIELadNLIKGDQGAKqqllqgLEALQSQVTKKQTSVTSFINE 120
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 117647249 2051 VKDLHQNLDGLKQNYNKLADSVAKTNAVVKDPSKNkiiadagtsVRNLEQEADRLIDKLKPIKELEDNLKKNISEIKELI 2130
Cdd:cd21116   121 LTTFKNDLDDDSRNLQTDATKAQAQVAVLNALKNQ---------LNSLAEQIDAAIDALEKLSNDWQTLDSDIKELITDL 191
                         170
                  ....*....|...
gi 117647249 2131 NQARKQANSIKVS 2143
Cdd:cd21116   192 EDAESSIDAAFLQ 204
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH