NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1720394819|ref|XP_030106689|]
View 

slit homolog 1 protein isoform X1 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
LamG smart00282
Laminin G domain;
1186-1319 3.70e-38

Laminin G domain;


:

Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 139.40  E-value: 3.70e-38
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  1186 NITLQVSTAEDNGILLYNG---DNDHIAVELYQGHVRVSYDPGSYPSSAIYSAETINDGQFHTVELVTFDQMVNLSIDGG 1262
Cdd:smart00282    1 SISFSFRTTSPNGLLLYAGskgGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQWHRVAVERNGRSVTLSVDGG 80
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*..
gi 1720394819  1263 SPMTMDNFGKHYTLNSEAPLYVGGMPVDvnsaaFRLWQILNGTSFHGCIRNLYINNE 1319
Cdd:smart00282   81 NRVSGESPGGLTILNLDGPLYLGGLPED-----LKLPPLPVTPGFRGCIRNLKVNGK 132
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
43-217 2.23e-28

Leucine-rich repeat (LRR) protein [Transcription];


:

Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 119.65  E-value: 2.23e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819   43 TTVDCHGTGLQAIPKNIPRNT--ERLELNGNNITRIHKnDFAGLKQLRVLQLMENQIGAVERgAFDDMKELERLRLNRNQ 120
Cdd:COG4886    116 ESLDLSGNQLTDLPEELANLTnlKELDLSNNQLTDLPE-PLGNLTNLKSLDLSNNQLTDLPE-ELGNLTNLKELDLSNNQ 193
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  121 LQVLPELLfQNNQALSRLDLSENFLQAVPrKAFRGATDLKNLQLDKNRISCIEEgaFRALRGLEVLTLNNNNITTIPVSS 200
Cdd:COG4886    194 ITDLPEPL-GNLTNLEELDLSGNQLTDLP-EPLANLTNLETLDLSNNQLTDLPE--LGNLTNLEELDLSNNQLTDLPPLA 269
                          170
                   ....*....|....*..
gi 1720394819  201 fnHMPKLRTFRLHSNHL 217
Cdd:COG4886    270 --NLTNLKTLDLSNNQL 284
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
522-820 1.63e-23

Leucine-rich repeat (LRR) protein [Transcription];


:

Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 105.40  E-value: 1.63e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  522 SVVECSSLKLSKIPERIPQSTTELRLNNNEISILEATGLFKKLSHLKKINLSNNKVSEIEDgTFEGAASVSELHLTANQL 601
Cdd:COG4886     70 SLLLLLLLSLLLLSLLLLGLTDLGDLTNLTELDLSGNEELSNLTNLESLDLSGNQLTDLPE-ELANLTNLKELDLSNNQL 148
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  602 ESIRSGmFRGLDGLRTLMLRNNRISCIhNDSFTGLRNVRLLSLYDNHITTIsPGAFDTLQALSTLNLlanpfncnchlsw 681
Cdd:COG4886    149 TDLPEP-LGNLTNLKSLDLSNNQLTDL-PEELGNLTNLKELDLSNNQITDL-PEPLGNLTNLEELDL------------- 212
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  682 lgdwlrkrkivtgnprcqnpdflrqiplqdvafpdfrceegqeevgclprpqcpqecacldtvvrcSNKHLQALPKGIP- 760
Cdd:COG4886    213 ------------------------------------------------------------------SGNQLTDLPEPLAn 226
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1720394819  761 -KNVTELYLDGNQFTLVPgQLSTFKYLQLVDLSNNKISSLSNSSftNMSQLTTLILSYNAL 820
Cdd:COG4886    227 lTNLETLDLSNNQLTDLP-ELGNLTNLEELDLSNNQLTDLPPLA--NLTNLKTLDLSNNQL 284
PCC super family cl28216
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
644-730 1.94e-12

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


The actual alignment was detected with superfamily member TIGR00864:

Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 72.81  E-value: 1.94e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  644 LYDNHITTISPGAFDTLQALSTLNLLANPFNCNCHLSWLGDWLRKRKIVTGNPR---CQNPDFLRQIPLQDVAFPDFRCe 720
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRQPEaalCAGPGALAGQPLLGIPLLDSGC- 80
                           90
                   ....*....|
gi 1720394819  721 eGQEEVGCLP 730
Cdd:TIGR00864   81 -DEEYVACLK 89
LRR_8 pfam13855
Leucine rich repeat;
312-369 1.76e-11

Leucine rich repeat;


:

Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 60.62  E-value: 1.76e-11
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1720394819  312 TEIRLELNGIKSIPPGAFSPYRKLRRIDLSNNQIAEIAPDAFQGLRSLNSLVLYGNKI 369
Cdd:pfam13855    4 RSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PCC super family cl28216
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
842-1002 2.27e-11

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


The actual alignment was detected with superfamily member TIGR00864:

Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 69.34  E-value: 2.27e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  842 NDVSTLQEGIFADVTSLSHLAIGANPLYCDCRLRWLSSWVK---TGYKEPGIARCAGPPEMEGKLLLTTPAKKFECqGPP 918
Cdd:TIGR00864    5 NKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEekgVKVRQPEAALCAGPGALAGQPLLGIPLLDSGC-DEE 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  919 SLAvqakCDPCLSSpcqnQGTCHNDPLEVYRCTCPSGYKGRHCEVSLDGCSSNPCGNGGT----CHAQEGEDAGFT---- 990
Cdd:TIGR00864   84 YVA----CLKDNSS----GGGAARSELVIFSAAHEGLFQPEACNAFCFSAGHGLAALGEQgeclCGAAQPSEANFAcesl 155
                          170
                   ....*....|..
gi 1720394819  991 CSCPSGFEGPTC 1002
Cdd:TIGR00864  156 CSGPPPPPAAAC 167
LRR_8 pfam13855
Leucine rich repeat;
389-441 6.08e-09

Leucine rich repeat;


:

Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 53.68  E-value: 6.08e-09
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1720394819  389 NANKINCIRPDAFQDLQNLSLLSLYDNKIQSLAKGTFTSLRAIQTLHLAQNPF 441
Cdd:pfam13855    9 SNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PCC super family cl28216
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
188-328 7.31e-08

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


The actual alignment was detected with superfamily member TIGR00864:

Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 57.78  E-value: 7.31e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  188 LNNNNITTIPVSSFNHMPKLRTFRLHSNHLFCDCHLAWLSQWLRQ------RPTIglfTQCSGPASLRG---LNVAEVQK 258
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEkgvkvrQPEA---ALCAGPGALAGqplLGIPLLDS 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  259 G----EFSC----SGQGEAAGAPACTLSSG--------SCPAMC-SCSSGIVDCRGKGLTAIPANLPETMTEIRLELNGI 321
Cdd:TIGR00864   79 GcdeeYVAClkdnSSGGGAARSELVIFSAAheglfqpeACNAFCfSAGHGLAALGEQGECLCGAAQPSEANFACESLCSG 158

                   ....*..
gi 1720394819  322 KSIPPGA 328
Cdd:TIGR00864  159 PPPPPAA 165
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1005-1041 1.19e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 46.48  E-value: 1.19e-06
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 1720394819 1005 DTDDCV-KHACVNGGVCVDGVGNYTCQCPLQYTGRACE 1041
Cdd:cd00054      1 DIDECAsGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1085-1119 4.29e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 44.94  E-value: 4.29e-06
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 1720394819 1085 DDCKD-HKCQNGAQCVDEVNSYACLCVEGYSGQLCE 1119
Cdd:cd00054      3 DECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1044-1080 1.33e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 43.39  E-value: 1.33e-05
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1720394819 1044 VDFCSPDmNPCQHEAQCVGTPDGPRCECMLGYTGDNC 1080
Cdd:cd00054      2 IDECASG-NPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
PCC super family cl28216
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
412-475 2.59e-05

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


The actual alignment was detected with superfamily member TIGR00864:

Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 49.31  E-value: 2.59e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720394819  412 LYDNKIQSLAKGTFTSLRAIQTLHLAQNPFICDCNLKWLADFLRTNPIETT---GARCASPRRLANK 475
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRqpeAALCAGPGALAGQ 68
LRRNT pfam01462
Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence ...
733-760 2.15e-03

Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is often found at the N-terminus of tandem leucine rich repeats.


:

Pssm-ID: 396168 [Multi-domain]  Cd Length: 28  Bit Score: 36.84  E-value: 2.15e-03
                           10        20
                   ....*....|....*....|....*...
gi 1720394819  733 QCPQECACLDTVVRCSNKHLQALPKGIP 760
Cdd:pfam01462    1 ACPVPCHCSATVVNCSDRGLTAVPRDLP 28
 
Name Accession Description Interval E-value
LamG smart00282
Laminin G domain;
1186-1319 3.70e-38

Laminin G domain;


Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 139.40  E-value: 3.70e-38
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  1186 NITLQVSTAEDNGILLYNG---DNDHIAVELYQGHVRVSYDPGSYPSSAIYSAETINDGQFHTVELVTFDQMVNLSIDGG 1262
Cdd:smart00282    1 SISFSFRTTSPNGLLLYAGskgGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQWHRVAVERNGRSVTLSVDGG 80
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*..
gi 1720394819  1263 SPMTMDNFGKHYTLNSEAPLYVGGMPVDvnsaaFRLWQILNGTSFHGCIRNLYINNE 1319
Cdd:smart00282   81 NRVSGESPGGLTILNLDGPLYLGGLPED-----LKLPPLPVTPGFRGCIRNLKVNGK 132
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
1164-1317 3.92e-37

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 137.16  E-value: 3.92e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819 1164 SVNFvDRDTYLQFTDLQN-WPRANITLQVSTAEDNGILLYNGD---NDHIAVELYQGHVRVSYDPGSypSSAIYSAET-I 1238
Cdd:cd00110      1 GVSF-SGSSYVRLPTLPApRTRLSISFSFRTTSPNGLLLYAGSqngGDFLALELEDGRLVLRYDLGS--GSLVLSSKTpL 77
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720394819 1239 NDGQFHTVELVTFDQMVNLSIDGGSPMTMDNFGKHYTLNSEAPLYVGGMPVDVnsaafRLWQILNGTSFHGCIRNLYIN 1317
Cdd:cd00110     78 NDGQWHSVSVERNGRSVTLSVDGERVVESGSPGGSALLNLDGPLYLGGLPEDL-----KSPGLPVSPGFVGCIRDLKVN 151
Laminin_G_2 pfam02210
Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G ...
1191-1319 4.24e-34

Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.


Pssm-ID: 460494 [Multi-domain]  Cd Length: 126  Bit Score: 127.54  E-value: 4.24e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819 1191 VSTAEDNGILLYNGD--NDHIAVELYQGHVRVSYDPGSYPSSAIYSAETINDGQFHTVELVTFDQMVNLSIDGGSPMTMD 1268
Cdd:pfam02210    1 FRTRQPNGLLLYAGGggSDFLALELVNGRLVLRYDLGSGPESLLSSGKNLNDGQWHSVRVERNGNTLTLSVDGQTVVSSL 80
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1720394819 1269 NFGKHYTLNSEAPLYVGGMPVDVnsaafRLWQILNGTSFHGCIRNLYINNE 1319
Cdd:pfam02210   81 PPGESLLLNLNGPLYLGGLPPLL-----LLPALPVRAGFVGCIRDVRVNGE 126
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
43-217 2.23e-28

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 119.65  E-value: 2.23e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819   43 TTVDCHGTGLQAIPKNIPRNT--ERLELNGNNITRIHKnDFAGLKQLRVLQLMENQIGAVERgAFDDMKELERLRLNRNQ 120
Cdd:COG4886    116 ESLDLSGNQLTDLPEELANLTnlKELDLSNNQLTDLPE-PLGNLTNLKSLDLSNNQLTDLPE-ELGNLTNLKELDLSNNQ 193
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  121 LQVLPELLfQNNQALSRLDLSENFLQAVPrKAFRGATDLKNLQLDKNRISCIEEgaFRALRGLEVLTLNNNNITTIPVSS 200
Cdd:COG4886    194 ITDLPEPL-GNLTNLEELDLSGNQLTDLP-EPLANLTNLETLDLSNNQLTDLPE--LGNLTNLEELDLSNNQLTDLPPLA 269
                          170
                   ....*....|....*..
gi 1720394819  201 fnHMPKLRTFRLHSNHL 217
Cdd:COG4886    270 --NLTNLKTLDLSNNQL 284
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
522-820 1.63e-23

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 105.40  E-value: 1.63e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  522 SVVECSSLKLSKIPERIPQSTTELRLNNNEISILEATGLFKKLSHLKKINLSNNKVSEIEDgTFEGAASVSELHLTANQL 601
Cdd:COG4886     70 SLLLLLLLSLLLLSLLLLGLTDLGDLTNLTELDLSGNEELSNLTNLESLDLSGNQLTDLPE-ELANLTNLKELDLSNNQL 148
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  602 ESIRSGmFRGLDGLRTLMLRNNRISCIhNDSFTGLRNVRLLSLYDNHITTIsPGAFDTLQALSTLNLlanpfncnchlsw 681
Cdd:COG4886    149 TDLPEP-LGNLTNLKSLDLSNNQLTDL-PEELGNLTNLKELDLSNNQITDL-PEPLGNLTNLEELDL------------- 212
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  682 lgdwlrkrkivtgnprcqnpdflrqiplqdvafpdfrceegqeevgclprpqcpqecacldtvvrcSNKHLQALPKGIP- 760
Cdd:COG4886    213 ------------------------------------------------------------------SGNQLTDLPEPLAn 226
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1720394819  761 -KNVTELYLDGNQFTLVPgQLSTFKYLQLVDLSNNKISSLSNSSftNMSQLTTLILSYNAL 820
Cdd:COG4886    227 lTNLETLDLSNNQLTDLP-ELGNLTNLEELDLSNNQLTDLPPLA--NLTNLKTLDLSNNQL 284
LRR_8 pfam13855
Leucine rich repeat;
590-649 6.78e-14

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 67.55  E-value: 6.78e-14
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  590 SVSELHLTANQLESIRSGMFRGLDGLRTLMLRNNRISCIHNDSFTGLRNVRLLSLYDNHI 649
Cdd:pfam13855    2 NLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
159-217 3.78e-13

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 65.62  E-value: 3.78e-13
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1720394819  159 LKNLQLDKNRISCIEEGAFRALRGLEVLTLNNNNITTIPVSSFNHMPKLRTFRLHSNHL 217
Cdd:pfam13855    3 LRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
644-730 1.94e-12

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 72.81  E-value: 1.94e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  644 LYDNHITTISPGAFDTLQALSTLNLLANPFNCNCHLSWLGDWLRKRKIVTGNPR---CQNPDFLRQIPLQDVAFPDFRCe 720
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRQPEaalCAGPGALAGQPLLGIPLLDSGC- 80
                           90
                   ....*....|
gi 1720394819  721 eGQEEVGCLP 730
Cdd:TIGR00864   81 -DEEYVACLK 89
LRR_8 pfam13855
Leucine rich repeat;
312-369 1.76e-11

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 60.62  E-value: 1.76e-11
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1720394819  312 TEIRLELNGIKSIPPGAFSPYRKLRRIDLSNNQIAEIAPDAFQGLRSLNSLVLYGNKI 369
Cdd:pfam13855    4 RSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
842-1002 2.27e-11

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 69.34  E-value: 2.27e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  842 NDVSTLQEGIFADVTSLSHLAIGANPLYCDCRLRWLSSWVK---TGYKEPGIARCAGPPEMEGKLLLTTPAKKFECqGPP 918
Cdd:TIGR00864    5 NKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEekgVKVRQPEAALCAGPGALAGQPLLGIPLLDSGC-DEE 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  919 SLAvqakCDPCLSSpcqnQGTCHNDPLEVYRCTCPSGYKGRHCEVSLDGCSSNPCGNGGT----CHAQEGEDAGFT---- 990
Cdd:TIGR00864   84 YVA----CLKDNSS----GGGAARSELVIFSAAHEGLFQPEACNAFCFSAGHGLAALGEQgeclCGAAQPSEANFAcesl 155
                          170
                   ....*....|..
gi 1720394819  991 CSCPSGFEGPTC 1002
Cdd:TIGR00864  156 CSGPPPPPAAAC 167
PRK15370 PRK15370
type III secretion system effector E3 ubiquitin transferase SlrP;
529-818 1.00e-10

type III secretion system effector E3 ubiquitin transferase SlrP;


Pssm-ID: 185268 [Multi-domain]  Cd Length: 754  Bit Score: 66.64  E-value: 1.00e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  529 LKLSKIPERIPQSTTELRLNNNEISILEATglfkKLSHLKKINLSNNKVSEIEDGTFEgaaSVSELHLTANQLESIRSgm 608
Cdd:PRK15370   188 LGLTTIPACIPEQITTLILDNNELKSLPEN----LQGNIKTLYANSNQLTSIPATLPD---TIQEMELSINRITELPE-- 258
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  609 fRGLDGLRTLMLRNNRISCIHNDSFTGLRNvrlLSLYDNHITTIsPGAFDTlqALSTLNLLANPfncnchLSWLGDWLrk 688
Cdd:PRK15370   259 -RLPSALQSLDLFHNKISCLPENLPEELRY---LSVYDNSIRTL-PAHLPS--GITHLNVQSNS------LTALPETL-- 323
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  689 rkivtgnprcqnPDFLRQIplqdvafpdfrcEEGQEEVGCLPRpQCPQECACLDTvvrcSNKHLQALPKGIPKNVTELYL 768
Cdd:PRK15370   324 ------------PPGLKTL------------EAGENALTSLPA-SLPPELQVLDV----SKNQITVLPETLPPTITTLDV 374
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1720394819  769 DGNQFTLVPGQLSTfkYLQLVDLSNNKISSLSNS--SFTNMS-QLTTLILSYN 818
Cdd:PRK15370   375 SRNALTNLPENLPA--ALQIMQASRNNLVRLPESlpHFRGEGpQPTRIIVEYN 425
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
66-217 1.46e-10

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 62.88  E-value: 1.46e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819   66 LELNGNNITRIhkNDFAGLKQLRVLQLMENQIGAVErgAFDDMKELERLRLNRNQLQVLPELlfQNNQALSRLDLSENFL 145
Cdd:cd21340      7 LYLNDKNITKI--DNLSLCKNLKVLYLYDNKITKIE--NLEFLTNLTHLYLQNNQIEKIENL--ENLVNLKKLYLGGNRI 80
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720394819  146 QAVprKAFRGATDLKNLQLDKNRIS-----CIEEGAFRALRG-LEVLTLNNNNITTIpvSSFNHMPKLRTFRLHSNHL 217
Cdd:cd21340     81 SVV--EGLENLTNLEELHIENQRLPpgeklTFDPRSLAALSNsLRVLNISGNNIDSL--EPLAPLRNLEQLDASNNQI 154
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
530-673 6.33e-10

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 60.96  E-value: 6.33e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  530 KLSKIpERIPQST--TELRLNNNEISILEatGLfKKLSHLKKINLSNNKVSEIEDgtFEGAASVSELHLTANQLES---- 603
Cdd:cd21340     35 KITKI-ENLEFLTnlTHLYLQNNQIEKIE--NL-ENLVNLKKLYLGGNRISVVEG--LENLTNLEELHIENQRLPPgekl 108
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720394819  604 -IRSGMFRGL-DGLRTLMLRNNRISCIhnDSFTGLRNVRLLSLYDNHITTISP--GAFDTLQALSTLNLLANPF 673
Cdd:cd21340    109 tFDPRSLAALsNSLRVLNISGNNIDSL--EPLAPLRNLEQLDASNNQISDLEEllDLLSSWPSLRELDLTGNPV 180
LRR_8 pfam13855
Leucine rich repeat;
389-441 6.08e-09

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 53.68  E-value: 6.08e-09
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1720394819  389 NANKINCIRPDAFQDLQNLSLLSLYDNKIQSLAKGTFTSLRAIQTLHLAQNPF 441
Cdd:pfam13855    9 SNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PRK15370 PRK15370
type III secretion system effector E3 ubiquitin transferase SlrP;
28-197 3.86e-08

type III secretion system effector E3 ubiquitin transferase SlrP;


Pssm-ID: 185268 [Multi-domain]  Cd Length: 754  Bit Score: 58.17  E-value: 3.86e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819   28 RLGATACPALCTCTGTTVDCHGTGLQAIPKNIPRNTERLELNGNNITRI-------------HKNDFAGLKQ-----LRV 89
Cdd:PRK15370   187 ILGLTTIPACIPEQITTLILDNNELKSLPENLQGNIKTLYANSNQLTSIpatlpdtiqemelSINRITELPErlpsaLQS 266
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819   90 LQLMENQIGAVERGAFDdmkELERLRLNRNQLQVLPELLfqnNQALSRLDLSENFLQAVPRKAFRGatdLKNLQLDKNRI 169
Cdd:PRK15370   267 LDLFHNKISCLPENLPE---ELRYLSVYDNSIRTLPAHL---PSGITHLNVQSNSLTALPETLPPG---LKTLEAGENAL 337
                          170       180
                   ....*....|....*....|....*...
gi 1720394819  170 SCIEEGAFRALRgleVLTLNNNNITTIP 197
Cdd:PRK15370   338 TSLPASLPPELQ---VLDVSKNQITVLP 362
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
188-328 7.31e-08

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 57.78  E-value: 7.31e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  188 LNNNNITTIPVSSFNHMPKLRTFRLHSNHLFCDCHLAWLSQWLRQ------RPTIglfTQCSGPASLRG---LNVAEVQK 258
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEkgvkvrQPEA---ALCAGPGALAGqplLGIPLLDS 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  259 G----EFSC----SGQGEAAGAPACTLSSG--------SCPAMC-SCSSGIVDCRGKGLTAIPANLPETMTEIRLELNGI 321
Cdd:TIGR00864   79 GcdeeYVAClkdnSSGGGAARSELVIFSAAheglfqpeACNAFCfSAGHGLAALGEQGECLCGAAQPSEANFACESLCSG 158

                   ....*..
gi 1720394819  322 KSIPPGA 328
Cdd:TIGR00864  159 PPPPPAA 165
LRRCT smart00082
Leucine rich repeat C-terminal domain;
866-915 6.83e-07

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 47.42  E-value: 6.83e-07
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 1720394819   866 NPLYCDCRLRWLSSWV--KTGYKEPGIARCAGPPEMEGKLLLTTPAkKFECQ 915
Cdd:smart00082    1 NPFICDCELRWLLRWLqaNEHLQDPVDLRCASPSSLRGPLLELLHS-EFKCP 51
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1005-1041 1.19e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 46.48  E-value: 1.19e-06
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 1720394819 1005 DTDDCV-KHACVNGGVCVDGVGNYTCQCPLQYTGRACE 1041
Cdd:cd00054      1 DIDECAsGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
185-671 2.02e-06

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 52.54  E-value: 2.02e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  185 VLTLNNNNITTIPVSSFNHMPKLRTFRLHSNHLfcdchlawlSQWLRQrptiGLFTQCSgpaSLRGLNVAevqKGEFSCS 264
Cdd:PLN00113    73 SIDLSGKNISGKISSAIFRLPYIQTINLSNNQL---------SGPIPD----DIFTTSS---SLRYLNLS---NNNFTGS 133
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  265 -GQGEAAGAPACTLS----SGSCPA-MCSCSS-GIVDCRGKGLTA-IPANLPE--TMTEIRLELNGIKSIPPGAFSPYRK 334
Cdd:PLN00113   134 iPRGSIPNLETLDLSnnmlSGEIPNdIGSFSSlKVLDLGGNVLVGkIPNSLTNltSLEFLTLASNQLVGQIPRELGQMKS 213
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  335 LRRIDLSNNQIAEIAPDAFQGLRSLNSLVL-YGNKITDLPRGvFGGLYTLQLLLLNANKINCIRPDAFQDLQNLSLLSLY 413
Cdd:PLN00113   214 LKWIYLGYNNLSGEIPYEIGGLTSLNHLDLvYNNLTGPIPSS-LGNLKNLQYLFLYQNKLSGPIPPSIFSLQKLISLDLS 292
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  414 DNKIQSLAKGTFTSLRAIQTLHLAQNPFicdcnlkwladflrTNPIetTGARCASPRRlankRIGQIKSKKFRCSakeqy 493
Cdd:PLN00113   293 DNSLSGEIPELVIQLQNLEILHLFSNNF--------------TGKI--PVALTSLPRL----QVLQLWSNKFSGE----- 347
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  494 fIPGTEDYHLNsectsdvacphkcrceASVVECSSLKLS-KIPE--------------------RIP------QSTTELR 546
Cdd:PLN00113   348 -IPKNLGKHNN----------------LTVLDLSTNNLTgEIPEglcssgnlfklilfsnslegEIPkslgacRSLRRVR 410
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  547 LNNNEISiLEATGLFKKLSHLKKINLSNNKVS-EIEDGTFE----------------------GAASVSELHLTANQLES 603
Cdd:PLN00113   411 LQDNSFS-GELPSEFTKLPLVYFLDISNNNLQgRINSRKWDmpslqmlslarnkffgglpdsfGSKRLENLDLSRNQFSG 489
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720394819  604 IRSGMFRGLDGLRTLMLRNNRISCIHNDSFTGLRNVRLLSLYDNHITTISPGAFDTLQALSTLNLLAN 671
Cdd:PLN00113   490 AVPRKLGSLSELMQLKLSENKLSGEIPDELSSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQN 557
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1085-1119 4.29e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 44.94  E-value: 4.29e-06
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 1720394819 1085 DDCKD-HKCQNGAQCVDEVNSYACLCVEGYSGQLCE 1119
Cdd:cd00054      3 DECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
LRRCT smart00082
Leucine rich repeat C-terminal domain;
671-719 6.40e-06

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 44.73  E-value: 6.40e-06
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|.
gi 1720394819   671 NPFNCNCHLSWLGDWLRKRKIV--TGNPRCQNPDFLRQiPLQDVAFPDFRC 719
Cdd:smart00082    1 NPFICDCELRWLLRWLQANEHLqdPVDLRCASPSSLRG-PLLELLHSEFKC 50
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
927-962 7.22e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 44.16  E-value: 7.22e-06
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1720394819  927 DPCLS-SPCQNQGTCHNDPlEVYRCTCPSGYKGRHCE 962
Cdd:cd00054      3 DECASgNPCQNGGTCVNTV-GSYRCSCPPGYTGRNCE 38
LRRNT smart00013
Leucine rich repeat N-terminal domain;
33-64 1.25e-05

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 43.46  E-value: 1.25e-05
                            10        20        30
                    ....*....|....*....|....*....|..
gi 1720394819    33 ACPALCTCTGTTVDCHGTGLQAIPKNIPRNTE 64
Cdd:smart00013    1 ACPAPCNCSGTAVDCSGRGLTEVPLDLPPDTT 32
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1044-1080 1.33e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 43.39  E-value: 1.33e-05
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1720394819 1044 VDFCSPDmNPCQHEAQCVGTPDGPRCECMLGYTGDNC 1080
Cdd:cd00054      2 IDECASG-NPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
PRK15370 PRK15370
type III secretion system effector E3 ubiquitin transferase SlrP;
296-373 2.48e-05

type III secretion system effector E3 ubiquitin transferase SlrP;


Pssm-ID: 185268 [Multi-domain]  Cd Length: 754  Bit Score: 48.92  E-value: 2.48e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  296 RGKGLTAIPANLPETMTEIRLELNGIKSIP---PGAfspyrkLRRIDLSNNQIAEIAPDAFQGLRSLNslvLYGNKITDL 372
Cdd:PRK15370   228 NSNQLTSIPATLPDTIQEMELSINRITELPerlPSA------LQSLDLFHNKISCLPENLPEELRYLS---VYDNSIRTL 298

                   .
gi 1720394819  373 P 373
Cdd:PRK15370   299 P 299
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
412-475 2.59e-05

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 49.31  E-value: 2.59e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720394819  412 LYDNKIQSLAKGTFTSLRAIQTLHLAQNPFICDCNLKWLADFLRTNPIETT---GARCASPRRLANK 475
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRqpeAALCAGPGALAGQ 68
EGF_CA smart00179
Calcium-binding EGF-like domain;
1005-1041 3.32e-05

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 42.23  E-value: 3.32e-05
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 1720394819  1005 DTDDCV-KHACVNGGVCVDGVGNYTCQCPLQYT-GRACE 1041
Cdd:smart00179    1 DIDECAsGNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
114-374 6.85e-05

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 45.93  E-value: 6.85e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  114 LRLNRNQLQVLPEL-LFQNnqaLSRLDLSENFLQAVPrkAFRGATDLKNLQLDKNRISCIEEgaFRALRGLEVLTLNNNN 192
Cdd:cd21340      7 LYLNDKNITKIDNLsLCKN---LKVLYLYDNKITKIE--NLEFLTNLTHLYLQNNQIEKIEN--LENLVNLKKLYLGGNR 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  193 ITTipVSSFNHMPKLRTfrLHsnhlfcdchlawLSqwlRQRPTIGlFTQCSGPASLRGLnvaevqkgefscsgqgeaaga 272
Cdd:cd21340     80 ISV--VEGLENLTNLEE--LH------------IE---NQRLPPG-EKLTFDPRSLAAL--------------------- 118
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  273 pACTLSsgscpamcscssgIVDCRGkgltaipanlpetmteirlelNGIKSIPPgaFSPYRKLRRIDLSNNQIAEIAP-- 350
Cdd:cd21340    119 -SNSLR-------------VLNISG---------------------NNIDSLEP--LAPLRNLEQLDASNNQISDLEEll 161
                          250       260
                   ....*....|....*....|....
gi 1720394819  351 DAFQGLRSLNSLVLYGNKITDLPR 374
Cdd:cd21340    162 DLLSSWPSLRELDLTGNPVCKKPK 185
LRRNT smart00013
Leucine rich repeat N-terminal domain;
282-313 7.76e-05

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 41.15  E-value: 7.76e-05
                            10        20        30
                    ....*....|....*....|....*....|..
gi 1720394819   282 CPAMCSCSSGIVDCRGKGLTAIPANLPETMTE 313
Cdd:smart00013    2 CPAPCNCSGTAVDCSGRGLTEVPLDLPPDTTL 33
EGF_CA smart00179
Calcium-binding EGF-like domain;
1085-1119 1.08e-04

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 40.69  E-value: 1.08e-04
                            10        20        30
                    ....*....|....*....|....*....|....*..
gi 1720394819  1085 DDCK-DHKCQNGAQCVDEVNSYACLCVEGYS-GQLCE 1119
Cdd:smart00179    3 DECAsGNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
LRRNT smart00013
Leucine rich repeat N-terminal domain;
733-765 1.86e-04

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 39.99  E-value: 1.86e-04
                            10        20        30
                    ....*....|....*....|....*....|...
gi 1720394819   733 QCPQECACLDTVVRCSNKHLQALPKGIPKNVTE 765
Cdd:smart00013    1 ACPAPCNCSGTAVDCSGRGLTEVPLDLPPDTTL 33
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1087-1115 1.91e-04

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 40.06  E-value: 1.91e-04
                           10        20
                   ....*....|....*....|....*....
gi 1720394819 1087 CKDHKCQNGAQCVDEVNSYACLCVEGYSG 1115
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTG 29
LRRNT pfam01462
Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence ...
282-308 2.00e-04

Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is often found at the N-terminus of tandem leucine rich repeats.


Pssm-ID: 396168 [Multi-domain]  Cd Length: 28  Bit Score: 39.92  E-value: 2.00e-04
                           10        20
                   ....*....|....*....|....*..
gi 1720394819  282 CPAMCSCSSGIVDCRGKGLTAIPANLP 308
Cdd:pfam01462    2 CPVPCHCSATVVNCSDRGLTAVPRDLP 28
LRRCT smart00082
Leucine rich repeat C-terminal domain;
439-469 3.17e-04

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 40.11  E-value: 3.17e-04
                            10        20        30
                    ....*....|....*....|....*....|...
gi 1720394819   439 NPFICDCNLKWLADFLRTNPI--ETTGARCASP 469
Cdd:smart00082    1 NPFICDCELRWLLRWLQANEHlqDPVDLRCASP 33
EGF_CA smart00179
Calcium-binding EGF-like domain;
1044-1080 1.54e-03

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 37.61  E-value: 1.54e-03
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 1720394819  1044 VDFCSpDMNPCQHEAQCVGTPDGPRCECMLGYT-GDNC 1080
Cdd:smart00179    2 IDECA-SGNPCQNGGTCVNTVGSYRCECPPGYTdGRNC 38
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
929-960 1.74e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 37.36  E-value: 1.74e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 1720394819  929 CLSSPCQNQGTCHNDPlEVYRCTCPSGYKGRH 960
Cdd:pfam00008    1 CAPNPCSNGGTCVDTP-GGYTCICPEGYTGKR 31
LRRNT pfam01462
Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence ...
733-760 2.15e-03

Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is often found at the N-terminus of tandem leucine rich repeats.


Pssm-ID: 396168 [Multi-domain]  Cd Length: 28  Bit Score: 36.84  E-value: 2.15e-03
                           10        20
                   ....*....|....*....|....*...
gi 1720394819  733 QCPQECACLDTVVRCSNKHLQALPKGIP 760
Cdd:pfam01462    1 ACPVPCHCSATVVNCSDRGLTAVPRDLP 28
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1009-1038 4.65e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 36.21  E-value: 4.65e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 1720394819 1009 CVKHACVNGGVCVDGVGNYTCQCPLQYTGR 1038
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTGK 30
 
Name Accession Description Interval E-value
LamG smart00282
Laminin G domain;
1186-1319 3.70e-38

Laminin G domain;


Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 139.40  E-value: 3.70e-38
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  1186 NITLQVSTAEDNGILLYNG---DNDHIAVELYQGHVRVSYDPGSYPSSAIYSAETINDGQFHTVELVTFDQMVNLSIDGG 1262
Cdd:smart00282    1 SISFSFRTTSPNGLLLYAGskgGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQWHRVAVERNGRSVTLSVDGG 80
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*..
gi 1720394819  1263 SPMTMDNFGKHYTLNSEAPLYVGGMPVDvnsaaFRLWQILNGTSFHGCIRNLYINNE 1319
Cdd:smart00282   81 NRVSGESPGGLTILNLDGPLYLGGLPED-----LKLPPLPVTPGFRGCIRNLKVNGK 132
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
1164-1317 3.92e-37

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 137.16  E-value: 3.92e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819 1164 SVNFvDRDTYLQFTDLQN-WPRANITLQVSTAEDNGILLYNGD---NDHIAVELYQGHVRVSYDPGSypSSAIYSAET-I 1238
Cdd:cd00110      1 GVSF-SGSSYVRLPTLPApRTRLSISFSFRTTSPNGLLLYAGSqngGDFLALELEDGRLVLRYDLGS--GSLVLSSKTpL 77
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720394819 1239 NDGQFHTVELVTFDQMVNLSIDGGSPMTMDNFGKHYTLNSEAPLYVGGMPVDVnsaafRLWQILNGTSFHGCIRNLYIN 1317
Cdd:cd00110     78 NDGQWHSVSVERNGRSVTLSVDGERVVESGSPGGSALLNLDGPLYLGGLPEDL-----KSPGLPVSPGFVGCIRDLKVN 151
Laminin_G_2 pfam02210
Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G ...
1191-1319 4.24e-34

Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.


Pssm-ID: 460494 [Multi-domain]  Cd Length: 126  Bit Score: 127.54  E-value: 4.24e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819 1191 VSTAEDNGILLYNGD--NDHIAVELYQGHVRVSYDPGSYPSSAIYSAETINDGQFHTVELVTFDQMVNLSIDGGSPMTMD 1268
Cdd:pfam02210    1 FRTRQPNGLLLYAGGggSDFLALELVNGRLVLRYDLGSGPESLLSSGKNLNDGQWHSVRVERNGNTLTLSVDGQTVVSSL 80
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1720394819 1269 NFGKHYTLNSEAPLYVGGMPVDVnsaafRLWQILNGTSFHGCIRNLYINNE 1319
Cdd:pfam02210   81 PPGESLLLNLNGPLYLGGLPPLL-----LLPALPVRAGFVGCIRDVRVNGE 126
Laminin_G_1 pfam00054
Laminin G domain;
1191-1322 1.71e-28

Laminin G domain;


Pssm-ID: 395008 [Multi-domain]  Cd Length: 131  Bit Score: 111.64  E-value: 1.71e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819 1191 VSTAEDNGILLYNGDNDH---IAVELYQGHVRVSYDPGSYPSSaIYSAETINDGQFHTVELVTFDQMVNLSIDGG-SPMT 1266
Cdd:pfam00054    1 FRTTEPSGLLLYNGTQTErdfLALELRDGRLEVSYDLGSGAAV-VRSGDKLNDGKWHSVELERNGRSGTLSVDGEaRPTG 79
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1720394819 1267 MDNFGKHYTLNSEAPLYVGGMPVDVNSAafrlWQILNGTSFHGCIRNLYINNELQD 1322
Cdd:pfam00054   80 ESPLGATTDLDVDGPLYVGGLPSLGVKK----RRLAISPSFDGCIRDVIVNGKPLD 131
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
43-217 2.23e-28

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 119.65  E-value: 2.23e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819   43 TTVDCHGTGLQAIPKNIPRNT--ERLELNGNNITRIHKnDFAGLKQLRVLQLMENQIGAVERgAFDDMKELERLRLNRNQ 120
Cdd:COG4886    116 ESLDLSGNQLTDLPEELANLTnlKELDLSNNQLTDLPE-PLGNLTNLKSLDLSNNQLTDLPE-ELGNLTNLKELDLSNNQ 193
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  121 LQVLPELLfQNNQALSRLDLSENFLQAVPrKAFRGATDLKNLQLDKNRISCIEEgaFRALRGLEVLTLNNNNITTIPVSS 200
Cdd:COG4886    194 ITDLPEPL-GNLTNLEELDLSGNQLTDLP-EPLANLTNLETLDLSNNQLTDLPE--LGNLTNLEELDLSNNQLTDLPPLA 269
                          170
                   ....*....|....*..
gi 1720394819  201 fnHMPKLRTFRLHSNHL 217
Cdd:COG4886    270 --NLTNLKTLDLSNNQL 284
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
522-820 1.63e-23

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 105.40  E-value: 1.63e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  522 SVVECSSLKLSKIPERIPQSTTELRLNNNEISILEATGLFKKLSHLKKINLSNNKVSEIEDgTFEGAASVSELHLTANQL 601
Cdd:COG4886     70 SLLLLLLLSLLLLSLLLLGLTDLGDLTNLTELDLSGNEELSNLTNLESLDLSGNQLTDLPE-ELANLTNLKELDLSNNQL 148
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  602 ESIRSGmFRGLDGLRTLMLRNNRISCIhNDSFTGLRNVRLLSLYDNHITTIsPGAFDTLQALSTLNLlanpfncnchlsw 681
Cdd:COG4886    149 TDLPEP-LGNLTNLKSLDLSNNQLTDL-PEELGNLTNLKELDLSNNQITDL-PEPLGNLTNLEELDL------------- 212
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  682 lgdwlrkrkivtgnprcqnpdflrqiplqdvafpdfrceegqeevgclprpqcpqecacldtvvrcSNKHLQALPKGIP- 760
Cdd:COG4886    213 ------------------------------------------------------------------SGNQLTDLPEPLAn 226
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1720394819  761 -KNVTELYLDGNQFTLVPgQLSTFKYLQLVDLSNNKISSLSNSSftNMSQLTTLILSYNAL 820
Cdd:COG4886    227 lTNLETLDLSNNQLTDLP-ELGNLTNLEELDLSNNQLTDLPPLA--NLTNLKTLDLSNNQL 284
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
527-827 5.57e-23

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 103.47  E-value: 5.57e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  527 SSLKLSKIPERIPQSTTELRLNNNEISILEATGLFKKLSHLKKINLSNNKVSEIEDGTFEGAASVSELHLTANQLesirs 606
Cdd:COG4886     34 LLALLLLSLLSLLLLLTLLLSLLLRDLLLSSLLLLLSLLLLLLLSLLLLSLLLLGLTDLGDLTNLTELDLSGNEE----- 108
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  607 gmFRGLDGLRTLMLRNNRISCIhNDSFTGLRNVRLLSLYDNHITTIsPGAFDTLQALSTLNLLANPfncnchlswlgdwl 686
Cdd:COG4886    109 --LSNLTNLESLDLSGNQLTDL-PEELANLTNLKELDLSNNQLTDL-PEPLGNLTNLKSLDLSNNQ-------------- 170
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  687 rkrkivtgnprcqnpdfLRQIPlqdvafpdfrceegqEEVGCLPRpqcpqecacLdTVVRCSNKHLQALPKGI--PKNVT 764
Cdd:COG4886    171 -----------------LTDLP---------------EELGNLTN---------L-KELDLSNNQITDLPEPLgnLTNLE 208
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1720394819  765 ELYLDGNQFTLVPGQLSTFKYLQLVDLSNNKISSLsnSSFTNMSQLTTLILSYNALQCIPPLA 827
Cdd:COG4886    209 ELDLSGNQLTDLPEPLANLTNLETLDLSNNQLTDL--PELGNLTNLEELDLSNNQLTDLPPLA 269
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
55-477 9.30e-23

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 103.09  E-value: 9.30e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819   55 IPKNIPRNTERLELNGNNITRIHKNDFAGLKQLRVLQLMENQigavergAFDDMKELERLRLNRNQLQVLPELLfQNNQA 134
Cdd:COG4886     66 LLLLSLLLLLLLSLLLLSLLLLGLTDLGDLTNLTELDLSGNE-------ELSNLTNLESLDLSGNQLTDLPEEL-ANLTN 137
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  135 LSRLDLSENFLQAVPrKAFRGATDLKNLQLDKNRISCIEEgAFRALRGLEVLTLNNNNITTIPvSSFNHMPKLRTFRLHS 214
Cdd:COG4886    138 LKELDLSNNQLTDLP-EPLGNLTNLKSLDLSNNQLTDLPE-ELGNLTNLKELDLSNNQITDLP-EPLGNLTNLEELDLSG 214
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  215 NHLFcdchlawlsqwlrqrpTIglftqcsgPASLRGLNvaevqkgefscsgqgeaagapacTLSSgscpamcscssgiVD 294
Cdd:COG4886    215 NQLT----------------DL--------PEPLANLT-----------------------NLET-------------LD 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  295 CRGKGLTAIP--ANLPEtMTEIRLELNGIKSIPPGAFSPyrKLRRIDLSNNQIAEIAPDAFQGLRSLNSLVLYGNKITDL 372
Cdd:COG4886    235 LSNNQLTDLPelGNLTN-LEELDLSNNQLTDLPPLANLT--NLKTLDLSNNQLTDLKLKELELLLGLNSLLLLLLLLNLL 311
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  373 PrgVFGGLYTLQLLLLNANKINCIRPDAFQDLQNLSLLSLYDNKIQSLAKGTFTSLRAIQTLHLAQNPFICDCNLKWLAD 452
Cdd:COG4886    312 E--LLILLLLLTTLLLLLLLLKGLLVTLTTLALSLSLLALLTLLLLLNLLSLLLTLLLTLGLLGLLEATLLTLALLLLTL 389
                          410       420
                   ....*....|....*....|....*
gi 1720394819  453 FLRTNPIETTGARCASPRRLANKRI 477
Cdd:COG4886    390 LLLLLTTTAGVLLLTLALLDAVNTE 414
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
106-484 3.27e-21

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 98.08  E-value: 3.27e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  106 DDMKELERLRLNRNQLQVLPELLFQNNQALSRLDLSENflqavprKAFRGATDLKNLQLDKNRISCIEEgAFRALRGLEV 185
Cdd:COG4886     69 LSLLLLLLLSLLLLSLLLLGLTDLGDLTNLTELDLSGN-------EELSNLTNLESLDLSGNQLTDLPE-ELANLTNLKE 140
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  186 LTLNNNNITTIPvSSFNHMPKLRTFRLHSNHLfcdchlawlsqwlrqrptIGLftqcsgPASLRGLNvaevqkgefscsg 265
Cdd:COG4886    141 LDLSNNQLTDLP-EPLGNLTNLKSLDLSNNQL------------------TDL------PEELGNLT------------- 182
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  266 qgeaagapactlssgscpamcscssgivdcrgkgltaipaNLpetmTEIRLELNGIKSIPPgAFSPYRKLRRIDLSNNQI 345
Cdd:COG4886    183 ----------------------------------------NL----KELDLSNNQITDLPE-PLGNLTNLEELDLSGNQL 217
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  346 AEIaPDAFQGLRSLNSLVLYGNKITDLPrgvfgglytlqllllnankincirpdAFQDLQNLSLLSLYDNKIQSLakGTF 425
Cdd:COG4886    218 TDL-PEPLANLTNLETLDLSNNQLTDLP--------------------------ELGNLTNLEELDLSNNQLTDL--PPL 268
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1720394819  426 TSLRAIQTLHLAQNPfICDCNLKWLADFLRTNPIETTGARCASPRRLANKRIGQIKSKK 484
Cdd:COG4886    269 ANLTNLKTLDLSNNQ-LTDLKLKELELLLGLNSLLLLLLLLNLLELLILLLLLTTLLLL 326
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
303-674 1.09e-20

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 96.54  E-value: 1.09e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  303 IPANLPETMTEIRLELNGIKSIPPGAFSPYRKLRRIDLSNNqiaeiapDAFQGLRSLNSLVLYGNKITDLPrgvfgglyt 382
Cdd:COG4886     66 LLLLSLLLLLLLSLLLLSLLLLGLTDLGDLTNLTELDLSGN-------EELSNLTNLESLDLSGNQLTDLP--------- 129
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  383 lqllllnankincirpDAFQDLQNLSLLSLYDNKIQSLAKgTFTSLRAIQTLHLAQNpficdcnlkwladflrtnpiett 462
Cdd:COG4886    130 ----------------EELANLTNLKELDLSNNQLTDLPE-PLGNLTNLKSLDLSNN----------------------- 169
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  463 garcasprrlankrigQIKSkkfrcsakeqyfIPgtedyhlnsectsdvacphkcrceASVVECSSLKlskiperipqst 542
Cdd:COG4886    170 ----------------QLTD------------LP------------------------EELGNLTNLK------------ 185
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  543 tELRLNNNEISILEATglFKKLSHLKKINLSNNKVSEIEDgTFEGAASVSELHLTANQLESIRSgmFRGLDGLRTLMLRN 622
Cdd:COG4886    186 -ELDLSNNQITDLPEP--LGNLTNLEELDLSGNQLTDLPE-PLANLTNLETLDLSNNQLTDLPE--LGNLTNLEELDLSN 259
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1720394819  623 NRISCIhnDSFTGLRNVRLLSLYDNHITTISPGAFDTLQALSTLNLLANPFN 674
Cdd:COG4886    260 NQLTDL--PPLANLTNLKTLDLSNNQLTDLKLKELELLLGLNSLLLLLLLLN 309
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
315-674 5.00e-20

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 94.62  E-value: 5.00e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  315 RLELNGIKSIppgafSPYRKLRRIDLSNNQIAEIaPDAFQGLRSLNSLVLYGNKITDLprgvfgglytlqllllnankin 394
Cdd:COG4886    100 ELDLSGNEEL-----SNLTNLESLDLSGNQLTDL-PEELANLTNLKELDLSNNQLTDL---------------------- 151
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  395 cirPDAFQDLQNLSLLSLYDNKIQSLAKgtftslrAIQTLHlaqnpficdcNLKWLadFLRTNPIETtgarcasprrlan 474
Cdd:COG4886    152 ---PEPLGNLTNLKSLDLSNNQLTDLPE-------ELGNLT----------NLKEL--DLSNNQITD------------- 196
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  475 krigqikskkfrcsakeqyfIPgtedyhlnsectsdvacphkcrceASVVECSSLKlskiperipqsttELRLNNNEISI 554
Cdd:COG4886    197 --------------------LP------------------------EPLGNLTNLE-------------ELDLSGNQLTD 219
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  555 LEATglFKKLSHLKKINLSNNKVSEIEDgtFEGAASVSELHLTANQLESIRSGMfrGLDGLRTLMLRNNRISCIHNDSFT 634
Cdd:COG4886    220 LPEP--LANLTNLETLDLSNNQLTDLPE--LGNLTNLEELDLSNNQLTDLPPLA--NLTNLKTLDLSNNQLTDLKLKELE 293
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|
gi 1720394819  635 GLRNVRLLSLYDNHITTISPGAFDTLQALSTLNLLANPFN 674
Cdd:COG4886    294 LLLGLNSLLLLLLLLNLLELLILLLLLTTLLLLLLLLKGL 333
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
312-709 2.22e-18

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 89.61  E-value: 2.22e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  312 TEIRLELNGIKSIPPgAFSPYRKLRRIDLSNNQIAEIaPDAFQGLRSLNSLVLYGNKITDLPrgvfgglytlqllllnan 391
Cdd:COG4886    116 ESLDLSGNQLTDLPE-ELANLTNLKELDLSNNQLTDL-PEPLGNLTNLKSLDLSNNQLTDLP------------------ 175
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  392 kincirpDAFQDLQNLSLLSLYDNKIQSLAKgTFTSLRAIQTLHLAQNPF------ICDC-NLKWLadFLRTNPIETtga 464
Cdd:COG4886    176 -------EELGNLTNLKELDLSNNQITDLPE-PLGNLTNLEELDLSGNQLtdlpepLANLtNLETL--DLSNNQLTD--- 242
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  465 rcasprrlankrigqikskkfrcsakeqyfIPgtedyhlnsectsdvacphkcrceaSVVECSSLKlskiperipqsttE 544
Cdd:COG4886    243 ------------------------------LP-------------------------ELGNLTNLE-------------E 254
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  545 LRLNNNEISILEATGlfkKLSHLKKINLSNNKVSEIEDGTFEGAASVSELHLTANQLESIrsGMFRGLDGLRTLMLRNNR 624
Cdd:COG4886    255 LDLSNNQLTDLPPLA---NLTNLKTLDLSNNQLTDLKLKELELLLGLNSLLLLLLLLNLL--ELLILLLLLTTLLLLLLL 329
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  625 ISCIHNDSFTGLRNVRLLSLYDNHITTISPGAFDTLQALSTLNLLANPFNCNCHLSWLGDWLRKRKIVTGNPRCQNPDFL 704
Cdd:COG4886    330 LKGLLVTLTTLALSLSLLALLTLLLLLNLLSLLLTLLLTLGLLGLLEATLLTLALLLLTLLLLLLTTTAGVLLLTLALLD 409

                   ....*
gi 1720394819  705 RQIPL 709
Cdd:COG4886    410 AVNTE 414
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
43-198 5.72e-18

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 88.45  E-value: 5.72e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819   43 TTVDCHGTGLQAIPKNIPR--NTERLELNGNNITRIHKnDFAGLKQLRVLQLMENQIGAVERgAFDDMKELERLRLNRNQ 120
Cdd:COG4886    162 KSLDLSNNQLTDLPEELGNltNLKELDLSNNQITDLPE-PLGNLTNLEELDLSGNQLTDLPE-PLANLTNLETLDLSNNQ 239
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720394819  121 LQVLPELlfQNNQALSRLDLSENFLQAVPrkAFRGATDLKNLQLDKNRISCIEEGAFRALRGLEVLTLNNNNITTIPV 198
Cdd:COG4886    240 LTDLPEL--GNLTNLEELDLSNNQLTDLP--PLANLTNLKTLDLSNNQLTDLKLKELELLLGLNSLLLLLLLLNLLEL 313
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
743-868 2.60e-15

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 79.98  E-value: 2.60e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  743 TVVRCSNKHLQALPKGIP--KNVTELYLDGNQFTLVPGQLSTFKYLQLVDLSNNKISSLSnSSFTNMSQLTTLILSYNAL 820
Cdd:COG4886    116 ESLDLSGNQLTDLPEELAnlTNLKELDLSNNQLTDLPEPLGNLTNLKSLDLSNNQLTDLP-EELGNLTNLKELDLSNNQI 194
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 1720394819  821 QCIPPlAFQGLRSLRLLSLHGNDVSTLQEGIfADVTSLSHLAIGANPL 868
Cdd:COG4886    195 TDLPE-PLGNLTNLEELDLSGNQLTDLPEPL-ANLTNLETLDLSNNQL 240
LRR_8 pfam13855
Leucine rich repeat;
590-649 6.78e-14

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 67.55  E-value: 6.78e-14
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  590 SVSELHLTANQLESIRSGMFRGLDGLRTLMLRNNRISCIHNDSFTGLRNVRLLSLYDNHI 649
Cdd:pfam13855    2 NLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
614-673 2.76e-13

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 66.01  E-value: 2.76e-13
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  614 GLRTLMLRNNRISCIHNDSFTGLRNVRLLSLYDNHITTISPGAFDTLQALSTLNLLANPF 673
Cdd:pfam13855    2 NLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
159-217 3.78e-13

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 65.62  E-value: 3.78e-13
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1720394819  159 LKNLQLDKNRISCIEEGAFRALRGLEVLTLNNNNITTIPVSSFNHMPKLRTFRLHSNHL 217
Cdd:pfam13855    3 LRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
644-730 1.94e-12

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 72.81  E-value: 1.94e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  644 LYDNHITTISPGAFDTLQALSTLNLLANPFNCNCHLSWLGDWLRKRKIVTGNPR---CQNPDFLRQIPLQDVAFPDFRCe 720
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRQPEaalCAGPGALAGQPLLGIPLLDSGC- 80
                           90
                   ....*....|
gi 1720394819  721 eGQEEVGCLP 730
Cdd:TIGR00864   81 -DEEYVACLK 89
LRR_8 pfam13855
Leucine rich repeat;
565-625 5.12e-12

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 62.16  E-value: 5.12e-12
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1720394819  565 SHLKKINLSNNKVSEIEDGTFEGAASVSELHLTANQLESIRSGMFRGLDGLRTLMLRNNRI 625
Cdd:pfam13855    1 PNLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
135-193 1.07e-11

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 61.39  E-value: 1.07e-11
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1720394819  135 LSRLDLSENFLQAVPRKAFRGATDLKNLQLDKNRISCIEEGAFRALRGLEVLTLNNNNI 193
Cdd:pfam13855    3 LRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
312-369 1.76e-11

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 60.62  E-value: 1.76e-11
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1720394819  312 TEIRLELNGIKSIPPGAFSPYRKLRRIDLSNNQIAEIAPDAFQGLRSLNSLVLYGNKI 369
Cdd:pfam13855    4 RSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
736-868 1.90e-11

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 68.04  E-value: 1.90e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  736 QECACLDTVVRCSNKHLQALpkgipKNVTELYLDGNQFTLVPGQLSTFKYLQLVDLSNNKISSLSnSSFTNMSQLTTLIL 815
Cdd:COG4886     93 GDLTNLTELDLSGNEELSNL-----TNLESLDLSGNQLTDLPEELANLTNLKELDLSNNQLTDLP-EPLGNLTNLKSLDL 166
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1720394819  816 SYNALQCIPPlAFQGLRSLRLLSLHGNDVSTLQEGIfADVTSLSHLAIGANPL 868
Cdd:COG4886    167 SNNQLTDLPE-ELGNLTNLKELDLSNNQITDLPEPL-GNLTNLEELDLSGNQL 217
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
842-1002 2.27e-11

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 69.34  E-value: 2.27e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  842 NDVSTLQEGIFADVTSLSHLAIGANPLYCDCRLRWLSSWVK---TGYKEPGIARCAGPPEMEGKLLLTTPAKKFECqGPP 918
Cdd:TIGR00864    5 NKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEekgVKVRQPEAALCAGPGALAGQPLLGIPLLDSGC-DEE 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  919 SLAvqakCDPCLSSpcqnQGTCHNDPLEVYRCTCPSGYKGRHCEVSLDGCSSNPCGNGGT----CHAQEGEDAGFT---- 990
Cdd:TIGR00864   84 YVA----CLKDNSS----GGGAARSELVIFSAAHEGLFQPEACNAFCFSAGHGLAALGEQgeclCGAAQPSEANFAcesl 155
                          170
                   ....*....|..
gi 1720394819  991 CSCPSGFEGPTC 1002
Cdd:TIGR00864  156 CSGPPPPPAAAC 167
LRR_8 pfam13855
Leucine rich repeat;
61-121 4.07e-11

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 59.85  E-value: 4.07e-11
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1720394819   61 RNTERLELNGNNITRIHKNDFAGLKQLRVLQLMENQIGAVERGAFDDMKELERLRLNRNQL 121
Cdd:pfam13855    1 PNLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PRK15370 PRK15370
type III secretion system effector E3 ubiquitin transferase SlrP;
529-818 1.00e-10

type III secretion system effector E3 ubiquitin transferase SlrP;


Pssm-ID: 185268 [Multi-domain]  Cd Length: 754  Bit Score: 66.64  E-value: 1.00e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  529 LKLSKIPERIPQSTTELRLNNNEISILEATglfkKLSHLKKINLSNNKVSEIEDGTFEgaaSVSELHLTANQLESIRSgm 608
Cdd:PRK15370   188 LGLTTIPACIPEQITTLILDNNELKSLPEN----LQGNIKTLYANSNQLTSIPATLPD---TIQEMELSINRITELPE-- 258
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  609 fRGLDGLRTLMLRNNRISCIHNDSFTGLRNvrlLSLYDNHITTIsPGAFDTlqALSTLNLLANPfncnchLSWLGDWLrk 688
Cdd:PRK15370   259 -RLPSALQSLDLFHNKISCLPENLPEELRY---LSVYDNSIRTL-PAHLPS--GITHLNVQSNS------LTALPETL-- 323
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  689 rkivtgnprcqnPDFLRQIplqdvafpdfrcEEGQEEVGCLPRpQCPQECACLDTvvrcSNKHLQALPKGIPKNVTELYL 768
Cdd:PRK15370   324 ------------PPGLKTL------------EAGENALTSLPA-SLPPELQVLDV----SKNQITVLPETLPPTITTLDV 374
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1720394819  769 DGNQFTLVPGQLSTfkYLQLVDLSNNKISSLSNS--SFTNMS-QLTTLILSYN 818
Cdd:PRK15370   375 SRNALTNLPENLPA--ALQIMQASRNNLVRLPESlpHFRGEGpQPTRIIVEYN 425
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
66-217 1.46e-10

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 62.88  E-value: 1.46e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819   66 LELNGNNITRIhkNDFAGLKQLRVLQLMENQIGAVErgAFDDMKELERLRLNRNQLQVLPELlfQNNQALSRLDLSENFL 145
Cdd:cd21340      7 LYLNDKNITKI--DNLSLCKNLKVLYLYDNKITKIE--NLEFLTNLTHLYLQNNQIEKIENL--ENLVNLKKLYLGGNRI 80
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720394819  146 QAVprKAFRGATDLKNLQLDKNRIS-----CIEEGAFRALRG-LEVLTLNNNNITTIpvSSFNHMPKLRTFRLHSNHL 217
Cdd:cd21340     81 SVV--EGLENLTNLEELHIENQRLPpgeklTFDPRSLAALSNsLRVLNISGNNIDSL--EPLAPLRNLEQLDASNNQI 154
LRR_8 pfam13855
Leucine rich repeat;
333-417 4.69e-10

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 56.76  E-value: 4.69e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  333 RKLRRIDLSNNQIAEIAPDAFQGLRSLNSLVLYGNKITdlprgvfgglytlqllllnankinCIRPDAFQDLQNLSLLSL 412
Cdd:pfam13855    1 PNLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLT------------------------TLSPGAFSGLPSLRYLDL 56

                   ....*
gi 1720394819  413 YDNKI 417
Cdd:pfam13855   57 SGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
543-601 5.53e-10

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 56.38  E-value: 5.53e-10
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1720394819  543 TELRLNNNEISILEAtGLFKKLSHLKKINLSNNKVSEIEDGTFEGAASVSELHLTANQL 601
Cdd:pfam13855    4 RSLDLSNNRLTSLDD-GAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
530-673 6.33e-10

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 60.96  E-value: 6.33e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  530 KLSKIpERIPQST--TELRLNNNEISILEatGLfKKLSHLKKINLSNNKVSEIEDgtFEGAASVSELHLTANQLES---- 603
Cdd:cd21340     35 KITKI-ENLEFLTnlTHLYLQNNQIEKIE--NL-ENLVNLKKLYLGGNRISVVEG--LENLTNLEELHIENQRLPPgekl 108
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720394819  604 -IRSGMFRGL-DGLRTLMLRNNRISCIhnDSFTGLRNVRLLSLYDNHITTISP--GAFDTLQALSTLNLLANPF 673
Cdd:cd21340    109 tFDPRSLAALsNSLRVLNISGNNIDSL--EPLAPLRNLEQLDASNNQISDLEEllDLLSSWPSLRELDLTGNPV 180
LRR_8 pfam13855
Leucine rich repeat;
762-820 1.67e-09

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 55.22  E-value: 1.67e-09
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  762 NVTELYLDGNQFTLV-PGQLSTFKYLQLVDLSNNKISSLSNSSFTNMSQLTTLILSYNAL 820
Cdd:pfam13855    2 NLRSLDLSNNRLTSLdDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
110-169 5.62e-09

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 53.68  E-value: 5.62e-09
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  110 ELERLRLNRNQLQVLPELLFQNNQALSRLDLSENFLQAVPRKAFRGATDLKNLQLDKNRI 169
Cdd:pfam13855    2 NLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
43-217 5.78e-09

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 57.87  E-value: 5.78e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819   43 TTVDCHGTGLQAIPK-NIPRNTERLELNGNNITRIhkNDFAGLKQLRVLQLMENQIGAVERgaFDDMKELERLRLNRNQL 121
Cdd:cd21340      5 THLYLNDKNITKIDNlSLCKNLKVLYLYDNKITKI--ENLEFLTNLTHLYLQNNQIEKIEN--LENLVNLKKLYLGGNRI 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  122 QV------LPEL--LFQNNQalsRLDLSENFLQAvPRKAFRGATDLKNLQLDKNRISCIEEgaFRALRGLEVLTLNNNNI 193
Cdd:cd21340     81 SVveglenLTNLeeLHIENQ---RLPPGEKLTFD-PRSLAALSNSLRVLNISGNNIDSLEP--LAPLRNLEQLDASNNQI 154
                          170       180
                   ....*....|....*....|....*.
gi 1720394819  194 TTIP--VSSFNHMPKLRTFRLHSNHL 217
Cdd:cd21340    155 SDLEelLDLLSSWPSLRELDLTGNPV 180
LRR_8 pfam13855
Leucine rich repeat;
389-441 6.08e-09

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 53.68  E-value: 6.08e-09
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1720394819  389 NANKINCIRPDAFQDLQNLSLLSLYDNKIQSLAKGTFTSLRAIQTLHLAQNPF 441
Cdd:pfam13855    9 SNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PRK15370 PRK15370
type III secretion system effector E3 ubiquitin transferase SlrP;
28-197 3.86e-08

type III secretion system effector E3 ubiquitin transferase SlrP;


Pssm-ID: 185268 [Multi-domain]  Cd Length: 754  Bit Score: 58.17  E-value: 3.86e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819   28 RLGATACPALCTCTGTTVDCHGTGLQAIPKNIPRNTERLELNGNNITRI-------------HKNDFAGLKQ-----LRV 89
Cdd:PRK15370   187 ILGLTTIPACIPEQITTLILDNNELKSLPENLQGNIKTLYANSNQLTSIpatlpdtiqemelSINRITELPErlpsaLQS 266
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819   90 LQLMENQIGAVERGAFDdmkELERLRLNRNQLQVLPELLfqnNQALSRLDLSENFLQAVPRKAFRGatdLKNLQLDKNRI 169
Cdd:PRK15370   267 LDLFHNKISCLPENLPE---ELRYLSVYDNSIRTLPAHL---PSGITHLNVQSNSLTALPETLPPG---LKTLEAGENAL 337
                          170       180
                   ....*....|....*....|....*...
gi 1720394819  170 SCIEEGAFRALRgleVLTLNNNNITTIP 197
Cdd:PRK15370   338 TSLPASLPPELQ---VLDVSKNQITVLP 362
LRR_8 pfam13855
Leucine rich repeat;
786-832 4.90e-08

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 50.99  E-value: 4.90e-08
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1720394819  786 LQLVDLSNNKISSLSNSSFTNMSQLTTLILSYNALQCIPPLAFQGLR 832
Cdd:pfam13855    3 LRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLP 49
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
188-328 7.31e-08

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 57.78  E-value: 7.31e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  188 LNNNNITTIPVSSFNHMPKLRTFRLHSNHLFCDCHLAWLSQWLRQ------RPTIglfTQCSGPASLRG---LNVAEVQK 258
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEkgvkvrQPEA---ALCAGPGALAGqplLGIPLLDS 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  259 G----EFSC----SGQGEAAGAPACTLSSG--------SCPAMC-SCSSGIVDCRGKGLTAIPANLPETMTEIRLELNGI 321
Cdd:TIGR00864   79 GcdeeYVAClkdnSSGGGAARSELVIFSAAheglfqpeACNAFCfSAGHGLAALGEQGECLCGAAQPSEANFACESLCSG 158

                   ....*..
gi 1720394819  322 KSIPPGA 328
Cdd:TIGR00864  159 PPPPPAA 165
LRR_8 pfam13855
Leucine rich repeat;
86-143 1.78e-07

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 49.45  E-value: 1.78e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1720394819   86 QLRVLQLMENQIGAVERGAFDDMKELERLRLNRNQLQVLPELLFQNNQALSRLDLSEN 143
Cdd:pfam13855    2 NLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGN 59
PRK15370 PRK15370
type III secretion system effector E3 ubiquitin transferase SlrP;
51-217 3.80e-07

type III secretion system effector E3 ubiquitin transferase SlrP;


Pssm-ID: 185268 [Multi-domain]  Cd Length: 754  Bit Score: 55.09  E-value: 3.80e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819   51 GLQAIPKNIPRNTERLELNGNNITRIHKNDFAGLKQLRVlqlMENQIGAVERGAFDDMKELErlrLNRNQLQVLPELLfq 130
Cdd:PRK15370   189 GLTTIPACIPEQITTLILDNNELKSLPENLQGNIKTLYA---NSNQLTSIPATLPDTIQEME---LSINRITELPERL-- 260
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  131 nNQALSRLDLSENFLQAVPRKAFRGatdLKNLQLDKNRISCIEE-------------GAFRAL-----RGLEVLTLNNNN 192
Cdd:PRK15370   261 -PSALQSLDLFHNKISCLPENLPEE---LRYLSVYDNSIRTLPAhlpsgithlnvqsNSLTALpetlpPGLKTLEAGENA 336
                          170       180
                   ....*....|....*....|....*
gi 1720394819  193 ITTIPVSSfnhMPKLRTFRLHSNHL 217
Cdd:PRK15370   337 LTSLPASL---PPELQVLDVSKNQI 358
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
543-701 6.46e-07

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 52.10  E-value: 6.46e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  543 TELRLNNNEISILEATGLFKKLSHLKkinLSNNKVSEIEDgtFEGAASVSELHLTANQLESIRsgMFRGLDGLRTLMLRN 622
Cdd:cd21340      5 THLYLNDKNITKIDNLSLCKNLKVLY---LYDNKITKIEN--LEFLTNLTHLYLQNNQIEKIE--NLENLVNLKKLYLGG 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  623 NRISCIHN---------------------------DSFTGLRN-VRLLSLYDNHITTISPgaFDTLQALSTLNLLANPFN 674
Cdd:cd21340     78 NRISVVEGlenltnleelhienqrlppgekltfdpRSLAALSNsLRVLNISGNNIDSLEP--LAPLRNLEQLDASNNQIS 155
                          170       180
                   ....*....|....*....|....*....
gi 1720394819  675 CNCHLSW-LGDWLRKRKI-VTGNPRCQNP 701
Cdd:cd21340    156 DLEELLDlLSSWPSLRELdLTGNPVCKKP 184
LRRCT smart00082
Leucine rich repeat C-terminal domain;
866-915 6.83e-07

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 47.42  E-value: 6.83e-07
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 1720394819   866 NPLYCDCRLRWLSSWV--KTGYKEPGIARCAGPPEMEGKLLLTTPAkKFECQ 915
Cdd:smart00082    1 NPFICDCELRWLLRWLqaNEHLQDPVDLRCASPSSLRGPLLELLHS-EFKCP 51
PRK15370 PRK15370
type III secretion system effector E3 ubiquitin transferase SlrP;
749-868 9.93e-07

type III secretion system effector E3 ubiquitin transferase SlrP;


Pssm-ID: 185268 [Multi-domain]  Cd Length: 754  Bit Score: 53.55  E-value: 9.93e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  749 NKHLQALPKGIPKNVTELYLDGNQFTLVPGQLStfKYLQLVDLSNNKISSLSNSSftnMSQLTTLILSYNALQCIP---P 825
Cdd:PRK15370   208 NNELKSLPENLQGNIKTLYANSNQLTSIPATLP--DTIQEMELSINRITELPERL---PSALQSLDLFHNKISCLPenlP 282
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|...
gi 1720394819  826 LAFQglrslrLLSLHGNDVSTLQEGIfadVTSLSHLAIGANPL 868
Cdd:PRK15370   283 EELR------YLSVYDNSIRTLPAHL---PSGITHLNVQSNSL 316
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1005-1041 1.19e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 46.48  E-value: 1.19e-06
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 1720394819 1005 DTDDCV-KHACVNGGVCVDGVGNYTCQCPLQYTGRACE 1041
Cdd:cd00054      1 DIDECAsGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
185-671 2.02e-06

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 52.54  E-value: 2.02e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  185 VLTLNNNNITTIPVSSFNHMPKLRTFRLHSNHLfcdchlawlSQWLRQrptiGLFTQCSgpaSLRGLNVAevqKGEFSCS 264
Cdd:PLN00113    73 SIDLSGKNISGKISSAIFRLPYIQTINLSNNQL---------SGPIPD----DIFTTSS---SLRYLNLS---NNNFTGS 133
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  265 -GQGEAAGAPACTLS----SGSCPA-MCSCSS-GIVDCRGKGLTA-IPANLPE--TMTEIRLELNGIKSIPPGAFSPYRK 334
Cdd:PLN00113   134 iPRGSIPNLETLDLSnnmlSGEIPNdIGSFSSlKVLDLGGNVLVGkIPNSLTNltSLEFLTLASNQLVGQIPRELGQMKS 213
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  335 LRRIDLSNNQIAEIAPDAFQGLRSLNSLVL-YGNKITDLPRGvFGGLYTLQLLLLNANKINCIRPDAFQDLQNLSLLSLY 413
Cdd:PLN00113   214 LKWIYLGYNNLSGEIPYEIGGLTSLNHLDLvYNNLTGPIPSS-LGNLKNLQYLFLYQNKLSGPIPPSIFSLQKLISLDLS 292
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  414 DNKIQSLAKGTFTSLRAIQTLHLAQNPFicdcnlkwladflrTNPIetTGARCASPRRlankRIGQIKSKKFRCSakeqy 493
Cdd:PLN00113   293 DNSLSGEIPELVIQLQNLEILHLFSNNF--------------TGKI--PVALTSLPRL----QVLQLWSNKFSGE----- 347
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  494 fIPGTEDYHLNsectsdvacphkcrceASVVECSSLKLS-KIPE--------------------RIP------QSTTELR 546
Cdd:PLN00113   348 -IPKNLGKHNN----------------LTVLDLSTNNLTgEIPEglcssgnlfklilfsnslegEIPkslgacRSLRRVR 410
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  547 LNNNEISiLEATGLFKKLSHLKKINLSNNKVS-EIEDGTFE----------------------GAASVSELHLTANQLES 603
Cdd:PLN00113   411 LQDNSFS-GELPSEFTKLPLVYFLDISNNNLQgRINSRKWDmpslqmlslarnkffgglpdsfGSKRLENLDLSRNQFSG 489
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720394819  604 IRSGMFRGLDGLRTLMLRNNRISCIHNDSFTGLRNVRLLSLYDNHITTISPGAFDTLQALSTLNLLAN 671
Cdd:PLN00113   490 AVPRKLGSLSELMQLKLSENKLSGEIPDELSSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQN 557
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1085-1119 4.29e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 44.94  E-value: 4.29e-06
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 1720394819 1085 DDCKD-HKCQNGAQCVDEVNSYACLCVEGYSGQLCE 1119
Cdd:cd00054      3 DECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
743-823 5.90e-06

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 49.01  E-value: 5.90e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  743 TVVRCSNKHLQALPK-GIPKNVTELYLDGNQFTLVPGqLSTFKYLQLVDLSNNKISSLSNssFTNMSQLTTLILSYNALQ 821
Cdd:cd21340      5 THLYLNDKNITKIDNlSLCKNLKVLYLYDNKITKIEN-LEFLTNLTHLYLQNNQIEKIEN--LENLVNLKKLYLGGNRIS 81

                   ..
gi 1720394819  822 CI 823
Cdd:cd21340     82 VV 83
LRRCT smart00082
Leucine rich repeat C-terminal domain;
671-719 6.40e-06

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 44.73  E-value: 6.40e-06
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|.
gi 1720394819   671 NPFNCNCHLSWLGDWLRKRKIV--TGNPRCQNPDFLRQiPLQDVAFPDFRC 719
Cdd:smart00082    1 NPFICDCELRWLLRWLQANEHLqdPVDLRCASPSSLRG-PLLELLHSEFKC 50
PRK15370 PRK15370
type III secretion system effector E3 ubiquitin transferase SlrP;
299-420 6.44e-06

type III secretion system effector E3 ubiquitin transferase SlrP;


Pssm-ID: 185268 [Multi-domain]  Cd Length: 754  Bit Score: 50.85  E-value: 6.44e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  299 GLTAIPANLPETMTEIRLELNGIKSIPPGAFSPYRKLRRIDLSNNQIAEIAPDAFQGLRslnslvLYGNKITDLPRGVfg 378
Cdd:PRK15370   189 GLTTIPACIPEQITTLILDNNELKSLPENLQGNIKTLYANSNQLTSIPATLPDTIQEME------LSINRITELPERL-- 260
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 1720394819  379 gLYTLQLLLLNANKINCIrPDAFQDlqNLSLLSLYDNKIQSL 420
Cdd:PRK15370   261 -PSALQSLDLFHNKISCL-PENLPE--ELRYLSVYDNSIRTL 298
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
927-962 7.22e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 44.16  E-value: 7.22e-06
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1720394819  927 DPCLS-SPCQNQGTCHNDPlEVYRCTCPSGYKGRHCE 962
Cdd:cd00054      3 DECASgNPCQNGGTCVNTV-GSYRCSCPPGYTGRNCE 38
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
561-818 9.99e-06

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 48.24  E-value: 9.99e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  561 FKKLSHLkkiNLSNNKVSEIEDgtfegaasvseLHLTANqlesirsgmfrgldgLRTLMLRNNRISCIHNdsFTGLRNVR 640
Cdd:cd21340      1 LKRITHL---YLNDKNITKIDN-----------LSLCKN---------------LKVLYLYDNKITKIEN--LEFLTNLT 49
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  641 LLSLYDNHITTISPgaFDTLQALSTLNLLanpFNCNCHLSWLgdwlrkrkivtgnprcQNPDFLRQIPLqdvafpdfrce 720
Cdd:cd21340     50 HLYLQNNQIEKIEN--LENLVNLKKLYLG---GNRISVVEGL----------------ENLTNLEELHI----------- 97
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  721 EGQeevgCLPRPQC-PQECACLDTVVRCsnkhLQALpkgipkNVTelyldGNQFTlVPGQLSTFKYLQLVDLSNNKISSL 799
Cdd:cd21340     98 ENQ----RLPPGEKlTFDPRSLAALSNS----LRVL------NIS-----GNNID-SLEPLAPLRNLEQLDASNNQISDL 157
                          250       260
                   ....*....|....*....|.
gi 1720394819  800 SN--SSFTNMSQLTTLILSYN 818
Cdd:cd21340    158 EEllDLLSSWPSLRELDLTGN 178
LRRNT smart00013
Leucine rich repeat N-terminal domain;
33-64 1.25e-05

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 43.46  E-value: 1.25e-05
                            10        20        30
                    ....*....|....*....|....*....|..
gi 1720394819    33 ACPALCTCTGTTVDCHGTGLQAIPKNIPRNTE 64
Cdd:smart00013    1 ACPAPCNCSGTAVDCSGRGLTEVPLDLPPDTT 32
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1044-1080 1.33e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 43.39  E-value: 1.33e-05
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1720394819 1044 VDFCSPDmNPCQHEAQCVGTPDGPRCECMLGYTGDNC 1080
Cdd:cd00054      2 IDECASG-NPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
969-1002 1.54e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 43.39  E-value: 1.54e-05
                           10        20        30
                   ....*....|....*....|....*....|....
gi 1720394819  969 SSNPCGNGGTCHAQEGedaGFTCSCPSGFEGPTC 1002
Cdd:cd00054      7 SGNPCQNGGTCVNTVG---SYRCSCPPGYTGRNC 37
LRR_8 pfam13855
Leucine rich repeat;
181-217 1.94e-05

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 43.67  E-value: 1.94e-05
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1720394819  181 RGLEVLTLNNNNITTIPVSSFNHMPKLRTFRLHSNHL 217
Cdd:pfam13855    1 PNLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLL 37
PRK15370 PRK15370
type III secretion system effector E3 ubiquitin transferase SlrP;
296-373 2.48e-05

type III secretion system effector E3 ubiquitin transferase SlrP;


Pssm-ID: 185268 [Multi-domain]  Cd Length: 754  Bit Score: 48.92  E-value: 2.48e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  296 RGKGLTAIPANLPETMTEIRLELNGIKSIP---PGAfspyrkLRRIDLSNNQIAEIAPDAFQGLRSLNslvLYGNKITDL 372
Cdd:PRK15370   228 NSNQLTSIPATLPDTIQEMELSINRITELPerlPSA------LQSLDLFHNKISCLPENLPEELRYLS---VYDNSIRTL 298

                   .
gi 1720394819  373 P 373
Cdd:PRK15370   299 P 299
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
412-475 2.59e-05

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 49.31  E-value: 2.59e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720394819  412 LYDNKIQSLAKGTFTSLRAIQTLHLAQNPFICDCNLKWLADFLRTNPIETT---GARCASPRRLANK 475
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRqpeAALCAGPGALAGQ 68
EGF_CA smart00179
Calcium-binding EGF-like domain;
1005-1041 3.32e-05

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 42.23  E-value: 3.32e-05
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 1720394819  1005 DTDDCV-KHACVNGGVCVDGVGNYTCQCPLQYT-GRACE 1041
Cdd:smart00179    1 DIDECAsGNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
RNA1 COG5238
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ...
50-217 3.73e-05

Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ribosomal structure and biogenesis];


Pssm-ID: 444072 [Multi-domain]  Cd Length: 434  Bit Score: 47.86  E-value: 3.73e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819   50 TGLQAIPKNIPRNTE--RLELNGNNITRIHKNDFA----GLKQLRVLQLMENQIGavERGA---FDDMKE---LERLRLN 117
Cdd:COG5238    195 EGIEELAEALTQNTTvtTLWLKRNPIGDEGAEILAealkGNKSLTTLDLSNNQIG--DEGVialAEALKNnttVETLYLS 272
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  118 RNQLQV-----LPELLfQNNQALSRLDLSENFLQAVPRKAF----RGATDLKNLQLDKNRISciEEGA---FRAL---RG 182
Cdd:COG5238    273 GNQIGAegaiaLAKAL-QGNTTLTSLDLSVNRIGDEGAIALaeglQGNKTLHTLNLAYNGIG--AQGAialAKALqenTT 349
                          170       180       190
                   ....*....|....*....|....*....|....*
gi 1720394819  183 LEVLTLNNNNITTIPVSSFNHMPKLRTfRLHSNHL 217
Cdd:COG5238    350 LHSLDLSDNQIGDEGAIALAKYLEGNT-TLRELNL 383
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
114-374 6.85e-05

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 45.93  E-value: 6.85e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  114 LRLNRNQLQVLPEL-LFQNnqaLSRLDLSENFLQAVPrkAFRGATDLKNLQLDKNRISCIEEgaFRALRGLEVLTLNNNN 192
Cdd:cd21340      7 LYLNDKNITKIDNLsLCKN---LKVLYLYDNKITKIE--NLEFLTNLTHLYLQNNQIEKIEN--LENLVNLKKLYLGGNR 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  193 ITTipVSSFNHMPKLRTfrLHsnhlfcdchlawLSqwlRQRPTIGlFTQCSGPASLRGLnvaevqkgefscsgqgeaaga 272
Cdd:cd21340     80 ISV--VEGLENLTNLEE--LH------------IE---NQRLPPG-EKLTFDPRSLAAL--------------------- 118
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  273 pACTLSsgscpamcscssgIVDCRGkgltaipanlpetmteirlelNGIKSIPPgaFSPYRKLRRIDLSNNQIAEIAP-- 350
Cdd:cd21340    119 -SNSLR-------------VLNISG---------------------NNIDSLEP--LAPLRNLEQLDASNNQISDLEEll 161
                          250       260
                   ....*....|....*....|....
gi 1720394819  351 DAFQGLRSLNSLVLYGNKITDLPR 374
Cdd:cd21340    162 DLLSSWPSLRELDLTGNPVCKKPK 185
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
55-417 7.08e-05

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 47.54  E-value: 7.08e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819   55 IPKNIPRNTE--RLELNGNNITRIHKNDFAGLKQLRVLQLMENQI-GAVERGAFDdMKELERLRLNRNQLQ-VLPELLFQ 130
Cdd:PLN00113   228 IPYEIGGLTSlnHLDLVYNNLTGPIPSSLGNLKNLQYLFLYQNKLsGPIPPSIFS-LQKLISLDLSDNSLSgEIPELVIQ 306
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  131 NnQALSRLDL-SENFLQAVPRkAFRGATDLKNLQLDKNRIScieeGAFRALRG----LEVLTLNNNNIT-TIP--VSSFN 202
Cdd:PLN00113   307 L-QNLEILHLfSNNFTGKIPV-ALTSLPRLQVLQLWSNKFS----GEIPKNLGkhnnLTVLDLSTNNLTgEIPegLCSSG 380
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  203 HMPKLRTFrlhSNHLfcdchlawlsqwlrqrptiglftQCSGPASL---RGLNVAEVQKGEFSCSGQGEAAGAPACTLSS 279
Cdd:PLN00113   381 NLFKLILF---SNSL-----------------------EGEIPKSLgacRSLRRVRLQDNSFSGELPSEFTKLPLVYFLD 434
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  280 GSCPAMcscsSGIVDCRGKGLTaipanlpeTMTEIRLELNGIKSIPPGAFSPyRKLRRIDLSNNQIAEIAPDAFQGLRSL 359
Cdd:PLN00113   435 ISNNNL----QGRINSRKWDMP--------SLQMLSLARNKFFGGLPDSFGS-KRLENLDLSRNQFSGAVPRKLGSLSEL 501
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1720394819  360 NSLVLYGNKIT-DLPRGVfGGLYTLQLLLLNANKINCIRPDAFQDLQNLSLLSLYDNKI 417
Cdd:PLN00113   502 MQLKLSENKLSgEIPDEL-SSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQNQL 559
LRRNT smart00013
Leucine rich repeat N-terminal domain;
282-313 7.76e-05

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 41.15  E-value: 7.76e-05
                            10        20        30
                    ....*....|....*....|....*....|..
gi 1720394819   282 CPAMCSCSSGIVDCRGKGLTAIPANLPETMTE 313
Cdd:smart00013    2 CPAPCNCSGTAVDCSGRGLTEVPLDLPPDTTL 33
EGF_CA smart00179
Calcium-binding EGF-like domain;
1085-1119 1.08e-04

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 40.69  E-value: 1.08e-04
                            10        20        30
                    ....*....|....*....|....*....|....*..
gi 1720394819  1085 DDCK-DHKCQNGAQCVDEVNSYACLCVEGYS-GQLCE 1119
Cdd:smart00179    3 DECAsGNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
LRRNT pfam01462
Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence ...
33-60 1.27e-04

Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is often found at the N-terminus of tandem leucine rich repeats.


Pssm-ID: 396168 [Multi-domain]  Cd Length: 28  Bit Score: 40.30  E-value: 1.27e-04
                           10        20
                   ....*....|....*....|....*...
gi 1720394819   33 ACPALCTCTGTTVDCHGTGLQAIPKNIP 60
Cdd:pfam01462    1 ACPVPCHCSATVVNCSDRGLTAVPRDLP 28
LRR_RI cd00116
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 ...
87-220 1.51e-04

Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).


Pssm-ID: 238064 [Multi-domain]  Cd Length: 319  Bit Score: 45.81  E-value: 1.51e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819   87 LRVLQLMENQIGA-----VERGAFDDMKELERLRLNRNQLQVLP----ELLFQNNQALSRLDLSENFL--QAVPR--KAF 153
Cdd:cd00116    110 LQELKLNNNGLGDrglrlLAKGLKDLPPALEKLVLGRNRLEGAScealAKALRANRDLKELNLANNGIgdAGIRAlaEGL 189
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1720394819  154 RGATDLKNLQLDKNRISCIE----EGAFRALRGLEVLTLNNNNITTIPVSSF-----NHMPKLRTFRLHSNHLFCD 220
Cdd:cd00116    190 KANCNLEVLDLNNNGLTDEGasalAETLASLKSLEVLNLGDNNLTDAGAAALasallSPNISLLTLSLSCNDITDD 265
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
548-673 1.77e-04

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 46.38  E-value: 1.77e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  548 NNNEISILEATG---------LFKKLSHLKKINLSNNKVS-EIEDGTFEGAASVSELHLTANQLE-SIRSGMfrgLDGLR 616
Cdd:PLN00113    67 NSSRVVSIDLSGknisgkissAIFRLPYIQTINLSNNQLSgPIPDDIFTTSSSLRYLNLSNNNFTgSIPRGS---IPNLE 143
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  617 TLMLRNNRISC-IHND--SFTglrNVRLLSLYDNHITTISPGAFDTLQALSTLNLLANPF 673
Cdd:PLN00113   144 TLDLSNNMLSGeIPNDigSFS---SLKVLDLGGNVLVGKIPNSLTNLTSLEFLTLASNQL 200
LRRNT smart00013
Leucine rich repeat N-terminal domain;
733-765 1.86e-04

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 39.99  E-value: 1.86e-04
                            10        20        30
                    ....*....|....*....|....*....|...
gi 1720394819   733 QCPQECACLDTVVRCSNKHLQALPKGIPKNVTE 765
Cdd:smart00013    1 ACPAPCNCSGTAVDCSGRGLTEVPLDLPPDTTL 33
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1087-1115 1.91e-04

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 40.06  E-value: 1.91e-04
                           10        20
                   ....*....|....*....|....*....
gi 1720394819 1087 CKDHKCQNGAQCVDEVNSYACLCVEGYSG 1115
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTG 29
LRR_RI cd00116
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 ...
398-627 1.98e-04

Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).


Pssm-ID: 238064 [Multi-domain]  Cd Length: 319  Bit Score: 45.42  E-value: 1.98e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  398 PDAFQDLQNLSLLSLYDNKIQSLAKGTFTSLRAIQTLHLAQnpficdcnlkwladfLRTNPIETTGAR--CASPRRLank 475
Cdd:cd00116     74 LQGLTKGCGLQELDLSDNALGPDGCGVLESLLRSSSLQELK---------------LNNNGLGDRGLRllAKGLKDL--- 135
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  476 rigQIKSKKFRCSakeqyfipgteDYHLNSECTSDVA--CPHKCRCEasvvecsslklskiperipqsttELRLNNNEIS 553
Cdd:cd00116    136 ---PPALEKLVLG-----------RNRLEGASCEALAkaLRANRDLK-----------------------ELNLANNGIG 178
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  554 ILEATGL---FKKLSHLKKINLSNNKVSEIEDGTFEGA----ASVSELHLTANQLES-----IRSGMFRGLDGLRTLMLR 621
Cdd:cd00116    179 DAGIRALaegLKANCNLEVLDLNNNGLTDEGASALAETlaslKSLEVLNLGDNNLTDagaaaLASALLSPNISLLTLSLS 258

                   ....*.
gi 1720394819  622 NNRISC 627
Cdd:cd00116    259 CNDITD 264
LRRNT pfam01462
Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence ...
282-308 2.00e-04

Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is often found at the N-terminus of tandem leucine rich repeats.


Pssm-ID: 396168 [Multi-domain]  Cd Length: 28  Bit Score: 39.92  E-value: 2.00e-04
                           10        20
                   ....*....|....*....|....*..
gi 1720394819  282 CPAMCSCSSGIVDCRGKGLTAIPANLP 308
Cdd:pfam01462    2 CPVPCHCSATVVNCSDRGLTAVPRDLP 28
LRR_4 pfam12799
Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a ...
183-220 2.15e-04

Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a number of proteins with diverse functions and cellular locations. These repeats are usually involved in protein-protein interactions. Each Leucine Rich Repeat is composed of a beta-alpha unit. These units form elongated non-globular structures. Leucine Rich Repeats are often flanked by cysteine rich domains.


Pssm-ID: 463713 [Multi-domain]  Cd Length: 44  Bit Score: 40.31  E-value: 2.15e-04
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 1720394819  183 LEVLTLNNNNITTIPvsSFNHMPKLRTFRLHSNHLFCD 220
Cdd:pfam12799    3 LEVLDLSNNQITDIP--PLAKLPNLETLDLSGNNKITD 38
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
61-439 2.46e-04

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 45.99  E-value: 2.46e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819   61 RNTERLELNGNNITRIHKNDFAGLKQLRVLQLMENQIGAVERGAFDDMKELERLRLNRNQLQ-VLPELLFqNNQALSRLD 139
Cdd:PLN00113   212 KSLKWIYLGYNNLSGEIPYEIGGLTSLNHLDLVYNNLTGPIPSSLGNLKNLQYLFLYQNKLSgPIPPSIF-SLQKLISLD 290
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  140 LSENFLQA-VPRKAFRgatdlknlqldknriscieegafraLRGLEVLTLNNNNIT-TIPVSsFNHMPKLRTFRLHSNHL 217
Cdd:PLN00113   291 LSDNSLSGeIPELVIQ-------------------------LQNLEILHLFSNNFTgKIPVA-LTSLPRLQVLQLWSNKF 344
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  218 fcdchlawlsqwlrqrptiglftqcSG--PASLRGLNVAEVqkgeFSCSGQGEAAGAPACTLSSGSCPAMCSCSSGIVDC 295
Cdd:PLN00113   345 -------------------------SGeiPKNLGKHNNLTV----LDLSTNNLTGEIPEGLCSSGNLFKLILFSNSLEGE 395
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  296 RGKGLTAIpanlpETMTEIRLELNGIKSIPPGAFSPYRKLRRIDLSNN----------------QIAEIA--------PD 351
Cdd:PLN00113   396 IPKSLGAC-----RSLRRVRLQDNSFSGELPSEFTKLPLVYFLDISNNnlqgrinsrkwdmpslQMLSLArnkffgglPD 470
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  352 AFqGLRSLNSLVLYGNKITD-LPRGvFGGLYTLQLLLLNANKINCIRPDAFQDLQNLSLLSLYDNKIQSLAKGTFTSLRA 430
Cdd:PLN00113   471 SF-GSKRLENLDLSRNQFSGaVPRK-LGSLSELMQLKLSENKLSGEIPDELSSCKKLVSLDLSHNQLSGQIPASFSEMPV 548

                   ....*....
gi 1720394819  431 IQTLHLAQN 439
Cdd:PLN00113   549 LSQLDLSQN 557
LRR_4 pfam12799
Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a ...
334-373 2.80e-04

Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a number of proteins with diverse functions and cellular locations. These repeats are usually involved in protein-protein interactions. Each Leucine Rich Repeat is composed of a beta-alpha unit. These units form elongated non-globular structures. Leucine Rich Repeats are often flanked by cysteine rich domains.


Pssm-ID: 463713 [Multi-domain]  Cd Length: 44  Bit Score: 39.92  E-value: 2.80e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1720394819  334 KLRRIDLSNNQIAEIapDAFQGLRSLNSLVLYGN-KITDLP 373
Cdd:pfam12799    2 NLEVLDLSNNQITDI--PPLAKLPNLETLDLSGNnKITDLS 40
LRRCT smart00082
Leucine rich repeat C-terminal domain;
439-469 3.17e-04

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 40.11  E-value: 3.17e-04
                            10        20        30
                    ....*....|....*....|....*....|...
gi 1720394819   439 NPFICDCNLKWLADFLRTNPI--ETTGARCASP 469
Cdd:smart00082    1 NPFICDCELRWLLRWLQANEHlqDPVDLRCASP 33
LRRNT smart00013
Leucine rich repeat N-terminal domain;
512-544 3.66e-04

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 39.22  E-value: 3.66e-04
                            10        20        30
                    ....*....|....*....|....*....|...
gi 1720394819   512 ACPHKCRCEASVVECSSLKLSKIPERIPQSTTE 544
Cdd:smart00013    1 ACPAPCNCSGTAVDCSGRGLTEVPLDLPPDTTL 33
LRR_4 pfam12799
Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a ...
543-582 9.21e-04

Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a number of proteins with diverse functions and cellular locations. These repeats are usually involved in protein-protein interactions. Each Leucine Rich Repeat is composed of a beta-alpha unit. These units form elongated non-globular structures. Leucine Rich Repeats are often flanked by cysteine rich domains.


Pssm-ID: 463713 [Multi-domain]  Cd Length: 44  Bit Score: 38.38  E-value: 9.21e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1720394819  543 TELRLNNNEISILEatgLFKKLSHLKKINLS-NNKVSEIED 582
Cdd:pfam12799    4 EVLDLSNNQITDIP---PLAKLPNLETLDLSgNNKITDLSD 41
EGF_CA smart00179
Calcium-binding EGF-like domain;
966-1002 1.06e-03

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 38.00  E-value: 1.06e-03
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 1720394819   966 DGCSS-NPCGNGGTCHAQEGedaGFTCSCPSGFE-GPTC 1002
Cdd:smart00179    3 DECASgNPCQNGGTCVNTVG---SYRCECPPGYTdGRNC 38
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
930-962 1.33e-03

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 37.84  E-value: 1.33e-03
                           10        20        30
                   ....*....|....*....|....*....|....
gi 1720394819  930 LSSPCQNQGTCHNDPlEVYRCTCPSGYKG-RHCE 962
Cdd:cd00053      4 ASNPCSNGGTCVNTP-GSYRCVCPPGYTGdRSCE 36
EGF_CA smart00179
Calcium-binding EGF-like domain;
1044-1080 1.54e-03

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 37.61  E-value: 1.54e-03
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 1720394819  1044 VDFCSpDMNPCQHEAQCVGTPDGPRCECMLGYT-GDNC 1080
Cdd:smart00179    2 IDECA-SGNPCQNGGTCVNTVGSYRCECPPGYTdGRNC 38
EGF_CA smart00179
Calcium-binding EGF-like domain;
927-962 1.65e-03

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 37.61  E-value: 1.65e-03
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 1720394819   927 DPCLS-SPCQNQGTCHNDPLEvYRCTCPSGYK-GRHCE 962
Cdd:smart00179    3 DECASgNPCQNGGTCVNTVGS-YRCECPPGYTdGRNCE 39
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
929-960 1.74e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 37.36  E-value: 1.74e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 1720394819  929 CLSSPCQNQGTCHNDPlEVYRCTCPSGYKGRH 960
Cdd:pfam00008    1 CAPNPCSNGGTCVDTP-GGYTCICPEGYTGKR 31
LRRCT smart00082
Leucine rich repeat C-terminal domain;
219-264 1.85e-03

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 37.79  E-value: 1.85e-03
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*...
gi 1720394819   219 CDCHLAWLSQWLRQRPTIGLF--TQCSGPASLRGLnVAEVQKGEFSCS 264
Cdd:smart00082    5 CDCELRWLLRWLQANEHLQDPvdLRCASPSSLRGP-LLELLHSEFKCP 51
LRRNT pfam01462
Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence ...
733-760 2.15e-03

Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is often found at the N-terminus of tandem leucine rich repeats.


Pssm-ID: 396168 [Multi-domain]  Cd Length: 28  Bit Score: 36.84  E-value: 2.15e-03
                           10        20
                   ....*....|....*....|....*...
gi 1720394819  733 QCPQECACLDTVVRCSNKHLQALPKGIP 760
Cdd:pfam01462    1 ACPVPCHCSATVVNCSDRGLTAVPRDLP 28
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
968-1001 2.60e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 36.98  E-value: 2.60e-03
                           10        20        30
                   ....*....|....*....|....*....|....
gi 1720394819  968 CSSNPCGNGGTCHAQEGedaGFTCSCPSGFEGPT 1001
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPG---GYTCICPEGYTGKR 31
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
969-1001 2.81e-03

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 36.69  E-value: 2.81e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 1720394819  969 SSNPCGNGGTCHAQEGedaGFTCSCPSGFEGPT 1001
Cdd:cd00053      4 ASNPCSNGGTCVNTPG---SYRCVCPPGYTGDR 33
LRR_5 pfam13306
BspA type Leucine rich repeat region (6 copies); This family includes a number of leucine rich ...
567-657 2.99e-03

BspA type Leucine rich repeat region (6 copies); This family includes a number of leucine rich repeats. This family contains a large number of BSPA-like surface antigens from Trichomonas vaginalis.


Pssm-ID: 463839 [Multi-domain]  Cd Length: 127  Bit Score: 39.45  E-value: 2.99e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  567 LKKINLSNNkVSEIEDGTFEGAASVSELHLTaNQLESIRSGMFRGLdGLRTLMLRNNrISCIHNDSFTGLRNVRLLSLYD 646
Cdd:pfam13306   13 LTSITIPSS-LTSIGEYAFSNCTSLKSITLP-SSLTSIGSYAFYNC-SLTSITIPSS-LTSIGEYAFSNCSNLKSITLPS 88
                           90
                   ....*....|.
gi 1720394819  647 NhITTISPGAF 657
Cdd:pfam13306   89 N-LTSIGSYAF 98
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
43-196 3.02e-03

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 41.84  E-value: 3.02e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819   43 TTVDCHGTGLQAIP--KNIPrNTERLELNGNNITRIhkNDFAGLKQLRVLQLMENQIGAVergafddmkELERLRLNRNQ 120
Cdd:COG4886    231 ETLDLSNNQLTDLPelGNLT-NLEELDLSNNQLTDL--PPLANLTNLKTLDLSNNQLTDL---------KLKELELLLGL 298
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1720394819  121 LQVLPELLFQNNQALSRLDLSENFLQAVPRKAFRGATDLKNLQLDKNRISCIEEGAFRALRGLEVLTLNNNNITTI 196
Cdd:COG4886    299 NSLLLLLLLLNLLELLILLLLLTTLLLLLLLLKGLLVTLTTLALSLSLLALLTLLLLLNLLSLLLTLLLTLGLLGL 374
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
319-441 3.30e-03

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 40.92  E-value: 3.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720394819  319 NGIKSIPpgAFSPYRKLRRIDLSNNQIAEIAPdaFQGLRSLNSLVLYGNKIT-------------------DLPRGV--- 376
Cdd:cd21340     34 NKITKIE--NLEFLTNLTHLYLQNNQIEKIEN--LENLVNLKKLYLGGNRISvveglenltnleelhienqRLPPGEklt 109
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1720394819  377 FGGLYTLQLLLL----NA--NKINCIRPdaFQDLQNLSLLSLYDNKIQSLA--KGTFTSLRAIQTLHLAQNPF 441
Cdd:cd21340    110 FDPRSLAALSNSlrvlNIsgNNIDSLEP--LAPLRNLEQLDASNNQISDLEelLDLLSSWPSLRELDLTGNPV 180
LRR_4 pfam12799
Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a ...
615-654 4.16e-03

Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a number of proteins with diverse functions and cellular locations. These repeats are usually involved in protein-protein interactions. Each Leucine Rich Repeat is composed of a beta-alpha unit. These units form elongated non-globular structures. Leucine Rich Repeats are often flanked by cysteine rich domains.


Pssm-ID: 463713 [Multi-domain]  Cd Length: 44  Bit Score: 36.45  E-value: 4.16e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1720394819  615 LRTLMLRNNRISCIhnDSFTGLRNVRLLSLYDN-HITTISP 654
Cdd:pfam12799    3 LEVLDLSNNQITDI--PPLAKLPNLETLDLSGNnKITDLSD 41
LRRNT pfam01462
Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence ...
512-539 4.24e-03

Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is often found at the N-terminus of tandem leucine rich repeats.


Pssm-ID: 396168 [Multi-domain]  Cd Length: 28  Bit Score: 36.07  E-value: 4.24e-03
                           10        20
                   ....*....|....*....|....*...
gi 1720394819  512 ACPHKCRCEASVVECSSLKLSKIPERIP 539
Cdd:pfam01462    1 ACPVPCHCSATVVNCSDRGLTAVPRDLP 28
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
1090-1116 4.55e-03

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 36.30  E-value: 4.55e-03
                           10        20
                   ....*....|....*....|....*..
gi 1720394819 1090 HKCQNGAQCVDEVNSYACLCVEGYSGQ 1116
Cdd:cd00053      6 NPCSNGGTCVNTPGSYRCVCPPGYTGD 32
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1009-1038 4.65e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 36.21  E-value: 4.65e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 1720394819 1009 CVKHACVNGGVCVDGVGNYTCQCPLQYTGR 1038
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTGK 30
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
1052-1079 5.17e-03

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 35.92  E-value: 5.17e-03
                           10        20
                   ....*....|....*....|....*...
gi 1720394819 1052 NPCQHEAQCVGTPDGPRCECMLGYTGDN 1079
Cdd:cd00053      6 NPCSNGGTCVNTPGSYRCVCPPGYTGDR 33
LRR_8 pfam13855
Leucine rich repeat;
637-668 5.70e-03

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 36.73  E-value: 5.70e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 1720394819  637 RNVRLLSLYDNHITTISPGAFDTLQALSTLNL 668
Cdd:pfam13855    1 PNLRSLDLSNNRLTSLDDGAFKGLSNLKVLDL 32
hEGF pfam12661
Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six ...
1092-1113 6.46e-03

Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six conserved residues disulfide-bonded into the characteriztic 'ababcc' pattern. They are involved in growth and proliferation of cells, in proteins of the Notch/Delta pathway, neurogulin and selectins. hEGFs are also found in mosaic proteins with four-disulfide laminin EGFs such as aggrecan and perlecan. The core fold of the EGF domain consists of two small beta-hairpins packed against each other. Two major structural variants have been identified based on the structural context of the C-terminal Cys residue of disulfide 'c' in the C-terminal hairpin: hEGFs and cEGFs. In hEGFs the C-terminal thiol resides in the beta-turn, resulting in shorter loop-lengths between the Cys residues of disulfide 'c', typically C[8-9]XC. These shorter loop-lengths are also typical of the four-disulfide EGF domains, laminin ad integrin. Tandem hEGF domains have six linking residues between terminal cysteines of adjacent domains. hEGF domains may or may not bind calcium in the linker region. hEGF domains with the consensus motif CXD4X[F,Y]XCXC are hydroxylated exclusively in the Asp residue.


Pssm-ID: 463660  Cd Length: 22  Bit Score: 35.39  E-value: 6.46e-03
                           10        20
                   ....*....|....*....|..
gi 1720394819 1092 CQNGAQCVDEVNSYACLCVEGY 1113
Cdd:pfam12661    1 CQNGGTCVDGVNGYKCQCPPGY 22
PLN03150 PLN03150
hypothetical protein; Provisional
776-820 7.04e-03

hypothetical protein; Provisional


Pssm-ID: 178695 [Multi-domain]  Cd Length: 623  Bit Score: 40.95  E-value: 7.04e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 1720394819  776 VPGQLSTFKYLQLVDLSNNKISSLSNSSFTNMSQLTTLILSYNAL 820
Cdd:PLN03150   434 IPNDISKLRHLQSINLSGNSIRGNIPPSLGSITSLEVLDLSYNSF 478
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
761-825 7.54e-03

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 39.77  E-value: 7.54e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1720394819  761 KNVTELYLDGNQFTLVPGqLSTFKYLQLVDLSNNKISSLSNssFTNMSQLTTLILSYnalQCIPP 825
Cdd:cd21340     46 TNLTHLYLQNNQIEKIEN-LENLVNLKKLYLGGNRISVVEG--LENLTNLEELHIEN---QRLPP 104
LRR smart00370
Leucine-rich repeats, outliers;
180-203 8.02e-03

Leucine-rich repeats, outliers;


Pssm-ID: 197688 [Multi-domain]  Cd Length: 24  Bit Score: 35.41  E-value: 8.02e-03
                            10        20
                    ....*....|....*....|....
gi 1720394819   180 LRGLEVLTLNNNNITTIPVSSFNH 203
Cdd:smart00370    1 LPNLRELDLSNNQLSSLPPGAFQG 24
LRR_TYP smart00369
Leucine-rich repeats, typical (most populated) subfamily;
180-203 8.02e-03

Leucine-rich repeats, typical (most populated) subfamily;


Pssm-ID: 197687 [Multi-domain]  Cd Length: 24  Bit Score: 35.41  E-value: 8.02e-03
                            10        20
                    ....*....|....*....|....
gi 1720394819   180 LRGLEVLTLNNNNITTIPVSSFNH 203
Cdd:smart00369    1 LPNLRELDLSNNQLSSLPPGAFQG 24
LRR_8 pfam13855
Leucine rich repeat;
405-439 8.70e-03

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 35.96  E-value: 8.70e-03
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 1720394819  405 QNLSLLSLYDNKIQSLAKGTFTSLRAIQTLHLAQN 439
Cdd:pfam13855    1 PNLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNN 35
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH