NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1907123041|ref|XP_036016299|]
View 

polycystin-1 isoform X20 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PCC super family cl28216
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
97-2725 0e+00

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


The actual alignment was detected with superfamily member TIGR00864:

Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 3837.75  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041   97 DLSNNRISTLEEGVFANLFNLSEINLSGNPFECNCGLAWLPRWAKEHQVHVVQSEATTCRGPIPLAGQPLLSIPLLDNAC 176
Cdd:TIGR00864    1 DISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRQPEAALCAGPGALAGQPLLGIPLLDSGC 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  177 GEEYVACLPDNSSGAVAA----VPFYFAHEGPLETEACSAFCFSAGEGLAALSEQNQCLCGAGQASNSSAACSSWCSSIS 252
Cdd:TIGR00864   81 DEEYVACLKDNSSGGGAArselVIFSAAHEGLFQPEACNAFCFSAGHGLAALGEQGECLCGAAQPSEANFACESLCSGPP 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  253 LSLNSACGGPTLLQHTFPASPGATLVGPHGPLASGQPADFHITSSLPISSTRWNFGDGSPEVDMASP----AATHFYVLP 328
Cdd:TIGR00864  161 PPPAAACRGPQLLEHIFPALPGAPIQGPHGPIASGQLAAFHAAAPLAPTAMRWDFGDGSAEVDAAGAggttAASHKYGHP 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  329 GSYHMTVVLALGAGSALLETEVQVEATPTVLELVCPSFVHSNESLELGIRHRGGSALEVTYSILALDKEPAQVVHPLCPL 408
Cdd:TIGR00864  241 GRYHVSAMGALGAGKALAGGDVQVEAAPAALELHCPSLVQADESLDLSIQNRGGSDLDAAWKITAHGEEPAKASHPHCPK 320
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  409 DTEIFPGNGHCYRLVAEKAPWLQAQEQCRTWAGAALAMVDSPAIQHFLVSKVTRSLD--VWIGFSSVEGTE-GLDPRGEA 485
Cdd:TIGR00864  321 DGEIFEENGHCFQIVPEEAAWLDAQEQCLARAGAALAIVDNDALQNFLARKVTHSLDrgVWIGFSDVNGAEkGPAHQGEA 400
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  486 FSLESCQNWLPGEPHPATAEHCVRLGPAGQCNTDLCSAPHSYVCELRPGGPVWDTENFVMGMSGGGLSGPLHPLAQQETV 565
Cdd:TIGR00864  401 FEAEECEEGLAGEPHPARAEHCVRLDPRGQCNSDLCNAPHAYVCELNPGGPVPDAENFAMGAASFDLHGLLQALAAMDGL 480
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  566 QGPLR-PVEVMVFPGLSPSREAFLTAAEFSTQKLEEPAQMRLQVYRPSGG--AAAVPEGSSEP-----DNRTEPAPKCVP 637
Cdd:TIGR00864  481 PAPPHeGVEVLLFPALRFSRAAFLSSAEFGTQELRRPAHILFQIYRLRCRlpGAGGPACGPEAecrppDNRSADAPACMK 560
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  638 EELWCPGANVCIPFDASCNSHVCINGSVSRLGLSRAS----YTLWKEFFFSVPAGPPTQYLVTLHSQDVPMLPGDLIGLQ 713
Cdd:TIGR00864  561 GEQWCPFAHICLPLDAPCHPQACANGCSQGHGLPGAArmplYALQREFLFSLPAGPAAHVLLQDHGEDLLMLPGDLIALQ 640
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  714 HDAGPGTLLQCPLASSCPG-QALYLSTNASDWM------------------------TNLPVHLEEAWAG----PVCSLQ 764
Cdd:TIGR00864  641 HDAGPAALIHCQPAPGHPGpRAPVFAANASEWFghnntpvppdnlagdgadplpdpeLDLKALLEGTRASwlecAACAIR 720
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  765 LLLVTERLTPLLGLGPNPGLQHPGHYEVRATVGNSVSRQNLSCSFSVVSPIAGLRVIHPIPLDGHIYVPTNGSVLVLQVD 844
Cdd:TIGR00864  721 LLAAGEQETRLLGAELNAGLPLPGLYELLAESAKGSDLHNASCSFDVLPPLAGLRVIHPAPQDGRLFLESNGSALLLQVD 800
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  845 SGANATATAQWFGGNISAPFEDACPPEVDFLKQD-------CTEEANGTLFSVLMLPRLKEGDHT----VEIVAQNGASQ 913
Cdd:TIGR00864  801 SGANAEAKAFWPGGNSSARFENVCPAEFASRLCHpstfeggCAEEAEDSLFAVLALNWLKEGEHTgpvqVDLMAENNASE 880
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  914 ANLSLRVTAEEPICGLRAVPSPEARVLQGILVRYSPMVEAGSDVAFRWTIDDKQSLTFHNTVFNVIYQSAAIFKLSLwlT 993
Cdd:TIGR00864  881 ANLSLLVQAEEPICGLRAQPHPAARVLMESLVRYSASVEAGSDMTFKWTIDDKPFFTFQNTVFNVIYQHAAVFKLSL--T 958
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  994 ASNHVSNITVNYNVTVERMNKMHGLWVSAVPTVLPPNATLALTGGVLVDSAVEVAFLWNFGDGEQVLRQFKPPYDESFQV 1073
Cdd:TIGR00864  959 AMNHVSNLTEDFNVTVDRLNPMQGLQVKGVPAVLPPGATLALTAGVLIDMAVEAAFLWSFGDGEQALFEFKPPYNESFPC 1038
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1074 PDPTVAQVLVEHNTTHIYTTPGEYNLTVLVSNTYENLTQQVTVSVRTVLPNVAIGMSSNVLVAGQPITFSPYPLPSTDGV 1153
Cdd:TIGR00864 1039 PDPSPAQVLLEHNVMHIYAAPGEYLATVLASNAFENISQQINMSVRAILPRVAIGTEDGLLLAGKPADFEAHPLPSPGGI 1118
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1154 LYTWDFGDGSPVLIQSQPVLNHTYSMTGAYRITLEVNNTVSSVTAHADIRVFQELHGLTVYLSPSVEQGAPMVVSASVES 1233
Cdd:TIGR00864 1119 HYEWDFGDGSALLQGRQPAAAHTFAKRGPFHVCLEVNNTISGAAACADMFAFEEIEGLSADMSLATELGAATTVRAALQS 1198
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1234 GDNITWTFDMGDGTVFTGPEATVQHVYLRAQNFTVTVEAANPAGHLSQSLHVQVFVLEVLHIEPSTCIPTQPSAQLMAHV 1313
Cdd:TIGR00864 1199 GDNITWTFDMGDGKSLSGPEATVEHKYAKAGNCTVNIGAANAAGHGARIIHVEVFVFEVAGIEPAACIGEHADANFRARV 1278
                         1290      1300      1310      1320      1330      1340      1350      1360
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1314 TGDPVHYLFDWTFGDGSSNVTVHGHPSVTHNFTRSGIFPLALVLSSHVNKAHYFTSICVEPEIRNITLQPERQFVKLGDE 1393
Cdd:TIGR00864 1279 SGNAAHYLFDWSFGDGSPNETHHGCPGISHNFRGNGTFPLALTISSGVNKAHFFTQICVEPELGKISLQAEKQFFALGDE 1358
                         1370      1380      1390      1400      1410      1420      1430      1440
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1394 ARLVAYSWPPFPYRYTWDFGTEDTTHTQTGGSEVKFIYREPGSYLVIVTVSNNISSTNDSAFVEVQEPVLVTGIRINGSH 1473
Cdd:TIGR00864 1359 AQFQACAEPEFNYRYEWDFGGEEAAPLPAAGAEVTFIYNDPGCYLVTVAASNNISAANDSALIEVLEPVGATSFKHNGSH 1438
                         1450      1460      1470      1480      1490      1500      1510      1520
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1474 --VLELQQPYLLSAMGSGSPATYLWELGDGSQSEGPEVTHIYSSTGDFTVRVSGWNEVSRSEAQLNITVKQRVRGLTINA 1551
Cdd:TIGR00864 1439 gnNLELGQPYLFSAFGRARNASYLWDFGDGGLLEGPEILHAFNSPGDFNIRLAAANEVGKNEATLNVAVKARVRGLTINA 1518
                         1530      1540      1550      1560      1570      1580      1590      1600
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1552 SRTVVPLNGSVSFSTLLEVGSDVHYSWVLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFIYVLQFIEGL 1631
Cdd:TIGR00864 1519 SLTNVPLNGSVHFEAHLDAGDDVRFSWILCDHCTPIFGGNTIFYTFRSVGTFNIIVTAENDVGAAQASIFLFVLQEIEGL 1598
                         1610      1620      1630      1640      1650      1660      1670      1680
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1632 QVAGGD-----------NGCCFPTNYTLQLQAAVRDGTNISYSWTA--QQEGSLITLFGSGKCFSLTSLKASTYYVHLRA 1698
Cdd:TIGR00864 1599 QILGETaegggggvqelDGCYFETNHTVQFHAGFKDGTNLSFSWNAilDNEPDGPAFAGSGKGAKLNPLEAGPCDIFLQA 1678
                         1690      1700      1710      1720      1730      1740      1750      1760
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1699 TNMLGSAAANRTIDFVEPVESLILSASPNPAAVNMSLTLCAELAGGSGVVYTWYLEEGLSWKTSMPSTTHTFAAPGLHLV 1778
Cdd:TIGR00864 1679 ANLLGQATADCTIDFLEPAGNLMLAASDNPAAVNALINLSAELAEGSGLQYRWFLEEGDDLETSEPFMSHSFPSAGLHLV 1758
                         1770      1780      1790      1800      1810      1820      1830      1840
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1779 RVTAENQLGSVNATVEVAIQVPVGGLSIRTSEP-DSIFVAAGSTLPFWGQLAEGTNVTWCWTLPGGS-KDSQYIAVRFST 1856
Cdd:TIGR00864 1759 TMKAFNELGSANASEEVDVQEPISGLKIRAADAgEQNFFAADSSVCFQGELATGTNVSWCWAIDGGSsKMGKHACMTFPD 1838
                         1850      1860      1870      1880      1890      1900      1910      1920
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1857 AGSFSLQLNASNAVSWVSAMYNLTVEEPIVNLMLWASSKVVAPGQPVHFEILLAAGSALTFRLQVGGSVPEVL-PSPHFS 1935
Cdd:TIGR00864 1839 AGTFAIRLNASNAVSGKSASREFFAEEPIFGLELKASKKIAAIGEKVEFQILLAAGSAVNFRLQIGGAAPEVLqPGPRFS 1918
                         1930      1940      1950      1960      1970      1980      1990      2000
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1936 HSFFRVGDHLVNVQAENHVSHAQAQVRILVLEAVVGLQVPNCCEPGMATGTEKNFTARVQRGSRVAYAWYFSLQKVQGDS 2015
Cdd:TIGR00864 1919 HSFPRVDDHMVNLRAKNEVSCAQANLHIEVLEAVRGLQIPDCCAAGIATGEEKNFTANVQRGKPVAFAWTFDLHHLHGDS 1998
                         2010      2020      2030      2040      2050      2060      2070      2080
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2016 LVILSGRDVTYTPVAAGLLEIHVRAFNELGGVNLTLMVEVQDIIQYVTLQSG--RCFTNRSARFEAATSPSPRRVTYHWD 2093
Cdd:TIGR00864 1999 LVIHMGKDVSYTAEAAGLLEIQLGAFNALGAENITLQLEAQDALMDAALQAGpqDCFTNKMAQFEAATSPKPNFMACHWD 2078
                         2090      2100      2110      2120      2130      2140      2150      2160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2094 FGDGTPVQKTEEFWADHYYLRPGDYHVEVNATNLVSFFVAQATVTVQVLACREPEVEVALPLQVLMRRSQRNYLEAHVDL 2173
Cdd:TIGR00864 2079 FGDGSAGQDTDEPRAEHEYLHPGDYRVQVNASNLVSFFSAHAEINVQVLACEEPEVDVVLALQLAIRRSQPNLLEAHVDL 2158
                         2170      2180      2190      2200      2210      2220      2230      2240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2174 RNCVSYQTEYRWEIYRTASCQRPGRMAQMVLPG-------------VDVSRPQLVVPRLALPVGHYCFVFVVSFGDTPLA 2240
Cdd:TIGR00864 2159 KDCLRYGAEYLWEILRAASCDNDGHFARGALNGatrsfpviplpaeVDVQRLQLSLPKLALAAGHYCFVFSLSFEDTPLK 2238
                         2250      2260      2270      2280      2290      2300      2310      2320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2241 RSIQANVTVAAERLVPIIEGGSYRVWSDTQDLVLDGSKSYDPNLEDGDQTPLNFHWACVASTQSETGGCVLNFGPRG-SS 2319
Cdd:TIGR00864 2239 KAACANLGVAAARLMPIIEGGSYRVWSDTQDLQLDAEESYDPNLDDDDQSLLHFHWACQASSKGEAGCCALNFGLGGkGP 2318
                         2330      2340      2350      2360      2370      2380      2390      2400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2320 VVTIPLERLEAGVEYTFNLIVWKAGRKEEATNQTVLIRSGRVPIVSLECVSCKAQAVYEVSRSSYVYLEGHCHNCSRGYK 2399
Cdd:TIGR00864 2319 TLGIPGEELAAGIEYTFKLSIGKAGMKEEATNQTVLIQSGHIPIVSLECVSCKAQALYEVSQNSYVYLEGRCLNCQSGFH 2398
                         2410      2420      2430      2440      2450      2460      2470      2480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2400 QGCWAARTFSNKTLVLNETTTSTGSTGMNLVVRPGALRDGEGYIFTLTVLGHSGEEEGCASIRLSPNRPPLGGSCRLFPL 2479
Cdd:TIGR00864 2399 RGRWAARTFQNDTLVLDESSTSTGSAGMNLVLRQGVLHDGEGYNFTLHVLDDSGDEEGAASIRLHHNMPPDGGECHLFPG 2478
                         2490      2500      2510      2520      2530      2540      2550      2560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2480 --------------DSVRGLTTKVHFECTGWRDAEDGGAPLVYALLLKRCRQSYCENFCIYKGSLSTYGAVLPPGFQ-PL 2544
Cdd:TIGR00864 2479 getgqehgdkedevWAIEALLDKVHFECSGWHDAEDAEAPLLYALLLNRCRDDHCEEFCVYKGSLPEHGAFLPPGFRsAH 2558
                         2570      2580      2590      2600      2610      2620      2630      2640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2545 FVVSLAVVVQDQLGAAVVALNRSLTIVLPEPSGNPADLVPWLHSLTASVLPGLLKQADPQHVIEYSLALITVLNEYEQAP 2624
Cdd:TIGR00864 2559 FEVGLAITVEDHLGAAIRALNKSIAITLPDPNGEASGLPHWLHDLIASKLKGLLDQADFQHVIELSLALITVLNEYEQAL 2638
                         2650      2660      2670      2680      2690      2700      2710      2720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2625 D-VSEPNVEQQLRAQMRKNITETLISLRVNTVDDIQQITAALAQCMVSSRELMCRSCLKKMLQKLEGMMRILQAETTEGT 2703
Cdd:TIGR00864 2639 DsAAEPKHERGHRAQIRKNITEALTALDLHTVDDIQQIAAALAQCMAPSREFICEECLKQTLHKLEAMLEILQADTKAGI 2718
                         2730      2740
                   ....*....|....*....|..
gi 1907123041 2704 LTPTTIADSILNITGDLIHLAS 2725
Cdd:TIGR00864 2719 VTPTAIADNILNIMGDLIHLAS 2740
Polycystin_dom pfam20519
Polycystin domain; This domain represents the polycystin domain from group II of Transient ...
3667-3845 6.11e-66

Polycystin domain; This domain represents the polycystin domain from group II of Transient receptor potential (TRP) channels (TRPP) including PKD1, PKD2, PKD2L and mucolipins. The polycystin domain display a sandwich-like shape with five beta-sheets in the tilted middle layer, three alpha-helices on one side and a large loop with two short antiparallel beta-sheets on the other.


:

Pssm-ID: 466668  Cd Length: 199  Bit Score: 223.07  E-value: 6.11e-66
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 3667 AFLAITRSDEFWPWMSHVFLPYVHGNQS------------SPELGPPRLRQVRLQEAFC--PDPSSSEHM-CSAAGSLST 3731
Cdd:pfam20519    1 GLLTVTDLDDIWDWLSSVLLPALHSNKTpsglpgsfiayeSLLLGVPRLRQLRVRNSSClvHDKFVREINeCHAGYSPPS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 3732 SDYGI----GWQSVVQNGSETWAYSAPDLL-GAWYWGYCAVYDSGGYIQELGLSLEESRARLGFLQLHNWLDSRSRAVFV 3806
Cdd:pfam20519   81 EDRKLysalPYKPVHYGSKYWFIYTPPGLLmGYDHWGHLASYPSGGYVVLLPSSREESLKRLAYLQDNNWLDRGTRAVFV 160
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 1907123041 3807 ELTRYSPAVGLHAAVTLRLEFPVAGHALAAFSVRPFALR 3845
Cdd:pfam20519  161 DFTLYNADINLFCVVTLRVEFPPTGGVLPSPSVQSVKLL 199
PKD_channel pfam08016
Polycystin cation channel; This family contains the cation channel region from group II of ...
3846-4067 2.64e-59

Polycystin cation channel; This family contains the cation channel region from group II of Transient receptor potential (TRP) channels, the TRPP subfamily, including PKD1, PKD2, PKD2L and mucolipin proteins.


:

Pssm-ID: 462341 [Multi-domain]  Cd Length: 225  Bit Score: 205.20  E-value: 2.64e-59
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 3846 RLSTGLSLPLLT-SVCLLLFALYFSMAEVQTWRKDGcACTARpDTW--ARCLLVILTAATGLVRLAQLGIADRQWTHFVq 3922
Cdd:pfam08016    1 RYVTNRSLFILLcEIVFVVFFLYFVVEEILKIRKHR-PSYLR-SVWnlLDLAIVILSVVLIVLNIYRDFLADRLIKSVE- 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 3923 DHPRHFTSFDQVAQLGSVARGLAASLLFLLLVKAAQQLRFVRQWSVFGKTLCRALPELMGATLGLVLLGVAYAQMAILLI 4002
Cdd:pfam08016   78 ASPVTFIDFDRVAQLDNLYRIILAFLVFLTWLKLFKVLRFNKTMSLFTKTLSRAWKDLAGFALMFVIFFFAYAQFGYLLF 157
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907123041 4003 SSGADTLYNMARAFLVLCpgarvPTLCPSESWY--------LSPLLCVGLWALRVWGALRLGAILLRWRYHAL 4067
Cdd:pfam08016  158 GTQAPNFSNFVKSILTLF-----RTILGDFGYNeifsgnrvLGPLLFLTFVFLVIFILLNLFLAIINDSYVEV 225
PLAT_polycystin cd01752
PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane ...
3112-3231 6.86e-57

PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane proteins composed of multiple domains, present in fish, invertebrates, mammals, and humans that are widely expressed in various cell types and whose biological functions remain poorly defined. In human, mutations in polycystin-1 (PKD1) and polycystin-2 (PKD2) have been shown to be the cause for autosomal dominant polycystic kidney disease (ADPKD). The generally proposed function of PLAT/LH2 domains is to mediate interaction with lipids or membrane bound proteins.


:

Pssm-ID: 238850  Cd Length: 120  Bit Score: 193.65  E-value: 6.86e-57
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 3112 FKYEILVKTGWSRGSGTTAHVGIMLYGEDNRSGHRHLDGD--RAFHRNSLDIFQIATPHSLGSVWKIRVWHDNKGLSPAW 3189
Cdd:cd01752      1 YLYLVTVFTGWRRGAGTTAKVTITLYGAEGESEPHHLRDPekPIFERGSVDSFLLTTPFPLGELQSIRLWHDNSGLSPSW 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 1907123041 3190 FLQHIIVRDLQSARSTFFLVNDWLSVETEanGGLVEKEVLAA 3231
Cdd:cd01752     81 YLSRVIVRDLQTGKKWFFLCNDWLSVEEG--DGTVERTFPVA 120
LRR_8 pfam13855
Leucine rich repeat;
61-127 2.26e-13

Leucine rich repeat;


:

Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 67.55  E-value: 2.26e-13
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907123041   61 PSLRIpadataLDLSHNLLQTLDIGLLVNLSALVELDLSNNRISTLEEGVFANLFNLSEINLSGNPF 127
Cdd:pfam13855    1 PNLRS------LDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
GPS super family cl02559
GPCR proteolysis site, GPS, motif; The GPS motif is found in GPCRs, and is the site for ...
3005-3054 9.61e-10

GPCR proteolysis site, GPS, motif; The GPS motif is found in GPCRs, and is the site for auto-proteolysis, so is thus named, GPS. The GPS motif is a conserved sequence of ~40 amino acids containing canonical cysteine and tryptophan residues, and is the most highly conserved part of the domain. In most, if not all, cell-adhesion GPCRs these undergo autoproteolysis in the GPS between a conserved aliphatic residue (usually a leucine) and a threonine, serine, or cysteine residue. In higher eukaryotes this motif is found embedded in the C-terminal beta-stranded part of a GAIN domain - GPCR-Autoproteolysis INducing (GAIN). The GAIN-GPS domain adopts a fold in which the GPS motif, at the C-terminus, forms five beta-strands that are tightly integrated into the overall GAIN domain. The GPS motif, evolutionarily conserved from tetrahymena to mammals, is the only extracellular domain shared by all human cell-adhesion GPCRs and PKD proteins, and is the locus of multiple human disease mutations. The GAIN-GPS domain is both necessary and sufficient functionally for autoproteolysis, suggesting an autoproteolytic mechanism whereby the overall GAIN domain fine-tunes the chemical environment in the GPS to catalyze peptide bond hydrolysis. In the cell-adhesion GPCRs and PKD proteins, the GPS motif is always located at the end of their long N-terminal extracellular regions, immediately before the first transmembrane helix of the respective protein.


The actual alignment was detected with superfamily member smart00303:

Pssm-ID: 470616  Cd Length: 49  Bit Score: 56.63  E-value: 9.61e-10
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|
gi 1907123041  3005 YTSLCQYFSEEMMMWRTEGIVPLEETSpSQAVCLTRHLTAFGASLFVPPS 3054
Cdd:smart00303    1 FNPICVFWDESSGEWSTRGCELLETNG-THTTCSCNHLTTFAVLMDVPPI 49
LRRNT smart00013
Leucine rich repeat N-terminal domain;
32-71 4.61e-05

Leucine rich repeat N-terminal domain;


:

Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 43.08  E-value: 4.61e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 1907123041    32 PCPLPCFCGPapdaaCRVNCSGRWLQTLgpSLRIPADATA 71
Cdd:smart00013    1 ACPAPCNCSG-----TAVDCSGRGLTEV--PLDLPPDTTL 33
 
Name Accession Description Interval E-value
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
97-2725 0e+00

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 3837.75  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041   97 DLSNNRISTLEEGVFANLFNLSEINLSGNPFECNCGLAWLPRWAKEHQVHVVQSEATTCRGPIPLAGQPLLSIPLLDNAC 176
Cdd:TIGR00864    1 DISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRQPEAALCAGPGALAGQPLLGIPLLDSGC 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  177 GEEYVACLPDNSSGAVAA----VPFYFAHEGPLETEACSAFCFSAGEGLAALSEQNQCLCGAGQASNSSAACSSWCSSIS 252
Cdd:TIGR00864   81 DEEYVACLKDNSSGGGAArselVIFSAAHEGLFQPEACNAFCFSAGHGLAALGEQGECLCGAAQPSEANFACESLCSGPP 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  253 LSLNSACGGPTLLQHTFPASPGATLVGPHGPLASGQPADFHITSSLPISSTRWNFGDGSPEVDMASP----AATHFYVLP 328
Cdd:TIGR00864  161 PPPAAACRGPQLLEHIFPALPGAPIQGPHGPIASGQLAAFHAAAPLAPTAMRWDFGDGSAEVDAAGAggttAASHKYGHP 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  329 GSYHMTVVLALGAGSALLETEVQVEATPTVLELVCPSFVHSNESLELGIRHRGGSALEVTYSILALDKEPAQVVHPLCPL 408
Cdd:TIGR00864  241 GRYHVSAMGALGAGKALAGGDVQVEAAPAALELHCPSLVQADESLDLSIQNRGGSDLDAAWKITAHGEEPAKASHPHCPK 320
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  409 DTEIFPGNGHCYRLVAEKAPWLQAQEQCRTWAGAALAMVDSPAIQHFLVSKVTRSLD--VWIGFSSVEGTE-GLDPRGEA 485
Cdd:TIGR00864  321 DGEIFEENGHCFQIVPEEAAWLDAQEQCLARAGAALAIVDNDALQNFLARKVTHSLDrgVWIGFSDVNGAEkGPAHQGEA 400
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  486 FSLESCQNWLPGEPHPATAEHCVRLGPAGQCNTDLCSAPHSYVCELRPGGPVWDTENFVMGMSGGGLSGPLHPLAQQETV 565
Cdd:TIGR00864  401 FEAEECEEGLAGEPHPARAEHCVRLDPRGQCNSDLCNAPHAYVCELNPGGPVPDAENFAMGAASFDLHGLLQALAAMDGL 480
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  566 QGPLR-PVEVMVFPGLSPSREAFLTAAEFSTQKLEEPAQMRLQVYRPSGG--AAAVPEGSSEP-----DNRTEPAPKCVP 637
Cdd:TIGR00864  481 PAPPHeGVEVLLFPALRFSRAAFLSSAEFGTQELRRPAHILFQIYRLRCRlpGAGGPACGPEAecrppDNRSADAPACMK 560
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  638 EELWCPGANVCIPFDASCNSHVCINGSVSRLGLSRAS----YTLWKEFFFSVPAGPPTQYLVTLHSQDVPMLPGDLIGLQ 713
Cdd:TIGR00864  561 GEQWCPFAHICLPLDAPCHPQACANGCSQGHGLPGAArmplYALQREFLFSLPAGPAAHVLLQDHGEDLLMLPGDLIALQ 640
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  714 HDAGPGTLLQCPLASSCPG-QALYLSTNASDWM------------------------TNLPVHLEEAWAG----PVCSLQ 764
Cdd:TIGR00864  641 HDAGPAALIHCQPAPGHPGpRAPVFAANASEWFghnntpvppdnlagdgadplpdpeLDLKALLEGTRASwlecAACAIR 720
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  765 LLLVTERLTPLLGLGPNPGLQHPGHYEVRATVGNSVSRQNLSCSFSVVSPIAGLRVIHPIPLDGHIYVPTNGSVLVLQVD 844
Cdd:TIGR00864  721 LLAAGEQETRLLGAELNAGLPLPGLYELLAESAKGSDLHNASCSFDVLPPLAGLRVIHPAPQDGRLFLESNGSALLLQVD 800
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  845 SGANATATAQWFGGNISAPFEDACPPEVDFLKQD-------CTEEANGTLFSVLMLPRLKEGDHT----VEIVAQNGASQ 913
Cdd:TIGR00864  801 SGANAEAKAFWPGGNSSARFENVCPAEFASRLCHpstfeggCAEEAEDSLFAVLALNWLKEGEHTgpvqVDLMAENNASE 880
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  914 ANLSLRVTAEEPICGLRAVPSPEARVLQGILVRYSPMVEAGSDVAFRWTIDDKQSLTFHNTVFNVIYQSAAIFKLSLwlT 993
Cdd:TIGR00864  881 ANLSLLVQAEEPICGLRAQPHPAARVLMESLVRYSASVEAGSDMTFKWTIDDKPFFTFQNTVFNVIYQHAAVFKLSL--T 958
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  994 ASNHVSNITVNYNVTVERMNKMHGLWVSAVPTVLPPNATLALTGGVLVDSAVEVAFLWNFGDGEQVLRQFKPPYDESFQV 1073
Cdd:TIGR00864  959 AMNHVSNLTEDFNVTVDRLNPMQGLQVKGVPAVLPPGATLALTAGVLIDMAVEAAFLWSFGDGEQALFEFKPPYNESFPC 1038
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1074 PDPTVAQVLVEHNTTHIYTTPGEYNLTVLVSNTYENLTQQVTVSVRTVLPNVAIGMSSNVLVAGQPITFSPYPLPSTDGV 1153
Cdd:TIGR00864 1039 PDPSPAQVLLEHNVMHIYAAPGEYLATVLASNAFENISQQINMSVRAILPRVAIGTEDGLLLAGKPADFEAHPLPSPGGI 1118
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1154 LYTWDFGDGSPVLIQSQPVLNHTYSMTGAYRITLEVNNTVSSVTAHADIRVFQELHGLTVYLSPSVEQGAPMVVSASVES 1233
Cdd:TIGR00864 1119 HYEWDFGDGSALLQGRQPAAAHTFAKRGPFHVCLEVNNTISGAAACADMFAFEEIEGLSADMSLATELGAATTVRAALQS 1198
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1234 GDNITWTFDMGDGTVFTGPEATVQHVYLRAQNFTVTVEAANPAGHLSQSLHVQVFVLEVLHIEPSTCIPTQPSAQLMAHV 1313
Cdd:TIGR00864 1199 GDNITWTFDMGDGKSLSGPEATVEHKYAKAGNCTVNIGAANAAGHGARIIHVEVFVFEVAGIEPAACIGEHADANFRARV 1278
                         1290      1300      1310      1320      1330      1340      1350      1360
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1314 TGDPVHYLFDWTFGDGSSNVTVHGHPSVTHNFTRSGIFPLALVLSSHVNKAHYFTSICVEPEIRNITLQPERQFVKLGDE 1393
Cdd:TIGR00864 1279 SGNAAHYLFDWSFGDGSPNETHHGCPGISHNFRGNGTFPLALTISSGVNKAHFFTQICVEPELGKISLQAEKQFFALGDE 1358
                         1370      1380      1390      1400      1410      1420      1430      1440
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1394 ARLVAYSWPPFPYRYTWDFGTEDTTHTQTGGSEVKFIYREPGSYLVIVTVSNNISSTNDSAFVEVQEPVLVTGIRINGSH 1473
Cdd:TIGR00864 1359 AQFQACAEPEFNYRYEWDFGGEEAAPLPAAGAEVTFIYNDPGCYLVTVAASNNISAANDSALIEVLEPVGATSFKHNGSH 1438
                         1450      1460      1470      1480      1490      1500      1510      1520
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1474 --VLELQQPYLLSAMGSGSPATYLWELGDGSQSEGPEVTHIYSSTGDFTVRVSGWNEVSRSEAQLNITVKQRVRGLTINA 1551
Cdd:TIGR00864 1439 gnNLELGQPYLFSAFGRARNASYLWDFGDGGLLEGPEILHAFNSPGDFNIRLAAANEVGKNEATLNVAVKARVRGLTINA 1518
                         1530      1540      1550      1560      1570      1580      1590      1600
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1552 SRTVVPLNGSVSFSTLLEVGSDVHYSWVLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFIYVLQFIEGL 1631
Cdd:TIGR00864 1519 SLTNVPLNGSVHFEAHLDAGDDVRFSWILCDHCTPIFGGNTIFYTFRSVGTFNIIVTAENDVGAAQASIFLFVLQEIEGL 1598
                         1610      1620      1630      1640      1650      1660      1670      1680
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1632 QVAGGD-----------NGCCFPTNYTLQLQAAVRDGTNISYSWTA--QQEGSLITLFGSGKCFSLTSLKASTYYVHLRA 1698
Cdd:TIGR00864 1599 QILGETaegggggvqelDGCYFETNHTVQFHAGFKDGTNLSFSWNAilDNEPDGPAFAGSGKGAKLNPLEAGPCDIFLQA 1678
                         1690      1700      1710      1720      1730      1740      1750      1760
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1699 TNMLGSAAANRTIDFVEPVESLILSASPNPAAVNMSLTLCAELAGGSGVVYTWYLEEGLSWKTSMPSTTHTFAAPGLHLV 1778
Cdd:TIGR00864 1679 ANLLGQATADCTIDFLEPAGNLMLAASDNPAAVNALINLSAELAEGSGLQYRWFLEEGDDLETSEPFMSHSFPSAGLHLV 1758
                         1770      1780      1790      1800      1810      1820      1830      1840
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1779 RVTAENQLGSVNATVEVAIQVPVGGLSIRTSEP-DSIFVAAGSTLPFWGQLAEGTNVTWCWTLPGGS-KDSQYIAVRFST 1856
Cdd:TIGR00864 1759 TMKAFNELGSANASEEVDVQEPISGLKIRAADAgEQNFFAADSSVCFQGELATGTNVSWCWAIDGGSsKMGKHACMTFPD 1838
                         1850      1860      1870      1880      1890      1900      1910      1920
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1857 AGSFSLQLNASNAVSWVSAMYNLTVEEPIVNLMLWASSKVVAPGQPVHFEILLAAGSALTFRLQVGGSVPEVL-PSPHFS 1935
Cdd:TIGR00864 1839 AGTFAIRLNASNAVSGKSASREFFAEEPIFGLELKASKKIAAIGEKVEFQILLAAGSAVNFRLQIGGAAPEVLqPGPRFS 1918
                         1930      1940      1950      1960      1970      1980      1990      2000
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1936 HSFFRVGDHLVNVQAENHVSHAQAQVRILVLEAVVGLQVPNCCEPGMATGTEKNFTARVQRGSRVAYAWYFSLQKVQGDS 2015
Cdd:TIGR00864 1919 HSFPRVDDHMVNLRAKNEVSCAQANLHIEVLEAVRGLQIPDCCAAGIATGEEKNFTANVQRGKPVAFAWTFDLHHLHGDS 1998
                         2010      2020      2030      2040      2050      2060      2070      2080
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2016 LVILSGRDVTYTPVAAGLLEIHVRAFNELGGVNLTLMVEVQDIIQYVTLQSG--RCFTNRSARFEAATSPSPRRVTYHWD 2093
Cdd:TIGR00864 1999 LVIHMGKDVSYTAEAAGLLEIQLGAFNALGAENITLQLEAQDALMDAALQAGpqDCFTNKMAQFEAATSPKPNFMACHWD 2078
                         2090      2100      2110      2120      2130      2140      2150      2160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2094 FGDGTPVQKTEEFWADHYYLRPGDYHVEVNATNLVSFFVAQATVTVQVLACREPEVEVALPLQVLMRRSQRNYLEAHVDL 2173
Cdd:TIGR00864 2079 FGDGSAGQDTDEPRAEHEYLHPGDYRVQVNASNLVSFFSAHAEINVQVLACEEPEVDVVLALQLAIRRSQPNLLEAHVDL 2158
                         2170      2180      2190      2200      2210      2220      2230      2240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2174 RNCVSYQTEYRWEIYRTASCQRPGRMAQMVLPG-------------VDVSRPQLVVPRLALPVGHYCFVFVVSFGDTPLA 2240
Cdd:TIGR00864 2159 KDCLRYGAEYLWEILRAASCDNDGHFARGALNGatrsfpviplpaeVDVQRLQLSLPKLALAAGHYCFVFSLSFEDTPLK 2238
                         2250      2260      2270      2280      2290      2300      2310      2320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2241 RSIQANVTVAAERLVPIIEGGSYRVWSDTQDLVLDGSKSYDPNLEDGDQTPLNFHWACVASTQSETGGCVLNFGPRG-SS 2319
Cdd:TIGR00864 2239 KAACANLGVAAARLMPIIEGGSYRVWSDTQDLQLDAEESYDPNLDDDDQSLLHFHWACQASSKGEAGCCALNFGLGGkGP 2318
                         2330      2340      2350      2360      2370      2380      2390      2400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2320 VVTIPLERLEAGVEYTFNLIVWKAGRKEEATNQTVLIRSGRVPIVSLECVSCKAQAVYEVSRSSYVYLEGHCHNCSRGYK 2399
Cdd:TIGR00864 2319 TLGIPGEELAAGIEYTFKLSIGKAGMKEEATNQTVLIQSGHIPIVSLECVSCKAQALYEVSQNSYVYLEGRCLNCQSGFH 2398
                         2410      2420      2430      2440      2450      2460      2470      2480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2400 QGCWAARTFSNKTLVLNETTTSTGSTGMNLVVRPGALRDGEGYIFTLTVLGHSGEEEGCASIRLSPNRPPLGGSCRLFPL 2479
Cdd:TIGR00864 2399 RGRWAARTFQNDTLVLDESSTSTGSAGMNLVLRQGVLHDGEGYNFTLHVLDDSGDEEGAASIRLHHNMPPDGGECHLFPG 2478
                         2490      2500      2510      2520      2530      2540      2550      2560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2480 --------------DSVRGLTTKVHFECTGWRDAEDGGAPLVYALLLKRCRQSYCENFCIYKGSLSTYGAVLPPGFQ-PL 2544
Cdd:TIGR00864 2479 getgqehgdkedevWAIEALLDKVHFECSGWHDAEDAEAPLLYALLLNRCRDDHCEEFCVYKGSLPEHGAFLPPGFRsAH 2558
                         2570      2580      2590      2600      2610      2620      2630      2640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2545 FVVSLAVVVQDQLGAAVVALNRSLTIVLPEPSGNPADLVPWLHSLTASVLPGLLKQADPQHVIEYSLALITVLNEYEQAP 2624
Cdd:TIGR00864 2559 FEVGLAITVEDHLGAAIRALNKSIAITLPDPNGEASGLPHWLHDLIASKLKGLLDQADFQHVIELSLALITVLNEYEQAL 2638
                         2650      2660      2670      2680      2690      2700      2710      2720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2625 D-VSEPNVEQQLRAQMRKNITETLISLRVNTVDDIQQITAALAQCMVSSRELMCRSCLKKMLQKLEGMMRILQAETTEGT 2703
Cdd:TIGR00864 2639 DsAAEPKHERGHRAQIRKNITEALTALDLHTVDDIQQIAAALAQCMAPSREFICEECLKQTLHKLEAMLEILQADTKAGI 2718
                         2730      2740
                   ....*....|....*....|..
gi 1907123041 2704 LTPTTIADSILNITGDLIHLAS 2725
Cdd:TIGR00864 2719 VTPTAIADNILNIMGDLIHLAS 2740
REJ pfam02010
REJ domain; The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor ...
2169-2612 1.39e-116

REJ domain; The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor for egg jelly Swiss:Q26627. The function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six completely conserved cysteine residues that may form disulphide bridges. This region contains tandem PKD-like domains.


Pssm-ID: 366875 [Multi-domain]  Cd Length: 448  Bit Score: 378.77  E-value: 1.39e-116
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2169 AHVDLRNCVS-YQTEYRWEIYRTASCqrpGRMAQMVLPgVDVSRPQLVVPRLALPVGHYCFVFVVSFGDTP-LARSIQAN 2246
Cdd:pfam02010    1 ASVELNGCFSaYTIDYLWSVFTVSSN---LNLQTISSP-KDLVLPQLTIPSGTLPYGTYVFTLTVSLSSTPsLAGTDIIT 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2247 VTVAAERLVPIIEGGSYRVWSDTQDLVLDGSKSYDPNLEDGDQTPLNFHWACVAST------QSETGGCV-----LNFGP 2315
Cdd:pfam02010   77 VTVQPSPLVAVIDGGSSRVVGYNQDLTLDGSESYDPDVDPGSSSGLTYLWSCRRSSsgdnplLNNDPVCFsdqneGTLLQ 156
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2316 RGSSVVTIPLERLEAGVEYTFNLIVWKAGRKEEATNQTVLIRSGRVPIVSLECVSCKAQAVYEVSRssYVYLEGHCHNCS 2395
Cdd:pfam02010  157 STSSSLTIPASTLQANVTYTFKLTVSKGSRNSASTTQTILVVDGNPPIIILSCISNCNRKNNPVDR--LVLLASTCLNCS 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2396 RGYKQGC--WAARTFSNKTLVLN--ETTTSTGSTGMNLVVRPGALRDGEGYIFTLTVLGHSGEEEGCASIRLSPNRPPLG 2471
Cdd:pfam02010  235 SDLSDVTyrWLSLGSENTSLVLDqlNSQTSTGRSGPYLVIKAGVLQSGVSYRFTLIVTVYPGLVSGLASISFITNAPPTG 314
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2472 GSCRLFPLDSVRgLTTKVHFECTGWRDAEDggaPLVYALLLKRCRQSYCENFCIYKGSLST-YGAVLPPGF-QPLFVVSL 2549
Cdd:pfam02010  315 GTCSVTPTEGTA-LETKFTVTCQGWTDDDL---PLTYQFGDISFREASEEWFLLYEGSSQIsISTFLPPGLpANDYQVTV 390
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907123041 2550 AVVVQDQLGAAvVALNRSLTIVLPEPSgnpaDLVPWLHSLTASVLPGLLKQADPQHVIEYSLA 2612
Cdd:pfam02010  391 VVVVYDSLGAA-TSVSLTITVTPPSSS----DELLYFLLGTTSDLSALLQSGDPQQAAQLILA 448
Polycystin_dom pfam20519
Polycystin domain; This domain represents the polycystin domain from group II of Transient ...
3667-3845 6.11e-66

Polycystin domain; This domain represents the polycystin domain from group II of Transient receptor potential (TRP) channels (TRPP) including PKD1, PKD2, PKD2L and mucolipins. The polycystin domain display a sandwich-like shape with five beta-sheets in the tilted middle layer, three alpha-helices on one side and a large loop with two short antiparallel beta-sheets on the other.


Pssm-ID: 466668  Cd Length: 199  Bit Score: 223.07  E-value: 6.11e-66
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 3667 AFLAITRSDEFWPWMSHVFLPYVHGNQS------------SPELGPPRLRQVRLQEAFC--PDPSSSEHM-CSAAGSLST 3731
Cdd:pfam20519    1 GLLTVTDLDDIWDWLSSVLLPALHSNKTpsglpgsfiayeSLLLGVPRLRQLRVRNSSClvHDKFVREINeCHAGYSPPS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 3732 SDYGI----GWQSVVQNGSETWAYSAPDLL-GAWYWGYCAVYDSGGYIQELGLSLEESRARLGFLQLHNWLDSRSRAVFV 3806
Cdd:pfam20519   81 EDRKLysalPYKPVHYGSKYWFIYTPPGLLmGYDHWGHLASYPSGGYVVLLPSSREESLKRLAYLQDNNWLDRGTRAVFV 160
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 1907123041 3807 ELTRYSPAVGLHAAVTLRLEFPVAGHALAAFSVRPFALR 3845
Cdd:pfam20519  161 DFTLYNADINLFCVVTLRVEFPPTGGVLPSPSVQSVKLL 199
PKD_channel pfam08016
Polycystin cation channel; This family contains the cation channel region from group II of ...
3846-4067 2.64e-59

Polycystin cation channel; This family contains the cation channel region from group II of Transient receptor potential (TRP) channels, the TRPP subfamily, including PKD1, PKD2, PKD2L and mucolipin proteins.


Pssm-ID: 462341 [Multi-domain]  Cd Length: 225  Bit Score: 205.20  E-value: 2.64e-59
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 3846 RLSTGLSLPLLT-SVCLLLFALYFSMAEVQTWRKDGcACTARpDTW--ARCLLVILTAATGLVRLAQLGIADRQWTHFVq 3922
Cdd:pfam08016    1 RYVTNRSLFILLcEIVFVVFFLYFVVEEILKIRKHR-PSYLR-SVWnlLDLAIVILSVVLIVLNIYRDFLADRLIKSVE- 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 3923 DHPRHFTSFDQVAQLGSVARGLAASLLFLLLVKAAQQLRFVRQWSVFGKTLCRALPELMGATLGLVLLGVAYAQMAILLI 4002
Cdd:pfam08016   78 ASPVTFIDFDRVAQLDNLYRIILAFLVFLTWLKLFKVLRFNKTMSLFTKTLSRAWKDLAGFALMFVIFFFAYAQFGYLLF 157
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907123041 4003 SSGADTLYNMARAFLVLCpgarvPTLCPSESWY--------LSPLLCVGLWALRVWGALRLGAILLRWRYHAL 4067
Cdd:pfam08016  158 GTQAPNFSNFVKSILTLF-----RTILGDFGYNeifsgnrvLGPLLFLTFVFLVIFILLNLFLAIINDSYVEV 225
PLAT_polycystin cd01752
PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane ...
3112-3231 6.86e-57

PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane proteins composed of multiple domains, present in fish, invertebrates, mammals, and humans that are widely expressed in various cell types and whose biological functions remain poorly defined. In human, mutations in polycystin-1 (PKD1) and polycystin-2 (PKD2) have been shown to be the cause for autosomal dominant polycystic kidney disease (ADPKD). The generally proposed function of PLAT/LH2 domains is to mediate interaction with lipids or membrane bound proteins.


Pssm-ID: 238850  Cd Length: 120  Bit Score: 193.65  E-value: 6.86e-57
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 3112 FKYEILVKTGWSRGSGTTAHVGIMLYGEDNRSGHRHLDGD--RAFHRNSLDIFQIATPHSLGSVWKIRVWHDNKGLSPAW 3189
Cdd:cd01752      1 YLYLVTVFTGWRRGAGTTAKVTITLYGAEGESEPHHLRDPekPIFERGSVDSFLLTTPFPLGELQSIRLWHDNSGLSPSW 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 1907123041 3190 FLQHIIVRDLQSARSTFFLVNDWLSVETEanGGLVEKEVLAA 3231
Cdd:cd01752     81 YLSRVIVRDLQTGKKWFFLCNDWLSVEEG--DGTVERTFPVA 120
PLAT pfam01477
PLAT/LH2 domain; This domain is found in a variety of membrane or lipid associated proteins. ...
3114-3217 1.84e-22

PLAT/LH2 domain; This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this domain binds to procolipase pfam01114, which mediates membrane association. So it appears possible that this domain mediates membrane attachment via other protein binding partners. The structure of this domain is known for many members of the family and is composed of a beta sandwich.


Pssm-ID: 396180  Cd Length: 115  Bit Score: 95.19  E-value: 1.84e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 3114 YEILVKTGWSRGSGTTAHVGIMLYGEDNRSGHRHLDGDR-AFHRNSLDIFQIATPHSLGSVWKIRVWHDNKGLSPAWFLQ 3192
Cdd:pfam01477    1 YQVKVVTGDELGAGTDADVYISLYGKVGESAQLEITLDNpDFERGAEDSFEIDTDWDVGAILKINLHWDNNGLSDEWFLK 80
                           90       100
                   ....*....|....*....|....*.
gi 1907123041 3193 HIIVRDLQSARSTF-FLVNDWLSVET 3217
Cdd:pfam01477   81 SITVEVPGETGGKYtFPCNSWVYGSK 106
WSC smart00321
present in yeast cell wall integrity and stress response component proteins; Domain present in ...
177-271 3.54e-19

present in yeast cell wall integrity and stress response component proteins; Domain present in WSC proteins, polycystin and fungal exoglucanase


Pssm-ID: 214616 [Multi-domain]  Cd Length: 95  Bit Score: 85.21  E-value: 3.54e-19
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041   177 GEEYVACLPDNSSGAVAAVPFYFAHegPLETEACSAFCFSAGEGLAALSEQNQCLCGAGQASNSSAACSSWC--SSISLS 254
Cdd:smart00321    1 GATYVGCYSDNSSRTLAAVSSYAYH--NMSVEACSNFCFSAGYALAALENGNECYCGDSLPSTSVSASDSSQcsTTCSGY 78
                            90
                    ....*....|....*..
gi 1907123041   255 LNSACGGPTLLQHTFPA 271
Cdd:smart00321   79 PAEVCGGPNRLSVYVLA 95
LH2 smart00308
Lipoxygenase homology 2 (beta barrel) domain;
3112-3214 5.47e-19

Lipoxygenase homology 2 (beta barrel) domain;


Pssm-ID: 214608 [Multi-domain]  Cd Length: 105  Bit Score: 85.00  E-value: 5.47e-19
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  3112 FKYEILVKTGWSRGSGTTAHVGIMLYGEDNRSGHRHLD--GDRAFHRNSLDIFQIATPHSLGSVWKIRVWHDNKglSPAW 3189
Cdd:smart00308    1 GKYKVTVTTGGLDFAGTTASVSLSLVGAEGDGKESKLDylFKGIFARGSTYEFTFDVDEDFGELGAVKIKNEHR--HPEW 78
                            90       100
                    ....*....|....*....|....*
gi 1907123041  3190 FLQHIIVRDLQSARSTFFLVNDWLS 3214
Cdd:smart00308   79 FLKSITVKDLPTGGKYHFPCNSWVY 103
CLECT cd00037
C-type lectin (CTL)/C-type lectin-like (CTLD) domain; CLECT: C-type lectin (CTL)/C-type ...
418-530 1.26e-17

C-type lectin (CTL)/C-type lectin-like (CTLD) domain; CLECT: C-type lectin (CTL)/C-type lectin-like (CTLD) domain; protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. This group is chiefly comprised of eukaryotic CTLDs, but contains some, as yet functionally uncharacterized, bacterial CTLDs. Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces, including CaCO3 and ice. Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. For example: mannose-binding lectin and lung surfactant proteins A and D bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, and apoptotic cells) and mediate functions associated with killing and phagocytosis; P (platlet)-, E (endothelial)-, and L (leukocyte)- selectins (sels) mediate the initial attachment, tethering, and rolling of lymphocytes on inflamed vascular walls enabling subsequent lymphocyte adhesion and transmigration. CTLDs may bind a variety of carbohydrate ligands including mannose, N-acetylglucosamine, galactose, N-acetylgalactosamine, and fucose. Several CTLDs bind to protein ligands, and only some of these binding interactions are Ca2+-dependent; including the CTLDs of Coagulation Factors IX/X (IX/X) and Von Willebrand Factor (VWF) binding proteins, and natural killer cell receptors. C-type lectins, such as lithostathine, and some type II antifreeze glycoproteins function in a Ca2+-independent manner to bind inorganic surfaces. Many proteins in this group contain a single CTLD; these CTLDs associate with each other through several different surfaces to form dimers, trimers, or tetramers, from which ligand-binding sites project in different orientations. Various vertebrate type 1 transmembrane proteins including macrophage mannose receptor, endo180, phospholipase A2 receptor, and dendritic and epithelial cell receptor (DEC205) have extracellular domains containing 8 or more CTLDs; these CTLDs remain in the parent model. In some members (IX/X and VWF binding proteins), a loop extends to the adjoining domain to form a loop-swapped dimer. A similar conformation is seen in the macrophage mannose receptor CRD4's putative non-sugar bound form of the domain in the acid environment of the endosome. Lineage specific expansions of CTLDs have occurred in several animal lineages including Drosophila melanogaster and Caenorhabditis elegans; these CTLDs also remain in the parent model.


Pssm-ID: 153057 [Multi-domain]  Cd Length: 116  Bit Score: 81.51  E-value: 1.26e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  418 HCYRLVAEKAPWLQAQEQCRTWaGAALAMVDSPAIQHFLVSKVTRSL--DVWIG---------FSSVEGTEGLDPrgeaf 486
Cdd:cd00037      1 SCYKFSTEKLTWEEAQEYCRSL-GGHLASIHSEEENDFLASLLKKSSssDVWIGlndlssegtWKWSDGSPLVDY----- 74
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 1907123041  487 slescQNWLPGEPHPATAEHCVRL--GPAGQCNTDLCSAPHSYVCE 530
Cdd:cd00037     75 -----TNWAPGEPNPGGSEDCVVLssSSDGKWNDVSCSSKLPFICE 115
LRR_8 pfam13855
Leucine rich repeat;
61-127 2.26e-13

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 67.55  E-value: 2.26e-13
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907123041   61 PSLRIpadataLDLSHNLLQTLDIGLLVNLSALVELDLSNNRISTLEEGVFANLFNLSEINLSGNPF 127
Cdd:pfam13855    1 PNLRS------LDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
1483-1770 6.28e-11

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 67.39  E-value: 6.28e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1483 LSAMGSGSPATYLWELGDGSQSEGPEVTHIYSSTGDFTVRVSGWNEV-SRSEAQLNITVKQRVRGLTINASRTVVPLNGS 1561
Cdd:COG3291     16 FTDTSSGNATSYEWDFGDGTTSTEANPSHTYTTPGTYTVTLTVTDAAgCSDTTTKTITVGAPNPGVTTVTTSTTVTTLAN 95
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1562 VSFSTLLEVGSDVHYSWVlcdrCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFIYVLQFIEGLQVAGGDNGCC 1641
Cdd:COG3291     96 TANGGATTVVAGSTVGTG----VATSTTTAAAPGGGGGTGTTTTTGTDTGLTGSTGTASDTATVTTSVSTTDVTSDGTTS 171
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1642 FPTNYTLQLQAAVRDGTNISYSWTAQQEGSLITLFGSGKCFSLTSLKASTYYVHLRATNMLGSAAANRTIDFVEPVESLI 1721
Cdd:COG3291    172 ASTNPSVTTDTVTTLTGSYTGTIVGGSGSGTVTSGTAGVTTGATSGTSGTGSATSGVAVTDVTLTGISTGDAGTPGTNTV 251
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*....
gi 1907123041 1722 LSASPNPAAVNMSLTLCAELAGGSGVVYTWYLEEGLSWKTSMPSTTHTF 1770
Cdd:COG3291    252 TTSGANTAGTSTITGGTSGVVTTSAATGTSTNGTGGLGTTTAITPGNVS 300
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
56-127 1.19e-10

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 64.42  E-value: 1.19e-10
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907123041   56 LQTLGPSLRIpadataLDLSHNLLQTLDIglLVNLSALVELDLSNNRISTLEE--GVFANLFNLSEINLSGNPF 127
Cdd:cd21340    115 LAALSNSLRV------LNISGNNIDSLEP--LAPLRNLEQLDASNNQISDLEEllDLLSSWPSLRELDLTGNPV 180
GPS smart00303
G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin ...
3005-3054 9.61e-10

G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin REJ and polycystin.


Pssm-ID: 197639  Cd Length: 49  Bit Score: 56.63  E-value: 9.61e-10
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|
gi 1907123041  3005 YTSLCQYFSEEMMMWRTEGIVPLEETSpSQAVCLTRHLTAFGASLFVPPS 3054
Cdd:smart00303    1 FNPICVFWDESSGEWSTRGCELLETNG-THTTCSCNHLTTFAVLMDVPPI 49
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
48-128 5.76e-09

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 61.87  E-value: 5.76e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041   48 RVNCSGRWLQTLGPSLripADATAL---DLSHNLLQTLDIGLLvNLSALVELDLSNNRISTLEEgVFANLFNLSEINLSG 124
Cdd:COG4886    140 ELDLSNNQLTDLPEPL---GNLTNLkslDLSNNQLTDLPEELG-NLTNLKELDLSNNQITDLPE-PLGNLTNLEELDLSG 214

                   ....
gi 1907123041  125 NPFE 128
Cdd:COG4886    215 NQLT 218
LRRNT smart00013
Leucine rich repeat N-terminal domain;
32-71 4.61e-05

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 43.08  E-value: 4.61e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 1907123041    32 PCPLPCFCGPapdaaCRVNCSGRWLQTLgpSLRIPADATA 71
Cdd:smart00013    1 ACPAPCNCSG-----TAVDCSGRGLTEV--PLDLPPDTTL 33
myxo_dep_M36 NF038112
myxosortase-dependent M36 family metallopeptidase; Members of this bacterial protein family ...
1553-1881 2.96e-04

myxosortase-dependent M36 family metallopeptidase; Members of this bacterial protein family have an M36 family metallopeptidase domain, like fungalysin (see PF02128), and a C-terminal MYXO-CTERM domain (see TIGR03901), suggesting processing and surface-anchoring by the still-unknown putative transpeptidase, myxosortase. Members of this family include MXAN_3564 (mepA), part of the effector cargo of outer membrane vesicles that the species produces in large numbers during predation on other microbes.


Pssm-ID: 468355 [Multi-domain]  Cd Length: 1597  Bit Score: 47.34  E-value: 2.96e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1553 RTVVPLNGSVSFStllEVGSDVHYSW--------VLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVgSAQDSIFIYV 1624
Cdd:NF038112  1201 RTTVTLNGSGSFD---PDGDPLTYAWtqvsgpavTLTGADTATPSFTAPEVTADTVLTFQLVVSDGTKT-SAPDTVTVLV 1276
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1625 LQFIEG-LQVAGGDNGCCFPTNYTLQLQAAVRDGTNISYSWTaQQEGSLITLFGSGKC---FSLTSLKASTYYVhLRATN 1700
Cdd:NF038112  1277 RNVNRApVAVAGAPATVDERSTVTLDGSGTDADGDALTYAWT-QTSGPAVTLTGATTAtatFTAPEVTADTQLT-FTLTV 1354
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1701 MLGSAAANRTIDFV------EPVESLILSASPNPAAVnMSLTLCAELAGGSGVVYTWYLEEGLSWK-TSMPSTTHTFAAP 1773
Cdd:NF038112  1355 SDGTASATDTVTVTvrnvnrAPVANAGADQTVDERST-VTLSGSATDPDGDALTYAWTQTAGPTVTlTGADTATASFTAP 1433
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1774 glhlvRVTAENQLG---------------SVNATVEVAIQVPVGglsirtSEPDSIFVAAGSTLPFWGQL--AEGTNVTW 1836
Cdd:NF038112  1434 -----EVAADTELTfqltvsadgqasadvTVTVTVRNVNRAPVA------HAGESITVDEGSTVTLDASAtdPDGDTLTY 1502
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*
gi 1907123041 1837 CWTLPGGSkdsqyiAVRFSTAGSFSLQLNASNAVSWVSAMYNLTV 1881
Cdd:NF038112  1503 AWTQVAGP------SVTLTGADSAKLTFTAPEVSADTTLTFSLTV 1541
 
Name Accession Description Interval E-value
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
97-2725 0e+00

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 3837.75  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041   97 DLSNNRISTLEEGVFANLFNLSEINLSGNPFECNCGLAWLPRWAKEHQVHVVQSEATTCRGPIPLAGQPLLSIPLLDNAC 176
Cdd:TIGR00864    1 DISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRQPEAALCAGPGALAGQPLLGIPLLDSGC 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  177 GEEYVACLPDNSSGAVAA----VPFYFAHEGPLETEACSAFCFSAGEGLAALSEQNQCLCGAGQASNSSAACSSWCSSIS 252
Cdd:TIGR00864   81 DEEYVACLKDNSSGGGAArselVIFSAAHEGLFQPEACNAFCFSAGHGLAALGEQGECLCGAAQPSEANFACESLCSGPP 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  253 LSLNSACGGPTLLQHTFPASPGATLVGPHGPLASGQPADFHITSSLPISSTRWNFGDGSPEVDMASP----AATHFYVLP 328
Cdd:TIGR00864  161 PPPAAACRGPQLLEHIFPALPGAPIQGPHGPIASGQLAAFHAAAPLAPTAMRWDFGDGSAEVDAAGAggttAASHKYGHP 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  329 GSYHMTVVLALGAGSALLETEVQVEATPTVLELVCPSFVHSNESLELGIRHRGGSALEVTYSILALDKEPAQVVHPLCPL 408
Cdd:TIGR00864  241 GRYHVSAMGALGAGKALAGGDVQVEAAPAALELHCPSLVQADESLDLSIQNRGGSDLDAAWKITAHGEEPAKASHPHCPK 320
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  409 DTEIFPGNGHCYRLVAEKAPWLQAQEQCRTWAGAALAMVDSPAIQHFLVSKVTRSLD--VWIGFSSVEGTE-GLDPRGEA 485
Cdd:TIGR00864  321 DGEIFEENGHCFQIVPEEAAWLDAQEQCLARAGAALAIVDNDALQNFLARKVTHSLDrgVWIGFSDVNGAEkGPAHQGEA 400
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  486 FSLESCQNWLPGEPHPATAEHCVRLGPAGQCNTDLCSAPHSYVCELRPGGPVWDTENFVMGMSGGGLSGPLHPLAQQETV 565
Cdd:TIGR00864  401 FEAEECEEGLAGEPHPARAEHCVRLDPRGQCNSDLCNAPHAYVCELNPGGPVPDAENFAMGAASFDLHGLLQALAAMDGL 480
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  566 QGPLR-PVEVMVFPGLSPSREAFLTAAEFSTQKLEEPAQMRLQVYRPSGG--AAAVPEGSSEP-----DNRTEPAPKCVP 637
Cdd:TIGR00864  481 PAPPHeGVEVLLFPALRFSRAAFLSSAEFGTQELRRPAHILFQIYRLRCRlpGAGGPACGPEAecrppDNRSADAPACMK 560
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  638 EELWCPGANVCIPFDASCNSHVCINGSVSRLGLSRAS----YTLWKEFFFSVPAGPPTQYLVTLHSQDVPMLPGDLIGLQ 713
Cdd:TIGR00864  561 GEQWCPFAHICLPLDAPCHPQACANGCSQGHGLPGAArmplYALQREFLFSLPAGPAAHVLLQDHGEDLLMLPGDLIALQ 640
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  714 HDAGPGTLLQCPLASSCPG-QALYLSTNASDWM------------------------TNLPVHLEEAWAG----PVCSLQ 764
Cdd:TIGR00864  641 HDAGPAALIHCQPAPGHPGpRAPVFAANASEWFghnntpvppdnlagdgadplpdpeLDLKALLEGTRASwlecAACAIR 720
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  765 LLLVTERLTPLLGLGPNPGLQHPGHYEVRATVGNSVSRQNLSCSFSVVSPIAGLRVIHPIPLDGHIYVPTNGSVLVLQVD 844
Cdd:TIGR00864  721 LLAAGEQETRLLGAELNAGLPLPGLYELLAESAKGSDLHNASCSFDVLPPLAGLRVIHPAPQDGRLFLESNGSALLLQVD 800
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  845 SGANATATAQWFGGNISAPFEDACPPEVDFLKQD-------CTEEANGTLFSVLMLPRLKEGDHT----VEIVAQNGASQ 913
Cdd:TIGR00864  801 SGANAEAKAFWPGGNSSARFENVCPAEFASRLCHpstfeggCAEEAEDSLFAVLALNWLKEGEHTgpvqVDLMAENNASE 880
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  914 ANLSLRVTAEEPICGLRAVPSPEARVLQGILVRYSPMVEAGSDVAFRWTIDDKQSLTFHNTVFNVIYQSAAIFKLSLwlT 993
Cdd:TIGR00864  881 ANLSLLVQAEEPICGLRAQPHPAARVLMESLVRYSASVEAGSDMTFKWTIDDKPFFTFQNTVFNVIYQHAAVFKLSL--T 958
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  994 ASNHVSNITVNYNVTVERMNKMHGLWVSAVPTVLPPNATLALTGGVLVDSAVEVAFLWNFGDGEQVLRQFKPPYDESFQV 1073
Cdd:TIGR00864  959 AMNHVSNLTEDFNVTVDRLNPMQGLQVKGVPAVLPPGATLALTAGVLIDMAVEAAFLWSFGDGEQALFEFKPPYNESFPC 1038
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1074 PDPTVAQVLVEHNTTHIYTTPGEYNLTVLVSNTYENLTQQVTVSVRTVLPNVAIGMSSNVLVAGQPITFSPYPLPSTDGV 1153
Cdd:TIGR00864 1039 PDPSPAQVLLEHNVMHIYAAPGEYLATVLASNAFENISQQINMSVRAILPRVAIGTEDGLLLAGKPADFEAHPLPSPGGI 1118
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1154 LYTWDFGDGSPVLIQSQPVLNHTYSMTGAYRITLEVNNTVSSVTAHADIRVFQELHGLTVYLSPSVEQGAPMVVSASVES 1233
Cdd:TIGR00864 1119 HYEWDFGDGSALLQGRQPAAAHTFAKRGPFHVCLEVNNTISGAAACADMFAFEEIEGLSADMSLATELGAATTVRAALQS 1198
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1234 GDNITWTFDMGDGTVFTGPEATVQHVYLRAQNFTVTVEAANPAGHLSQSLHVQVFVLEVLHIEPSTCIPTQPSAQLMAHV 1313
Cdd:TIGR00864 1199 GDNITWTFDMGDGKSLSGPEATVEHKYAKAGNCTVNIGAANAAGHGARIIHVEVFVFEVAGIEPAACIGEHADANFRARV 1278
                         1290      1300      1310      1320      1330      1340      1350      1360
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1314 TGDPVHYLFDWTFGDGSSNVTVHGHPSVTHNFTRSGIFPLALVLSSHVNKAHYFTSICVEPEIRNITLQPERQFVKLGDE 1393
Cdd:TIGR00864 1279 SGNAAHYLFDWSFGDGSPNETHHGCPGISHNFRGNGTFPLALTISSGVNKAHFFTQICVEPELGKISLQAEKQFFALGDE 1358
                         1370      1380      1390      1400      1410      1420      1430      1440
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1394 ARLVAYSWPPFPYRYTWDFGTEDTTHTQTGGSEVKFIYREPGSYLVIVTVSNNISSTNDSAFVEVQEPVLVTGIRINGSH 1473
Cdd:TIGR00864 1359 AQFQACAEPEFNYRYEWDFGGEEAAPLPAAGAEVTFIYNDPGCYLVTVAASNNISAANDSALIEVLEPVGATSFKHNGSH 1438
                         1450      1460      1470      1480      1490      1500      1510      1520
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1474 --VLELQQPYLLSAMGSGSPATYLWELGDGSQSEGPEVTHIYSSTGDFTVRVSGWNEVSRSEAQLNITVKQRVRGLTINA 1551
Cdd:TIGR00864 1439 gnNLELGQPYLFSAFGRARNASYLWDFGDGGLLEGPEILHAFNSPGDFNIRLAAANEVGKNEATLNVAVKARVRGLTINA 1518
                         1530      1540      1550      1560      1570      1580      1590      1600
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1552 SRTVVPLNGSVSFSTLLEVGSDVHYSWVLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFIYVLQFIEGL 1631
Cdd:TIGR00864 1519 SLTNVPLNGSVHFEAHLDAGDDVRFSWILCDHCTPIFGGNTIFYTFRSVGTFNIIVTAENDVGAAQASIFLFVLQEIEGL 1598
                         1610      1620      1630      1640      1650      1660      1670      1680
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1632 QVAGGD-----------NGCCFPTNYTLQLQAAVRDGTNISYSWTA--QQEGSLITLFGSGKCFSLTSLKASTYYVHLRA 1698
Cdd:TIGR00864 1599 QILGETaegggggvqelDGCYFETNHTVQFHAGFKDGTNLSFSWNAilDNEPDGPAFAGSGKGAKLNPLEAGPCDIFLQA 1678
                         1690      1700      1710      1720      1730      1740      1750      1760
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1699 TNMLGSAAANRTIDFVEPVESLILSASPNPAAVNMSLTLCAELAGGSGVVYTWYLEEGLSWKTSMPSTTHTFAAPGLHLV 1778
Cdd:TIGR00864 1679 ANLLGQATADCTIDFLEPAGNLMLAASDNPAAVNALINLSAELAEGSGLQYRWFLEEGDDLETSEPFMSHSFPSAGLHLV 1758
                         1770      1780      1790      1800      1810      1820      1830      1840
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1779 RVTAENQLGSVNATVEVAIQVPVGGLSIRTSEP-DSIFVAAGSTLPFWGQLAEGTNVTWCWTLPGGS-KDSQYIAVRFST 1856
Cdd:TIGR00864 1759 TMKAFNELGSANASEEVDVQEPISGLKIRAADAgEQNFFAADSSVCFQGELATGTNVSWCWAIDGGSsKMGKHACMTFPD 1838
                         1850      1860      1870      1880      1890      1900      1910      1920
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1857 AGSFSLQLNASNAVSWVSAMYNLTVEEPIVNLMLWASSKVVAPGQPVHFEILLAAGSALTFRLQVGGSVPEVL-PSPHFS 1935
Cdd:TIGR00864 1839 AGTFAIRLNASNAVSGKSASREFFAEEPIFGLELKASKKIAAIGEKVEFQILLAAGSAVNFRLQIGGAAPEVLqPGPRFS 1918
                         1930      1940      1950      1960      1970      1980      1990      2000
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1936 HSFFRVGDHLVNVQAENHVSHAQAQVRILVLEAVVGLQVPNCCEPGMATGTEKNFTARVQRGSRVAYAWYFSLQKVQGDS 2015
Cdd:TIGR00864 1919 HSFPRVDDHMVNLRAKNEVSCAQANLHIEVLEAVRGLQIPDCCAAGIATGEEKNFTANVQRGKPVAFAWTFDLHHLHGDS 1998
                         2010      2020      2030      2040      2050      2060      2070      2080
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2016 LVILSGRDVTYTPVAAGLLEIHVRAFNELGGVNLTLMVEVQDIIQYVTLQSG--RCFTNRSARFEAATSPSPRRVTYHWD 2093
Cdd:TIGR00864 1999 LVIHMGKDVSYTAEAAGLLEIQLGAFNALGAENITLQLEAQDALMDAALQAGpqDCFTNKMAQFEAATSPKPNFMACHWD 2078
                         2090      2100      2110      2120      2130      2140      2150      2160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2094 FGDGTPVQKTEEFWADHYYLRPGDYHVEVNATNLVSFFVAQATVTVQVLACREPEVEVALPLQVLMRRSQRNYLEAHVDL 2173
Cdd:TIGR00864 2079 FGDGSAGQDTDEPRAEHEYLHPGDYRVQVNASNLVSFFSAHAEINVQVLACEEPEVDVVLALQLAIRRSQPNLLEAHVDL 2158
                         2170      2180      2190      2200      2210      2220      2230      2240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2174 RNCVSYQTEYRWEIYRTASCQRPGRMAQMVLPG-------------VDVSRPQLVVPRLALPVGHYCFVFVVSFGDTPLA 2240
Cdd:TIGR00864 2159 KDCLRYGAEYLWEILRAASCDNDGHFARGALNGatrsfpviplpaeVDVQRLQLSLPKLALAAGHYCFVFSLSFEDTPLK 2238
                         2250      2260      2270      2280      2290      2300      2310      2320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2241 RSIQANVTVAAERLVPIIEGGSYRVWSDTQDLVLDGSKSYDPNLEDGDQTPLNFHWACVASTQSETGGCVLNFGPRG-SS 2319
Cdd:TIGR00864 2239 KAACANLGVAAARLMPIIEGGSYRVWSDTQDLQLDAEESYDPNLDDDDQSLLHFHWACQASSKGEAGCCALNFGLGGkGP 2318
                         2330      2340      2350      2360      2370      2380      2390      2400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2320 VVTIPLERLEAGVEYTFNLIVWKAGRKEEATNQTVLIRSGRVPIVSLECVSCKAQAVYEVSRSSYVYLEGHCHNCSRGYK 2399
Cdd:TIGR00864 2319 TLGIPGEELAAGIEYTFKLSIGKAGMKEEATNQTVLIQSGHIPIVSLECVSCKAQALYEVSQNSYVYLEGRCLNCQSGFH 2398
                         2410      2420      2430      2440      2450      2460      2470      2480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2400 QGCWAARTFSNKTLVLNETTTSTGSTGMNLVVRPGALRDGEGYIFTLTVLGHSGEEEGCASIRLSPNRPPLGGSCRLFPL 2479
Cdd:TIGR00864 2399 RGRWAARTFQNDTLVLDESSTSTGSAGMNLVLRQGVLHDGEGYNFTLHVLDDSGDEEGAASIRLHHNMPPDGGECHLFPG 2478
                         2490      2500      2510      2520      2530      2540      2550      2560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2480 --------------DSVRGLTTKVHFECTGWRDAEDGGAPLVYALLLKRCRQSYCENFCIYKGSLSTYGAVLPPGFQ-PL 2544
Cdd:TIGR00864 2479 getgqehgdkedevWAIEALLDKVHFECSGWHDAEDAEAPLLYALLLNRCRDDHCEEFCVYKGSLPEHGAFLPPGFRsAH 2558
                         2570      2580      2590      2600      2610      2620      2630      2640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2545 FVVSLAVVVQDQLGAAVVALNRSLTIVLPEPSGNPADLVPWLHSLTASVLPGLLKQADPQHVIEYSLALITVLNEYEQAP 2624
Cdd:TIGR00864 2559 FEVGLAITVEDHLGAAIRALNKSIAITLPDPNGEASGLPHWLHDLIASKLKGLLDQADFQHVIELSLALITVLNEYEQAL 2638
                         2650      2660      2670      2680      2690      2700      2710      2720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2625 D-VSEPNVEQQLRAQMRKNITETLISLRVNTVDDIQQITAALAQCMVSSRELMCRSCLKKMLQKLEGMMRILQAETTEGT 2703
Cdd:TIGR00864 2639 DsAAEPKHERGHRAQIRKNITEALTALDLHTVDDIQQIAAALAQCMAPSREFICEECLKQTLHKLEAMLEILQADTKAGI 2718
                         2730      2740
                   ....*....|....*....|..
gi 1907123041 2704 LTPTTIADSILNITGDLIHLAS 2725
Cdd:TIGR00864 2719 VTPTAIADNILNIMGDLIHLAS 2740
REJ pfam02010
REJ domain; The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor ...
2169-2612 1.39e-116

REJ domain; The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor for egg jelly Swiss:Q26627. The function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six completely conserved cysteine residues that may form disulphide bridges. This region contains tandem PKD-like domains.


Pssm-ID: 366875 [Multi-domain]  Cd Length: 448  Bit Score: 378.77  E-value: 1.39e-116
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2169 AHVDLRNCVS-YQTEYRWEIYRTASCqrpGRMAQMVLPgVDVSRPQLVVPRLALPVGHYCFVFVVSFGDTP-LARSIQAN 2246
Cdd:pfam02010    1 ASVELNGCFSaYTIDYLWSVFTVSSN---LNLQTISSP-KDLVLPQLTIPSGTLPYGTYVFTLTVSLSSTPsLAGTDIIT 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2247 VTVAAERLVPIIEGGSYRVWSDTQDLVLDGSKSYDPNLEDGDQTPLNFHWACVAST------QSETGGCV-----LNFGP 2315
Cdd:pfam02010   77 VTVQPSPLVAVIDGGSSRVVGYNQDLTLDGSESYDPDVDPGSSSGLTYLWSCRRSSsgdnplLNNDPVCFsdqneGTLLQ 156
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2316 RGSSVVTIPLERLEAGVEYTFNLIVWKAGRKEEATNQTVLIRSGRVPIVSLECVSCKAQAVYEVSRssYVYLEGHCHNCS 2395
Cdd:pfam02010  157 STSSSLTIPASTLQANVTYTFKLTVSKGSRNSASTTQTILVVDGNPPIIILSCISNCNRKNNPVDR--LVLLASTCLNCS 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2396 RGYKQGC--WAARTFSNKTLVLN--ETTTSTGSTGMNLVVRPGALRDGEGYIFTLTVLGHSGEEEGCASIRLSPNRPPLG 2471
Cdd:pfam02010  235 SDLSDVTyrWLSLGSENTSLVLDqlNSQTSTGRSGPYLVIKAGVLQSGVSYRFTLIVTVYPGLVSGLASISFITNAPPTG 314
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 2472 GSCRLFPLDSVRgLTTKVHFECTGWRDAEDggaPLVYALLLKRCRQSYCENFCIYKGSLST-YGAVLPPGF-QPLFVVSL 2549
Cdd:pfam02010  315 GTCSVTPTEGTA-LETKFTVTCQGWTDDDL---PLTYQFGDISFREASEEWFLLYEGSSQIsISTFLPPGLpANDYQVTV 390
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907123041 2550 AVVVQDQLGAAvVALNRSLTIVLPEPSgnpaDLVPWLHSLTASVLPGLLKQADPQHVIEYSLA 2612
Cdd:pfam02010  391 VVVVYDSLGAA-TSVSLTITVTPPSSS----DELLYFLLGTTSDLSALLQSGDPQQAAQLILA 448
Polycystin_dom pfam20519
Polycystin domain; This domain represents the polycystin domain from group II of Transient ...
3667-3845 6.11e-66

Polycystin domain; This domain represents the polycystin domain from group II of Transient receptor potential (TRP) channels (TRPP) including PKD1, PKD2, PKD2L and mucolipins. The polycystin domain display a sandwich-like shape with five beta-sheets in the tilted middle layer, three alpha-helices on one side and a large loop with two short antiparallel beta-sheets on the other.


Pssm-ID: 466668  Cd Length: 199  Bit Score: 223.07  E-value: 6.11e-66
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 3667 AFLAITRSDEFWPWMSHVFLPYVHGNQS------------SPELGPPRLRQVRLQEAFC--PDPSSSEHM-CSAAGSLST 3731
Cdd:pfam20519    1 GLLTVTDLDDIWDWLSSVLLPALHSNKTpsglpgsfiayeSLLLGVPRLRQLRVRNSSClvHDKFVREINeCHAGYSPPS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 3732 SDYGI----GWQSVVQNGSETWAYSAPDLL-GAWYWGYCAVYDSGGYIQELGLSLEESRARLGFLQLHNWLDSRSRAVFV 3806
Cdd:pfam20519   81 EDRKLysalPYKPVHYGSKYWFIYTPPGLLmGYDHWGHLASYPSGGYVVLLPSSREESLKRLAYLQDNNWLDRGTRAVFV 160
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 1907123041 3807 ELTRYSPAVGLHAAVTLRLEFPVAGHALAAFSVRPFALR 3845
Cdd:pfam20519  161 DFTLYNADINLFCVVTLRVEFPPTGGVLPSPSVQSVKLL 199
PKD_channel pfam08016
Polycystin cation channel; This family contains the cation channel region from group II of ...
3846-4067 2.64e-59

Polycystin cation channel; This family contains the cation channel region from group II of Transient receptor potential (TRP) channels, the TRPP subfamily, including PKD1, PKD2, PKD2L and mucolipin proteins.


Pssm-ID: 462341 [Multi-domain]  Cd Length: 225  Bit Score: 205.20  E-value: 2.64e-59
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 3846 RLSTGLSLPLLT-SVCLLLFALYFSMAEVQTWRKDGcACTARpDTW--ARCLLVILTAATGLVRLAQLGIADRQWTHFVq 3922
Cdd:pfam08016    1 RYVTNRSLFILLcEIVFVVFFLYFVVEEILKIRKHR-PSYLR-SVWnlLDLAIVILSVVLIVLNIYRDFLADRLIKSVE- 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 3923 DHPRHFTSFDQVAQLGSVARGLAASLLFLLLVKAAQQLRFVRQWSVFGKTLCRALPELMGATLGLVLLGVAYAQMAILLI 4002
Cdd:pfam08016   78 ASPVTFIDFDRVAQLDNLYRIILAFLVFLTWLKLFKVLRFNKTMSLFTKTLSRAWKDLAGFALMFVIFFFAYAQFGYLLF 157
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907123041 4003 SSGADTLYNMARAFLVLCpgarvPTLCPSESWY--------LSPLLCVGLWALRVWGALRLGAILLRWRYHAL 4067
Cdd:pfam08016  158 GTQAPNFSNFVKSILTLF-----RTILGDFGYNeifsgnrvLGPLLFLTFVFLVIFILLNLFLAIINDSYVEV 225
PLAT_polycystin cd01752
PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane ...
3112-3231 6.86e-57

PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane proteins composed of multiple domains, present in fish, invertebrates, mammals, and humans that are widely expressed in various cell types and whose biological functions remain poorly defined. In human, mutations in polycystin-1 (PKD1) and polycystin-2 (PKD2) have been shown to be the cause for autosomal dominant polycystic kidney disease (ADPKD). The generally proposed function of PLAT/LH2 domains is to mediate interaction with lipids or membrane bound proteins.


Pssm-ID: 238850  Cd Length: 120  Bit Score: 193.65  E-value: 6.86e-57
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 3112 FKYEILVKTGWSRGSGTTAHVGIMLYGEDNRSGHRHLDGD--RAFHRNSLDIFQIATPHSLGSVWKIRVWHDNKGLSPAW 3189
Cdd:cd01752      1 YLYLVTVFTGWRRGAGTTAKVTITLYGAEGESEPHHLRDPekPIFERGSVDSFLLTTPFPLGELQSIRLWHDNSGLSPSW 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 1907123041 3190 FLQHIIVRDLQSARSTFFLVNDWLSVETEanGGLVEKEVLAA 3231
Cdd:cd01752     81 YLSRVIVRDLQTGKKWFFLCNDWLSVEEG--DGTVERTFPVA 120
PLAT_repeat cd01756
PLAT/LH2 domain repeats of family of proteins with unknown function. In general, PLAT/LH2 ...
3112-3231 2.30e-29

PLAT/LH2 domain repeats of family of proteins with unknown function. In general, PLAT/LH2 consists of an eight stranded beta-barrel and it's proposed function is to mediate interaction with lipids or membrane bound proteins.


Pssm-ID: 238854  Cd Length: 120  Bit Score: 114.96  E-value: 2.30e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 3112 FKYEILVKTGWSRGSGTTAHVGIMLYGEDNRSGHRHLD---GDRAFHRNSLDIFQIATPhSLGSVWKIRVWHDNKGLSPA 3188
Cdd:cd01756      1 VTYEVTVKTGDVKGAGTDANVFITLYGENGDTGKRKLKksnNKNKFERGQTDKFTVEAV-DLGKLKKIRIGHDNSGLGAG 79
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|...
gi 1907123041 3189 WFLQHIIVRDLQSARSTFFLVNDWLSveTEANGGLVEKEVLAA 3231
Cdd:cd01756     80 WFLDKVEIREPGTGDEYTFPCNRWLD--KDEDDGQIVRELYPS 120
PLAT pfam01477
PLAT/LH2 domain; This domain is found in a variety of membrane or lipid associated proteins. ...
3114-3217 1.84e-22

PLAT/LH2 domain; This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this domain binds to procolipase pfam01114, which mediates membrane association. So it appears possible that this domain mediates membrane attachment via other protein binding partners. The structure of this domain is known for many members of the family and is composed of a beta sandwich.


Pssm-ID: 396180  Cd Length: 115  Bit Score: 95.19  E-value: 1.84e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 3114 YEILVKTGWSRGSGTTAHVGIMLYGEDNRSGHRHLDGDR-AFHRNSLDIFQIATPHSLGSVWKIRVWHDNKGLSPAWFLQ 3192
Cdd:pfam01477    1 YQVKVVTGDELGAGTDADVYISLYGKVGESAQLEITLDNpDFERGAEDSFEIDTDWDVGAILKINLHWDNNGLSDEWFLK 80
                           90       100
                   ....*....|....*....|....*.
gi 1907123041 3193 HIIVRDLQSARSTF-FLVNDWLSVET 3217
Cdd:pfam01477   81 SITVEVPGETGGKYtFPCNSWVYGSK 106
WSC smart00321
present in yeast cell wall integrity and stress response component proteins; Domain present in ...
177-271 3.54e-19

present in yeast cell wall integrity and stress response component proteins; Domain present in WSC proteins, polycystin and fungal exoglucanase


Pssm-ID: 214616 [Multi-domain]  Cd Length: 95  Bit Score: 85.21  E-value: 3.54e-19
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041   177 GEEYVACLPDNSSGAVAAVPFYFAHegPLETEACSAFCFSAGEGLAALSEQNQCLCGAGQASNSSAACSSWC--SSISLS 254
Cdd:smart00321    1 GATYVGCYSDNSSRTLAAVSSYAYH--NMSVEACSNFCFSAGYALAALENGNECYCGDSLPSTSVSASDSSQcsTTCSGY 78
                            90
                    ....*....|....*..
gi 1907123041   255 LNSACGGPTLLQHTFPA 271
Cdd:smart00321   79 PAEVCGGPNRLSVYVLA 95
PLAT cd00113
PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology 2) domain. ...
3113-3214 4.02e-19

PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology 2) domain. It consists of an eight stranded beta-barrel. The domain can be found in various domain architectures, in case of lipoxygenases, alpha toxin, lipases and polycystin, but also as a single domain or as repeats.The putative function of this domain is to facilitate access to sequestered membrane or micelle bound substrates.


Pssm-ID: 238061  Cd Length: 116  Bit Score: 85.85  E-value: 4.02e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 3113 KYEILVKTGWSRGSGTTAHVGIMLYGED-NRSGHRHLDGDRAFHRNSLDIFQIATPHSLGSVWKIRVWHDNKGLSPAWFL 3191
Cdd:cd00113      2 RYTVTIKTGDKKGAGTDSNISLALYGENgNSSDIPILDGPGSFERGSTDTFQIDLKLDIGDITKVYLRRDGSGLSDGWYC 81
                           90       100
                   ....*....|....*....|...
gi 1907123041 3192 QHIIVRDLQSARSTFFLVNDWLS 3214
Cdd:cd00113     82 ESITVQALGTKKVYTFPVNRWVL 104
LH2 smart00308
Lipoxygenase homology 2 (beta barrel) domain;
3112-3214 5.47e-19

Lipoxygenase homology 2 (beta barrel) domain;


Pssm-ID: 214608 [Multi-domain]  Cd Length: 105  Bit Score: 85.00  E-value: 5.47e-19
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  3112 FKYEILVKTGWSRGSGTTAHVGIMLYGEDNRSGHRHLD--GDRAFHRNSLDIFQIATPHSLGSVWKIRVWHDNKglSPAW 3189
Cdd:smart00308    1 GKYKVTVTTGGLDFAGTTASVSLSLVGAEGDGKESKLDylFKGIFARGSTYEFTFDVDEDFGELGAVKIKNEHR--HPEW 78
                            90       100
                    ....*....|....*....|....*
gi 1907123041  3190 FLQHIIVRDLQSARSTFFLVNDWLS 3214
Cdd:smart00308   79 FLKSITVKDLPTGGKYHFPCNSWVY 103
CLECT cd00037
C-type lectin (CTL)/C-type lectin-like (CTLD) domain; CLECT: C-type lectin (CTL)/C-type ...
418-530 1.26e-17

C-type lectin (CTL)/C-type lectin-like (CTLD) domain; CLECT: C-type lectin (CTL)/C-type lectin-like (CTLD) domain; protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. This group is chiefly comprised of eukaryotic CTLDs, but contains some, as yet functionally uncharacterized, bacterial CTLDs. Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces, including CaCO3 and ice. Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. For example: mannose-binding lectin and lung surfactant proteins A and D bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, and apoptotic cells) and mediate functions associated with killing and phagocytosis; P (platlet)-, E (endothelial)-, and L (leukocyte)- selectins (sels) mediate the initial attachment, tethering, and rolling of lymphocytes on inflamed vascular walls enabling subsequent lymphocyte adhesion and transmigration. CTLDs may bind a variety of carbohydrate ligands including mannose, N-acetylglucosamine, galactose, N-acetylgalactosamine, and fucose. Several CTLDs bind to protein ligands, and only some of these binding interactions are Ca2+-dependent; including the CTLDs of Coagulation Factors IX/X (IX/X) and Von Willebrand Factor (VWF) binding proteins, and natural killer cell receptors. C-type lectins, such as lithostathine, and some type II antifreeze glycoproteins function in a Ca2+-independent manner to bind inorganic surfaces. Many proteins in this group contain a single CTLD; these CTLDs associate with each other through several different surfaces to form dimers, trimers, or tetramers, from which ligand-binding sites project in different orientations. Various vertebrate type 1 transmembrane proteins including macrophage mannose receptor, endo180, phospholipase A2 receptor, and dendritic and epithelial cell receptor (DEC205) have extracellular domains containing 8 or more CTLDs; these CTLDs remain in the parent model. In some members (IX/X and VWF binding proteins), a loop extends to the adjoining domain to form a loop-swapped dimer. A similar conformation is seen in the macrophage mannose receptor CRD4's putative non-sugar bound form of the domain in the acid environment of the endosome. Lineage specific expansions of CTLDs have occurred in several animal lineages including Drosophila melanogaster and Caenorhabditis elegans; these CTLDs also remain in the parent model.


Pssm-ID: 153057 [Multi-domain]  Cd Length: 116  Bit Score: 81.51  E-value: 1.26e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  418 HCYRLVAEKAPWLQAQEQCRTWaGAALAMVDSPAIQHFLVSKVTRSL--DVWIG---------FSSVEGTEGLDPrgeaf 486
Cdd:cd00037      1 SCYKFSTEKLTWEEAQEYCRSL-GGHLASIHSEEENDFLASLLKKSSssDVWIGlndlssegtWKWSDGSPLVDY----- 74
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 1907123041  487 slescQNWLPGEPHPATAEHCVRL--GPAGQCNTDLCSAPHSYVCE 530
Cdd:cd00037     75 -----TNWAPGEPNPGGSEDCVVLssSSDGKWNDVSCSSKLPFICE 115
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1722-1791 1.43e-17

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 79.74  E-value: 1.43e-17
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1722 LSASPNPAAVNMSLTLCAELAGGSGVVYTWYLEEGLSWKTSMPSTTHTFAAPGLHLVRVTAENQLGSVNA 1791
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTATLADGSNVTYTWDFGDSPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1890-1959 3.25e-17

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 78.58  E-value: 3.25e-17
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1890 LWASSKVVAPGQPVHFEILLAAGSALTFRLQVGGSVPEVLPSPHFSHSFFRVGDHLVNVQAENHVSHAQA 1959
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTATLADGSNVTYTWDFGDSPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
CLECT smart00034
C-type lectin (CTL) or carbohydrate-recognition domain (CRD); Many of these domains function ...
406-530 5.32e-17

C-type lectin (CTL) or carbohydrate-recognition domain (CRD); Many of these domains function as calcium-dependent carbohydrate binding modules.


Pssm-ID: 214480 [Multi-domain]  Cd Length: 124  Bit Score: 79.95  E-value: 5.32e-17
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041   406 CPLDTEIFpgNGHCYRLVAEKAPWLQAQEQCRtWAGAALAMVDSPAIQHFL---VSKVTRSLDVWIGFS--SVEGT-EGL 479
Cdd:smart00034    1 CPSGWISY--GGKCYKFSTEKKTWEDAQAFCQ-SLGGHLASIHSEAENDFVaslLKNSGSSDYYWIGLSdpDSNGSwQWS 77
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|..
gi 1907123041   480 DPRGeafsLESCQNWLPGEPhPATAEHCVRLGPAGQC-NTDLCSAPHSYVCE 530
Cdd:smart00034   78 DGSG----PVSYSNWAPGEP-NNSSGDCVVLSTSGGKwNDVSCTSKLPFVCE 124
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1381-1452 1.67e-16

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 76.66  E-value: 1.67e-16
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907123041 1381 LQPERQFVKLGDEARLVAYSWPPFPYRYTWDFGteDTTHTQTGGSEVKFIYREPGSYLVIVTVSNNISSTND 1452
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTATLADGSNVTYTWDFG--DSPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1216-1281 2.20e-16

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 76.27  E-value: 2.20e-16
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907123041 1216 SPSVEQGAPMVVSASVESGDNITWTFDMGDGTVFTGPEATVQHVYLRAQNFTVTVEAANPAGHLSQ 1281
Cdd:pfam00801    5 GTVVAAGQPVTFTATLADGSNVTYTWDFGDSPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1127-1198 3.83e-16

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 75.50  E-value: 3.83e-16
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907123041 1127 IGMSSNVLVAGQPITFSPYpLPSTDGVLYTWDFGDgSPVLIQSQPVLNHTYSMTGAYRITLEVNNTVSSVTA 1198
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTAT-LADGSNVTYTWDFGD-SPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1549-1618 8.35e-16

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 74.73  E-value: 8.35e-16
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1549 INASRTVVPLNGSVSFSTLLEVGSDVHYSWVLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQD 1618
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTATLADGSNVTYTWDFGDSPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
2066-2133 4.13e-15

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 72.80  E-value: 4.13e-15
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907123041 2066 SGRCFTNRSARFEAaTSPSPRRVTYHWDFGDGtPVQKTEEFWADHYYLRPGDYHVEVNATNLVSFFVA 2133
Cdd:pfam00801    5 GTVVAAGQPVTFTA-TLADGSNVTYTWDFGDS-PGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
CLECT_DC-SIGN_like cd03590
C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific ...
406-530 1.47e-13

C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR); CLECT_DC-SIGN_like: C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). This group also contains proteins similar to hepatic asialoglycoprotein receptor (ASGP-R) and langerin in human. These proteins are type II membrane proteins with a CTLD ectodomain. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. DC-SIGN is thought to mediate the initial contact between dendritic cells and resting T cells, and may also mediate the rolling of DCs on epithelium. DC-SIGN and DC-SIGNR bind to oligosaccharides present on human tissues, as well as, on pathogens including parasites, bacteria, and viruses. DC-SIGN and DC-SIGNR bind to HIV enhancing viral infection of T cells. DC-SIGN and DC-SIGNR are homotetrameric, and contain four CTLDs stabilized by a coiled coil of alpha helices. The hepatic ASGP-R is an endocytic recycling receptor which binds and internalizes desialylated glycoproteins having a terminal galactose or N-acetylgalactosamine residues on their N-linked carbohydrate chains, via the clathrin-coated pit mediated endocytic pathway, and delivers them to lysosomes for degradation. It has been proposed that glycoproteins bearing terminal Sia (sialic acid) alpha2, 6GalNAc and Sia alpha2, 6Gal are endogenous ligands for ASGP-R and that ASGP-R participates in regulating the relative concentration of serum glycoproteins bearing alpha 2,6-linked Sia. The human ASGP-R is a hetero-oligomer composed of two subunits, both of which are found within this group. Langerin is expressed in a subset of dendritic leukocytes, the Langerhans cells (LC). Langerin induces the formation of Birbeck Granules (BGs) and associates with these BGs following internalization. Langerin binds, in a calcium-dependent manner, to glyco-conjugates containing mannose and related sugars mediating their uptake and degradation. Langerin molecules oligomerize as trimers with three CTLDs held together by a coiled-coil of alpha helices.


Pssm-ID: 153060 [Multi-domain]  Cd Length: 126  Bit Score: 70.03  E-value: 1.47e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  406 CPLDTEIFpgNGHCYRLVAEKAPWLQAQEQCRTwAGAALAMVDSPAIQHFLVSKVTRSLDVWIGFsSVEGTEG----LDp 481
Cdd:cd03590      1 CPTNWKSF--QSSCYFFSTEKKSWEESRQFCED-MGAHLVIINSQEEQEFISKILSGNRSYWIGL-SDEETEGewkwVD- 75
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907123041  482 rGEAFSLEScQNWLPGEP--HPATAEHCVRL-GPAGQCNTDLCSAPHSYVCE 530
Cdd:cd03590     76 -GTPLNSSK-TFWHPGEPnnWGGGGEDCAELvYDSGGWNDVPCNLEYRWICE 125
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1213-1289 1.93e-13

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 68.29  E-value: 1.93e-13
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907123041 1213 VYLSPSVEQGAPMVVSASVES-GDNITWTFDMGDGTVFTGPEATVQHVYLRAQNFTVTVEAANPAGhLSQSLHVQVFV 1289
Cdd:cd00146      5 VSAPPVAELGASVTFSASDSSgGSIVSYKWDFGDGEVSSSGEPTVTHTYTKPGTYTVTLTVTNAVG-SSSTKTTTVVV 81
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1130-1205 2.00e-13

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 68.25  E-value: 2.00e-13
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907123041  1130 SSNVLVAGQPITFSPYPLPSTDGVLYTWDFGDGSpvlIQSQPVLNHTYSMTGAYRITLEVNNTVSSVTAHADIRVF 1205
Cdd:smart00089    7 SPTVGVAGESVTFTATSSDDGSIVSYTWDFGDGT---SSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTVVVQ 79
LRR_8 pfam13855
Leucine rich repeat;
61-127 2.26e-13

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 67.55  E-value: 2.26e-13
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907123041   61 PSLRIpadataLDLSHNLLQTLDIGLLVNLSALVELDLSNNRISTLEEGVFANLFNLSEINLSGNPF 127
Cdd:pfam13855    1 PNLRS------LDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1487-1541 2.36e-13

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 67.86  E-value: 2.36e-13
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|....*
gi 1907123041  1487 GSGSPATYLWELGDGSQSEGPEVTHIYSSTGDFTVRVSGWNEVSRSEAQLNITVK 1541
Cdd:smart00089   25 DDGSIVSYTWDFGDGTSSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTVVVQ 79
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1210-1288 2.81e-13

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 67.86  E-value: 2.81e-13
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  1210 GLTVYLSPSVEQ-GAPMVVSASVE-SGDNITWTFDMGDGTVFTGPeaTVQHVYLRAQNFTVTVEAANPAGHLSQSLHVQV 1287
Cdd:smart00089    1 VADVSASPTVGVaGESVTFTATSSdDGSIVSYTWDFGDGTSSTGP--TVTHTYTKPGTYTVTLTVTNAVGSASATVTVVV 78

                    .
gi 1907123041  1288 F 1288
Cdd:smart00089   79 Q 79
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1638-1707 3.50e-13

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 67.03  E-value: 3.50e-13
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1638 NGCCFPTNYTLQLQAAVRDGTNISYSWTAqqeGSLITLFGSGKCFSLTSLKASTYYVHLRATNMLGSAAA 1707
Cdd:pfam00801    4 SGTVVAAGQPVTFTATLADGSNVTYTWDF---GDSPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1471-1534 4.74e-13

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 66.64  E-value: 4.74e-13
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907123041 1471 GSHVLELQQPYLLSAMG-SGSPATYLWELGD--GSQSEGPEVTHIYSSTGDFTVRVSGWNEVSRSEA 1534
Cdd:pfam00801    4 SGTVVAAGQPVTFTATLaDGSNVTYTWDFGDspGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1546-1624 2.10e-12

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 65.17  E-value: 2.10e-12
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  1546 GLTINASRTVVPLNGSVSFS-TLLEVGSDVHYSWVLCDRctPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFIYV 1624
Cdd:smart00089    1 VADVSASPTVGVAGESVTFTaTSSDDGSIVSYTWDFGDG--TSSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTVVV 78
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
277-344 3.86e-12

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 64.33  E-value: 3.86e-12
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907123041  277 LVGPHGPLASGQPADFHITS-SLPISSTRWNFGDgSPEVDMASPAATHFYVLPGSYHMTVVLALGAGSA 344
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTATLaDGSNVTYTWDFGD-SPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSA 68
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
2071-2140 6.38e-12

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 64.01  E-value: 6.38e-12
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  2071 TNRSARFEAATSPSPRRVTYHWDFGDGTPVQKTeefWADHYYLRPGDYHVEVNATNLVSFFVAQATVTVQ 2140
Cdd:smart00089   13 AGESVTFTATSSDDGSIVSYTWDFGDGTSSTGP---TVTHTYTKPGTYTVTLTVTNAVGSASATVTVVVQ 79
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1811-1875 1.07e-11

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 62.79  E-value: 1.07e-11
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907123041 1811 PDSIFVAAGSTLPFWGQLAEGTNVTWCWTL---PGGSKDSQYIAVRFSTAGSFSLQLNASNAVSWVSA 1875
Cdd:pfam00801    3 ASGTVVAAGQPVTFTATLADGSNVTYTWDFgdsPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1131-1204 2.77e-11

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 62.13  E-value: 2.77e-11
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907123041 1131 SNVLVAGQPITFSPYPLPSTDGVLYTWDFGDGSpVLIQSQPVLNHTYSMTGAYRITLEV-NNTVSSVTAHADIRV 1204
Cdd:cd00146      8 PPVAELGASVTFSASDSSGGSIVSYKWDFGDGE-VSSSGEPTVTHTYTKPGTYTVTLTVtNAVGSSSTKTTTVVV 81
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
1483-1770 6.28e-11

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 67.39  E-value: 6.28e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1483 LSAMGSGSPATYLWELGDGSQSEGPEVTHIYSSTGDFTVRVSGWNEV-SRSEAQLNITVKQRVRGLTINASRTVVPLNGS 1561
Cdd:COG3291     16 FTDTSSGNATSYEWDFGDGTTSTEANPSHTYTTPGTYTVTLTVTDAAgCSDTTTKTITVGAPNPGVTTVTTSTTVTTLAN 95
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1562 VSFSTLLEVGSDVHYSWVlcdrCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFIYVLQFIEGLQVAGGDNGCC 1641
Cdd:COG3291     96 TANGGATTVVAGSTVGTG----VATSTTTAAAPGGGGGTGTTTTTGTDTGLTGSTGTASDTATVTTSVSTTDVTSDGTTS 171
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1642 FPTNYTLQLQAAVRDGTNISYSWTAQQEGSLITLFGSGKCFSLTSLKASTYYVHLRATNMLGSAAANRTIDFVEPVESLI 1721
Cdd:COG3291    172 ASTNPSVTTDTVTTLTGSYTGTIVGGSGSGTVTSGTAGVTTGATSGTSGTGSATSGVAVTDVTLTGISTGDAGTPGTNTV 251
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*....
gi 1907123041 1722 LSASPNPAAVNMSLTLCAELAGGSGVVYTWYLEEGLSWKTSMPSTTHTF 1770
Cdd:COG3291    252 TTSGANTAGTSTITGGTSGVVTTSAATGTSTNGTGGLGTTTAITPGNVS 300
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1723-1798 6.64e-11

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 60.93  E-value: 6.64e-11
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907123041  1723 SASPNPAAVNMSLTLCAELAG-GSGVVYTWYLEEGLSWKTsmPSTTHTFAAPGLHLVRVTAENQLGSVNATVEVAIQ 1798
Cdd:smart00089    5 SASPTVGVAGESVTFTATSSDdGSIVSYTWDFGDGTSSTG--PTVTHTYTKPGTYTVTLTVTNAVGSASATVTVVVQ 79
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
56-127 1.19e-10

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 64.42  E-value: 1.19e-10
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907123041   56 LQTLGPSLRIpadataLDLSHNLLQTLDIglLVNLSALVELDLSNNRISTLEE--GVFANLFNLSEINLSGNPF 127
Cdd:cd21340    115 LAALSNSLRV------LNISGNNIDSLEP--LAPLRNLEQLDASNNQISDLEEllDLLSSWPSLRELDLTGNPV 180
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1020-1118 2.21e-10

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 59.39  E-value: 2.21e-10
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  1020 VSAVPTVLPPNATLALTGGVLVDSAVeVAFLWNFGDGeqvlrqfkppydesfqvpdptvaQVLVEHNTTHIYTTPGEYNL 1099
Cdd:smart00089    4 VSASPTVGVAGESVTFTATSSDDGSI-VSYTWDFGDG-----------------------TSSTGPTVTHTYTKPGTYTV 59
                            90
                    ....*....|....*....
gi 1907123041  1100 TVLVSNTYENLTQQVTVSV 1118
Cdd:smart00089   60 TLTVTNAVGSASATVTVVV 78
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1978-2049 2.84e-10

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 58.94  E-value: 2.84e-10
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907123041 1978 CEPGMATGTEKNFTARVQRGSRVAYAWYFslqkvqGDSLV-ILSGRDVTYTPVAAGLLEIHVRAFNELGGVNL 2049
Cdd:pfam00801    4 SGTVVAAGQPVTFTATLADGSNVTYTWDF------GDSPGtSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1547-1624 6.81e-10

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 58.28  E-value: 6.81e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1547 LTINASRTVVPLNG-SVSFS-TLLEVGSDVHYSWVLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQ-DSIFIY 1623
Cdd:cd00146      1 PTASVSAPPVAELGaSVTFSaSDSSGGSIVSYKWDFGDGEVSSSGEPTVTHTYTKPGTYTVTLTVTNAVGSSStKTTTVV 80

                   .
gi 1907123041 1624 V 1624
Cdd:cd00146     81 V 81
GPS smart00303
G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin ...
3005-3054 9.61e-10

G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin REJ and polycystin.


Pssm-ID: 197639  Cd Length: 49  Bit Score: 56.63  E-value: 9.61e-10
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|
gi 1907123041  3005 YTSLCQYFSEEMMMWRTEGIVPLEETSpSQAVCLTRHLTAFGASLFVPPS 3054
Cdd:smart00303    1 FNPICVFWDESSGEWSTRGCELLETNG-THTTCSCNHLTTFAVLMDVPPI 49
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1020-1112 2.17e-09

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 56.24  E-value: 2.17e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1020 VSAVPTVLPPNATLALTGGVLvdSAVEVAFLWNFGDGeqvlrqfkppydesfqvpdptVAQVLVEHNTTHIYTTPGEYNL 1099
Cdd:pfam00801    1 VSASGTVVAAGQPVTFTATLA--DGSNVTYTWDFGDS---------------------PGTSGSGPTVTHTYLSPGTYTV 57
                           90
                   ....*....|...
gi 1907123041 1100 TVLVSNTYENLTQ 1112
Cdd:pfam00801   58 TLTASNAVGSANA 70
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
70-128 4.59e-09

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 61.87  E-value: 4.59e-09
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907123041   70 TALDLSHNLLQTLDIGLlVNLSALVELDLSNNRISTLEEgvFANLFNLSEINLSGNPFE 128
Cdd:COG4886    208 EELDLSGNQLTDLPEPL-ANLTNLETLDLSNNQLTDLPE--LGNLTNLEELDLSNNQLT 263
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1467-1540 5.00e-09

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 55.58  E-value: 5.00e-09
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907123041 1467 IRINGSHVLELQQPYLLSAMGS--GSPATYLWELGDGSQ--SEGPEVTHIYSSTGDFTVRVSGWNEVSRSEAQ-LNITV 1540
Cdd:cd00146      3 ASVSAPPVAELGASVTFSASDSsgGSIVSYKWDFGDGEVssSGEPTVTHTYTKPGTYTVTLTVTNAVGSSSTKtTTVVV 81
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
48-128 5.76e-09

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 61.87  E-value: 5.76e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041   48 RVNCSGRWLQTLGPSLripADATAL---DLSHNLLQTLDIGLLvNLSALVELDLSNNRISTLEEgVFANLFNLSEINLSG 124
Cdd:COG4886    140 ELDLSNNQLTDLPEPL---GNLTNLkslDLSNNQLTDLPEELG-NLTNLKELDLSNNQITDLPE-PLGNLTNLEELDLSG 214

                   ....
gi 1907123041  125 NPFE 128
Cdd:COG4886    215 NQLT 218
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
2071-2141 1.37e-08

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 54.42  E-value: 1.37e-08
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907123041 2071 TNRSARFEAATSPSPRRVTYHWDFGDGTpVQKTEEFWADHYYLRPGDYHVEVNATNLVSfFVAQATVTVQV 2141
Cdd:cd00146     13 LGASVTFSASDSSGGSIVSYKWDFGDGE-VSSSGEPTVTHTYTKPGTYTVTLTVTNAVG-SSSTKTTTVVV 81
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
72-128 1.73e-08

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 60.33  E-value: 1.73e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1907123041   72 LDLSHNLLQTLDIGLlVNLSALVELDLSNNRISTLEEGvFANLFNLSEINLSGNPFE 128
Cdd:COG4886    187 LDLSNNQITDLPEPL-GNLTNLEELDLSGNQLTDLPEP-LANLTNLETLDLSNNQLT 241
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
276-352 1.81e-08

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 54.04  E-value: 1.81e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  276 TLVGPHGPLAS-GQPADFHITSSLP--ISSTRWNFGDGSPEVdMASPAATHFYVLPGSYHMTVVLALGAGSALLET-EVQ 351
Cdd:cd00146      2 TASVSAPPVAElGASVTFSASDSSGgsIVSYKWDFGDGEVSS-SGEPTVTHTYTKPGTYTVTLTVTNAVGSSSTKTtTVV 80

                   .
gi 1907123041  352 V 352
Cdd:cd00146     81 V 81
WSC pfam01822
WSC domain; This domain is involved in carbohydrate binding.
180-233 2.26e-08

WSC domain; This domain is involved in carbohydrate binding.


Pssm-ID: 460348  Cd Length: 82  Bit Score: 54.00  E-value: 2.26e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1907123041  180 YVACLPDNSSGaVAAVPFYFAHEGPLETEACSAFCFSAGEGLAALSEQNQCLCG 233
Cdd:pfam01822    1 YLGCYSDGTGG-RRLLLGSSGDYDDMTPEKCIAFCSAAGYTYAGLEYGGECYCG 53
PKD_4 pfam18911
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
1121-1204 3.43e-08

PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.


Pssm-ID: 436824 [Multi-domain]  Cd Length: 85  Bit Score: 53.43  E-value: 3.43e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1121 VLPNVAIGMSSNVlVAGQPITFSPypLPSTDG----VLYTWDFGDGSPVliqSQPVLNHTYSMTGAYRITLEVNNT--VS 1194
Cdd:pfam18911    2 AAPVADAGGDRIV-AEGETVTFDA--SASDDPdgdiLSYRWDFGDGTTA---TGANVSHTYAAPGTYTVTLTVTDDsgAS 75
                           90
                   ....*....|
gi 1907123041 1195 SVTAHADIRV 1204
Cdd:pfam18911   76 NSTATDTVTV 85
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
286-353 3.84e-08

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 53.22  E-value: 3.84e-08
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041   286 SGQPADFHITSSLP--ISSTRWNFGDGSpevDMASPAATHFYVLPGSYHMTVVLALGAGSALLETEVQVE 353
Cdd:smart00089   13 AGESVTFTATSSDDgsIVSYTWDFGDGT---SSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTVVVQ 79
PKD_4 pfam18911
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
1487-1540 4.01e-08

PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.


Pssm-ID: 436824 [Multi-domain]  Cd Length: 85  Bit Score: 53.43  E-value: 4.01e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907123041 1487 GSGSPATYLWELGDGSQSEGPEVTHIYSSTGDFTVRVSGWNE--VSRSEAQLNITV 1540
Cdd:pfam18911   30 PDGDILSYRWDFGDGTTATGANVSHTYAAPGTYTVTLTVTDDsgASNSTATDTVTV 85
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
70-128 6.25e-08

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 58.41  E-value: 6.25e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907123041   70 TALDLSHNLLQtlDIGLLVNLSALVELDLSNNRISTLEEgvFANLFNLSEINLSGNPFE 128
Cdd:COG4886    231 ETLDLSNNQLT--DLPELGNLTNLEELDLSNNQLTDLPP--LANLTNLKTLDLSNNQLT 285
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1382-1459 8.55e-08

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 52.07  E-value: 8.55e-08
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907123041  1382 QPERQFVKLGDEARLVAYSWP-PFPYRYTWDFGTEDTTHTQTggseVKFIYREPGSYLVIVTVSNNISSTNDSAFVEVQ 1459
Cdd:smart00089    5 SASPTVGVAGESVTFTATSSDdGSIVSYTWDFGDGTSSTGPT----VTHTYTKPGTYTVTLTVTNAVGSASATVTVVVQ 79
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1723-1795 1.04e-07

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 52.11  E-value: 1.04e-07
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907123041 1723 SASPNPAAVNMSLTLcaeLAGGSGVVYTWYLEEGLSWKTSMPSTTHTFAAPGLHLVRVTAENQLGSVN---ATVEV 1795
Cdd:cd00146      9 PVAELGASVTFSASD---SSGGSIVSYKWDFGDGEVSSSGEPTVTHTYTKPGTYTVTLTVTNAVGSSStktTTVVV 81
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1292-1373 1.47e-07

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 51.73  E-value: 1.47e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1292 VLHIEPSTCIPTQPSAQLMAHVTGDPVHYLFDWTFGDGssNVTVHGHPSVTHNFTRSGIFPLALVLSSHVNKAHyFTSIC 1371
Cdd:cd00146      2 TASVSAPPVAELGASVTFSASDSSGGSIVSYKWDFGDG--EVSSSGEPTVTHTYTKPGTYTVTLTVTNAVGSSS-TKTTT 78

                   ..
gi 1907123041 1372 VE 1373
Cdd:cd00146     79 VV 80
CLECT_REG-1_like cd03594
C-type lectin-like domain (CTLD) of the type found in Human REG-1 (lithostathine), REG-4, and ...
406-530 1.53e-07

C-type lectin-like domain (CTLD) of the type found in Human REG-1 (lithostathine), REG-4, and avian eggshell-specific proteins: ansocalcin, structhiocalcin-1(SCA-1), and -2(SCA-2); CLECT_REG-1_like: C-type lectin-like domain (CTLD) of the type found in Human REG-1 (lithostathine), REG-4, and avian eggshell-specific proteins: ansocalcin, structhiocalcin-1(SCA-1), and -2(SCA-2). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. REG-1 is a proliferating factor which participates in various kinds of tissue regeneration including pancreatic beta-cell regeneration, regeneration of intestinal mucosa, regeneration of motor neurons, and perhaps in tissue regeneration of damaged heart. REG-1 may play a role on the pathophysiology of Alzheimer's disease and in the development of gastric cancers. Its expression is correlated with reduced survival from early-stage colorectal cancer. REG-1 also binds and aggregates several bacterial strains from the intestinal flora and it has been suggested that it is involved in the control of the intestinal bacterial ecosystem. Rat lithostathine has calcium carbonate crystal inhibitor activity in vitro. REG-IV is unregulated in pancreatic, gastric, hepatocellular, and prostrate adenocarcinomas. REG-IV activates the EGF receptor/Akt/AP-1 signaling pathway in colorectal carcinoma. Ansocalcin, SCA-1 and -2 are found at high concentration in the calcified egg shell layer of goose and ostrich, respectively and tend to form aggregates. Ansocalcin nucleates calcite crystal aggregates in vitro.


Pssm-ID: 153064 [Multi-domain]  Cd Length: 129  Bit Score: 53.14  E-value: 1.53e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  406 CPLDTeiFPGNGHCYRLVAEKAPWLQAQEQCRTW-AGAALAMVDSPAIQHFLVSKV----TRSLDVWIGFSsvegteglD 480
Cdd:cd03594      1 CPKGW--LPYKGNCYGYFRQPLSWSDAELFCQKYgPGAHLASIHSPAEAAAIASLIssyqKAYQPVWIGLH--------D 70
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907123041  481 P---RGEAFSLESCQNW--LPGEPHPATAEHCVRL-GPAG--QCNTDLCSAPHSYVCE 530
Cdd:cd03594     71 PqqsRGWEWSDGSKLDYrsWDRNPPYARGGYCAELsRSTGflKWNDANCEERNPFICK 128
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1292-1373 1.71e-07

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 51.30  E-value: 1.71e-07
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  1292 VLHIEPSTCIPTQP-SAQLMAHVTGDPVHYLFDWTFGDGssnvTVHGHPSVTHNFTRSGIFPLALVLSSHVNKAHYFTSI 1370
Cdd:smart00089    1 VADVSASPTVGVAGeSVTFTATSSDDGSIVSYTWDFGDG----TSSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTV 76

                    ...
gi 1907123041  1371 CVE 1373
Cdd:smart00089   77 VVQ 79
CLECT_CEL-1_like cd03589
C-type lectin-like domain (CTLD) of the type found in CEL-1 from Cucumaria echinata and ...
406-531 3.41e-07

C-type lectin-like domain (CTLD) of the type found in CEL-1 from Cucumaria echinata and Echinoidin from Anthocidaris crassispina; CLECT_CEL-1_like: C-type lectin-like domain (CTLD) of the type found in CEL-1 from Cucumaria echinata and Echinoidin from Anthocidaris crassispina. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. The CEL-1 CTLD binds three calcium ions and has a high specificity for N-acteylgalactosamine (GalNAc). CEL-1 exhibits strong cytotoxicity which is inhibited by GalNAc. This protein may play a role as a toxin defending against predation. Echinoidin is found in the coelomic fluid of the sea urchin and is specific for GalBeta1-3GalNAc. Echinoidin has a cell adhesive activity towards human cancer cells which is not mediated through the CTLD. Both CEL-1 and Echinoidin are multimeric proteins comprised of multiple dimers linked by disulfide bonds.


Pssm-ID: 153059  Cd Length: 137  Bit Score: 52.36  E-value: 3.41e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  406 CPLDTEIFpgNGHCYRLVAEKAPWLQAQEQCRTWAG----AALAMVDSPAIQHFL------VSKVTRSLDVWIGF--SSV 473
Cdd:cd03589      1 CPTFWTAF--GGYCYRFFGDRLTWEEAELRCRSFSIpgliAHLVSIHSQEENDFVydlfesSRGPDTPYGLWIGLhdRTS 78
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907123041  474 EGT-EGLDPRGEAFSlescqNWLPGEPHPA-TAEHCV---RLGPAGQCNTDL-CSAPHSYVCEL 531
Cdd:cd03589     79 EGPfEWTDGSPVDFT-----KWAGGQPDNYgGNEDCVqmwRRGDAGQSWNDMpCDAVFPYICKM 137
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
70-128 3.84e-07

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 56.10  E-value: 3.84e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907123041   70 TALDLSHNLLQTLDIGLlVNLSALVELDLSNNRISTLEEgVFANLFNLSEINLSGNPFE 128
Cdd:COG4886    116 ESLDLSGNQLTDLPEEL-ANLTNLKELDLSNNQLTDLPE-PLGNLTNLKSLDLSNNQLT 172
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1970-2056 3.86e-07

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 50.53  E-value: 3.86e-07
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  1970 VGLQVPNCCEPgmATGTEKNFTARVQR-GSRVAYAWYFslqkvqGDSLViLSGRDVTYTPVAAGLLEIHVRAFNELGGVN 2048
Cdd:smart00089    1 VADVSASPTVG--VAGESVTFTATSSDdGSIVSYTWDF------GDGTS-STGPTVTHTYTKPGTYTVTLTVTNAVGSAS 71

                    ....*...
gi 1907123041  2049 LTLMVEVQ 2056
Cdd:smart00089   72 ATVTVVVQ 79
PLAT_plant_stress cd01754
PLAT/LH2 domain of plant-specific single domain protein family with unknown function. Many of ...
3114-3221 5.52e-07

PLAT/LH2 domain of plant-specific single domain protein family with unknown function. Many of its members are stress induced. In general, PLAT/LH2 consists of an eight stranded beta-barrel and it's proposed function is to mediate interaction with lipids or membrane bound proteins.


Pssm-ID: 238852  Cd Length: 129  Bit Score: 51.39  E-value: 5.52e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 3114 YEILVKTGWSRGSGTTAHVGIMLYGEDNRS---------GHRHLDGDRAFHRNSLDIFQIATPHSLGSVWKIRVWHDNKG 3184
Cdd:cd01754      3 YTIYVQTGSIWKAGTDSRISLQIYDADGPGlrianleawGGLMGAGHDYFERGNLDRFSGRGPCLPSPPCWMNLTSDGTG 82
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|
gi 1907123041 3185 LSPAWFLQHIIVRDLQSAR---STFFLVNDWLSVETEANG 3221
Cdd:cd01754     83 NHPGWYVNYVEVTQAGQHApcmQHLFAVEQWLATDESPYM 122
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
1021-1277 1.09e-06

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 53.91  E-value: 1.09e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1021 SAVPTVLPPNATLALTGgvlVDSAVEVAFLWNFGDGEqvlrqfkppydesfqvpDPTVAqvlvehNTTHIYTTPGEYNLT 1100
Cdd:COG3291      2 TATPTSGCAPLTVQFTD---TSSGNATSYEWDFGDGT-----------------TSTEA------NPSHTYTTPGTYTVT 55
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1101 VLVSNTY-ENLTQQVTVSVRTVLPNVAIGMSSNVLVAGQPITFSPYPLPSTDGVLYTWDFGDGSPVLIQSQPVLNHTYSM 1179
Cdd:COG3291     56 LTVTDAAgCSDTTTKTITVGAPNPGVTTVTTSTTVTTLANTANGGATTVVAGSTVGTGVATSTTTAAAPGGGGGTGTTTT 135
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1180 TGayriTLEVNNTVSSVTAHADIRVFQELHGLTVYLSPSVEQGAPMVVSASVESGDNITWTFDMGDGTVFTGPEATVQHV 1259
Cdd:COG3291    136 TG----TDTGLTGSTGTASDTATVTTSVSTTDVTSDGTTSASTNPSVTTDTVTTLTGSYTGTIVGGSGSGTVTSGTAGVT 211
                          250
                   ....*....|....*...
gi 1907123041 1260 YLRAQNFTVTVEAANPAG 1277
Cdd:COG3291    212 TGATSGTSGTGSATSGVA 229
LRRCT smart00082
Leucine rich repeat C-terminal domain;
125-177 1.21e-06

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 48.20  E-value: 1.21e-06
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|...
gi 1907123041   125 NPFECNCGLAWLPRWAKEhQVHVVQSEATTCRGPIPLAGqPLLSIPLLDNACG 177
Cdd:smart00082    1 NPFICDCELRWLLRWLQA-NEHLQDPVDLRCASPSSLRG-PLLELLHSEFKCP 51
CLECT_collectin_like cd03591
C-type lectin-like domain (CTLD) of the type found in human collectins including lung ...
420-531 2.27e-06

C-type lectin-like domain (CTLD) of the type found in human collectins including lung surfactant proteins A and D, mannose- or mannan binding lectin (MBL), and CL-L1 (collectin liver 1); CLECT_collectin_like: C-type lectin-like domain (CTLD) of the type found in human collectins including lung surfactant proteins A and D, mannose- or mannan binding lectin (MBL), and CL-L1 (collectin liver 1). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. The CTLDs of these collectins bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, or apoptotic cells) and mediate functions associated with killing and phagocytosis. MBPs recognize high mannose oligosaccharides in a calcium dependent manner, bind to a broad range of pathogens, and trigger cell killing by activating the complement pathway. MBP also acts directly as an opsonin. SP-A and SP-D in addition to functioning as host defense components, are components of pulmonary surfactant which play a role in surfactant homeostasis. Pulmonary surfactant is a phospholipid-protein complex which reduces the surface tension within the lungs. SP-A binds the major surfactant lipid: dipalmitoylphosphatidylcholine (DPPC). SP-D binds two minor components of surfactant that contain sugar moieties: glucosylceramide and phosphatidylinositol (PI). MBP and SP-A, -D monomers are homotrimers with an N-terminal collagen region and three CTLDs. Multiple homotrimeric units associate to form supramolecular complexes. MBL deficiency results in an increased susceptibility to a large number of different infections and to inflammatory disease, such as rheumatoid arthritis.


Pssm-ID: 153061 [Multi-domain]  Cd Length: 114  Bit Score: 49.22  E-value: 2.27e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  420 YRLVAEKAPWLQAQEQCRTwAGAALAMVDSP----AIQHFLVSKVTRsldVWIGFSSVEgTEG----LDPRGEAFSlesc 491
Cdd:cd03591      4 FVTNGEEKNFDDAQKLCSE-AGGTLAMPRNAaenaAIASYVKKGNTY---AFIGITDLE-TEGqfvyLDGGPLTYT---- 74
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|.
gi 1907123041  492 qNWLPGEPHPAT-AEHCVRLGPAGQCNTDLCSAPHSYVCEL 531
Cdd:cd03591     75 -NWKPGEPNNAGgGEDCVEMYTSGKWNDVACNLTRLFVCEF 114
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1388-1458 6.28e-06

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 47.11  E-value: 6.28e-06
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907123041 1388 VKLGDEARLVAYSWPPF----PYRYTWDFGTEDTTHTqtGGSEVKFIYREPGSYLVIVTVSNNISSTN-DSAFVEV 1458
Cdd:cd00146      8 PPVAELGASVTFSASDSsggsIVSYKWDFGDGEVSSS--GEPTVTHTYTKPGTYTVTLTVTNAVGSSStKTTTVVV 81
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1818-1882 1.02e-05

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 46.29  E-value: 1.02e-05
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907123041  1818 AGSTLPFWGQ-LAEGTNVTWCWTLPGGSKDSQYIAV-RFSTAGSFSLQLNASNAVSWVSAMYNLTVE 1882
Cdd:smart00089   13 AGESVTFTATsSDDGSIVSYTWDFGDGTSSTGPTVThTYTKPGTYTVTLTVTNAVGSASATVTVVVQ 79
PKD pfam00801
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ...
1298-1362 1.69e-05

PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.


Pssm-ID: 395646 [Multi-domain]  Cd Length: 70  Bit Score: 45.46  E-value: 1.69e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907123041 1298 STCIPTQPSAQLMAHV-TGDPVHYLfdWTFGDGssNVTVHGHPSVTHNFTRSGIFPLALVLSSHVN 1362
Cdd:pfam00801    5 GTVVAAGQPVTFTATLaDGSNVTYT--WDFGDS--PGTSGSGPTVTHTYLSPGTYTVTLTASNAVG 66
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
55-128 1.71e-05

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 50.70  E-value: 1.71e-05
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907123041   55 WLQTLGPSLRIPADATALDLSHNLLqtldiglLVNLSALVELDLSNNRISTLEEGvFANLFNLSEINLSGNPFE 128
Cdd:COG4886     84 LLLLGLTDLGDLTNLTELDLSGNEE-------LSNLTNLESLDLSGNQLTDLPEE-LANLTNLKELDLSNNQLT 149
PKD cd00146
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ...
1020-1118 4.28e-05

polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.


Pssm-ID: 238084 [Multi-domain]  Cd Length: 81  Bit Score: 44.79  E-value: 4.28e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1020 VSAVPTVLPPNATLALTGGVLVDSAVeVAFLWNFGDGEQVLRQfkppydesfqvpdptvaqvlvEHNTTHIYTTPGEYNL 1099
Cdd:cd00146      4 SVSAPPVAELGASVTFSASDSSGGSI-VSYKWDFGDGEVSSSG---------------------EPTVTHTYTKPGTYTV 61
                           90       100
                   ....*....|....*....|
gi 1907123041 1100 TVLVSNTY-ENLTQQVTVSV 1118
Cdd:cd00146     62 TLTVTNAVgSSSTKTTTVVV 81
LRRNT smart00013
Leucine rich repeat N-terminal domain;
32-71 4.61e-05

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 43.08  E-value: 4.61e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 1907123041    32 PCPLPCFCGPapdaaCRVNCSGRWLQTLgpSLRIPADATA 71
Cdd:smart00013    1 ACPAPCNCSG-----TAVDCSGRGLTEV--PLDLPPDTTL 33
CLECT_NK_receptors_like cd03593
C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs); ...
417-530 1.59e-04

C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs); CLECT_NK_receptors_like: C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs), including proteins similar to oxidized low density lipoprotein (OxLDL) receptor (LOX-1), CD94, CD69, NKG2-A and -D, osteoclast inhibitory lectin (OCIL), dendritic cell-associated C-type lectin-1 (dectin-1), human myeloid inhibitory C-type lectin-like receptor (MICL), mast cell-associated functional antigen (MAFA), killer cell lectin-like receptors: subfamily F, member 1 (KLRF1) and subfamily B, member 1 (KLRB1), and lys49 receptors. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. NKRs are variously associated with activation or inhibition of natural killer (NK) cells. Activating NKRs stimulate cytolysis by NK cells of virally infected or transformed cells; inhibitory NKRs block cytolysis upon recognition of markers of healthy self cells. Most Lys49 receptors are inhibitory; some are stimulatory. OCIL inhibits NK cell function via binding to the receptor NKRP1D. Murine OCIL in addition to inhibiting NK cell function inhibits osteoclast differentiation. MAFA clusters with the type I Fc epsilon receptor (FcepsilonRI) and inhibits the mast cells secretory response to FcepsilonRI stimulus. CD72 is a negative regulator of B cell receptor signaling. NKG2D is an activating receptor for stress-induced antigens; human NKG2D ligands include the stress induced MHC-I homologs, MICA, MICB, and ULBP family of glycoproteins Several NKRs have a carbohydrate-binding capacity which is not mediated through calcium ions (e.g. OCIL binds a range of high molecular weight sulfated glycosaminoglycans including dextran sulfate, fucoidan, and gamma-carrageenan sugars). Dectin-1 binds fungal beta-glucans and in involved in the innate immune responses to fungal pathogens. MAFA binds saccharides having terminal alpha-D mannose residues in a calcium-dependent manner. LOX-1 is the major receptor for OxLDL in endothelial cells and thought to play a role in the pathology of atherosclerosis. Some NKRs exist as homodimers (e.g.Lys49, NKG2D, CD69, LOX-1) and some as heterodimers (e.g. CD94/NKG2A). Dectin-1 can function as a monomer in vitro.


Pssm-ID: 153063  Cd Length: 116  Bit Score: 43.86  E-value: 1.59e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  417 GHCYRLVAEKAPWLQAQEQCRTwAGAALAMVDSPAIQHFLvSKVTRSLDVWIGFSsVEGTEG----LDprGEAFSlescq 492
Cdd:cd03593     10 NKCYYFSMEKKTWNESKEACSS-KNSSLLKIDDEEELEFL-QSQIGSSSYWIGLS-REKSEKpwkwID--GSPLN----- 79
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 1907123041  493 NWLpgEPHPATAE-HCVRLGPAGqCNTDLCSAPHSYVCE 530
Cdd:cd03593     80 NLF--NIRGSTKSgNCAYLSSTG-IYSEDCSTKKRWICE 115
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
1723-1878 1.76e-04

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 46.97  E-value: 1.76e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1723 SASPNPAAVNMSLTLCAeLAGGSGVVYTWYLEEGLSwkTSMPSTTHTFAAPGLHLVRVTAENQLGSvNATVEVAIQVPVG 1802
Cdd:COG3291      2 TATPTSGCAPLTVQFTD-TSSGNATSYEWDFGDGTT--STEANPSHTYTTPGTYTVTLTVTDAAGC-SDTTTKTITVGAP 77
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907123041 1803 GLSIRTSEPDSIFVAAGSTLPFWGQLAEGTNVTWCWTLPGGSKDSQYIAVRFSTAGSFSLQLNASNAVSWVSAMYN 1878
Cdd:COG3291     78 NPGVTTVTTSTTVTTLANTANGGATTVVAGSTVGTGVATSTTTAAAPGGGGGTGTTTTTGTDTGLTGSTGTASDTA 153
CLECT_CSPGs cd03588
C-type lectin-like domain (CTLD) of the type found in chondroitin sulfate proteoglycan core ...
417-530 2.11e-04

C-type lectin-like domain (CTLD) of the type found in chondroitin sulfate proteoglycan core proteins; CLECT_CSPGs: C-type lectin-like domain (CTLD) of the type found in chondroitin sulfate proteoglycan core proteins (CSPGs) in human and chicken aggrecan, frog brevican, and zebra fish dermacan. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with hyaluronan (HA). These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggrecan may contribute to the structural integrity of many different tissues. Xenopus brevican is expressed in the notochord and the brain during early embryogenesis. Zebra fish dermacan is expressed in dermal bones and may play a role in dermal bone development. CSPGs do contain LINK domain(s) which bind HA. These LINK domains are considered by one classification system to be a variety of CTLD, but are omitted from this hierarchical classification based on insignificant sequence similarity.


Pssm-ID: 153058  Cd Length: 124  Bit Score: 44.11  E-value: 2.11e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  417 GHCYRLVAEKAPWLQAQEQCRTwAGAALAMVDSPAIQHFLVSKVTRSLdvWIGFS--SVEGteglDPR---GEAFSLEsc 491
Cdd:cd03588     10 GHCYRHFPDRETWEDAERRCRE-QQGHLSSIVTPEEQEFVNNNAQDYQ--WIGLNdrTIEG----DFRwsdGHPLQFE-- 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|...
gi 1907123041  492 qNWLPGEPHP--ATAEHCVRL--GPAGQCNTDLCSAPHSYVCE 530
Cdd:cd03588     81 -NWRPNQPDNffATGEDCVVMiwHEEGEWNDVPCNYHLPFTCK 122
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
1404-1665 2.29e-04

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 46.59  E-value: 2.29e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1404 FPYRYTWDFGtEDTTHTQTggsEVKFIYREPGSYLVIVTVSNNI-SSTNDSAFVEVQEPVLVTGIRINGShvlelqqpYL 1482
Cdd:COG3291     23 NATSYEWDFG-DGTTSTEA---NPSHTYTTPGTYTVTLTVTDAAgCSDTTTKTITVGAPNPGVTTVTTST--------TV 90
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1483 LSAMGSGSPATYLWELGDGSQSEGPEVTHIYSSTGDFTVRVSGWNEVSRSEAQLNITVKQRVRGLTINASRTVVPLNGSV 1562
Cdd:COG3291     91 TTLANTANGGATTVVAGSTVGTGVATSTTTAAAPGGGGGTGTTTTTGTDTGLTGSTGTASDTATVTTSVSTTDVTSDGTT 170
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1563 SFSTLLEVGSDVHYSWVLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSAQDSIFIYVLQFIEGLQVAGGDNGCCF 1642
Cdd:COG3291    171 SASTNPSVTTDTVTTLTGSYTGTIVGGSGSGTVTSGTAGVTTGATSGTSGTGSATSGVAVTDVTLTGISTGDAGTPGTNT 250
                          250       260
                   ....*....|....*....|...
gi 1907123041 1643 PTNYTLQLQAAVRDGTNISYSWT 1665
Cdd:COG3291    251 VTTSGANTAGTSTITGGTSGVVT 273
PKD_4 pfam18911
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
1227-1289 2.58e-04

PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.


Pssm-ID: 436824 [Multi-domain]  Cd Length: 85  Bit Score: 42.64  E-value: 2.58e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907123041 1227 VSASVES-GDNITWTFDMGDGTVFTGPEATvqHVYLRAQNFTVTVEAANPAGHLSQSLHVQVFV 1289
Cdd:pfam18911   24 ASASDDPdGDILSYRWDFGDGTTATGANVS--HTYAAPGTYTVTLTVTDDSGASNSTATDTVTV 85
PKD_4 pfam18911
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
1407-1458 2.93e-04

PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.


Pssm-ID: 436824 [Multi-domain]  Cd Length: 85  Bit Score: 42.26  E-value: 2.93e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907123041 1407 RYTWDFGteDTThtQTGGSEVKFIYREPGSYLVIVTVSNNISSTNDSAFVEV 1458
Cdd:pfam18911   36 SYRWDFG--DGT--TATGANVSHTYAAPGTYTVTLTVTDDSGASNSTATDTV 83
myxo_dep_M36 NF038112
myxosortase-dependent M36 family metallopeptidase; Members of this bacterial protein family ...
1553-1881 2.96e-04

myxosortase-dependent M36 family metallopeptidase; Members of this bacterial protein family have an M36 family metallopeptidase domain, like fungalysin (see PF02128), and a C-terminal MYXO-CTERM domain (see TIGR03901), suggesting processing and surface-anchoring by the still-unknown putative transpeptidase, myxosortase. Members of this family include MXAN_3564 (mepA), part of the effector cargo of outer membrane vesicles that the species produces in large numbers during predation on other microbes.


Pssm-ID: 468355 [Multi-domain]  Cd Length: 1597  Bit Score: 47.34  E-value: 2.96e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1553 RTVVPLNGSVSFStllEVGSDVHYSW--------VLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVgSAQDSIFIYV 1624
Cdd:NF038112  1201 RTTVTLNGSGSFD---PDGDPLTYAWtqvsgpavTLTGADTATPSFTAPEVTADTVLTFQLVVSDGTKT-SAPDTVTVLV 1276
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1625 LQFIEG-LQVAGGDNGCCFPTNYTLQLQAAVRDGTNISYSWTaQQEGSLITLFGSGKC---FSLTSLKASTYYVhLRATN 1700
Cdd:NF038112  1277 RNVNRApVAVAGAPATVDERSTVTLDGSGTDADGDALTYAWT-QTSGPAVTLTGATTAtatFTAPEVTADTQLT-FTLTV 1354
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1701 MLGSAAANRTIDFV------EPVESLILSASPNPAAVnMSLTLCAELAGGSGVVYTWYLEEGLSWK-TSMPSTTHTFAAP 1773
Cdd:NF038112  1355 SDGTASATDTVTVTvrnvnrAPVANAGADQTVDERST-VTLSGSATDPDGDALTYAWTQTAGPTVTlTGADTATASFTAP 1433
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1774 glhlvRVTAENQLG---------------SVNATVEVAIQVPVGglsirtSEPDSIFVAAGSTLPFWGQL--AEGTNVTW 1836
Cdd:NF038112  1434 -----EVAADTELTfqltvsadgqasadvTVTVTVRNVNRAPVA------HAGESITVDEGSTVTLDASAtdPDGDTLTY 1502
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*
gi 1907123041 1837 CWTLPGGSkdsqyiAVRFSTAGSFSLQLNASNAVSWVSAMYNLTV 1881
Cdd:NF038112  1503 AWTQVAGP------SVTLTGADSAKLTFTAPEVSADTTLTFSLTV 1541
CLECT_VCBS cd03603
A bacterial subgroup of the C-type lectin-like (CTLD) domain; a subgroup of bacterial protein ...
418-501 7.53e-04

A bacterial subgroup of the C-type lectin-like (CTLD) domain; a subgroup of bacterial protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins; CLECT_VCBS: A bacterial subgroup of the C-type lectin-like (CTLD) domain; a subgroup of bacterial protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces including CaCO3 and ice. Bacterial CTLDs within this group are functionally uncharacterized. Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. CTLDs may bind a variety of carbohydrate ligands including mannose, N-acetylglucosamine, galactose, N-acetylgalactosamine, and fucose. CTLDs associate with each other through several different surfaces to form dimers, trimers, or tetramers from which ligand-binding sites project in different orientations. In some CTLDs a loop extends to the adjoining domain to form a loop-swapped dimer.


Pssm-ID: 153073 [Multi-domain]  Cd Length: 118  Bit Score: 42.03  E-value: 7.53e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  418 HCYRLVAEKAPWLQAQEQCRTwAGAALAMVDSPAIQHFLVSKVTRSLDVWIGFSSVEgTEG--LDPRGEAfslESCQNWL 495
Cdd:cd03603      1 HFYKFVDGGMTWEAAQTLAES-LGGHLVTINSAEENDWLLSNFGGYGASWIGASDAA-TEGtwKWSDGEE---STYTNWG 75

                   ....*.
gi 1907123041  496 PGEPHP 501
Cdd:cd03603     76 SGEPHN 81
LRR_4 pfam12799
Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a ...
61-108 1.45e-03

Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a number of proteins with diverse functions and cellular locations. These repeats are usually involved in protein-protein interactions. Each Leucine Rich Repeat is composed of a beta-alpha unit. These units form elongated non-globular structures. Leucine Rich Repeats are often flanked by cysteine rich domains.


Pssm-ID: 463713 [Multi-domain]  Cd Length: 44  Bit Score: 39.15  E-value: 1.45e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 1907123041   61 PSLRIpadataLDLSHNLLQtlDIGLLVNLSALVELDLS-NNRISTLEE 108
Cdd:pfam12799    1 PNLEV------LDLSNNQIT--DIPPLAKLPNLETLDLSgNNKITDLSD 41
PKD_4 pfam18911
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
271-344 1.49e-03

PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.


Pssm-ID: 436824 [Multi-domain]  Cd Length: 85  Bit Score: 40.33  E-value: 1.49e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907123041  271 ASPGATLVGPHgPLASGQPADFHITSSLP----ISSTRWNFGDGSpEVDMASPaaTHFYVLPGSYHMTVVLALGAGSA 344
Cdd:pfam18911    2 AAPVADAGGDR-IVAEGETVTFDASASDDpdgdILSYRWDFGDGT-TATGANV--SHTYAAPGTYTVTLTVTDDSGAS 75
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
1298-1624 1.59e-03

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 43.89  E-value: 1.59e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1298 STCIPTqpSAQLMAHVTGDPVHYLfdWTFGDGSSNVTvhghPSVTHNFTRSGIFPLALVLSSHVNKAHYFTS-ICVEPEI 1376
Cdd:COG3291      7 SGCAPL--TVQFTDTSSGNATSYE--WDFGDGTTSTE----ANPSHTYTTPGTYTVTLTVTDAAGCSDTTTKtITVGAPN 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1377 RNITLQPERQFVKLGDEARLVAYSWPPFPYRYTWDFGTEDTTHTQTGGSEVKFIYREPGSYLVIVTVSNNISSTNDSAFV 1456
Cdd:COG3291     79 PGVTTVTTSTTVTTLANTANGGATTVVAGSTVGTGVATSTTTAAAPGGGGGTGTTTTTGTDTGLTGSTGTASDTATVTTS 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1457 EVQEPVLVTGIRINGSHVLELQQPYLLSAMGSGSPATYLWELGDGSQSEGpevTHIYSSTGDFTVRVSGWNEVSRSEAQL 1536
Cdd:COG3291    159 VSTTDVTSDGTTSASTNPSVTTDTVTTLTGSYTGTIVGGSGSGTVTSGTA---GVTTGATSGTSGTGSATSGVAVTDVTL 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1537 NITVKQRVRGLTINASRTVVPLNGSVSFSTllEVGSDVHYSWVLCDRCTPIPGGPTISYTFRSVGTFNIIVTAENEVGSA 1616
Cdd:COG3291    236 TGISTGDAGTPGTNTVTTSGANTAGTSTIT--GGTSGVVTTSAATGTSTNGTGGLGTTTAITPGNVSTTADVTGGTATLA 313

                   ....*...
gi 1907123041 1617 QDSIFIYV 1624
Cdd:COG3291    314 VSSTLTTN 321
CLECT_TC14_like cd03601
C-type lectin-like domain (CTLD) of the type found in lectins TC14, TC14-2, TC14-3, and TC14-4 ...
435-530 1.84e-03

C-type lectin-like domain (CTLD) of the type found in lectins TC14, TC14-2, TC14-3, and TC14-4 from the budding tunicate Polyandrocarpa misakiensis and PfG6 from the Acorn worm; CLECT_TC14_like: C-type lectin-like domain (CTLD) of the type found in lectins TC14, TC14-2, TC14-3, and TC14-4 from the budding tunicate Polyandrocarpa misakiensis and PfG6 from the Acorn worm. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. TC14 is homodimeric. The CTLD of TC14 binds D-galactose and D-fucose. TC14 is expressed constitutively by multipotent epithelial and mesenchymal cells and plays in role during budding, in inducing the aggregation of undifferentiated mesenchymal cells to give rise to epithelial forming tissue. TC14-2 and TC14-3 shows calcium-dependent galactose binding activity. TC14-3 is a cytostatic factor which blocks cell growth and dedifferentiation of the atrial epithelium during asexual reproduction. It may also act as a differentiation inducing factor. Galactose inhibits the cytostatic activity of TC14-3. The gene for Acorn worm PfG6 is gill-specific; PfG6 may be a secreted protein.


Pssm-ID: 153071  Cd Length: 119  Bit Score: 40.98  E-value: 1.84e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041  435 QCRTWAGAALAMVDSPAIQHFLVSKVTRSLDVWIGFSSVEGTEG--LDPRGEAFSLEScQNWLPGEP-HPATAEHCVRL- 510
Cdd:cd03601     20 RSRGMRLASLAMRDSEMRDAILAFTLVKGHGYWVGADNLQDGEYdfLWNDGVSLPTDS-DLWAPNEPsNPQSRQLCVQLw 98
                           90       100
                   ....*....|....*....|
gi 1907123041  511 GPAGQCNTDLCSAPHSYVCE 530
Cdd:cd03601     99 SKYNLLDDEYCGRAKRVICE 118
LRR smart00370
Leucine-rich repeats, outliers;
90-113 2.02e-03

Leucine-rich repeats, outliers;


Pssm-ID: 197688 [Multi-domain]  Cd Length: 24  Bit Score: 38.10  E-value: 2.02e-03
                            10        20
                    ....*....|....*....|....
gi 1907123041    90 LSALVELDLSNNRISTLEEGVFAN 113
Cdd:smart00370    1 LPNLRELDLSNNQLSSLPPGAFQG 24
LRR_TYP smart00369
Leucine-rich repeats, typical (most populated) subfamily;
90-113 2.02e-03

Leucine-rich repeats, typical (most populated) subfamily;


Pssm-ID: 197687 [Multi-domain]  Cd Length: 24  Bit Score: 38.10  E-value: 2.02e-03
                            10        20
                    ....*....|....*....|....
gi 1907123041    90 LSALVELDLSNNRISTLEEGVFAN 113
Cdd:smart00369    1 LPNLRELDLSNNQLSSLPPGAFQG 24
PKD_4 pfam18911
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
2074-2126 2.11e-03

PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.


Pssm-ID: 436824 [Multi-domain]  Cd Length: 85  Bit Score: 39.95  E-value: 2.11e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907123041 2074 SARFEAATS--PSPRRVTYHWDFGDGTPVQKTEefwADHYYLRPGDYHVEVNATN 2126
Cdd:pfam18911   19 TVTFDASASddPDGDILSYRWDFGDGTTATGAN---VSHTYAAPGTYTVTLTVTD 70
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1892-1965 3.22e-03

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 39.36  E-value: 3.22e-03
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907123041  1892 ASSKVVAPGQPVHFEI-LLAAGSALTFRLQVGGSvpEVLPSPHFSHSFFRVGDHLVNVQAENHVSHAQAQVRILV 1965
Cdd:smart00089    6 ASPTVGVAGESVTFTAtSSDDGSIVSYTWDFGDG--TSSTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTVVV 78
PKD_4 pfam18911
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
1024-1118 3.92e-03

PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.


Pssm-ID: 436824 [Multi-domain]  Cd Length: 85  Bit Score: 39.18  E-value: 3.92e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123041 1024 PTVLPPNATLALTGGVLVDS-AVEVAFLWNFGDGeqvlrqfkppydesfqvpdpTVAQVLvehNTTHIYTTPGEYNLTVL 1102
Cdd:pfam18911   11 DRIVAEGETVTFDASASDDPdGDILSYRWDFGDG--------------------TTATGA---NVSHTYAAPGTYTVTLT 67
                           90
                   ....*....|....*...
gi 1907123041 1103 VSNTY--ENLTQQVTVSV 1118
Cdd:pfam18911   68 VTDDSgaSNSTATDTVTV 85
PKD smart00089
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ...
1643-1711 4.11e-03

Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.


Pssm-ID: 214510 [Multi-domain]  Cd Length: 79  Bit Score: 38.97  E-value: 4.11e-03
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907123041  1643 PTNYTLQLQAAVR-DGTNISYSWTaqqegslitlFG-----SGKCFSLTSLKASTYYVHLRATNMLGSAAANRTI 1711
Cdd:smart00089   12 VAGESVTFTATSSdDGSIVSYTWD----------FGdgtssTGPTVTHTYTKPGTYTVTLTVTNAVGSASATVTV 76
LRR_4 pfam12799
Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a ...
93-126 4.20e-03

Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a number of proteins with diverse functions and cellular locations. These repeats are usually involved in protein-protein interactions. Each Leucine Rich Repeat is composed of a beta-alpha unit. These units form elongated non-globular structures. Leucine Rich Repeats are often flanked by cysteine rich domains.


Pssm-ID: 463713 [Multi-domain]  Cd Length: 44  Bit Score: 37.61  E-value: 4.20e-03
                           10        20        30
                   ....*....|....*....|....*....|....
gi 1907123041   93 LVELDLSNNRISTLEEgvFANLFNLSEINLSGNP 126
Cdd:pfam12799    3 LEVLDLSNNQITDIPP--LAKLPNLETLDLSGNN 34
COG3291 COG3291
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
2074-2140 7.93e-03

Uncharacterized conserved protein, PKD repeat domain [Function unknown];


Pssm-ID: 442520 [Multi-domain]  Cd Length: 333  Bit Score: 41.96  E-value: 7.93e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907123041 2074 SARFEAATSPSPrrVTYHWDFGDGT------PVqkteefwadHYYLRPGDYHVEVNATNLV-SFFVAQATVTVQ 2140
Cdd:COG3291     13 TVQFTDTSSGNA--TSYEWDFGDGTtsteanPS---------HTYTTPGTYTVTLTVTDAAgCSDTTTKTITVG 75
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH