NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|922581002|ref|NP_001300071|]
View 

Cadherin EGF LAG seven-pass G-type receptor fmi-1 [Caenorhabditis elegans]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
327-421 1.93e-32

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


:

Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 122.42  E-value: 1.93e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002  327 PFSVKIPEHSPIGYPVITLKAEDHDRGDNARIVYSI---DSSQFFRIDPSSGDISVSSDLDREDRATFSVIVTASDHASP 403
Cdd:cd11304     1 SYEVSVPENAPPGTVVLTVSATDPDSGENGEVTYSIvsgNEDGLFSIDPSTGEITTAKPLDREEQSSYTLTVTATDGGGP 80
                          90
                  ....*....|....*...
gi 922581002  404 PLNTSTQIEVILDDINDN 421
Cdd:cd11304    81 PLSSTATVTITVLDVNDN 98
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
127-218 3.23e-32

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


:

Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 121.65  E-value: 3.23e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002  127 ITLEENVAIGEEVGRVYAIDEDSGPNGIIKYSMEGSED---FIIDEDSGLIKTTKLLDRETTARYSLKVTARDMGTPSLN 203
Cdd:cd11304     4 VSVPENAPPGTVVLTVSATDPDSGENGEVTYSIVSGNEdglFSIDPSTGEITTAKPLDREEQSSYTLTVTATDGGGPPLS 83
                          90
                  ....*....|....*
gi 922581002  204 TSTTIAVVLKDINDN 218
Cdd:cd11304    84 STATVTITVLDVNDN 98
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
429-529 2.36e-31

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


:

Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 118.96  E-value: 2.36e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002  429 SYAATISEDIPVGTSFLQVSAIDADIGPNGIVDYFLnesSSSPSIQLFRLDRTSGTLRVSSKLDREQFAVIVLPIFARDR 508
Cdd:cd11304     1 SYEVSVPENAPPGTVVLTVSATDPDSGENGEVTYSI---VSGNEDGLFSIDPSTGEITTAKPLDREEQSSYTLTVTATDG 77
                          90       100
                  ....*....|....*....|.
gi 922581002  509 GTPSLSAASEITLTLSDVNDN 529
Cdd:cd11304    78 GGPPLSSTATVTITVLDVNDN 98
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
20-116 2.25e-28

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


:

Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 110.48  E-value: 2.25e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002   20 SYNVTILENITIPAVIATVKATDEDFGTNGKVHYSMASSSGIGGLTIDYSTGEVTLRERIDA-KNSPITAVIRAKDGAQP 98
Cdd:cd11304     1 SYEVSVPENAPPGTVVLTVSATDPDSGENGEVTYSIVSGNEDGLFSIDPSTGEITTAKPLDReEQSSYTLTVTATDGGGP 80
                          90
                  ....*....|....*...
gi 922581002   99 ALSSTVPLTINVIDINDH 116
Cdd:cd11304    81 PLSSTATVTITVLDVNDN 98
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
537-637 1.98e-26

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


:

Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 105.09  E-value: 1.98e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002  537 SYDLYIAENSPVGSTVGTIVARDADEGDNADISFRIFGGADAKLFDIEEDaeqNGVVRilTRAEFDYEAKAnKFFFELQA 616
Cdd:cd11304     1 SYEVSVPENAPPGTVVLTVSATDPDSGENGEVTYSIVSGNEDGLFSIDPS---TGEIT--TAKPLDREEQS-SYTLTVTA 74
                          90       100
                  ....*....|....*....|....
gi 922581002  617 SSG---QLSSTVPVRIHVSDVNDN 637
Cdd:cd11304    75 TDGggpPLSSTATVTITVLDVNDN 98
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
974-1145 2.92e-23

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


:

Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 97.87  E-value: 2.92e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002  974 SVSFDGEGLLNV-NLDLPRTQWTMKFRVSTIAHNGVLVFTGDK-RSDFVEVSVVDRVLKVQFSLGGEKIDAkmenDVENR 1051
Cdd:cd00110     1 GVSFSGSSYVRLpTLPAPRTRLSISFSFRTTSPNGLLLYAGSQnGGDFLALELEDGRLVLRYDLGSGSLVL----SSKTP 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002 1052 INDGEWHTVALEYSNKQITMSLDDCetnpslLLNTSPNCAIRAKLNLEKKcedptvpcyryldisngLFLGGRPGTSKQ- 1130
Cdd:cd00110    77 LNDGQWHSVSVERNGRSVTLSVDGE------RVVESGSPGGSALLNLDGP-----------------LYLGGLPEDLKSp 133
                         170
                  ....*....|....*...
gi 922581002 1131 ---IEKAFSGCISDLSVD 1145
Cdd:cd00110   134 glpVSPGFVGCIRDLKVN 151
HormR smart00008
Domain present in hormone receptors;
1558-1615 1.39e-12

Domain present in hormone receptors;


:

Pssm-ID: 214468  Cd Length: 70  Bit Score: 64.84  E-value: 1.39e-12
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 922581002   1558 CPSEIEYSIQWPASQKGSIVRQSCPVGESGL-----ATRKCLETGRWS--DVNAWNCTRPEYSIM 1615
Cdd:smart00008    5 CPATWDGIICWPQTPAGQLVEVPCPKYFSGFsyktgASRNCTENGGWSppFPNYSNCTSNDYEEL 69
GPS smart00303
G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin ...
1811-1859 2.53e-11

G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin REJ and polycystin.


:

Pssm-ID: 197639  Cd Length: 49  Bit Score: 60.48  E-value: 2.53e-11
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*....
gi 922581002   1811 KYPECVRFDEKSGTWTARGAALIGLNLTHAACEYNRIGVFTMFVNDQSS 1859
Cdd:smart00303    1 FNPICVFWDESSGEWSTRGCELLETNGTHTTCSCNHLTTFAVLMDVPPI 49
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
893-928 5.74e-09

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 53.41  E-value: 5.74e-09
                          10        20        30
                  ....*....|....*....|....*....|....*..
gi 922581002  893 IDECYRGR-CSNNSTCVAFENTYQCECKPGWIGRHCE 928
Cdd:cd00054     2 IDECASGNpCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
1223-1357 9.27e-09

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


:

Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 56.27  E-value: 9.27e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002 1223 FEEESFV-LYQPSQVSVPFEVSFEFRTSRADMQVFALEFTQRSVHYNLEVDDGTLK--YNIGDSEVELPAPE-VTSKHWM 1298
Cdd:cd00110     4 FSGSSYVrLPTLPAPRTRLSISFSFRTTSPNGLLLYAGSQNGGDFLALELEDGRLVlrYDLGSGSLVLSSKTpLNDGQWH 83
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 922581002 1299 NVVIKFEADSVATSINGIYSAEAKASISDMNL---ESLYFGIAPGTGHPSR------FEGCIRNVLVD 1357
Cdd:cd00110    84 SVSVERNGRSVTLSVDGERVVESGSPGGSALLnldGPLYLGGLPEDLKSPGlpvspgFVGCIRDLKVN 151
7tm_GPCRs super family cl28897
seven-transmembrane G protein-coupled receptor superfamily; This hierarchical evolutionary ...
1878-2096 1.53e-07

seven-transmembrane G protein-coupled receptor superfamily; This hierarchical evolutionary model represents the seven-transmembrane (7TM) receptors, often referred to as G protein-coupled receptors (GPCRs), which transmit physiological signals from the outside of the cell to the inside via G proteins. GPCRs constitute the largest known superfamily of transmembrane receptors across the three kingdoms of life that respond to a wide variety of extracellular stimuli including peptides, lipids, neurotransmitters, amino acids, hormones, and sensory stimuli such as light, smell and taste. All GPCRs share a common structural architecture comprising of seven-transmembrane (TM) alpha-helices interconnected by three extracellular and three intracellular loops. A general feature of GPCR signaling is agonist-induced conformational changes in the receptors, leading to activation of the heterotrimeric G proteins, which consist of the guanine nucleotide-binding G-alpha subunit and the dimeric G-beta-gamma subunits. The activated G proteins then bind to and activate numerous downstream effector proteins, which generate second messengers that mediate a broad range of cellular and physiological processes. However, some 7TM receptors, such as the type 1 microbial rhodopsins, do not activate G proteins. Based on sequence similarity, GPCRs can be divided into six major classes: class A (the rhodopsin-like family), class B (the Methuselah-like, adhesion and secretin-like receptor family), class C (the metabotropic glutamate receptor family), class D (the fungal mating pheromone receptors), class E (the cAMP receptor family), and class F (the frizzled/smoothened receptor family). Nearly 800 human GPCR genes have been identified and are involved essentially in all major physiological processes. Approximately 40% of clinically marketed drugs mediate their effects through modulation of GPCR function for the treatment of a variety of human diseases including bacterial infections.


The actual alignment was detected with superfamily member cd15040:

Pssm-ID: 475119 [Multi-domain]  Cd Length: 253  Bit Score: 54.89  E-value: 1.53e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002 1878 VALFLCFLSILLTLSRRSLKTHSVRIgfILFFAINILNLFFVhkTAINQAY----CPVRNAMLSFTSSAPFAWLFLYGLY 1953
Cdd:cd15040    17 LGLLLTIITYILFRKLRKRKPTKILL--NLCLALLLANLLFL--FGINSTDnpvlCTAVAALLHYFLLASFMWMLVEALL 92
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002 1954 IYRML---ADGSSSPSLTTSLLVGIVFPCLISFTTFFV--------TDQCSLSPHLWLFWCIILPIGLFLL--LSFYAAA 2020
Cdd:cd15040    93 LYLRLvkvFGTYPRHFILKYALIGWGLPLIIVIITLAVdpdsygnsSGYCWLSNGNGLYYAFLGPVLLIILvnLVIFVLV 172
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002 2021 TSVLVSLHKKydVFVAKYNVKRAVFQHFILTIFTLGMT-LTGLFAnqlplpmeimEISQSII--YLIAA------LVIFL 2091
Cdd:cd15040   173 LRKLLRLSAK--RNKKKRKKTKAQLRAAVSLFFLLGLTwIFGILA----------IFGARVVfqYLFAIfnslqgFFIFI 240

                  ....*.
gi 922581002 2092 W-CVCD 2096
Cdd:cd15040   241 FhCLRN 246
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
1506-1552 1.44e-05

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


:

Pssm-ID: 238012  Cd Length: 50  Bit Score: 44.27  E-value: 1.44e-05
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 922581002 1506 CEC-GFGADSTEC-SADGHCKCNGDAVGRRCDRCSRFDHQLDSKTLKCR 1552
Cdd:cd00055     2 CDCnGHGSLSGQCdPGTGQCECKPNTTGRRCDRCAPGYYGLPSQGGGCQ 50
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1415-1449 2.84e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 43.01  E-value: 2.84e-05
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 922581002 1415 CSVANVCS-SGTCVssNTTAGYECICPAGKTGKNCQ 1449
Cdd:cd00054     5 CASGNPCQnGGTCV--NTVGSYRCSCPPGYTGRNCE 38
 
Name Accession Description Interval E-value
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
327-421 1.93e-32

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 122.42  E-value: 1.93e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002  327 PFSVKIPEHSPIGYPVITLKAEDHDRGDNARIVYSI---DSSQFFRIDPSSGDISVSSDLDREDRATFSVIVTASDHASP 403
Cdd:cd11304     1 SYEVSVPENAPPGTVVLTVSATDPDSGENGEVTYSIvsgNEDGLFSIDPSTGEITTAKPLDREEQSSYTLTVTATDGGGP 80
                          90
                  ....*....|....*...
gi 922581002  404 PLNTSTQIEVILDDINDN 421
Cdd:cd11304    81 PLSSTATVTITVLDVNDN 98
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
127-218 3.23e-32

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 121.65  E-value: 3.23e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002  127 ITLEENVAIGEEVGRVYAIDEDSGPNGIIKYSMEGSED---FIIDEDSGLIKTTKLLDRETTARYSLKVTARDMGTPSLN 203
Cdd:cd11304     4 VSVPENAPPGTVVLTVSATDPDSGENGEVTYSIVSGNEdglFSIDPSTGEITTAKPLDREEQSSYTLTVTATDGGGPPLS 83
                          90
                  ....*....|....*
gi 922581002  204 TSTTIAVVLKDINDN 218
Cdd:cd11304    84 STATVTITVLDVNDN 98
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
429-529 2.36e-31

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 118.96  E-value: 2.36e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002  429 SYAATISEDIPVGTSFLQVSAIDADIGPNGIVDYFLnesSSSPSIQLFRLDRTSGTLRVSSKLDREQFAVIVLPIFARDR 508
Cdd:cd11304     1 SYEVSVPENAPPGTVVLTVSATDPDSGENGEVTYSI---VSGNEDGLFSIDPSTGEITTAKPLDREEQSSYTLTVTATDG 77
                          90       100
                  ....*....|....*....|.
gi 922581002  509 GTPSLSAASEITLTLSDVNDN 529
Cdd:cd11304    78 GGPPLSSTATVTITVLDVNDN 98
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
20-116 2.25e-28

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 110.48  E-value: 2.25e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002   20 SYNVTILENITIPAVIATVKATDEDFGTNGKVHYSMASSSGIGGLTIDYSTGEVTLRERIDA-KNSPITAVIRAKDGAQP 98
Cdd:cd11304     1 SYEVSVPENAPPGTVVLTVSATDPDSGENGEVTYSIVSGNEDGLFSIDPSTGEITTAKPLDReEQSSYTLTVTATDGGGP 80
                          90
                  ....*....|....*...
gi 922581002   99 ALSSTVPLTINVIDINDH 116
Cdd:cd11304    81 PLSSTATVTITVLDVNDN 98
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
537-637 1.98e-26

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 105.09  E-value: 1.98e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002  537 SYDLYIAENSPVGSTVGTIVARDADEGDNADISFRIFGGADAKLFDIEEDaeqNGVVRilTRAEFDYEAKAnKFFFELQA 616
Cdd:cd11304     1 SYEVSVPENAPPGTVVLTVSATDPDSGENGEVTYSIVSGNEDGLFSIDPS---TGEIT--TAKPLDREEQS-SYTLTVTA 74
                          90       100
                  ....*....|....*....|....
gi 922581002  617 SSG---QLSSTVPVRIHVSDVNDN 637
Cdd:cd11304    75 TDGggpPLSSTATVTITVLDVNDN 98
CA smart00112
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. ...
144-220 2.10e-26

Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.


Pssm-ID: 214520 [Multi-domain]  Cd Length: 81  Bit Score: 104.35  E-value: 2.10e-26
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002    144 AIDEDSGPNGIIKYSMEGSED---FIIDEDSGLIKTTKLLDRETTARYSLKVTARDMGTPSLNTSTTIAVVLKDINDNAP 220
Cdd:smart00112    2 ATDADSGENGKVTYSILSGNDdglFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTITVLDVNDNAP 81
CA smart00112
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. ...
346-423 4.20e-24

Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.


Pssm-ID: 214520 [Multi-domain]  Cd Length: 81  Bit Score: 97.81  E-value: 4.20e-24
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002    346 KAEDHDRGDNARIVYSIDSS---QFFRIDPSSGDISVSSDLDREDRATFSVIVTASDHASPPLNTSTQIEVILDDINDNS 422
Cdd:smart00112    1 SATDADSGENGKVTYSILSGnddGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTITVLDVNDNA 80

                    .
gi 922581002    423 P 423
Cdd:smart00112   81 P 81
Cadherin pfam00028
Cadherin domain;
328-415 1.64e-23

Cadherin domain;


Pssm-ID: 394985 [Multi-domain]  Cd Length: 92  Bit Score: 96.60  E-value: 1.64e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002   328 FSVKIPEHSPIGYPVITLKAEDHDRGDNARIVYSI---DSSQFFRIDPSSGDISVSSDLDREDRATFSVIVTASDHASPP 404
Cdd:pfam00028    1 YSASVPENAPVGTEVLTVTATDPDLGPNGRIFYSIlggGPGGNFRIDPDTGDISTTKPLDRESIGEYELTVEATDSGGPP 80
                           90
                   ....*....|.
gi 922581002   405 LNTSTQIEVIL 415
Cdd:pfam00028   81 LSSTATVTITV 91
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
974-1145 2.92e-23

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 97.87  E-value: 2.92e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002  974 SVSFDGEGLLNV-NLDLPRTQWTMKFRVSTIAHNGVLVFTGDK-RSDFVEVSVVDRVLKVQFSLGGEKIDAkmenDVENR 1051
Cdd:cd00110     1 GVSFSGSSYVRLpTLPAPRTRLSISFSFRTTSPNGLLLYAGSQnGGDFLALELEDGRLVLRYDLGSGSLVL----SSKTP 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002 1052 INDGEWHTVALEYSNKQITMSLDDCetnpslLLNTSPNCAIRAKLNLEKKcedptvpcyryldisngLFLGGRPGTSKQ- 1130
Cdd:cd00110    77 LNDGQWHSVSVERNGRSVTLSVDGE------RVVESGSPGGSALLNLDGP-----------------LYLGGLPEDLKSp 133
                         170
                  ....*....|....*...
gi 922581002 1131 ---IEKAFSGCISDLSVD 1145
Cdd:cd00110   134 glpVSPGFVGCIRDLKVN 151
LamG smart00282
Laminin G domain;
995-1147 5.06e-23

Laminin G domain;


Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 96.64  E-value: 5.06e-23
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002    995 TMKFRVSTIAHNGVLVFTGDK-RSDFVEVSVVDRVLKVQFSLGGEKIDAKMENdveNRINDGEWHTVALEYSNKQITMSL 1073
Cdd:smart00282    1 SISFSFRTTSPNGLLLYAGSKgGGDYLALELRDGRLVLRYDLGSGPARLTSDP---TPLNDGQWHRVAVERNGRSVTLSV 77
                            90       100       110       120       130       140       150
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 922581002   1074 DDCEtnpsLLLNTSPncairaklnlekkcedptvPCYRYLDISNGLFLGGRPGTSKQ----IEKAFSGCISDLSVDKE 1147
Cdd:smart00282   78 DGGN----RVSGESP-------------------GGLTILNLDGPLYLGGLPEDLKLpplpVTPGFRGCIRNLKVNGK 132
CA smart00112
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. ...
448-531 1.84e-21

Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.


Pssm-ID: 214520 [Multi-domain]  Cd Length: 81  Bit Score: 90.49  E-value: 1.84e-21
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002    448 SAIDADIGPNGIVDYFLnesSSSPSIQLFRLDRTSGTLRVSSKLDREQFAVIVLPIFARDRGTPSLSAASEITLTLSDVN 527
Cdd:smart00112    1 SATDADSGENGKVTYSI---LSGNDDGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTITVLDVN 77

                    ....
gi 922581002    528 DNAP 531
Cdd:smart00112   78 DNAP 81
Cadherin pfam00028
Cadherin domain;
127-212 2.41e-21

Cadherin domain;


Pssm-ID: 394985 [Multi-domain]  Cd Length: 92  Bit Score: 90.44  E-value: 2.41e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002   127 ITLEENVAIGEEVGRVYAIDEDSGPNGIIKYSM-EGSED--FIIDEDSGLIKTTKLLDRETTARYSLKVTARDMGTPSLN 203
Cdd:pfam00028    3 ASVPENAPVGTEVLTVTATDPDLGPNGRIFYSIlGGGPGgnFRIDPDTGDISTTKPLDRESIGEYELTVEATDSGGPPLS 82

                   ....*....
gi 922581002   204 TSTTIAVVL 212
Cdd:pfam00028   83 STATVTITV 91
CA smart00112
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. ...
39-118 2.81e-20

Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.


Pssm-ID: 214520 [Multi-domain]  Cd Length: 81  Bit Score: 87.02  E-value: 2.81e-20
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002     39 KATDEDFGTNGKVHYSMASSSGIGGLTIDYSTGEVTLRERIDAKNSPI-TAVIRAKDGAQPALSSTVPLTINVIDINDHA 117
Cdd:smart00112    1 SATDADSGENGKVTYSILSGNDDGLFSIDPETGEITTTKPLDREEQPEyTLTVEATDGGGPPLSSTATVTITVLDVNDNA 80

                    .
gi 922581002    118 P 118
Cdd:smart00112   81 P 81
Laminin_G_2 pfam02210
Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G ...
1006-1147 1.06e-19

Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.


Pssm-ID: 460494 [Multi-domain]  Cd Length: 126  Bit Score: 87.09  E-value: 1.06e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002  1006 NGVLVFTGDKRSDFVEVSVVDRVLKVQFSLGGEKIDAKMENdveNRINDGEWHTVALEYSNKQITMSLDDCETNPSLLLN 1085
Cdd:pfam02210    7 NGLLLYAGGGGSDFLALELVNGRLVLRYDLGSGPESLLSSG---KNLNDGQWHSVRVERNGNTLTLSVDGQTVVSSLPPG 83
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 922581002  1086 TSPNcairaklnlekkcedptvpcyryLDISNGLFLGG----RPGTSKQIEKAFSGCISDLSVDKE 1147
Cdd:pfam02210   84 ESLL-----------------------LNLNGPLYLGGlpplLLLPALPVRAGFVGCIRDVRVNGE 126
Cadherin pfam00028
Cadherin domain;
430-523 3.68e-19

Cadherin domain;


Pssm-ID: 394985 [Multi-domain]  Cd Length: 92  Bit Score: 84.27  E-value: 3.68e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002   430 YAATISEDIPVGTSFLQVSAIDADIGPNGIVDYFLnesSSSPSIQLFRLDRTSGTLRVSSKLDREQFAVIVLPIFARDRG 509
Cdd:pfam00028    1 YSASVPENAPVGTEVLTVTATDPDLGPNGRIFYSI---LGGGPGGNFRIDPDTGDISTTKPLDRESIGEYELTVEATDSG 77
                           90
                   ....*....|....
gi 922581002   510 TPSLSAASEITLTL 523
Cdd:pfam00028   78 GPPLSSTATVTITV 91
Cadherin pfam00028
Cadherin domain;
21-111 1.38e-18

Cadherin domain;


Pssm-ID: 394985 [Multi-domain]  Cd Length: 92  Bit Score: 82.35  E-value: 1.38e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002    21 YNVTILENITIPAVIATVKATDEDFGTNGKVHYSMASSSGIGGLTIDYSTGEVTLRERIDAKNSPI-TAVIRAKDGAQPA 99
Cdd:pfam00028    1 YSASVPENAPVGTEVLTVTATDPDLGPNGRIFYSILGGGPGGNFRIDPDTGDISTTKPLDRESIGEyELTVEATDSGGPP 80
                           90
                   ....*....|..
gi 922581002   100 LSSTVPLTINVI 111
Cdd:pfam00028   81 LSSTATVTITVL 92
CA smart00112
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. ...
556-639 1.00e-16

Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.


Pssm-ID: 214520 [Multi-domain]  Cd Length: 81  Bit Score: 77.00  E-value: 1.00e-16
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002    556 VARDADEGDNADISFRIFGGADAKLFDIEEDaeqNGVvrILTRAEFDYEAKANkFFFELQASSG---QLSSTVPVRIHVS 632
Cdd:smart00112    1 SATDADSGENGKVTYSILSGNDDGLFSIDPE---TGE--ITTTKPLDREEQPE-YTLTVEATDGggpPLSSTATVTITVL 74

                    ....*..
gi 922581002    633 DVNDNKP 639
Cdd:smart00112   75 DVNDNAP 81
Cadherin pfam00028
Cadherin domain;
538-631 3.58e-13

Cadherin domain;


Pssm-ID: 394985 [Multi-domain]  Cd Length: 92  Bit Score: 66.94  E-value: 3.58e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002   538 YDLYIAENSPVGSTVGTIVARDADEGDNADISFRIFGGADAKLFDIEEDaeqNGVVRilTRAEFDYEAKaNKFFFELQAS 617
Cdd:pfam00028    1 YSASVPENAPVGTEVLTVTATDPDLGPNGRIFYSILGGGPGGNFRIDPD---TGDIS--TTKPLDRESI-GEYELTVEAT 74
                           90
                   ....*....|....*..
gi 922581002   618 SGQ---LSSTVPVRIHV 631
Cdd:pfam00028   75 DSGgppLSSTATVTITV 91
HormR smart00008
Domain present in hormone receptors;
1558-1615 1.39e-12

Domain present in hormone receptors;


Pssm-ID: 214468  Cd Length: 70  Bit Score: 64.84  E-value: 1.39e-12
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 922581002   1558 CPSEIEYSIQWPASQKGSIVRQSCPVGESGL-----ATRKCLETGRWS--DVNAWNCTRPEYSIM 1615
Cdd:smart00008    5 CPATWDGIICWPQTPAGQLVEVPCPKYFSGFsyktgASRNCTENGGWSppFPNYSNCTSNDYEEL 69
GPS smart00303
G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin ...
1811-1859 2.53e-11

G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin REJ and polycystin.


Pssm-ID: 197639  Cd Length: 49  Bit Score: 60.48  E-value: 2.53e-11
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*....
gi 922581002   1811 KYPECVRFDEKSGTWTARGAALIGLNLTHAACEYNRIGVFTMFVNDQSS 1859
Cdd:smart00303    1 FNPICVFWDESSGEWSTRGCELLETNGTHTTCSCNHLTTFAVLMDVPPI 49
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
893-928 5.74e-09

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 53.41  E-value: 5.74e-09
                          10        20        30
                  ....*....|....*....|....*....|....*..
gi 922581002  893 IDECYRGR-CSNNSTCVAFENTYQCECKPGWIGRHCE 928
Cdd:cd00054     2 IDECASGNpCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
1223-1357 9.27e-09

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 56.27  E-value: 9.27e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002 1223 FEEESFV-LYQPSQVSVPFEVSFEFRTSRADMQVFALEFTQRSVHYNLEVDDGTLK--YNIGDSEVELPAPE-VTSKHWM 1298
Cdd:cd00110     4 FSGSSYVrLPTLPAPRTRLSISFSFRTTSPNGLLLYAGSQNGGDFLALELEDGRLVlrYDLGSGSLVLSSKTpLNDGQWH 83
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 922581002 1299 NVVIKFEADSVATSINGIYSAEAKASISDMNL---ESLYFGIAPGTGHPSR------FEGCIRNVLVD 1357
Cdd:cd00110    84 SVSVERNGRSVTLSVDGERVVESGSPGGSALLnldGPLYLGGLPEDLKSPGlpvspgFVGCIRDLKVN 151
LamG smart00282
Laminin G domain;
1241-1359 1.50e-08

Laminin G domain;


Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 55.04  E-value: 1.50e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002   1241 EVSFEFRTSRADMQVFALEFTQRSVHYNLEVDDGTLK--YNIGDSEVELPAPEVTSK--HWMNVVIKFEADSVATSINGI 1316
Cdd:smart00282    1 SISFSFRTTSPNGLLLYAGSKGGGDYLALELRDGRLVlrYDLGSGPARLTSDPTPLNdgQWHRVAVERNGRSVTLSVDGG 80
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|..
gi 922581002   1317 YSAEAKASIS--DMNLES-LYFGIAPGTGHPSR------FEGCIRNVLVDGR 1359
Cdd:smart00282   81 NRVSGESPGGltILNLDGpLYLGGLPEDLKLPPlpvtpgFRGCIRNLKVNGK 132
EGF_CA smart00179
Calcium-binding EGF-like domain;
893-928 1.36e-07

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 49.55  E-value: 1.36e-07
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 922581002    893 IDECY-RGRCSNNSTCVAFENTYQCECKPGWI-GRHCE 928
Cdd:smart00179    2 IDECAsGNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
7tmB2_Adhesion cd15040
adhesion receptors, subfamily B2 of the class B family of seven-transmembrane G ...
1878-2096 1.53e-07

adhesion receptors, subfamily B2 of the class B family of seven-transmembrane G protein-coupled receptors; The B2 subfamily of class B GPCRs consists of cell-adhesion receptors with 33 members in humans and vertebrates. The adhesion receptors are characterized by the presence of large N-terminal extracellular domains containing a variety of structural motifs, which play critical roles in cell-cell adhesion and cell-matrix interactions, linked to a class B seven-transmembrane domain. These include, for example, EGF (epidermal growth factor)-like domains in CD97, Celsr1 (cadherin family member), Celsr2, Celsr3, EMR1 (EGF-module-containing mucin-like hormone receptor-like 1), EMR2, EMR3, and Flamingo; two laminin A G-type repeats and nine cadherin domains in Flamingo and its human orthologs Celsr1, Celsr2 and Celsr3; olfactomedin-like domains in the latrotoxin receptors; and five or four thrombospondin type 1 repeats in BAI1 (brain-specific angiogenesis inhibitor 1), BAI2 and BAI3. Furthermore, almost all adhesion receptors, except GPR123, contain an evolutionarily conserved GPCR- autoproteolysis inducing (GAIN) domain that undergoes autoproteolytic processing at the GPCR proteolysis site (GPS) motif located immediately N-terminal to the first transmembrane region, to generate N- and C-terminal fragments (NTF and CTF), which may serve important biological functions.


Pssm-ID: 320168 [Multi-domain]  Cd Length: 253  Bit Score: 54.89  E-value: 1.53e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002 1878 VALFLCFLSILLTLSRRSLKTHSVRIgfILFFAINILNLFFVhkTAINQAY----CPVRNAMLSFTSSAPFAWLFLYGLY 1953
Cdd:cd15040    17 LGLLLTIITYILFRKLRKRKPTKILL--NLCLALLLANLLFL--FGINSTDnpvlCTAVAALLHYFLLASFMWMLVEALL 92
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002 1954 IYRML---ADGSSSPSLTTSLLVGIVFPCLISFTTFFV--------TDQCSLSPHLWLFWCIILPIGLFLL--LSFYAAA 2020
Cdd:cd15040    93 LYLRLvkvFGTYPRHFILKYALIGWGLPLIIVIITLAVdpdsygnsSGYCWLSNGNGLYYAFLGPVLLIILvnLVIFVLV 172
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002 2021 TSVLVSLHKKydVFVAKYNVKRAVFQHFILTIFTLGMT-LTGLFAnqlplpmeimEISQSII--YLIAA------LVIFL 2091
Cdd:cd15040   173 LRKLLRLSAK--RNKKKRKKTKAQLRAAVSLFFLLGLTwIFGILA----------IFGARVVfqYLFAIfnslqgFFIFI 240

                  ....*.
gi 922581002 2092 W-CVCD 2096
Cdd:cd15040   241 FhCLRN 246
HRM pfam02793
Hormone receptor domain; This extracellular domain contains four conserved cysteines that ...
1558-1610 2.51e-06

Hormone receptor domain; This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain.


Pssm-ID: 397086  Cd Length: 64  Bit Score: 46.59  E-value: 2.51e-06
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 922581002  1558 CPSEIEYSIQWPASQKGSIVRQSCP-----VGESGLATRKCLETGRWSDV---NAWNCTRP 1610
Cdd:pfam02793    4 CPRTWDGILCWPRTPAGETVEVPCPdyfsgFDPRGNASRNCTEDGTWSEHppsNYSNCTSN 64
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
896-926 8.30e-06

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 44.30  E-value: 8.30e-06
                           10        20        30
                   ....*....|....*....|....*....|.
gi 922581002   896 CYRGRCSNNSTCVAFENTYQCECKPGWIGRH 926
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTGKR 31
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
1506-1552 1.44e-05

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 44.27  E-value: 1.44e-05
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 922581002 1506 CEC-GFGADSTEC-SADGHCKCNGDAVGRRCDRCSRFDHQLDSKTLKCR 1552
Cdd:cd00055     2 CDCnGHGSLSGQCdPGTGQCECKPNTTGRRCDRCAPGYYGLPSQGGGCQ 50
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1415-1449 2.84e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 43.01  E-value: 2.84e-05
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 922581002 1415 CSVANVCS-SGTCVssNTTAGYECICPAGKTGKNCQ 1449
Cdd:cd00054     5 CASGNPCQnGGTCV--NTVGSYRCSCPPGYTGRNCE 38
7tm_2 pfam00002
7 transmembrane receptor (Secretin family); This family is known as Family B, the ...
1878-2064 3.91e-05

7 transmembrane receptor (Secretin family); This family is known as Family B, the secretin-receptor family or family 2 of the G-protein-coupled receptors (GCPRs). They have been described in many animal species, but not in plants, fungi or prokaryotes. Three distinct sub-families are recognized. Subfamily B1 contains classical hormone receptors, such as receptors for secretin and glucagon, that are all involved in cAMP-mediated signalling pathways. Subfamily B2 contains receptors with long extracellular N-termini, such as the leukocyte cell-surface antigen CD97; calcium-independent receptors for latrotoxin, and brain-specific angiogenesis inhibitors amongst others. Subfamily B3 includes Methuselah and other Drosophila proteins. Other than the typical seven-transmembrane region, characteriztic structural features include an amino-terminal extracellular domain involved in ligand binding, and an intracellular loop (IC3) required for specific G-protein coupling.


Pssm-ID: 459625 [Multi-domain]  Cd Length: 248  Bit Score: 47.66  E-value: 3.91e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002  1878 VALFLCFLsILLTLsrRSLktHSVRIG-----FILFFAINILnlFFVHKTAI---------NQAYCPVRNAMLSFTSSAP 1943
Cdd:pfam00002   17 VALLLAIA-IFLLF--RKL--HCTRNYihlnlFASFILRALL--FLVGDAVLfnkqdldhcSWVGCKVVAVFLHYFFLAN 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002  1944 FAWLFLYGLYIYRMLADGSSSPSLTTSLLVGI------VFPCLISFTT---FFVTDQCSLSPHLWLFWCIILPIGLFLLL 2014
Cdd:pfam00002   90 FFWMLVEGLYLYTLLVEVFFSERKYFWWYLLIgwgvpaLVVGIWAGVDpkgYGEDDGCWLSNENGLWWIIRGPILLIILV 169
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 922581002  2015 SFYAAATSVLVSLHKK---YDVFVAKYNVKRAVFQHFILtIFTLGMT-LTGLFA 2064
Cdd:pfam00002  170 NFIIFINIVRILVQKLretNMGKSDLKQYRRLAKSTLLL-LPLLGITwVFGLFA 222
Laminin_G_2 pfam02210
Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G ...
1246-1359 1.46e-04

Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.


Pssm-ID: 460494 [Multi-domain]  Cd Length: 126  Bit Score: 43.56  E-value: 1.46e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002  1246 FRTSRADMQVFALEfTQRSVHYNLEVDDGTLK--YNIGDSEVELPAPE--VTSKHWMNVVIKFEADSVATSINGIYSAEA 1321
Cdd:pfam02210    1 FRTRQPNGLLLYAG-GGGSDFLALELVNGRLVlrYDLGSGPESLLSSGknLNDGQWHSVRVERNGNTLTLSVDGQTVVSS 79
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|
gi 922581002  1322 KAS--ISDMNLES-LYFGiapGTGHPSR---------FEGCIRNVLVDGR 1359
Cdd:pfam02210   80 LPPgeSLLLNLNGpLYLG---GLPPLLLlpalpvragFVGCIRDVRVNGE 126
EGF smart00181
Epidermal growth factor-like domain;
1415-1449 9.00e-04

Epidermal growth factor-like domain;


Pssm-ID: 214544  Cd Length: 35  Bit Score: 38.65  E-value: 9.00e-04
                            10        20        30
                    ....*....|....*....|....*....|....*.
gi 922581002   1415 CSVANVCSSGTCVssNTTAGYECICPAGKTG-KNCQ 1449
Cdd:smart00181    2 CASGGPCSNGTCI--NTPGSYTCSCPPGYTGdKRCE 35
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
1506-1537 7.58e-03

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 36.52  E-value: 7.58e-03
                            10        20        30
                    ....*....|....*....|....*....|....
gi 922581002   1506 CEC-GFGADSTECSAD-GHCKCNGDAVGRRCDRC 1537
Cdd:smart00180    1 CDCdPGGSASGTCDPDtGQCECKPNVTGRRCDRC 34
 
Name Accession Description Interval E-value
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
327-421 1.93e-32

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 122.42  E-value: 1.93e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002  327 PFSVKIPEHSPIGYPVITLKAEDHDRGDNARIVYSI---DSSQFFRIDPSSGDISVSSDLDREDRATFSVIVTASDHASP 403
Cdd:cd11304     1 SYEVSVPENAPPGTVVLTVSATDPDSGENGEVTYSIvsgNEDGLFSIDPSTGEITTAKPLDREEQSSYTLTVTATDGGGP 80
                          90
                  ....*....|....*...
gi 922581002  404 PLNTSTQIEVILDDINDN 421
Cdd:cd11304    81 PLSSTATVTITVLDVNDN 98
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
127-218 3.23e-32

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 121.65  E-value: 3.23e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002  127 ITLEENVAIGEEVGRVYAIDEDSGPNGIIKYSMEGSED---FIIDEDSGLIKTTKLLDRETTARYSLKVTARDMGTPSLN 203
Cdd:cd11304     4 VSVPENAPPGTVVLTVSATDPDSGENGEVTYSIVSGNEdglFSIDPSTGEITTAKPLDREEQSSYTLTVTATDGGGPPLS 83
                          90
                  ....*....|....*
gi 922581002  204 TSTTIAVVLKDINDN 218
Cdd:cd11304    84 STATVTITVLDVNDN 98
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
429-529 2.36e-31

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 118.96  E-value: 2.36e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002  429 SYAATISEDIPVGTSFLQVSAIDADIGPNGIVDYFLnesSSSPSIQLFRLDRTSGTLRVSSKLDREQFAVIVLPIFARDR 508
Cdd:cd11304     1 SYEVSVPENAPPGTVVLTVSATDPDSGENGEVTYSI---VSGNEDGLFSIDPSTGEITTAKPLDREEQSSYTLTVTATDG 77
                          90       100
                  ....*....|....*....|.
gi 922581002  509 GTPSLSAASEITLTLSDVNDN 529
Cdd:cd11304    78 GGPPLSSTATVTITVLDVNDN 98
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
20-116 2.25e-28

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 110.48  E-value: 2.25e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002   20 SYNVTILENITIPAVIATVKATDEDFGTNGKVHYSMASSSGIGGLTIDYSTGEVTLRERIDA-KNSPITAVIRAKDGAQP 98
Cdd:cd11304     1 SYEVSVPENAPPGTVVLTVSATDPDSGENGEVTYSIVSGNEDGLFSIDPSTGEITTAKPLDReEQSSYTLTVTATDGGGP 80
                          90
                  ....*....|....*...
gi 922581002   99 ALSSTVPLTINVIDINDH 116
Cdd:cd11304    81 PLSSTATVTITVLDVNDN 98
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
537-637 1.98e-26

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 105.09  E-value: 1.98e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002  537 SYDLYIAENSPVGSTVGTIVARDADEGDNADISFRIFGGADAKLFDIEEDaeqNGVVRilTRAEFDYEAKAnKFFFELQA 616
Cdd:cd11304     1 SYEVSVPENAPPGTVVLTVSATDPDSGENGEVTYSIVSGNEDGLFSIDPS---TGEIT--TAKPLDREEQS-SYTLTVTA 74
                          90       100
                  ....*....|....*....|....
gi 922581002  617 SSG---QLSSTVPVRIHVSDVNDN 637
Cdd:cd11304    75 TDGggpPLSSTATVTITVLDVNDN 98
CA smart00112
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. ...
144-220 2.10e-26

Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.


Pssm-ID: 214520 [Multi-domain]  Cd Length: 81  Bit Score: 104.35  E-value: 2.10e-26
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002    144 AIDEDSGPNGIIKYSMEGSED---FIIDEDSGLIKTTKLLDRETTARYSLKVTARDMGTPSLNTSTTIAVVLKDINDNAP 220
Cdd:smart00112    2 ATDADSGENGKVTYSILSGNDdglFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTITVLDVNDNAP 81
CA smart00112
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. ...
346-423 4.20e-24

Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.


Pssm-ID: 214520 [Multi-domain]  Cd Length: 81  Bit Score: 97.81  E-value: 4.20e-24
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002    346 KAEDHDRGDNARIVYSIDSS---QFFRIDPSSGDISVSSDLDREDRATFSVIVTASDHASPPLNTSTQIEVILDDINDNS 422
Cdd:smart00112    1 SATDADSGENGKVTYSILSGnddGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTITVLDVNDNA 80

                    .
gi 922581002    423 P 423
Cdd:smart00112   81 P 81
Cadherin pfam00028
Cadherin domain;
328-415 1.64e-23

Cadherin domain;


Pssm-ID: 394985 [Multi-domain]  Cd Length: 92  Bit Score: 96.60  E-value: 1.64e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002   328 FSVKIPEHSPIGYPVITLKAEDHDRGDNARIVYSI---DSSQFFRIDPSSGDISVSSDLDREDRATFSVIVTASDHASPP 404
Cdd:pfam00028    1 YSASVPENAPVGTEVLTVTATDPDLGPNGRIFYSIlggGPGGNFRIDPDTGDISTTKPLDRESIGEYELTVEATDSGGPP 80
                           90
                   ....*....|.
gi 922581002   405 LNTSTQIEVIL 415
Cdd:pfam00028   81 LSSTATVTITV 91
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
974-1145 2.92e-23

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 97.87  E-value: 2.92e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002  974 SVSFDGEGLLNV-NLDLPRTQWTMKFRVSTIAHNGVLVFTGDK-RSDFVEVSVVDRVLKVQFSLGGEKIDAkmenDVENR 1051
Cdd:cd00110     1 GVSFSGSSYVRLpTLPAPRTRLSISFSFRTTSPNGLLLYAGSQnGGDFLALELEDGRLVLRYDLGSGSLVL----SSKTP 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002 1052 INDGEWHTVALEYSNKQITMSLDDCetnpslLLNTSPNCAIRAKLNLEKKcedptvpcyryldisngLFLGGRPGTSKQ- 1130
Cdd:cd00110    77 LNDGQWHSVSVERNGRSVTLSVDGE------RVVESGSPGGSALLNLDGP-----------------LYLGGLPEDLKSp 133
                         170
                  ....*....|....*...
gi 922581002 1131 ---IEKAFSGCISDLSVD 1145
Cdd:cd00110   134 glpVSPGFVGCIRDLKVN 151
LamG smart00282
Laminin G domain;
995-1147 5.06e-23

Laminin G domain;


Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 96.64  E-value: 5.06e-23
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002    995 TMKFRVSTIAHNGVLVFTGDK-RSDFVEVSVVDRVLKVQFSLGGEKIDAKMENdveNRINDGEWHTVALEYSNKQITMSL 1073
Cdd:smart00282    1 SISFSFRTTSPNGLLLYAGSKgGGDYLALELRDGRLVLRYDLGSGPARLTSDP---TPLNDGQWHRVAVERNGRSVTLSV 77
                            90       100       110       120       130       140       150
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 922581002   1074 DDCEtnpsLLLNTSPncairaklnlekkcedptvPCYRYLDISNGLFLGGRPGTSKQ----IEKAFSGCISDLSVDKE 1147
Cdd:smart00282   78 DGGN----RVSGESP-------------------GGLTILNLDGPLYLGGLPEDLKLpplpVTPGFRGCIRNLKVNGK 132
CA smart00112
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. ...
448-531 1.84e-21

Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.


Pssm-ID: 214520 [Multi-domain]  Cd Length: 81  Bit Score: 90.49  E-value: 1.84e-21
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002    448 SAIDADIGPNGIVDYFLnesSSSPSIQLFRLDRTSGTLRVSSKLDREQFAVIVLPIFARDRGTPSLSAASEITLTLSDVN 527
Cdd:smart00112    1 SATDADSGENGKVTYSI---LSGNDDGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTITVLDVN 77

                    ....
gi 922581002    528 DNAP 531
Cdd:smart00112   78 DNAP 81
Cadherin pfam00028
Cadherin domain;
127-212 2.41e-21

Cadherin domain;


Pssm-ID: 394985 [Multi-domain]  Cd Length: 92  Bit Score: 90.44  E-value: 2.41e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002   127 ITLEENVAIGEEVGRVYAIDEDSGPNGIIKYSM-EGSED--FIIDEDSGLIKTTKLLDRETTARYSLKVTARDMGTPSLN 203
Cdd:pfam00028    3 ASVPENAPVGTEVLTVTATDPDLGPNGRIFYSIlGGGPGgnFRIDPDTGDISTTKPLDRESIGEYELTVEATDSGGPPLS 82

                   ....*....
gi 922581002   204 TSTTIAVVL 212
Cdd:pfam00028   83 STATVTITV 91
CA smart00112
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. ...
39-118 2.81e-20

Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.


Pssm-ID: 214520 [Multi-domain]  Cd Length: 81  Bit Score: 87.02  E-value: 2.81e-20
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002     39 KATDEDFGTNGKVHYSMASSSGIGGLTIDYSTGEVTLRERIDAKNSPI-TAVIRAKDGAQPALSSTVPLTINVIDINDHA 117
Cdd:smart00112    1 SATDADSGENGKVTYSILSGNDDGLFSIDPETGEITTTKPLDREEQPEyTLTVEATDGGGPPLSSTATVTITVLDVNDNA 80

                    .
gi 922581002    118 P 118
Cdd:smart00112   81 P 81
Laminin_G_2 pfam02210
Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G ...
1006-1147 1.06e-19

Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.


Pssm-ID: 460494 [Multi-domain]  Cd Length: 126  Bit Score: 87.09  E-value: 1.06e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002  1006 NGVLVFTGDKRSDFVEVSVVDRVLKVQFSLGGEKIDAKMENdveNRINDGEWHTVALEYSNKQITMSLDDCETNPSLLLN 1085
Cdd:pfam02210    7 NGLLLYAGGGGSDFLALELVNGRLVLRYDLGSGPESLLSSG---KNLNDGQWHSVRVERNGNTLTLSVDGQTVVSSLPPG 83
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 922581002  1086 TSPNcairaklnlekkcedptvpcyryLDISNGLFLGG----RPGTSKQIEKAFSGCISDLSVDKE 1147
Cdd:pfam02210   84 ESLL-----------------------LNLNGPLYLGGlpplLLLPALPVRAGFVGCIRDVRVNGE 126
Cadherin pfam00028
Cadherin domain;
430-523 3.68e-19

Cadherin domain;


Pssm-ID: 394985 [Multi-domain]  Cd Length: 92  Bit Score: 84.27  E-value: 3.68e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002   430 YAATISEDIPVGTSFLQVSAIDADIGPNGIVDYFLnesSSSPSIQLFRLDRTSGTLRVSSKLDREQFAVIVLPIFARDRG 509
Cdd:pfam00028    1 YSASVPENAPVGTEVLTVTATDPDLGPNGRIFYSI---LGGGPGGNFRIDPDTGDISTTKPLDRESIGEYELTVEATDSG 77
                           90
                   ....*....|....
gi 922581002   510 TPSLSAASEITLTL 523
Cdd:pfam00028   78 GPPLSSTATVTITV 91
Cadherin pfam00028
Cadherin domain;
21-111 1.38e-18

Cadherin domain;


Pssm-ID: 394985 [Multi-domain]  Cd Length: 92  Bit Score: 82.35  E-value: 1.38e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002    21 YNVTILENITIPAVIATVKATDEDFGTNGKVHYSMASSSGIGGLTIDYSTGEVTLRERIDAKNSPI-TAVIRAKDGAQPA 99
Cdd:pfam00028    1 YSASVPENAPVGTEVLTVTATDPDLGPNGRIFYSILGGGPGGNFRIDPDTGDISTTKPLDRESIGEyELTVEATDSGGPP 80
                           90
                   ....*....|..
gi 922581002   100 LSSTVPLTINVI 111
Cdd:pfam00028   81 LSSTATVTITVL 92
CA smart00112
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. ...
556-639 1.00e-16

Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.


Pssm-ID: 214520 [Multi-domain]  Cd Length: 81  Bit Score: 77.00  E-value: 1.00e-16
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002    556 VARDADEGDNADISFRIFGGADAKLFDIEEDaeqNGVvrILTRAEFDYEAKANkFFFELQASSG---QLSSTVPVRIHVS 632
Cdd:smart00112    1 SATDADSGENGKVTYSILSGNDDGLFSIDPE---TGE--ITTTKPLDREEQPE-YTLTVEATDGggpPLSSTATVTITVL 74

                    ....*..
gi 922581002    633 DVNDNKP 639
Cdd:smart00112   75 DVNDNAP 81
Cadherin pfam00028
Cadherin domain;
538-631 3.58e-13

Cadherin domain;


Pssm-ID: 394985 [Multi-domain]  Cd Length: 92  Bit Score: 66.94  E-value: 3.58e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002   538 YDLYIAENSPVGSTVGTIVARDADEGDNADISFRIFGGADAKLFDIEEDaeqNGVVRilTRAEFDYEAKaNKFFFELQAS 617
Cdd:pfam00028    1 YSASVPENAPVGTEVLTVTATDPDLGPNGRIFYSILGGGPGGNFRIDPD---TGDIS--TTKPLDRESI-GEYELTVEAT 74
                           90
                   ....*....|....*..
gi 922581002   618 SGQ---LSSTVPVRIHV 631
Cdd:pfam00028   75 DSGgppLSSTATVTITV 91
HormR smart00008
Domain present in hormone receptors;
1558-1615 1.39e-12

Domain present in hormone receptors;


Pssm-ID: 214468  Cd Length: 70  Bit Score: 64.84  E-value: 1.39e-12
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 922581002   1558 CPSEIEYSIQWPASQKGSIVRQSCPVGESGL-----ATRKCLETGRWS--DVNAWNCTRPEYSIM 1615
Cdd:smart00008    5 CPATWDGIICWPQTPAGQLVEVPCPKYFSGFsyktgASRNCTENGGWSppFPNYSNCTSNDYEEL 69
GPS smart00303
G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin ...
1811-1859 2.53e-11

G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin REJ and polycystin.


Pssm-ID: 197639  Cd Length: 49  Bit Score: 60.48  E-value: 2.53e-11
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*....
gi 922581002   1811 KYPECVRFDEKSGTWTARGAALIGLNLTHAACEYNRIGVFTMFVNDQSS 1859
Cdd:smart00303    1 FNPICVFWDESSGEWSTRGCELLETNGTHTTCSCNHLTTFAVLMDVPPI 49
Laminin_G_1 pfam00054
Laminin G domain;
1000-1150 9.17e-11

Laminin G domain;


Pssm-ID: 395008 [Multi-domain]  Cd Length: 131  Bit Score: 61.56  E-value: 9.17e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002  1000 VSTIAHNGVLVFTGDKR-SDFVEVSVVDRVLKVQFSLGGEKIdAKMENDvenRINDGEWHTVALEYSNKQITMSLDDCEt 1078
Cdd:pfam00054    1 FRTTEPSGLLLYNGTQTeRDFLALELRDGRLEVSYDLGSGAA-VVRSGD---KLNDGKWHSVELERNGRSGTLSVDGEA- 75
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 922581002  1079 NPSLLLNTSPNcairAKLNLEKKcedptvpcyryldisngLFLGGRPGTSKQ-----IEKAFSGCISDLSVDKEDVD 1150
Cdd:pfam00054   76 RPTGESPLGAT----TDLDVDGP-----------------LYVGGLPSLGVKkrrlaISPSFDGCIRDVIVNGKPLD 131
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
893-928 5.74e-09

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 53.41  E-value: 5.74e-09
                          10        20        30
                  ....*....|....*....|....*....|....*..
gi 922581002  893 IDECYRGR-CSNNSTCVAFENTYQCECKPGWIGRHCE 928
Cdd:cd00054     2 IDECASGNpCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
1223-1357 9.27e-09

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 56.27  E-value: 9.27e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002 1223 FEEESFV-LYQPSQVSVPFEVSFEFRTSRADMQVFALEFTQRSVHYNLEVDDGTLK--YNIGDSEVELPAPE-VTSKHWM 1298
Cdd:cd00110     4 FSGSSYVrLPTLPAPRTRLSISFSFRTTSPNGLLLYAGSQNGGDFLALELEDGRLVlrYDLGSGSLVLSSKTpLNDGQWH 83
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 922581002 1299 NVVIKFEADSVATSINGIYSAEAKASISDMNL---ESLYFGIAPGTGHPSR------FEGCIRNVLVD 1357
Cdd:cd00110    84 SVSVERNGRSVTLSVDGERVVESGSPGGSALLnldGPLYLGGLPEDLKSPGlpvspgFVGCIRDLKVN 151
LamG smart00282
Laminin G domain;
1241-1359 1.50e-08

Laminin G domain;


Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 55.04  E-value: 1.50e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002   1241 EVSFEFRTSRADMQVFALEFTQRSVHYNLEVDDGTLK--YNIGDSEVELPAPEVTSK--HWMNVVIKFEADSVATSINGI 1316
Cdd:smart00282    1 SISFSFRTTSPNGLLLYAGSKGGGDYLALELRDGRLVlrYDLGSGPARLTSDPTPLNdgQWHRVAVERNGRSVTLSVDGG 80
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|..
gi 922581002   1317 YSAEAKASIS--DMNLES-LYFGIAPGTGHPSR------FEGCIRNVLVDGR 1359
Cdd:smart00282   81 NRVSGESPGGltILNLDGpLYLGGLPEDLKLPPlpvtpgFRGCIRNLKVNGK 132
EGF_CA smart00179
Calcium-binding EGF-like domain;
893-928 1.36e-07

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 49.55  E-value: 1.36e-07
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 922581002    893 IDECY-RGRCSNNSTCVAFENTYQCECKPGWI-GRHCE 928
Cdd:smart00179    2 IDECAsGNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
7tmB2_Adhesion cd15040
adhesion receptors, subfamily B2 of the class B family of seven-transmembrane G ...
1878-2096 1.53e-07

adhesion receptors, subfamily B2 of the class B family of seven-transmembrane G protein-coupled receptors; The B2 subfamily of class B GPCRs consists of cell-adhesion receptors with 33 members in humans and vertebrates. The adhesion receptors are characterized by the presence of large N-terminal extracellular domains containing a variety of structural motifs, which play critical roles in cell-cell adhesion and cell-matrix interactions, linked to a class B seven-transmembrane domain. These include, for example, EGF (epidermal growth factor)-like domains in CD97, Celsr1 (cadherin family member), Celsr2, Celsr3, EMR1 (EGF-module-containing mucin-like hormone receptor-like 1), EMR2, EMR3, and Flamingo; two laminin A G-type repeats and nine cadherin domains in Flamingo and its human orthologs Celsr1, Celsr2 and Celsr3; olfactomedin-like domains in the latrotoxin receptors; and five or four thrombospondin type 1 repeats in BAI1 (brain-specific angiogenesis inhibitor 1), BAI2 and BAI3. Furthermore, almost all adhesion receptors, except GPR123, contain an evolutionarily conserved GPCR- autoproteolysis inducing (GAIN) domain that undergoes autoproteolytic processing at the GPCR proteolysis site (GPS) motif located immediately N-terminal to the first transmembrane region, to generate N- and C-terminal fragments (NTF and CTF), which may serve important biological functions.


Pssm-ID: 320168 [Multi-domain]  Cd Length: 253  Bit Score: 54.89  E-value: 1.53e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002 1878 VALFLCFLSILLTLSRRSLKTHSVRIgfILFFAINILNLFFVhkTAINQAY----CPVRNAMLSFTSSAPFAWLFLYGLY 1953
Cdd:cd15040    17 LGLLLTIITYILFRKLRKRKPTKILL--NLCLALLLANLLFL--FGINSTDnpvlCTAVAALLHYFLLASFMWMLVEALL 92
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002 1954 IYRML---ADGSSSPSLTTSLLVGIVFPCLISFTTFFV--------TDQCSLSPHLWLFWCIILPIGLFLL--LSFYAAA 2020
Cdd:cd15040    93 LYLRLvkvFGTYPRHFILKYALIGWGLPLIIVIITLAVdpdsygnsSGYCWLSNGNGLYYAFLGPVLLIILvnLVIFVLV 172
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002 2021 TSVLVSLHKKydVFVAKYNVKRAVFQHFILTIFTLGMT-LTGLFAnqlplpmeimEISQSII--YLIAA------LVIFL 2091
Cdd:cd15040   173 LRKLLRLSAK--RNKKKRKKTKAQLRAAVSLFFLLGLTwIFGILA----------IFGARVVfqYLFAIfnslqgFFIFI 240

                  ....*.
gi 922581002 2092 W-CVCD 2096
Cdd:cd15040   241 FhCLRN 246
7tm_classB cd13952
class B family of seven-transmembrane G protein-coupled receptors; The class B of ...
1877-2095 1.74e-07

class B family of seven-transmembrane G protein-coupled receptors; The class B of seven-transmembrane GPCRs is classified into three major subfamilies: subfamily B1 (secretin-like receptor family), B2 (adhesion family), and B3 (Methuselah-like family). The class B receptors have been identified in all the vertebrates, from fishes to mammals, as well as invertebrates including Caenorhabditis elegans and Drosophila melanogaster, but are not present in plants, fungi or prokaryotes. The B1 subfamily comprises receptors for polypeptide hormones of 27-141 amino-acid residues such as secretin, glucagon, glucagon-like peptide (GLP), calcitonin gene-related peptide, parathyroid hormone (PTH), and corticotropin-releasing factor. These receptors contain the large N-terminal extracellular domain (ECD), which plays a critical role in hormone recognition by binding to the C-terminal portion of the peptide. On the other hand, the N-terminal segment of the hormone induces receptor activation by interacting with the receptor transmembrane domains and connecting extracellular loops, triggering intracellular signaling pathways. All members of the subfamily B1 receptors preferentially couple to G proteins of G(s) family, which positively stimulate adenylate cyclase, leading to increased intracellular cAMP formation and calcium influx. The subfamily B2 consists of cell-adhesion receptors with 33 members in humans and vertebrates. The adhesion receptors are characterized by the presence of large N-terminal extracellular domains containing a variety of structural motifs, which play critical roles in cell-cell adhesion and cell-matrix interactions, linked to a class B seven-transmembrane domain. These include, for example, EGF (epidermal growth factor)-like domains in CD97, Celsr1 (cadherin family member), Celsr2, Celsr3, EMR1 (EGF-module-containing mucin-like hormone receptor-like 1), EMR2, EMR3, and Flamingo; two laminin A G-type repeats and nine cadherin domains in Flamingo and its human orthologs Celsr1, Celsr2 and Celsr3; olfactomedin-like domains in the latrotoxin receptors; and five or four thrombospondin type 1 repeats in BAI1 (brain-specific angiogenesis inhibitor 1), BAI2 and BAI3. Almost all adhesion receptors, except GPR123, contain an evolutionarily conserved GPCR- autoproteolysis inducing (GAIN) domain that undergoes autoproteolytic processing at the GPCR proteolysis site (GPS) motif located immediately N-terminal to the first transmembrane region, to generate N- and C-terminal fragments (NTF and CTF), which may serve important biological functions. Furthermore, the subfamily B3 includes Methuselah (Mth) protein, which was originally identified in Drosophila as a GPCR affecting stress resistance and aging, and its closely related proteins.


Pssm-ID: 410627 [Multi-domain]  Cd Length: 260  Bit Score: 54.91  E-value: 1.74e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002 1877 GVALFLCFLSILLTLSRRSLKTHSVRIGFILFFAINILNLFFV----HKTAINQAYCPVRNAMLSFTSSAPFAWLFLYGL 1952
Cdd:cd13952    13 SLSLVGLLLTIITYLLFPKLRNLRGKILINLCLSLLLAQLLFLigqlLTSSDRPVLCKALAILLHYFLLASFFWMLVEAF 92
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002 1953 YIYRMLADGSSSPSLTTSLL---VGIVFPCLISFTTFFV------------TDQCSLSPHLWLFWCIILPIGLFLL--LS 2015
Cdd:cd13952    93 DLYRTFVKVFGSSERRRFLKyslYGWGLPLLIVIITAIVdfslygpspgygGEYCWLSNGNALLWAFYGPVLLILLvnLV 172
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002 2016 FYAAATSVLVSlHKKYDVFVAKYNVKRAVFQHFILTIFTLGMT-LTGLFANQLPLPMEIMEISqSIIYLIAALVIFLWCV 2094
Cdd:cd13952   173 FFILTVRILLR-KLRETPKQSERKSDRKQLRAYLKLFPLMGLTwIFGILAPFVGGSLVFWYLF-DILNSLQGFFIFLIFC 250

                  .
gi 922581002 2095 C 2095
Cdd:cd13952   251 L 251
7tmB2_CELSR1 cd15991
Cadherin EGF LAG seven-pass G-type receptor 1, member of the class B2 family of ...
1871-2064 9.84e-07

Cadherin EGF LAG seven-pass G-type receptor 1, member of the class B2 family of seven-transmembrane G protein-coupled receptors; The group IV adhesion GPCRs include the cadherin EGF LAG seven-pass G-type receptors (CELSRs) and their Drosophila homolog Flamingo (also known as Starry night). These receptors are also classified as that belongs to the EGF-TM7 group of subfamily B2 adhesion GPCRs, because they contain EGF-like domains. Functionally, the group IV receptors act as key regulators of many physiological processes such as endocrine cell differentiation, neuronal migration, dendrite growth, axon, guidance, lymphatic vessel and valve formation, and planar cell polarity (PCP) during embryonic development. Three mammalian orthologs of Flamingo, Celsr1-3, are widely expressed in the nervous system from embryonic development until the adult stage. Each Celsr exhibits different expression patterns in the developing brain, suggesting that they serve distinct functions. Mutations of CELSR1 cause neural tube defects in the nervous system, while mutations of CELSR2 are associated with coronary heart disease. Moreover, CELSR1 and several other PCP signaling molecules, such as dishevelled, prickle, frizzled, have been shown to be upregulated in B lymphocytes of chronic lymphocytic leukemia patients. The adhesion receptors are characterized by the presence of large N-terminal extracellular domains containing multiple adhesion motifs, which play critical roles in cell-cell adhesion and cell-matrix interactions, that are coupled to a class B seven-transmembrane domain. In the case of CELSR/Flamingo/Starry night, their extracellular domains comprise nine cadherin repeats linked to a series of epidermal growth factor (EGF)-like and laminin globular (G)-like domains. The cadherin repeats contain sequence motifs that mediate calcium-dependent cell-cell adhesion by homophilic interactions. Moreover, almost all adhesion receptors, except GPR123, contain an evolutionarily conserved GPCR- autoproteolysis inducing (GAIN) domain that undergoes autoproteolytic processing at the GPCR proteolysis site (GPS) motif located immediately N-terminal to the first transmembrane region, to generate N- and C-terminal fragments (NTF and CTF), which may serve important biological functions.


Pssm-ID: 320657 [Multi-domain]  Cd Length: 254  Bit Score: 52.54  E-value: 9.84e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002 1871 TSPAIAGVALFLCFLSILLTLSRRSlKTHSVRIGFI--LFFAInilnlfFVHKTAINQA----YCPVRNAMLSFTSSAPF 1944
Cdd:cd15991    10 TTVSLSLVALLITFILLVLIRTLRS-NLHSIHKNLVaaLFFSE------LIFLIGINQTenpfVCTVVAILLHYFYMSTF 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002 1945 AWLFLYGLYIYRMLAD--GSSSPSLTTSLLVGIVFPCLISFTTFFVTDQ-------CSLSPHLWLFWCIILPIGLFLLLS 2015
Cdd:cd15991    83 AWMFVEGLHIYRMLTEvrNINTGHMRFYYVVGWGIPAIITGLAVGLDPQgygnpdfCWLSVQDTLIWSFAGPIGIVVIIN 162
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*....
gi 922581002 2016 FYAAATSVLVSLHKKYDVFvAKYNVKRAVFQHFILTIFTLGMTLTGLFA 2064
Cdd:cd15991   163 TVIFVLAAKASCGRRQRYF-EKSGVISMLRTAFLLLLLISATWLLGLMA 210
HRM pfam02793
Hormone receptor domain; This extracellular domain contains four conserved cysteines that ...
1558-1610 2.51e-06

Hormone receptor domain; This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain.


Pssm-ID: 397086  Cd Length: 64  Bit Score: 46.59  E-value: 2.51e-06
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 922581002  1558 CPSEIEYSIQWPASQKGSIVRQSCP-----VGESGLATRKCLETGRWSDV---NAWNCTRP 1610
Cdd:pfam02793    4 CPRTWDGILCWPRTPAGETVEVPCPdyfsgFDPRGNASRNCTEDGTWSEHppsNYSNCTSN 64
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
896-926 8.30e-06

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 44.30  E-value: 8.30e-06
                           10        20        30
                   ....*....|....*....|....*....|.
gi 922581002   896 CYRGRCSNNSTCVAFENTYQCECKPGWIGRH 926
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTGKR 31
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
895-928 1.40e-05

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 43.62  E-value: 1.40e-05
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 922581002  895 EC-YRGRCSNNSTCVAFENTYQCECKPGWIG-RHCE 928
Cdd:cd00053     1 ECaASNPCSNGGTCVNTPGSYRCVCPPGYTGdRSCE 36
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
1506-1552 1.44e-05

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 44.27  E-value: 1.44e-05
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 922581002 1506 CEC-GFGADSTEC-SADGHCKCNGDAVGRRCDRCSRFDHQLDSKTLKCR 1552
Cdd:cd00055     2 CDCnGHGSLSGQCdPGTGQCECKPNTTGRRCDRCAPGYYGLPSQGGGCQ 50
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1415-1449 2.84e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 43.01  E-value: 2.84e-05
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 922581002 1415 CSVANVCS-SGTCVssNTTAGYECICPAGKTGKNCQ 1449
Cdd:cd00054     5 CASGNPCQnGGTCV--NTVGSYRCSCPPGYTGRNCE 38
7tmB2_CELSR_Adhesion_IV cd15441
cadherin EGF LAG seven-pass G-type receptors, group IV adhesion GPCRs, member of the class B2 ...
1874-2091 3.59e-05

cadherin EGF LAG seven-pass G-type receptors, group IV adhesion GPCRs, member of the class B2 family of seven-transmembrane G protein-coupled receptors; The group IV adhesion GPCRs include the cadherin EGF LAG seven-pass G-type receptors (CELSRs) and their Drosophila homolog Flamingo (also known as Starry night). These receptors are also classified as that belongs to the EGF-TM7 group of subfamily B2 adhesion GPCRs, because they contain EGF-like domains. Functionally, the group IV receptors act as key regulators of many physiological processes such as endocrine cell differentiation, neuronal migration, dendrite growth, axon, guidance, lymphatic vessel and valve formation, and planar cell polarity (PCP) during embryonic development. The adhesion receptors are characterized by the presence of large N-terminal extracellular domains containing multiple adhesion motifs, which play critical roles in cell-cell adhesion and cell-matrix interactions, that are coupled to a class B seven-transmembrane domain. In the case of CELSR/Flamingo/Starry night, their extracellular domains comprise nine cadherin repeats linked to a series of epidermal growth factor (EGF)-like and laminin globular (G)-like domains. The cadherin repeats contain sequence motifs that mediate calcium-dependent cell-cell adhesion by homophilic interactions. Moreover, almost all adhesion receptors, except GPR123, contain an evolutionarily conserved GPCR- autoproteolysis inducing (GAIN) domain that undergoes autoproteolytic processing at the GPCR proteolysis site (GPS) motif located immediately N-terminal to the first transmembrane region, to generate N- and C-terminal fragments (NTF and CTF), which may serve important biological functions. Three mammalian orthologs of Flamingo, Celsr1-3, are widely expressed in the nervous system from embryonic development until the adult stage. Each Celsr exhibits different expression patterns in the developing brain, suggesting that they serve distinct functions. Mutations of CELSR1 cause neural tube defects in the nervous system, while mutations of CELSR2 are associated with coronary heart disease. Moreover, CELSR1 and several other PCP signaling molecules, such as dishevelled, prickle, frizzled, have been shown to be upregulated in B lymphocytes of chronic lymphocytic leukemia patients. Celsr3 is expressed in both the developing and adult mouse brain. It has been functionally implicated in proper neuron migration and axon guidance in the CNS.


Pssm-ID: 320557 [Multi-domain]  Cd Length: 254  Bit Score: 47.63  E-value: 3.59e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002 1874 AIAGVALFLCFLsILLTLSRRSLKTHSVRIGFIL-FFAINILNLFFVHKTAiNQAYCPVRNAMLSFTSSAPFAWLFLYGL 1952
Cdd:cd15441    13 GISLVLLVIAFL-VLSCLRGLQSNSNSIHKNLVAcLLLAELLFLLGINQTE-NLFPCKLIAILLHYFYLSAFSWLLVESL 90
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002 1953 YIYRMLADGSSSPSLTTS--LLVGIVFPCLISFTTFFV-TDQ------CSLSPHLWLFWCIILPIGLFLLLSFYAAATSV 2023
Cdd:cd15441    91 HLYRMLTEPRDINHGHMRfyYLLGYGIPAIIVGLSVGLrPDGygnpdfCWLSVNETLIWSFAGPIAFVIVITLIIFILAL 170
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 922581002 2024 LVSLHKKYDVFvAKYNVKRAVFQHFILTIFTLGMTLTGLFAnqLPLPMEIMEISQSIIYLIAALVIFL 2091
Cdd:cd15441   171 RASCTLKRHVL-EKASVRTDLRSSFLLLPLLGATWVFGLLA--VNEDSELLHYLFAGLNFLQGLFIFL 235
7tm_2 pfam00002
7 transmembrane receptor (Secretin family); This family is known as Family B, the ...
1878-2064 3.91e-05

7 transmembrane receptor (Secretin family); This family is known as Family B, the secretin-receptor family or family 2 of the G-protein-coupled receptors (GCPRs). They have been described in many animal species, but not in plants, fungi or prokaryotes. Three distinct sub-families are recognized. Subfamily B1 contains classical hormone receptors, such as receptors for secretin and glucagon, that are all involved in cAMP-mediated signalling pathways. Subfamily B2 contains receptors with long extracellular N-termini, such as the leukocyte cell-surface antigen CD97; calcium-independent receptors for latrotoxin, and brain-specific angiogenesis inhibitors amongst others. Subfamily B3 includes Methuselah and other Drosophila proteins. Other than the typical seven-transmembrane region, characteriztic structural features include an amino-terminal extracellular domain involved in ligand binding, and an intracellular loop (IC3) required for specific G-protein coupling.


Pssm-ID: 459625 [Multi-domain]  Cd Length: 248  Bit Score: 47.66  E-value: 3.91e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002  1878 VALFLCFLsILLTLsrRSLktHSVRIG-----FILFFAINILnlFFVHKTAI---------NQAYCPVRNAMLSFTSSAP 1943
Cdd:pfam00002   17 VALLLAIA-IFLLF--RKL--HCTRNYihlnlFASFILRALL--FLVGDAVLfnkqdldhcSWVGCKVVAVFLHYFFLAN 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002  1944 FAWLFLYGLYIYRMLADGSSSPSLTTSLLVGI------VFPCLISFTT---FFVTDQCSLSPHLWLFWCIILPIGLFLLL 2014
Cdd:pfam00002   90 FFWMLVEGLYLYTLLVEVFFSERKYFWWYLLIgwgvpaLVVGIWAGVDpkgYGEDDGCWLSNENGLWWIIRGPILLIILV 169
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 922581002  2015 SFYAAATSVLVSLHKK---YDVFVAKYNVKRAVFQHFILtIFTLGMT-LTGLFA 2064
Cdd:pfam00002  170 NFIIFINIVRILVQKLretNMGKSDLKQYRRLAKSTLLL-LPLLGITwVFGLFA 222
CA_like cd00031
Cadherin repeat-like domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
433-529 4.89e-05

Cadherin repeat-like domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers. This family also includes the cadherin-like repeats of extracellular alpha-dystroglycan.


Pssm-ID: 206635  Cd Length: 98  Bit Score: 44.26  E-value: 4.89e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002  433 TISEDIPVGTSFLQ-VSAIDADIGPNGIVDYFLNESSSSPSIQLFRLDRTSGTLRVSSKLDREQFAVIVLPIFARDRGTP 511
Cdd:cd00031     1 IPDGSAVEGRSRGSfRVSIPTDLIASSGEIIKISAAGKEALPSWLHWEPHSGILEGLEKLDREDKGVHYISVSAASLGAN 80
                          90
                  ....*....|....*...
gi 922581002  512 SLSAASEITLTLSDVNDN 529
Cdd:cd00031    81 VPQTSSVFSIEVYDENDN 98
7tmB2_CELSR2 cd15992
Cadherin EGF LAG seven-pass G-type receptor 2, member of the class B2 family of ...
1877-1960 1.10e-04

Cadherin EGF LAG seven-pass G-type receptor 2, member of the class B2 family of seven-transmembrane G protein-coupled receptors; The group IV adhesion GPCRs include the cadherin EGF LAG seven-pass G-type receptors (CELSRs) and their Drosophila homolog Flamingo (also known as Starry night). These receptors are also classified as that belongs to the EGF-TM7 group of subfamily B2 adhesion GPCRs, because they contain EGF-like domains. Functionally, the group IV receptors act as key regulators of many physiological processes such as endocrine cell differentiation, neuronal migration, dendrite growth, axon, guidance, lymphatic vessel and valve formation, and planar cell polarity (PCP) during embryonic development. Three mammalian orthologs of Flamingo, Celsr1-3, are widely expressed in the nervous system from embryonic development until the adult stage. Each Celsr exhibits different expression patterns in the developing brain, suggesting that they serve distinct functions. Mutations of CELSR1 cause neural tube defects in the nervous system, while mutations of CELSR2 are associated with coronary heart disease. Moreover, CELSR1 and several other PCP signaling molecules, such as dishevelled, prickle, frizzled, have been shown to be upregulated in B lymphocytes of chronic lymphocytic leukemia patients. The adhesion receptors are characterized by the presence of large N-terminal extracellular domains containing multiple adhesion motifs, which play critical roles in cell-cell adhesion and cell-matrix interactions, that are coupled to a class B seven-transmembrane domain. In the case of CELSR/Flamingo/Starry night, their extracellular domains comprise nine cadherin repeats linked to a series of epidermal growth factor (EGF)-like and laminin globular (G)-like domains. The cadherin repeats contain sequence motifs that mediate calcium-dependent cell-cell adhesion by homophilic interactions. Moreover, almost all adhesion receptors, except GPR123, contain an evolutionarily conserved GPCR- autoproteolysis inducing (GAIN) domain that undergoes autoproteolytic processing at the GPCR proteolysis site (GPS) motif located immediately N-terminal to the first transmembrane region, to generate N- and C-terminal fragments (NTF and CTF), which may serve important biological functions.


Pssm-ID: 320658  Cd Length: 255  Bit Score: 46.35  E-value: 1.10e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002 1877 GVALFLCFLSILLTLSRRSLKTHSVRIGFILFFAINILNLFFVhkTAINQA----YCPVRNAMLSFTSSAPFAWLFLYGL 1952
Cdd:cd15992    13 GVTLGFLLLTFLFLLCLRALRSNKTSIRKNGATALFLSELVFI--LGINQAdnpfACTVIAILLHFFYLCTFSWLFLEGL 90

                  ....*...
gi 922581002 1953 YIYRMLAD 1960
Cdd:cd15992    91 HIYRMLSE 98
hEGF pfam12661
Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six ...
901-922 1.28e-04

Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six conserved residues disulfide-bonded into the characteriztic 'ababcc' pattern. They are involved in growth and proliferation of cells, in proteins of the Notch/Delta pathway, neurogulin and selectins. hEGFs are also found in mosaic proteins with four-disulfide laminin EGFs such as aggrecan and perlecan. The core fold of the EGF domain consists of two small beta-hairpins packed against each other. Two major structural variants have been identified based on the structural context of the C-terminal Cys residue of disulfide 'c' in the C-terminal hairpin: hEGFs and cEGFs. In hEGFs the C-terminal thiol resides in the beta-turn, resulting in shorter loop-lengths between the Cys residues of disulfide 'c', typically C[8-9]XC. These shorter loop-lengths are also typical of the four-disulfide EGF domains, laminin ad integrin. Tandem hEGF domains have six linking residues between terminal cysteines of adjacent domains. hEGF domains may or may not bind calcium in the linker region. hEGF domains with the consensus motif CXD4X[F,Y]XCXC are hydroxylated exclusively in the Asp residue.


Pssm-ID: 463660  Cd Length: 22  Bit Score: 40.78  E-value: 1.28e-04
                           10        20
                   ....*....|....*....|..
gi 922581002   901 CSNNSTCVAFENTYQCECKPGW 922
Cdd:pfam12661    1 CQNGGTCVDGVNGYKCQCPPGY 22
Laminin_G_2 pfam02210
Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G ...
1246-1359 1.46e-04

Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.


Pssm-ID: 460494 [Multi-domain]  Cd Length: 126  Bit Score: 43.56  E-value: 1.46e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002  1246 FRTSRADMQVFALEfTQRSVHYNLEVDDGTLK--YNIGDSEVELPAPE--VTSKHWMNVVIKFEADSVATSINGIYSAEA 1321
Cdd:pfam02210    1 FRTRQPNGLLLYAG-GGGSDFLALELVNGRLVlrYDLGSGPESLLSSGknLNDGQWHSVRVERNGNTLTLSVDGQTVVSS 79
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|
gi 922581002  1322 KAS--ISDMNLES-LYFGiapGTGHPSR---------FEGCIRNVLVDGR 1359
Cdd:pfam02210   80 LPPgeSLLLNLNGpLYLG---GLPPLLLlpalpvragFVGCIRDVRVNGE 126
7tmB2_CELSR3 cd15993
Cadherin EGF LAG seven-pass G-type receptor 3, member of the class B2 family of ...
1871-2090 3.99e-04

Cadherin EGF LAG seven-pass G-type receptor 3, member of the class B2 family of seven-transmembrane G protein-coupled receptors; The group IV adhesion GPCRs include the cadherin EGF LAG seven-pass G-type receptors (CELSRs) and their Drosophila homolog Flamingo (also known as Starry night). These receptors are also classified as that belongs to the EGF-TM7 group of subfamily B2 adhesion GPCRs, because they contain EGF-like domains. Functionally, the group IV receptors act as key regulators of many physiological processes such as endocrine cell differentiation, neuronal migration, dendrite growth, axon, guidance, lymphatic vessel and valve formation, and planar cell polarity (PCP) during embryonic development. Three mammalian orthologs of Flamingo, Celsr1-3, are widely expressed in the nervous system from embryonic development until the adult stage. Each Celsr exhibits different expression patterns in the developing brain, suggesting that they serve distinct functions. Mutations of CELSR1 cause neural tube defects in the nervous system, while mutations of CELSR2 are associated with coronary heart disease. Moreover, CELSR1 and several other PCP signaling molecules, such as dishevelled, prickle, frizzled, have been shown to be upregulated in B lymphocytes of chronic lymphocytic leukemia patients. Celsr3 is expressed in both the developing and adult mouse brain. It has been functionally implicated in proper neuronal migration and axon guidance in the CNS. The adhesion receptors are characterized by the presence of large N-terminal extracellular domains containing multiple adhesion motifs, which play critical roles in cell-cell adhesion and cell-matrix interactions, that are coupled to a class B seven-transmembrane domain. In the case of CELSR/Flamingo/Starry night, their extracellular domains comprise nine cadherin repeats linked to a series of epidermal growth factor (EGF)-like and laminin globular (G)-like domains. The cadherin repeats contain sequence motifs that mediate calcium-dependent cell-cell adhesion by homophilic interactions. Moreover, almost all adhesion receptors, except GPR123, contain an evolutionarily conserved GPCR- autoproteolysis inducing (GAIN) domain that undergoes autoproteolytic processing at the GPCR proteolysis site (GPS) motif located immediately N-terminal to the first transmembrane region, to generate N- and C-terminal fragments (NTF and CTF), which may serve important biological functions.


Pssm-ID: 320659 [Multi-domain]  Cd Length: 254  Bit Score: 44.45  E-value: 3.99e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002 1871 TSPAIAGVALFLCFlSILLTLsrRSLKT-----HSvRIGFILFFAiNILNLFFVHKTAiNQAYCPVRNAMLSFTSSAPFA 1945
Cdd:cd15993    10 SSVSASLAALVLTF-SVLTCL--RGLKSntrgiHS-NIAAALFLS-ELLFLLGINRTE-NQFLCTVVAILLHYFFLSTFA 83
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002 1946 WLFLYGLYIYRMLADGSSSPSLTTS--LLVGIVFPCLISFTT-------FFVTDQCSLSPHLWLFWCIILPIGLFLLLS- 2015
Cdd:cd15993    84 WLFVQGLHIYRMQTEARNVNFGAMRfyYAIGWGVPAIITGLAvgldpegYGNPDFCWISIHDKLVWSFAGPIVVVIVMNg 163
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 922581002 2016 --FYAAATSVLVSLHKKydvfVAKYNVKRAVFQHFILTIFTLGMTLTGLFA-NQLPLPMEIMEISQSIIYLIAALVIF 2090
Cdd:cd15993   164 vmFLLVARMSCSPGQKE----TKKTSVLMTLRSSFLLLLLISATWLFGLLAvNNSVLAFHYLHAILCCLQGLAVLLLF 237
7tmB2_Latrophilin-1 cd16007
Latrophilin-1, member of the class B2 family of seven-transmembrane G protein-coupled ...
1906-2029 4.60e-04

Latrophilin-1, member of the class B2 family of seven-transmembrane G protein-coupled receptors; Latrophilins (also called lectomedins or latrotoxin receptors) belong to Group I adhesion GPCRs, which also include ETL (EGF-TM7-latrophilin-related protein). These receptors are a member of the adhesion family (subclass B2) that belongs to the class B GPCRs. Three subtypes of latrophilins have been identified: LPH1 (latrophilin-1), LPH2, and LPH3. The latrophilin-1 is a brain-specific calcium-independent receptor of alpha-latrotoxin, a potent presynaptic neurotoxin from the venom of the black widow spider that induces massive neurotransmitter release from sensory and motor neurons as well as endocrine cells, leading to nerve-terminal degeneration. Latrophilin-2 and -3, although sharing strong sequence homology to latrophilin-1, do not bind alpha-latrotoxin. While latrophilin-3 is also brain specific, latrophilin-2 is ubiquitously distributed. The endogenous ligands for these two receptors are unknown. ETL, a seven transmembrane receptor containing EGF-like repeats is highly expressed in heart, where developmentally regulated, as well as in normal smooth cells. The function of the ETL is unknown. All adhesion GPCRs possess large N-terminal extracellular domains containing multiple structural motifs, which play critical roles in cell-cell adhesion and cell-matrix interactions, coupled to a seven-transmembrane domain. In addition, almost all adhesion receptors, except GPR123, contain an evolutionarily conserved GPCR-autoproteolysis inducing (GAIN) domain that undergoes autoproteolytic processing at the GPCR proteolysis site (GPS) motif located immediately N-terminal to the first transmembrane region, to generate N- and C-terminal fragments (NTF and CTF), which may serve important biological functions.


Pssm-ID: 320673 [Multi-domain]  Cd Length: 258  Bit Score: 44.14  E-value: 4.60e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002 1906 ILFFAINILNLFFVHKTAiNQAYCPVRNAMLSFTSSAPFAWLFLYGLYIYRMLAD--GSSSPSLTTSLLVGIVFPCL--- 1980
Cdd:cd16007    45 INLFLAELLFLIGIDKTQ-YQIACPIFAGLLHFFFLAAFSWLCLEGVQLYLMLVEvfESEYSRKKYYYLCGYCFPALvvg 123
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|...
gi 922581002 1981 ----ISFTTFFVTDQCSLSPHLWLFWCIILPIGLFLLLSFyaaaTSVLVSLHK 2029
Cdd:cd16007   124 isaaIDYRSYGTEKACWLRVDNYFIWSFIGPVSFVIVVNL----VFLMVTLHK 172
CA_like cd00031
Cadherin repeat-like domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
127-218 6.11e-04

Cadherin repeat-like domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers. This family also includes the cadherin-like repeats of extracellular alpha-dystroglycan.


Pssm-ID: 206635  Cd Length: 98  Bit Score: 41.18  E-value: 6.11e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002  127 ITLEENVAIGEEVGRVyAIDEDSGPNGIIKYSMEGSED----FIIDEDSGLIKTTKLLDRETTARYSLKVTARDMGTPSL 202
Cdd:cd00031     4 GSAVEGRSRGSFRVSI-PTDLIASSGEIIKISAAGKEAlpswLHWEPHSGILEGLEKLDREDKGVHYISVSAASLGANVP 82
                          90
                  ....*....|....*.
gi 922581002  203 NTSTTIAVVLKDINDN 218
Cdd:cd00031    83 QTSSVFSIEVYDENDN 98
CA_like cd00031
Cadherin repeat-like domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
351-421 6.60e-04

Cadherin repeat-like domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers. This family also includes the cadherin-like repeats of extracellular alpha-dystroglycan.


Pssm-ID: 206635  Cd Length: 98  Bit Score: 41.18  E-value: 6.60e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 922581002  351 DRGDNARIVYSIDSS------QFFRIDPSSGDISVSSDLDREDRATFSVIVTASDHASPPLNTSTQIEVILDDINDN 421
Cdd:cd00031    22 DLIASSGEIIKISAAgkealpSWLHWEPHSGILEGLEKLDREDKGVHYISVSAASLGANVPQTSSVFSIEVYDENDN 98
EGF smart00181
Epidermal growth factor-like domain;
1415-1449 9.00e-04

Epidermal growth factor-like domain;


Pssm-ID: 214544  Cd Length: 35  Bit Score: 38.65  E-value: 9.00e-04
                            10        20        30
                    ....*....|....*....|....*....|....*.
gi 922581002   1415 CSVANVCSSGTCVssNTTAGYECICPAGKTG-KNCQ 1449
Cdd:smart00181    2 CASGGPCSNGTCI--NTPGSYTCSCPPGYTGdKRCE 35
Cadherin_4 pfam17803
Bacterial cadherin-like domain; This entry contains numerous bacterial cadherin-like domains ...
2-74 9.32e-04

Bacterial cadherin-like domain; This entry contains numerous bacterial cadherin-like domains found in extracelullar proteins.


Pssm-ID: 465514 [Multi-domain]  Cd Length: 71  Bit Score: 39.86  E-value: 9.32e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 922581002     2 VEITVVDVNDnAPVFASDSYNVTILENITIPA-VIATvkATDEDfgtNGKVHYSMASSSGIGG-LTIDySTGEVT 74
Cdd:pfam17803    3 VTITVTGVND-APVAVDDTATVDEDTPVTITGnVLAN--DTDVD---GDTLTVTSVTGAPAYGtLTLN-ADGSFT 70
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
902-940 9.96e-04

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 38.83  E-value: 9.96e-04
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 922581002    902 SNNSTCVAFenTYQCECKPGWIGRHCEisvhalTCVPGY 940
Cdd:smart00180    8 SASGTCDPD--TGQCECKPNVTGRRCD------RCAPGY 38
EGF_CA smart00179
Calcium-binding EGF-like domain;
1415-1449 1.26e-03

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 38.38  E-value: 1.26e-03
                            10        20        30
                    ....*....|....*....|....*....|....*..
gi 922581002   1415 CSVANVCSSG-TCVssNTTAGYECICPAG-KTGKNCQ 1449
Cdd:smart00179    5 CASGNPCQNGgTCV--NTVGSYRCECPPGyTDGRNCE 39
EGF smart00181
Epidermal growth factor-like domain;
895-928 1.86e-03

Epidermal growth factor-like domain;


Pssm-ID: 214544  Cd Length: 35  Bit Score: 37.88  E-value: 1.86e-03
                            10        20        30
                    ....*....|....*....|....*....|....*.
gi 922581002    895 EC-YRGRCSNNsTCVAFENTYQCECKPGWIG-RHCE 928
Cdd:smart00181    1 ECaSGGPCSNG-TCINTPGSYTCSCPPGYTGdKRCE 35
7tmB2_GPR133-like_Adhesion_V cd15933
orphan GPR133 and related proteins, group V adhesion GPCRs, member of class B2 family of ...
1874-1957 2.04e-03

orphan GPR133 and related proteins, group V adhesion GPCRs, member of class B2 family of seven-transmembrane G protein-coupled receptors; group V adhesion GPCRs include orphan receptors GPR133, GPR144, and closely related proteins. The function of GPR144 has not yet been characterized, whereas GPR133 is highly expressed in the pituitary gland and is coupled to the G(s) protein, leading to activation of adenylate cyclase pathway. Moreover, genetic variations in the GPR133 have been reported to be associated with adult height and heart rate. The adhesion receptors are characterized by the presence of large N-terminal extracellular domains containing multiple adhesion motifs, which play critical roles in ligand recognition as well as cell-cell adhesion and cell-matrix interactions, linked by a stalk region to a class B seven-transmembrane domain. In addition, almost all adhesion receptors, except GPR123, contain an evolutionarily conserved GPCR-autoproteolysis inducing (GAIN) domain that undergoes autoproteolytic processing at the GPCR proteolysis site (GPS) motif located immediately N-terminal to the first transmembrane region, to generate N- and C-terminal fragments (NTF and CTF), which may serve important biological functions. However, several adhesion GPCRs, including GPR 111, GPR115, and CELSR1, are predicted to be non-cleavable at the GAIN domain because of the lack of a consensus catalytic triad sequence (His-Leu-Ser/Thr) within their GPS.


Pssm-ID: 320599 [Multi-domain]  Cd Length: 252  Bit Score: 42.31  E-value: 2.04e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002 1874 AIAGVALFLCFL--SILLTLSRRSLKTHSVRIGFILFFAINILNLFFV--HKTAINQAYCPVRNAMLSFTSSAPFAWLFL 1949
Cdd:cd15933     8 SYIGCGISIACLalTLIIFLVLRVLSSDRFQIHKNLCVALLLAQILLLagEWAEGNKVACKVVAILLHFFFMAAFSWMLV 87

                  ....*...
gi 922581002 1950 YGLYIYRM 1957
Cdd:cd15933    88 EGLHLYLM 95
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
1415-1449 2.85e-03

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 37.46  E-value: 2.85e-03
                          10        20        30
                  ....*....|....*....|....*....|....*..
gi 922581002 1415 CSVANVCSS-GTCVssNTTAGYECICPAGKTG-KNCQ 1449
Cdd:cd00053     2 CAASNPCSNgGTCV--NTPGSYRCVCPPGYTGdRSCE 36
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
902-940 3.12e-03

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 37.72  E-value: 3.12e-03
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 922581002  902 SNNSTCvaFENTYQCECKPGWIGRHCEisvhalTCVPGY 940
Cdd:cd00055     9 SLSGQC--DPGTGQCECKPNTTGRRCD------RCAPGY 39
7tmB1_hormone_R cd15041
The subfamily B1 of hormone receptors (secretin-like), member of the class B family ...
1877-2017 3.30e-03

The subfamily B1 of hormone receptors (secretin-like), member of the class B family seven-transmembrane G protein-coupled receptors; The B1 subfamily of class B GPCRs, also referred to as secretin-like receptor family, includes receptors for polypeptide hormones of 27-141 amino-acid residues such as secretin, glucagon, glucagon-like peptide (GLP), calcitonin gene-related peptide, parathyroid hormone (PTH), and corticotropin-releasing factor. These receptors contain the large N-terminal extracellular domain (ECD), which plays a critical role in hormone recognition by binding to the C-terminal portion of the peptide. On the other hand, the N-terminal segment of the hormone induces receptor activation by interacting with the receptor transmembrane domains and connecting extracellular loops, triggering intracellular signaling pathways. All members of this subfamily preferentially couple to G proteins of G(s) family, which positively stimulate adenylate cyclase, leading to increased intracellular cAMP formation and calcium influx. Moreover, the B1 subfamily receptors play key roles in hormone homeostasis and are promising drug targets in various human diseases including diabetes, osteoporosis, obesity, neurodegenerative conditions (Alzheimer###s and Parkinson's), cardiovascular disease, migraine, and psychiatric disorders (anxiety, depression). Furthermore, the subfamilies B2 and B3 consist of receptors that are capable of interacting with epidermal growth factors (EGF) and the Drosophila melanogaster Methuselah gene product (Mth), respectively. The class B GPCRs have been identified in all the vertebrates, from fishes to mammals, as well as invertebrates including Caenorhabditis elegans and Drosophila melanogaster, but are not present in plants, fungi, or prokaryotes.


Pssm-ID: 341321 [Multi-domain]  Cd Length: 273  Bit Score: 41.83  E-value: 3.30e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002 1877 GVALFLCFLSILLTLSRRSLKTHSVRI------GFILFFAINIL--NLFFVHK-------TAINQ--AYCPVRNAMLSFT 1939
Cdd:cd15041    13 SLSLVALLPAIVIFLYFRSLRCTRIRLhinlflSFILRAVFWIIwdLLVVYDRltssgveTVLMQnpVGCKLLSVLKRYF 92
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 922581002 1940 SSAPFAWLFLYGLYIYRMLADGSSSPSLTTSL--LVGIVFPCLISFTTFFV-----TDQCSLS-PHLWLFWCIILPIGLF 2011
Cdd:cd15041    93 KSANYFWMLCEGLYLHRLIVVAFFSEPSSLKLyyAIGWGLPLVIVVIWAIVrallsNESCWISyNNGHYEWILYGPNLLA 172

                  ....*.
gi 922581002 2012 LLLSFY 2017
Cdd:cd15041   173 LLVNLF 178
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
901-927 3.99e-03

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 36.56  E-value: 3.99e-03
                           10        20
                   ....*....|....*....|....*..
gi 922581002   901 CSNNSTCVAFenTYQCECKPGWIGRHC 927
Cdd:pfam07974    2 CSGRGTCVNQ--CGKCVCDSGYQGATC 26
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
1506-1537 7.58e-03

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 36.52  E-value: 7.58e-03
                            10        20        30
                    ....*....|....*....|....*....|....
gi 922581002   1506 CEC-GFGADSTECSAD-GHCKCNGDAVGRRCDRC 1537
Cdd:smart00180    1 CDCdPGGSASGTCDPDtGQCECKPNVTGRRCDRC 34
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH