NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|153792725|ref|NP_780465|]
View 

protein HEG homolog 1 precursor [Mus musculus]

Protein Classification

calcium-binding EGF-like domain-containing protein; wall-associated receptor kinase family protein( domain architecture ID 10043351)

calcium-binding epidermal growth factor (EGF)-like domain-containing protein may play a crucial role in numerous protein-protein interactions| wall-associated receptor kinase (WAK) family protein containing the calcium-binding EGF and serine/threonine kinase domains but lacking the WAK galacturonan-binding domain, catalyzes the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates, and may function as a signaling receptor of extracellular matrix component

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
EGF_CA smart00179
Calcium-binding EGF-like domain;
726-757 2.81e-10

Calcium-binding EGF-like domain;


:

Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 56.10  E-value: 2.81e-10
                            10        20        30
                    ....*....|....*....|....*....|...
gi 153792725    726 DVNECLS-SPCPPLATCNNTQGSFTCRCPVGYQ 757
Cdd:smart00179    1 DIDECASgNPCQNGGTCVNTVGSYRCECPPGYT 33
PHA03247 super family cl33720
large tegument protein UL36; Provisional
319-686 4.22e-10

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 64.57  E-value: 4.22e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 153792725  319 PSQAQPKQSSMSSDDDEPAQSSTESPVLHTSNLPTYTSTVNMPNTLVLDTGTKPVEDPSDSRVPSTQPSPSqPQPFSSAL 398
Cdd:PHA03247 2556 PPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDP-PPPSPSPA 2634
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 153792725  399 PSTRSPGSTSETTTSSPSPSPISLLVSTLAPYSVSQTTFPHPSSTlvPHRPREPRVTSvqmstaisaiALIPSNQTANPk 478
Cdd:PHA03247 2635 ANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSP--PQRPRRRAARP----------TVGSLTSLADP- 2701
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 153792725  479 nqstPQQEKPITEAKSPSLVSPPTDSTKAVTVSLPPGAPWSPALTGFSTGPALPATSTSLAqmSPALTSAMPQTT--HSP 556
Cdd:PHA03247 2702 ----PPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPA--RPPTTAGPPAPAppAAP 2775
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 153792725  557 VTSPSTLSHVEALTSGAVVVHTTPKKPHLPTNPEILVPHISTEGAITTEGNREHTDPTTQPIPLTTSTTSAGERTTELGR 636
Cdd:PHA03247 2776 AAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGS 2855
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|
gi 153792725  637 AEESSPSHFLTPSSPQTTDVSTAEMLTSRYITFAAQSTSQSPTALPPLTP 686
Cdd:PHA03247 2856 VAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQP 2905
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
687-723 3.52e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 44.55  E-value: 3.52e-06
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 153792725  687 VNSC-TVNPCLHDGKCIvDLTGrGYRCVCPPAWQGENC 723
Cdd:cd00054     2 IDECaSGNPCQNGGTCV-NTVG-SYRCSCPPGYTGRNC 37
 
Name Accession Description Interval E-value
EGF_CA smart00179
Calcium-binding EGF-like domain;
726-757 2.81e-10

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 56.10  E-value: 2.81e-10
                            10        20        30
                    ....*....|....*....|....*....|...
gi 153792725    726 DVNECLS-SPCPPLATCNNTQGSFTCRCPVGYQ 757
Cdd:smart00179    1 DIDECASgNPCQNGGTCVNTVGSYRCECPPGYT 33
PHA03247 PHA03247
large tegument protein UL36; Provisional
319-686 4.22e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 64.57  E-value: 4.22e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 153792725  319 PSQAQPKQSSMSSDDDEPAQSSTESPVLHTSNLPTYTSTVNMPNTLVLDTGTKPVEDPSDSRVPSTQPSPSqPQPFSSAL 398
Cdd:PHA03247 2556 PPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDP-PPPSPSPA 2634
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 153792725  399 PSTRSPGSTSETTTSSPSPSPISLLVSTLAPYSVSQTTFPHPSSTlvPHRPREPRVTSvqmstaisaiALIPSNQTANPk 478
Cdd:PHA03247 2635 ANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSP--PQRPRRRAARP----------TVGSLTSLADP- 2701
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 153792725  479 nqstPQQEKPITEAKSPSLVSPPTDSTKAVTVSLPPGAPWSPALTGFSTGPALPATSTSLAqmSPALTSAMPQTT--HSP 556
Cdd:PHA03247 2702 ----PPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPA--RPPTTAGPPAPAppAAP 2775
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 153792725  557 VTSPSTLSHVEALTSGAVVVHTTPKKPHLPTNPEILVPHISTEGAITTEGNREHTDPTTQPIPLTTSTTSAGERTTELGR 636
Cdd:PHA03247 2776 AAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGS 2855
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|
gi 153792725  637 AEESSPSHFLTPSSPQTTDVSTAEMLTSRYITFAAQSTSQSPTALPPLTP 686
Cdd:PHA03247 2856 VAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQP 2905
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
726-758 2.70e-09

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 53.41  E-value: 2.70e-09
                          10        20        30
                  ....*....|....*....|....*....|....
gi 153792725  726 DVNECLS-SPCPPLATCNNTQGSFTCRCPVGYQL 758
Cdd:cd00054     1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTG 34
EGF_CA pfam07645
Calcium-binding EGF domain;
726-755 2.55e-08

Calcium-binding EGF domain;


Pssm-ID: 429571  Cd Length: 32  Bit Score: 50.70  E-value: 2.55e-08
                           10        20        30
                   ....*....|....*....|....*....|..
gi 153792725   726 DVNECLSSP--CPPLATCNNTQGSFTCRCPVG 755
Cdd:pfam07645    1 DVDECATGThnCPANTVCVNTIGSFECRCPDG 32
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
687-723 3.52e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 44.55  E-value: 3.52e-06
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 153792725  687 VNSC-TVNPCLHDGKCIvDLTGrGYRCVCPPAWQGENC 723
Cdd:cd00054     2 IDECaSGNPCQNGGTCV-NTVG-SYRCSCPPGYTGRNC 37
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
325-661 6.82e-06

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 49.96  E-value: 6.82e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 153792725   325 KQSSMSSDDDEPAQSSTESPVLHTSNLPTYTSTVnmPNTLVLDTGTKPVEDPSDSRVPSTQPSPSQPQPFSSALPSTRSP 404
Cdd:pfam17823   89 EHTPHGTDLSEPATREGAADGAASRALAAAASSS--PSSAAQSLPAAIAALPSEAFSAPRAAACRANASAAPRAAIAAAS 166
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 153792725   405 GSTSETTTSSPSPSPISLLVSTLAPYSVSQTTFPHPSSTLVPHRPREPRVTSVQMSTAISAIALIPSNQTANPKNQSTPQ 484
Cdd:pfam17823  167 APHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVGNSSPAAGTVTAAVG 246
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 153792725   485 QEKPITEAKSPSLVSPPTDSTKAVTVSLPPGAPWSPA-------LTGFSTGPALPATSTSLAQMS---PALTSAMPQTTH 554
Cdd:pfam17823  247 TVTPAALATLAAAAGTVASAAGTINMGDPHARRLSPAkhmpsdtMARNPAAPMGAQAQGPIIQVStdqPVHNTAGEPTPS 326
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 153792725   555 SPVTSPSTLSHVEALTSGAVVVHTTPKKPHLPTNPEILVPHISTEGAIttegnrEHTDPTTQPIPLTTSTTSAGE---RT 631
Cdd:pfam17823  327 PSNTTLEPNTPKSVASTNLAVVTTTKAQAKEPSASPVPVLHTSMIPEV------EATSPTTQPSPLLPTQGAAGPgilLA 400
                          330       340       350
                   ....*....|....*....|....*....|
gi 153792725   632 TELGRAEESSPSHFLTPSSPQTTDVSTAEM 661
Cdd:pfam17823  401 PEQVATEATAGTASAGPTPRSSGDPKTLAM 430
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
690-722 1.65e-05

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 42.76  E-value: 1.65e-05
                           10        20        30
                   ....*....|....*....|....*....|...
gi 153792725   690 CTVNPCLHDGKCIVdlTGRGYRCVCPPAWQGEN 722
Cdd:pfam00008    1 CAPNPCSNGGTCVD--TPGGYTCICPEGYTGKR 31
EGF_CA smart00179
Calcium-binding EGF-like domain;
687-723 2.66e-04

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 39.15  E-value: 2.66e-04
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 153792725    687 VNSC-TVNPCLHDGKCIvDLTGrGYRCVCPPAWQ-GENC 723
Cdd:smart00179    2 IDECaSGNPCQNGGTCV-NTVG-SYRCECPPGYTdGRNC 38
 
Name Accession Description Interval E-value
EGF_CA smart00179
Calcium-binding EGF-like domain;
726-757 2.81e-10

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 56.10  E-value: 2.81e-10
                            10        20        30
                    ....*....|....*....|....*....|...
gi 153792725    726 DVNECLS-SPCPPLATCNNTQGSFTCRCPVGYQ 757
Cdd:smart00179    1 DIDECASgNPCQNGGTCVNTVGSYRCECPPGYT 33
PHA03247 PHA03247
large tegument protein UL36; Provisional
319-686 4.22e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 64.57  E-value: 4.22e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 153792725  319 PSQAQPKQSSMSSDDDEPAQSSTESPVLHTSNLPTYTSTVNMPNTLVLDTGTKPVEDPSDSRVPSTQPSPSqPQPFSSAL 398
Cdd:PHA03247 2556 PPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDP-PPPSPSPA 2634
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 153792725  399 PSTRSPGSTSETTTSSPSPSPISLLVSTLAPYSVSQTTFPHPSSTlvPHRPREPRVTSvqmstaisaiALIPSNQTANPk 478
Cdd:PHA03247 2635 ANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSP--PQRPRRRAARP----------TVGSLTSLADP- 2701
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 153792725  479 nqstPQQEKPITEAKSPSLVSPPTDSTKAVTVSLPPGAPWSPALTGFSTGPALPATSTSLAqmSPALTSAMPQTT--HSP 556
Cdd:PHA03247 2702 ----PPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPA--RPPTTAGPPAPAppAAP 2775
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 153792725  557 VTSPSTLSHVEALTSGAVVVHTTPKKPHLPTNPEILVPHISTEGAITTEGNREHTDPTTQPIPLTTSTTSAGERTTELGR 636
Cdd:PHA03247 2776 AAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGS 2855
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|
gi 153792725  637 AEESSPSHFLTPSSPQTTDVSTAEMLTSRYITFAAQSTSQSPTALPPLTP 686
Cdd:PHA03247 2856 VAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQP 2905
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
726-758 2.70e-09

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 53.41  E-value: 2.70e-09
                          10        20        30
                  ....*....|....*....|....*....|....
gi 153792725  726 DVNECLS-SPCPPLATCNNTQGSFTCRCPVGYQL 758
Cdd:cd00054     1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTG 34
EGF_CA pfam07645
Calcium-binding EGF domain;
726-755 2.55e-08

Calcium-binding EGF domain;


Pssm-ID: 429571  Cd Length: 32  Bit Score: 50.70  E-value: 2.55e-08
                           10        20        30
                   ....*....|....*....|....*....|..
gi 153792725   726 DVNECLSSP--CPPLATCNNTQGSFTCRCPVG 755
Cdd:pfam07645    1 DVDECATGThnCPANTVCVNTIGSFECRCPDG 32
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
687-723 3.52e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 44.55  E-value: 3.52e-06
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 153792725  687 VNSC-TVNPCLHDGKCIvDLTGrGYRCVCPPAWQGENC 723
Cdd:cd00054     2 IDECaSGNPCQNGGTCV-NTVG-SYRCSCPPGYTGRNC 37
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
729-760 4.03e-06

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 44.39  E-value: 4.03e-06
                          10        20        30
                  ....*....|....*....|....*....|...
gi 153792725  729 EC-LSSPCPPLATCNNTQGSFTCRCPVGYQLEK 760
Cdd:cd00053     1 ECaASNPCSNGGTCVNTPGSYRCVCPPGYTGDR 33
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
325-661 6.82e-06

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 49.96  E-value: 6.82e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 153792725   325 KQSSMSSDDDEPAQSSTESPVLHTSNLPTYTSTVnmPNTLVLDTGTKPVEDPSDSRVPSTQPSPSQPQPFSSALPSTRSP 404
Cdd:pfam17823   89 EHTPHGTDLSEPATREGAADGAASRALAAAASSS--PSSAAQSLPAAIAALPSEAFSAPRAAACRANASAAPRAAIAAAS 166
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 153792725   405 GSTSETTTSSPSPSPISLLVSTLAPYSVSQTTFPHPSSTLVPHRPREPRVTSVQMSTAISAIALIPSNQTANPKNQSTPQ 484
Cdd:pfam17823  167 APHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVGNSSPAAGTVTAAVG 246
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 153792725   485 QEKPITEAKSPSLVSPPTDSTKAVTVSLPPGAPWSPA-------LTGFSTGPALPATSTSLAQMS---PALTSAMPQTTH 554
Cdd:pfam17823  247 TVTPAALATLAAAAGTVASAAGTINMGDPHARRLSPAkhmpsdtMARNPAAPMGAQAQGPIIQVStdqPVHNTAGEPTPS 326
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 153792725   555 SPVTSPSTLSHVEALTSGAVVVHTTPKKPHLPTNPEILVPHISTEGAIttegnrEHTDPTTQPIPLTTSTTSAGE---RT 631
Cdd:pfam17823  327 PSNTTLEPNTPKSVASTNLAVVTTTKAQAKEPSASPVPVLHTSMIPEV------EATSPTTQPSPLLPTQGAAGPgilLA 400
                          330       340       350
                   ....*....|....*....|....*....|
gi 153792725   632 TELGRAEESSPSHFLTPSSPQTTDVSTAEM 661
Cdd:pfam17823  401 PEQVATEATAGTASAGPTPRSSGDPKTLAM 430
PHA03247 PHA03247
large tegument protein UL36; Provisional
332-664 7.29e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.32  E-value: 7.29e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 153792725  332 DDDEPAQSSTESP--VLHTSNLPTYTSTVNMPNTLVLDTGTKPVEDPSDSRVPSTQPSPSqPQPFSSALP------STRS 403
Cdd:PHA03247 2652 PRDDPAPGRVSRPrrARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPA-PHALVSATPlppgpaAARQ 2730
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 153792725  404 PGSTSETTTSSPSPSPISLLVSTLAPYSVSQTTF--PHPSSTLVPHRPREPRVTSVQMSTAISAIALIPSNQTANPKNQS 481
Cdd:PHA03247 2731 ASPALPAAPAPPAVPAGPATPGGPARPARPPTTAgpPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA 2810
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 153792725  482 TPQQEKPITEAKSPSLVSPPTDSTKAVTVSLPPGAPWSPALTGFSTGPALPATSTSLAQMSPAL--TSAMPQTTHSPVTS 559
Cdd:PHA03247 2811 VLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKpaAPARPPVRRLARPA 2890
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 153792725  560 PSTLSHVEALTSGAVVVHTTPKKPHLPTNPEILVPHISTEGAITTEGnREHTDPTTQPIPLTTSTTSAGERTTELG---- 635
Cdd:PHA03247 2891 VSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPP-RPQPPLAPTTDPAGAGEPSGAVPQPWLGalvp 2969
                         330       340       350
                  ....*....|....*....|....*....|....*..
gi 153792725  636 --------RAEESSPSHFLTPSSPQTTDVSTAEMLTS 664
Cdd:PHA03247 2970 grvavprfRVPQPAPSREAPASSTPPLTGHSLSRVSS 3006
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
690-722 1.65e-05

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 42.76  E-value: 1.65e-05
                           10        20        30
                   ....*....|....*....|....*....|...
gi 153792725   690 CTVNPCLHDGKCIVdlTGRGYRCVCPPAWQGEN 722
Cdd:pfam00008    1 CAPNPCSNGGTCVD--TPGGYTCICPEGYTGKR 31
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
734-757 1.68e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 42.59  E-value: 1.68e-05
                           10        20
                   ....*....|....*....|....
gi 153792725   734 PCPPLATCNNTQGSFTCRCPVGYQ 757
Cdd:pfam12947    7 GCHPNATCTNTGGSFTCTCNDGYT 30
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
430-680 2.40e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 48.37  E-value: 2.40e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 153792725   430 YSVSQTTFPHPSSTLVPHRPREPRVTSVQMSTA-ISAIALIPSNQTANPKNQStPQQEKPITEAKSPSLVSPptdsTKAV 508
Cdd:pfam05109  439 FAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTAdVTSPTPAGTTSGASPVTPS-PSPRDNGTESKAPDMTSP----TSAV 513
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 153792725   509 TVSLPPGAPWSPALTG---FSTGPALPATSTSLA---------QMSPALTSAMPQTTHSPVTSPSTLSHVEALTSGAVVV 576
Cdd:pfam05109  514 TTPTPNATSPTPAVTTptpNATSPTLGKTSPTSAvttptpnatSPTPAVTTPTPNATIPTLGKTSPTSAVTTPTPNATSP 593
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 153792725   577 HTTPKKPHLPTNPEILVPHISTEGAITTEGNREHTDPTTQPIPLTTSTTSAGERttelgraeessPSHFLTPSSPQTTDV 656
Cdd:pfam05109  594 TVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLR-----------PSSISETLSPSTSDN 662
                          250       260
                   ....*....|....*....|....*.
gi 153792725   657 STAEM--LTSRYITFAAQSTSQSPTA 680
Cdd:pfam05109  663 STSHMplLTSAHPTGGENITQVTPAS 688
EGF_CA smart00179
Calcium-binding EGF-like domain;
687-723 2.66e-04

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 39.15  E-value: 2.66e-04
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 153792725    687 VNSC-TVNPCLHDGKCIvDLTGrGYRCVCPPAWQ-GENC 723
Cdd:smart00179    2 IDECaSGNPCQNGGTCV-NTVG-SYRCECPPGYTdGRNC 38
EGF smart00181
Epidermal growth factor-like domain;
729-760 3.38e-04

Epidermal growth factor-like domain;


Pssm-ID: 214544  Cd Length: 35  Bit Score: 39.04  E-value: 3.38e-04
                            10        20        30
                    ....*....|....*....|....*....|...
gi 153792725    729 ECLS-SPCPPlATCNNTQGSFTCRCPVGYQLEK 760
Cdd:smart00181    1 ECASgGPCSN-GTCINTPGSYTCSCPPGYTGDK 32
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
730-756 9.64e-04

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 37.75  E-value: 9.64e-04
                           10        20
                   ....*....|....*....|....*..
gi 153792725   730 CLSSPCPPLATCNNTQGSFTCRCPVGY 756
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGY 27
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
693-723 9.77e-04

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 37.84  E-value: 9.77e-04
                          10        20        30
                  ....*....|....*....|....*....|..
gi 153792725  693 NPCLHDGKCIVdlTGRGYRCVCPPAWQGE-NC 723
Cdd:cd00053     6 NPCSNGGTCVN--TPGSYRCVCPPGYTGDrSC 35
EB pfam01683
EB module; This domain has no known function. It is found in several C. elegans proteins. The ...
690-763 1.20e-03

EB module; This domain has no known function. It is found in several C. elegans proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges. This domain is found associated with kunitz domains pfam00014.


Pssm-ID: 460294  Cd Length: 52  Bit Score: 37.79  E-value: 1.20e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 153792725   690 CTVNPCLHDGKCIvdltgrgyrcvcPPAWQGENCSVDVNeclsspCPPLATCNNTqgsfTCRCPVGYQLEKGIC 763
Cdd:pfam01683    1 CPPGQVLVNGQCV------------PKVAPGESCEADEQ------CPGGSVCVNG----VCQCPPGFTPVNGRC 52
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
379-624 2.42e-03

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 41.84  E-value: 2.42e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 153792725  379 SRVPSTQPSPSQPQPFS--SALPSTRSPGSTSETTTSSPSPSPISLLVSTLAPYSVSQTTFPHPSSTLVPHRPREPRVTS 456
Cdd:PLN03209  321 AKIPSQRVPPKESDAADgpKPVPTKPVTPEAPSPPIEEEPPQPKAVVPRPLSPYTAYEDLKPPTSPIPTPPSSSPASSKS 400
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 153792725  457 VQMSTAISAIALIPSNQTANPKNQSTP-----QQEKPIT------EAKSPSLVSPPTDSTKAVTVSLPPGAPWSPaltgf 525
Cdd:PLN03209  401 VDAVAKPAEPDVVPSPGSASNVPEVEPaqveaKKTRPLSpyaryeDLKPPTSPSPTAPTGVSPSVSSTSSVPAVP----- 475
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 153792725  526 STGPALPATSTSL---AQMSPaLTSAMPQTTHSPVTSPSTLShvealtsgavvvhTTPKKPHLPTNPEILVPHISTEGAI 602
Cdd:PLN03209  476 DTAPATAATDAAApppANMRP-LSPYAVYDDLKPPTSPSPAA-------------PVGKVAPSSTNEVVKVGNSAPPTAL 541
                         250       260
                  ....*....|....*....|..
gi 153792725  603 TTEGNreHTDPttQPIPLTTST 624
Cdd:PLN03209  542 ADEQH--HAQP--KPRPLSPYT 559
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH