NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2462498115|ref|XP_054188329|]
View 

capping protein, Arp2/3 and myosin-I linker protein 3 isoform X3 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
CARMIL_C pfam16000
CARMIL C-terminus; This domain is found near to the C-terminus of leucine-rich ...
778-1067 1.93e-82

CARMIL C-terminus; This domain is found near to the C-terminus of leucine-rich repeat-containing proteins in the CARMIL family. In leucine-rich repeat-containing protein 16A (LRRC16A) it includes the region responsible for interaction with F-actin-capping protein subunit alpha-2 (CAPZA2).


:

Pssm-ID: 464966 [Multi-domain]  Cd Length: 299  Bit Score: 272.42  E-value: 1.93e-82
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  778 TQELCPVAMRVAEGHNKMLSNVAERVTVPRNFIRGALLEQAGQDIQNKLDEVKLSVVTYLTSSIVDEILQELYHSHKSLA 857
Cdd:pfam16000    1 AESLCPHVMQKAGVRQDLEKALSEKMTLPEEFVKSTLLEQAGVDIFNKLSEVKLSVASFLSDRIVDEVLEALSRSHHKLA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  858 RHLTQL-RTLSDPPGCP-GQGQDLSSRGRGRNHDHEETTDDELGTNIDTMAIKKQKR-CRKIRPVSAFISGSPQDMESQL 934
Cdd:pfam16000   81 RHLSQRgRTLLEPESLPdGDRPESSPLGPGKRHEGEIERLEELETPMATLKSKRKSIhSRKLRPVSVAFSVSELDLDKAP 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  935 GNLGI--------PPGWFSGLGGSQPTASGSWEGLSELPTHGYKLRHQTQgRPRPPRTTPPGPGRPSQMPAPGTRQENGM 1006
Cdd:pfam16000  161 EEVPIhvedassgPPLPSSSPSEPELSASESLDSLSELPTEGQKLQHLTK-GRPKRNKTRAPTRPPGKVGPAQDGEQNGL 239
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462498115 1007 ATRLDEGLEDFFSRRVLEEssSYPRTLRTVRPGLSEAP-LPPLQKKRRRGLFHFRRPRSFKG 1067
Cdd:pfam16000  240 SGRVDEGLEDFFSKKVIKL--STPTSPTSEPSSSSLFPdSPKKRKKRKSGFFNFIKPRSSKG 299
Carm_PH pfam17888
Carmil pleckstrin homology domain; This is a non-canonical pleckstrin homology (PH) domain ...
31-118 5.36e-33

Carmil pleckstrin homology domain; This is a non-canonical pleckstrin homology (PH) domain connected to a 16-leucine-rich repeat domain found in CARMIL (CP Arp2/3 complex myosin-I linker) proteins. The PH domain is interconnected with an N-terminal helix (N-helix), residues 10-20 and a C-terminal linker (Linker), residues 129-147 in Swiss:Q6EDY6. Structural and functional studies indicate that the PH domain involved in direct binding to the PM (plasma membrane) and a HD (helical domain) responsible for antiparallel dimerization and enhancement of CARMIL's membrane-binding activity. Furthermore, it appears that CARMIL's PH domain mediates non-specific binding to the membrane, in contrast to other PH domains that bind polyphosphorylated phosphatidylinositides, which are thought to function as signalling lipids.


:

Pssm-ID: 436119  Cd Length: 94  Bit Score: 123.16  E-value: 5.36e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115   31 VKLETKPKKFEDRVLALTSWRLHLFLLKVPAKVESSFNVLEIRAFNTLSQNQILVETERGMVSMRLPSAESVDQVTRHVS 110
Cdd:pfam17888    7 VKLETKGDKVEDRILVLTPWRLFLLSAKVPTKVERTFHFLEIRAINSRNPNQVIVETDKSNYSLKLASEEDVDHVVGHIL 86

                   ....*...
gi 2462498115  111 SALSKVCP 118
Cdd:pfam17888   87 TALKKIFP 94
RNA1 super family cl34950
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ...
274-657 2.06e-16

Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ribosomal structure and biogenesis];


The actual alignment was detected with superfamily member COG5238:

Pssm-ID: 444072 [Multi-domain]  Cd Length: 434  Bit Score: 83.69  E-value: 2.06e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  274 VLHALTLSHNPIEDKGFLSLSQQLLCFPSGLTKLCLAKTAISPRGLQALGQTFGANPA---FASSLRYLDLSKNPGLLAT 350
Cdd:COG5238     88 QLLVVDWEGAEEVSPVALAETATAVATPPPDLRRIMAKTLEDSLILYLALPRRINLIQvlkDPLGGNAVHLLGLAARLGL 167
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  351 DEANALYSFLaQPNALVHLDLSGT---DCVIDLLLGALLHGccSHLTYLNLARNSCshrkGREAPPAFKQFFSSAYTLSH 427
Cdd:COG5238    168 LAAISMAKAL-QNNSVETVYLGCNqigDEGIEELAEALTQN--TTVTTLWLKRNPI----GDEGAEILAEALKGNKSLTT 240
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  428 VNLSATKLPLEALRALLQGLSLNSHLSdlHLDLSSCELRSAGAQALQEQLGAVTCVGSLDLSDNGF-DSDLLTLVPALGK 506
Cdd:COG5238    241 LDLSNNQIGDEGVIALAEALKNNTTVE--TLYLSGNQIGAEGAIALAKALQGNTTLTSLDLSVNRIgDEGAIALAEGLQG 318
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  507 NKSLKHLFLGKNfNVKAKTLEeilhklvqliqeedcslqslsvadsrlklrtsILINALGSNTCLAKVDLSGNGMEDIGA 586
Cdd:COG5238    319 NKTLHTLNLAYN-GIGAQGAI--------------------------------ALAKALQENTTLHSLDLSDNQIGDEGA 365
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462498115  587 KMLSKALQINSSLRTILWDRNNTSALGFLDIARALESNhTLRFMSFPVSDISQAYRSapeRTEDVWQKIQW 657
Cdd:COG5238    366 IALAKYLEGNTTLRELNLGKNNIGKQGAEALIDALQTN-RLHTLILDGNLIGAEAQQ---RLEQLLERIKS 432
PRK07764 super family cl35613
DNA polymerase III subunits gamma and tau; Validated
1122-1338 5.81e-06

DNA polymerase III subunits gamma and tau; Validated


The actual alignment was detected with superfamily member PRK07764:

Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 50.75  E-value: 5.81e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115 1122 GGGRGPSFRRKMGTEGSEPGEGGPAPGTAQQPRVHGVALPGlerakGWSFDGKREGPGPDQEGSTQAWQKRRSSDDAGPG 1201
Cdd:PRK07764   597 GEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPA-----EASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPA 671
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115 1202 SWKPPPPPQSTKPSFSAMRRAEATwHIAEESAPNHSCQSPSPASQDGEEEKEGTLFPERTLPARNAKLQDPALAPWPPkP 1281
Cdd:PRK07764   672 KAGGAAPAAPPPAPAPAAPAAPAG-AAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDP-P 749
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462498115 1282 VAVPRGRQPPQEPGVREEAEAGDAAPgvnkprlrlSSQQDQEEPEV----QGPPDPGRRTA 1338
Cdd:PRK07764   750 DPAGAPAQPPPPPAPAPAAAPAAAPP---------PSPPSEEEEMAeddaPSMDDEDRRDA 801
 
Name Accession Description Interval E-value
CARMIL_C pfam16000
CARMIL C-terminus; This domain is found near to the C-terminus of leucine-rich ...
778-1067 1.93e-82

CARMIL C-terminus; This domain is found near to the C-terminus of leucine-rich repeat-containing proteins in the CARMIL family. In leucine-rich repeat-containing protein 16A (LRRC16A) it includes the region responsible for interaction with F-actin-capping protein subunit alpha-2 (CAPZA2).


Pssm-ID: 464966 [Multi-domain]  Cd Length: 299  Bit Score: 272.42  E-value: 1.93e-82
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  778 TQELCPVAMRVAEGHNKMLSNVAERVTVPRNFIRGALLEQAGQDIQNKLDEVKLSVVTYLTSSIVDEILQELYHSHKSLA 857
Cdd:pfam16000    1 AESLCPHVMQKAGVRQDLEKALSEKMTLPEEFVKSTLLEQAGVDIFNKLSEVKLSVASFLSDRIVDEVLEALSRSHHKLA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  858 RHLTQL-RTLSDPPGCP-GQGQDLSSRGRGRNHDHEETTDDELGTNIDTMAIKKQKR-CRKIRPVSAFISGSPQDMESQL 934
Cdd:pfam16000   81 RHLSQRgRTLLEPESLPdGDRPESSPLGPGKRHEGEIERLEELETPMATLKSKRKSIhSRKLRPVSVAFSVSELDLDKAP 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  935 GNLGI--------PPGWFSGLGGSQPTASGSWEGLSELPTHGYKLRHQTQgRPRPPRTTPPGPGRPSQMPAPGTRQENGM 1006
Cdd:pfam16000  161 EEVPIhvedassgPPLPSSSPSEPELSASESLDSLSELPTEGQKLQHLTK-GRPKRNKTRAPTRPPGKVGPAQDGEQNGL 239
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462498115 1007 ATRLDEGLEDFFSRRVLEEssSYPRTLRTVRPGLSEAP-LPPLQKKRRRGLFHFRRPRSFKG 1067
Cdd:pfam16000  240 SGRVDEGLEDFFSKKVIKL--STPTSPTSEPSSSSLFPdSPKKRKKRKSGFFNFIKPRSSKG 299
Carm_PH pfam17888
Carmil pleckstrin homology domain; This is a non-canonical pleckstrin homology (PH) domain ...
31-118 5.36e-33

Carmil pleckstrin homology domain; This is a non-canonical pleckstrin homology (PH) domain connected to a 16-leucine-rich repeat domain found in CARMIL (CP Arp2/3 complex myosin-I linker) proteins. The PH domain is interconnected with an N-terminal helix (N-helix), residues 10-20 and a C-terminal linker (Linker), residues 129-147 in Swiss:Q6EDY6. Structural and functional studies indicate that the PH domain involved in direct binding to the PM (plasma membrane) and a HD (helical domain) responsible for antiparallel dimerization and enhancement of CARMIL's membrane-binding activity. Furthermore, it appears that CARMIL's PH domain mediates non-specific binding to the membrane, in contrast to other PH domains that bind polyphosphorylated phosphatidylinositides, which are thought to function as signalling lipids.


Pssm-ID: 436119  Cd Length: 94  Bit Score: 123.16  E-value: 5.36e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115   31 VKLETKPKKFEDRVLALTSWRLHLFLLKVPAKVESSFNVLEIRAFNTLSQNQILVETERGMVSMRLPSAESVDQVTRHVS 110
Cdd:pfam17888    7 VKLETKGDKVEDRILVLTPWRLFLLSAKVPTKVERTFHFLEIRAINSRNPNQVIVETDKSNYSLKLASEEDVDHVVGHIL 86

                   ....*...
gi 2462498115  111 SALSKVCP 118
Cdd:pfam17888   87 TALKKIFP 94
RNA1 COG5238
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ...
274-657 2.06e-16

Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ribosomal structure and biogenesis];


Pssm-ID: 444072 [Multi-domain]  Cd Length: 434  Bit Score: 83.69  E-value: 2.06e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  274 VLHALTLSHNPIEDKGFLSLSQQLLCFPSGLTKLCLAKTAISPRGLQALGQTFGANPA---FASSLRYLDLSKNPGLLAT 350
Cdd:COG5238     88 QLLVVDWEGAEEVSPVALAETATAVATPPPDLRRIMAKTLEDSLILYLALPRRINLIQvlkDPLGGNAVHLLGLAARLGL 167
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  351 DEANALYSFLaQPNALVHLDLSGT---DCVIDLLLGALLHGccSHLTYLNLARNSCshrkGREAPPAFKQFFSSAYTLSH 427
Cdd:COG5238    168 LAAISMAKAL-QNNSVETVYLGCNqigDEGIEELAEALTQN--TTVTTLWLKRNPI----GDEGAEILAEALKGNKSLTT 240
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  428 VNLSATKLPLEALRALLQGLSLNSHLSdlHLDLSSCELRSAGAQALQEQLGAVTCVGSLDLSDNGF-DSDLLTLVPALGK 506
Cdd:COG5238    241 LDLSNNQIGDEGVIALAEALKNNTTVE--TLYLSGNQIGAEGAIALAKALQGNTTLTSLDLSVNRIgDEGAIALAEGLQG 318
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  507 NKSLKHLFLGKNfNVKAKTLEeilhklvqliqeedcslqslsvadsrlklrtsILINALGSNTCLAKVDLSGNGMEDIGA 586
Cdd:COG5238    319 NKTLHTLNLAYN-GIGAQGAI--------------------------------ALAKALQENTTLHSLDLSDNQIGDEGA 365
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462498115  587 KMLSKALQINSSLRTILWDRNNTSALGFLDIARALESNhTLRFMSFPVSDISQAYRSapeRTEDVWQKIQW 657
Cdd:COG5238    366 IALAKYLEGNTTLRELNLGKNNIGKQGAEALIDALQTN-RLHTLILDGNLIGAEAQQ---RLEQLLERIKS 432
LRR_RI cd00116
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 ...
339-600 6.89e-13

Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).


Pssm-ID: 238064 [Multi-domain]  Cd Length: 319  Bit Score: 71.23  E-value: 6.89e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  339 LDLSKNpgLLATDEANALYSFLAQPNALVHLDLS-----GTDCVIDLLLGALLHGCCshLTYLNLARNSCShrkgrEAPP 413
Cdd:cd00116     28 LRLEGN--TLGEEAAKALASALRPQPSLKELCLSlnetgRIPRGLQSLLQGLTKGCG--LQELDLSDNALG-----PDGC 98
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  414 AFKQFFSSAYTLSHVNLSATKLPLEALRALLQGL-SLNSHLSDLhlDLSSCELRSAGAQALQEQLGAVTCVGSLDLSDNG 492
Cdd:cd00116     99 GVLESLLRSSSLQELKLNNNGLGDRGLRLLAKGLkDLPPALEKL--VLGRNRLEGASCEALAKALRANRDLKELNLANNG 176
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  493 F-DSDLLTLVPALGKNKSLKHLFLGKNF--NVKAKTLEEILHKLVQL--IQEEDCSLQSLSVADsrlklrtsiLINALGS 567
Cdd:cd00116    177 IgDAGIRALAEGLKANCNLEVLDLNNNGltDEGASALAETLASLKSLevLNLGDNNLTDAGAAA---------LASALLS 247
                          250       260       270
                   ....*....|....*....|....*....|....
gi 2462498115  568 -NTCLAKVDLSGNGMEDIGAKMLSKALQINSSLR 600
Cdd:cd00116    248 pNISLLTLSLSCNDITDDGAKDLAEVLAEKESLL 281
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1122-1338 5.81e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 50.75  E-value: 5.81e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115 1122 GGGRGPSFRRKMGTEGSEPGEGGPAPGTAQQPRVHGVALPGlerakGWSFDGKREGPGPDQEGSTQAWQKRRSSDDAGPG 1201
Cdd:PRK07764   597 GEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPA-----EASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPA 671
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115 1202 SWKPPPPPQSTKPSFSAMRRAEATwHIAEESAPNHSCQSPSPASQDGEEEKEGTLFPERTLPARNAKLQDPALAPWPPkP 1281
Cdd:PRK07764   672 KAGGAAPAAPPPAPAPAAPAAPAG-AAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDP-P 749
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462498115 1282 VAVPRGRQPPQEPGVREEAEAGDAAPgvnkprlrlSSQQDQEEPEV----QGPPDPGRRTA 1338
Cdd:PRK07764   750 DPAGAPAQPPPPPAPAPAAAPAAAPP---------PSPPSEEEEMAeddaPSMDDEDRRDA 801
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
1239-1370 1.18e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 46.30  E-value: 1.18e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115 1239 QSPSPASQDGEEEKEGTLFPERTLPARNAKLQ----DPALAPWP--PKPVAVPRGRQPpqEPGVREEAEAG--DAAPGVN 1310
Cdd:NF033839   359 EKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQpekpKPEVKPQPekPKPEVKPQPEKP--KPEVKPQPEKPkpEVKPQPE 436
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462498115 1311 KPRLRLSSQQDQEEPEVQ---GPPDPGRRTAPLKPKRTRRAQScDKLEPDRRRP-PDPTGTSEP 1370
Cdd:NF033839   437 KPKPEVKPQPEKPKPEVKpqpETPKPEVKPQPEKPKPEVKPQP-EKPKPDNSKPqADDKKPSTP 499
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
1237-1372 5.97e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 40.91  E-value: 5.97e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115 1237 SCQSPSPASQDGEEEKEGTLFPERTLPARNAKLQ----DPALAPWP--PKPVAVPRGRQPPQEPGVREEAEAGDAAPGVN 1310
Cdd:NF033839   324 QLEKPKPEVKPQPEKPKPEVKPQLETPKPEVKPQpekpKPEVKPQPekPKPEVKPQPETPKPEVKPQPEKPKPEVKPQPE 403
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462498115 1311 KPRLRLSSQQDQEEPEVQGPPDPGR---RTAPLKPKRTRRAQscdklePDRRRP--------PDPTGTSEPGT 1372
Cdd:NF033839   404 KPKPEVKPQPEKPKPEVKPQPEKPKpevKPQPEKPKPEVKPQ------PEKPKPevkpqpetPKPEVKPQPEK 470
 
Name Accession Description Interval E-value
CARMIL_C pfam16000
CARMIL C-terminus; This domain is found near to the C-terminus of leucine-rich ...
778-1067 1.93e-82

CARMIL C-terminus; This domain is found near to the C-terminus of leucine-rich repeat-containing proteins in the CARMIL family. In leucine-rich repeat-containing protein 16A (LRRC16A) it includes the region responsible for interaction with F-actin-capping protein subunit alpha-2 (CAPZA2).


Pssm-ID: 464966 [Multi-domain]  Cd Length: 299  Bit Score: 272.42  E-value: 1.93e-82
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  778 TQELCPVAMRVAEGHNKMLSNVAERVTVPRNFIRGALLEQAGQDIQNKLDEVKLSVVTYLTSSIVDEILQELYHSHKSLA 857
Cdd:pfam16000    1 AESLCPHVMQKAGVRQDLEKALSEKMTLPEEFVKSTLLEQAGVDIFNKLSEVKLSVASFLSDRIVDEVLEALSRSHHKLA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  858 RHLTQL-RTLSDPPGCP-GQGQDLSSRGRGRNHDHEETTDDELGTNIDTMAIKKQKR-CRKIRPVSAFISGSPQDMESQL 934
Cdd:pfam16000   81 RHLSQRgRTLLEPESLPdGDRPESSPLGPGKRHEGEIERLEELETPMATLKSKRKSIhSRKLRPVSVAFSVSELDLDKAP 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  935 GNLGI--------PPGWFSGLGGSQPTASGSWEGLSELPTHGYKLRHQTQgRPRPPRTTPPGPGRPSQMPAPGTRQENGM 1006
Cdd:pfam16000  161 EEVPIhvedassgPPLPSSSPSEPELSASESLDSLSELPTEGQKLQHLTK-GRPKRNKTRAPTRPPGKVGPAQDGEQNGL 239
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462498115 1007 ATRLDEGLEDFFSRRVLEEssSYPRTLRTVRPGLSEAP-LPPLQKKRRRGLFHFRRPRSFKG 1067
Cdd:pfam16000  240 SGRVDEGLEDFFSKKVIKL--STPTSPTSEPSSSSLFPdSPKKRKKRKSGFFNFIKPRSSKG 299
Carm_PH pfam17888
Carmil pleckstrin homology domain; This is a non-canonical pleckstrin homology (PH) domain ...
31-118 5.36e-33

Carmil pleckstrin homology domain; This is a non-canonical pleckstrin homology (PH) domain connected to a 16-leucine-rich repeat domain found in CARMIL (CP Arp2/3 complex myosin-I linker) proteins. The PH domain is interconnected with an N-terminal helix (N-helix), residues 10-20 and a C-terminal linker (Linker), residues 129-147 in Swiss:Q6EDY6. Structural and functional studies indicate that the PH domain involved in direct binding to the PM (plasma membrane) and a HD (helical domain) responsible for antiparallel dimerization and enhancement of CARMIL's membrane-binding activity. Furthermore, it appears that CARMIL's PH domain mediates non-specific binding to the membrane, in contrast to other PH domains that bind polyphosphorylated phosphatidylinositides, which are thought to function as signalling lipids.


Pssm-ID: 436119  Cd Length: 94  Bit Score: 123.16  E-value: 5.36e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115   31 VKLETKPKKFEDRVLALTSWRLHLFLLKVPAKVESSFNVLEIRAFNTLSQNQILVETERGMVSMRLPSAESVDQVTRHVS 110
Cdd:pfam17888    7 VKLETKGDKVEDRILVLTPWRLFLLSAKVPTKVERTFHFLEIRAINSRNPNQVIVETDKSNYSLKLASEEDVDHVVGHIL 86

                   ....*...
gi 2462498115  111 SALSKVCP 118
Cdd:pfam17888   87 TALKKIFP 94
RNA1 COG5238
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ...
274-657 2.06e-16

Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ribosomal structure and biogenesis];


Pssm-ID: 444072 [Multi-domain]  Cd Length: 434  Bit Score: 83.69  E-value: 2.06e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  274 VLHALTLSHNPIEDKGFLSLSQQLLCFPSGLTKLCLAKTAISPRGLQALGQTFGANPA---FASSLRYLDLSKNPGLLAT 350
Cdd:COG5238     88 QLLVVDWEGAEEVSPVALAETATAVATPPPDLRRIMAKTLEDSLILYLALPRRINLIQvlkDPLGGNAVHLLGLAARLGL 167
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  351 DEANALYSFLaQPNALVHLDLSGT---DCVIDLLLGALLHGccSHLTYLNLARNSCshrkGREAPPAFKQFFSSAYTLSH 427
Cdd:COG5238    168 LAAISMAKAL-QNNSVETVYLGCNqigDEGIEELAEALTQN--TTVTTLWLKRNPI----GDEGAEILAEALKGNKSLTT 240
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  428 VNLSATKLPLEALRALLQGLSLNSHLSdlHLDLSSCELRSAGAQALQEQLGAVTCVGSLDLSDNGF-DSDLLTLVPALGK 506
Cdd:COG5238    241 LDLSNNQIGDEGVIALAEALKNNTTVE--TLYLSGNQIGAEGAIALAKALQGNTTLTSLDLSVNRIgDEGAIALAEGLQG 318
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  507 NKSLKHLFLGKNfNVKAKTLEeilhklvqliqeedcslqslsvadsrlklrtsILINALGSNTCLAKVDLSGNGMEDIGA 586
Cdd:COG5238    319 NKTLHTLNLAYN-GIGAQGAI--------------------------------ALAKALQENTTLHSLDLSDNQIGDEGA 365
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462498115  587 KMLSKALQINSSLRTILWDRNNTSALGFLDIARALESNhTLRFMSFPVSDISQAYRSapeRTEDVWQKIQW 657
Cdd:COG5238    366 IALAKYLEGNTTLRELNLGKNNIGKQGAEALIDALQTN-RLHTLILDGNLIGAEAQQ---RLEQLLERIKS 432
RNA1 COG5238
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ...
189-477 8.56e-14

Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ribosomal structure and biogenesis];


Pssm-ID: 444072 [Multi-domain]  Cd Length: 434  Bit Score: 75.21  E-value: 8.56e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  189 REFNLLDFSHLESRDLALMVAALAYNQWFTKLYCKDLRLGSEVLEQVLHTLSKSGSLEELVLDNAGLKTDFVQKLAGVFG 268
Cdd:COG5238    154 NAVHLLGLAARLGLLAAISMAKALQNNSVETVYLGCNQIGDEGIEELAEALTQNTTVTTLWLKRNPIGDEGAEILAEALK 233
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  269 ENGScvLHALTLSHNPIEDKGFLSLSqQLLCFPSGLTKLCLAKTAISPRGLQALGQTFGANPafasSLRYLDLSKNPgll 348
Cdd:COG5238    234 GNKS--LTTLDLSNNQIGDEGVIALA-EALKNNTTVETLYLSGNQIGAEGAIALAKALQGNT----TLTSLDLSVNR--- 303
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  349 ATDE-ANALYSFLAQPNALVHLDLS----GTDCVIdLLLGALLHGccSHLTYLNLARNscshRKGREAPPAFKQFFSSAY 423
Cdd:COG5238    304 IGDEgAIALAEGLQGNKTLHTLNLAyngiGAQGAI-ALAKALQEN--TTLHSLDLSDN----QIGDEGAIALAKYLEGNT 376
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462498115  424 TLSHVNLSATKLPLEALRALLQGLSLNShlsdLH-LDLSSCELRSAGAQALQEQL 477
Cdd:COG5238    377 TLRELNLGKNNIGKQGAEALIDALQTNR----LHtLILDGNLIGAEAQQRLEQLL 427
LRR_RI cd00116
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 ...
339-600 6.89e-13

Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).


Pssm-ID: 238064 [Multi-domain]  Cd Length: 319  Bit Score: 71.23  E-value: 6.89e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  339 LDLSKNpgLLATDEANALYSFLAQPNALVHLDLS-----GTDCVIDLLLGALLHGCCshLTYLNLARNSCShrkgrEAPP 413
Cdd:cd00116     28 LRLEGN--TLGEEAAKALASALRPQPSLKELCLSlnetgRIPRGLQSLLQGLTKGCG--LQELDLSDNALG-----PDGC 98
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  414 AFKQFFSSAYTLSHVNLSATKLPLEALRALLQGL-SLNSHLSDLhlDLSSCELRSAGAQALQEQLGAVTCVGSLDLSDNG 492
Cdd:cd00116     99 GVLESLLRSSSLQELKLNNNGLGDRGLRLLAKGLkDLPPALEKL--VLGRNRLEGASCEALAKALRANRDLKELNLANNG 176
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  493 F-DSDLLTLVPALGKNKSLKHLFLGKNF--NVKAKTLEEILHKLVQL--IQEEDCSLQSLSVADsrlklrtsiLINALGS 567
Cdd:cd00116    177 IgDAGIRALAEGLKANCNLEVLDLNNNGltDEGASALAETLASLKSLevLNLGDNNLTDAGAAA---------LASALLS 247
                          250       260       270
                   ....*....|....*....|....*....|....
gi 2462498115  568 -NTCLAKVDLSGNGMEDIGAKMLSKALQINSSLR 600
Cdd:cd00116    248 pNISLLTLSLSCNDITDDGAKDLAEVLAEKESLL 281
LRR_RI cd00116
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 ...
428-628 7.65e-11

Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).


Pssm-ID: 238064 [Multi-domain]  Cd Length: 319  Bit Score: 65.07  E-value: 7.65e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  428 VNLSATKLPLEALRALlqGLSLNSHLSDLHLDLSSCELRS--AGAQALQEQLGAVTCVGSLDLSDNGFDSDLLTLVPALG 505
Cdd:cd00116     28 LRLEGNTLGEEAAKAL--ASALRPQPSLKELCLSLNETGRipRGLQSLLQGLTKGCGLQELDLSDNALGPDGCGVLESLL 105
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  506 KNKSLKHLFLGKN------FNVKAKTLEEILHKLVQLIQE-----------------EDCSLQSLSVADSRLKLR-TSIL 561
Cdd:cd00116    106 RSSSLQELKLNNNglgdrgLRLLAKGLKDLPPALEKLVLGrnrlegascealakalrANRDLKELNLANNGIGDAgIRAL 185
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462498115  562 INALGSNTCLAKVDLSGNGMEDIGAKMLSKALQINSSLRTILWDRNNTSALGFLDIARALES-NHTLR 628
Cdd:cd00116    186 AEGLKANCNLEVLDLNNNGLTDEGASALAETLASLKSLEVLNLGDNNLTDAGAAALASALLSpNISLL 253
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
334-663 1.30e-09

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 61.87  E-value: 1.30e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  334 SSLRYLDLSKNPGLlatdeanalysflAQPNALVHLDLSGTDCV-IDLLLGALlhgccSHLTYLNLARNSCShrkgrEAP 412
Cdd:COG4886     96 TNLTELDLSGNEEL-------------SNLTNLESLDLSGNQLTdLPEELANL-----TNLKELDLSNNQLT-----DLP 152
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  413 PAFKQFFSsaytLSHVNLSATKL-----PLEALRALlQGLSL-NSHLSDLHLDLSSCelrsagaQALQEqlgavtcvgsL 486
Cdd:COG4886    153 EPLGNLTN----LKSLDLSNNQLtdlpeELGNLTNL-KELDLsNNQITDLPEPLGNL-------TNLEE----------L 210
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  487 DLSDNGFDSdlltLVPALGKNKSLKHLFLGKNfnvKAKTLEEILhklvQLIqeedcSLQSLSVADSRLKLrtsilINALG 566
Cdd:COG4886    211 DLSGNQLTD----LPEPLANLTNLETLDLSNN---QLTDLPELG----NLT-----NLEELDLSNNQLTD-----LPPLA 269
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  567 SNTCLAKVDLSGNGMEDIGAKMLSKALQINSSLRTILWDRNNTSALGFLDIARALESNHTLRFMSFPVSDISQAYRSAPE 646
Cdd:COG4886    270 NLTNLKTLDLSNNQLTDLKLKELELLLGLNSLLLLLLLLNLLELLILLLLLTTLLLLLLLLKGLLVTLTTLALSLSLLAL 349
                          330
                   ....*....|....*..
gi 2462498115  647 RTEDVWQKIQWCLVRNN 663
Cdd:COG4886    350 LTLLLLLNLLSLLLTLL 366
LRR_RI cd00116
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 ...
210-493 1.60e-07

Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).


Pssm-ID: 238064 [Multi-domain]  Cd Length: 319  Bit Score: 54.67  E-value: 1.60e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  210 ALAYNQWFTKLYCKDLRLGS--EVLEQVLHTLSKSGSLEELVLDNAGLKTDFVQKLAGVFGengSCVLHALTLSHNPIED 287
Cdd:cd00116     46 ALRPQPSLKELCLSLNETGRipRGLQSLLQGLTKGCGLQELDLSDNALGPDGCGVLESLLR---SSSLQELKLNNNGLGD 122
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  288 KGFLSLSQQLLCFPSGLTKLCLAKTAISPRGLQALGQTFGANPafasSLRYLDLSKNPglLATDEANALYSFLAQPNALV 367
Cdd:cd00116    123 RGLRLLAKGLKDLPPALEKLVLGRNRLEGASCEALAKALRANR----DLKELNLANNG--IGDAGIRALAEGLKANCNLE 196
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  368 HLDLSgtDCVID-----LLLGALLHGCCshLTYLNLARNSCSHRKGREAPPAFKqffSSAYTLSHVNLSATKLPLEALRA 442
Cdd:cd00116    197 VLDLN--NNGLTdegasALAETLASLKS--LEVLNLGDNNLTDAGAAALASALL---SPNISLLTLSLSCNDITDDGAKD 269
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2462498115  443 LLQGLSLNSHLsdLHLDLSSCELRSAGAQALQEQLGAVTC-VGSLDLSDNGF 493
Cdd:cd00116    270 LAEVLAEKESL--LELDLRGNKFGEEGAQLLAESLLEPGNeLESLWVKDDSF 319
LRR_RI cd00116
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 ...
185-501 3.53e-07

Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).


Pssm-ID: 238064 [Multi-domain]  Cd Length: 319  Bit Score: 53.90  E-value: 3.53e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  185 AEDNREFNLLDFS--HLESRDLALMVAALAYnQWFTKLYCKDLR------LGSEVLEQVLHtlskSGSLEELVLDNAGLK 256
Cdd:cd00116     47 LRPQPSLKELCLSlnETGRIPRGLQSLLQGL-TKGCGLQELDLSdnalgpDGCGVLESLLR----SSSLQELKLNNNGLG 121
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  257 TDFVQKLAGVFGENgSCVLHALTLSHNPIEDKGFLSLSQqLLCFPSGLTKLCLAKTAISPRGLQALGQTFGANpafaSSL 336
Cdd:cd00116    122 DRGLRLLAKGLKDL-PPALEKLVLGRNRLEGASCEALAK-ALRANRDLKELNLANNGIGDAGIRALAEGLKAN----CNL 195
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  337 RYLDLSKNpgLLATDEANALYSFLAQPNALVHLDLSgtDCVIDLLlgallhgCCSHLtylnlarnscshrkgreappafk 416
Cdd:cd00116    196 EVLDLNNN--GLTDEGASALAETLASLKSLEVLNLG--DNNLTDA-------GAAAL----------------------- 241
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  417 qffssaytlshvnlsatklpLEALRALLQGLslnshlsdLHLDLSSCELRSAGAQALQEQLGAVTCVGSLDLSDNGFDSD 496
Cdd:cd00116    242 --------------------ASALLSPNISL--------LTLSLSCNDITDDGAKDLAEVLAEKESLLELDLRGNKFGEE 293

                   ....*
gi 2462498115  497 LLTLV 501
Cdd:cd00116    294 GAQLL 298
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1122-1338 5.81e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 50.75  E-value: 5.81e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115 1122 GGGRGPSFRRKMGTEGSEPGEGGPAPGTAQQPRVHGVALPGlerakGWSFDGKREGPGPDQEGSTQAWQKRRSSDDAGPG 1201
Cdd:PRK07764   597 GEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPA-----EASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPA 671
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115 1202 SWKPPPPPQSTKPSFSAMRRAEATwHIAEESAPNHSCQSPSPASQDGEEEKEGTLFPERTLPARNAKLQDPALAPWPPkP 1281
Cdd:PRK07764   672 KAGGAAPAAPPPAPAPAAPAAPAG-AAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDP-P 749
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462498115 1282 VAVPRGRQPPQEPGVREEAEAGDAAPgvnkprlrlSSQQDQEEPEV----QGPPDPGRRTA 1338
Cdd:PRK07764   750 DPAGAPAQPPPPPAPAPAAAPAAAPP---------PSPPSEEEEMAeddaPSMDDEDRRDA 801
PHA03169 PHA03169
hypothetical protein; Provisional
1123-1337 3.95e-05

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 47.66  E-value: 3.95e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115 1123 GGRGPSFRRKMGTEGSEPGEGGPAPGTA--QQPRVHGVALPGLERAKGWSF----DGKREGPGPDQEGSTQAWQKRRSSD 1196
Cdd:PHA03169    34 GRRRGTAARAAKPAPPAPTTSGPQVRAVaeQGHRQTESDTETAEESRHGEKeergQGGPSGSGSESVGSPTPSPSGSAEE 113
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115 1197 DAGPGSWKPPPPPQSTKPSFSAMRRAEATWHIAEESAPNHScQSPSPASQ-DGEEEKEGTLFPERTLPARNAKLQDPALA 1275
Cdd:PHA03169   114 LASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPES-HNPSPNQQpSSFLQPSHEDSPEEPEPPTSEPEPDSPGP 192
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462498115 1276 PWPPKPVAVPRGRQPPQEPGvREEAEAGDAAPGVNKPRlRLSSQQDQEEPEVQGPPDPGRRT 1337
Cdd:PHA03169   193 PQSETPTSSPPPQSPPDEPG-EPQSPTPQQAPSPNTQQ-AVEHEDEPTEPEREGPPFPGHRS 252
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
239-576 7.51e-05

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 46.85  E-value: 7.51e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  239 LSKSGSLEELVLDNAGLKT--DFVQKLAGvfgengscvLHALTLSHNPIEDkgflsLSQQLLCFPSgLTKLCLAKTAIS- 315
Cdd:COG4886    109 LSNLTNLESLDLSGNQLTDlpEELANLTN---------LKELDLSNNQLTD-----LPEPLGNLTN-LKSLDLSNNQLTd 173
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  316 -PRGLQALgqtfganpafaSSLRYLDLSKNPgllATDEANAlysfLAQPNALVHLDLSGTD-CVIDLLLGALlhgccSHL 393
Cdd:COG4886    174 lPEELGNL-----------TNLKELDLSNNQ---ITDLPEP----LGNLTNLEELDLSGNQlTDLPEPLANL-----TNL 230
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  394 TYLNLARNscshrKGREAPpafkqFFSSAYTLSHVNLSATKlplealralLQGLSLNSHLSDL-HLDLSSCELRSAGAQA 472
Cdd:COG4886    231 ETLDLSNN-----QLTDLP-----ELGNLTNLEELDLSNNQ---------LTDLPPLANLTNLkTLDLSNNQLTDLKLKE 291
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115  473 LQE-------QLGAVTCVGSLDLSDNGFDSDLLTLVPALGKNKSLKHLFLGKNFNVKAKTLEEILHKLVQLIQEEDCSLQ 545
Cdd:COG4886    292 LELllglnslLLLLLLLNLLELLILLLLLTTLLLLLLLLKGLLVTLTTLALSLSLLALLTLLLLLNLLSLLLTLLLTLGL 371
                          330       340       350
                   ....*....|....*....|....*....|.
gi 2462498115  546 SLSVADSRLKLRTSILINALGSNTCLAKVDL 576
Cdd:COG4886    372 LGLLEATLLTLALLLLTLLLLLLTTTAGVLL 402
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
1239-1370 1.18e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 46.30  E-value: 1.18e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115 1239 QSPSPASQDGEEEKEGTLFPERTLPARNAKLQ----DPALAPWP--PKPVAVPRGRQPpqEPGVREEAEAG--DAAPGVN 1310
Cdd:NF033839   359 EKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQpekpKPEVKPQPekPKPEVKPQPEKP--KPEVKPQPEKPkpEVKPQPE 436
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462498115 1311 KPRLRLSSQQDQEEPEVQ---GPPDPGRRTAPLKPKRTRRAQScDKLEPDRRRP-PDPTGTSEP 1370
Cdd:NF033839   437 KPKPEVKPQPEKPKPEVKpqpETPKPEVKPQPEKPKPEVKPQP-EKPKPDNSKPqADDKKPSTP 499
PHA03247 PHA03247
large tegument protein UL36; Provisional
1120-1368 1.78e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.47  E-value: 1.78e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115 1120 GFGGGRGPSFRRKMGTEGSEPGEGGPAPGTAQQPRVHGVALPGLERAKgwsfdgkreGPGPDQEGSTQAWQKRRSSDDAG 1199
Cdd:PHA03247   262 GEGADRAPETARGATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPLAL---------PAPPDPPPPAPAGDAEEEDDEDG 332
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115 1200 PGSWKPPPPPQSTKPSFSAMRRAEATW----HIAEESAPNHSCQSPSPASQDGEEEKEG-TLFPERTLPARNAKLQDPAL 1274
Cdd:PHA03247   333 AMEVVSPLPRPRQHYPLGFPKRRRPTWtppsSLEDLSAGRHHPKRASLPTRKRRSARHAaTPFARGPGGDDQTRPAAPVP 412
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115 1275 APWP-PKPVAVPRGRQPPQEPGVREEAEAGDAAPGVnkprlrlssqqdqeEPEVQGPPDPGRRTAPLKPKRTRRAqscdk 1353
Cdd:PHA03247   413 ASVPtPAPTPVPASAPPPPATPLPSAEPGSDDGPAP--------------PPERQPPAPATEPAPDDPDDATRKA----- 473
                          250
                   ....*....|....*.
gi 2462498115 1354 LEPDR-RRPPDPTGTS 1368
Cdd:PHA03247   474 LDALReRRPPEPPGAD 489
PRK11633 PRK11633
cell division protein DedD; Provisional
1281-1370 7.70e-04

cell division protein DedD; Provisional


Pssm-ID: 236940 [Multi-domain]  Cd Length: 226  Bit Score: 42.68  E-value: 7.70e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115 1281 PVAVPRGRQPPqePGVREEAEAGDAAPGVNKP-RLRLSSQQDQEEPEVQGPPDPGRRTAPlKPKRTRRAQSCDKLEPDRR 1359
Cdd:PRK11633    58 AATQALPTQPP--EGAAEAVRAGDAAAPSLDPaTVAPPNTPVEPEPAPVEPPKPKPVEKP-KPKPKPQQKVEAPPAPKPE 134
                           90
                   ....*....|.
gi 2462498115 1360 RPPDPTGTSEP 1370
Cdd:PRK11633   135 PKPVVEEKAAP 145
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1105-1373 2.93e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.08  E-value: 2.93e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115 1105 SSPCWSPEEESSLLPGFGGGRGPSFRRK---MGTEGSEPGEGGPAP-GTAQQPRVHGVALPGLER-AKGWSFDGKREGPG 1179
Cdd:PHA03307   206 PPRRSSPISASASSPAPAPGRSAADDAGassSDSSSSESSGCGWGPeNECPLPRPAPITLPTRIWeASGWNGPSSRPGPA 285
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115 1180 PDQEGStqawqkRRSSDDAGPGSwkpppppqSTKPSFSAMRRAeatwhiAEESAPNHSCQSPSPASQDgeeekegtlFPE 1259
Cdd:PHA03307   286 SSSSSP------RERSPSPSPSS--------PGSGPAPSSPRA------SSSSSSSRESSSSSTSSSS---------ESS 336
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115 1260 RTLPARNAKLQDPALAPWPPKPVAV---PRGRQPPQEPGVREEAEAGDAAP-GVNKPRLRLSSQQDQeepevqgppdPGR 1335
Cdd:PHA03307   337 RGAAVSPGPSPSRSPSPSRPPPPADpssPRKRPRPSRAPSSPAASAGRPTRrRARAAVAGRARRRDA----------TGR 406
                          250       260       270
                   ....*....|....*....|....*....|....*...
gi 2462498115 1336 RTAPLKPKRTRRAQSCDKLEPDRRRPPDPTGTSEPGTD 1373
Cdd:PHA03307   407 FPAGRPRPSPLDAGAASGAFYARYPLLTPSGEPWPGSP 444
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
1135-1373 4.03e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 41.76  E-value: 4.03e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115 1135 TEGSEPGEGGPA--PGTAQQPRVHGVALPGLERAKGWSFDGKREGPGPDQEGSTQAWQKRRSSDDAGPGSWKPPPPPQST 1212
Cdd:PRK07003   363 TGGGAPGGGVPArvAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDA 442
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115 1213 KPSFSAMRRAEATWHIAEESAPNHSCQSPS-------------PASQDGEEEKEGTLFPERTLPARNAKLQDPALAPWPP 1279
Cdd:PRK07003   443 ADGDAPVPAKANARASADSRCDERDAQPPAdsgsasapasdapPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAP 522
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115 1280 KPVAVPRGRQPPQEPGV-REEAEAGDAAPGVNKPR---LRLSSQQDQE---------EPEVQGPPDPGRRTAPLKPKRTR 1346
Cdd:PRK07003   523 AAAAPPAPEARPPTPAAaAPAARAGGAAAALDVLRnagMRVSSDRGARaaaaakpaaAPAAAPKPAAPRVAVQVPTPRAR 602
                          250       260
                   ....*....|....*....|....*....
gi 2462498115 1347 RAQSCDKLEPDRRRPP--DPTGTSEPGTD 1373
Cdd:PRK07003   603 AATGDAPPNGAARAEQaaESRGAPPPWED 631
PHA03378 PHA03378
EBNA-3B; Provisional
1146-1370 4.63e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 41.59  E-value: 4.63e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115 1146 APGTAQQP--RVHGVALPGlerakgwsfDGKREGPGPDQEGSTQaWQKRRSSDDAGPGSWKPPPPPQSTKPSFSAMRRAE 1223
Cdd:PHA03378   589 APSYAQTPwpVPHPSQTPE---------PPTTQSHIPETSAPRQ-WPMPLRPIPMRPLRMQPITFNVLVFPTPHQPPQVE 658
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115 1224 ATWHIAEESAPNHSCQSPSPASQDgeeekegTLFPERTLPARnakLQDPALAPWPPKPVAVPRGR-QPPQEPGVREEAEA 1302
Cdd:PHA03378   659 ITPYKPTWTQIGHIPYQPSPTGAN-------TMLPIQWAPGT---MQPPPRAPTPMRPPAAPPGRaQRPAAATGRARPPA 728
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462498115 1303 GdaAPGVNKPRlrlssqqdqeepevQGPPDPGRRTAPlKPKRTRRAQSCdklePDRRRPPD-----PTGTSEP 1370
Cdd:PHA03378   729 A--APGRARPP--------------AAAPGRARPPAA-APGRARPPAAA----PGRARPPAaapgaPTPQPPP 780
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
1237-1372 5.97e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 40.91  E-value: 5.97e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462498115 1237 SCQSPSPASQDGEEEKEGTLFPERTLPARNAKLQ----DPALAPWP--PKPVAVPRGRQPPQEPGVREEAEAGDAAPGVN 1310
Cdd:NF033839   324 QLEKPKPEVKPQPEKPKPEVKPQLETPKPEVKPQpekpKPEVKPQPekPKPEVKPQPETPKPEVKPQPEKPKPEVKPQPE 403
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462498115 1311 KPRLRLSSQQDQEEPEVQGPPDPGR---RTAPLKPKRTRRAQscdklePDRRRP--------PDPTGTSEPGT 1372
Cdd:NF033839   404 KPKPEVKPQPEKPKPEVKPQPEKPKpevKPQPEKPKPEVKPQ------PEKPKPevkpqpetPKPEVKPQPEK 470
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH