NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|442619634|ref|NP_001262675|]
View 

uncharacterized protein Dmel_CG14322, isoform B [Drosophila melanogaster]

Protein Classification

WD40 repeat domain-containing protein( domain architecture ID 11455410)

WD40 repeat domain-containing protein similar to proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly

CATH:  2.130.10.10
PubMed:  10322433|8090199
SCOP:  4002744

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
1327-1586 6.79e-08

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 56.84  E-value: 6.79e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634 1327 VIAAAEDGDIYVFHLVTHKLEQKITKHSEAITNMFLSEKDSILYTTSADGffkkssllnlervfeTVYL-----KEPLQS 1401
Cdd:COG2319    93 LASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADG---------------TVRLwdlatGKLLRT 157
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634 1402 M----DVAWGLAF--------IGSRWGQISTFNVVTNKVVeKPLVSTGQSIIAIKATKEGvrKILVLGCKGNFVQMHDAG 1469
Cdd:COG2319   158 LtghsGAVTSVAFspdgkllaSGSDDGTVRLWDLATGKLL-RTLTGHTGAVRSVAFSPDG--KLLASGSADGTVRLWDLA 234
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634 1470 NGLLLRhVFIAEGLNIYSLLL--DEGHIYCGTQKNELYQLEFVSGNLVTKFSCGNGAVAVAAY--GERYLLVGCYDGYIY 1545
Cdd:COG2319   235 TGKLLR-TLTGHSGSVRSVAFspDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFspDGKLLASGSDDGTVR 313
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|...
gi 442619634 1546 VLNKITGTQTGRFAGAGRMV--LALSVVGDKIVTSSKDNSLAI 1586
Cdd:COG2319   314 LWDLATGKLLRTLTGHTGAVrsVAFSPDGKTLASGSDDGTVRL 356
PTZ00395 super family cl33180
Sec24-related protein; Provisional
5-277 9.78e-06

Sec24-related protein; Provisional


The actual alignment was detected with superfamily member PTZ00395:

Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 50.46  E-value: 9.78e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634    5 QQQRGESGGGAPPDTGRDSA--SQPAKSSTA-SGSGHSATCSHSPSSKNRRQNNSmKYNSRPWQRGRSDSGHFNNRVHSN 81
Cdd:PTZ00395  373 PDARGAWAGGPHSNASYNCAaySNAAQSNAAqSNAGFSNAGYSNPGNSNPGYNNA-PNSNTPYNNPPNSNTPYSNPPNSN 451
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634   82 HNQRQTPYPKSNYENRGRSDTRSSNQERQERDYTKRTDFRERNTRFSDERHSGRYSdyhrRNHYHrfkswGSEGRSYRRD 161
Cdd:PTZ00395  452 PPYSNLPYSNTPYSNAPLSNAPPSSAKDHHSAYHAAYQHRAANQPAANLPTANQPA----ANNFH-----GAAGNSVGNP 522
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634  162 KGARDLSKSRYDNDASGSSlirgaseirrknghncDPNSDASKTENSEDIQSCINHYQTEENKIDTEQSKDQGANRTSPT 241
Cdd:PTZ00395  523 FASRPFGSAPYGGNAATTA----------------DPNGIAKREDHPEGGTNRQKYEQSDEESVESSSSENSSENENEVT 586
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|.
gi 442619634  242 EQLS-------------DISKQSNPIALEGDQNKKTNELKESSCS--SKPS 277
Cdd:PTZ00395  587 DKGEeiysllkktinriDMNKIPRPIINTQEKKKKKNLKVFETCKyiSPPS 637
 
Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
1327-1586 6.79e-08

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 56.84  E-value: 6.79e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634 1327 VIAAAEDGDIYVFHLVTHKLEQKITKHSEAITNMFLSEKDSILYTTSADGffkkssllnlervfeTVYL-----KEPLQS 1401
Cdd:COG2319    93 LASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADG---------------TVRLwdlatGKLLRT 157
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634 1402 M----DVAWGLAF--------IGSRWGQISTFNVVTNKVVeKPLVSTGQSIIAIKATKEGvrKILVLGCKGNFVQMHDAG 1469
Cdd:COG2319   158 LtghsGAVTSVAFspdgkllaSGSDDGTVRLWDLATGKLL-RTLTGHTGAVRSVAFSPDG--KLLASGSADGTVRLWDLA 234
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634 1470 NGLLLRhVFIAEGLNIYSLLL--DEGHIYCGTQKNELYQLEFVSGNLVTKFSCGNGAVAVAAY--GERYLLVGCYDGYIY 1545
Cdd:COG2319   235 TGKLLR-TLTGHSGSVRSVAFspDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFspDGKLLASGSDDGTVR 313
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|...
gi 442619634 1546 VLNKITGTQTGRFAGAGRMV--LALSVVGDKIVTSSKDNSLAI 1586
Cdd:COG2319   314 LWDLATGKLLRTLTGHTGAVrsVAFSPDGKTLASGSDDGTVRL 356
PTZ00395 PTZ00395
Sec24-related protein; Provisional
5-277 9.78e-06

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 50.46  E-value: 9.78e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634    5 QQQRGESGGGAPPDTGRDSA--SQPAKSSTA-SGSGHSATCSHSPSSKNRRQNNSmKYNSRPWQRGRSDSGHFNNRVHSN 81
Cdd:PTZ00395  373 PDARGAWAGGPHSNASYNCAaySNAAQSNAAqSNAGFSNAGYSNPGNSNPGYNNA-PNSNTPYNNPPNSNTPYSNPPNSN 451
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634   82 HNQRQTPYPKSNYENRGRSDTRSSNQERQERDYTKRTDFRERNTRFSDERHSGRYSdyhrRNHYHrfkswGSEGRSYRRD 161
Cdd:PTZ00395  452 PPYSNLPYSNTPYSNAPLSNAPPSSAKDHHSAYHAAYQHRAANQPAANLPTANQPA----ANNFH-----GAAGNSVGNP 522
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634  162 KGARDLSKSRYDNDASGSSlirgaseirrknghncDPNSDASKTENSEDIQSCINHYQTEENKIDTEQSKDQGANRTSPT 241
Cdd:PTZ00395  523 FASRPFGSAPYGGNAATTA----------------DPNGIAKREDHPEGGTNRQKYEQSDEESVESSSSENSSENENEVT 586
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|.
gi 442619634  242 EQLS-------------DISKQSNPIALEGDQNKKTNELKESSCS--SKPS 277
Cdd:PTZ00395  587 DKGEeiysllkktinriDMNKIPRPIINTQEKKKKKNLKVFETCKyiSPPS 637
assembly_YfgL TIGR03300
outer membrane assembly lipoprotein YfgL; Members of this protein family are YfgL, a ...
1488-1588 1.76e-04

outer membrane assembly lipoprotein YfgL; Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ. [Protein fate, Protein and peptide secretion and trafficking]


Pssm-ID: 274511 [Multi-domain]  Cd Length: 377  Bit Score: 45.69  E-value: 1.76e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634  1488 LLLDEGHIYCGTQKNELYQLEFVSGNLVTKfscgNGAV------AVAAYGeRYLLVGCYDGYIYVLNkitgTQTGRFagA 1561
Cdd:TIGR03300  275 PAVDDNRLYVTDADGVVVALDRRSGSELWK----NDELkyrqltAPAVLG-GYLVVGDFEGYLHWLD----RDDGSF--V 343
                           90       100       110
                   ....*....|....*....|....*....|....
gi 442619634  1562 GRM-------VLALSVVGDKIVTSSKDNSLAILE 1588
Cdd:TIGR03300  344 ARLktdgsgiASPPVVVGDGLLVQTRDGDLYAFR 377
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
10-318 2.37e-04

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 46.06  E-value: 2.37e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634   10 ESGGGAPPDTGRDSASQPAKSSTASGSGHSATCSHSPSSKNRRQNNSMKYNSRPWQRGRSDSGHfNNRVHSNHNQRQTPY 89
Cdd:NF033609  581 DSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDS-DSDSDSDSDSDSDSD 659
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634   90 PKSNYENRGRSDTRSSNQERQERDYTKRTDFRERNTRFSDerhSGRYSDYHRRNHYHRFKSWGSEGRSYRRDKGARDlSK 169
Cdd:NF033609  660 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD---SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-SD 735
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634  170 SRYDNDASGSSLIRGASEIRRKNGHNCDPNSDA-SKTENSEDIQSCINHYQTEENKIDTEQSKDQGANRTSPTEQLSDIS 248
Cdd:NF033609  736 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 815
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634  249 KQSNPIALEGDQNKKTNELKESSCSSKPSREINETSKASQFPDQRSKEQSSGEELNVSESSDNHSGAQAS 318
Cdd:NF033609  816 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPPNSPKNGTNAS 885
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1324-1379 4.33e-04

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 44.25  E-value: 4.33e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 442619634 1324 RDNVIAAAEDGDIYVFHLVTHKLEQKITKHSEAITNMFLSEKDSILYTTSADGFFK 1379
Cdd:cd00200   231 GYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIR 286
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
74-171 1.68e-03

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 42.96  E-value: 1.68e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634    74 FNNRVHSNHNQRQTPYPKSNYENRGRSDTRSSNQERQERDYTKRTDFRERNTRFSDERHSGRYSDYHRRNHYHRFKSwgs 153
Cdd:TIGR01642    4 EPDREREKSRGRDRDRSSERPRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDSRSPRSLRYSSVRRSRDRPRR--- 80
                           90
                   ....*....|....*...
gi 442619634   154 egRSYRRDKGARDLSKSR 171
Cdd:TIGR01642   81 --RSRSVRSIEQHRRRLR 96
 
Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
1327-1586 6.79e-08

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 56.84  E-value: 6.79e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634 1327 VIAAAEDGDIYVFHLVTHKLEQKITKHSEAITNMFLSEKDSILYTTSADGffkkssllnlervfeTVYL-----KEPLQS 1401
Cdd:COG2319    93 LASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADG---------------TVRLwdlatGKLLRT 157
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634 1402 M----DVAWGLAF--------IGSRWGQISTFNVVTNKVVeKPLVSTGQSIIAIKATKEGvrKILVLGCKGNFVQMHDAG 1469
Cdd:COG2319   158 LtghsGAVTSVAFspdgkllaSGSDDGTVRLWDLATGKLL-RTLTGHTGAVRSVAFSPDG--KLLASGSADGTVRLWDLA 234
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634 1470 NGLLLRhVFIAEGLNIYSLLL--DEGHIYCGTQKNELYQLEFVSGNLVTKFSCGNGAVAVAAY--GERYLLVGCYDGYIY 1545
Cdd:COG2319   235 TGKLLR-TLTGHSGSVRSVAFspDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFspDGKLLASGSDDGTVR 313
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|...
gi 442619634 1546 VLNKITGTQTGRFAGAGRMV--LALSVVGDKIVTSSKDNSLAI 1586
Cdd:COG2319   314 LWDLATGKLLRTLTGHTGAVrsVAFSPDGKTLASGSDDGTVRL 356
WD40 COG2319
WD40 repeat [General function prediction only];
1327-1586 2.86e-07

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 54.92  E-value: 2.86e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634 1327 VIAAAEDGDIYVFHLVTHKLEQKITKHSEAITNMFLSEKDSILYTTSADGffkkssllnlervfeTVYL-----KEPLQS 1401
Cdd:COG2319   135 LASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDG---------------TVRLwdlatGKLLRT 199
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634 1402 M----DVAWGLAF--------IGSRWGQISTFNVVTNKVVeKPLVSTGQSIIAIKATKEGvrKILVLGCKGNFVQMHDAG 1469
Cdd:COG2319   200 LtghtGAVRSVAFspdgkllaSGSADGTVRLWDLATGKLL-RTLTGHSGSVRSVAFSPDG--RLLASGSADGTVRLWDLA 276
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634 1470 NGLLLRhVFIAEGLNIYSLLL--DEGHIYCGTQKNELYQLEFVSGNLVTKFSCGNG---AVAVAAYGeRYLLVGCYDGYI 1544
Cdd:COG2319   277 TGELLR-TLTGHSGGVNSVAFspDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGavrSVAFSPDG-KTLASGSDDGTV 354
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....
gi 442619634 1545 YVLNKITGTQTGRFAGAGRMV--LALSVVGDKIVTSSKDNSLAI 1586
Cdd:COG2319   355 RLWDLATGELLRTLTGHTGAVtsVAFSPDGRTLASGSADGTVRL 398
PTZ00395 PTZ00395
Sec24-related protein; Provisional
5-277 9.78e-06

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 50.46  E-value: 9.78e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634    5 QQQRGESGGGAPPDTGRDSA--SQPAKSSTA-SGSGHSATCSHSPSSKNRRQNNSmKYNSRPWQRGRSDSGHFNNRVHSN 81
Cdd:PTZ00395  373 PDARGAWAGGPHSNASYNCAaySNAAQSNAAqSNAGFSNAGYSNPGNSNPGYNNA-PNSNTPYNNPPNSNTPYSNPPNSN 451
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634   82 HNQRQTPYPKSNYENRGRSDTRSSNQERQERDYTKRTDFRERNTRFSDERHSGRYSdyhrRNHYHrfkswGSEGRSYRRD 161
Cdd:PTZ00395  452 PPYSNLPYSNTPYSNAPLSNAPPSSAKDHHSAYHAAYQHRAANQPAANLPTANQPA----ANNFH-----GAAGNSVGNP 522
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634  162 KGARDLSKSRYDNDASGSSlirgaseirrknghncDPNSDASKTENSEDIQSCINHYQTEENKIDTEQSKDQGANRTSPT 241
Cdd:PTZ00395  523 FASRPFGSAPYGGNAATTA----------------DPNGIAKREDHPEGGTNRQKYEQSDEESVESSSSENSSENENEVT 586
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|.
gi 442619634  242 EQLS-------------DISKQSNPIALEGDQNKKTNELKESSCS--SKPS 277
Cdd:PTZ00395  587 DKGEeiysllkktinriDMNKIPRPIINTQEKKKKKNLKVFETCKyiSPPS 637
assembly_YfgL TIGR03300
outer membrane assembly lipoprotein YfgL; Members of this protein family are YfgL, a ...
1488-1588 1.76e-04

outer membrane assembly lipoprotein YfgL; Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ. [Protein fate, Protein and peptide secretion and trafficking]


Pssm-ID: 274511 [Multi-domain]  Cd Length: 377  Bit Score: 45.69  E-value: 1.76e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634  1488 LLLDEGHIYCGTQKNELYQLEFVSGNLVTKfscgNGAV------AVAAYGeRYLLVGCYDGYIYVLNkitgTQTGRFagA 1561
Cdd:TIGR03300  275 PAVDDNRLYVTDADGVVVALDRRSGSELWK----NDELkyrqltAPAVLG-GYLVVGDFEGYLHWLD----RDDGSF--V 343
                           90       100       110
                   ....*....|....*....|....*....|....
gi 442619634  1562 GRM-------VLALSVVGDKIVTSSKDNSLAILE 1588
Cdd:TIGR03300  344 ARLktdgsgiASPPVVVGDGLLVQTRDGDLYAFR 377
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
10-318 2.37e-04

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 46.06  E-value: 2.37e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634   10 ESGGGAPPDTGRDSASQPAKSSTASGSGHSATCSHSPSSKNRRQNNSMKYNSRPWQRGRSDSGHfNNRVHSNHNQRQTPY 89
Cdd:NF033609  581 DSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDS-DSDSDSDSDSDSDSD 659
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634   90 PKSNYENRGRSDTRSSNQERQERDYTKRTDFRERNTRFSDerhSGRYSDYHRRNHYHRFKSWGSEGRSYRRDKGARDlSK 169
Cdd:NF033609  660 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD---SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-SD 735
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634  170 SRYDNDASGSSLIRGASEIRRKNGHNCDPNSDA-SKTENSEDIQSCINHYQTEENKIDTEQSKDQGANRTSPTEQLSDIS 248
Cdd:NF033609  736 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 815
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634  249 KQSNPIALEGDQNKKTNELKESSCSSKPSREINETSKASQFPDQRSKEQSSGEELNVSESSDNHSGAQAS 318
Cdd:NF033609  816 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPPNSPKNGTNAS 885
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1324-1379 4.33e-04

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 44.25  E-value: 4.33e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 442619634 1324 RDNVIAAAEDGDIYVFHLVTHKLEQKITKHSEAITNMFLSEKDSILYTTSADGFFK 1379
Cdd:cd00200   231 GYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIR 286
PRK12678 PRK12678
transcription termination factor Rho; Provisional
2-210 1.45e-03

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 43.35  E-value: 1.45e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634    2 SAPQQQRGESGGGAPPDTGRDSASQPAKSSTASGSGHSATCSHSPSSKNRRQNNSMKYNSRPWQRGRSDSGHFNNRVHSN 81
Cdd:PRK12678   99 AAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARRGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDR 178
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634   82 HNQRQTpypksnyENRGRSDTRSSNQERQERDYtkRTDFRERNTRFSDERHSGRYSDYHRRNhyhrfkswGSEGRSYRRD 161
Cdd:PRK12678  179 EDRQAE-------AERGERGRREERGRDGDDRD--RRDRREQGDRREERGRRDGGDRRGRRR--------RRDRRDARGD 241
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*....
gi 442619634  162 KGARDLSKSRYDNDASGSSLIRGASEIRRKNGHNCDPNSDASKTENSED 210
Cdd:PRK12678  242 DNREDRGDRDGDDGEGRGGRRGRRFRDRDRRGRRGGDGGNEREPELRED 290
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
74-171 1.68e-03

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 42.96  E-value: 1.68e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 442619634    74 FNNRVHSNHNQRQTPYPKSNYENRGRSDTRSSNQERQERDYTKRTDFRERNTRFSDERHSGRYSDYHRRNHYHRFKSwgs 153
Cdd:TIGR01642    4 EPDREREKSRGRDRDRSSERPRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDSRSPRSLRYSSVRRSRDRPRR--- 80
                           90
                   ....*....|....*...
gi 442619634   154 egRSYRRDKGARDLSKSR 171
Cdd:TIGR01642   81 --RSRSVRSIEQHRRRLR 96
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH