NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|189458847|ref|NP_081501|]
View 

cordon-bleu protein-like 1 isoform 2 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Cobl pfam09469
Cordon-bleu ubiquitin-like domain; The Cordon-bleu protein domain is highly conserved among ...
156-234 3.61e-42

Cordon-bleu ubiquitin-like domain; The Cordon-bleu protein domain is highly conserved among vertebrates. The sequence contains three repeated lysine, arginine, and proline-rich regions, the KKRAP motif. The exact function of the protein is unknown but it is thought to be involved in mid-brain neural tube closure. It is expressed specifically in the node. This domain has a ubiquitin-like fold.


:

Pssm-ID: 462810  Cd Length: 79  Bit Score: 148.50  E-value: 3.61e-42
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 189458847   156 EKTVRVVINFKKTQKTIVRVSPHAPLQDLAPIICSKCEFDPLHTVLLKDYQAQEPLDLTKSLNDLGLRELYAMDISRES 234
Cdd:pfam09469    1 EKTVRLVVNYKKTQKAVVRVSPHVPLQELLPIICSKCEFDPLHVLLLKDYISQEELDLTKSLNDLGIKELYAMDVNRES 79
PHA03247 super family cl33720
large tegument protein UL36; Provisional
835-1106 5.66e-08

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 57.64  E-value: 5.66e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  835 PPKAPRVTTDTGTIPFAPNLEDInnilESKFRSRASNPQAKPSSfflqMQKRASGHYVTSAAAKSVHTAPGPAPKEPtik 914
Cdd:PHA03247 2556 PPAAPPAAPDRSVPPPRPAPRPS----EPAVTSRARRPDAPPQS----ARPRAPVDDRGDPRGPAPPSPLPPDTHAP--- 2624
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  915 evqRDPQLSPEQHPSSLSErTHSAPLPNISKADDD-------------IIQKPAETSPPPVAPKPMTLraetsPPPVfpK 981
Cdd:PHA03247 2625 ---DPPPPSPSPAANEPDP-HPPPTVPPPERPRDDpapgrvsrprrarRLGRAAQASSPPQRPRRRAA-----RPTV--G 2693
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  982 PMTLPAETSPPPVFPKPMTLPAETSLPLvfpKPMTLRAETSPPPVAAKPVALPGSQGTSLNLKTLKTFGAPRPYSSSGPS 1061
Cdd:PHA03247 2694 SLTSLADPPPPPPTPEPAPHALVSATPL---PPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPA 2770
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*
gi 189458847 1062 PFALAVVKRSQSFSKACPESASEGSSALPPAATQDEKTHTVNKPT 1106
Cdd:PHA03247 2771 PPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPA 2815
WH2 super family cl41728
Wiskott-Aldrich Syndrome Homology (WASP) region 2 (WH2 motif), and similar proteins; This ...
1152-1177 7.42e-08

Wiskott-Aldrich Syndrome Homology (WASP) region 2 (WH2 motif), and similar proteins; This family contains the Wiskott-Aldrich syndrome protein (WASP)-homology domain 2 (WH2) as well as thymosin-beta (Tbeta; also called beta-thymosin or betaT) domains that are small, widespread intrinsically disordered actin-binding peptides displaying significant sequence variability and different regulations of actin self-assembly in motile and morphogenetic processes. These WH2/betaT peptides are identified by a central consensus actin-binding motif LKKT/V flanked by variable N-terminal and C-terminal extensions; the betaT shares a more extended and conserved C-terminal half than WH2. These single or repeated domains are found in actin-binding proteins (ABPs) such as the hematopoietic-specific protein WASP, its ubiquitously expressed ortholog neural-WASP (N-WASP), WASP-interacting protein (WAS/WASL-interacting protein family members 1 and 2), and WASP-family verprolin homologous protein (WAVE/SCAR) isoforms: WAVE1, WAVE2, and WAVE3. Also included are the WH2 domains found in inverted formin FH2 domain-containing protein (INF2), Cordon bleu (Cobl) protein, vasodilator-stimulated phosphoprotein (VASP) homology protein and actobindin (found in amoebae). These ABPs are commonly multidomain proteins that contain signaling domains and structurally conserved actin-binding motifs, the most important being the WH2 domain motif through which they bind actin in order to direct the location, rate, and timing for actin assembly in the cell into different structures, such as filopodia, lamellipodia, stress fibers, and focal adhesions. The WH2 domain motif is one of the most abundant actin-binding motifs in Wiskott-Aldrich syndrome proteins (WASPs) where they activate Arp2/3-dependent actin nucleation and branching in response to signals mediated by Rho-family GTPases. The thymosin beta (Tbeta) domains in metazoans act in cells as major actin-sequestering peptides; their complex with monomeric ATP-actin (G-ATP-actin) cannot polymerize at either filament (F-actin) end.


The actual alignment was detected with superfamily member cd21801:

Pssm-ID: 425359  Cd Length: 26  Bit Score: 49.23  E-value: 7.42e-08
                          10        20
                  ....*....|....*....|....*.
gi 189458847 1152 DPEHVRQSLLTAIRSGEAAAKLKRVT 1177
Cdd:cd21801     1 NPEQARQALLEAIRSGEGAARLKKVP 26
RBD super family cl46342
Raf-like Ras-binding domain;
72-138 9.80e-03

Raf-like Ras-binding domain;


The actual alignment was detected with superfamily member pfam02196:

Pssm-ID: 460485  Cd Length: 69  Bit Score: 35.96  E-value: 9.80e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 189458847    72 LSVVLPGDILKSTTVHGSKPMMDLLVFLCAQYHLNPSSHTIDLLSAEENLIkfKPNTPIGMLDVEKV 138
Cdd:pfam02196    2 CRVYLPDGQRTVVQVRPGETVRDALSKLCKKRGLNPEACDVYLVGGDKYPL--DLDTDSSTLEGEEV 66
 
Name Accession Description Interval E-value
Cobl pfam09469
Cordon-bleu ubiquitin-like domain; The Cordon-bleu protein domain is highly conserved among ...
156-234 3.61e-42

Cordon-bleu ubiquitin-like domain; The Cordon-bleu protein domain is highly conserved among vertebrates. The sequence contains three repeated lysine, arginine, and proline-rich regions, the KKRAP motif. The exact function of the protein is unknown but it is thought to be involved in mid-brain neural tube closure. It is expressed specifically in the node. This domain has a ubiquitin-like fold.


Pssm-ID: 462810  Cd Length: 79  Bit Score: 148.50  E-value: 3.61e-42
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 189458847   156 EKTVRVVINFKKTQKTIVRVSPHAPLQDLAPIICSKCEFDPLHTVLLKDYQAQEPLDLTKSLNDLGLRELYAMDISRES 234
Cdd:pfam09469    1 EKTVRLVVNYKKTQKAVVRVSPHVPLQELLPIICSKCEFDPLHVLLLKDYISQEELDLTKSLNDLGIKELYAMDVNRES 79
PHA03247 PHA03247
large tegument protein UL36; Provisional
835-1106 5.66e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 57.64  E-value: 5.66e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  835 PPKAPRVTTDTGTIPFAPNLEDInnilESKFRSRASNPQAKPSSfflqMQKRASGHYVTSAAAKSVHTAPGPAPKEPtik 914
Cdd:PHA03247 2556 PPAAPPAAPDRSVPPPRPAPRPS----EPAVTSRARRPDAPPQS----ARPRAPVDDRGDPRGPAPPSPLPPDTHAP--- 2624
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  915 evqRDPQLSPEQHPSSLSErTHSAPLPNISKADDD-------------IIQKPAETSPPPVAPKPMTLraetsPPPVfpK 981
Cdd:PHA03247 2625 ---DPPPPSPSPAANEPDP-HPPPTVPPPERPRDDpapgrvsrprrarRLGRAAQASSPPQRPRRRAA-----RPTV--G 2693
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  982 PMTLPAETSPPPVFPKPMTLPAETSLPLvfpKPMTLRAETSPPPVAAKPVALPGSQGTSLNLKTLKTFGAPRPYSSSGPS 1061
Cdd:PHA03247 2694 SLTSLADPPPPPPTPEPAPHALVSATPL---PPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPA 2770
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*
gi 189458847 1062 PFALAVVKRSQSFSKACPESASEGSSALPPAATQDEKTHTVNKPT 1106
Cdd:PHA03247 2771 PPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPA 2815
WH2_Wc_Cobl cd21801
third Wiskott Aldrich syndrome homology region 2 (WH2 motif) repeat (called Wc) found in ...
1152-1177 7.42e-08

third Wiskott Aldrich syndrome homology region 2 (WH2 motif) repeat (called Wc) found in protein Cordon-Bleu (Cobl) and similar proteins; This family contains the third tandem Wiskott-Aldrich syndrome protein (WASP)-homology domain 2 (WH2), called Wc, found in protein Cordon-Bleu (Cobl), a potent actin filament nucleator that plays an important role in the reorganization of the actin cytoskeleton. It regulates neuron morphogenesis and increases branching of axons and dendrites. It also modulates dendrite branching in Purkinje cells. Cobl binds to and sequesters actin monomers (G-actin). Cobl contains three tandem WH2 (or W) domains consisting of an N-terminal alpha helix and a C-terminal LRKV motif. The first two WH2 domains have the highest binding affinity for actin. They are functionally active in actin nucleation and polymerization. The model corresponds to the first WH2 domain.


Pssm-ID: 409199  Cd Length: 26  Bit Score: 49.23  E-value: 7.42e-08
                          10        20
                  ....*....|....*....|....*.
gi 189458847 1152 DPEHVRQSLLTAIRSGEAAAKLKRVT 1177
Cdd:cd21801     1 NPEQARQALLEAIRSGEGAARLKKVP 26
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
796-1093 4.22e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 47.84  E-value: 4.22e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847   796 PATSKSSQQPQPDLKPKP--SSGTERHLHRTLSSPTGTETNPPKAPRVTTDTGTIPFApnledinnILESKFRSRASNPQ 873
Cdd:pfam03154  212 PATSQPPNQTQSTAAPHTliQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQP--------SLHGQMPPMPHSLQ 283
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847   874 AKPSsfFLQMQKRASGHYVTSAAAKSvHTAPGPAPKEPTikEVQRDPQLSPEQHPSSLSERTHSAPLPNISKADDDIiqK 953
Cdd:pfam03154  284 TGPS--HMQHPVPPQPFPLTPQSSQS-QVPPGPSPAAPG--QSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHI--K 356
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847   954 PAETSPPPVAPKPmtlRAETSPPPVF-PKPMTLPAETSPPPVFpKPMTLPAETSLPLVFPKPMTLRAETSP-PPVAAKPV 1031
Cdd:pfam03154  357 PPPTTPIPQLPNP---QSHKHPPHLSgPSPFQMNSNLPPPPAL-KPLSSLSTHHPPSAHPPPLQLMPQSQQlPPPPAQPP 432
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 189458847  1032 ALPGSQGTSLNLKTLKTFGAPRPYSSSGP---SPFALAVVKRSQSFSKACPESASEGSSALPPAA 1093
Cdd:pfam03154  433 VLTQSQSLPPPAASHPPTSGLHQVPSQSPfpqHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSS 497
BimA_second NF040983
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia ...
954-1034 1.45e-03

trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia intracellular motility A), WP_004266405.1-like proteins in Burkholderia mallei or B. pseudomallei. The term BimA has also been used for WP_011205626.1-like homologs that have a very different N-terminal half.


Pssm-ID: 468913 [Multi-domain]  Cd Length: 382  Bit Score: 42.58  E-value: 1.45e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  954 PAETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFPKPMTLPAETSLPLVFPKPMTLRAETSPPPVaaKPVAL 1033
Cdd:NF040983   86 PNKVPPPPPPPPPPPPPPPTPPPPPPPPPPPPPPSPPPPPPPSPPPSPPPPTTTPPTRTTPSTTTPTPSMHPI--QPTQL 163

                  .
gi 189458847 1034 P 1034
Cdd:NF040983  164 P 164
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
904-1034 3.98e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 41.29  E-value: 3.98e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  904 PGPAPKEPTIKEVQRDPQlsPEQHPSSLSERTHSAPLPNISKADddiIQKPAETSPPPVAPKPMTLRAETSPPPVFPKPM 983
Cdd:NF033839  345 PQLETPKPEVKPQPEKPK--PEVKPQPEKPKPEVKPQPETPKPE---VKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPE 419
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 189458847  984 TLPA------ETSPPPVFPKPMTLPA-ETSLPLVFPKPMTLRAETSPPPVAAKPVALP 1034
Cdd:NF033839  420 VKPQpekpkpEVKPQPEKPKPEVKPQpEKPKPEVKPQPETPKPEVKPQPEKPKPEVKP 477
BimA_second NF040983
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia ...
951-1036 9.40e-03

trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia intracellular motility A), WP_004266405.1-like proteins in Burkholderia mallei or B. pseudomallei. The term BimA has also been used for WP_011205626.1-like homologs that have a very different N-terminal half.


Pssm-ID: 468913 [Multi-domain]  Cd Length: 382  Bit Score: 39.89  E-value: 9.40e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  951 IQKPAETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFPKPMTLPAETSLPLVfPKPMTLRAETSPPPVAAKP 1030
Cdd:NF040983   89 VPPPPPPPPPPPPPPPTPPPPPPPPPPPPPPSPPPPPPPSPPPSPPPPTTTPPTRTTPST-TTPTPSMHPIQPTQLPSIP 167

                  ....*.
gi 189458847 1031 VALPGS 1036
Cdd:NF040983  168 NATPTS 173
RBD pfam02196
Raf-like Ras-binding domain;
72-138 9.80e-03

Raf-like Ras-binding domain;


Pssm-ID: 460485  Cd Length: 69  Bit Score: 35.96  E-value: 9.80e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 189458847    72 LSVVLPGDILKSTTVHGSKPMMDLLVFLCAQYHLNPSSHTIDLLSAEENLIkfKPNTPIGMLDVEKV 138
Cdd:pfam02196    2 CRVYLPDGQRTVVQVRPGETVRDALSKLCKKRGLNPEACDVYLVGGDKYPL--DLDTDSSTLEGEEV 66
 
Name Accession Description Interval E-value
Cobl pfam09469
Cordon-bleu ubiquitin-like domain; The Cordon-bleu protein domain is highly conserved among ...
156-234 3.61e-42

Cordon-bleu ubiquitin-like domain; The Cordon-bleu protein domain is highly conserved among vertebrates. The sequence contains three repeated lysine, arginine, and proline-rich regions, the KKRAP motif. The exact function of the protein is unknown but it is thought to be involved in mid-brain neural tube closure. It is expressed specifically in the node. This domain has a ubiquitin-like fold.


Pssm-ID: 462810  Cd Length: 79  Bit Score: 148.50  E-value: 3.61e-42
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 189458847   156 EKTVRVVINFKKTQKTIVRVSPHAPLQDLAPIICSKCEFDPLHTVLLKDYQAQEPLDLTKSLNDLGLRELYAMDISRES 234
Cdd:pfam09469    1 EKTVRLVVNYKKTQKAVVRVSPHVPLQELLPIICSKCEFDPLHVLLLKDYISQEELDLTKSLNDLGIKELYAMDVNRES 79
PHA03247 PHA03247
large tegument protein UL36; Provisional
835-1106 5.66e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 57.64  E-value: 5.66e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  835 PPKAPRVTTDTGTIPFAPNLEDInnilESKFRSRASNPQAKPSSfflqMQKRASGHYVTSAAAKSVHTAPGPAPKEPtik 914
Cdd:PHA03247 2556 PPAAPPAAPDRSVPPPRPAPRPS----EPAVTSRARRPDAPPQS----ARPRAPVDDRGDPRGPAPPSPLPPDTHAP--- 2624
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  915 evqRDPQLSPEQHPSSLSErTHSAPLPNISKADDD-------------IIQKPAETSPPPVAPKPMTLraetsPPPVfpK 981
Cdd:PHA03247 2625 ---DPPPPSPSPAANEPDP-HPPPTVPPPERPRDDpapgrvsrprrarRLGRAAQASSPPQRPRRRAA-----RPTV--G 2693
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  982 PMTLPAETSPPPVFPKPMTLPAETSLPLvfpKPMTLRAETSPPPVAAKPVALPGSQGTSLNLKTLKTFGAPRPYSSSGPS 1061
Cdd:PHA03247 2694 SLTSLADPPPPPPTPEPAPHALVSATPL---PPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPA 2770
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*
gi 189458847 1062 PFALAVVKRSQSFSKACPESASEGSSALPPAATQDEKTHTVNKPT 1106
Cdd:PHA03247 2771 PPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPA 2815
WH2_Wc_Cobl cd21801
third Wiskott Aldrich syndrome homology region 2 (WH2 motif) repeat (called Wc) found in ...
1152-1177 7.42e-08

third Wiskott Aldrich syndrome homology region 2 (WH2 motif) repeat (called Wc) found in protein Cordon-Bleu (Cobl) and similar proteins; This family contains the third tandem Wiskott-Aldrich syndrome protein (WASP)-homology domain 2 (WH2), called Wc, found in protein Cordon-Bleu (Cobl), a potent actin filament nucleator that plays an important role in the reorganization of the actin cytoskeleton. It regulates neuron morphogenesis and increases branching of axons and dendrites. It also modulates dendrite branching in Purkinje cells. Cobl binds to and sequesters actin monomers (G-actin). Cobl contains three tandem WH2 (or W) domains consisting of an N-terminal alpha helix and a C-terminal LRKV motif. The first two WH2 domains have the highest binding affinity for actin. They are functionally active in actin nucleation and polymerization. The model corresponds to the first WH2 domain.


Pssm-ID: 409199  Cd Length: 26  Bit Score: 49.23  E-value: 7.42e-08
                          10        20
                  ....*....|....*....|....*.
gi 189458847 1152 DPEHVRQSLLTAIRSGEAAAKLKRVT 1177
Cdd:cd21801     1 NPEQARQALLEAIRSGEGAARLKKVP 26
PHA03247 PHA03247
large tegument protein UL36; Provisional
796-1142 7.15e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.17  E-value: 7.15e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  796 PATSKSSQQPQPdlKPKPSSGTERHLHRTLSSPTgtETNPPKAPRVTTDTGTIPFAPnledinnileSKFRSRASNPQAK 875
Cdd:PHA03247 2562 AAPDRSVPPPRP--APRPSEPAVTSRARRPDAPP--QSARPRAPVDDRGDPRGPAPP----------SPLPPDTHAPDPP 2627
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  876 PSSfflqMQKRASGHYVTSAAAKSVHTAPGPAPKEPTIKEVQRDPQLSPEQHPSSLSERTHSAPLPNISKADDDIIQKPA 955
Cdd:PHA03247 2628 PPS----PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPP 2703
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  956 ETSPPPVAPKPMTlraetSPPPVFPKPMTLPAETSPPPVFPKPMTLPAETSLPLVFPKPMTLRAETSPP-------PVAA 1028
Cdd:PHA03247 2704 PPPTPEPAPHALV-----SATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPapappaaPAAG 2778
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847 1029 KPVALPGSQGTSLNLKTLKTFGAPRPYSSSGPSPFALAVVKRSQSFSK--ACPESASEGSSALPPAATQDekthtvNKPT 1106
Cdd:PHA03247 2779 PPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGplPPPTSAQPTAPPPPPGPPPP------SLPL 2852
                         330       340       350
                  ....*....|....*....|....*....|....*.
gi 189458847 1107 VGSQHGDGDKQNNPVQNEHSSQVLTPADGPSFTLKR 1142
Cdd:PHA03247 2853 GGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLAR 2888
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
865-1096 3.10e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 51.42  E-value: 3.10e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  865 FRSRASNPQAKPSSfflqmqkRASGHYVTSAAAKSVHTAPGPAPKEPTiKEVQRDPQLSPEQHPSSLSERTHSAPLPNIS 944
Cdd:PRK12323  363 FRPGQSGGGAGPAT-------AAAAPVAQPAPAAAAPAAAAPAPAAPP-AAPAAAPAAAAAARAVAAAPARRSPAPEALA 434
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  945 KADDDIIQKPAETSPPPVAPKPMTLrAETSPPPVFPKPMTLPAETSPPPVFPKPMTLPAETS------LPLVFPKPMTLR 1018
Cdd:PRK12323  435 AARQASARGPGGAPAPAPAPAAAPA-AAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDpppweeLPPEFASPAPAQ 513
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 189458847 1019 AETSPPPVAAKPVALPGSQGTSlnlktlktfgAPRPYSSSGPSPfALAVVKRSQSFSKACPESASEGSSALPPAATQD 1096
Cdd:PRK12323  514 PDAAPAGWVAESIPDPATADPD----------DAFETLAPAPAA-APAPRAAAATEPVVAPRPPRASASGLPDMFDGD 580
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
796-1093 4.22e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 47.84  E-value: 4.22e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847   796 PATSKSSQQPQPDLKPKP--SSGTERHLHRTLSSPTGTETNPPKAPRVTTDTGTIPFApnledinnILESKFRSRASNPQ 873
Cdd:pfam03154  212 PATSQPPNQTQSTAAPHTliQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQP--------SLHGQMPPMPHSLQ 283
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847   874 AKPSsfFLQMQKRASGHYVTSAAAKSvHTAPGPAPKEPTikEVQRDPQLSPEQHPSSLSERTHSAPLPNISKADDDIiqK 953
Cdd:pfam03154  284 TGPS--HMQHPVPPQPFPLTPQSSQS-QVPPGPSPAAPG--QSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHI--K 356
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847   954 PAETSPPPVAPKPmtlRAETSPPPVF-PKPMTLPAETSPPPVFpKPMTLPAETSLPLVFPKPMTLRAETSP-PPVAAKPV 1031
Cdd:pfam03154  357 PPPTTPIPQLPNP---QSHKHPPHLSgPSPFQMNSNLPPPPAL-KPLSSLSTHHPPSAHPPPLQLMPQSQQlPPPPAQPP 432
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 189458847  1032 ALPGSQGTSLNLKTLKTFGAPRPYSSSGP---SPFALAVVKRSQSFSKACPESASEGSSALPPAA 1093
Cdd:pfam03154  433 VLTQSQSLPPPAASHPPTSGLHQVPSQSPfpqHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSS 497
PHA03247 PHA03247
large tegument protein UL36; Provisional
796-1091 5.07e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.01  E-value: 5.07e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  796 PATSKSSQQPQPDLKPKPSSGTERHLHRTLSSPTGTETNPPKAPRVTTDTGTIpfapnledinnilESKFRSRASNPQAK 875
Cdd:PHA03247 2609 RGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRV-------------SRPRRARRLGRAAQ 2675
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  876 PSSFFLQMQKRASGHYVTSAAAKSVHTAPGPAPKEPTIKEVQRDPQlspeqhPSSLSERTHSAPLPNISKADDDIIQKPA 955
Cdd:PHA03247 2676 ASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPL------PPGPAAARQASPALPAAPAPPAVPAGPA 2749
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  956 ETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFP------------KPMTLPAETSLPLVFPKPMTLRAETSP 1023
Cdd:PHA03247 2750 TPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASlsesreslpspwDPADPPAAVLAPAAALPPAASPAGPLP 2829
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 189458847 1024 PPVAAKPVALPGSQGTSLNLKTLKTFGAP------RPYSSSGPSPFALAVVKRSQSFSKACPESASEgSSALPP 1091
Cdd:PHA03247 2830 PPTSAQPTAPPPPPGPPPPSLPLGGSVAPggdvrrRPPSRSPAAKPAAPARPPVRRLARPAVSRSTE-SFALPP 2902
PHA03378 PHA03378
EBNA-3B; Provisional
773-1058 5.48e-05

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 47.75  E-value: 5.48e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  773 KMLPVGQRHTIENMTETSMQTEVPATSKSSQQPQPDLKPKPSsgterhlhrTLSSPTGTETNPPK--APR-VTTDTGTIP 849
Cdd:PHA03378  562 QLLPAPGLGPLQIQPLTSPTTSQLASSAPSYAQTPWPVPHPS---------QTPEPPTTQSHIPEtsAPRqWPMPLRPIP 632
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  850 FAPNLEDINNILESKFRSRASNPQAKPSSFFLQMQKRASGHYVTSAAAKSVHTAPGPAPKepTIKEVQRDP-QLSPEQHP 928
Cdd:PHA03378  633 MRPLRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPG--TMQPPPRAPtPMRPPAAP 710
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  929 SSLSERTHSAPLPNISKADDDIIQKPAETSPPPvAPKPMTLRAETSPPPVFPKPMTLPAETSPPPV-FPKPMTLPAETSL 1007
Cdd:PHA03378  711 PGRAQRPAAATGRARPPAAAPGRARPPAAAPGR-ARPPAAAPGRARPPAAAPGRARPPAAAPGAPTpQPPPQAPPAPQQR 789
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|..
gi 189458847 1008 PLVFPKPMTlRAETSPPPVAAKPVALPGSQG-TSLNLKTLKTFGAPRPYSSS 1058
Cdd:PHA03378  790 PRGAPTPQP-PPQAGPTSMQLMPRAAPGQQGpTKQILRQLLTGGVKRGRPSL 840
PHA03247 PHA03247
large tegument protein UL36; Provisional
686-1062 6.64e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.63  E-value: 6.64e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  686 EKEPACTYGNNVPLSPVDGSNKNPAASylKNFPLYRQDSNPKPKPSNEITREYIPKIGMTTYKIVPPKSLEMAKDWESeA 765
Cdd:PHA03247 2586 ARRPDAPPQSARPRAPVDDRGDPRGPA--PPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRV-S 2662
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  766 MGRKDDQKMLPVGQRHTIENMTETSMQTEVPATSKSSQQPQPDLKPKPSSgteRHLHRTLSSPTGTETNPPKAPRVTTDT 845
Cdd:PHA03247 2663 RPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAP---HALVSATPLPPGPAAARQASPALPAAP 2739
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  846 GT--IPFAPNLEDINNILESKFRSRASNPQAKPSSFFLQMQKRASGHYVTSAAAKSVHTAPGPAPKEPTIKEVQRDPQLS 923
Cdd:PHA03247 2740 APpaVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALP 2819
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  924 PEQHPSSLSerthsAPLPNISKADDDIIQKPAETSPPP---VAPK-PMTLRAETSPPPvfpkpmTLPAETSPPPVfpKPM 999
Cdd:PHA03247 2820 PAASPAGPL-----PPPTSAQPTAPPPPPGPPPPSLPLggsVAPGgDVRRRPPSRSPA------AKPAAPARPPV--RRL 2886
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 189458847 1000 TLPAETSLPLVFPKPMTLRAETSPPPVAAKPVALPGSQGTSLNLKTLKTFGAPRPYSSSGPSP 1062
Cdd:PHA03247 2887 ARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDP 2949
PRK10263 PRK10263
DNA translocase FtsK; Provisional
788-1062 1.54e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 46.23  E-value: 1.54e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  788 ETSMQT-EVPATSKSSQQPQPDLKPKPssgterhlhrtlssptGTETNPPK-APRVTTDTGTIPFAPNLEDINNILESKF 865
Cdd:PRK10263  338 EPVTQTpPVASVDVPPAQPTVAWQPVP----------------GPQTGEPViAPAPEGYPQQSQYAQPAVQYNEPLQQPV 401
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  866 rsrasnPQAKPSSFFLQMQKRASGHYVTSAAAKSVHTAPGPAPKEPTIKEV-QRDPQLSPEQH-PSSLSERTHSAPLPni 943
Cdd:PRK10263  402 ------QPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAwQAEEQQSTFAPqSTYQTEQTYQQPAA-- 473
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  944 skADDDIIQKPAETSPPPVAPKPMTLRAETSPPPVF-----------------------PKPMTLPAETSPPPVFPKPMT 1000
Cdd:PRK10263  474 --QEPLYQQPQPVEQQPVVEPEPVVEETKPARPPLYyfeeveekrarereqlaawyqpiPEPVKEPEPIKSSLKAPSVAA 551
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 189458847 1001 LPAETSLPLVFPKPMTLRAETSPPPVAAKpVALPgsqgtslnLKTLKTFGAPRPYSSSGPSP 1062
Cdd:PRK10263  552 VPPVEAAAAVSPLASGVKKATLATGAAAT-VAAP--------VFSLANSGGPRPQVKEGIGP 604
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
920-1039 2.51e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 45.15  E-value: 2.51e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  920 PQLSPEQHPSSLSERTHSAPLPNiSKADDDIIQKPAETSPPPVAPKPMTlRAETSPPPVfPKPMTLPAETSPPPVFPKPM 999
Cdd:PRK14971  371 GGRGPKQHIKPVFTQPAAAPQPS-AAAAASPSPSQSSAAAQPSAPQSAT-QPAGTPPTV-SVDPPAAVPVNPPSTAPQAV 447
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|
gi 189458847 1000 TLPAETSlplvfPKPMTLRAETSPPPVAAKPVALPGSQGT 1039
Cdd:PRK14971  448 RPAQFKE-----EKKIPVSKVSSLGPSTLRPIQEKAEQAT 482
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
953-1042 1.11e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 43.26  E-value: 1.11e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  953 KPAETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFPKPmtlPAETSLPLVFPKPMTLRAETSPPPVAAK--P 1030
Cdd:PRK14950  370 KPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPR---PVAPPVPHTPESAPKLTRAAIPVDEKPKytP 446
                          90
                  ....*....|..
gi 189458847 1031 VALPGSQGTSLN 1042
Cdd:PRK14950  447 PAPPKEEEKALI 458
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
823-1024 1.30e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 42.94  E-value: 1.30e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  823 RTLSSPTGTETNPPKAPRVTTDTGTIPFAPNLEDiNNILESKFRSRASNPQAKPSSFFLQMQKRASGHYVTSAAAKSVHT 902
Cdd:PRK12323  376 TAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAA-PAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAP 454
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  903 APGPAPKEPTIKEVQRDPQLSPEQHPSSLSERTHSAPLPNISKADDDIIQKPAETSPPPVAPKPMTLRAETSPPPVFPKP 982
Cdd:PRK12323  455 AAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADP 534
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|..
gi 189458847  983 mTLPAETSPPPVFPKPMTLPAETSLPLVFPKPMTLRAETSPP 1024
Cdd:PRK12323  535 -DDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPD 575
BimA_second NF040983
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia ...
954-1034 1.45e-03

trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia intracellular motility A), WP_004266405.1-like proteins in Burkholderia mallei or B. pseudomallei. The term BimA has also been used for WP_011205626.1-like homologs that have a very different N-terminal half.


Pssm-ID: 468913 [Multi-domain]  Cd Length: 382  Bit Score: 42.58  E-value: 1.45e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  954 PAETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFPKPMTLPAETSLPLVFPKPMTLRAETSPPPVaaKPVAL 1033
Cdd:NF040983   86 PNKVPPPPPPPPPPPPPPPTPPPPPPPPPPPPPPSPPPPPPPSPPPSPPPPTTTPPTRTTPSTTTPTPSMHPI--QPTQL 163

                  .
gi 189458847 1034 P 1034
Cdd:NF040983  164 P 164
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
892-1203 1.72e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.85  E-value: 1.72e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  892 VTSAAAKSVHTAPGPAPKEPTIKEVQRDPQLSPEQHPSSL---SERTHSAPLPNISKADDDI--IQKPAETSPPPVAPKP 966
Cdd:PHA03307   56 VAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLapaSPAREGSPTPPGPSSPDPPppTPPPASPPPSPAPDLS 135
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  967 MTLRAETSPPPVfPKPMTLPAETSPPPVfpkpmTLPAETSLPLVFPKPMTLRAETSPPPVAAKPVALPGSqgtslnlktl 1046
Cdd:PHA03307  136 EMLRPVGSPGPP-PAASPPAAGASPAAV-----ASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPP---------- 199
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847 1047 ktfGAPRPYSSSGPSPFALAVVkrsqsfskaCPESASEGSSALPPAATQDEKTHtvnkpTVGSQHGDGDKQNNPVQNEHS 1126
Cdd:PHA03307  200 ---AAASPRPPRRSSPISASAS---------SPAPAPGRSAADDAGASSSDSSS-----SESSGCGWGPENECPLPRPAP 262
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 189458847 1127 SQVLTPADGPSFTLKRqssltfqSSDPEHVRQSLLTAIRSGEAAAKLKRVTVPSNTISVNGKSGLSQSMSIDAQDSR 1203
Cdd:PHA03307  263 ITLPTRIWEASGWNGP-------SSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSS 332
PHA03378 PHA03378
EBNA-3B; Provisional
905-1066 2.17e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 42.36  E-value: 2.17e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  905 GPAPKEPtikevqrdPQLSPEQHPSSLSERTH--SAPLPNISKADDDIIQKPAETSPPPVAPKPMtlRAETSPPPVFPKP 982
Cdd:PHA03378  648 FPTPHQP--------PQVEITPYKPTWTQIGHipYQPSPTGANTMLPIQWAPGTMQPPPRAPTPM--RPPAAPPGRAQRP 717
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  983 MTLPAETSPPPVFPKPMTLPAETSLPLVFPKPMTLRAET-------SPPPVAA--KPVALPGSQGTSLNLKTLKTFGAPR 1053
Cdd:PHA03378  718 AAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPpaaapgrARPPAAApgAPTPQPPPQAPPAPQQRPRGAPTPQ 797
                         170
                  ....*....|...
gi 189458847 1054 PYSSSGPSPFALA 1066
Cdd:PHA03378  798 PPPQAGPTSMQLM 810
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
906-1139 2.70e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.06  E-value: 2.70e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847   906 PAPKEPTIKEVQRDPQLSPEQHPSSLSERTHSAPLPNISKADDDIIQKPAETSPPPVAPKPMTLRAETSPPPVFPKPMTL 985
Cdd:pfam03154  221 TQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPL 300
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847   986 PAETSPPPVFPKPMT-LPAETSLPLVFPKPMTLRAETSPP---PVAAKPVALPgsqgtslNLKTLKTFGAPR-------- 1053
Cdd:pfam03154  301 TPQSSQSQVPPGPSPaAPGQSQQRIHTPPSQSQLQSQQPPreqPLPPAPLSMP-------HIKPPPTTPIPQlpnpqshk 373
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  1054 -PYSSSGPSPFAL-------AVVKRSQSFSKACPESA--------SEGSSALPPAATQDEKTHTVNKPTVGSQH-GDGDK 1116
Cdd:pfam03154  374 hPPHLSGPSPFQMnsnlpppPALKPLSSLSTHHPPSAhppplqlmPQSQQLPPPPAQPPVLTQSQSLPPPAASHpPTSGL 453
                          250       260
                   ....*....|....*....|...
gi 189458847  1117 QNNPVQNEHSSQVLTPADGPSFT 1139
Cdd:pfam03154  454 HQVPSQSPFPQHPFVPGGPPPIT 476
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
952-1047 2.97e-03

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 41.80  E-value: 2.97e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  952 QKPAETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFPKPMTLPAETSLPlvfpkpmtlrAETSPPPVAAKPV 1031
Cdd:PRK12270   36 YGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAA----------AAAPAAPPAAAAA 105
                          90
                  ....*....|....*.
gi 189458847 1032 ALPGSQGTSLNLKTLK 1047
Cdd:PRK12270  106 AAPAAAAVEDEVTPLR 121
PHA03247 PHA03247
large tegument protein UL36; Provisional
796-1056 3.04e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.23  E-value: 3.04e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  796 PATSKSSQQPQPDLKPKPSSGTERHLHRTlSSPTGTETNPPKAPRVTTDTGTIPFAPNLEDINNILESKFRSRASNPQAK 875
Cdd:PHA03247 2793 ESRESLPSPWDPADPPAAVLAPAAALPPA-ASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSP 2871
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  876 PSSfflqmqkrasghyVTSAAAKSVHTAPGPAPKEPTIKEVQRDPQLSPEQHPSSLSERTHSAPLPniskadddiiQKPA 955
Cdd:PHA03247 2872 AAK-------------PAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPP----------PPPQ 2928
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  956 ETSPPPVAPKPmtlraetsPPPVFPKPMTLPAeTSPPPVFPKP---MTLPAETSLPLVF---PKPMTLRAETSPPPVAAK 1029
Cdd:PHA03247 2929 PQPPPPPPPRP--------QPPLAPTTDPAGA-GEPSGAVPQPwlgALVPGRVAVPRFRvpqPAPSREAPASSTPPLTGH 2999
                         250       260
                  ....*....|....*....|....*..
gi 189458847 1030 PVALPGSQGTSLnlkTLKTFGAPRPYS 1056
Cdd:PHA03247 3000 SLSRVSSWASSL---ALHEETDPPPVS 3023
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
904-1034 3.98e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 41.29  E-value: 3.98e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  904 PGPAPKEPTIKEVQRDPQlsPEQHPSSLSERTHSAPLPNISKADddiIQKPAETSPPPVAPKPMTLRAETSPPPVFPKPM 983
Cdd:NF033839  345 PQLETPKPEVKPQPEKPK--PEVKPQPEKPKPEVKPQPETPKPE---VKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPE 419
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 189458847  984 TLPA------ETSPPPVFPKPMTLPA-ETSLPLVFPKPMTLRAETSPPPVAAKPVALP 1034
Cdd:NF033839  420 VKPQpekpkpEVKPQPEKPKPEVKPQpEKPKPEVKPQPETPKPEVKPQPEKPKPEVKP 477
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
953-1099 7.46e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 40.62  E-value: 7.46e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  953 KPAETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFPKPM------TLPAETSLPLVFPKPMTLRAETSPPPV 1026
Cdd:PRK07994  360 HPAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASApqqapaVPLPETTSQLLAARQQLQRAQGATKAK 439
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 189458847 1027 AAKPVALPGSQGTSLNLKTLKTFgAPRPYSSSGPSPFALAVVKRSQSfskacPESASEGSSALPPAATQ---DEKT 1099
Cdd:PRK07994  440 KSEPAAASRARPVNSALERLASV-RPAPSALEKAPAKKEAYRWKATN-----PVEVKKEPVATPKALKKaleHEKT 509
BimA_second NF040983
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia ...
951-1036 9.40e-03

trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia intracellular motility A), WP_004266405.1-like proteins in Burkholderia mallei or B. pseudomallei. The term BimA has also been used for WP_011205626.1-like homologs that have a very different N-terminal half.


Pssm-ID: 468913 [Multi-domain]  Cd Length: 382  Bit Score: 39.89  E-value: 9.40e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 189458847  951 IQKPAETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFPKPMTLPAETSLPLVfPKPMTLRAETSPPPVAAKP 1030
Cdd:NF040983   89 VPPPPPPPPPPPPPPPTPPPPPPPPPPPPPPSPPPPPPPSPPPSPPPPTTTPPTRTTPST-TTPTPSMHPIQPTQLPSIP 167

                  ....*.
gi 189458847 1031 VALPGS 1036
Cdd:NF040983  168 NATPTS 173
RBD pfam02196
Raf-like Ras-binding domain;
72-138 9.80e-03

Raf-like Ras-binding domain;


Pssm-ID: 460485  Cd Length: 69  Bit Score: 35.96  E-value: 9.80e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 189458847    72 LSVVLPGDILKSTTVHGSKPMMDLLVFLCAQYHLNPSSHTIDLLSAEENLIkfKPNTPIGMLDVEKV 138
Cdd:pfam02196    2 CRVYLPDGQRTVVQVRPGETVRDALSKLCKKRGLNPEACDVYLVGGDKYPL--DLDTDSSTLEGEEV 66
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH