NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1907140485|ref|XP_036018184|]
View 

cordon-bleu protein-like 1 isoform X10 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Cobl pfam09469
Cordon-bleu ubiquitin-like domain; The Cordon-bleu protein domain is highly conserved among ...
156-234 2.97e-42

Cordon-bleu ubiquitin-like domain; The Cordon-bleu protein domain is highly conserved among vertebrates. The sequence contains three repeated lysine, arginine, and proline-rich regions, the KKRAP motif. The exact function of the protein is unknown but it is thought to be involved in mid-brain neural tube closure. It is expressed specifically in the node. This domain has a ubiquitin-like fold.


:

Pssm-ID: 462810  Cd Length: 79  Bit Score: 148.89  E-value: 2.97e-42
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907140485  156 EKTVRVVINFKKTQKTIVRVSPHAPLQDLAPIICSKCEFDPLHTVLLKDYQAQEPLDLTKSLNDLGLRELYAMDISRES 234
Cdd:pfam09469    1 EKTVRLVVNYKKTQKAVVRVSPHVPLQELLPIICSKCEFDPLHVLLLKDYISQEELDLTKSLNDLGIKELYAMDVNRES 79
PHA03247 super family cl33720
large tegument protein UL36; Provisional
873-1144 6.22e-08

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 57.64  E-value: 6.22e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  873 PPKAPRVTTDTGTIPFAPNLEDInnilESKFRSRASNPQAKPSSfflqMQKRASGHYVTSAAAKSVHTAPGPAPKEPtik 952
Cdd:PHA03247  2556 PPAAPPAAPDRSVPPPRPAPRPS----EPAVTSRARRPDAPPQS----ARPRAPVDDRGDPRGPAPPSPLPPDTHAP--- 2624
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  953 evqRDPQLSPEQHPSSLSErTHSAPLPNISKADDD-------------IIQKPAETSPPPVAPKPMTLraetsPPPVfpK 1019
Cdd:PHA03247  2625 ---DPPPPSPSPAANEPDP-HPPPTVPPPERPRDDpapgrvsrprrarRLGRAAQASSPPQRPRRRAA-----RPTV--G 2693
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485 1020 PMTLPAETSPPPVFPKPMTLPAETSLPLvfpKPMTLRAETSPPPVAAKPVALPGSQGTSLNLKTLKTFGAPRPYSSSGPS 1099
Cdd:PHA03247  2694 SLTSLADPPPPPPTPEPAPHALVSATPL---PPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPA 2770
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 1907140485 1100 PFALAVVKRSQSFSKACPESASEGSSALPPAATQDEKTHTVNKPT 1144
Cdd:PHA03247  2771 PPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPA 2815
WH2 super family cl41728
Wiskott-Aldrich Syndrome Homology (WASP) region 2 (WH2 motif), and similar proteins; This ...
1190-1214 4.40e-07

Wiskott-Aldrich Syndrome Homology (WASP) region 2 (WH2 motif), and similar proteins; This family contains the Wiskott-Aldrich syndrome protein (WASP)-homology domain 2 (WH2) as well as thymosin-beta (Tbeta; also called beta-thymosin or betaT) domains that are small, widespread intrinsically disordered actin-binding peptides displaying significant sequence variability and different regulations of actin self-assembly in motile and morphogenetic processes. These WH2/betaT peptides are identified by a central consensus actin-binding motif LKKT/V flanked by variable N-terminal and C-terminal extensions; the betaT shares a more extended and conserved C-terminal half than WH2. These single or repeated domains are found in actin-binding proteins (ABPs) such as the hematopoietic-specific protein WASP, its ubiquitously expressed ortholog neural-WASP (N-WASP), WASP-interacting protein (WAS/WASL-interacting protein family members 1 and 2), and WASP-family verprolin homologous protein (WAVE/SCAR) isoforms: WAVE1, WAVE2, and WAVE3. Also included are the WH2 domains found in inverted formin FH2 domain-containing protein (INF2), Cordon bleu (Cobl) protein, vasodilator-stimulated phosphoprotein (VASP) homology protein and actobindin (found in amoebae). These ABPs are commonly multidomain proteins that contain signaling domains and structurally conserved actin-binding motifs, the most important being the WH2 domain motif through which they bind actin in order to direct the location, rate, and timing for actin assembly in the cell into different structures, such as filopodia, lamellipodia, stress fibers, and focal adhesions. The WH2 domain motif is one of the most abundant actin-binding motifs in Wiskott-Aldrich syndrome proteins (WASPs) where they activate Arp2/3-dependent actin nucleation and branching in response to signals mediated by Rho-family GTPases. The thymosin beta (Tbeta) domains in metazoans act in cells as major actin-sequestering peptides; their complex with monomeric ATP-actin (G-ATP-actin) cannot polymerize at either filament (F-actin) end.


The actual alignment was detected with superfamily member cd21801:

Pssm-ID: 425359  Cd Length: 26  Bit Score: 46.91  E-value: 4.40e-07
                           10        20
                   ....*....|....*....|....*
gi 1907140485 1190 DPEHVRQSLLTAIRSGEAAAKLKRA 1214
Cdd:cd21801      1 NPEQARQALLEAIRSGEGAARLKKV 25
RBD super family cl46342
Raf-like Ras-binding domain;
72-138 9.51e-03

Raf-like Ras-binding domain;


The actual alignment was detected with superfamily member pfam02196:

Pssm-ID: 460485  Cd Length: 69  Bit Score: 35.96  E-value: 9.51e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907140485   72 LSVVLPGDILKSTTVHGSKPMMDLLVFLCAQYHLNPSSHTIDLLSAEENLIkfKPNTPIGMLDVEKV 138
Cdd:pfam02196    2 CRVYLPDGQRTVVQVRPGETVRDALSKLCKKRGLNPEACDVYLVGGDKYPL--DLDTDSSTLEGEEV 66
 
Name Accession Description Interval E-value
Cobl pfam09469
Cordon-bleu ubiquitin-like domain; The Cordon-bleu protein domain is highly conserved among ...
156-234 2.97e-42

Cordon-bleu ubiquitin-like domain; The Cordon-bleu protein domain is highly conserved among vertebrates. The sequence contains three repeated lysine, arginine, and proline-rich regions, the KKRAP motif. The exact function of the protein is unknown but it is thought to be involved in mid-brain neural tube closure. It is expressed specifically in the node. This domain has a ubiquitin-like fold.


Pssm-ID: 462810  Cd Length: 79  Bit Score: 148.89  E-value: 2.97e-42
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907140485  156 EKTVRVVINFKKTQKTIVRVSPHAPLQDLAPIICSKCEFDPLHTVLLKDYQAQEPLDLTKSLNDLGLRELYAMDISRES 234
Cdd:pfam09469    1 EKTVRLVVNYKKTQKAVVRVSPHVPLQELLPIICSKCEFDPLHVLLLKDYISQEELDLTKSLNDLGIKELYAMDVNRES 79
PHA03247 PHA03247
large tegument protein UL36; Provisional
873-1144 6.22e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 57.64  E-value: 6.22e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  873 PPKAPRVTTDTGTIPFAPNLEDInnilESKFRSRASNPQAKPSSfflqMQKRASGHYVTSAAAKSVHTAPGPAPKEPtik 952
Cdd:PHA03247  2556 PPAAPPAAPDRSVPPPRPAPRPS----EPAVTSRARRPDAPPQS----ARPRAPVDDRGDPRGPAPPSPLPPDTHAP--- 2624
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  953 evqRDPQLSPEQHPSSLSErTHSAPLPNISKADDD-------------IIQKPAETSPPPVAPKPMTLraetsPPPVfpK 1019
Cdd:PHA03247  2625 ---DPPPPSPSPAANEPDP-HPPPTVPPPERPRDDpapgrvsrprrarRLGRAAQASSPPQRPRRRAA-----RPTV--G 2693
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485 1020 PMTLPAETSPPPVFPKPMTLPAETSLPLvfpKPMTLRAETSPPPVAAKPVALPGSQGTSLNLKTLKTFGAPRPYSSSGPS 1099
Cdd:PHA03247  2694 SLTSLADPPPPPPTPEPAPHALVSATPL---PPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPA 2770
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 1907140485 1100 PFALAVVKRSQSFSKACPESASEGSSALPPAATQDEKTHTVNKPT 1144
Cdd:PHA03247  2771 PPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPA 2815
WH2_Wc_Cobl cd21801
third Wiskott Aldrich syndrome homology region 2 (WH2 motif) repeat (called Wc) found in ...
1190-1214 4.40e-07

third Wiskott Aldrich syndrome homology region 2 (WH2 motif) repeat (called Wc) found in protein Cordon-Bleu (Cobl) and similar proteins; This family contains the third tandem Wiskott-Aldrich syndrome protein (WASP)-homology domain 2 (WH2), called Wc, found in protein Cordon-Bleu (Cobl), a potent actin filament nucleator that plays an important role in the reorganization of the actin cytoskeleton. It regulates neuron morphogenesis and increases branching of axons and dendrites. It also modulates dendrite branching in Purkinje cells. Cobl binds to and sequesters actin monomers (G-actin). Cobl contains three tandem WH2 (or W) domains consisting of an N-terminal alpha helix and a C-terminal LRKV motif. The first two WH2 domains have the highest binding affinity for actin. They are functionally active in actin nucleation and polymerization. The model corresponds to the first WH2 domain.


Pssm-ID: 409199  Cd Length: 26  Bit Score: 46.91  E-value: 4.40e-07
                           10        20
                   ....*....|....*....|....*
gi 1907140485 1190 DPEHVRQSLLTAIRSGEAAAKLKRA 1214
Cdd:cd21801      1 NPEQARQALLEAIRSGEGAARLKKV 25
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
834-1131 3.35e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 48.22  E-value: 3.35e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  834 PATSKSSQQPQPDLKPKP--SSGTERHLHRTLSSPTGTETNPPKAPRVTTDTGTIPFApnledinnILESKFRSRASNPQ 911
Cdd:pfam03154  212 PATSQPPNQTQSTAAPHTliQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQP--------SLHGQMPPMPHSLQ 283
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  912 AKPSsfFLQMQKRASGHYVTSAAAKSvHTAPGPAPKEPTikEVQRDPQLSPEQHPSSLSERTHSAPLPNISKADDDIiqK 991
Cdd:pfam03154  284 TGPS--HMQHPVPPQPFPLTPQSSQS-QVPPGPSPAAPG--QSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHI--K 356
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  992 PAETSPPPVAPKPmtlRAETSPPPVF-PKPMTLPAETSPPPVFpKPMTLPAETSLPLVFPKPMTLRAETSP-PPVAAKPV 1069
Cdd:pfam03154  357 PPPTTPIPQLPNP---QSHKHPPHLSgPSPFQMNSNLPPPPAL-KPLSSLSTHHPPSAHPPPLQLMPQSQQlPPPPAQPP 432
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907140485 1070 ALPGSQGTSLNLKTLKTFGAPRPYSSSGP---SPFALAVVKRSQSFSKACPESASEGSSALPPAA 1131
Cdd:pfam03154  433 VLTQSQSLPPPAASHPPTSGLHQVPSQSPfpqHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSS 497
BimA_second NF040983
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia ...
992-1072 1.25e-03

trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia intracellular motility A), WP_004266405.1-like proteins in Burkholderia mallei or B. pseudomallei. The term BimA has also been used for WP_011205626.1-like homologs that have a very different N-terminal half.


Pssm-ID: 468913 [Multi-domain]  Cd Length: 382  Bit Score: 42.58  E-value: 1.25e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  992 PAETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFPKPMTLPAETSLPLVFPKPMTLRAETSPPPVaaKPVAL 1071
Cdd:NF040983    86 PNKVPPPPPPPPPPPPPPPTPPPPPPPPPPPPPPSPPPPPPPSPPPSPPPPTTTPPTRTTPSTTTPTPSMHPI--QPTQL 163

                   .
gi 1907140485 1072 P 1072
Cdd:NF040983   164 P 164
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
942-1072 4.11e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 41.29  E-value: 4.11e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  942 PGPAPKEPTIKEVQRDPQlsPEQHPSSLSERTHSAPLPNISKADddiIQKPAETSPPPVAPKPMTLRAETSPPPVFPKPM 1021
Cdd:NF033839   345 PQLETPKPEVKPQPEKPK--PEVKPQPEKPKPEVKPQPETPKPE---VKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPE 419
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907140485 1022 TLPA------ETSPPPVFPKPMTLPA-ETSLPLVFPKPMTLRAETSPPPVAAKPVALP 1072
Cdd:NF033839   420 VKPQpekpkpEVKPQPEKPKPEVKPQpEKPKPEVKPQPETPKPEVKPQPEKPKPEVKP 477
BimA_second NF040983
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia ...
989-1074 8.45e-03

trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia intracellular motility A), WP_004266405.1-like proteins in Burkholderia mallei or B. pseudomallei. The term BimA has also been used for WP_011205626.1-like homologs that have a very different N-terminal half.


Pssm-ID: 468913 [Multi-domain]  Cd Length: 382  Bit Score: 39.89  E-value: 8.45e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  989 IQKPAETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFPKPMTLPAETSLPLVfPKPMTLRAETSPPPVAAKP 1068
Cdd:NF040983    89 VPPPPPPPPPPPPPPPTPPPPPPPPPPPPPPSPPPPPPPSPPPSPPPPTTTPPTRTTPST-TTPTPSMHPIQPTQLPSIP 167

                   ....*.
gi 1907140485 1069 VALPGS 1074
Cdd:NF040983   168 NATPTS 173
RBD pfam02196
Raf-like Ras-binding domain;
72-138 9.51e-03

Raf-like Ras-binding domain;


Pssm-ID: 460485  Cd Length: 69  Bit Score: 35.96  E-value: 9.51e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907140485   72 LSVVLPGDILKSTTVHGSKPMMDLLVFLCAQYHLNPSSHTIDLLSAEENLIkfKPNTPIGMLDVEKV 138
Cdd:pfam02196    2 CRVYLPDGQRTVVQVRPGETVRDALSKLCKKRGLNPEACDVYLVGGDKYPL--DLDTDSSTLEGEEV 66
 
Name Accession Description Interval E-value
Cobl pfam09469
Cordon-bleu ubiquitin-like domain; The Cordon-bleu protein domain is highly conserved among ...
156-234 2.97e-42

Cordon-bleu ubiquitin-like domain; The Cordon-bleu protein domain is highly conserved among vertebrates. The sequence contains three repeated lysine, arginine, and proline-rich regions, the KKRAP motif. The exact function of the protein is unknown but it is thought to be involved in mid-brain neural tube closure. It is expressed specifically in the node. This domain has a ubiquitin-like fold.


Pssm-ID: 462810  Cd Length: 79  Bit Score: 148.89  E-value: 2.97e-42
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907140485  156 EKTVRVVINFKKTQKTIVRVSPHAPLQDLAPIICSKCEFDPLHTVLLKDYQAQEPLDLTKSLNDLGLRELYAMDISRES 234
Cdd:pfam09469    1 EKTVRLVVNYKKTQKAVVRVSPHVPLQELLPIICSKCEFDPLHVLLLKDYISQEELDLTKSLNDLGIKELYAMDVNRES 79
PHA03247 PHA03247
large tegument protein UL36; Provisional
873-1144 6.22e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 57.64  E-value: 6.22e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  873 PPKAPRVTTDTGTIPFAPNLEDInnilESKFRSRASNPQAKPSSfflqMQKRASGHYVTSAAAKSVHTAPGPAPKEPtik 952
Cdd:PHA03247  2556 PPAAPPAAPDRSVPPPRPAPRPS----EPAVTSRARRPDAPPQS----ARPRAPVDDRGDPRGPAPPSPLPPDTHAP--- 2624
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  953 evqRDPQLSPEQHPSSLSErTHSAPLPNISKADDD-------------IIQKPAETSPPPVAPKPMTLraetsPPPVfpK 1019
Cdd:PHA03247  2625 ---DPPPPSPSPAANEPDP-HPPPTVPPPERPRDDpapgrvsrprrarRLGRAAQASSPPQRPRRRAA-----RPTV--G 2693
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485 1020 PMTLPAETSPPPVFPKPMTLPAETSLPLvfpKPMTLRAETSPPPVAAKPVALPGSQGTSLNLKTLKTFGAPRPYSSSGPS 1099
Cdd:PHA03247  2694 SLTSLADPPPPPPTPEPAPHALVSATPL---PPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPA 2770
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 1907140485 1100 PFALAVVKRSQSFSKACPESASEGSSALPPAATQDEKTHTVNKPT 1144
Cdd:PHA03247  2771 PPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPA 2815
WH2_Wc_Cobl cd21801
third Wiskott Aldrich syndrome homology region 2 (WH2 motif) repeat (called Wc) found in ...
1190-1214 4.40e-07

third Wiskott Aldrich syndrome homology region 2 (WH2 motif) repeat (called Wc) found in protein Cordon-Bleu (Cobl) and similar proteins; This family contains the third tandem Wiskott-Aldrich syndrome protein (WASP)-homology domain 2 (WH2), called Wc, found in protein Cordon-Bleu (Cobl), a potent actin filament nucleator that plays an important role in the reorganization of the actin cytoskeleton. It regulates neuron morphogenesis and increases branching of axons and dendrites. It also modulates dendrite branching in Purkinje cells. Cobl binds to and sequesters actin monomers (G-actin). Cobl contains three tandem WH2 (or W) domains consisting of an N-terminal alpha helix and a C-terminal LRKV motif. The first two WH2 domains have the highest binding affinity for actin. They are functionally active in actin nucleation and polymerization. The model corresponds to the first WH2 domain.


Pssm-ID: 409199  Cd Length: 26  Bit Score: 46.91  E-value: 4.40e-07
                           10        20
                   ....*....|....*....|....*
gi 1907140485 1190 DPEHVRQSLLTAIRSGEAAAKLKRA 1214
Cdd:cd21801      1 NPEQARQALLEAIRSGEGAARLKKV 25
PHA03247 PHA03247
large tegument protein UL36; Provisional
834-1180 7.59e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.79  E-value: 7.59e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  834 PATSKSSQQPQPdlKPKPSSGTERHLHRTLSSPTgtETNPPKAPRVTTDTGTIPFAPnledinnileSKFRSRASNPQAK 913
Cdd:PHA03247  2562 AAPDRSVPPPRP--APRPSEPAVTSRARRPDAPP--QSARPRAPVDDRGDPRGPAPP----------SPLPPDTHAPDPP 2627
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  914 PSSfflqMQKRASGHYVTSAAAKSVHTAPGPAPKEPTIKEVQRDPQLSPEQHPSSLSERTHSAPLPNISKADDDIIQKPA 993
Cdd:PHA03247  2628 PPS----PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPP 2703
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  994 ETSPPPVAPKPMTlraetSPPPVFPKPMTLPAETSPPPVFPKPMTLPAETSLPLVFPKPMTLRAETSPP-------PVAA 1066
Cdd:PHA03247  2704 PPPTPEPAPHALV-----SATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPapappaaPAAG 2778
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485 1067 KPVALPGSQGTSLNLKTLKTFGAPRPYSSSGPSPFALAVVKRSQSFSK--ACPESASEGSSALPPAATQDekthtvNKPT 1144
Cdd:PHA03247  2779 PPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGplPPPTSAQPTAPPPPPGPPPP------SLPL 2852
                          330       340       350
                   ....*....|....*....|....*....|....*.
gi 1907140485 1145 VGSQHGDGDKQNNPVQNEHSSQVLTPADGPSFTLKR 1180
Cdd:PHA03247  2853 GGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLAR 2888
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
903-1134 3.19e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 51.42  E-value: 3.19e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  903 FRSRASNPQAKPSSfflqmqkRASGHYVTSAAAKSVHTAPGPAPKEPTiKEVQRDPQLSPEQHPSSLSERTHSAPLPNIS 982
Cdd:PRK12323   363 FRPGQSGGGAGPAT-------AAAAPVAQPAPAAAAPAAAAPAPAAPP-AAPAAAPAAAAAARAVAAAPARRSPAPEALA 434
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  983 KADDDIIQKPAETSPPPVAPKPMTLrAETSPPPVFPKPMTLPAETSPPPVFPKPMTLPAETS------LPLVFPKPMTLR 1056
Cdd:PRK12323   435 AARQASARGPGGAPAPAPAPAAAPA-AAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDpppweeLPPEFASPAPAQ 513
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907140485 1057 AETSPPPVAAKPVALPGSQGTSlnlktlktfgAPRPYSSSGPSPfALAVVKRSQSFSKACPESASEGSSALPPAATQD 1134
Cdd:PRK12323   514 PDAAPAGWVAESIPDPATADPD----------DAFETLAPAPAA-APAPRAAAATEPVVAPRPPRASASGLPDMFDGD 580
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
834-1131 3.35e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 48.22  E-value: 3.35e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  834 PATSKSSQQPQPDLKPKP--SSGTERHLHRTLSSPTGTETNPPKAPRVTTDTGTIPFApnledinnILESKFRSRASNPQ 911
Cdd:pfam03154  212 PATSQPPNQTQSTAAPHTliQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQP--------SLHGQMPPMPHSLQ 283
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  912 AKPSsfFLQMQKRASGHYVTSAAAKSvHTAPGPAPKEPTikEVQRDPQLSPEQHPSSLSERTHSAPLPNISKADDDIiqK 991
Cdd:pfam03154  284 TGPS--HMQHPVPPQPFPLTPQSSQS-QVPPGPSPAAPG--QSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHI--K 356
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  992 PAETSPPPVAPKPmtlRAETSPPPVF-PKPMTLPAETSPPPVFpKPMTLPAETSLPLVFPKPMTLRAETSP-PPVAAKPV 1069
Cdd:pfam03154  357 PPPTTPIPQLPNP---QSHKHPPHLSgPSPFQMNSNLPPPPAL-KPLSSLSTHHPPSAHPPPLQLMPQSQQlPPPPAQPP 432
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907140485 1070 ALPGSQGTSLNLKTLKTFGAPRPYSSSGP---SPFALAVVKRSQSFSKACPESASEGSSALPPAA 1131
Cdd:pfam03154  433 VLTQSQSLPPPAASHPPTSGLHQVPSQSPfpqHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSS 497
PHA03247 PHA03247
large tegument protein UL36; Provisional
834-1129 5.42e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.01  E-value: 5.42e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  834 PATSKSSQQPQPDLKPKPSSGTERHLHRTLSSPTGTETNPPKAPRVTTDTGTIpfapnledinnilESKFRSRASNPQAK 913
Cdd:PHA03247  2609 RGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRV-------------SRPRRARRLGRAAQ 2675
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  914 PSSFFLQMQKRASGHYVTSAAAKSVHTAPGPAPKEPTIKEVQRDPQlspeqhPSSLSERTHSAPLPNISKADDDIIQKPA 993
Cdd:PHA03247  2676 ASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPL------PPGPAAARQASPALPAAPAPPAVPAGPA 2749
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  994 ETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFP------------KPMTLPAETSLPLVFPKPMTLRAETSP 1061
Cdd:PHA03247  2750 TPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASlsesreslpspwDPADPPAAVLAPAAALPPAASPAGPLP 2829
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907140485 1062 PPVAAKPVALPGSQGTSLNLKTLKTFGAP------RPYSSSGPSPFALAVVKRSQSFSKACPESASEgSSALPP 1129
Cdd:PHA03247  2830 PPTSAQPTAPPPPPGPPPPSLPLGGSVAPggdvrrRPPSRSPAAKPAAPARPPVRRLARPAVSRSTE-SFALPP 2902
PHA03378 PHA03378
EBNA-3B; Provisional
811-1096 5.57e-05

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 47.75  E-value: 5.57e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  811 KMLPVGQRHTIENMTETSMQTEVPATSKSSQQPQPDLKPKPSsgterhlhrTLSSPTGTETNPPK--APR-VTTDTGTIP 887
Cdd:PHA03378   562 QLLPAPGLGPLQIQPLTSPTTSQLASSAPSYAQTPWPVPHPS---------QTPEPPTTQSHIPEtsAPRqWPMPLRPIP 632
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  888 FAPNLEDINNILESKFRSRASNPQAKPSSFFLQMQKRASGHYVTSAAAKSVHTAPGPAPKepTIKEVQRDP-QLSPEQHP 966
Cdd:PHA03378   633 MRPLRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPG--TMQPPPRAPtPMRPPAAP 710
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  967 SSLSERTHSAPLPNISKADDDIIQKPAETSPPPvAPKPMTLRAETSPPPVFPKPMTLPAETSPPPV-FPKPMTLPAETSL 1045
Cdd:PHA03378   711 PGRAQRPAAATGRARPPAAAPGRARPPAAAPGR-ARPPAAAPGRARPPAAAPGRARPPAAAPGAPTpQPPPQAPPAPQQR 789
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907140485 1046 PLVFPKPMTlRAETSPPPVAAKPVALPGSQG-TSLNLKTLKTFGAPRPYSSS 1096
Cdd:PHA03378   790 PRGAPTPQP-PPQAGPTSMQLMPRAAPGQQGpTKQILRQLLTGGVKRGRPSL 840
PHA03247 PHA03247
large tegument protein UL36; Provisional
724-1100 7.10e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.63  E-value: 7.10e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  724 EKEPACTYGNNVPLSPVDGSNKNPAASylKNFPLYRQDSNPKPKPSNEITREYIPKIGMTTYKIVPPKSLEMAKDWESeA 803
Cdd:PHA03247  2586 ARRPDAPPQSARPRAPVDDRGDPRGPA--PPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRV-S 2662
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  804 MGRKDDQKMLPVGQRHTIENMTETSMQTEVPATSKSSQQPQPDLKPKPSSgteRHLHRTLSSPTGTETNPPKAPRVTTDT 883
Cdd:PHA03247  2663 RPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAP---HALVSATPLPPGPAAARQASPALPAAP 2739
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  884 GT--IPFAPNLEDINNILESKFRSRASNPQAKPSSFFLQMQKRASGHYVTSAAAKSVHTAPGPAPKEPTIKEVQRDPQLS 961
Cdd:PHA03247  2740 APpaVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALP 2819
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  962 PEQHPSSLSerthsAPLPNISKADDDIIQKPAETSPPP---VAPK-PMTLRAETSPPPvfpkpmTLPAETSPPPVfpKPM 1037
Cdd:PHA03247  2820 PAASPAGPL-----PPPTSAQPTAPPPPPGPPPPSLPLggsVAPGgDVRRRPPSRSPA------AKPAAPARPPV--RRL 2886
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907140485 1038 TLPAETSLPLVFPKPMTLRAETSPPPVAAKPVALPGSQGTSLNLKTLKTFGAPRPYSSSGPSP 1100
Cdd:PHA03247  2887 ARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDP 2949
PRK10263 PRK10263
DNA translocase FtsK; Provisional
826-1100 1.65e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 46.23  E-value: 1.65e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  826 ETSMQT-EVPATSKSSQQPQPDLKPKPssgterhlhrtlssptGTETNPPK-APRVTTDTGTIPFAPNLEDINNILESKF 903
Cdd:PRK10263   338 EPVTQTpPVASVDVPPAQPTVAWQPVP----------------GPQTGEPViAPAPEGYPQQSQYAQPAVQYNEPLQQPV 401
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  904 rsrasnPQAKPSSFFLQMQKRASGHYVTSAAAKSVHTAPGPAPKEPTIKEV-QRDPQLSPEQH-PSSLSERTHSAPLPni 981
Cdd:PRK10263   402 ------QPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAwQAEEQQSTFAPqSTYQTEQTYQQPAA-- 473
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  982 skADDDIIQKPAETSPPPVAPKPMTLRAETSPPPVF-----------------------PKPMTLPAETSPPPVFPKPMT 1038
Cdd:PRK10263   474 --QEPLYQQPQPVEQQPVVEPEPVVEETKPARPPLYyfeeveekrarereqlaawyqpiPEPVKEPEPIKSSLKAPSVAA 551
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907140485 1039 LPAETSLPLVFPKPMTLRAETSPPPVAAKpVALPgsqgtslnLKTLKTFGAPRPYSSSGPSP 1100
Cdd:PRK10263   552 VPPVEAAAAVSPLASGVKKATLATGAAAT-VAAP--------VFSLANSGGPRPQVKEGIGP 604
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
958-1077 2.34e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 45.54  E-value: 2.34e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  958 PQLSPEQHPSSLSERTHSAPLPNiSKADDDIIQKPAETSPPPVAPKPMTlRAETSPPPVfPKPMTLPAETSPPPVFPKPM 1037
Cdd:PRK14971   371 GGRGPKQHIKPVFTQPAAAPQPS-AAAAASPSPSQSSAAAQPSAPQSAT-QPAGTPPTV-SVDPPAAVPVNPPSTAPQAV 447
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|
gi 1907140485 1038 TLPAETSlplvfPKPMTLRAETSPPPVAAKPVALPGSQGT 1077
Cdd:PRK14971   448 RPAQFKE-----EKKIPVSKVSSLGPSTLRPIQEKAEQAT 482
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
991-1080 1.09e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 43.26  E-value: 1.09e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  991 KPAETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFPKPmtlPAETSLPLVFPKPMTLRAETSPPPVAAK--P 1068
Cdd:PRK14950   370 KPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPR---PVAPPVPHTPESAPKLTRAAIPVDEKPKytP 446
                           90
                   ....*....|..
gi 1907140485 1069 VALPGSQGTSLN 1080
Cdd:PRK14950   447 PAPPKEEEKALI 458
BimA_second NF040983
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia ...
992-1072 1.25e-03

trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia intracellular motility A), WP_004266405.1-like proteins in Burkholderia mallei or B. pseudomallei. The term BimA has also been used for WP_011205626.1-like homologs that have a very different N-terminal half.


Pssm-ID: 468913 [Multi-domain]  Cd Length: 382  Bit Score: 42.58  E-value: 1.25e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  992 PAETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFPKPMTLPAETSLPLVFPKPMTLRAETSPPPVaaKPVAL 1071
Cdd:NF040983    86 PNKVPPPPPPPPPPPPPPPTPPPPPPPPPPPPPPSPPPPPPPSPPPSPPPPTTTPPTRTTPSTTTPTPSMHPI--QPTQL 163

                   .
gi 1907140485 1072 P 1072
Cdd:NF040983   164 P 164
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
861-1062 1.30e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 42.94  E-value: 1.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  861 RTLSSPTGTETNPPKAPRVTTDTGTIPFAPNLEDiNNILESKFRSRASNPQAKPSSFFLQMQKRASGHYVTSAAAKSVHT 940
Cdd:PRK12323   376 TAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAA-PAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAP 454
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  941 APGPAPKEPTIKEVQRDPQLSPEQHPSSLSERTHSAPLPNISKADDDIIQKPAETSPPPVAPKPMTLRAETSPPPVFPKP 1020
Cdd:PRK12323   455 AAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADP 534
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|..
gi 1907140485 1021 mTLPAETSPPPVFPKPMTLPAETSLPLVFPKPMTLRAETSPP 1062
Cdd:PRK12323   535 -DDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPD 575
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
930-1137 2.02e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.47  E-value: 2.02e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  930 VTSAAAKSVHTAPGPAPKEPTIKEVQRDPQLSPEQHPSSL---SERTHSAPLPNISKADDDI--IQKPAETSPPPVAPKP 1004
Cdd:PHA03307    56 VAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLapaSPAREGSPTPPGPSSPDPPppTPPPASPPPSPAPDLS 135
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485 1005 MTLRAETSPPPVfPKPMTLPAETSPPPVfpkpmTLPAETSLPLVFPKPMTLRAETSPPPVAAKPV---ALPGSQGTSLNL 1081
Cdd:PHA03307   136 EMLRPVGSPGPP-PAASPPAAGASPAAV-----ASDAASSRQAALPLSSPEETARAPSSPPAEPPpstPPAAASPRPPRR 209
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907140485 1082 KTLKTFGAPRPYSSSGPSPFALAVVKRSQSFSKACPESAS--EGSSALPPAATQDEKT 1137
Cdd:PHA03307   210 SSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWgpENECPLPRPAPITLPT 267
PHA03378 PHA03378
EBNA-3B; Provisional
943-1104 2.24e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 42.36  E-value: 2.24e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  943 GPAPKEPtikevqrdPQLSPEQHPSSLSERTH--SAPLPNISKADDDIIQKPAETSPPPVAPKPMtlRAETSPPPVFPKP 1020
Cdd:PHA03378   648 FPTPHQP--------PQVEITPYKPTWTQIGHipYQPSPTGANTMLPIQWAPGTMQPPPRAPTPM--RPPAAPPGRAQRP 717
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485 1021 MTLPAETSPPPVFPKPMTLPAETSLPLVFPKPMTLRAET-------SPPPVAA--KPVALPGSQGTSLNLKTLKTFGAPR 1091
Cdd:PHA03378   718 AAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPpaaapgrARPPAAApgAPTPQPPPQAPPAPQQRPRGAPTPQ 797
                          170
                   ....*....|...
gi 1907140485 1092 PYSSSGPSPFALA 1104
Cdd:PHA03378   798 PPPQAGPTSMQLM 810
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
944-1177 2.38e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.45  E-value: 2.38e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  944 PAPKEPTIKEVQRDPQLSPEQHPSSLSERTHSAPLPNISKADDDIIQKPAETSPPPVAPKPMTLRAETSPPPVFPKPMTL 1023
Cdd:pfam03154  221 TQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPL 300
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485 1024 PAETSPPPVFPKPMT-LPAETSLPLVFPKPMTLRAETSPP---PVAAKPVALPgsqgtslNLKTLKTFGAPR-------- 1091
Cdd:pfam03154  301 TPQSSQSQVPPGPSPaAPGQSQQRIHTPPSQSQLQSQQPPreqPLPPAPLSMP-------HIKPPPTTPIPQlpnpqshk 373
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485 1092 -PYSSSGPSPFAL-------AVVKRSQSFSKACPESA--------SEGSSALPPAATQDEKTHTVNKPTVGSQH-GDGDK 1154
Cdd:pfam03154  374 hPPHLSGPSPFQMnsnlpppPALKPLSSLSTHHPPSAhppplqlmPQSQQLPPPPAQPPVLTQSQSLPPPAASHpPTSGL 453
                          250       260
                   ....*....|....*....|...
gi 1907140485 1155 QNNPVQNEHSSQVLTPADGPSFT 1177
Cdd:pfam03154  454 HQVPSQSPFPQHPFVPGGPPPIT 476
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
990-1085 3.04e-03

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 41.80  E-value: 3.04e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  990 QKPAETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFPKPMTLPAETSLPlvfpkpmtlrAETSPPPVAAKPV 1069
Cdd:PRK12270    36 YGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAA----------AAAPAAPPAAAAA 105
                           90
                   ....*....|....*.
gi 1907140485 1070 ALPGSQGTSLNLKTLK 1085
Cdd:PRK12270   106 AAPAAAAVEDEVTPLR 121
PHA03247 PHA03247
large tegument protein UL36; Provisional
834-1094 3.09e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.23  E-value: 3.09e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  834 PATSKSSQQPQPDLKPKPSSGTERHLHRTlSSPTGTETNPPKAPRVTTDTGTIPFAPNLEDINNILESKFRSRASNPQAK 913
Cdd:PHA03247  2793 ESRESLPSPWDPADPPAAVLAPAAALPPA-ASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSP 2871
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  914 PSSfflqmqkrasghyVTSAAAKSVHTAPGPAPKEPTIKEVQRDPQLSPEQHPSSLSERTHSAPLPniskadddiiQKPA 993
Cdd:PHA03247  2872 AAK-------------PAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPP----------PPPQ 2928
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  994 ETSPPPVAPKPmtlraetsPPPVFPKPMTLPAeTSPPPVFPKP---MTLPAETSLPLVF---PKPMTLRAETSPPPVAAK 1067
Cdd:PHA03247  2929 PQPPPPPPPRP--------QPPLAPTTDPAGA-GEPSGAVPQPwlgALVPGRVAVPRFRvpqPAPSREAPASSTPPLTGH 2999
                          250       260
                   ....*....|....*....|....*..
gi 1907140485 1068 PVALPGSQGTSLnlkTLKTFGAPRPYS 1094
Cdd:PHA03247  3000 SLSRVSSWASSL---ALHEETDPPPVS 3023
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
942-1072 4.11e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 41.29  E-value: 4.11e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  942 PGPAPKEPTIKEVQRDPQlsPEQHPSSLSERTHSAPLPNISKADddiIQKPAETSPPPVAPKPMTLRAETSPPPVFPKPM 1021
Cdd:NF033839   345 PQLETPKPEVKPQPEKPK--PEVKPQPEKPKPEVKPQPETPKPE---VKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPE 419
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907140485 1022 TLPA------ETSPPPVFPKPMTLPA-ETSLPLVFPKPMTLRAETSPPPVAAKPVALP 1072
Cdd:NF033839   420 VKPQpekpkpEVKPQPEKPKPEVKPQpEKPKPEVKPQPETPKPEVKPQPEKPKPEVKP 477
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
991-1137 7.71e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 40.62  E-value: 7.71e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  991 KPAETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFPKPM------TLPAETSLPLVFPKPMTLRAETSPPPV 1064
Cdd:PRK07994   360 HPAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASApqqapaVPLPETTSQLLAARQQLQRAQGATKAK 439
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907140485 1065 AAKPVALPGSQGTSLNLKTLKTFgAPRPYSSSGPSPFALAVVKRSQSfskacPESASEGSSALPPAATQ---DEKT 1137
Cdd:PRK07994   440 KSEPAAASRARPVNSALERLASV-RPAPSALEKAPAKKEAYRWKATN-----PVEVKKEPVATPKALKKaleHEKT 509
BimA_second NF040983
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia ...
989-1074 8.45e-03

trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia intracellular motility A), WP_004266405.1-like proteins in Burkholderia mallei or B. pseudomallei. The term BimA has also been used for WP_011205626.1-like homologs that have a very different N-terminal half.


Pssm-ID: 468913 [Multi-domain]  Cd Length: 382  Bit Score: 39.89  E-value: 8.45e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140485  989 IQKPAETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFPKPMTLPAETSLPLVfPKPMTLRAETSPPPVAAKP 1068
Cdd:NF040983    89 VPPPPPPPPPPPPPPPTPPPPPPPPPPPPPPSPPPPPPPSPPPSPPPPTTTPPTRTTPST-TTPTPSMHPIQPTQLPSIP 167

                   ....*.
gi 1907140485 1069 VALPGS 1074
Cdd:NF040983   168 NATPTS 173
RBD pfam02196
Raf-like Ras-binding domain;
72-138 9.51e-03

Raf-like Ras-binding domain;


Pssm-ID: 460485  Cd Length: 69  Bit Score: 35.96  E-value: 9.51e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907140485   72 LSVVLPGDILKSTTVHGSKPMMDLLVFLCAQYHLNPSSHTIDLLSAEENLIkfKPNTPIGMLDVEKV 138
Cdd:pfam02196    2 CRVYLPDGQRTVVQVRPGETVRDALSKLCKKRGLNPEACDVYLVGGDKYPL--DLDTDSSTLEGEEV 66
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH