NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2217272036|ref|XP_047289424|]
View 

period circadian protein homolog 3 isoform X14 [Homo sapiens]

Protein Classification

Dcp1 family protein; LuxR family transcriptional regulator( domain architecture ID 12888871)

Dcp1 (mRNA-decapping enzyme subunit 1) family protein similar to Arabidopsis thaliana mRNA-decapping enzyme-like protein that acts as a component of the decapping complex and is involved in the degradation of mRNAs| LuxR family transcriptional regulator, may be involved in quorum-sensing

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Period_C super family cl13540
Period protein 2/3C-terminal region; This domain is found in eukaryotes. This domain is ...
1075-1175 2.15e-24

Period protein 2/3C-terminal region; This domain is found in eukaryotes. This domain is typically between 164 to 200 amino acids in length. This domain is found associated with pfam08447.


The actual alignment was detected with superfamily member pfam12114:

Pssm-ID: 463464  Cd Length: 171  Bit Score: 100.94  E-value: 2.15e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036 1075 SVYSSKISQNGQQSQDVQKKETF-PNVAEEPIWRMIRQTPERILMTYQVPERVKEVVLKEDLEKLESMRQQQPQFSHGQK 1153
Cdd:pfam12114   68 SIDSSENNHKAKKTAEVGEEEHFiKCVLQDPIWLLMANTDDSVMMTYQIPSRDLETVLKEDREKLKAMQKMQPRFTEDQK 147
                           90       100
                   ....*....|....*....|..
gi 2217272036 1154 EELAKVYNWIQSQTVTQEIDIQ 1175
Cdd:pfam12114  148 GELAEVHPWIQKGGLPAALDLS 169
PAS cd00130
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising ...
284-376 1.77e-12

PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.


:

Pssm-ID: 238075 [Multi-domain]  Cd Length: 103  Bit Score: 64.58  E-value: 1.77e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  284 FLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKYAGHPPFEhspIRFCTQNGDYIILDSSWSSFVNPW 363
Cdd:cd00130     14 ILYANPAAEQLLGYSPEELIGKSLLDLIHPEDREELRERLENLLSGGEPVTLE---VRLRRKDGSVIWVLVSLTPIRDEG 90
                           90
                   ....*....|...
gi 2217272036  364 SRKISFIIGRHKV 376
Cdd:cd00130     91 GEVIGLLGVVRDI 103
Atrophin-1 super family cl38111
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
744-1055 3.87e-08

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


The actual alignment was detected with superfamily member pfam03154:

Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 57.85  E-value: 3.87e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  744 LPEPPDSSSSNTGSgprrgAHQNAQPCCPSAASSPHTSSPTFPPAAMVPSQAPylvPAFPLPAATSPGREYAAPGTAPEG 823
Cdd:pfam03154  148 IPSPQDNESDSDSS-----AQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAA---TAGPTPSAPSVPPQGSPATSQPPN 219
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  824 LHGLPLSEG--LQPYPAFPFPYLDTfmtvflPDPPVCPLLSPSflpcpflgatassaispsmssamsptldPPPSVTSQR 901
Cdd:pfam03154  220 QTQSTAAPHtlIQQTPTLHPQRLPS------PHPPLQPMTQPP----------------------------PPSQVSPQP 265
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  902 REEEKWEAQSE--GHPFITSRSSSPLQLNllqeemPRPSESPDQMRRNTCPQTEYQCVTGNNGSES-SPATTGALSTGSP 978
Cdd:pfam03154  266 LPQPSLHGQMPpmPHSLQTGPSHMQHPVP------PQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIhTPPSQSQLQSQQP 339
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2217272036  979 PRENPSHPTASALSTGSPPMKNPSHPTASALSTGSPPmkNPSHPTASTLSMGLPPSrtPSHPTATVLSTGSPPSESP 1055
Cdd:pfam03154  340 PREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPP--HLSGPSPFQMNSNLPPP--PALKPLSSLSTHHPPSAHP 412
 
Name Accession Description Interval E-value
Period_C pfam12114
Period protein 2/3C-terminal region; This domain is found in eukaryotes. This domain is ...
1075-1175 2.15e-24

Period protein 2/3C-terminal region; This domain is found in eukaryotes. This domain is typically between 164 to 200 amino acids in length. This domain is found associated with pfam08447.


Pssm-ID: 463464  Cd Length: 171  Bit Score: 100.94  E-value: 2.15e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036 1075 SVYSSKISQNGQQSQDVQKKETF-PNVAEEPIWRMIRQTPERILMTYQVPERVKEVVLKEDLEKLESMRQQQPQFSHGQK 1153
Cdd:pfam12114   68 SIDSSENNHKAKKTAEVGEEEHFiKCVLQDPIWLLMANTDDSVMMTYQIPSRDLETVLKEDREKLKAMQKMQPRFTEDQK 147
                           90       100
                   ....*....|....*....|..
gi 2217272036 1154 EELAKVYNWIQSQTVTQEIDIQ 1175
Cdd:pfam12114  148 GELAEVHPWIQKGGLPAALDLS 169
PAS cd00130
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising ...
284-376 1.77e-12

PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.


Pssm-ID: 238075 [Multi-domain]  Cd Length: 103  Bit Score: 64.58  E-value: 1.77e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  284 FLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKYAGHPPFEhspIRFCTQNGDYIILDSSWSSFVNPW 363
Cdd:cd00130     14 ILYANPAAEQLLGYSPEELIGKSLLDLIHPEDREELRERLENLLSGGEPVTLE---VRLRRKDGSVIWVLVSLTPIRDEG 90
                           90
                   ....*....|...
gi 2217272036  364 SRKISFIIGRHKV 376
Cdd:cd00130     91 GEVIGLLGVVRDI 103
PAS_3 pfam08447
PAS fold; The PAS fold corresponds to the structural domain that has previously been defined ...
284-372 6.85e-12

PAS fold; The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya.


Pssm-ID: 430001 [Multi-domain]  Cd Length: 89  Bit Score: 62.74  E-value: 6.85e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  284 FLEVDEKAVPLLGYLPQDLIGT--SILSYLHPEDRSLMVAIHQKVLKYAGhpPFEHsPIRFCTQNGDYIILDSSWSSFVN 361
Cdd:pfam08447    1 IIYWSPRFEEILGYTPEELLGKgeSWLDLVHPDDRERVREALWEALKGGE--PYSG-EYRIRRKDGEYRWVEARARPIRD 77
                           90
                   ....*....|.
gi 2217272036  362 pWSRKISFIIG 372
Cdd:pfam08447   78 -ENGKPVRVIG 87
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
744-1055 3.87e-08

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 57.85  E-value: 3.87e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  744 LPEPPDSSSSNTGSgprrgAHQNAQPCCPSAASSPHTSSPTFPPAAMVPSQAPylvPAFPLPAATSPGREYAAPGTAPEG 823
Cdd:pfam03154  148 IPSPQDNESDSDSS-----AQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAA---TAGPTPSAPSVPPQGSPATSQPPN 219
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  824 LHGLPLSEG--LQPYPAFPFPYLDTfmtvflPDPPVCPLLSPSflpcpflgatassaispsmssamsptldPPPSVTSQR 901
Cdd:pfam03154  220 QTQSTAAPHtlIQQTPTLHPQRLPS------PHPPLQPMTQPP----------------------------PPSQVSPQP 265
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  902 REEEKWEAQSE--GHPFITSRSSSPLQLNllqeemPRPSESPDQMRRNTCPQTEYQCVTGNNGSES-SPATTGALSTGSP 978
Cdd:pfam03154  266 LPQPSLHGQMPpmPHSLQTGPSHMQHPVP------PQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIhTPPSQSQLQSQQP 339
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2217272036  979 PRENPSHPTASALSTGSPPMKNPSHPTASALSTGSPPmkNPSHPTASTLSMGLPPSrtPSHPTATVLSTGSPPSESP 1055
Cdd:pfam03154  340 PREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPP--HLSGPSPFQMNSNLPPP--PALKPLSSLSTHHPPSAHP 412
PAS smart00091
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising ...
284-328 1.23e-07

PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels.


Pssm-ID: 214512  Cd Length: 67  Bit Score: 49.71  E-value: 1.23e-07
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*
gi 2217272036   284 FLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLK 328
Cdd:smart00091   23 ILYANPAAEELLGYSPEELIGKSLLELIHPEDRERVQEALQRLLS 67
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
741-1056 6.42e-07

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 54.02  E-value: 6.42e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  741 RKKLPEPPDSSSSNTGSGPRRGAHQNAQPCCPSAASSPHTSSP-------------TFPPAAMVPSQAPYLVPAFPL-PA 806
Cdd:PHA03307    64 RFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPtppgpsspdppppTPPPASPPPSPAPDLSEMLRPvGS 143
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  807 ATSPGREYAAPGTAPEGLHGLPLSEGLQPYPAFPfpyldtfMTVFLPDPPVCPLLSPSFLPCPFLGAtasSAISPSMSSA 886
Cdd:PHA03307   144 PGPPPAASPPAAGASPAAVASDAASSRQAALPLS-------SPEETARAPSSPPAEPPPSTPPAAAS---PRPPRRSSPI 213
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  887 MSPTLDPPPSvtSQRREEEKWEAQSEGhpfiTSRSSSPLQLNLLQEEMPRPSESPdqmrrNTCPQTEYQCVTGNN-GSES 965
Cdd:PHA03307   214 SASASSPAPA--PGRSAADDAGASSSD----SSSSESSGCGWGPENECPLPRPAP-----ITLPTRIWEASGWNGpSSRP 282
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  966 SPATTGALSTGSPPRENPSHPTASALSTGSPPMKNPSHPTASALSTGSPPmKNPSHPTASTLSMGLPPSRTPSHPTATvl 1045
Cdd:PHA03307   283 GPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSS-SESSRGAAVSPGPSPSRSPSPSRPPPP-- 359
                          330
                   ....*....|.
gi 2217272036 1046 STGSPPSESPS 1056
Cdd:PHA03307   360 ADPSSPRKRPR 370
KinA COG5805
Sporulation sensor histidine kinase A (Stage II sporulation protein SpoIIF/SpoIIJ) [Cell cycle ...
273-373 3.20e-03

Sporulation sensor histidine kinase A (Stage II sporulation protein SpoIIF/SpoIIJ) [Cell cycle control, cell division, chromosome partitioning, Signal transduction mechanisms];


Pssm-ID: 444507 [Multi-domain]  Cd Length: 496  Bit Score: 41.64  E-value: 3.20e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  273 IFTTTHTPGcVFLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKYAGHPPFEHSPIrfcTQNGDYIIL 352
Cdd:COG5805    169 LICVIDTDG-RILFINESIERLFGAPREELIGKNLLELLHPCDKEEFKERIESITEVWQEFIIEREII---TKDGRIRYF 244
                           90       100
                   ....*....|....*....|..
gi 2217272036  353 DSSWSSFVNP-WSRKISFIIGR 373
Cdd:COG5805    245 EAVIVPLIDTdGSVKGILVILR 266
 
Name Accession Description Interval E-value
Period_C pfam12114
Period protein 2/3C-terminal region; This domain is found in eukaryotes. This domain is ...
1075-1175 2.15e-24

Period protein 2/3C-terminal region; This domain is found in eukaryotes. This domain is typically between 164 to 200 amino acids in length. This domain is found associated with pfam08447.


Pssm-ID: 463464  Cd Length: 171  Bit Score: 100.94  E-value: 2.15e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036 1075 SVYSSKISQNGQQSQDVQKKETF-PNVAEEPIWRMIRQTPERILMTYQVPERVKEVVLKEDLEKLESMRQQQPQFSHGQK 1153
Cdd:pfam12114   68 SIDSSENNHKAKKTAEVGEEEHFiKCVLQDPIWLLMANTDDSVMMTYQIPSRDLETVLKEDREKLKAMQKMQPRFTEDQK 147
                           90       100
                   ....*....|....*....|..
gi 2217272036 1154 EELAKVYNWIQSQTVTQEIDIQ 1175
Cdd:pfam12114  148 GELAEVHPWIQKGGLPAALDLS 169
PAS cd00130
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising ...
284-376 1.77e-12

PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.


Pssm-ID: 238075 [Multi-domain]  Cd Length: 103  Bit Score: 64.58  E-value: 1.77e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  284 FLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKYAGHPPFEhspIRFCTQNGDYIILDSSWSSFVNPW 363
Cdd:cd00130     14 ILYANPAAEQLLGYSPEELIGKSLLDLIHPEDREELRERLENLLSGGEPVTLE---VRLRRKDGSVIWVLVSLTPIRDEG 90
                           90
                   ....*....|...
gi 2217272036  364 SRKISFIIGRHKV 376
Cdd:cd00130     91 GEVIGLLGVVRDI 103
PAS_3 pfam08447
PAS fold; The PAS fold corresponds to the structural domain that has previously been defined ...
284-372 6.85e-12

PAS fold; The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya.


Pssm-ID: 430001 [Multi-domain]  Cd Length: 89  Bit Score: 62.74  E-value: 6.85e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  284 FLEVDEKAVPLLGYLPQDLIGT--SILSYLHPEDRSLMVAIHQKVLKYAGhpPFEHsPIRFCTQNGDYIILDSSWSSFVN 361
Cdd:pfam08447    1 IIYWSPRFEEILGYTPEELLGKgeSWLDLVHPDDRERVREALWEALKGGE--PYSG-EYRIRRKDGEYRWVEARARPIRD 77
                           90
                   ....*....|.
gi 2217272036  362 pWSRKISFIIG 372
Cdd:pfam08447   78 -ENGKPVRVIG 87
PAS_11 pfam14598
PAS domain; This family includes the PAS-B domain of NCOA1 (Nuclear receptor coactivator 1), ...
274-376 3.27e-09

PAS domain; This family includes the PAS-B domain of NCOA1 (Nuclear receptor coactivator 1), which binds to an LXXLL motif in the C-terminal region of STAT6 (Signal transducer and activator of transcription 6).


Pssm-ID: 464214 [Multi-domain]  Cd Length: 110  Bit Score: 55.76  E-value: 3.27e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  274 FTTTHTPGCVFLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKYAGHppfEHSPI-RFCTQNGDYIIL 352
Cdd:pfam14598    4 FTTRHDIDGKIISCDTRAPFSLGYEKDELVGRSIYDLVHPQDLRTAKSHLREIIQTRGR---ATSPSyRLRLRDGDFLSV 80
                           90       100
                   ....*....|....*....|....
gi 2217272036  353 DSSWSSFVNPWSRKISFIIGRHKV 376
Cdd:pfam14598   81 HTKSKLFLNQNSNQQPFIMCTHTI 104
PAS pfam00989
PAS fold; The PAS fold corresponds to the structural domain that has previously been defined ...
271-370 3.82e-09

PAS fold; The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya. This domain can bind gases (O2, CO and NO), FAD, 4-hydroxycinnamic acid and NAD+ (Matilla et.al., FEMS Microbiology Reviews, fuab043, 45, 2021, 1. https://doi.org/10.1093/femsre/fuab043).


Pssm-ID: 395786 [Multi-domain]  Cd Length: 113  Bit Score: 55.50  E-value: 3.82e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  271 KRIFTTTHTPGCV------FLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKyAGHPPFEHSpIRFCT 344
Cdd:pfam00989    4 RAILESLPDGIFVvdedgrILYVNAAAEELLGLSREEVIGKSLLDLIPEEDDAEVAELLRQALL-QGEESRGFE-VSFRV 81
                           90       100
                   ....*....|....*....|....*.
gi 2217272036  345 QNGDYIILDSSWSSFVNPWSRKISFI 370
Cdd:pfam00989   82 PDGRPRHVEVRASPVRDAGGEILGFL 107
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
744-1055 3.87e-08

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 57.85  E-value: 3.87e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  744 LPEPPDSSSSNTGSgprrgAHQNAQPCCPSAASSPHTSSPTFPPAAMVPSQAPylvPAFPLPAATSPGREYAAPGTAPEG 823
Cdd:pfam03154  148 IPSPQDNESDSDSS-----AQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAA---TAGPTPSAPSVPPQGSPATSQPPN 219
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  824 LHGLPLSEG--LQPYPAFPFPYLDTfmtvflPDPPVCPLLSPSflpcpflgatassaispsmssamsptldPPPSVTSQR 901
Cdd:pfam03154  220 QTQSTAAPHtlIQQTPTLHPQRLPS------PHPPLQPMTQPP----------------------------PPSQVSPQP 265
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  902 REEEKWEAQSE--GHPFITSRSSSPLQLNllqeemPRPSESPDQMRRNTCPQTEYQCVTGNNGSES-SPATTGALSTGSP 978
Cdd:pfam03154  266 LPQPSLHGQMPpmPHSLQTGPSHMQHPVP------PQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIhTPPSQSQLQSQQP 339
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2217272036  979 PRENPSHPTASALSTGSPPMKNPSHPTASALSTGSPPmkNPSHPTASTLSMGLPPSrtPSHPTATVLSTGSPPSESP 1055
Cdd:pfam03154  340 PREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPP--HLSGPSPFQMNSNLPPP--PALKPLSSLSTHHPPSAHP 412
PAS smart00091
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising ...
284-328 1.23e-07

PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels.


Pssm-ID: 214512  Cd Length: 67  Bit Score: 49.71  E-value: 1.23e-07
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*
gi 2217272036   284 FLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLK 328
Cdd:smart00091   23 ILYANPAAEELLGYSPEELIGKSLLELIHPEDRERVQEALQRLLS 67
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
748-1058 2.37e-07

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 55.31  E-value: 2.37e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  748 PDSSSSNTGSGPRRGAHQNAQPCCPSAA--------SSPHTSSPTFPPAAMVP-SQAPYLVPAFPLPAATSPGREYAAPG 818
Cdd:pfam05109  466 PTVSTADVTSPTPAGTTSGASPVTPSPSprdngtesKAPDMTSPTSAVTTPTPnATSPTPAVTTPTPNATSPTLGKTSPT 545
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  819 TAPEglhgLPLSEGLQPYPAFPFPYLDTFMTVFLPDPPVCPLLSPS-FLPCPFLGATASSAISPSMS----SAMSPTLDP 893
Cdd:pfam05109  546 SAVT----TPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVTTPTpNATSPTVGETSPQANTTNHTlggtSSTPVVTSP 621
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  894 PPSVTSqrreeekweAQSEGHPFITSRSSSPLQLNLLQ-EEMPRPSESpDQMRRNTCPQTEYQCVTGNNGSESSPATTGA 972
Cdd:pfam05109  622 PKNATS---------AVTTGQHNITSSSTSSMSLRPSSiSETLSPSTS-DNSTSHMPLLTSAHPTGGENITQVTPASTST 691
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  973 --LSTGSP-PRenpshPTASALSTGSPPMKNPSHPTASALSTGSPPmKNPSHPTASTLSMGLPPSRTPSHPTATVLSTGS 1049
Cdd:pfam05109  692 hhVSTSSPaPR-----PGTTSQASGPGNSSTSTKPGEVNVTKGTPP-KNATSPQAPSGQKTAVPTVTSTGGKANSTTGGK 765

                   ....*....
gi 2217272036 1050 PPSESPSRT 1058
Cdd:pfam05109  766 HTTGHGART 774
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
741-1056 6.42e-07

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 54.02  E-value: 6.42e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  741 RKKLPEPPDSSSSNTGSGPRRGAHQNAQPCCPSAASSPHTSSP-------------TFPPAAMVPSQAPYLVPAFPL-PA 806
Cdd:PHA03307    64 RFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPtppgpsspdppppTPPPASPPPSPAPDLSEMLRPvGS 143
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  807 ATSPGREYAAPGTAPEGLHGLPLSEGLQPYPAFPfpyldtfMTVFLPDPPVCPLLSPSFLPCPFLGAtasSAISPSMSSA 886
Cdd:PHA03307   144 PGPPPAASPPAAGASPAAVASDAASSRQAALPLS-------SPEETARAPSSPPAEPPPSTPPAAAS---PRPPRRSSPI 213
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  887 MSPTLDPPPSvtSQRREEEKWEAQSEGhpfiTSRSSSPLQLNLLQEEMPRPSESPdqmrrNTCPQTEYQCVTGNN-GSES 965
Cdd:PHA03307   214 SASASSPAPA--PGRSAADDAGASSSD----SSSSESSGCGWGPENECPLPRPAP-----ITLPTRIWEASGWNGpSSRP 282
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  966 SPATTGALSTGSPPRENPSHPTASALSTGSPPMKNPSHPTASALSTGSPPmKNPSHPTASTLSMGLPPSRTPSHPTATvl 1045
Cdd:PHA03307   283 GPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSS-SESSRGAAVSPGPSPSRSPSPSRPPPP-- 359
                          330
                   ....*....|.
gi 2217272036 1046 STGSPPSESPS 1056
Cdd:PHA03307   360 ADPSSPRKRPR 370
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
740-1057 3.15e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 51.71  E-value: 3.15e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  740 KRKKLPEPPDSSSSNTGSGPRRGAhqnaqpCCPSAASSPHTSSPTFPPAAMVPSQAPYLVPAFPL-PAATSPGREYAAPG 818
Cdd:PHA03307    82 NESRSTPTWSLSTLAPASPAREGS------PTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPvGSPGPPPAASPPAA 155
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  819 TAPEGLHGLPLSEGLQPYPAFPfpyldtfMTVFLPDPPVCPLLSPSFLPCPFLGAtasSAISPSMSSAMSPTLDPPPSvt 898
Cdd:PHA03307   156 GASPAAVASDAASSRQAALPLS-------SPEETARAPSSPPAEPPPSTPPAAAS---PRPPRRSSPISASASSPAPA-- 223
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  899 SQRREEEKWEAQSEGhpfiTSRSSSPLQLNLLQEEMPRPSESPdqmrrNTCPQTEYQCVTGNN-GSESSPATTGALSTGS 977
Cdd:PHA03307   224 PGRSAADDAGASSSD----SSSSESSGCGWGPENECPLPRPAP-----ITLPTRIWEASGWNGpSSRPGPASSSSSPRER 294
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  978 PPRENPSHPTASALSTGSPPMKNPSHPTASALSTGSPpmknpSHPTAStlSMGLPPSRTPSHPTAtvLSTGSPPSESPSR 1057
Cdd:PHA03307   295 SPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSS-----SSESSR--GAAVSPGPSPSRSPS--PSRPPPPADPSSP 365
PHA03247 PHA03247
large tegument protein UL36; Provisional
731-1056 3.40e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.86  E-value: 3.40e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  731 SAGCRKGKHKRKKLPEPPDSSSSNTGSGPRRGAHQNAQPCCPSAASSPHTSSPTFPPAAMVPSQAPYLVPAFPLPAATSP 810
Cdd:PHA03247  2656 PAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPAL 2735
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  811 GREYAAPGTaPEGlHGLPLSEGLQPYPAFPfpyldtfMTVFLPDPPVCPLLSPS-FLPCPflgATASSAISPSMSSAMSP 889
Cdd:PHA03247  2736 PAAPAPPAV-PAG-PATPGGPARPARPPTT-------AGPPAPAPPAAPAAGPPrRLTRP---AVASLSESRESLPSPWD 2803
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  890 TLDPPPSVTSQRREEEKWEAQSEGHPFITSRSSSPLQLNllqeemPRPSESPDQMRRNTCPQTEYQcvtgNNGSESSPAT 969
Cdd:PHA03247  2804 PADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPP------PGPPPPSLPLGGSVAPGGDVR----RRPPSRSPAA 2873
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  970 TGALSTGSP----PRENPSHPTASaLSTGSPPMKNPSHPTASALSTGSPPMKNPSHPTASTLSMGLPPSrtPSHPTATVL 1045
Cdd:PHA03247  2874 KPAAPARPPvrrlARPAVSRSTES-FALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQP--PLAPTTDPA 2950
                          330
                   ....*....|.
gi 2217272036 1046 STGSPPSESPS 1056
Cdd:PHA03247  2951 GAGEPSGAVPQ 2961
PHA03247 PHA03247
large tegument protein UL36; Provisional
747-1055 3.99e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.48  E-value: 3.99e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  747 PPDSSSSNTGSGPRRGAHQNAQPccpsAASSPHTSSPTFPPAAmvPSQAPYLVPAFPLPAATSPGREYAAPGTAPEGLHG 826
Cdd:PHA03247  2592 PPQSARPRAPVDDRGDPRGPAPP----SPLPPDTHAPDPPPPS--PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPR 2665
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  827 LPLSEGLQPYPAFPfpyldtfmtvflPDPPVCPLLSPSFLPCPFLGatassaispsmssamsptlDPPPSvtsQRREEEK 906
Cdd:PHA03247  2666 RARRLGRAAQASSP------------PQRPRRRAARPTVGSLTSLA-------------------DPPPP---PPTPEPA 2711
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  907 WEAQSEGHPfitsrssSPLQLNLLQEEMPRPSESPdqmrrNTCPQTEYQCVTGNNGSESSPATTGALSTGSPPRENPSHP 986
Cdd:PHA03247  2712 PHALVSATP-------LPPGPAAARQASPALPAAP-----APPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGP 2779
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2217272036  987 ----TASALSTGSPPMKNPSHPTASALSTGSPPMKNPSHPTASTLSMGLPPSRTPShPTATVLSTGSPPSESP 1055
Cdd:PHA03247  2780 prrlTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQ-PTAPPPPPGPPPPSLP 2851
PHA03247 PHA03247
large tegument protein UL36; Provisional
747-1053 7.69e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 7.69e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  747 PPDSSSSNTGSGPRRGAhQNAQPCCPSAASSPHTSSPTFPPAAMVPSQAPYLVPAFPLPAATSPgreyaAPGTAPEGLHG 826
Cdd:PHA03247  2742 PAVPAGPATPGGPARPA-RPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDP-----ADPPAAVLAPA 2815
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  827 LPLSEGLQPYPAFPFPyldtfmTVFLPDPPVCPllsPSFLPCPFlgATASSAISPSMSSAMSPTLDPPPSVTSQRREEEK 906
Cdd:PHA03247  2816 AALPPAASPAGPLPPP------TSAQPTAPPPP---PGPPPPSL--PLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVR 2884
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  907 WEAQSEghpfiTSRSSSPLQLNLLQEEMPRPSESPDQMRRNTCPQTEYQCVTGNNG---SESSPATTGALSTGSPPRENP 983
Cdd:PHA03247  2885 RLARPA-----VSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPpprPQPPLAPTTDPAGAGEPSGAV 2959
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2217272036  984 SHPTASALSTGSPPMKN----PSHPTASALSTGSPPMKNPSHPTASTLSMGLPPSRTPSHPTATVLSTGSPPSE 1053
Cdd:PHA03247  2960 PQPWLGALVPGRVAVPRfrvpQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPPVSLKQTLWPPDD 3033
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
745-1058 7.85e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 50.55  E-value: 7.85e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  745 PEPPDSSSSNTGSGPRRG-AHQNAQPCCPSAASSPHTSSPtFPPAAMVPSQAPYLVPAFPLPAATSPGREYAAPGTAPEG 823
Cdd:PHA03307   114 PDPPPPTPPPASPPPSPApDLSEMLRPVGSPGPPPAASPP-AAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAE 192
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  824 LHGLPLSEGLQPYPafpfPYLDTFMTVFLPDPPVCPLLSPSFLPCPFLGATASSAISPSMSSAMSPTLDPPPS---VTSQ 900
Cdd:PHA03307   193 PPPSTPPAAASPRP----PRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPApitLPTR 268
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  901 RREEEKWEAQSEGHPFITSRSSSPlqlnllqEEMPRPSESPDQMRRNTCPQTeyqcVTGNNGSESSPATTGALSTGSPPR 980
Cdd:PHA03307   269 IWEASGWNGPSSRPGPASSSSSPR-------ERSPSPSPSSPGSGPAPSSPR----ASSSSSSSRESSSSSTSSSSESSR 337
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2217272036  981 ENPSHPtasalstGSPPMKNPSHPTASALSTGSPPMKN-PSHPTASTLSMGlPPSRTPSHPTATVLSTGSPPSESPSRT 1058
Cdd:PHA03307   338 GAAVSP-------GPSPSRSPSPSRPPPPADPSSPRKRpRPSRAPSSPAAS-AGRPTRRRARAAVAGRARRRDATGRFP 408
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
744-1056 4.36e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 47.84  E-value: 4.36e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  744 LPEPPDSSSSNTGSGPRRGAHQNAQPCCPSAASSP-HTSSPTFP-PAAMVPSQAPYLVPAFPLPAATSPGREYA-APGTA 820
Cdd:pfam03154  252 MTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPsHMQHPVPPqPFPLTPQSSQSQVPPGPSPAAPGQSQQRIhTPPSQ 331
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  821 PEGLHGLPLSEglQPYPAFPFPyldtfMTVFLPDP--PVCPLLSPSFLPCPflgatasSAISPSMSSAMSPTLDPPPSVt 898
Cdd:pfam03154  332 SQLQSQQPPRE--QPLPPAPLS-----MPHIKPPPttPIPQLPNPQSHKHP-------PHLSGPSPFQMNSNLPPPPAL- 396
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  899 sqrreeEKWEAQSEGHPfiTSRSSSPLQLNLLQEEMPRPSESPDQMrrntcpqTEYQCVTGnngSESSPATTGALSTGSP 978
Cdd:pfam03154  397 ------KPLSSLSTHHP--PSAHPPPLQLMPQSQQLPPPPAQPPVL-------TQSQSLPP---PAASHPPTSGLHQVPS 458
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  979 preNPSHPTASALSTGSPPMKNPSHPTASALSTGS---PPMKNP---SHPTASTLSMGLPPSRTPSHPTATVLSTGS--P 1050
Cdd:pfam03154  459 ---QSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPgiqPPSSASvssSGPVPAAVSCPLPPVQIKEEALDEAEEPESppP 535

                   ....*.
gi 2217272036 1051 PSESPS 1056
Cdd:pfam03154  536 PPRSPS 541
PHA03247 PHA03247
large tegument protein UL36; Provisional
744-1056 5.85e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.63  E-value: 5.85e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  744 LPEPPDSSSSNTGSGPRRGAHQNAQPCCPSAASSP-------HTSSPTFPPAAMVPSQAPYLVPAFPLPAATSPGREYAA 816
Cdd:PHA03247  2624 PDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPgrvsrprRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPP 2703
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  817 PGTAPE-----GLHGLPLSEGLQPYPAFPFPYLDTFMTVFLPDPPVCPlLSPSFLPCPFLGATASSAISPSMSSAMSPTL 891
Cdd:PHA03247  2704 PPPTPEpaphaLVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATP-GGPARPARPPTTAGPPAPAPPAAPAAGPPRR 2782
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  892 DPPPSVTSQrrEEEKWEAQSEGHPFITSRSSSPLQLNLLQEEMPRPSESPDQMRRNTCPQTEYQCVTGNNGSESSPATTG 971
Cdd:PHA03247  2783 LTRPAVASL--SESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGG 2860
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  972 ALSTGSPPRENPSHPTAsalstgsppmknPSHPTASALStgSPPMKNPSHPTASTlSMGLPPSRTPSHPTATVLSTGSPP 1051
Cdd:PHA03247  2861 DVRRRPPSRSPAAKPAA------------PARPPVRRLA--RPAVSRSTESFALP-PDQPERPPQPQAPPPPQPQPQPPP 2925

                   ....*
gi 2217272036 1052 SESPS 1056
Cdd:PHA03247  2926 PPQPQ 2930
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
745-1056 5.36e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 44.37  E-value: 5.36e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  745 PEPPDSSSSNTGSGPRRGAHQNAQPCCPSAASSPHTSSPTFPPAAMVPSQAPYLVPAFPLP-------AATSPGREYAAP 817
Cdd:pfam03154  185 SPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPhpplqpmTQPPPPSQVSPQ 264
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  818 GTAPEGLHGL--PLSEGLQ--------PYPAFPFPYLDTFMTVFLPDPPVCPLLSPSflpcpflgatassaispsmssAM 887
Cdd:pfam03154  265 PLPQPSLHGQmpPMPHSLQtgpshmqhPVPPQPFPLTPQSSQSQVPPGPSPAAPGQS---------------------QQ 323
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  888 SPTLDPPPSVTSQRR--EEEKWEAQSEGHPFITSRSSSPLQLNLLQEEMPRPSE----SPDQMRRNTCPQTEYQCVTGNN 961
Cdd:pfam03154  324 RIHTPPSQSQLQSQQppREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHlsgpSPFQMNSNLPPPPALKPLSSLS 403
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  962 GSESSPATTGALSTGSPPRENPSHPTASALSTGSP--PMKNPSHPTASALSTGSPPMKNPSHPTASTLSMGLPPSRTPSH 1039
Cdd:pfam03154  404 THHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQslPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPT 483
                          330
                   ....*....|....*..
gi 2217272036 1040 PTATVLSTGSPPSESPS 1056
Cdd:pfam03154  484 STSSAMPGIQPPSSASV 500
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
745-1055 2.84e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.08  E-value: 2.84e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  745 PEPPDSSSSNTGSGPRRGAHQNAQPCCPSAASSPHTSSPTFPPAAMVPSQAPYLVPAFPLPAATSPGREYAAPGTAPEGL 824
Cdd:PHA03307   129 SPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPR 208
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  825 HGLPLSEG-LQPYPAFP----FPYLDTFMTVFLPDPPVCPLLSPSFLPCPFLGATASSAISPSMSSAMSPTLDPPP--SV 897
Cdd:PHA03307   209 RSSPISASaSSPAPAPGrsaaDDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPasSS 288
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  898 TSQRREEEKWEAQSEGHPFITSRSSSPLQLNLLQEEMPrPSESPDQMRRNTCPQTeyqcvTGNNGSES-SPATTGALSTG 976
Cdd:PHA03307   289 SSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSS-SSTSSSSESSRGAAVS-----PGPSPSRSpSPSRPPPPADP 362
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2217272036  977 SPPRENPshPTASALSTGSPPMKNPSHPTASALSTGSPPMKNPSHPtastlsmgLPPSRTPSHPTATVLSTGSPPSESP 1055
Cdd:PHA03307   363 SSPRKRP--RPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGR--------FPAGRPRPSPLDAGAASGAFYARYP 431
KinA COG5805
Sporulation sensor histidine kinase A (Stage II sporulation protein SpoIIF/SpoIIJ) [Cell cycle ...
273-373 3.20e-03

Sporulation sensor histidine kinase A (Stage II sporulation protein SpoIIF/SpoIIJ) [Cell cycle control, cell division, chromosome partitioning, Signal transduction mechanisms];


Pssm-ID: 444507 [Multi-domain]  Cd Length: 496  Bit Score: 41.64  E-value: 3.20e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  273 IFTTTHTPGcVFLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKYAGHPPFEHSPIrfcTQNGDYIIL 352
Cdd:COG5805    169 LICVIDTDG-RILFINESIERLFGAPREELIGKNLLELLHPCDKEEFKERIESITEVWQEFIIEREII---TKDGRIRYF 244
                           90       100
                   ....*....|....*....|..
gi 2217272036  353 DSSWSSFVNP-WSRKISFIIGR 373
Cdd:COG5805    245 EAVIVPLIDTdGSVKGILVILR 266
PHA03379 PHA03379
EBNA-3A; Provisional
745-1055 9.26e-03

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 40.43  E-value: 9.26e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  745 PEPPDSSSSNTGSGPRRGAHQN-AQPCCPSAASSPHTSSPTfPPAAMVPSQAPYLVPAFPLPAATSPGREYAAPGTAPEG 823
Cdd:PHA03379   425 PEVPQSLETATSHGSAQVPEPPpVHDLEPGPLHDQHSMAPC-PVAQLPPGPLQDLEPGDQLPGVVQDGRPACAPVPAPAG 503
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  824 LHGLPLSEGLQPYPAFPF-PYLDTFMTV-FLPDP------PVCPLLSPSFLPCPflGATASSAISPSMSSAMSPTLDPPP 895
Cdd:PHA03379   504 PIVRPWEASLSQVPGVAFaPVMPQPMPVePVPVPtvalerPVCPAPPLIAMQGP--GETSGIVRVRERWRPAPWTPNPPR 581
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  896 SVTSQRREEEKWEAQSEGHPFITSRSSSPLQLNLL--QEEMPRPSEsPDQMRRNTCPQTEYQCVTGNNG----------- 962
Cdd:PHA03379   582 SPSQMSVRDRLARLRAEAQPYQASVEVQPPQLTQVspQQPMEYPLE-PEQQMFPGSPFSQVADVMRAGGvpamqpqyfdl 660
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  963 SESSPATTGALST-------GSPPR--ENPSH---PTASALSTGSP--------PMKNPSHPtASALSTGSPPMKNPSHP 1022
Cdd:PHA03379   661 PLQQPISQGAPLAplrasmgPVPPVpaTQPQYfdiPLTEPINQGASaahflpqqPMEGPLVP-ERWMFQGATLSQSVRPG 739
                          330       340       350
                   ....*....|....*....|....*....|...
gi 2217272036 1023 TASTLSMGLPPSRTPSHPTATVLSTGSPPSESP 1055
Cdd:PHA03379   740 VAQSQYFDLPLTQPINHGAPAAHFLHQPPMEGP 772
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
751-1008 9.52e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 40.24  E-value: 9.52e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  751 SSSNTGSGPRRGAHQN-AQPCCPSAASSPHTSSPTFPPA--AMVPSQAPYLVPAFPLPAATSPGREYAAPGTAPEGLHGL 827
Cdd:PRK12323   366 GQSGGGAGPATAAAAPvAQPAPAAAAPAAAAPAPAAPPAapAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPG 445
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  828 PLSEGLQPYPAFPFPyldtfmtvfLPDPPVCPLLSPSFLpcpflgATASSAISPSMSSAMSPTLDPPPsvtsqrreeekW 907
Cdd:PRK12323   446 GAPAPAPAPAAAPAA---------AARPAAAGPRPVAAA------AAAAPARAAPAAAPAPADDDPPP-----------W 499
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  908 EAQSEGHPFITSRSSSPLQLNLLQEEMPRPSESPDQMRRNTCPQteyqcvtgnngsESSPATTGALSTGSPPRENPSHPT 987
Cdd:PRK12323   500 EELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAP------------APAAAPAPRAAAATEPVVAPRPPR 567
                          250       260
                   ....*....|....*....|.
gi 2217272036  988 ASAlsTGSPPMKNPSHPTASA 1008
Cdd:PRK12323   568 ASA--SGLPDMFDGDWPALAA 586
PLN02217 PLN02217
probable pectinesterase/pectinesterase inhibitor
958-1058 9.86e-03

probable pectinesterase/pectinesterase inhibitor


Pssm-ID: 215130 [Multi-domain]  Cd Length: 670  Bit Score: 40.07  E-value: 9.86e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217272036  958 TGNNGSESSPATTGALSTGSPprENPSHPTASALSTGSPPMKNPSHPTASALSTGSPPMKNPSHptastlSMGLPPSrTP 1037
Cdd:PLN02217   563 AGNPGSTNSTPTGSAASSNTT--FSSDSPSTVVAPSTSPPAGHLGSPPATPSKIVSPSTSPPAS------HLGSPST-TP 633
                           90       100
                   ....*....|....*....|.
gi 2217272036 1038 SHPTATVLSTGSpPSESPSRT 1058
Cdd:PLN02217   634 SSPESSIKVAST-ETASPESS 653
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH