NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1907114278|ref|XP_036015493|]
View 

protein FAM186A isoform X1 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1228-1632 6.79e-16

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 85.38  E-value: 6.79e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1228 QALGITPTPQPITLTPEQAQALGITPTPQPTTLTPEQTQALGITPTPQPITLTPEQAQALGIT-PTPQPITLTPEQTQAL 1306
Cdd:PHA03247  2637 EPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLAdPPPPPPTPEPAPHALV 2716
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1307 GITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQALGITPTPqPITLTPEQAQALGITPTPQPITLTPEQAQALGITP 1386
Cdd:PHA03247  2717 SATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARP-PTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESR 2795
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1387 TPQPITLTPEQAQALGITPTPqpiTLTPEQTQALGITPTPQPITLTPEQaqalgiTPTPQPITLTPEQVQALGITPTPQP 1466
Cdd:PHA03247  2796 ESLPSPWDPADPPAAVLAPAA---ALPPAASPAGPLPPPTSAQPTAPPP------PPGPPPPSLPLGGSVAPGGDVRRRP 2866
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1467 itlTPEQAQALGITPTPQPITLTPEQAqalgITPTPQPITLTPEQTQALGITPTPQPITLTPEQAQALGITPTPQPITLT 1546
Cdd:PHA03247  2867 ---PSRSPAAKPAAPARPPVRRLARPA----VSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1547 PelvqalgitPTPQPITLTPEQAQALGITPTPQPTTLSPEQaqalgitpTPQPITLTPEQAQALgitPTPQPTTLSPEQA 1626
Cdd:PHA03247  2940 Q---------PPLAPTTDPAGAGEPSGAVPQPWLGALVPGR--------VAVPRFRVPQPAPSR---EAPASSTPPLTGH 2999

                   ....*.
gi 1907114278 1627 QALGIS 1632
Cdd:PHA03247  3000 SLSRVS 3005
PHA03247 super family cl33720
large tegument protein UL36; Provisional
2430-2775 2.71e-05

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.32  E-value: 2.71e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 2430 QPAQAFPSPFTLEKPATLATSTDRLSQRWKDSYPASIPLQALRPSPTQAPFTPTTSLgigslldsekpwmsptyrqtltd 2509
Cdd:PHA03247  2642 PPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSL----------------------- 2698
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 2510 rgqdvlAQPLAPETPPSLRQLLAPGAPPTPGPPLGPRHFFKPrvpPTSGEVPGLVSGGSAaheeLPMSRTTPLQPPEWQG 2589
Cdd:PHA03247  2699 ------ADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPA---LPAAPAPPAVPAGPA----TPGGPARPARPPTTAG 2765
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 2590 PSRLIPEQGfmPAISSIPLHPFTAEALPTPGRPQRSSKAKPLKPKSARGLPNVTLGFETSQA-PFPIEKTQIPKTPDTSE 2668
Cdd:PHA03247  2766 PPAPAPPAA--PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAgPLPPPTSAQPTAPPPPP 2843
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 2669 QTQALQDALG--VQPFGIF-------QPYGTSSGIARSQSPLIDEKALSREKPGTPLPSLTTQLPQTPQISTSEKGQ--- 2736
Cdd:PHA03247  2844 GPPPPSLPLGgsVAPGGDVrrrppsrSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQpqp 2923
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*
gi 1907114278 2737 ------KPWLPPIDKPWTPTPVSSTREAKMIVSPTDQHPEDGYVV 2775
Cdd:PHA03247  2924 ppppqpQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALV 2968
MRS6 super family cl34879
Vacuolar protein sorting-associated protein [Intracellular trafficking and secretion];
25-255 5.55e-04

Vacuolar protein sorting-associated protein [Intracellular trafficking and secretion];


The actual alignment was detected with superfamily member COG5043:

Pssm-ID: 227376 [Multi-domain]  Cd Length: 2552  Bit Score: 46.03  E-value: 5.55e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278   25 LATLRKPISNRDNPL-SLQFEIPASV---QSVIHKIEESHIFRAKEEVIWRLTEIMSNVELIMTRYNIDS---MSPGRKG 97
Cdd:COG5043     47 LDKLGLPIEVTSGLIgTLTLEIPWSSlknKPVEIYIEDIYLLISPQAKNSLTREELPQSQQALKQRQLDSweiLRETLEE 126
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278   98 SSSESQKKKRKAFLEKIATVMT--------NVDLR-ERTLSKILSWLEEWNLILSEVSAINMDDyyHWT-----VKMELI 163
Cdd:COG5043    127 SSSSPNISRKQSFIESLITKLIdniqiyieDIHLRfEDNLSADLEGPYSFGLTLYSLRATSTDA--SWTeyfvsTDSSCI 204
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278  164 -------------PDTLKRISK-NVDSLIQMALLLVEEKKRAKKR-ILA--RGTLWkawkdRAIKRPATAQALRLD-QMI 225
Cdd:COG5043    205 hklitldyfsiywCEISPCITTeDIDSYLENFQPMIAEKSPAYNEyILKpvRGTAK-----VSINKLPTDEIPRLRgQLS 279
                          250       260       270
                   ....*....|....*....|....*....|....*.
gi 1907114278  226 FDQIGL---NAKVSEIQGMLQEL---IGTAMFSKLE 255
Cdd:COG5043    280 VEEFSIslsDHMYYSLLGVLDYLqvvMKQQKFLKYR 315
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
1228-1632 6.79e-16

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 85.38  E-value: 6.79e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1228 QALGITPTPQPITLTPEQAQALGITPTPQPTTLTPEQTQALGITPTPQPITLTPEQAQALGIT-PTPQPITLTPEQTQAL 1306
Cdd:PHA03247  2637 EPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLAdPPPPPPTPEPAPHALV 2716
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1307 GITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQALGITPTPqPITLTPEQAQALGITPTPQPITLTPEQAQALGITP 1386
Cdd:PHA03247  2717 SATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARP-PTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESR 2795
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1387 TPQPITLTPEQAQALGITPTPqpiTLTPEQTQALGITPTPQPITLTPEQaqalgiTPTPQPITLTPEQVQALGITPTPQP 1466
Cdd:PHA03247  2796 ESLPSPWDPADPPAAVLAPAA---ALPPAASPAGPLPPPTSAQPTAPPP------PPGPPPPSLPLGGSVAPGGDVRRRP 2866
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1467 itlTPEQAQALGITPTPQPITLTPEQAqalgITPTPQPITLTPEQTQALGITPTPQPITLTPEQAQALGITPTPQPITLT 1546
Cdd:PHA03247  2867 ---PSRSPAAKPAAPARPPVRRLARPA----VSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1547 PelvqalgitPTPQPITLTPEQAQALGITPTPQPTTLSPEQaqalgitpTPQPITLTPEQAQALgitPTPQPTTLSPEQA 1626
Cdd:PHA03247  2940 Q---------PPLAPTTDPAGAGEPSGAVPQPWLGALVPGR--------VAVPRFRVPQPAPSR---EAPASSTPPLTGH 2999

                   ....*.
gi 1907114278 1627 QALGIS 1632
Cdd:PHA03247  3000 SLSRVS 3005
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1329-1628 2.79e-10

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 66.33  E-value: 2.79e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1329 PTPQPITLTPEQTQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQAQALGITPTPQPItltPEQAQALGITPTPQ 1408
Cdd:pfam03154  180 AASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRL---PSPHPPLQPMTQPP 256
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1409 PITLTPEQtqalgitPTPQPITLTPeqaqalgITPTPQPITLTPEQVQALGitpTPQPITLTPEQAQALGitpTPQPITL 1488
Cdd:pfam03154  257 PPSQVSPQ-------PLPQPSLHGQ-------MPPMPHSLQTGPSHMQHPV---PPQPFPLTPQSSQSQV---PPGPSPA 316
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1489 TPEQAQALGITPTPQPITLTPEQTQALGITPTPQPIT-LTPEQAQALGITPTPQPITLTPELVqalGITPTPQPITLTPE 1567
Cdd:pfam03154  317 APGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPhIKPPPTTPIPQLPNPQSHKHPPHLS---GPSPFQMNSNLPPP 393
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907114278 1568 QA-QALGITPTPQPTTLSPEQAQALgitptPQPITLTPEQAQALGITPTPqptTLSPEQAQA 1628
Cdd:pfam03154  394 PAlKPLSSLSTHHPPSAHPPPLQLM-----PQSQQLPPPPAQPPVLTQSQ---SLPPPAASH 447
PHA03247 PHA03247
large tegument protein UL36; Provisional
2430-2775 2.71e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.32  E-value: 2.71e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 2430 QPAQAFPSPFTLEKPATLATSTDRLSQRWKDSYPASIPLQALRPSPTQAPFTPTTSLgigslldsekpwmsptyrqtltd 2509
Cdd:PHA03247  2642 PPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSL----------------------- 2698
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 2510 rgqdvlAQPLAPETPPSLRQLLAPGAPPTPGPPLGPRHFFKPrvpPTSGEVPGLVSGGSAaheeLPMSRTTPLQPPEWQG 2589
Cdd:PHA03247  2699 ------ADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPA---LPAAPAPPAVPAGPA----TPGGPARPARPPTTAG 2765
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 2590 PSRLIPEQGfmPAISSIPLHPFTAEALPTPGRPQRSSKAKPLKPKSARGLPNVTLGFETSQA-PFPIEKTQIPKTPDTSE 2668
Cdd:PHA03247  2766 PPAPAPPAA--PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAgPLPPPTSAQPTAPPPPP 2843
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 2669 QTQALQDALG--VQPFGIF-------QPYGTSSGIARSQSPLIDEKALSREKPGTPLPSLTTQLPQTPQISTSEKGQ--- 2736
Cdd:PHA03247  2844 GPPPPSLPLGgsVAPGGDVrrrppsrSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQpqp 2923
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*
gi 1907114278 2737 ------KPWLPPIDKPWTPTPVSSTREAKMIVSPTDQHPEDGYVV 2775
Cdd:PHA03247  2924 ppppqpQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALV 2968
SP2_N cd22540
N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins ...
1264-1615 7.62e-05

N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. SP2 contains the least conserved DNA-binding domain within the SP subfamily of proteins, and its DNA sequence specificity differs from the other SP proteins. It localizes primarily within subnuclear foci associated with the nuclear matrix, and can activate, or in some cases, repress expression from different promoters. The transcription factor SP2 serves as a paradigm for indirect genomic binding. It does not require its DNA-binding domain for genomic DNA binding and occupies target promoters independently of whether they contain a cognate DNA-binding motif. SP2 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP2.


Pssm-ID: 411776 [Multi-domain]  Cd Length: 511  Bit Score: 48.38  E-value: 7.62e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1264 QTQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQA 1343
Cdd:cd22540    122 TNQQYQISPQIQAAGQINNSGQIQIIPGTNQAIITPVQVLQQPQQAHKPVPIKPAPLQTSNTNSASLQVPGNVIKLQSGG 201
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1344 LGITPTPQPITLTPEQAQALGITPTPQPITLTPEQAQALGITPTPQPitlTPEQAQALGITPTPQPI-----TLTPEQTQ 1418
Cdd:cd22540    202 NVALTLPVNNLVGTQDGATQLQLAAAPSKPSKKIRKKSAQAAQPAVT---VAEQVETVLIETTADNIiqagnNLLIVQSP 278
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1419 ALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQVQALGITP-----TPQPITLTPEQAQALGITPTPQPITLTPEQA 1493
Cdd:cd22540    279 GTGQPAVLQQVQVLQPKQEQQVVQIPQQALRVVQAASATLPTVPqkplqNIQIQNSEPTPTQVYIKTPSGEVQTVLLQEA 358
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1494 QALGITPTPQPITLTPEQTQALGiTPTPQPITLTPEQAQALGITPTPQPITLTPELVQALGITPTPQPITLTPEQAQALG 1573
Cdd:cd22540    359 PAATATPSSSTSTVQQQVTANNG-TGTSKPNYNVRKERTLPKIAPAGGIISLNAAQLAAAAQAIQTININGVQVQGVPVT 437
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*
gi 1907114278 1574 ITPTPQPTTLSPEQAQALGIT---PTPQPITLTPEQAQALGITPT 1615
Cdd:cd22540    438 ITNAGGQQQLTVQTVSSNNLTisgLSPTQIQLQMEQALEIETQPG 482
MRS6 COG5043
Vacuolar protein sorting-associated protein [Intracellular trafficking and secretion];
25-255 5.55e-04

Vacuolar protein sorting-associated protein [Intracellular trafficking and secretion];


Pssm-ID: 227376 [Multi-domain]  Cd Length: 2552  Bit Score: 46.03  E-value: 5.55e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278   25 LATLRKPISNRDNPL-SLQFEIPASV---QSVIHKIEESHIFRAKEEVIWRLTEIMSNVELIMTRYNIDS---MSPGRKG 97
Cdd:COG5043     47 LDKLGLPIEVTSGLIgTLTLEIPWSSlknKPVEIYIEDIYLLISPQAKNSLTREELPQSQQALKQRQLDSweiLRETLEE 126
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278   98 SSSESQKKKRKAFLEKIATVMT--------NVDLR-ERTLSKILSWLEEWNLILSEVSAINMDDyyHWT-----VKMELI 163
Cdd:COG5043    127 SSSSPNISRKQSFIESLITKLIdniqiyieDIHLRfEDNLSADLEGPYSFGLTLYSLRATSTDA--SWTeyfvsTDSSCI 204
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278  164 -------------PDTLKRISK-NVDSLIQMALLLVEEKKRAKKR-ILA--RGTLWkawkdRAIKRPATAQALRLD-QMI 225
Cdd:COG5043    205 hklitldyfsiywCEISPCITTeDIDSYLENFQPMIAEKSPAYNEyILKpvRGTAK-----VSINKLPTDEIPRLRgQLS 279
                          250       260       270
                   ....*....|....*....|....*....|....*.
gi 1907114278  226 FDQIGL---NAKVSEIQGMLQEL---IGTAMFSKLE 255
Cdd:COG5043    280 VEEFSIslsDHMYYSLLGVLDYLqvvMKQQKFLKYR 315
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
1228-1632 6.79e-16

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 85.38  E-value: 6.79e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1228 QALGITPTPQPITLTPEQAQALGITPTPQPTTLTPEQTQALGITPTPQPITLTPEQAQALGIT-PTPQPITLTPEQTQAL 1306
Cdd:PHA03247  2637 EPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLAdPPPPPPTPEPAPHALV 2716
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1307 GITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQALGITPTPqPITLTPEQAQALGITPTPQPITLTPEQAQALGITP 1386
Cdd:PHA03247  2717 SATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARP-PTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESR 2795
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1387 TPQPITLTPEQAQALGITPTPqpiTLTPEQTQALGITPTPQPITLTPEQaqalgiTPTPQPITLTPEQVQALGITPTPQP 1466
Cdd:PHA03247  2796 ESLPSPWDPADPPAAVLAPAA---ALPPAASPAGPLPPPTSAQPTAPPP------PPGPPPPSLPLGGSVAPGGDVRRRP 2866
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1467 itlTPEQAQALGITPTPQPITLTPEQAqalgITPTPQPITLTPEQTQALGITPTPQPITLTPEQAQALGITPTPQPITLT 1546
Cdd:PHA03247  2867 ---PSRSPAAKPAAPARPPVRRLARPA----VSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1547 PelvqalgitPTPQPITLTPEQAQALGITPTPQPTTLSPEQaqalgitpTPQPITLTPEQAQALgitPTPQPTTLSPEQA 1626
Cdd:PHA03247  2940 Q---------PPLAPTTDPAGAGEPSGAVPQPWLGALVPGR--------VAVPRFRVPQPAPSR---EAPASSTPPLTGH 2999

                   ....*.
gi 1907114278 1627 QALGIS 1632
Cdd:PHA03247  3000 SLSRVS 3005
PHA03247 PHA03247
large tegument protein UL36; Provisional
1234-1635 2.97e-15

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 83.06  E-value: 2.97e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1234 PTPQPITLTPEQAqaLGITPTPQPTTLTPEQTQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQALGITPTPQ 1313
Cdd:PHA03247  2569 PPPRPAPRPSEPA--VTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTV 2646
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1314 PITLTPEQAQALGITPTPQPITLTPEQTQALGITPTPQPITLTPEQAQALGIT-PTPQPITLTPEQAQALGITPTPQPIT 1392
Cdd:PHA03247  2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLAdPPPPPPTPEPAPHALVSATPLPPGPA 2726
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1393 LTPEQAQALGITPTPQPITLTPEQTQALGITPTPqPITLTPEQAQALGITPTPQPITLTPEQVQALGITPTPQPITLTPE 1472
Cdd:PHA03247  2727 AARQASPALPAAPAPPAVPAGPATPGGPARPARP-PTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPA 2805
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1473 QAQALGITPTPqpiTLTPEQAQALGITPTPQPITLTPEQTqalgitPTPQPITLTPEQAQALG----ITPTPQPITLTPE 1548
Cdd:PHA03247  2806 DPPAAVLAPAA---ALPPAASPAGPLPPPTSAQPTAPPPP------PGPPPPSLPLGGSVAPGgdvrRRPPSRSPAAKPA 2876
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1549 L--------VQALGITPTPQPITLTPEQAQALGITPTPQPTTLSPEQAQAlgitPTPQPITLTPEQAQalgitPTPQPTT 1620
Cdd:PHA03247  2877 AparppvrrLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPP----PQPQPPPPPPPRPQ-----PPLAPTT 2947
                          410
                   ....*....|....*
gi 1907114278 1621 LSPEQAQALGISLIP 1635
Cdd:PHA03247  2948 DPAGAGEPSGAVPQP 2962
PHA03247 PHA03247
large tegument protein UL36; Provisional
1272-1649 1.62e-14

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 80.75  E-value: 1.62e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1272 PTPQPITLTPEQA-QALGITPTPQPITLTPeqtQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQALGITPTP 1350
Cdd:PHA03247  2569 PPPRPAPRPSEPAvTSRARRPDAPPQSARP---RAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPT 2645
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1351 QPITLTPEQAQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQAQALGIT-PTPQPITLTPEQTQALGITPTPQPI 1429
Cdd:PHA03247  2646 VPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLAdPPPPPPTPEPAPHALVSATPLPPGP 2725
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1430 TLTPEQAQALGITPTPQPITLTPEQVQALGITPTPqPITLTPEQAQALGITPTPQPITLTPEQAQALGITPTPQPITLTP 1509
Cdd:PHA03247  2726 AAARQASPALPAAPAPPAVPAGPATPGGPARPARP-PTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDP 2804
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1510 EQTQALGITPTPqpiTLTPEQAQALGITPTPQPITLTPELvqalgiTPTPQPITLTPEQAQALG---------ITPTPQP 1580
Cdd:PHA03247  2805 ADPPAAVLAPAA---ALPPAASPAGPLPPPTSAQPTAPPP------PPGPPPPSLPLGGSVAPGgdvrrrppsRSPAAKP 2875
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907114278 1581 TTLSPEQAQALG---ITPTPQPITLTPEQAQALGITPTPQPTTLSPEQAQALGISLIPKQQEISLSPEQAQA 1649
Cdd:PHA03247  2876 AAPARPPVRRLArpaVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTT 2947
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1329-1628 2.79e-10

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 66.33  E-value: 2.79e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1329 PTPQPITLTPEQTQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQAQALGITPTPQPItltPEQAQALGITPTPQ 1408
Cdd:pfam03154  180 AASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRL---PSPHPPLQPMTQPP 256
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1409 PITLTPEQtqalgitPTPQPITLTPeqaqalgITPTPQPITLTPEQVQALGitpTPQPITLTPEQAQALGitpTPQPITL 1488
Cdd:pfam03154  257 PPSQVSPQ-------PLPQPSLHGQ-------MPPMPHSLQTGPSHMQHPV---PPQPFPLTPQSSQSQV---PPGPSPA 316
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1489 TPEQAQALGITPTPQPITLTPEQTQALGITPTPQPIT-LTPEQAQALGITPTPQPITLTPELVqalGITPTPQPITLTPE 1567
Cdd:pfam03154  317 APGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPhIKPPPTTPIPQLPNPQSHKHPPHLS---GPSPFQMNSNLPPP 393
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907114278 1568 QA-QALGITPTPQPTTLSPEQAQALgitptPQPITLTPEQAQALGITPTPqptTLSPEQAQA 1628
Cdd:pfam03154  394 PAlKPLSSLSTHHPPSAHPPPLQLM-----PQSQQLPPPPAQPPVLTQSQ---SLPPPAASH 447
PHA03247 PHA03247
large tegument protein UL36; Provisional
1227-1549 1.27e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 64.57  E-value: 1.27e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1227 AQALGITPTPQPITLTPEQAQALGITPTPQPTTLTPEQTQALGITPTPQPiTLTPEQAQALGITPTPQPITLTPEQTQAL 1306
Cdd:PHA03247  2732 SPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR-RLTRPAVASLSESRESLPSPWDPADPPAA 2810
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1307 GITPTPqpiTLTPEQAQALGITPTPQPITLTPEQtqalgiTPTPQPITLTPEQAQALGITPTPQPitlTPEQAQALGITP 1386
Cdd:PHA03247  2811 VLAPAA---ALPPAASPAGPLPPPTSAQPTAPPP------PPGPPPPSLPLGGSVAPGGDVRRRP---PSRSPAAKPAAP 2878
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1387 TPQPITLTPEQAqalgITPTPQPITLTPEQTQALGITPTPQPITLTPEQAQAlgitPTPQPITLTPEQVQalgitPTPQP 1466
Cdd:PHA03247  2879 ARPPVRRLARPA----VSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPP----PQPQPPPPPPPRPQ-----PPLAP 2945
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1467 ITLTPEQAQALGITPTPQPITLTPEQAQAL-GITPTPQPITLTPEQTQAlgiTPTPQPITLTPEQAQ--ALGITPTPQPI 1543
Cdd:PHA03247  2946 TTDPAGAGEPSGAVPQPWLGALVPGRVAVPrFRVPQPAPSREAPASSTP---PLTGHSLSRVSSWASslALHEETDPPPV 3022

                   ....*.
gi 1907114278 1544 TLTPEL 1549
Cdd:PHA03247  3023 SLKQTL 3028
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1367-1656 1.82e-09

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 63.63  E-value: 1.82e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1367 PTPQPITLTPEQAQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQALGITPTPQPItltPEQAQALGITPTPQ 1446
Cdd:pfam03154  180 AASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRL---PSPHPPLQPMTQPP 256
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1447 PITLTPEQvqalgitPTPQPITLTPeqaqalgITPTPQPITLTPEQAQALGitpTPQPITLTPEQTQALGitpTPQPITL 1526
Cdd:pfam03154  257 PPSQVSPQ-------PLPQPSLHGQ-------MPPMPHSLQTGPSHMQHPV---PPQPFPLTPQSSQSQV---PPGPSPA 316
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1527 TPEQAQALGITPTPQPITLTPELVQALGITPTPQPIT-LTPEQAQALGITPTPQPTTLSPEQAqalGITPTPQPITLTPE 1605
Cdd:pfam03154  317 APGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPhIKPPPTTPIPQLPNPQSHKHPPHLS---GPSPFQMNSNLPPP 393
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1907114278 1606 QAqalgITPTPQPTTLSPEQAQALGISLIPKQQEISLSPeqAQALGLTLTP 1656
Cdd:pfam03154  394 PA----LKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPP--AQPPVLTQSQ 438
PHA03247 PHA03247
large tegument protein UL36; Provisional
1158-1507 3.19e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 63.42  E-value: 3.19e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1158 PTPQPITFTPEQTQALGITPTPqlitLTPEQAKALANTLTAEQVSLSPQQAEALGITPTPQPTTLTP---EQAQALGITP 1234
Cdd:PHA03247  2701 PPPPPPTPEPAPHALVSATPLP----PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTagpPAPAPPAAPA 2776
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1235 TPQPITLTPEQAQALGITPTPQPTTLTPEQTQALGITPTP-QPITLTPEQAQALGITPTPQPITLTPEqtqalgitptPQ 1313
Cdd:PHA03247  2777 AGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAaLPPAASPAGPLPPPTSAQPTAPPPPPG----------PP 2846
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1314 PITLTPEQAQALGITPTPQPitlTPEQTQALGITPTPQPITLTPEQAqalgITPTPQPITLTPEQAQALGITPTPQPITL 1393
Cdd:PHA03247  2847 PPSLPLGGSVAPGGDVRRRP---PSRSPAAKPAAPARPPVRRLARPA----VSRSTESFALPPDQPERPPQPQAPPPPQP 2919
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1394 TPEQAQAlgitPTPQPITLTPEQTQalgitPTPQPITLTPEQAQALGITPTPQPITLTPEQVQAL-GITPTPQPITLTPE 1472
Cdd:PHA03247  2920 QPQPPPP----PQPQPPPPPPPRPQ-----PPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPrFRVPQPAPSREAPA 2990
                          330       340       350
                   ....*....|....*....|....*....|....*..
gi 1907114278 1473 QAQAlgiTPTPQPITLTPEQAQ--ALGITPTPQPITL 1507
Cdd:PHA03247  2991 SSTP---PLTGHSLSRVSSWASslALHEETDPPPVSL 3024
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1265-1624 5.31e-09

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 62.09  E-value: 5.31e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1265 TQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQAL 1344
Cdd:pfam03154  192 TQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSL 271
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1345 --GITPTPQPITLTPEQAQALGitpTPQPITLTPEQAQALGitpTPQPITLTPEQAQALGITPTPQPITLTPEQTQALGI 1422
Cdd:pfam03154  272 hgQMPPMPHSLQTGPSHMQHPV---PPQPFPLTPQSSQSQV---PPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPL 345
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1423 TPTPQPIT-LTPEQAQALGITPTPQPITLTPeqvQALGITPTPQPITLTPEQA-QALGITPTPQPITLTPEQAQalgITP 1500
Cdd:pfam03154  346 PPAPLSMPhIKPPPTTPIPQLPNPQSHKHPP---HLSGPSPFQMNSNLPPPPAlKPLSSLSTHHPPSAHPPPLQ---LMP 419
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1501 TPQPITLTPEQTQALGITPTPQPITLTPEQAQALGITPTPQPITLTPELVQAlgitPTPQPITLTPEQAQALGITPTPQP 1580
Cdd:pfam03154  420 QSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGG----PPPITPPSGPPTSTSSAMPGIQPP 495
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*...
gi 1907114278 1581 TTLSPEQAQALGITPT----PQPITLTPEQAQALGITPTPQPTTLSPE 1624
Cdd:pfam03154  496 SSASVSSSGPVPAAVScplpPVQIKEEALDEAEEPESPPPPPRSPSPE 543
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
1263-1649 5.15e-07

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 55.35  E-value: 5.15e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1263 EQTQALGITPTPQPITLTpEQAQALGITPTPQPITLTPEQTQAlgITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQ 1342
Cdd:pfam17823   56 EQ*NFCAATAAPAPVTLT-KGTSAAHLNSTEVTAEHTPHGTDL--SEPATREGAADGAASRALAAAASSSPSSAAQSLPA 132
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1343 ALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQALGI 1422
Cdd:pfam17823  133 AIAALPSEAFSAPRAAACRANASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGI 212
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1423 TpTPQPITLTPEQAQALGITPTPQPITLTPEQVqalgiTPTPQPITLTPEQAQALGITPTPQPITLTPEQAQalgitpTP 1502
Cdd:pfam17823  213 S-TAATATGHPAAGTALAAVGNSSPAAGTVTAA-----VGTVTPAALATLAAAAGTVASAAGTINMGDPHAR------RL 280
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1503 QPITLTPEQTQALGITPTPQPitltpeQAQAlgitPTPQPITLTPELVQALGITPTPQPITLTPEQAQALGITPTPQPTT 1582
Cdd:pfam17823  281 SPAKHMPSDTMARNPAAPMGA------QAQG----PIIQVSTDQPVHNTAGEPTPSPSNTTLEPNTPKSVASTNLAVVTT 350
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907114278 1583 lspEQAQALGITPTPQPITLTPEQAQALGITPTPQPTTLSPEQ-AQALGISLIPKQQEISLSPEQAQA 1649
Cdd:pfam17823  351 ---TKAQAKEPSASPVPVLHTSMIPEVEATSPTTQPSPLLPTQgAAGPGILLAPEQVATEATAGTASA 415
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1158-1531 4.33e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 52.85  E-value: 4.33e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1158 PTPQPITFTPEQTQALGITPTPQLITLTPEQAKALANTLTAEQVSLSPQQAEALGITPTPQPTTLTPEQAQALGITPTPQ 1237
Cdd:pfam03154  180 AASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPS 259
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1238 PITLTPEQAQALGITPTPQPTTLTPEQTQALGITPtPQPITLTPEQAQALGitpTPQPITLTPEQTQALGITPTPQPITL 1317
Cdd:pfam03154  260 QVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVP-PQPFPLTPQSSQSQV---PPGPSPAAPGQSQQRIHTPPSQSQLQ 335
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1318 TPEQAQALGITPTPQPIT-LTPEQTQALGITPTPQPITLTPEQAqalGITPTPQPITLTPEQA-QALGITPT-------P 1388
Cdd:pfam03154  336 SQQPPREQPLPPAPLSMPhIKPPPTTPIPQLPNPQSHKHPPHLS---GPSPFQMNSNLPPPPAlKPLSSLSThhppsahP 412
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1389 QPITLTPeQAQALGITPTpQPITLTPEQTQALGITPTPQPITLTPEQAQA----LGITPTPQPITLTPEQVQAlgITPTP 1464
Cdd:pfam03154  413 PPLQLMP-QSQQLPPPPA-QPPVLTQSQSLPPPAASHPPTSGLHQVPSQSpfpqHPFVPGGPPPITPPSGPPT--STSSA 488
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1465 QPITLTPEQAQALGITPTPQPIT--LTPEQAQALGITPTPQPITLTPEQTqalgiTPTPQP-ITLTPEQA 1531
Cdd:pfam03154  489 MPGIQPPSSASVSSSGPVPAAVScpLPPVQIKEEALDEAEEPESPPPPPR-----SPSPEPtVVNTPSHA 553
PHA03247 PHA03247
large tegument protein UL36; Provisional
2430-2775 2.71e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.32  E-value: 2.71e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 2430 QPAQAFPSPFTLEKPATLATSTDRLSQRWKDSYPASIPLQALRPSPTQAPFTPTTSLgigslldsekpwmsptyrqtltd 2509
Cdd:PHA03247  2642 PPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSL----------------------- 2698
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 2510 rgqdvlAQPLAPETPPSLRQLLAPGAPPTPGPPLGPRHFFKPrvpPTSGEVPGLVSGGSAaheeLPMSRTTPLQPPEWQG 2589
Cdd:PHA03247  2699 ------ADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPA---LPAAPAPPAVPAGPA----TPGGPARPARPPTTAG 2765
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 2590 PSRLIPEQGfmPAISSIPLHPFTAEALPTPGRPQRSSKAKPLKPKSARGLPNVTLGFETSQA-PFPIEKTQIPKTPDTSE 2668
Cdd:PHA03247  2766 PPAPAPPAA--PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAgPLPPPTSAQPTAPPPPP 2843
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 2669 QTQALQDALG--VQPFGIF-------QPYGTSSGIARSQSPLIDEKALSREKPGTPLPSLTTQLPQTPQISTSEKGQ--- 2736
Cdd:PHA03247  2844 GPPPPSLPLGgsVAPGGDVrrrppsrSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQpqp 2923
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*
gi 1907114278 2737 ------KPWLPPIDKPWTPTPVSSTREAKMIVSPTDQHPEDGYVV 2775
Cdd:PHA03247  2924 ppppqpQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALV 2968
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1381-1647 2.78e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 49.91  E-value: 2.78e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1381 ALGITPTPQPITLTPEQA----QALGITPTPQPITLTPeqtqALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQVQ 1456
Cdd:pfam05109  395 GLGTAPKTLIITRTATNAttttHKVIFSKAPESTTTSP----TLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVST 470
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1457 ALGITPTPQPITltpeqAQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQALGITPTPQPITLTPEQAQ---- 1532
Cdd:pfam05109  471 ADVTSPTPAGTT-----SGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKtspt 545
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1533 ALGITPTPQPITLTPELVqalgiTPTPQPITLTPEQAQALGITPTPQPTTLSPEQAQ----------ALGITPTPQPITL 1602
Cdd:pfam05109  546 SAVTTPTPNATSPTPAVT-----TPTPNATIPTLGKTSPTSAVTTPTPNATSPTVGEtspqanttnhTLGGTSSTPVVTS 620
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 1907114278 1603 TPEQAQALGITPTPQPTTLSPEQaqalgISLIPKQQEISLSPEQA 1647
Cdd:pfam05109  621 PPKNATSAVTTGQHNITSSSTSS-----MSLRPSSISETLSPSTS 660
PHA03378 PHA03378
EBNA-3B; Provisional
1351-1637 3.26e-05

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 49.68  E-value: 3.26e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1351 QPITLTPEQAQALgitPTPQPitlTPEQAQALGITPTPQPITLTPEQAQalgiTPTPQPitlTPEQTQALGITPTPQPIT 1430
Cdd:PHA03378   552 EPASTEPVHDQLL---PAPGL---GPLQIQPLTSPTTSQLASSAPSYAQ----TPWPVP---HPSQTPEPPTTQSHIPET 618
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1431 LTPEQaqalgiTPTP-QPITLTPEQVQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQAQALGITpTPQPITLTP 1509
Cdd:PHA03378   619 SAPRQ------WPMPlRPIPMRPLRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPYQPSPTGAN-TMLPIQWAP 691
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1510 EQTQALGITPTP-QPITLTPEQAQALGITPTP-QPITLTPELVQALGITPTPQPitlTPEQAQALGITPTPQPTTLSPEQ 1587
Cdd:PHA03378   692 GTMQPPPRAPTPmRPPAAPPGRAQRPAAATGRaRPPAAAPGRARPPAAAPGRAR---PPAAAPGRARPPAAAPGRARPPA 768
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1588 AQALGITPTPQPITLTPEQAQALGiTPTPQPttlsPEQAQALGISLIPKQ 1637
Cdd:PHA03378   769 AAPGAPTPQPPPQAPPAPQQRPRG-APTPQP----PPQAGPTSMQLMPRA 813
SP2_N cd22540
N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins ...
1264-1615 7.62e-05

N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. SP2 contains the least conserved DNA-binding domain within the SP subfamily of proteins, and its DNA sequence specificity differs from the other SP proteins. It localizes primarily within subnuclear foci associated with the nuclear matrix, and can activate, or in some cases, repress expression from different promoters. The transcription factor SP2 serves as a paradigm for indirect genomic binding. It does not require its DNA-binding domain for genomic DNA binding and occupies target promoters independently of whether they contain a cognate DNA-binding motif. SP2 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP2.


Pssm-ID: 411776 [Multi-domain]  Cd Length: 511  Bit Score: 48.38  E-value: 7.62e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1264 QTQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQA 1343
Cdd:cd22540    122 TNQQYQISPQIQAAGQINNSGQIQIIPGTNQAIITPVQVLQQPQQAHKPVPIKPAPLQTSNTNSASLQVPGNVIKLQSGG 201
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1344 LGITPTPQPITLTPEQAQALGITPTPQPITLTPEQAQALGITPTPQPitlTPEQAQALGITPTPQPI-----TLTPEQTQ 1418
Cdd:cd22540    202 NVALTLPVNNLVGTQDGATQLQLAAAPSKPSKKIRKKSAQAAQPAVT---VAEQVETVLIETTADNIiqagnNLLIVQSP 278
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1419 ALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQVQALGITP-----TPQPITLTPEQAQALGITPTPQPITLTPEQA 1493
Cdd:cd22540    279 GTGQPAVLQQVQVLQPKQEQQVVQIPQQALRVVQAASATLPTVPqkplqNIQIQNSEPTPTQVYIKTPSGEVQTVLLQEA 358
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1494 QALGITPTPQPITLTPEQTQALGiTPTPQPITLTPEQAQALGITPTPQPITLTPELVQALGITPTPQPITLTPEQAQALG 1573
Cdd:cd22540    359 PAATATPSSSTSTVQQQVTANNG-TGTSKPNYNVRKERTLPKIAPAGGIISLNAAQLAAAAQAIQTININGVQVQGVPVT 437
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*
gi 1907114278 1574 ITPTPQPTTLSPEQAQALGIT---PTPQPITLTPEQAQALGITPT 1615
Cdd:cd22540    438 ITNAGGQQQLTVQTVSSNNLTisgLSPTQIQLQMEQALEIETQPG 482
PHA03247 PHA03247
large tegument protein UL36; Provisional
1319-1661 5.07e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.08  E-value: 5.07e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1319 PEQAQALGITPTPQPITLTPEQTQALGITPTPQPITLTPEqaqALGITPTPQPIT----LTPEQAQALGITPTPQPITLT 1394
Cdd:PHA03247  2484 AEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPDE---PVGEPVHPRMLTwirgLEELASDDAGDPPPPLPPAAP 2560
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1395 PeqAQALGITPTPQPITLTPE-QTQALGITPTPQPITLTPeqaQALGITPTPQPITLTPEQVQALGITPTPQPITLTPEQ 1473
Cdd:PHA03247  2561 P--AAPDRSVPPPRPAPRPSEpAVTSRARRPDAPPQSARP---RAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAA 2635
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1474 AQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQALGITPTPQPITLTPEQAQALGIT-PTPQPITLTPELVQA 1552
Cdd:PHA03247  2636 NEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLAdPPPPPPTPEPAPHAL 2715
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1553 LGITPTPQPITLTPEQAQALGITPTPQPTTLSPeqaqALGITPTPQPITLTPeqAQALGITPTPQPTTLSPEQAQALGIS 1632
Cdd:PHA03247  2716 VSATPLPPGPAAARQASPALPAAPAPPAVPAGP----ATPGGPARPARPPTT--AGPPAPAPPAAPAAGPPRRLTRPAVA 2789
                          330       340
                   ....*....|....*....|....*....
gi 1907114278 1633 LIPKQQEISLSPEQAQALGLTLTPQQAQV 1661
Cdd:PHA03247  2790 SLSESRESLPSPWDPADPPAAVLAPAAAL 2818
MRS6 COG5043
Vacuolar protein sorting-associated protein [Intracellular trafficking and secretion];
25-255 5.55e-04

Vacuolar protein sorting-associated protein [Intracellular trafficking and secretion];


Pssm-ID: 227376 [Multi-domain]  Cd Length: 2552  Bit Score: 46.03  E-value: 5.55e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278   25 LATLRKPISNRDNPL-SLQFEIPASV---QSVIHKIEESHIFRAKEEVIWRLTEIMSNVELIMTRYNIDS---MSPGRKG 97
Cdd:COG5043     47 LDKLGLPIEVTSGLIgTLTLEIPWSSlknKPVEIYIEDIYLLISPQAKNSLTREELPQSQQALKQRQLDSweiLRETLEE 126
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278   98 SSSESQKKKRKAFLEKIATVMT--------NVDLR-ERTLSKILSWLEEWNLILSEVSAINMDDyyHWT-----VKMELI 163
Cdd:COG5043    127 SSSSPNISRKQSFIESLITKLIdniqiyieDIHLRfEDNLSADLEGPYSFGLTLYSLRATSTDA--SWTeyfvsTDSSCI 204
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278  164 -------------PDTLKRISK-NVDSLIQMALLLVEEKKRAKKR-ILA--RGTLWkawkdRAIKRPATAQALRLD-QMI 225
Cdd:COG5043    205 hklitldyfsiywCEISPCITTeDIDSYLENFQPMIAEKSPAYNEyILKpvRGTAK-----VSINKLPTDEIPRLRgQLS 279
                          250       260       270
                   ....*....|....*....|....*....|....*.
gi 1907114278  226 FDQIGL---NAKVSEIQGMLQEL---IGTAMFSKLE 255
Cdd:COG5043    280 VEEFSIslsDHMYYSLLGVLDYLqvvMKQQKFLKYR 315
PHA03247 PHA03247
large tegument protein UL36; Provisional
2312-2742 8.17e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.31  E-value: 8.17e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 2312 SPLTPKQPQAVEPAKAKL-----PPLTPSQAQPLQKQLAPELTQTLLFTITLQKAQHLGVTFTYEQTQAAAVTLTSEQVA 2386
Cdd:PHA03247  2678 SPPQRPRRRAARPTVGSLtsladPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARP 2757
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 2387 ALEDALTENLAwrweiSVTPGMAQEAPNITTTKQLQALGITARQPAQAFPSPFTLEKPATLATSTDRLSQRWKDSYPASI 2466
Cdd:PHA03247  2758 ARPPTTAGPPA-----PAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPT 2832
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 2467 PLQALRPSPTQAPFTPTTSLGiGSLLDSEKPWMSPTYRQTltdrgqdvLAQPLAPETPPSLRqllapgaPPTPGPPLGPR 2546
Cdd:PHA03247  2833 SAQPTAPPPPPGPPPPSLPLG-GSVAPGGDVRRRPPSRSP--------AAKPAAPARPPVRR-------LARPAVSRSTE 2896
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 2547 HFFKPRVPPTSGEVPGLVSGGSAAHEELPMSRTTPLQPPEWQGPSRLIPEQGFMPAISSIPLHPFTAEALPTPGR----- 2621
Cdd:PHA03247  2897 SFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRvavpr 2976
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 2622 -------PQRSSKAKPLKPKSARGLPNVT-----LGFETSQAPFPIEKTQIPKTPDTSEQTQAlqdalgvqpfgifqpyg 2689
Cdd:PHA03247  2977 frvpqpaPSREAPASSTPPLTGHSLSRVSswassLALHEETDPPPVSLKQTLWPPDDTEDSDA----------------- 3039
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907114278 2690 tsSGIARSQSPLIDEKALSREKPGTPLPSLTTQLPQTPQISTSEKGQKPWLPP 2742
Cdd:PHA03247  3040 --DSLFDSDSERSDLEALDPLPPEPHDPFAHEPDPATPEAGARESPSSQFGPP 3090
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
2575-2772 8.98e-04

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 45.07  E-value: 8.98e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 2575 PMSRTTPLQPPEWQGPSRL-IPEQGFMPAISSIPLHPFTAEALPTPGRPQRSSKAKPLKP-------------------- 2633
Cdd:PTZ00449   600 PRSAQRPTRPKSPKLPELLdIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPpkspkppfdpkfkekfyddy 679
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 2634 --KSARGLPNVTlGFETSQAPFPIEKTQIPKTPDTSEQT-QALQDALGVQPFGIFQPYGTSSGIARSQS----PLIDEKA 2706
Cdd:PTZ00449   680 ldAAAKSKETKT-TVVLDESFESILKETLPETPGTPFTTpRPLPPKLPRDEEFPFEPIGDPDAEQPDDIefftPPEEERT 758
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907114278 2707 LSREKPG-TPLPSLTTQLPQTPQIsTSEKGQKPwlPPIDKPWTPTPVSSTREAKMIVSPTDQHPEDG 2772
Cdd:PTZ00449   759 FFHETPAdTPLPDILAEEFKEEDI-HAETGEPD--EAMKRPDSPSEHEDKPPGDHPSLPKKRHRLDG 822
PRK14948 PRK14948
DNA polymerase III subunit gamma/tau;
1268-1505 1.80e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237862 [Multi-domain]  Cd Length: 620  Bit Score: 43.80  E-value: 1.80e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1268 LGITPTPQPitltPEQAQALGITPTPQPITLTPEQTQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQALGIT 1347
Cdd:PRK14948   357 LGLLPSAFI----SEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPSPTPPANA 432
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1348 PTPQPITLTPEQAQA-LGITPTPQPITLTPEQAQALGITPTPQPITLTP-------------EQA--QALGitptpQPIT 1411
Cdd:PRK14948   433 ANAPPSLNLEELWQQiLAKLELPSTRMLLSQQAELVSLDSNRAVIAVSPnwlgmvqsrkpllEQAfaKVLG-----RSIK 507
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1412 LTPEQTQALGITPTPQPITLTPEQAQALGITPTPQPITLTPeqvqALGITPTPQPITLTPEQAQALGITPTPQPITLTPE 1491
Cdd:PRK14948   508 LNLESQSGSASNTAKTPPPPQKSPPPPAPTPPLPQPTATAP----PPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEE 583
                          250
                   ....*....|....
gi 1907114278 1492 QAQALGITPTPQPI 1505
Cdd:PRK14948   584 PTPSPTKDSSPEEI 597
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1172-1582 1.96e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 43.75  E-value: 1.96e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1172 ALGITPTPQLITLTPEQAkalanTLTAEQVSLSpqqaealgitptpqPTTLTPEQAQALGITPTPQPITLTPEQAQALGI 1251
Cdd:pfam05109  395 GLGTAPKTLIITRTATNA-----TTTTHKVIFS--------------KAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVP 455
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1252 TPTPQPTTLTPEQTQALGITPTPQPITltpeqAQALGITPTPQPI-----TLTPEQTQALGITPTPQPITLTPEQAQAlg 1326
Cdd:pfam05109  456 TNLTAPASTGPTVSTADVTSPTPAGTT-----SGASPVTPSPSPRdngteSKAPDMTSPTSAVTTPTPNATSPTPAVT-- 528
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1327 iTPTPQPITLTPEQTQALGITPTPQPITLTPEQAQAlgiTPTPQPITLTPEQAQALGITPTPQPITLTPEQAQAlgitpT 1406
Cdd:pfam05109  529 -TPTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVT---TPTPNATIPTLGKTSPTSAVTTPTPNATSPTVGET-----S 599
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1407 PQPITltpeQTQALGITPTPQPITLTPEQAQAlgiTPTPQPITLTPEQVQALGITPTPQPITLTPEQAQAlgiTPTPQPI 1486
Cdd:pfam05109  600 PQANT----TNHTLGGTSSTPVVTSPPKNATS---AVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDN---STSHMPL 669
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1487 TLTPEQAQALGITptpqpiTLTPEQTQALGI-TPTPQPITLTPEQAQALGITPTpqpiTLTPELVQALGITPtpqPITLT 1565
Cdd:pfam05109  670 LTSAHPTGGENIT------QVTPASTSTHHVsTSSPAPRPGTTSQASGPGNSST----STKPGEVNVTKGTP---PKNAT 736
                          410
                   ....*....|....*..
gi 1907114278 1566 PEQAQALGITPTPQPTT 1582
Cdd:pfam05109  737 SPQAPSGQKTAVPTVTS 753
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1073-1493 2.41e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 43.60  E-value: 2.41e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1073 GLPLIPPKPITFTreqTQALGITPTHQPITLTSEqvqalGITPTHQPITLTPEQAQALALILTTEQVKTQRInlsPDQTQ 1152
Cdd:pfam03154  179 GAASPPSPPPPGT---TQAATAGPTPSAPSVPPQ-----GSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRL---PSPHP 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1153 ALGITPTPQPITFTPEQTqalgiTPTPQLITLTPEQAKALANTLTAEQVSLSPQqaealgitptPQPTTLTPEQAQalgI 1232
Cdd:pfam03154  248 PLQPMTQPPPPSQVSPQP-----LPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQ----------PFPLTPQSSQSQ---V 309
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1233 TPTPQPITLTPEQAQALGITPTPQPTTLTPEQTQALGITPTPQPiTLTPEQAQALGITPTPQPITLTPEQTqalGITPTP 1312
Cdd:pfam03154  310 PPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMP-HIKPPPTTPIPQLPNPQSHKHPPHLS---GPSPFQ 385
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1313 QPITLTPEQA-QALGITPTPQPITLTPEQTQALgitptPQPITLTPEQAQALGITPTPqpiTLTPEQAQAlgitPTPQPI 1391
Cdd:pfam03154  386 MNSNLPPPPAlKPLSSLSTHHPPSAHPPPLQLM-----PQSQQLPPPPAQPPVLTQSQ---SLPPPAASH----PPTSGL 453
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1392 TLTPEQ---AQALGITPTPQPIT--LTPEQTQALGITPTPQPITLTPEQAQALGITPTpqpITLTPEQVQALGITPTPQP 1466
Cdd:pfam03154  454 HQVPSQspfPQHPFVPGGPPPITppSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVS---CPLPPVQIKEEALDEAEEP 530
                          410       420
                   ....*....|....*....|....*...
gi 1907114278 1467 ITLTPEQAqalgiTPTPQP-ITLTPEQA 1493
Cdd:pfam03154  531 ESPPPPPR-----SPSPEPtVVNTPSHA 553
PRK14948 PRK14948
DNA polymerase III subunit gamma/tau;
1263-1486 2.78e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237862 [Multi-domain]  Cd Length: 620  Bit Score: 43.41  E-value: 2.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1263 EQTQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQTQ 1342
Cdd:PRK14948   367 EIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPSPTPPANAANAPPSLNLEELWQ 446
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1343 A-LGITPTPQPITLTPEQAQALGITPTPQPITLTP-------------EQA--QALGitptpQPITLTPEQAQALGITPT 1406
Cdd:PRK14948   447 QiLAKLELPSTRMLLSQQAELVSLDSNRAVIAVSPnwlgmvqsrkpllEQAfaKVLG-----RSIKLNLESQSGSASNTA 521
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1407 PQPITLTPEQTQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQVQAlgitPTPQPITLTPEQAQALGITPTPQPI 1486
Cdd:PRK14948   522 KTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPA----DSSPPPPIPEEPTPSPTKDSSPEEI 597
PRK14948 PRK14948
DNA polymerase III subunit gamma/tau;
1420-1646 6.60e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237862 [Multi-domain]  Cd Length: 620  Bit Score: 41.87  E-value: 6.60e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1420 LGITPTPQPitltPEQAQALGITPTPQPITLTPEQVQALGITPTPQPITLTPEQAQALGITPTPQPITLTPEQAQALGIT 1499
Cdd:PRK14948   357 LGLLPSAFI----SEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPSPTPPANA 432
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1500 PTPQPITLTPEQTQA-LGITPTPQPITLTPEQAQALGITPTPQPITLTPELV---------------QALGitptpQPIT 1563
Cdd:PRK14948   433 ANAPPSLNLEELWQQiLAKLELPSTRMLLSQQAELVSLDSNRAVIAVSPNWLgmvqsrkplleqafaKVLG-----RSIK 507
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114278 1564 LTPEQAQALGITPTPQPTTLSPEQAQALGITPTPQPITLTPEQAQALGITPTPQPTTLSPEQAQALGISLIPKQQEISLS 1643
Cdd:PRK14948   508 LNLESQSGSASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEEPTPS 587

                   ...
gi 1907114278 1644 PEQ 1646
Cdd:PRK14948   588 PTK 590
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH