|
Name |
Accession |
Description |
Interval |
E-value |
| PAN_AP_HGF |
cd01099 |
Subfamily of PAN/APPLE-like domains; present in N-terminal (N) domains of plasminogen ... |
468-560 |
2.11e-14 |
|
Subfamily of PAN/APPLE-like domains; present in N-terminal (N) domains of plasminogen/hepatocyte growth factor proteins, and various proteins found in Bilateria, such as leech anti-platelet proteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions. :
Pssm-ID: 238532 Cd Length: 80 Bit Score: 69.42 E-value: 2.11e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 468 KACFRRVLAGKRIAPHFVRRSISCERVEECMRECGRERRFMCEGFNYRldpsgHGQGDCELVEMPLAQMDlysSPDRRDA 547
Cdd:cd01099 1 LNDFKFVLVLNKILVSEVKTEITVASLEECLRKCLEETEFTCRSFNYN-----YKSKECILSDEDRMSSG---VKLLYDS 72
|
90
....*....|...
gi 665399559 548 NllrhpdYDYYER 560
Cdd:cd01099 73 N------VDYYEN 79
|
|
| PAN_AP_HGF |
cd01099 |
Subfamily of PAN/APPLE-like domains; present in N-terminal (N) domains of plasminogen ... |
984-1070 |
8.01e-14 |
|
Subfamily of PAN/APPLE-like domains; present in N-terminal (N) domains of plasminogen/hepatocyte growth factor proteins, and various proteins found in Bilateria, such as leech anti-platelet proteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions. :
Pssm-ID: 238532 Cd Length: 80 Bit Score: 67.88 E-value: 8.01e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 984 QCFFRAIDATRFF-KSIVRDSLTVRSVGECEMECIRSTKFTCRAFAFRYGQQrhagvidNCQLSDWPVRDMDKERhlILD 1062
Cdd:cd01099 1 LNDFKFVLVLNKIlVSEVKTEITVASLEECLRKCLEETEFTCRSFNYNYKSK-------ECILSDEDRMSSGVKL--LYD 71
|
....*...
gi 665399559 1063 AAFDIFER 1070
Cdd:cd01099 72 SNVDYYEN 79
|
|
| PAN_AP_HGF |
cd01099 |
Subfamily of PAN/APPLE-like domains; present in N-terminal (N) domains of plasminogen ... |
870-953 |
2.67e-09 |
|
Subfamily of PAN/APPLE-like domains; present in N-terminal (N) domains of plasminogen/hepatocyte growth factor proteins, and various proteins found in Bilateria, such as leech anti-platelet proteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions. :
Pssm-ID: 238532 Cd Length: 80 Bit Score: 55.17 E-value: 2.67e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 870 ECSAKTSEGFRLHKTAVRHAYNVPTLTECERLCSDPRpSFVCHTFSYRYNqagRDNCMLCDRPINMLDyyVDIEPDRDYD 949
Cdd:cd01099 2 NDFKFVLVLNKILVSEVKTEITVASLEECLRKCLEET-EFTCRSFNYNYK---SKECILSDEDRMSSG--VKLLYDSNVD 75
|
....
gi 665399559 950 IYSM 953
Cdd:cd01099 76 YYEN 79
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
224-416 |
3.36e-09 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 61.88 E-value: 3.36e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 224 PSPNEEAYIPGGYAANAPEVDERYRPHYGPIEPAEDEGIRPTPHSPPRPIFgLGYGNGYSYGSPERYTPPTGRPSSGSSE 303
Cdd:PHA03247 2799 PSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPS-LPLGGSVAPGGDVRRRPPSRSPAAKPAA 2877
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 304 ERPSYAARPPRPTADRPEYPPGPPRPTADRPEYPPRPYEGSTPPSYGPRPSPsydpdrEPRPDYTNRPDPPsLSPQVG-Y 382
Cdd:PHA03247 2878 PARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQP------QPPPPPPPRPQPP-LAPTTDpA 2950
|
170 180 190
....*....|....*....|....*....|....
gi 665399559 383 GMNGPSARPPPTSYDGPPPPRDEYVRRNGSAMRP 416
Cdd:PHA03247 2951 GAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAP 2984
|
|
| PAN_AP |
smart00473 |
divergent subfamily of APPLE domains; Apple-like domains present in Plasminogen, C. elegans ... |
44-104 |
8.88e-04 |
|
divergent subfamily of APPLE domains; Apple-like domains present in Plasminogen, C. elegans hypothetical ORFs and the extracellular portion of plant receptor-like protein kinases. Predicted to possess protein- and/or carbohydrate-binding functions. :
Pssm-ID: 214680 Cd Length: 78 Bit Score: 39.48 E-value: 8.88e-04
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 665399559 44 DCFERIAlEAMLPFEKTFRTeDTNSLKMCKEKCLQAGEKCQAISFgvhRRGNGTCQLSSEQ 104
Cdd:smart00473 4 DCFVRLP-NTKLPGFSRIVI-SVASLEECASKCLNSNCSCRSFTY---NNGTKGCLLWSES 59
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| PAN_AP_HGF |
cd01099 |
Subfamily of PAN/APPLE-like domains; present in N-terminal (N) domains of plasminogen ... |
468-560 |
2.11e-14 |
|
Subfamily of PAN/APPLE-like domains; present in N-terminal (N) domains of plasminogen/hepatocyte growth factor proteins, and various proteins found in Bilateria, such as leech anti-platelet proteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Pssm-ID: 238532 Cd Length: 80 Bit Score: 69.42 E-value: 2.11e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 468 KACFRRVLAGKRIAPHFVRRSISCERVEECMRECGRERRFMCEGFNYRldpsgHGQGDCELVEMPLAQMDlysSPDRRDA 547
Cdd:cd01099 1 LNDFKFVLVLNKILVSEVKTEITVASLEECLRKCLEETEFTCRSFNYN-----YKSKECILSDEDRMSSG---VKLLYDS 72
|
90
....*....|...
gi 665399559 548 NllrhpdYDYYER 560
Cdd:cd01099 73 N------VDYYEN 79
|
|
| PAN_AP_HGF |
cd01099 |
Subfamily of PAN/APPLE-like domains; present in N-terminal (N) domains of plasminogen ... |
984-1070 |
8.01e-14 |
|
Subfamily of PAN/APPLE-like domains; present in N-terminal (N) domains of plasminogen/hepatocyte growth factor proteins, and various proteins found in Bilateria, such as leech anti-platelet proteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Pssm-ID: 238532 Cd Length: 80 Bit Score: 67.88 E-value: 8.01e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 984 QCFFRAIDATRFF-KSIVRDSLTVRSVGECEMECIRSTKFTCRAFAFRYGQQrhagvidNCQLSDWPVRDMDKERhlILD 1062
Cdd:cd01099 1 LNDFKFVLVLNKIlVSEVKTEITVASLEECLRKCLEETEFTCRSFNYNYKSK-------ECILSDEDRMSSGVKL--LYD 71
|
....*...
gi 665399559 1063 AAFDIFER 1070
Cdd:cd01099 72 SNVDYYEN 79
|
|
| PAN_AP_HGF |
cd01099 |
Subfamily of PAN/APPLE-like domains; present in N-terminal (N) domains of plasminogen ... |
870-953 |
2.67e-09 |
|
Subfamily of PAN/APPLE-like domains; present in N-terminal (N) domains of plasminogen/hepatocyte growth factor proteins, and various proteins found in Bilateria, such as leech anti-platelet proteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Pssm-ID: 238532 Cd Length: 80 Bit Score: 55.17 E-value: 2.67e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 870 ECSAKTSEGFRLHKTAVRHAYNVPTLTECERLCSDPRpSFVCHTFSYRYNqagRDNCMLCDRPINMLDyyVDIEPDRDYD 949
Cdd:cd01099 2 NDFKFVLVLNKILVSEVKTEITVASLEECLRKCLEET-EFTCRSFNYNYK---SKECILSDEDRMSSG--VKLLYDSNVD 75
|
....
gi 665399559 950 IYSM 953
Cdd:cd01099 76 YYEN 79
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
224-416 |
3.36e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 61.88 E-value: 3.36e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 224 PSPNEEAYIPGGYAANAPEVDERYRPHYGPIEPAEDEGIRPTPHSPPRPIFgLGYGNGYSYGSPERYTPPTGRPSSGSSE 303
Cdd:PHA03247 2799 PSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPS-LPLGGSVAPGGDVRRRPPSRSPAAKPAA 2877
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 304 ERPSYAARPPRPTADRPEYPPGPPRPTADRPEYPPRPYEGSTPPSYGPRPSPsydpdrEPRPDYTNRPDPPsLSPQVG-Y 382
Cdd:PHA03247 2878 PARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQP------QPPPPPPPRPQPP-LAPTTDpA 2950
|
170 180 190
....*....|....*....|....*....|....
gi 665399559 383 GMNGPSARPPPTSYDGPPPPRDEYVRRNGSAMRP 416
Cdd:PHA03247 2951 GAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAP 2984
|
|
| PAN_AP |
smart00473 |
divergent subfamily of APPLE domains; Apple-like domains present in Plasminogen, C. elegans ... |
44-104 |
8.88e-04 |
|
divergent subfamily of APPLE domains; Apple-like domains present in Plasminogen, C. elegans hypothetical ORFs and the extracellular portion of plant receptor-like protein kinases. Predicted to possess protein- and/or carbohydrate-binding functions.
Pssm-ID: 214680 Cd Length: 78 Bit Score: 39.48 E-value: 8.88e-04
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 665399559 44 DCFERIAlEAMLPFEKTFRTeDTNSLKMCKEKCLQAGEKCQAISFgvhRRGNGTCQLSSEQ 104
Cdd:smart00473 4 DCFVRLP-NTKLPGFSRIVI-SVASLEECASKCLNSNCSCRSFTY---NNGTKGCLLWSES 59
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
135-402 |
4.19e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 41.68 E-value: 4.19e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 135 SDNSIAAGPQPVPGPSPISPGPTPLDLPNRPLDVGNTPVLVVNPTPHTSGSPPPPLGPSLPPPGVGDRLYFSHDLypLYK 214
Cdd:pfam03154 156 SDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTL--IQQ 233
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 215 YPTLYESNYPSPNEeAYIPGGYAANAPEVDERYRP---HYGPIEPAE---DEGIRPTPHSPPRPIFGLGYGNGYSYGSPE 288
Cdd:pfam03154 234 TPTLHPQRLPSPHP-PLQPMTQPPPPSQVSPQPLPqpsLHGQMPPMPhslQTGPSHMQHPVPPQPFPLTPQSSQSQVPPG 312
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 289 RYTPPTGRPSSGSSEERPSYAARPPRPTADRPEYPPGPPRptadrPEYPPRPyegSTPPSYGPRPS----PSYDPDREPR 364
Cdd:pfam03154 313 PSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSM-----PHIKPPP---TTPIPQLPNPQshkhPPHLSGPSPF 384
|
250 260 270 280
....*....|....*....|....*....|....*....|....
gi 665399559 365 PDYTNRPDPPSLSPQVGYGMNG-PSARPPP-----TSYDGPPPP 402
Cdd:pfam03154 385 QMNSNLPPPPALKPLSSLSTHHpPSAHPPPlqlmpQSQQLPPPP 428
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| PAN_AP_HGF |
cd01099 |
Subfamily of PAN/APPLE-like domains; present in N-terminal (N) domains of plasminogen ... |
468-560 |
2.11e-14 |
|
Subfamily of PAN/APPLE-like domains; present in N-terminal (N) domains of plasminogen/hepatocyte growth factor proteins, and various proteins found in Bilateria, such as leech anti-platelet proteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Pssm-ID: 238532 Cd Length: 80 Bit Score: 69.42 E-value: 2.11e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 468 KACFRRVLAGKRIAPHFVRRSISCERVEECMRECGRERRFMCEGFNYRldpsgHGQGDCELVEMPLAQMDlysSPDRRDA 547
Cdd:cd01099 1 LNDFKFVLVLNKILVSEVKTEITVASLEECLRKCLEETEFTCRSFNYN-----YKSKECILSDEDRMSSG---VKLLYDS 72
|
90
....*....|...
gi 665399559 548 NllrhpdYDYYER 560
Cdd:cd01099 73 N------VDYYEN 79
|
|
| PAN_AP_HGF |
cd01099 |
Subfamily of PAN/APPLE-like domains; present in N-terminal (N) domains of plasminogen ... |
984-1070 |
8.01e-14 |
|
Subfamily of PAN/APPLE-like domains; present in N-terminal (N) domains of plasminogen/hepatocyte growth factor proteins, and various proteins found in Bilateria, such as leech anti-platelet proteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Pssm-ID: 238532 Cd Length: 80 Bit Score: 67.88 E-value: 8.01e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 984 QCFFRAIDATRFF-KSIVRDSLTVRSVGECEMECIRSTKFTCRAFAFRYGQQrhagvidNCQLSDWPVRDMDKERhlILD 1062
Cdd:cd01099 1 LNDFKFVLVLNKIlVSEVKTEITVASLEECLRKCLEETEFTCRSFNYNYKSK-------ECILSDEDRMSSGVKL--LYD 71
|
....*...
gi 665399559 1063 AAFDIFER 1070
Cdd:cd01099 72 SNVDYYEN 79
|
|
| PAN_AP_HGF |
cd01099 |
Subfamily of PAN/APPLE-like domains; present in N-terminal (N) domains of plasminogen ... |
870-953 |
2.67e-09 |
|
Subfamily of PAN/APPLE-like domains; present in N-terminal (N) domains of plasminogen/hepatocyte growth factor proteins, and various proteins found in Bilateria, such as leech anti-platelet proteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
Pssm-ID: 238532 Cd Length: 80 Bit Score: 55.17 E-value: 2.67e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 870 ECSAKTSEGFRLHKTAVRHAYNVPTLTECERLCSDPRpSFVCHTFSYRYNqagRDNCMLCDRPINMLDyyVDIEPDRDYD 949
Cdd:cd01099 2 NDFKFVLVLNKILVSEVKTEITVASLEECLRKCLEET-EFTCRSFNYNYK---SKECILSDEDRMSSG--VKLLYDSNVD 75
|
....
gi 665399559 950 IYSM 953
Cdd:cd01099 76 YYEN 79
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
224-416 |
3.36e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 61.88 E-value: 3.36e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 224 PSPNEEAYIPGGYAANAPEVDERYRPHYGPIEPAEDEGIRPTPHSPPRPIFgLGYGNGYSYGSPERYTPPTGRPSSGSSE 303
Cdd:PHA03247 2799 PSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPS-LPLGGSVAPGGDVRRRPPSRSPAAKPAA 2877
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 304 ERPSYAARPPRPTADRPEYPPGPPRPTADRPEYPPRPYEGSTPPSYGPRPSPsydpdrEPRPDYTNRPDPPsLSPQVG-Y 382
Cdd:PHA03247 2878 PARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQP------QPPPPPPPRPQPP-LAPTTDpA 2950
|
170 180 190
....*....|....*....|....*....|....
gi 665399559 383 GMNGPSARPPPTSYDGPPPPRDEYVRRNGSAMRP 416
Cdd:PHA03247 2951 GAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAP 2984
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
224-404 |
1.44e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.40 E-value: 1.44e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 224 PSPNEEAYIPGGYAANAPEVDERYRPHYGPIEPAEDEGIRPTPH--SPPRPIFGLGYGNGYSyGSPERYTPPTGRPSSGS 301
Cdd:PHA03247 2616 PLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGrvSRPRRARRLGRAAQAS-SPPQRPRRRAARPTVGS 2694
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 302 SEErpsyAARPPRPTADrpeyppgpprptadrPEYPPRPYEGSTPPSygprPSPSYDPDREPRPDYTNRPDPPSLSPQVG 381
Cdd:PHA03247 2695 LTS----LADPPPPPPT---------------PEPAPHALVSATPLP----PGPAAARQASPALPAAPAPPAVPAGPATP 2751
|
170 180
....*....|....*....|...
gi 665399559 382 YGMNGPSARPPPTSYDGPPPPRD 404
Cdd:PHA03247 2752 GGPARPARPPTTAGPPAPAPPAA 2774
|
|
| dnaA |
PRK14086 |
chromosomal replication initiator protein DnaA; |
224-393 |
6.47e-06 |
|
chromosomal replication initiator protein DnaA;
Pssm-ID: 237605 [Multi-domain] Cd Length: 617 Bit Score: 50.59 E-value: 6.47e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 224 PSPNEEAY--IPGGYAANAPEVDERY--RPHYGPIEPAEDEGIRPTPHSPPRPIFG-----LGYGNGYSYGSPERYTPPT 294
Cdd:PRK14086 111 PRPGRRPYegYGGPRADDRPPGLPRQdqLPTARPAYPAYQQRPEPGAWPRAADDYGwqqqrLGFPPRAPYASPASYAPEQ 190
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 295 GRPSSGSSEERPSYAARPPRPTADRPEYPPGPPRPTaDRPEypPRPYEGSTPPSYGPRPSPsydPDREPRPDYTNRPDPP 374
Cdd:PRK14086 191 ERDREPYDAGRPEYDQRRRDYDHPRPDWDRPRRDRT-DRPE--PPPGAGHVHRGGPGPPER---DDAPVVPIRPSAPGPL 264
|
170
....*....|....*....
gi 665399559 375 SLSPQVGYGMNGPSARPPP 393
Cdd:PRK14086 265 AAQPAPAPGPGEPTARLNP 283
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
224-414 |
1.50e-05 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 49.60 E-value: 1.50e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 224 PSPNEEAYIPGgyAANAPEVDERYRPHYGPIEPAEDEGIR-----PTPHSPPRPIFGLGYGNGYSYGSPERYTPPTGRPS 298
Cdd:PRK07764 606 SGPPEEAARPA--APAAPAAPAAPAPAGAAAAPAEASAAPapgvaAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPP 683
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 299 SGSSEERPSYAARPPRPTADRPEYPPGPPRPTADRPEYPPRPYEGSTPPSYGPRPSPS-YDPDREPRPDYTNRPDPPSLS 377
Cdd:PRK07764 684 APAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLpPEPDDPPDPAGAPAQPPPPPA 763
|
170 180 190 200
....*....|....*....|....*....|....*....|.
gi 665399559 378 PQVGYGMNGPSARPPPT----SYDGPPPPRDEYVRRNGSAM 414
Cdd:PRK07764 764 PAPAAAPAAAPPPSPPSeeeeMAEDDAPSMDDEDRRDAEEV 804
|
|
| dnaA |
PRK14086 |
chromosomal replication initiator protein DnaA; |
286-416 |
2.60e-05 |
|
chromosomal replication initiator protein DnaA;
Pssm-ID: 237605 [Multi-domain] Cd Length: 617 Bit Score: 48.67 E-value: 2.60e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 286 SPERYTPPTGRPSSGSSEERPSYAARP----PRPTADRPEYPPGPPRPTAD-RPEYPPRpYEGSTPPSYgPRPSPSYDPD 360
Cdd:PRK14086 91 SAGEPAPPPPHARRTSEPELPRPGRRPyegyGGPRADDRPPGLPRQDQLPTaRPAYPAY-QQRPEPGAW-PRAADDYGWQ 168
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 361 rEPRPDYTNRPDPPSlspqvgygmngPSARPPPTSYDGPPP----PRDEYVRRNGSAMRP 416
Cdd:PRK14086 169 -QQRLGFPPRAPYAS-----------PASYAPEQERDREPYdagrPEYDQRRRDYDHPRP 216
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
215-401 |
2.89e-04 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 45.44 E-value: 2.89e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 215 YPTLYESNYPS-PNEEAYIPGGYAANAPEVDERYRP-HYGPIEPAE-DEGIRPTPHSPPRPIFGLGYGNGYSYG------ 285
Cdd:PHA03378 597 WPVPHPSQTPEpPTTQSHIPETSAPRQWPMPLRPIPmRPLRMQPITfNVLVFPTPHQPPQVEITPYKPTWTQIGhipyqp 676
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 286 -------------SPERYTPPTGRPSSGSSEERPSYAARPPRPTADRPEYPPGPPRPtADRPEYPPRPyegSTPPSYGPR 352
Cdd:PHA03378 677 sptgantmlpiqwAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGR-ARPPAAAPGR---ARPPAAAPG 752
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 665399559 353 PS--PSYDPDREPRPDYTNRPDPPSLSPQVG-YGMNGPSARPPPTsydgPPP 401
Cdd:PHA03378 753 RArpPAAAPGRARPPAAAPGAPTPQPPPQAPpAPQQRPRGAPTPQ----PPP 800
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
253-403 |
3.91e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 45.31 E-value: 3.91e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 253 PIEPAEDEGIrPTPHSPPRPIFGLGYGNGYSYGSP-ERYTPPTGRPSSGSSEERPSYAARPPRPTADRPEYppgpprpTA 331
Cdd:PHA03247 2559 APPAAPDRSV-PPPRPAPRPSEPAVTSRARRPDAPpQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPP-------PS 2630
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 332 DRPE--YPPRPYEGSTPPSYGPRPSPS----------YDPDREPRPDY-TNRPDPPSLSPQVGYGMNgpSARPPPTSYDG 398
Cdd:PHA03247 2631 PSPAanEPDPHPPPTVPPPERPRDDPApgrvsrprraRRLGRAAQASSpPQRPRRRAARPTVGSLTS--LADPPPPPPTP 2708
|
....*
gi 665399559 399 PPPPR 403
Cdd:PHA03247 2709 EPAPH 2713
|
|
| dnaA |
PRK14086 |
chromosomal replication initiator protein DnaA; |
305-449 |
4.60e-04 |
|
chromosomal replication initiator protein DnaA;
Pssm-ID: 237605 [Multi-domain] Cd Length: 617 Bit Score: 44.43 E-value: 4.60e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 305 RPSYAARPPRPTADRPEYPPGPPRPTA--DRPEYPPRPYEGSTPPSYGPRP--SPSYDPDREPR---PDYTNRPDPPSL- 376
Cdd:PRK14086 80 RPIRIAITVDPSAGEPAPPPPHARRTSepELPRPGRRPYEGYGGPRADDRPpgLPRQDQLPTARpayPAYQQRPEPGAWp 159
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 665399559 377 SPQVGYGMNGPSARPPPTSYDGPP---PPRDEYVRRNGSAMRPGDGYGGYDDDKTIATYFNPDdymQGNNNRPMPS 449
Cdd:PRK14086 160 RAADDYGWQQQRLGFPPRAPYASPasyAPEQERDREPYDAGRPEYDQRRRDYDHPRPDWDRPR---RDRTDRPEPP 232
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
248-412 |
6.98e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 44.54 E-value: 6.98e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 248 RPHYGPIEPAEdegiRPTPHSPPRPI-FGLGYGNGYSYGSPERYTPP-----TGRPSSGSSEERPSYAARPPRPTADRPE 321
Cdd:PHA03247 2700 DPPPPPPTPEP----APHALVSATPLpPGPAAARQASPALPAAPAPPavpagPATPGGPARPARPPTTAGPPAPAPPAAP 2775
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 322 YppgpprptADRPEYPPRPYEGSTPPSYGPRPSPSYDPDREPRPDYTNRPDPPSLSPQvgyGMNGPSARPPPTSYDGPPP 401
Cdd:PHA03247 2776 A--------AGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPA---GPLPPPTSAQPTAPPPPPG 2844
|
170
....*....|.
gi 665399559 402 PRDEYVRRNGS 412
Cdd:PHA03247 2845 PPPPSLPLGGS 2855
|
|
| PAN_AP |
smart00473 |
divergent subfamily of APPLE domains; Apple-like domains present in Plasminogen, C. elegans ... |
44-104 |
8.88e-04 |
|
divergent subfamily of APPLE domains; Apple-like domains present in Plasminogen, C. elegans hypothetical ORFs and the extracellular portion of plant receptor-like protein kinases. Predicted to possess protein- and/or carbohydrate-binding functions.
Pssm-ID: 214680 Cd Length: 78 Bit Score: 39.48 E-value: 8.88e-04
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 665399559 44 DCFERIAlEAMLPFEKTFRTeDTNSLKMCKEKCLQAGEKCQAISFgvhRRGNGTCQLSSEQ 104
Cdd:smart00473 4 DCFVRLP-NTKLPGFSRIVI-SVASLEECASKCLNSNCSCRSFTY---NNGTKGCLLWSES 59
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
140-412 |
3.48e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 42.23 E-value: 3.48e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 140 AAGPQPVPGPSPISPGPTPLDLPNRPLDVGNTPVLVVNPTPhtsgSPPPPLGPSLPPPGvgdrlyfshdlyPLYKYPTLy 219
Cdd:PHA03247 2776 AAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAA----ALPPAASPAGPLPP------------PTSAQPTA- 2838
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 220 ESNYPSPNEEAYIPGGYAanAPEVDERYRPHYGPIEPAEDEGIRPTPHSPPRPifglgygngysygSPERYTPPTGRPSS 299
Cdd:PHA03247 2839 PPPPPGPPPPSLPLGGSV--APGGDVRRRPPSRSPAAKPAAPARPPVRRLARP-------------AVSRSTESFALPPD 2903
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 300 GSSEERPSYAARPPRPTADrpeyppgpprptadrPEYPPRPyegSTPPSYGPRPSPSYDPDREPRPdyTNRPDPPSLSPQ 379
Cdd:PHA03247 2904 QPERPPQPQAPPPPQPQPQ---------------PPPPPQP---QPPPPPPPRPQPPLAPTTDPAG--AGEPSGAVPQPW 2963
|
250 260 270 280
....*....|....*....|....*....|....*....|...
gi 665399559 380 VGYGMNG---------PSARPP-PTSYDGPPPPRDEYVRRNGS 412
Cdd:PHA03247 2964 LGALVPGrvavprfrvPQPAPSrEAPASSTPPLTGHSLSRVSS 3006
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
256-409 |
3.68e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 41.90 E-value: 3.68e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 256 PAEDEGIRPTPHSPPRPIFGLGYGNGYSYGSPeryTPPTGRPSSGSSEERPsyaARPPRPTADRPEYPPGPPRPTADRPE 335
Cdd:PRK07764 365 PSASDDERGLLARLERLERRLGVAGGAGAPAA---AAPSAAAAAPAAAPAP---AAAAPAAAAAPAPAAAPQPAPAPAPA 438
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 665399559 336 YPPRPYEGSTPPSYGPRPSPSYDPDREPRPDYTNRPDPPSLSPQVGYGMNGPSARPPPTSYDGPPPP--RDEYVRR 409
Cdd:PRK07764 439 PAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGadDAATLRE 514
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
135-402 |
4.19e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 41.68 E-value: 4.19e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 135 SDNSIAAGPQPVPGPSPISPGPTPLDLPNRPLDVGNTPVLVVNPTPHTSGSPPPPLGPSLPPPGVGDRLYFSHDLypLYK 214
Cdd:pfam03154 156 SDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTL--IQQ 233
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 215 YPTLYESNYPSPNEeAYIPGGYAANAPEVDERYRP---HYGPIEPAE---DEGIRPTPHSPPRPIFGLGYGNGYSYGSPE 288
Cdd:pfam03154 234 TPTLHPQRLPSPHP-PLQPMTQPPPPSQVSPQPLPqpsLHGQMPPMPhslQTGPSHMQHPVPPQPFPLTPQSSQSQVPPG 312
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 289 RYTPPTGRPSSGSSEERPSYAARPPRPTADRPEYPPGPPRptadrPEYPPRPyegSTPPSYGPRPS----PSYDPDREPR 364
Cdd:pfam03154 313 PSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSM-----PHIKPPP---TTPIPQLPNPQshkhPPHLSGPSPF 384
|
250 260 270 280
....*....|....*....|....*....|....*....|....
gi 665399559 365 PDYTNRPDPPSLSPQVGYGMNG-PSARPPP-----TSYDGPPPP 402
Cdd:pfam03154 385 QMNSNLPPPPALKPLSSLSTHHpPSAHPPPlqlmpQSQQLPPPP 428
|
|
| Pro-rich |
pfam15240 |
Proline-rich protein; This family includes several eukaryotic proline-rich proteins. |
296-402 |
4.31e-03 |
|
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
Pssm-ID: 464580 [Multi-domain] Cd Length: 167 Bit Score: 39.64 E-value: 4.31e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 296 RPSSGSSEERPSYAARPPRPTADRPEYPPGPPRPTADRPEYPPRPYEGSTPPSYGPRPSPSyDPDREPRPDYTNRPDPPS 375
Cdd:pfam15240 39 SQQGGQGPQGPPPGGFPPQPPASDDPPGPPPPGGPQQPPPQGGKQKPQGPPPQGGPRPPPG-KPQGPPPQGGNQQQGPPP 117
|
90 100
....*....|....*....|....*..
gi 665399559 376 LSPQVGYGMNGPSARPPPTSYDGPPPP 402
Cdd:pfam15240 118 PGKPQGPPPQGGGPPPQGGNQQGPPPP 144
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
141-402 |
5.39e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.46 E-value: 5.39e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 141 AGPQPVPGPSPISPGPTPLDLPNRPldvgntpvlvVNPTPHTSGSPPPPLGPSLPPPGVGDRlyfSHDLYPLYKYPTLYE 220
Cdd:PHA03247 2500 GGGPPDPDAPPAPSRLAPAILPDEP----------VGEPVHPRMLTWIRGLEELASDDAGDP---PPPLPPAAPPAAPDR 2566
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 221 SNYPS-----PNEEAYIPGGYAANAPEVDERYR----PHYGPIEPAEDEGIRPTPHSPPRPIFGlgygngysyGSPERYT 291
Cdd:PHA03247 2567 SVPPPrpaprPSEPAVTSRARRPDAPPQSARPRapvdDRGDPRGPAPPSPLPPDTHAPDPPPPS---------PSPAANE 2637
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 292 PPTGRPSSGSSEERPSYAARPPRPTADRPEYPPGPPRPTADRPEYPPRPyegSTPPSYGPRPSPSYDPDREPRPDYTNRP 371
Cdd:PHA03247 2638 PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRR---AARPTVGSLTSLADPPPPPPTPEPAPHA 2714
|
250 260 270
....*....|....*....|....*....|.
gi 665399559 372 DPPSLSPQVGYGMNGPSARPPPTSYDGPPPP 402
Cdd:PHA03247 2715 LVSATPLPPGPAAARQASPALPAAPAPPAVP 2745
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
225-431 |
6.07e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 41.31 E-value: 6.07e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 225 SPNEEAYIPGGYAANAPEVDERYRPHYGPIE--PAEDEGIRPTPHSPPRPIFGLGYGNGYSYGSP-------ERYTPPTG 295
Cdd:PHA03307 179 PEETARAPSSPPAEPPPSTPPAAASPRPPRRssPISASASSPAPAPGRSAADDAGASSSDSSSSEssgcgwgPENECPLP 258
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 296 RPSSGSSEERPSYAARPPRPTADrpeYPPgPPRPTADRPEYPP--RPYEGSTPPSYGPRPSPSYDPDREPRPDYTNRPDP 373
Cdd:PHA03307 259 RPAPITLPTRIWEASGWNGPSSR---PGP-ASSSSSPRERSPSpsPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSE 334
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*...
gi 665399559 374 PSLSPQVGYgmnGPSARPPPTSYDGPPPPRDEYVRRNGSAMRPGDGYGGYDDDKTIAT 431
Cdd:PHA03307 335 SSRGAAVSP---GPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRR 389
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
240-404 |
8.30e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 40.92 E-value: 8.30e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 240 APEVDERYRPHYGPIEPAEDEGIrPTPHSPPRPIFGLGYGNGYSYGSPEryTPPTGRPSSGSSEERPSYAARPPRPTADR 319
Cdd:PHA03307 131 APDLSEMLRPVGSPGPPPAASPP-AAGASPAAVASDAASSRQAALPLSS--PEETARAPSSPPAEPPPSTPPAAASPRPP 207
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 665399559 320 PEYPPGPPRPTADRPEYPPRPYEGSTPPSYGPRPSPSYDPDREPRPDYTNRPDPPSLSPQV---GYGMNGPSARPPPTSY 396
Cdd:PHA03307 208 RRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRiweASGWNGPSSRPGPASS 287
|
....*...
gi 665399559 397 DGPPPPRD 404
Cdd:PHA03307 288 SSSPRERS 295
|
|
| FAP |
pfam07174 |
Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment ... |
331-402 |
9.74e-03 |
|
Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment proteins (FAP). Family members are rich in alanine and proline, are approximately 300 long, and seem to be restricted to mycobacteria. These proteins contain a fibronectin-binding motif that allows mycobacteria to bind to fibronectin in the extracellular matrix.
Pssm-ID: 429334 Cd Length: 301 Bit Score: 39.91 E-value: 9.74e-03
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 665399559 331 ADRPEYPPRPYEGSTPPSYGPRPSPSYDPDREPRPDYTNRPDPPslsPQVGyGMNGPSarPPPTSYDGPPPP 402
Cdd:pfam07174 39 ADPEPAPPPPSTATAPPAPPPPPPAPAAPAPPPPPAAPNAPNAP---PPPA-DPNAPP--PPPADPNAPPPP 104
|
|
|