|
Name |
Accession |
Description |
Interval |
E-value |
| DUF4795 |
pfam16043 |
Domain of unknown function (DUF4795); This family of proteins is functionally uncharacterized. ... |
709-899 |
1.69e-53 |
|
Domain of unknown function (DUF4795); This family of proteins is functionally uncharacterized. This family of proteins is found in bacteria and eukaryotes. Proteins in this family are typically between 285 and 978 amino acids in length.
Pssm-ID: 464990 [Multi-domain] Cd Length: 181 Bit Score: 184.81 E-value: 1.69e-53
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 709 TTVDILQKKIGSLQksrlkeEELERIWGNQIEMMKDryitldkavenLQIRMDEFKTLQAQIKRLEMNKVNKSTMEEELR 788
Cdd:pfam16043 7 ELLDQLQALILDLQ------EELEKLSETTSELSER-----------LQQRQKHLEALYQQIEKLEKVKADKEVVEEELD 69
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 789 EKADRSALAGKASRVDLETVALELNEMIQGILFKVTIHEDSWKKAMEELSKDVNTKLVHSDLDPLKKEMEEVWKIVRKLL 868
Cdd:pfam16043 70 EKADKEALASKVSRDQFDETLEELNQMLQELLDKLEGQEDAWKKALETLSEELDTKLDRLELDPLKELLERRIKALQKLL 149
|
170 180 190
....*....|....*....|....*....|..
gi 530408016 869 IEGLRLDPD-SAAGFRRKLFKRVKCISCDRPV 899
Cdd:pfam16043 150 QEGSEELDEaEAAGFRKKLLERFHCISCDRPV 181
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
286-436 |
6.91e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 60.34 E-value: 6.91e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 286 PELLPEGSSAQAVSLSRaQEPAQPPALTP--ESAPGCTTEFAPGPAPGTEPVPglelglelEPVPALGPVPGPSVTPGSL 363
Cdd:PHA03247 2848 PSLPLGGSVAPGGDVRR-RPPSRSPAAKPaaPARPPVRRLARPAVSRSTESFA--------LPPDQPERPPQPQAPPPPQ 2918
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 364 PAPWPVLGPVPAPGAQPPPLGDWPALPRRWPLPQGWPRvGSWPLWDLGVLRP----------TQPQPSR---APPPATEF 430
Cdd:PHA03247 2919 PQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPS-GAVPQPWLGALVPgrvavprfrvPQPAPSReapASSTPPLT 2997
|
....*.
gi 530408016 431 GSLWPR 436
Cdd:PHA03247 2998 GHSLSR 3003
|
|
| MISS |
pfam15822 |
MAPK-interacting and spindle-stabilising protein-like; MISS is a family of eukaryotic ... |
283-427 |
3.53e-07 |
|
MAPK-interacting and spindle-stabilising protein-like; MISS is a family of eukaryotic MAPK-interacting and spindle-stabilising protein-like proteins. MISS is rich in prolines and has four potential MAPK-phosphorylation sites, a MAPK-docking site, a PEST sequence (PEST motif) and a bipartite nuclear localization signal. The endogenous protein accumulates during mouse meiotic maturation and is found as discrete dots on the MII spindle. MISS is the first example of a physiological MAPK-substrate that is stabilized in MII that specifically regulates MII spindle integrity during the CSF arrest.
Pssm-ID: 318115 [Multi-domain] Cd Length: 238 Bit Score: 52.68 E-value: 3.53e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 283 YEVPELLPEGSSAQ--AVSLSRAQEPAQ-----PPALTPESAPGCTTEFAPGPAPGTEPVPGLELGLeLEPVPALGPVPG 355
Cdd:pfam15822 1 FSLADALPEQSPAKtsAVSNPKPGQPPQgwpgsNPWNNPSAPPAVPSGLPPSTAPSTVPFGPAPTGM-YPSIPLTGPSPG 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 356 P---------SVTPGSLPAPWPVL-GPVPaPGAQPPPLGDWPALPRRW--------PLPQG-WPRVGSWPlWDLGV---- 412
Cdd:pfam15822 80 PpapfppsgpSCPPPGGPYPAPTVpGPGP-IGPYPTPNMPFPELPRPYgaptdpaaAAPSGpWGSMSSGP-WAPGMggqy 157
|
170
....*....|....*
gi 530408016 413 LRPTQPQPSRAPPPA 427
Cdd:pfam15822 158 PAPNMPYPSPGPYPA 172
|
|
| gly_rich_SclB |
NF038329 |
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ... |
326-538 |
1.06e-04 |
|
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.
Pssm-ID: 468478 [Multi-domain] Cd Length: 440 Bit Score: 46.05 E-value: 1.06e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 326 PGPA----PGTEPVPGLELGLELEPVPALGPVP----GPSVTPGSLPAPWPV--LGPVPAPGAQPPPLGDWPALPRRWPL 395
Cdd:NF038329 122 PGPAgpagPAGEQGPRGDRGETGPAGPAGPPGPqgerGEKGPAGPQGEAGPQgpAGKDGEAGAKGPAGEKGPQGPRGETG 201
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 396 PQGwPRVGSWPLWDLGVLRPTQPQPSRAPPPATEFGSLWPRPLQPYQSRQGEALQLAAVQVKGE--ENDVPSLRGLRERA 473
Cdd:NF038329 202 PAG-EQGPAGPAGPDGEAGPAGEDGPAGPAGDGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDrgEAGPDGPDGKDGER 280
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 530408016 474 RKDGAP-KDRTR-KDGVPkdrgGKDvDPKDRAHKDDVPKDRGgKDGDP-KD----RVGKDGAPKEAQPKAPQ 538
Cdd:NF038329 281 GPVGPAgKDGQNgKDGLP----GKD-GKDGQNGKDGLPGKDG-KDGQPgKDglpgKDGKDGQPGKPAPKTPE 346
|
|
| penta_MxKDx |
TIGR02953 |
pentapeptide MXKDX repeat protein; Members of this protein family are small bacterial proteins, ... |
481-532 |
3.32e-04 |
|
pentapeptide MXKDX repeat protein; Members of this protein family are small bacterial proteins, each with an N-terminal signal sequence followed by up to 11 imperfect repeats of a pentapeptide. The pentapeptide repeat usually follows the form Met-Xaa-Lys-Asp-Xaa.
Pssm-ID: 131998 [Multi-domain] Cd Length: 75 Bit Score: 40.21 E-value: 3.32e-04
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|..
gi 530408016 481 DRTRKDGVPKDRGGKDVDPKDRAHKDDVPKDRGGKDGDPKDRVGKDGAPKEA 532
Cdd:TIGR02953 23 DAMKKDTMKKDAMGKDAMAKDAMSKDAMKKDAMKKDAMKKDGMKKDAMKKDA 74
|
|
| PBP1 |
COG5180 |
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ... |
245-541 |
4.61e-04 |
|
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];
Pssm-ID: 444064 [Multi-domain] Cd Length: 548 Bit Score: 44.28 E-value: 4.61e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 245 EQLPEAALAQTTKYLEATRAIQVSEPVQNPQLLQTVWHYEVPELLPEGSSAQAVSLSRAQEPAQPPALTPESAPGctTEF 324
Cdd:COG5180 212 EEPPDLTGGADHPRPEAASSPKVDPPSTSEARSRPATVDAQPEMRPPADAKERRRAAIGDTPAAEPPGLPVLEAG--SEP 289
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 325 APGPAPGTEPVPGLELGLELEPvPALGPV-PGPSVTPGSLPAPwpvLGPVPAPGAQPPPLGDWPALPRRWPLPQGwprvg 403
Cdd:COG5180 290 QSDAPEAETARPIDVKGVASAP-PATRPVrPPGGARDPGTPRP---GQPTERPAGVPEAASDAGQPPSAYPPAEE----- 360
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 404 swplwdlgvLRPTQPQPSRAPPPATEFGSlwPRPLQPY----QSRQGEALQLAAVQVKGEENDVPSLRGLRERARKDGAP 479
Cdd:COG5180 361 ---------AVPGKPLEQGAPRPGSSGGD--GAPFQPPngapQPGLGRRGAPGPPMGAGDLVQAALDGGGRETASLGGAA 429
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 530408016 480 KDRTRkdgVPKDRGGKDVdpkdrAHKDDVPKDRGGKDGDPKdrvgkdgAPKEAQPKAPQSAL 541
Cdd:COG5180 430 GGAGQ---GPKADFVPGD-----AESVSGPAGLADQAGAAA-------STAMADFVAPVTDA 476
|
|
| RRM_RBM27 |
cd12517 |
RNA recognition motif (RRM) found in vertebrate RNA-binding protein 27 (RBM27); This subgroup ... |
248-284 |
5.66e-04 |
|
RNA recognition motif (RRM) found in vertebrate RNA-binding protein 27 (RBM27); This subgroup corresponds to the RRM of RBM27 which contains a single RNA recognition motif (RRM), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain). Although the specific function of the RRM in RBM27 remains unclear, it shows high sequence similarity with RRM1of RBM26, which functions as a cutaneous lymphoma (CL)-associated antigen.
Pssm-ID: 409939 [Multi-domain] Cd Length: 76 Bit Score: 39.65 E-value: 5.66e-04
10 20 30
....*....|....*....|....*....|....*..
gi 530408016 248 PEAALAQTTKYLEATRAIQVSEPVQNPQLLQTVWHYE 284
Cdd:cd12517 40 PEAALIQYTTNEEARRAISSTEAVLNNRFIRVLWHRE 76
|
|
| PRK01156 |
PRK01156 |
chromosome segregation protein; Provisional |
715-867 |
4.14e-03 |
|
chromosome segregation protein; Provisional
Pssm-ID: 100796 [Multi-domain] Cd Length: 895 Bit Score: 41.43 E-value: 4.14e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 715 QKKIGSLQKSRLKEEELERIWGNQIEMMKDRYITLDKAVENLQIRMDEFKTLQAQIKRLEMN-------------KVNK- 780
Cdd:PRK01156 196 NLELENIKKQIADDEKSHSITLKEIERLSIEYNNAMDDYNNLKSALNELSSLEDMKNRYESEiktaesdlsmeleKNNYy 275
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 781 STMEEELREKADRSALAGKASRVDLETVA---LELNEMIQGILFKVTIHEDSWKKAmEELSKDvntklvHSDLDPLKKEM 857
Cdd:PRK01156 276 KELEERHMKIINDPVYKNRNYINDYFKYKndiENKKQILSNIDAEINKYHAIIKKL-SVLQKD------YNDYIKKKSRY 348
|
170
....*....|
gi 530408016 858 EEVWKIVRKL 867
Cdd:PRK01156 349 DDLNNQILEL 358
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| DUF4795 |
pfam16043 |
Domain of unknown function (DUF4795); This family of proteins is functionally uncharacterized. ... |
709-899 |
1.69e-53 |
|
Domain of unknown function (DUF4795); This family of proteins is functionally uncharacterized. This family of proteins is found in bacteria and eukaryotes. Proteins in this family are typically between 285 and 978 amino acids in length.
Pssm-ID: 464990 [Multi-domain] Cd Length: 181 Bit Score: 184.81 E-value: 1.69e-53
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 709 TTVDILQKKIGSLQksrlkeEELERIWGNQIEMMKDryitldkavenLQIRMDEFKTLQAQIKRLEMNKVNKSTMEEELR 788
Cdd:pfam16043 7 ELLDQLQALILDLQ------EELEKLSETTSELSER-----------LQQRQKHLEALYQQIEKLEKVKADKEVVEEELD 69
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 789 EKADRSALAGKASRVDLETVALELNEMIQGILFKVTIHEDSWKKAMEELSKDVNTKLVHSDLDPLKKEMEEVWKIVRKLL 868
Cdd:pfam16043 70 EKADKEALASKVSRDQFDETLEELNQMLQELLDKLEGQEDAWKKALETLSEELDTKLDRLELDPLKELLERRIKALQKLL 149
|
170 180 190
....*....|....*....|....*....|..
gi 530408016 869 IEGLRLDPD-SAAGFRRKLFKRVKCISCDRPV 899
Cdd:pfam16043 150 QEGSEELDEaEAAGFRKKLLERFHCISCDRPV 181
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
286-436 |
6.91e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 60.34 E-value: 6.91e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 286 PELLPEGSSAQAVSLSRaQEPAQPPALTP--ESAPGCTTEFAPGPAPGTEPVPglelglelEPVPALGPVPGPSVTPGSL 363
Cdd:PHA03247 2848 PSLPLGGSVAPGGDVRR-RPPSRSPAAKPaaPARPPVRRLARPAVSRSTESFA--------LPPDQPERPPQPQAPPPPQ 2918
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 364 PAPWPVLGPVPAPGAQPPPLGDWPALPRRWPLPQGWPRvGSWPLWDLGVLRP----------TQPQPSR---APPPATEF 430
Cdd:PHA03247 2919 PQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPS-GAVPQPWLGALVPgrvavprfrvPQPAPSReapASSTPPLT 2997
|
....*.
gi 530408016 431 GSLWPR 436
Cdd:PHA03247 2998 GHSLSR 3003
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
290-444 |
6.45e-08 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 57.26 E-value: 6.45e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 290 PEGSSAQAVSLSRAQEPAQPPALTPESAPGCTTEFAPGPAPGTEPVPGLELGLELEPVPALGPVPGPSVTPGSLPAPWPV 369
Cdd:PHA03247 2766 PPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGP 2845
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 370 LGPVPAPGAQPPPLGDW------------PALP-----RRWPLPQGWPRVGSWPLWDLGVLRPTQPQ-PSRAPPPATEFG 431
Cdd:PHA03247 2846 PPPSLPLGGSVAPGGDVrrrppsrspaakPAAParppvRRLARPAVSRSTESFALPPDQPERPPQPQaPPPPQPQPQPPP 2925
|
170
....*....|...
gi 530408016 432 SLWPRPLQPYQSR 444
Cdd:PHA03247 2926 PPQPQPPPPPPPR 2938
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
293-467 |
1.28e-07 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 56.15 E-value: 1.28e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 293 SSAQAVSLSRAQEPAQPPALTPESAPGCTTEFAPGPAPGTEPVPGLELGLELEPVPALGPVPGPSVTPgslPAPWPVLGP 372
Cdd:PRK07764 617 APAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPA---PAPAAPAAP 693
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 373 VPAPGAQPPPLGDWPALPRRWPLPQGWPRVGSWPLWdlgvlrPTQPQPSRAPPPATE-----FGSLWPRPLQPYQSRQGE 447
Cdd:PRK07764 694 AGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGAS------APSPAADDPVPLPPEpddppDPAGAPAQPPPPPAPAPA 767
|
170 180
....*....|....*....|
gi 530408016 448 ALQLAAVQVKGEENDVPSLR 467
Cdd:PRK07764 768 AAPAAAPPPSPPSEEEEMAE 787
|
|
| PHA03201 |
PHA03201 |
uracil DNA glycosylase; Provisional |
301-399 |
2.54e-07 |
|
uracil DNA glycosylase; Provisional
Pssm-ID: 165468 Cd Length: 318 Bit Score: 53.74 E-value: 2.54e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 301 SRAQEPAQ---PPALTPESAPGCTTEFAPG--PAPGTEPVPGLELGlelEPVPALGPVPGPS-VTPGSLPAPWPVLGPVP 374
Cdd:PHA03201 6 SRSPSPPRrpsPPRPTPPRSPDASPEETPPspPGPGAEPPPGRAAG---PAAPRRRPRGCPAgVTFSSSAPPRPPLGLDD 82
|
90 100
....*....|....*....|....*
gi 530408016 375 APGAQPPPLgDWPALPRRWPLPQGW 399
Cdd:PHA03201 83 APAATPPPL-DWTEFRRRFLVGDAW 106
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
285-487 |
2.76e-07 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 54.88 E-value: 2.76e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 285 VPELLPEGSSAQAVSLSRAQ---EPAQPPALTPESAPGCTTEFAPGPAPgtEPVPGLELGLELEPVPALGPVPGPSVTPG 361
Cdd:PRK12323 382 VAQPAPAAAAPAAAAPAPAAppaAPAAAPAAAAAARAVAAAPARRSPAP--EALAAARQASARGPGGAPAPAPAPAAAPA 459
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 362 SLPAPwpvlgpvPAPGAQPPPLGDWPALPRRWPLPQGWPRVGSWPLWDLGVLRPTQPQPSR-APPPATEFGSLWPRPLQP 440
Cdd:PRK12323 460 AAARP-------AAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQpDAAPAGWVAESIPDPATA 532
|
170 180 190 200
....*....|....*....|....*....|....*....|....*..
gi 530408016 441 YQSRQGEALQLAAVQVKGEENDVPSLRGLRERARKDGAPKDRTRKDG 487
Cdd:PRK12323 533 DPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDG 579
|
|
| MISS |
pfam15822 |
MAPK-interacting and spindle-stabilising protein-like; MISS is a family of eukaryotic ... |
283-427 |
3.53e-07 |
|
MAPK-interacting and spindle-stabilising protein-like; MISS is a family of eukaryotic MAPK-interacting and spindle-stabilising protein-like proteins. MISS is rich in prolines and has four potential MAPK-phosphorylation sites, a MAPK-docking site, a PEST sequence (PEST motif) and a bipartite nuclear localization signal. The endogenous protein accumulates during mouse meiotic maturation and is found as discrete dots on the MII spindle. MISS is the first example of a physiological MAPK-substrate that is stabilized in MII that specifically regulates MII spindle integrity during the CSF arrest.
Pssm-ID: 318115 [Multi-domain] Cd Length: 238 Bit Score: 52.68 E-value: 3.53e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 283 YEVPELLPEGSSAQ--AVSLSRAQEPAQ-----PPALTPESAPGCTTEFAPGPAPGTEPVPGLELGLeLEPVPALGPVPG 355
Cdd:pfam15822 1 FSLADALPEQSPAKtsAVSNPKPGQPPQgwpgsNPWNNPSAPPAVPSGLPPSTAPSTVPFGPAPTGM-YPSIPLTGPSPG 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 356 P---------SVTPGSLPAPWPVL-GPVPaPGAQPPPLGDWPALPRRW--------PLPQG-WPRVGSWPlWDLGV---- 412
Cdd:pfam15822 80 PpapfppsgpSCPPPGGPYPAPTVpGPGP-IGPYPTPNMPFPELPRPYgaptdpaaAAPSGpWGSMSSGP-WAPGMggqy 157
|
170
....*....|....*
gi 530408016 413 LRPTQPQPSRAPPPA 427
Cdd:pfam15822 158 PAPNMPYPSPGPYPA 172
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
286-422 |
5.00e-07 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 54.30 E-value: 5.00e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 286 PELLPEGSSAQAVSLSRAQEPAQPPALTPESAPGCTTEFAPGPAPGTEPVPGLELGLELEPVPALGPVPGPSVTPGS-LP 364
Cdd:PHA03378 697 PPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGApTP 776
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 530408016 365 APWPVLGPVP------APGAQPPPLGDWPAL---PRRWPLPQGWPRVGSWPLWDLGVL--RPTQPQPSR 422
Cdd:PHA03378 777 QPPPQAPPAPqqrprgAPTPQPPPQAGPTSMqlmPRAAPGQQGPTKQILRQLLTGGVKrgRPSLKKPAA 845
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
288-467 |
5.87e-07 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 53.92 E-value: 5.87e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 288 LLPEGSSAQAVSLSRAQEPAQPPALTPESAPgcttefAPGPAPGTEPVPGLELGLELEPVPALGPVPGPSVTPGSLPAPW 367
Cdd:PHA03378 685 LPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQ------RPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPA 758
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 368 PVLGPVPAPGAQPPPLGDWPAlPRRWPLPQGWPRVGswplwdlgvlrPTQPQPSRAPPPATEFGslwprPLQPYQSRQGE 447
Cdd:PHA03378 759 AAPGRARPPAAAPGAPTPQPP-PQAPPAPQQRPRGA-----------PTPQPPPQAGPTSMQLM-----PRAAPGQQGPT 821
|
170 180
....*....|....*....|
gi 530408016 448 ALQLAAVQVKGEENDVPSLR 467
Cdd:PHA03378 822 KQILRQLLTGGVKRGRPSLK 841
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
306-442 |
7.44e-07 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 53.53 E-value: 7.44e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 306 PAQPPALTPESAPGCTTEFAPGPA------PGTEPVPGLELGLELEPVPALGPVPGPSVTPGSLPAPWPVLGPVPAPGAQ 379
Cdd:PHA03378 651 PHQPPQVEITPYKPTWTQIGHIPYqpsptgANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAA 730
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 530408016 380 PPPLGDWPALPRRWPLPQGWPrvgswplwdlGVLRPTQPQPSRAPPPATEFGSlwPRPLQPYQ 442
Cdd:PHA03378 731 PGRARPPAAAPGRARPPAAAP----------GRARPPAAAPGRARPPAAAPGA--PTPQPPPQ 781
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
307-482 |
1.31e-06 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 52.76 E-value: 1.31e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 307 AQPPALTPESAP-GCTTEFAPGPAPGTEPVPGLELGLELEPVPALGPVPGPSVTPGSLPAPWPVLGPVPAPGAQPPPLGD 385
Cdd:PHA03378 667 TQIGHIPYQPSPtGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARP 746
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 386 WPALPRRWPLPQGWPRVGSWPLWDLGvlRPTQPQPSRAPPPATEFGSLWPRPLQPYQSRQGeALQLAAVQVKGEENDVPS 465
Cdd:PHA03378 747 PAAAPGRARPPAAAPGRARPPAAAPG--APTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPT-SMQLMPRAAPGQQGPTKQ 823
|
170
....*....|....*...
gi 530408016 466 -LRGLRERARKDGAPKDR 482
Cdd:PHA03378 824 iLRQLLTGGVKRGRPSLK 841
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
249-463 |
3.60e-06 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 51.14 E-value: 3.60e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 249 EAALAQTTKYLEatRAIQVSEPVQNPQllqtvwhyevpellPEGSSAQAVSLSRAQEPAQP-PALTPESAPGCTTEFAPG 327
Cdd:PRK07764 371 ERGLLARLERLE--RRLGVAGGAGAPA--------------AAAPSAAAAAPAAAPAPAAAaPAAAAAPAPAAAPQPAPA 434
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 328 PAPGTEPvPGLELGLELEPVPALGPVPGPSVTPGSLPAPWPVLGPVPAPGAQPpplgdwpalprrWPLPQGWPRVgswpl 407
Cdd:PRK07764 435 PAPAPAP-PSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPA------------APAPAAAPAA----- 496
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*...
gi 530408016 408 wdlgvlrPTQPQPSRAPPPATEFGSLWPRPLQ--PYQSRQGEALQLAAVQVKGEENDV 463
Cdd:PRK07764 497 -------PAAPAAPAGADDAATLRERWPEILAavPKRSRKTWAILLPEATVLGVRGDT 547
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
301-445 |
4.29e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 51.48 E-value: 4.29e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 301 SRAQEPAQPPALTPESAPGCTTEFAPGPAPGTEPVPglelglelEPVPALGPVPGPSVTPGSLPAPWPVLGP-------V 373
Cdd:PHA03247 2584 SRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPP--------DTHAPDPPPPSPSPAANEPDPHPPPTVPpperprdD 2655
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 374 PAPGAQPPP-----LGDWP---ALPRRWPLPQGWPRVGswPLWDLGVLRPTQPQPSRAPPPATefgSLWPRPLQPYQSRQ 445
Cdd:PHA03247 2656 PAPGRVSRPrrarrLGRAAqasSPPQRPRRRAARPTVG--SLTSLADPPPPPPTPEPAPHALV---SATPLPPGPAAARQ 2730
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
226-418 |
4.55e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 51.09 E-value: 4.55e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 226 DAMFTSEIGSSPLDLWQSVEQLPEAALAQTTKYLEATRAIQVSEPVQNPQLlqtvwHYEV-------PELLPEGSSAQAV 298
Cdd:PHA03247 295 DGVWGAALAGAPLALPAPPDPPPPAPAGDAEEEDDEDGAMEVVSPLPRPRQ-----HYPLgfpkrrrPTWTPPSSLEDLS 369
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 299 SLSRAQEPAQPPALTPESAPGCTTEFAPGPAPGTEPVPGlelglelEPVPALGPVPGPSVTPGSLPAPWPVLGPVPAPG- 377
Cdd:PHA03247 370 AGRHHPKRASLPTRKRRSARHAATPFARGPGGDDQTRPA-------APVPASVPTPAPTPVPASAPPPPATPLPSAEPGs 442
|
170 180 190 200
....*....|....*....|....*....|....*....|...
gi 530408016 378 --AQPPPLGDWPALPRRWPLPQGWPRVGSWPLWDLGVLRPTQP 418
Cdd:PHA03247 443 ddGPAPPPERQPPAPATEPAPDDPDDATRKALDALRERRPPEP 485
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
292-434 |
5.30e-06 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 50.48 E-value: 5.30e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 292 GSSAQAVSLSRAQEPAQPPALTPESAPGCTTEFAPGPAPgtePVPGLELGLELEPVPALGPVPGPSVTPGSLPAPWPVLG 371
Cdd:PRK14951 367 AAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAA---APAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPA 443
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 530408016 372 PVPAPGAQPPPLGDWPALPRRWPLPQGWPRvgswplwDLGVLRPTQPQPsrAPPPATEFGSLW 434
Cdd:PRK14951 444 AVALAPAPPAQAAPETVAIPVRVAPEPAVA-------SAAPAPAAAPAA--ARLTPTEEGDVW 497
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
290-447 |
1.04e-05 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 49.60 E-value: 1.04e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 290 PEGSSAQAVSLSRAQEPAQPPALTPESAPGCTTEFAPGPAPGTEPVPGLElglelEPVPALGPVPGPSVTPGSLPAPWPV 369
Cdd:PRK07764 632 AAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAP-----PPAPAPAAPAAPAGAAPAQPAPAPA 706
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 530408016 370 LGPVPAPGAQPPPLGDWPALPRRWPLPQGWPRVGSWPLWDLGvLRPTQPQPSRAPPPATEFGSLWPRPLQPYQSRQGE 447
Cdd:PRK07764 707 ATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDP-PDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEE 783
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
270-446 |
2.44e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.78 E-value: 2.44e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 270 PVQNPQLLQTVWHYEVPeLLPEGSSAQAVSLSRAQEPAQPPALTPESAPGCTTEFAPGPAPGTEPVPGLELGLELEPVPA 349
Cdd:PHA03247 2704 PPPTPEPAPHALVSATP-LPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRR 2782
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 350 LGPVPGPSVTPG--SLPAPWpvlGPVPAPGAQPPPLgdwPALPrrwplPQGWPRVGSWPlwdlgvlrPTQPQPSRAPPPa 427
Cdd:PHA03247 2783 LTRPAVASLSESreSLPSPW---DPADPPAAVLAPA---AALP-----PAASPAGPLPP--------PTSAQPTAPPPP- 2842
|
170
....*....|....*....
gi 530408016 428 tefgslwPRPLQPYQSRQG 446
Cdd:PHA03247 2843 -------PGPPPPSLPLGG 2854
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
248-446 |
2.87e-05 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 48.33 E-value: 2.87e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 248 PEAALAQTTKYLEATRAIQVSEPVQNPQLLQTVWHYEVPELLPEGSSAQAVSLSRAQEPAQPPALTPESApgcttefAPG 327
Cdd:PRK12323 392 PAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAA-------ARP 464
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 328 PAPGTEPVPGLELGLELEPVPALGPVPGPSVTPgslpaPW---PVLGPVPAPGAQPPPLGDWPAlprrwplpQGWPRVGS 404
Cdd:PRK12323 465 AAAGPRPVAAAAAAAPARAAPAAAPAPADDDPP-----PWeelPPEFASPAPAQPDAAPAGWVA--------ESIPDPAT 531
|
170 180 190 200
....*....|....*....|....*....|....*....|..
gi 530408016 405 WPLWDLGVLRPTQPQPSRAPPPATEFGSLWPrPLQPYQSRQG 446
Cdd:PRK12323 532 ADPDDAFETLAPAPAAAPAPRAAAATEPVVA-PRPPRASASG 572
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
290-395 |
3.97e-05 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 47.95 E-value: 3.97e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 290 PEGSSAQAVSLSRAQEPAQPPALTPESAP---GCTTEFA-PGPAPGTEPVPGLELGLELEPVPALGPVPGPSVTPGSLPA 365
Cdd:PRK12323 469 PRPVAAAAAAAPARAAPAAAPAPADDDPPpweELPPEFAsPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAA 548
|
90 100 110 120
....*....|....*....|....*....|....*....|..
gi 530408016 366 PWPV--LGPVPAPGAQPPPL----------GDWPALPRRWPL 395
Cdd:PRK12323 549 PAPRaaAATEPVVAPRPPRAsasglpdmfdGDWPALAARLPV 590
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
296-427 |
5.33e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 47.63 E-value: 5.33e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 296 QAVSLSRAQEPAQPP-ALTPESAP---GCTTEFAPGPAPGTEPVPglelglelEPVPALGPVPGPSVT-PGSLPAPWPVL 370
Cdd:PHA03247 2666 RARRLGRAAQASSPPqRPRRRAARptvGSLTSLADPPPPPPTPEP--------APHALVSATPLPPGPaAARQASPALPA 2737
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*..
gi 530408016 371 GPVPAPGAQPPPLGDWPALPRRWPLPQGWPRvgSWPLWDlgvlrPTQPQPSRAPPPA 427
Cdd:PHA03247 2738 APAPPAVPAGPATPGGPARPARPPTTAGPPA--PAPPAA-----PAAGPPRRLTRPA 2787
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
306-445 |
9.60e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 46.68 E-value: 9.60e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 306 PAQPPALTPESA-----PGCTTEFAPGPAPGTEPVPglelgleLEPVPALGPVP----GPSVTPGSLPAPWPVLGPVPAP 376
Cdd:pfam03154 183 PPSPPPPGTTQAatagpTPSAPSVPPQGSPATSQPP-------NQTQSTAAPHTliqqTPTLHPQRLPSPHPPLQPMTQP 255
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 530408016 377 G--------AQPPPLGDWPALPRRWPLPQGwPRVGSWPLWDLGVLRPTQPQPSRAPPPatefgslwPRPLQPYQSRQ 445
Cdd:pfam03154 256 PppsqvspqPLPQPSLHGQMPPMPHSLQTG-PSHMQHPVPPQPFPLTPQSSQSQVPPG--------PSPAAPGQSQQ 323
|
|
| gly_rich_SclB |
NF038329 |
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ... |
326-538 |
1.06e-04 |
|
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.
Pssm-ID: 468478 [Multi-domain] Cd Length: 440 Bit Score: 46.05 E-value: 1.06e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 326 PGPA----PGTEPVPGLELGLELEPVPALGPVP----GPSVTPGSLPAPWPV--LGPVPAPGAQPPPLGDWPALPRRWPL 395
Cdd:NF038329 122 PGPAgpagPAGEQGPRGDRGETGPAGPAGPPGPqgerGEKGPAGPQGEAGPQgpAGKDGEAGAKGPAGEKGPQGPRGETG 201
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 396 PQGwPRVGSWPLWDLGVLRPTQPQPSRAPPPATEFGSLWPRPLQPYQSRQGEALQLAAVQVKGE--ENDVPSLRGLRERA 473
Cdd:NF038329 202 PAG-EQGPAGPAGPDGEAGPAGEDGPAGPAGDGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDrgEAGPDGPDGKDGER 280
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 530408016 474 RKDGAP-KDRTR-KDGVPkdrgGKDvDPKDRAHKDDVPKDRGgKDGDP-KD----RVGKDGAPKEAQPKAPQ 538
Cdd:NF038329 281 GPVGPAgKDGQNgKDGLP----GKD-GKDGQNGKDGLPGKDG-KDGQPgKDglpgKDGKDGQPGKPAPKTPE 346
|
|
| DUF4813 |
pfam16072 |
Domain of unknown function (DUF4813); This family of proteins is functionally uncharacterized. ... |
291-396 |
1.91e-04 |
|
Domain of unknown function (DUF4813); This family of proteins is functionally uncharacterized. This family of proteins is found in eukaryotes. Proteins in this family are typically between 345 and 672 amino acids in length.
Pssm-ID: 435117 [Multi-domain] Cd Length: 288 Bit Score: 44.75 E-value: 1.91e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 291 EGSSAQAVSLSRAQEPAQPPALTPESAPGCTTEFAPGPAPGTEP-VPGLELGlELEPVPALGPVPGPSVTPG--SLPAPW 367
Cdd:pfam16072 153 SAGSGTTVINAGGQQPAAPAAPAYPVAPAAYPAQAPAAAPAPAPgAPQTPLA-PLNPVAAAPAAAAGAAAAPvvAAAAPA 231
|
90 100 110
....*....|....*....|....*....|....*
gi 530408016 368 PVLGPVPAPGAqPPPLGDWPA------LPRRWPLP 396
Cdd:pfam16072 232 AAAPPPPAPAA-PPADAAPPApggiicVPVRVPEP 265
|
|
| FAP |
pfam07174 |
Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment ... |
292-399 |
2.86e-04 |
|
Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment proteins (FAP). Family members are rich in alanine and proline, are approximately 300 long, and seem to be restricted to mycobacteria. These proteins contain a fibronectin-binding motif that allows mycobacteria to bind to fibronectin in the extracellular matrix.
Pssm-ID: 429334 Cd Length: 301 Bit Score: 44.15 E-value: 2.86e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 292 GSSAQAVSL---SRAQEPAQPPALTPESAPgctteFAPGPAPgtePVPGlelglelEPVPAlgPVPGPSVTPGSLPAPWP 368
Cdd:pfam07174 25 GASAVAVALpavAHADPEPAPPPPSTATAP-----PAPPPPP---PAPA-------APAPP--PPPAAPNAPNAPPPPAD 87
|
90 100 110 120
....*....|....*....|....*....|....*....|....
gi 530408016 369 VLGPVPAPG--AQPPPLGDWPALPR-----------RWPLPQGW 399
Cdd:pfam07174 88 PNAPPPPPAdpNAPPPPAVDPNAPEpgridnavggfSYVVPAGW 131
|
|
| penta_MxKDx |
TIGR02953 |
pentapeptide MXKDX repeat protein; Members of this protein family are small bacterial proteins, ... |
481-532 |
3.32e-04 |
|
pentapeptide MXKDX repeat protein; Members of this protein family are small bacterial proteins, each with an N-terminal signal sequence followed by up to 11 imperfect repeats of a pentapeptide. The pentapeptide repeat usually follows the form Met-Xaa-Lys-Asp-Xaa.
Pssm-ID: 131998 [Multi-domain] Cd Length: 75 Bit Score: 40.21 E-value: 3.32e-04
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|..
gi 530408016 481 DRTRKDGVPKDRGGKDVDPKDRAHKDDVPKDRGGKDGDPKDRVGKDGAPKEA 532
Cdd:TIGR02953 23 DAMKKDTMKKDAMGKDAMAKDAMSKDAMKKDAMKKDAMKKDGMKKDAMKKDA 74
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
301-540 |
3.58e-04 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 44.78 E-value: 3.58e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 301 SRAQEPAQPPALTPESAPGCTTEFAPGPAP-GTEPVPGLELGLELEPVPALgpvPGPSVTPGSLPAPWPVLGPVPAPGAQ 379
Cdd:PHA03307 75 PGTEAPANESRSTPTWSLSTLAPASPAREGsPTPPGPSSPDPPPPTPPPAS---PPPSPAPDLSEMLRPVGSPGPPPAAS 151
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 380 PPPLGDWPALPRRwplpqgwprvGSWPLWDLGVLRPTQPQPSRAP-PPATEFGSLWPRP-LQPYQSRQGEALQLAAVqvk 457
Cdd:PHA03307 152 PPAAGASPAAVAS----------DAASSRQAALPLSSPEETARAPsSPPAEPPPSTPPAaASPRPPRRSSPISASAS--- 218
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 458 geenDVPSLRGLRERARKDGAPKDRTRKDGVPKDRGGKDVDPKDRAHKDDVP-KDRGGKDGDPKDR----VGKDGAPKEA 532
Cdd:PHA03307 219 ----SPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPtRIWEASGWNGPSSrpgpASSSSSPRER 294
|
....*...
gi 530408016 533 QPKAPQSA 540
Cdd:PHA03307 295 SPSPSPSS 302
|
|
| PHA03379 |
PHA03379 |
EBNA-3A; Provisional |
321-474 |
4.28e-04 |
|
EBNA-3A; Provisional
Pssm-ID: 223066 [Multi-domain] Cd Length: 935 Bit Score: 44.66 E-value: 4.28e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 321 TTEFAPGPAPGTE----PVPGLELGLELEPVPALGPVPGPSVTPGSLPAPWPV---LGPVPAPGAQPPPLGDWPALP-RR 392
Cdd:PHA03379 404 ALEKASEPTYGTPrppvEKPRPEVPQSLETATSHGSAQVPEPPPVHDLEPGPLhdqHSMAPCPVAQLPPGPLQDLEPgDQ 483
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 393 WPLPQGWPRVGSWPLWDLG--VLRPTQPQPSRAP--PPATEF-----GSLWPRPLQPYQSRQGEALQLAAVQVKGEENdv 463
Cdd:PHA03379 484 LPGVVQDGRPACAPVPAPAgpIVRPWEASLSQVPgvAFAPVMpqpmpVEPVPVPTVALERPVCPAPPLIAMQGPGETS-- 561
|
170
....*....|.
gi 530408016 464 pSLRGLRERAR 474
Cdd:PHA03379 562 -GIVRVRERWR 571
|
|
| PBP1 |
COG5180 |
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ... |
245-541 |
4.61e-04 |
|
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];
Pssm-ID: 444064 [Multi-domain] Cd Length: 548 Bit Score: 44.28 E-value: 4.61e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 245 EQLPEAALAQTTKYLEATRAIQVSEPVQNPQLLQTVWHYEVPELLPEGSSAQAVSLSRAQEPAQPPALTPESAPGctTEF 324
Cdd:COG5180 212 EEPPDLTGGADHPRPEAASSPKVDPPSTSEARSRPATVDAQPEMRPPADAKERRRAAIGDTPAAEPPGLPVLEAG--SEP 289
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 325 APGPAPGTEPVPGLELGLELEPvPALGPV-PGPSVTPGSLPAPwpvLGPVPAPGAQPPPLGDWPALPRRWPLPQGwprvg 403
Cdd:COG5180 290 QSDAPEAETARPIDVKGVASAP-PATRPVrPPGGARDPGTPRP---GQPTERPAGVPEAASDAGQPPSAYPPAEE----- 360
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 404 swplwdlgvLRPTQPQPSRAPPPATEFGSlwPRPLQPY----QSRQGEALQLAAVQVKGEENDVPSLRGLRERARKDGAP 479
Cdd:COG5180 361 ---------AVPGKPLEQGAPRPGSSGGD--GAPFQPPngapQPGLGRRGAPGPPMGAGDLVQAALDGGGRETASLGGAA 429
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 530408016 480 KDRTRkdgVPKDRGGKDVdpkdrAHKDDVPKDRGGKDGDPKdrvgkdgAPKEAQPKAPQSAL 541
Cdd:COG5180 430 GGAGQ---GPKADFVPGD-----AESVSGPAGLADQAGAAA-------STAMADFVAPVTDA 476
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
274-428 |
5.28e-04 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 44.37 E-value: 5.28e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 274 PQLLQTVWHYEVPELLPEGSSAQAVSLSRAQEPAQPPALTPESAPGCTTEFAPGPAPGTEPVPG-------LELGLELEP 346
Cdd:pfam03154 313 PSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPhlsgpspFQMNSNLPP 392
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 347 VPALGPVPGPSV--TPGSLPAP---WPVLGPVPAPGAQPPPLGDWPALP---RRWPLPQGWPRVGSWPLWDLGVLRPTQP 418
Cdd:pfam03154 393 PPALKPLSSLSThhPPSAHPPPlqlMPQSQQLPPPPAQPPVLTQSQSLPppaASHPPTSGLHQVPSQSPFPQHPFVPGGP 472
|
170
....*....|...
gi 530408016 419 Q---PSRAPPPAT 428
Cdd:pfam03154 473 PpitPPSGPPTST 485
|
|
| RRM_RBM27 |
cd12517 |
RNA recognition motif (RRM) found in vertebrate RNA-binding protein 27 (RBM27); This subgroup ... |
248-284 |
5.66e-04 |
|
RNA recognition motif (RRM) found in vertebrate RNA-binding protein 27 (RBM27); This subgroup corresponds to the RRM of RBM27 which contains a single RNA recognition motif (RRM), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain). Although the specific function of the RRM in RBM27 remains unclear, it shows high sequence similarity with RRM1of RBM26, which functions as a cutaneous lymphoma (CL)-associated antigen.
Pssm-ID: 409939 [Multi-domain] Cd Length: 76 Bit Score: 39.65 E-value: 5.66e-04
10 20 30
....*....|....*....|....*....|....*..
gi 530408016 248 PEAALAQTTKYLEATRAIQVSEPVQNPQLLQTVWHYE 284
Cdd:cd12517 40 PEAALIQYTTNEEARRAISSTEAVLNNRFIRVLWHRE 76
|
|
| Pro-rich |
pfam15240 |
Proline-rich protein; This family includes several eukaryotic proline-rich proteins. |
293-439 |
6.37e-04 |
|
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
Pssm-ID: 464580 [Multi-domain] Cd Length: 167 Bit Score: 41.95 E-value: 6.37e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 293 SSAQAVSLSRAQEPAQPPALTPESAPGCTTEFAPGPAPGTEPVPGLELGLELEPVPALGPVPGPSVTPGSLP-APWPVLG 371
Cdd:pfam15240 14 SSAQSSSEDVSQEDSPSLISEEEGQSQQGGQGPQGPPPGGFPPQPPASDDPPGPPPPGGPQQPPPQGGKQKPqGPPPQGG 93
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 530408016 372 PVPAPGAQ---PPPLGDWPALPRRWPLPQGWPRVGSWPLWDLG-VLRPTQPQPSR--APPPATEFGSLWPRPLQ 439
Cdd:pfam15240 94 PRPPPGKPqgpPPQGGNQQQGPPPPGKPQGPPPQGGGPPPQGGnQQGPPPPPPGNpqGPPQRPPQPGNPQGPPQ 167
|
|
| RRM1_RBM26 |
cd12516 |
RNA recognition motif 1 (RRM1) found in vertebrate RNA-binding protein 26 (RBM26); This ... |
248-284 |
9.73e-04 |
|
RNA recognition motif 1 (RRM1) found in vertebrate RNA-binding protein 26 (RBM26); This subgroup corresponds to the RRM1 of RBM26, also known as cutaneous T-cell lymphoma (CTCL) tumor antigen se70-2, which represents a cutaneous lymphoma (CL)-associated antigen. It contains two RNA recognition motifs (RRMs), also known as RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). The RRMs may play some functional roles in RNA-binding or protein-protein interactions.
Pssm-ID: 409938 [Multi-domain] Cd Length: 76 Bit Score: 38.84 E-value: 9.73e-04
10 20 30
....*....|....*....|....*....|....*..
gi 530408016 248 PEAALAQTTKYLEATRAIQVSEPVQNPQLLQTVWHYE 284
Cdd:cd12516 40 PEGALIQFATHEEAKRAISSTEAVLNNRFIKVYWHRE 76
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
309-440 |
1.58e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 42.39 E-value: 1.58e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 309 PPALTPESAPGCTTEFAPGPAPGTEPVPglelglelEPVPALGPVPGPSVtPGSLPAPWPVLGPVPAPGAQPPPLGDWPA 388
Cdd:PRK14951 366 PAAAAEAAAPAEKKTPARPEAAAPAAAP--------VAQAAAAPAPAAAP-AAAASAPAAPPAAAPPAPVAAPAAAAPAA 436
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....
gi 530408016 389 LPRRWPLPQGWPRVGSWPL--WDLGVLRPTQPQPSRAPPPATEFGSLWPRPLQP 440
Cdd:PRK14951 437 APAAAPAAVALAPAPPAQAapETVAIPVRVAPEPAVASAAPAPAAAPAAARLTP 490
|
|
| PRK14971 |
PRK14971 |
DNA polymerase III subunit gamma/tau; |
272-423 |
1.76e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237874 [Multi-domain] Cd Length: 614 Bit Score: 42.46 E-value: 1.76e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 272 QNPQLLQTVWHYEVPELLPEGSSAqavslSRAQEPAQP--PALT-PESAPGCTTEFAPGPAPGTEPVPglelglelEPVP 348
Cdd:PRK14971 346 KNKRLLVELTLIQLAQLTQKGDDA-----SGGRGPKQHikPVFTqPAAAPQPSAAAAASPSPSQSSAA--------AQPS 412
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 530408016 349 ALGPVPGPSVTPGSLPAPWPVLGPVPAPGAQPPPLGDWPALPRRWPLPQGWPRVGswplwdLGVLRPTQPQPSRA 423
Cdd:PRK14971 413 APQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKIPVSKVSSLG------PSTLRPIQEKAEQA 481
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
303-537 |
2.58e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 42.14 E-value: 2.58e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 303 AQEPAQPPALTPESAPGCTTEFAPGPAPGTEPVPGlELGLELEPVPALGPVPGPSVTPGSLPAP--------WPVLGPVP 374
Cdd:PRK07003 370 GGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTG-AAGAALAPKAAAAAAATRAEAPPAAPAPpatadrgdDAADGDAP 448
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 375 APGAQPPPLGDWPALPRRWPLPQGWPRVGSWPlwdlgvlrptqpqPSRAPPPATEFGSLWPRPLQPYQS-RQGEALQLAA 453
Cdd:PRK07003 449 VPAKANARASADSRCDERDAQPPADSGSASAP-------------ASDAPPDAAFEPAPRAAAPSAATPaAVPDARAPAA 515
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 454 VQvKGEENDVPSLRGLRERARKDGAPKDRTRKDG------VPKDRGGKdvdpkdrahkddVPKDRGGK-DGDPKDRVGKD 526
Cdd:PRK07003 516 AS-REDAPAAAAPPAPEARPPTPAAAAPAARAGGaaaaldVLRNAGMR------------VSSDRGARaAAAAKPAAAPA 582
|
250
....*....|.
gi 530408016 527 GAPKEAQPKAP 537
Cdd:PRK07003 583 AAPKPAAPRVA 593
|
|
| PHA03321 |
PHA03321 |
tegument protein VP11/12; Provisional |
282-543 |
2.78e-03 |
|
tegument protein VP11/12; Provisional
Pssm-ID: 223041 [Multi-domain] Cd Length: 694 Bit Score: 41.87 E-value: 2.78e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 282 HYE-VPELLPEGSSAQAVSLSRAQEPAQPPALTPESAPGCTTEFAPGPAPGTEPVPGLELGLELEPVPALGPVPGPSvTP 360
Cdd:PHA03321 417 HYEaSLRLLSSRQPPGAPAPRRDNDPPPPPRARPGSTPACARRARAQRARDAGPEYVDPLGALRRLPAGAAPPPEPA-AA 495
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 361 GSLPAPWPVLGPVPapgaqppplgdwPALPRRWPLPQgwprvgswplwdlgVLRPTQPQPSRAPPPATEFGSLWPRPLQP 440
Cdd:PHA03321 496 PSPATYYTRMGGGP------------PRLPPRNRATE--------------TLRPDWGPPAAAPPEQMEDPYLEPDDDRF 549
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 441 YQSRQGEALQLAAVQVKGEENDVPSLRGLRERARK--DGAPKDRTRKDGVPKDRGGKDVDPKDRA-------HKDDVPKD 511
Cdd:PHA03321 550 DRRDGAAAAATSHPREAPAPDDDPIYEGVSDSEEPvyEEIPTPRVYQNPLPRPMEGAGEPPDLDAptspwveEENPIYGW 629
|
250 260 270
....*....|....*....|....*....|..
gi 530408016 512 RGGKDGDPKDRVGKDGAPKEAQPKAPQSALHR 543
Cdd:PHA03321 630 GDSPLFSPPPAARFPPPDPALSPEPPALPAHR 661
|
|
| PRK05641 |
PRK05641 |
putative acetyl-CoA carboxylase biotin carboxyl carrier protein subunit; Validated |
332-385 |
3.81e-03 |
|
putative acetyl-CoA carboxylase biotin carboxyl carrier protein subunit; Validated
Pssm-ID: 235540 [Multi-domain] Cd Length: 153 Bit Score: 39.08 E-value: 3.81e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....
gi 530408016 332 TEPVPGLELGLELEPVPALGPVPGPSVTPGSLPAPWPVLGPVPAPGaqPPPLGD 385
Cdd:PRK05641 33 TYEVEAKGLGIDLSAVQEQVPTPAPAPAPAVPSAPTPVAPAAPAPA--PASAGE 84
|
|
| PRK01156 |
PRK01156 |
chromosome segregation protein; Provisional |
715-867 |
4.14e-03 |
|
chromosome segregation protein; Provisional
Pssm-ID: 100796 [Multi-domain] Cd Length: 895 Bit Score: 41.43 E-value: 4.14e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 715 QKKIGSLQKSRLKEEELERIWGNQIEMMKDRYITLDKAVENLQIRMDEFKTLQAQIKRLEMN-------------KVNK- 780
Cdd:PRK01156 196 NLELENIKKQIADDEKSHSITLKEIERLSIEYNNAMDDYNNLKSALNELSSLEDMKNRYESEiktaesdlsmeleKNNYy 275
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 781 STMEEELREKADRSALAGKASRVDLETVA---LELNEMIQGILFKVTIHEDSWKKAmEELSKDvntklvHSDLDPLKKEM 857
Cdd:PRK01156 276 KELEERHMKIINDPVYKNRNYINDYFKYKndiENKKQILSNIDAEINKYHAIIKKL-SVLQKD------YNDYIKKKSRY 348
|
170
....*....|
gi 530408016 858 EEVWKIVRKL 867
Cdd:PRK01156 349 DDLNNQILEL 358
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
299-543 |
4.19e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.46 E-value: 4.19e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 299 SLSRAQEPAQPPALTPESAPGCTTEFAPGPAPGTEPVPglelglelEPVPALGPVPGpSVTPGSLPAPwPVLGPVPapga 378
Cdd:PHA03247 2465 SLSLLLGELFPGAPVYRRPAEARFPFAAGAAPDPGGGG--------PPDPDAPPAPS-RLAPAILPDE-PVGEPVH---- 2530
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 379 qppplgdwpalPRRWPLPQGWPRVGSWPLWDlgvlrPTQPQPSRAPPPATEFG----SLWPRPLQPyqsrqgealqlaAV 454
Cdd:PHA03247 2531 -----------PRMLTWIRGLEELASDDAGD-----PPPPLPPAAPPAAPDRSvpppRPAPRPSEP------------AV 2582
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 455 QVKGEENDVPSlRGLRERARKD--GAPKDRTRKDGVPKDRGGKDVDPKDRAHKDDVPKDRGGKDGDPKDRVGKDGAPKEA 532
Cdd:PHA03247 2583 TSRARRPDAPP-QSARPRAPVDdrGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRV 2661
|
250
....*....|.
gi 530408016 533 QPKAPQSALHR 543
Cdd:PHA03247 2662 SRPRRARRLGR 2672
|
|
| FAP |
pfam07174 |
Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment ... |
326-435 |
4.62e-03 |
|
Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment proteins (FAP). Family members are rich in alanine and proline, are approximately 300 long, and seem to be restricted to mycobacteria. These proteins contain a fibronectin-binding motif that allows mycobacteria to bind to fibronectin in the extracellular matrix.
Pssm-ID: 429334 Cd Length: 301 Bit Score: 40.29 E-value: 4.62e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 326 PGPAPgtePVPglelglelePVPALGPVPGPSVTPGSLPAPWPVLGPVPAPGAQPPplgdwPALPRRWPLPQGWPRVGSW 405
Cdd:pfam07174 41 PEPAP---PPP---------STATAPPAPPPPPPAPAAPAPPPPPAAPNAPNAPPP-----PADPNAPPPPPADPNAPPP 103
|
90 100 110
....*....|....*....|....*....|
gi 530408016 406 PLWDlgvlrPTQPQPSRAPPPATEFGSLWP 435
Cdd:pfam07174 104 PAVD-----PNAPEPGRIDNAVGGFSYVVP 128
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
274-435 |
5.35e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.08 E-value: 5.35e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 274 PQLLQTVWHYEVPELLPE---GSSAQAVSLSRAQEPAQP---PALTPESAPGCTTEFAPGPAPGTEPVPGLELGLELEPV 347
Cdd:PHA03247 2889 PAVSRSTESFALPPDQPErppQPQAPPPPQPQPQPPPPPqpqPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALV 2968
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 348 PALGPVPGPSVTPgslPAPwpvlgPVPAPGAQPPPLGDWPAlprrwplpqgwPRVGSWPLwDLGVLRPTqpqpsrAPPPA 427
Cdd:PHA03247 2969 PGRVAVPRFRVPQ---PAP-----SREAPASSTPPLTGHSL-----------SRVSSWAS-SLALHEET------DPPPV 3022
|
....*...
gi 530408016 428 TEFGSLWP 435
Cdd:PHA03247 3023 SLKQTLWP 3030
|
|
| PRK11633 |
PRK11633 |
cell division protein DedD; Provisional |
286-379 |
5.57e-03 |
|
cell division protein DedD; Provisional
Pssm-ID: 236940 [Multi-domain] Cd Length: 226 Bit Score: 39.60 E-value: 5.57e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 286 PELLPEGSSAQAVSLSRAQEPAQPPALTPESAPgCTTEFAPGPAPGTEPVPglelglelEPVPALGPVPGPSVTPGSLPA 365
Cdd:PRK11633 64 PTQPPEGAAEAVRAGDAAAPSLDPATVAPPNTP-VEPEPAPVEPPKPKPVE--------KPKPKPKPQQKVEAPPAPKPE 134
|
90
....*....|....
gi 530408016 366 PWPVLGPVPAPGAQ 379
Cdd:PRK11633 135 PKPVVEEKAAPTGK 148
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
272-440 |
6.11e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 40.91 E-value: 6.11e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 272 QNPQLLQTVWHYEVPELLPEGSSAQAVSlsrAQEPAQPPALTPESAPGcTTEFAPGPAPGTEPVPGLELGLELEP----- 346
Cdd:pfam03154 169 TQPPVLQAQSGAASPPSPPPPGTTQAAT---AGPTPSAPSVPPQGSPA-TSQPPNQTQSTAAPHTLIQQTPTLHPqrlps 244
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 347 -----VPALGPVPGPSVTPGSLPAPW--PVLGPVPAPGAQPPPLGDWPALPRRWPLPQGWPRVGSWPLWDLGVLRPTQpQ 419
Cdd:pfam03154 245 phpplQPMTQPPPPSQVSPQPLPQPSlhGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQ-Q 323
|
170 180
....*....|....*....|.
gi 530408016 420 PSRAPPPATEFGSLWPRPLQP 440
Cdd:pfam03154 324 RIHTPPSQSQLQSQQPPREQP 344
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
286-484 |
8.15e-03 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 40.44 E-value: 8.15e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 286 PELLPEGSSAQAVSLSRAQEPAQPPALTPESAPGCTTEFAPG-PAPGTEPVPGLELG---LELEPVP---ALGPVPG--P 356
Cdd:PHA03378 576 PLTSPTTSQLASSAPSYAQTPWPVPHPSQTPEPPTTQSHIPEtSAPRQWPMPLRPIPmrpLRMQPITfnvLVFPTPHqpP 655
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 357 SVTPGSLPAPWPVLGPVPApgaQPPPLGDWPALPRRWPLpqgwprvgswplwdlGVLRPTQPQPSRAPPPATEFGSLWPR 436
Cdd:PHA03378 656 QVEITPYKPTWTQIGHIPY---QPSPTGANTMLPIQWAP---------------GTMQPPPRAPTPMRPPAAPPGRAQRP 717
|
170 180 190 200
....*....|....*....|....*....|....*....|....*...
gi 530408016 437 PLQPYQSRQGEALQLAAVQVKGEENDVPSLRGLRERARKDGAPKDRTR 484
Cdd:PHA03378 718 AAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRAR 765
|
|
| Drf_FH1 |
pfam06346 |
Formin Homology Region 1; This region is found in some of the Diaphanous related formins (Drfs) ... |
286-431 |
8.45e-03 |
|
Formin Homology Region 1; This region is found in some of the Diaphanous related formins (Drfs). It consists of low complexity repeats of around 12 residues.
Pssm-ID: 461881 [Multi-domain] Cd Length: 157 Bit Score: 38.31 E-value: 8.45e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530408016 286 PELLPEGSSAQAVSLSRAQEPAQPPALtpesaPGCTTEFAPGPAPGTEPVPglelglELEPVPALGPVPGPSVTPGS--L 363
Cdd:pfam06346 25 PPLPGGGGPPPPPPLPGSAAIPPPPPL-----PGGTSIPPPPPLPGAASIP------PPPPLPGSTGIPPPPPLPGGagI 93
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 530408016 364 PAPWPVLGPVPAPGAQPPPLGDWPALPRRWPLPQGWPrvgswplwdlgvLRPTQPQPSRAPPPATEFG 431
Cdd:pfam06346 94 PPPPPPLPGGAGVPPPPPPLPGGPGIPPPPPFPGGPG------------IPPPPPGMGMPPPPPFGFG 149
|
|
|