|
Name |
Accession |
Description |
Interval |
E-value |
| LRRC37AB_C |
pfam14914 |
LRRC37A/B like protein 1 C-terminal domain; This family represents the C-terminal domain of ... |
1468-1613 |
3.13e-78 |
|
LRRC37A/B like protein 1 C-terminal domain; This family represents the C-terminal domain of the putative Leucine Rich Repeat Containing protein 37A or protein 37B (LRRC37A/B) found in eukaryotes. The Leucine Rich Repeats (LRR) lies in the central region. The gene that encodes this protein is found in the chromosomal position 17q11.2, and its microdeletion results in the disease, neurofibromatosis type-1 (NF1). The function of the protein, LRRC37B is unknown, however experimental data shows expression in the aorta, heart, skeletal muscle, liver and brain during gestation.
Pssm-ID: 464370 Cd Length: 147 Bit Score: 254.65 E-value: 3.13e-78
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 1468 SPGDQFEIQLTQQLQSLIPNNNVRRLIAHVIRTLKMDCSGAHVQVTCAKLISRTGHLMKLLSGQQEVKASKIEWDTDQWK 1547
Cdd:pfam14914 1 SPGDQFEIQLNQQLLSLIPNVDVRRLISHVIRTLKMDCSEPQMQLACAKLISRTGLLMKLLSEQQEAKVSKADWDTDQWK 80
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 116325993 1548 IENYINESTEAQSEQKE-KSLELKKEVPGYGYTDKLILALIVTGILTILIILFCLIVICCHRRSLQE 1613
Cdd:pfam14914 81 NENYINESTEAQSKQKKqSSRELTKEVPGYGYNNKLILAISVTVVIMILIIILCLIEICSHRSASGE 147
|
|
| LRRC37 |
pfam15779 |
Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, ... |
560-629 |
5.96e-23 |
|
Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, and is approximately 70 amino acids in length. The function of this protein is unknown but it is likely to be upregulated by androgen.
Pssm-ID: 434930 [Multi-domain] Cd Length: 73 Bit Score: 93.97 E-value: 5.96e-23
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 116325993 560 EVELSPTMKETPTQP---PKKVVPQLRVYQGVTNPTPGQDQAQHPVSPSVTVQLLDLGLTITPEPTTEVGHST 629
Cdd:pfam15779 1 EVEPSPTQQETPTQPpesPKEVVAQPPVHHEVTVPTPGQGQAQHPTLPNVTVQPLDLELTITPEPTKEAEHST 73
|
|
| LRRC37 |
pfam15779 |
Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, ... |
251-319 |
2.20e-15 |
|
Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, and is approximately 70 amino acids in length. The function of this protein is unknown but it is likely to be upregulated by androgen.
Pssm-ID: 434930 [Multi-domain] Cd Length: 73 Bit Score: 72.40 E-value: 2.20e-15
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 116325993 251 EEEPSSMQQEAPALPPESSMESL--TLPNHEVSVQPPGEDQAYY-HLPNITVKPADVEVTITSEPTNETESS 319
Cdd:pfam15779 1 EVEPSPTQQETPTQPPESPKEVVaqPPVHHEVTVPTPGQGQAQHpTLPNVTVQPLDLELTITPEPTKEAEHS 72
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
206-704 |
2.96e-15 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 82.06 E-value: 2.96e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 206 PPGPSEQVGPSQFHLE----PETQNPETLEDIQSSSLQQEAPAQLPQLLEEEPssMQQEAPALPPESSMESLTLPNHEVS 281
Cdd:PRK10263 344 PPVASVDVPPAQPTVAwqpvPGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEP--LQQPVQPQQPYYAPAAEQPAQQPYY 421
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 282 VQPPGEDQAYYHLPNITVKPA--------DVEVTITSEPTNETESSQAQ---QETPIQFPEEVEPSATQQEAP----IEP 346
Cdd:PRK10263 422 APAPEQPAQQPYYAPAPEQPVagnawqaeEQQSTFAPQSTYQTEQTYQQpaaQEPLYQQPQPVEQQPVVEPEPvveeTKP 501
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 347 PVPPMEHELSISEQQ-----------QPV-QPSESPREVESSPTQQETPGQPPEHHEVTVSP--PGHHQTHHLASPSVSV 412
Cdd:PRK10263 502 ARPPLYYFEEVEEKRarereqlaawyQPIpEPVKEPEPIKSSLKAPSVAAVPPVEAAAAVSPlaSGVKKATLATGAAATV 581
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 413 KPPDVQLTIAAEPSAEVGTSLVHQ-------EATTRLSGSGNDVEPPAIQhggppLLPESSEEAGPLAVQQETSFQSPEP 485
Cdd:PRK10263 582 AAPVFSLANSGGPRPQVKEGIGPQlprpkriRVPTRRELASYGIKLPSQR-----AAEEKAREAQRNQYDSGDQYNDDEI 656
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 486 INNENPSPTQQEAAAEHPQTAEEGESSLTHQ----EAPAQTPEFPNVVVAQPPEHSHLTQATVQPLDLG-FTITP----- 555
Cdd:PRK10263 657 DAMQQDELARQFAQTQQQRYGEQYQHDVPVNaedaDAAAEAELARQFAQTQQQRYSGEQPAGANPFSLDdFEFSPmkall 736
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 556 -ESKTEVELSPT-MKETPTQPPKKVVPQLRVYQGVTNPTPGQDQAQHPVSPSVTVQLLDLGLTITPEpTTEVGHSTPPKR 633
Cdd:PRK10263 737 dDGPHEPLFTPIvEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQ-YQQPQQPVAPQP 815
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 116325993 634 TIVSPKHPEVTLPHPDQVQTQhshltrATVQPLDLgfTITPKSMTEVEPSTALMTTAPPPGHPEVTLPPSD 704
Cdd:PRK10263 816 QYQQPQQPVAPQPQYQQPQQP------VAPQPQDT--LLHPLLMRNGDSRPLHKPTTPLPSLDLLTPPPSE 878
|
|
| LRR_8 |
pfam13855 |
Leucine rich repeat; |
892-949 |
5.84e-15 |
|
Leucine rich repeat;
Pssm-ID: 404697 [Multi-domain] Cd Length: 61 Bit Score: 70.63 E-value: 5.84e-15
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 116325993 892 EKLILRENNLTELHKDSFEGLLSLQYLDLSCNKIQSIERHTFEPLPFLKFINLSCNVI 949
Cdd:pfam13855 4 RSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
|
|
| LRRC37 |
pfam15779 |
Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, ... |
352-429 |
1.16e-14 |
|
Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, and is approximately 70 amino acids in length. The function of this protein is unknown but it is likely to be upregulated by androgen.
Pssm-ID: 434930 [Multi-domain] Cd Length: 73 Bit Score: 70.47 E-value: 1.16e-14
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 116325993 352 EHELSISEQQQPVQPSESPREVEssptqqetpGQPPEHHEVTVSPPGHHQTHHLASPSVSVKPPDVQLTIAAEPSAEV 429
Cdd:pfam15779 1 EVEPSPTQQETPTQPPESPKEVV---------AQPPVHHEVTVPTPGQGQAQHPTLPNVTVQPLDLELTITPEPTKEA 69
|
|
| LRR |
COG4886 |
Leucine-rich repeat (LRR) protein [Transcription]; |
868-1020 |
2.23e-14 |
|
Leucine-rich repeat (LRR) protein [Transcription];
Pssm-ID: 443914 [Multi-domain] Cd Length: 414 Bit Score: 77.28 E-value: 2.23e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 868 TILNFQGNYISYIDGNVwkaYSWT--EKLILRENNLTELHkDSFEGLLSLQYLDLSCNKIQSIERHtFEPLPFLKFINLS 945
Cdd:COG4886 116 ESLDLSGNQLTDLPEEL---ANLTnlKELDLSNNQLTDLP-EPLGNLTNLKSLDLSNNQLTDLPEE-LGNLTNLKELDLS 190
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 116325993 946 CNVITELSfgtfQAWHGMQFLHKLILNHNPLTTVEDPyLFKLPALKYLDMGTTlvPLTTLKNILMMTvELEKLIL 1020
Cdd:COG4886 191 NNQITDLP----EPLGNLTNLEELDLSGNQLTDLPEP-LANLTNLETLDLSNN--QLTDLPELGNLT-NLEELDL 257
|
|
| LRRC37 |
pfam15779 |
Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, ... |
689-739 |
2.76e-12 |
|
Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, and is approximately 70 amino acids in length. The function of this protein is unknown but it is likely to be upregulated by androgen.
Pssm-ID: 434930 [Multi-domain] Cd Length: 73 Bit Score: 63.54 E-value: 2.76e-12
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|.
gi 116325993 689 TAPPPGHPEVTLPPSDKGQAQHSHLTQATVQPLDLELTITTKPTTEVKPSP 739
Cdd:pfam15779 23 VAQPPVHHEVTVPTPGQGQAQHPTLPNVTVQPLDLELTITPEPTKEAEHST 73
|
|
| PPP1R42 |
cd21340 |
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ... |
874-978 |
1.29e-05 |
|
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.
Pssm-ID: 411060 [Multi-domain] Cd Length: 220 Bit Score: 48.24 E-value: 1.29e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 874 GNYISYIDGnvwkayswTEKLilreNNLTELH---------------KDSFEGLL-SLQYLDLSCNKIQSIErhTFEPLP 937
Cdd:cd21340 77 GNRISVVEG--------LENL----TNLEELHienqrlppgekltfdPRSLAALSnSLRVLNISGNNIDSLE--PLAPLR 142
|
90 100 110 120
....*....|....*....|....*....|....*....|...
gi 116325993 938 FLKFINLSCNVITELS--FGTFQAWHgmqFLHKLILNHNPLTT 978
Cdd:cd21340 143 NLEQLDASNNQISDLEelLDLLSSWP---SLRELDLTGNPVCK 182
|
|
| Not5 |
COG5665 |
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription]; |
302-604 |
9.25e-04 |
|
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
Pssm-ID: 444384 [Multi-domain] Cd Length: 874 Bit Score: 44.27 E-value: 9.25e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 302 ADVEVTITSEPTNETESSQAQQETPIQFPEEVEPSATQQEAPIEPPVPPMEHELSISEQQQPVQ--PSESPREVESSPTQ 379
Cdd:COG5665 240 PSLLATPPATPATEEKSSQQPKSQPTSPSGGTTPPSTNQLTTSNTPTSTAKAQPQPPTKKQPAKepPSDTASGNPSAPSV 319
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 380 QETPGQPPEHHEVTVSPPGHHQTHHLASPSVSVKPPdvqltiaAEPSAEVgTSLVHQEATTRLSGSgndVEPPAIQHGGP 459
Cdd:COG5665 320 LINSDSPTSEDPATASVPTTEETTAFTTPSSVPSTP-------AEKDTPA-TDLATPVSPTPPETS---VDKKVSPDSAT 388
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 460 PLLPESSEEAGPLAV-QQETSFQSPEPINNENPSPTQQEAAAEHPQTAEegesSLTHQEAPAQTPEFPNVVVAQPPEHSH 538
Cdd:COG5665 389 SSTKSEKEGGTASSPmPPNIAIGAKDDVDATDPSQEAKEYTKNAPMTPE----ADSAPESSVRTEASPSAGSDLEPENTT 464
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 116325993 539 LTQAtvqpldlgftiTPESKTEVELSPTMKETPTQPPKKVVPQLRVYQGVTNPTPGQDQAQHPVSP 604
Cdd:COG5665 465 LRDP-----------APNAIPPPEDPSTIGRLSSGDKLANETGPPVIRRDSTPSSTADQSIVGVLA 519
|
|
| ftsN |
TIGR02223 |
cell division protein FtsN; FtsN is a poorly conserved protein active in cell division in a ... |
181-414 |
1.41e-03 |
|
cell division protein FtsN; FtsN is a poorly conserved protein active in cell division in a number of Proteobacteria. The N-terminal 30 residue region tends to by Lys/Arg-rich, and is followed by a membrane-spanning region. This is followed by an acidic low-complexity region of variable length and a well-conserved C-terminal domain of two tandem regions matched by pfam05036 (Sporulation related repeat), found in several cell division and sporulation proteins. The role of FtsN as a suppressor for other cell division mutations is poorly understood; it may involve cell wall hydrolysis. [Cellular processes, Cell division]
Pssm-ID: 274041 [Multi-domain] Cd Length: 298 Bit Score: 42.76 E-value: 1.41e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 181 QNEYSSTDTPYPgslppELRVKSDEPPGPSEQVGPSQFHLEPETQNPETLedIQSSSLQQEAPAQLPQLLEEEPSSMQQE 260
Cdd:TIGR02223 3 QRDYVRRGRGAP-----QKKKKNRRLVRATVLIAAILILLFIGGSSGLYL--LTESKQANEPETLQPKNQTENGETAADL 75
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 261 APAlPPESSMESLTLPNHEVSVQPPGEDQAyyhlpnitVKPADVEVTITSEPTNETESSQAQQETPIQFPEEVEPSATQQ 340
Cdd:TIGR02223 76 PPK-PEERWSYIEELEAREVLINDPEEPSN--------GGGVEESAQLTAEQRQLLEQMQADMRAAEKVLATAPSEQTVA 146
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 116325993 341 EAPIEPPVPPMEHELSISEQQQ-PVQPSESPREVESSPTQQETPGQPPEHHEVTVSPpghHQTHHLASPSVSVKP 414
Cdd:TIGR02223 147 VEARKQTAEKKPQKARTAEAQKtPVETEKIASKVKEAKQKQKALPKQTAETQSNSKP---IETAPKADKADKTKP 218
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| LRRC37AB_C |
pfam14914 |
LRRC37A/B like protein 1 C-terminal domain; This family represents the C-terminal domain of ... |
1468-1613 |
3.13e-78 |
|
LRRC37A/B like protein 1 C-terminal domain; This family represents the C-terminal domain of the putative Leucine Rich Repeat Containing protein 37A or protein 37B (LRRC37A/B) found in eukaryotes. The Leucine Rich Repeats (LRR) lies in the central region. The gene that encodes this protein is found in the chromosomal position 17q11.2, and its microdeletion results in the disease, neurofibromatosis type-1 (NF1). The function of the protein, LRRC37B is unknown, however experimental data shows expression in the aorta, heart, skeletal muscle, liver and brain during gestation.
Pssm-ID: 464370 Cd Length: 147 Bit Score: 254.65 E-value: 3.13e-78
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 1468 SPGDQFEIQLTQQLQSLIPNNNVRRLIAHVIRTLKMDCSGAHVQVTCAKLISRTGHLMKLLSGQQEVKASKIEWDTDQWK 1547
Cdd:pfam14914 1 SPGDQFEIQLNQQLLSLIPNVDVRRLISHVIRTLKMDCSEPQMQLACAKLISRTGLLMKLLSEQQEAKVSKADWDTDQWK 80
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 116325993 1548 IENYINESTEAQSEQKE-KSLELKKEVPGYGYTDKLILALIVTGILTILIILFCLIVICCHRRSLQE 1613
Cdd:pfam14914 81 NENYINESTEAQSKQKKqSSRELTKEVPGYGYNNKLILAISVTVVIMILIIILCLIEICSHRSASGE 147
|
|
| LRRC37 |
pfam15779 |
Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, ... |
560-629 |
5.96e-23 |
|
Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, and is approximately 70 amino acids in length. The function of this protein is unknown but it is likely to be upregulated by androgen.
Pssm-ID: 434930 [Multi-domain] Cd Length: 73 Bit Score: 93.97 E-value: 5.96e-23
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 116325993 560 EVELSPTMKETPTQP---PKKVVPQLRVYQGVTNPTPGQDQAQHPVSPSVTVQLLDLGLTITPEPTTEVGHST 629
Cdd:pfam15779 1 EVEPSPTQQETPTQPpesPKEVVAQPPVHHEVTVPTPGQGQAQHPTLPNVTVQPLDLELTITPEPTKEAEHST 73
|
|
| LRRC37 |
pfam15779 |
Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, ... |
251-319 |
2.20e-15 |
|
Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, and is approximately 70 amino acids in length. The function of this protein is unknown but it is likely to be upregulated by androgen.
Pssm-ID: 434930 [Multi-domain] Cd Length: 73 Bit Score: 72.40 E-value: 2.20e-15
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 116325993 251 EEEPSSMQQEAPALPPESSMESL--TLPNHEVSVQPPGEDQAYY-HLPNITVKPADVEVTITSEPTNETESS 319
Cdd:pfam15779 1 EVEPSPTQQETPTQPPESPKEVVaqPPVHHEVTVPTPGQGQAQHpTLPNVTVQPLDLELTITPEPTKEAEHS 72
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
206-704 |
2.96e-15 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 82.06 E-value: 2.96e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 206 PPGPSEQVGPSQFHLE----PETQNPETLEDIQSSSLQQEAPAQLPQLLEEEPssMQQEAPALPPESSMESLTLPNHEVS 281
Cdd:PRK10263 344 PPVASVDVPPAQPTVAwqpvPGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEP--LQQPVQPQQPYYAPAAEQPAQQPYY 421
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 282 VQPPGEDQAYYHLPNITVKPA--------DVEVTITSEPTNETESSQAQ---QETPIQFPEEVEPSATQQEAP----IEP 346
Cdd:PRK10263 422 APAPEQPAQQPYYAPAPEQPVagnawqaeEQQSTFAPQSTYQTEQTYQQpaaQEPLYQQPQPVEQQPVVEPEPvveeTKP 501
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 347 PVPPMEHELSISEQQ-----------QPV-QPSESPREVESSPTQQETPGQPPEHHEVTVSP--PGHHQTHHLASPSVSV 412
Cdd:PRK10263 502 ARPPLYYFEEVEEKRarereqlaawyQPIpEPVKEPEPIKSSLKAPSVAAVPPVEAAAAVSPlaSGVKKATLATGAAATV 581
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 413 KPPDVQLTIAAEPSAEVGTSLVHQ-------EATTRLSGSGNDVEPPAIQhggppLLPESSEEAGPLAVQQETSFQSPEP 485
Cdd:PRK10263 582 AAPVFSLANSGGPRPQVKEGIGPQlprpkriRVPTRRELASYGIKLPSQR-----AAEEKAREAQRNQYDSGDQYNDDEI 656
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 486 INNENPSPTQQEAAAEHPQTAEEGESSLTHQ----EAPAQTPEFPNVVVAQPPEHSHLTQATVQPLDLG-FTITP----- 555
Cdd:PRK10263 657 DAMQQDELARQFAQTQQQRYGEQYQHDVPVNaedaDAAAEAELARQFAQTQQQRYSGEQPAGANPFSLDdFEFSPmkall 736
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 556 -ESKTEVELSPT-MKETPTQPPKKVVPQLRVYQGVTNPTPGQDQAQHPVSPSVTVQLLDLGLTITPEpTTEVGHSTPPKR 633
Cdd:PRK10263 737 dDGPHEPLFTPIvEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQ-YQQPQQPVAPQP 815
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 116325993 634 TIVSPKHPEVTLPHPDQVQTQhshltrATVQPLDLgfTITPKSMTEVEPSTALMTTAPPPGHPEVTLPPSD 704
Cdd:PRK10263 816 QYQQPQQPVAPQPQYQQPQQP------VAPQPQDT--LLHPLLMRNGDSRPLHKPTTPLPSLDLLTPPPSE 878
|
|
| LRR_8 |
pfam13855 |
Leucine rich repeat; |
892-949 |
5.84e-15 |
|
Leucine rich repeat;
Pssm-ID: 404697 [Multi-domain] Cd Length: 61 Bit Score: 70.63 E-value: 5.84e-15
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 116325993 892 EKLILRENNLTELHKDSFEGLLSLQYLDLSCNKIQSIERHTFEPLPFLKFINLSCNVI 949
Cdd:pfam13855 4 RSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
|
|
| LRRC37 |
pfam15779 |
Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, ... |
352-429 |
1.16e-14 |
|
Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, and is approximately 70 amino acids in length. The function of this protein is unknown but it is likely to be upregulated by androgen.
Pssm-ID: 434930 [Multi-domain] Cd Length: 73 Bit Score: 70.47 E-value: 1.16e-14
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 116325993 352 EHELSISEQQQPVQPSESPREVEssptqqetpGQPPEHHEVTVSPPGHHQTHHLASPSVSVKPPDVQLTIAAEPSAEV 429
Cdd:pfam15779 1 EVEPSPTQQETPTQPPESPKEVV---------AQPPVHHEVTVPTPGQGQAQHPTLPNVTVQPLDLELTITPEPTKEA 69
|
|
| LRR |
COG4886 |
Leucine-rich repeat (LRR) protein [Transcription]; |
868-1020 |
2.23e-14 |
|
Leucine-rich repeat (LRR) protein [Transcription];
Pssm-ID: 443914 [Multi-domain] Cd Length: 414 Bit Score: 77.28 E-value: 2.23e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 868 TILNFQGNYISYIDGNVwkaYSWT--EKLILRENNLTELHkDSFEGLLSLQYLDLSCNKIQSIERHtFEPLPFLKFINLS 945
Cdd:COG4886 116 ESLDLSGNQLTDLPEEL---ANLTnlKELDLSNNQLTDLP-EPLGNLTNLKSLDLSNNQLTDLPEE-LGNLTNLKELDLS 190
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 116325993 946 CNVITELSfgtfQAWHGMQFLHKLILNHNPLTTVEDPyLFKLPALKYLDMGTTlvPLTTLKNILMMTvELEKLIL 1020
Cdd:COG4886 191 NNQITDLP----EPLGNLTNLEELDLSGNQLTDLPEP-LANLTNLETLDLSNN--QLTDLPELGNLT-NLEELDL 257
|
|
| LRRC37 |
pfam15779 |
Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, ... |
689-739 |
2.76e-12 |
|
Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, and is approximately 70 amino acids in length. The function of this protein is unknown but it is likely to be upregulated by androgen.
Pssm-ID: 434930 [Multi-domain] Cd Length: 73 Bit Score: 63.54 E-value: 2.76e-12
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|.
gi 116325993 689 TAPPPGHPEVTLPPSDKGQAQHSHLTQATVQPLDLELTITTKPTTEVKPSP 739
Cdd:pfam15779 23 VAQPPVHHEVTVPTPGQGQAQHPTLPNVTVQPLDLELTITPEPTKEAEHST 73
|
|
| LRR |
COG4886 |
Leucine-rich repeat (LRR) protein [Transcription]; |
868-1006 |
1.68e-11 |
|
Leucine-rich repeat (LRR) protein [Transcription];
Pssm-ID: 443914 [Multi-domain] Cd Length: 414 Bit Score: 68.42 E-value: 1.68e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 868 TILNFQGNYISYIDGNVWKAYSwTEKLILRENNLTELHkDSFEGLLSLQYLDLSCNKIQSIERhTFEPLPFLKFINLSCN 947
Cdd:COG4886 162 KSLDLSNNQLTDLPEELGNLTN-LKELDLSNNQITDLP-EPLGNLTNLEELDLSGNQLTDLPE-PLANLTNLETLDLSNN 238
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*....
gi 116325993 948 VITELSfgtfqAWHGMQFLHKLILNHNPLTTVedPYLFKLPALKYLDMGTTlvPLTTLK 1006
Cdd:COG4886 239 QLTDLP-----ELGNLTNLEELDLSNNQLTDL--PPLANLTNLKTLDLSNN--QLTDLK 288
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
190-752 |
3.09e-11 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 68.81 E-value: 3.09e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 190 PYPGSLPPELRVKS-------DEPPGPSEQVGPSQFHLEPETQNPETLEDIQSSSLQQEAPAQLPqlleeePSSMQQEAP 262
Cdd:PHA03247 2554 PLPPAAPPAAPDRSvppprpaPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLP------PDTHAPDPP 2627
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 263 alPPESSMESLTLPNHEVSVQPPGEDQAYYHLPNITVKPAdvEVTITSEPTNEteSSQAQQETPIQFPEEVEPSATQQEA 342
Cdd:PHA03247 2628 --PPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPR--RARRLGRAAQA--SSPPQRPRRRAARPTVGSLTSLADP 2701
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 343 PIEPPVPPMEHELSISEQQQPVQPSESPREVESSPTQQETPGQP-----PEHHEVTVSPPGHHQTHHLASPSVSVKPPDV 417
Cdd:PHA03247 2702 PPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPagpatPGGPARPARPPTTAGPPAPAPPAAPAAGPPR 2781
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 418 QLTIAAEPSAevgtslvhQEATTRLSGSGNDVEPPAIQHGGPPLLPESSEEAGPLAVQQetsfqSPEPINNENPSPTQQE 497
Cdd:PHA03247 2782 RLTRPAVASL--------SESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPT-----SAQPTAPPPPPGPPPP 2848
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 498 AAAEHPQTAEEGESSlthQEAPAQTPefPNVVVAQP-PEHSHLTQATVQPLDLGFTITPESKtEVELSPTMKETPTQPPK 576
Cdd:PHA03247 2849 SLPLGGSVAPGGDVR---RRPPSRSP--AAKPAAPArPPVRRLARPAVSRSTESFALPPDQP-ERPPQPQAPPPPQPQPQ 2922
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 577 KVVPQLRVYQgvtNPTPGQDQAQHPVSPSVTVQLLDLGLTITPEPTTEVGHSTPPKRTIVSPKHPEVTLPHPDQVQTQHS 656
Cdd:PHA03247 2923 PPPPPQPQPP---PPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGH 2999
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 657 HLTRATVQPLDLGFTITPksmtevepstalmttAPPPGHPEVTL-PPSDKGQAQHSHLTQATVQPLDLEltiTTKPTTEV 735
Cdd:PHA03247 3000 SLSRVSSWASSLALHEET---------------DPPPVSLKQTLwPPDDTEDSDADSLFDSDSERSDLE---ALDPLPPE 3061
|
570
....*....|....*..
gi 116325993 736 KPSPTTEETSTQPPDLG 752
Cdd:PHA03247 3062 PHDPFAHEPDPATPEAG 3078
|
|
| LRR_8 |
pfam13855 |
Leucine rich repeat; |
914-976 |
9.14e-11 |
|
Leucine rich repeat;
Pssm-ID: 404697 [Multi-domain] Cd Length: 61 Bit Score: 58.69 E-value: 9.14e-11
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 116325993 914 SLQYLDLSCNKIQSIERHTFEPLPFLKFINLSCNVITELSFGTFqawHGMQFLHKLILNHNPL 976
Cdd:pfam13855 2 NLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAF---SGLPSLRYLDLSGNRL 61
|
|
| LRRC37 |
pfam15779 |
Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, ... |
508-565 |
9.93e-11 |
|
Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, and is approximately 70 amino acids in length. The function of this protein is unknown but it is likely to be upregulated by androgen.
Pssm-ID: 434930 [Multi-domain] Cd Length: 73 Bit Score: 59.30 E-value: 9.93e-11
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 116325993 508 EGESSLTHQEAPAQTPEFPNVVVAQPP---------------EHSHLTQATVQPLDLGFTITPESKTEVELSP 565
Cdd:pfam15779 1 EVEPSPTQQETPTQPPESPKEVVAQPPvhhevtvptpgqgqaQHPTLPNVTVQPLDLELTITPEPTKEAEHST 73
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
307-739 |
3.83e-10 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 65.17 E-value: 3.83e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 307 TITSEPTNETES-SQAQQETPIQFPEEVEpsaTQQEAPIEPPVPPMEHELSISEQQQPVQPSESPREVE--SSPTQQETP 383
Cdd:pfam03154 147 SIPSPQDNESDSdSSAQQQILQTQPPVLQ---AQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPatSQPPNQTQS 223
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 384 GQPPeHHEVTVSPPGHHQthHLASPsvsvKPPDVQLTIAAEPSAEVGTSLVHQEATTRLSGSGNDVE--PPAIQHGGPP- 460
Cdd:pfam03154 224 TAAP-HTLIQQTPTLHPQ--RLPSP----HPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQtgPSHMQHPVPPq 296
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 461 ---LLPESSEEAGPLAVQQETSFQSPEPINNENPSPTQQEAAAEHPQTAEEGESSLTHQEAPAQTPeFPNVVVAQPPEH- 536
Cdd:pfam03154 297 pfpLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTP-IPQLPNPQSHKHp 375
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 537 SHLTQATVQPLDLGFTITPESKTEVELSPTMKETPTQPPKKVVPQlrvyqgvtnptpGQDQAQHPVSPSVTVQLLDLGLT 616
Cdd:pfam03154 376 PHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQ------------SQQLPPPPAQPPVLTQSQSLPPP 443
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 617 ITPEPTTEVGHSTPPKRTIvsPKHPEVTLPHPdqvqtqhshltraTVQPLDLGFTITPKSMTEVEP--STALMTTAPPPG 694
Cdd:pfam03154 444 AASHPPTSGLHQVPSQSPF--PQHPFVPGGPP-------------PITPPSGPPTSTSSAMPGIQPpsSASVSSSGPVPA 508
|
410 420 430 440
....*....|....*....|....*....|....*....|....*
gi 116325993 695 HPEVTLPPsdkgqaqhshlTQATVQPLDLELTITTKPTTEVKPSP 739
Cdd:pfam03154 509 AVSCPLPP-----------VQIKEEALDEAEEPESPPPPPRSPSP 542
|
|
| LRR |
COG4886 |
Leucine-rich repeat (LRR) protein [Transcription]; |
900-1020 |
1.36e-08 |
|
Leucine-rich repeat (LRR) protein [Transcription];
Pssm-ID: 443914 [Multi-domain] Cd Length: 414 Bit Score: 59.18 E-value: 1.36e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 900 NLTEL---HKDSFEGLLSLQYLDLSCNKIQSIERHtFEPLPFLKFINLSCNVITEL--SFGTFQAwhgmqfLHKLILNHN 974
Cdd:COG4886 97 NLTELdlsGNEELSNLTNLESLDLSGNQLTDLPEE-LANLTNLKELDLSNNQLTDLpePLGNLTN------LKSLDLSNN 169
|
90 100 110 120
....*....|....*....|....*....|....*....|....*.
gi 116325993 975 PLTTVEDPyLFKLPALKYLDMGTTlvPLTTLKNILMMTVELEKLIL 1020
Cdd:COG4886 170 QLTDLPEE-LGNLTNLKELDLSNN--QITDLPEPLGNLTNLEELDL 212
|
|
| rne |
PRK10811 |
ribonuclease E; Reviewed |
223-593 |
1.52e-07 |
|
ribonuclease E; Reviewed
Pssm-ID: 236766 [Multi-domain] Cd Length: 1068 Bit Score: 56.59 E-value: 1.52e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 223 ETQNPETLEDIQSSSLQQEAPAQLPQ-LLEEEPSSMQQEAPALPPEssmesltlpnHEVSVQPPGEDQAYYHLPNITVKP 301
Cdd:PRK10811 655 ESQQAEVTEKARTQDEQQQAPRRERQrRRNDEKRQAQQEAKALNVE----------EQSVQETEQEERVQQVQPRRKQRQ 724
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 302 ADVEVTITSEPTNETESSQAQQETPIQFPEEVEPSATQQEAPIEPPV---PPMEHELSISEQQQ-----PVQPSESPREV 373
Cdd:PRK10811 725 LNQKVRIEQSVAEEAVAPVVEETVAAEPVVQEVPAPRTELVKVPLPVvaqTAPEQDEENNAENRdnngmPRRSRRSPRHL 804
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 374 ------------ESSPTQQETP----GQPPE--------HHEVTvsPPGHHQTHHLASPSVSVKPPDVQLTIAAEPSAEV 429
Cdd:PRK10811 805 rvsgqrrrryrdERYPTQSPMPltvaCASPEmasgkvwiRYPVV--RPQDVQVEEQREAEEVQVQPVVAEVPVAAAVEPV 882
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 430 GTSLVHQEATTrlsgsgnDVEPPAIQHGGPPLLPESSEEAGPLAVQqetsfqspEPInNENPSPTQQEAAAEHPQTAEEG 509
Cdd:PRK10811 883 VSAPVVEAVAE-------VVEEPVVVAEPQPEEVVVVETTHPEVIA--------APV-TEQPQVITESDVAVAQEVAEHA 946
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 510 ESSLTHQEAPAQTPEFPNVVVAQPPEHSHLTQATVQPldlgfTITPESKTEVELSPTMKETPTQPPKKVVPQLRVYQGVT 589
Cdd:PRK10811 947 EPVVEPQDETADIEEAAETAEVVVAEPEVVAQPAAPV-----VAEVAAEVETVTAVEPEVAPAQVPEATVEHNHATAPMT 1021
|
....*
gi 116325993 590 N-PTP 593
Cdd:PRK10811 1022 RaPAP 1026
|
|
| LRR |
COG4886 |
Leucine-rich repeat (LRR) protein [Transcription]; |
823-1020 |
2.70e-07 |
|
Leucine-rich repeat (LRR) protein [Transcription];
Pssm-ID: 443914 [Multi-domain] Cd Length: 414 Bit Score: 54.94 E-value: 2.70e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 823 KASTSTNICELCTCGDEMLSCIDLNPEQRLRQVPVPEPNTHNGTFTILNFQGNYISYIDGNVWKAYSWTEKLILRENNLT 902
Cdd:COG4886 8 LTLKLLLLLLLELLTTLILLLLLLLLLLALLLLSLLSLLLLLTLLLSLLLRDLLLSSLLLLLSLLLLLLLSLLLLSLLLL 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 903 ELhkDSFEGLLSLQYLDLSCNKiqsierhTFEPLPFLKFINLSCNVITELSFGTFQawhgMQFLHKLILNHNPLTTVEDP 982
Cdd:COG4886 88 GL--TDLGDLTNLTELDLSGNE-------ELSNLTNLESLDLSGNQLTDLPEELAN----LTNLKELDLSNNQLTDLPEP 154
|
170 180 190
....*....|....*....|....*....|....*...
gi 116325993 983 yLFKLPALKYLDMGTTlvPLTTLKNILMMTVELEKLIL 1020
Cdd:COG4886 155 -LGNLTNLKSLDLSNN--QLTDLPEELGNLTNLKELDL 189
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
315-777 |
7.81e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.56 E-value: 7.81e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 315 ETESSQAQQETPIqFPEEVEPSATQQEAPIEPPVPpmehelsiseqqQPVQPSESPRE----VESSPTQQETPGQPPEhh 390
Cdd:PHA03247 2542 ELASDDAGDPPPP-LPPAAPPAAPDRSVPPPRPAP------------RPSEPAVTSRArrpdAPPQSARPRAPVDDRG-- 2606
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 391 evtvSPPGHHQTHHLASPSVSVKPPDVQLTIAAEPSAEVGTSLVHQEATTRLSGSGNDVEPP--------AIQHGGPPLL 462
Cdd:PHA03247 2607 ----DPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPrrarrlgrAAQASSPPQR 2682
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 463 PEssEEAGPLAVQQETSFQSPEPINNE-NPSPTQQEAAAEHPQTAeegesslthQEAPAQTPEFPNVVVAQPPEHSHLTQ 541
Cdd:PHA03247 2683 PR--RRAARPTVGSLTSLADPPPPPPTpEPAPHALVSATPLPPGP---------AAARQASPALPAAPAPPAVPAGPATP 2751
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 542 ATVQPldlgftitpesktevelsPTMKETPTQPPKKVVPQLRvyqgvtnPTPGQDQAQHPVSPSVTVQLLDLGLTITPEP 621
Cdd:PHA03247 2752 GGPAR------------------PARPPTTAGPPAPAPPAAP-------AAGPPRRLTRPAVASLSESRESLPSPWDPAD 2806
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 622 TTEVghSTPPKRTIVSPKHPEVTLPHPDQVQTQHSHLTRATVQP-LDLGFTITPKS-MTEVEPSTALMTTAPPPGHPevt 699
Cdd:PHA03247 2807 PPAA--VLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPsLPLGGSVAPGGdVRRRPPSRSPAAKPAAPARP--- 2881
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 116325993 700 lPPSDKGQAQHSHLTQATVQPLDlelTITTKPTTEVKPSPTTEETSTQPPDLGlaiiPEPTTETRHSTALEKTTAPRP 777
Cdd:PHA03247 2882 -PVRRLARPAVSRSTESFALPPD---QPERPPQPQAPPPPQPQPQPPPPPQPQ----PPPPPPPRPQPPLAPTTDPAG 2951
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
68-535 |
1.89e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.02 E-value: 1.89e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 68 PRESPHAPTLPADPwdfDHLGPSASSEMPAPPQESTEnlvPFLDTWDSAGEQPLEPEQFLASqqdlkdklSPQERLPVSP 147
Cdd:PHA03247 2676 ASSPPQRPRRRAAR---PTVGSLTSLADPPPPPPTPE---PAPHALVSATPLPPGPAAARQA--------SPALPAAPAP 2741
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 148 KKLKKDPAqrwslaeIIGITRQLSTPQSqkqtlqneyssTDTPyPGSLPPELRVKSDEPPGPSEQVGPSQFHLE--PETQ 225
Cdd:PHA03247 2742 PAVPAGPA-------TPGGPARPARPPT-----------TAGP-PAPAPPAAPAAGPPRRLTRPAVASLSESREslPSPW 2802
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 226 NPETLEDIQSSSLQQEAPAQLPQLLEEEPSSMQQEAPALPPESSMESLTLpnhEVSVQPPGedqayyhlPNITVKPADVE 305
Cdd:PHA03247 2803 DPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPL---GGSVAPGG--------DVRRRPPSRSP 2871
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 306 VTITSEPTNETESSQAQqetpiqfpeevePSATQQEAPIepPVPPmehelsiSEQQQPVQPSESPREVESSPTQQETPGQ 385
Cdd:PHA03247 2872 AAKPAAPARPPVRRLAR------------PAVSRSTESF--ALPP-------DQPERPPQPQAPPPPQPQPQPPPPPQPQ 2930
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 386 PPEHHEVTVSPPGHHQTHHLASPSVSVKPPDVQLTIAAEPSAEVGTSLVHQEATTRLSGSgndvEPPAIQHGGPplLPES 465
Cdd:PHA03247 2931 PPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPA----SSTPPLTGHS--LSRV 3004
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 116325993 466 SEEAGPLAVQQETsfqSPEPINNEN---PSPTQQEAAAEHPQTAEEGESSLthqEAPAQTPEFPNVVVAQPPE 535
Cdd:PHA03247 3005 SSWASSLALHEET---DPPPVSLKQtlwPPDDTEDSDADSLFDSDSERSDL---EALDPLPPEPHDPFAHEPD 3071
|
|
| LRR |
COG4886 |
Leucine-rich repeat (LRR) protein [Transcription]; |
868-1010 |
3.57e-06 |
|
Leucine-rich repeat (LRR) protein [Transcription];
Pssm-ID: 443914 [Multi-domain] Cd Length: 414 Bit Score: 51.47 E-value: 3.57e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 868 TILNFQGNYISYIDGNVwKAYSWTEKLILRENNLTELhkDSFEGLLSLQYLDLSCNKIQSIErhTFEPLPFLKFINLSCN 947
Cdd:COG4886 208 EELDLSGNQLTDLPEPL-ANLTNLETLDLSNNQLTDL--PELGNLTNLEELDLSNNQLTDLP--PLANLTNLKTLDLSNN 282
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 116325993 948 VITELSFGTFQAWHGMQFLHKLILNHNPLTTVEDPYLFKLPALKYLDMGTTLVPLTTLKNILM 1010
Cdd:COG4886 283 QLTDLKLKELELLLGLNSLLLLLLLLNLLELLILLLLLTTLLLLLLLLKGLLVTLTTLALSLS 345
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
49-383 |
3.95e-06 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 52.08 E-value: 3.95e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 49 LTSNPLGPPDSWSSHSSHfPRESPHAPtLPADPWDFdHLGPSaSSEMPAPPQestenlvPFLDTWDSAGEQ-PLEPEQFL 127
Cdd:pfam03154 249 LQPMTQPPPPSQVSPQPL-PQPSLHGQ-MPPMPHSL-QTGPS-HMQHPVPPQ-------PFPLTPQSSQSQvPPGPSPAA 317
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 128 ASQQDLKDKLSPQERLPVSPKKLKKDPAQRWSLAeIIGITRQLSTPQSQKQTLQneysSTDTPYPGSLPPELRVKSDEPP 207
Cdd:pfam03154 318 PGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLS-MPHIKPPPTTPIPQLPNPQ----SHKHPPHLSGPSPFQMNSNLPP 392
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 208 GPSEQVGPSQFHLEPETQNPETLEDIQSSSLQQEAPAQLPQLleeepssmqQEAPALPPESSMESLTLPNHEVSVQPPGE 287
Cdd:pfam03154 393 PPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVL---------TQSQSLPPPAASHPPTSGLHQVPSQSPFP 463
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 288 DQAYYHLPNITVKPAdvevtitSEPTNETESSQaqqeTPIQFPEEVEPSATQQ-EAPIEPPVPPME-HELSISEQQQPVQ 365
Cdd:pfam03154 464 QHPFVPGGPPPITPP-------SGPPTSTSSAM----PGIQPPSSASVSSSGPvPAAVSCPLPPVQiKEEALDEAEEPES 532
|
330
....*....|....*...
gi 116325993 366 PSESPREVESSPTQQETP 383
Cdd:pfam03154 533 PPPPPRSPSPEPTVVNTP 550
|
|
| PPP1R42 |
cd21340 |
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ... |
874-978 |
1.29e-05 |
|
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.
Pssm-ID: 411060 [Multi-domain] Cd Length: 220 Bit Score: 48.24 E-value: 1.29e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 874 GNYISYIDGnvwkayswTEKLilreNNLTELH---------------KDSFEGLL-SLQYLDLSCNKIQSIErhTFEPLP 937
Cdd:cd21340 77 GNRISVVEG--------LENL----TNLEELHienqrlppgekltfdPRSLAALSnSLRVLNISGNNIDSLE--PLAPLR 142
|
90 100 110 120
....*....|....*....|....*....|....*....|...
gi 116325993 938 FLKFINLSCNVITELS--FGTFQAWHgmqFLHKLILNHNPLTT 978
Cdd:cd21340 143 NLEQLDASNNQISDLEelLDLLSSWP---SLRELDLTGNPVCK 182
|
|
| rne |
PRK10811 |
ribonuclease E; Reviewed |
299-538 |
1.31e-05 |
|
ribonuclease E; Reviewed
Pssm-ID: 236766 [Multi-domain] Cd Length: 1068 Bit Score: 50.04 E-value: 1.31e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 299 VKPADVEVTITSEPTNETESSQAQQETPIQFPEEVEPSATQQEAPIEPPVPPMEHElsiseqqqPVQPSESPREvesspt 378
Cdd:PRK10811 848 VRPQDVQVEEQREAEEVQVQPVVAEVPVAAAVEPVVSAPVVEAVAEVVEEPVVVAE--------PQPEEVVVVE------ 913
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 379 qqetpgqppEHHEVTVSPPGHHQTHHLASPSVSVKPPDVQLTIAAEPSAEVGTSLVHQEATTrlsgsgndveppaiqhgg 458
Cdd:PRK10811 914 ---------TTHPEVIAAPVTEQPQVITESDVAVAQEVAEHAEPVVEPQDETADIEEAAETA------------------ 966
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 459 PPLLPESSEEAGPLAVQQETsfqSPEPINNENPSPTQQEAAAEHPQTAEEGESSLTHQEAPAqtPEFpnvvVAQPPEHSH 538
Cdd:PRK10811 967 EVVVAEPEVVAQPAAPVVAE---VAAEVETVTAVEPEVAPAQVPEATVEHNHATAPMTRAPA--PEY----VPEAPRHSD 1037
|
|
| rne |
PRK10811 |
ribonuclease E; Reviewed |
215-418 |
4.35e-05 |
|
ribonuclease E; Reviewed
Pssm-ID: 236766 [Multi-domain] Cd Length: 1068 Bit Score: 48.50 E-value: 4.35e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 215 PSQFHLEPETQNPETLEDIQSSSLQQEAPAQlpqllEEEPSSMQQEAPALPPEssmesltlPNHEVSVQPPGEDQAYYHL 294
Cdd:PRK10811 850 PQDVQVEEQREAEEVQVQPVVAEVPVAAAVE-----PVVSAPVVEAVAEVVEE--------PVVVAEPQPEEVVVVETTH 916
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 295 PNITVKPADVEVTITSEPTNETESSQAQQETPIQFPEEVEPSATQQEAPIEPPVPpmehelsisEQQQPVQPSESPREVE 374
Cdd:PRK10811 917 PEVIAAPVTEQPQVITESDVAVAQEVAEHAEPVVEPQDETADIEEAAETAEVVVA---------EPEVVAQPAAPVVAEV 987
|
170 180 190 200
....*....|....*....|....*....|....*....|....
gi 116325993 375 SSPTQQETPGQPPEHHEVTVSPPGHHqtHHLASPSVSVKPPDVQ 418
Cdd:PRK10811 988 AAEVETVTAVEPEVAPAQVPEATVEH--NHATAPMTRAPAPEYV 1029
|
|
| PRK14949 |
PRK14949 |
DNA polymerase III subunits gamma and tau; Provisional |
154-602 |
1.83e-04 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237863 [Multi-domain] Cd Length: 944 Bit Score: 46.26 E-value: 1.83e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 154 PAQRWSLAEIIGITRQLSTPQSQKQTLQNEYSSTDTPYPGSLPPElrvksdEPPGPSEQVGPSQFHLEPETQNP-ETLED 232
Cdd:PRK14949 362 PVKRWQVDDPAEISLPEGQTPSALAAAVQAPHANEPQFVNAAPAE------KKTALTEQTTAQQQVQAANAEAVaEADAS 435
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 233 IQSSSLQQEAPAQLPQLLEEEPSSMQ---QEAPALPPESSMESLTLPNHEVSVQPPGEDQAYYHlPNITVKPADVEVTIT 309
Cdd:PRK14949 436 AEPADTVEQALDDESELLAALNAEQAvilSQAQSQGFEASSSLDADNSAVPEQIDSTAEQSVVN-PSVTDTQVDDTSASN 514
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 310 SEPTNETESSQAQQETPIQFPEEVEPSATQQEAPIEPPVPPMEHElsISEQQQPVQPSESPREVESSPTQQETPGQPPEH 389
Cdd:PRK14949 515 NSAADNTVDDNYSAEDTLESNGLDEGDYAQDSAPLDAYQDDYVAF--SSESYNALSDDEQHSANVQSAQSAAEAQPSSQS 592
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 390 HEVTVSPPghhqthhlaSPSVSVKPPDV-QLTIAAEPS--AEVGTSLVHQEATTRLSGSGNDVEPPAIQHGGPPLLPESS 466
Cdd:PRK14949 593 LSPISAVT---------TAAASLADDDIlDAVLAARDSllSDLDALSPKEGDGKKSSADRKPKTPPSRAPPASLSKPASS 663
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 467 EEAGPLAVQQETSFQS-PEPINNENPSPTQ-QEAAAEHPQT----------AEEGESSLTHQEAPAQTPEFPNVV----- 529
Cdd:PRK14949 664 PDASQTSASFDLDPDFeLATHQSVPEAALAsGSAPAPPPVPdpydrppweeAPEVASANDGPNNAAEGNLSESVEdasns 743
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 116325993 530 ----VAQPPEHSHLTQATVQPldlgftitpesktevelsPTMKETPTQPPKKVVPQlrvyQGVTNPTPGQDQAQHPV 602
Cdd:PRK14949 744 elqaVEQQATHQPQVQAEAQS------------------PASTTALTQTSSEVQDT----ELNLVLLSSGSITGHPL 798
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
190-468 |
3.39e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 45.41 E-value: 3.39e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 190 PYPGSLPPELRVKSDEPPGPSEQVGPSQFHLEP-----ET-QNPETLEDIQ-SSSLQQEAPAQLPQLLEEEPSSMQQEAP 262
Cdd:pfam09770 107 PAARAAQSSAQPPASSLPQYQYASQQSQQPSKPvrtgyEKyKEPEPIPDLQvDASLWGVAPKKAAAPAPAPQPAAQPASL 186
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 263 ALPPESSMeSLtlpnHEVsvqppgEDQAYYHLPNITVKPADVEVTITSEPTnetessQAQQETPIQFPEEVEPSATQQEA 342
Cdd:pfam09770 187 PAPSRKMM-SL----EEV------EAAMRAQAKKPAQQPAPAPAQPPAAPP------AQQAQQQQQFPPQIQQQQQPQQQ 249
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 343 PIEPPvPPMEHELSISEQQQPVQPSESPREVESSPTQQETPGQPPehhevtvsPPGHHQTHHLASPSVsvkPPDVQLTIA 422
Cdd:pfam09770 250 PQQPQ-QHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPP--------PVPVQPTQILQNPNR---LSAARVGYP 317
|
250 260 270 280
....*....|....*....|....*....|....*....|....*.
gi 116325993 423 AEPSAEVGTSLVHQEATTRLSGSGNdvePPAIQHggPPLLPESSEE 468
Cdd:pfam09770 318 QNPQPGVQPAPAHQAHRQQGSFGRQ---APIITH--PQQLAQLSEE 358
|
|
| PHA03377 |
PHA03377 |
EBNA-3C; Provisional |
295-748 |
3.47e-04 |
|
EBNA-3C; Provisional
Pssm-ID: 177614 [Multi-domain] Cd Length: 1000 Bit Score: 45.43 E-value: 3.47e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 295 PNITVKPADVEVTITSEPTNETESSQAqqetpiqFPEEVEPSaTQQEAPIEPPVP-PMEHELSISEQQQPV--------- 364
Cdd:PHA03377 422 PTPKTHPVKRTLVKTSGRSDEAEQAQS-------TPERPGPS-DQPSVPVEPAHLtPVEHTTVILHQPPQSpptvaikpa 493
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 365 -QPSESPR--------------EVESS--PTQQETPGQPPEHHEVTVSPPGHHQTHHL---ASPSVSVKP--------PD 416
Cdd:PHA03377 494 pPPSRRRRgacvvydddiieviDVETTeeEESVTQPAKPHRKVQDGFQRSGRRQKRATppkVSPSDRGPPkasppvmaPP 573
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 417 VQLTIAAEPSAEVGTSLVHQEATTRLSGSGNDVEPPAIQHGGPPLLPESSEEAGPLAVQQETSFQSPEPINNENPSPTQQ 496
Cdd:PHA03377 574 STGPRVMATPSTGPRDMAPPSTGPRQQAKCKDGPPASGPHEKQPPSSAPRDMAPSVVRMFLRERLLEQSTGPKPKSFWEM 653
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 497 EAAAEHPQTAEEGESSL--THQEAPAQTPEFPNVVV--------AQPPEHSHL-----TQATVQPLDLGFTiTPESKTEV 561
Cdd:PHA03377 654 RAGRDGSGIQQEPSSRRqpATQSTPPRPSWLPSVFVlpsvdagrAQPSEESHLssmspTQPISHEEQPRYE-DPDDPLDL 732
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 562 ELSPtmkETPTQPPKKvvpqlrvyqgvtNPTPGQDQAQHPVSPSVTVQlldlglTITPEPTTEVGHSTPPKRTIVSPKHP 641
Cdd:PHA03377 733 SLHP---DQAPPPSHQ------------APYSGHEEPQAQQAPYPGYW------EPRPPQAPYLGYQEPQAQGVQVSSYP 791
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 642 EVTLPHPDQVQTQ-HSHLTRATVQPLDLGFTITPKSMTEVEPSTALMTTA---------PPPGHPEVTLPPSDKGQAQHS 711
Cdd:PHA03377 792 GYAGPWGLRAQHPrYRHSWAYWSQYPGHGHPQGPWAPRPPHLPPQWDGSAghgqdqvsqFPHLQSETGPPRLQLSQVPQL 871
|
490 500 510
....*....|....*....|....*....|....*..
gi 116325993 712 HLTQATVQPLDLELTiTTKPTTEVKPSPTTEETSTQP 748
Cdd:PHA03377 872 PYSQTLVSSSAPSWS-SPQPRAPIRPIPTRFPPPPMP 907
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
171-646 |
4.06e-04 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 45.44 E-value: 4.06e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 171 STPQSQKQT--LQNEYSSTDTPYPGSLPPELRVKSdEPPGPSEQVGPSQFHLEP--ETQNPETLEDIQSSSLQQEAPAQL 246
Cdd:PHA03378 443 ATPHSQAPTvvLHRPPTQPLEGPTGPLSVQAPLEP-WQPLPHPQVTPVILHQPPaqGVQAHGSMLDLLEKDDEDMEQRVM 521
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 247 PQLLEEEPSsmQQEAPALPPESSMESLTLPNHEVSVQPPGEDQAyyhLPNITVKPADVEvTITSEPTNETES---SQAQQ 323
Cdd:PHA03378 522 ATLLPPSPP--QPRAGRRAPCVYTEDLDIESDEPASTEPVHDQL---LPAPGLGPLQIQ-PLTSPTTSQLASsapSYAQT 595
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 324 ETPIQFPEEVEPSATQQEAPIEPPVPPMEHELSISEQQQPVQPSESPREVESSPTQQETPGQPPEHHEVTVSPPGH---- 399
Cdd:PHA03378 596 PWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPLRPIPMRPLRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIGHipyq 675
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 400 -HQTHHLASPSVS-----VKPPDVQLTIAAEPSAEVGTSLVHQEATTRL---SGSGNDVEPPAiqhGGPPLLPESSEEAG 470
Cdd:PHA03378 676 pSPTGANTMLPIQwapgtMQPPPRAPTPMRPPAAPPGRAQRPAAATGRArppAAAPGRARPPA---AAPGRARPPAAAPG 752
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 471 PLAVQQETSFQSPEPINNEN-PSPTQQEAAAEHPQTAEEGESslTHQEAPAQTPEFPNVVVAQPPEHSHLTQATVQPLDL 549
Cdd:PHA03378 753 RARPPAAAPGRARPPAAAPGaPTPQPPPQAPPAPQQRPRGAP--TPQPPPQAGPTSMQLMPRAAPGQQGPTKQILRQLLT 830
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 550 GFTIT--PESKTEVELSPTMKETPTQPPK-----KVVPQLRVYQGVTNPT--PGQDQAQHPVSPSvtvqlldlglTITPE 620
Cdd:PHA03378 831 GGVKRgrPSLKKPAALERQAAAGPTPSPGsgtsdKIVQAPVFYPPVLQPIqvMRQLGSVRAAAAS----------TVTQA 900
|
490 500
....*....|....*....|....*.
gi 116325993 621 PTTEVGhstppKRTIVSPKHPEVTLP 646
Cdd:PHA03378 901 PTEYTG-----ERRGVGPMHPTDIPP 921
|
|
| Not5 |
COG5665 |
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription]; |
302-604 |
9.25e-04 |
|
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
Pssm-ID: 444384 [Multi-domain] Cd Length: 874 Bit Score: 44.27 E-value: 9.25e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 302 ADVEVTITSEPTNETESSQAQQETPIQFPEEVEPSATQQEAPIEPPVPPMEHELSISEQQQPVQ--PSESPREVESSPTQ 379
Cdd:COG5665 240 PSLLATPPATPATEEKSSQQPKSQPTSPSGGTTPPSTNQLTTSNTPTSTAKAQPQPPTKKQPAKepPSDTASGNPSAPSV 319
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 380 QETPGQPPEHHEVTVSPPGHHQTHHLASPSVSVKPPdvqltiaAEPSAEVgTSLVHQEATTRLSGSgndVEPPAIQHGGP 459
Cdd:COG5665 320 LINSDSPTSEDPATASVPTTEETTAFTTPSSVPSTP-------AEKDTPA-TDLATPVSPTPPETS---VDKKVSPDSAT 388
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 460 PLLPESSEEAGPLAV-QQETSFQSPEPINNENPSPTQQEAAAEHPQTAEegesSLTHQEAPAQTPEFPNVVVAQPPEHSH 538
Cdd:COG5665 389 SSTKSEKEGGTASSPmPPNIAIGAKDDVDATDPSQEAKEYTKNAPMTPE----ADSAPESSVRTEASPSAGSDLEPENTT 464
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 116325993 539 LTQAtvqpldlgftiTPESKTEVELSPTMKETPTQPPKKVVPQLRVYQGVTNPTPGQDQAQHPVSP 604
Cdd:COG5665 465 LRDP-----------APNAIPPPEDPSTIGRLSSGDKLANETGPPVIRRDSTPSSTADQSIVGVLA 519
|
|
| PRK14960 |
PRK14960 |
DNA polymerase III subunit gamma/tau; |
301-510 |
1.01e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237868 [Multi-domain] Cd Length: 702 Bit Score: 43.88 E-value: 1.01e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 301 PADVEVTITSEPTNETE---SSQAQQETPIQFPEEVEPSATQQEAPIEPPVPPMEHELSISEQQQPvQPSESPrEVESSP 377
Cdd:PRK14960 363 PNEILVSEPVQQNGQAEvglNSQAQTAQEITPVSAVQPVEVISQPAMVEPEPEPEPEPEPEPEPEP-EPEPEP-EPEPEP 440
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 378 tqqetpgQPPEHHEVTVSPPGHHQTHHLASPSVSvkppdVQLTIAAEPSAEVGTS-LVHQEATTRLsgsgNDVEPPAIQH 456
Cdd:PRK14960 441 -------EPQPNQDLMVFDPNHHELIGLESAVVQ-----ETVSVLEEDFIPVPEQkLVQVQAETQV----KQIEPEPAST 504
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*...
gi 116325993 457 GGPPLLPESSEEAGPLAVQQETSFQSPEPINNENP----SPTQQEAAAEHPQTAEEGE 510
Cdd:PRK14960 505 AEPIGLFEASSAEFSLAQDTSAYDLVSEPVIEQQSlvqaEIVETVAVVKEPNATDNSQ 562
|
|
| LRR_4 |
pfam12799 |
Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a ... |
914-954 |
1.10e-03 |
|
Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a number of proteins with diverse functions and cellular locations. These repeats are usually involved in protein-protein interactions. Each Leucine Rich Repeat is composed of a beta-alpha unit. These units form elongated non-globular structures. Leucine Rich Repeats are often flanked by cysteine rich domains.
Pssm-ID: 463713 [Multi-domain] Cd Length: 44 Bit Score: 38.38 E-value: 1.10e-03
10 20 30 40
....*....|....*....|....*....|....*....|..
gi 116325993 914 SLQYLDLSCNKIQSIErhTFEPLPFLKFINLS-CNVITELSF 954
Cdd:pfam12799 2 NLEVLDLSNNQITDIP--PLAKLPNLETLDLSgNNKITDLSD 41
|
|
| ftsN |
TIGR02223 |
cell division protein FtsN; FtsN is a poorly conserved protein active in cell division in a ... |
181-414 |
1.41e-03 |
|
cell division protein FtsN; FtsN is a poorly conserved protein active in cell division in a number of Proteobacteria. The N-terminal 30 residue region tends to by Lys/Arg-rich, and is followed by a membrane-spanning region. This is followed by an acidic low-complexity region of variable length and a well-conserved C-terminal domain of two tandem regions matched by pfam05036 (Sporulation related repeat), found in several cell division and sporulation proteins. The role of FtsN as a suppressor for other cell division mutations is poorly understood; it may involve cell wall hydrolysis. [Cellular processes, Cell division]
Pssm-ID: 274041 [Multi-domain] Cd Length: 298 Bit Score: 42.76 E-value: 1.41e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 181 QNEYSSTDTPYPgslppELRVKSDEPPGPSEQVGPSQFHLEPETQNPETLedIQSSSLQQEAPAQLPQLLEEEPSSMQQE 260
Cdd:TIGR02223 3 QRDYVRRGRGAP-----QKKKKNRRLVRATVLIAAILILLFIGGSSGLYL--LTESKQANEPETLQPKNQTENGETAADL 75
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 261 APAlPPESSMESLTLPNHEVSVQPPGEDQAyyhlpnitVKPADVEVTITSEPTNETESSQAQQETPIQFPEEVEPSATQQ 340
Cdd:TIGR02223 76 PPK-PEERWSYIEELEAREVLINDPEEPSN--------GGGVEESAQLTAEQRQLLEQMQADMRAAEKVLATAPSEQTVA 146
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 116325993 341 EAPIEPPVPPMEHELSISEQQQ-PVQPSESPREVESSPTQQETPGQPPEHHEVTVSPpghHQTHHLASPSVSVKP 414
Cdd:TIGR02223 147 VEARKQTAEKKPQKARTAEAQKtPVETEKIASKVKEAKQKQKALPKQTAETQSNSKP---IETAPKADKADKTKP 218
|
|
| LRR_4 |
pfam12799 |
Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a ... |
892-929 |
2.27e-03 |
|
Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a number of proteins with diverse functions and cellular locations. These repeats are usually involved in protein-protein interactions. Each Leucine Rich Repeat is composed of a beta-alpha unit. These units form elongated non-globular structures. Leucine Rich Repeats are often flanked by cysteine rich domains.
Pssm-ID: 463713 [Multi-domain] Cd Length: 44 Bit Score: 37.22 E-value: 2.27e-03
10 20 30
....*....|....*....|....*....|....*....
gi 116325993 892 EKLILRENNLTELhkDSFEGLLSLQYLDLS-CNKIQSIE 929
Cdd:pfam12799 4 EVLDLSNNQITDI--PPLAKLPNLETLDLSgNNKITDLS 40
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
260-799 |
3.86e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 41.99 E-value: 3.86e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 260 EAPALPPESSMESLTLPNHEVSVQPPGEDqaYYHLPNitvkPADVEVTITSEPTNETESSQAQQetpiqfPEEVEPSATQ 339
Cdd:PRK10263 331 QSWAAPVEPVTQTPPVASVDVPPAQPTVA--WQPVPG----PQTGEPVIAPAPEGYPQQSQYAQ------PAVQYNEPLQ 398
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 340 QEAPIEPPVPPMEHELSISEQQQPVQPSESPREVESSPTQQETPGQPPEHHEVTVSPPGHHQTHHLASPSVSVKPPDVQl 419
Cdd:PRK10263 399 QPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPL- 477
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 420 tiaaepsaEVGTSLVHQEATTRLSGSGNDVEPPAiqhggPPLLP-ESSEEAGPLAVQQETSFQSPEPINNENPSPTQQEA 498
Cdd:PRK10263 478 --------YQQPQPVEQQPVVEPEPVVEETKPAR-----PPLYYfEEVEEKRAREREQLAAWYQPIPEPVKEPEPIKSSL 544
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 499 AAEHPQTAEEGESslthqeAPAQTPEFPNvvVAQPPEHSHLTQATVQP-LDLGFTITPESKTEVELSPTMKetptQPPKK 577
Cdd:PRK10263 545 KAPSVAAVPPVEA------AAAVSPLASG--VKKATLATGAAATVAAPvFSLANSGGPRPQVKEGIGPQLP----RPKRI 612
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 578 VVPQLRVYQGVTNPTPGQDQAQHPVSPSVTVQlLDLGLTITPEPTTEVGHSTPPKRTIVSPKH---PEVTLPHPDQV--- 651
Cdd:PRK10263 613 RVPTRRELASYGIKLPSQRAAEEKAREAQRNQ-YDSGDQYNDDEIDAMQQDELARQFAQTQQQrygEQYQHDVPVNAeda 691
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 652 ----------------QTQHSHLTRATVQPL---DLGFT--------------ITPKSMTEVEPSTALMTTAPPPGHPEV 698
Cdd:PRK10263 692 daaaeaelarqfaqtqQQRYSGEQPAGANPFsldDFEFSpmkallddgpheplFTPIVEPVQQPQQPVAPQQQYQQPQQP 771
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 699 TLPPSDKGQAQHSHLTQAtvQPLDLELTITTKPTTEVKPSPTTEETSTQPPDLGLA-----------IIPEPTTETRH-- 765
Cdd:PRK10263 772 VAPQPQYQQPQQPVAPQP--QYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVApqpqyqqpqqpVAPQPQDTLLHpl 849
|
570 580 590 600
....*....|....*....|....*....|....*....|
gi 116325993 766 ------STALEKTTAPRPdrvqtlhrSLTEVTGPPTELEP 799
Cdd:PRK10263 850 lmrngdSRPLHKPTTPLP--------SLDLLTPPPSEVEP 881
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
471-777 |
4.09e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 41.83 E-value: 4.09e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 471 PLAVQQETSFQSPEpinNENPSPTQQEAAAEHPQTAEEGESSLThqeaPAQTPEfPNVVVAQPPEHSHLTQATVQPLDLG 550
Cdd:pfam05109 449 PSSTHVPTNLTAPA---STGPTVSTADVTSPTPAGTTSGASPVT----PSPSPR-DNGTESKAPDMTSPTSAVTTPTPNA 520
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 551 FTITPESKTEV--ELSPTM-KETPTQppkkvvpqlrvyqGVTNPTPGQDQAQHPVS-PSVTVQLLDLGLT------ITPE 620
Cdd:pfam05109 521 TSPTPAVTTPTpnATSPTLgKTSPTS-------------AVTTPTPNATSPTPAVTtPTPNATIPTLGKTsptsavTTPT 587
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 621 P---TTEVGHSTPPKRT-------------IVSPKHPEVTLPHPDQVQTQHSHLTRATVQPLDLGFTITPKSMTEVEPST 684
Cdd:pfam05109 588 PnatSPTVGETSPQANTtnhtlggtsstpvVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHM 667
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 685 ALMTTAPPPG---------------HPEVTLP---PSDKGQAQHSHLTQATVQPLDLELTITTKPTTEVKPSPTTEETST 746
Cdd:pfam05109 668 PLLTSAHPTGgenitqvtpaststhHVSTSSPaprPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQKTA 747
|
330 340 350
....*....|....*....|....*....|.
gi 116325993 747 QPPDLGLAIIPEPTTETRHSTALEKTTAPRP 777
Cdd:pfam05109 748 VPTVTSTGGKANSTTGGKHTTGHGARTSTEP 778
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
174-458 |
4.19e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 41.83 E-value: 4.19e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 174 QSQKQTLQNEYSSTDTPYPGSLPPELRVKSDEPPGPSEQVGPSQFHLEPETQNPEtlediqssslqqeAPAQLPQLLEEE 253
Cdd:pfam05109 500 ESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAVTTPTPN-------------ATSPTPAVTTPT 566
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 254 PSSMqqeAPALPPESSMESLTLPNHEVSVQPPGEDQAYYHLPNITVKPADVEVTITSEPTNETESSQAQQEtpiQFPEEV 333
Cdd:pfam05109 567 PNAT---IPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQH---NITSSS 640
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 334 EPSATQQEAPIEPPVPPMEHELSISeqQQPVQPSESPREVESspTQQETPGQPPEHHEVTVSP-PGHHQTHHLASP---S 409
Cdd:pfam05109 641 TSSMSLRPSSISETLSPSTSDNSTS--HMPLLTSAHPTGGEN--ITQVTPASTSTHHVSTSSPaPRPGTTSQASGPgnsS 716
|
250 260 270 280
....*....|....*....|....*....|....*....|....*....
gi 116325993 410 VSVKPPDVQLTIAAEPsaevgtslvhQEATTRLSGSGNDVEPPAIQHGG 458
Cdd:pfam05109 717 TSTKPGEVNVTKGTPP----------KNATSPQAPSGQKTAVPTVTSTG 755
|
|
| PHA03369 |
PHA03369 |
capsid maturational protease; Provisional |
221-392 |
5.27e-03 |
|
capsid maturational protease; Provisional
Pssm-ID: 223061 [Multi-domain] Cd Length: 663 Bit Score: 41.52 E-value: 5.27e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 221 EPETQNPETLEDIQSSSLQQEApaqlpqlLEEEPSSMQQEAPALPPESSMESLTLPNHEVSVQPPGED-----QAYYHLP 295
Cdd:PHA03369 491 EQESLAKELEATAHKSEIKKIA-------ESEFKNAGAKTAAANIEPNCSADAAAPATKRARPETKTEleavvRFPYQIR 563
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 296 NITVKPADVEVTITSEPTNETESSQAQqETPIQFPEEVEPSATQQEAPIEPPVPPMEHELSISEQ-QQPVQPSESPREVE 374
Cdd:PHA03369 564 NMESPAFVHSFTSTTLAAAAGQGSDTA-EALAGAIETLLTQASAQPAGLSLPAPAVPVNASTPAStPPPLAPQEPPQPGT 642
|
170
....*....|....*...
gi 116325993 375 SSPTqqeTPGQPPEHHEV 392
Cdd:PHA03369 643 SAPS---LETSLPQQKPV 657
|
|
| LRR_RI |
cd00116 |
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 ... |
891-976 |
5.89e-03 |
|
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
Pssm-ID: 238064 [Multi-domain] Cd Length: 319 Bit Score: 40.80 E-value: 5.89e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 891 TEKLILRENNLTELH-KDSFEGL---LSLQYLDLSCNKIQSIER------HTFEPLPFLKFINLSCNVITELSFGTFQAW 960
Cdd:cd00116 25 LQVLRLEGNTLGEEAaKALASALrpqPSLKELCLSLNETGRIPRglqsllQGLTKGCGLQELDLSDNALGPDGCGVLESL 104
|
90
....*....|....*.
gi 116325993 961 HGMQFLHKLILNHNPL 976
Cdd:cd00116 105 LRSSSLQELKLNNNGL 120
|
|
| Pneumo_att_G |
pfam05539 |
Pneumovirinae attachment membrane glycoprotein G; |
618-831 |
6.50e-03 |
|
Pneumovirinae attachment membrane glycoprotein G;
Pssm-ID: 114270 [Multi-domain] Cd Length: 408 Bit Score: 40.80 E-value: 6.50e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 618 TPEPTTEvghSTPPKRTIVSPKhpevTLPHPDQVQTQHSHLTRATVQPLDLGFTITPKS---MTEVEPSTALMTTAPPpg 694
Cdd:pfam05539 178 TSWPTEV---SHPTYPSQVTPQ----SQPATQGHQTATANQRLSSTEPVGTQGTTTSSNpepQTEPPPSQRGPSGSPQ-- 248
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 695 HPEVTLPPSDKGQAQHSHLTQATVQPLDLELTITTKPTTevKPSPTTEETSTQPPdlglaiIPEPTTETRHSTALEKTTA 774
Cdd:pfam05539 249 HPPSTTSQDQSTTGDGQEHTQRRKTPPATSNRRSPHSTA--TPPPTTKRQETGRP------TPRPTATTQSGSSPPHSSP 320
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 775 PRPDRVQTlhrslTEVTGPPTELEPAQDSLVQSESYTQNKALTAP---EEHKASTSTNIC 831
Cdd:pfam05539 321 PGVQANPT-----TQNLVDCKELDPPKPNSICYGVGIYNEALPRGcdiVVPLCSTYTIMC 375
|
|
| Treacle |
pfam03546 |
Treacher Collins syndrome protein Treacle; |
310-523 |
6.56e-03 |
|
Treacher Collins syndrome protein Treacle;
Pssm-ID: 460967 [Multi-domain] Cd Length: 531 Bit Score: 41.21 E-value: 6.56e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 310 SEPTNEtESSQAQQETPI-QFPEEVEP----------SATQQEAPIE--PPVPPMEHELSISEQQQPVQPSESPREVESS 376
Cdd:pfam03546 24 SESSSE-EESDSEEETPAaKTPLQAKPsgktpqvraaSAPAKESPRKgaPPVPPGKTGPAAAQAQAGKPEEDSESSSEES 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 377 PTQQETP-GQPPEHHEVTVSPPGhhQTHHLASPSVSVKPPDVQLTIAAEPSAEVGTSLVHQ--------EATTRLSGSGN 447
Cdd:pfam03546 103 DSDGETPaAATLTTSPAQVKPLG--KNSQVRPASTVGKGPSGKGANPAPPGKAGSAAPLVQvgkkeedsESSSEESDSEG 180
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 116325993 448 DVEPPAIQHGGPPLLPESSEEAGPlavqqeTSFQSPEPINNENPSPTQ--QEAAAEHPQTAEEgeSSLTHQEAPAQTP 523
Cdd:pfam03546 181 EAPPAATQAKPSGKILQVRPASGP------AKGAAPAPPQKAGPVATQvkAERSKEDSESSEE--SSDSEEEAPAAAT 250
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
438-703 |
8.68e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 40.84 E-value: 8.68e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 438 ATTRLSGSGNDVEPpaIQHGGPPLLPESSEEAGPLAVQQETSFQSPEPINNENPsptqqEAAAEHPQTAEEGESSLTHQE 517
Cdd:PRK10263 326 ATTATQSWAAPVEP--VTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIAPAP-----EGYPQQSQYAQPAVQYNEPLQ 398
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 518 APAQTPEFPNVVVAQPPEHSHLTQATVQPLDLGFTITPESKTEVELSPTMKEtPTQPPKKVVPQLRVYQGVTNPTPG--Q 595
Cdd:PRK10263 399 QPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAE-EQQSTFAPQSTYQTEQTYQQPAAQepL 477
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116325993 596 DQAQHPVSPSVTVQlldlgltitPEPTTEvghSTPPKRTivspkhPEVTLPHPDQVQTQHSHLTRATVQPLdlgftitPK 675
Cdd:PRK10263 478 YQQPQPVEQQPVVE---------PEPVVE---ETKPARP------PLYYFEEVEEKRAREREQLAAWYQPI-------PE 532
|
250 260
....*....|....*....|....*...
gi 116325993 676 SMTEVEPSTALMTTAPPPGHPEVTLPPS 703
Cdd:PRK10263 533 PVKEPEPIKSSLKAPSVAAVPPVEAAAA 560
|
|
|