|
Name |
Accession |
Description |
Interval |
E-value |
| APC_basic |
pfam05956 |
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ... |
1794-2126 |
7.54e-97 |
|
APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules. :
Pssm-ID: 428690 [Multi-domain] Cd Length: 336 Bit Score: 316.43 E-value: 7.54e-97
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1794 AMLRGRTVIYTASpASRAQSKGISGPCSAPKKMgtsgttqpETATKTPSPEQQRSRSLHRPGKISELAALSHPPRSATPP 1873
Cdd:pfam05956 1 VVFRGRTVIYMPG-VKESQPSTSPPPKKTPPKT--------DAPAKNPNLGQQRSRSLHRLGKPSELADLSPPKRSATPP 71
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1874 ARLTKTPSSSSSQTSPASQSLPRRSPLATPTGGP------LPGPGGSPVPKSPARALLAKQ--HKTQKSPVRIPFMQRPA 1945
Cdd:pfam05956 72 ARISKAPSSGSSRDSTPSRPPQKKLTSPSQSPGRlpgsggRNKLSPLPKTKSPARASTKKSgsHKTQKSPVRIPFMQTPT 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1946 R-----RVPPPLARPSPEPgsRGRAGAEGTPGARGSRLGLVRVASTRSSGSESSDRSgFRRQLTFIKESPGLLRRRRSEL 2020
Cdd:pfam05956 152 KqtglpRNPSPLVTNQPEP--RSESASKGLRSLPGKRLDLVRMSSARSSGSESDRSG-FLRQLTFIKESPSLLLRRRLEL 228
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2021 SSADSTVSTSQTASPCRGRPALPAVFLCSSRCDELRASPRQPLAA--QRVPQAKPGLAPRAPRRTSSESPSRLPVRATPG 2098
Cdd:pfam05956 229 SASESLSPSSQPASPRRSRPGLPAVFLCSSRCQELKGWRKQPPNPnsRAEPSDRPLTRRRPPRRTSSESPSRLPVRNGTW 308
|
330 340
....*....|....*....|....*...
gi 1958788261 2099 RPETVKRYASLPHISVSRRPDSAVSVPT 2126
Cdd:pfam05956 309 KRETFKRYSSLPHINVWRRTGSSSSILS 336
|
|
| APC_rep |
pfam18797 |
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ... |
386-459 |
5.78e-36 |
|
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region. :
Pssm-ID: 465870 Cd Length: 74 Bit Score: 131.52 E-value: 5.78e-36
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1958788261 386 SQPDQGLARKEMRVLHVLEQIRAYCETCWDWLQARDSGTESGTGDTPVPIEPQICQAACAVMKLSFDEEYRRAM 459
Cdd:pfam18797 1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
|
|
| APC_N_CC |
pfam16689 |
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil ... |
30-81 |
6.75e-26 |
|
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil dimerization domain of the adenomatosis polyposis coli (APC) tumour-repressor proteins. It plays a key role in the regulation of cellular levels of the oncogene product beta-catenin. Coiled-coil regions are binding repeats that in this case bind to the armadillo repeat region of beta-catenin. :
Pssm-ID: 435517 Cd Length: 52 Bit Score: 101.99 E-value: 6.75e-26
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|..
gi 1958788261 30 ASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKL 81
Cdd:pfam16689 1 ASYDQLLRQVEALKLENTTLRQELRDNSSHLSKLETEASNMKEVLKHLQGSI 52
|
|
| Suppressor_APC |
pfam11414 |
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ... |
148-235 |
1.39e-17 |
|
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils. :
Pssm-ID: 463275 Cd Length: 82 Bit Score: 79.22 E-value: 1.39e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 148 SRATIRLLEELDQERCFLLSEIEKEEKEKLWYYSQLQGLSKRLDELPHVDTpvprsqQFSMQMDLIRQQLEFEAQHIRSL 227
Cdd:pfam11414 1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGT------YFDYGSDAQQERLEFLLARIQEV 74
|
....*...
gi 1958788261 228 MEERFGTS 235
Cdd:pfam11414 75 NRCLGGLI 82
|
|
| Arm_APC_u3 super family |
cl25003 |
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ... |
725-971 |
5.08e-17 |
|
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known. The actual alignment was detected with superfamily member pfam16629:
Pssm-ID: 435476 Cd Length: 293 Bit Score: 83.87 E-value: 5.08e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 725 HRPAKYQAAAMaVSPGTCVPSLYVRKQRALEAELDTRHLVHALGHLEKQGlPEAETTSKKplpplRHLDGLVQDYASDSG 804
Cdd:pfam16629 1 NRPAKYKDANI-MSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLS-PKASHRNKQ-----RHKQNVYSEYVLDSG 73
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 805 CFDDddapsLAAAATTAEPASPAVMSMFLGGPFLQGQAlARTPPARQGGLEAEKE-----------------TGGEAAVA 867
Cdd:pfam16629 74 RHDD-----SVCRSDNFNTGNVTVLSPYLNTTVLPSSS-SRDSRGNAESSRSEKDrsldrergaglsnfhpaTENSGNSS 147
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 868 AKAKAKLALAVARIDRLVEDISALHTSSDDSFSLSSGDP---GQEAPREGRAQSCSPCRGTEG-GRREAGSRAHPLLRLK 943
Cdd:pfam16629 148 KRIGMQISTTAAQIAKVMEEVSSMHISQEDRSSGSTSDMhcmQDDRNSIRRSSTAHPHSNVYSfNKSESSNRPCPMPYMK 227
|
250 260
....*....|....*....|....*...
gi 1958788261 944 AAHTSLSNDSLNSGSTSDGYCTREHMTP 971
Cdd:pfam16629 228 MEYKRASNDSLNSVSSSDGYGKRGQMKP 255
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
643-682 |
1.42e-06 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats. :
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 46.68 E-value: 1.42e-06
10 20 30 40
....*....|....*....|....*....|....*....|
gi 1958788261 643 EDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 682
Cdd:pfam00514 2 PENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
|
|
| SAMP |
pfam05924 |
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1628-1649 |
7.37e-05 |
|
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin. :
Pssm-ID: 461782 Cd Length: 22 Bit Score: 41.42 E-value: 7.37e-05
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
1534-1959 |
8.45e-05 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.40 E-value: 8.45e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1534 DSDEEPPATAPPTRRASAIPRALKREKPAGRKETP---TRATQPATLPVRAQPRLIVDETPPCYSLTSSASSLSEPEASE 1610
Cdd:PHA03247 2546 DDAGDPPPPLPPAAPPAAPDRSVPPPRPAPRPSEPavtSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPD 2625
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1611 QPACHPR---VEEQGSKQDSSPRAEEELLQRCISLAMPRRRTQVPS--------SRRRKPRAVRSDIRPteLPQKCREEV 1679
Cdd:PHA03247 2626 PPPPSPSpaaNEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGraaqasspPQRPRRRAARPTVGS--LTSLADPPP 2703
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1680 PGSDPASdldsVEWQAIQEGANSIVTWLHQAAAKASLEASSESDSLLSLVSGLSASSTLQPSKLRKGRKPVAEAGGAWRP 1759
Cdd:PHA03247 2704 PPPTPEP----APHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGP 2779
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1760 EKRGTTSTKGSGSPRFPSGPEKAKGTQKTMAgesAMLRGRTVIYTASPASRAQSKGISGPCSAPKKMGTSGTTQPETATK 1839
Cdd:PHA03247 2780 PRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSV 2856
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1840 TPSPEQQRSRSLHRPgkISELAALSHPP-RSATPPARLTKTPSSSSSQTSPASQSLPRRSPLATPTGGPLPGPGGSPVPK 1918
Cdd:PHA03247 2857 APGGDVRRRPPSRSP--AAKPAAPARPPvRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPP 2934
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*..
gi 1958788261 1919 SPARALLAKQHKTQKSPVRIPFMQRPA---------------RRVPPPL-ARPSPEP 1959
Cdd:PHA03247 2935 PPPRPQPPLAPTTDPAGAGEPSGAVPQpwlgalvpgrvavprFRVPQPApSREAPAS 2991
|
|
| DUF5585 super family |
cl39316 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
2071-2310 |
2.86e-04 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. The actual alignment was detected with superfamily member pfam17823:
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 46.11 E-value: 2.86e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2071 AKPGLAPRAPRRTSSESPSRLPVRATPGRPETVKRYASLPHISVSRRPDSAVS-VPTTQANATRRGSdgearPLPRVAAP 2149
Cdd:pfam17823 106 AADGAASRALAAAASSSPSSAAQSLPAAIAALPSEAFSAPRAAACRANASAAPrAAIAAASAPHAAS-----PAPRTAAS 180
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2150 GTTW--RRIKDEDVPHILRSTLPATALPLRGSSPEDSPAGTPHRKTSDAVV----------------------------- 2198
Cdd:pfam17823 181 STTAasSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVgnsspaagtvtaavgtvtpaalatlaaaa 260
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2199 QTEDVATSKTNS----STSPSLESRDPPQAPISGPVAPLGSDVDGPVLAkppasapFTHEGLSVVTGGFPTSrhgSPSRA 2274
Cdd:pfam17823 261 GTVASAAGTINMgdphARRLSPAKHMPSDTMARNPAAPMGAQAQGPIIQ-------VSTDQPVHNTAGEPTP---SPSNT 330
|
250 260 270
....*....|....*....|....*....|....*.
gi 1958788261 2275 ARVPPFNYVPSPMVVATMTSDSAVEKAPVTSPASLL 2310
Cdd:pfam17823 331 TLEPNTPKSVASTNLAVVTTTKAQAKEPSASPVPVL 366
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1407-1429 |
6.61e-04 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin. :
Pssm-ID: 461781 Cd Length: 24 Bit Score: 38.90 E-value: 6.61e-04
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
684-724 |
3.12e-03 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats. :
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 37.43 E-value: 3.12e-03
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1958788261 684 SPRDQELLWDLGAVGMLRNLVHSKHKMIAMGSAAALRNLLA 724
Cdd:pfam00514 1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1283-1304 |
3.61e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin. :
Pssm-ID: 461781 Cd Length: 24 Bit Score: 36.59 E-value: 3.61e-03
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1171-1194 |
5.84e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin. :
Pssm-ID: 461781 Cd Length: 24 Bit Score: 36.20 E-value: 5.84e-03
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| APC_basic |
pfam05956 |
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ... |
1794-2126 |
7.54e-97 |
|
APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.
Pssm-ID: 428690 [Multi-domain] Cd Length: 336 Bit Score: 316.43 E-value: 7.54e-97
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1794 AMLRGRTVIYTASpASRAQSKGISGPCSAPKKMgtsgttqpETATKTPSPEQQRSRSLHRPGKISELAALSHPPRSATPP 1873
Cdd:pfam05956 1 VVFRGRTVIYMPG-VKESQPSTSPPPKKTPPKT--------DAPAKNPNLGQQRSRSLHRLGKPSELADLSPPKRSATPP 71
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1874 ARLTKTPSSSSSQTSPASQSLPRRSPLATPTGGP------LPGPGGSPVPKSPARALLAKQ--HKTQKSPVRIPFMQRPA 1945
Cdd:pfam05956 72 ARISKAPSSGSSRDSTPSRPPQKKLTSPSQSPGRlpgsggRNKLSPLPKTKSPARASTKKSgsHKTQKSPVRIPFMQTPT 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1946 R-----RVPPPLARPSPEPgsRGRAGAEGTPGARGSRLGLVRVASTRSSGSESSDRSgFRRQLTFIKESPGLLRRRRSEL 2020
Cdd:pfam05956 152 KqtglpRNPSPLVTNQPEP--RSESASKGLRSLPGKRLDLVRMSSARSSGSESDRSG-FLRQLTFIKESPSLLLRRRLEL 228
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2021 SSADSTVSTSQTASPCRGRPALPAVFLCSSRCDELRASPRQPLAA--QRVPQAKPGLAPRAPRRTSSESPSRLPVRATPG 2098
Cdd:pfam05956 229 SASESLSPSSQPASPRRSRPGLPAVFLCSSRCQELKGWRKQPPNPnsRAEPSDRPLTRRRPPRRTSSESPSRLPVRNGTW 308
|
330 340
....*....|....*....|....*...
gi 1958788261 2099 RPETVKRYASLPHISVSRRPDSAVSVPT 2126
Cdd:pfam05956 309 KRETFKRYSSLPHINVWRRTGSSSSILS 336
|
|
| APC_rep |
pfam18797 |
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ... |
386-459 |
5.78e-36 |
|
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.
Pssm-ID: 465870 Cd Length: 74 Bit Score: 131.52 E-value: 5.78e-36
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1958788261 386 SQPDQGLARKEMRVLHVLEQIRAYCETCWDWLQARDSGTESGTGDTPVPIEPQICQAACAVMKLSFDEEYRRAM 459
Cdd:pfam18797 1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
|
|
| APC_N_CC |
pfam16689 |
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil ... |
30-81 |
6.75e-26 |
|
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil dimerization domain of the adenomatosis polyposis coli (APC) tumour-repressor proteins. It plays a key role in the regulation of cellular levels of the oncogene product beta-catenin. Coiled-coil regions are binding repeats that in this case bind to the armadillo repeat region of beta-catenin.
Pssm-ID: 435517 Cd Length: 52 Bit Score: 101.99 E-value: 6.75e-26
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|..
gi 1958788261 30 ASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKL 81
Cdd:pfam16689 1 ASYDQLLRQVEALKLENTTLRQELRDNSSHLSKLETEASNMKEVLKHLQGSI 52
|
|
| Suppressor_APC |
pfam11414 |
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ... |
148-235 |
1.39e-17 |
|
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils.
Pssm-ID: 463275 Cd Length: 82 Bit Score: 79.22 E-value: 1.39e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 148 SRATIRLLEELDQERCFLLSEIEKEEKEKLWYYSQLQGLSKRLDELPHVDTpvprsqQFSMQMDLIRQQLEFEAQHIRSL 227
Cdd:pfam11414 1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGT------YFDYGSDAQQERLEFLLARIQEV 74
|
....*...
gi 1958788261 228 MEERFGTS 235
Cdd:pfam11414 75 NRCLGGLI 82
|
|
| Arm_APC_u3 |
pfam16629 |
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ... |
725-971 |
5.08e-17 |
|
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 435476 Cd Length: 293 Bit Score: 83.87 E-value: 5.08e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 725 HRPAKYQAAAMaVSPGTCVPSLYVRKQRALEAELDTRHLVHALGHLEKQGlPEAETTSKKplpplRHLDGLVQDYASDSG 804
Cdd:pfam16629 1 NRPAKYKDANI-MSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLS-PKASHRNKQ-----RHKQNVYSEYVLDSG 73
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 805 CFDDddapsLAAAATTAEPASPAVMSMFLGGPFLQGQAlARTPPARQGGLEAEKE-----------------TGGEAAVA 867
Cdd:pfam16629 74 RHDD-----SVCRSDNFNTGNVTVLSPYLNTTVLPSSS-SRDSRGNAESSRSEKDrsldrergaglsnfhpaTENSGNSS 147
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 868 AKAKAKLALAVARIDRLVEDISALHTSSDDSFSLSSGDP---GQEAPREGRAQSCSPCRGTEG-GRREAGSRAHPLLRLK 943
Cdd:pfam16629 148 KRIGMQISTTAAQIAKVMEEVSSMHISQEDRSSGSTSDMhcmQDDRNSIRRSSTAHPHSNVYSfNKSESSNRPCPMPYMK 227
|
250 260
....*....|....*....|....*...
gi 1958788261 944 AAHTSLSNDSLNSGSTSDGYCTREHMTP 971
Cdd:pfam16629 228 MEYKRASNDSLNSVSSSDGYGKRGQMKP 255
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
643-682 |
1.42e-06 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 46.68 E-value: 1.42e-06
10 20 30 40
....*....|....*....|....*....|....*....|
gi 1958788261 643 EDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 682
Cdd:pfam00514 2 PENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1805-2311 |
1.53e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.17 E-value: 1.53e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1805 ASPASRAQSKGISGPCSAPKKMGTSGTTQpETATKTPsPEQQRSRSlhrPGKISELAALSHPPrSATPPArlTKTPSSSS 1884
Cdd:PHA03247 2557 PAAPPAAPDRSVPPPRPAPRPSEPAVTSR-ARRPDAP-PQSARPRA---PVDDRGDPRGPAPP-SPLPPD--THAPDPPP 2628
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1885 SQTSPASQSLPRRSPLATPTGGPLPGPGGSPVPKSPARALLAKQHKTQKSPvripfMQRPARRVPPP-------LARPSP 1957
Cdd:PHA03247 2629 PSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSP-----PQRPRRRAARPtvgsltsLADPPP 2703
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1958 EPGSrgragAEGTPGARGSRLGLVRVASTRSSGSESSDRSGFRRQLTFIKESPGLLRRRRSELSSADSTVSTSQTASPCR 2037
Cdd:PHA03247 2704 PPPT-----PEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAG 2778
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2038 GRPALPAVFLCSSrcDELRASPRQPLAAQRVPQAKPGLAPRAPRRTSSESPSRLPVRATPGRPETvkryaslphisvsrr 2117
Cdd:PHA03247 2779 PPRRLTRPAVASL--SESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPP--------------- 2841
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2118 PDSAVSVPTTQANATRRGSDGEARPLPRVAAPGTTWRrikdedvPHILRSTLPATALPlRGSSPEDSPAGTPHRKtsdav 2197
Cdd:PHA03247 2842 PPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAP-------ARPPVRRLARPAVS-RSTESFALPPDQPERP----- 2908
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2198 vQTEDVATSKTNSSTSPSLESRDPPQAPISGPVAPLGSDVDGPVLAKPPASAPFTHEGlSVVTGGFPTSRHGSPSRAARV 2277
Cdd:PHA03247 2909 -PQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLG-ALVPGRVAVPRFRVPQPAPSR 2986
|
490 500 510 520
....*....|....*....|....*....|....*....|
gi 1958788261 2278 PPFNYVPSPMV------VATMTSDSAVEKAPVTSPASLLE 2311
Cdd:PHA03247 2987 EAPASSTPPLTghslsrVSSWASSLALHEETDPPPVSLKQ 3026
|
|
| ARM |
smart00185 |
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ... |
642-682 |
2.30e-06 |
|
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Pssm-ID: 214547 [Multi-domain] Cd Length: 41 Bit Score: 46.27 E-value: 2.30e-06
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1958788261 642 REDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 682
Cdd:smart00185 1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
|
|
| SAMP |
pfam05924 |
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1628-1649 |
7.37e-05 |
|
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.
Pssm-ID: 461782 Cd Length: 22 Bit Score: 41.42 E-value: 7.37e-05
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1534-1959 |
8.45e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.40 E-value: 8.45e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1534 DSDEEPPATAPPTRRASAIPRALKREKPAGRKETP---TRATQPATLPVRAQPRLIVDETPPCYSLTSSASSLSEPEASE 1610
Cdd:PHA03247 2546 DDAGDPPPPLPPAAPPAAPDRSVPPPRPAPRPSEPavtSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPD 2625
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1611 QPACHPR---VEEQGSKQDSSPRAEEELLQRCISLAMPRRRTQVPS--------SRRRKPRAVRSDIRPteLPQKCREEV 1679
Cdd:PHA03247 2626 PPPPSPSpaaNEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGraaqasspPQRPRRRAARPTVGS--LTSLADPPP 2703
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1680 PGSDPASdldsVEWQAIQEGANSIVTWLHQAAAKASLEASSESDSLLSLVSGLSASSTLQPSKLRKGRKPVAEAGGAWRP 1759
Cdd:PHA03247 2704 PPPTPEP----APHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGP 2779
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1760 EKRGTTSTKGSGSPRFPSGPEKAKGTQKTMAgesAMLRGRTVIYTASPASRAQSKGISGPCSAPKKMGTSGTTQPETATK 1839
Cdd:PHA03247 2780 PRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSV 2856
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1840 TPSPEQQRSRSLHRPgkISELAALSHPP-RSATPPARLTKTPSSSSSQTSPASQSLPRRSPLATPTGGPLPGPGGSPVPK 1918
Cdd:PHA03247 2857 APGGDVRRRPPSRSP--AAKPAAPARPPvRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPP 2934
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*..
gi 1958788261 1919 SPARALLAKQHKTQKSPVRIPFMQRPA---------------RRVPPPL-ARPSPEP 1959
Cdd:PHA03247 2935 PPPRPQPPLAPTTDPAGAGEPSGAVPQpwlgalvpgrvavprFRVPQPApSREAPAS 2991
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
2071-2310 |
2.86e-04 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 46.11 E-value: 2.86e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2071 AKPGLAPRAPRRTSSESPSRLPVRATPGRPETVKRYASLPHISVSRRPDSAVS-VPTTQANATRRGSdgearPLPRVAAP 2149
Cdd:pfam17823 106 AADGAASRALAAAASSSPSSAAQSLPAAIAALPSEAFSAPRAAACRANASAAPrAAIAAASAPHAAS-----PAPRTAAS 180
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2150 GTTW--RRIKDEDVPHILRSTLPATALPLRGSSPEDSPAGTPHRKTSDAVV----------------------------- 2198
Cdd:pfam17823 181 STTAasSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVgnsspaagtvtaavgtvtpaalatlaaaa 260
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2199 QTEDVATSKTNS----STSPSLESRDPPQAPISGPVAPLGSDVDGPVLAkppasapFTHEGLSVVTGGFPTSrhgSPSRA 2274
Cdd:pfam17823 261 GTVASAAGTINMgdphARRLSPAKHMPSDTMARNPAAPMGAQAQGPIIQ-------VSTDQPVHNTAGEPTP---SPSNT 330
|
250 260 270
....*....|....*....|....*....|....*.
gi 1958788261 2275 ARVPPFNYVPSPMVVATMTSDSAVEKAPVTSPASLL 2310
Cdd:pfam17823 331 TLEPNTPKSVASTNLAVVTTTKAQAKEPSASPVPVL 366
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1407-1429 |
6.61e-04 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 38.90 E-value: 6.61e-04
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
17-110 |
2.38e-03 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 42.58 E-value: 2.38e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 17 NQALQELKMASsvASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKLEQEARVLVSSGQTEV 96
Cdd:COG4372 97 AQAQEELESLQ--EEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQ 174
|
90
....*....|....
gi 1958788261 97 LEQLKALQTDISSL 110
Cdd:COG4372 175 ALSEAEAEQALDEL 188
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
684-724 |
3.12e-03 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 37.43 E-value: 3.12e-03
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1958788261 684 SPRDQELLWDLGAVGMLRNLVHSKHKMIAMGSAAALRNLLA 724
Cdd:pfam00514 1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1283-1304 |
3.61e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 36.59 E-value: 3.61e-03
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
30-265 |
4.25e-03 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 42.35 E-value: 4.25e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 30 ASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKLEQ--EARVLVSSGQTEVLEQLKALQTDI 107
Cdd:TIGR02168 267 EKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEEleAQLEELESKLDELAEELAELEEKL 346
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 108 SSLYNLKFHAPALGPEPAAQTPEGSpvHGPAPSKDSFGELSRATIRLLEELDQERCFLLSEIekeekeklwyySQLQGLS 187
Cdd:TIGR02168 347 EELKEELESLEAELEELEAELEELE--SRLEELEEQLETLRSKVAQLELQIASLNNEIERLE-----------ARLERLE 413
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958788261 188 KRLDELpHVDTPVPRSQQFSMQMDLIRQQLEFEAQHIRSLMEERfgtsDEMVQRAQIRASRLEQIDKELLEAQDRVQQ 265
Cdd:TIGR02168 414 DRRERL-QQEIEELLKKLEEAELKELQAELEELEEELEELQEEL----ERLEEALEELREELEEAEQALDAAERELAQ 486
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1171-1194 |
5.84e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 36.20 E-value: 5.84e-03
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| APC_basic |
pfam05956 |
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ... |
1794-2126 |
7.54e-97 |
|
APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.
Pssm-ID: 428690 [Multi-domain] Cd Length: 336 Bit Score: 316.43 E-value: 7.54e-97
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1794 AMLRGRTVIYTASpASRAQSKGISGPCSAPKKMgtsgttqpETATKTPSPEQQRSRSLHRPGKISELAALSHPPRSATPP 1873
Cdd:pfam05956 1 VVFRGRTVIYMPG-VKESQPSTSPPPKKTPPKT--------DAPAKNPNLGQQRSRSLHRLGKPSELADLSPPKRSATPP 71
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1874 ARLTKTPSSSSSQTSPASQSLPRRSPLATPTGGP------LPGPGGSPVPKSPARALLAKQ--HKTQKSPVRIPFMQRPA 1945
Cdd:pfam05956 72 ARISKAPSSGSSRDSTPSRPPQKKLTSPSQSPGRlpgsggRNKLSPLPKTKSPARASTKKSgsHKTQKSPVRIPFMQTPT 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1946 R-----RVPPPLARPSPEPgsRGRAGAEGTPGARGSRLGLVRVASTRSSGSESSDRSgFRRQLTFIKESPGLLRRRRSEL 2020
Cdd:pfam05956 152 KqtglpRNPSPLVTNQPEP--RSESASKGLRSLPGKRLDLVRMSSARSSGSESDRSG-FLRQLTFIKESPSLLLRRRLEL 228
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2021 SSADSTVSTSQTASPCRGRPALPAVFLCSSRCDELRASPRQPLAA--QRVPQAKPGLAPRAPRRTSSESPSRLPVRATPG 2098
Cdd:pfam05956 229 SASESLSPSSQPASPRRSRPGLPAVFLCSSRCQELKGWRKQPPNPnsRAEPSDRPLTRRRPPRRTSSESPSRLPVRNGTW 308
|
330 340
....*....|....*....|....*...
gi 1958788261 2099 RPETVKRYASLPHISVSRRPDSAVSVPT 2126
Cdd:pfam05956 309 KRETFKRYSSLPHINVWRRTGSSSSILS 336
|
|
| APC_rep |
pfam18797 |
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ... |
386-459 |
5.78e-36 |
|
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.
Pssm-ID: 465870 Cd Length: 74 Bit Score: 131.52 E-value: 5.78e-36
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1958788261 386 SQPDQGLARKEMRVLHVLEQIRAYCETCWDWLQARDSGTESGTGDTPVPIEPQICQAACAVMKLSFDEEYRRAM 459
Cdd:pfam18797 1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
|
|
| APC_N_CC |
pfam16689 |
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil ... |
30-81 |
6.75e-26 |
|
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil dimerization domain of the adenomatosis polyposis coli (APC) tumour-repressor proteins. It plays a key role in the regulation of cellular levels of the oncogene product beta-catenin. Coiled-coil regions are binding repeats that in this case bind to the armadillo repeat region of beta-catenin.
Pssm-ID: 435517 Cd Length: 52 Bit Score: 101.99 E-value: 6.75e-26
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|..
gi 1958788261 30 ASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKL 81
Cdd:pfam16689 1 ASYDQLLRQVEALKLENTTLRQELRDNSSHLSKLETEASNMKEVLKHLQGSI 52
|
|
| Suppressor_APC |
pfam11414 |
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ... |
148-235 |
1.39e-17 |
|
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils.
Pssm-ID: 463275 Cd Length: 82 Bit Score: 79.22 E-value: 1.39e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 148 SRATIRLLEELDQERCFLLSEIEKEEKEKLWYYSQLQGLSKRLDELPHVDTpvprsqQFSMQMDLIRQQLEFEAQHIRSL 227
Cdd:pfam11414 1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGT------YFDYGSDAQQERLEFLLARIQEV 74
|
....*...
gi 1958788261 228 MEERFGTS 235
Cdd:pfam11414 75 NRCLGGLI 82
|
|
| Arm_APC_u3 |
pfam16629 |
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ... |
725-971 |
5.08e-17 |
|
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 435476 Cd Length: 293 Bit Score: 83.87 E-value: 5.08e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 725 HRPAKYQAAAMaVSPGTCVPSLYVRKQRALEAELDTRHLVHALGHLEKQGlPEAETTSKKplpplRHLDGLVQDYASDSG 804
Cdd:pfam16629 1 NRPAKYKDANI-MSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLS-PKASHRNKQ-----RHKQNVYSEYVLDSG 73
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 805 CFDDddapsLAAAATTAEPASPAVMSMFLGGPFLQGQAlARTPPARQGGLEAEKE-----------------TGGEAAVA 867
Cdd:pfam16629 74 RHDD-----SVCRSDNFNTGNVTVLSPYLNTTVLPSSS-SRDSRGNAESSRSEKDrsldrergaglsnfhpaTENSGNSS 147
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 868 AKAKAKLALAVARIDRLVEDISALHTSSDDSFSLSSGDP---GQEAPREGRAQSCSPCRGTEG-GRREAGSRAHPLLRLK 943
Cdd:pfam16629 148 KRIGMQISTTAAQIAKVMEEVSSMHISQEDRSSGSTSDMhcmQDDRNSIRRSSTAHPHSNVYSfNKSESSNRPCPMPYMK 227
|
250 260
....*....|....*....|....*...
gi 1958788261 944 AAHTSLSNDSLNSGSTSDGYCTREHMTP 971
Cdd:pfam16629 228 MEYKRASNDSLNSVSSSDGYGKRGQMKP 255
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
643-682 |
1.42e-06 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 46.68 E-value: 1.42e-06
10 20 30 40
....*....|....*....|....*....|....*....|
gi 1958788261 643 EDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 682
Cdd:pfam00514 2 PENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1805-2311 |
1.53e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.17 E-value: 1.53e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1805 ASPASRAQSKGISGPCSAPKKMGTSGTTQpETATKTPsPEQQRSRSlhrPGKISELAALSHPPrSATPPArlTKTPSSSS 1884
Cdd:PHA03247 2557 PAAPPAAPDRSVPPPRPAPRPSEPAVTSR-ARRPDAP-PQSARPRA---PVDDRGDPRGPAPP-SPLPPD--THAPDPPP 2628
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1885 SQTSPASQSLPRRSPLATPTGGPLPGPGGSPVPKSPARALLAKQHKTQKSPvripfMQRPARRVPPP-------LARPSP 1957
Cdd:PHA03247 2629 PSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSP-----PQRPRRRAARPtvgsltsLADPPP 2703
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1958 EPGSrgragAEGTPGARGSRLGLVRVASTRSSGSESSDRSGFRRQLTFIKESPGLLRRRRSELSSADSTVSTSQTASPCR 2037
Cdd:PHA03247 2704 PPPT-----PEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAG 2778
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2038 GRPALPAVFLCSSrcDELRASPRQPLAAQRVPQAKPGLAPRAPRRTSSESPSRLPVRATPGRPETvkryaslphisvsrr 2117
Cdd:PHA03247 2779 PPRRLTRPAVASL--SESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPP--------------- 2841
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2118 PDSAVSVPTTQANATRRGSDGEARPLPRVAAPGTTWRrikdedvPHILRSTLPATALPlRGSSPEDSPAGTPHRKtsdav 2197
Cdd:PHA03247 2842 PPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAP-------ARPPVRRLARPAVS-RSTESFALPPDQPERP----- 2908
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2198 vQTEDVATSKTNSSTSPSLESRDPPQAPISGPVAPLGSDVDGPVLAKPPASAPFTHEGlSVVTGGFPTSRHGSPSRAARV 2277
Cdd:PHA03247 2909 -PQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLG-ALVPGRVAVPRFRVPQPAPSR 2986
|
490 500 510 520
....*....|....*....|....*....|....*....|
gi 1958788261 2278 PPFNYVPSPMV------VATMTSDSAVEKAPVTSPASLLE 2311
Cdd:PHA03247 2987 EAPASSTPPLTghslsrVSSWASSLALHEETDPPPVSLKQ 3026
|
|
| ARM |
smart00185 |
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ... |
642-682 |
2.30e-06 |
|
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Pssm-ID: 214547 [Multi-domain] Cd Length: 41 Bit Score: 46.27 E-value: 2.30e-06
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1958788261 642 REDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 682
Cdd:smart00185 1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
1942-2276 |
6.70e-06 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 51.71 E-value: 6.70e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1942 QRPARRVPPPLARPSPEPGSRGRAGAeGTPGARGSRLGLVRVASTRSSGSESSDRSGFRRQLTfikESPGLLRRRRSELS 2021
Cdd:PHA03307 80 PANESRSTPTWSLSTLAPASPAREGS-PTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPV---GSPGPPPAASPPAA 155
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2022 SADSTVSTSQTASPCRGRPALPAVflcssrcDELRASPRQPLAAQRVPQAKPGLAPRAPRRTSSESPSRLPVRATPGRPE 2101
Cdd:PHA03307 156 GASPAAVASDAASSRQAALPLSSP-------EETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSA 228
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2102 TVKRYASLPHISVSRRPDSAVSvpttQANATRRGSDGEARpLPRVAAPGTTWrriKDEDVPHILRStlPATALPLRGSSP 2181
Cdd:PHA03307 229 ADDAGASSSDSSSSESSGCGWG----PENECPLPRPAPIT-LPTRIWEASGW---NGPSSRPGPAS--SSSSPRERSPSP 298
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2182 EDSPAGTPHRKTSDAVVQTEDVATSKTNSSTSPSLESRDPPQAPISGPVAPLGSDVDGPVLAKPP----ASAPFTHEGLS 2257
Cdd:PHA03307 299 SPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSsprkRPRPSRAPSSP 378
|
330
....*....|....*....
gi 1958788261 2258 VVTGGFPTSRHGSPSRAAR 2276
Cdd:PHA03307 379 AASAGRPTRRRARAAVAGR 397
|
|
| SAMP |
pfam05924 |
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1628-1649 |
7.37e-05 |
|
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.
Pssm-ID: 461782 Cd Length: 22 Bit Score: 41.42 E-value: 7.37e-05
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1534-1959 |
8.45e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.40 E-value: 8.45e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1534 DSDEEPPATAPPTRRASAIPRALKREKPAGRKETP---TRATQPATLPVRAQPRLIVDETPPCYSLTSSASSLSEPEASE 1610
Cdd:PHA03247 2546 DDAGDPPPPLPPAAPPAAPDRSVPPPRPAPRPSEPavtSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPD 2625
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1611 QPACHPR---VEEQGSKQDSSPRAEEELLQRCISLAMPRRRTQVPS--------SRRRKPRAVRSDIRPteLPQKCREEV 1679
Cdd:PHA03247 2626 PPPPSPSpaaNEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGraaqasspPQRPRRRAARPTVGS--LTSLADPPP 2703
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1680 PGSDPASdldsVEWQAIQEGANSIVTWLHQAAAKASLEASSESDSLLSLVSGLSASSTLQPSKLRKGRKPVAEAGGAWRP 1759
Cdd:PHA03247 2704 PPPTPEP----APHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGP 2779
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1760 EKRGTTSTKGSGSPRFPSGPEKAKGTQKTMAgesAMLRGRTVIYTASPASRAQSKGISGPCSAPKKMGTSGTTQPETATK 1839
Cdd:PHA03247 2780 PRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSV 2856
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1840 TPSPEQQRSRSLHRPgkISELAALSHPP-RSATPPARLTKTPSSSSSQTSPASQSLPRRSPLATPTGGPLPGPGGSPVPK 1918
Cdd:PHA03247 2857 APGGDVRRRPPSRSP--AAKPAAPARPPvRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPP 2934
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*..
gi 1958788261 1919 SPARALLAKQHKTQKSPVRIPFMQRPA---------------RRVPPPL-ARPSPEP 1959
Cdd:PHA03247 2935 PPPRPQPPLAPTTDPAGAGEPSGAVPQpwlgalvpgrvavprFRVPQPApSREAPAS 2991
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
2071-2310 |
2.86e-04 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 46.11 E-value: 2.86e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2071 AKPGLAPRAPRRTSSESPSRLPVRATPGRPETVKRYASLPHISVSRRPDSAVS-VPTTQANATRRGSdgearPLPRVAAP 2149
Cdd:pfam17823 106 AADGAASRALAAAASSSPSSAAQSLPAAIAALPSEAFSAPRAAACRANASAAPrAAIAAASAPHAAS-----PAPRTAAS 180
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2150 GTTW--RRIKDEDVPHILRSTLPATALPLRGSSPEDSPAGTPHRKTSDAVV----------------------------- 2198
Cdd:pfam17823 181 STTAasSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVgnsspaagtvtaavgtvtpaalatlaaaa 260
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2199 QTEDVATSKTNS----STSPSLESRDPPQAPISGPVAPLGSDVDGPVLAkppasapFTHEGLSVVTGGFPTSrhgSPSRA 2274
Cdd:pfam17823 261 GTVASAAGTINMgdphARRLSPAKHMPSDTMARNPAAPMGAQAQGPIIQ-------VSTDQPVHNTAGEPTP---SPSNT 330
|
250 260 270
....*....|....*....|....*....|....*.
gi 1958788261 2275 ARVPPFNYVPSPMVVATMTSDSAVEKAPVTSPASLL 2310
Cdd:pfam17823 331 TLEPNTPKSVASTNLAVVTTTKAQAKEPSASPVPVL 366
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1758-2232 |
2.95e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 46.47 E-value: 2.95e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1758 RPEKRGTTSTKGSGSPRFPSGPEKAKGtqKTMAGESAMLRGRTVIYTASPASRAQSKGISGPCSAPKKMGTSGT-TQPET 1836
Cdd:PHA03247 2572 RPAPRPSEPAVTSRARRPDAPPQSARP--RAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPpTVPPP 2649
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1837 ATKTPSPEQQRSRSLHRPGKISELAALSHPPRSATPPARLTKTPSSSSSQTSPASQSLPRRSPLATPTGGPLPGPGGSPV 1916
Cdd:PHA03247 2650 ERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAAR 2729
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1917 PKSPARALLAKQHKTQKSPVrIPFMQRPARRVPPPLARPSPEPGsrgrAGAEGTPGARGSRLGLVRVASTRSSGSESsdr 1996
Cdd:PHA03247 2730 QASPALPAAPAPPAVPAGPA-TPGGPARPARPPTTAGPPAPAPP----AAPAAGPPRRLTRPAVASLSESRESLPSP--- 2801
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1997 sgfrrqltfikespgllrrrrSELSSADSTVSTSQTASPCRGRPALPAVFLCSSrcdeLRASPRQPLAAQRVPQAKPG-L 2075
Cdd:PHA03247 2802 ---------------------WDPADPPAAVLAPAAALPPAASPAGPLPPPTSA----QPTAPPPPPGPPPPSLPLGGsV 2856
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2076 APRAPRRTSSESPSRLPVRATPGRPetvkRYASLPHISVSRRPDSAVSVPTTQANATRRGSDGEARPLPRVAAPGTTWRR 2155
Cdd:PHA03247 2857 APGGDVRRRPPSRSPAAKPAAPARP----PVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPP 2932
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1958788261 2156 IKDEDVPhilrstlPATALPLRGSSPEDSPAGTPHRKTSDAVVQTEDVATSKTNSSTSPSLESRDPPQAPISGPVAP 2232
Cdd:PHA03247 2933 PPPPPRP-------QPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLS 3002
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
2009-2241 |
4.20e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 46.08 E-value: 4.20e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2009 SPGLLRRRRSELSSADSTVSTSQTASPCRGRPALPAVFLCSSRCDELRASP-RQPLAAQRVPQAKPGLAPRAPR------ 2081
Cdd:PHA03247 256 APPPVVGEGADRAPETARGATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPlALPAPPDPPPPAPAGDAEEEDDedgame 335
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2082 ------RTSSESPSRLPVRATPgrpeTVKRYASLPHISVSRRPDSAVSVPTTQANATR-------RGSDGEARPLPRVAA 2148
Cdd:PHA03247 336 vvsplpRPRQHYPLGFPKRRRP----TWTPPSSLEDLSAGRHHPKRASLPTRKRRSARhaatpfaRGPGGDDQTRPAAPV 411
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2149 PGTTwrriKDEDVPHILRSTLPATALPLRGSSP--EDSPAGTPHRKTSDAVVQTEDVATSKTNSSTSPSLESRDPPQAPI 2226
Cdd:PHA03247 412 PASV----PTPAPTPVPASAPPPPATPLPSAEPgsDDGPAPPPERQPPAPATEPAPDDPDDATRKALDALRERRPPEPPG 487
|
250
....*....|....*..
gi 1958788261 2227 SGPVAPLGS--DVDGPV 2241
Cdd:PHA03247 488 ADLAELLGRhpDTAGTV 504
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1407-1429 |
6.61e-04 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 38.90 E-value: 6.61e-04
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
17-110 |
2.38e-03 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 42.58 E-value: 2.38e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 17 NQALQELKMASsvASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKLEQEARVLVSSGQTEV 96
Cdd:COG4372 97 AQAQEELESLQ--EEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQ 174
|
90
....*....|....
gi 1958788261 97 LEQLKALQTDISSL 110
Cdd:COG4372 175 ALSEAEAEQALDEL 188
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
15-110 |
2.64e-03 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 42.58 E-value: 2.64e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 15 FGNQALQELKMASSVASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKLEQEARVL--VSSG 92
Cdd:COG4372 16 FGLRPKTGILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLqaAQAE 95
|
90
....*....|....*...
gi 1958788261 93 QTEVLEQLKALQTDISSL 110
Cdd:COG4372 96 LAQAQEELESLQEEAEEL 113
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
684-724 |
3.12e-03 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 37.43 E-value: 3.12e-03
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1958788261 684 SPRDQELLWDLGAVGMLRNLVHSKHKMIAMGSAAALRNLLA 724
Cdd:pfam00514 1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1283-1304 |
3.61e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 36.59 E-value: 3.61e-03
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
30-265 |
4.25e-03 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 42.35 E-value: 4.25e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 30 ASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKLEQ--EARVLVSSGQTEVLEQLKALQTDI 107
Cdd:TIGR02168 267 EKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEEleAQLEELESKLDELAEELAELEEKL 346
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 108 SSLYNLKFHAPALGPEPAAQTPEGSpvHGPAPSKDSFGELSRATIRLLEELDQERCFLLSEIekeekeklwyySQLQGLS 187
Cdd:TIGR02168 347 EELKEELESLEAELEELEAELEELE--SRLEELEEQLETLRSKVAQLELQIASLNNEIERLE-----------ARLERLE 413
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958788261 188 KRLDELpHVDTPVPRSQQFSMQMDLIRQQLEFEAQHIRSLMEERfgtsDEMVQRAQIRASRLEQIDKELLEAQDRVQQ 265
Cdd:TIGR02168 414 DRRERL-QQEIEELLKKLEEAELKELQAELEELEEELEELQEEL----ERLEEALEELREELEEAEQALDAAERELAQ 486
|
|
| ZapB |
pfam06005 |
Cell division protein ZapB; ZapB is a non-essential, abundant cell division factor that is ... |
22-83 |
4.74e-03 |
|
Cell division protein ZapB; ZapB is a non-essential, abundant cell division factor that is required for proper Z-ring formation.
Pssm-ID: 428718 [Multi-domain] Cd Length: 71 Bit Score: 37.63 E-value: 4.74e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1958788261 22 ELKMASSVASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKLEQ 83
Cdd:pfam06005 10 ETKIQAAVDTIALLQMENEELKEENEELKEEANELEEENQQLKQERNQWQERIRGLLGKLDE 71
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1171-1194 |
5.84e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 36.20 E-value: 5.84e-03
|
| ZapB |
COG3074 |
Cell division protein ZapB, interacts with FtsZ [Cell cycle control, cell division, chromosome ... |
17-82 |
6.73e-03 |
|
Cell division protein ZapB, interacts with FtsZ [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442308 [Multi-domain] Cd Length: 79 Bit Score: 37.64 E-value: 6.73e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958788261 17 NQALQEL--KMASSVASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKLE 82
Cdd:COG3074 3 LELLEELeaKVQQAVDTIELLQMEVEELKEKNEELEQENEELQSENEELQSENEQLKTENAEWQERIR 70
|
|
|