NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1958788261|ref|XP_038934576|]
View 

adenomatous polyposis coli protein 2 isoform X2 [Rattus norvegicus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
APC_basic pfam05956
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ...
1794-2126 7.54e-97

APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.


:

Pssm-ID: 428690 [Multi-domain]  Cd Length: 336  Bit Score: 316.43  E-value: 7.54e-97
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1794 AMLRGRTVIYTASpASRAQSKGISGPCSAPKKMgtsgttqpETATKTPSPEQQRSRSLHRPGKISELAALSHPPRSATPP 1873
Cdd:pfam05956    1 VVFRGRTVIYMPG-VKESQPSTSPPPKKTPPKT--------DAPAKNPNLGQQRSRSLHRLGKPSELADLSPPKRSATPP 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1874 ARLTKTPSSSSSQTSPASQSLPRRSPLATPTGGP------LPGPGGSPVPKSPARALLAKQ--HKTQKSPVRIPFMQRPA 1945
Cdd:pfam05956   72 ARISKAPSSGSSRDSTPSRPPQKKLTSPSQSPGRlpgsggRNKLSPLPKTKSPARASTKKSgsHKTQKSPVRIPFMQTPT 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1946 R-----RVPPPLARPSPEPgsRGRAGAEGTPGARGSRLGLVRVASTRSSGSESSDRSgFRRQLTFIKESPGLLRRRRSEL 2020
Cdd:pfam05956  152 KqtglpRNPSPLVTNQPEP--RSESASKGLRSLPGKRLDLVRMSSARSSGSESDRSG-FLRQLTFIKESPSLLLRRRLEL 228
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2021 SSADSTVSTSQTASPCRGRPALPAVFLCSSRCDELRASPRQPLAA--QRVPQAKPGLAPRAPRRTSSESPSRLPVRATPG 2098
Cdd:pfam05956  229 SASESLSPSSQPASPRRSRPGLPAVFLCSSRCQELKGWRKQPPNPnsRAEPSDRPLTRRRPPRRTSSESPSRLPVRNGTW 308
                          330       340
                   ....*....|....*....|....*...
gi 1958788261 2099 RPETVKRYASLPHISVSRRPDSAVSVPT 2126
Cdd:pfam05956  309 KRETFKRYSSLPHINVWRRTGSSSSILS 336
APC_rep pfam18797
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ...
386-459 5.78e-36

Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.


:

Pssm-ID: 465870  Cd Length: 74  Bit Score: 131.52  E-value: 5.78e-36
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1958788261  386 SQPDQGLARKEMRVLHVLEQIRAYCETCWDWLQARDSGTESGTGDTPVPIEPQICQAACAVMKLSFDEEYRRAM 459
Cdd:pfam18797    1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
APC_N_CC pfam16689
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil ...
30-81 6.75e-26

Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil dimerization domain of the adenomatosis polyposis coli (APC) tumour-repressor proteins. It plays a key role in the regulation of cellular levels of the oncogene product beta-catenin. Coiled-coil regions are binding repeats that in this case bind to the armadillo repeat region of beta-catenin.


:

Pssm-ID: 435517  Cd Length: 52  Bit Score: 101.99  E-value: 6.75e-26
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1958788261   30 ASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKL 81
Cdd:pfam16689    1 ASYDQLLRQVEALKLENTTLRQELRDNSSHLSKLETEASNMKEVLKHLQGSI 52
Suppressor_APC pfam11414
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ...
148-235 1.39e-17

Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils.


:

Pssm-ID: 463275  Cd Length: 82  Bit Score: 79.22  E-value: 1.39e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261  148 SRATIRLLEELDQERCFLLSEIEKEEKEKLWYYSQLQGLSKRLDELPHVDTpvprsqQFSMQMDLIRQQLEFEAQHIRSL 227
Cdd:pfam11414    1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGT------YFDYGSDAQQERLEFLLARIQEV 74

                   ....*...
gi 1958788261  228 MEERFGTS 235
Cdd:pfam11414   75 NRCLGGLI 82
Arm_APC_u3 super family cl25003
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ...
725-971 5.08e-17

Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


The actual alignment was detected with superfamily member pfam16629:

Pssm-ID: 435476  Cd Length: 293  Bit Score: 83.87  E-value: 5.08e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261  725 HRPAKYQAAAMaVSPGTCVPSLYVRKQRALEAELDTRHLVHALGHLEKQGlPEAETTSKKplpplRHLDGLVQDYASDSG 804
Cdd:pfam16629    1 NRPAKYKDANI-MSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLS-PKASHRNKQ-----RHKQNVYSEYVLDSG 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261  805 CFDDddapsLAAAATTAEPASPAVMSMFLGGPFLQGQAlARTPPARQGGLEAEKE-----------------TGGEAAVA 867
Cdd:pfam16629   74 RHDD-----SVCRSDNFNTGNVTVLSPYLNTTVLPSSS-SRDSRGNAESSRSEKDrsldrergaglsnfhpaTENSGNSS 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261  868 AKAKAKLALAVARIDRLVEDISALHTSSDDSFSLSSGDP---GQEAPREGRAQSCSPCRGTEG-GRREAGSRAHPLLRLK 943
Cdd:pfam16629  148 KRIGMQISTTAAQIAKVMEEVSSMHISQEDRSSGSTSDMhcmQDDRNSIRRSSTAHPHSNVYSfNKSESSNRPCPMPYMK 227
                          250       260
                   ....*....|....*....|....*...
gi 1958788261  944 AAHTSLSNDSLNSGSTSDGYCTREHMTP 971
Cdd:pfam16629  228 MEYKRASNDSLNSVSSSDGYGKRGQMKP 255
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
643-682 1.42e-06

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


:

Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 46.68  E-value: 1.42e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 1958788261  643 EDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 682
Cdd:pfam00514    2 PENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
1628-1649 7.37e-05

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


:

Pssm-ID: 461782  Cd Length: 22  Bit Score: 41.42  E-value: 7.37e-05
                           10        20
                   ....*....|....*....|..
gi 1958788261 1628 SPRAEEELLQRCISLAMPRRRT 1649
Cdd:pfam05924    1 SPDDEDDLLQECINSAMPKKRR 22
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1534-1959 8.45e-05

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.40  E-value: 8.45e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1534 DSDEEPPATAPPTRRASAIPRALKREKPAGRKETP---TRATQPATLPVRAQPRLIVDETPPCYSLTSSASSLSEPEASE 1610
Cdd:PHA03247  2546 DDAGDPPPPLPPAAPPAAPDRSVPPPRPAPRPSEPavtSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPD 2625
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1611 QPACHPR---VEEQGSKQDSSPRAEEELLQRCISLAMPRRRTQVPS--------SRRRKPRAVRSDIRPteLPQKCREEV 1679
Cdd:PHA03247  2626 PPPPSPSpaaNEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGraaqasspPQRPRRRAARPTVGS--LTSLADPPP 2703
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1680 PGSDPASdldsVEWQAIQEGANSIVTWLHQAAAKASLEASSESDSLLSLVSGLSASSTLQPSKLRKGRKPVAEAGGAWRP 1759
Cdd:PHA03247  2704 PPPTPEP----APHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGP 2779
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1760 EKRGTTSTKGSGSPRFPSGPEKAKGTQKTMAgesAMLRGRTVIYTASPASRAQSKGISGPCSAPKKMGTSGTTQPETATK 1839
Cdd:PHA03247  2780 PRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSV 2856
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1840 TPSPEQQRSRSLHRPgkISELAALSHPP-RSATPPARLTKTPSSSSSQTSPASQSLPRRSPLATPTGGPLPGPGGSPVPK 1918
Cdd:PHA03247  2857 APGGDVRRRPPSRSP--AAKPAAPARPPvRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPP 2934
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1958788261 1919 SPARALLAKQHKTQKSPVRIPFMQRPA---------------RRVPPPL-ARPSPEP 1959
Cdd:PHA03247  2935 PPPRPQPPLAPTTDPAGAGEPSGAVPQpwlgalvpgrvavprFRVPQPApSREAPAS 2991
DUF5585 super family cl39316
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
2071-2310 2.86e-04

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


The actual alignment was detected with superfamily member pfam17823:

Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 46.11  E-value: 2.86e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2071 AKPGLAPRAPRRTSSESPSRLPVRATPGRPETVKRYASLPHISVSRRPDSAVS-VPTTQANATRRGSdgearPLPRVAAP 2149
Cdd:pfam17823  106 AADGAASRALAAAASSSPSSAAQSLPAAIAALPSEAFSAPRAAACRANASAAPrAAIAAASAPHAAS-----PAPRTAAS 180
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2150 GTTW--RRIKDEDVPHILRSTLPATALPLRGSSPEDSPAGTPHRKTSDAVV----------------------------- 2198
Cdd:pfam17823  181 STTAasSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVgnsspaagtvtaavgtvtpaalatlaaaa 260
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2199 QTEDVATSKTNS----STSPSLESRDPPQAPISGPVAPLGSDVDGPVLAkppasapFTHEGLSVVTGGFPTSrhgSPSRA 2274
Cdd:pfam17823  261 GTVASAAGTINMgdphARRLSPAKHMPSDTMARNPAAPMGAQAQGPIIQ-------VSTDQPVHNTAGEPTP---SPSNT 330
                          250       260       270
                   ....*....|....*....|....*....|....*.
gi 1958788261 2275 ARVPPFNYVPSPMVVATMTSDSAVEKAPVTSPASLL 2310
Cdd:pfam17823  331 TLEPNTPKSVASTNLAVVTTTKAQAKEPSASPVPVL 366
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1407-1429 6.61e-04

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


:

Pssm-ID: 461781  Cd Length: 24  Bit Score: 38.90  E-value: 6.61e-04
                           10        20
                   ....*....|....*....|...
gi 1958788261 1407 DDSGTDSAEGTPVNFSSAASLSD 1429
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSS 23
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
684-724 3.12e-03

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


:

Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 37.43  E-value: 3.12e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1958788261  684 SPRDQELLWDLGAVGMLRNLVHSKHKMIAMGSAAALRNLLA 724
Cdd:pfam00514    1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1283-1304 3.61e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


:

Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.59  E-value: 3.61e-03
                           10        20
                   ....*....|....*....|..
gi 1958788261 1283 SVRFTVEKPDENFSCASSLSAL 1304
Cdd:pfam05923    3 PKRYCVEGTPANFSRASSLSSL 24
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1171-1194 5.84e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


:

Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.20  E-value: 5.84e-03
                           10        20
                   ....*....|....*....|....
gi 1958788261 1171 SSSSENCVQETPLVLSRCSSVSSL 1194
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSSL 24
 
Name Accession Description Interval E-value
APC_basic pfam05956
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ...
1794-2126 7.54e-97

APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.


Pssm-ID: 428690 [Multi-domain]  Cd Length: 336  Bit Score: 316.43  E-value: 7.54e-97
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1794 AMLRGRTVIYTASpASRAQSKGISGPCSAPKKMgtsgttqpETATKTPSPEQQRSRSLHRPGKISELAALSHPPRSATPP 1873
Cdd:pfam05956    1 VVFRGRTVIYMPG-VKESQPSTSPPPKKTPPKT--------DAPAKNPNLGQQRSRSLHRLGKPSELADLSPPKRSATPP 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1874 ARLTKTPSSSSSQTSPASQSLPRRSPLATPTGGP------LPGPGGSPVPKSPARALLAKQ--HKTQKSPVRIPFMQRPA 1945
Cdd:pfam05956   72 ARISKAPSSGSSRDSTPSRPPQKKLTSPSQSPGRlpgsggRNKLSPLPKTKSPARASTKKSgsHKTQKSPVRIPFMQTPT 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1946 R-----RVPPPLARPSPEPgsRGRAGAEGTPGARGSRLGLVRVASTRSSGSESSDRSgFRRQLTFIKESPGLLRRRRSEL 2020
Cdd:pfam05956  152 KqtglpRNPSPLVTNQPEP--RSESASKGLRSLPGKRLDLVRMSSARSSGSESDRSG-FLRQLTFIKESPSLLLRRRLEL 228
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2021 SSADSTVSTSQTASPCRGRPALPAVFLCSSRCDELRASPRQPLAA--QRVPQAKPGLAPRAPRRTSSESPSRLPVRATPG 2098
Cdd:pfam05956  229 SASESLSPSSQPASPRRSRPGLPAVFLCSSRCQELKGWRKQPPNPnsRAEPSDRPLTRRRPPRRTSSESPSRLPVRNGTW 308
                          330       340
                   ....*....|....*....|....*...
gi 1958788261 2099 RPETVKRYASLPHISVSRRPDSAVSVPT 2126
Cdd:pfam05956  309 KRETFKRYSSLPHINVWRRTGSSSSILS 336
APC_rep pfam18797
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ...
386-459 5.78e-36

Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.


Pssm-ID: 465870  Cd Length: 74  Bit Score: 131.52  E-value: 5.78e-36
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1958788261  386 SQPDQGLARKEMRVLHVLEQIRAYCETCWDWLQARDSGTESGTGDTPVPIEPQICQAACAVMKLSFDEEYRRAM 459
Cdd:pfam18797    1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
APC_N_CC pfam16689
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil ...
30-81 6.75e-26

Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil dimerization domain of the adenomatosis polyposis coli (APC) tumour-repressor proteins. It plays a key role in the regulation of cellular levels of the oncogene product beta-catenin. Coiled-coil regions are binding repeats that in this case bind to the armadillo repeat region of beta-catenin.


Pssm-ID: 435517  Cd Length: 52  Bit Score: 101.99  E-value: 6.75e-26
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1958788261   30 ASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKL 81
Cdd:pfam16689    1 ASYDQLLRQVEALKLENTTLRQELRDNSSHLSKLETEASNMKEVLKHLQGSI 52
Suppressor_APC pfam11414
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ...
148-235 1.39e-17

Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils.


Pssm-ID: 463275  Cd Length: 82  Bit Score: 79.22  E-value: 1.39e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261  148 SRATIRLLEELDQERCFLLSEIEKEEKEKLWYYSQLQGLSKRLDELPHVDTpvprsqQFSMQMDLIRQQLEFEAQHIRSL 227
Cdd:pfam11414    1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGT------YFDYGSDAQQERLEFLLARIQEV 74

                   ....*...
gi 1958788261  228 MEERFGTS 235
Cdd:pfam11414   75 NRCLGGLI 82
Arm_APC_u3 pfam16629
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ...
725-971 5.08e-17

Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435476  Cd Length: 293  Bit Score: 83.87  E-value: 5.08e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261  725 HRPAKYQAAAMaVSPGTCVPSLYVRKQRALEAELDTRHLVHALGHLEKQGlPEAETTSKKplpplRHLDGLVQDYASDSG 804
Cdd:pfam16629    1 NRPAKYKDANI-MSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLS-PKASHRNKQ-----RHKQNVYSEYVLDSG 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261  805 CFDDddapsLAAAATTAEPASPAVMSMFLGGPFLQGQAlARTPPARQGGLEAEKE-----------------TGGEAAVA 867
Cdd:pfam16629   74 RHDD-----SVCRSDNFNTGNVTVLSPYLNTTVLPSSS-SRDSRGNAESSRSEKDrsldrergaglsnfhpaTENSGNSS 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261  868 AKAKAKLALAVARIDRLVEDISALHTSSDDSFSLSSGDP---GQEAPREGRAQSCSPCRGTEG-GRREAGSRAHPLLRLK 943
Cdd:pfam16629  148 KRIGMQISTTAAQIAKVMEEVSSMHISQEDRSSGSTSDMhcmQDDRNSIRRSSTAHPHSNVYSfNKSESSNRPCPMPYMK 227
                          250       260
                   ....*....|....*....|....*...
gi 1958788261  944 AAHTSLSNDSLNSGSTSDGYCTREHMTP 971
Cdd:pfam16629  228 MEYKRASNDSLNSVSSSDGYGKRGQMKP 255
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
643-682 1.42e-06

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 46.68  E-value: 1.42e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 1958788261  643 EDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 682
Cdd:pfam00514    2 PENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
PHA03247 PHA03247
large tegument protein UL36; Provisional
1805-2311 1.53e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.17  E-value: 1.53e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1805 ASPASRAQSKGISGPCSAPKKMGTSGTTQpETATKTPsPEQQRSRSlhrPGKISELAALSHPPrSATPPArlTKTPSSSS 1884
Cdd:PHA03247  2557 PAAPPAAPDRSVPPPRPAPRPSEPAVTSR-ARRPDAP-PQSARPRA---PVDDRGDPRGPAPP-SPLPPD--THAPDPPP 2628
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1885 SQTSPASQSLPRRSPLATPTGGPLPGPGGSPVPKSPARALLAKQHKTQKSPvripfMQRPARRVPPP-------LARPSP 1957
Cdd:PHA03247  2629 PSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSP-----PQRPRRRAARPtvgsltsLADPPP 2703
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1958 EPGSrgragAEGTPGARGSRLGLVRVASTRSSGSESSDRSGFRRQLTFIKESPGLLRRRRSELSSADSTVSTSQTASPCR 2037
Cdd:PHA03247  2704 PPPT-----PEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAG 2778
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2038 GRPALPAVFLCSSrcDELRASPRQPLAAQRVPQAKPGLAPRAPRRTSSESPSRLPVRATPGRPETvkryaslphisvsrr 2117
Cdd:PHA03247  2779 PPRRLTRPAVASL--SESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPP--------------- 2841
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2118 PDSAVSVPTTQANATRRGSDGEARPLPRVAAPGTTWRrikdedvPHILRSTLPATALPlRGSSPEDSPAGTPHRKtsdav 2197
Cdd:PHA03247  2842 PPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAP-------ARPPVRRLARPAVS-RSTESFALPPDQPERP----- 2908
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2198 vQTEDVATSKTNSSTSPSLESRDPPQAPISGPVAPLGSDVDGPVLAKPPASAPFTHEGlSVVTGGFPTSRHGSPSRAARV 2277
Cdd:PHA03247  2909 -PQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLG-ALVPGRVAVPRFRVPQPAPSR 2986
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|
gi 1958788261 2278 PPFNYVPSPMV------VATMTSDSAVEKAPVTSPASLLE 2311
Cdd:PHA03247  2987 EAPASSTPPLTghslsrVSSWASSLALHEETDPPPVSLKQ 3026
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
642-682 2.30e-06

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 46.27  E-value: 2.30e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 1958788261   642 REDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 682
Cdd:smart00185    1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
1628-1649 7.37e-05

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


Pssm-ID: 461782  Cd Length: 22  Bit Score: 41.42  E-value: 7.37e-05
                           10        20
                   ....*....|....*....|..
gi 1958788261 1628 SPRAEEELLQRCISLAMPRRRT 1649
Cdd:pfam05924    1 SPDDEDDLLQECINSAMPKKRR 22
PHA03247 PHA03247
large tegument protein UL36; Provisional
1534-1959 8.45e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.40  E-value: 8.45e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1534 DSDEEPPATAPPTRRASAIPRALKREKPAGRKETP---TRATQPATLPVRAQPRLIVDETPPCYSLTSSASSLSEPEASE 1610
Cdd:PHA03247  2546 DDAGDPPPPLPPAAPPAAPDRSVPPPRPAPRPSEPavtSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPD 2625
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1611 QPACHPR---VEEQGSKQDSSPRAEEELLQRCISLAMPRRRTQVPS--------SRRRKPRAVRSDIRPteLPQKCREEV 1679
Cdd:PHA03247  2626 PPPPSPSpaaNEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGraaqasspPQRPRRRAARPTVGS--LTSLADPPP 2703
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1680 PGSDPASdldsVEWQAIQEGANSIVTWLHQAAAKASLEASSESDSLLSLVSGLSASSTLQPSKLRKGRKPVAEAGGAWRP 1759
Cdd:PHA03247  2704 PPPTPEP----APHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGP 2779
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1760 EKRGTTSTKGSGSPRFPSGPEKAKGTQKTMAgesAMLRGRTVIYTASPASRAQSKGISGPCSAPKKMGTSGTTQPETATK 1839
Cdd:PHA03247  2780 PRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSV 2856
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1840 TPSPEQQRSRSLHRPgkISELAALSHPP-RSATPPARLTKTPSSSSSQTSPASQSLPRRSPLATPTGGPLPGPGGSPVPK 1918
Cdd:PHA03247  2857 APGGDVRRRPPSRSP--AAKPAAPARPPvRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPP 2934
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1958788261 1919 SPARALLAKQHKTQKSPVRIPFMQRPA---------------RRVPPPL-ARPSPEP 1959
Cdd:PHA03247  2935 PPPRPQPPLAPTTDPAGAGEPSGAVPQpwlgalvpgrvavprFRVPQPApSREAPAS 2991
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
2071-2310 2.86e-04

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 46.11  E-value: 2.86e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2071 AKPGLAPRAPRRTSSESPSRLPVRATPGRPETVKRYASLPHISVSRRPDSAVS-VPTTQANATRRGSdgearPLPRVAAP 2149
Cdd:pfam17823  106 AADGAASRALAAAASSSPSSAAQSLPAAIAALPSEAFSAPRAAACRANASAAPrAAIAAASAPHAAS-----PAPRTAAS 180
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2150 GTTW--RRIKDEDVPHILRSTLPATALPLRGSSPEDSPAGTPHRKTSDAVV----------------------------- 2198
Cdd:pfam17823  181 STTAasSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVgnsspaagtvtaavgtvtpaalatlaaaa 260
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2199 QTEDVATSKTNS----STSPSLESRDPPQAPISGPVAPLGSDVDGPVLAkppasapFTHEGLSVVTGGFPTSrhgSPSRA 2274
Cdd:pfam17823  261 GTVASAAGTINMgdphARRLSPAKHMPSDTMARNPAAPMGAQAQGPIIQ-------VSTDQPVHNTAGEPTP---SPSNT 330
                          250       260       270
                   ....*....|....*....|....*....|....*.
gi 1958788261 2275 ARVPPFNYVPSPMVVATMTSDSAVEKAPVTSPASLL 2310
Cdd:pfam17823  331 TLEPNTPKSVASTNLAVVTTTKAQAKEPSASPVPVL 366
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1407-1429 6.61e-04

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 38.90  E-value: 6.61e-04
                           10        20
                   ....*....|....*....|...
gi 1958788261 1407 DDSGTDSAEGTPVNFSSAASLSD 1429
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSS 23
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
17-110 2.38e-03

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 42.58  E-value: 2.38e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261   17 NQALQELKMASsvASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKLEQEARVLVSSGQTEV 96
Cdd:COG4372     97 AQAQEELESLQ--EEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQ 174
                           90
                   ....*....|....
gi 1958788261   97 LEQLKALQTDISSL 110
Cdd:COG4372    175 ALSEAEAEQALDEL 188
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
684-724 3.12e-03

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 37.43  E-value: 3.12e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1958788261  684 SPRDQELLWDLGAVGMLRNLVHSKHKMIAMGSAAALRNLLA 724
Cdd:pfam00514    1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1283-1304 3.61e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.59  E-value: 3.61e-03
                           10        20
                   ....*....|....*....|..
gi 1958788261 1283 SVRFTVEKPDENFSCASSLSAL 1304
Cdd:pfam05923    3 PKRYCVEGTPANFSRASSLSSL 24
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
30-265 4.25e-03

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 42.35  E-value: 4.25e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261   30 ASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKLEQ--EARVLVSSGQTEVLEQLKALQTDI 107
Cdd:TIGR02168  267 EKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEEleAQLEELESKLDELAEELAELEEKL 346
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261  108 SSLYNLKFHAPALGPEPAAQTPEGSpvHGPAPSKDSFGELSRATIRLLEELDQERCFLLSEIekeekeklwyySQLQGLS 187
Cdd:TIGR02168  347 EELKEELESLEAELEELEAELEELE--SRLEELEEQLETLRSKVAQLELQIASLNNEIERLE-----------ARLERLE 413
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958788261  188 KRLDELpHVDTPVPRSQQFSMQMDLIRQQLEFEAQHIRSLMEERfgtsDEMVQRAQIRASRLEQIDKELLEAQDRVQQ 265
Cdd:TIGR02168  414 DRRERL-QQEIEELLKKLEEAELKELQAELEELEEELEELQEEL----ERLEEALEELREELEEAEQALDAAERELAQ 486
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1171-1194 5.84e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.20  E-value: 5.84e-03
                           10        20
                   ....*....|....*....|....
gi 1958788261 1171 SSSSENCVQETPLVLSRCSSVSSL 1194
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSSL 24
 
Name Accession Description Interval E-value
APC_basic pfam05956
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ...
1794-2126 7.54e-97

APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.


Pssm-ID: 428690 [Multi-domain]  Cd Length: 336  Bit Score: 316.43  E-value: 7.54e-97
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1794 AMLRGRTVIYTASpASRAQSKGISGPCSAPKKMgtsgttqpETATKTPSPEQQRSRSLHRPGKISELAALSHPPRSATPP 1873
Cdd:pfam05956    1 VVFRGRTVIYMPG-VKESQPSTSPPPKKTPPKT--------DAPAKNPNLGQQRSRSLHRLGKPSELADLSPPKRSATPP 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1874 ARLTKTPSSSSSQTSPASQSLPRRSPLATPTGGP------LPGPGGSPVPKSPARALLAKQ--HKTQKSPVRIPFMQRPA 1945
Cdd:pfam05956   72 ARISKAPSSGSSRDSTPSRPPQKKLTSPSQSPGRlpgsggRNKLSPLPKTKSPARASTKKSgsHKTQKSPVRIPFMQTPT 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1946 R-----RVPPPLARPSPEPgsRGRAGAEGTPGARGSRLGLVRVASTRSSGSESSDRSgFRRQLTFIKESPGLLRRRRSEL 2020
Cdd:pfam05956  152 KqtglpRNPSPLVTNQPEP--RSESASKGLRSLPGKRLDLVRMSSARSSGSESDRSG-FLRQLTFIKESPSLLLRRRLEL 228
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2021 SSADSTVSTSQTASPCRGRPALPAVFLCSSRCDELRASPRQPLAA--QRVPQAKPGLAPRAPRRTSSESPSRLPVRATPG 2098
Cdd:pfam05956  229 SASESLSPSSQPASPRRSRPGLPAVFLCSSRCQELKGWRKQPPNPnsRAEPSDRPLTRRRPPRRTSSESPSRLPVRNGTW 308
                          330       340
                   ....*....|....*....|....*...
gi 1958788261 2099 RPETVKRYASLPHISVSRRPDSAVSVPT 2126
Cdd:pfam05956  309 KRETFKRYSSLPHINVWRRTGSSSSILS 336
APC_rep pfam18797
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ...
386-459 5.78e-36

Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.


Pssm-ID: 465870  Cd Length: 74  Bit Score: 131.52  E-value: 5.78e-36
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1958788261  386 SQPDQGLARKEMRVLHVLEQIRAYCETCWDWLQARDSGTESGTGDTPVPIEPQICQAACAVMKLSFDEEYRRAM 459
Cdd:pfam18797    1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
APC_N_CC pfam16689
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil ...
30-81 6.75e-26

Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil dimerization domain of the adenomatosis polyposis coli (APC) tumour-repressor proteins. It plays a key role in the regulation of cellular levels of the oncogene product beta-catenin. Coiled-coil regions are binding repeats that in this case bind to the armadillo repeat region of beta-catenin.


Pssm-ID: 435517  Cd Length: 52  Bit Score: 101.99  E-value: 6.75e-26
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1958788261   30 ASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKL 81
Cdd:pfam16689    1 ASYDQLLRQVEALKLENTTLRQELRDNSSHLSKLETEASNMKEVLKHLQGSI 52
Suppressor_APC pfam11414
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ...
148-235 1.39e-17

Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils.


Pssm-ID: 463275  Cd Length: 82  Bit Score: 79.22  E-value: 1.39e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261  148 SRATIRLLEELDQERCFLLSEIEKEEKEKLWYYSQLQGLSKRLDELPHVDTpvprsqQFSMQMDLIRQQLEFEAQHIRSL 227
Cdd:pfam11414    1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGT------YFDYGSDAQQERLEFLLARIQEV 74

                   ....*...
gi 1958788261  228 MEERFGTS 235
Cdd:pfam11414   75 NRCLGGLI 82
Arm_APC_u3 pfam16629
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ...
725-971 5.08e-17

Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435476  Cd Length: 293  Bit Score: 83.87  E-value: 5.08e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261  725 HRPAKYQAAAMaVSPGTCVPSLYVRKQRALEAELDTRHLVHALGHLEKQGlPEAETTSKKplpplRHLDGLVQDYASDSG 804
Cdd:pfam16629    1 NRPAKYKDANI-MSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLS-PKASHRNKQ-----RHKQNVYSEYVLDSG 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261  805 CFDDddapsLAAAATTAEPASPAVMSMFLGGPFLQGQAlARTPPARQGGLEAEKE-----------------TGGEAAVA 867
Cdd:pfam16629   74 RHDD-----SVCRSDNFNTGNVTVLSPYLNTTVLPSSS-SRDSRGNAESSRSEKDrsldrergaglsnfhpaTENSGNSS 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261  868 AKAKAKLALAVARIDRLVEDISALHTSSDDSFSLSSGDP---GQEAPREGRAQSCSPCRGTEG-GRREAGSRAHPLLRLK 943
Cdd:pfam16629  148 KRIGMQISTTAAQIAKVMEEVSSMHISQEDRSSGSTSDMhcmQDDRNSIRRSSTAHPHSNVYSfNKSESSNRPCPMPYMK 227
                          250       260
                   ....*....|....*....|....*...
gi 1958788261  944 AAHTSLSNDSLNSGSTSDGYCTREHMTP 971
Cdd:pfam16629  228 MEYKRASNDSLNSVSSSDGYGKRGQMKP 255
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
643-682 1.42e-06

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 46.68  E-value: 1.42e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 1958788261  643 EDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 682
Cdd:pfam00514    2 PENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
PHA03247 PHA03247
large tegument protein UL36; Provisional
1805-2311 1.53e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.17  E-value: 1.53e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1805 ASPASRAQSKGISGPCSAPKKMGTSGTTQpETATKTPsPEQQRSRSlhrPGKISELAALSHPPrSATPPArlTKTPSSSS 1884
Cdd:PHA03247  2557 PAAPPAAPDRSVPPPRPAPRPSEPAVTSR-ARRPDAP-PQSARPRA---PVDDRGDPRGPAPP-SPLPPD--THAPDPPP 2628
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1885 SQTSPASQSLPRRSPLATPTGGPLPGPGGSPVPKSPARALLAKQHKTQKSPvripfMQRPARRVPPP-------LARPSP 1957
Cdd:PHA03247  2629 PSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSP-----PQRPRRRAARPtvgsltsLADPPP 2703
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1958 EPGSrgragAEGTPGARGSRLGLVRVASTRSSGSESSDRSGFRRQLTFIKESPGLLRRRRSELSSADSTVSTSQTASPCR 2037
Cdd:PHA03247  2704 PPPT-----PEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAG 2778
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2038 GRPALPAVFLCSSrcDELRASPRQPLAAQRVPQAKPGLAPRAPRRTSSESPSRLPVRATPGRPETvkryaslphisvsrr 2117
Cdd:PHA03247  2779 PPRRLTRPAVASL--SESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPP--------------- 2841
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2118 PDSAVSVPTTQANATRRGSDGEARPLPRVAAPGTTWRrikdedvPHILRSTLPATALPlRGSSPEDSPAGTPHRKtsdav 2197
Cdd:PHA03247  2842 PPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAP-------ARPPVRRLARPAVS-RSTESFALPPDQPERP----- 2908
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2198 vQTEDVATSKTNSSTSPSLESRDPPQAPISGPVAPLGSDVDGPVLAKPPASAPFTHEGlSVVTGGFPTSRHGSPSRAARV 2277
Cdd:PHA03247  2909 -PQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLG-ALVPGRVAVPRFRVPQPAPSR 2986
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|
gi 1958788261 2278 PPFNYVPSPMV------VATMTSDSAVEKAPVTSPASLLE 2311
Cdd:PHA03247  2987 EAPASSTPPLTghslsrVSSWASSLALHEETDPPPVSLKQ 3026
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
642-682 2.30e-06

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 46.27  E-value: 2.30e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 1958788261   642 REDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 682
Cdd:smart00185    1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1942-2276 6.70e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 51.71  E-value: 6.70e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1942 QRPARRVPPPLARPSPEPGSRGRAGAeGTPGARGSRLGLVRVASTRSSGSESSDRSGFRRQLTfikESPGLLRRRRSELS 2021
Cdd:PHA03307    80 PANESRSTPTWSLSTLAPASPAREGS-PTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPV---GSPGPPPAASPPAA 155
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2022 SADSTVSTSQTASPCRGRPALPAVflcssrcDELRASPRQPLAAQRVPQAKPGLAPRAPRRTSSESPSRLPVRATPGRPE 2101
Cdd:PHA03307   156 GASPAAVASDAASSRQAALPLSSP-------EETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSA 228
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2102 TVKRYASLPHISVSRRPDSAVSvpttQANATRRGSDGEARpLPRVAAPGTTWrriKDEDVPHILRStlPATALPLRGSSP 2181
Cdd:PHA03307   229 ADDAGASSSDSSSSESSGCGWG----PENECPLPRPAPIT-LPTRIWEASGW---NGPSSRPGPAS--SSSSPRERSPSP 298
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2182 EDSPAGTPHRKTSDAVVQTEDVATSKTNSSTSPSLESRDPPQAPISGPVAPLGSDVDGPVLAKPP----ASAPFTHEGLS 2257
Cdd:PHA03307   299 SPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSsprkRPRPSRAPSSP 378
                          330
                   ....*....|....*....
gi 1958788261 2258 VVTGGFPTSRHGSPSRAAR 2276
Cdd:PHA03307   379 AASAGRPTRRRARAAVAGR 397
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
1628-1649 7.37e-05

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


Pssm-ID: 461782  Cd Length: 22  Bit Score: 41.42  E-value: 7.37e-05
                           10        20
                   ....*....|....*....|..
gi 1958788261 1628 SPRAEEELLQRCISLAMPRRRT 1649
Cdd:pfam05924    1 SPDDEDDLLQECINSAMPKKRR 22
PHA03247 PHA03247
large tegument protein UL36; Provisional
1534-1959 8.45e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.40  E-value: 8.45e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1534 DSDEEPPATAPPTRRASAIPRALKREKPAGRKETP---TRATQPATLPVRAQPRLIVDETPPCYSLTSSASSLSEPEASE 1610
Cdd:PHA03247  2546 DDAGDPPPPLPPAAPPAAPDRSVPPPRPAPRPSEPavtSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPD 2625
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1611 QPACHPR---VEEQGSKQDSSPRAEEELLQRCISLAMPRRRTQVPS--------SRRRKPRAVRSDIRPteLPQKCREEV 1679
Cdd:PHA03247  2626 PPPPSPSpaaNEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGraaqasspPQRPRRRAARPTVGS--LTSLADPPP 2703
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1680 PGSDPASdldsVEWQAIQEGANSIVTWLHQAAAKASLEASSESDSLLSLVSGLSASSTLQPSKLRKGRKPVAEAGGAWRP 1759
Cdd:PHA03247  2704 PPPTPEP----APHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGP 2779
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1760 EKRGTTSTKGSGSPRFPSGPEKAKGTQKTMAgesAMLRGRTVIYTASPASRAQSKGISGPCSAPKKMGTSGTTQPETATK 1839
Cdd:PHA03247  2780 PRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSV 2856
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1840 TPSPEQQRSRSLHRPgkISELAALSHPP-RSATPPARLTKTPSSSSSQTSPASQSLPRRSPLATPTGGPLPGPGGSPVPK 1918
Cdd:PHA03247  2857 APGGDVRRRPPSRSP--AAKPAAPARPPvRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPP 2934
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1958788261 1919 SPARALLAKQHKTQKSPVRIPFMQRPA---------------RRVPPPL-ARPSPEP 1959
Cdd:PHA03247  2935 PPPRPQPPLAPTTDPAGAGEPSGAVPQpwlgalvpgrvavprFRVPQPApSREAPAS 2991
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
2071-2310 2.86e-04

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 46.11  E-value: 2.86e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2071 AKPGLAPRAPRRTSSESPSRLPVRATPGRPETVKRYASLPHISVSRRPDSAVS-VPTTQANATRRGSdgearPLPRVAAP 2149
Cdd:pfam17823  106 AADGAASRALAAAASSSPSSAAQSLPAAIAALPSEAFSAPRAAACRANASAAPrAAIAAASAPHAAS-----PAPRTAAS 180
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2150 GTTW--RRIKDEDVPHILRSTLPATALPLRGSSPEDSPAGTPHRKTSDAVV----------------------------- 2198
Cdd:pfam17823  181 STTAasSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVgnsspaagtvtaavgtvtpaalatlaaaa 260
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2199 QTEDVATSKTNS----STSPSLESRDPPQAPISGPVAPLGSDVDGPVLAkppasapFTHEGLSVVTGGFPTSrhgSPSRA 2274
Cdd:pfam17823  261 GTVASAAGTINMgdphARRLSPAKHMPSDTMARNPAAPMGAQAQGPIIQ-------VSTDQPVHNTAGEPTP---SPSNT 330
                          250       260       270
                   ....*....|....*....|....*....|....*.
gi 1958788261 2275 ARVPPFNYVPSPMVVATMTSDSAVEKAPVTSPASLL 2310
Cdd:pfam17823  331 TLEPNTPKSVASTNLAVVTTTKAQAKEPSASPVPVL 366
PHA03247 PHA03247
large tegument protein UL36; Provisional
1758-2232 2.95e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.47  E-value: 2.95e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1758 RPEKRGTTSTKGSGSPRFPSGPEKAKGtqKTMAGESAMLRGRTVIYTASPASRAQSKGISGPCSAPKKMGTSGT-TQPET 1836
Cdd:PHA03247  2572 RPAPRPSEPAVTSRARRPDAPPQSARP--RAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPpTVPPP 2649
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1837 ATKTPSPEQQRSRSLHRPGKISELAALSHPPRSATPPARLTKTPSSSSSQTSPASQSLPRRSPLATPTGGPLPGPGGSPV 1916
Cdd:PHA03247  2650 ERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAAR 2729
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1917 PKSPARALLAKQHKTQKSPVrIPFMQRPARRVPPPLARPSPEPGsrgrAGAEGTPGARGSRLGLVRVASTRSSGSESsdr 1996
Cdd:PHA03247  2730 QASPALPAAPAPPAVPAGPA-TPGGPARPARPPTTAGPPAPAPP----AAPAAGPPRRLTRPAVASLSESRESLPSP--- 2801
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 1997 sgfrrqltfikespgllrrrrSELSSADSTVSTSQTASPCRGRPALPAVFLCSSrcdeLRASPRQPLAAQRVPQAKPG-L 2075
Cdd:PHA03247  2802 ---------------------WDPADPPAAVLAPAAALPPAASPAGPLPPPTSA----QPTAPPPPPGPPPPSLPLGGsV 2856
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2076 APRAPRRTSSESPSRLPVRATPGRPetvkRYASLPHISVSRRPDSAVSVPTTQANATRRGSDGEARPLPRVAAPGTTWRR 2155
Cdd:PHA03247  2857 APGGDVRRRPPSRSPAAKPAAPARP----PVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPP 2932
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1958788261 2156 IKDEDVPhilrstlPATALPLRGSSPEDSPAGTPHRKTSDAVVQTEDVATSKTNSSTSPSLESRDPPQAPISGPVAP 2232
Cdd:PHA03247  2933 PPPPPRP-------QPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLS 3002
PHA03247 PHA03247
large tegument protein UL36; Provisional
2009-2241 4.20e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.08  E-value: 4.20e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2009 SPGLLRRRRSELSSADSTVSTSQTASPCRGRPALPAVFLCSSRCDELRASP-RQPLAAQRVPQAKPGLAPRAPR------ 2081
Cdd:PHA03247   256 APPPVVGEGADRAPETARGATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPlALPAPPDPPPPAPAGDAEEEDDedgame 335
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2082 ------RTSSESPSRLPVRATPgrpeTVKRYASLPHISVSRRPDSAVSVPTTQANATR-------RGSDGEARPLPRVAA 2148
Cdd:PHA03247   336 vvsplpRPRQHYPLGFPKRRRP----TWTPPSSLEDLSAGRHHPKRASLPTRKRRSARhaatpfaRGPGGDDQTRPAAPV 411
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261 2149 PGTTwrriKDEDVPHILRSTLPATALPLRGSSP--EDSPAGTPHRKTSDAVVQTEDVATSKTNSSTSPSLESRDPPQAPI 2226
Cdd:PHA03247   412 PASV----PTPAPTPVPASAPPPPATPLPSAEPgsDDGPAPPPERQPPAPATEPAPDDPDDATRKALDALRERRPPEPPG 487
                          250
                   ....*....|....*..
gi 1958788261 2227 SGPVAPLGS--DVDGPV 2241
Cdd:PHA03247   488 ADLAELLGRhpDTAGTV 504
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1407-1429 6.61e-04

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 38.90  E-value: 6.61e-04
                           10        20
                   ....*....|....*....|...
gi 1958788261 1407 DDSGTDSAEGTPVNFSSAASLSD 1429
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSS 23
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
17-110 2.38e-03

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 42.58  E-value: 2.38e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261   17 NQALQELKMASsvASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKLEQEARVLVSSGQTEV 96
Cdd:COG4372     97 AQAQEELESLQ--EEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQ 174
                           90
                   ....*....|....
gi 1958788261   97 LEQLKALQTDISSL 110
Cdd:COG4372    175 ALSEAEAEQALDEL 188
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
15-110 2.64e-03

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 42.58  E-value: 2.64e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261   15 FGNQALQELKMASSVASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKLEQEARVL--VSSG 92
Cdd:COG4372     16 FGLRPKTGILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLqaAQAE 95
                           90
                   ....*....|....*...
gi 1958788261   93 QTEVLEQLKALQTDISSL 110
Cdd:COG4372     96 LAQAQEELESLQEEAEEL 113
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
684-724 3.12e-03

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 37.43  E-value: 3.12e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1958788261  684 SPRDQELLWDLGAVGMLRNLVHSKHKMIAMGSAAALRNLLA 724
Cdd:pfam00514    1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1283-1304 3.61e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.59  E-value: 3.61e-03
                           10        20
                   ....*....|....*....|..
gi 1958788261 1283 SVRFTVEKPDENFSCASSLSAL 1304
Cdd:pfam05923    3 PKRYCVEGTPANFSRASSLSSL 24
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
30-265 4.25e-03

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 42.35  E-value: 4.25e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261   30 ASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKLEQ--EARVLVSSGQTEVLEQLKALQTDI 107
Cdd:TIGR02168  267 EKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEEleAQLEELESKLDELAEELAELEEKL 346
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788261  108 SSLYNLKFHAPALGPEPAAQTPEGSpvHGPAPSKDSFGELSRATIRLLEELDQERCFLLSEIekeekeklwyySQLQGLS 187
Cdd:TIGR02168  347 EELKEELESLEAELEELEAELEELE--SRLEELEEQLETLRSKVAQLELQIASLNNEIERLE-----------ARLERLE 413
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958788261  188 KRLDELpHVDTPVPRSQQFSMQMDLIRQQLEFEAQHIRSLMEERfgtsDEMVQRAQIRASRLEQIDKELLEAQDRVQQ 265
Cdd:TIGR02168  414 DRRERL-QQEIEELLKKLEEAELKELQAELEELEEELEELQEEL----ERLEEALEELREELEEAEQALDAAERELAQ 486
ZapB pfam06005
Cell division protein ZapB; ZapB is a non-essential, abundant cell division factor that is ...
22-83 4.74e-03

Cell division protein ZapB; ZapB is a non-essential, abundant cell division factor that is required for proper Z-ring formation.


Pssm-ID: 428718 [Multi-domain]  Cd Length: 71  Bit Score: 37.63  E-value: 4.74e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1958788261   22 ELKMASSVASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKLEQ 83
Cdd:pfam06005   10 ETKIQAAVDTIALLQMENEELKEENEELKEEANELEEENQQLKQERNQWQERIRGLLGKLDE 71
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1171-1194 5.84e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.20  E-value: 5.84e-03
                           10        20
                   ....*....|....*....|....
gi 1958788261 1171 SSSSENCVQETPLVLSRCSSVSSL 1194
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSSL 24
ZapB COG3074
Cell division protein ZapB, interacts with FtsZ [Cell cycle control, cell division, chromosome ...
17-82 6.73e-03

Cell division protein ZapB, interacts with FtsZ [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442308 [Multi-domain]  Cd Length: 79  Bit Score: 37.64  E-value: 6.73e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958788261   17 NQALQEL--KMASSVASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKLE 82
Cdd:COG3074      3 LELLEELeaKVQQAVDTIELLQMEVEELKEKNEELEQENEELQSENEELQSENEQLKTENAEWQERIR 70
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH