NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|190165|gb|AAA60353|]
View 

polyposis locus-encoded protein [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Arm_APC_u3 pfam16629
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ...
732-1019 3.09e-166

Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


:

Pssm-ID: 435476  Cd Length: 293  Bit Score: 514.14  E-value: 3.09e-166
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165      732 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRSKQRHKQSLYGDYVFDTNRHDDN-- 809
Cdd:pfam16629    1 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRNKQRHKQNVYSEYVLDSGRHDDSvc 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165      810 RSDNFNTGNMTVLSPYLNTTVLPSSSSSRG--SLDSSRSEKDRSLERERGIGLGNYHPATENPGTSSKR-GLQISTTAAQ 886
Cdd:pfam16629   81 RSDNFNTGNVTVLSPYLNTTVLPSSSSRDSrgNAESSRSEKDRSLDRERGAGLSNFHPATENSGNSSKRiGMQISTTAAQ 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165      887 IAKVMEEVSAIHTSQEDRSSGSTTELHCVTDERNALRRSSAAHTHSNTYNFTKSENSNRTCSMPYAKLEYKRSSNDSLNS 966
Cdd:pfam16629  161 IAKVMEEVSSMHISQEDRSSGSTSDMHCMQDDRNSIRRSSTAHPHSNVYSFNKSESSNRPCPMPYMKMEYKRASNDSLNS 240
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|...
gi 190165      967 VSSNDGYGKRGQMKPSIESYSEDDESKFCSYGQYPADLAHKIHSANHMDDNDG 1019
Cdd:pfam16629  241 VSSSDGYGKRGQMKPSVESYSEDDEGKFCSYGKYPADLAHKIHSANHMDDNDG 293
APC_basic pfam05956
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ...
2224-2569 3.57e-110

APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.


:

Pssm-ID: 428690 [Multi-domain]  Cd Length: 336  Bit Score: 355.34  E-value: 3.57e-110
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165     2224 SISRGRTMIHIPGVRNSSSSTSPVSKKGPPLKTPASKSPSEGQTATTS-PRGAKPSVKSELSPVARQTSQIGGSSKAPSR 2302
Cdd:pfam05956    1 VVFRGRTVIYMPGVKESQPSTSPPPKKTPPKTDAPAKNPNLGQQRSRSlHRLGKPSELADLSPPKRSATPPARISKAPSS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165     2303 SGSRDSTPSRPAQQPLSRPIQSPGRNSISPGRNgisppnKLSQLPRTSSPSTASTKSSGSgKMSYTSPGRQMSQQNLTKQ 2382
Cdd:pfam05956   81 GSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRN------KLSPLPKTKSPARASTKKSGS-HKTQKSPVRIPFMQTPTKQ 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165     2383 TGLSKNASSI------PRSESASKGLNqmnngNGANKKVELSRMSSTKSSGSESDRSerpVLVRQSTFIKEAPSPTLRRK 2456
Cdd:pfam05956  154 TGLPRNPSPLvtnqpePRSESASKGLR-----SLPGKRLDLVRMSSARSSGSESDRS---GFLRQLTFIKESPSLLLRRR 225
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165     2457 LEESASfESLSPSSRPASPTRSQaqtpvlsPSLPDMSLSTHSSVQAGGWRKLPPNLSPTIEYNDGRPAKRHDIARSHSES 2536
Cdd:pfam05956  226 LELSAS-ESLSPSSQPASPRRSR-------PGLPAVFLCSSRCQELKGWRKQPPNPNSRAEPSDRPLTRRRPPRRTSSES 297
                          330       340       350
                   ....*....|....*....|....*....|...
gi 190165     2537 PSRLPInRSGTWKREHSKHSSSLPRVSTWRRTG 2569
Cdd:pfam05956  298 PSRLPV-RNGTWKRETFKRYSSLPHINVWRRTG 329
EB1_binding pfam05937
EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the ...
2671-2844 1.19e-89

EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the microtubule-associating protein EB-1. At the C-terminus of the alignment is also a pfam00595 binding domain. A short motif in the middle of the region appears to be found in the APC2 proteins.


:

Pssm-ID: 399141  Cd Length: 174  Bit Score: 289.20  E-value: 1.19e-89
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165     2671 RSGRSPTGNTPPVIDSVSEKANPNIKDSKDNQAKQNVGNGSVPMRTVGLENRLTSFIQVDAPDQKGTEIKPGQNNPVPVS 2750
Cdd:pfam05937    1 RSGRSPTGNTPPVIDSVPEKGIKDEKDSKDPQAKQNMGNGNVPVRTVGLENRLNSFIQSDSPDKKGTETKPLQNNPVPTP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165     2751 ETNESPIVERTPFSSSSSSKHSSPSGTVAARVTPFNYNPSPRKSSADSTSARPSQIPTPVNNNTKKRDSKTDSTESSGTQ 2830
Cdd:pfam05937   81 ETNENPVSERTPFSSSSSSKHSSPSGAVAARVTPFNYNPSPRKSSADSSSARPSQIPTPVNNSTKKRDSKTESTDSSGNQ 160
                          170
                   ....*....|....
gi 190165     2831 SPKRHSGSYLVTSV 2844
Cdd:pfam05937  161 SPKRHSGSYLVTSV 174
APC_u5 pfam16630
Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of ...
1036-1135 8.13e-57

Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of natively unstructured sequence lying between the first and the second 15-residue beta-catenin binding motifs, APC_15aa, pfam05972, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


:

Pssm-ID: 406923  Cd Length: 100  Bit Score: 192.43  E-value: 8.13e-57
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165     1036 LNSGRQSPSQNERWARPKHIIEDEIKQSEQRQSRNQSTTYPVYTESTDDKHLKFQPHFGQQECVSPYRSRGANGSETNRV 1115
Cdd:pfam16630    1 LNSGRQSPSQNERWARPKHIIEDEMKQSEQRQPRSQSTTYPVYTESGDDKHMKFQPRFGQQECVSPFRSRGSNGSEQSRV 80
                           90       100
                   ....*....|....*....|
gi 190165     1116 GSNHGINQNVSQSLCQEDDY 1135
Cdd:pfam16630   81 GSSHGINQKVSQSLCQVDDY 100
APC_u14 pfam16635
Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively ...
1747-1840 2.54e-48

Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively unstructured sequence lying between the second SAMP pfam05924, and the fifth creatine-rich region, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


:

Pssm-ID: 435479  Cd Length: 94  Bit Score: 167.71  E-value: 2.54e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165     1747 IMDQVQQASASSSAPNKNQLDGKKKKPTSPVKPIPQNTEYRTRVRKNADSKNNLNAERVFSDNKDSKKQNLKNNSKDFND 1826
Cdd:pfam16635    1 IMDQIQQASAASSGGSKSQQDGEKKKPTSPVKPMPQSSEYRARVRKNTESKNNLNSERSYPDNKESKKQNLKNNSRDFND 80
                           90
                   ....*....|....
gi 190165     1827 KLPNNEDRVRGSFA 1840
Cdd:pfam16635   81 KLPNNEERTRGSFA 94
APC_rep pfam18797
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ...
393-466 4.76e-45

Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.


:

Pssm-ID: 465870  Cd Length: 74  Bit Score: 157.71  E-value: 4.76e-45
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 190165      393 SQPDDKRGRREIRVLHLLEQIRAYCETCWEWQEAHEPGMDQDKNPMPAPVEHQICPAVCVLMKLSFDEEHRHAM 466
Cdd:pfam18797    1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
APC_u15 pfam16636
Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of ...
1874-1948 2.65e-31

Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of natively unstructured sequence lying between the fifth and sixth creatine-rich, APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


:

Pssm-ID: 435480  Cd Length: 81  Bit Score: 118.81  E-value: 2.65e-31
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 190165     1874 DLSREKAELRKAKENKESEAKVTSHTELTSNQQSANKTQAIAKQPINRGQPKPILQKQSTFPQSSKDIPDRGAAT 1948
Cdd:pfam16636    7 DLSREKAELRKGKETKETETKVTSHIEQPSNQQSTNRTQACQKHPPNRGQPKPLLQKQTTFPQSSKDIPDRGAAT 81
APC_u9 pfam16633
Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of ...
1283-1369 5.60e-31

Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of natively unstructured sequence lying between the first and second APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


:

Pssm-ID: 435478  Cd Length: 89  Bit Score: 118.05  E-value: 5.60e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165     1283 AEDEI-GCNQTTQEADSANTLQIAEIKGKIGTRSAEDPVSEVPSSVHSTlETKSSRLQGSSLS-SESARHKAVEFPSGAK 1360
Cdd:pfam16633    2 AEDEIeGRDQATRSTDNYNTLQITELKENSGAVSTEQTVSEVPSSSQHI-RTKPNRLQASNLSpSDSSRHKAVEFSSGAK 80

                   ....*....
gi 190165     1361 SPSKSGAQT 1369
Cdd:pfam16633   81 SPSKSGAQT 89
APC_u13 pfam16634
Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively ...
1663-1716 1.23e-24

Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively unstructured sequence lying between the fourth creatine-rich region, APC_crr, pfam05923, and the SAMP pfam05924, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


:

Pssm-ID: 406927  Cd Length: 54  Bit Score: 98.71  E-value: 1.23e-24
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 190165     1663 IESPPNELAAGEGVRGGAQSGEFEKRDTIPTEGRSTDEAQGGKTSSVTIPELDD 1716
Cdd:pfam16634    1 IESPPNELANAESTGTGAESAEFEKRDTIPTEGRSTDDAQRGKKSNITTSALDD 54
Suppressor_APC pfam11414
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ...
127-207 4.74e-22

Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils.


:

Pssm-ID: 463275  Cd Length: 82  Bit Score: 92.32  E-value: 4.74e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165      127 SRESTGYLEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLPLTE-NFSLQTDLTRRQLEYEARQIRVAMEEQLG 205
Cdd:pfam11414    1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGtYFDYGSDAQQERLEFLLARIQEVNRCLGG 80

                   ..
gi 190165      206 TC 207
Cdd:pfam11414   81 LI 82
APC_N_CC pfam16689
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil ...
4-55 6.06e-22

Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil dimerization domain of the adenomatosis polyposis coli (APC) tumour-repressor proteins. It plays a key role in the regulation of cellular levels of the oncogene product beta-catenin. Coiled-coil regions are binding repeats that in this case bind to the armadillo repeat region of beta-catenin.


:

Pssm-ID: 435517  Cd Length: 52  Bit Score: 91.20  E-value: 6.06e-22
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 190165        4 ASYDQLLKQVEALKMENSNLRQELEDNSNHLTKLETEASNMKEVLKQLQGSI 55
Cdd:pfam16689    1 ASYDQLLRQVEALKLENTTLRQELRDNSSHLSKLETEASNMKEVLKHLQGSI 52
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1638-1661 1.46e-07

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


:

Pssm-ID: 461781  Cd Length: 24  Bit Score: 49.30  E-value: 1.46e-07
                           10        20
                   ....*....|....*....|....
gi 190165     1638 DMPRVYCVEGTPINFSTATSLSDL 1661
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSSL 24
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
649-689 4.89e-07

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


:

Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 48.19  E-value: 4.89e-07
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 190165       649 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 689
Cdd:smart00185    1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
2034-2053 3.82e-05

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


:

Pssm-ID: 461782  Cd Length: 22  Bit Score: 42.58  E-value: 3.82e-05
                           10        20
                   ....*....|....*....|
gi 190165     2034 DSEDDLLQECISSAMPKKKK 2053
Cdd:pfam05924    3 DDEDDLLQECINSAMPKKRR 22
SMC_prok_B super family cl37069
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
8-272 1.98e-04

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


The actual alignment was detected with superfamily member TIGR02168:

Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 47.36  E-value: 1.98e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165        8 QLLKQVEALKMENSNLRQELEDNSNHLTKLETEASNMKEVLKQLQGSIEDeamassgqidLLERLKELNLDSSNFPGVKL 87
Cdd:TIGR02168  744 QLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQ----------LKEELKALREALDELRAELT 813
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165       88 RSKMSLRSYGSREGSVSSRSGECSPVPMGSFPRRGFVNGSRES-TGYLEELEKERSLLLADLDKEEKEKDWYYAQLQNLT 166
Cdd:TIGR02168  814 LLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESlAAEIEELEELIEELESELEALLNERASLEEALALLR 893
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165      167 KRIDSLpltenfslqtDLTRRQLEYEARQIRVAMEEqlgtCQDMEKRAQRRIARIQQIEKDIL-RIRQLLQSQATEAERs 245
Cdd:TIGR02168  894 SELEEL----------SEELRELESKRSELRRELEE----LREKLAQLELRLEGLEVRIDNLQeRLSEEYSLTLEEAEA- 958
                          250       260       270
                   ....*....|....*....|....*....|..
gi 190165      246 SQNKHETGSHDAERQ-----NEGQGVGEINMA 272
Cdd:TIGR02168  959 LENKIEDDEEEARRRlkrleNKIKELGPVNLA 990
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
512-553 5.63e-04

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


:

Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 39.72  E-value: 5.63e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|..
gi 190165       512 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLSW 553
Cdd:smart00185    1 DDENKQAVVDA-GGLPALVELLKSEDEEVVKEAAWALSNLSS 41
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
691-731 7.00e-04

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


:

Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 39.36  E-value: 7.00e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 190165      691 NPKDQEALWDMGAVSMLKNLIHSKHKMIAMGSAAALRNLMA 731
Cdd:pfam00514    1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
1717-1738 1.42e-03

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


:

Pssm-ID: 461782  Cd Length: 22  Bit Score: 37.96  E-value: 1.42e-03
                           10        20
                   ....*....|....*....|..
gi 190165     1717 NKAEEGDILAECINSAMPKGKS 1738
Cdd:pfam05924    1 SPDDEDDLLQECINSAMPKKRR 22
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1372-1394 3.50e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


:

Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.97  E-value: 3.50e-03
                           10        20
                   ....*....|....*....|....
gi 190165     1372 SPPEHY-VQETPLMFSRCTSVSSL 1394
Cdd:pfam05923    1 DSPKRYcVEGTPANFSRASSLSSL 24
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1257-1274 6.63e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


:

Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.20  E-value: 6.63e-03
                           10
                   ....*....|....*...
gi 190165     1257 ETIQTYCVEDTPICFSRC 1274
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRA 18
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
340-392 8.69e-03

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


:

Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 36.25  E-value: 8.69e-03
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|...
gi 190165       340 SQDSCISMRQSGCLPLLIQLLHgndkdsvllgnsRGSKEARARASAALHNIIH 392
Cdd:smart00185    1 DDENKQAVVDAGGLPALVELLK------------SEDEEVVKEAAWALSNLSS 41
 
Name Accession Description Interval E-value
Arm_APC_u3 pfam16629
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ...
732-1019 3.09e-166

Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435476  Cd Length: 293  Bit Score: 514.14  E-value: 3.09e-166
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165      732 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRSKQRHKQSLYGDYVFDTNRHDDN-- 809
Cdd:pfam16629    1 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRNKQRHKQNVYSEYVLDSGRHDDSvc 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165      810 RSDNFNTGNMTVLSPYLNTTVLPSSSSSRG--SLDSSRSEKDRSLERERGIGLGNYHPATENPGTSSKR-GLQISTTAAQ 886
Cdd:pfam16629   81 RSDNFNTGNVTVLSPYLNTTVLPSSSSRDSrgNAESSRSEKDRSLDRERGAGLSNFHPATENSGNSSKRiGMQISTTAAQ 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165      887 IAKVMEEVSAIHTSQEDRSSGSTTELHCVTDERNALRRSSAAHTHSNTYNFTKSENSNRTCSMPYAKLEYKRSSNDSLNS 966
Cdd:pfam16629  161 IAKVMEEVSSMHISQEDRSSGSTSDMHCMQDDRNSIRRSSTAHPHSNVYSFNKSESSNRPCPMPYMKMEYKRASNDSLNS 240
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|...
gi 190165      967 VSSNDGYGKRGQMKPSIESYSEDDESKFCSYGQYPADLAHKIHSANHMDDNDG 1019
Cdd:pfam16629  241 VSSSDGYGKRGQMKPSVESYSEDDEGKFCSYGKYPADLAHKIHSANHMDDNDG 293
APC_basic pfam05956
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ...
2224-2569 3.57e-110

APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.


Pssm-ID: 428690 [Multi-domain]  Cd Length: 336  Bit Score: 355.34  E-value: 3.57e-110
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165     2224 SISRGRTMIHIPGVRNSSSSTSPVSKKGPPLKTPASKSPSEGQTATTS-PRGAKPSVKSELSPVARQTSQIGGSSKAPSR 2302
Cdd:pfam05956    1 VVFRGRTVIYMPGVKESQPSTSPPPKKTPPKTDAPAKNPNLGQQRSRSlHRLGKPSELADLSPPKRSATPPARISKAPSS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165     2303 SGSRDSTPSRPAQQPLSRPIQSPGRNSISPGRNgisppnKLSQLPRTSSPSTASTKSSGSgKMSYTSPGRQMSQQNLTKQ 2382
Cdd:pfam05956   81 GSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRN------KLSPLPKTKSPARASTKKSGS-HKTQKSPVRIPFMQTPTKQ 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165     2383 TGLSKNASSI------PRSESASKGLNqmnngNGANKKVELSRMSSTKSSGSESDRSerpVLVRQSTFIKEAPSPTLRRK 2456
Cdd:pfam05956  154 TGLPRNPSPLvtnqpePRSESASKGLR-----SLPGKRLDLVRMSSARSSGSESDRS---GFLRQLTFIKESPSLLLRRR 225
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165     2457 LEESASfESLSPSSRPASPTRSQaqtpvlsPSLPDMSLSTHSSVQAGGWRKLPPNLSPTIEYNDGRPAKRHDIARSHSES 2536
Cdd:pfam05956  226 LELSAS-ESLSPSSQPASPRRSR-------PGLPAVFLCSSRCQELKGWRKQPPNPNSRAEPSDRPLTRRRPPRRTSSES 297
                          330       340       350
                   ....*....|....*....|....*....|...
gi 190165     2537 PSRLPInRSGTWKREHSKHSSSLPRVSTWRRTG 2569
Cdd:pfam05956  298 PSRLPV-RNGTWKRETFKRYSSLPHINVWRRTG 329
EB1_binding pfam05937
EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the ...
2671-2844 1.19e-89

EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the microtubule-associating protein EB-1. At the C-terminus of the alignment is also a pfam00595 binding domain. A short motif in the middle of the region appears to be found in the APC2 proteins.


Pssm-ID: 399141  Cd Length: 174  Bit Score: 289.20  E-value: 1.19e-89
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165     2671 RSGRSPTGNTPPVIDSVSEKANPNIKDSKDNQAKQNVGNGSVPMRTVGLENRLTSFIQVDAPDQKGTEIKPGQNNPVPVS 2750
Cdd:pfam05937    1 RSGRSPTGNTPPVIDSVPEKGIKDEKDSKDPQAKQNMGNGNVPVRTVGLENRLNSFIQSDSPDKKGTETKPLQNNPVPTP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165     2751 ETNESPIVERTPFSSSSSSKHSSPSGTVAARVTPFNYNPSPRKSSADSTSARPSQIPTPVNNNTKKRDSKTDSTESSGTQ 2830
Cdd:pfam05937   81 ETNENPVSERTPFSSSSSSKHSSPSGAVAARVTPFNYNPSPRKSSADSSSARPSQIPTPVNNSTKKRDSKTESTDSSGNQ 160
                          170
                   ....*....|....
gi 190165     2831 SPKRHSGSYLVTSV 2844
Cdd:pfam05937  161 SPKRHSGSYLVTSV 174
APC_u5 pfam16630
Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of ...
1036-1135 8.13e-57

Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of natively unstructured sequence lying between the first and the second 15-residue beta-catenin binding motifs, APC_15aa, pfam05972, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 406923  Cd Length: 100  Bit Score: 192.43  E-value: 8.13e-57
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165     1036 LNSGRQSPSQNERWARPKHIIEDEIKQSEQRQSRNQSTTYPVYTESTDDKHLKFQPHFGQQECVSPYRSRGANGSETNRV 1115
Cdd:pfam16630    1 LNSGRQSPSQNERWARPKHIIEDEMKQSEQRQPRSQSTTYPVYTESGDDKHMKFQPRFGQQECVSPFRSRGSNGSEQSRV 80
                           90       100
                   ....*....|....*....|
gi 190165     1116 GSNHGINQNVSQSLCQEDDY 1135
Cdd:pfam16630   81 GSSHGINQKVSQSLCQVDDY 100
APC_u14 pfam16635
Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively ...
1747-1840 2.54e-48

Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively unstructured sequence lying between the second SAMP pfam05924, and the fifth creatine-rich region, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435479  Cd Length: 94  Bit Score: 167.71  E-value: 2.54e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165     1747 IMDQVQQASASSSAPNKNQLDGKKKKPTSPVKPIPQNTEYRTRVRKNADSKNNLNAERVFSDNKDSKKQNLKNNSKDFND 1826
Cdd:pfam16635    1 IMDQIQQASAASSGGSKSQQDGEKKKPTSPVKPMPQSSEYRARVRKNTESKNNLNSERSYPDNKESKKQNLKNNSRDFND 80
                           90
                   ....*....|....
gi 190165     1827 KLPNNEDRVRGSFA 1840
Cdd:pfam16635   81 KLPNNEERTRGSFA 94
APC_rep pfam18797
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ...
393-466 4.76e-45

Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.


Pssm-ID: 465870  Cd Length: 74  Bit Score: 157.71  E-value: 4.76e-45
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 190165      393 SQPDDKRGRREIRVLHLLEQIRAYCETCWEWQEAHEPGMDQDKNPMPAPVEHQICPAVCVLMKLSFDEEHRHAM 466
Cdd:pfam18797    1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
APC_u15 pfam16636
Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of ...
1874-1948 2.65e-31

Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of natively unstructured sequence lying between the fifth and sixth creatine-rich, APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435480  Cd Length: 81  Bit Score: 118.81  E-value: 2.65e-31
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 190165     1874 DLSREKAELRKAKENKESEAKVTSHTELTSNQQSANKTQAIAKQPINRGQPKPILQKQSTFPQSSKDIPDRGAAT 1948
Cdd:pfam16636    7 DLSREKAELRKGKETKETETKVTSHIEQPSNQQSTNRTQACQKHPPNRGQPKPLLQKQTTFPQSSKDIPDRGAAT 81
APC_u9 pfam16633
Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of ...
1283-1369 5.60e-31

Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of natively unstructured sequence lying between the first and second APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435478  Cd Length: 89  Bit Score: 118.05  E-value: 5.60e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165     1283 AEDEI-GCNQTTQEADSANTLQIAEIKGKIGTRSAEDPVSEVPSSVHSTlETKSSRLQGSSLS-SESARHKAVEFPSGAK 1360
Cdd:pfam16633    2 AEDEIeGRDQATRSTDNYNTLQITELKENSGAVSTEQTVSEVPSSSQHI-RTKPNRLQASNLSpSDSSRHKAVEFSSGAK 80

                   ....*....
gi 190165     1361 SPSKSGAQT 1369
Cdd:pfam16633   81 SPSKSGAQT 89
APC_u13 pfam16634
Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively ...
1663-1716 1.23e-24

Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively unstructured sequence lying between the fourth creatine-rich region, APC_crr, pfam05923, and the SAMP pfam05924, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 406927  Cd Length: 54  Bit Score: 98.71  E-value: 1.23e-24
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 190165     1663 IESPPNELAAGEGVRGGAQSGEFEKRDTIPTEGRSTDEAQGGKTSSVTIPELDD 1716
Cdd:pfam16634    1 IESPPNELANAESTGTGAESAEFEKRDTIPTEGRSTDDAQRGKKSNITTSALDD 54
Suppressor_APC pfam11414
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ...
127-207 4.74e-22

Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils.


Pssm-ID: 463275  Cd Length: 82  Bit Score: 92.32  E-value: 4.74e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165      127 SRESTGYLEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLPLTE-NFSLQTDLTRRQLEYEARQIRVAMEEQLG 205
Cdd:pfam11414    1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGtYFDYGSDAQQERLEFLLARIQEVNRCLGG 80

                   ..
gi 190165      206 TC 207
Cdd:pfam11414   81 LI 82
APC_N_CC pfam16689
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil ...
4-55 6.06e-22

Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil dimerization domain of the adenomatosis polyposis coli (APC) tumour-repressor proteins. It plays a key role in the regulation of cellular levels of the oncogene product beta-catenin. Coiled-coil regions are binding repeats that in this case bind to the armadillo repeat region of beta-catenin.


Pssm-ID: 435517  Cd Length: 52  Bit Score: 91.20  E-value: 6.06e-22
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 190165        4 ASYDQLLKQVEALKMENSNLRQELEDNSNHLTKLETEASNMKEVLKQLQGSI 55
Cdd:pfam16689    1 ASYDQLLRQVEALKLENTTLRQELRDNSSHLSKLETEASNMKEVLKHLQGSI 52
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1638-1661 1.46e-07

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 49.30  E-value: 1.46e-07
                           10        20
                   ....*....|....*....|....
gi 190165     1638 DMPRVYCVEGTPINFSTATSLSDL 1661
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSSL 24
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
649-689 4.89e-07

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 48.19  E-value: 4.89e-07
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 190165       649 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 689
Cdd:smart00185    1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
649-689 6.26e-07

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 47.83  E-value: 6.26e-07
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 190165      649 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 689
Cdd:pfam00514    1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
PHA03247 PHA03247
large tegument protein UL36; Provisional
2252-2565 9.58e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.94  E-value: 9.58e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165    2252 PPLKTPASKsPSEGQTATTSPRGAKPSVKSE----LSPVARQTSQIGGSSKAPSRSGSRDSTPSRPAQQPLSRPIQSPGR 2327
Cdd:PHA03247 2703 PPPPTPEPA-PHALVSATPLPPGPAAARQASpalpAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR 2781
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165    2328 NSISPGRNGISPPNKLSQLPRTSSPSTASTKSSGSGKMSYTSPGrqmsqqnltkqTGLSKNASSIPRSESASKGLNQ--M 2405
Cdd:PHA03247 2782 RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPA-----------GPLPPPTSAQPTAPPPPPGPPPpsL 2850
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165    2406 NNGNGANKKVELSRMSSTKSSGSESDRSERPVLVRqstfikeAPSPTLRRKLEESA-SFESLSPSSRPASPTRSQAQTPV 2484
Cdd:PHA03247 2851 PLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRR-------LARPAVSRSTESFAlPPDQPERPPQPQAPPPPQPQPQP 2923
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165    2485 LSPSLPDMSLSTHSSVQAggwrKLPPNLSPTieyNDGRPAKRHDIARSHSESPSRLPINRSGTWK----RE------HSK 2554
Cdd:PHA03247 2924 PPPPQPQPPPPPPPRPQP----PLAPTTDPA---GAGEPSGAVPQPWLGALVPGRVAVPRFRVPQpapsREapasstPPL 2996
                         330
                  ....*....|.
gi 190165    2555 HSSSLPRVSTW 2565
Cdd:PHA03247 2997 TGHSLSRVSSW 3007
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
2034-2053 3.82e-05

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


Pssm-ID: 461782  Cd Length: 22  Bit Score: 42.58  E-value: 3.82e-05
                           10        20
                   ....*....|....*....|
gi 190165     2034 DSEDDLLQECISSAMPKKKK 2053
Cdd:pfam05924    3 DDEDDLLQECINSAMPKKRR 22
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
8-272 1.98e-04

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 47.36  E-value: 1.98e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165        8 QLLKQVEALKMENSNLRQELEDNSNHLTKLETEASNMKEVLKQLQGSIEDeamassgqidLLERLKELNLDSSNFPGVKL 87
Cdd:TIGR02168  744 QLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQ----------LKEELKALREALDELRAELT 813
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165       88 RSKMSLRSYGSREGSVSSRSGECSPVPMGSFPRRGFVNGSRES-TGYLEELEKERSLLLADLDKEEKEKDWYYAQLQNLT 166
Cdd:TIGR02168  814 LLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESlAAEIEELEELIEELESELEALLNERASLEEALALLR 893
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165      167 KRIDSLpltenfslqtDLTRRQLEYEARQIRVAMEEqlgtCQDMEKRAQRRIARIQQIEKDIL-RIRQLLQSQATEAERs 245
Cdd:TIGR02168  894 SELEEL----------SEELRELESKRSELRRELEE----LREKLAQLELRLEGLEVRIDNLQeRLSEEYSLTLEEAEA- 958
                          250       260       270
                   ....*....|....*....|....*....|..
gi 190165      246 SQNKHETGSHDAERQ-----NEGQGVGEINMA 272
Cdd:TIGR02168  959 LENKIEDDEEEARRRlkrleNKIKELGPVNLA 990
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
135-262 4.01e-04

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 46.08  E-value: 4.01e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165    135 EELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLpltenfSLQTDLTRRQLEyEARQIRVAMEEQLgtcQDMEKRA 214
Cdd:COG1196  221 ELKELEAELLLLKLRELEAELEELEAELEELEAELEEL------EAELAELEAELE-ELRLELEELELEL---EEAQAEE 290
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*...
gi 190165    215 QRRIARIQQIEKDILRIRQLLQSQATEAERSSQNKHETGSHDAERQNE 262
Cdd:COG1196  291 YELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEE 338
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
512-553 5.63e-04

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 39.72  E-value: 5.63e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|..
gi 190165       512 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLSW 553
Cdd:smart00185    1 DDENKQAVVDA-GGLPALVELLKSEDEEVVKEAAWALSNLSS 41
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
512-552 6.04e-04

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 39.36  E-value: 6.04e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 190165      512 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLS 552
Cdd:pfam00514    1 SPENKQAVIEA-GAVPPLVRLLSSPDEEVQEEAAWALSNLA 40
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
691-731 7.00e-04

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 39.36  E-value: 7.00e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 190165      691 NPKDQEALWDMGAVSMLKNLIHSKHKMIAMGSAAALRNLMA 731
Cdd:pfam00514    1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
1717-1738 1.42e-03

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


Pssm-ID: 461782  Cd Length: 22  Bit Score: 37.96  E-value: 1.42e-03
                           10        20
                   ....*....|....*....|..
gi 190165     1717 NKAEEGDILAECINSAMPKGKS 1738
Cdd:pfam05924    1 SPDDEDDLLQECINSAMPKKRR 22
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
11-247 1.44e-03

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 43.60  E-value: 1.44e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165     11 KQVEALKMENSNLRQELEDNSNHLTKLETEASNMKEVLKQLQGSI---EDEAMASSGQIDLLE-RLKELNLDSSnfpgvK 86
Cdd:COG4942   20 DAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIaalARRIRALEQELAALEaELAELEKEIA-----E 94
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165     87 LRSKM-SLRSYGSREGSVSSRSGECSPvPMGSFPRRGFVNGSRESTgYLEELEKERSLLLADLDKEEKEKDWYYAQLQNL 165
Cdd:COG4942   95 LRAELeAQKEELAELLRALYRLGRQPP-LALLLSPEDFLDAVRRLQ-YLKYLAPARREQAEELRADLAELAALRAELEAE 172
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165    166 TKRIDSLpltenfslqtdltRRQLEYEARQIRVAMEEQLGTCQDMEKRAQRRIARIQQIEKDILRIRQLLQSQATEAERS 245
Cdd:COG4942  173 RAELEAL-------------LAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAA 239

                 ..
gi 190165    246 SQ 247
Cdd:COG4942  240 AE 241
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
120-260 2.17e-03

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 43.89  E-value: 2.17e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165      120 RRGFVNGSRESTGY--------LEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLpltenfslqtdltrRQLEY 191
Cdd:TIGR02168  657 PGGVITGGSAKTNSsilerrreIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQL--------------RKELE 722
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 190165      192 EARQIRVAMEEQLGTcqdMEKRAQRRIARIQQIEKDILRIRQLLQSQATEAERSSQNKHETGSHDAERQ 260
Cdd:TIGR02168  723 ELSRQISALRKDLAR---LEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELE 788
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1372-1394 3.50e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.97  E-value: 3.50e-03
                           10        20
                   ....*....|....*....|....
gi 190165     1372 SPPEHY-VQETPLMFSRCTSVSSL 1394
Cdd:pfam05923    1 DSPKRYcVEGTPANFSRASSLSSL 24
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1257-1274 6.63e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.20  E-value: 6.63e-03
                           10
                   ....*....|....*...
gi 190165     1257 ETIQTYCVEDTPICFSRC 1274
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRA 18
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
340-392 8.69e-03

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 36.25  E-value: 8.69e-03
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|...
gi 190165       340 SQDSCISMRQSGCLPLLIQLLHgndkdsvllgnsRGSKEARARASAALHNIIH 392
Cdd:smart00185    1 DDENKQAVVDAGGLPALVELLK------------SEDEEVVKEAAWALSNLSS 41
 
Name Accession Description Interval E-value
Arm_APC_u3 pfam16629
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ...
732-1019 3.09e-166

Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435476  Cd Length: 293  Bit Score: 514.14  E-value: 3.09e-166
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165      732 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRSKQRHKQSLYGDYVFDTNRHDDN-- 809
Cdd:pfam16629    1 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRNKQRHKQNVYSEYVLDSGRHDDSvc 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165      810 RSDNFNTGNMTVLSPYLNTTVLPSSSSSRG--SLDSSRSEKDRSLERERGIGLGNYHPATENPGTSSKR-GLQISTTAAQ 886
Cdd:pfam16629   81 RSDNFNTGNVTVLSPYLNTTVLPSSSSRDSrgNAESSRSEKDRSLDRERGAGLSNFHPATENSGNSSKRiGMQISTTAAQ 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165      887 IAKVMEEVSAIHTSQEDRSSGSTTELHCVTDERNALRRSSAAHTHSNTYNFTKSENSNRTCSMPYAKLEYKRSSNDSLNS 966
Cdd:pfam16629  161 IAKVMEEVSSMHISQEDRSSGSTSDMHCMQDDRNSIRRSSTAHPHSNVYSFNKSESSNRPCPMPYMKMEYKRASNDSLNS 240
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|...
gi 190165      967 VSSNDGYGKRGQMKPSIESYSEDDESKFCSYGQYPADLAHKIHSANHMDDNDG 1019
Cdd:pfam16629  241 VSSSDGYGKRGQMKPSVESYSEDDEGKFCSYGKYPADLAHKIHSANHMDDNDG 293
APC_basic pfam05956
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ...
2224-2569 3.57e-110

APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.


Pssm-ID: 428690 [Multi-domain]  Cd Length: 336  Bit Score: 355.34  E-value: 3.57e-110
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165     2224 SISRGRTMIHIPGVRNSSSSTSPVSKKGPPLKTPASKSPSEGQTATTS-PRGAKPSVKSELSPVARQTSQIGGSSKAPSR 2302
Cdd:pfam05956    1 VVFRGRTVIYMPGVKESQPSTSPPPKKTPPKTDAPAKNPNLGQQRSRSlHRLGKPSELADLSPPKRSATPPARISKAPSS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165     2303 SGSRDSTPSRPAQQPLSRPIQSPGRNSISPGRNgisppnKLSQLPRTSSPSTASTKSSGSgKMSYTSPGRQMSQQNLTKQ 2382
Cdd:pfam05956   81 GSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRN------KLSPLPKTKSPARASTKKSGS-HKTQKSPVRIPFMQTPTKQ 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165     2383 TGLSKNASSI------PRSESASKGLNqmnngNGANKKVELSRMSSTKSSGSESDRSerpVLVRQSTFIKEAPSPTLRRK 2456
Cdd:pfam05956  154 TGLPRNPSPLvtnqpePRSESASKGLR-----SLPGKRLDLVRMSSARSSGSESDRS---GFLRQLTFIKESPSLLLRRR 225
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165     2457 LEESASfESLSPSSRPASPTRSQaqtpvlsPSLPDMSLSTHSSVQAGGWRKLPPNLSPTIEYNDGRPAKRHDIARSHSES 2536
Cdd:pfam05956  226 LELSAS-ESLSPSSQPASPRRSR-------PGLPAVFLCSSRCQELKGWRKQPPNPNSRAEPSDRPLTRRRPPRRTSSES 297
                          330       340       350
                   ....*....|....*....|....*....|...
gi 190165     2537 PSRLPInRSGTWKREHSKHSSSLPRVSTWRRTG 2569
Cdd:pfam05956  298 PSRLPV-RNGTWKRETFKRYSSLPHINVWRRTG 329
EB1_binding pfam05937
EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the ...
2671-2844 1.19e-89

EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the microtubule-associating protein EB-1. At the C-terminus of the alignment is also a pfam00595 binding domain. A short motif in the middle of the region appears to be found in the APC2 proteins.


Pssm-ID: 399141  Cd Length: 174  Bit Score: 289.20  E-value: 1.19e-89
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165     2671 RSGRSPTGNTPPVIDSVSEKANPNIKDSKDNQAKQNVGNGSVPMRTVGLENRLTSFIQVDAPDQKGTEIKPGQNNPVPVS 2750
Cdd:pfam05937    1 RSGRSPTGNTPPVIDSVPEKGIKDEKDSKDPQAKQNMGNGNVPVRTVGLENRLNSFIQSDSPDKKGTETKPLQNNPVPTP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165     2751 ETNESPIVERTPFSSSSSSKHSSPSGTVAARVTPFNYNPSPRKSSADSTSARPSQIPTPVNNNTKKRDSKTDSTESSGTQ 2830
Cdd:pfam05937   81 ETNENPVSERTPFSSSSSSKHSSPSGAVAARVTPFNYNPSPRKSSADSSSARPSQIPTPVNNSTKKRDSKTESTDSSGNQ 160
                          170
                   ....*....|....
gi 190165     2831 SPKRHSGSYLVTSV 2844
Cdd:pfam05937  161 SPKRHSGSYLVTSV 174
APC_u5 pfam16630
Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of ...
1036-1135 8.13e-57

Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of natively unstructured sequence lying between the first and the second 15-residue beta-catenin binding motifs, APC_15aa, pfam05972, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 406923  Cd Length: 100  Bit Score: 192.43  E-value: 8.13e-57
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165     1036 LNSGRQSPSQNERWARPKHIIEDEIKQSEQRQSRNQSTTYPVYTESTDDKHLKFQPHFGQQECVSPYRSRGANGSETNRV 1115
Cdd:pfam16630    1 LNSGRQSPSQNERWARPKHIIEDEMKQSEQRQPRSQSTTYPVYTESGDDKHMKFQPRFGQQECVSPFRSRGSNGSEQSRV 80
                           90       100
                   ....*....|....*....|
gi 190165     1116 GSNHGINQNVSQSLCQEDDY 1135
Cdd:pfam16630   81 GSSHGINQKVSQSLCQVDDY 100
APC_u14 pfam16635
Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively ...
1747-1840 2.54e-48

Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively unstructured sequence lying between the second SAMP pfam05924, and the fifth creatine-rich region, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435479  Cd Length: 94  Bit Score: 167.71  E-value: 2.54e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165     1747 IMDQVQQASASSSAPNKNQLDGKKKKPTSPVKPIPQNTEYRTRVRKNADSKNNLNAERVFSDNKDSKKQNLKNNSKDFND 1826
Cdd:pfam16635    1 IMDQIQQASAASSGGSKSQQDGEKKKPTSPVKPMPQSSEYRARVRKNTESKNNLNSERSYPDNKESKKQNLKNNSRDFND 80
                           90
                   ....*....|....
gi 190165     1827 KLPNNEDRVRGSFA 1840
Cdd:pfam16635   81 KLPNNEERTRGSFA 94
APC_rep pfam18797
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ...
393-466 4.76e-45

Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.


Pssm-ID: 465870  Cd Length: 74  Bit Score: 157.71  E-value: 4.76e-45
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 190165      393 SQPDDKRGRREIRVLHLLEQIRAYCETCWEWQEAHEPGMDQDKNPMPAPVEHQICPAVCVLMKLSFDEEHRHAM 466
Cdd:pfam18797    1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
APC_u15 pfam16636
Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of ...
1874-1948 2.65e-31

Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of natively unstructured sequence lying between the fifth and sixth creatine-rich, APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435480  Cd Length: 81  Bit Score: 118.81  E-value: 2.65e-31
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 190165     1874 DLSREKAELRKAKENKESEAKVTSHTELTSNQQSANKTQAIAKQPINRGQPKPILQKQSTFPQSSKDIPDRGAAT 1948
Cdd:pfam16636    7 DLSREKAELRKGKETKETETKVTSHIEQPSNQQSTNRTQACQKHPPNRGQPKPLLQKQTTFPQSSKDIPDRGAAT 81
APC_u9 pfam16633
Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of ...
1283-1369 5.60e-31

Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of natively unstructured sequence lying between the first and second APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435478  Cd Length: 89  Bit Score: 118.05  E-value: 5.60e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165     1283 AEDEI-GCNQTTQEADSANTLQIAEIKGKIGTRSAEDPVSEVPSSVHSTlETKSSRLQGSSLS-SESARHKAVEFPSGAK 1360
Cdd:pfam16633    2 AEDEIeGRDQATRSTDNYNTLQITELKENSGAVSTEQTVSEVPSSSQHI-RTKPNRLQASNLSpSDSSRHKAVEFSSGAK 80

                   ....*....
gi 190165     1361 SPSKSGAQT 1369
Cdd:pfam16633   81 SPSKSGAQT 89
APC_u13 pfam16634
Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively ...
1663-1716 1.23e-24

Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively unstructured sequence lying between the fourth creatine-rich region, APC_crr, pfam05923, and the SAMP pfam05924, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 406927  Cd Length: 54  Bit Score: 98.71  E-value: 1.23e-24
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 190165     1663 IESPPNELAAGEGVRGGAQSGEFEKRDTIPTEGRSTDEAQGGKTSSVTIPELDD 1716
Cdd:pfam16634    1 IESPPNELANAESTGTGAESAEFEKRDTIPTEGRSTDDAQRGKKSNITTSALDD 54
Suppressor_APC pfam11414
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ...
127-207 4.74e-22

Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils.


Pssm-ID: 463275  Cd Length: 82  Bit Score: 92.32  E-value: 4.74e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165      127 SRESTGYLEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLPLTE-NFSLQTDLTRRQLEYEARQIRVAMEEQLG 205
Cdd:pfam11414    1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGtYFDYGSDAQQERLEFLLARIQEVNRCLGG 80

                   ..
gi 190165      206 TC 207
Cdd:pfam11414   81 LI 82
APC_N_CC pfam16689
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil ...
4-55 6.06e-22

Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil dimerization domain of the adenomatosis polyposis coli (APC) tumour-repressor proteins. It plays a key role in the regulation of cellular levels of the oncogene product beta-catenin. Coiled-coil regions are binding repeats that in this case bind to the armadillo repeat region of beta-catenin.


Pssm-ID: 435517  Cd Length: 52  Bit Score: 91.20  E-value: 6.06e-22
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 190165        4 ASYDQLLKQVEALKMENSNLRQELEDNSNHLTKLETEASNMKEVLKQLQGSI 55
Cdd:pfam16689    1 ASYDQLLRQVEALKLENTTLRQELRDNSSHLSKLETEASNMKEVLKHLQGSI 52
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1638-1661 1.46e-07

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 49.30  E-value: 1.46e-07
                           10        20
                   ....*....|....*....|....
gi 190165     1638 DMPRVYCVEGTPINFSTATSLSDL 1661
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSSL 24
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
649-689 4.89e-07

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 48.19  E-value: 4.89e-07
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 190165       649 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 689
Cdd:smart00185    1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
649-689 6.26e-07

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 47.83  E-value: 6.26e-07
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 190165      649 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 689
Cdd:pfam00514    1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
PHA03247 PHA03247
large tegument protein UL36; Provisional
2252-2565 9.58e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.94  E-value: 9.58e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165    2252 PPLKTPASKsPSEGQTATTSPRGAKPSVKSE----LSPVARQTSQIGGSSKAPSRSGSRDSTPSRPAQQPLSRPIQSPGR 2327
Cdd:PHA03247 2703 PPPPTPEPA-PHALVSATPLPPGPAAARQASpalpAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR 2781
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165    2328 NSISPGRNGISPPNKLSQLPRTSSPSTASTKSSGSGKMSYTSPGrqmsqqnltkqTGLSKNASSIPRSESASKGLNQ--M 2405
Cdd:PHA03247 2782 RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPA-----------GPLPPPTSAQPTAPPPPPGPPPpsL 2850
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165    2406 NNGNGANKKVELSRMSSTKSSGSESDRSERPVLVRqstfikeAPSPTLRRKLEESA-SFESLSPSSRPASPTRSQAQTPV 2484
Cdd:PHA03247 2851 PLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRR-------LARPAVSRSTESFAlPPDQPERPPQPQAPPPPQPQPQP 2923
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165    2485 LSPSLPDMSLSTHSSVQAggwrKLPPNLSPTieyNDGRPAKRHDIARSHSESPSRLPINRSGTWK----RE------HSK 2554
Cdd:PHA03247 2924 PPPPQPQPPPPPPPRPQP----PLAPTTDPA---GAGEPSGAVPQPWLGALVPGRVAVPRFRVPQpapsREapasstPPL 2996
                         330
                  ....*....|.
gi 190165    2555 HSSSLPRVSTW 2565
Cdd:PHA03247 2997 TGHSLSRVSSW 3007
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
2257-2402 5.70e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 52.48  E-value: 5.70e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165    2257 PASKSPSEGQTATTSPRGAKPSVKSELSPVA----RQTSQIGGSSKAPSRSGSRDSTPSRPAQQPLSRPIQSPGRNSISP 2332
Cdd:PHA03307  278 PSSRPGPASSSSSPRERSPSPSPSSPGSGPApsspRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPP 357
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165    2333 GRNGISPPNKLSQlPRTSSPSTASTKSSGSGKMSYTSPGRQMSQQNLTKQTGLSKNASSIPRSESASKGL 2402
Cdd:PHA03307  358 PPADPSSPRKRPR-PSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAF 426
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
2034-2053 3.82e-05

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


Pssm-ID: 461782  Cd Length: 22  Bit Score: 42.58  E-value: 3.82e-05
                           10        20
                   ....*....|....*....|
gi 190165     2034 DSEDDLLQECISSAMPKKKK 2053
Cdd:pfam05924    3 DDEDDLLQECINSAMPKKRR 22
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
2250-2545 4.02e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 49.40  E-value: 4.02e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165    2250 KGPPLKTPASKSPSEGQTATTSPRGAKPSVKSELSPVAR----------QTSQIGGSSKAPSRSGSRDSTPS-------R 2312
Cdd:PHA03307  101 AREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRpvgspgpppaASPPAAGASPAAVASDAASSRQAalplsspE 180
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165    2313 PAQQPLSRPIQSPGRNSISPGRNGISPPNKLSQLPRTSSPSTASTKSSGSGKmSYTSPGRQMSQQNLTKQTGLSKNASSI 2392
Cdd:PHA03307  181 ETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDA-GASSSDSSSSESSGCGWGPENECPLPR 259
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165    2393 PRSESASKGLNQMNNGNGankkvelSRMSSTKSSGSESDRSERPVLVRQSTFIKEAPSPTlRRKLEESASFESLSPSSRP 2472
Cdd:PHA03307  260 PAPITLPTRIWEASGWNG-------PSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSP-RASSSSSSSRESSSSSTSS 331
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165    2473 ASPTRSQAQTPVLSPSLPDMSLSTHSSVQAGGWRKLPPNLSPTIEY---NDGRP---AKRHDIARSH--SESPSRLPINR 2544
Cdd:PHA03307  332 SSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSpaaSAGRPtrrRARAAVAGRArrRDATGRFPAGR 411

                  .
gi 190165    2545 S 2545
Cdd:PHA03307  412 P 412
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
8-272 1.98e-04

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 47.36  E-value: 1.98e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165        8 QLLKQVEALKMENSNLRQELEDNSNHLTKLETEASNMKEVLKQLQGSIEDeamassgqidLLERLKELNLDSSNFPGVKL 87
Cdd:TIGR02168  744 QLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQ----------LKEELKALREALDELRAELT 813
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165       88 RSKMSLRSYGSREGSVSSRSGECSPVPMGSFPRRGFVNGSRES-TGYLEELEKERSLLLADLDKEEKEKDWYYAQLQNLT 166
Cdd:TIGR02168  814 LLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESlAAEIEELEELIEELESELEALLNERASLEEALALLR 893
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165      167 KRIDSLpltenfslqtDLTRRQLEYEARQIRVAMEEqlgtCQDMEKRAQRRIARIQQIEKDIL-RIRQLLQSQATEAERs 245
Cdd:TIGR02168  894 SELEEL----------SEELRELESKRSELRRELEE----LREKLAQLELRLEGLEVRIDNLQeRLSEEYSLTLEEAEA- 958
                          250       260       270
                   ....*....|....*....|....*....|..
gi 190165      246 SQNKHETGSHDAERQ-----NEGQGVGEINMA 272
Cdd:TIGR02168  959 LENKIEDDEEEARRRlkrleNKIKELGPVNLA 990
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
135-262 4.01e-04

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 46.08  E-value: 4.01e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165    135 EELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLpltenfSLQTDLTRRQLEyEARQIRVAMEEQLgtcQDMEKRA 214
Cdd:COG1196  221 ELKELEAELLLLKLRELEAELEELEAELEELEAELEEL------EAELAELEAELE-ELRLELEELELEL---EEAQAEE 290
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*...
gi 190165    215 QRRIARIQQIEKDILRIRQLLQSQATEAERSSQNKHETGSHDAERQNE 262
Cdd:COG1196  291 YELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEE 338
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
512-553 5.63e-04

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 39.72  E-value: 5.63e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|..
gi 190165       512 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLSW 553
Cdd:smart00185    1 DDENKQAVVDA-GGLPALVELLKSEDEEVVKEAAWALSNLSS 41
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
512-552 6.04e-04

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 39.36  E-value: 6.04e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 190165      512 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLS 552
Cdd:pfam00514    1 SPENKQAVIEA-GAVPPLVRLLSSPDEEVQEEAAWALSNLA 40
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
2255-2515 6.17e-04

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 45.34  E-value: 6.17e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165     2255 KTPASKSPSEGQTATTSPRGAKPsvkselSPVARQTSQIGGSSKAPSRSGSRDSTPSRPAQQPLSRPIQSPGRNSISPgr 2334
Cdd:pfam17823  151 RANASAAPRAAIAAASAPHAASP------APRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHP-- 222
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165     2335 ngiSPPNKLSQLPrTSSPStASTKSSGSGKMSYTSPGRQMSQQNLTKQTGLSKNASSiPRSESASKGLNQMNNGNGANKk 2414
Cdd:pfam17823  223 ---AAGTALAAVG-NSSPA-AGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGD-PHARRLSPAKHMPSDTMARNP- 295
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165     2415 veLSRMSSTKSSGSESDRSERPVLvrqSTFIKEAPSPTlRRKLEESASFESLSPSSRPASPTRSQAQTPVLSPsLPDMSL 2494
Cdd:pfam17823  296 --AAPMGAQAQGPIIQVSTDQPVH---NTAGEPTPSPS-NTTLEPNTPKSVASTNLAVVTTTKAQAKEPSASP-VPVLHT 368
                          250       260
                   ....*....|....*....|.
gi 190165     2495 STHSSVQAGGWRKLPPNLSPT 2515
Cdd:pfam17823  369 SMIPEVEATSPTTQPSPLLPT 389
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
691-731 7.00e-04

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 39.36  E-value: 7.00e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 190165      691 NPKDQEALWDMGAVSMLKNLIHSKHKMIAMGSAAALRNLMA 731
Cdd:pfam00514    1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
134-251 1.39e-03

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 44.37  E-value: 1.39e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165    134 LEELEKERSLLLADLDKEEKEKDWY--YAQLQNLTKRIDSLP---------LTENFSLQTDL---------TRRQLEYEA 193
Cdd:COG4717  104 LEELEAELEELREELEKLEKLLQLLplYQELEALEAELAELPerleeleerLEELRELEEELeeleaelaeLQEELEELL 183
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....*...
gi 190165    194 RQIRVAMEEQLGTCQDMEKRAQRRIARIQQIEKDILRIRQLLQSQATEAERSSQNKHE 251
Cdd:COG4717  184 EQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQLENELEAAAL 241
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
1717-1738 1.42e-03

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


Pssm-ID: 461782  Cd Length: 22  Bit Score: 37.96  E-value: 1.42e-03
                           10        20
                   ....*....|....*....|..
gi 190165     1717 NKAEEGDILAECINSAMPKGKS 1738
Cdd:pfam05924    1 SPDDEDDLLQECINSAMPKKRR 22
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
11-247 1.44e-03

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 43.60  E-value: 1.44e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165     11 KQVEALKMENSNLRQELEDNSNHLTKLETEASNMKEVLKQLQGSI---EDEAMASSGQIDLLE-RLKELNLDSSnfpgvK 86
Cdd:COG4942   20 DAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIaalARRIRALEQELAALEaELAELEKEIA-----E 94
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165     87 LRSKM-SLRSYGSREGSVSSRSGECSPvPMGSFPRRGFVNGSRESTgYLEELEKERSLLLADLDKEEKEKDWYYAQLQNL 165
Cdd:COG4942   95 LRAELeAQKEELAELLRALYRLGRQPP-LALLLSPEDFLDAVRRLQ-YLKYLAPARREQAEELRADLAELAALRAELEAE 172
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165    166 TKRIDSLpltenfslqtdltRRQLEYEARQIRVAMEEQLGTCQDMEKRAQRRIARIQQIEKDILRIRQLLQSQATEAERS 245
Cdd:COG4942  173 RAELEAL-------------LAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAA 239

                 ..
gi 190165    246 SQ 247
Cdd:COG4942  240 AE 241
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
120-260 2.17e-03

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 43.89  E-value: 2.17e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165      120 RRGFVNGSRESTGY--------LEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLpltenfslqtdltrRQLEY 191
Cdd:TIGR02168  657 PGGVITGGSAKTNSsilerrreIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQL--------------RKELE 722
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 190165      192 EARQIRVAMEEQLGTcqdMEKRAQRRIARIQQIEKDILRIRQLLQSQATEAERSSQNKHETGSHDAERQ 260
Cdd:TIGR02168  723 ELSRQISALRKDLAR---LEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELE 788
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
134-259 2.67e-03

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 43.52  E-value: 2.67e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165      134 LEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSL------------PLTENFSLQTDLTRRQLEYEARQIRVAME 201
Cdd:TIGR02169  232 KEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIeqlleelnkkikDLGEEEQLRVKEKIGELEAEIASLERSIA 311
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 190165      202 EqlgtCQDMEKRAQRRIARIQ-QIEKDILRIRQLLQSQATEAERSSQNKHETGSHDAER 259
Cdd:TIGR02169  312 E----KERELEDAEERLAKLEaEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEEL 366
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
2253-2569 3.36e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 43.24  E-value: 3.36e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165    2253 PLKTPASKSPSEGQTATTSPRGAKPSVKSELSPVARQTSQiGGSSKAPSRSGsrDSTPSRPAQQPLSRPIQSPGRNSISP 2332
Cdd:PHA03307   63 DRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAR-EGSPTPPGPSS--PDPPPPTPPPASPPPSPAPDLSEMLR 139
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165    2333 GRNGISPPNKLSQLPRTSSPS-TASTKSSGSGKM---------SYTSPGRQMSQQNLTKQTGLSKNASSIPRSESASKGL 2402
Cdd:PHA03307  140 PVGSPGPPPAASPPAAGASPAaVASDAASSRQAAlplsspeetARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASS 219
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165    2403 NQMNNGNGANKKVELSRMSSTKSSGSESD---RSERPVLVRQSTfikeaPSPTLRRKLEESASFESLSPSSRPASPTRSQ 2479
Cdd:PHA03307  220 PAPAPGRSAADDAGASSSDSSSSESSGCGwgpENECPLPRPAPI-----TLPTRIWEASGWNGPSSRPGPASSSSSPRER 294
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165    2480 AqtPVLSPSLPDMSLSTHSSVQAGGWRKLPP-NLSPTIEYNDGRPAKRHDIARSHSESPS----RLPINRSGTWKREHSK 2554
Cdd:PHA03307  295 S--PSPSPSSPGSGPAPSSPRASSSSSSSREsSSSSTSSSSESSRGAAVSPGPSPSRSPSpsrpPPPADPSSPRKRPRPS 372
                         330
                  ....*....|....*
gi 190165    2555 HSSSLPRVSTWRRTG 2569
Cdd:PHA03307  373 RAPSSPAASAGRPTR 387
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1372-1394 3.50e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.97  E-value: 3.50e-03
                           10        20
                   ....*....|....*....|....
gi 190165     1372 SPPEHY-VQETPLMFSRCTSVSSL 1394
Cdd:pfam05923    1 DSPKRYcVEGTPANFSRASSLSSL 24
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
134-248 5.18e-03

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 42.06  E-value: 5.18e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165    134 LEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLpltenfslqtdltrrqleyeARQIRvAMEEQLgtcQDMEKR 213
Cdd:COG4942   29 LEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAAL--------------------ARRIR-ALEQEL---AALEAE 84
                         90       100       110
                 ....*....|....*....|....*....|....*
gi 190165    214 AQRRIARIQQIEKDILRIRQLLQSQATEAERSSQN 248
Cdd:COG4942   85 LAELEKEIAELRAELEAQKEELAELLRALYRLGRQ 119
PHA03247 PHA03247
large tegument protein UL36; Provisional
2251-2541 6.18e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.23  E-value: 6.18e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165    2251 GPPLKTPASKSPSEGQTATTSPRGAKPSvkselsPVARQTSqIGGSSKAPSRSGSRDSTPSRPAQQPLSRPIQSPGRNSI 2330
Cdd:PHA03247 2603 DDRGDPRGPAPPSPLPPDTHAPDPPPPS------PSPAANE-PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQ 2675
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165    2331 SPgrngiSPPNKLSQ--LPRTSSPSTASTKSSGSGKMSYTSPGRQMSQQNLTKQTGLSKNASSIPRSESASKGLNQMNNG 2408
Cdd:PHA03247 2676 AS-----SPPQRPRRraARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPAT 2750
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190165    2409 NGANKKVE--------LSRMSSTKSSGSESDRSERPVLVRQSTFIKEAPSPT--LRRKLEESASFESLSPSSRPAS---- 2474
Cdd:PHA03247 2751 PGGPARPArppttagpPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWdpADPPAAVLAPAAALPPAASPAGplpp 2830
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 190165    2475 PTRSQAQTPVLSPSLPDMSLSTHSSVQAGG--WRKLPPNLSPTIEYNDGRPAKRHDIARSHSESPSRLP 2541
Cdd:PHA03247 2831 PTSAQPTAPPPPPGPPPPSLPLGGSVAPGGdvRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFA 2899
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1257-1274 6.63e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.20  E-value: 6.63e-03
                           10
                   ....*....|....*...
gi 190165     1257 ETIQTYCVEDTPICFSRC 1274
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRA 18
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
340-392 8.69e-03

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 36.25  E-value: 8.69e-03
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|...
gi 190165       340 SQDSCISMRQSGCLPLLIQLLHgndkdsvllgnsRGSKEARARASAALHNIIH 392
Cdd:smart00185    1 DDENKQAVVDAGGLPALVELLK------------SEDEEVVKEAAWALSNLSS 41
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH