NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|25150345|ref|NP_493575|]
View 

Mediator of RNA polymerase II transcription subunit 23 [Caenorhabditis elegans]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Med23 pfam11573
Mediator complex subunit 23; Med23 is one of the subunits of the Tail portion of the Mediator ...
25-1347 0e+00

Mediator complex subunit 23; Med23 is one of the subunits of the Tail portion of the Mediator complex that regulates RNA polymerase II activity. Med23 is required for heat-shock-specific gene expression, and has been shown to mediate transcriptional activation of E1A in mice.


:

Pssm-ID: 463299  Cd Length: 1301  Bit Score: 1596.63  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345     25 EVNRIKSQIKSLVEENHTRKFFRPLTSNIGD----ETAILRIQFNNMMSKMEEKEKQSLVKELIKMVHHVAEKNSPDRVF 100
Cdd:pfam11573    1 EVNQVKSIINEILKVEIIEEAFSGFTSNLPDdekeKTAILRKQFNNMMSKMSEEEKESLVKELIKLVHHVAEKNRLDFLF 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    101 vgtnyervvdQLLRYAHQKGAISTNLCAEGLIMTSDFRLCSR-ICQEKWKFINDCIPKIDYKGIRNILRYILEsQLRRLP 179
Cdd:pfam11573   81 ----------QLLEYAVQKGIISARLVCEGLIMSEKLVLCNRlFWQECFKLIRKIIPGVDYKGVREILKYCLE-KARRLP 149
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    180 YSLSPEQVNEIRIVENVILHIVDRDSNLMPPLITLSEIMRGmPKQALMLP--RLTEKLASLSVHFRPIADLSHVCGRGFV 257
Cdd:pfam11573  150 YSLSPEKVPQLRALENVILYILDRNACLLPAYFIVNEIMKG-YPDAKMWPhwRLAELLSNFVESFRPIAQMVSIIGRSFM 228
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    258 YPIPLHPSF-YPLTSCWEehgLNASNTIVQPHHTLPYRPEHTSTYLYTLYMILRQPLGKD------SLHPPSKTKTKTNW 330
Cdd:pfam11573  229 LPVVEHSGYaDHLTSPWK---LDPATLKFQLKGNLPYRPELLEPQTYLLRYVLEQPYSRDmvcsmlGLQKQHKQRCKALE 305
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    331 EMLISVMICeSMAEAESLPETEPIPRYQWDNIVNIVIYGItqhLLVPKTFFNVLKNLIKRCK---YTRARDEVMWIVFQV 407
Cdd:pfam11573  306 EQLVELMIC-AMERSETEPEDEPITHWLWLHLSSQLIYFV---LFQFASFPNIVKSLHEKLAgrdLTRGRDHLMWVLLQF 381
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    408 VGSLSNLTRLDDAVTeIVELYNELFDGDVvWMGASD-HPALFARFLAAAGTWMILEKDFADK----MPPANETIKSHIKF 482
Cdd:pfam11573  382 ISGSIQKNPLSDFLP-ILKLYDLLYPEKE-PLPVPDyNKPLCTHQMAATCIWIHLLKKAQDEnqklQRPIPDTLKSHHEF 459
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    483 IQDGVDNFDSSNTAM-----LAVLSNIYRTDTKMGKLIVPTLQQMLESIDDTKSLfelsyKRMAVNCFSAFPVEFMEALT 557
Cdd:pfam11573  460 LQHLVLNNDSPNLAMgsdyrIALLCNAYSTNTEYFSRPMPALVETILGNSKSRSP-----GCNAPGPTTPLSMEFLDSLT 534
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    558 FRSKKTLLIQCFQPLRSFSTVRL---PSPAVFETVAKIcesedyeMAVKDMEHLAQRSLhvataADRSSGEHQNVQAKDQ 634
Cdd:pfam11573  535 VHSKMSLIHSIVTHVIKFAQSKSsvaLAPALVETYSRL-------LVYTEIESLGIKGF-----ISQLLPTVFKSQAWGI 602
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    635 CYFLFDFLVYRLPHLHAYSK--YSSTVNALsyftqhiPNNPQNH--QIYRLMEQFLMRRWcwagfhgcitahtQMFGTSY 710
Cdd:pfam11573  603 LHTLLEMFSYRMHHIQPHYRvqLLSHLHSL-------ANVPQTNqtQLHLCVESTALRLI-------------TGLGSSE 662
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    711 KDNTMMRHMAYPKTFSVpDQYPFAINpeifKMAIYSFLRAVKITAQDIAIE---KTMFPTIING--FGWPEKSTSYFPKW 785
Cdd:pfam11573  663 VQPQLSRFLNEPKTFLV-SQESEELN----RALILTLARAIHITGTGTDSEswcKELLETIMQNtpHGWPQHTLSCFPPW 737
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    786 ALDAIKASDTSNAvNTEEILSDVRNTARMHTSLT-PNQFVIRYGEDRDPATSHCMLavlfhfaYNSVDSTYNITSEFYEV 864
Cdd:pfam11573  738 LLEFFKQNNVPKE-NKQQLKKNVEEEYRKWTSMTnENDIIAHFSVPGTPPLFLCLL-------WKMLLETDRITPIAYKV 809
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    865 MEKKTPKEIVVMGNYLVDYIIADAkTQDCNEKTFKNIAKAAALLVFQFQVLRADRLLLSLIMHPATDEDALICIQIANEF 944
Cdd:pfam11573  810 LERIGARALSAHLRKFCDYLVFEF-TNSCGGQHVNKCADALNDMIWKYNIVTIDRLVLCLALRPQEGNEAQVCFFIIQLL 888
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    945 IL-TPEFQERIRWYHQ-NVPKKDHFPTEYIKAIVkYHDAFPEFEACELVRSYDSSSNVHMPTYYGCLIERLLPIMDQYLH 1022
Cdd:pfam11573  889 LLkTTEFRNRVQDFVKeNSPEHWKQPNWHEKHLA-YHRKFPEKFAPEGVAEYDSSSNPYLPTYFGNVCLRFLPVLDIVIH 967
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345   1023 VALEQ-QGYKLNPQILQSVSMLYKFHAMPIHFMYSVL-FTSHGLMSGPDAK-SFVLAFATQIEECH-----LTEAFEKFn 1094
Cdd:pfam11573  968 RALELpPVSKSLETLLEHLGCLYKFHDRPVTYLYNTLhYYEHKLRDRPDLKrKLVSAIIGSLKDCRppgwaLTEAYEKY- 1046
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345   1095 HQKSSREQLI-------MELIDRMSASLDFIlTPPPFVAKDWKIAELS-PGAQTLYLACIELMASPHSPETLVAAMINVM 1166
Cdd:pfam11573 1047 HQKSSDEQLIwlpedyyMKLIDRMVDTMDGK-TPPPFPAKDWRFNEFPnPGAHALYVTCVELMALPVSPETVANALIDVV 1125
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345   1167 QMKPHARPFNI----VNLTALLLTALPAAYSNALHDEFVAVFVNGETANL-----KFEEIVFDNYEESLLlnlPNRARTI 1237
Cdd:pfam11573 1126 QKGYHLIPRNQihlwINAIGLILTALPESYWSVLHDRIVEVLSNPELTNWtypcsPFELFNFDNYHESLL---ENRASYI 1202
                         1290      1300      1310      1320      1330      1340      1350      1360
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345   1238 NVISQQYWLHCSLSLLNFFSHEYVPRILEHVKTEKDLWYTLRLVMPYLRRYyenwdtakqmRSQRENfGPLHIVKLVFQK 1317
Cdd:pfam11573 1203 LALAHAVWHHAGIGQLNPFPQFVKEKLLPHVKTEEQLLYLCHLVGPFLQRF----------RAERER-GVLDITKLLYEL 1271
                         1370      1380      1390
                   ....*....|....*....|....*....|
gi 25150345   1318 LGSMAEEGVEIVYEQHLCDLFYNCKYFFAG 1347
Cdd:pfam11573 1272 LEQVDKEGVELRYMDPICDLLYHIKYMFVG 1301
KLF1_2_4_N super family cl41729
N-terminal domain of Kruppel-like factor (KLF) 1, KLF2, KLF4, and similar proteins; Kruppel ...
1403-1543 1.11e-10

N-terminal domain of Kruppel-like factor (KLF) 1, KLF2, KLF4, and similar proteins; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specifity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the related N-terminal domains of KLF1, KLF2, KLF4, and similar proteins.


The actual alignment was detected with superfamily member cd22056:

Pssm-ID: 425360 [Multi-domain]  Cd Length: 339  Bit Score: 65.06  E-value: 1.11e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345 1403 HVKAHQPLESTPSVSSLPQMQHHLQQAPLLPSHQMMPPPQQHSSSLQHHLQHHtsthqmmdtsqHQTIQQQSNHPTQQQL 1482
Cdd:cd22056  203 FMGQQKPKHQMHSVHPQAFTHHQAAGPGALQGRGGRGGPDCHLLHSSHHHHHH-----------HHLQYQYMNAPYPPHY 271
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 25150345 1483 QHQIPnmsmhqqmgPQYPGavfhhpsgpvghvpmQYGmghhMQQHPHLPHHQQMPAPMHTM 1543
Cdd:cd22056  272 AHQGA---------PQFHG---------------QYS----VFREPMRVHHQGHPGSMLTP 304
 
Name Accession Description Interval E-value
Med23 pfam11573
Mediator complex subunit 23; Med23 is one of the subunits of the Tail portion of the Mediator ...
25-1347 0e+00

Mediator complex subunit 23; Med23 is one of the subunits of the Tail portion of the Mediator complex that regulates RNA polymerase II activity. Med23 is required for heat-shock-specific gene expression, and has been shown to mediate transcriptional activation of E1A in mice.


Pssm-ID: 463299  Cd Length: 1301  Bit Score: 1596.63  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345     25 EVNRIKSQIKSLVEENHTRKFFRPLTSNIGD----ETAILRIQFNNMMSKMEEKEKQSLVKELIKMVHHVAEKNSPDRVF 100
Cdd:pfam11573    1 EVNQVKSIINEILKVEIIEEAFSGFTSNLPDdekeKTAILRKQFNNMMSKMSEEEKESLVKELIKLVHHVAEKNRLDFLF 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    101 vgtnyervvdQLLRYAHQKGAISTNLCAEGLIMTSDFRLCSR-ICQEKWKFINDCIPKIDYKGIRNILRYILEsQLRRLP 179
Cdd:pfam11573   81 ----------QLLEYAVQKGIISARLVCEGLIMSEKLVLCNRlFWQECFKLIRKIIPGVDYKGVREILKYCLE-KARRLP 149
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    180 YSLSPEQVNEIRIVENVILHIVDRDSNLMPPLITLSEIMRGmPKQALMLP--RLTEKLASLSVHFRPIADLSHVCGRGFV 257
Cdd:pfam11573  150 YSLSPEKVPQLRALENVILYILDRNACLLPAYFIVNEIMKG-YPDAKMWPhwRLAELLSNFVESFRPIAQMVSIIGRSFM 228
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    258 YPIPLHPSF-YPLTSCWEehgLNASNTIVQPHHTLPYRPEHTSTYLYTLYMILRQPLGKD------SLHPPSKTKTKTNW 330
Cdd:pfam11573  229 LPVVEHSGYaDHLTSPWK---LDPATLKFQLKGNLPYRPELLEPQTYLLRYVLEQPYSRDmvcsmlGLQKQHKQRCKALE 305
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    331 EMLISVMICeSMAEAESLPETEPIPRYQWDNIVNIVIYGItqhLLVPKTFFNVLKNLIKRCK---YTRARDEVMWIVFQV 407
Cdd:pfam11573  306 EQLVELMIC-AMERSETEPEDEPITHWLWLHLSSQLIYFV---LFQFASFPNIVKSLHEKLAgrdLTRGRDHLMWVLLQF 381
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    408 VGSLSNLTRLDDAVTeIVELYNELFDGDVvWMGASD-HPALFARFLAAAGTWMILEKDFADK----MPPANETIKSHIKF 482
Cdd:pfam11573  382 ISGSIQKNPLSDFLP-ILKLYDLLYPEKE-PLPVPDyNKPLCTHQMAATCIWIHLLKKAQDEnqklQRPIPDTLKSHHEF 459
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    483 IQDGVDNFDSSNTAM-----LAVLSNIYRTDTKMGKLIVPTLQQMLESIDDTKSLfelsyKRMAVNCFSAFPVEFMEALT 557
Cdd:pfam11573  460 LQHLVLNNDSPNLAMgsdyrIALLCNAYSTNTEYFSRPMPALVETILGNSKSRSP-----GCNAPGPTTPLSMEFLDSLT 534
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    558 FRSKKTLLIQCFQPLRSFSTVRL---PSPAVFETVAKIcesedyeMAVKDMEHLAQRSLhvataADRSSGEHQNVQAKDQ 634
Cdd:pfam11573  535 VHSKMSLIHSIVTHVIKFAQSKSsvaLAPALVETYSRL-------LVYTEIESLGIKGF-----ISQLLPTVFKSQAWGI 602
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    635 CYFLFDFLVYRLPHLHAYSK--YSSTVNALsyftqhiPNNPQNH--QIYRLMEQFLMRRWcwagfhgcitahtQMFGTSY 710
Cdd:pfam11573  603 LHTLLEMFSYRMHHIQPHYRvqLLSHLHSL-------ANVPQTNqtQLHLCVESTALRLI-------------TGLGSSE 662
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    711 KDNTMMRHMAYPKTFSVpDQYPFAINpeifKMAIYSFLRAVKITAQDIAIE---KTMFPTIING--FGWPEKSTSYFPKW 785
Cdd:pfam11573  663 VQPQLSRFLNEPKTFLV-SQESEELN----RALILTLARAIHITGTGTDSEswcKELLETIMQNtpHGWPQHTLSCFPPW 737
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    786 ALDAIKASDTSNAvNTEEILSDVRNTARMHTSLT-PNQFVIRYGEDRDPATSHCMLavlfhfaYNSVDSTYNITSEFYEV 864
Cdd:pfam11573  738 LLEFFKQNNVPKE-NKQQLKKNVEEEYRKWTSMTnENDIIAHFSVPGTPPLFLCLL-------WKMLLETDRITPIAYKV 809
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    865 MEKKTPKEIVVMGNYLVDYIIADAkTQDCNEKTFKNIAKAAALLVFQFQVLRADRLLLSLIMHPATDEDALICIQIANEF 944
Cdd:pfam11573  810 LERIGARALSAHLRKFCDYLVFEF-TNSCGGQHVNKCADALNDMIWKYNIVTIDRLVLCLALRPQEGNEAQVCFFIIQLL 888
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    945 IL-TPEFQERIRWYHQ-NVPKKDHFPTEYIKAIVkYHDAFPEFEACELVRSYDSSSNVHMPTYYGCLIERLLPIMDQYLH 1022
Cdd:pfam11573  889 LLkTTEFRNRVQDFVKeNSPEHWKQPNWHEKHLA-YHRKFPEKFAPEGVAEYDSSSNPYLPTYFGNVCLRFLPVLDIVIH 967
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345   1023 VALEQ-QGYKLNPQILQSVSMLYKFHAMPIHFMYSVL-FTSHGLMSGPDAK-SFVLAFATQIEECH-----LTEAFEKFn 1094
Cdd:pfam11573  968 RALELpPVSKSLETLLEHLGCLYKFHDRPVTYLYNTLhYYEHKLRDRPDLKrKLVSAIIGSLKDCRppgwaLTEAYEKY- 1046
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345   1095 HQKSSREQLI-------MELIDRMSASLDFIlTPPPFVAKDWKIAELS-PGAQTLYLACIELMASPHSPETLVAAMINVM 1166
Cdd:pfam11573 1047 HQKSSDEQLIwlpedyyMKLIDRMVDTMDGK-TPPPFPAKDWRFNEFPnPGAHALYVTCVELMALPVSPETVANALIDVV 1125
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345   1167 QMKPHARPFNI----VNLTALLLTALPAAYSNALHDEFVAVFVNGETANL-----KFEEIVFDNYEESLLlnlPNRARTI 1237
Cdd:pfam11573 1126 QKGYHLIPRNQihlwINAIGLILTALPESYWSVLHDRIVEVLSNPELTNWtypcsPFELFNFDNYHESLL---ENRASYI 1202
                         1290      1300      1310      1320      1330      1340      1350      1360
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345   1238 NVISQQYWLHCSLSLLNFFSHEYVPRILEHVKTEKDLWYTLRLVMPYLRRYyenwdtakqmRSQRENfGPLHIVKLVFQK 1317
Cdd:pfam11573 1203 LALAHAVWHHAGIGQLNPFPQFVKEKLLPHVKTEEQLLYLCHLVGPFLQRF----------RAERER-GVLDITKLLYEL 1271
                         1370      1380      1390
                   ....*....|....*....|....*....|
gi 25150345   1318 LGSMAEEGVEIVYEQHLCDLFYNCKYFFAG 1347
Cdd:pfam11573 1272 LEQVDKEGVELRYMDPICDLLYHIKYMFVG 1301
KLF1_2_4_N-like cd22056
N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of ...
1403-1543 1.11e-10

N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of Kruppel-like factor (KLF)1, KLF2, and KLF4; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specifity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domains of an unknown subfamily of KLFs, predominantly found in fish, related to the N-terminal domains of KLF1, KLF2, and KLF4.


Pssm-ID: 409231 [Multi-domain]  Cd Length: 339  Bit Score: 65.06  E-value: 1.11e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345 1403 HVKAHQPLESTPSVSSLPQMQHHLQQAPLLPSHQMMPPPQQHSSSLQHHLQHHtsthqmmdtsqHQTIQQQSNHPTQQQL 1482
Cdd:cd22056  203 FMGQQKPKHQMHSVHPQAFTHHQAAGPGALQGRGGRGGPDCHLLHSSHHHHHH-----------HHLQYQYMNAPYPPHY 271
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 25150345 1483 QHQIPnmsmhqqmgPQYPGavfhhpsgpvghvpmQYGmghhMQQHPHLPHHQQMPAPMHTM 1543
Cdd:cd22056  272 AHQGA---------PQFHG---------------QYS----VFREPMRVHHQGHPGSMLTP 304
Activator_LAG-3 pfam11498
Transcriptional activator LAG-3; The C.elegans Notch pathway, involved in the control of ...
1421-1527 3.71e-09

Transcriptional activator LAG-3; The C.elegans Notch pathway, involved in the control of growth, differentiation and patterning in animal development, relies on either of the receptors GLP-1 or LIN-12. Both these receptors promote signalling by the recruitment of LAG-3 to target promoters, where it then acts as a transcriptional activator. LAG-3 works as a ternary complex together with the DNA binding protein, LAG-1. Its N-terminal region adopts an elongated kinked helix that is required for complex assembly.


Pssm-ID: 151935 [Multi-domain]  Cd Length: 476  Bit Score: 61.13  E-value: 3.71e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345   1421 QMQHHLQQAPLLPSHQMMPPPQQH---------SSSLQHHLQHHTSTHQMMDTSQHQTIQQQSNHPTQQQLQHQIPnmSM 1491
Cdd:pfam11498  361 QQQEHQQQQMLLQQQQQMHQLQQHhqmngggqfATQAHQHAAYLQQMQHMRLQEQIQHQQQQAQHHQQAQQQHQQP--AQ 438
                           90       100       110
                   ....*....|....*....|....*....|....*.
gi 25150345   1492 HQQMGPQYPGAVFHHPSgpvGHVPmQYGmGHHMQQH 1527
Cdd:pfam11498  439 HGQMGYGIPNGYPAHMH---GHAP-AYG-AHHMPHH 469
Amelogenin smart00818
Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem ...
1419-1552 3.59e-06

Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem to regulate formation of crystallites during the secretory stage of tooth enamel development and are thought to play a major role in the structural organisation and mineralisation of developing enamel. The extracellular matrix of the developing enamel comprises two major classes of protein: the hydrophobic amelogenins and the acidic enamelins. Circular dichroism studies of porcine amelogenin have shown that the protein consists of 3 discrete folding units: the N-terminal region appears to contain beta-strand structures, while the C-terminal region displays characteristics of a random coil conformation. Subsequent studies on the bovine protein have indicated the amelogenin structure to contain a repetitive beta-turn segment and a "beta-spiral" between Gln112 and Leu138, which sequester a (Pro, Leu, Gln) rich region. The beta-spiral offers a probable site for interactions with Ca2+ ions. Muatations in the human amelogenin gene (AMGX) cause X-linked hypoplastic amelogenesis imperfecta, a disease characterised by defective enamel. A 9bp deletion in exon 2 of AMGX results in the loss of codons for Ile5, Leu6, Phe7 and Ala8, and replacement by a new threonine codon, disrupting the 16-residue (Met1-Ala16) amelogenin signal peptide.


Pssm-ID: 197891 [Multi-domain]  Cd Length: 165  Bit Score: 48.63  E-value: 3.59e-06
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    1419 LPQMQHHLQQAPLLPSHQMMPPPQQHSSSLQHHLQHHTSTHQMMDTSQHQ-TIQQQSNHPTQQQlqhqipnmsMHQQMGP 1497
Cdd:smart00818   40 IPVSQQHPPTHTLQPHHHIPVLPAQQPVVPQQPLMPVPGQHSMTPTQHHQpNLPQPAQQPFQPQ---------PLQPPQP 110
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*
gi 25150345    1498 QYPgavfhhpsgpvghvpmqygmghhMQQHPhlPHHQQMPAPMHTMNPMMQNMTP 1552
Cdd:smart00818  111 QQP-----------------------MQPQP--PVHPIPPLPPQPPLPPMFPMQP 140
PRK10263 PRK10263
DNA translocase FtsK; Provisional
1407-1507 3.64e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 45.46  E-value: 3.64e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345  1407 HQPL-------ESTPSVSSLPQMQHHLQQAPLLPSHQMMPPPQQHSSSLQHHLQHHTSTHQMMDTSQHQTIQQQSNHPTQ 1479
Cdd:PRK10263  741 HEPLftpivepVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQP 820
                          90       100
                  ....*....|....*....|....*...
gi 25150345  1480 QQLQHQIPNMSMHQQMGPQYPGAVFHHP 1507
Cdd:PRK10263  821 QQPVAPQPQYQQPQQPVAPQPQDTLLHP 848
 
Name Accession Description Interval E-value
Med23 pfam11573
Mediator complex subunit 23; Med23 is one of the subunits of the Tail portion of the Mediator ...
25-1347 0e+00

Mediator complex subunit 23; Med23 is one of the subunits of the Tail portion of the Mediator complex that regulates RNA polymerase II activity. Med23 is required for heat-shock-specific gene expression, and has been shown to mediate transcriptional activation of E1A in mice.


Pssm-ID: 463299  Cd Length: 1301  Bit Score: 1596.63  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345     25 EVNRIKSQIKSLVEENHTRKFFRPLTSNIGD----ETAILRIQFNNMMSKMEEKEKQSLVKELIKMVHHVAEKNSPDRVF 100
Cdd:pfam11573    1 EVNQVKSIINEILKVEIIEEAFSGFTSNLPDdekeKTAILRKQFNNMMSKMSEEEKESLVKELIKLVHHVAEKNRLDFLF 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    101 vgtnyervvdQLLRYAHQKGAISTNLCAEGLIMTSDFRLCSR-ICQEKWKFINDCIPKIDYKGIRNILRYILEsQLRRLP 179
Cdd:pfam11573   81 ----------QLLEYAVQKGIISARLVCEGLIMSEKLVLCNRlFWQECFKLIRKIIPGVDYKGVREILKYCLE-KARRLP 149
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    180 YSLSPEQVNEIRIVENVILHIVDRDSNLMPPLITLSEIMRGmPKQALMLP--RLTEKLASLSVHFRPIADLSHVCGRGFV 257
Cdd:pfam11573  150 YSLSPEKVPQLRALENVILYILDRNACLLPAYFIVNEIMKG-YPDAKMWPhwRLAELLSNFVESFRPIAQMVSIIGRSFM 228
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    258 YPIPLHPSF-YPLTSCWEehgLNASNTIVQPHHTLPYRPEHTSTYLYTLYMILRQPLGKD------SLHPPSKTKTKTNW 330
Cdd:pfam11573  229 LPVVEHSGYaDHLTSPWK---LDPATLKFQLKGNLPYRPELLEPQTYLLRYVLEQPYSRDmvcsmlGLQKQHKQRCKALE 305
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    331 EMLISVMICeSMAEAESLPETEPIPRYQWDNIVNIVIYGItqhLLVPKTFFNVLKNLIKRCK---YTRARDEVMWIVFQV 407
Cdd:pfam11573  306 EQLVELMIC-AMERSETEPEDEPITHWLWLHLSSQLIYFV---LFQFASFPNIVKSLHEKLAgrdLTRGRDHLMWVLLQF 381
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    408 VGSLSNLTRLDDAVTeIVELYNELFDGDVvWMGASD-HPALFARFLAAAGTWMILEKDFADK----MPPANETIKSHIKF 482
Cdd:pfam11573  382 ISGSIQKNPLSDFLP-ILKLYDLLYPEKE-PLPVPDyNKPLCTHQMAATCIWIHLLKKAQDEnqklQRPIPDTLKSHHEF 459
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    483 IQDGVDNFDSSNTAM-----LAVLSNIYRTDTKMGKLIVPTLQQMLESIDDTKSLfelsyKRMAVNCFSAFPVEFMEALT 557
Cdd:pfam11573  460 LQHLVLNNDSPNLAMgsdyrIALLCNAYSTNTEYFSRPMPALVETILGNSKSRSP-----GCNAPGPTTPLSMEFLDSLT 534
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    558 FRSKKTLLIQCFQPLRSFSTVRL---PSPAVFETVAKIcesedyeMAVKDMEHLAQRSLhvataADRSSGEHQNVQAKDQ 634
Cdd:pfam11573  535 VHSKMSLIHSIVTHVIKFAQSKSsvaLAPALVETYSRL-------LVYTEIESLGIKGF-----ISQLLPTVFKSQAWGI 602
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    635 CYFLFDFLVYRLPHLHAYSK--YSSTVNALsyftqhiPNNPQNH--QIYRLMEQFLMRRWcwagfhgcitahtQMFGTSY 710
Cdd:pfam11573  603 LHTLLEMFSYRMHHIQPHYRvqLLSHLHSL-------ANVPQTNqtQLHLCVESTALRLI-------------TGLGSSE 662
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    711 KDNTMMRHMAYPKTFSVpDQYPFAINpeifKMAIYSFLRAVKITAQDIAIE---KTMFPTIING--FGWPEKSTSYFPKW 785
Cdd:pfam11573  663 VQPQLSRFLNEPKTFLV-SQESEELN----RALILTLARAIHITGTGTDSEswcKELLETIMQNtpHGWPQHTLSCFPPW 737
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    786 ALDAIKASDTSNAvNTEEILSDVRNTARMHTSLT-PNQFVIRYGEDRDPATSHCMLavlfhfaYNSVDSTYNITSEFYEV 864
Cdd:pfam11573  738 LLEFFKQNNVPKE-NKQQLKKNVEEEYRKWTSMTnENDIIAHFSVPGTPPLFLCLL-------WKMLLETDRITPIAYKV 809
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    865 MEKKTPKEIVVMGNYLVDYIIADAkTQDCNEKTFKNIAKAAALLVFQFQVLRADRLLLSLIMHPATDEDALICIQIANEF 944
Cdd:pfam11573  810 LERIGARALSAHLRKFCDYLVFEF-TNSCGGQHVNKCADALNDMIWKYNIVTIDRLVLCLALRPQEGNEAQVCFFIIQLL 888
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    945 IL-TPEFQERIRWYHQ-NVPKKDHFPTEYIKAIVkYHDAFPEFEACELVRSYDSSSNVHMPTYYGCLIERLLPIMDQYLH 1022
Cdd:pfam11573  889 LLkTTEFRNRVQDFVKeNSPEHWKQPNWHEKHLA-YHRKFPEKFAPEGVAEYDSSSNPYLPTYFGNVCLRFLPVLDIVIH 967
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345   1023 VALEQ-QGYKLNPQILQSVSMLYKFHAMPIHFMYSVL-FTSHGLMSGPDAK-SFVLAFATQIEECH-----LTEAFEKFn 1094
Cdd:pfam11573  968 RALELpPVSKSLETLLEHLGCLYKFHDRPVTYLYNTLhYYEHKLRDRPDLKrKLVSAIIGSLKDCRppgwaLTEAYEKY- 1046
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345   1095 HQKSSREQLI-------MELIDRMSASLDFIlTPPPFVAKDWKIAELS-PGAQTLYLACIELMASPHSPETLVAAMINVM 1166
Cdd:pfam11573 1047 HQKSSDEQLIwlpedyyMKLIDRMVDTMDGK-TPPPFPAKDWRFNEFPnPGAHALYVTCVELMALPVSPETVANALIDVV 1125
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345   1167 QMKPHARPFNI----VNLTALLLTALPAAYSNALHDEFVAVFVNGETANL-----KFEEIVFDNYEESLLlnlPNRARTI 1237
Cdd:pfam11573 1126 QKGYHLIPRNQihlwINAIGLILTALPESYWSVLHDRIVEVLSNPELTNWtypcsPFELFNFDNYHESLL---ENRASYI 1202
                         1290      1300      1310      1320      1330      1340      1350      1360
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345   1238 NVISQQYWLHCSLSLLNFFSHEYVPRILEHVKTEKDLWYTLRLVMPYLRRYyenwdtakqmRSQRENfGPLHIVKLVFQK 1317
Cdd:pfam11573 1203 LALAHAVWHHAGIGQLNPFPQFVKEKLLPHVKTEEQLLYLCHLVGPFLQRF----------RAERER-GVLDITKLLYEL 1271
                         1370      1380      1390
                   ....*....|....*....|....*....|
gi 25150345   1318 LGSMAEEGVEIVYEQHLCDLFYNCKYFFAG 1347
Cdd:pfam11573 1272 LEQVDKEGVELRYMDPICDLLYHIKYMFVG 1301
KLF1_2_4_N-like cd22056
N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of ...
1403-1543 1.11e-10

N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of Kruppel-like factor (KLF)1, KLF2, and KLF4; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specifity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domains of an unknown subfamily of KLFs, predominantly found in fish, related to the N-terminal domains of KLF1, KLF2, and KLF4.


Pssm-ID: 409231 [Multi-domain]  Cd Length: 339  Bit Score: 65.06  E-value: 1.11e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345 1403 HVKAHQPLESTPSVSSLPQMQHHLQQAPLLPSHQMMPPPQQHSSSLQHHLQHHtsthqmmdtsqHQTIQQQSNHPTQQQL 1482
Cdd:cd22056  203 FMGQQKPKHQMHSVHPQAFTHHQAAGPGALQGRGGRGGPDCHLLHSSHHHHHH-----------HHLQYQYMNAPYPPHY 271
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 25150345 1483 QHQIPnmsmhqqmgPQYPGavfhhpsgpvghvpmQYGmghhMQQHPHLPHHQQMPAPMHTM 1543
Cdd:cd22056  272 AHQGA---------PQFHG---------------QYS----VFREPMRVHHQGHPGSMLTP 304
KLF1_2_4_N-like cd22056
N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of ...
1430-1540 8.58e-10

N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of Kruppel-like factor (KLF)1, KLF2, and KLF4; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specifity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domains of an unknown subfamily of KLFs, predominantly found in fish, related to the N-terminal domains of KLF1, KLF2, and KLF4.


Pssm-ID: 409231 [Multi-domain]  Cd Length: 339  Bit Score: 62.37  E-value: 8.58e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345 1430 PLLPSHQMMPPPQQHSSSLQHHLQHHTSTHQMMDTSQHQTIQQQSNHPTQQQLQHQIPNMSMHQQMGPQYPgAVFHhpsg 1509
Cdd:cd22056  206 QQKPKHQMHSVHPQAFTHHQAAGPGALQGRGGRGGPDCHLLHSSHHHHHHHHLQYQYMNAPYPPHYAHQGA-PQFH---- 280
                         90       100       110
                 ....*....|....*....|....*....|..
gi 25150345 1510 pvGhvpmQYGMgHHMQQHPHLPHHQ-QMPAPM 1540
Cdd:cd22056  281 --G----QYSV-FREPMRVHHQGHPgSMLTPP 305
Activator_LAG-3 pfam11498
Transcriptional activator LAG-3; The C.elegans Notch pathway, involved in the control of ...
1421-1527 3.71e-09

Transcriptional activator LAG-3; The C.elegans Notch pathway, involved in the control of growth, differentiation and patterning in animal development, relies on either of the receptors GLP-1 or LIN-12. Both these receptors promote signalling by the recruitment of LAG-3 to target promoters, where it then acts as a transcriptional activator. LAG-3 works as a ternary complex together with the DNA binding protein, LAG-1. Its N-terminal region adopts an elongated kinked helix that is required for complex assembly.


Pssm-ID: 151935 [Multi-domain]  Cd Length: 476  Bit Score: 61.13  E-value: 3.71e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345   1421 QMQHHLQQAPLLPSHQMMPPPQQH---------SSSLQHHLQHHTSTHQMMDTSQHQTIQQQSNHPTQQQLQHQIPnmSM 1491
Cdd:pfam11498  361 QQQEHQQQQMLLQQQQQMHQLQQHhqmngggqfATQAHQHAAYLQQMQHMRLQEQIQHQQQQAQHHQQAQQQHQQP--AQ 438
                           90       100       110
                   ....*....|....*....|....*....|....*.
gi 25150345   1492 HQQMGPQYPGAVFHHPSgpvGHVPmQYGmGHHMQQH 1527
Cdd:pfam11498  439 HGQMGYGIPNGYPAHMH---GHAP-AYG-AHHMPHH 469
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
1405-1537 2.61e-08

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 58.89  E-value: 2.61e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345   1405 KAHQPLESTPSVSSLPQMQHH--LQQAPLLPSHQMMPPPQQHSSsLQHHLQHHtsthqMMDTSQHQTIQQ----QSNHPT 1478
Cdd:pfam09770  210 PAQQPAPAPAQPPAAPPAQQAqqQQQFPPQIQQQQQPQQQPQQP-QQHPGQGH-----PVTILQRPQSPQpdpaQPSIQP 283
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 25150345   1479 QQQLQH-----------QI---PNMSMHQQMGPQYPGAVFHHPSGPVGHVPMQYGMGHHMQQHPHLPHHQQMP 1537
Cdd:pfam09770  284 QAQQFHqqpppvpvqptQIlqnPNRLSAARVGYPQNPQPGVQPAPAHQAHRQQGSFGRQAPIITHPQQLAQLS 356
Activator_LAG-3 pfam11498
Transcriptional activator LAG-3; The C.elegans Notch pathway, involved in the control of ...
1422-1544 1.70e-07

Transcriptional activator LAG-3; The C.elegans Notch pathway, involved in the control of growth, differentiation and patterning in animal development, relies on either of the receptors GLP-1 or LIN-12. Both these receptors promote signalling by the recruitment of LAG-3 to target promoters, where it then acts as a transcriptional activator. LAG-3 works as a ternary complex together with the DNA binding protein, LAG-1. Its N-terminal region adopts an elongated kinked helix that is required for complex assembly.


Pssm-ID: 151935 [Multi-domain]  Cd Length: 476  Bit Score: 55.74  E-value: 1.70e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345   1422 MQHHLQQAPLLPSHQMMPPPQQHSsslQHHLQHHtstHQMMDTSQHQT--------IQQQSNHPTQQQLQHQIPNMSMHQ 1493
Cdd:pfam11498  354 MHQHQQQQQQEHQQQQMLLQQQQQ---MHQLQQH---HQMNGGGQFATqahqhaayLQQMQHMRLQEQIQHQQQQAQHHQ 427
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 25150345   1494 QMGPQYPGAVFHHpsgpvghvPMQYGMGHHMQQHPHL------PHHQQMPAPMHTMN 1544
Cdd:pfam11498  428 QAQQQHQQPAQHG--------QMGYGIPNGYPAHMHGhapaygAHHMPHHTAFANIN 476
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
1414-1549 3.60e-07

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 55.01  E-value: 3.60e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345   1414 PSVSSLPQMQHHLQQAPLLPSHQMMPPPQQHSSSLQHHLQHHTSTH--QMMDTSQHqtIQQQSNHPTQQQLQhqipnmsm 1491
Cdd:pfam09606  370 PGMMSSPSPVPGQQVRQVTPNQFMRQSPQPSVPSPQGPGSQPPQSHpgGMIPSPAL--IPSPSPQMSQQPAQ-------- 439
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 25150345   1492 hQQMGPQY-PGAvfhhPSGPVGHVPMQYGMGHHMQQ--HPHLPHHQQMPAPMHTMNPMMQN 1549
Cdd:pfam09606  440 -QRTIGQDsPGG----SLNTPGQSAVNSPLNPQEEQlyREKYRQLTKYIEPLKRMIAKMEN 495
Amelogenin smart00818
Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem ...
1419-1552 3.59e-06

Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem to regulate formation of crystallites during the secretory stage of tooth enamel development and are thought to play a major role in the structural organisation and mineralisation of developing enamel. The extracellular matrix of the developing enamel comprises two major classes of protein: the hydrophobic amelogenins and the acidic enamelins. Circular dichroism studies of porcine amelogenin have shown that the protein consists of 3 discrete folding units: the N-terminal region appears to contain beta-strand structures, while the C-terminal region displays characteristics of a random coil conformation. Subsequent studies on the bovine protein have indicated the amelogenin structure to contain a repetitive beta-turn segment and a "beta-spiral" between Gln112 and Leu138, which sequester a (Pro, Leu, Gln) rich region. The beta-spiral offers a probable site for interactions with Ca2+ ions. Muatations in the human amelogenin gene (AMGX) cause X-linked hypoplastic amelogenesis imperfecta, a disease characterised by defective enamel. A 9bp deletion in exon 2 of AMGX results in the loss of codons for Ile5, Leu6, Phe7 and Ala8, and replacement by a new threonine codon, disrupting the 16-residue (Met1-Ala16) amelogenin signal peptide.


Pssm-ID: 197891 [Multi-domain]  Cd Length: 165  Bit Score: 48.63  E-value: 3.59e-06
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    1419 LPQMQHHLQQAPLLPSHQMMPPPQQHSSSLQHHLQHHTSTHQMMDTSQHQ-TIQQQSNHPTQQQlqhqipnmsMHQQMGP 1497
Cdd:smart00818   40 IPVSQQHPPTHTLQPHHHIPVLPAQQPVVPQQPLMPVPGQHSMTPTQHHQpNLPQPAQQPFQPQ---------PLQPPQP 110
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*
gi 25150345    1498 QYPgavfhhpsgpvghvpmqygmghhMQQHPhlPHHQQMPAPMHTMNPMMQNMTP 1552
Cdd:smart00818  111 QQP-----------------------MQPQP--PVHPIPPLPPQPPLPPMFPMQP 140
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
1404-1552 3.69e-06

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 51.93  E-value: 3.69e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345   1404 VKAHQPLESTPSVSSLP----QMQHHLQQAPLLPSH----QMMPPPQQHSSSLQHHLQHHTSTHQMMDTSQHQTIQQQSN 1475
Cdd:pfam09606  242 AMQQQQPQQQGQQSQLGmginQMQQMPQGVGGGAGQggpgQPMGPPGQQPGAMPNVMSIGDQNNYQQQQTRQQQQQQGGN 321
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345   1476 HPTQQQLQhqiPNMSMHQ------------------------QMGPQYPGAVFHHPSG--------------------PV 1511
Cdd:pfam09606  322 HPAAHQQQ---MNQSVGQggqvvalgglnhletwnpgnfgglGANPMQRGQPGMMSSPspvpgqqvrqvtpnqfmrqsPQ 398
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 25150345   1512 GHVPMQYGMG-HHMQQHPH----LPHHQQMPAPMHTMNPMMQNMTP 1552
Cdd:pfam09606  399 PSVPSPQGPGsQPPQSHPGgmipSPALIPSPSPQMSQQPAQQRTIG 444
KLF4_N cd21582
N-terminal domain of Kruppel-like factor 4; Kruppel-like factor 4 (KLF4; also known as ...
1404-1518 1.18e-05

N-terminal domain of Kruppel-like factor 4; Kruppel-like factor 4 (KLF4; also known as Krueppel-like factor 4 or gut-enriched Kruppel-like factor/GKLF) is a protein that, in humans, is encoded by the KLF4 gene. Evidence also suggests that KLF4 is a tumor suppressor in certain cancers, including colorectal cancer, gastric cancer, esophageal squamous cell carcinoma, intestinal cancer, prostate cancer, bladder cancer and lung cancer. It may act as a tumor promoter where increased KLF4 expression has been reported, such as in oral squamous cell carcinoma and in primary breast ductal carcinoma. KLF4 is one of four key factors that are essential for inducing pluripotent stem cells. KLF4 is highly expressed in non-dividing cells and its overexpression induces cell cycle arrest. KLF proteins KLF1, KLF2, KLF4, KLF5, KLF6, and KLF7 are transcriptional activators. KLF4 belongs to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domain of KLF4, which is related to the N-terminal domains of KLF1 and KLF2.


Pssm-ID: 409228 [Multi-domain]  Cd Length: 335  Bit Score: 49.31  E-value: 1.18e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345 1404 VKAHQPLESTPSV--SSLPQMQHHLQQAPLL------PSHQMMPPPQQHSSSLQHHLQHHTSTHQM---MDTSQHQTIQQ 1472
Cdd:cd21582  185 VSKSAPMTKSSVApsSSLPFMCPRIKQENPStctisrPMDGHLGGNSQHGFSQRAPLPSRTTPSGGpggGNSSTAESLMS 264
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*.
gi 25150345 1473 QSNHPTQQQLQHQIPNMSMHQQMGPQYPGavFHHPSGPvghvPMQY 1518
Cdd:cd21582  265 RDHHPSSQVLSHPPLPLPQGYHPSPGYPP--FPPPPSQ----PQQY 304
Amelogenin smart00818
Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem ...
1407-1500 2.51e-05

Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem to regulate formation of crystallites during the secretory stage of tooth enamel development and are thought to play a major role in the structural organisation and mineralisation of developing enamel. The extracellular matrix of the developing enamel comprises two major classes of protein: the hydrophobic amelogenins and the acidic enamelins. Circular dichroism studies of porcine amelogenin have shown that the protein consists of 3 discrete folding units: the N-terminal region appears to contain beta-strand structures, while the C-terminal region displays characteristics of a random coil conformation. Subsequent studies on the bovine protein have indicated the amelogenin structure to contain a repetitive beta-turn segment and a "beta-spiral" between Gln112 and Leu138, which sequester a (Pro, Leu, Gln) rich region. The beta-spiral offers a probable site for interactions with Ca2+ ions. Muatations in the human amelogenin gene (AMGX) cause X-linked hypoplastic amelogenesis imperfecta, a disease characterised by defective enamel. A 9bp deletion in exon 2 of AMGX results in the loss of codons for Ile5, Leu6, Phe7 and Ala8, and replacement by a new threonine codon, disrupting the 16-residue (Met1-Ala16) amelogenin signal peptide.


Pssm-ID: 197891 [Multi-domain]  Cd Length: 165  Bit Score: 46.32  E-value: 2.51e-05
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    1407 HQPLESTPSVSSLPQMQHHLQQAPLL--PSHQMMPPPQQHSSSLQHHLQHHTSTHQMMDTSQHQTIQQQSNHPTQQQLQH 1484
Cdd:smart00818   49 THTLQPHHHIPVLPAQQPVVPQQPLMpvPGQHSMTPTQHHQPNLPQPAQQPFQPQPLQPPQPQQPMQPQPPVHPIPPLPP 128
                            90
                    ....*....|....*.
gi 25150345    1485 QIPNMSMHqQMGPQYP 1500
Cdd:smart00818  129 QPPLPPMF-PMQPLPP 143
Activator_LAG-3 pfam11498
Transcriptional activator LAG-3; The C.elegans Notch pathway, involved in the control of ...
1411-1552 1.51e-04

Transcriptional activator LAG-3; The C.elegans Notch pathway, involved in the control of growth, differentiation and patterning in animal development, relies on either of the receptors GLP-1 or LIN-12. Both these receptors promote signalling by the recruitment of LAG-3 to target promoters, where it then acts as a transcriptional activator. LAG-3 works as a ternary complex together with the DNA binding protein, LAG-1. Its N-terminal region adopts an elongated kinked helix that is required for complex assembly.


Pssm-ID: 151935 [Multi-domain]  Cd Length: 476  Bit Score: 46.11  E-value: 1.51e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345   1411 ESTPSVSSLPQM--QHHLQQAPLLPSHQMMPPPQQHSSSLQHHLQHhtsthQMMdtSQHQTiQQQSNHPTQQQL---QHQ 1485
Cdd:pfam11498  306 DRMPQSAPPPAMnpQHIAQLAQQQNKMRLLQQQEMEMQRIEQQRQQ-----QIM--HQHQQ-QQQQEHQQQQMLlqqQQQ 377
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 25150345   1486 IPNMSMHQQM--GPQYPGAVFHHPS--GPVGHVPMQYGMGHHMQQ---HPHLPHHQQMPAPMHTMNPMMQNMTP 1552
Cdd:pfam11498  378 MHQLQQHHQMngGGQFATQAHQHAAylQQMQHMRLQEQIQHQQQQaqhHQQAQQQHQQPAQHGQMGYGIPNGYP 451
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
1408-1552 2.72e-04

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 45.77  E-value: 2.72e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345   1408 QPLESTPSVSSlPQMQHHLQQAPLLPSHQMMPPPQQHSSSL-----------QHHLQHHTSTH-----QMMDTSQHQTIQ 1471
Cdd:pfam09606  165 QPGSGTPNQMG-PNGGPGQGQAGGMNGGQQGPMGGQMPPQMgvpgmpgpadaGAQMGQQAQANggmnpQQMGGAPNQVAM 243
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345   1472 QQSnhPTQQQLQHQIPNMSMHQ--QMGPQYPGAVFHH-PSGPVGHVPMQYG-MGHHMQ-QHPHLPHHQQMPAPMHTMNPM 1546
Cdd:pfam09606  244 QQQ--QPQQQGQQSQLGMGINQmqQMPQGVGGGAGQGgPGQPMGPPGQQPGaMPNVMSiGDQNNYQQQQTRQQQQQQGGN 321

                   ....*.
gi 25150345   1547 MQNMTP 1552
Cdd:pfam09606  322 HPAAHQ 327
PRK10263 PRK10263
DNA translocase FtsK; Provisional
1407-1507 3.64e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 45.46  E-value: 3.64e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345  1407 HQPL-------ESTPSVSSLPQMQHHLQQAPLLPSHQMMPPPQQHSSSLQHHLQHHTSTHQMMDTSQHQTIQQQSNHPTQ 1479
Cdd:PRK10263  741 HEPLftpivepVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQP 820
                          90       100
                  ....*....|....*....|....*...
gi 25150345  1480 QQLQHQIPNMSMHQQMGPQYPGAVFHHP 1507
Cdd:PRK10263  821 QQPVAPQPQYQQPQQPVAPQPQDTLLHP 848
Med12-PQL pfam12144
Eukaryotic Mediator 12 catenin-binding domain; This domain is found in eukaryotes, and is ...
1413-1543 4.90e-04

Eukaryotic Mediator 12 catenin-binding domain; This domain is found in eukaryotes, and is typically between 325 and 354 amino acids in length. Both development and carcinogenesis are driven by signal transduction within the canonical Wnt/beta-catenin pathway through both programmed and unprogrammed changes in gene transcription. Beta-catenin physically and functionally targets this PQL (proline-, glutamine-, leucine-rich) region of the Med12 subunit of Mediator to activate transcription. The beta-catenin transactivation domain binds directly to isolated Med12 and intact Mediator both in vitro and in vivo, and Mediator is recruited to Wnt-responsive genes in a beta-catenin-dependent manner.


Pssm-ID: 463471 [Multi-domain]  Cd Length: 210  Bit Score: 43.29  E-value: 4.90e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345   1413 TPSVSSLPQMQHHLQQAPLLPShqMMPPPQQHSSSLQHHL-------QHHTSTHQMMDTSQHQTIQQQSNHPTQQQLQHQ 1485
Cdd:pfam12144   39 PPYRPVRKPLSKMLPTRPNYPG--MMPTMMGSIMSLDQQLykmklykQQPTMPQGQLLRQQLQARLLQQGMLGQQQARQM 116
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345   1486 IPNMSM-HQQMGPQ-YPGAVFH-----HPSGPVGHVPMQYG-----MGH------------HMQQHPHLPHHQQMPAPMH 1541
Cdd:pfam12144  117 PPNPSYgSLQLMPQgYTMYGSHmglqqHPSQAGGMVPPSYSsqayqGAHpgsnpalvdplrQMQQRPSGYVHQQAPGYLH 196

                   ..
gi 25150345   1542 TM 1543
Cdd:pfam12144  197 PL 198
EIF4E-T pfam10477
Nucleocytoplasmic shuttling protein for mRNA cap-binding EIF4E; EIF4E-T is the transporter ...
1356-1551 9.30e-04

Nucleocytoplasmic shuttling protein for mRNA cap-binding EIF4E; EIF4E-T is the transporter protein for shuttling the mRNA cap-binding protein EIF4E protein, targeting it for nuclear import. EIF4E-T contains several key binding domains including two functional leucine-rich NESs (nuclear export signals) between residues 438-447 and 613-638 in the human protein. The other two binding domains are an EIF4E-binding site, between residues 27-42 in Q9EST3, and a bipartite NLS (nuclear localization signals) between 194-211, and these lie in family EIF4E-T_N. EIF4E is the eukaryotic translation initiation factor 4E that is the rate-limiting factor for cap-dependent translation initiation.


Pssm-ID: 371079  Cd Length: 646  Bit Score: 43.85  E-value: 9.30e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345   1356 TEFAKLPEKM--RDRLKFYVSQSEPTAEQETPPEKEKSPEKEKEQEQEQHVKAHQPLESTPSVSSLPQM-QHHLQQAPLL 1432
Cdd:pfam10477  453 EAFNKLLQQLgsQGTQHHSVNDSAPTINDSNAGQLANQPVPYHIQENHQDQQAKFLGVHTPNIFAQPNMeIQHLIQELVR 532
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345   1433 P--SHQMMPPPQQHSSSLQH---------HLQHHTSTHQMMDTSQHQTIQQQSNHptQQQLQHqipnmsMHQQMGPQYPG 1501
Cdd:pfam10477  533 PdiSHEFLEKELSNPSTLGHtkdviaavlRECSNGMRNTMSPKPQQNVMTPQSLH--QAQLQP------LQQPQQLALQQ 604
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|.
gi 25150345   1502 AVFHHPSGpvghvpmqygmGHHMQQHphlPHHQQMPAPMH-TMNPMMQNMT 1551
Cdd:pfam10477  605 DLFYQNDG-----------NQNRQQH---IRHSNSPTPLAfTPTSVLRKMT 641
Amelogenin smart00818
Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem ...
1466-1554 1.18e-03

Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem to regulate formation of crystallites during the secretory stage of tooth enamel development and are thought to play a major role in the structural organisation and mineralisation of developing enamel. The extracellular matrix of the developing enamel comprises two major classes of protein: the hydrophobic amelogenins and the acidic enamelins. Circular dichroism studies of porcine amelogenin have shown that the protein consists of 3 discrete folding units: the N-terminal region appears to contain beta-strand structures, while the C-terminal region displays characteristics of a random coil conformation. Subsequent studies on the bovine protein have indicated the amelogenin structure to contain a repetitive beta-turn segment and a "beta-spiral" between Gln112 and Leu138, which sequester a (Pro, Leu, Gln) rich region. The beta-spiral offers a probable site for interactions with Ca2+ ions. Muatations in the human amelogenin gene (AMGX) cause X-linked hypoplastic amelogenesis imperfecta, a disease characterised by defective enamel. A 9bp deletion in exon 2 of AMGX results in the loss of codons for Ile5, Leu6, Phe7 and Ala8, and replacement by a new threonine codon, disrupting the 16-residue (Met1-Ala16) amelogenin signal peptide.


Pssm-ID: 197891 [Multi-domain]  Cd Length: 165  Bit Score: 41.31  E-value: 1.18e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345    1466 QHQTIQQQSNHPTQQQLQ--HQIPNMSMHQQMGPQypgavfhHPSGPVghvPMQYGMGHHMQQHPHLPHHQQMPAPMhtm 1543
Cdd:smart00818   36 HHQIIPVSQQHPPTHTLQphHHIPVLPAQQPVVPQ-------QPLMPV---PGQHSMTPTQHHQPNLPQPAQQPFQP--- 102
                            90
                    ....*....|.
gi 25150345    1544 nPMMQNMTPQQ 1554
Cdd:smart00818  103 -QPLQPPQPQQ 112
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
1413-1587 3.01e-03

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 42.33  E-value: 3.01e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345   1413 TPSVSSLPQMQHhLQQAPLLPSHQMM--------------PPPQQHSSSLQHHLQHHTSthqmmdtsQHQTIQQQSNHPT 1478
Cdd:pfam09770  170 AAAPAPAPQPAA-QPASLPAPSRKMMsleeveaamraqakKPAQQPAPAPAQPPAAPPA--------QQAQQQQQFPPQI 240
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345   1479 QQQLQHQIPNMSMHQQMGPQYPGAVFHHPSGPVGHVPMQygmghHMQQHPHLPHHQQMPAPMHTM----NPMMQNMTPQQ 1554
Cdd:pfam09770  241 QQQQQPQQQPQQPQQHPGQGHPVTILQRPQSPQPDPAQP-----SIQPQAQQFHQQPPPVPVQPTqilqNPNRLSAARVG 315
                          170       180       190
                   ....*....|....*....|....*....|...
gi 25150345   1555 QYLYMQQLQQHQQHQQYMQQQQQHHHQHQQQPH 1587
Cdd:pfam09770  316 YPQNPQPGVQPAPAHQAHRQQGSFGRQAPIITH 348
Amelogenin smart00818
Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem ...
1479-1547 6.49e-03

Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem to regulate formation of crystallites during the secretory stage of tooth enamel development and are thought to play a major role in the structural organisation and mineralisation of developing enamel. The extracellular matrix of the developing enamel comprises two major classes of protein: the hydrophobic amelogenins and the acidic enamelins. Circular dichroism studies of porcine amelogenin have shown that the protein consists of 3 discrete folding units: the N-terminal region appears to contain beta-strand structures, while the C-terminal region displays characteristics of a random coil conformation. Subsequent studies on the bovine protein have indicated the amelogenin structure to contain a repetitive beta-turn segment and a "beta-spiral" between Gln112 and Leu138, which sequester a (Pro, Leu, Gln) rich region. The beta-spiral offers a probable site for interactions with Ca2+ ions. Muatations in the human amelogenin gene (AMGX) cause X-linked hypoplastic amelogenesis imperfecta, a disease characterised by defective enamel. A 9bp deletion in exon 2 of AMGX results in the loss of codons for Ile5, Leu6, Phe7 and Ala8, and replacement by a new threonine codon, disrupting the 16-residue (Met1-Ala16) amelogenin signal peptide.


Pssm-ID: 197891 [Multi-domain]  Cd Length: 165  Bit Score: 39.39  E-value: 6.49e-03
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 25150345    1479 QQQLQHQIPN-----MS--MHQQMGPQYPGAVFHHPSGPVGHVPMQYgmghhmQQHPHLPHHQQMPAPMHtmNPMM 1547
Cdd:smart00818   16 QSMIRHPYPSygyepMGgwLHHQIIPVSQQHPPTHTLQPHHHIPVLP------AQQPVVPQQPLMPVPGQ--HSMT 83
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1413-1552 7.51e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 40.91  E-value: 7.51e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345   1413 TPSVSSLPQMqhhlqqapllPSHQMMPPPQQHSSSLQHH--LQHHTSTHQMMDTSQHQTIQQQSNHPTQQQLQHQ-IPNM 1489
Cdd:pfam03154  200 TPSAPSVPPQ----------GSPATSQPPNQTQSTAAPHtlIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQpLPQP 269
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 25150345   1490 SMHQQMGPqypgavfhhpsgpvghvpmqygMGHHMQQHPHLPHHQQMPAPMHTMNPMMQNMTP 1552
Cdd:pfam03154  270 SLHGQMPP----------------------MPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVP 310
PRK10350 PRK10350
DUF2756 family protein;
1427-1510 7.59e-03

DUF2756 family protein;


Pssm-ID: 182398 [Multi-domain]  Cd Length: 145  Bit Score: 38.84  E-value: 7.59e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 25150345  1427 QQAPLLPSHQ-----MMPPPQQHSSSLQHHLQHHTSthqmmdtSQHQTIQQQSNHPTQQQLQHQIPNMSMHQQMGPQYPG 1501
Cdd:PRK10350   28 QPGYQIPSQQrmqtqMQTQQIQQKGMLNQQLKTQTR-------LQQQHLQNQINNNSQRVQQGQPGNNPARQQMLPNTNG 100

                  ....*....
gi 25150345  1502 AVFHHPSGP 1510
Cdd:PRK10350  101 GMLNSNRNP 109
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH