NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|160333362|ref|NP_001103821|]
View 

paired amphipathic helix protein Sin3a isoform 1 [Mus musculus]

Protein Classification

Sin3 family protein( domain architecture ID 1001704)

Sin3 family protein similar to Homo sapiens paired amphipathic helix proteins Sin3a and Sin3b, which act as transcriptional repressors

CATH:  1.20.1160.11
Gene Ontology:  GO:0003714

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Sin3 super family cl35027
Histone deacetylase complex, regulatory component SIN3 [Chromatin structure and dynamics];
8-1214 6.17e-169

Histone deacetylase complex, regulatory component SIN3 [Chromatin structure and dynamics];


The actual alignment was detected with superfamily member COG5602:

Pssm-ID: 227889 [Multi-domain]  Cd Length: 1163  Bit Score: 533.73  E-value: 6.17e-169
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362    8 QESPVYAAQQRRIPGSTEAFSHQHRVLAPAPPVYEAVSEtMQSATGIQYSvapnyQVSAVPQSSGSHGPAIAAVHSSHHH 87
Cdd:COG5602    37 RDHDVNTSLQNSEPWPDNGSARKPSYSPESMSSYGGPMD-ERSGTRRYYE-----DVEQILSSLAGDNPDPSTDHGSASN 110
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362   88 ptavqphggqvvQSHAHPAPPVAPvqgqqqFQRLKVEDALSYLDQVKLQFGSQPQVYNDFLDIMKEFKSQSIDTPGVISR 167
Cdd:COG5602   111 ------------QNMERSPPPRIP------YRPLDVSDALSYLEKVKEQFSNRPEIYNNFLDIMKDFKSQAIDTPGVIER 172
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  168 VSQLFKGHPDLIMGFNTFLPPGYKIEVQTND----MVNVTTPGQVHQIPthgiqpqpqpppqhpsqpssqsaptpaqpap 243
Cdd:COG5602   173 VSVLFRGYPHLIEGFNTFLPSGYKIEGSLPQpngsRLHVTTPQGPLSSP------------------------------- 221
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  244 qptAAKVSKPSQLQAHtpASQQTPPLPPYASPRSPPVQPHTPvtislgtaPSLQNNQPVEFNHAINYVNKIKNRFQGQPD 323
Cdd:COG5602   222 ---PVQSSYYVAPCNH--DQRTSHPTLPSDSQPEPSAPSHMP--------SDARGKHQVDFNQAIIFVNKVKVRFQNNPE 288
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  324 IYKAFLEILHTYQKEQRNAkeaggnytpalteQEVYAQVARLFKNQEDLLSEFGQFLPDANSSVLLSKTTAEKVDSvRND 403
Cdd:COG5602   289 MYYDFLDSLRTYQMKQRSI-------------QEVYARVTKLFAEAPDLLEEFKEFLPDSSVSAEQSTANAQKPSK-RLP 354
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  404 HGGTVKKPqlnnkPQRPSQNGCQIRRHSGTGATPPVKKKPKLM-SLKESSMADA----SKHGVGTESLFFDKVRKALRSA 478
Cdd:COG5602   355 PIGSFSLP-----TAAPEQNRPSLLWESPRSISNISRYRADLLtSFSRNFVPIRihmpISHLSNESEDFFERIKQYLKDK 429
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  479 EAYENFLRCLVIFNQEVISRAELVQLVSPFLGKFPELFNWFKNFLGYKESVHlESFPKERategiameIDYASCKRLGSS 558
Cdd:COG5602   430 NLYHEFLKLLNLYSQKIIDKNDLVERLFAFLGSNEELIRWFKAFINYSEPEK-EPLRETR--------KDLEKFEKCGPS 500
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  559 YRALPKSYQQPKCTGRTPLCKEVLNDTWVSFPSW-SEDSTFVSSKKTQYEEHIYRCEDERFELDVVLETNLATIRVLEAI 637
Cdd:COG5602   501 YRLLPKSEEEIKCSGRDDLAWEVLNDDWVSHPTWaSEDSGFIAHRKNQYEEALFKIEEERYEYDRHIEATQRTIKALEQI 580
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  638 QKKLSRLSAEEQAKFRLDNTLGGTSEVIHRKALQRIY-ADKAADIIDGLRKNPSIAVPIVLKRLKMKEEEWREAQRGFNK 716
Cdd:COG5602   581 IDKIKDMEESEKANKTLPGGLGLPSKSIYKKVIKKVYdKEHAPEILEALLKKPHVTIPIVLKRLKMKDEEWRSCKREWNK 660
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  717 VWREQNEKYYLKSLDHQGINFKQNDTKVLRSKSLLNEIESIYDERQEQATEENAGVpvgpHLSLAYEDKQILEDAAALII 796
Cdd:COG5602   661 IWREIEEKNYHKSLDHQGVSFKSRDKKILSTKNILEEIDDRSQAKIHVSIDDKKVF----QFVFVLCDTYIFVNILDLKD 736
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  797 HHVKRQTGIQKEDKYKIKQIMHHFIPD--LLFAQRGDLSDVEEEEEEemdvdeatgapKKHNGVGGSP------------ 862
Cdd:COG5602   737 TLITNMSSYLESDKERLKANLKTIVSLffLLCIFRIIIIVYERLLNV-----------KGLNIDGLKAdrssrsdesaqr 805
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  863 ------------------------------PKSKLLFSNTAA-QKLRGMDEVYNlFYVNNNWYIFMRLHQILCLRLLRic 911
Cdd:COG5602   806 yskvksgnleqvskqideyaiedeikepthPDGLKEHNISKGiSENEKQVIALT-EFGNSLMYVFFRLRFDSYERLYE-- 882
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  912 sqaerqieeenrerewerevlgIKRDKSDSPAIQLRLkepmdvdveDYYPAFLDMVRSLLDGNIDSSQYEDSLREMFTIH 991
Cdd:COG5602   883 ----------------------IKKIKEAVTIGMMEP---------GHYENELWDSYRLIFGALTPCEFEKYLRPFYNNT 931
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  992 AYIAFTMDKLIQSIVRQLQHIVSDEVCVQVTDLYLAENNNGATggqlnsqtsRSLLESAYQRKAEQLMS-DENCFKLMFI 1070
Cdd:COG5602   932 CYKIYTIDRLRQSLKKQLHPITYDIKYIAKSELFEMNSAASSK---------RKQDQTKYRLEVESLLNpDEILFRFCWI 1002
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362 1071 QSQGQVQLTVELLD--TEEENSDDpveAERWSDYVERYMSSDTTspELREHLAQKPVFLPRNLRRIRKC-QRGREQQEke 1147
Cdd:COG5602  1003 NKFKSFGIQIMKRAdlTVDQSLDT---QRVWKYYVQSYAIQHLT--EGISYKNYKCPFLCRNIEKERTVeQLVSRLQT-- 1075
                        1210      1220      1230      1240      1250      1260
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 160333362 1148 gkegnskKTMENVESLDKLECRFKLNSYKMVYVIKSEDyMYRRTALLRAHQSHERVSKRLHQRFQAW 1214
Cdd:COG5602  1076 -------KLLRSHELVSGLQAFLCLDTFKLLYLPNTED-SYIDASYLRLRDSDFLDCQKRKQRWRNW 1134
 
Name Accession Description Interval E-value
Sin3 COG5602
Histone deacetylase complex, regulatory component SIN3 [Chromatin structure and dynamics];
8-1214 6.17e-169

Histone deacetylase complex, regulatory component SIN3 [Chromatin structure and dynamics];


Pssm-ID: 227889 [Multi-domain]  Cd Length: 1163  Bit Score: 533.73  E-value: 6.17e-169
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362    8 QESPVYAAQQRRIPGSTEAFSHQHRVLAPAPPVYEAVSEtMQSATGIQYSvapnyQVSAVPQSSGSHGPAIAAVHSSHHH 87
Cdd:COG5602    37 RDHDVNTSLQNSEPWPDNGSARKPSYSPESMSSYGGPMD-ERSGTRRYYE-----DVEQILSSLAGDNPDPSTDHGSASN 110
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362   88 ptavqphggqvvQSHAHPAPPVAPvqgqqqFQRLKVEDALSYLDQVKLQFGSQPQVYNDFLDIMKEFKSQSIDTPGVISR 167
Cdd:COG5602   111 ------------QNMERSPPPRIP------YRPLDVSDALSYLEKVKEQFSNRPEIYNNFLDIMKDFKSQAIDTPGVIER 172
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  168 VSQLFKGHPDLIMGFNTFLPPGYKIEVQTND----MVNVTTPGQVHQIPthgiqpqpqpppqhpsqpssqsaptpaqpap 243
Cdd:COG5602   173 VSVLFRGYPHLIEGFNTFLPSGYKIEGSLPQpngsRLHVTTPQGPLSSP------------------------------- 221
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  244 qptAAKVSKPSQLQAHtpASQQTPPLPPYASPRSPPVQPHTPvtislgtaPSLQNNQPVEFNHAINYVNKIKNRFQGQPD 323
Cdd:COG5602   222 ---PVQSSYYVAPCNH--DQRTSHPTLPSDSQPEPSAPSHMP--------SDARGKHQVDFNQAIIFVNKVKVRFQNNPE 288
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  324 IYKAFLEILHTYQKEQRNAkeaggnytpalteQEVYAQVARLFKNQEDLLSEFGQFLPDANSSVLLSKTTAEKVDSvRND 403
Cdd:COG5602   289 MYYDFLDSLRTYQMKQRSI-------------QEVYARVTKLFAEAPDLLEEFKEFLPDSSVSAEQSTANAQKPSK-RLP 354
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  404 HGGTVKKPqlnnkPQRPSQNGCQIRRHSGTGATPPVKKKPKLM-SLKESSMADA----SKHGVGTESLFFDKVRKALRSA 478
Cdd:COG5602   355 PIGSFSLP-----TAAPEQNRPSLLWESPRSISNISRYRADLLtSFSRNFVPIRihmpISHLSNESEDFFERIKQYLKDK 429
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  479 EAYENFLRCLVIFNQEVISRAELVQLVSPFLGKFPELFNWFKNFLGYKESVHlESFPKERategiameIDYASCKRLGSS 558
Cdd:COG5602   430 NLYHEFLKLLNLYSQKIIDKNDLVERLFAFLGSNEELIRWFKAFINYSEPEK-EPLRETR--------KDLEKFEKCGPS 500
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  559 YRALPKSYQQPKCTGRTPLCKEVLNDTWVSFPSW-SEDSTFVSSKKTQYEEHIYRCEDERFELDVVLETNLATIRVLEAI 637
Cdd:COG5602   501 YRLLPKSEEEIKCSGRDDLAWEVLNDDWVSHPTWaSEDSGFIAHRKNQYEEALFKIEEERYEYDRHIEATQRTIKALEQI 580
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  638 QKKLSRLSAEEQAKFRLDNTLGGTSEVIHRKALQRIY-ADKAADIIDGLRKNPSIAVPIVLKRLKMKEEEWREAQRGFNK 716
Cdd:COG5602   581 IDKIKDMEESEKANKTLPGGLGLPSKSIYKKVIKKVYdKEHAPEILEALLKKPHVTIPIVLKRLKMKDEEWRSCKREWNK 660
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  717 VWREQNEKYYLKSLDHQGINFKQNDTKVLRSKSLLNEIESIYDERQEQATEENAGVpvgpHLSLAYEDKQILEDAAALII 796
Cdd:COG5602   661 IWREIEEKNYHKSLDHQGVSFKSRDKKILSTKNILEEIDDRSQAKIHVSIDDKKVF----QFVFVLCDTYIFVNILDLKD 736
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  797 HHVKRQTGIQKEDKYKIKQIMHHFIPD--LLFAQRGDLSDVEEEEEEemdvdeatgapKKHNGVGGSP------------ 862
Cdd:COG5602   737 TLITNMSSYLESDKERLKANLKTIVSLffLLCIFRIIIIVYERLLNV-----------KGLNIDGLKAdrssrsdesaqr 805
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  863 ------------------------------PKSKLLFSNTAA-QKLRGMDEVYNlFYVNNNWYIFMRLHQILCLRLLRic 911
Cdd:COG5602   806 yskvksgnleqvskqideyaiedeikepthPDGLKEHNISKGiSENEKQVIALT-EFGNSLMYVFFRLRFDSYERLYE-- 882
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  912 sqaerqieeenrerewerevlgIKRDKSDSPAIQLRLkepmdvdveDYYPAFLDMVRSLLDGNIDSSQYEDSLREMFTIH 991
Cdd:COG5602   883 ----------------------IKKIKEAVTIGMMEP---------GHYENELWDSYRLIFGALTPCEFEKYLRPFYNNT 931
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  992 AYIAFTMDKLIQSIVRQLQHIVSDEVCVQVTDLYLAENNNGATggqlnsqtsRSLLESAYQRKAEQLMS-DENCFKLMFI 1070
Cdd:COG5602   932 CYKIYTIDRLRQSLKKQLHPITYDIKYIAKSELFEMNSAASSK---------RKQDQTKYRLEVESLLNpDEILFRFCWI 1002
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362 1071 QSQGQVQLTVELLD--TEEENSDDpveAERWSDYVERYMSSDTTspELREHLAQKPVFLPRNLRRIRKC-QRGREQQEke 1147
Cdd:COG5602  1003 NKFKSFGIQIMKRAdlTVDQSLDT---QRVWKYYVQSYAIQHLT--EGISYKNYKCPFLCRNIEKERTVeQLVSRLQT-- 1075
                        1210      1220      1230      1240      1250      1260
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 160333362 1148 gkegnskKTMENVESLDKLECRFKLNSYKMVYVIKSEDyMYRRTALLRAHQSHERVSKRLHQRFQAW 1214
Cdd:COG5602  1076 -------KLLRSHELVSGLQAFLCLDTFKLLYLPNTED-SYIDASYLRLRDSDFLDCQKRKQRWRNW 1134
Sin3a_C pfam16879
C-terminal domain of Sin3a protein; Sin3a_C is a family of eukaryotic species. It is found at ...
887-1187 1.70e-103

C-terminal domain of Sin3a protein; Sin3a_C is a family of eukaryotic species. It is found at the C-terminus of the co-repressor Sin3a, and downstream of family Sin3_corepress, pfam08295.


Pssm-ID: 465294  Cd Length: 281  Bit Score: 329.18  E-value: 1.70e-103
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362   887 FYVNNNWYIFMRLHQILCLRLLRIcSQAERQIEEENREREWEREVLGIKRDKSDSPAIQLRLkepmdVDVEDYYPAFLDM 966
Cdd:pfam16879    1 FFANTTIYVFFRLFQILYERLLKI-KQAEEEVAEEIKRRKANKPAKDLGLISKELEDFGLDL-----SDPEDYYPQLLEL 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362   967 VRSLLDGNIDSSQYEDSLREMFTIHAYIAFTMDKLIQSIVRQLQHIVSDEVCVQVTDLYLAENNNGATggqlNSQTsrsl 1046
Cdd:pfam16879   75 CERLIDGEIDQSQFEDSLRQMYGNKAYKLYTIDKLIQALVKQLQTIVSDSKSQELLELFEKERSKEET----TAQD---- 146
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  1047 lESAYQRKAEQLM-SDENCFKLMFIQSqgQVQLTVELLDTEEENSDDPV--EAERWSDYVERYMSSDTTspELREHLAQK 1123
Cdd:pfam16879  147 -EILYRKQAEKLLgKDENLFRIEWNPQ--SKTVTIQLLDKDDLTFDDDEltAEERWQYYVSSYVLSHPT--EGVPRSKLR 221
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 160333362  1124 PVFLPRNLRRIRKCQRGREQQEKEGKEgnskKTMENVESLDKLECRFKLNSYKMVYVIKSEDYM 1187
Cdd:pfam16879  222 KPFLKRNLKAEREEEEDEEEEPKADAE----EPEGPLEFESGLEIRICLNSYKLFYVPGTEDYF 281
HDAC_interact smart00761
Histone deacetylase (HDAC) interacting; This domain is found on transcriptional regulators. It ...
551-651 2.80e-52

Histone deacetylase (HDAC) interacting; This domain is found on transcriptional regulators. It forms interactions with histone deacetylases.


Pssm-ID: 214808 [Multi-domain]  Cd Length: 102  Bit Score: 178.27  E-value: 2.80e-52
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362    551 SCKRLGSSYRALPKSYQQPKCTGRTPLCKEVLNDTWVSFPSW-SEDSTFVSSKKTQYEEHIYRCEDERFELDVVLETNLA 629
Cdd:smart00761    1 NCERCGPSYRLLPKSEKQPKCSGRDELCKEVLNDTWVSHPTWaSEDSGFVAHRKNQYEEALFRCEDERFELDMVIESNSS 80
                            90       100
                    ....*....|....*....|..
gi 160333362    630 TIRVLEAIQKKLSRLSAEEQAK 651
Cdd:smart00761   81 TIKLLEEILNKIEDMSDEERAN 102
KLF1_2_4_N-like cd22056
N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of ...
48-119 1.24e-03

N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of Kruppel-like factor (KLF)1, KLF2, and KLF4; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specifity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domains of an unknown subfamily of KLFs, predominantly found in fish, related to the N-terminal domains of KLF1, KLF2, and KLF4.


Pssm-ID: 409231 [Multi-domain]  Cd Length: 339  Bit Score: 42.72  E-value: 1.24e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 160333362   48 MQSATGIQYSVAPNYQVSAVPQSSGSHGPAIAAVHSSHHHPTAVQPHGgqvVQSHAHPAPPVAPvQGQQQFQ 119
Cdd:cd22056   213 MHSVHPQAFTHHQAAGPGALQGRGGRGGPDCHLLHSSHHHHHHHHLQY---QYMNAPYPPHYAH-QGAPQFH 280
 
Name Accession Description Interval E-value
Sin3 COG5602
Histone deacetylase complex, regulatory component SIN3 [Chromatin structure and dynamics];
8-1214 6.17e-169

Histone deacetylase complex, regulatory component SIN3 [Chromatin structure and dynamics];


Pssm-ID: 227889 [Multi-domain]  Cd Length: 1163  Bit Score: 533.73  E-value: 6.17e-169
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362    8 QESPVYAAQQRRIPGSTEAFSHQHRVLAPAPPVYEAVSEtMQSATGIQYSvapnyQVSAVPQSSGSHGPAIAAVHSSHHH 87
Cdd:COG5602    37 RDHDVNTSLQNSEPWPDNGSARKPSYSPESMSSYGGPMD-ERSGTRRYYE-----DVEQILSSLAGDNPDPSTDHGSASN 110
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362   88 ptavqphggqvvQSHAHPAPPVAPvqgqqqFQRLKVEDALSYLDQVKLQFGSQPQVYNDFLDIMKEFKSQSIDTPGVISR 167
Cdd:COG5602   111 ------------QNMERSPPPRIP------YRPLDVSDALSYLEKVKEQFSNRPEIYNNFLDIMKDFKSQAIDTPGVIER 172
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  168 VSQLFKGHPDLIMGFNTFLPPGYKIEVQTND----MVNVTTPGQVHQIPthgiqpqpqpppqhpsqpssqsaptpaqpap 243
Cdd:COG5602   173 VSVLFRGYPHLIEGFNTFLPSGYKIEGSLPQpngsRLHVTTPQGPLSSP------------------------------- 221
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  244 qptAAKVSKPSQLQAHtpASQQTPPLPPYASPRSPPVQPHTPvtislgtaPSLQNNQPVEFNHAINYVNKIKNRFQGQPD 323
Cdd:COG5602   222 ---PVQSSYYVAPCNH--DQRTSHPTLPSDSQPEPSAPSHMP--------SDARGKHQVDFNQAIIFVNKVKVRFQNNPE 288
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  324 IYKAFLEILHTYQKEQRNAkeaggnytpalteQEVYAQVARLFKNQEDLLSEFGQFLPDANSSVLLSKTTAEKVDSvRND 403
Cdd:COG5602   289 MYYDFLDSLRTYQMKQRSI-------------QEVYARVTKLFAEAPDLLEEFKEFLPDSSVSAEQSTANAQKPSK-RLP 354
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  404 HGGTVKKPqlnnkPQRPSQNGCQIRRHSGTGATPPVKKKPKLM-SLKESSMADA----SKHGVGTESLFFDKVRKALRSA 478
Cdd:COG5602   355 PIGSFSLP-----TAAPEQNRPSLLWESPRSISNISRYRADLLtSFSRNFVPIRihmpISHLSNESEDFFERIKQYLKDK 429
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  479 EAYENFLRCLVIFNQEVISRAELVQLVSPFLGKFPELFNWFKNFLGYKESVHlESFPKERategiameIDYASCKRLGSS 558
Cdd:COG5602   430 NLYHEFLKLLNLYSQKIIDKNDLVERLFAFLGSNEELIRWFKAFINYSEPEK-EPLRETR--------KDLEKFEKCGPS 500
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  559 YRALPKSYQQPKCTGRTPLCKEVLNDTWVSFPSW-SEDSTFVSSKKTQYEEHIYRCEDERFELDVVLETNLATIRVLEAI 637
Cdd:COG5602   501 YRLLPKSEEEIKCSGRDDLAWEVLNDDWVSHPTWaSEDSGFIAHRKNQYEEALFKIEEERYEYDRHIEATQRTIKALEQI 580
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  638 QKKLSRLSAEEQAKFRLDNTLGGTSEVIHRKALQRIY-ADKAADIIDGLRKNPSIAVPIVLKRLKMKEEEWREAQRGFNK 716
Cdd:COG5602   581 IDKIKDMEESEKANKTLPGGLGLPSKSIYKKVIKKVYdKEHAPEILEALLKKPHVTIPIVLKRLKMKDEEWRSCKREWNK 660
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  717 VWREQNEKYYLKSLDHQGINFKQNDTKVLRSKSLLNEIESIYDERQEQATEENAGVpvgpHLSLAYEDKQILEDAAALII 796
Cdd:COG5602   661 IWREIEEKNYHKSLDHQGVSFKSRDKKILSTKNILEEIDDRSQAKIHVSIDDKKVF----QFVFVLCDTYIFVNILDLKD 736
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  797 HHVKRQTGIQKEDKYKIKQIMHHFIPD--LLFAQRGDLSDVEEEEEEemdvdeatgapKKHNGVGGSP------------ 862
Cdd:COG5602   737 TLITNMSSYLESDKERLKANLKTIVSLffLLCIFRIIIIVYERLLNV-----------KGLNIDGLKAdrssrsdesaqr 805
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  863 ------------------------------PKSKLLFSNTAA-QKLRGMDEVYNlFYVNNNWYIFMRLHQILCLRLLRic 911
Cdd:COG5602   806 yskvksgnleqvskqideyaiedeikepthPDGLKEHNISKGiSENEKQVIALT-EFGNSLMYVFFRLRFDSYERLYE-- 882
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  912 sqaerqieeenrerewerevlgIKRDKSDSPAIQLRLkepmdvdveDYYPAFLDMVRSLLDGNIDSSQYEDSLREMFTIH 991
Cdd:COG5602   883 ----------------------IKKIKEAVTIGMMEP---------GHYENELWDSYRLIFGALTPCEFEKYLRPFYNNT 931
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  992 AYIAFTMDKLIQSIVRQLQHIVSDEVCVQVTDLYLAENNNGATggqlnsqtsRSLLESAYQRKAEQLMS-DENCFKLMFI 1070
Cdd:COG5602   932 CYKIYTIDRLRQSLKKQLHPITYDIKYIAKSELFEMNSAASSK---------RKQDQTKYRLEVESLLNpDEILFRFCWI 1002
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362 1071 QSQGQVQLTVELLD--TEEENSDDpveAERWSDYVERYMSSDTTspELREHLAQKPVFLPRNLRRIRKC-QRGREQQEke 1147
Cdd:COG5602  1003 NKFKSFGIQIMKRAdlTVDQSLDT---QRVWKYYVQSYAIQHLT--EGISYKNYKCPFLCRNIEKERTVeQLVSRLQT-- 1075
                        1210      1220      1230      1240      1250      1260
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 160333362 1148 gkegnskKTMENVESLDKLECRFKLNSYKMVYVIKSEDyMYRRTALLRAHQSHERVSKRLHQRFQAW 1214
Cdd:COG5602  1076 -------KLLRSHELVSGLQAFLCLDTFKLLYLPNTED-SYIDASYLRLRDSDFLDCQKRKQRWRNW 1134
Sin3a_C pfam16879
C-terminal domain of Sin3a protein; Sin3a_C is a family of eukaryotic species. It is found at ...
887-1187 1.70e-103

C-terminal domain of Sin3a protein; Sin3a_C is a family of eukaryotic species. It is found at the C-terminus of the co-repressor Sin3a, and downstream of family Sin3_corepress, pfam08295.


Pssm-ID: 465294  Cd Length: 281  Bit Score: 329.18  E-value: 1.70e-103
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362   887 FYVNNNWYIFMRLHQILCLRLLRIcSQAERQIEEENREREWEREVLGIKRDKSDSPAIQLRLkepmdVDVEDYYPAFLDM 966
Cdd:pfam16879    1 FFANTTIYVFFRLFQILYERLLKI-KQAEEEVAEEIKRRKANKPAKDLGLISKELEDFGLDL-----SDPEDYYPQLLEL 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362   967 VRSLLDGNIDSSQYEDSLREMFTIHAYIAFTMDKLIQSIVRQLQHIVSDEVCVQVTDLYLAENNNGATggqlNSQTsrsl 1046
Cdd:pfam16879   75 CERLIDGEIDQSQFEDSLRQMYGNKAYKLYTIDKLIQALVKQLQTIVSDSKSQELLELFEKERSKEET----TAQD---- 146
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362  1047 lESAYQRKAEQLM-SDENCFKLMFIQSqgQVQLTVELLDTEEENSDDPV--EAERWSDYVERYMSSDTTspELREHLAQK 1123
Cdd:pfam16879  147 -EILYRKQAEKLLgKDENLFRIEWNPQ--SKTVTIQLLDKDDLTFDDDEltAEERWQYYVSSYVLSHPT--EGVPRSKLR 221
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 160333362  1124 PVFLPRNLRRIRKCQRGREQQEKEGKEgnskKTMENVESLDKLECRFKLNSYKMVYVIKSEDYM 1187
Cdd:pfam16879  222 KPFLKRNLKAEREEEEDEEEEPKADAE----EPEGPLEFESGLEIRICLNSYKLFYVPGTEDYF 281
HDAC_interact smart00761
Histone deacetylase (HDAC) interacting; This domain is found on transcriptional regulators. It ...
551-651 2.80e-52

Histone deacetylase (HDAC) interacting; This domain is found on transcriptional regulators. It forms interactions with histone deacetylases.


Pssm-ID: 214808 [Multi-domain]  Cd Length: 102  Bit Score: 178.27  E-value: 2.80e-52
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362    551 SCKRLGSSYRALPKSYQQPKCTGRTPLCKEVLNDTWVSFPSW-SEDSTFVSSKKTQYEEHIYRCEDERFELDVVLETNLA 629
Cdd:smart00761    1 NCERCGPSYRLLPKSEKQPKCSGRDELCKEVLNDTWVSHPTWaSEDSGFVAHRKNQYEEALFRCEDERFELDMVIESNSS 80
                            90       100
                    ....*....|....*....|..
gi 160333362    630 TIRVLEAIQKKLSRLSAEEQAK 651
Cdd:smart00761   81 TIKLLEEILNKIEDMSDEERAN 102
Sin3_corepress pfam08295
Sin3 family co-repressor; This domain is found on transcriptional regulators. It forms ...
553-647 5.87e-51

Sin3 family co-repressor; This domain is found on transcriptional regulators. It forms interactions with histone deacetylases.


Pssm-ID: 462418  Cd Length: 97  Bit Score: 174.34  E-value: 5.87e-51
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 160333362   553 KRLGSSYRALPKSYQQPKCTGRTPLCKEVLNDTWVSFPSW-SEDS-TFVSSKKTQYEEHIYRCEDERFELDVVLETNLAT 630
Cdd:pfam08295    1 KRCGPSYRLLPKSEPQLPCSGRDELCWEVLNDEWVSHPTWaSEDSgSFVAHRKNQYEEALFRCEDERYEYDMHIESNLRT 80
                           90
                   ....*....|....*..
gi 160333362   631 IRVLEAIQKKLSRLSAE 647
Cdd:pfam08295   81 IKLLEPIAEKINNMSPE 97
PAH pfam02671
Paired amphipathic helix repeat; This family contains the paired amphipathic helix repeat. The ...
142-186 9.33e-20

Paired amphipathic helix repeat; This family contains the paired amphipathic helix repeat. The family contains the yeast SIN3 gene (also known as SDI1) that is a negative regulator of the yeast HO gene. This repeat may be distantly related to the helix-loop-helix motif, which mediate protein-protein interactions.


Pssm-ID: 460645 [Multi-domain]  Cd Length: 45  Bit Score: 83.63  E-value: 9.33e-20
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 160333362   142 QVYNDFLDIMKEFKSQSIDTPGVISRVSQLFKGHPDLIMGFNTFL 186
Cdd:pfam02671    1 EVYDEFLKILNDYKQEIIDIAEVIARVSELLKGHPDLLEGFNTFL 45
PAH pfam02671
Paired amphipathic helix repeat; This family contains the paired amphipathic helix repeat. The ...
479-523 2.04e-13

Paired amphipathic helix repeat; This family contains the paired amphipathic helix repeat. The family contains the yeast SIN3 gene (also known as SDI1) that is a negative regulator of the yeast HO gene. This repeat may be distantly related to the helix-loop-helix motif, which mediate protein-protein interactions.


Pssm-ID: 460645 [Multi-domain]  Cd Length: 45  Bit Score: 65.52  E-value: 2.04e-13
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 160333362   479 EAYENFLRCLVIFNQEVISRAELVQLVSPFLGKFPELFNWFKNFL 523
Cdd:pfam02671    1 EVYDEFLKILNDYKQEIIDIAEVIARVSELLKGHPDLLEGFNTFL 45
PAH pfam02671
Paired amphipathic helix repeat; This family contains the paired amphipathic helix repeat. The ...
323-380 3.78e-12

Paired amphipathic helix repeat; This family contains the paired amphipathic helix repeat. The family contains the yeast SIN3 gene (also known as SDI1) that is a negative regulator of the yeast HO gene. This repeat may be distantly related to the helix-loop-helix motif, which mediate protein-protein interactions.


Pssm-ID: 460645 [Multi-domain]  Cd Length: 45  Bit Score: 62.05  E-value: 3.78e-12
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 160333362   323 DIYKAFLEILHTYQKEQRNAKEaggnytpalteqeVYAQVARLFKNQEDLLSEFGQFL 380
Cdd:pfam02671    1 EVYDEFLKILNDYKQEIIDIAE-------------VIARVSELLKGHPDLLEGFNTFL 45
KLF1_2_4_N-like cd22056
N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of ...
48-119 1.24e-03

N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of Kruppel-like factor (KLF)1, KLF2, and KLF4; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specifity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domains of an unknown subfamily of KLFs, predominantly found in fish, related to the N-terminal domains of KLF1, KLF2, and KLF4.


Pssm-ID: 409231 [Multi-domain]  Cd Length: 339  Bit Score: 42.72  E-value: 1.24e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 160333362   48 MQSATGIQYSVAPNYQVSAVPQSSGSHGPAIAAVHSSHHHPTAVQPHGgqvVQSHAHPAPPVAPvQGQQQFQ 119
Cdd:cd22056   213 MHSVHPQAFTHHQAAGPGALQGRGGRGGPDCHLLHSSHHHHHHHHLQY---QYMNAPYPPHYAH-QGAPQFH 280
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH