NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|145332953|ref|NP_001078342|]
View 

chorismate synthase [Arabidopsis thaliana]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PHA03247 super family cl33720
large tegument protein UL36; Provisional
574-947 1.41e-03

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.00  E-value: 1.41e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145332953  574 GDSYHPPPPPVDQDTSVPSvKMTKQRKTSVDDQASQHLlSLLQRSSDPKSQDTQLLSATERRPPPPSMKTTTPPPSVKST 653
Cdd:PHA03247 2641 HPPPTVPPPERPRDDPAPG-RVSRPRRARRLGRAAQAS-SPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSA 2718
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145332953  654 TAGEADPGKSLtlenlfGSAFMNELQSIGEPVSGRAMVSDAPGVPLRSERSIGELSQRNQIRPDGPPGGVLALPEDGNLL 733
Cdd:PHA03247 2719 TPLPPGPAAAR------QASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLS 2792
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145332953  734 AVGGHANPSKYMSFPGSHNQEPEVAFNISDKLAALNSGPRNERPTMGGqdglfLHQHPQQYVTNPSSHLNGSGPVFHPFD 813
Cdd:PHA03247 2793 ESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPP-----PPPGPPPPSLPLGGSVAPGGDVRRRPP 2867
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145332953  814 SQHAHVKPQLDFMGPGSTMSQHHDPPPNHRF--PPNMIHRPPFHHTPTSGHPEFDRLPPHMMQKMHMQdnlqhhhlmQGF 891
Cdd:PHA03247 2868 SRSPAAKPAAPARPPVRRLARPAVSRSTESFalPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP---------PPR 2938
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 145332953  892 PGSGPQPHHSPHVNNQMPGLIPELNPSQGFPFAHRQPNYGMPPPGSQVnrgEHPAS 947
Cdd:PHA03247 2939 PQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSR---EAPAS 2991
DUF499 super family cl47163
Protein of unknown function (DUF499); Family of uncharacterized hypothetical prokaryotic ...
211-440 1.41e-03

Protein of unknown function (DUF499); Family of uncharacterized hypothetical prokaryotic proteins.


The actual alignment was detected with superfamily member pfam04465:

Pssm-ID: 367954 [Multi-domain]  Cd Length: 1016  Bit Score: 42.52  E-value: 1.41e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145332953   211 GSSDSTSEDRAEEERKRRASFELLrkeHQKAFQERQKSnpdlrkndfdftELLGESKDDKGRPSRSDEVNHAPTIPGSSN 290
Cdd:pfam04465  702 YSEIPFDKERGEVPTSIEEADIIL---PWNKALERMLE------------ELLKEEKDGVRKDKGILKLWYEVYIPNEKY 766
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145332953   291 T--SLPSQSNAPRPL-----------VPPGF----ASTILEKKQGEKPQTETS-----QYERS-PLNSKGINVVNGTSVN 347
Cdd:pfam04465  767 PlkDIVKFEDWEKVLrggiiekreeiLKGGFilkvKPRSVELNPGEKVEVEVAiepigDYENEiKLSSDEGELSVEPVEL 846
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145332953   348 NGGKPLGIK-------IGSSEMLIEGEDvrvSSTDANERAVNISSLLGISTDTVNKDKSFEK---LSSISTPTEIQGYPI 417
Cdd:pfam04465  847 KGKEPLKIKwrlkiprVGRYRIKIEAKS---NGGKLDERIISVVPKIEESVIIVEKIDEVEKgakLVSIKSINDLDSLKS 923
                          250       260
                   ....*....|....*....|....
gi 145332953   418 KSEKATMTLGKKK-SLEHSDGPSI 440
Cdd:pfam04465  924 IPEVAKVFPGKASgSLEVSEAGSW 947
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
574-947 1.41e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.00  E-value: 1.41e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145332953  574 GDSYHPPPPPVDQDTSVPSvKMTKQRKTSVDDQASQHLlSLLQRSSDPKSQDTQLLSATERRPPPPSMKTTTPPPSVKST 653
Cdd:PHA03247 2641 HPPPTVPPPERPRDDPAPG-RVSRPRRARRLGRAAQAS-SPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSA 2718
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145332953  654 TAGEADPGKSLtlenlfGSAFMNELQSIGEPVSGRAMVSDAPGVPLRSERSIGELSQRNQIRPDGPPGGVLALPEDGNLL 733
Cdd:PHA03247 2719 TPLPPGPAAAR------QASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLS 2792
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145332953  734 AVGGHANPSKYMSFPGSHNQEPEVAFNISDKLAALNSGPRNERPTMGGqdglfLHQHPQQYVTNPSSHLNGSGPVFHPFD 813
Cdd:PHA03247 2793 ESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPP-----PPPGPPPPSLPLGGSVAPGGDVRRRPP 2867
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145332953  814 SQHAHVKPQLDFMGPGSTMSQHHDPPPNHRF--PPNMIHRPPFHHTPTSGHPEFDRLPPHMMQKMHMQdnlqhhhlmQGF 891
Cdd:PHA03247 2868 SRSPAAKPAAPARPPVRRLARPAVSRSTESFalPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP---------PPR 2938
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 145332953  892 PGSGPQPHHSPHVNNQMPGLIPELNPSQGFPFAHRQPNYGMPPPGSQVnrgEHPAS 947
Cdd:PHA03247 2939 PQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSR---EAPAS 2991
DUF499 pfam04465
Protein of unknown function (DUF499); Family of uncharacterized hypothetical prokaryotic ...
211-440 1.41e-03

Protein of unknown function (DUF499); Family of uncharacterized hypothetical prokaryotic proteins.


Pssm-ID: 367954 [Multi-domain]  Cd Length: 1016  Bit Score: 42.52  E-value: 1.41e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145332953   211 GSSDSTSEDRAEEERKRRASFELLrkeHQKAFQERQKSnpdlrkndfdftELLGESKDDKGRPSRSDEVNHAPTIPGSSN 290
Cdd:pfam04465  702 YSEIPFDKERGEVPTSIEEADIIL---PWNKALERMLE------------ELLKEEKDGVRKDKGILKLWYEVYIPNEKY 766
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145332953   291 T--SLPSQSNAPRPL-----------VPPGF----ASTILEKKQGEKPQTETS-----QYERS-PLNSKGINVVNGTSVN 347
Cdd:pfam04465  767 PlkDIVKFEDWEKVLrggiiekreeiLKGGFilkvKPRSVELNPGEKVEVEVAiepigDYENEiKLSSDEGELSVEPVEL 846
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145332953   348 NGGKPLGIK-------IGSSEMLIEGEDvrvSSTDANERAVNISSLLGISTDTVNKDKSFEK---LSSISTPTEIQGYPI 417
Cdd:pfam04465  847 KGKEPLKIKwrlkiprVGRYRIKIEAKS---NGGKLDERIISVVPKIEESVIIVEKIDEVEKgakLVSIKSINDLDSLKS 923
                          250       260
                   ....*....|....*....|....
gi 145332953   418 KSEKATMTLGKKK-SLEHSDGPSI 440
Cdd:pfam04465  924 IPEVAKVFPGKASgSLEVSEAGSW 947
KLF1_2_4_N-like cd22056
N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of ...
826-903 4.86e-03

N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of Kruppel-like factor (KLF)1, KLF2, and KLF4; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specifity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domains of an unknown subfamily of KLFs, predominantly found in fish, related to the N-terminal domains of KLF1, KLF2, and KLF4.


Pssm-ID: 409231 [Multi-domain]  Cd Length: 339  Bit Score: 40.41  E-value: 4.86e-03
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 145332953 826 MGPGSTMSQHHDPPPNHRFPPnMIHRPPFHHTPTSGHPEFDRLPPHMMQKMHMQDNLQHHHLMQGFPGSGPQPHHSPH 903
Cdd:cd22056  198 AGGGGFMGQQKPKHQMHSVHP-QAFTHHQAAGPGALQGRGGRGGPDCHLLHSSHHHHHHHHLQYQYMNAPYPPHYAHQ 274
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
574-947 1.41e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.00  E-value: 1.41e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145332953  574 GDSYHPPPPPVDQDTSVPSvKMTKQRKTSVDDQASQHLlSLLQRSSDPKSQDTQLLSATERRPPPPSMKTTTPPPSVKST 653
Cdd:PHA03247 2641 HPPPTVPPPERPRDDPAPG-RVSRPRRARRLGRAAQAS-SPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSA 2718
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145332953  654 TAGEADPGKSLtlenlfGSAFMNELQSIGEPVSGRAMVSDAPGVPLRSERSIGELSQRNQIRPDGPPGGVLALPEDGNLL 733
Cdd:PHA03247 2719 TPLPPGPAAAR------QASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLS 2792
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145332953  734 AVGGHANPSKYMSFPGSHNQEPEVAFNISDKLAALNSGPRNERPTMGGqdglfLHQHPQQYVTNPSSHLNGSGPVFHPFD 813
Cdd:PHA03247 2793 ESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPP-----PPPGPPPPSLPLGGSVAPGGDVRRRPP 2867
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145332953  814 SQHAHVKPQLDFMGPGSTMSQHHDPPPNHRF--PPNMIHRPPFHHTPTSGHPEFDRLPPHMMQKMHMQdnlqhhhlmQGF 891
Cdd:PHA03247 2868 SRSPAAKPAAPARPPVRRLARPAVSRSTESFalPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP---------PPR 2938
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 145332953  892 PGSGPQPHHSPHVNNQMPGLIPELNPSQGFPFAHRQPNYGMPPPGSQVnrgEHPAS 947
Cdd:PHA03247 2939 PQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSR---EAPAS 2991
DUF499 pfam04465
Protein of unknown function (DUF499); Family of uncharacterized hypothetical prokaryotic ...
211-440 1.41e-03

Protein of unknown function (DUF499); Family of uncharacterized hypothetical prokaryotic proteins.


Pssm-ID: 367954 [Multi-domain]  Cd Length: 1016  Bit Score: 42.52  E-value: 1.41e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145332953   211 GSSDSTSEDRAEEERKRRASFELLrkeHQKAFQERQKSnpdlrkndfdftELLGESKDDKGRPSRSDEVNHAPTIPGSSN 290
Cdd:pfam04465  702 YSEIPFDKERGEVPTSIEEADIIL---PWNKALERMLE------------ELLKEEKDGVRKDKGILKLWYEVYIPNEKY 766
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145332953   291 T--SLPSQSNAPRPL-----------VPPGF----ASTILEKKQGEKPQTETS-----QYERS-PLNSKGINVVNGTSVN 347
Cdd:pfam04465  767 PlkDIVKFEDWEKVLrggiiekreeiLKGGFilkvKPRSVELNPGEKVEVEVAiepigDYENEiKLSSDEGELSVEPVEL 846
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145332953   348 NGGKPLGIK-------IGSSEMLIEGEDvrvSSTDANERAVNISSLLGISTDTVNKDKSFEK---LSSISTPTEIQGYPI 417
Cdd:pfam04465  847 KGKEPLKIKwrlkiprVGRYRIKIEAKS---NGGKLDERIISVVPKIEESVIIVEKIDEVEKgakLVSIKSINDLDSLKS 923
                          250       260
                   ....*....|....*....|....
gi 145332953   418 KSEKATMTLGKKK-SLEHSDGPSI 440
Cdd:pfam04465  924 IPEVAKVFPGKASgSLEVSEAGSW 947
KLF1_2_4_N-like cd22056
N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of ...
826-903 4.86e-03

N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of Kruppel-like factor (KLF)1, KLF2, and KLF4; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specifity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domains of an unknown subfamily of KLFs, predominantly found in fish, related to the N-terminal domains of KLF1, KLF2, and KLF4.


Pssm-ID: 409231 [Multi-domain]  Cd Length: 339  Bit Score: 40.41  E-value: 4.86e-03
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 145332953 826 MGPGSTMSQHHDPPPNHRFPPnMIHRPPFHHTPTSGHPEFDRLPPHMMQKMHMQDNLQHHHLMQGFPGSGPQPHHSPH 903
Cdd:cd22056  198 AGGGGFMGQQKPKHQMHSVHP-QAFTHHQAAGPGALQGRGGRGGPDCHLLHSSHHHHHHHHLQYQYMNAPYPPHYAHQ 274
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH