NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|606520357|gb|EYV75156|]
View 

hypothetical protein BX32_16815, partial [Escherichia coli O121:H19 str. 2009EL1412]

Protein Classification

SASA family carbohydrate esterase( domain architecture ID 12091508)

SASA family carbohydrate esterase containing a DUF1737 domains, similar to 9-O-Acetyl N-acetylneuraminic acid esterase from Escherichia coli O157:H7

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
DUF6645 pfam20350
Family of unknown function (DUF6645); This presumed domain is functionally uncharacterized. ...
489-607 1.72e-84

Family of unknown function (DUF6645); This presumed domain is functionally uncharacterized. This domain family is found in bacteria and viruses, and is approximately 120 amino acids in length. The family is found in association with pfam03629, pfam08410.


:

Pssm-ID: 466500  Cd Length: 119  Bit Score: 259.55  E-value: 1.72e-84
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 606520357  489 LQKGGQIRCRFKVSGALAANQYVMAFYWPVSSLPQGVALTGDGGNNLLAAFYIQTDAKDLNVMYHNAKVATNNLKLGTFG 568
Cdd:pfam20350   1 LQKGGQIRCRFKVSGALAANQYVMALYWPVSSLPQGVTLTGDAGNNLLAAFYIQTDAKDLNVMYHNAKVATNNLKLGSFG 80
                          90       100       110
                  ....*....|....*....|....*....|....*....
gi 606520357  569 AFDNEWHTLAFRFAGNNSLQVTPVIDGQDGTPFTLTQSP 607
Cdd:pfam20350  81 AFDNEWHTLAFRFAGNNSLQVTPVIDGQDGTPFTLTQSP 119
DUF1737 pfam08410
Domain of unknown function (DUF1737); This domain of unknown function is found at the ...
5-53 8.82e-16

Domain of unknown function (DUF1737); This domain of unknown function is found at the N-terminus of bacterial and viral hypothetical proteins.


:

Pssm-ID: 429981  Cd Length: 51  Bit Score: 71.61  E-value: 8.82e-16
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 606520357    5 HYDVVRAASPSDLAERITQKLKEGWQPYGSALISTAGY--GAEFIQPVVSE 53
Cdd:pfam08410   1 LYRLLTGPDDSAFCHRVTQALNEGWQLYGSPSITFDAArgVMRCGQAVVKE 51
SASA super family cl04187
Carbohydrate esterase, sialic acid-specific acetylesterase; The catalytic triad of this ...
83-274 1.51e-10

Carbohydrate esterase, sialic acid-specific acetylesterase; The catalytic triad of this esterase enzyme comprises residues Ser127, His403 and Asp391 in UniProtKB:P70665.


The actual alignment was detected with superfamily member pfam03629:

Pssm-ID: 427409  Cd Length: 227  Bit Score: 61.45  E-value: 1.51e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 606520357   83 LAGQSN-----GMSYGEGLPLPDTfdSPDPRIKQLARRSTVtpggaackyndiIPADHCLHdvqdmsrlnhpkADLSKGQ 157
Cdd:pfam03629   7 LAGQSNmagrgGVENWDGVVPPEC--QPPPRILRLNADLEW------------EEAREPLH------------ADIDAKK 60
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 606520357  158 YGTVGQGLHIAKKLLPfIPANAGILLVPCCRGGSAFTtgadgtysdasgasenstRWGVDKPLYKDLIGRTKAALKknpK 237
Cdd:pfam03629  61 TCGVGPGMAFANALLR-APPGGVIGLVPCAVGGTSIE------------------EWARGGLLYQEMVRRAKAALK---G 118
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|..
gi 606520357  238 NVLFAVVWMQGEFDfggTPVNHAAQ-----FGALVDKFRADL 274
Cdd:pfam03629 119 GEIKGILWYQGESD---TSDEEDAAaykekLEKLITDLRDDL 157
 
Name Accession Description Interval E-value
DUF6645 pfam20350
Family of unknown function (DUF6645); This presumed domain is functionally uncharacterized. ...
489-607 1.72e-84

Family of unknown function (DUF6645); This presumed domain is functionally uncharacterized. This domain family is found in bacteria and viruses, and is approximately 120 amino acids in length. The family is found in association with pfam03629, pfam08410.


Pssm-ID: 466500  Cd Length: 119  Bit Score: 259.55  E-value: 1.72e-84
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 606520357  489 LQKGGQIRCRFKVSGALAANQYVMAFYWPVSSLPQGVALTGDGGNNLLAAFYIQTDAKDLNVMYHNAKVATNNLKLGTFG 568
Cdd:pfam20350   1 LQKGGQIRCRFKVSGALAANQYVMALYWPVSSLPQGVTLTGDAGNNLLAAFYIQTDAKDLNVMYHNAKVATNNLKLGSFG 80
                          90       100       110
                  ....*....|....*....|....*....|....*....
gi 606520357  569 AFDNEWHTLAFRFAGNNSLQVTPVIDGQDGTPFTLTQSP 607
Cdd:pfam20350  81 AFDNEWHTLAFRFAGNNSLQVTPVIDGQDGTPFTLTQSP 119
DUF1737 pfam08410
Domain of unknown function (DUF1737); This domain of unknown function is found at the ...
5-53 8.82e-16

Domain of unknown function (DUF1737); This domain of unknown function is found at the N-terminus of bacterial and viral hypothetical proteins.


Pssm-ID: 429981  Cd Length: 51  Bit Score: 71.61  E-value: 8.82e-16
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 606520357    5 HYDVVRAASPSDLAERITQKLKEGWQPYGSALISTAGY--GAEFIQPVVSE 53
Cdd:pfam08410   1 LYRLLTGPDDSAFCHRVTQALNEGWQLYGSPSITFDAArgVMRCGQAVVKE 51
SASA pfam03629
Carbohydrate esterase, sialic acid-specific acetylesterase; The catalytic triad of this ...
83-274 1.51e-10

Carbohydrate esterase, sialic acid-specific acetylesterase; The catalytic triad of this esterase enzyme comprises residues Ser127, His403 and Asp391 in UniProtKB:P70665.


Pssm-ID: 427409  Cd Length: 227  Bit Score: 61.45  E-value: 1.51e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 606520357   83 LAGQSN-----GMSYGEGLPLPDTfdSPDPRIKQLARRSTVtpggaackyndiIPADHCLHdvqdmsrlnhpkADLSKGQ 157
Cdd:pfam03629   7 LAGQSNmagrgGVENWDGVVPPEC--QPPPRILRLNADLEW------------EEAREPLH------------ADIDAKK 60
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 606520357  158 YGTVGQGLHIAKKLLPfIPANAGILLVPCCRGGSAFTtgadgtysdasgasenstRWGVDKPLYKDLIGRTKAALKknpK 237
Cdd:pfam03629  61 TCGVGPGMAFANALLR-APPGGVIGLVPCAVGGTSIE------------------EWARGGLLYQEMVRRAKAALK---G 118
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|..
gi 606520357  238 NVLFAVVWMQGEFDfggTPVNHAAQ-----FGALVDKFRADL 274
Cdd:pfam03629 119 GEIKGILWYQGESD---TSDEEDAAaykekLEKLITDLRDDL 157
 
Name Accession Description Interval E-value
DUF6645 pfam20350
Family of unknown function (DUF6645); This presumed domain is functionally uncharacterized. ...
489-607 1.72e-84

Family of unknown function (DUF6645); This presumed domain is functionally uncharacterized. This domain family is found in bacteria and viruses, and is approximately 120 amino acids in length. The family is found in association with pfam03629, pfam08410.


Pssm-ID: 466500  Cd Length: 119  Bit Score: 259.55  E-value: 1.72e-84
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 606520357  489 LQKGGQIRCRFKVSGALAANQYVMAFYWPVSSLPQGVALTGDGGNNLLAAFYIQTDAKDLNVMYHNAKVATNNLKLGTFG 568
Cdd:pfam20350   1 LQKGGQIRCRFKVSGALAANQYVMALYWPVSSLPQGVTLTGDAGNNLLAAFYIQTDAKDLNVMYHNAKVATNNLKLGSFG 80
                          90       100       110
                  ....*....|....*....|....*....|....*....
gi 606520357  569 AFDNEWHTLAFRFAGNNSLQVTPVIDGQDGTPFTLTQSP 607
Cdd:pfam20350  81 AFDNEWHTLAFRFAGNNSLQVTPVIDGQDGTPFTLTQSP 119
DUF1737 pfam08410
Domain of unknown function (DUF1737); This domain of unknown function is found at the ...
5-53 8.82e-16

Domain of unknown function (DUF1737); This domain of unknown function is found at the N-terminus of bacterial and viral hypothetical proteins.


Pssm-ID: 429981  Cd Length: 51  Bit Score: 71.61  E-value: 8.82e-16
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 606520357    5 HYDVVRAASPSDLAERITQKLKEGWQPYGSALISTAGY--GAEFIQPVVSE 53
Cdd:pfam08410   1 LYRLLTGPDDSAFCHRVTQALNEGWQLYGSPSITFDAArgVMRCGQAVVKE 51
SASA pfam03629
Carbohydrate esterase, sialic acid-specific acetylesterase; The catalytic triad of this ...
83-274 1.51e-10

Carbohydrate esterase, sialic acid-specific acetylesterase; The catalytic triad of this esterase enzyme comprises residues Ser127, His403 and Asp391 in UniProtKB:P70665.


Pssm-ID: 427409  Cd Length: 227  Bit Score: 61.45  E-value: 1.51e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 606520357   83 LAGQSN-----GMSYGEGLPLPDTfdSPDPRIKQLARRSTVtpggaackyndiIPADHCLHdvqdmsrlnhpkADLSKGQ 157
Cdd:pfam03629   7 LAGQSNmagrgGVENWDGVVPPEC--QPPPRILRLNADLEW------------EEAREPLH------------ADIDAKK 60
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 606520357  158 YGTVGQGLHIAKKLLPfIPANAGILLVPCCRGGSAFTtgadgtysdasgasenstRWGVDKPLYKDLIGRTKAALKknpK 237
Cdd:pfam03629  61 TCGVGPGMAFANALLR-APPGGVIGLVPCAVGGTSIE------------------EWARGGLLYQEMVRRAKAALK---G 118
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|..
gi 606520357  238 NVLFAVVWMQGEFDfggTPVNHAAQ-----FGALVDKFRADL 274
Cdd:pfam03629 119 GEIKGILWYQGESD---TSDEEDAAaykekLEKLITDLRDDL 157
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH