NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1618341947|dbj|GDR84505|]
View 

hypothetical protein BvCmsOUP006_00103 [Escherichia coli]

Protein Classification

SASA family carbohydrate esterase( domain architecture ID 12091508)

SASA family carbohydrate esterase containing a DUF1737 domains, similar to 9-O-Acetyl N-acetylneuraminic acid esterase from Escherichia coli O157:H7

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
DUF6645 pfam20350
Family of unknown function (DUF6645); This presumed domain is functionally uncharacterized. ...
490-608 7.18e-84

Family of unknown function (DUF6645); This presumed domain is functionally uncharacterized. This domain family is found in bacteria and viruses, and is approximately 120 amino acids in length. The family is found in association with pfam03629, pfam08410.


:

Pssm-ID: 466500  Cd Length: 119  Bit Score: 258.78  E-value: 7.18e-84
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1618341947 490 LQKGGQIRCRFKASGALAANQYVMAFYWPVSSLPQGVVLTGDGGNNLLAAFYIQTDAKDLNVMYHNAKVATNNLKLGSFG 569
Cdd:pfam20350   1 LQKGGQIRCRFKVSGALAANQYVMALYWPVSSLPQGVTLTGDAGNNLLAAFYIQTDAKDLNVMYHNAKVATNNLKLGSFG 80
                          90       100       110
                  ....*....|....*....|....*....|....*....
gi 1618341947 570 AFDNEWHTLAFRFAGNNSLQVTPVIDGQDGTPFTLTQSP 608
Cdd:pfam20350  81 AFDNEWHTLAFRFAGNNSLQVTPVIDGQDGTPFTLTQSP 119
DUF1737 pfam08410
Domain of unknown function (DUF1737); This domain of unknown function is found at the ...
5-53 1.06e-15

Domain of unknown function (DUF1737); This domain of unknown function is found at the N-terminus of bacterial and viral hypothetical proteins.


:

Pssm-ID: 429981  Cd Length: 51  Bit Score: 71.22  E-value: 1.06e-15
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 1618341947   5 HYDVVRAASPSDLAERITQKLKEGWQPYGSALISTAGY--GAEFIQPVVSE 53
Cdd:pfam08410   1 LYRLLTGPDDSAFCHRVTQALNEGWQLYGSPSITFDAArgVMRCGQAVVKE 51
SASA super family cl04187
Carbohydrate esterase, sialic acid-specific acetylesterase; The catalytic triad of this ...
84-275 2.83e-10

Carbohydrate esterase, sialic acid-specific acetylesterase; The catalytic triad of this esterase enzyme comprises residues Ser127, His403 and Asp391 in UniProtKB:P70665.


The actual alignment was detected with superfamily member pfam03629:

Pssm-ID: 427409  Cd Length: 227  Bit Score: 60.68  E-value: 2.83e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1618341947  84 LAGQSNSMSYG-----EGLPLPETydRPDPRIKQLARRSTVtpggaackyndiIPADHCLHdvqdmsrlnhpkADLSKGQ 158
Cdd:pfam03629   7 LAGQSNMAGRGgvenwDGVVPPEC--QPPPRILRLNADLEW------------EEAREPLH------------ADIDAKK 60
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1618341947 159 YGTVGQGLHIAKKLLPfIPANAGILLVPCCRGGSAFTtgadgtysdasgasenstRWGVDKPLYKDLIGRTKAALKKSpk 238
Cdd:pfam03629  61 TCGVGPGMAFANALLR-APPGGVIGLVPCAVGGTSIE------------------EWARGGLLYQEMVRRAKAALKGG-- 119
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|..
gi 1618341947 239 nVLFAVVWMQGEFDFGGmpvNHAAQ-----FGALVDKFRADL 275
Cdd:pfam03629 120 -EIKGILWYQGESDTSD---EEDAAaykekLEKLITDLRDDL 157
 
Name Accession Description Interval E-value
DUF6645 pfam20350
Family of unknown function (DUF6645); This presumed domain is functionally uncharacterized. ...
490-608 7.18e-84

Family of unknown function (DUF6645); This presumed domain is functionally uncharacterized. This domain family is found in bacteria and viruses, and is approximately 120 amino acids in length. The family is found in association with pfam03629, pfam08410.


Pssm-ID: 466500  Cd Length: 119  Bit Score: 258.78  E-value: 7.18e-84
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1618341947 490 LQKGGQIRCRFKASGALAANQYVMAFYWPVSSLPQGVVLTGDGGNNLLAAFYIQTDAKDLNVMYHNAKVATNNLKLGSFG 569
Cdd:pfam20350   1 LQKGGQIRCRFKVSGALAANQYVMALYWPVSSLPQGVTLTGDAGNNLLAAFYIQTDAKDLNVMYHNAKVATNNLKLGSFG 80
                          90       100       110
                  ....*....|....*....|....*....|....*....
gi 1618341947 570 AFDNEWHTLAFRFAGNNSLQVTPVIDGQDGTPFTLTQSP 608
Cdd:pfam20350  81 AFDNEWHTLAFRFAGNNSLQVTPVIDGQDGTPFTLTQSP 119
DUF1737 pfam08410
Domain of unknown function (DUF1737); This domain of unknown function is found at the ...
5-53 1.06e-15

Domain of unknown function (DUF1737); This domain of unknown function is found at the N-terminus of bacterial and viral hypothetical proteins.


Pssm-ID: 429981  Cd Length: 51  Bit Score: 71.22  E-value: 1.06e-15
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 1618341947   5 HYDVVRAASPSDLAERITQKLKEGWQPYGSALISTAGY--GAEFIQPVVSE 53
Cdd:pfam08410   1 LYRLLTGPDDSAFCHRVTQALNEGWQLYGSPSITFDAArgVMRCGQAVVKE 51
SASA pfam03629
Carbohydrate esterase, sialic acid-specific acetylesterase; The catalytic triad of this ...
84-275 2.83e-10

Carbohydrate esterase, sialic acid-specific acetylesterase; The catalytic triad of this esterase enzyme comprises residues Ser127, His403 and Asp391 in UniProtKB:P70665.


Pssm-ID: 427409  Cd Length: 227  Bit Score: 60.68  E-value: 2.83e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1618341947  84 LAGQSNSMSYG-----EGLPLPETydRPDPRIKQLARRSTVtpggaackyndiIPADHCLHdvqdmsrlnhpkADLSKGQ 158
Cdd:pfam03629   7 LAGQSNMAGRGgvenwDGVVPPEC--QPPPRILRLNADLEW------------EEAREPLH------------ADIDAKK 60
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1618341947 159 YGTVGQGLHIAKKLLPfIPANAGILLVPCCRGGSAFTtgadgtysdasgasenstRWGVDKPLYKDLIGRTKAALKKSpk 238
Cdd:pfam03629  61 TCGVGPGMAFANALLR-APPGGVIGLVPCAVGGTSIE------------------EWARGGLLYQEMVRRAKAALKGG-- 119
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|..
gi 1618341947 239 nVLFAVVWMQGEFDFGGmpvNHAAQ-----FGALVDKFRADL 275
Cdd:pfam03629 120 -EIKGILWYQGESDTSD---EEDAAaykekLEKLITDLRDDL 157
 
Name Accession Description Interval E-value
DUF6645 pfam20350
Family of unknown function (DUF6645); This presumed domain is functionally uncharacterized. ...
490-608 7.18e-84

Family of unknown function (DUF6645); This presumed domain is functionally uncharacterized. This domain family is found in bacteria and viruses, and is approximately 120 amino acids in length. The family is found in association with pfam03629, pfam08410.


Pssm-ID: 466500  Cd Length: 119  Bit Score: 258.78  E-value: 7.18e-84
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1618341947 490 LQKGGQIRCRFKASGALAANQYVMAFYWPVSSLPQGVVLTGDGGNNLLAAFYIQTDAKDLNVMYHNAKVATNNLKLGSFG 569
Cdd:pfam20350   1 LQKGGQIRCRFKVSGALAANQYVMALYWPVSSLPQGVTLTGDAGNNLLAAFYIQTDAKDLNVMYHNAKVATNNLKLGSFG 80
                          90       100       110
                  ....*....|....*....|....*....|....*....
gi 1618341947 570 AFDNEWHTLAFRFAGNNSLQVTPVIDGQDGTPFTLTQSP 608
Cdd:pfam20350  81 AFDNEWHTLAFRFAGNNSLQVTPVIDGQDGTPFTLTQSP 119
DUF1737 pfam08410
Domain of unknown function (DUF1737); This domain of unknown function is found at the ...
5-53 1.06e-15

Domain of unknown function (DUF1737); This domain of unknown function is found at the N-terminus of bacterial and viral hypothetical proteins.


Pssm-ID: 429981  Cd Length: 51  Bit Score: 71.22  E-value: 1.06e-15
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 1618341947   5 HYDVVRAASPSDLAERITQKLKEGWQPYGSALISTAGY--GAEFIQPVVSE 53
Cdd:pfam08410   1 LYRLLTGPDDSAFCHRVTQALNEGWQLYGSPSITFDAArgVMRCGQAVVKE 51
SASA pfam03629
Carbohydrate esterase, sialic acid-specific acetylesterase; The catalytic triad of this ...
84-275 2.83e-10

Carbohydrate esterase, sialic acid-specific acetylesterase; The catalytic triad of this esterase enzyme comprises residues Ser127, His403 and Asp391 in UniProtKB:P70665.


Pssm-ID: 427409  Cd Length: 227  Bit Score: 60.68  E-value: 2.83e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1618341947  84 LAGQSNSMSYG-----EGLPLPETydRPDPRIKQLARRSTVtpggaackyndiIPADHCLHdvqdmsrlnhpkADLSKGQ 158
Cdd:pfam03629   7 LAGQSNMAGRGgvenwDGVVPPEC--QPPPRILRLNADLEW------------EEAREPLH------------ADIDAKK 60
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1618341947 159 YGTVGQGLHIAKKLLPfIPANAGILLVPCCRGGSAFTtgadgtysdasgasenstRWGVDKPLYKDLIGRTKAALKKSpk 238
Cdd:pfam03629  61 TCGVGPGMAFANALLR-APPGGVIGLVPCAVGGTSIE------------------EWARGGLLYQEMVRRAKAALKGG-- 119
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|..
gi 1618341947 239 nVLFAVVWMQGEFDFGGmpvNHAAQ-----FGALVDKFRADL 275
Cdd:pfam03629 120 -EIKGILWYQGESDTSD---EEDAAaykekLEKLITDLRDDL 157
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH