NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1717567556|gb|TWP43201|]
View 

hypothetical protein AYC64_014835 [Escherichia coli]

Protein Classification

phage tail protein( domain architecture ID 13428948)

phage tail protein is part of a multi-protein structure that mediates the attachment, digestion and penetration of the cell wall and genome ejection

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Phage_fiber_2 pfam03406
Phage tail fibre repeat; This repeat is found in the tail fibres of phage. For example protein ...
215-252 6.64e-12

Phage tail fibre repeat; This repeat is found in the tail fibres of phage. For example protein K. The repeats are about 40 residues long.


:

Pssm-ID: 427282 [Multi-domain]  Cd Length: 38  Bit Score: 60.41  E-value: 6.64e-12
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 1717567556 215 STTQKGLVQLSSETNSDSETMAATPKAVKSVKDLADTK 252
Cdd:pfam03406   1 TLTQKGIVQLSSATNSTSETLAATPKAVKTAYDNANAA 38
Collar pfam07484
Phage Tail Collar Domain; This region is occasionally found in conjunction with pfam03335. ...
545-592 3.73e-11

Phage Tail Collar Domain; This region is occasionally found in conjunction with pfam03335. Most of the family appear to be phage tail proteins; however some appear to be involved in other processes. For instance Swiss:Q03314 from Rhizobium leguminosarum may be involved in plant-microbe interactions. A related protein Swiss:Q9L3N1 is involved in the pathogenicity of Microcystis aeruginosa. The finding of this family in a structural component of the phage tail fibre baseplate suggests that its function is structural rather than enzymatic. Structural studies show this region consists of a helix and a loop and three beta-strands. This alignment does not catch the third strand as it is separated from the rest of the structure by around 100 residues. This strand is conserved in homologs but the intervening sequence is not. Much of the function of Swiss:P10930 appears to reside in this intervening region. In the tertiary structure of the phage baseplate this domain forms part of the 'collar'. The domain may bind SO4, however the residues accredited with this vary between the PDB file and the Swiss-Prot entry. The long unconserved region maybe due to domain swapping in and out of a loop or reflective of rapid evolution.


:

Pssm-ID: 429485 [Multi-domain]  Cd Length: 57  Bit Score: 58.72  E-value: 3.73e-11
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 1717567556 545 GAPIPWPSDVVPTGYAIMQGQTFDKSAYPKLA---------VAYPSGVIPDMRGWTI 592
Cdd:pfam07484   1 GEIRLFAGNFAPAGWLLCDGQTLSISQYPALFallgttyggDGSTTFALPDLRGRFP 57
COG5301 super family cl34977
Phage-related tail fiber protein [Mobilome: prophages, transposons];
175-295 1.93e-09

Phage-related tail fiber protein [Mobilome: prophages, transposons];


The actual alignment was detected with superfamily member COG5301:

Pssm-ID: 444101 [Multi-domain]  Cd Length: 254  Bit Score: 58.91  E-value: 1.93e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1717567556 175 TIGIDISDsVTSTRSDVAASSLAVKKAYDLAkskytaqDASTTQKGLVQLSSETNSDSETMAATPKAVKSVKDLADTKAP 254
Cdd:COG5301   137 TLKIDPSV-VLATRQYVDDKLAKHEKSRNHP-------DATLTEKGFVQLSSATDSNSETLAATPKAVKTAYDLADTAFV 208
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|...
gi 1717567556 255 IESPSLTGTPTAPTAAQGT--NNTQIATTAYVRAAISALVGSS 295
Cdd:COG5301   209 AASGNAAGTAATAAAAGGLtlNAAALSVSTYSATAGAVLGTGG 251
 
Name Accession Description Interval E-value
Phage_fiber_2 pfam03406
Phage tail fibre repeat; This repeat is found in the tail fibres of phage. For example protein ...
215-252 6.64e-12

Phage tail fibre repeat; This repeat is found in the tail fibres of phage. For example protein K. The repeats are about 40 residues long.


Pssm-ID: 427282 [Multi-domain]  Cd Length: 38  Bit Score: 60.41  E-value: 6.64e-12
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 1717567556 215 STTQKGLVQLSSETNSDSETMAATPKAVKSVKDLADTK 252
Cdd:pfam03406   1 TLTQKGIVQLSSATNSTSETLAATPKAVKTAYDNANAA 38
Collar pfam07484
Phage Tail Collar Domain; This region is occasionally found in conjunction with pfam03335. ...
545-592 3.73e-11

Phage Tail Collar Domain; This region is occasionally found in conjunction with pfam03335. Most of the family appear to be phage tail proteins; however some appear to be involved in other processes. For instance Swiss:Q03314 from Rhizobium leguminosarum may be involved in plant-microbe interactions. A related protein Swiss:Q9L3N1 is involved in the pathogenicity of Microcystis aeruginosa. The finding of this family in a structural component of the phage tail fibre baseplate suggests that its function is structural rather than enzymatic. Structural studies show this region consists of a helix and a loop and three beta-strands. This alignment does not catch the third strand as it is separated from the rest of the structure by around 100 residues. This strand is conserved in homologs but the intervening sequence is not. Much of the function of Swiss:P10930 appears to reside in this intervening region. In the tertiary structure of the phage baseplate this domain forms part of the 'collar'. The domain may bind SO4, however the residues accredited with this vary between the PDB file and the Swiss-Prot entry. The long unconserved region maybe due to domain swapping in and out of a loop or reflective of rapid evolution.


Pssm-ID: 429485 [Multi-domain]  Cd Length: 57  Bit Score: 58.72  E-value: 3.73e-11
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 1717567556 545 GAPIPWPSDVVPTGYAIMQGQTFDKSAYPKLA---------VAYPSGVIPDMRGWTI 592
Cdd:pfam07484   1 GEIRLFAGNFAPAGWLLCDGQTLSISQYPALFallgttyggDGSTTFALPDLRGRFP 57
COG5301 COG5301
Phage-related tail fiber protein [Mobilome: prophages, transposons];
175-295 1.93e-09

Phage-related tail fiber protein [Mobilome: prophages, transposons];


Pssm-ID: 444101 [Multi-domain]  Cd Length: 254  Bit Score: 58.91  E-value: 1.93e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1717567556 175 TIGIDISDsVTSTRSDVAASSLAVKKAYDLAkskytaqDASTTQKGLVQLSSETNSDSETMAATPKAVKSVKDLADTKAP 254
Cdd:COG5301   137 TLKIDPSV-VLATRQYVDDKLAKHEKSRNHP-------DATLTEKGFVQLSSATDSNSETLAATPKAVKTAYDLADTAFV 208
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|...
gi 1717567556 255 IESPSLTGTPTAPTAAQGT--NNTQIATTAYVRAAISALVGSS 295
Cdd:COG5301   209 AASGNAAGTAATAAAAGGLtlNAAALSVSTYSATAGAVLGTGG 251
Phage_fiber_2 pfam03406
Phage tail fibre repeat; This repeat is found in the tail fibres of phage. For example protein ...
178-208 1.08e-04

Phage tail fibre repeat; This repeat is found in the tail fibres of phage. For example protein K. The repeats are about 40 residues long.


Pssm-ID: 427282 [Multi-domain]  Cd Length: 38  Bit Score: 39.99  E-value: 1.08e-04
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1717567556 178 IDISDSVTSTRSDVAASSLAVKKAYDLAKSK 208
Cdd:pfam03406   8 VQLSSATNSTSETLAATPKAVKTAYDNANAA 38
 
Name Accession Description Interval E-value
Phage_fiber_2 pfam03406
Phage tail fibre repeat; This repeat is found in the tail fibres of phage. For example protein ...
215-252 6.64e-12

Phage tail fibre repeat; This repeat is found in the tail fibres of phage. For example protein K. The repeats are about 40 residues long.


Pssm-ID: 427282 [Multi-domain]  Cd Length: 38  Bit Score: 60.41  E-value: 6.64e-12
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 1717567556 215 STTQKGLVQLSSETNSDSETMAATPKAVKSVKDLADTK 252
Cdd:pfam03406   1 TLTQKGIVQLSSATNSTSETLAATPKAVKTAYDNANAA 38
Collar pfam07484
Phage Tail Collar Domain; This region is occasionally found in conjunction with pfam03335. ...
545-592 3.73e-11

Phage Tail Collar Domain; This region is occasionally found in conjunction with pfam03335. Most of the family appear to be phage tail proteins; however some appear to be involved in other processes. For instance Swiss:Q03314 from Rhizobium leguminosarum may be involved in plant-microbe interactions. A related protein Swiss:Q9L3N1 is involved in the pathogenicity of Microcystis aeruginosa. The finding of this family in a structural component of the phage tail fibre baseplate suggests that its function is structural rather than enzymatic. Structural studies show this region consists of a helix and a loop and three beta-strands. This alignment does not catch the third strand as it is separated from the rest of the structure by around 100 residues. This strand is conserved in homologs but the intervening sequence is not. Much of the function of Swiss:P10930 appears to reside in this intervening region. In the tertiary structure of the phage baseplate this domain forms part of the 'collar'. The domain may bind SO4, however the residues accredited with this vary between the PDB file and the Swiss-Prot entry. The long unconserved region maybe due to domain swapping in and out of a loop or reflective of rapid evolution.


Pssm-ID: 429485 [Multi-domain]  Cd Length: 57  Bit Score: 58.72  E-value: 3.73e-11
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 1717567556 545 GAPIPWPSDVVPTGYAIMQGQTFDKSAYPKLA---------VAYPSGVIPDMRGWTI 592
Cdd:pfam07484   1 GEIRLFAGNFAPAGWLLCDGQTLSISQYPALFallgttyggDGSTTFALPDLRGRFP 57
COG5301 COG5301
Phage-related tail fiber protein [Mobilome: prophages, transposons];
175-295 1.93e-09

Phage-related tail fiber protein [Mobilome: prophages, transposons];


Pssm-ID: 444101 [Multi-domain]  Cd Length: 254  Bit Score: 58.91  E-value: 1.93e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1717567556 175 TIGIDISDsVTSTRSDVAASSLAVKKAYDLAkskytaqDASTTQKGLVQLSSETNSDSETMAATPKAVKSVKDLADTKAP 254
Cdd:COG5301   137 TLKIDPSV-VLATRQYVDDKLAKHEKSRNHP-------DATLTEKGFVQLSSATDSNSETLAATPKAVKTAYDLADTAFV 208
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|...
gi 1717567556 255 IESPSLTGTPTAPTAAQGT--NNTQIATTAYVRAAISALVGSS 295
Cdd:COG5301   209 AASGNAAGTAATAAAAGGLtlNAAALSVSTYSATAGAVLGTGG 251
Phage_fiber_2 pfam03406
Phage tail fibre repeat; This repeat is found in the tail fibres of phage. For example protein ...
178-208 1.08e-04

Phage tail fibre repeat; This repeat is found in the tail fibres of phage. For example protein K. The repeats are about 40 residues long.


Pssm-ID: 427282 [Multi-domain]  Cd Length: 38  Bit Score: 39.99  E-value: 1.08e-04
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1717567556 178 IDISDSVTSTRSDVAASSLAVKKAYDLAKSK 208
Cdd:pfam03406   8 VQLSSATNSTSETLAATPKAVKTAYDNANAA 38
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH