NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1221484109|ref|WP_089637541|]
View 

RNA-guided endonuclease TnpB family protein [Escherichia coli]

Protein Classification

RNA-guided endonuclease InsQ/TnpB family protein( domain architecture ID 11430747)

RNA-guided endonuclease InsQ/TnpB family protein such as the RNA-guided endonuclease TnpB from IS200/IS605 family elements and IS607 family elements; this protein is homologous to some CRISPR-associated (Cas) proteins such as the type V CRISPR-associated protein C2c8; TnpB proteins were described as accessory proteins in IS (insertion sequence) elements, present as one of just one or two proteins encoded in the element but not necessary for transposition. The programmable RNA-guided endonuclease TnpB proteins may provide a CRISPR-like, widespread form of phage defense by RNA-guided DNA degradation.

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
InsQ COG0675
Transposase [Mobilome: prophages, transposons];
1-392 6.08e-158

Transposase [Mobilome: prophages, transposons];


:

Pssm-ID: 440439 [Multi-domain]  Cd Length: 381  Bit Score: 449.73  E-value: 6.08e-158
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109   1 MKRlqAFKFQLRPGGQQERQMRLFAGACRFVFNRALALQNENYEAGNKYIPYTKMASWLVEWKKdtETEWLKDSPSQPLQ 80
Cdd:COG0675     1 MLR--TYKFRLYPTKEQEELLERTLGCCRFVYNYALAERRQAYKETGKSLSYYELQKLLTELKK--EYPWLKELPSQVLQ 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109  81 QSLKDLERAYKNFFQ-----KRAAFPRFKKRGQNDAFRYPQ-GVKLDqeNSRIFLPKLGWMRYRNSRQV--TGIVKNVTV 152
Cdd:COG0675    77 QALKRLDEAFKSFFKrkkkgKKAGFPRFKKKGRYRSFTYPQsGFKLK--DGRLKLPKIGWVKIRLHRPLpdDGKIKSVTI 154
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109 153 SQ-SCGKWYISIQTESEVSTPVHPSASMVGLDAGVARLATLSDGTVFEPVNSFQKNQKKLARLQRQLSRKVRFSNNWQKQ 231
Cdd:COG0675   155 SRkAAGKWYVSFVVEVEDVPELPPTGKVVGIDLGLKNFATLSDGEKIDNPKFLKKAERKLAKLQRRLSRKKKGSKNRRKA 234
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109 232 KRKIQQLHSRIANIRRDYLHKVTTIISKNHAMIVIEDLKVKHMSKSAagtisqpgrnvraksGLNRSILDQGWYEMRRQL 311
Cdd:COG0675   235 RKKLAKLHEKIANQRKDFLHKLARKLVKEADVIVVEDLNVKGMKKNK---------------KLNKSISDAGWGEFRRQL 299
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109 312 EYKQLWRGGQVLAVPPAYTSQRCACCGHTAKENRLSQSKFRCQVCGYTANADVNGARNILAAGHAVLAC-GGMVQSGRPL 390
Cdd:COG0675   300 EYKAEKYGIKVVEVDPAYTSQTCSSCGHVVKKLRLSVRTFVCPKCGTVHDRDVNAAINILRRGLRQLGLaGHSGGTVRPL 379

                  ..
gi 1221484109 391 KQ 392
Cdd:COG0675   380 RD 381
 
Name Accession Description Interval E-value
InsQ COG0675
Transposase [Mobilome: prophages, transposons];
1-392 6.08e-158

Transposase [Mobilome: prophages, transposons];


Pssm-ID: 440439 [Multi-domain]  Cd Length: 381  Bit Score: 449.73  E-value: 6.08e-158
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109   1 MKRlqAFKFQLRPGGQQERQMRLFAGACRFVFNRALALQNENYEAGNKYIPYTKMASWLVEWKKdtETEWLKDSPSQPLQ 80
Cdd:COG0675     1 MLR--TYKFRLYPTKEQEELLERTLGCCRFVYNYALAERRQAYKETGKSLSYYELQKLLTELKK--EYPWLKELPSQVLQ 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109  81 QSLKDLERAYKNFFQ-----KRAAFPRFKKRGQNDAFRYPQ-GVKLDqeNSRIFLPKLGWMRYRNSRQV--TGIVKNVTV 152
Cdd:COG0675    77 QALKRLDEAFKSFFKrkkkgKKAGFPRFKKKGRYRSFTYPQsGFKLK--DGRLKLPKIGWVKIRLHRPLpdDGKIKSVTI 154
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109 153 SQ-SCGKWYISIQTESEVSTPVHPSASMVGLDAGVARLATLSDGTVFEPVNSFQKNQKKLARLQRQLSRKVRFSNNWQKQ 231
Cdd:COG0675   155 SRkAAGKWYVSFVVEVEDVPELPPTGKVVGIDLGLKNFATLSDGEKIDNPKFLKKAERKLAKLQRRLSRKKKGSKNRRKA 234
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109 232 KRKIQQLHSRIANIRRDYLHKVTTIISKNHAMIVIEDLKVKHMSKSAagtisqpgrnvraksGLNRSILDQGWYEMRRQL 311
Cdd:COG0675   235 RKKLAKLHEKIANQRKDFLHKLARKLVKEADVIVVEDLNVKGMKKNK---------------KLNKSISDAGWGEFRRQL 299
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109 312 EYKQLWRGGQVLAVPPAYTSQRCACCGHTAKENRLSQSKFRCQVCGYTANADVNGARNILAAGHAVLAC-GGMVQSGRPL 390
Cdd:COG0675   300 EYKAEKYGIKVVEVDPAYTSQTCSSCGHVVKKLRLSVRTFVCPKCGTVHDRDVNAAINILRRGLRQLGLaGHSGGTVRPL 379

                  ..
gi 1221484109 391 KQ 392
Cdd:COG0675   380 RD 381
IS200_TnpB NF038281
IS200/IS605 family element RNA-guided endonuclease TnpB;
6-374 2.80e-114

IS200/IS605 family element RNA-guided endonuclease TnpB;


Pssm-ID: 468448 [Multi-domain]  Cd Length: 359  Bit Score: 337.92  E-value: 2.80e-114
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109   6 AFKFQLRPGGQQERQMRLFAGACRFVFNRALALQNENYEAGNKYIPYTKMASWLVEWKKdtETEWLKDSPSQPLQQSLKD 85
Cdd:NF038281    3 AYKFRIYPNKEQEILINKTIGCSRFVYNHFLAKWNEAYEETGKGLSYNACSKQLTQLKK--EEEWLKEVDSIALQNSLKN 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109  86 LERAYKNFFQKRAAFPRFK-KRGQNDAFR--YPQGVKLDQENsRIFLPKLGWMRYRNSRQVTGIVKNVTVSQ-SCGKWYI 161
Cdd:NF038281   81 LDDAFKRFFKKQNGFPRFKsKKNPVQSYTtkNTNGNIAIVGN-KIKLPKLGWVKFAKSREVEGRILSATVRRnPSGKYFV 159
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109 162 SIQTESEVSTPVHPSASmVGLDAGVARLATLSDGTVFEPVNSFQKNQKKLARLQRQLSRKVRFSNNWQKQKRKIQQLHSR 241
Cdd:NF038281  160 SILVETEVQLLPKTNSA-VGIDLGLKDFAILSDGGKIENPKYLRKLEKKLAKLQRILSRRKKGSSNWQKQRIKVARLHEK 238
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109 242 IANIRRDYLHKVTTIISKNHAMIVIEDLKVKHMSKsaagtisqpgrNVRaksgLNRSILDQGWYEMRRQLEYKQLWRGGQ 321
Cdd:NF038281  239 IANQRKDFLHKLSTRLIKENQVICIEDLQVKNMLK-----------NHK----LAKSISDVSWSEFRTMLEYKAKWYGRT 303
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 1221484109 322 VLAVPPAY-TSQRCACCGHTAKENR-LSQSKFRCQVCGYTANADVNGARNILAAG 374
Cdd:NF038281  304 VVKVGKFFpSSQLCSCCGYKNKEVKnLALREWTCPSCGTHHDRDINASKNILNEG 358
guided_TnpB NF040570
RNA-guided endonuclease TnpB family protein; This family includes RNA-guided endonuclease TnpB ...
7-373 1.07e-113

RNA-guided endonuclease TnpB family protein; This family includes RNA-guided endonuclease TnpB from IS200/IS605 family elements (NF038281) and IS607 family elements (NF038280), but also many additional proteins. It exhibits homolog to or actually includes some CRISPR-associated (Cas) proteins such as the type V CRISPR-associated protein C2c8. For a long time, TnpB proteins were described as accessory proteins in IS (insertion sequence) elements, present as one of just one or two proteins encoded in the element but not necessary for transposition. The programmable RNA-guided endonuclease TnpB proteins may provide a CRISPR-like, widespread form of phage defense by RNA-guided DNA degradation.


Pssm-ID: 468544 [Multi-domain]  Cd Length: 384  Bit Score: 337.21  E-value: 1.07e-113
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109   7 FKFQLRPGGQQERQMRLFAGACRFVFNRALALQNENYEAGNKYIP-YTKMASWLVEWKKDTETEWLKDSPSQPLQQSLKD 85
Cdd:NF040570    1 YKYRLYPTKEQKRELAELFGAARFLYNAALAERKEAYEKNGKFLSyKALLKKLLTELKKEKELEWLKELSSQALQQALKR 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109  86 LERAYKNFFQKR--AAFPRFKKRGQNDAFRYPQGVKLD--------QENSRIFLPKLGWMRYRNSRQV-------TGIVK 148
Cdd:NF040570   81 LAKAFKNFFKKLkkAGFPRFKSKKKKVPSYTPQSVNKRlrkkrnrkKKNGRLKLPKLGGVKLRLSRILpilldgkGGKIK 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109 149 NVTVSQ-SCGKWYISIQTESEVSTPVHPSA--SMVGLDAGVARLATLSDG-TVFEPVNSFQKNQKKLARLQRQLSRK--- 221
Cdd:NF040570  161 SVTISKpKKGKYYVSISVEVEVPEPPPKEVtgKVVGIDLGLKNFATLSDGgEKIENPRFLRKKEKRLRRLQRKLSRKlqr 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109 222 -VRFSNNWQKQKRKIQQLHSRIANIRRDYLHKVTTIISKNHAM--IVIEDLKVKHMSKSaagtisqpgrnvRAKSGLNRS 298
Cdd:NF040570  241 kGKGSSNRKKARKKVARLHRKIANQRKDFLHKLSKRLVKEADAnnVVVEDLEVKGMVKN------------KKKKKLAKS 308
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1221484109 299 ILDQGWYEMRRQLEYKQLWRGGQVLAVPPAYTSQRCACCGHTAKENR-LSQSKFRCQVCGYTANADVNGARNILAA 373
Cdd:NF040570  309 IHDWAFGQLRRMLEYKAEWYGIKVVKVDPAYTSSQCCSCGGHRKEKLlLSCREWTCPECGYTVHRDINAAINILRR 384
OrfB_IS605 pfam01385
Probable transposase; This family includes IS891, IS1136 and IS1341. DUF1225, pfam06774, has ...
162-276 5.60e-33

Probable transposase; This family includes IS891, IS1136 and IS1341. DUF1225, pfam06774, has now been merged into this family.


Pssm-ID: 396108 [Multi-domain]  Cd Length: 120  Bit Score: 120.10  E-value: 5.60e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109 162 SIQTESEVSTPVHPSASMVGLDAGVARLATLSDGT----VFEPvNSFQKNQKKLARLQRQLSRKVRFSNNWQKQKRKIQQ 237
Cdd:pfam01385   1 SIPVEVEDPPPVAEPNKAAGIDLGINNLATVSTEDgdwfLFNP-RRLKSDYKYLAKRIARLQRKLKGSNNRKKASRKLAR 79
                          90       100       110
                  ....*....|....*....|....*....|....*....
gi 1221484109 238 LHSRIANIRRDYLHKVTTIISKNHAMIVIEDLKVKHMSK 276
Cdd:pfam01385  80 LHRKRSRRRKDFLHKLVRRLIEELDEVGVEDLNVGGMKD 118
tspaseT_teng_C TIGR01766
transposase, IS605 OrfB family, central region; This model represents a region of a sequence ...
244-331 1.63e-09

transposase, IS605 OrfB family, central region; This model represents a region of a sequence similarity between a family of putative transposases of Thermoanaerobacter tengcongensis, smaller related proteins from Bacillus anthracis, putative transposes described by pfam01385, and other proteins. [Mobile and extrachromosomal element functions, Transposon functions]


Pssm-ID: 273793 [Multi-domain]  Cd Length: 82  Bit Score: 54.26  E-value: 1.63e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109 244 NIRRDYLHKVTTIISK----NHAMIVIEDLKVKHmsksaagtisqpgRNVRAKS-GLNRSILDQGWYEMRRQLEYKQLWR 318
Cdd:TIGR01766   3 NKVEDFLHKIVKQIVEyakeNNGTIVLEDLKNIR-------------EMVDKKSkYLRRKLHQWSFRKLISKIKYKAEEY 69
                          90
                  ....*....|...
gi 1221484109 319 GGQVLAVPPAYTS 331
Cdd:TIGR01766  70 GIEVIEVNPAYTS 82
PHA02942 PHA02942
putative transposase; Provisional
75-379 5.97e-07

putative transposase; Provisional


Pssm-ID: 165252 [Multi-domain]  Cd Length: 383  Bit Score: 51.17  E-value: 5.97e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109  75 PSQPLQQSLKDLERAYKNFFQ--KRAAFPRFKKRGQNDAFRYPQGVKLDQENSRIF----LPKLGWMR----YRNSRqvt 144
Cdd:PHA02942   72 PPKVSADCYRDALAIYKSWYNnpKKGRFPRVYKPTVWLTPKQSYTVDLDKMTVKIAsvgeLPILGYPRnlkeYANWD--- 148
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109 145 giVKNVTVSQSCGKWYISIQTESEvSTPVHPSASmVGLDAGVARLATLSDGTVFEPVNSFQKNQKKLARLQRQLSRKvrF 224
Cdd:PHA02942  149 --MKEARLTIKDGKAFLKVTFEKE-EEKIKPKDS-VAVDINMNDIVVGKDDSHYVRIPTRLHDAHHFKSLAENLQKK--Y 222
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109 225 SNNWQKQKR---KIQQLHSRIANIRRDYLHKVTTIISK-----NHAMIVIEDLK--VKHMSKSAAgtisqpgrNVRAKSG 294
Cdd:PHA02942  223 PRRWKENKRilhRARSFHHKAKLIMEDFARKVGKWVVEiaedlGANVIKLEDLKnlIKDVNKLPA--------EFRDKLY 294
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109 295 LNRSILDQGWyemrrqLEYKQLWRGGQVLAVPPAYTSQRCACCGHTAKEnrLSQSKFRCQVCGYTANADVNGARNILAAG 374
Cdd:PHA02942  295 LMQYHRIQYW------IEWQAKKHGMIVEFVNPSYSSVSCPKCGHKMVE--IAHRYFHCPSCGYENDRDVIAIMNLNGRG 366

                  ....*
gi 1221484109 375 HAVLA 379
Cdd:PHA02942  367 SLTLS 371
 
Name Accession Description Interval E-value
InsQ COG0675
Transposase [Mobilome: prophages, transposons];
1-392 6.08e-158

Transposase [Mobilome: prophages, transposons];


Pssm-ID: 440439 [Multi-domain]  Cd Length: 381  Bit Score: 449.73  E-value: 6.08e-158
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109   1 MKRlqAFKFQLRPGGQQERQMRLFAGACRFVFNRALALQNENYEAGNKYIPYTKMASWLVEWKKdtETEWLKDSPSQPLQ 80
Cdd:COG0675     1 MLR--TYKFRLYPTKEQEELLERTLGCCRFVYNYALAERRQAYKETGKSLSYYELQKLLTELKK--EYPWLKELPSQVLQ 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109  81 QSLKDLERAYKNFFQ-----KRAAFPRFKKRGQNDAFRYPQ-GVKLDqeNSRIFLPKLGWMRYRNSRQV--TGIVKNVTV 152
Cdd:COG0675    77 QALKRLDEAFKSFFKrkkkgKKAGFPRFKKKGRYRSFTYPQsGFKLK--DGRLKLPKIGWVKIRLHRPLpdDGKIKSVTI 154
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109 153 SQ-SCGKWYISIQTESEVSTPVHPSASMVGLDAGVARLATLSDGTVFEPVNSFQKNQKKLARLQRQLSRKVRFSNNWQKQ 231
Cdd:COG0675   155 SRkAAGKWYVSFVVEVEDVPELPPTGKVVGIDLGLKNFATLSDGEKIDNPKFLKKAERKLAKLQRRLSRKKKGSKNRRKA 234
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109 232 KRKIQQLHSRIANIRRDYLHKVTTIISKNHAMIVIEDLKVKHMSKSAagtisqpgrnvraksGLNRSILDQGWYEMRRQL 311
Cdd:COG0675   235 RKKLAKLHEKIANQRKDFLHKLARKLVKEADVIVVEDLNVKGMKKNK---------------KLNKSISDAGWGEFRRQL 299
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109 312 EYKQLWRGGQVLAVPPAYTSQRCACCGHTAKENRLSQSKFRCQVCGYTANADVNGARNILAAGHAVLAC-GGMVQSGRPL 390
Cdd:COG0675   300 EYKAEKYGIKVVEVDPAYTSQTCSSCGHVVKKLRLSVRTFVCPKCGTVHDRDVNAAINILRRGLRQLGLaGHSGGTVRPL 379

                  ..
gi 1221484109 391 KQ 392
Cdd:COG0675   380 RD 381
IS200_TnpB NF038281
IS200/IS605 family element RNA-guided endonuclease TnpB;
6-374 2.80e-114

IS200/IS605 family element RNA-guided endonuclease TnpB;


Pssm-ID: 468448 [Multi-domain]  Cd Length: 359  Bit Score: 337.92  E-value: 2.80e-114
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109   6 AFKFQLRPGGQQERQMRLFAGACRFVFNRALALQNENYEAGNKYIPYTKMASWLVEWKKdtETEWLKDSPSQPLQQSLKD 85
Cdd:NF038281    3 AYKFRIYPNKEQEILINKTIGCSRFVYNHFLAKWNEAYEETGKGLSYNACSKQLTQLKK--EEEWLKEVDSIALQNSLKN 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109  86 LERAYKNFFQKRAAFPRFK-KRGQNDAFR--YPQGVKLDQENsRIFLPKLGWMRYRNSRQVTGIVKNVTVSQ-SCGKWYI 161
Cdd:NF038281   81 LDDAFKRFFKKQNGFPRFKsKKNPVQSYTtkNTNGNIAIVGN-KIKLPKLGWVKFAKSREVEGRILSATVRRnPSGKYFV 159
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109 162 SIQTESEVSTPVHPSASmVGLDAGVARLATLSDGTVFEPVNSFQKNQKKLARLQRQLSRKVRFSNNWQKQKRKIQQLHSR 241
Cdd:NF038281  160 SILVETEVQLLPKTNSA-VGIDLGLKDFAILSDGGKIENPKYLRKLEKKLAKLQRILSRRKKGSSNWQKQRIKVARLHEK 238
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109 242 IANIRRDYLHKVTTIISKNHAMIVIEDLKVKHMSKsaagtisqpgrNVRaksgLNRSILDQGWYEMRRQLEYKQLWRGGQ 321
Cdd:NF038281  239 IANQRKDFLHKLSTRLIKENQVICIEDLQVKNMLK-----------NHK----LAKSISDVSWSEFRTMLEYKAKWYGRT 303
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 1221484109 322 VLAVPPAY-TSQRCACCGHTAKENR-LSQSKFRCQVCGYTANADVNGARNILAAG 374
Cdd:NF038281  304 VVKVGKFFpSSQLCSCCGYKNKEVKnLALREWTCPSCGTHHDRDINASKNILNEG 358
guided_TnpB NF040570
RNA-guided endonuclease TnpB family protein; This family includes RNA-guided endonuclease TnpB ...
7-373 1.07e-113

RNA-guided endonuclease TnpB family protein; This family includes RNA-guided endonuclease TnpB from IS200/IS605 family elements (NF038281) and IS607 family elements (NF038280), but also many additional proteins. It exhibits homolog to or actually includes some CRISPR-associated (Cas) proteins such as the type V CRISPR-associated protein C2c8. For a long time, TnpB proteins were described as accessory proteins in IS (insertion sequence) elements, present as one of just one or two proteins encoded in the element but not necessary for transposition. The programmable RNA-guided endonuclease TnpB proteins may provide a CRISPR-like, widespread form of phage defense by RNA-guided DNA degradation.


Pssm-ID: 468544 [Multi-domain]  Cd Length: 384  Bit Score: 337.21  E-value: 1.07e-113
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109   7 FKFQLRPGGQQERQMRLFAGACRFVFNRALALQNENYEAGNKYIP-YTKMASWLVEWKKDTETEWLKDSPSQPLQQSLKD 85
Cdd:NF040570    1 YKYRLYPTKEQKRELAELFGAARFLYNAALAERKEAYEKNGKFLSyKALLKKLLTELKKEKELEWLKELSSQALQQALKR 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109  86 LERAYKNFFQKR--AAFPRFKKRGQNDAFRYPQGVKLD--------QENSRIFLPKLGWMRYRNSRQV-------TGIVK 148
Cdd:NF040570   81 LAKAFKNFFKKLkkAGFPRFKSKKKKVPSYTPQSVNKRlrkkrnrkKKNGRLKLPKLGGVKLRLSRILpilldgkGGKIK 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109 149 NVTVSQ-SCGKWYISIQTESEVSTPVHPSA--SMVGLDAGVARLATLSDG-TVFEPVNSFQKNQKKLARLQRQLSRK--- 221
Cdd:NF040570  161 SVTISKpKKGKYYVSISVEVEVPEPPPKEVtgKVVGIDLGLKNFATLSDGgEKIENPRFLRKKEKRLRRLQRKLSRKlqr 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109 222 -VRFSNNWQKQKRKIQQLHSRIANIRRDYLHKVTTIISKNHAM--IVIEDLKVKHMSKSaagtisqpgrnvRAKSGLNRS 298
Cdd:NF040570  241 kGKGSSNRKKARKKVARLHRKIANQRKDFLHKLSKRLVKEADAnnVVVEDLEVKGMVKN------------KKKKKLAKS 308
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1221484109 299 ILDQGWYEMRRQLEYKQLWRGGQVLAVPPAYTSQRCACCGHTAKENR-LSQSKFRCQVCGYTANADVNGARNILAA 373
Cdd:NF040570  309 IHDWAFGQLRRMLEYKAEWYGIKVVKVDPAYTSSQCCSCGGHRKEKLlLSCREWTCPECGYTVHRDINAAINILRR 384
OrfB_IS605 pfam01385
Probable transposase; This family includes IS891, IS1136 and IS1341. DUF1225, pfam06774, has ...
162-276 5.60e-33

Probable transposase; This family includes IS891, IS1136 and IS1341. DUF1225, pfam06774, has now been merged into this family.


Pssm-ID: 396108 [Multi-domain]  Cd Length: 120  Bit Score: 120.10  E-value: 5.60e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109 162 SIQTESEVSTPVHPSASMVGLDAGVARLATLSDGT----VFEPvNSFQKNQKKLARLQRQLSRKVRFSNNWQKQKRKIQQ 237
Cdd:pfam01385   1 SIPVEVEDPPPVAEPNKAAGIDLGINNLATVSTEDgdwfLFNP-RRLKSDYKYLAKRIARLQRKLKGSNNRKKASRKLAR 79
                          90       100       110
                  ....*....|....*....|....*....|....*....
gi 1221484109 238 LHSRIANIRRDYLHKVTTIISKNHAMIVIEDLKVKHMSK 276
Cdd:pfam01385  80 LHRKRSRRRKDFLHKLVRRLIEELDEVGVEDLNVGGMKD 118
OrfB_Zn_ribbon pfam07282
Putative transposase DNA-binding domain; This putative domain is found at the C-terminus of a ...
304-371 1.67e-27

Putative transposase DNA-binding domain; This putative domain is found at the C-terminus of a large number of transposase proteins. This domain contains four conserved cysteines suggestive of a zinc binding domain. Given the need for transposases to bind DNA as well as the large number of DNA-binding zinc fingers we hypothesize this domain is DNA-binding.


Pssm-ID: 284650 [Multi-domain]  Cd Length: 69  Bit Score: 103.45  E-value: 1.67e-27
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1221484109 304 WYEMRRQLEYKQLWRGGQVLAVPPAYTSQRCACCGHTAKEnRLSQSKFRCQVCGYTANADVNGARNIL 371
Cdd:pfam07282   1 FRKFIEQLEYKAKEYGIKVVEVDPAYTSKTCSVCGHKNKE-SLSGRTFVCPNCGFVADRDVNAAINIL 67
HTH_OrfB_IS605 pfam12323
Helix-turn-helix domain; This is the N terminal helix-turn-helix domain of Transposase_2 ...
3-48 5.92e-11

Helix-turn-helix domain; This is the N terminal helix-turn-helix domain of Transposase_2 pfam01385.


Pssm-ID: 432479 [Multi-domain]  Cd Length: 47  Bit Score: 57.20  E-value: 5.92e-11
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 1221484109   3 RLQAFKFQLRPGGQQERQMRLFAGACRFVFNRALALQNENYEAGNK 48
Cdd:pfam12323   2 VLKAYKYRLYPTPEQEELLARTFGCARFVYNKALAERKEAYKEGGK 47
tspaseT_teng_C TIGR01766
transposase, IS605 OrfB family, central region; This model represents a region of a sequence ...
244-331 1.63e-09

transposase, IS605 OrfB family, central region; This model represents a region of a sequence similarity between a family of putative transposases of Thermoanaerobacter tengcongensis, smaller related proteins from Bacillus anthracis, putative transposes described by pfam01385, and other proteins. [Mobile and extrachromosomal element functions, Transposon functions]


Pssm-ID: 273793 [Multi-domain]  Cd Length: 82  Bit Score: 54.26  E-value: 1.63e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109 244 NIRRDYLHKVTTIISK----NHAMIVIEDLKVKHmsksaagtisqpgRNVRAKS-GLNRSILDQGWYEMRRQLEYKQLWR 318
Cdd:TIGR01766   3 NKVEDFLHKIVKQIVEyakeNNGTIVLEDLKNIR-------------EMVDKKSkYLRRKLHQWSFRKLISKIKYKAEEY 69
                          90
                  ....*....|...
gi 1221484109 319 GGQVLAVPPAYTS 331
Cdd:TIGR01766  70 GIEVIEVNPAYTS 82
PHA02942 PHA02942
putative transposase; Provisional
75-379 5.97e-07

putative transposase; Provisional


Pssm-ID: 165252 [Multi-domain]  Cd Length: 383  Bit Score: 51.17  E-value: 5.97e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109  75 PSQPLQQSLKDLERAYKNFFQ--KRAAFPRFKKRGQNDAFRYPQGVKLDQENSRIF----LPKLGWMR----YRNSRqvt 144
Cdd:PHA02942   72 PPKVSADCYRDALAIYKSWYNnpKKGRFPRVYKPTVWLTPKQSYTVDLDKMTVKIAsvgeLPILGYPRnlkeYANWD--- 148
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109 145 giVKNVTVSQSCGKWYISIQTESEvSTPVHPSASmVGLDAGVARLATLSDGTVFEPVNSFQKNQKKLARLQRQLSRKvrF 224
Cdd:PHA02942  149 --MKEARLTIKDGKAFLKVTFEKE-EEKIKPKDS-VAVDINMNDIVVGKDDSHYVRIPTRLHDAHHFKSLAENLQKK--Y 222
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109 225 SNNWQKQKR---KIQQLHSRIANIRRDYLHKVTTIISK-----NHAMIVIEDLK--VKHMSKSAAgtisqpgrNVRAKSG 294
Cdd:PHA02942  223 PRRWKENKRilhRARSFHHKAKLIMEDFARKVGKWVVEiaedlGANVIKLEDLKnlIKDVNKLPA--------EFRDKLY 294
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1221484109 295 LNRSILDQGWyemrrqLEYKQLWRGGQVLAVPPAYTSQRCACCGHTAKEnrLSQSKFRCQVCGYTANADVNGARNILAAG 374
Cdd:PHA02942  295 LMQYHRIQYW------IEWQAKKHGMIVEFVNPSYSSVSCPKCGHKMVE--IAHRYFHCPSCGYENDRDVIAIMNLNGRG 366

                  ....*
gi 1221484109 375 HAVLA 379
Cdd:PHA02942  367 SLTLS 371
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH