NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1373826342|dbj|BBD63659|]
View 

hypothetical protein NIES2109_65340 (plasmid) [Nostoc sp. HK-01]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
transpos_IS4_3 super family cl41337
IS4 family transposase;
52-466 6.07e-112

IS4 family transposase;


The actual alignment was detected with superfamily member NF033590:

Pssm-ID: 468099 [Multi-domain]  Cd Length: 403  Bit Score: 336.19  E-value: 6.07e-112
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373826342  52 YEFFSNPKTSFEKLTQPYFKQTAQEINGTPVVLAVGDTTFLDYKKILdKREEYGPIGNGGN---GLILHSCLALEPDFGQ 128
Cdd:NF033590    3 YRFIRNENVSAEDILEAHFQATVQRAKAHPLLLAIQDTTELNFTHHS-VREGLGHLGNQGKqsrGLLLHSTLLVAPETQQ 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373826342 129 PLGLLWEKLWHREHKAPQpsnetqelkedrlKKERKAKRnkEFKEKESYRWVEAFSKIEKQfseleipVGGLSSKIIHVF 208
Cdd:NF033590   82 PLGLIEQQRWSRDIKTRG-------------KKRRRKRR--PYEEKESYKWLEASEAAAER-------LGSPMTQVISVC 139
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373826342 209 DREGDIAEVFAQIsQTKNAGVVVRAAHNRCLEGENGHLWSYVNSTPVQFVKDVELVETKKRHARTATLEVRYCPVSISPP 288
Cdd:NF033590  140 DREADIYEYLEYK-TTNQQRFLVRAMQNRRLEEEDGKLYDYSSQLQSAGEYTVEIPQKGGRKARQARLEVRFAPVTLKPP 218
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373826342 289 AR----LKNQGSFHVYAVYAREINCPENGEPVEWMLLTTELVDSEQSATQILRWYTYRWRVEEYHKILKSGCQAESYRL- 363
Cdd:NF033590  219 ANkrakAKELPSIPLNYVGCVEINPPEGEEPLEWHLLTSEPVTSLEQALEIIDWYELRWLIEDYHKVLKSGCKVEELRLq 298
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373826342 364 AGESMSTMLGFLTVIAAQLLRMTYLYRNCPHLDA-SVVLTKEQMDVLIA-----SSPPKLKKDIEftvdWAIRAIARLGG 437
Cdd:NF033590  299 TKERLERMLVIYSFVAWRVLQLRELGRSDPELDCtSTLLSPKEWKLLYWlkkekKPPPEKPPSLK----WAYRWIAKLGG 374
                         410       420
                  ....*....|....*....|....*....
gi 1373826342 438 YLEHRKNSAIGIQVLWRGWLELETLCQGW 466
Cdd:NF033590  375 WLDRKRDGRPGWKTLWEGWFRLQDIAEGY 403
Tnp_DNA_bind super family cl20643
Transposase DNA-binding; This domain occurs at the C-terminus of transposases including E. ...
10-59 2.07e-07

Transposase DNA-binding; This domain occurs at the C-terminus of transposases including E. coli tnpA. TnpA encodes a transposase and an inhibitor protein, the inhibitor only differs from the transposase by the absence of the N-terminal 55 amino acids, which includes most of this domain. This domain consists of alpha helices and turns, and functions as a DNA-binding domain.


The actual alignment was detected with superfamily member pfam14706:

Pssm-ID: 434146  Cd Length: 57  Bit Score: 47.67  E-value: 2.07e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1373826342  10 GECNFGDKRLTQRAASIGELLSVKYGQPLSKIFKTASDLKRGYEFFSNPK 59
Cdd:pfam14706   8 GGADLGDKRLTKRLVKLAESLAEQPGASIPQACGDWAETKAAYRFLDNDR 57
 
Name Accession Description Interval E-value
transpos_IS4_3 NF033590
IS4 family transposase;
52-466 6.07e-112

IS4 family transposase;


Pssm-ID: 468099 [Multi-domain]  Cd Length: 403  Bit Score: 336.19  E-value: 6.07e-112
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373826342  52 YEFFSNPKTSFEKLTQPYFKQTAQEINGTPVVLAVGDTTFLDYKKILdKREEYGPIGNGGN---GLILHSCLALEPDFGQ 128
Cdd:NF033590    3 YRFIRNENVSAEDILEAHFQATVQRAKAHPLLLAIQDTTELNFTHHS-VREGLGHLGNQGKqsrGLLLHSTLLVAPETQQ 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373826342 129 PLGLLWEKLWHREHKAPQpsnetqelkedrlKKERKAKRnkEFKEKESYRWVEAFSKIEKQfseleipVGGLSSKIIHVF 208
Cdd:NF033590   82 PLGLIEQQRWSRDIKTRG-------------KKRRRKRR--PYEEKESYKWLEASEAAAER-------LGSPMTQVISVC 139
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373826342 209 DREGDIAEVFAQIsQTKNAGVVVRAAHNRCLEGENGHLWSYVNSTPVQFVKDVELVETKKRHARTATLEVRYCPVSISPP 288
Cdd:NF033590  140 DREADIYEYLEYK-TTNQQRFLVRAMQNRRLEEEDGKLYDYSSQLQSAGEYTVEIPQKGGRKARQARLEVRFAPVTLKPP 218
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373826342 289 AR----LKNQGSFHVYAVYAREINCPENGEPVEWMLLTTELVDSEQSATQILRWYTYRWRVEEYHKILKSGCQAESYRL- 363
Cdd:NF033590  219 ANkrakAKELPSIPLNYVGCVEINPPEGEEPLEWHLLTSEPVTSLEQALEIIDWYELRWLIEDYHKVLKSGCKVEELRLq 298
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373826342 364 AGESMSTMLGFLTVIAAQLLRMTYLYRNCPHLDA-SVVLTKEQMDVLIA-----SSPPKLKKDIEftvdWAIRAIARLGG 437
Cdd:NF033590  299 TKERLERMLVIYSFVAWRVLQLRELGRSDPELDCtSTLLSPKEWKLLYWlkkekKPPPEKPPSLK----WAYRWIAKLGG 374
                         410       420
                  ....*....|....*....|....*....
gi 1373826342 438 YLEHRKNSAIGIQVLWRGWLELETLCQGW 466
Cdd:NF033590  375 WLDRKRDGRPGWKTLWEGWFRLQDIAEGY 403
Tnp_DNA_bind pfam14706
Transposase DNA-binding; This domain occurs at the C-terminus of transposases including E. ...
10-59 2.07e-07

Transposase DNA-binding; This domain occurs at the C-terminus of transposases including E. coli tnpA. TnpA encodes a transposase and an inhibitor protein, the inhibitor only differs from the transposase by the absence of the N-terminal 55 amino acids, which includes most of this domain. This domain consists of alpha helices and turns, and functions as a DNA-binding domain.


Pssm-ID: 434146  Cd Length: 57  Bit Score: 47.67  E-value: 2.07e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1373826342  10 GECNFGDKRLTQRAASIGELLSVKYGQPLSKIFKTASDLKRGYEFFSNPK 59
Cdd:pfam14706   8 GGADLGDKRLTKRLVKLAESLAEQPGASIPQACGDWAETKAAYRFLDNDR 57
Dimer_Tnp_Tn5 pfam02281
Transposase Tn5 dimerization domain; Transposons are mobile DNA sequences capable of ...
377-465 1.24e-04

Transposase Tn5 dimerization domain; Transposons are mobile DNA sequences capable of replication and insertion into the chromosome. Typically transposons code for the transposase enzyme, which catalyzes insertion, found between terminal inverted repeats. Tn5 has a unique method of self- regulation in which a truncated version of the transposase enzyme acts as an inhibitor. The catalytic domain of the Tn5 transposon is found in pfam01609. This domain mediates dimerization in the known structure.


Pssm-ID: 426696  Cd Length: 99  Bit Score: 41.19  E-value: 1.24e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373826342 377 VIAAQLLRMTYLYRNCPHLDASVVLTKEQMDVLIASSPPKLKKDIEfTVDWAIRAIARLGGYLEHRKNSAIGIQVLWRGW 456
Cdd:pfam02281   1 VIAWRINRLMRLGRTVPELDAELVFEPDEWRAAYILNKKPIPKKMP-GLNEVIRLIARRGGFLGRKGDGEPGARTLWLGL 79

                  ....*....
gi 1373826342 457 LELETLCQG 465
Cdd:pfam02281  80 QEIAVFVEG 88
transpos_IS4_1 NF033592
IS4 family transposase;
264-387 3.00e-03

IS4 family transposase;


Pssm-ID: 468101 [Multi-domain]  Cd Length: 332  Bit Score: 39.56  E-value: 3.00e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373826342 264 VETKKRHARTATLEVRYCPVSISPPARLKNQGSfHVYAVYaREInCPENGEPVEWMLLTTELVDSEQSATQILRWYTYRW 343
Cdd:NF033592  212 YEVVEELGETDELQDVYVDTEESLQARKKKPQL-PEKKKL-RLV-SVRDEEGEKEYVLLTNLPDPRLPAEEIAELYRLRW 288
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....
gi 1373826342 344 RVEEYHKILKSGCQAESYRLAGESMSTMLGFLTVIAAQLLRMTY 387
Cdd:NF033592  289 QIELLFKELKSHLQLDHLRSKSPEAVEQELWGALIAYLLLRLLM 332
 
Name Accession Description Interval E-value
transpos_IS4_3 NF033590
IS4 family transposase;
52-466 6.07e-112

IS4 family transposase;


Pssm-ID: 468099 [Multi-domain]  Cd Length: 403  Bit Score: 336.19  E-value: 6.07e-112
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373826342  52 YEFFSNPKTSFEKLTQPYFKQTAQEINGTPVVLAVGDTTFLDYKKILdKREEYGPIGNGGN---GLILHSCLALEPDFGQ 128
Cdd:NF033590    3 YRFIRNENVSAEDILEAHFQATVQRAKAHPLLLAIQDTTELNFTHHS-VREGLGHLGNQGKqsrGLLLHSTLLVAPETQQ 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373826342 129 PLGLLWEKLWHREHKAPQpsnetqelkedrlKKERKAKRnkEFKEKESYRWVEAFSKIEKQfseleipVGGLSSKIIHVF 208
Cdd:NF033590   82 PLGLIEQQRWSRDIKTRG-------------KKRRRKRR--PYEEKESYKWLEASEAAAER-------LGSPMTQVISVC 139
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373826342 209 DREGDIAEVFAQIsQTKNAGVVVRAAHNRCLEGENGHLWSYVNSTPVQFVKDVELVETKKRHARTATLEVRYCPVSISPP 288
Cdd:NF033590  140 DREADIYEYLEYK-TTNQQRFLVRAMQNRRLEEEDGKLYDYSSQLQSAGEYTVEIPQKGGRKARQARLEVRFAPVTLKPP 218
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373826342 289 AR----LKNQGSFHVYAVYAREINCPENGEPVEWMLLTTELVDSEQSATQILRWYTYRWRVEEYHKILKSGCQAESYRL- 363
Cdd:NF033590  219 ANkrakAKELPSIPLNYVGCVEINPPEGEEPLEWHLLTSEPVTSLEQALEIIDWYELRWLIEDYHKVLKSGCKVEELRLq 298
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373826342 364 AGESMSTMLGFLTVIAAQLLRMTYLYRNCPHLDA-SVVLTKEQMDVLIA-----SSPPKLKKDIEftvdWAIRAIARLGG 437
Cdd:NF033590  299 TKERLERMLVIYSFVAWRVLQLRELGRSDPELDCtSTLLSPKEWKLLYWlkkekKPPPEKPPSLK----WAYRWIAKLGG 374
                         410       420
                  ....*....|....*....|....*....
gi 1373826342 438 YLEHRKNSAIGIQVLWRGWLELETLCQGW 466
Cdd:NF033590  375 WLDRKRDGRPGWKTLWEGWFRLQDIAEGY 403
Tnp_DNA_bind pfam14706
Transposase DNA-binding; This domain occurs at the C-terminus of transposases including E. ...
10-59 2.07e-07

Transposase DNA-binding; This domain occurs at the C-terminus of transposases including E. coli tnpA. TnpA encodes a transposase and an inhibitor protein, the inhibitor only differs from the transposase by the absence of the N-terminal 55 amino acids, which includes most of this domain. This domain consists of alpha helices and turns, and functions as a DNA-binding domain.


Pssm-ID: 434146  Cd Length: 57  Bit Score: 47.67  E-value: 2.07e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1373826342  10 GECNFGDKRLTQRAASIGELLSVKYGQPLSKIFKTASDLKRGYEFFSNPK 59
Cdd:pfam14706   8 GGADLGDKRLTKRLVKLAESLAEQPGASIPQACGDWAETKAAYRFLDNDR 57
Dimer_Tnp_Tn5 pfam02281
Transposase Tn5 dimerization domain; Transposons are mobile DNA sequences capable of ...
377-465 1.24e-04

Transposase Tn5 dimerization domain; Transposons are mobile DNA sequences capable of replication and insertion into the chromosome. Typically transposons code for the transposase enzyme, which catalyzes insertion, found between terminal inverted repeats. Tn5 has a unique method of self- regulation in which a truncated version of the transposase enzyme acts as an inhibitor. The catalytic domain of the Tn5 transposon is found in pfam01609. This domain mediates dimerization in the known structure.


Pssm-ID: 426696  Cd Length: 99  Bit Score: 41.19  E-value: 1.24e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373826342 377 VIAAQLLRMTYLYRNCPHLDASVVLTKEQMDVLIASSPPKLKKDIEfTVDWAIRAIARLGGYLEHRKNSAIGIQVLWRGW 456
Cdd:pfam02281   1 VIAWRINRLMRLGRTVPELDAELVFEPDEWRAAYILNKKPIPKKMP-GLNEVIRLIARRGGFLGRKGDGEPGARTLWLGL 79

                  ....*....
gi 1373826342 457 LELETLCQG 465
Cdd:pfam02281  80 QEIAVFVEG 88
transpos_IS4_1 NF033592
IS4 family transposase;
264-387 3.00e-03

IS4 family transposase;


Pssm-ID: 468101 [Multi-domain]  Cd Length: 332  Bit Score: 39.56  E-value: 3.00e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373826342 264 VETKKRHARTATLEVRYCPVSISPPARLKNQGSfHVYAVYaREInCPENGEPVEWMLLTTELVDSEQSATQILRWYTYRW 343
Cdd:NF033592  212 YEVVEELGETDELQDVYVDTEESLQARKKKPQL-PEKKKL-RLV-SVRDEEGEKEYVLLTNLPDPRLPAEEIAELYRLRW 288
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....
gi 1373826342 344 RVEEYHKILKSGCQAESYRLAGESMSTMLGFLTVIAAQLLRMTY 387
Cdd:NF033592  289 QIELLFKELKSHLQLDHLRSKSPEAVEQELWGALIAYLLLRLLM 332
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH