NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1194622937|ref|WP_085955203|]
View 

MULTISPECIES: IS3-like element ISKpn1 family transposase [Bacteria]

Protein Classification

transposase( domain architecture ID 15202490)

transposase binds to the end of a transposon and catalyzes the movement of the transposon to another part of the genome by a cut and paste mechanism or a replicative transposition mechanism; similar to Escherichia coli insertion element IS150 protein InsJ

Gene Ontology:  GO:0004803|GO:0003677
PubMed:  20885819

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
transpos_IS3 NF033516
IS3 family transposase;
64-447 1.12e-134

IS3 family transposase;


:

Pssm-ID: 468052 [Multi-domain]  Cd Length: 369  Bit Score: 392.31  E-value: 1.12e-134
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937  64 EFKLVVVRAVISDRLTMREAAARFNLSAEiLVRRWLDVYNDAGAEGLlnmqcgrpgqmtkPKNIPPLTDKELEKLspEEL 143
Cdd:NF033516    1 EFKLEAVREVLEGGKSVAEVARELGISPS-TLYRWRKKYRGGGEAAD-------------AGRLKELLTPEEEEN--RRL 64
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937 144 RAELRYLRAENAYPKKVESLGSERKKWQkalIISELRHEHALRDLLRAAGMSRSTWYYNMNAL--KQGDRYAGLKENIRK 221
Cdd:NF033516   65 KRELAELRLENEILKKARKLLRPAVKYA---LIDALRGEYSVRRACRVLGVSRSTYYYWRKRPpsRRAPDDAELRARIRE 141
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937 222 IYHYHKGRYGYRRITLALRKQGLRINHKTVQRLMAELSLRSVIRaKKYRAWKGRTGE---AAPNILSRNFGASKANEKWV 298
Cdd:NF033516  142 IFEESRGRYGYRRITALLRREGIRVNHKRVYRLMRELGLLARRR-RKRRPYTTDSGHvhpVAPNLLNRQFTATRPNQVWV 220
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937 299 TDVTEFPVQGKKLYLSSVLDLFNREVIAYSLSERPVMEMVNTMLDGAFPKLRPGDAPLLHSDQGWHYRMRSYQERLKAHG 378
Cdd:NF033516  221 TDITYIRTAEGWLYLAVVLDLFSREIVGWSVSTSMSAELVLDALEMAIEWRGKPEGLILHSDNGSQYTSKAYREWLKEHG 300
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1194622937 379 MTQSMSRKGNCLDNAVMENFFGTLKSECFYLREFRSVSALRKAVEDYIHYYNNERISLKLKGLSPVEYR 447
Cdd:NF033516  301 ITQSMSRPGNCWDNAVAESFFGTLKRECLYRRRFRTLEEARQAIEEYIEFYNHERPHSSLGYLTPAEFE 369
InsE COG2963
Transposase InsE and inactivated derivatives [Mobilome: prophages, transposons];
1-80 1.89e-05

Transposase InsE and inactivated derivatives [Mobilome: prophages, transposons];


:

Pssm-ID: 442203 [Multi-domain]  Cd Length: 93  Bit Score: 42.99  E-value: 1.89e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937   1 MAKP--KYSPETKLAVVNHYLSGKDGEQSTADLFGIERTSVRRWVRAWQFHGAEGLTAKNNHYSDEFKLVVVRAVIsDRL 78
Cdd:COG2963     1 MSKKrrRYSPEFKAEAVRLVLEGGASVAEVARELGISPSTLYRWVRQYREGGLGGFPGDGRTTPEQAEIRRLRKEL-RRL 79

                  ..
gi 1194622937  79 TM 80
Cdd:COG2963    80 EM 81
 
Name Accession Description Interval E-value
transpos_IS3 NF033516
IS3 family transposase;
64-447 1.12e-134

IS3 family transposase;


Pssm-ID: 468052 [Multi-domain]  Cd Length: 369  Bit Score: 392.31  E-value: 1.12e-134
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937  64 EFKLVVVRAVISDRLTMREAAARFNLSAEiLVRRWLDVYNDAGAEGLlnmqcgrpgqmtkPKNIPPLTDKELEKLspEEL 143
Cdd:NF033516    1 EFKLEAVREVLEGGKSVAEVARELGISPS-TLYRWRKKYRGGGEAAD-------------AGRLKELLTPEEEEN--RRL 64
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937 144 RAELRYLRAENAYPKKVESLGSERKKWQkalIISELRHEHALRDLLRAAGMSRSTWYYNMNAL--KQGDRYAGLKENIRK 221
Cdd:NF033516   65 KRELAELRLENEILKKARKLLRPAVKYA---LIDALRGEYSVRRACRVLGVSRSTYYYWRKRPpsRRAPDDAELRARIRE 141
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937 222 IYHYHKGRYGYRRITLALRKQGLRINHKTVQRLMAELSLRSVIRaKKYRAWKGRTGE---AAPNILSRNFGASKANEKWV 298
Cdd:NF033516  142 IFEESRGRYGYRRITALLRREGIRVNHKRVYRLMRELGLLARRR-RKRRPYTTDSGHvhpVAPNLLNRQFTATRPNQVWV 220
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937 299 TDVTEFPVQGKKLYLSSVLDLFNREVIAYSLSERPVMEMVNTMLDGAFPKLRPGDAPLLHSDQGWHYRMRSYQERLKAHG 378
Cdd:NF033516  221 TDITYIRTAEGWLYLAVVLDLFSREIVGWSVSTSMSAELVLDALEMAIEWRGKPEGLILHSDNGSQYTSKAYREWLKEHG 300
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1194622937 379 MTQSMSRKGNCLDNAVMENFFGTLKSECFYLREFRSVSALRKAVEDYIHYYNNERISLKLKGLSPVEYR 447
Cdd:NF033516  301 ITQSMSRPGNCWDNAVAESFFGTLKRECLYRRRFRTLEEARQAIEEYIEFYNHERPHSSLGYLTPAEFE 369
Tra5 COG2801
Transposase InsO and inactivated derivatives [Mobilome: prophages, transposons];
143-454 9.60e-103

Transposase InsO and inactivated derivatives [Mobilome: prophages, transposons];


Pssm-ID: 442053 [Multi-domain]  Cd Length: 309  Bit Score: 308.62  E-value: 9.60e-103
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937 143 LRAELRYLRAENAYPKKVESLGSERKKWQKALIISELRHEHALRDLLRAAGMSRSTWYYNMNALKQGDRYAGLKENIRKI 222
Cdd:COG2801     1 ELAEEEELRKEEELLRRLLLLLRLLLLRRRVLRRVSRRRRRLLRLLRRRRARSRRRRRLRRPRSYRADEDAELLERIKEI 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937 223 YHYHKgRYGYRRITLALRKQGLRINHKTVQRLMAELSLRSVIRAK-KYRAWKGRTGEAAPNILsrnFGASKANEKWVTDV 301
Cdd:COG2801    81 FAESP-RYGYRRITAELRREGIAVNRKRVRRLMRELGLQARRRRKkKYTTYSGHGGPIAPNLL---FTATAPNQVWVTDI 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937 302 TEFPVQGKKLYLSSVLDLFNREVIAYSLSERPVMEMVNTMLDGAFPKLRPGDAPLLHSDQGWHYRMRSYQERLKAHGMTQ 381
Cdd:COG2801   157 TYIPTAEGWLYLAAVIDLFSREIVGWSVSDSMDAELVVDALEMAIERRGPPKPLILHSDNGSQYTSKAYQELLKKLGITQ 236
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1194622937 382 SMSRKGNCLDNAVMENFFGTLKSECFYLREFRSVSALRKAVEDYIHYYNNERISLKLKGLSPVEYRTQALRAA 454
Cdd:COG2801   237 SMSRPGNPQDNAFIESFFGTLKYELLYRRRFESLEEAREAIEEYIEFYNHERPHSSLGYLTPAEYEKQLAAAA 309
PHA02517 PHA02517
putative transposase OrfB; Reviewed
193-445 1.98e-55

putative transposase OrfB; Reviewed


Pssm-ID: 222853 [Multi-domain]  Cd Length: 277  Bit Score: 185.45  E-value: 1.98e-55
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937 193 GMSRSTWYYNMNALKQGD-------RYAGLKENIRKIYHYHKGRYGYRRITLALRKQGLRINHKTVQRLMAELSLRSVIR 265
Cdd:PHA02517    2 GIAPSTYYRCQQQRHHPDkrraraqHDDWLKSEILRVYDENHQVYGVRKVWRQLNREGIRVARCTVGRLMKELGLAGVLR 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937 266 AKKYRAWKGRTGEAAPNILSRNFGASKANEKWVTDVTEFPVQGKKLYLSSVLDLFNREVIAYSLSERPVMEMVNTMLDGA 345
Cdd:PHA02517   82 GKKVRTTISRKAVAAPDRVNRQFVATRPNQLWVADFTYVSTWQGWVYVAFIIDVFARRIVGWRVSSSMDTDFVLDALEQA 161
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937 346 -FPKLRPGDApLLHSDQGWHYRMRSYQERLKAHGMTQSMSRKGNCLDNAVMENFFGTLKSECFYLREFRSVSALRKAVED 424
Cdd:PHA02517  162 lWARGRPGGL-IHHSDKGSQYVSLAYTQRLKEAGIRASTGSRGDSYDNAPAESINGLYKAEVIHRVSWKNREEVELATLE 240
                         250       260
                  ....*....|....*....|.
gi 1194622937 425 YIHYYNNERISLKLKGLSPVE 445
Cdd:PHA02517  241 WVAWYNNRRLHERLGYTPPAE 261
rve pfam00665
Integrase core domain; Integrase mediates integration of a DNA copy of the viral genome into ...
293-389 7.60e-26

Integrase core domain; Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic domain. The carboxyl terminal domain that is a non-specific DNA binding domain pfam00552. The catalytic domain acts as an endonuclease when two nucleotides are removed from the 3' ends of the blunt-ended viral DNA made by reverse transcription. This domain also catalyzes the DNA strand transfer reaction of the 3' ends of the viral DNA to the 5' ends of the integration site.


Pssm-ID: 459897 [Multi-domain]  Cd Length: 98  Bit Score: 100.85  E-value: 7.60e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937 293 ANEKWVTDVTEFPV--QGKKLYLSSVLDLFNREVIAYSLSERPVMEMVNTMLDGAFpKLRPGDAPLLHSDQGWHYRMRSY 370
Cdd:pfam00665   1 PNQLWQGDFTYIRIpgGGGKLYLLVIVDDFSREILAWALSSEMDAELVLDALERAI-AFRGGVPLIIHSDNGSEYTSKAF 79
                          90
                  ....*....|....*....
gi 1194622937 371 QERLKAHGMTQSMSRKGNC 389
Cdd:pfam00665  80 REFLKDLGIKPSFSRPGNP 98
transpos_IS481 NF033577
IS481 family transposase; null
65-445 7.17e-15

IS481 family transposase; null


Pssm-ID: 468094 [Multi-domain]  Cd Length: 283  Bit Score: 74.55  E-value: 7.17e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937  65 FKLVVVRAVISDRLTMREAAARFNLSAEIlVRRWLDVYNDAGAEGLLNMQcgrpgqmTKPKNIPPLTDKELEKlspeelr 144
Cdd:NF033577    1 GRLELVRLVLEDGWSVREAARRFGISRKT-VYKWLKRYRAGGEEGLIDRS-------RRPHRSPRRTSPETEA------- 65
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937 145 aelrylraenaypkkveslgserkkwqkalIISELRHEHalrdllraagmsrstwyynmnalkqgdryaglkenirkiyh 224
Cdd:NF033577   66 ------------------------------RILALRREL----------------------------------------- 74
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937 225 yhkgRYGYRRITLALRKQGLRINHKTVQRLMAELSL-RSVIRAKKYRAWKgrtgeaapnilsrNFGASKANEKWVTDVTE 303
Cdd:NF033577   75 ----RLGPRRIAYELERQGPGVSRSTVHRILRRHGLsRLRALDRKTGKVK-------------RYERAHPGELWHIDIKK 137
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937 304 FP--VQGKKLYLSSVLDLFNREVIAYSL-SERPvmEMVNTMLDGAF-----PKLRpgdaplLHSDQGWHYR--MRSYQER 373
Cdd:NF033577  138 LGriPDVGRLYLHTAIDDHSRFAYAELYpDETA--ETAADFLRRAFaehgiPIRR------VLTDNGSEFRsrAHGFELA 209
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1194622937 374 LKAHGMTQSMSRKGNCLDNAVMENFFGTLKSECFYLREFRSVSALRKAVEDYIHYYNNERISLKLKGLSPVE 445
Cdd:NF033577  210 LAELGIEHRRTRPYHPQTNGKVERFHRTLKDEFAYARPYESLAELQAALDEWLHHYNHHRPHSALGGKTPAE 281
InsE COG2963
Transposase InsE and inactivated derivatives [Mobilome: prophages, transposons];
1-80 1.89e-05

Transposase InsE and inactivated derivatives [Mobilome: prophages, transposons];


Pssm-ID: 442203 [Multi-domain]  Cd Length: 93  Bit Score: 42.99  E-value: 1.89e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937   1 MAKP--KYSPETKLAVVNHYLSGKDGEQSTADLFGIERTSVRRWVRAWQFHGAEGLTAKNNHYSDEFKLVVVRAVIsDRL 78
Cdd:COG2963     1 MSKKrrRYSPEFKAEAVRLVLEGGASVAEVARELGISPSTLYRWVRQYREGGLGGFPGDGRTTPEQAEIRRLRKEL-RRL 79

                  ..
gi 1194622937  79 TM 80
Cdd:COG2963    80 EM 81
HTH_28 pfam13518
Helix-turn-helix domain; This helix-turn-helix domain is often found in transposases and is ...
11-57 1.09e-04

Helix-turn-helix domain; This helix-turn-helix domain is often found in transposases and is likely to be DNA-binding.


Pssm-ID: 463908 [Multi-domain]  Cd Length: 52  Bit Score: 39.88  E-value: 1.09e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 1194622937  11 KLAVVNHYLSGKDGEQsTADLFGIERTSVRRWVRAWQFHGAEGLTAK 57
Cdd:pfam13518   2 RLKIVLLALEGESIKE-AARLFGISRSTVYRWIRRYREGGLEGLLPR 47
transpos_IS630 NF033545
IS630 family transposase;
66-196 3.02e-03

IS630 family transposase;


Pssm-ID: 468076 [Multi-domain]  Cd Length: 298  Bit Score: 39.55  E-value: 3.02e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937  66 KLVVVRAVISDRLTMREAAARFNLSAEIlVRRWLDVYNDAGAEGLLNMQcgRPGQmtkpkniPPltdkeleKLSPEELRA 145
Cdd:NF033545    1 RRARILLLAAEGLSITEIAERLGVSRST-VYRWLKRFNEGGLEGLLDKP--RPGR-------PR-------KLLSEQQAE 63
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 1194622937 146 ELRYLRAENaypkkveslGSERKKWQKALIISELRHEH-------ALRDLLRAAGMSR 196
Cdd:NF033545   64 LLALLLEEP---------PEGAGHWTLRELAALLEEEFgveysrsTVRRLLKRLGLSP 112
transpos_ISNCY_2 NF033594
ISNCY family transposase; The ISNCY insertion sequence family, as defined by ISFinder, encodes ...
69-118 3.69e-03

ISNCY family transposase; The ISNCY insertion sequence family, as defined by ISFinder, encodes several apparently unrelated families of transposases. Members of this family resemble the transposases of ISNCY family elements such as IS1202, ISTde1, ISKpn21, and ISCARN1.


Pssm-ID: 468103 [Multi-domain]  Cd Length: 367  Bit Score: 39.39  E-value: 3.69e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1194622937  69 VVRAVISDRLTMREAAARFNLSaEILVRRWLDVYNDAGAEGLLNMQCGRP 118
Cdd:NF033594    2 VIQKVVDGRLTVKEAAELLGLS-ERQVRRLLKRYREEGAAGLVHGNRGRP 50
transpos_IS630 NF033545
IS630 family transposase;
29-99 5.66e-03

IS630 family transposase;


Pssm-ID: 468076 [Multi-domain]  Cd Length: 298  Bit Score: 38.78  E-value: 5.66e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937  29 ADLFGIERTSVRRWVRAWQFHGAEGLTAKNNH-----YSDEFKLVVVRAVISDR------LTMREAAARFNLSAEIL--- 94
Cdd:NF033545   19 AERLGVSRSTVYRWLKRFNEGGLEGLLDKPRPgrprkLLSEQQAELLALLLEEPpegaghWTLRELAALLEEEFGVEysr 98

                  ....*..
gi 1194622937  95 --VRRWL 99
Cdd:NF033545   99 stVRRLL 105
 
Name Accession Description Interval E-value
transpos_IS3 NF033516
IS3 family transposase;
64-447 1.12e-134

IS3 family transposase;


Pssm-ID: 468052 [Multi-domain]  Cd Length: 369  Bit Score: 392.31  E-value: 1.12e-134
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937  64 EFKLVVVRAVISDRLTMREAAARFNLSAEiLVRRWLDVYNDAGAEGLlnmqcgrpgqmtkPKNIPPLTDKELEKLspEEL 143
Cdd:NF033516    1 EFKLEAVREVLEGGKSVAEVARELGISPS-TLYRWRKKYRGGGEAAD-------------AGRLKELLTPEEEEN--RRL 64
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937 144 RAELRYLRAENAYPKKVESLGSERKKWQkalIISELRHEHALRDLLRAAGMSRSTWYYNMNAL--KQGDRYAGLKENIRK 221
Cdd:NF033516   65 KRELAELRLENEILKKARKLLRPAVKYA---LIDALRGEYSVRRACRVLGVSRSTYYYWRKRPpsRRAPDDAELRARIRE 141
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937 222 IYHYHKGRYGYRRITLALRKQGLRINHKTVQRLMAELSLRSVIRaKKYRAWKGRTGE---AAPNILSRNFGASKANEKWV 298
Cdd:NF033516  142 IFEESRGRYGYRRITALLRREGIRVNHKRVYRLMRELGLLARRR-RKRRPYTTDSGHvhpVAPNLLNRQFTATRPNQVWV 220
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937 299 TDVTEFPVQGKKLYLSSVLDLFNREVIAYSLSERPVMEMVNTMLDGAFPKLRPGDAPLLHSDQGWHYRMRSYQERLKAHG 378
Cdd:NF033516  221 TDITYIRTAEGWLYLAVVLDLFSREIVGWSVSTSMSAELVLDALEMAIEWRGKPEGLILHSDNGSQYTSKAYREWLKEHG 300
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1194622937 379 MTQSMSRKGNCLDNAVMENFFGTLKSECFYLREFRSVSALRKAVEDYIHYYNNERISLKLKGLSPVEYR 447
Cdd:NF033516  301 ITQSMSRPGNCWDNAVAESFFGTLKRECLYRRRFRTLEEARQAIEEYIEFYNHERPHSSLGYLTPAEFE 369
Tra5 COG2801
Transposase InsO and inactivated derivatives [Mobilome: prophages, transposons];
143-454 9.60e-103

Transposase InsO and inactivated derivatives [Mobilome: prophages, transposons];


Pssm-ID: 442053 [Multi-domain]  Cd Length: 309  Bit Score: 308.62  E-value: 9.60e-103
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937 143 LRAELRYLRAENAYPKKVESLGSERKKWQKALIISELRHEHALRDLLRAAGMSRSTWYYNMNALKQGDRYAGLKENIRKI 222
Cdd:COG2801     1 ELAEEEELRKEEELLRRLLLLLRLLLLRRRVLRRVSRRRRRLLRLLRRRRARSRRRRRLRRPRSYRADEDAELLERIKEI 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937 223 YHYHKgRYGYRRITLALRKQGLRINHKTVQRLMAELSLRSVIRAK-KYRAWKGRTGEAAPNILsrnFGASKANEKWVTDV 301
Cdd:COG2801    81 FAESP-RYGYRRITAELRREGIAVNRKRVRRLMRELGLQARRRRKkKYTTYSGHGGPIAPNLL---FTATAPNQVWVTDI 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937 302 TEFPVQGKKLYLSSVLDLFNREVIAYSLSERPVMEMVNTMLDGAFPKLRPGDAPLLHSDQGWHYRMRSYQERLKAHGMTQ 381
Cdd:COG2801   157 TYIPTAEGWLYLAAVIDLFSREIVGWSVSDSMDAELVVDALEMAIERRGPPKPLILHSDNGSQYTSKAYQELLKKLGITQ 236
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1194622937 382 SMSRKGNCLDNAVMENFFGTLKSECFYLREFRSVSALRKAVEDYIHYYNNERISLKLKGLSPVEYRTQALRAA 454
Cdd:COG2801   237 SMSRPGNPQDNAFIESFFGTLKYELLYRRRFESLEEAREAIEEYIEFYNHERPHSSLGYLTPAEYEKQLAAAA 309
PHA02517 PHA02517
putative transposase OrfB; Reviewed
193-445 1.98e-55

putative transposase OrfB; Reviewed


Pssm-ID: 222853 [Multi-domain]  Cd Length: 277  Bit Score: 185.45  E-value: 1.98e-55
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937 193 GMSRSTWYYNMNALKQGD-------RYAGLKENIRKIYHYHKGRYGYRRITLALRKQGLRINHKTVQRLMAELSLRSVIR 265
Cdd:PHA02517    2 GIAPSTYYRCQQQRHHPDkrraraqHDDWLKSEILRVYDENHQVYGVRKVWRQLNREGIRVARCTVGRLMKELGLAGVLR 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937 266 AKKYRAWKGRTGEAAPNILSRNFGASKANEKWVTDVTEFPVQGKKLYLSSVLDLFNREVIAYSLSERPVMEMVNTMLDGA 345
Cdd:PHA02517   82 GKKVRTTISRKAVAAPDRVNRQFVATRPNQLWVADFTYVSTWQGWVYVAFIIDVFARRIVGWRVSSSMDTDFVLDALEQA 161
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937 346 -FPKLRPGDApLLHSDQGWHYRMRSYQERLKAHGMTQSMSRKGNCLDNAVMENFFGTLKSECFYLREFRSVSALRKAVED 424
Cdd:PHA02517  162 lWARGRPGGL-IHHSDKGSQYVSLAYTQRLKEAGIRASTGSRGDSYDNAPAESINGLYKAEVIHRVSWKNREEVELATLE 240
                         250       260
                  ....*....|....*....|.
gi 1194622937 425 YIHYYNNERISLKLKGLSPVE 445
Cdd:PHA02517  241 WVAWYNNRRLHERLGYTPPAE 261
rve pfam00665
Integrase core domain; Integrase mediates integration of a DNA copy of the viral genome into ...
293-389 7.60e-26

Integrase core domain; Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic domain. The carboxyl terminal domain that is a non-specific DNA binding domain pfam00552. The catalytic domain acts as an endonuclease when two nucleotides are removed from the 3' ends of the blunt-ended viral DNA made by reverse transcription. This domain also catalyzes the DNA strand transfer reaction of the 3' ends of the viral DNA to the 5' ends of the integration site.


Pssm-ID: 459897 [Multi-domain]  Cd Length: 98  Bit Score: 100.85  E-value: 7.60e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937 293 ANEKWVTDVTEFPV--QGKKLYLSSVLDLFNREVIAYSLSERPVMEMVNTMLDGAFpKLRPGDAPLLHSDQGWHYRMRSY 370
Cdd:pfam00665   1 PNQLWQGDFTYIRIpgGGGKLYLLVIVDDFSREILAWALSSEMDAELVLDALERAI-AFRGGVPLIIHSDNGSEYTSKAF 79
                          90
                  ....*....|....*....
gi 1194622937 371 QERLKAHGMTQSMSRKGNC 389
Cdd:pfam00665  80 REFLKDLGIKPSFSRPGNP 98
rve_2 pfam13333
Integrase core domain;
396-451 3.94e-18

Integrase core domain;


Pssm-ID: 372570 [Multi-domain]  Cd Length: 52  Bit Score: 77.69  E-value: 3.94e-18
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 1194622937 396 ENFFGTLKSECFYLREFRSVSALRKAVEDYIHYYNNERislkLKGLSPVEYRTQAL 451
Cdd:pfam13333   1 ESFFGSLKTEMVYGEHFKTLEELELAIFDYIEWYNNKR----LKGLSPVQYRNQSL 52
HTH_21 pfam13276
HTH-like domain; This domain contains a predicted helix-turn-helix suggesting a DNA-binding ...
215-268 6.29e-16

HTH-like domain; This domain contains a predicted helix-turn-helix suggesting a DNA-binding function.


Pssm-ID: 463824 [Multi-domain]  Cd Length: 60  Bit Score: 71.83  E-value: 6.29e-16
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 1194622937 215 LKENIRKIYHYHKGRYGYRRITLALRKQG-LRINHKTVQRLMAELSLRSVIRAKK 268
Cdd:pfam13276   6 LLEAIREIFEESRGTYGYRRITAELRREGgIRVNRKRVARLMRELGLRARRRRKR 60
transpos_IS481 NF033577
IS481 family transposase; null
65-445 7.17e-15

IS481 family transposase; null


Pssm-ID: 468094 [Multi-domain]  Cd Length: 283  Bit Score: 74.55  E-value: 7.17e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937  65 FKLVVVRAVISDRLTMREAAARFNLSAEIlVRRWLDVYNDAGAEGLLNMQcgrpgqmTKPKNIPPLTDKELEKlspeelr 144
Cdd:NF033577    1 GRLELVRLVLEDGWSVREAARRFGISRKT-VYKWLKRYRAGGEEGLIDRS-------RRPHRSPRRTSPETEA------- 65
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937 145 aelrylraenaypkkveslgserkkwqkalIISELRHEHalrdllraagmsrstwyynmnalkqgdryaglkenirkiyh 224
Cdd:NF033577   66 ------------------------------RILALRREL----------------------------------------- 74
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937 225 yhkgRYGYRRITLALRKQGLRINHKTVQRLMAELSL-RSVIRAKKYRAWKgrtgeaapnilsrNFGASKANEKWVTDVTE 303
Cdd:NF033577   75 ----RLGPRRIAYELERQGPGVSRSTVHRILRRHGLsRLRALDRKTGKVK-------------RYERAHPGELWHIDIKK 137
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937 304 FP--VQGKKLYLSSVLDLFNREVIAYSL-SERPvmEMVNTMLDGAF-----PKLRpgdaplLHSDQGWHYR--MRSYQER 373
Cdd:NF033577  138 LGriPDVGRLYLHTAIDDHSRFAYAELYpDETA--ETAADFLRRAFaehgiPIRR------VLTDNGSEFRsrAHGFELA 209
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1194622937 374 LKAHGMTQSMSRKGNCLDNAVMENFFGTLKSECFYLREFRSVSALRKAVEDYIHYYNNERISLKLKGLSPVE 445
Cdd:NF033577  210 LAELGIEHRRTRPYHPQTNGKVERFHRTLKDEFAYARPYESLAELQAALDEWLHHYNHHRPHSALGGKTPAE 281
rve_3 pfam13683
Integrase core domain;
377-443 2.45e-12

Integrase core domain;


Pssm-ID: 433402 [Multi-domain]  Cd Length: 67  Bit Score: 61.85  E-value: 2.45e-12
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1194622937 377 HGMTQSMSRKGNCLDNAVMENFFGTLKSECFYLREFRSVSALRKAVEDYIHYYNNERISLKLKGLSP 443
Cdd:pfam13683   1 LGIEISYIAPGKPMQNGLVESFNGTLRDECLNEHLFSSLAEARALLAAWREDYNTERPHSSLGYRTP 67
InsE COG2963
Transposase InsE and inactivated derivatives [Mobilome: prophages, transposons];
55-163 1.91e-07

Transposase InsE and inactivated derivatives [Mobilome: prophages, transposons];


Pssm-ID: 442203 [Multi-domain]  Cd Length: 93  Bit Score: 48.77  E-value: 1.91e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937  55 TAKNNHYSDEFKLVVVRAVISDRLTMREAAARFNLSAEiLVRRWLDVYNDAGAEGllnmqcgrpgqmtkPKNIPPLTDKE 134
Cdd:COG2963     2 SKKRRRYSPEFKAEAVRLVLEGGASVAEVARELGISPS-TLYRWVRQYREGGLGG--------------FPGDGRTTPEQ 66
                          90       100
                  ....*....|....*....|....*....
gi 1194622937 135 LEKlspEELRAELRYLRAENAYPKKVESL 163
Cdd:COG2963    67 AEI---RRLRKELRRLEMENDILKKAAAL 92
InsE COG2963
Transposase InsE and inactivated derivatives [Mobilome: prophages, transposons];
1-80 1.89e-05

Transposase InsE and inactivated derivatives [Mobilome: prophages, transposons];


Pssm-ID: 442203 [Multi-domain]  Cd Length: 93  Bit Score: 42.99  E-value: 1.89e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937   1 MAKP--KYSPETKLAVVNHYLSGKDGEQSTADLFGIERTSVRRWVRAWQFHGAEGLTAKNNHYSDEFKLVVVRAVIsDRL 78
Cdd:COG2963     1 MSKKrrRYSPEFKAEAVRLVLEGGASVAEVARELGISPSTLYRWVRQYREGGLGGFPGDGRTTPEQAEIRRLRKEL-RRL 79

                  ..
gi 1194622937  79 TM 80
Cdd:COG2963    80 EM 81
PRK14702 PRK14702
insertion element IS2 transposase InsD; Provisional
230-449 4.82e-05

insertion element IS2 transposase InsD; Provisional


Pssm-ID: 237792 [Multi-domain]  Cd Length: 262  Bit Score: 44.72  E-value: 4.82e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937 230 YGYRRITLALRKQG-----LRINHKTVQRLMAELSLRSVIRAKKYRAWKGRTGEAApnilsrnfgASKANEKWVTDVTEF 304
Cdd:PRK14702   27 YGYRRVWALLRRQAeldgmPAINAKRVYRLMRQNALLLERKPAVPPSKRAHTGRVA---------VKESNQRWCSDGFEF 97
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937 305 PV-QGKKLYLSSVLDLFNREVIAYSLSERPV-MEMVNTMLDGAFPKLRPGDAPLLH----SDQGWHYRMRSYQERLKAHG 378
Cdd:PRK14702   98 CCdNGERLRVTFALDCCDREALHWAVTTGGFnSETVQDVMLGAVERRFGNDLPSSPvewlTDNGSCYRANETRQFARMLG 177
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1194622937 379 MTQSMSRKGNCLDNAVMENFFGTLKSECFYLREFRSVSALRKAVEDYIHYYNNERISLKLKGLSPVEYRTQ 449
Cdd:PRK14702  178 LEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQ 248
PRK09409 PRK09409
IS2 transposase TnpB; Reviewed
230-449 8.77e-05

IS2 transposase TnpB; Reviewed


Pssm-ID: 181829 [Multi-domain]  Cd Length: 301  Bit Score: 44.32  E-value: 8.77e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937 230 YGYRRITLALRKQG-----LRINHKTVQRLMAELSLRSVIRAKKYRAWKGRTGEAApnilsrnfgASKANEKWVTDVTEF 304
Cdd:PRK09409   66 YGYRRVWALLRRQAeldgmPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVA---------VKESNQRWCSDGFEF 136
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937 305 PV-QGKKLYLSSVLDLFNREVIAYSLS-----ERPVMEMVNTMLDGAFPKLRPGDAPLLHSDQGWHYRMRSYQERLKAHG 378
Cdd:PRK09409  137 CCdNGERLRVTFALDCCDREALHWAVTtggfnSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLG 216
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1194622937 379 MTQSMSRKGNCLDNAVMENFFGTLKSECFYLREFRSVSALRKAVEDYIHYYNNERISLKLKGLSPVEYRTQ 449
Cdd:PRK09409  217 LEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQ 287
HTH_28 pfam13518
Helix-turn-helix domain; This helix-turn-helix domain is often found in transposases and is ...
11-57 1.09e-04

Helix-turn-helix domain; This helix-turn-helix domain is often found in transposases and is likely to be DNA-binding.


Pssm-ID: 463908 [Multi-domain]  Cd Length: 52  Bit Score: 39.88  E-value: 1.09e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 1194622937  11 KLAVVNHYLSGKDGEQsTADLFGIERTSVRRWVRAWQFHGAEGLTAK 57
Cdd:pfam13518   2 RLKIVLLALEGESIKE-AARLFGISRSTVYRWIRRYREGGLEGLLPR 47
HTH_28 pfam13518
Helix-turn-helix domain; This helix-turn-helix domain is often found in transposases and is ...
65-112 2.32e-04

Helix-turn-helix domain; This helix-turn-helix domain is often found in transposases and is likely to be DNA-binding.


Pssm-ID: 463908 [Multi-domain]  Cd Length: 52  Bit Score: 38.73  E-value: 2.32e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 1194622937  65 FKLVVVRAVISDRlTMREAAARFNLSAEiLVRRWLDVYNDAGAEGLLN 112
Cdd:pfam13518   1 ERLKIVLLALEGE-SIKEAARLFGISRS-TVYRWIRRYREGGLEGLLP 46
transpos_IS630 NF033545
IS630 family transposase;
66-196 3.02e-03

IS630 family transposase;


Pssm-ID: 468076 [Multi-domain]  Cd Length: 298  Bit Score: 39.55  E-value: 3.02e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937  66 KLVVVRAVISDRLTMREAAARFNLSAEIlVRRWLDVYNDAGAEGLLNMQcgRPGQmtkpkniPPltdkeleKLSPEELRA 145
Cdd:NF033545    1 RRARILLLAAEGLSITEIAERLGVSRST-VYRWLKRFNEGGLEGLLDKP--RPGR-------PR-------KLLSEQQAE 63
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 1194622937 146 ELRYLRAENaypkkveslGSERKKWQKALIISELRHEH-------ALRDLLRAAGMSR 196
Cdd:NF033545   64 LLALLLEEP---------PEGAGHWTLRELAALLEEEFgveysrsTVRRLLKRLGLSP 112
transpos_ISNCY_2 NF033594
ISNCY family transposase; The ISNCY insertion sequence family, as defined by ISFinder, encodes ...
69-118 3.69e-03

ISNCY family transposase; The ISNCY insertion sequence family, as defined by ISFinder, encodes several apparently unrelated families of transposases. Members of this family resemble the transposases of ISNCY family elements such as IS1202, ISTde1, ISKpn21, and ISCARN1.


Pssm-ID: 468103 [Multi-domain]  Cd Length: 367  Bit Score: 39.39  E-value: 3.69e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1194622937  69 VVRAVISDRLTMREAAARFNLSaEILVRRWLDVYNDAGAEGLLNMQCGRP 118
Cdd:NF033594    2 VIQKVVDGRLTVKEAAELLGLS-ERQVRRLLKRYREEGAAGLVHGNRGRP 50
HTH_23 pfam13384
Homeodomain-like domain;
9-54 4.76e-03

Homeodomain-like domain;


Pssm-ID: 433164 [Multi-domain]  Cd Length: 50  Bit Score: 34.94  E-value: 4.76e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 1194622937   9 ETKLAVVNHYLSGKDGEQsTADLFGIERTSVRRWVRAWQFHGAEGL 54
Cdd:pfam13384   5 RRRARALLLLAEGLSVKE-IAELLGVSRRTVYRWLKRYNEEGLEGL 49
transpos_IS630 NF033545
IS630 family transposase;
29-99 5.66e-03

IS630 family transposase;


Pssm-ID: 468076 [Multi-domain]  Cd Length: 298  Bit Score: 38.78  E-value: 5.66e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1194622937  29 ADLFGIERTSVRRWVRAWQFHGAEGLTAKNNH-----YSDEFKLVVVRAVISDR------LTMREAAARFNLSAEIL--- 94
Cdd:NF033545   19 AERLGVSRSTVYRWLKRFNEGGLEGLLDKPRPgrprkLLSEQQAELLALLLEEPpegaghWTLRELAALLEEEFGVEysr 98

                  ....*..
gi 1194622937  95 --VRRWL 99
Cdd:NF033545   99 stVRRLL 105
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH