NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1720408942|ref|XP_030109448|]
View 

leucine zipper protein 1 isoform X2 [Mus musculus]

Protein Classification

coiled-coil domain-containing protein( domain architecture ID 1000037)

coiled-coil domain-containing protein contains a region with alpha-helical coiled-coil sequence signatures that is being annotated by a variety of protein family models, not necessarily indicating family membership

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PRK03918 super family cl35229
DNA double-strand break repair ATPase Rad50;
104-353 2.77e-12

DNA double-strand break repair ATPase Rad50;


The actual alignment was detected with superfamily member PRK03918:

Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 71.25  E-value: 2.77e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  104 RELKSEIERLQKRMVDLEKLEEALSRSKNECSQLCLSLNEERNLTKKISSELEMLRVKVKELESSEDRLDKTEQSLVSEL 183
Cdd:PRK03918   172 KEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLE 251
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  184 EKLKSLTLSFVNERKYLNEKEKEN---EKIIKELTQ---------KLEQNKKMNRDHMRNASTFLERNDLRIEdGISSTL 251
Cdd:PRK03918   252 GSKRKLEEKIRELEERIEELKKEIeelEEKVKELKElkekaeeyiKLSEFYEEYLDELREIEKRLSRLEEEIN-GIEERI 330
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  252 SSKESKrKGSLDYLKQVENETRDKSENEKNRNQEDNKVKDLNQEIEKLKTQIKHF--ESLEEELKKMRAKNNDLQDNYLT 329
Cdd:PRK03918   331 KELEEK-EERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLtpEKLEKELEELEKAKEEIEEEISK 409
                          250       260
                   ....*....|....*....|....
gi 1720408942  330 ELNRNRSLASQLEEIKLQVRKQKE 353
Cdd:PRK03918   410 ITARIGELKKEIKELKKAIEELKK 433
DR0291 super family cl34310
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
14-185 2.69e-05

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


The actual alignment was detected with superfamily member COG1579:

Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 46.84  E-value: 2.69e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   14 RHLRFKLQSLSRRLDELEE----ATKNLQRAEDELLDLQDKVIQAEgsdsstlAEIEVLRQRVLKIEGKDEEIKRAEDLc 89
Cdd:COG1579     20 DRLEHRLKELPAELAELEDelaaLEARLEAAKTELEDLEKEIKRLE-------LEIEEVEARIKKYEEQLGNVRNNKEY- 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   90 htmkekleeeenltRELKSEIERLQKRMVDLEK----LEEALSRSKNECSQLclsLNEERNLTKKISSELEMLRVKVKEL 165
Cdd:COG1579     92 --------------EALQKEIESLKRRISDLEDeileLMERIEELEEELAEL---EAELAELEAELEEKKAELDEELAEL 154
                          170       180
                   ....*....|....*....|
gi 1720408942  166 ESSEDRLDKTEQSLVSELEK 185
Cdd:COG1579    155 EAELEELEAEREELAAKIPP 174
 
Name Accession Description Interval E-value
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
104-353 2.77e-12

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 71.25  E-value: 2.77e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  104 RELKSEIERLQKRMVDLEKLEEALSRSKNECSQLCLSLNEERNLTKKISSELEMLRVKVKELESSEDRLDKTEQSLVSEL 183
Cdd:PRK03918   172 KEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLE 251
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  184 EKLKSLTLSFVNERKYLNEKEKEN---EKIIKELTQ---------KLEQNKKMNRDHMRNASTFLERNDLRIEdGISSTL 251
Cdd:PRK03918   252 GSKRKLEEKIRELEERIEELKKEIeelEEKVKELKElkekaeeyiKLSEFYEEYLDELREIEKRLSRLEEEIN-GIEERI 330
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  252 SSKESKrKGSLDYLKQVENETRDKSENEKNRNQEDNKVKDLNQEIEKLKTQIKHF--ESLEEELKKMRAKNNDLQDNYLT 329
Cdd:PRK03918   331 KELEEK-EERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLtpEKLEKELEELEKAKEEIEEEISK 409
                          250       260
                   ....*....|....*....|....
gi 1720408942  330 ELNRNRSLASQLEEIKLQVRKQKE 353
Cdd:PRK03918   410 ITARIGELKKEIKELKKAIEELKK 433
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
7-406 6.04e-11

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 66.88  E-value: 6.04e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942    7 YKDAASNRHLRFKLQSLSRRLDELEEATKNLQRAEDELLDLQDKVIQAEgsdsstlAEIEVLRQRVLKIEGKDEEIKRAE 86
Cdd:COG1196    218 LKEELKELEAELLLLKLRELEAELEELEAELEELEAELEELEAELAELE-------AELEELRLELEELELELEEAQAEE 290
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   87 dlchtmkekleeeenltRELKSEIERLQKRMVDLEKLEEALSRSKNECSQLCLSLNEER-NLTKKISSELEMLRVKVKEL 165
Cdd:COG1196    291 -----------------YELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELeELEEELEELEEELEEAEEEL 353
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  166 ESSEDRLDKTEQSLVSELEKLKSLtlsfvnerkyLNEKEKENEKIIKELTQKLEQNKKMNRdhmrnastfLERNDLRIED 245
Cdd:COG1196    354 EEAEAELAEAEEALLEAEAELAEA----------EEELEELAEELLEALRAAAELAAQLEE---------LEEAEEALLE 414
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  246 GISSTLSSKESKRKGSLDYLKQVENETRDKSENEKNRNQEDNKVKDLNQEIEKLKTQIkhfESLEEELKKMRAKNNDLQD 325
Cdd:COG1196    415 RLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEA---ALLEAALAELLEELAEAAA 491
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  326 NYLTELNRNRSLASQLEEIKlqvRKQKELGNGDIEGEDAFLLGRGRHERTKLKGHGSEASVSKHTSRELSPQHKRERLRN 405
Cdd:COG1196    492 RLLLLLEAEADYEGFLEGVK---AALLLAGLRGLAGAVAVLIGVEAAYEAALEAALAAALQNIVVEDDEVAAAAIEYLKA 568

                   .
gi 1720408942  406 R 406
Cdd:COG1196    569 A 569
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
103-362 1.59e-10

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 65.47  E-value: 1.59e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  103 TRELKSEIERLQKRMVDLEKLEEAL----SRSKNECSQLCLSLNEERNLTKKISSELEML---RVKVKE-LESSEDRLDK 174
Cdd:TIGR02169  669 SRSEPAELQRLRERLEGLKRELSSLqselRRIENRLDELSQELSDASRKIGEIEKEIEQLeqeEEKLKErLEELEEDLSS 748
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  175 TEQSLVSELEKLKSLTLSFVNERKYLNEKEKENEKIIKELTQKLEQNKkmnRDHMRNASTFLERNDLRIEDgISSTLSSK 254
Cdd:TIGR02169  749 LEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEI---QAELSKLEEEVSRIEARLRE-IEQKLNRL 824
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  255 ESKRKGSLDYLKQVENETRDKSENEKNRNQE----DNKVKDLNQEIEKLKTQIKHFES----LEEELKKMRAKNNDLQDN 326
Cdd:TIGR02169  825 TLEKEYLEKEIQELQEQRIDLKEQIKSIEKEienlNGKKEELEEELEELEAALRDLESrlgdLKKERDELEAQLRELERK 904
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|.
gi 1720408942  327 Y---LTELNRNRSLASQLEEiKLQVRKQ--KELGNGDIEGE 362
Cdd:TIGR02169  905 IeelEAQIEKKRKRLSELKA-KLEALEEelSEIEDPKGEDE 944
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
61-353 5.39e-07

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 53.82  E-value: 5.39e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   61 TLAEIEVLRQRVLKIEGKDEEIKRAEDlchtmKEKLEEEENLTRELKSEIERLQKRMVDLEKLEEALSRSKNECSQLCLS 140
Cdd:pfam02463  174 ALKKLIEETENLAELIIDLEELKLQEL-----KLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERIDLLQELLRD 248
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  141 LNEERNLTKKI---SSELEMLRVKVKELESSEDRLDKTEQSLVSELEKLKSLTLSFVNERKYLNEKEKENEKIIKELTQK 217
Cdd:pfam02463  249 EQEEIESSKQEiekEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKEKKKAEK 328
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  218 LEQNKKMNRDHMRNASTFLERNDLRIEDGISSTLSS--KESKRKGSLDYLKQVENETRDKSENEKN-----RNQEDNKVK 290
Cdd:pfam02463  329 ELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLqeKLEQLEEELLAKKKLESERLSSAAKLKEeelelKSEEEKEAQ 408
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1720408942  291 DLNQEIEKLKTQIKHFESLEEELKKMRAKNNDLQDNYLTELNRNRSLASQLEEIKLQVRKQKE 353
Cdd:pfam02463  409 LLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKSE 471
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
14-185 2.69e-05

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 46.84  E-value: 2.69e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   14 RHLRFKLQSLSRRLDELEE----ATKNLQRAEDELLDLQDKVIQAEgsdsstlAEIEVLRQRVLKIEGKDEEIKRAEDLc 89
Cdd:COG1579     20 DRLEHRLKELPAELAELEDelaaLEARLEAAKTELEDLEKEIKRLE-------LEIEEVEARIKKYEEQLGNVRNNKEY- 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   90 htmkekleeeenltRELKSEIERLQKRMVDLEK----LEEALSRSKNECSQLclsLNEERNLTKKISSELEMLRVKVKEL 165
Cdd:COG1579     92 --------------EALQKEIESLKRRISDLEDeileLMERIEELEEELAEL---EAELAELEAELEEKKAELDEELAEL 154
                          170       180
                   ....*....|....*....|
gi 1720408942  166 ESSEDRLDKTEQSLVSELEK 185
Cdd:COG1579    155 EAELEELEAEREELAAKIPP 174
Spc7 smart00787
Spc7 kinetochore protein; This domain is found in cell division proteins which are required ...
262-346 4.86e-03

Spc7 kinetochore protein; This domain is found in cell division proteins which are required for kinetochore-spindle association.


Pssm-ID: 197874 [Multi-domain]  Cd Length: 312  Bit Score: 40.39  E-value: 4.86e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   262 LDYLKQVENETRDKSENEKNRNQEdnKVKDLNQEIEKLKTQIKHFESLEEELKKMRAKNNDLQDNYLTELNrnrSLASQL 341
Cdd:smart00787  188 LRQLKQLEDELEDCDPTELDRAKE--KLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIA---EAEKKL 262

                    ....*
gi 1720408942   342 EEIKL 346
Cdd:smart00787  263 EQCRG 267
 
Name Accession Description Interval E-value
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
104-353 2.77e-12

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 71.25  E-value: 2.77e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  104 RELKSEIERLQKRMVDLEKLEEALSRSKNECSQLCLSLNEERNLTKKISSELEMLRVKVKELESSEDRLDKTEQSLVSEL 183
Cdd:PRK03918   172 KEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLE 251
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  184 EKLKSLTLSFVNERKYLNEKEKEN---EKIIKELTQ---------KLEQNKKMNRDHMRNASTFLERNDLRIEdGISSTL 251
Cdd:PRK03918   252 GSKRKLEEKIRELEERIEELKKEIeelEEKVKELKElkekaeeyiKLSEFYEEYLDELREIEKRLSRLEEEIN-GIEERI 330
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  252 SSKESKrKGSLDYLKQVENETRDKSENEKNRNQEDNKVKDLNQEIEKLKTQIKHF--ESLEEELKKMRAKNNDLQDNYLT 329
Cdd:PRK03918   331 KELEEK-EERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLtpEKLEKELEELEKAKEEIEEEISK 409
                          250       260
                   ....*....|....*....|....
gi 1720408942  330 ELNRNRSLASQLEEIKLQVRKQKE 353
Cdd:PRK03918   410 ITARIGELKKEIKELKKAIEELKK 433
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
7-406 6.04e-11

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 66.88  E-value: 6.04e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942    7 YKDAASNRHLRFKLQSLSRRLDELEEATKNLQRAEDELLDLQDKVIQAEgsdsstlAEIEVLRQRVLKIEGKDEEIKRAE 86
Cdd:COG1196    218 LKEELKELEAELLLLKLRELEAELEELEAELEELEAELEELEAELAELE-------AELEELRLELEELELELEEAQAEE 290
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   87 dlchtmkekleeeenltRELKSEIERLQKRMVDLEKLEEALSRSKNECSQLCLSLNEER-NLTKKISSELEMLRVKVKEL 165
Cdd:COG1196    291 -----------------YELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELeELEEELEELEEELEEAEEEL 353
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  166 ESSEDRLDKTEQSLVSELEKLKSLtlsfvnerkyLNEKEKENEKIIKELTQKLEQNKKMNRdhmrnastfLERNDLRIED 245
Cdd:COG1196    354 EEAEAELAEAEEALLEAEAELAEA----------EEELEELAEELLEALRAAAELAAQLEE---------LEEAEEALLE 414
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  246 GISSTLSSKESKRKGSLDYLKQVENETRDKSENEKNRNQEDNKVKDLNQEIEKLKTQIkhfESLEEELKKMRAKNNDLQD 325
Cdd:COG1196    415 RLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEA---ALLEAALAELLEELAEAAA 491
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  326 NYLTELNRNRSLASQLEEIKlqvRKQKELGNGDIEGEDAFLLGRGRHERTKLKGHGSEASVSKHTSRELSPQHKRERLRN 405
Cdd:COG1196    492 RLLLLLEAEADYEGFLEGVK---AALLLAGLRGLAGAVAVLIGVEAAYEAALEAALAAALQNIVVEDDEVAAAAIEYLKA 568

                   .
gi 1720408942  406 R 406
Cdd:COG1196    569 A 569
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
28-345 1.39e-10

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 65.47  E-value: 1.39e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   28 DELEEATKNLQRAEDELLDLQDKVIQAEGSDSSTLAEIEVLRQRVLKIEGKDEEIKRAEDLCHTMKEKLEEEENLTRELK 107
Cdd:PRK03918   193 ELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELK 272
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  108 SEIERLQKRMVDLEKLEE------ALSRSKNEcsqlclSLNEERNLTK---KISSELEMLRVKVKELESSEDRLDKTEQS 178
Cdd:PRK03918   273 KEIEELEEKVKELKELKEkaeeyiKLSEFYEE------YLDELREIEKrlsRLEEEINGIEERIKELEEKEERLEELKKK 346
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  179 LvSELEKLKSLTLSFVNERKYLNEKEKENEKIIKELTQKLEQNKKMNRDHMRNASTFLERNDLRIEDGISStLSSKESKR 258
Cdd:PRK03918   347 L-KELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGE-LKKEIKEL 424
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  259 KGSLDYLKQVENET----RDKSENEKnrnqeDNKVKDLNQEIEKLKTQIKHFESLEEELKKmRAKNNDLQDNYLTELNRN 334
Cdd:PRK03918   425 KKAIEELKKAKGKCpvcgRELTEEHR-----KELLEEYTAELKRIEKELKEIEEKERKLRK-ELRELEKVLKKESELIKL 498
                          330
                   ....*....|.
gi 1720408942  335 RSLASQLEEIK 345
Cdd:PRK03918   499 KELAEQLKELE 509
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
103-362 1.59e-10

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 65.47  E-value: 1.59e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  103 TRELKSEIERLQKRMVDLEKLEEAL----SRSKNECSQLCLSLNEERNLTKKISSELEML---RVKVKE-LESSEDRLDK 174
Cdd:TIGR02169  669 SRSEPAELQRLRERLEGLKRELSSLqselRRIENRLDELSQELSDASRKIGEIEKEIEQLeqeEEKLKErLEELEEDLSS 748
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  175 TEQSLVSELEKLKSLTLSFVNERKYLNEKEKENEKIIKELTQKLEQNKkmnRDHMRNASTFLERNDLRIEDgISSTLSSK 254
Cdd:TIGR02169  749 LEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEI---QAELSKLEEEVSRIEARLRE-IEQKLNRL 824
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  255 ESKRKGSLDYLKQVENETRDKSENEKNRNQE----DNKVKDLNQEIEKLKTQIKHFES----LEEELKKMRAKNNDLQDN 326
Cdd:TIGR02169  825 TLEKEYLEKEIQELQEQRIDLKEQIKSIEKEienlNGKKEELEEELEELEAALRDLESrlgdLKKERDELEAQLRELERK 904
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|.
gi 1720408942  327 Y---LTELNRNRSLASQLEEiKLQVRKQ--KELGNGDIEGE 362
Cdd:TIGR02169  905 IeelEAQIEKKRKRLSELKA-KLEALEEelSEIEDPKGEDE 944
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
14-314 2.05e-10

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 65.09  E-value: 2.05e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   14 RHLRFKLQSLSRRLDELEEATKNLQ----RAEDELLDLQDKVIQAEGSDSSTLAEIEVLRQRVLKIEGKDEEIKRAEDLC 89
Cdd:TIGR02169  670 RSEPAELQRLRERLEGLKRELSSLQselrRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSL 749
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   90 htmkekleeeENLTRELKSEIERLQKRmvdLEKLEEALSRSKNECSQLCLSLNEERnlTKKISSELEMLRVKVKELESSE 169
Cdd:TIGR02169  750 ----------EQEIENVKSELKELEAR---IEELEEDLHKLEEALNDLEARLSHSR--IPEIQAELSKLEEEVSRIEARL 814
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  170 DRLDKTEQSLVSELEKLKSLTLSFVNERKYLNEKEKENEKIIKELTQKLEqnkKMNRDhmrnastfLERNDLRIEDgiss 249
Cdd:TIGR02169  815 REIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKE---ELEEE--------LEELEAALRD---- 879
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1720408942  250 tLSSKESKRKgsldylKQVENETRDKSENEKNRNQEDNKVKDLNQEIEKLKTQIkhfESLEEELK 314
Cdd:TIGR02169  880 -LESRLGDLK------KERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKL---EALEEELS 934
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
104-410 2.80e-10

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 64.69  E-value: 2.80e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  104 RELKSEIERLQKRM--VDLEKLEEALSRSKNECSQLCLSLNEERNLTKKISSELEMLRVKVKELESSEDRLDKTEQSLVS 181
Cdd:TIGR02168  216 KELKAELRELELALlvLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALAN 295
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  182 ELEKLKSLTLSFVNERKYLNEKEKENEKIIKELTQKLEQnkkmnrdhmrnastfLERNDLRIEDGISSTLSSKESkrkgs 261
Cdd:TIGR02168  296 EISRLEQQKQILRERLANLERQLEELEAQLEELESKLDE---------------LAEELAELEEKLEELKEELES----- 355
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  262 ldyLKQVENETRDKSENEKNRNQE-DNKVKDLNQEIEKLKTQIkhfESLEEELKKMRAKNNDLQDNYLTELNRNRSLASQ 340
Cdd:TIGR02168  356 ---LEAELEELEAELEELESRLEElEEQLETLRSKVAQLELQI---ASLNNEIERLEARLERLEDRRERLQQEIEELLKK 429
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  341 LEEIKLQvRKQKELGNGDIEGEDAfllgRGRHERTKLKGHGSEASVSKHTSRELSPQHKRERLRNREFAL 410
Cdd:TIGR02168  430 LEEAELK-ELQAELEELEEELEEL----QEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSL 494
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
30-350 4.20e-10

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 64.31  E-value: 4.20e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   30 LEEATKNLQRAED---------ELLDLQDKV------IQAEGSDSSTLAEIEVLRQRVLKIEGKDEEIKRAEDLCHTMKE 94
Cdd:TIGR02168  181 LERTRENLDRLEDilnelerqlKSLERQAEKaerykeLKAELRELELALLVLRLEELREELEELQEELKEAEEELEELTA 260
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   95 KLEEEENLTRELKSEIERLQKRMVDLEKLEEALSRSKNECSQLCLSLNEERnltKKISSELEMLRVKVKELESSEDRLDK 174
Cdd:TIGR02168  261 ELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERL---ANLERQLEELEAQLEELESKLDELAE 337
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  175 TEQSLVSELEKLKSLTLSFVNERKYLNEKEKENEKIIKELTQKLEQNKKmnrdhmRNASTFLERNDLRIEdgISSTLSSK 254
Cdd:TIGR02168  338 ELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRS------KVAQLELQIASLNNE--IERLEARL 409
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  255 ESKrKGSLDYLKQvENETRDKSENEKNRNQEDNKVKDLNQEIEKLKTQIKHFESLEEELKKMRAKNNDLQDNYLTELNRN 334
Cdd:TIGR02168  410 ERL-EDRRERLQQ-EIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQL 487
                          330
                   ....*....|....*.
gi 1720408942  335 RSLASQLEEIKLQVRK 350
Cdd:TIGR02168  488 QARLDSLERLQENLEG 503
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
25-371 2.52e-09

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 61.49  E-value: 2.52e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   25 RRLDELEEatkNLQRAED---EL---LD-LQDKVIQAEgsDSSTLAEIEVLRQ---RVLKIEGKDEEIKRAEDlchtmke 94
Cdd:COG1196    179 RKLEATEE---NLERLEDilgELerqLEpLERQAEKAE--RYRELKEELKELEaelLLLKLRELEAELEELEA------- 246
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   95 kleeeenLTRELKSEIERLQKRmvdLEKLEEALSRSKNECSQLCLSLNE----ERNLTKKISSELEMLRVKVKELESSED 170
Cdd:COG1196    247 -------ELEELEAELEELEAE---LAELEAELEELRLELEELELELEEaqaeEYELLAELARLEQDIARLEERRRELEE 316
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  171 RLDKTEQSLVSELEKLKSLTlsfvNERKYLNEKEKENEKIIKELTQKLEQNKKMNRDHMRNASTfLERNDLRIEDGISST 250
Cdd:COG1196    317 RLEELEEELAELEEELEELE----EELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAE-AEEELEELAEELLEA 391
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  251 LSSKESKRKGSLDYLKQVENETRDKSENEKNRNQEDNKVKDLNQEIEKLKTQIKHFESLEEELKKMRAKNNDLQDNYLTE 330
Cdd:COG1196    392 LRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEE 471
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|.
gi 1720408942  331 LNRNRSLASQLEEIKLQVRKQKELGNGDIEGEDAFLLGRGR 371
Cdd:COG1196    472 AALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKA 512
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
16-345 2.99e-09

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 61.23  E-value: 2.99e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   16 LRFKLQSLSRRLDELEEATKNLQRAEDELLDLQDKVIQAEGsDSSTLAEIEVLRQRV--LKIEGKDEEIKRAEDLCHTMK 93
Cdd:PRK03918   319 LEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEE-RHELYEEAKAKKEELerLKKRLTGLTPEKLEKELEELE 397
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   94 EKLEEEENLTRELKSEIERLQKRMVDLEKLEEALSRSKNECSqLC---LSLNEERNLTKKISSELEMLRVKVKELESSED 170
Cdd:PRK03918   398 KAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKCP-VCgreLTEEHRKELLEEYTAELKRIEKELKEIEEKER 476
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  171 RLDKTEQSLVSELEKLKSLTlsfvnerkylneKEKENEKIIKELTQKLeqnKKMNRDHMRNASTFLE--RNDLRIEDGIS 248
Cdd:PRK03918   477 KLRKELRELEKVLKKESELI------------KLKELAEQLKELEEKL---KKYNLEELEKKAEEYEklKEKLIKLKGEI 541
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  249 STLSSKESKRKGSLDYLKQVENETRDKSENEKNRNQEDNK-----VKDLNQEIE----------KLKTQIKHFESLEEEL 313
Cdd:PRK03918   542 KSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEElgfesVEELEERLKelepfyneylELKDAEKELEREEKEL 621
                          330       340       350
                   ....*....|....*....|....*....|..
gi 1720408942  314 KKMRAKNNDLQDNYLTELNRNRSLASQLEEIK 345
Cdd:PRK03918   622 KKLEEELDKAFEELAETEKRLEELRKELEELE 653
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
19-353 1.66e-08

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 58.49  E-value: 1.66e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   19 KLQSLSRRLDELEEATKNLQRaedELLDLQDKVIQAEGSDSSTLAEIEVLRQRVLKIEGKdeeIKRAEDLCHTMKEKLEE 98
Cdd:TIGR04523  343 QISQLKKELTNSESENSEKQR---ELEEKQNEIEKLKKENQSYKQEIKNLESQINDLESK---IQNQEKLNQQKDEQIKK 416
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   99 EENLTRELKSEIERLQKrmvDLEKLEEALSRSKNECSQLCLSLNEERNLTKKISSELEMLRVKVKELESSedrLDKTEQS 178
Cdd:TIGR04523  417 LQQEKELLEKEIERLKE---TIIKNNSEIKDLTNQDSVKELIIKNLDNTRESLETQLKVLSRSINKIKQN---LEQKQKE 490
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  179 LVSELEKLKSLTlsfvNERKYLNEKEKENEKIIKELTQKLEQnkkmnrdhmrnastfLERNDLRIEDGISST----LSSK 254
Cdd:TIGR04523  491 LKSKEKELKKLN----EEKKELEEKVKDLTKKISSLKEKIEK---------------LESEKKEKESKISDLedelNKDD 551
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  255 ESKRKGSLDYLKQVENETRDKSENE-----KNRNQEDNKVKDLNQEIEKLKTQI----KHFESLEEELKKMRAKNNDLQD 325
Cdd:TIGR04523  552 FELKKENLEKEIDEKNKEIEELKQTqkslkKKQEEKQELIDQKEKEKKDLIKEIeekeKKISSLEKELEKAKKENEKLSS 631
                          330       340
                   ....*....|....*....|....*...
gi 1720408942  326 NYLTELNRNRSLASQLEEIKLQVRKQKE 353
Cdd:TIGR04523  632 IIKNIKSKKNKLKQEVKQIKETIKEIRN 659
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
7-315 3.58e-08

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 57.76  E-value: 3.58e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942    7 YKDAASNRHLRFKLQSLSRRLDELEEATKNLQRAEDELLDLQDKVIQAEGSDSSTLAEIEVLRQRVLKIEGKDEEIKRAe 86
Cdd:TIGR02168  218 LKAELRELELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANE- 296
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   87 dlchtmkekleeeenlTRELKSEIERLQKRMVDLEKLEEALSRSKNECSQlclSLNEERNLTKKISSELEMLRVKVKELE 166
Cdd:TIGR02168  297 ----------------ISRLEQQKQILRERLANLERQLEELEAQLEELES---KLDELAEELAELEEKLEELKEELESLE 357
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  167 SSEDRLDKTEQSLVSELEKLKSLTLSFVNERKYLNEKEKENEKIIKELTQKLEQNKKMNRDHMRNASTFLERNDLRIEDG 246
Cdd:TIGR02168  358 AELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKE 437
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720408942  247 ISSTLsskESKRKGSLDYLKQVENETRDKSENEKNRNQEDNKVKDLNQEIEKLKTQIKHFESLEEELKK 315
Cdd:TIGR02168  438 LQAEL---EELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEG 503
PTZ00121 PTZ00121
MAEBL; Provisional
16-334 4.55e-08

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 57.46  E-value: 4.55e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   16 LRFKLQSLSRRLDELEEATKNLQRAeDELLDLQDKVIQAEGSDSSTLAEIEVLRqrvlkiegKDEEIKRAEDLCHTMKEK 95
Cdd:PTZ00121  1488 AKKKAEEAKKKADEAKKAAEAKKKA-DEAKKAEEAKKADEAKKAEEAKKADEAK--------KAEEKKKADELKKAEELK 1558
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   96 LEEEENLTRELKSEIERlqkRMVDLEKLEEALSRSKNECSQLCLSLNEERNLTKKISSELEMLRVKVKELESSED----- 170
Cdd:PTZ00121  1559 KAEEKKKAEEAKKAEED---KNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEekkkv 1635
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  171 -RLDKTEQSLVSELEKLKSLTLSFVNERKYLNEKEKENEKIIKELTQKLEQNKKMNRDHMRNASTFLERNDLRI----ED 245
Cdd:PTZ00121  1636 eQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKkeaeEK 1715
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  246 GISSTLSSKESKRKGSLDYLKQVENETRDKSENEKNRNQEDNKVKDLNQEIEKLKTQIKHFES--LEEELKKMRAKNNDL 323
Cdd:PTZ00121  1716 KKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEavIEEELDEEDEKRRME 1795
                          330
                   ....*....|.
gi 1720408942  324 QDNYLTELNRN 334
Cdd:PTZ00121  1796 VDKKIKDIFDN 1806
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
19-198 2.99e-07

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 54.39  E-value: 2.99e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   19 KLQSLSRRLDELEEATKNLQRAEDELLDLQDKVIQAEgsdsstlAEIEVLRQRVLKIEgkdeEIKRAEDLCHTMKEKLEE 98
Cdd:COG4717     72 ELKELEEELKEAEEKEEEYAELQEELEELEEELEELE-------AELEELREELEKLE----KLLQLLPLYQELEALEAE 140
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   99 EENLT---RELKSEIERLQKRMVDLEKLEEALSRSKNECSQLC--LSLNEERNLtKKISSELEMLRVKVKELESSEDRLD 173
Cdd:COG4717    141 LAELPerlEELEERLEELRELEEELEELEAELAELQEELEELLeqLSLATEEEL-QDLAEELEELQQRLAELEEELEEAQ 219
                          170       180
                   ....*....|....*....|....*
gi 1720408942  174 KTEQSLVSELEKLKSLTLSFVNERK 198
Cdd:COG4717    220 EELEELEEELEQLENELEAAALEER 244
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
105-354 4.49e-07

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 53.87  E-value: 4.49e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  105 ELKSEIERLQKRMVDLEKLEEALSRSKNECSQLCLSLNEERNLTKKISSELEMLRVKVKELESSEDRLDKTEQSLVSELE 184
Cdd:TIGR04523  156 KLNNKYNDLKKQKEELENELNLLEKEKLNIQKNIDKIKNKLLKLELLLSNLKKKIQKNKSLESQISELKKQNNQLKDNIE 235
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  185 KLKS----LTLSFVNERKYLNEKEKENEKIIKELTQK---LEQNKKMNRDhmrnastfLERNDLRIEDGISSTlssKESK 257
Cdd:TIGR04523  236 KKQQeineKTTEISNTQTQLNQLKDEQNKIKKQLSEKqkeLEQNNKKIKE--------LEKQLNQLKSEISDL---NNQK 304
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  258 RKGSLDYLK-QVENETRDKSENEKNRNQEDNKVKDLNQEIEKLKTQIKHFESLEEELKK-MRAKNNDLQ------DNYLT 329
Cdd:TIGR04523  305 EQDWNKELKsELKNQEKKLEEIQNQISQNNKIISQLNEQISQLKKELTNSESENSEKQReLEEKQNEIEklkkenQSYKQ 384
                          250       260
                   ....*....|....*....|....*
gi 1720408942  330 ELNrnrSLASQLEEIKLQVRKQKEL 354
Cdd:TIGR04523  385 EIK---NLESQINDLESKIQNQEKL 406
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
61-353 5.39e-07

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 53.82  E-value: 5.39e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   61 TLAEIEVLRQRVLKIEGKDEEIKRAEDlchtmKEKLEEEENLTRELKSEIERLQKRMVDLEKLEEALSRSKNECSQLCLS 140
Cdd:pfam02463  174 ALKKLIEETENLAELIIDLEELKLQEL-----KLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERIDLLQELLRD 248
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  141 LNEERNLTKKI---SSELEMLRVKVKELESSEDRLDKTEQSLVSELEKLKSLTLSFVNERKYLNEKEKENEKIIKELTQK 217
Cdd:pfam02463  249 EQEEIESSKQEiekEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKEKKKAEK 328
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  218 LEQNKKMNRDHMRNASTFLERNDLRIEDGISSTLSS--KESKRKGSLDYLKQVENETRDKSENEKN-----RNQEDNKVK 290
Cdd:pfam02463  329 ELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLqeKLEQLEEELLAKKKLESERLSSAAKLKEeelelKSEEEKEAQ 408
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1720408942  291 DLNQEIEKLKTQIKHFESLEEELKKMRAKNNDLQDNYLTELNRNRSLASQLEEIKLQVRKQKE 353
Cdd:pfam02463  409 LLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKSE 471
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
3-325 5.58e-07

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 53.91  E-value: 5.58e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942    3 ELTNYKDAAsnRHLRFKLQSLSRRLDELEE----ATKNLQRAEDELLDLQDKVIQAEGSDSSTLAEIEVLRQRVLKIEGK 78
Cdd:TIGR02168  706 ELEELEEEL--EQLRKELEELSRQISALRKdlarLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAE 783
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   79 ----DEEIKRAEDLCHTMKEKLEEEENLTRELKSEIERLQKRMVDLEKLEEALSRSKNECSQLCLSLNEERnltKKISSE 154
Cdd:TIGR02168  784 ieelEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDI---ESLAAE 860
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  155 LEMLRVKVKELESSEDRLDKTEQSLVSELEKLKSLTLSFVNERKYLNEKEKENEKIIKELTQKLEQnkkmnrdhmrnAST 234
Cdd:TIGR02168  861 IEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQ-----------LEL 929
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  235 FLERNDLRIeDGISSTLSSKESKrkgSLDYLKQVENETRDKSENEKNRnqednkVKDLNQEIEKLK----TQIKHFESLE 310
Cdd:TIGR02168  930 RLEGLEVRI-DNLQERLSEEYSL---TLEEAEALENKIEDDEEEARRR------LKRLENKIKELGpvnlAAIEEYEELK 999
                          330
                   ....*....|....*
gi 1720408942  311 EELKKMRAKNNDLQD 325
Cdd:TIGR02168 1000 ERYDFLTAQKEDLTE 1014
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
16-415 8.09e-07

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 53.51  E-value: 8.09e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   16 LRFKLQSLSRRLDELEEATKNLQRAEDELLDL---QDKVIQAEgsdsstLAEIEVLRQRVLKIEGKDEEIKRAEDLCHTM 92
Cdd:TIGR00606  700 LQSKLRLAPDKLKSTESELKKKEKRRDEMLGLapgRQSIIDLK------EKEIPELRNKLQKVNRDIQRLKNDIEEQETL 773
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   93 KEKLEEEENLTRELKSE---IERLQKRMVDLEKLEEALSrSKNECSQLCLSLNEERNLTKKISSELEMLRVKVKELESSE 169
Cdd:TIGR00606  774 LGTIMPEEESAKVCLTDvtiMERFQMELKDVERKIAQQA-AKLQGSDLDRTVQQVNQEKQEKQHELDTVVSKIELNRKLI 852
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  170 DRLDKTEQSLVSELEKLKSLTLSFVNERKYLNEKEKENEKIIKELtQKLEQNKKMNRDHMRNASTFLERNDLRIEDGISS 249
Cdd:TIGR00606  853 QDQQEQIQHLKSKTNELKSEKLQIGTNLQRRQQFEEQLVELSTEV-QSLIREIKDAKEQDSPLETFLEKDQQEKEELISS 931
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  250 TLSSK----------ESKRKGSLDYLKQVENETRDKSENEKnrNQEDNKVKDLNQEIEKLKtqiKHFESLEEELKKMRaK 319
Cdd:TIGR00606  932 KETSNkkaqdkvndiKEKVKNIHGYMKDIENKIQDGKDDYL--KQKETELNTVNAQLEECE---KHQEKINEDMRLMR-Q 1005
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  320 NNDLQDNYLTELNRNRSLASQLEEIKlQVRKQKELGNGDIeGEDAFLLGRGRH----ERTKLKGHGSEASVSKHTSRELS 395
Cdd:TIGR00606 1006 DIDTQKIQERWLQDNLTLRKRENELK-EVEEELKQHLKEM-GQMQVLQMKQEHqkleENIDLIKRNHVLALGRQKGYEKE 1083
                          410       420
                   ....*....|....*....|
gi 1720408942  396 PQHKRERLRNREFALSNEHY 415
Cdd:TIGR00606 1084 IKHFKKELREPQFRDAEEKY 1103
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
80-353 8.52e-07

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 53.14  E-value: 8.52e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   80 EEIKRAEDLCHTMKEKLEeeenltrELKSEIERLQKRMVDLEKLEEALSRSKNECS-QLCLSLNEERNLTKKISSELEML 158
Cdd:TIGR02168  684 EKIEELEEKIAELEKALA-------ELRKELEELEEELEQLRKELEELSRQISALRkDLARLEAEVEQLEERIAQLSKEL 756
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  159 RVKVKELESSEDRLDKTEQSLVSELEKLKSLTLSFVNerkyLNEKEKENEKIIKELTQKLeQNKKMNRDHMRNASTFLER 238
Cdd:TIGR02168  757 TELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQ----LKEELKALREALDELRAEL-TLLNEEAANLRERLESLER 831
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  239 NDLRIEDGISSTLSSKESKRKGSLDYLKQVENETRDKSENEKNRNQEDNKVKDLNQEIEKLKtqiKHFESLEEELKKMRA 318
Cdd:TIGR02168  832 RIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLR---SELEELSEELRELES 908
                          250       260       270
                   ....*....|....*....|....*....|....*
gi 1720408942  319 KNNDLQDNYLTELNRNRSLASQLEEIKLQVRKQKE 353
Cdd:TIGR02168  909 KRSELRRELEELREKLAQLELRLEGLEVRIDNLQE 943
PTZ00121 PTZ00121
MAEBL; Provisional
64-322 8.83e-07

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 53.22  E-value: 8.83e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   64 EIEVLRQRVLKIEGKDEEIKRAEDlchTMKEKLEEEENLTRELKSEIERLQKRMVDLEKLEEALSRSKNE---CSQLCLS 140
Cdd:PTZ00121  1634 KVEQLKKKEAEEKKKAEELKKAEE---ENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEakkAEELKKK 1710
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  141 LNEERNLTKKISSELEMLRVKVKELESSEDRLDKTEQSLVSELEKLKSLTLSFVNERKYLNEKEKENEKIIKELTQKLEQ 220
Cdd:PTZ00121  1711 EAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDE 1790
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  221 NKKMNRDhmRNASTFLERNDLRIEDGISSTL---SSKE---SKRKGSLDYLKQVENETRDKSENEKNRNQEDNKVKDLNQ 294
Cdd:PTZ00121  1791 KRRMEVD--KKIKDIFDNFANIIEGGKEGNLvinDSKEmedSAIKEVADSKNMQLEEADAFEKHKFNKNNENGEDGNKEA 1868
                          250       260
                   ....*....|....*....|....*...
gi 1720408942  295 EIEKLKTQIKHFESLEEELKKMRAKNND 322
Cdd:PTZ00121  1869 DFNKEKDLKEDDEEEIEEADEIEKIDKD 1896
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
21-280 9.41e-07

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 53.15  E-value: 9.41e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   21 QSLSRRLDELEEATKNLQR-AEDELLDLQDKVIQAEGSDSSTLAEIEVLRQRVlkiEGKDEEIKRAEDLCHtmkEKLEEE 99
Cdd:TIGR02169  265 KRLEEIEQLLEELNKKIKDlGEEEQLRVKEKIGELEAEIASLERSIAEKEREL---EDAEERLAKLEAEID---KLLAEI 338
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  100 ENLTRELKSEIERLQKRMVDLEKLEEALSRSKNECSQLCLSLNEERNLTKKISSELEMLRVKVKELESSEDRLDKTEQSL 179
Cdd:TIGR02169  339 EELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRL 418
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  180 VSELEKLKSLTLSFVNERKYLNEKEKENEKIIKELTQKLEQNKKMNRDHmrNASTFLERNDLRIedgISSTLSSKESKRK 259
Cdd:TIGR02169  419 SEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKY--EQELYDLKEEYDR---VEKELSKLQRELA 493
                          250       260
                   ....*....|....*....|.
gi 1720408942  260 GSLDYLKQVENETRDKSENEK 280
Cdd:TIGR02169  494 EAEAQARASEERVRGGRAVEE 514
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
13-220 1.00e-06

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 52.71  E-value: 1.00e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   13 NRHLRFKLQSLSRRLD----ELEEATKNLQRAEDELLDLQDK--VIQAEGSDSSTLAEIEVLRQRVLKIEgkdEEIKRAE 86
Cdd:COG3206    163 EQNLELRREEARKALEfleeQLPELRKELEEAEAALEEFRQKngLVDLSEEAKLLLQQLSELESQLAEAR---AELAEAE 239
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   87 DLCHTMKEKLEEEENLTRELKS--EIERLQKRMVDLE-KLEEALSRSKNECSQLcLSLNEER-NLTKKISSELEMLRVkv 162
Cdd:COG3206    240 ARLAALRAQLGSGPDALPELLQspVIQQLRAQLAELEaELAELSARYTPNHPDV-IALRAQIaALRAQLQQEAQRILA-- 316
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1720408942  163 kELESSEDRLDKTEQSLVSELEKLKSLTLSFVNERKYLNEKEKE---NEKIIKELTQKLEQ 220
Cdd:COG3206    317 -SLEAELEALQAREASLQAQLAQLEARLAELPELEAELRRLEREvevARELYESLLQRLEE 376
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
2-349 2.31e-06

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 51.96  E-value: 2.31e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942    2 AELTNYKDAASNRhlRFKLQSLSRRLDELEEATKN----LQRAEDELLDLQDKVIQAEGSDSSTLAEIEVLRQRV----- 72
Cdd:PRK02224   370 SELEEAREAVEDR--REEIEELEEEIEELRERFGDapvdLGNAEDFLEELREERDELREREAELEATLRTARERVeeaea 447
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   73 LKIEGK---------DEEIKRAEDLCHTMKEKLEEEENLTRELKSEIERLQKRMVDLEKLEEALSRSKNECSQLCLSLNE 143
Cdd:PRK02224   448 LLEAGKcpecgqpveGSPHVETIEEDRERVEELEAELEDLEEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAE 527
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  144 ERNLTKKISSELEMLRVKVKELESSEDRLDKTEQSLVSELEKLKSLTLSFVNERKYLNEKEKENEKI------------- 210
Cdd:PRK02224   528 RRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKERIESLERIrtllaaiadaede 607
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  211 IKELTQKLEQNKKMNRDHMRNASTFLERNDLRIEDGISSTLSSKESKRKGSLDYLKQVENETRDKSEneknrnQEDnkvk 290
Cdd:PRK02224   608 IERLREKREALAELNDERRERLAEKRERKRELEAEFDEARIEEAREDKERAEEYLEQVEEKLDELRE------ERD---- 677
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1720408942  291 DLNQEIEKLKTQIKHFESLEEELKKMRAKNNDLQDNYltelnrnrSLASQLEEIKLQVR 349
Cdd:PRK02224   678 DLQAEIGAVENELEELEELRERREALENRVEALEALY--------DEAEELESMYGDLR 728
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
19-323 2.34e-06

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 51.90  E-value: 2.34e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   19 KLQSLSRRLDELEEATKNLQRAEDELLDLQDKVIQAEGSDSSTLAEIEVLRQRVLKIEGKDEEIKRAEDLCHTMKEKLEE 98
Cdd:pfam02463  723 LADRVQEAQDKINEELKLLKQKIDEEEEEEEKSRLKKEEKEEEKSELSLKEKELAEEREKTEKLKVEEEKEEKLKAQEEE 802
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   99 EENLTRELKSEIERLQKRMVDLEKLEEALSRSKNECSQLCL-SLNEERNLTKKISSELEMLRVKVKELESSEDRLDKTEQ 177
Cdd:pfam02463  803 LRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKeEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQ 882
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  178 SLVSELEK-----------LKSLTLSFVNERKYLNEKEKENEKIIKELTQKLEQNKKMnRDHMRNASTFLERNDLRIEDG 246
Cdd:pfam02463  883 KLKDELESkeekekeekkeLEEESQKLNLLEEKENEIEERIKEEAEILLKYEEEPEEL-LLEEADEKEKEENNKEEEEER 961
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720408942  247 ISSTLSSKESKRKGSLDYLKQVENETRDKSENEKNRNQEDNKVKDLNQEIEKLKTQIKhfeslEEELKKMRAKNNDL 323
Cdd:pfam02463  962 NKRLLLAKEELGKVNLMAIEEFEEKEERYNKDELEKERLEEEKKKLIRAIIEETCQRL-----KEFLELFVSINKGW 1033
PRK10929 PRK10929
putative mechanosensitive channel protein; Provisional
21-378 2.90e-06

putative mechanosensitive channel protein; Provisional


Pssm-ID: 236798 [Multi-domain]  Cd Length: 1109  Bit Score: 51.59  E-value: 2.90e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   21 QSLSRRLDELEEATKNLQRAED-----------------ELLDLQDKVIQAEGSDSStlAEIEvlrQRVLKIEGK----- 78
Cdd:PRK10929    48 EALQSALNWLEERKGSLERAKQyqqvidnfpklsaelrqQLNNERDEPRSVPPNMST--DALE---QEILQVSSQlleks 122
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   79 ---DEEIKRAEDLCHTMKEKLEEEENLTRELkSEIERlqkRMVDLEKLEEALSRSKNecsqlcLSLNEERNLTKKISSEL 155
Cdd:PRK10929   123 rqaQQEQDRAREISDSLSQLPQQQTEARRQL-NEIER---RLQTLGTPNTPLAQAQL------TALQAESAALKALVDEL 192
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  156 EM-----------LRVKVKELESSEDRLDKTEQSLVSELEKLKSLTLSFVNERKYLNEKEKENekIIKELTQKLEQNkkm 224
Cdd:PRK10929   193 ELaqlsannrqelARLRSELAKKRSQQLDAYLQALRNQLNSQRQREAERALESTELLAEQSGD--LPKSIVAQFKIN--- 267
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  225 nrdhmRNASTFLERNDLRIEDgisstLSSKESKRKGSLDYLKQVENETRDKSE--------NEKNRNQ-----EDNKVKD 291
Cdd:PRK10929   268 -----RELSQALNQQAQRMDL-----IASQQRQAASQTLQVRQALNTLREQSQwlgvsnalGEALRAQvarlpEMPKPQQ 337
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  292 LNQEIEKLKTQIKHFESLEEELKKMRAKNNDlqDNYLTELNRNRSLASQLeeiklqvRKQKELGNGDIEGEDAFLLgrgr 371
Cdd:PRK10929   338 LDTEMAQLRVQRLRYEDLLNKQPQLRQIRQA--DGQPLTAEQNRILDAQL-------RTQRELLNSLLSGGDTLIL---- 404

                   ....*..
gi 1720408942  372 hERTKLK 378
Cdd:PRK10929   405 -ELTKLK 410
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
137-354 3.49e-06

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 50.53  E-value: 3.49e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  137 LCLSLNEERNLTKKISSELEMLRVKVKELESSEDRLDKTEQSLVSELEKLK---SLTLSFVN----ERKYLNEKEKENEK 209
Cdd:COG4942     11 LALAAAAQADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALErriAALARRIRaleqELAALEAELAELEK 90
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  210 IIKELTQKLEQNKKMNRDHMRNASTFLERNDLRIedgISSTLSSKESKRkgSLDYLKQVENETRDKSENEKNRNQEDNKV 289
Cdd:COG4942     91 EIAELRAELEAQKEELAELLRALYRLGRQPPLAL---LLSPEDFLDAVR--RLQYLKYLAPARREQAEELRADLAELAAL 165
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1720408942  290 K-DLNQEIEKLKTQIKHFESLEEELKKMRAKNNDLqdnyLTELNRNrsLASQLEEIKLQVRKQKEL 354
Cdd:COG4942    166 RaELEAERAELEALLAELEEERAALEALKAERQKL----LARLEKE--LAELAAELAELQQEAEEL 225
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
19-223 4.32e-06

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 49.15  E-value: 4.32e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   19 KLQSLSRRLDELEEATKNLQRaedELLDLQDKVIQAEgsdsstlAEIEVLRQRVLKIEgkdEEIKRAEdlchtmkeklee 98
Cdd:COG1579     11 DLQELDSELDRLEHRLKELPA---ELAELEDELAALE-------ARLEAAKTELEDLE---KEIKRLE------------ 65
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   99 eenltrelkSEIERLQKRmvdLEKLEEALSRSKNecsqlclslNEERnltKKISSELEMLRVKVKELESSEDRLDKTEQS 178
Cdd:COG1579     66 ---------LEIEEVEAR---IKKYEEQLGNVRN---------NKEY---EALQKEIESLKRRISDLEDEILELMERIEE 121
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*
gi 1720408942  179 LVSELEKLKSLtlsFVNERKYLNEKEKENEKIIKELTQKLEQNKK 223
Cdd:COG1579    122 LEEELAELEAE---LAELEAELEEKKAELDEELAELEAELEELEA 163
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
19-214 9.47e-06

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 49.68  E-value: 9.47e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   19 KLQSLSRRLDELEEATKNLQRAEDELLD-LQDKVIQAEGSDSSTLAEIEVLRQRVLKIEGKDEEIKRAEDLCHTMKEKLE 97
Cdd:PRK03918   550 KLEELKKKLAELEKKLDELEEELAELLKeLEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELD 629
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   98 EEENLTRELKSEIERLQKRmvdLEKLEEALSRSKNEcsqlclslnEERNLTKKISSELEMLRVKVKELESSEDRLDKTeq 177
Cdd:PRK03918   630 KAFEELAETEKRLEELRKE---LEELEKKYSEEEYE---------ELREEYLELSRELAGLRAELEELEKRREEIKKT-- 695
                          170       180       190
                   ....*....|....*....|....*....|....*..
gi 1720408942  178 slvseLEKLKsltlsfvNERKYLNEKEKENEKIIKEL 214
Cdd:PRK03918   696 -----LEKLK-------EELEEREKAKKELEKLEKAL 720
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
14-319 1.34e-05

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 49.29  E-value: 1.34e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   14 RHLRFKLQSLSRRLDELEEATKNLQRAED-------ELLDLQDKVIQAEGSdsstlAEIEVLRQRVLKIEGKDEEIK-RA 85
Cdd:PRK03918   408 SKITARIGELKKEIKELKKAIEELKKAKGkcpvcgrELTEEHRKELLEEYT-----AELKRIEKELKEIEEKERKLRkEL 482
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   86 EDLCHTMKEKLEEEENLT-----RELKSE-----IERLQKRMVDLEKLEEALSRSKNECSQLCLSLNEERNLTKKIS--- 152
Cdd:PRK03918   483 RELEKVLKKESELIKLKElaeqlKELEEKlkkynLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAele 562
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  153 ----------------------SELEMLRVKVKELESSEDR---LDKTEQSLVSELEKLKSLTLSFVNERKYLNEKEKEN 207
Cdd:PRK03918   563 kkldeleeelaellkeleelgfESVEELEERLKELEPFYNEyleLKDAEKELEREEKELKKLEEELDKAFEELAETEKRL 642
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  208 EKIIKELTqklEQNKKMNRDHMRNASTFLERndlriedgISSTLSSKESKRKGSLDYLKQVENETRDKSENEKNRnqedn 287
Cdd:PRK03918   643 EELRKELE---ELEKKYSEEEYEELREEYLE--------LSRELAGLRAELEELEKRREEIKKTLEKLKEELEER----- 706
                          330       340       350
                   ....*....|....*....|....*....|..
gi 1720408942  288 kvKDLNQEIEKLKTQIKHFESLEEELKKMRAK 319
Cdd:PRK03918   707 --EKAKKELEKLEKALERVEELREKVKKYKAL 736
COG1340 COG1340
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
105-324 1.45e-05

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];


Pssm-ID: 440951 [Multi-domain]  Cd Length: 297  Bit Score: 47.98  E-value: 1.45e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  105 ELKSEIERLQKRmvdLEKLEEALSRSKNECSQLCLSLNEERNLTKKISSELEMLRVKVKELESSEDRLDKTEQSLVSELE 184
Cdd:COG1340     19 ELREEIEELKEK---RDELNEELKELAEKRDELNAQVKELREEAQELREKRDELNEKVKELKEERDELNEKLNELREELD 95
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  185 KLKSLTLSFVNERKYLNEKEKENEKI--------------------IKELTQKLEQNKKMNRDHMRNASTFLERNDLRIE 244
Cdd:COG1340     96 ELRKELAELNKAGGSIDKLRKEIERLewrqqtevlspeeekelvekIKELEKELEKAKKALEKNEKLKELRAELKELRKE 175
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  245 -DGIS---STLSSKESKRKGSLDYLKQVENETRDKSENEKNRNQE-DNKVKDLNQEIEKLKTQIKhfeSLEEELKKMRAK 319
Cdd:COG1340    176 aEEIHkkiKELAEEAQELHEEMIELYKEADELRKEADELHKEIVEaQEKADELHEEIIELQKELR---ELRKELKKLRKK 252

                   ....*
gi 1720408942  320 NNDLQ 324
Cdd:COG1340    253 QRALK 257
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
19-353 1.55e-05

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 49.27  E-value: 1.55e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   19 KLQSLSRRLDELEEATKNLQRAEDELLDLQDKV-IQAEGSDSSTLAEIEVLRQRVLKIE----GKDEEIKRAEDLCHTMK 93
Cdd:TIGR00606  320 ELVDCQRELEKLNKERRLLNQEKTELLVEQGRLqLQADRHQEHIRARDSLIQSLATRLEldgfERGPFSERQIKNFHTLV 399
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   94 EKLEEEENLTrelkseIERLQKRMVDLEKL-EEALSRSKNECSQLCLSLNEERNLTKKISSELEMLRVKVKELESSEDRL 172
Cdd:TIGR00606  400 IERQEDEAKT------AAQLCADLQSKERLkQEQADEIRDEKKGLGRTIELKKEILEKKQEELKFVIKELQQLEGSSDRI 473
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  173 DKTEQSLVSELEKL-----KSLTLSFVNERKYLNEKEKENEKIIKELTQKLEQ--NKKMNRDHMRNASTFLERNDLRIED 245
Cdd:TIGR00606  474 LELDQELRKAERELskaekNSLTETLKKEVKSLQNEKADLDRKLRKLDQEMEQlnHHTTTRTQMEMLTKDKMDKDEQIRK 553
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  246 gISSTLSSKESKRKGSLDYLKQVENETRDKSeneKNRNQEDNKVKDLNQEIEKLKTQIKHFeslEEELKKMRAKNNDLQD 325
Cdd:TIGR00606  554 -IKSRHSDELTSLLGYFPNKKQLEDWLHSKS---KEINQTRDRLAKLNKELASLEQNKNHI---NNELESKEEQLSSYED 626
                          330       340
                   ....*....|....*....|....*...
gi 1720408942  326 NyLTELNRNRSLASQLEEIKLQVRKQKE 353
Cdd:TIGR00606  627 K-LFDVCGSQDEESDLERLKEEIEKSSK 653
PRK01156 PRK01156
chromosome segregation protein; Provisional
106-359 2.09e-05

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 48.74  E-value: 2.09e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  106 LKSEIERLQKRMVDLEKLEEALSRSKNECSQLCLSLNEERNLTKKISSELEMLRVKVKELESSEDRLDKTEQSLVSELEK 185
Cdd:PRK01156   171 LKDVIDMLRAEISNIDYLEEKLKSSNLELENIKKQIADDEKSHSITLKEIERLSIEYNNAMDDYNNLKSALNELSSLEDM 250
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  186 LKSLTLSFVNERKYLNEKEKENEKI--IKELTQKLEQNKKM-NRDHMRNasTFLERNDLRIEDGISSTLSSKESKRKGSL 262
Cdd:PRK01156   251 KNRYESEIKTAESDLSMELEKNNYYkeLEERHMKIINDPVYkNRNYIND--YFKYKNDIENKKQILSNIDAEINKYHAII 328
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  263 DYLKQVENETRDKSENEKNRNQEDNKVKDLNQEIEKLKTQIKHFESLEEELKKMRAKNNDLQDNYLTELNRNRSLASQLE 342
Cdd:PRK01156   329 KKLSVLQKDYNDYIKKKSRYDDLNNQILELEGYEMDYNSYLKSIESLKKKIEEYSKNIERMSAFISEILKIQEIDPDAIK 408
                          250
                   ....*....|....*..
gi 1720408942  343 EIKLQVRKQKELGNGDI 359
Cdd:PRK01156   409 KELNEINVKLQDISSKV 425
PRK05771 PRK05771
V-type ATP synthase subunit I; Validated
109-342 2.19e-05

V-type ATP synthase subunit I; Validated


Pssm-ID: 235600 [Multi-domain]  Cd Length: 646  Bit Score: 48.39  E-value: 2.19e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  109 EIERLqKRMVDLEKLEEALSRSkNECSQLCLSLNEERNLTKKISSELEmlRVKVKELESSEDRLDKTEQSLVSELEKLks 188
Cdd:PRK05771    32 HIEDL-KEELSNERLRKLRSLL-TKLSEALDKLRSYLPKLNPLREEKK--KVSVKSLEELIKDVEEELEKIEKEIKEL-- 105
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  189 ltlsfVNERKYLNEKEKENEKIIKELTQ--------KLEQNKKmnrdhmrNASTFLERNDLRIEDGISSTLSSK----ES 256
Cdd:PRK05771   106 -----EEEISELENEIKELEQEIERLEPwgnfdldlSLLLGFK-------YVSVFVGTVPEDKLEELKLESDVEnveyIS 173
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  257 KRKGSLDYLKQVENETRDKSENE--KN--RNQEDNKVKDLNQEIEKLKTQI----KHFESLEEELKKMRAKNNDL---QD 325
Cdd:PRK05771   174 TDKGYVYVVVVVLKELSDEVEEElkKLgfERLELEEEGTPSELIREIKEELeeieKERESLLEELKELAKKYLEEllaLY 253
                          250
                   ....*....|....*...
gi 1720408942  326 NYL-TELNRNRSLASQLE 342
Cdd:PRK05771   254 EYLeIELERAEALSKFLK 271
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1-299 2.58e-05

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 47.59  E-value: 2.58e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942    1 MAELTNYKDAASNRHLRFKLQSLSRRLDELEEATKNLQRAEDELLDLQDKVIQAEgsdsstlAEIEVLRQRVLKIEgkdE 80
Cdd:COG4372     18 LRPKTGILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQAR-------SELEQLEEELEELN---E 87
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   81 EIKRAEDLCHTMKEKLEEEENLTRELKSEIERLQKRMVDLEKLEEALSRSKnecSQLCLSLNEERNLTKKISSELEMLRV 160
Cdd:COG4372     88 QLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQI---AELQSEIAEREEELKELEEQLESLQE 164
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  161 KVKELESSEDRLDKTEqsLVSELEKLKSLTLSFVNERKYLNEKEKENEKIIKELTQKLEQNKKMNRDHMRNASTFLERND 240
Cdd:COG4372    165 ELAALEQELQALSEAE--AEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDAL 242
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1720408942  241 LRIEDGISSTLSSKESKRKGSLDYLKQVENETRDKSENEKNRNQEDNKVKDLNQEIEKL 299
Cdd:COG4372    243 ELEEDKEELLEEVILKEIEELELAILVEKDTEEEELEIAALELEALEEAALELKLLALL 301
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
14-185 2.69e-05

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 46.84  E-value: 2.69e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   14 RHLRFKLQSLSRRLDELEE----ATKNLQRAEDELLDLQDKVIQAEgsdsstlAEIEVLRQRVLKIEGKDEEIKRAEDLc 89
Cdd:COG1579     20 DRLEHRLKELPAELAELEDelaaLEARLEAAKTELEDLEKEIKRLE-------LEIEEVEARIKKYEEQLGNVRNNKEY- 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   90 htmkekleeeenltRELKSEIERLQKRMVDLEK----LEEALSRSKNECSQLclsLNEERNLTKKISSELEMLRVKVKEL 165
Cdd:COG1579     92 --------------EALQKEIESLKRRISDLEDeileLMERIEELEEELAEL---EAELAELEAELEEKKAELDEELAEL 154
                          170       180
                   ....*....|....*....|
gi 1720408942  166 ESSEDRLDKTEQSLVSELEK 185
Cdd:COG1579    155 EAELEELEAEREELAAKIPP 174
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
18-262 3.97e-05

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 47.07  E-value: 3.97e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   18 FKLQSLSRRLDELEEAtknLQRAEDELLDLQDKVIQAEGSDSSTLAEIEVLRQRVLKIEgkdEEIKRAEDlchtmkeKLE 97
Cdd:COG4942     13 LAAAAQADAAAEAEAE---LEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALA---RRIRALEQ-------ELA 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   98 EEENLTRELKSEIERLQKRmvdLEKLEEALSR--------SKNECSQLCLS---LNEERNLTKKISSELEMLRVKVKELE 166
Cdd:COG4942     80 ALEAELAELEKEIAELRAE---LEAQKEELAEllralyrlGRQPPLALLLSpedFLDAVRRLQYLKYLAPARREQAEELR 156
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  167 SSEDRLDKTEQSLVSELEKLKSLTLSFVNERKYLNEKEKENEKIIKELTQKLEQNKKMNRDHMRNAST---FLERNDLRI 243
Cdd:COG4942    157 ADLAELAALRAELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEEleaLIARLEAEA 236
                          250
                   ....*....|....*....
gi 1720408942  244 EDGISSTLSSKESKRKGSL 262
Cdd:COG4942    237 AAAAERTPAAGFAALKGKL 255
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
82-356 4.41e-05

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 47.74  E-value: 4.41e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   82 IKRAEDLCHTMKEKLEEEENLTRELKSEIE-RLQKRMvdLEKLEEALSRSKNecsqlclSLNEERNLTKKISSELEMLRV 160
Cdd:TIGR01612  502 MKDFKDIIDFMELYKPDEVPSKNIIGFDIDqNIKAKL--YKEIEAGLKESYE-------LAKNWKKLIHEIKKELEEENE 572
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  161 KVKELESSEDRLDKTEQSLVSELEKLKSLTLSFVNERKYLNEKekeNEKIIKELT-QKLEQNKKMNRDHMRNASTFLERN 239
Cdd:TIGR01612  573 DSIHLEKEIKDLFDKYLEIDDEIIYINKLKLELKEKIKNISDK---NEYIKKAIDlKKIIENNNAYIDELAKISPYQVPE 649
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  240 DLRIEDGISSTLSSKESK-RKGSLDYLKqveNETRD-KSENEKNRNQEDNKVKDLNQEIEKLKTQIKHFE--SLEEELKK 315
Cdd:TIGR01612  650 HLKNKDKIYSTIKSELSKiYEDDIDALY---NELSSiVKENAIDNTEDKAKLDDLKSKIDKEYDKIQNMEtaTVELHLSN 726
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*..
gi 1720408942  316 MRAKNNDLQDNyLTELNR------NRSLASQLEEIKlqvRKQKELGN 356
Cdd:TIGR01612  727 IENKKNELLDI-IVEIKKhihgeiNKDLNKILEDFK---NKEKELSN 769
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
68-354 4.50e-05

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 47.81  E-value: 4.50e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   68 LRQRVLKIEGKDEEIKRAEDLchtmkekleeeenlTRELKSEIERLQKRMVDLEKLEEALSRSKNECSQLCLSLNEERNL 147
Cdd:pfam15921  446 MERQMAAIQGKNESLEKVSSL--------------TAQLESTKEMLRKVVEELTAKKMTLESSERTVSDLTASLQEKERA 511
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  148 TKKISSELEMLRVKVkELESSEdrldktEQSLVSELEKLKSLTlsfvNERKYLNEKEKENEKIIKELTQKLEQNKKMNRD 227
Cdd:pfam15921  512 IEATNAEITKLRSRV-DLKLQE------LQHLKNEGDHLRNVQ----TECEALKLQMAEKDKVIEILRQQIENMTQLVGQ 580
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  228 HMRNASTFL--------ERNDLRIEDGISSTLSSKESKRKGSLDylKQVENETRDKSENEKNRNQEDNKVKDLNQEIEKL 299
Cdd:pfam15921  581 HGRTAGAMQvekaqlekEINDRRLELQEFKILKDKKDAKIRELE--ARVSDLELEKVKLVNAGSERLRAVKDIKQERDQL 658
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1720408942  300 KTQIKHFESleeELKKMRAKNNDLQDNYlteLNRNRSLASQLEEIKLQVRK-QKEL 354
Cdd:pfam15921  659 LNEVKTSRN---ELNSLSEDYEVLKRNF---RNKSEEMETTTNKLKMQLKSaQSEL 708
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
62-349 4.97e-05

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 47.37  E-value: 4.97e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   62 LAEIEVLRQRVLKIEGKDEEIK---------RAEDLCHTMKEKLEEEENLT------RELKSEIERLQKRMVDLEKLEEA 126
Cdd:TIGR02169  176 LEELEEVEENIERLDLIIDEKRqqlerlrreREKAERYQALLKEKREYEGYellkekEALERQKEAIERQLASLEEELEK 255
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  127 LSRSKNECSQLCLSLNEERN-LTKKIS--SELEMLRVKVK--ELESSEDRLDKTEQSLVSELEKL-KSLTLSFVNERKYL 200
Cdd:TIGR02169  256 LTEEISELEKRLEEIEQLLEeLNKKIKdlGEEEQLRVKEKigELEAEIASLERSIAEKERELEDAeERLAKLEAEIDKLL 335
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  201 NEKEKENEKIikeLTQKLEQNKKMNRDHMRNASTFLERNDLRIEDGISSTLSSKESKRKGSLDYLKQVENE---TRDKSE 277
Cdd:TIGR02169  336 AEIEELEREI---EEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINElkrELDRLQ 412
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  278 NEKNRNQEdnKVKDLNQEIEKLKTQIKHFES-----------LEEELKKMRAKNNDLQDNYL---TELN----RNRSLAS 339
Cdd:TIGR02169  413 EELQRLSE--ELADLNAAIAGIEAKINELEEekedkaleikkQEWKLEQLAADLSKYEQELYdlkEEYDrvekELSKLQR 490
                          330
                   ....*....|
gi 1720408942  340 QLEEIKLQVR 349
Cdd:TIGR02169  491 ELAEAEAQAR 500
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
16-353 5.85e-05

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 47.42  E-value: 5.85e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   16 LRFKLQSLSRRLDELEEATKNLQRAEDELLD-LQDKVIQAEGS---------DSSTlaEIEVLRQRVLKIEGKDEEIKR- 84
Cdd:pfam15921  115 LQTKLQEMQMERDAMADIRRRESQSQEDLRNqLQNTVHELEAAkclkedmleDSNT--QIEQLRKMMLSHEGVLQEIRSi 192
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   85 -------------AEDLCHTM--KEKLEEEENLTRELKSEIERLQKRMVDLEKLEEAL-SRSKNECSQLclsLNEERNLT 148
Cdd:pfam15921  193 lvdfeeasgkkiyEHDSMSTMhfRSLGSAISKILRELDTEISYLKGRIFPVEDQLEALkSESQNKIELL---LQQHQDRI 269
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  149 KKISSELEmlrVKVKELESSEDRLDKTEQSLVSELEKLKSLTLsfvNERKYLNEKEKENEKIIKELTQKLEQNKKMNRDH 228
Cdd:pfam15921  270 EQLISEHE---VEITGLTEKASSARSQANSIQSQLEIIQEQAR---NQNSMYMRQLSDLESTVSQLRSELREAKRMYEDK 343
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  229 MRNastfLERNDLRIEDGISSTLSSKE--SKRKGSLDYLKQVENETRDKSENEKNRNQEDNK------------------ 288
Cdd:pfam15921  344 IEE----LEKQLVLANSELTEARTERDqfSQESGNLDDQLQKLLADLHKREKELSLEKEQNKrlwdrdtgnsitidhlrr 419
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1720408942  289 -VKDLNQEIEKLKTQIKHFES-----LEEELKKMRAKNNDLQdnyltelnRNRSLASQLEEIKLQVRKQKE 353
Cdd:pfam15921  420 eLDDRNMEVQRLEALLKAMKSecqgqMERQMAAIQGKNESLE--------KVSSLTAQLESTKEMLRKVVE 482
PRK01156 PRK01156
chromosome segregation protein; Provisional
20-353 6.42e-05

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 47.20  E-value: 6.42e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   20 LQSLSRRLDELEEATKNLQRAEDELLDLQDKVIQAEGSDSSTLAEIEVLRQRVLKIEGKDEEIKRAEDLCHTMKEKLEEE 99
Cdd:PRK01156   175 IDMLRAEISNIDYLEEKLKSSNLELENIKKQIADDEKSHSITLKEIERLSIEYNNAMDDYNNLKSALNELSSLEDMKNRY 254
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  100 ENLTRELKSEIERLQKRMVDLEKLEEALSRSKNecSQLCLSLNEER---NLTKKISSELEMLRVKVKELESSEDRLDKTE 176
Cdd:PRK01156   255 ESEIKTAESDLSMELEKNNYYKELEERHMKIIN--DPVYKNRNYINdyfKYKNDIENKKQILSNIDAEINKYHAIIKKLS 332
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  177 --QSLVSELEKLKSLTLSFVNERKYLNEKEKENEKIIKELTQ---KLEQNKKMNRDHMRNASTFLERNDL------RIED 245
Cdd:PRK01156   333 vlQKDYNDYIKKKSRYDDLNNQILELEGYEMDYNSYLKSIESlkkKIEEYSKNIERMSAFISEILKIQEIdpdaikKELN 412
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  246 GISSTL---SSKESKRKGSLDYLKQVENETRDKSE-------------------NEKNRNQEDNKVKDLNQEIEKLKTQI 303
Cdd:PRK01156   413 EINVKLqdiSSKVSSLNQRIRALRENLDELSRNMEmlngqsvcpvcgttlgeekSNHIINHYNEKKSRLEEKIREIEIEV 492
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1720408942  304 KHFESLEEELKKMRAKNNDLQDN-YLTELNRNRSLASQLEEIKLQVRKQKE 353
Cdd:PRK01156   493 KDIDEKIVDLKKRKEYLESEEINkSINEYNKIESARADLEDIKIKINELKD 543
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
103-363 8.93e-05

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 46.89  E-value: 8.93e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  103 TRELKSEIERLQKRMVDLEKLEEALSRSKNECSQLCLSLNEER------NLTKKISSELEMLRV--KVKELESSEDRLDK 174
Cdd:pfam02463  165 SRLKRKKKEALKKLIEETENLAELIIDLEELKLQELKLKEQAKkaleyyQLKEKLELEEEYLLYldYLKLNEERIDLLQE 244
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  175 TEQSLVSELEKLKSL------TLSFVNERKYLNEKEKENEKIIKELTQKLEQNKKMNRDHMRNASTFLERNDLRIEDGIS 248
Cdd:pfam02463  245 LLRDEQEEIESSKQEiekeeeKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKEKK 324
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  249 STLSSKESKRKGSLDYLKQVENETRDKSENE-KNRNQEDNKVKDLNQEIEKLKTQIKHFESLEEELKKmRAKNNDLQDNY 327
Cdd:pfam02463  325 KAEKELKKEKEEIEELEKELKELEIKREAEEeEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKL-KEEELELKSEE 403
                          250       260       270
                   ....*....|....*....|....*....|....*.
gi 1720408942  328 LTELNRNRSLASQLEEIkLQVRKQKELGNGDIEGED 363
Cdd:pfam02463  404 EKEAQLLLELARQLEDL-LKEEKKEELEILEEEEES 438
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
15-238 1.01e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 46.45  E-value: 1.01e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   15 HLRFKLQSLSRRLDELEEATKNLQRAEDELLDLQDkviqaegsdssTLAEIEVLRQRVLKIEGKDEEIkraedlchtmke 94
Cdd:COG4913    614 ALEAELAELEEELAEAEERLEALEAELDALQERRE-----------ALQRLAEYSWDEIDVASAEREI------------ 670
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   95 kleeeenltRELKSEIERLQKRMVDLEKLEEALSRSKNECSQlclsLNEERNLTKKISSELEmlrvkvKELESSEDRLDK 174
Cdd:COG4913    671 ---------AELEAELERLDASSDDLAALEEQLEELEAELEE----LEEELDELKGEIGRLE------KELEQAEEELDE 731
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720408942  175 TeQSLVSELEKLKSLTLSFVNERKYLNEKEKENEkiiKELTQKLEQNKKMNRDHMRNASTFLER 238
Cdd:COG4913    732 L-QDRLEAAEDLARLELRALLEERFAAALGDAVE---RELRENLEERIDALRARLNRAEEELER 791
PTZ00440 PTZ00440
reticulocyte binding protein 2-like protein; Provisional
27-332 1.37e-04

reticulocyte binding protein 2-like protein; Provisional


Pssm-ID: 240419 [Multi-domain]  Cd Length: 2722  Bit Score: 46.36  E-value: 1.37e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   27 LDELEEATKNLQRAEDELLDLQDKVIQAEGSDSST--LAEIEVLRQRvlkIEGKDEEIKRAEDLCHTMkekleeeenltr 104
Cdd:PTZ00440   690 IKNLKKELQNLLSLKENIIKKQLNNIEQDISNSLNqyTIKYNDLKSS---IEEYKEEEEKLEVYKHQI------------ 754
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  105 elkseIERLQKRMVDLEKLEEALSRSKNECSQLCLSLNEERNLTKKISSELEMLRVKVK----ELESSEDRLDKTEQSLV 180
Cdd:PTZ00440   755 -----INRKNEFILHLYENDKDLPDGKNTYEEFLQYKDTILNKENKISNDINILKENKKnnqdLLNSYNILIQKLEAHTE 829
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  181 SELEKLKSLTLSFVNERKYLNEKEKENE-----KIIKELTQKLEQ-NKKMNRDHMRNASTFLERNDLRIedgISSTLSSK 254
Cdd:PTZ00440   830 KNDEELKQLLQKFPTEDENLNLKELEKEfnennQIVDNIIKDIENmNKNINIIKTLNIAINRSNSNKQL---VEHLLNNK 906
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  255 ESKRKGSLDYLKQVENET----RDKSENEKNRNQEDNKV-KDLNQE-IEKLKTQIkhfESLEEELKKMRAKNNDLQDNYL 328
Cdd:PTZ00440   907 IDLKNKLEQHMKIINTDNiiqkNEKLNLLNNLNKEKEKIeKQLSDTkINNLKMQI---EKTLEYYDKSKENINGNDGTHL 983

                   ....
gi 1720408942  329 TELN 332
Cdd:PTZ00440   984 EKLD 987
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
79-345 2.03e-04

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 45.44  E-value: 2.03e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   79 DEEIKRAEDLCHTMKEKLEEEENLTRELKSEIERLQKRMVDLEKLEeALSRSKNECSQLCLSLNEERNLT--KKISSELE 156
Cdd:TIGR02169  169 DRKKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQ-ALLKEKREYEGYELLKEKEALERqkEAIERQLA 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  157 MLRvkvKELESSEDRLDKTEQSLVSELEKLKSLT--LSFVNERKYLNEKEKenekiIKELTQKLEQNKKMNRDHMRNAST 234
Cdd:TIGR02169  248 SLE---EELEKLTEEISELEKRLEEIEQLLEELNkkIKDLGEEEQLRVKEK-----IGELEAEIASLERSIAEKERELED 319
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  235 fLERNDLRIEDGISSTLSSKESKRKGSLDYLKQVENETrdkSENEKNRNQEDNKVKDLNQEIEKLKTQIKHFESLEEELK 314
Cdd:TIGR02169  320 -AEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLT---EEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLE 395
                          250       260       270
                   ....*....|....*....|....*....|.
gi 1720408942  315 KMRAKNNDLQDNYLTELNRNRSLASQLEEIK 345
Cdd:TIGR02169  396 KLKREINELKRELDRLQEELQRLSEELADLN 426
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
19-188 2.08e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 45.29  E-value: 2.08e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   19 KLQSLSRRLDELEEAtknLQRAEDELLDLQDKVIQAEgsdsstlAEIEVLRQRVLKIEGKDEEikraedlchtmkeklee 98
Cdd:COG4913    289 RLELLEAELEELRAE---LARLEAELERLEARLDALR-------EELDELEAQIRGNGGDRLE----------------- 341
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   99 eenltrELKSEIERLQKRmvdLEKLEEALSRSKNECSQLCLSLNEE-----------RNLTKKISSELEMLRVKVKELES 167
Cdd:COG4913    342 ------QLEREIERLERE---LEERERRRARLEALLAALGLPLPASaeefaalraeaAALLEALEEELEALEEALAEAEA 412
                          170       180
                   ....*....|....*....|.
gi 1720408942  168 SEDRLDKTEQSLVSELEKLKS 188
Cdd:COG4913    413 ALRDLRRELRELEAEIASLER 433
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
16-366 2.83e-04

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 45.10  E-value: 2.83e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   16 LRFKLQSLSRRLDELE-EATKNLQRAEDELLDLQDKVIQAEGSDSSTLAEIEVLRQRVLKIEGK----DEEIKRAEDLCH 90
Cdd:pfam05483  213 MHFKLKEDHEKIQHLEeEYKKEINDKEKQVSLLLIQITEKENKMKDLTFLLEESRDKANQLEEKtklqDENLKELIEKKD 292
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   91 TMkekleeeenlTRELKSEIERLQKRMVDLEKLEEALSRSKNECSQLC-------LSLNEERNLTKKISSEL-------- 155
Cdd:pfam05483  293 HL----------TKELEDIKMSLQRSMSTQKALEEDLQIATKTICQLTeekeaqmEELNKAKAAHSFVVTEFeattcsle 362
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  156 EMLRVKVKELESSEDRLDKTEQSLVSELEKLKSLTlsfvnerKYLNEKEKENEKIIKELTQK---LEQNKKMNRdhmrna 232
Cdd:pfam05483  363 ELLRTEQQRLEKNEDQLKIITMELQKKSSELEEMT-------KFKNNKEVELEELKKILAEDeklLDEKKQFEK------ 429
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  233 stflerndlriedgISSTLSSKESKRKGSLdylkqvenETRDKseneknrnqednKVKDLNQEIEKLKTQIKHF----ES 308
Cdd:pfam05483  430 --------------IAEELKGKEQELIFLL--------QAREK------------EIHDLEIQLTAIKTSEEHYlkevED 475
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1720408942  309 LEEELKKMRAKNNDLQDNYLTELNRNRSLASQLEEIKLQVRKQKELGNGDIEGEDAFL 366
Cdd:pfam05483  476 LKTELEKEKLKNIELTAHCDKLLLENKELTQEASDMTLELKKHQEDIINCKKQEERML 533
COG2433 COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
113-224 3.00e-04

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 44.85  E-value: 3.00e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  113 LQKRMVDLEKLEEALSRSKNECSQLCLSLNEERNltKKISSELEMLRVKVKELESSEDRLDKTEQSLVSELEKLKSLTLS 192
Cdd:COG2433    382 LEELIEKELPEEEPEAEREKEHEERELTEEEEEI--RRLEEQVERLEAEVEELEAELEEKDERIERLERELSEARSEERR 459
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 1720408942  193 FVNERKYLNEKEKEN----------EKIIKELTQKLEQNKKM 224
Cdd:COG2433    460 EIRKDREISRLDREIerlereleeeRERIEELKRKLERLKEL 501
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
121-353 3.23e-04

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 44.63  E-value: 3.23e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  121 EKLEEALSRSKNECSQLCLSL-NEERNLTKKISsELEMLRVKVKELESSEDRLDKTEQSLVSELEKLKSLTLSFVNERKY 199
Cdd:TIGR04523   36 KQLEKKLKTIKNELKNKEKELkNLDKNLNKDEE-KINNSNNKIKILEQQIKDLNDKLKKNKDKINKLNSDLSKINSEIKN 114
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  200 LNEKEKENEKIIKELTQKLEQNKKmnrdhmrNASTFLerNDLRIEDGISSTLSSKES---KRKGSL-DYLKQVENETRDK 275
Cdd:TIGR04523  115 DKEQKNKLEVELNKLEKQKKENKK-------NIDKFL--TEIKKKEKELEKLNNKYNdlkKQKEELeNELNLLEKEKLNI 185
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720408942  276 SENEKNRNQEDNKvkdLNQEIEKLKTQIKHFESLEEELKKMRAKNNDLQDNYLTELNRNRSLASQLEEIKLQVRKQKE 353
Cdd:TIGR04523  186 QKNIDKIKNKLLK---LELLLSNLKKKIQKNKSLESQISELKKQNNQLKDNIEKKQQEINEKTTEISNTQTQLNQLKD 260
PRK01156 PRK01156
chromosome segregation protein; Provisional
29-360 3.66e-04

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 44.51  E-value: 3.66e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   29 ELEEATKNLQRAEDELLDLQDKVIQAEGSDSSTLAEIEVLR-QRVLKIEGKDEEIKRAEDLCHTMKEKLEEEENLTRELK 107
Cdd:PRK01156   410 ELNEINVKLQDISSKVSSLNQRIRALRENLDELSRNMEMLNgQSVCPVCGTTLGEEKSNHIINHYNEKKSRLEEKIREIE 489
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  108 SEIERLQKRMVDLEKLEEALSRSKNEcsqlclSLNEERNLTKKISSELEMLRVKVKELESSEDRLDKTEQSLVS-ELEKL 186
Cdd:PRK01156   490 IEVKDIDEKIVDLKKRKEYLESEEIN------KSINEYNKIESARADLEDIKIKINELKDKHDKYEEIKNRYKSlKLEDL 563
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  187 KSLTLSFVNERKYLNEKEKENEKiikelTQKLEQNKKMNRdhmrnastfLERNDLRIEDGISSTLSSKESkrkgsldYLK 266
Cdd:PRK01156   564 DSKRTSWLNALAVISLIDIETNR-----SRSNEIKKQLND---------LESRLQEIEIGFPDDKSYIDK-------SIR 622
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  267 QVENETRDkSENEKNRNQEDNKVKD-LNQEIEKLKTQIKHFESLEEELKKMRAKNNDLQDNY------LTELNRNRSLAS 339
Cdd:PRK01156   623 EIENEANN-LNNKYNEIQENKILIEkLRGKIDNYKKQIAEIDSIIPDLKEITSRINDIEDNLkksrkaLDDAKANRARLE 701
                          330       340
                   ....*....|....*....|.
gi 1720408942  340 QLEEIKLQVRKQKELGNGDIE 360
Cdd:PRK01156   702 STIEILRTRINELSDRINDIN 722
COG2433 COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
30-215 5.60e-04

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 43.69  E-value: 5.60e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   30 LEEAtknLQRAEDELLDLQDKVIQAEGSDSStlAEIEVLRQRVLKiegKDEEIKRAEDLchtmkekleeeenlTRELKSE 109
Cdd:COG2433    378 IEEA---LEELIEKELPEEEPEAEREKEHEE--RELTEEEEEIRR---LEEQVERLEAE--------------VEELEAE 435
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  110 IERLQKRmvdLEKLEEALSRSKnecsqlclslnEERNLTKKISSELEMLRVKVKELESSEDRLDKTEQSLVSELEKLKSL 189
Cdd:COG2433    436 LEEKDER---IERLERELSEAR-----------SEERREIRKDREISRLDREIERLERELEEERERIEELKRKLERLKEL 501
                          170       180
                   ....*....|....*....|....*.
gi 1720408942  190 tlsfvneRKYLNEKEKENEKIIKELT 215
Cdd:COG2433    502 -------WKLEHSGELVPVKVVEKFT 520
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
35-220 5.93e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 43.99  E-value: 5.93e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   35 KNLQRAEDELLDLQDKviqaegSDSSTLAEIEVLRQRVLKIEGKDEEIKRAEDlchtmkeKLEEEENLTRELKSEIERLQ 114
Cdd:COG4717     49 ERLEKEADELFKPQGR------KPELNLKELKELEEELKEAEEKEEEYAELQE-------ELEELEEELEELEAELEELR 115
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  115 KRMVDLEKLEEALSRSKnECSQLCLSLNEERNLTKKISSELEMLRVKVKELESSEDRLDKTEQSLVSELEKLKSLTL--- 191
Cdd:COG4717    116 EELEKLEKLLQLLPLYQ-ELEALEAELAELPERLEELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEeel 194
                          170       180       190
                   ....*....|....*....|....*....|
gi 1720408942  192 -SFVNERKYLNEKEKENEKIIKELTQKLEQ 220
Cdd:COG4717    195 qDLAEELEELQQRLAELEEELEEAQEELEE 224
DUF4686 pfam15742
Domain of unknown function (DUF4686); This family of proteins is found in eukaryotes. Proteins ...
103-353 6.61e-04

Domain of unknown function (DUF4686); This family of proteins is found in eukaryotes. Proteins in this family are typically between 498 and 775 amino acids in length. There is a conserved DLK sequence motif.


Pssm-ID: 464838 [Multi-domain]  Cd Length: 384  Bit Score: 43.13  E-value: 6.61e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  103 TRELKSEIER---LQKRMVDLEklEEALsRSKNECSQLCLSLNEERNLTKKISSELEMLRVKVKELE-----SSEDRldK 174
Cdd:pfam15742   33 EKELRYERGKnldLKQHNSLLQ--EENI-KIKAELKQAQQKLLDSTKMCSSLTAEWKHCQQKIRELElevlkQAQSI--K 107
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  175 TEQSLVSELEKLKS-----------------------LTLSFVNERKYLNEKEKENEKIIKELTQKL--EQNKKMNRDHM 229
Cdd:pfam15742  108 SQNSLQEKLAQEKSrvadaeekilelqqklehahkvcLTDTCILEKKQLEERIKEASENEAKLKQQYqeEQQKRKLLDQN 187
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  230 RNASTFLERNDLRIEDGISSTLSSKESKRKGSLDYLKQVENETRDKSENEKNRNQEDNKVKDLNQEIEKLKTQIKHF-ES 308
Cdd:pfam15742  188 VNELQQQVRSLQDKEAQLEMTNSQQQLRIQQQEAQLKQLENEKRKSDEHLKSNQELSEKLSSLQQEKEALQEELQQVlKQ 267
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1720408942  309 LEEELKK-------MRAKNNDLQDNYLTELN----RNRSLASQLEEIKLQVRKQKE 353
Cdd:pfam15742  268 LDVHVRKynekhhhHKAKLRRAKDRLVHEVEqrdeRIKQLENEIGILQQQSEKEKA 323
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
21-361 6.89e-04

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 43.95  E-value: 6.89e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   21 QSLSRRLDELEEATKNLQRAEDELLDLQDKVIQAEGSDSSTLAEIEVLRQRV---------LKIEGkdEEIKRAEDLCHT 91
Cdd:pfam15921  475 EMLRKVVEELTAKKMTLESSERTVSDLTASLQEKERAIEATNAEITKLRSRVdlklqelqhLKNEG--DHLRNVQTECEA 552
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   92 MKEKLEEEENLTRELKSEIERLQK------RMVDLEKLEEAlsRSKNECSQLCLSLNEERNLTKKISSELEMLRVKVKEL 165
Cdd:pfam15921  553 LKLQMAEKDKVIEILRQQIENMTQlvgqhgRTAGAMQVEKA--QLEKEINDRRLELQEFKILKDKKDAKIRELEARVSDL 630
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  166 ESSEDRLDKTEQSLVSELEKLKSLTLSFVNE----RKYLNEKEKENEKIIKELTQKLEQNKKMN---RDHMRNASTFLE- 237
Cdd:pfam15921  631 ELEKVKLVNAGSERLRAVKDIKQERDQLLNEvktsRNELNSLSEDYEVLKRNFRNKSEEMETTTnklKMQLKSAQSELEq 710
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  238 -RNDLRIEDG-------ISSTLSSKESKRKGSLDYLKQ--------VENETRDKSENEKNRNQEDNKVKDLNQEIEKLKT 301
Cdd:pfam15921  711 tRNTLKSMEGsdghamkVAMGMQKQITAKRGQIDALQSkiqfleeaMTNANKEKHFLKEEKNKLSQELSTVATEKNKMAG 790
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  302 QIKHFESLEEELKKMRAKNNDLQDNYLTELNRNRSLASQLEEIKLQVRKQKELGNGDIEG 361
Cdd:pfam15921  791 ELEVLRSQERRLKEKVANMEVALDKASLQFAECQDIIQRQEQESVRLKLQHTLDVKELQG 850
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
15-347 7.91e-04

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 43.56  E-value: 7.91e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   15 HLRFKLQSLSRRLDELEEATKNLQRAEDELLDLQDKViqaeGSDSSTLAEIEVLRQRVLKIEGKDEEI--------KRAE 86
Cdd:pfam05483  378 QLKIITMELQKKSSELEEMTKFKNNKEVELEELKKIL----AEDEKLLDEKKQFEKIAEELKGKEQELifllqareKEIH 453
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   87 DL---CHTMKEKLEEEENLTRELKSEIERLQKRMVDLEKLEEALSRSK----NECSQLCLSL-NEERNLTKKISSELEML 158
Cdd:pfam05483  454 DLeiqLTAIKTSEEHYLKEVEDLKTELEKEKLKNIELTAHCDKLLLENkeltQEASDMTLELkKHQEDIINCKKQEERML 533
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  159 rvkvKELESsedrLDKTEQSLVSELEKLKSLTLSFVNERKYLNEKEKENEKIIKELTQKLEQNKKMNRDHMRNASTFLER 238
Cdd:pfam05483  534 ----KQIEN----LEEKEMNLRDELESVREEFIQKGDEVKCKLDKSEENARSIEYEVLKKEKQMKILENKCNNLKKQIEN 605
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  239 NDLRIEDgisstlsskeskrkgsldyLKQVENETRDKSENE-KNRNQEDNKVKDLNQEIEKLKTQikhFESLEEELKKMR 317
Cdd:pfam05483  606 KNKNIEE-------------------LHQENKALKKKGSAEnKQLNAYEIKVNKLELELASAKQK---FEEIIDNYQKEI 663
                          330       340       350
                   ....*....|....*....|....*....|
gi 1720408942  318 AKNNDLQDNYLTELNRNRSLASqlEEIKLQ 347
Cdd:pfam05483  664 EDKKISEEKLLEEVEKAKAIAD--EAVKLQ 691
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
4-352 8.14e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 43.22  E-value: 8.14e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942    4 LTNYKDAASNRHLRFKLQSLSRRLDELEEATKNLQRAEDELLDLQDKVIQAEGS-----DSSTLAEIEVLRQRVLKIEGK 78
Cdd:COG4717    125 LQLLPLYQELEALEAELAELPERLEELEERLEELRELEEELEELEAELAELQEEleellEQLSLATEEELQDLAEELEEL 204
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   79 DEEIKRAEDlchtmkeKLEEEENLTRELKSEIERLQKRMV---DLEKLEEALSRSKNECSQLCLSLNEERNL-------- 147
Cdd:COG4717    205 QQRLAELEE-------ELEEAQEELEELEEELEQLENELEaaaLEERLKEARLLLLIAAALLALLGLGGSLLsliltiag 277
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  148 ---------------TKKISSELEMLRVKVKELESSEDRLDKTEQSLVSELEKLKSLTLSFVNE--------RKYLNEKE 204
Cdd:COG4717    278 vlflvlgllallfllLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEELLElldrieelQELLREAE 357
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  205 KENEKI-----IKELTQKLEQNKKMNRDHMRNAST-FLERNDLRIE-DGISSTLSSKESKRKGSLDYLKQVENETRDkSE 277
Cdd:COG4717    358 ELEEELqleelEQEIAALLAEAGVEDEEELRAALEqAEEYQELKEElEELEEQLEELLGELEELLEALDEEELEEEL-EE 436
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  278 NEKNRNQEDNKVKDLNQEIEKLKTQIKHFES------LEEELKKMRAKNNDLQDNYLtelnRNRSLASQLEEIKLQVRKQ 351
Cdd:COG4717    437 LEEELEELEEELEELREELAELEAELEQLEEdgelaeLLQELEELKAELRELAEEWA----ALKLALELLEEAREEYREE 512

                   .
gi 1720408942  352 K 352
Cdd:COG4717    513 R 513
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
73-374 8.59e-04

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 43.50  E-value: 8.59e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   73 LKIEGKDEEIKRAEDLCHTMKEKLEEEENlTRELKSEIERLQKRMVDLEKLEEALSRSKNECSQL-CLSLNEERNLTKKI 151
Cdd:TIGR01612 1226 LFLEKIDEEKKKSEHMIKAMEAYIEDLDE-IKEKSPEIENEMGIEMDIKAEMETFNISHDDDKDHhIISKKHDENISDIR 1304
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  152 SSELEMLRVKVKEleSSEDRLDKTEQSLVSELEKLKSLTLSFVNERKYLNEKEKEN--EKII---KELTQKLEQNKKMNR 226
Cdd:TIGR01612 1305 EKSLKIIEDFSEE--SDINDIKKELQKNLLDAQKHNSDINLYLNEIANIYNILKLNkiKKIIdevKEYTKEIEENNKNIK 1382
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  227 DHMRNASTFLE--RNDLRIED---GISSTLSSKE-----SKRKGSLDYL--KQVENETRDKSENEKNRN----------- 283
Cdd:TIGR01612 1383 DELDKSEKLIKkiKDDINLEEcksKIESTLDDKDideciKKIKELKNHIlsEESNIDTYFKNADENNENvlllfkniema 1462
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  284 ----------QEDNKVKDLNQEIEKLKTQIKHFESLEEEL---KKMRAKNNDLQDNYLTELNR--NRSLASQLEEIKLQV 348
Cdd:TIGR01612 1463 dnksqhilkiKKDNATNDHDFNINELKEHIDKSKGCKDEAdknAKAIEKNKELFEQYKKDVTEllNKYSALAIKNKFAKT 1542
                          330       340
                   ....*....|....*....|....*..
gi 1720408942  349 RKQKELGNGDI-EGEDAFLLGRGRHER 374
Cdd:TIGR01612 1543 KKDSEIIIKEIkDAHKKFILEAEKSEQ 1569
COG5022 COG5022
Myosin heavy chain [General function prediction only];
14-347 1.08e-03

Myosin heavy chain [General function prediction only];


Pssm-ID: 227355 [Multi-domain]  Cd Length: 1463  Bit Score: 43.14  E-value: 1.08e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   14 RHLRFKLQSLSRRLDELEEATKNL---QRAEDELLDLQDKVIQAEGSDSSTLAEievLRQRVLKIEGKDEEIKRAEDLCH 90
Cdd:COG5022    758 RYLRRRYLQALKRIKKIQVIQHGFrlrRLVDYELKWRLFIKLQPLLSLLGSRKE---YRSYLACIIKLQKTIKREKKLRE 834
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   91 TMKEKLEEEENLTRELKSEIERLQKRMVDLEKlEEALSRSKNEcsqlclSLNEERNLT--KKISSELEMLRVKVKELESS 168
Cdd:COG5022    835 TEEVEFSLKAEVLIQKFGRSLKAKKRFSLLKK-ETIYLQSAQR------VELAERQLQelKIDVKSISSLKLVNLELESE 907
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  169 EDRLDKTEQSlvSELEKLKSLTLSFVNERKYLNEKEKENEKIIKeltqkLEQNKKMNRDHMRNAS---TFLERNDLRIED 245
Cdd:COG5022    908 IIELKKSLSS--DLIENLEFKTELIARLKKLLNNIDLEEGPSIE-----YVKLPELNKLHEVESKlkeTSEEYEDLLKKS 980
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  246 GIS-STLSSKESKRKGSLDYLKQVENETRDKSENEKNRNQEDNKVKDLNQEIEKLKTQIKHFESLEEELKKMRA---KNN 321
Cdd:COG5022    981 TILvREGNKANSELKNFKKELAELSKQYGALQESTKQLKELPVEVAELQSASKIISSESTELSILKPLQKLKGLlllENN 1060
                          330       340
                   ....*....|....*....|....*.
gi 1720408942  322 DLQDNYLTELNRNRSLASQLEEIKLQ 347
Cdd:COG5022   1061 QLQARYKALKLRRENSLLDDKQLYQL 1086
PTZ00440 PTZ00440
reticulocyte binding protein 2-like protein; Provisional
81-413 1.12e-03

reticulocyte binding protein 2-like protein; Provisional


Pssm-ID: 240419 [Multi-domain]  Cd Length: 2722  Bit Score: 43.28  E-value: 1.12e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   81 EIKRAEDLCHTMKE---KLEEEENLTRELKSEIERLQKRMVDLEKLEEALSRSKNECSQLCLSLNEERNLTKKISSELEM 157
Cdd:PTZ00440   496 YQEKVDELLQIINSikeKNNIVNNNFKNIEDYYITIEGLKNEIEGLIELIKYYLQSIETLIKDEKLKRSMKNDIKNKIKY 575
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  158 LR---VKVKELESSEDRLDKTEQslvsELEKLKSLTLSfvNERKYLNEKEKENEKIIKELTQKLEQNKKMNRDHMrnaST 234
Cdd:PTZ00440   576 IEenvDHIKDIISLNDEIDNIIQ----QIEELINEALF--NKEKFINEKNDLQEKVKYILNKFYKGDLQELLDEL---SH 646
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  235 FLERNdlriedgisstlsSKESKRKGSLDYLKQVENETRDKSENEKNRNQEDNkvkdlNQEIEKLKTQIKHFESLEEELK 314
Cdd:PTZ00440   647 FLDDH-------------KYLYHEAKSKEDLQTLLNTSKNEYEKLEFMKSDNI-----DNIIKNLKKELQNLLSLKENII 708
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  315 KMRAKN--NDLQDNYLTELNRNRSLASQLEEIKLQVRKQKELGNGDIEGEDAFLLGRGRHERTKLKGHGSEASVSKHtsr 392
Cdd:PTZ00440   709 KKQLNNieQDISNSLNQYTIKYNDLKSSIEEYKEEEEKLEVYKHQIINRKNEFILHLYENDKDLPDGKNTYEEFLQY--- 785
                          330       340
                   ....*....|....*....|.
gi 1720408942  393 elspqhkRERLRNREFALSNE 413
Cdd:PTZ00440   786 -------KDTILNKENKISND 799
PTZ00121 PTZ00121
MAEBL; Provisional
25-364 1.24e-03

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 43.21  E-value: 1.24e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   25 RRLDELEEATKNLQRAEDELLDLQDKVIQAEGSDSSTLAE-----------IEVLRQRVLKiegKDEEIKRAEDLCHTMK 93
Cdd:PTZ00121  1134 RKAEDARKAEEARKAEDAKRVEIARKAEDARKAEEARKAEdakkaeaarkaEEVRKAEELR---KAEDARKAEAARKAEE 1210
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   94 EKLEEEENLTRELK--SEIERLQKRMVDLEKLEEALSRSKNECSQLCLSLNEERNLTKKISSELEMLRvKVKELESSEDR 171
Cdd:PTZ00121  1211 ERKAEEARKAEDAKkaEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEAR-KADELKKAEEK 1289
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  172 LDKTEQSLVSELEKLKSLtlsfvnerkylnEKEKENEKIIKELTQKLEQNKKMNRDHMRNASTFLERNDLRIEDGISSTL 251
Cdd:PTZ00121  1290 KKADEAKKAEEKKKADEA------------KKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAAD 1357
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  252 SSKESKRKGSLDYLKQveNETRDKSENEKNRNQEDNKVKDLNQEIEKLKTQIKHFESLEEELKKM-RAKNNDLQDNYLTE 330
Cdd:PTZ00121  1358 EAEAAEEKAEAAEKKK--EEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKAdEAKKKAEEKKKADE 1435
                          330       340       350
                   ....*....|....*....|....*....|....
gi 1720408942  331 LNRNRSLASQLEEIKLQVRKQKELGNGDIEGEDA 364
Cdd:PTZ00121  1436 AKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEA 1469
EzrA pfam06160
Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like ...
27-345 1.34e-03

Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerizes into a ring structure that establishes the location of the nascent division site. EzrA modulates the frequency and position of FtsZ ring formation. The structure contains 5 spectrin like alpha helical repeats.


Pssm-ID: 428797 [Multi-domain]  Cd Length: 542  Bit Score: 42.53  E-value: 1.34e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   27 LDELEEATK--NLQRAEDELLDLQDKVIQAEGSDSSTLAEIEVLrqrvLKIEGKD-EEIKRAEDLchtmkekleeeenlT 103
Cdd:pfam06160   69 LFEAEELNDkyRFKKAKKALDEIEELLDDIEEDIKQILEELDEL----LESEEKNrEEVEELKDK--------------Y 130
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  104 RELKSEIerLQKRM---VDLEKLEEALSRSKNECSQLcLSLNEE------RNLTKKISSELEMLRVKVKELEsseDRLDK 174
Cdd:pfam06160  131 RELRKTL--LANRFsygPAIDELEKQLAEIEEEFSQF-EELTESgdyleaREVLEKLEEETDALEELMEDIP---PLYEE 204
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  175 TEQSLVSELEKLKSLTLSFVNERKYLNEKEKENEkiIKELTQKLEQNKKM-NRDHMRNASTFLERNDLRIeDGISSTL-- 251
Cdd:pfam06160  205 LKTELPDQLEELKEGYREMEEEGYALEHLNVDKE--IQQLEEQLEENLALlENLELDEAEEALEEIEERI-DQLYDLLek 281
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  252 --SSK---ESKRKGSLDYLKQVENETRD-KSENE---KN---RNQEDNKVKDLNQEIEKLKTQikhFESLEEELKKMRAK 319
Cdd:pfam06160  282 evDAKkyvEKNLPEIEDYLEHAEEQNKElKEELErvqQSytlNENELERVRGLEKQLEELEKR---YDEIVERLEEKEVA 358
                          330       340
                   ....*....|....*....|....*.
gi 1720408942  320 NNDLQDNYLTELNRNRSLASQLEEIK 345
Cdd:pfam06160  359 YSELQEELEEILEQLEEIEEEQEEFK 384
PRK12704 PRK12704
phosphodiesterase; Provisional
108-220 1.42e-03

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 42.46  E-value: 1.42e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  108 SEIERLQKRMVDLEKLE------EALSRSKNECSQLCLSLNEERNLTKKISSELEmLRVKVKE--LESSEDRLDKTEQSL 179
Cdd:PRK12704    34 KEAEEEAKRILEEAKKEaeaikkEALLEAKEEIHKLRNEFEKELRERRNELQKLE-KRLLQKEenLDRKLELLEKREEEL 112
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|.
gi 1720408942  180 VSELEKLKsltlsfvNERKYLNEKEKENEKIIKELTQKLEQ 220
Cdd:PRK12704   113 EKKEKELE-------QKQQELEKKEEELEELIEEQLQELER 146
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
28-349 1.54e-03

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 42.73  E-value: 1.54e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   28 DELEEATKNLQRAEDELLDLQDKVIQAEGSDSSTLAEIEVLRQRVLKIEGKDEEIKRAEDLchtmkekleeeENLTRELK 107
Cdd:TIGR00606  577 DWLHSKSKEINQTRDRLAKLNKELASLEQNKNHINNELESKEEQLSSYEDKLFDVCGSQDE-----------ESDLERLK 645
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  108 SEIERLQKRMVDL-------EKLEEALSRSKNECSQLC----LSLNEERNLTKKISSELEMLRVKVKELESSEDRLDKTE 176
Cdd:TIGR00606  646 EEIEKSSKQRAMLagatavySQFITQLTDENQSCCPVCqrvfQTEAELQEFISDLQSKLRLAPDKLKSTESELKKKEKRR 725
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  177 QSLVSELEKLKSLTLSFVNERKYLNEKEKENEKIIKELTQKLEQNKKMNRDHM-RNASTFLERNDLRIEDGISSTLSSKE 255
Cdd:TIGR00606  726 DEMLGLAPGRQSIIDLKEKEIPELRNKLQKVNRDIQRLKNDIEEQETLLGTIMpEEESAKVCLTDVTIMERFQMELKDVE 805
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  256 SKRKGSLDYLKQVENEtRDKSENEKNRNQEDNKVKDLNQEIEKLKtqiKHFESLEEELKKMRAKNNDLQDNYL---TELN 332
Cdd:TIGR00606  806 RKIAQQAAKLQGSDLD-RTVQQVNQEKQEKQHELDTVVSKIELNR---KLIQDQQEQIQHLKSKTNELKSEKLqigTNLQ 881
                          330
                   ....*....|....*..
gi 1720408942  333 RNRSLASQLEEIKLQVR 349
Cdd:TIGR00606  882 RRQQFEEQLVELSTEVQ 898
PTZ00121 PTZ00121
MAEBL; Provisional
25-315 1.68e-03

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 42.82  E-value: 1.68e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   25 RRLDELEEATKNLQRAEDELLDLQDKVIQAEGSDSSTLAEIEVLRQRVLKIE--GKDEEIKRAEDLCHTMKEKLEEEENL 102
Cdd:PTZ00121  1230 KKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADelKKAEEKKKADEAKKAEEKKKADEAKK 1309
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  103 TRELKSEIERLQKRMVDLEKLEEALSRSKNEcsqlclslNEERNLTKKISSELEMLRVKVKELESSEDRLDKTEQSLVSE 182
Cdd:PTZ00121  1310 KAEEAKKADEAKKKAEEAKKKADAAKKKAEE--------AKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKAD 1381
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  183 LEKLKSLTLSFVNERKylnEKEKENEKIIKELTQKLEQNKKMnrDHMRNASTFLERND---LRIEDGISSTLSSKESKRK 259
Cdd:PTZ00121  1382 AAKKKAEEKKKADEAK---KKAEEDKKKADELKKAAAAKKKA--DEAKKKAEEKKKADeakKKAEEAKKADEAKKKAEEA 1456
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1720408942  260 GSLDYLKQVENETRdKSENEKNRNQEDNKVKDLNQEIEKLKTQIKHFESLEEELKK 315
Cdd:PTZ00121  1457 KKAEEAKKKAEEAK-KADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKK 1511
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
16-353 2.10e-03

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 42.34  E-value: 2.10e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   16 LRFKLQSLSRRLDELEEATKNLQRAEDELLDLQDKVIQAEGSDSSTL-------AEIEVLRQRVLKIEGKDEEIKRAEDL 88
Cdd:TIGR00606  191 LRQVRQTQGQKVQEHQMELKYLKQYKEKACEIRDQITSKEAQLESSReivksyeNELDPLKNRLKEIEHNLSKIMKLDNE 270
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   89 CHTMKEKLEEEENLTRELKSEIERL----QKRMVDLEKLEEALSRSKNE----CSQLCLSLNEERNLTKKISSELEmlrV 160
Cdd:TIGR00606  271 IKALKSRKKQMEKDNSELELKMEKVfqgtDEQLNDLYHNHQRTVREKERelvdCQRELEKLNKERRLLNQEKTELL---V 347
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  161 KVKELESSEDRLDktEQSLVSELEKLKSLTLSFVNERKYLNEKEKENEKIIKELTQKLEQNKKMNRDHMrnastflernd 240
Cdd:TIGR00606  348 EQGRLQLQADRHQ--EHIRARDSLIQSLATRLELDGFERGPFSERQIKNFHTLVIERQEDEAKTAAQLC----------- 414
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  241 lriedgisSTLSSKESKRKGSLDYLKQVENETRDKSENEKNR-NQEDNKVKDLNQEIEKLKTQIKHFESLEEELKKMRAK 319
Cdd:TIGR00606  415 --------ADLQSKERLKQEQADEIRDEKKGLGRTIELKKEIlEKKQEELKFVIKELQQLEGSSDRILELDQELRKAERE 486
                          330       340       350
                   ....*....|....*....|....*....|....
gi 1720408942  320 NNDLQDNYLTELNRNRSLASQLEEIKLQVRKQKE 353
Cdd:TIGR00606  487 LSKAEKNSLTETLKKEVKSLQNEKADLDRKLRKL 520
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
153-364 2.58e-03

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 41.35  E-value: 2.58e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  153 SELEMLRVKVKELESSEDRLDKTEQSLVSELEKLKSLTLSFVNERKYLNEKEKENEKIIKELTQKLEQNKKMNRDHMR-- 230
Cdd:COG3883     16 PQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGERARal 95
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  231 -----------------NASTFLERNDLRiedgisSTLSSKESkrkgsldylKQVENETRDKSENEKNRNQEDNKVKDLN 293
Cdd:COG3883     96 yrsggsvsyldvllgseSFSDFLDRLSAL------SKIADADA---------DLLEELKADKAELEAKKAELEAKLAELE 160
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1720408942  294 QEIEKLKTQIKhfeSLEEELKKMRAKNNDLQDNYLTELNRNRSLASQLEEIKLQVRKQKELGNGDIEGEDA 364
Cdd:COG3883    161 ALKAELEAAKA---ELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAA 228
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
10-192 2.62e-03

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 41.29  E-value: 2.62e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   10 AASNRHLRFKLQSLSRRLDELEEATKNLQRAEDELLDLQDKVIQAEGSDSSTLAEIEVLRQRvlkIEGKDEEIKR----- 84
Cdd:COG4942     37 AELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAE---LEAQKEELAEllral 113
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   85 --------------AEDLcHTMKEKLEEEENLTRELKSEIERLQKRMVDLEKLEEALSRSKNECSQLCLSLNEERN---- 146
Cdd:COG4942    114 yrlgrqpplalllsPEDF-LDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELEAERAELEALLAELEEERAalea 192
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1720408942  147 -------LTKKISSELEMLRVKVKELESSEDRLDKTEQSLVSELEKLKSLTLS 192
Cdd:COG4942    193 lkaerqkLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAERTPA 245
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
19-354 2.64e-03

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 41.96  E-value: 2.64e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   19 KLQSLSRRLDELEEATKNLQRAEDELLDlqdkviQAEGSDSSTLAEIEVLRQRVLKIEGKDEEIKRAEDLCHTMKEKLEE 98
Cdd:TIGR00606  406 EAKTAAQLCADLQSKERLKQEQADEIRD------EKKGLGRTIELKKEILEKKQEELKFVIKELQQLEGSSDRILELDQE 479
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   99 EENLTREL-----KSEIERLQKRMVDLEKLEEALSRSKNECSQLclslNEERNLTKKISSELEML-RVKVKELESSEDRL 172
Cdd:TIGR00606  480 LRKAERELskaekNSLTETLKKEVKSLQNEKADLDRKLRKLDQE----MEQLNHHTTTRTQMEMLtKDKMDKDEQIRKIK 555
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  173 DKTEQSLVSELEKL---KSLTLSFVNERKYLNEKEKENEKIIKELtQKLEQNKkmnrDHMRNASTFLERNDLRIEDGISS 249
Cdd:TIGR00606  556 SRHSDELTSLLGYFpnkKQLEDWLHSKSKEINQTRDRLAKLNKEL-ASLEQNK----NHINNELESKEEQLSSYEDKLFD 630
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  250 TLSSKESKRK-------------------GSLDYLKQVENETRDKSENEKNRNQEDNKVK-DLNQEIEKLKTQIKHF--- 306
Cdd:TIGR00606  631 VCGSQDEESDlerlkeeieksskqramlaGATAVYSQFITQLTDENQSCCPVCQRVFQTEaELQEFISDLQSKLRLApdk 710
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1720408942  307 -ESLEEELKK-------------MRAKNNDLQDNYLTEL-NRNRSLASQLEEIKLQVRKQKEL 354
Cdd:TIGR00606  711 lKSTESELKKkekrrdemlglapGRQSIIDLKEKEIPELrNKLQKVNRDIQRLKNDIEEQETL 773
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
28-423 3.34e-03

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 41.75  E-value: 3.34e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   28 DELEEATKNLQRAEDELLDLQDKVIQAEGSDSSTLAEIEVLRQRVLKIEGKDEEIKR----------------------- 84
Cdd:pfam12128  471 ERIERAREEQEAANAEVERLQSELRQARKRRDQASEALRQASRRLEERQSALDELELqlfpqagtllhflrkeapdweqs 550
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   85 ------AEDLCHTMKEKLEEEENLTRELKSEIERLQKRMVDLEK---LEEALsrsKNECSQLCLSLNEERNLTKKISSEL 155
Cdd:pfam12128  551 igkvisPELLHRTDLDPEVWDGSVGGELNLYGVKLDLKRIDVPEwaaSEEEL---RERLDKAEEALQSAREKQAAAEEQL 627
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  156 EMLRVKVKELESSEDRldkTEQSLVSELEKLKSLTLSFVNERKYLNEKEKENEKIIKELTQKLEQNKKMNRDHMRNASTF 235
Cdd:pfam12128  628 VQANGELEKASREETF---ARTALKNARLDLRRLFDEKQSEKDKKNKALAERKDSANERLNSLEAQLKQLDKKHQAWLEE 704
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  236 LERNDLRIEDGISSTLSSKESKRKGSLDYLKQVENETRDKSENEKNRNQEDNKvKDLN------QEIEKLKTQIKHFESL 309
Cdd:pfam12128  705 QKEQKREARTEKQAYWQVVEGALDAQLALLKAAIAARRSGAKAELKALETWYK-RDLAslgvdpDVIAKLKREIRTLERK 783
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  310 EEELKKMRAKNNDLQDNYL-TELNRNRSLASQLEEIKLQVRkqkelgngDIEGEdaflLGRGRHE-RTKLKGHGSEASVS 387
Cdd:pfam12128  784 IERIAVRRQEVLRYFDWYQeTWLQRRPRLATQLSNIERAIS--------ELQQQ----LARLIADtKLRRAKLEMERKAS 851
                          410       420       430
                   ....*....|....*....|....*....|....*.
gi 1720408942  388 KHTSRELSPQHKRERLRNREFALSNEHYSLSSKQAS 423
Cdd:pfam12128  852 EKQQVRLSENLRGLRCEMSKLATLKEDANSEQAQGS 887
Tropomyosin pfam00261
Tropomyosin; Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 ...
28-245 4.03e-03

Tropomyosin; Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites. The protein is best known for its role in regulating the interaction between actin and myosin in muscle contraction, but is also involved in the organization and dynamics of the cytoskeleton in non-muscle cells. There are multiple cell-specific isoforms, expressed by alternative promoters and alternative RNA processing of at least four genes. Muscle isoforms of tropomyosin are characterized by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region.


Pssm-ID: 459736 [Multi-domain]  Cd Length: 235  Bit Score: 40.01  E-value: 4.03e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   28 DELEEATKNLQRAEDELLDLQDKVIQAEgsdsstlAEIEVLRQRVLKIEgkdEEIKRAEDLCHTMKEKLEEEENLTrelk 107
Cdd:pfam00261    8 EELDEAEERLKEAMKKLEEAEKRAEKAE-------AEVAALNRRIQLLE---EELERTEERLAEALEKLEEAEKAA---- 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  108 SEIERLQKrmvdleKLEEALSRSKNECSQLCLSLNEERNLTKKISSELEMLRVKVK----ELESSEDRLDKTEqSLVSEL 183
Cdd:pfam00261   74 DESERGRK------VLENRALKDEEKMEILEAQLKEAKEIAEEADRKYEEVARKLVvvegDLERAEERAELAE-SKIVEL 146
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  184 EK--------LKSLTLSfvnERKYlNEKEKENEKIIKELTQKLEQNKKMNRDHMRNASTfLERNDLRIED 245
Cdd:pfam00261  147 EEelkvvgnnLKSLEAS---EEKA-SEREDKYEEQIRFLTEKLKEAETRAEFAERSVQK-LEKEVDRLED 211
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
103-380 4.17e-03

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 40.66  E-value: 4.17e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  103 TRELKSEIERLQKRMVDLEKLEEALSRSKNECSQLCLSLNEERNLTKKISSELEMLRVKVKELESSEDRLDKTEQSLVSE 182
Cdd:COG4372     58 REELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQ 137
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  183 LEKLKSltlSFVNERKYLNEKEKENEKIIKELTQKLEQNKKMNRDHMRNASTFLERNDLRIEDGISSTLSSKESKRKGSL 262
Cdd:COG4372    138 IAELQS---EIAEREEELKELEEQLESLQEELAALEQELQALSEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPR 214
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  263 DYLKQVENETRDKSENEKnRNQEDNKVKDLNQEIEKLKTQIKHFESLEEELKKMRAKNNDLQDNYLTELNrnrslaSQLE 342
Cdd:COG4372    215 ELAEELLEAKDSLEAKLG-LALSALLDALELEEDKEELLEEVILKEIEELELAILVEKDTEEEELEIAAL------ELEA 287
                          250       260       270
                   ....*....|....*....|....*....|....*...
gi 1720408942  343 EIKLQVRKQKELGNGDIEGEDAFLLGRGRHERTKLKGH 380
Cdd:COG4372    288 LEEAALELKLLALLLNLAALSLIGALEDALLAALLELA 325
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
23-325 4.23e-03

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 41.31  E-value: 4.23e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   23 LSRRLDELEEATKNLQRAEDELLDLQDKVIQAEG----------SDSSTLAEIEVLRQRvlkIEGKDEEIkraEDLCHTM 92
Cdd:pfam01576    7 MQAKEEELQKVKERQQKAESELKELEKKHQQLCEeknalqeqlqAETELCAEAEEMRAR---LAARKQEL---EEILHEL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   93 KEKLEEEENLTRELKSEIERLQKRMVDLE-KLEE------ALSRSKNECSQLCLSLNEERNLTKKISSELEmlrvkvKEL 165
Cdd:pfam01576   81 ESRLEEEEERSQQLQNEKKKMQQHIQDLEeQLDEeeaarqKLQLEKVTTEAKIKKLEEDILLLEDQNSKLS------KER 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  166 ESSEDRLDKTEQSLVSELEKLKSLT-LSFVNERKY--LNEKEKENEKiikeLTQKLEQNKKMNRDHMRNASTFLERNDLR 242
Cdd:pfam01576  155 KLLEERISEFTSNLAEEEEKAKSLSkLKNKHEAMIsdLEERLKKEEK----GRQELEKAKRKLEGESTDLQEQIAELQAQ 230
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  243 IEDgISSTLSSKESKRKGSLDylkQVENETRDKSENEKNRNQEDNKVKDLNQEIEKLKTQ----IKHFESLEEELKKMRA 318
Cdd:pfam01576  231 IAE-LRAQLAKKEEELQAALA---RLEEETAQKNNALKKIRELEAQISELQEDLESERAArnkaEKQRRDLGEELEALKT 306

                   ....*..
gi 1720408942  319 KNNDLQD 325
Cdd:pfam01576  307 ELEDTLD 313
CALCOCO1 pfam07888
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ...
104-341 4.59e-03

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.


Pssm-ID: 462303 [Multi-domain]  Cd Length: 488  Bit Score: 40.65  E-value: 4.59e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  104 RELKSEIERLQKRMVDLEKLEEALSRSKNECSQLCLSLNEERNLtkkISSELEMLRVKVKELESSEDRLDKTEQSLVSEL 183
Cdd:pfam07888   76 RELESRVAELKEELRQSREKHEELEEKYKELSASSEELSEEKDA---LLAQRAAHEARIRELEEDIKTLTQRVLERETEL 152
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  184 EKLKSLTLSFVNERKylnEKEKENEKIIKELTQKLEQNKKMNRDHMRnastflERNDLRIEDGISSTLSSKESKRKGSLD 263
Cdd:pfam07888  153 ERMKERAKKAGAQRK---EEEAERKQLQAKLQQTEEELRSLSKEFQE------LRNSLAQRDTQVLQLQDTITTLTQKLT 223
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720408942  264 YLKQVEnetrdkSENEKNRnqednkvKDLNQEIEKLKTQIKHFESLEEELKKMRAknndLQDNYLTELNRNRSLASQL 341
Cdd:pfam07888  224 TAHRKE------AENEALL-------EELRSLQERLNASERKVEGLGEELSSMAA----QRDRTQAELHQARLQAAQL 284
Spc7 smart00787
Spc7 kinetochore protein; This domain is found in cell division proteins which are required ...
262-346 4.86e-03

Spc7 kinetochore protein; This domain is found in cell division proteins which are required for kinetochore-spindle association.


Pssm-ID: 197874 [Multi-domain]  Cd Length: 312  Bit Score: 40.39  E-value: 4.86e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   262 LDYLKQVENETRDKSENEKNRNQEdnKVKDLNQEIEKLKTQIKHFESLEEELKKMRAKNNDLQDNYLTELNrnrSLASQL 341
Cdd:smart00787  188 LRQLKQLEDELEDCDPTELDRAKE--KLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIA---EAEKKL 262

                    ....*
gi 1720408942   342 EEIKL 346
Cdd:smart00787  263 EQCRG 267
PRK12704 PRK12704
phosphodiesterase; Provisional
197-353 5.34e-03

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 40.53  E-value: 5.34e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  197 RKYLNEKEKENEKIIKELTQKLEQNKKmnrDHMRNASTflERNDLRIEdgisstlSSKESKRKgsLDYLKQVENETRDKS 276
Cdd:PRK12704    30 EAKIKEAEEEAKRILEEAKKEAEAIKK---EALLEAKE--EIHKLRNE-------FEKELRER--RNELQKLEKRLLQKE 95
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720408942  277 ENEKNRNQednkvkDLNQEIEKLKTQIKHFESLEEELKKMRAKNNDLQDNYLTELNRNRSLASqlEEIKLQVRKQKE 353
Cdd:PRK12704    96 ENLDRKLE------LLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGLTA--EEAKEILLEKVE 164
PilN COG3166
Type IV pilus assembly protein PilN [Cell motility, Extracellular structures];
277-347 6.78e-03

Type IV pilus assembly protein PilN [Cell motility, Extracellular structures];


Pssm-ID: 442399 [Multi-domain]  Cd Length: 185  Bit Score: 38.79  E-value: 6.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  277 ENEKNRNQEdnkvkdLNQEIEKLKTQIKHFESLEEELKKMRAKNN---DLQDN------YLTELNRNRSLASQLEEIKLQ 347
Cdd:COG3166     48 AQQQARNAA------LQQEIAKLDKQIAEIKELKKQKAELLARLQvieQLQQSrppwvhLLDELARLLPEGVWLTSLSQQ 121
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
147-354 7.41e-03

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 40.14  E-value: 7.41e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  147 LTKKISSELEML-RVKVKELESSEDRLDKTEQSLVSELEKLKSLTlSFVNERKYLNEKEKENEKIIKELTQKLEQNKKMN 225
Cdd:COG4717     47 LLERLEKEADELfKPQGRKPELNLKELKELEEELKEAEEKEEEYA-ELQEELEELEEELEELEAELEELREELEKLEKLL 125
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  226 RdhmrNASTFLERNDLRIEdgisstLSSKESKrkgsLDYLKQVENETRDKSENEKNRNQEdnkVKDLNQEIEKLKTQI-- 303
Cdd:COG4717    126 Q----LLPLYQELEALEAE------LAELPER----LEELEERLEELRELEEELEELEAE---LAELQEELEELLEQLsl 188
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1720408942  304 ---KHFESLEEELKKMRAKNNDLQDNYLTELNRNRSLASQLEEIKLQVRKQKEL 354
Cdd:COG4717    189 ateEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQLENELEAAALE 242
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
15-354 7.64e-03

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 40.34  E-value: 7.64e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   15 HLRFKLQSLSRRLDELEEATKNLQRAEDELLDLQDKVIQAEGSD-------------SSTLAEIEVLRQRVL-----KIE 76
Cdd:TIGR00618  390 TLTQKLQSLCKELDILQREQATIDTRTSAFRDLQGQLAHAKKQQelqqryaelcaaaITCTAQCEKLEKIHLqesaqSLK 469
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   77 GKDEEIKRAEDLCHTMKEKLEEEENLTRELKSEIERLQKRMVDLEKLEEALSRSKNECSQLCLSLNEERNLTK---KISS 153
Cdd:TIGR00618  470 EREQQLQTKEQIHLQETRKKAVVLARLLELQEEPCPLCGSCIHPNPARQDIDNPGPLTRRMQRGEQTYAQLETseeDVYH 549
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  154 ELEMLRVKVKELESSEDRLDKTEQSLVSELEKLKSL---TLSFVNERKYLNEKEKENEKIIKELTQKL------EQNKKM 224
Cdd:TIGR00618  550 QLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDipnLQNITVRLQDLTEKLSEAEDMLACEQHALlrklqpEQDLQD 629
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  225 NRDHMRNAS-------TFLERNDL-----RIEDGISSTLSSKESKRKGSLDYLKQVENETRDKSENEKNRNQEDNKVKDL 292
Cdd:TIGR00618  630 VRLHLQQCSqelalklTALHALQLtltqeRVREHALSIRVLPKELLASRQLALQKMQSEKEQLTYWKEMLAQCQTLLREL 709
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1720408942  293 NQEIEKLKTQIKHFESLEEELKKMRAKNNDLQDNYLTELNRNRSLASQLEEIKLQVRKQKEL 354
Cdd:TIGR00618  710 ETHIEEYDREFNEIENASSSLGSDLAAREDALNQSLKELMHQARTVLKARTEAHFNNNEEVT 771
MAD pfam05557
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ...
37-380 7.82e-03

Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.


Pssm-ID: 461677 [Multi-domain]  Cd Length: 660  Bit Score: 40.11  E-value: 7.82e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   37 LQRAEDELLDLQDKVIQAEGSDSSTLAEIEvlRQRVLKIEGKDEEIKRAEDLCHTMKEKLEEEENLTRELKSEIERLQKR 116
Cdd:pfam05557    4 LIESKARLSQLQNEKKQMELEHKRARIELE--KKASALKRQLDRESDRNQELQKRIRLLEKREAEAEEALREQAELNRLK 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  117 MvdleKLEEALSRSKNECSQLCLSLNE-ERNLTKKIS--------SELEmLRVKVKELESSEDRLDKTEQSLvSELEKLK 187
Cdd:pfam05557   82 K----KYLEALNKKLNEKESQLADAREvISCLKNELSelrrqiqrAELE-LQSTNSELEELQERLDLLKAKA-SEAEQLR 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  188 SLTLSFVNERKYLNEKEKENEKIIKELTQKLEQNKKMNRDHMRNAStfLERNDLRIEDGISSTLSSKESKrkgslDYLKQ 267
Cdd:pfam05557  156 QNLEKQQSSLAEAEQRIKELEFEIQSQEQDSEIVKNSKSELARIPE--LEKELERLREHNKHLNENIENK-----LLLKE 228
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  268 VENETRDKSENEKNRNQEdnkVKDLNQEIEKLKTQIKHFESLEEELKKMRAKNNDLQDNYLTELNRNRSLASQLEEIKLQ 347
Cdd:pfam05557  229 EVEDLKRKLEREEKYREE---AATLELEKEKLEQELQSWVKLAQDTGLNLRSPEDLSRRIEQLQQREIVLKEENSSLTSS 305
                          330       340       350
                   ....*....|....*....|....*....|...
gi 1720408942  348 VRkQKELGNGDIEGEDAFLLGRGRHERTKLKGH 380
Cdd:pfam05557  306 AR-QLEKARRELEQELAQYLKKIEDLNKKLKRH 337
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
11-223 8.14e-03

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 40.44  E-value: 8.14e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   11 ASNRHLRFKLQSLSRRLDELEEATKNLQRAEDElLDLQDKVIQAEGSDS-----STLAEIEVLRQRVLKIEGKDEEIKRA 85
Cdd:TIGR02169  812 ARLREIEQKLNRLTLEKEYLEKEIQELQEQRID-LKEQIKSIEKEIENLngkkeELEEELEELEAALRDLESRLGDLKKE 890
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   86 EDlchTMKEKLEEEENLTRELKSEIERLQKRMVDL-EKLEEALSRSKnecsqlclSLNEERNLTKKISSELEMLRVKVKE 164
Cdd:TIGR02169  891 RD---ELEAQLRELERKIEELEAQIEKKRKRLSELkAKLEALEEELS--------EIEDPKGEDEEIPEEELSLEDVQAE 959
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1720408942  165 LESSEDRLDKTEQSLVSELEKLKSlTLSFVNERKYLNEKEKENEKIIKELTQKLEQNKK 223
Cdd:TIGR02169  960 LQRVEEEIRALEPVNMLAIQEYEE-VLKRLDELKEKRAKLEEERKAILERIEEYEKKKR 1017
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
35-347 9.02e-03

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 40.00  E-value: 9.02e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   35 KNLQRAEDELlDLQDKVIQAEGSDSSTLAE----IEVLRQR---VLKIEGKDEEIKRAEDLC---------HTMKEKLEE 98
Cdd:COG3206     94 PVLERVVDKL-NLDEDPLGEEASREAAIERlrknLTVEPVKgsnVIEISYTSPDPELAAAVAnalaeayleQNLELRREE 172
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942   99 EENLTRELKSEIERLQKRmvdLEKLEEALSRSKNEcSQLcLSLNEERNLTKKISSELEMLRVKVK-ELESSEDRLDKTEQ 177
Cdd:COG3206    173 ARKALEFLEEQLPELRKE---LEEAEAALEEFRQK-NGL-VDLSEEAKLLLQQLSELESQLAEARaELAEAEARLAALRA 247
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  178 SLVSELEKLKSLTlsfvnerkylnekekeNEKIIKELTQKLEQnkkMNRDHMRNASTFLERNDLRIEdgisstlssKESK 257
Cdd:COG3206    248 QLGSGPDALPELL----------------QSPVIQQLRAQLAE---LEAELAELSARYTPNHPDVIA---------LRAQ 299
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408942  258 RKGSLDYLKQVENETRDKSENEKNRNQEdnKVKDLNQEIEKLKTQIKHFESLEEELkkmraknNDLQDNYltELNRN--R 335
Cdd:COG3206    300 IAALRAQLQQEAQRILASLEAELEALQA--REASLQAQLAQLEARLAELPELEAEL-------RRLEREV--EVARElyE 368
                          330
                   ....*....|..
gi 1720408942  336 SLASQLEEIKLQ 347
Cdd:COG3206    369 SLLQRLEEARLA 380
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH