|
Name |
Accession |
Description |
Interval |
E-value |
| Ubl_RB1CC1 |
cd17060 |
ubiquitin-like (Ubl) domain found in retinoblastoma 1-inducible coiled-coil protein 1 (RB1CC1) ... |
2-76 |
3.41e-40 |
|
ubiquitin-like (Ubl) domain found in retinoblastoma 1-inducible coiled-coil protein 1 (RB1CC1) and similar proteins; RB1CC1, also termed FAK family kinase-interacting protein of 200 kDa (FIP200), is the mammalian counterpart of the yeast Atg17 gene and functions as a component of the ULK1/Atg13/RB1CC1/Atg101 complex essential for induction of autophagy. RB1CC1 is a key signaling node to regulate cellular proliferation and differentiation. As a DNA-binding transcription factor, RB1CC1 has been implicated in the regulation of retinoblastoma 1 (RB1) expression. RB1CC1 contains a conserved ubiquitin-like (Ubl) domain with a beta-grasp Ubl fold, as well as a nuclear localization signal (KPRK), a leucine zipper motif and a coiled-coil structure.
Pssm-ID: 340580 Cd Length: 75 Bit Score: 143.09 E-value: 3.41e-40
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 226693374 2 KLYVFLVNTGTTLTFDTELTVQTVADLKHAIQSKYKIAIQHQVLVVNGGECMAADRRVCTYSAGTDTNPIFLFNK 76
Cdd:cd17060 1 MLYVFLVNTGTMLTFDMSLAVQSVSDLKEAIEEKTKIPVEKQVLLVSGGESLDPDKRVSSYSAGTDTNPIFLFNK 75
|
|
| ATG11 |
pfam10377 |
Autophagy-related protein 11; The function of this family is conflicting. In the fission yeast, ... |
1456-1582 |
1.12e-26 |
|
Autophagy-related protein 11; The function of this family is conflicting. In the fission yeast, Schizosaccharomyces pombe, this protein has been shown to interact with the telomere cap complex and it is also required for dimerization and kinase activation of Atg1. However, in budding yeast, Saccharomyces cerevisiae, this protein is called ATG11 and is shown to be involved in autophagy.
Pssm-ID: 463063 Cd Length: 130 Bit Score: 106.53 E-value: 1.12e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1456 LERTLQlKEEENKRlnqrlmsQSLSSVSSRHSEKIAIRDFQVGDLVLIILDERHDN-----YVLFTVSPTLYFLHSESLP 1530
Cdd:pfam10377 14 LAKKLT-KENKSKR-------EKLEKLQSEAHEKITLKNFKVGDLALFLPTRRHNDlstppWAAFNVGAPHYFLKADSLL 85
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*
gi 226693374 1531 ALdlkpgegasgaSRRPWVLGKVMEKEYCQAK---KAQNRFKVPLGTKFYRVKAV 1582
Cdd:pfam10377 86 AL-----------KSREWFVGRITSIEERVVDlsdKDSNPFKLSKGTVWYLVTAE 129
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
827-1317 |
6.85e-14 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 76.98 E-value: 6.85e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 827 KEQCDLANYLKCTAVEIRNIIEKVKCSLEITLKEKHQQELQSLKIEYECKLDALVKDSEENVNK----ILKLKENLVSLE 902
Cdd:TIGR04523 166 KQKEELENELNLLEKEKLNIQKNIDKIKNKLLKLELLLSNLKKKIQKNKSLESQISELKKQNNQlkdnIEKKQQEINEKT 245
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 903 EALQNKDNEFTSIKHEKDAIVCVQQEKDQKLLEMEKIMHTQHCEIKELKqsreMALEDLKKLHDEKIESLRAEFQCLEQN 982
Cdd:TIGR04523 246 TEISNTQTQLNQLKDEQNKIKKQLSEKQKELEQNNKKIKELEKQLNQLK----SEISDLNNQKEQDWNKELKSELKNQEK 321
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 983 HLKELEDTLhirhtQEFEKVMTDHNMSLEKLKKE------NQQRIDQMLESHASTIQEKEQQLQELKLKVSDLSDMRCKL 1056
Cdd:TIGR04523 322 KLEEIQNQI-----SQNNKIISQLNEQISQLKKEltnsesENSEKQRELEEKQNEIEKLKKENQSYKQEIKNLESQINDL 396
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1057 EVELALKEAETDEIkilleESRTQQKEMLKSLLEQETENLRTEISKLNQKIHD-NNESYQVGLS--ELRALMTIEKDQCI 1133
Cdd:TIGR04523 397 ESKIQNQEKLNQQK-----DEQIKKLQQEKELLEKEIERLKETIIKNNSEIKDlTNQDSVKELIikNLDNTRESLETQLK 471
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1134 SELISRHEEESN--ILKAELDNVTSLHRQAYEIEKKLKEQIVELqTRLNSELSALEKQKDEKITQQEEKYEALIQNLEKD 1211
Cdd:TIGR04523 472 VLSRSINKIKQNleQKQKELKSKEKELKKLNEEKKELEEKVKDL-TKKISSLKEKIEKLESEKKEKESKISDLEDELNKD 550
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1212 KERLVKNH-EQDKEHLIQELnfEKNKAVQTALDEFKVERELVEKELLEKVKHLENQIAKTPAFESAREDSSSLVAELQEK 1290
Cdd:TIGR04523 551 DFELKKENlEKEIDEKNKEI--EELKQTQKSLKKKQEEKQELIDQKEKEKKDLIKEIEEKEKKISSLEKELEKAKKENEK 628
|
490 500 510
....*....|....*....|....*....|
gi 226693374 1291 LQEEKAKF---LEQLEEQEKRKNEEMQNVR 1317
Cdd:TIGR04523 629 LSSIIKNIkskKNKLKQEVKQIKETIKEIR 658
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
795-1373 |
1.51e-13 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 76.31 E-value: 1.51e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 795 VQLEKCRAAAQDSHSSIQTIkddlcHFRTF------VQKEQCDLANYLKCTAVEIRNIIEKVKCS----LEITLKEkHQQ 864
Cdd:pfam15921 194 VDFEEASGKKIYEHDSMSTM-----HFRSLgsaiskILRELDTEISYLKGRIFPVEDQLEALKSEsqnkIELLLQQ-HQD 267
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 865 ELQSLKIEYECKLDALVKDSEENVNKILKLKENLVSLEEALQNKDNEFtsIKHEKDAIVCVQQEKDQ----------KLL 934
Cdd:pfam15921 268 RIEQLISEHEVEITGLTEKASSARSQANSIQSQLEIIQEQARNQNSMY--MRQLSDLESTVSQLRSElreakrmyedKIE 345
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 935 EMEKIMHTQHCEIKELKQSREMALEDLKKLhDEKIESLRAEFqcleqnHLKELEDTLHIRHTQEFEKVMTDHNMSLEKLK 1014
Cdd:pfam15921 346 ELEKQLVLANSELTEARTERDQFSQESGNL-DDQLQKLLADL------HKREKELSLEKEQNKRLWDRDTGNSITIDHLR 418
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1015 KE------NQQRIDQMLESHASTIQ-EKEQQLQELKLKVSDL---SDMRCKLEVELALKEAETDEI---KILLEESRTQQ 1081
Cdd:pfam15921 419 RElddrnmEVQRLEALLKAMKSECQgQMERQMAAIQGKNESLekvSSLTAQLESTKEMLRKVVEELtakKMTLESSERTV 498
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1082 KEMLKSLLEQE--TENLRTEISKLNQKI--------HDNNE-----SYQVGLSELRALMTiEKDQCISELISRHEEESNI 1146
Cdd:pfam15921 499 SDLTASLQEKEraIEATNAEITKLRSRVdlklqelqHLKNEgdhlrNVQTECEALKLQMA-EKDKVIEILRQQIENMTQL 577
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1147 LKAELDNVTSLHRQAYEIEKKLKEQIVELQtrlnsELSALEKQKDEKITQQEEKYEAL------IQNLEKDKERLVKNHE 1220
Cdd:pfam15921 578 VGQHGRTAGAMQVEKAQLEKEINDRRLELQ-----EFKILKDKKDAKIRELEARVSDLelekvkLVNAGSERLRAVKDIK 652
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1221 QDKEHLIQELNFEKNKaVQTALDEFKV----------ERELVEKELLEKVKHLENQIAKT----PAFESAREDSSSLVAE 1286
Cdd:pfam15921 653 QERDQLLNEVKTSRNE-LNSLSEDYEVlkrnfrnkseEMETTTNKLKMQLKSAQSELEQTrntlKSMEGSDGHAMKVAMG 731
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1287 LQEKLQ---------EEKAKFLEQ-----------LEEQEKRKNEEMQNVRTSLiAEQQTNFNTVLTREKMRKENIINDL 1346
Cdd:pfam15921 732 MQKQITakrgqidalQSKIQFLEEamtnankekhfLKEEKNKLSQELSTVATEK-NKMAGELEVLRSQERRLKEKVANME 810
|
650 660
....*....|....*....|....*..
gi 226693374 1347 SDKLKSTMQQQERdKDLIESLSEDRAR 1373
Cdd:pfam15921 811 VALDKASLQFAEC-QDIIQRQEQESVR 836
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
876-1370 |
2.55e-13 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 75.06 E-value: 2.55e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 876 KLDALVKDSEENVNKILKLKENL----VSLEEALQNKDNEFTSIKHE---KDAIVCVQQEKDQKLLEMEKimhtqhcEIK 948
Cdd:TIGR04523 149 KKEKELEKLNNKYNDLKKQKEELenelNLLEKEKLNIQKNIDKIKNKllkLELLLSNLKKKIQKNKSLES-------QIS 221
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 949 ELKQSREmALEDLKKLHDEKIESLRAEFQCLEQNhLKELEDtlhirhtqefekvmtdhnmSLEKLKKENQQRIDQmLESH 1028
Cdd:TIGR04523 222 ELKKQNN-QLKDNIEKKQQEINEKTTEISNTQTQ-LNQLKD-------------------EQNKIKKQLSEKQKE-LEQN 279
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1029 ASTIQEKEQQLQELKLKVSDLSdmrcklevelalKEAETDEIKILLEESRTQQKEmlKSLLEQETENLRTEISKLNQKIH 1108
Cdd:TIGR04523 280 NKKIKELEKQLNQLKSEISDLN------------NQKEQDWNKELKSELKNQEKK--LEEIQNQISQNNKIISQLNEQIS 345
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1109 D----------NNESYQVGLSE-LRALMTIEKD-QCISELISRHEEESNILKAELDNVTslhrqayEIEKKLKEQIVELQ 1176
Cdd:TIGR04523 346 QlkkeltnsesENSEKQRELEEkQNEIEKLKKEnQSYKQEIKNLESQINDLESKIQNQE-------KLNQQKDEQIKKLQ 418
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1177 TRLNSELSALEKQKDEKITQQEEkyealIQNLEKD---KERLVKNHEQDKEHL---IQELNFEKNKAVQTaLDEFKVERE 1250
Cdd:TIGR04523 419 QEKELLEKEIERLKETIIKNNSE-----IKDLTNQdsvKELIIKNLDNTRESLetqLKVLSRSINKIKQN-LEQKQKELK 492
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1251 LVEKELL---EKVKHLENQIAKTpafesarEDSSSLVAELQEKLQEEKAKFLEQLEEQEKRKNEEMQNVRTSLIA----- 1322
Cdd:TIGR04523 493 SKEKELKklnEEKKELEEKVKDL-------TKKISSLKEKIEKLESEKKEKESKISDLEDELNKDDFELKKENLEkeide 565
|
490 500 510 520 530
....*....|....*....|....*....|....*....|....*....|...
gi 226693374 1323 -----EQQTNFNTVLTREKMRKENIINDLSDKLKSTMQQQERDKDLIESLSED 1370
Cdd:TIGR04523 566 knkeiEELKQTQKSLKKKQEEKQELIDQKEKEKKDLIKEIEEKEKKISSLEKE 618
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1032-1325 |
7.44e-13 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 73.82 E-value: 7.44e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1032 IQEKEQQLQELK------LKVSDLSDMRCKLEVELAL-------KEAETDEIKILLEESRTQQKEMLKSLLEQETENLRT 1098
Cdd:COG1196 195 LGELERQLEPLErqaekaERYRELKEELKELEAELLLlklreleAELEELEAELEELEAELEELEAELAELEAELEELRL 274
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1099 EISKLNQKIHDNNESYQVGLSELRALmtiEKDqciselISRHEEESNILKAELDNVTSLHRQAYEIEKKLKEQIVELQTR 1178
Cdd:COG1196 275 ELEELELELEEAQAEEYELLAELARL---EQD------IARLEERRRELEERLEELEEELAELEEELEELEEELEELEEE 345
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1179 LNSELSALEkQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQDKEHLIQELNFEKN-KAVQTALDEFKVERELVEKELL 1257
Cdd:COG1196 346 LEEAEEELE-EAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQlEELEEAEEALLERLERLEEELE 424
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 226693374 1258 EKVKHLENQIAKTPAFESAREDSSSLVAELQEKLQEEKAKFLEQLEEQEKRKNEEMQNVRTSLIAEQQ 1325
Cdd:COG1196 425 ELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAAR 492
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
847-1318 |
7.78e-13 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 73.60 E-value: 7.78e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 847 IEKVKCSLE--ITLKEKHQQELQ-------SLKIEYECKLDALVKDSEENVNKILKLKENLVSLEEALQNKDNEFTSIKH 917
Cdd:pfam05483 298 LEDIKMSLQrsMSTQKALEEDLQiatkticQLTEEKEAQMEELNKAKAAHSFVVTEFEATTCSLEELLRTEQQRLEKNED 377
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 918 EKDAIVCVQQEKDQKLLEMEKIMHTQHCEIKELKQsremALEDLKKLHDEK--IESLRAEFQCLEQN------------H 983
Cdd:pfam05483 378 QLKIITMELQKKSSELEEMTKFKNNKEVELEELKK----ILAEDEKLLDEKkqFEKIAEELKGKEQElifllqarekeiH 453
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 984 LKELEDTLHIRHTQEFEKVMTDHNMSLEKLKKENQQ---RIDQMLESHASTIQEKEQQLQELKLKVSDLSDmrCKLEVEL 1060
Cdd:pfam05483 454 DLEIQLTAIKTSEEHYLKEVEDLKTELEKEKLKNIEltaHCDKLLLENKELTQEASDMTLELKKHQEDIIN--CKKQEER 531
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1061 ALKEAETdeikilLEESRTQQKEMLKSLLEqETENLRTEISKLNQKIHDNNESYQVGLSELRALMTIEKDQCiselisrh 1140
Cdd:pfam05483 532 MLKQIEN------LEEKEMNLRDELESVRE-EFIQKGDEVKCKLDKSEENARSIEYEVLKKEKQMKILENKC-------- 596
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1141 eeesNILKAELDNVTSLHRQAYEIEKKLKEQIVELQTRLNSELSALEKQKDEkITQQEEKYEALIQNLEKDkerlVKNHE 1220
Cdd:pfam05483 597 ----NNLKKQIENKNKNIEELHQENKALKKKGSAENKQLNAYEIKVNKLELE-LASAKQKFEEIIDNYQKE----IEDKK 667
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1221 QDKEHLIQELnfekNKAVQTALDEFKVERELVEK---------ELLEKVKH-----LENQIAKTPAFESAREDSSSLVAE 1286
Cdd:pfam05483 668 ISEEKLLEEV----EKAKAIADEAVKLQKEIDKRcqhkiaemvALMEKHKHqydkiIEERDSELGLYKNKEQEQSSAKAA 743
|
490 500 510
....*....|....*....|....*....|....*....
gi 226693374 1287 LQEKLQEEKAKFLE-------QLEEQEKRKNEEMQNVRT 1318
Cdd:pfam05483 744 LEIELSNIKAELLSlkkqleiEKEEKEKLKMEAKENTAI 782
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1032-1318 |
1.43e-12 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 73.17 E-value: 1.43e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1032 IQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAEtdeikilLEESRTQQKEMLKSLLEQETE--NLRTEISKLNQKI-- 1107
Cdd:TIGR02168 234 LEELREELEELQEELKEAEEELEELTAELQELEEK-------LEELRLEVSELEEEIEELQKElyALANEISRLEQQKqi 306
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1108 --------HDNNESYQVGLSELRAlmtieKDQCISELISRHEEESNILKAELDNVTSLHRQAYEIEKKLKEQIVELQTRL 1179
Cdd:TIGR02168 307 lrerlanlERQLEELEAQLEELES-----KLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQL 381
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1180 NSELSALEKqkdekITQQEEKYEALIQNLEKDKERLVKNHEQdkehLIQELNFEKNKAVQTALDEFKVERELVEKELLEK 1259
Cdd:TIGR02168 382 ETLRSKVAQ-----LELQIASLNNEIERLEARLERLEDRRER----LQQEIEELLKKLEEAELKELQAELEELEEELEEL 452
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*....
gi 226693374 1260 VKHLENQIAKTPAFESAREDSSSLVAELQEKLQEEKAKfLEQLEEQEKRKNEEMQNVRT 1318
Cdd:TIGR02168 453 QEELERLEEALEELREELEEAEQALDAAERELAQLQAR-LDSLERLQENLEGFSEGVKA 510
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
880-1388 |
8.02e-12 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 70.48 E-value: 8.02e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 880 LVKDSEENVNKILKLKENLvslEEALQNKDNEFTSIKHEKDAIVCVQQEKDQKLLEMEKI---MHTQHCEIKELKQSREM 956
Cdd:PRK03918 173 EIKRRIERLEKFIKRTENI---EELIKEKEKELEEVLREINEISSELPELREELEKLEKEvkeLEELKEEIEELEKELES 249
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 957 ALEDLKKLhDEKIESLRAEFQCLEQnHLKELEDTlhIRHTQEFEKVMTDHnMSLEKLK---KENQQRIDQMLESHASTIQ 1033
Cdd:PRK03918 250 LEGSKRKL-EEKIRELEERIEELKK-EIEELEEK--VKELKELKEKAEEY-IKLSEFYeeyLDELREIEKRLSRLEEEIN 324
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1034 EKEQQLQELKLKVSDLSDMRcKLEVELALKEAETDEIKILLEESRTQQKEM--LKS---------------LLEQETENL 1096
Cdd:PRK03918 325 GIEERIKELEEKEERLEELK-KKLKELEKRLEELEERHELYEEAKAKKEELerLKKrltgltpeklekeleELEKAKEEI 403
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1097 RTEISKLNQKIhdnNESYQVGLSELRALMTIEKDQ-----CISELISRHEEE-SNILKAELDNVTSLHRQAYEIEKKLKE 1170
Cdd:PRK03918 404 EEEISKITARI---GELKKEIKELKKAIEELKKAKgkcpvCGRELTEEHRKElLEEYTAELKRIEKELKEIEEKERKLRK 480
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1171 QIVELQTRLN--SELSALEKQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQ----DKEHLIQELNFEKNKAVQTALDE 1244
Cdd:PRK03918 481 ELRELEKVLKkeSELIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKliklKGEIKSLKKELEKLEELKKKLAE 560
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1245 FKVERELVEKELLEKVKHLENQiaktpAFESAREDSSSLvAELQ------------EKLQEEKAKFLEQLEEQEKRKNEE 1312
Cdd:PRK03918 561 LEKKLDELEEELAELLKELEEL-----GFESVEELEERL-KELEpfyneylelkdaEKELEREEKELKKLEEELDKAFEE 634
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1313 MQNVRTSL------IAEQQTNFNTVLTREKMRKENIINDLSDKLKSTMQQQERDKDLIESLSEDRARLLEEKKQLEEEVS 1386
Cdd:PRK03918 635 LAETEKRLeelrkeLEELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELE 714
|
..
gi 226693374 1387 KL 1388
Cdd:PRK03918 715 KL 716
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1010-1308 |
1.07e-11 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 69.97 E-value: 1.07e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1010 LEKLKK---------ENQQRIDQM-LESHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESRT 1079
Cdd:COG1196 202 LEPLERqaekaeryrELKEELKELeAELLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELEL 281
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1080 QQKEMLKSL--LEQETENLRTEISKLNQKIHDNNESYQVGLSELRALmtIEKDQCISELISRHEEESNILKAELDNVTSL 1157
Cdd:COG1196 282 ELEEAQAEEyeLLAELARLEQDIARLEERRRELEERLEELEEELAEL--EEELEELEEELEELEEELEEAEEELEEAEAE 359
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1158 HRQAYEIEKKLKEQIVELQTRLNSELSAL--EKQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQDKEHLIQELnfekn 1235
Cdd:COG1196 360 LAEAEEALLEAEAELAEAEEELEELAEELleALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELE----- 434
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 226693374 1236 kavqTALDEFKVERELVEKELLEKVKHLENQIAKTPAFESAREDSSSLVAELQEKLQEEKAKFLEQLEEQEKR 1308
Cdd:COG1196 435 ----EEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADY 503
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
855-1372 |
1.40e-11 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 69.71 E-value: 1.40e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 855 EITLKEKHQQELQSLKIEYECKLDALVKDSEENVNKILKLKENLVSLEEALQNKDNEFTSIKHEKDAIvcvqQEKDQKLL 934
Cdd:PRK03918 173 EIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEI----EELEKELE 248
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 935 EMEKIMHTQHCEIKELKQSREMALEDLKKLHDEKIESLRAEFQCLEQNHLKELEDTLHIRhTQEFEKVMTDhnmsLEKLK 1014
Cdd:PRK03918 249 SLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDE-LREIEKRLSR----LEEEI 323
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1015 KENQQRIDQmLESHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEA-----------ETDEIKILLEESRTQQKE 1083
Cdd:PRK03918 324 NGIEERIKE-LEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEElerlkkrltglTPEKLEKELEELEKAKEE 402
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1084 mlkslLEQETENLRTEISKLNQKIHDNNEsyqvglselrALMTIEKDQ-----CISELISRHEEE-SNILKAELDNVTSL 1157
Cdd:PRK03918 403 -----IEEEISKITARIGELKKEIKELKK----------AIEELKKAKgkcpvCGRELTEEHRKElLEEYTAELKRIEKE 467
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1158 HRQAYEIEKKLKEQIVELQTRLN--SELSALEKQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQ----DKEHLIQELN 1231
Cdd:PRK03918 468 LKEIEEKERKLRKELRELEKVLKkeSELIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKliklKGEIKSLKKE 547
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1232 FEKNKAVQTALDEFKVERELVEKELLEKVKHLENQiaktpAFESAREDSSSLvAELQ------------EKLQEEKAKFL 1299
Cdd:PRK03918 548 LEKLEELKKKLAELEKKLDELEEELAELLKELEEL-----GFESVEELEERL-KELEpfyneylelkdaEKELEREEKEL 621
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1300 EQLEEQEKRKNEEMQNVRTSL------IAEQQTNFNTV----LTREKMRKENIINDLSDKLKSTMQQQERDKDLIESLSE 1369
Cdd:PRK03918 622 KKLEEELDKAFEELAETEKRLeelrkeLEELEKKYSEEeyeeLREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKE 701
|
...
gi 226693374 1370 DRA 1372
Cdd:PRK03918 702 ELE 704
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
859-1359 |
1.98e-11 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 69.31 E-value: 1.98e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 859 KEKHQQELQSLKIEYECKLDALVKDSEENVNKILKLKENLvSLEEALQNKDNEFTSIKHEKDAIVCVQQEKDQKLLEMEK 938
Cdd:TIGR00606 499 LKKEVKSLQNEKADLDRKLRKLDQEMEQLNHHTTTRTQME-MLTKDKMDKDEQIRKIKSRHSDELTSLLGYFPNKKQLED 577
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 939 IMHTQHCEIKELKQSREMALEDLKKLHDEKiESLRAEFQCLEQNHLKELEDTLHIRHTQEFEKVMTDHNMSLEKLKKEN- 1017
Cdd:TIGR00606 578 WLHSKSKEINQTRDRLAKLNKELASLEQNK-NHINNELESKEEQLSSYEDKLFDVCGSQDEESDLERLKEEIEKSSKQRa 656
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1018 ---------QQRIDQMLESHAS----------TIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEesr 1078
Cdd:TIGR00606 657 mlagatavySQFITQLTDENQSccpvcqrvfqTEAELQEFISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAP--- 733
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1079 TQQKEMlkSLLEQETENLRTEISKLNQKIHdnnesyqvglselRALMTIEKDQCISELISRHEEESNILKAELDNVTSLH 1158
Cdd:TIGR00606 734 GRQSII--DLKEKEIPELRNKLQKVNRDIQ-------------RLKNDIEEQETLLGTIMPEEESAKVCLTDVTIMERFQ 798
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1159 RQAYEIEKKLKEQIVELQTrlnSELSALEKQKDEKITQQEEKYEALIQNLEKDKeRLVKNHEQDKEHLIQELNFEKNKAV 1238
Cdd:TIGR00606 799 MELKDVERKIAQQAAKLQG---SDLDRTVQQVNQEKQEKQHELDTVVSKIELNR-KLIQDQQEQIQHLKSKTNELKSEKL 874
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1239 QtaLDEFKVERELVEKELLEKVKHLENQIAktpAFESAREDSSSLvAELQEKLQEEKAKFLEQLEEQEKRKNEEMQNVRT 1318
Cdd:TIGR00606 875 Q--IGTNLQRRQQFEEQLVELSTEVQSLIR---EIKDAKEQDSPL-ETFLEKDQQEKEELISSKETSNKKAQDKVNDIKE 948
|
490 500 510 520
....*....|....*....|....*....|....*....|....*..
gi 226693374 1319 SL---IAEQQTNFNTV---LTREKMRKENIINDLSDKLKSTMQQQER 1359
Cdd:TIGR00606 949 KVkniHGYMKDIENKIqdgKDDYLKQKETELNTVNAQLEECEKHQEK 995
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1055-1369 |
2.01e-11 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 69.32 E-value: 2.01e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1055 KLEVELALKEAETDEIKILLEESRTQQKEmlkslLEQETENLRTEISKLNQKIHDNNESYQVGLSELRALmtiekdqciS 1134
Cdd:TIGR02168 681 ELEEKIEELEEKIAELEKALAELRKELEE-----LEEELEQLRKELEELSRQISALRKDLARLEAEVEQL---------E 746
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1135 ELISRHEEESNILKAELDNVTSLHRQAYEIEKKLKEQIVELQTRLNSELSALEKQKdEKITQQEEKYEALIQNLEKDKER 1214
Cdd:TIGR02168 747 ERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALR-EALDELRAELTLLNEEAANLRER 825
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1215 LvKNHEQDKEHLIQELNFEKNKAVQTALDEFKVERELVEK-----ELLEKVKHLENQIAKT-PAFESAREDSSSLVAELQ 1288
Cdd:TIGR02168 826 L-ESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELeelieELESELEALLNERASLeEALALLRSELEELSEELR 904
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1289 EKlqEEKAKFLEQLEEQEKRKNEEMQNVRTSLIAEQQTNFNTVLTREKMRKENIINDLSDKLKSTMQQQERDKDLIESLS 1368
Cdd:TIGR02168 905 EL--ESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRRLKRLENKIK 982
|
.
gi 226693374 1369 E 1369
Cdd:TIGR02168 983 E 983
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
892-1311 |
2.57e-11 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 68.94 E-value: 2.57e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 892 LKLKENLVSLEEALQNKDNEFTSIKHEKDAIvcvQQEKDQKLLEMEKimhtQHCEIKELKQSREMALEDLKKLHdEKIES 971
Cdd:TIGR02169 670 RSEPAELQRLRERLEGLKRELSSLQSELRRI---ENRLDELSQELSD----ASRKIGEIEKEIEQLEQEEEKLK-ERLEE 741
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 972 LRAEFQCLEQnhlkELEDTlhIRHTQEFEKVMTDHNMSLEKLKKEnQQRIDQMLEShaSTIQEKEQQLQELKLKVSDLsd 1051
Cdd:TIGR02169 742 LEEDLSSLEQ----EIENV--KSELKELEARIEELEEDLHKLEEA-LNDLEARLSH--SRIPEIQAELSKLEEEVSRI-- 810
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1052 mrcklevelalkEAETDEIkilleESRTQQKEMLKSLLEQETENLRTEISKLNQKIHDNNESYQVGLSELRALMTI--EK 1129
Cdd:TIGR02169 811 ------------EARLREI-----EQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEEleEL 873
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1130 DQCISELISRHEEesniLKAELDNvtslhrqayeiekkLKEQIVELQTRLNsELSALEKQKDEKITQQEEKYEALIQNLE 1209
Cdd:TIGR02169 874 EAALRDLESRLGD----LKKERDE--------------LEAQLRELERKIE-ELEAQIEKKRKRLSELKAKLEALEEELS 934
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1210 KdkerLVKNHEQDKEHLIQELNFEKNKAvqtaldefKVERELVEKELLEKVKHLENQiaktpAFESAREDSSSLVaELQE 1289
Cdd:TIGR02169 935 E----IEDPKGEDEEIPEEELSLEDVQA--------ELQRVEEEIRALEPVNMLAIQ-----EYEEVLKRLDELK-EKRA 996
|
410 420
....*....|....*....|..
gi 226693374 1290 KLQEEKAKFLEQLEEQEKRKNE 1311
Cdd:TIGR02169 997 KLEEERKAILERIEEYEKKKRE 1018
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
842-1314 |
6.75e-11 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 67.40 E-value: 6.75e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 842 EIRNIIEKVKCSLEITLKEKHQQELQSLKIEYECKLdalvkdseenvNKILKLKENLVSLEEALQNKDNEFTSIKHEKda 921
Cdd:PRK03918 274 EIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDEL-----------REIEKRLSRLEEEINGIEERIKELEEKEERL-- 340
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 922 ivcvqQEKDQKLLEMEKimhtqhcEIKELKQSREmALEDLKKLHDEkIESLRAEFQCLEQNHLKELEDTLHIRHT--QEF 999
Cdd:PRK03918 341 -----EELKKKLKELEK-------RLEELEERHE-LYEEAKAKKEE-LERLKKRLTGLTPEKLEKELEELEKAKEeiEEE 406
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1000 EKVMTDHNMSLEKLKKENQQRIDQMLESHAS--------TIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIK 1071
Cdd:PRK03918 407 ISKITARIGELKKEIKELKKAIEELKKAKGKcpvcgrelTEEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELE 486
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1072 ILLEESRTQQKemLKSLLEQeTENLRTEISKLNqkihdnnesyqvglselralmtIEKDQCISELISRHEEESNILKAEL 1151
Cdd:PRK03918 487 KVLKKESELIK--LKELAEQ-LKELEEKLKKYN----------------------LEELEKKAEEYEKLKEKLIKLKGEI 541
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1152 DNVTSLHRQAYEIEKKLKEQIVELQTrLNSELSALEKQKDEKITQQEEKYEALIQNLEKDKERLV--KNHEQDKEHLIQE 1229
Cdd:PRK03918 542 KSLKKELEKLEELKKKLAELEKKLDE-LEEELAELLKELEELGFESVEELEERLKELEPFYNEYLelKDAEKELEREEKE 620
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1230 LNFEKNKAVQTA--LDEFKVERELVEKEL--LEKVKHLENQIAKTPAFESAREDSSSLVAELQ--EKLQEEKAKFLEQLE 1303
Cdd:PRK03918 621 LKKLEEELDKAFeeLAETEKRLEELRKELeeLEKKYSEEEYEELREEYLELSRELAGLRAELEelEKRREEIKKTLEKLK 700
|
490
....*....|..
gi 226693374 1304 EQ-EKRKNEEMQ 1314
Cdd:PRK03918 701 EElEEREKAKKE 712
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
855-1372 |
1.06e-10 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 66.88 E-value: 1.06e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 855 EITLKEKHQQELQSLKIEYECKLDALVKDSEENVNKILKLKENLVSLEEALQNKDNEFTSIKHEKDAIVCVQQEKDQKLL 934
Cdd:COG1196 261 ELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELE 340
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 935 EMEKIMHTQHCEIKELKQSREMALEDLKKLHDEKIESLRAEFQcLEQNHLKELEDTLHIRHTQEfekvmtdhnmSLEKLK 1014
Cdd:COG1196 341 ELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEE-LAEELLEALRAAAELAAQLE----------ELEEAE 409
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1015 KENQQRIDQMLESHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESRTQQKEMLKSLLEQETE 1094
Cdd:COG1196 410 EALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEA 489
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1095 NLRtEISKLNQKIHDNNESYQVGLSELRALMTiEKDQCISELISRHEEESNILKAELDNVtsLHRQAYEIEKKLKEQIVE 1174
Cdd:COG1196 490 AAR-LLLLLEAEADYEGFLEGVKAALLLAGLR-GLAGAVAVLIGVEAAYEAALEAALAAA--LQNIVVEDDEVAAAAIEY 565
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1175 LQTRLNSELSALEKQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQDKEHLIQELNFEKNKAVQTALDEFKVERELVEK 1254
Cdd:COG1196 566 LKAAKAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGR 645
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1255 ELLEKVKHLENQIAKTPAFESAREDSSSLVAELQEKLQEEKAKFLEQLEEQEKRKNEEMQNVRTSLIAEQQTNFNTVLTR 1334
Cdd:COG1196 646 LREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEA 725
|
490 500 510
....*....|....*....|....*....|....*...
gi 226693374 1335 EKMRKENIINDLSDKLKSTMQQQERDKDLIESLSEDRA 1372
Cdd:COG1196 726 LEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLE 763
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
885-1215 |
1.80e-10 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 66.23 E-value: 1.80e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 885 EENVNKILKLKENLVSLEEALQNKDNEFTSIKHEKDAIVCVQQEKDQKLLEMEKimhtqhcEIKELKQSREMALEDLKKL 964
Cdd:TIGR02168 680 EELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRK-------DLARLEAEVEQLEERIAQL 752
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 965 HDEkIESLRAEFQCLEQNhLKELEDTLHirhtqefekvmtdhnmSLEKLKKENQQRIDQMLESHA---STIQEKEQQLQE 1041
Cdd:TIGR02168 753 SKE-LTELEAEIEELEER-LEEAEEELA----------------EAEAEIEELEAQIEQLKEELKalrEALDELRAELTL 814
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1042 LKLKVSDLsdmRCKLEVELALKEAETDEIKILLEESRTQQKEMLKslLEQETENLRTEISKLNQKIHDNNESYQVgLSEL 1121
Cdd:TIGR02168 815 LNEEAANL---RERLESLERRIAATERRLEDLEEQIEELSEDIES--LAAEIEELEELIEELESELEALLNERAS-LEEA 888
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1122 RALMTIEKDQCISElISRHEEESNILKAELDNVTSLHRQAYEIEKKLKEQIVELQTRLNSELSALE---KQKDEKITQQE 1198
Cdd:TIGR02168 889 LALLRSELEELSEE-LRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLeeaEALENKIEDDE 967
|
330
....*....|....*..
gi 226693374 1199 EKYEALIQNLEKDKERL 1215
Cdd:TIGR02168 968 EEARRRLKRLENKIKEL 984
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
1032-1325 |
2.45e-10 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 65.47 E-value: 2.45e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1032 IQEKEQQLQELKLKVSDLSDMRCKLEVELALKEaetdEIKILLEESR-TQQKEMLKSL--LEQETENLRTEISKLNQKIh 1108
Cdd:TIGR02169 179 LEEVEENIERLDLIIDEKRQQLERLRREREKAE----RYQALLKEKReYEGYELLKEKeaLERQKEAIERQLASLEEEL- 253
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1109 dnnESYQVGLSELRALMTiEKDQCISELISR----HEEESNILKAELDNVTSLHRQAYEIEKKLKEQIVELQTRLNSELS 1184
Cdd:TIGR02169 254 ---EKLTEEISELEKRLE-EIEQLLEELNKKikdlGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEA 329
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1185 ALEKQK------DEKITQQEEKYEALIQNLEKDKERL----VKNHEQDKEHliQELnFEKNKAVQTALDEFKVERELVEK 1254
Cdd:TIGR02169 330 EIDKLLaeieelEREIEEERKRRDKLTEEYAELKEELedlrAELEEVDKEF--AET-RDELKDYREKLEKLKREINELKR 406
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1255 E----------LLEKVKHLENQIA----KTPAFESAREDSSSLVAELQEKLQEEKAKfLEQLEEQEKRKNEEMQNVRTSL 1320
Cdd:TIGR02169 407 EldrlqeelqrLSEELADLNAAIAgieaKINELEEEKEDKALEIKKQEWKLEQLAAD-LSKYEQELYDLKEEYDRVEKEL 485
|
....*
gi 226693374 1321 IAEQQ 1325
Cdd:TIGR02169 486 SKLQR 490
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
855-1369 |
2.79e-10 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 65.45 E-value: 2.79e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 855 EITLKEKHQQELQSLKIEYECKLDALVKDSEENVNKILKLKENLVsleealQNKDNEFTSIKHEKDAIVCVQQEKDQKLL 934
Cdd:TIGR00606 270 EIKALKSRKKQMEKDNSELELKMEKVFQGTDEQLNDLYHNHQRTV------REKERELVDCQRELEKLNKERRLLNQEKT 343
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 935 EMEKimhtqhcEIKELKQSREMALEDLKKLHDEKIE-SLRAEFQCLEQNHLKELE-DTLH--IRHTQEFEKVMTDHNM-- 1008
Cdd:TIGR00606 344 ELLV-------EQGRLQLQADRHQEHIRARDSLIQSlATRLELDGFERGPFSERQiKNFHtlVIERQEDEAKTAAQLCad 416
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1009 --SLEKLKKENQQRIDQMLESHASTIQEK----EQQLQELKLKVSDLSD----MRCKLEVELALKEAETDeIKILLEESR 1078
Cdd:TIGR00606 417 lqSKERLKQEQADEIRDEKKGLGRTIELKkeilEKKQEELKFVIKELQQlegsSDRILELDQELRKAERE-LSKAEKNSL 495
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1079 TQQKEMLKSLLEQETENLRTEISKLNQKIHD-NNESYQVGLSELRALMTIEKDQCISELISRHEEESNILKAELDNVTSL 1157
Cdd:TIGR00606 496 TETLKKEVKSLQNEKADLDRKLRKLDQEMEQlNHHTTTRTQMEMLTKDKMDKDEQIRKIKSRHSDELTSLLGYFPNKKQL 575
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1158 HRQAYEIEKKLKeQIVELQTRLNSELSALEKQKD------EKITQQEEKYEALI------QNLEKDKERLVKNHEQDKEH 1225
Cdd:TIGR00606 576 EDWLHSKSKEIN-QTRDRLAKLNKELASLEQNKNhinnelESKEEQLSSYEDKLfdvcgsQDEESDLERLKEEIEKSSKQ 654
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1226 L-------------IQELNFEKNKAVQTALDEFKVERELVE--KELLEKVKHLENQIAKTPAFESAREDSSSLV---AEL 1287
Cdd:TIGR00606 655 RamlagatavysqfITQLTDENQSCCPVCQRVFQTEAELQEfiSDLQSKLRLAPDKLKSTESELKKKEKRRDEMlglAPG 734
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1288 QEKLQEEKAKFLEQLEEQEKRKNEEMQNVRTSlIAEQQTNFNTVLTREKMrKENIINDLSDKLKSTMQQQERDKDLIESL 1367
Cdd:TIGR00606 735 RQSIIDLKEKEIPELRNKLQKVNRDIQRLKND-IEEQETLLGTIMPEEES-AKVCLTDVTIMERFQMELKDVERKIAQQA 812
|
..
gi 226693374 1368 SE 1369
Cdd:TIGR00606 813 AK 814
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1142-1373 |
3.59e-10 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 64.96 E-value: 3.59e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1142 EESNILKAELdnvTSLHRQAYEIE-KKLKEQIVELQTRLNSELSALEkQKDEKITQQEEKYEALIQNLEKDKERLVkNHE 1220
Cdd:COG1196 220 EELKELEAEL---LLLKLRELEAElEELEAELEELEAELEELEAELA-ELEAELEELRLELEELELELEEAQAEEY-ELL 294
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1221 QDKEHLIQELNF--EKNKAVQTALDEFKVERELVEKELLEKVKHLENQIAKTPAFESAREDSSSLVAELQEKLQEEKAKF 1298
Cdd:COG1196 295 AELARLEQDIARleERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAEL 374
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 226693374 1299 LEQLEEQEKRKNEEMQNVRTSL-IAEQQTNFNTVLTREKMRKENIINDLSDKLKSTMQQQERDKDLIESLSEDRAR 1373
Cdd:COG1196 375 AEAEEELEELAEELLEALRAAAeLAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEE 450
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
929-1367 |
3.61e-10 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 64.66 E-value: 3.61e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 929 KDQKLLEMEKIMHTQHCEIKElKQSREMALEDLKKLHDEKIESLRAEFQCLEQNhLKELEDTLHIRHTQefekvMTDHNM 1008
Cdd:TIGR04523 31 QDTEEKQLEKKLKTIKNELKN-KEKELKNLDKNLNKDEEKINNSNNKIKILEQQ-IKDLNDKLKKNKDK-----INKLNS 103
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1009 SLEKLKKENQQRIDQMLESHaSTIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESRTQQKEMLKSL 1088
Cdd:TIGR04523 104 DLSKINSEIKNDKEQKNKLE-VELNKLEKQKKENKKNIDKFLTEIKKKEKELEKLNNKYNDLKKQKEELENELNLLEKEK 182
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1089 LEQETE--NLRTEISKLNQKIHD------NNESYQVGLSELRALMTIEKDQcISELISRHEEESNILKAELDNVTSLHRQ 1160
Cdd:TIGR04523 183 LNIQKNidKIKNKLLKLELLLSNlkkkiqKNKSLESQISELKKQNNQLKDN-IEKKQQEINEKTTEISNTQTQLNQLKDE 261
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1161 AYEIEKKLKEQIVELQT-------------RLNSELSALEKQKD--------EKITQQEEKYEALIQNLEKDKERLVKNH 1219
Cdd:TIGR04523 262 QNKIKKQLSEKQKELEQnnkkikelekqlnQLKSEISDLNNQKEqdwnkelkSELKNQEKKLEEIQNQISQNNKIISQLN 341
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1220 EQ--DKEHLIQELNFEkNKAVQTALDEFKVERELVEKELLEKVKHLENQIAKTPAFESAREDssslvaelQEKLQEEKAK 1297
Cdd:TIGR04523 342 EQisQLKKELTNSESE-NSEKQRELEEKQNEIEKLKKENQSYKQEIKNLESQINDLESKIQN--------QEKLNQQKDE 412
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1298 FLEQLEEQEKRKNEEMQNVRTSLIAEQQTNFNtvLTREKMRKENIINDLSDKLKSTMQQQERDKDLIESL 1367
Cdd:TIGR04523 413 QIKKLQQEKELLEKEIERLKETIIKNNSEIKD--LTNQDSVKELIIKNLDNTRESLETQLKVLSRSINKI 480
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
829-1255 |
4.86e-10 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 64.70 E-value: 4.86e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 829 QCDLANYLKCTAVEIRNIIEKVkcsleitlkekhqqelqSLKIEYECKLDALVKDSEEnvnkilkLKENLVSLEEALQNK 908
Cdd:TIGR02169 141 QGDVTDFISMSPVERRKIIDEI-----------------AGVAEFDRKKEKALEELEE-------VEENIERLDLIIDEK 196
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 909 DNEFTSIKHEKDaivcvQQEKDQKLLEmEKimhtQHCEIKELKQSREMALEDLKKLhDEKIESLRAEFQCLEQnHLKELE 988
Cdd:TIGR02169 197 RQQLERLRRERE-----KAERYQALLK-EK----REYEGYELLKEKEALERQKEAI-ERQLASLEEELEKLTE-EISELE 264
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 989 DTLHirhtqEFEKVMTDHNMSLEKLKKENQQRIDQMLESHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAETD 1068
Cdd:TIGR02169 265 KRLE-----EIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIE 339
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1069 EIKillEESRTQQKEmlKSLLEQETENLRTEISKLNQKIHDNNESYQVGLSELRALmtiekDQCISELISRHEEesniLK 1148
Cdd:TIGR02169 340 ELE---REIEEERKR--RDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDY-----REKLEKLKREINE----LK 405
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1149 AELDNVTSLHRQAYEIEKKLKEQIVELQTRLNsELSALEKQKDEKITQQEEKYEALIQNLEKDKERLvknheQDKEHLIQ 1228
Cdd:TIGR02169 406 RELDRLQEELQRLSEELADLNAAIAGIEAKIN-ELEEEKEDKALEIKKQEWKLEQLAADLSKYEQEL-----YDLKEEYD 479
|
410 420
....*....|....*....|....*..
gi 226693374 1229 ELNfEKNKAVQTALDEFKVERELVEKE 1255
Cdd:TIGR02169 480 RVE-KELSKLQRELAEAEAQARASEER 505
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
842-1373 |
6.08e-10 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 64.31 E-value: 6.08e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 842 EIRNIIEKVKCSLEITLKEKHQQELQSLKIEYEC-----KLDALVKDSEENVNKILKLKENLVSLEEALQNKDNEftsik 916
Cdd:TIGR02168 292 ALANEISRLEQQKQILRERLANLERQLEELEAQLeelesKLDELAEELAELEEKLEELKEELESLEAELEELEAE----- 366
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 917 hekdaivcvQQEKDQKLLEMEKIMHTQHCEIKELKQSREMALEDLKKLhDEKIESLRAEFQCLEQNhLKELEDTLHIRHT 996
Cdd:TIGR02168 367 ---------LEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERL-EARLERLEDRRERLQQE-IEELLKKLEEAEL 435
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 997 QEFEKVMTDHNMSLEKLKKENQQRIDQmLESHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEE 1076
Cdd:TIGR02168 436 KELQAELEELEEELEELQEELERLEEA-LEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLKN 514
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1077 SR--TQQKEMLKSLLEQEtENLRTEISK-----LNQKIHDNNESYQVGLSELR--------------------------A 1123
Cdd:TIGR02168 515 QSglSGILGVLSELISVD-EGYEAAIEAalggrLQAVVVENLNAAKKAIAFLKqnelgrvtflpldsikgteiqgndreI 593
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1124 LMTIEKDQCI-SELISRHEEESNILKAELDN---VTSLhRQAYEIEKKLKEQ--IVELQTRL------------NSELSA 1185
Cdd:TIGR02168 594 LKNIEGFLGVaKDLVKFDPKLRKALSYLLGGvlvVDDL-DNALELAKKLRPGyrIVTLDGDLvrpggvitggsaKTNSSI 672
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1186 LEKQKD-EKITQQEEKYEALIQNLEKDKERLVKNHEQDKE--HLIQELNFEKNK---AVQTALDEFKVERELVEKELLEK 1259
Cdd:TIGR02168 673 LERRREiEELEEKIEELEEKIAELEKALAELRKELEELEEelEQLRKELEELSRqisALRKDLARLEAEVEQLEERIAQL 752
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1260 VKHLENQIAKTPAFESAREDSSSLVAELQEKLQEEKAKF------LEQLEEQEKRKNEEMQNVRTSliAEQQTNFNTVLT 1333
Cdd:TIGR02168 753 SKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIeqlkeeLKALREALDELRAELTLLNEE--AANLRERLESLE 830
|
570 580 590 600
....*....|....*....|....*....|....*....|
gi 226693374 1334 REKMRKENIINDLSDKLKSTMQQQERDKDLIESLSEDRAR 1373
Cdd:TIGR02168 831 RRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEE 870
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
859-1390 |
7.90e-10 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 63.92 E-value: 7.90e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 859 KEKHQQELQSLKIEYEckldALVKDSEENVNKILKLKENLVSLEEALQNKDNEFTSIKHEKDAIVCVQQEKDQKLLEMEK 938
Cdd:TIGR02168 304 KQILRERLANLERQLE----ELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEE 379
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 939 IMHTQHCEIKELKQSREMALEDLKKLhDEKIESLRAEFQCLEQNhLKELEDTLHIRHTQEFEKVMTDHNMSLEKLKKENQ 1018
Cdd:TIGR02168 380 QLETLRSKVAQLELQIASLNNEIERL-EARLERLEDRRERLQQE-IEELLKKLEEAELKELQAELEELEEELEELQEELE 457
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1019 QRIDQmLESHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESR--TQQKEMLKSLLEQEtENL 1096
Cdd:TIGR02168 458 RLEEA-LEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLKNQSglSGILGVLSELISVD-EGY 535
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1097 RTEISK-----LNQKIHDNNESYQVGLSELR--------------------------ALMTIEKDQCI-SELISRHEEES 1144
Cdd:TIGR02168 536 EAAIEAalggrLQAVVVENLNAAKKAIAFLKqnelgrvtflpldsikgteiqgndreILKNIEGFLGVaKDLVKFDPKLR 615
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1145 NILKAELDNV------------------------------------------TSLHRQAYEIE--------KKLKEQIVE 1174
Cdd:TIGR02168 616 KALSYLLGGVlvvddldnalelakklrpgyrivtldgdlvrpggvitggsakTNSSILERRREieeleekiEELEEKIAE 695
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1175 LQTRLN---SELSALEKQKDEKITQQEEKyEALIQNLEKDKERLVKNHEQdKEHLIQELNFEKNKAVQ--TALDEFKVER 1249
Cdd:TIGR02168 696 LEKALAelrKELEELEEELEQLRKELEEL-SRQISALRKDLARLEAEVEQ-LEERIAQLSKELTELEAeiEELEERLEEA 773
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1250 ELVEKELLEKVKHLENQIAKtpaFESAREDSSSLVAELQEKLQEEKAKFLEQLEEQEKRKNEemqnvrtslIAEQQTNFn 1329
Cdd:TIGR02168 774 EEELAEAEAEIEELEAQIEQ---LKEELKALREALDELRAELTLLNEEAANLRERLESLERR---------IAATERRL- 840
|
570 580 590 600 610 620
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 226693374 1330 TVLTREKMRKENIINDLSDKLKSTMQQQERDKDLIESLSEDRARLLEEKKQLEEEVSKLRT 1390
Cdd:TIGR02168 841 EDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSE 901
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
877-1369 |
8.76e-10 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 63.54 E-value: 8.76e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 877 LDALVKDSEENvNKILKLKENLVSLEEALQNKDNEFTSIKHEKDAIvcvqqekdQKLLEMEKIMHTQhceIKELKQSREM 956
Cdd:PRK03918 137 IDAILESDESR-EKVVRQILGLDDYENAYKNLGEVIKEIKRRIERL--------EKFIKRTENIEEL---IKEKEKELEE 204
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 957 ALEDLKKLHDEkIESLRAEFQCLEQNhLKELEDTLHIRHTQEFEKvmtdhnMSLEKLKKENQQRIDQMLEShastIQEKE 1036
Cdd:PRK03918 205 VLREINEISSE-LPELREELEKLEKE-VKELEELKEEIEELEKEL------ESLEGSKRKLEEKIRELEER----IEELK 272
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1037 QQLQELKLKVSDLSDMRCKLEVELALKEaETDEIKilleeSRTQQKEMLKSLLEQETENLRTEISKLNQKIHDNNEsyqv 1116
Cdd:PRK03918 273 KEIEELEEKVKELKELKEKAEEYIKLSE-FYEEYL-----DELREIEKRLSRLEEEINGIEERIKELEEKEERLEE---- 342
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1117 glselralmtiekdqcISELISRHEEESNILKA---ELDNVTSLHRQAYEIEKKLKEQIVElqtRLNSELSALEKQKdEK 1193
Cdd:PRK03918 343 ----------------LKKKLKELEKRLEELEErheLYEEAKAKKEELERLKKRLTGLTPE---KLEKELEELEKAK-EE 402
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1194 ITQQEEKYEALIQNLEKDKERLVKN------------------HEQDKEHLIQELNFEKNKavqtaLDEFKVERELVEKE 1255
Cdd:PRK03918 403 IEEEISKITARIGELKKEIKELKKAieelkkakgkcpvcgrelTEEHRKELLEEYTAELKR-----IEKELKEIEEKERK 477
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1256 LLEKVKHLENQIAKTPAFESAREdssslVAELQEKLQEEKAKF-LEQLEeqekRKNEEMQNVRTSLIaeqqtnfntvltr 1334
Cdd:PRK03918 478 LRKELRELEKVLKKESELIKLKE-----LAEQLKELEEKLKKYnLEELE----KKAEEYEKLKEKLI------------- 535
|
490 500 510
....*....|....*....|....*....|....*
gi 226693374 1335 ekmrkeniinDLSDKLKSTMQQQERDKDLIESLSE 1369
Cdd:PRK03918 536 ----------KLKGEIKSLKKELEKLEELKKKLAE 560
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1114-1373 |
1.26e-09 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 63.42 E-value: 1.26e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1114 YQVGLSELRALMTIEKDQCISELISRHEEESNILKAELDNVTSLHRQAYEIEKKLKEQIVELQTRLNSELSALE--KQKD 1191
Cdd:COG1196 218 LKEELKELEAELLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYelLAEL 297
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1192 EKITQQEEKYEALIQNLEKDKERLVKNHEQDKEHLIQ-----ELNFEKNKAVQTALDEFKVERELVEKELLEKVKHLENQ 1266
Cdd:COG1196 298 ARLEQDIARLEERRRELEERLEELEEELAELEEELEEleeelEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEA 377
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1267 IAKTPAFESAREDSSSLVAELQEKLQ------EEKAKFLEQLEEQEKRKNEEMQNVRTSLIAEQQT--NFNTVLTREKMR 1338
Cdd:COG1196 378 EEELEELAEELLEALRAAAELAAQLEeleeaeEALLERLERLEEELEELEEALAELEEEEEEEEEAleEAAEEEAELEEE 457
|
250 260 270
....*....|....*....|....*....|....*
gi 226693374 1339 KENIINDLSDKLKSTMQQQERDKDLIESLSEDRAR 1373
Cdd:COG1196 458 EEALLELLAELLEEAALLEAALAELLEELAEAAAR 492
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
946-1488 |
1.31e-09 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 63.03 E-value: 1.31e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 946 EIKELKQSREMALEDLKKLhDEKIESLRAEFQCLE------QNHLKELEDTLHIRHTQEFEKVMTDHNM-----SLEKLK 1014
Cdd:COG1196 233 KLRELEAELEELEAELEEL-EAELEELEAELAELEaeleelRLELEELELELEEAQAEEYELLAELARLeqdiaRLEERR 311
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1015 KENQQRIDQMLEShastIQEKEQQLQELKLKvsdlsdmrcklEVELALKEAETDEIKILLEESRTQQKEMLKSLLEQETE 1094
Cdd:COG1196 312 RELEERLEELEEE----LAELEEELEELEEE-----------LEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAE 376
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1095 NLRTEISKLNQKIHDNNESYQVGLSELRALMTIEKDQcisELISRHEEESNILKAELDNVtslhRQAYEIEKKLKEQIVE 1174
Cdd:COG1196 377 AEEELEELAEELLEALRAAAELAAQLEELEEAEEALL---ERLERLEEELEELEEALAEL----EEEEEEEEEALEEAAE 449
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1175 LQTRLNSELSALEKQKDEKITQQEEKYEALIQNLEKDKERLvknheqdkehliQELNFEKNKAVQTALDEFKVERELVEK 1254
Cdd:COG1196 450 EEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAA------------ARLLLLLEAEADYEGFLEGVKAALLLA 517
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1255 ELLEKVKHLENQIAKTPAFESAREDssSLVAELQEKLQEEKAKFLEQLEEQEKRKNEEMQNVRTSLIAEQQTNFNTVLTR 1334
Cdd:COG1196 518 GLRGLAGAVAVLIGVEAAYEAALEA--ALAAALQNIVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARG 595
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1335 EKMRKENIINDLSDKLKStmqQQERDKDLIESLSEDRARLLEEKKQLEEEVSKLRTSSFLSSApVAAAPELYGACAPELP 1414
Cdd:COG1196 596 AIGAAVDLVASDLREADA---RYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEG-GSAGGSLTGGSRRELL 671
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 226693374 1415 GEPERSVMETADEGRLDSAMETSMMSVQENMLSEEKQRIMLLERTLQLKEEENKRLNQRLMSQSLSSVSSRHSE 1488
Cdd:COG1196 672 AALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEE 745
|
|
| 235kDa-fam |
TIGR01612 |
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ... |
804-1390 |
2.44e-09 |
|
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.
Pssm-ID: 130673 [Multi-domain] Cd Length: 2757 Bit Score: 62.76 E-value: 2.44e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 804 AQDSHSSIQTIKDDLCH--FRTFVQKEQCDLANYLKCTAVEIRNIIEKVkcsleitlkEKHQQELQSLKIEYECKLDAL- 880
Cdd:TIGR01612 1079 AEINITNFNEIKEKLKHynFDDFGKEENIKYADEINKIKDDIKNLDQKI---------DHHIKALEEIKKKSENYIDEIk 1149
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 881 --------VKD---SEENVNKILKLKENLVSleeALQNKDNEFTSIKHEKDAIVCVqqEKDQKLLEMEKIMHTQHCeiKE 949
Cdd:TIGR01612 1150 aqindledVADkaiSNDDPEEIEKKIENIVT---KIDKKKNIYDEIKKLLNEIAEI--EKDKTSLEEVKGINLSYG--KN 1222
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 950 LKQSREMALEDLKKLHDEKIESLRAEFQCLEQNHLKELEDTLHIRHTQEFEKVMTDHNMSLEKLKKenQQRIDQMLESHA 1029
Cdd:TIGR01612 1223 LGKLFLEKIDEEKKKSEHMIKAMEAYIEDLDEIKEKSPEIENEMGIEMDIKAEMETFNISHDDDKD--HHIISKKHDENI 1300
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1030 STIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESrTQQKEMLKSlleQETENLRTEISKLNQKIHD 1109
Cdd:TIGR01612 1301 SDIREKSLKIIEDFSEESDINDIKKELQKNLLDAQKHNSDINLYLNEI-ANIYNILKL---NKIKKIIDEVKEYTKEIEE 1376
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1110 NNESYQVGLSELRALMTIEKD-----QC------------ISELISRHEEESNILKAELDNVTSLHRQAYEIEKKLKEQI 1172
Cdd:TIGR01612 1377 NNKNIKDELDKSEKLIKKIKDdinleECkskiestlddkdIDECIKKIKELKNHILSEESNIDTYFKNADENNENVLLLF 1456
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1173 VELQTRLNSELSALEKQKDEKITQQEEKYEALIQNleKDKERLVKNhEQDKehliQELNFEKNKAVqtaldeFKVERELV 1252
Cdd:TIGR01612 1457 KNIEMADNKSQHILKIKKDNATNDHDFNINELKEH--IDKSKGCKD-EADK----NAKAIEKNKEL------FEQYKKDV 1523
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1253 eKELLEKVKHLE--NQIAKTpafesaREDSSSLVAELQEKlqeeKAKFLEQLEEQEKR----KNEEMQ------------ 1314
Cdd:TIGR01612 1524 -TELLNKYSALAikNKFAKT------KKDSEIIIKEIKDA----HKKFILEAEKSEQKikeiKKEKFRieddaakndksn 1592
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1315 ----NVRTSL---------IAEQQTNFNTVLTREKMRKENI----INDLSDKLKSTMQQQERDKDLIESLSEDRARLLEE 1377
Cdd:TIGR01612 1593 kaaiDIQLSLenfenkflkISDIKKKINDCLKETESIEKKIssfsIDSQDTELKENGDNLNSLQEFLESLKDQKKNIEDK 1672
|
650
....*....|...
gi 226693374 1378 KKQLEEEVSKLRT 1390
Cdd:TIGR01612 1673 KKELDELDSEIEK 1685
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1149-1472 |
2.60e-09 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 62.38 E-value: 2.60e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1149 AELD-NVTSLHRQAYEIEK--KLKEQIVELQ--------TRLNSELSALEKQKDEKiTQQEEKYEALIQNLEKDKERLVK 1217
Cdd:TIGR02168 196 NELErQLKSLERQAEKAERykELKAELRELElallvlrlEELREELEELQEELKEA-EEELEELTAELQELEEKLEELRL 274
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1218 NHEQDKE--HLIQELNFEKNKAVQtaldefKVERELveKELLEKVKHLENQIAKTPA----FESAREDSSSLVAELQEKL 1291
Cdd:TIGR02168 275 EVSELEEeiEELQKELYALANEIS------RLEQQK--QILRERLANLERQLEELEAqleeLESKLDELAEELAELEEKL 346
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1292 QEEKAKFlEQLEEQEKRKNEEMQNvRTSLIAEQQTNFNTVltrekmRKEniINDLSDKLKSTMQQQERDKDLIESLSEDR 1371
Cdd:TIGR02168 347 EELKEEL-ESLEAELEELEAELEE-LESRLEELEEQLETL------RSK--VAQLELQIASLNNEIERLEARLERLEDRR 416
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1372 ARLLEEKKQLEEEVSKLRTssflssAPVAAAPELYGACAPELPGEPERsVMETADEGRLDSAMETSMMSVQENMLSEEKQ 1451
Cdd:TIGR02168 417 ERLQQEIEELLKKLEEAEL------KELQAELEELEEELEELQEELER-LEEALEELREELEEAEQALDAAERELAQLQA 489
|
330 340
....*....|....*....|.
gi 226693374 1452 RIMLLERTLQLKEEENKRLNQ 1472
Cdd:TIGR02168 490 RLDSLERLQENLEGFSEGVKA 510
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
859-1373 |
2.61e-09 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 62.11 E-value: 2.61e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 859 KEKHQQELQSLKiEYECKLDALVKDSEENVNKILKLKENLVSLEEA---LQNKDNEFTSIKHEKDAIVCVQQEKDQKLLE 935
Cdd:pfam01576 18 KERQQKAESELK-ELEKKHQQLCEEKNALQEQLQAETELCAEAEEMrarLAARKQELEEILHELESRLEEEEERSQQLQN 96
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 936 MEKIMHTQHCEIKELKQSREMALEDL---KKLHDEKIESLRAEFQCLE-QNHLKELEDTLHIRHTQEFEKVMTDHN---M 1008
Cdd:pfam01576 97 EKKKMQQHIQDLEEQLDEEEAARQKLqleKVTTEAKIKKLEEDILLLEdQNSKLSKERKLLEERISEFTSNLAEEEekaK 176
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1009 SLEKLKKENQQRIDQM---LESHASTIQEKEQ-------QLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESR 1078
Cdd:pfam01576 177 SLSKLKNKHEAMISDLeerLKKEEKGRQELEKakrklegESTDLQEQIAELQAQIAELRAQLAKKEEELQAALARLEEET 256
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1079 TQQKEMLKSL---------LEQETENLRTEISKLNQKIHDNNEsyqvglsELRALMTIEKDQCIS-----ELISRHEEES 1144
Cdd:pfam01576 257 AQKNNALKKIreleaqiseLQEDLESERAARNKAEKQRRDLGE-------ELEALKTELEDTLDTtaaqqELRSKREQEV 329
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1145 NILKAELDNVTSLHR-QAYEIEKKLKEQIVELQTRLnsELSALEKQKDEKITQQEEKyealiQNLEKDKE-RLVKNHEQD 1222
Cdd:pfam01576 330 TELKKALEEETRSHEaQLQEMRQKHTQALEELTEQL--EQAKRNKANLEKAKQALES-----ENAELQAElRTLQQAKQD 402
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1223 KEHliqelnfeKNKAVQTALDEFKV---ERELVEKELLEKVKHLENQIAKTPAFESARED----SSSLVAELQEKLQEEK 1295
Cdd:pfam01576 403 SEH--------KRKKLEGQLQELQArlsESERQRAELAEKLSKLQSELESVSSLLNEAEGknikLSKDVSSLESQLQDTQ 474
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1296 akflEQLEEQEKRKNEEMQNVRTslIAEQQTNFNTVLTREKMRKENI---INDLSDKLKSTMQQQERDKDLIESLSEDRA 1372
Cdd:pfam01576 475 ----ELLQEETRQKLNLSTRLRQ--LEDERNSLQEQLEEEEEAKRNVerqLSTLQAQLSDMKKKLEEDAGTLEALEEGKK 548
|
.
gi 226693374 1373 R 1373
Cdd:pfam01576 549 R 549
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
892-1369 |
3.08e-09 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 61.66 E-value: 3.08e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 892 LKLKENLVSLEEALQNKDNEFTSIKHEKDAIVCVQQEKDQKLLEMEKIMHTQHCEIKELKQSREMALEDLKKLHDekies 971
Cdd:pfam05483 215 FKLKEDHEKIQHLEEEYKKEINDKEKQVSLLLIQITEKENKMKDLTFLLEESRDKANQLEEKTKLQDENLKELIE----- 289
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 972 lraefqclEQNHL-KELEDTlhirhtqefeKVMTDHNMSLEKLKKENQQ----RIDQMLESHASTIQEKEQQLQELKLKV 1046
Cdd:pfam05483 290 --------KKDHLtKELEDI----------KMSLQRSMSTQKALEEDLQiatkTICQLTEEKEAQMEELNKAKAAHSFVV 351
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1047 SDLSDMRCKLE----VELALKEAETDEIKILLEESRTQQKEM-----LKSLLEQETENLRTEISKlNQKIHDNNESYQVG 1117
Cdd:pfam05483 352 TEFEATTCSLEellrTEQQRLEKNEDQLKIITMELQKKSSELeemtkFKNNKEVELEELKKILAE-DEKLLDEKKQFEKI 430
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1118 LSELRAlmtieKDQCISELISRHEEESNILKAELDNVTSLH----RQAYEIEKKL-KEQIVELQTRLNSELSALEK---- 1188
Cdd:pfam05483 431 AEELKG-----KEQELIFLLQAREKEIHDLEIQLTAIKTSEehylKEVEDLKTELeKEKLKNIELTAHCDKLLLENkelt 505
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1189 QKDEKITQQEEKYEALIQNLEKDKERLVK---NHEQDKEHLIQELNF------EKNKAVQTALDEFKVERELVEKELLEK 1259
Cdd:pfam05483 506 QEASDMTLELKKHQEDIINCKKQEERMLKqieNLEEKEMNLRDELESvreefiQKGDEVKCKLDKSEENARSIEYEVLKK 585
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1260 VKHLENQIAKTPAFESAREDSSSLVAELQEK----------------LQEEKAKFLEQLEEQEKRKNEEMQNVRTSLIAE 1323
Cdd:pfam05483 586 EKQMKILENKCNNLKKQIENKNKNIEELHQEnkalkkkgsaenkqlnAYEIKVNKLELELASAKQKFEEIIDNYQKEIED 665
|
490 500 510 520
....*....|....*....|....*....|....*....|....*.
gi 226693374 1324 QQTNFNTVLtrEKMRKENIINDLSDKLkstmqQQERDKDLIESLSE 1369
Cdd:pfam05483 666 KKISEEKLL--EEVEKAKAIADEAVKL-----QKEIDKRCQHKIAE 704
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1147-1393 |
3.81e-09 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 61.61 E-value: 3.81e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1147 LKAELDNVTSLHRQAYEIEKKLKEQIVELQ---TRLNSELSALEKQKDEkitQQEEKYE--ALIQNLEKDKERLV---KN 1218
Cdd:TIGR02168 237 LREELEELQEELKEAEEELEELTAELQELEeklEELRLEVSELEEEIEE---LQKELYAlaNEISRLEQQKQILRerlAN 313
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1219 HEQDKEHLIQELNFEKNKavqtaLDEFKVERELVEKELLEKVKHLENQIAKTPAFESAREDSSSLVAELQEKLQEEKAKF 1298
Cdd:TIGR02168 314 LERQLEELEAQLEELESK-----LDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKV 388
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1299 LeQLEEQEKRKNEEMQNVRTSL--IAEQQTNFNTVLT-----REKMRKENIINDLSDKLKSTMQQQERDKDLIESLSEDR 1371
Cdd:TIGR02168 389 A-QLELQIASLNNEIERLEARLerLEDRRERLQQEIEellkkLEEAELKELQAELEELEEELEELQEELERLEEALEELR 467
|
250 260
....*....|....*....|..
gi 226693374 1372 ARLLEEKKQLEEEVSKLRTSSF 1393
Cdd:TIGR02168 468 EELEEAEQALDAAERELAQLQA 489
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1012-1291 |
5.86e-09 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 60.16 E-value: 5.86e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1012 KLKKENQQRIDQMLEShastIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAEtdeikilLEESRTQQKEmlkslLEQ 1091
Cdd:COG4942 20 DAAAEAEAELEQLQQE----IAELEKELAALKKEEKALLKQLAALERRIAALARR-------IRALEQELAA-----LEA 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1092 ETENLRTEISKLNQKIHDNNESYQvglSELRALMTiekdqciselISRHEEESNILKAElDNVTSLHRQAY--EIEKKLK 1169
Cdd:COG4942 84 ELAELEKEIAELRAELEAQKEELA---ELLRALYR----------LGRQPPLALLLSPE-DFLDAVRRLQYlkYLAPARR 149
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1170 EQIVELQTRLNsELSALEKQkdekITQQEEKYEALIQNLEKDKERLVKNhEQDKEHLIQELNfEKNKAVQTALDEFKVEr 1249
Cdd:COG4942 150 EQAEELRADLA-ELAALRAE----LEAERAELEALLAELEEERAALEAL-KAERQKLLARLE-KELAELAAELAELQQE- 221
|
250 260 270 280
....*....|....*....|....*....|....*....|..
gi 226693374 1250 elvEKELLEKVKHLENQIAktpafESAREDSSSLVAELQEKL 1291
Cdd:COG4942 222 ---AEELEALIARLEAEAA-----AAAERTPAAGFAALKGKL 255
|
|
| MAD |
pfam05557 |
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ... |
851-1370 |
1.30e-08 |
|
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.
Pssm-ID: 461677 [Multi-domain] Cd Length: 660 Bit Score: 59.75 E-value: 1.30e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 851 KCSLEITLKEKHQQELQ--SLKIEYECKLDALVKDSEENVNK-------ILKLKENLVSLEEALQNKDNEFTSIKHEKDA 921
Cdd:pfam05557 8 KARLSQLQNEKKQMELEhkRARIELEKKASALKRQLDRESDRnqelqkrIRLLEKREAEAEEALREQAELNRLKKKYLEA 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 922 IVCVQQEKDQKLLEMEKIMHTQHCEIKELKQSREMAledlkklhDEKIESLRAEFQCLEQNHlKELEDTLhirhtQEFEK 1001
Cdd:pfam05557 88 LNKKLNEKESQLADAREVISCLKNELSELRRQIQRA--------ELELQSTNSELEELQERL-DLLKAKA-----SEAEQ 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1002 VMTDHNMSLEKLKKENQ--QRIDQMLESHASTIQEKEQQLQELkLKVSDLSDMRCKLEVELAlKEAETDEIKILLEEsrt 1079
Cdd:pfam05557 154 LRQNLEKQQSSLAEAEQriKELEFEIQSQEQDSEIVKNSKSEL-ARIPELEKELERLREHNK-HLNENIENKLLLKE--- 228
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1080 qQKEMLKSLLEQEtENLRTEISKLNQKihdnNESYQVGLSELRAL-----MTIEKDQCISELISRHEEESNILKAELDNV 1154
Cdd:pfam05557 229 -EVEDLKRKLERE-EKYREEAATLELE----KEKLEQELQSWVKLaqdtgLNLRSPEDLSRRIEQLQQREIVLKEENSSL 302
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1155 TS----LHRQAYEIEKKLK---EQIVELQTRLNSELSALEKQKDEKITQQEEKyEALIQNLEK-DKERLVKNHEQDKEHL 1226
Cdd:pfam05557 303 TSsarqLEKARRELEQELAqylKKIEDLNKKLKRHKALVRRLQRRVLLLTKER-DGYRAILESyDKELTMSNYSPQLLER 381
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1227 IQELNFEKNKaVQTALDEFKVERELVEKELL---EKVKHLENQIAKTPAFESAREDSSSL--VAELQEKLQEEKAkfleQ 1301
Cdd:pfam05557 382 IEEAEDMTQK-MQAHNEEMEAQLSVAEEELGgykQQAQTLERELQALRQQESLADPSYSKeeVDSLRRKLETLEL----E 456
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 226693374 1302 LEEQEKRKNE-EMQNVRTSLiaeqQTNFNTVLTREKMRKENIINDLSDKLKSTMQQ----QERDKDLIESLSED 1370
Cdd:pfam05557 457 RQRLREQKNElEMELERRCL----QGDYDPKKTKVLHLSMNPAAEAYQQRKNQLEKlqaeIERLKRLLKKLEDD 526
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
842-1372 |
1.78e-08 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 59.60 E-value: 1.78e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 842 EIRNIIEKvKCSLEITLKEKHQQELQSLKIEYECKLDALVKDSEENVNKILKLKENLVSLEEALQNKDNEFTSIKHEKDA 921
Cdd:pfam02463 398 ELKSEEEK-EAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKSEDLLKE 476
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 922 IVCVQQEkdQKLLEMEKIMHTQHCEIKELKQSREMALEDLKKLHDEKIESLRAEFQCLEQNHLKELEDTLHIRHTQEfek 1001
Cdd:pfam02463 477 TQLVKLQ--EQLELLLSRQKLEERSQKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAISTAVIV--- 551
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1002 vmTDHNMSLEKLKKENQQRIDQMLESHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEI------KILLE 1075
Cdd:pfam02463 552 --EVSATADEVEERQKLVRALTELPLGARKLRLLIPKLKLPLKSIAVLEIDPILNLAQLDKATLEADEDdkrakvVEGIL 629
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1076 ESRTQQKEMLKSLLEQETENLRTEISKLNQKIHDNNESYQVGLSELRALMTIEKDQCISELISRHEEESNILKAELDNVT 1155
Cdd:pfam02463 630 KDTELTKLKESAKAKESGLRKGVSLEEGLAEKSEVKASLSELTKELLEIQELQEKAESELAKEEILRRQLEIKKKEQREK 709
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1156 SLHRQAYEIEKKLKEQIVELQTRLNSELSALEKQkdeKITQQEEKYEALIQNLEKDKERLVKNHEQDKEHLIQELNFEKN 1235
Cdd:pfam02463 710 EELKKLKLEAEELLADRVQEAQDKINEELKLLKQ---KIDEEEEEEEKSRLKKEEKEEEKSELSLKEKELAEEREKTEKL 786
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1236 KAVQTaLDEFKVERELVEKELLEKVKHLENQIaktpafesareDSSSLVAELQEKLQEEKAKFLEQLEEQEKRKNEEMQN 1315
Cdd:pfam02463 787 KVEEE-KEEKLKAQEEELRALEEELKEEAELL-----------EEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEE 854
|
490 500 510 520 530
....*....|....*....|....*....|....*....|....*....|....*..
gi 226693374 1316 VRTSLIAEQQTNFNTVLTREKMRKENIINDLSDKLKSTMQQQERDKDLIESLSEDRA 1372
Cdd:pfam02463 855 ELERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKKELEEESQKLNL 911
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
926-1259 |
2.59e-08 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 58.91 E-value: 2.59e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 926 QQEKDQKLLEMEKimhtqhcEIKELKqsREMALEDLKKLHdEKIESLRAEFQCLEQNHlKELEDTLHIRHTQEFEKVMTD 1005
Cdd:TIGR02168 208 QAEKAERYKELKA-------ELRELE--LALLVLRLEELR-EELEELQEELKEAEEEL-EELTAELQELEEKLEELRLEV 276
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1006 HNMSleklkkENQQRIDQMLESHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESRTQQKEML 1085
Cdd:TIGR02168 277 SELE------EEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELK 350
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1086 KSL--LEQETENLRTEISKLNQKIHDNNESYQvGLSELRALMTIEKDQcISELISRHEEESNILKAELDNvtsLHRQAYE 1163
Cdd:TIGR02168 351 EELesLEAELEELEAELEELESRLEELEEQLE-TLRSKVAQLELQIAS-LNNEIERLEARLERLEDRRER---LQQEIEE 425
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1164 IEKKLKE-QIVELQTRLNselsalekQKDEKITQQEEKYEALIQNLEKDKERLVKNHE--QDKEHLIQELNFEKnKAVQT 1240
Cdd:TIGR02168 426 LLKKLEEaELKELQAELE--------ELEEELEELQEELERLEEALEELREELEEAEQalDAAERELAQLQARL-DSLER 496
|
330
....*....|....*....
gi 226693374 1241 ALDEFKVERELVEKELLEK 1259
Cdd:TIGR02168 497 LQENLEGFSEGVKALLKNQ 515
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
864-1272 |
5.13e-08 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 57.83 E-value: 5.13e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 864 QELQSLKIEYECKLDALVKDSEENVNKILKLKENLVSLEEALQNKdnEFTSIKHEKdaivcVQQEKDQKLLEMEKimhtq 943
Cdd:pfam17380 247 EDVTTMTPEYTVRYNGQTMTENEFLNQLLHIVQHQKAVSERQQQE--KFEKMEQER-----LRQEKEEKAREVER----- 314
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 944 HCEIKELKQSREMALEDLKKLHDE----------KIESLRAEFQCLEQNHLKELEDTLHIRHTQEfekvmtdhnmsLEKL 1013
Cdd:pfam17380 315 RRKLEEAEKARQAEMDRQAAIYAEqermamererELERIRQEERKRELERIRQEEIAMEISRMRE-----------LERL 383
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1014 KKENQQ---RIDQMLE-SHASTIQEKEQQLQELKLKVsdlsdmrcklevelalkeaETDEIKILLEESRTQQKEMLKSLL 1089
Cdd:pfam17380 384 QMERQQkneRVRQELEaARKVKILEEERQRKIQQQKV-------------------EMEQIRAEQEEARQREVRRLEEER 444
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1090 EQETENLRTEISKLNQKIhdnnesyqvglselralmtiekdqcisELISRHEEESNILKAELDNVTSLHRQAYEIEKKLK 1169
Cdd:pfam17380 445 AREMERVRLEEQERQQQV---------------------------ERLRQQEEERKRKKLELEKEKRDRKRAEEQRRKIL 497
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1170 EQIVELQTRlnselSALEKQKDEKITQQEekYEALIQNLEKDKERLVKNHEQDKEHLIQELNFEKNKAVQTALDEFKVER 1249
Cdd:pfam17380 498 EKELEERKQ-----AMIEEERKRKLLEKE--MEERQKAIYEEERRREAEEERRKQQEMEERRRIQEQMRKATEERSRLEA 570
|
410 420
....*....|....*....|...
gi 226693374 1250 ELVEKELLEKVKHLENQIAKTPA 1272
Cdd:pfam17380 571 MEREREMMRQIVESEKARAEYEA 593
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1009-1215 |
5.91e-08 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 56.70 E-value: 5.91e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1009 SLEKLKKENQQRIDQM---LESHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAEtdeikilLEESRTQQKEml 1085
Cdd:COG4942 24 EAEAELEQLQQEIAELekeLAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAE-------LAELEKEIAE-- 94
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1086 kslLEQETENLRTEISKLNQKI--HDNNESYQVGLSELRALMTIEKDQCISELISRHEEESNILKAELDNVTSLHRQAYE 1163
Cdd:COG4942 95 ---LRAELEAQKEELAELLRALyrLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELEA 171
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*
gi 226693374 1164 IEKKLKEQIVELQTRLNsELSALEKQKDE---KITQQEEKYEALIQNLEKDKERL 1215
Cdd:COG4942 172 ERAELEALLAELEEERA-ALEALKAERQKllaRLEKELAELAAELAELQQEAEEL 225
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
901-1320 |
7.57e-08 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 57.08 E-value: 7.57e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 901 LEEALQNKDNEFTSIKHEKDAIVCVQQEKDQKLLEMEKIMHTQHCEIKELKQSREMALEDLKKlhdeKIESLRAEFQCLE 980
Cdd:COG4717 47 LLERLEKEADELFKPQGRKPELNLKELKELEEELKEAEEKEEEYAELQEELEELEEELEELEA----ELEELREELEKLE 122
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 981 QnhLKELEDTLHIRhtQEFEKVMTDHNMSLEKLKKENQQRIDQM--LESHASTIQEKEQQLQELKLKVSdlsdmrckLEV 1058
Cdd:COG4717 123 K--LLQLLPLYQEL--EALEAELAELPERLEELEERLEELRELEeeLEELEAELAELQEELEELLEQLS--------LAT 190
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1059 ELALKEAETDEIKilLEESRTQQKEMLKSlLEQETENLRTEISKLNQKIHDNNESYQvgLSELRALMTI-----EKDQCI 1133
Cdd:COG4717 191 EEELQDLAEELEE--LQQRLAELEEELEE-AQEELEELEEELEQLENELEAAALEER--LKEARLLLLIaaallALLGLG 265
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1134 SELISRHEEESNILKAEL----DNVTSLHRQAYEIEKKLKE--QIVELQTRLNSELSALEKQKDEKITQQEEKYEALIQN 1207
Cdd:COG4717 266 GSLLSLILTIAGVLFLVLgllaLLFLLLAREKASLGKEAEElqALPALEELEEEELEELLAALGLPPDLSPEELLELLDR 345
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1208 LEKDKERLVKNHEQDKEHLIQELNFEKN----KAVQTALDEF--KVERELVEKELLEKVKHLENQIAKTPAFESAREDSS 1281
Cdd:COG4717 346 IEELQELLREAEELEEELQLEELEQEIAallaEAGVEDEEELraALEQAEEYQELKEELEELEEQLEELLGELEELLEAL 425
|
410 420 430
....*....|....*....|....*....|....*....
gi 226693374 1282 SLvAELQEKLQEEKAKfLEQLEEQEKRKNEEMQNVRTSL 1320
Cdd:COG4717 426 DE-EELEEELEELEEE-LEELEEELEELREELAELEAEL 462
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
828-1366 |
7.67e-08 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 57.46 E-value: 7.67e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 828 EQCDLANYLKCTAVEIRNIIEKVKCSLEitlkEKHQQELQSLKIEYECKLDALVKDSEE--NVNKILKLKENLVSLEEAL 905
Cdd:PTZ00121 1388 EEKKKADEAKKKAEEDKKKADELKKAAA----AKKKADEAKKKAEEKKKADEAKKKAEEakKADEAKKKAEEAKKAEEAK 1463
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 906 QNKDNEFTSIKHEKDAivcvqqEKDQKLLEMEKIMHTQHCEIKELKQSRE--MALEDLKKLHDEKI--ESLRAEfqclEQ 981
Cdd:PTZ00121 1464 KKAEEAKKADEAKKKA------EEAKKADEAKKKAEEAKKKADEAKKAAEakKKADEAKKAEEAKKadEAKKAE----EA 1533
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 982 NHLKELEDTLHIRHTQEFEKvmTDHNMSLEKLKKENQQRIDQMLESHASTIQEKEQQLQELKLK-VSDLSDMRCKLEVEL 1060
Cdd:PTZ00121 1534 KKADEAKKAEEKKKADELKK--AEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEeVMKLYEEEKKMKAEE 1611
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1061 ALKEaetDEIKILLEESRTQQKEmlKSLLEQETENLRTEISKLNQkIHDNNESYQVGLSELRAlmTIEKDQCISELISRH 1140
Cdd:PTZ00121 1612 AKKA---EEAKIKAEELKKAEEE--KKKVEQLKKKEAEEKKKAEE-LKKAEEENKIKAAEEAK--KAEEDKKKAEEAKKA 1683
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1141 EEESNILKAELDNVTSLHRQAYEIEKKLKEQIVELQTRLNSELSALEKQKDEKITQQEEKYEALIQNLEKDKERLVKNHE 1220
Cdd:PTZ00121 1684 EEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLK 1763
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1221 QDKEHLIQELNFEKNKAVQTALDEFKVEREL-VEKELLEKVKHLENQI----AKTPAFESAREDSSSLVAEL--QEKLQE 1293
Cdd:PTZ00121 1764 KEEEKKAEEIRKEKEAVIEEELDEEDEKRRMeVDKKIKDIFDNFANIIeggkEGNLVINDSKEMEDSAIKEVadSKNMQL 1843
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 226693374 1294 EKAKFLEQLEEQEKRKNEEMQNvrtsliaeQQTNFNtvltREKMRKEniinDLSDKLKSTMQQQERDKDLIES 1366
Cdd:PTZ00121 1844 EEADAFEKHKFNKNNENGEDGN--------KEADFN----KEKDLKE----DDEEEIEEADEIEKIDKDDIER 1900
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
872-1389 |
1.07e-07 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 56.99 E-value: 1.07e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 872 EYECKLDALVKDSEENVNKILKLKENLVSLEEALQNKDNEFTSIKHEKDAIVCVQQEKDQKL--LEMEKIMHTQ------ 943
Cdd:TIGR02168 236 ELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEIsrLEQQKQILRErlanle 315
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 944 ------HCEIKELKQSREMALEDLKKLhDEKIESLRAEFQcLEQNHLKELEDTLhirhtQEFEKVMTDHNMSLEKLKKEN 1017
Cdd:TIGR02168 316 rqleelEAQLEELESKLDELAEELAEL-EEKLEELKEELE-SLEAELEELEAEL-----EELESRLEELEEQLETLRSKV 388
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1018 QQRIDQmLESHASTIQEKEQQLQELKLKVSDLSDMRckLEVELALKEAETDEIKILLEESRTQQKEMLKSL--LEQETEN 1095
Cdd:TIGR02168 389 AQLELQ-IASLNNEIERLEARLERLEDRRERLQQEI--EELLKKLEEAELKELQAELEELEEELEELQEELerLEEALEE 465
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1096 LRTEISKLNQKIHDNNESYQVGLSELRALMTIEK----------------------DQCISELISRHEEesniLKAELDN 1153
Cdd:TIGR02168 466 LREELEEAEQALDAAERELAQLQARLDSLERLQEnlegfsegvkallknqsglsgiLGVLSELISVDEG----YEAAIEA 541
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1154 VTSLHRQAYEIEKK---------LKEqiVELQTRLNSELSALEKQK----DEKITQQEEKYEALIQNLEKDKERLVKNHE 1220
Cdd:TIGR02168 542 ALGGRLQAVVVENLnaakkaiafLKQ--NELGRVTFLPLDSIKGTEiqgnDREILKNIEGFLGVAKDLVKFDPKLRKALS 619
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1221 ------------QDKEHLIQELNFEKN-----------------KAVQTALDEFKVERELveKELLEKVKHLENQIAKTp 1271
Cdd:TIGR02168 620 yllggvlvvddlDNALELAKKLRPGYRivtldgdlvrpggvitgGSAKTNSSILERRREI--EELEEKIEELEEKIAEL- 696
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1272 afESAREDssslVAELQEKLQEEKAKFLEQLEEQEKRKNE-EMQNVRTSLIAEQQTNFNTVLTREKMRKENIINDLSDKL 1350
Cdd:TIGR02168 697 --EKALAE----LRKELEELEEELEQLRKELEELSRQISAlRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERL 770
|
570 580 590
....*....|....*....|....*....|....*....
gi 226693374 1351 KSTMQQQERDKDLIESLSEDRARLLEEKKQLEEEVSKLR 1389
Cdd:TIGR02168 771 EEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELR 809
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
997-1363 |
2.90e-07 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 55.51 E-value: 2.90e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 997 QEFEKVMTDHNMSLEKLkkenQQRIDQMLESHA-------STIQEKEQQLQELKLKVSDLSDMRcklevelaLKEAETDE 1069
Cdd:pfam15921 74 EHIERVLEEYSHQVKDL----QRRLNESNELHEkqkfylrQSVIDLQTKLQEMQMERDAMADIR--------RRESQSQE 141
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1070 IKILLEESRTQQKEMLKSLLEQETENLRTEISKLNQKIHdnneSYQVGLSELRALMTiekdqciselisRHEEESNILKA 1149
Cdd:pfam15921 142 DLRNQLQNTVHELEAAKCLKEDMLEDSNTQIEQLRKMML----SHEGVLQEIRSILV------------DFEEASGKKIY 205
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1150 ELDNVTSLHRQAY--EIEKKLKEQIVELqTRLNSELSALEKQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQDKEHLI 1227
Cdd:pfam15921 206 EHDSMSTMHFRSLgsAISKILRELDTEI-SYLKGRIFPVEDQLEALKSESQNKIELLLQQHQDRIEQLISEHEVEITGLT 284
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1228 qelnfEKNKAVQTALDEFKVERELVEKELLEKVKHLENQIAktpafesareDSSSLVAELQEKLQEEKAKFLEQLEEQEK 1307
Cdd:pfam15921 285 -----EKASSARSQANSIQSQLEIIQEQARNQNSMYMRQLS----------DLESTVSQLRSELREAKRMYEDKIEELEK 349
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 226693374 1308 R---KNEEMQNVRTS--LIAEQQTNFNTVLtrekmrkENIINDLSDKLKSTMQQQERDKDL 1363
Cdd:pfam15921 350 QlvlANSELTEARTErdQFSQESGNLDDQL-------QKLLADLHKREKELSLEKEQNKRL 403
|
|
| COG5022 |
COG5022 |
Myosin heavy chain [General function prediction only]; |
860-1371 |
3.69e-07 |
|
Myosin heavy chain [General function prediction only];
Pssm-ID: 227355 [Multi-domain] Cd Length: 1463 Bit Score: 55.47 E-value: 3.69e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 860 EKHQQELQSLKIEYEcKLDALVKDSEENVNKILKLKENLVSLE-EALQNKDNEFTSIKHEKDAI-VCVQQEKDQKLLEME 937
Cdd:COG5022 878 ELAERQLQELKIDVK-SISSLKLVNLELESEIIELKKSLSSDLiENLEFKTELIARLKKLLNNIdLEEGPSIEYVKLPEL 956
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 938 KIMHTQHCEIKELKQSREMALEDLKKLHDEKI----ESLR-----AEFQCLE---QNHLKELEDTLHI--RHTQEFEKVM 1003
Cdd:COG5022 957 NKLHEVESKLKETSEEYEDLLKKSTILVREGNkansELKNfkkelAELSKQYgalQESTKQLKELPVEvaELQSASKIIS 1036
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1004 TDHNmSLEKLKKENQQRIDQMLEshastIQEKEQQLQELKLKVSDLSDMRCKLE----VELALKEAETDEIKILLEESRT 1079
Cdd:COG5022 1037 SEST-ELSILKPLQKLKGLLLLE-----NNQLQARYKALKLRRENSLLDDKQLYqlesTENLLKTINVKDLEVTNRNLVK 1110
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1080 QQKEMLKSLLEQETENLRTEISKLNQKIHDNNESYQVGLSELRALMTIEKDQC-------------ISELISRHE----- 1141
Cdd:COG5022 1111 PANVLQFIVAQMIKLNLLQEISKFLSQLVNTLEPVFQKLSVLQLELDGLFWEAnlealpspppfaaLSEKRLYQSalyde 1190
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1142 ------EESNILKAELDNVTSLHRQAYEIEKKLKEQIVE--LQTRLNSELSALEKQKDEKITQQEEKYEALIQNLEKDKE 1213
Cdd:COG5022 1191 ksklssSEVNDLKNELIALFSKIFSGWPRGDKLKKLISEgwVPTEYSTSLKGFNNLNKKFDTPASMSNEKLLSLLNSIDN 1270
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1214 RLVKNHEQDKEH--------------LIQELNFEKNKAVQTALDEFKVEREL-----VEKELLEKVKHLENQIAKTPAFE 1274
Cdd:COG5022 1271 LLSSYKLEEEVLpatinsllqyinvgLFNALRTKASSLRWKSATEVNYNSEElddwcREFEISDVDEELEELIQAVKVLQ 1350
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1275 sAREDSSSLVAELQEKLQEEKAKFLEQLEEQEKR-------KNEEMQNVRTSLIAEQQTNFNTVLTREKMRKENIINDLS 1347
Cdd:COG5022 1351 -LLKDDLNKLDELLDACYSLNPAEIQNLKSRYDPadkennlPKEILKKIEALLIKQELQLSLEGKDETEVHLSEIFSEEK 1429
|
570 580
....*....|....*....|....*..
gi 226693374 1348 DKL---KSTMQQQERDKDLIESLSEDR 1371
Cdd:COG5022 1430 SLIsldRNSIYKEEVLSSLSALLTKEK 1456
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
840-1475 |
4.27e-07 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 55.15 E-value: 4.27e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 840 AVEIRNIIEKVKCSLEITLKEKHQQElQSLKIEYECKLDALVKDSEEnvnkiLKLKENLVSLEEAlqnKDNEFTSIKHEK 919
Cdd:PTZ00121 1196 AEDARKAEAARKAEEERKAEEARKAE-DAKKAEAVKKAEEAKKDAEE-----AKKAEEERNNEEI---RKFEEARMAHFA 1266
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 920 DAIVCVQQEKDQKLLEMEKIMHTQHCEikELKQSREM-ALEDLKKLHDEKieslraefqcleqnhlkeledtlhiRHTQE 998
Cdd:PTZ00121 1267 RRQAAIKAEEARKADELKKAEEKKKAD--EAKKAEEKkKADEAKKKAEEA-------------------------KKADE 1319
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 999 FEKVMTDHNMSLEKLKK--ENQQRIDQMLESHASTIQEKEQQLQElKLKVSDLSDMRCKLEVELALKEAE----TDEIKI 1072
Cdd:PTZ00121 1320 AKKKAEEAKKKADAAKKkaEEAKKAAEAAKAEAEAAADEAEAAEE-KAEAAEKKKEEAKKKADAAKKKAEekkkADEAKK 1398
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1073 LLEESRTQQKEMLKSLLEQ----ETENLRTEISKLNQKIHDNNESYQVGLSELRAlmtiEKDQCISELISRHEEESNILK 1148
Cdd:PTZ00121 1399 KAEEDKKKADELKKAAAAKkkadEAKKKAEEKKKADEAKKKAEEAKKADEAKKKA----EEAKKAEEAKKKAEEAKKADE 1474
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1149 A-----ELDNVTSLHRQAYEIEKKLKE--QIVELQTRLNSELSALEKQKDEKITQQEEKYEA--LIQNLEKDKERLVKNH 1219
Cdd:PTZ00121 1475 AkkkaeEAKKADEAKKKAEEAKKKADEakKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKAdeAKKAEEKKKADELKKA 1554
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1220 EQDKEHLIQELNFEKNKAVQTALDEFKVEREL--VEKELLEKVKHLENQIAKTPAFESAREDSSSLVAELQEKLQEEKAK 1297
Cdd:PTZ00121 1555 EELKKAEEKKKAEEAKKAEEDKNMALRKAEEAkkAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKK 1634
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1298 F--LEQLEEQEKRKNEEMQNvrtsliAEQQtnfNTVLTREKMRKENIINDLSDKLKSTMQQQERDKDLIESLSEDRARLL 1375
Cdd:PTZ00121 1635 VeqLKKKEAEEKKKAEELKK------AEEE---NKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAE 1705
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1376 EEKKQLEEEVSKlrtssflsSAPVAAAPELYGACAPELPGEPERSvMETADEGRLDSAMETSMMsvQENMLSEEKQRIML 1455
Cdd:PTZ00121 1706 ELKKKEAEEKKK--------AEELKKAEEENKIKAEEAKKEAEED-KKKAEEAKKDEEEKKKIA--HLKKEEEKKAEEIR 1774
|
650 660
....*....|....*....|
gi 226693374 1456 LERTLQLKEEENKRLNQRLM 1475
Cdd:PTZ00121 1775 KEKEAVIEEELDEEDEKRRM 1794
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
852-1216 |
4.49e-07 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 54.68 E-value: 4.49e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 852 CSLEITlkEKHQQELQSlkiEYECKLDALVKDSEENVNKILKLKENLVSLEEALqnkdneftsikhekdaivcvqqEKDQ 931
Cdd:PRK03918 441 CGRELT--EEHRKELLE---EYTAELKRIEKELKEIEEKERKLRKELRELEKVL----------------------KKES 493
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 932 KLLEMEKIMHtqhcEIKELKQS-REMALEDLKKlHDEKIESLRAEFQCLEQNhLKELEDTLhiRHTQEFEKVMTDHNMSL 1010
Cdd:PRK03918 494 ELIKLKELAE----QLKELEEKlKKYNLEELEK-KAEEYEKLKEKLIKLKGE-IKSLKKEL--EKLEELKKKLAELEKKL 565
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1011 EKLKKENQQRIDQMLESHASTIQEKEQQLQELK--------LKVS-----DLSDMRCKLEVELALKEAETDEIKILLEES 1077
Cdd:PRK03918 566 DELEEELAELLKELEELGFESVEELEERLKELEpfyneyleLKDAekeleREEKELKKLEEELDKAFEELAETEKRLEEL 645
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1078 RTQQKEMLKSLLEQETENLRTEISKLNQKIhdnnESYQVGLSELRALM-----TIEKDQCISELISRHEEESNILKAELD 1152
Cdd:PRK03918 646 RKELEELEKKYSEEEYEELREEYLELSREL----AGLRAELEELEKRReeikkTLEKLKEELEEREKAKKELEKLEKALE 721
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 226693374 1153 NVTslhrqayEIEKKLKEQIVELQTRLNSELSALEKQKDEKITqqEEKYEALIQNLEKDKERLV 1216
Cdd:PRK03918 722 RVE-------ELREKVKKYKALLKERALSKVGEIASEIFEELT--EGKYSGVRVKAEENKVKLF 776
|
|
| CALCOCO1 |
pfam07888 |
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ... |
926-1308 |
5.11e-07 |
|
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.
Pssm-ID: 462303 [Multi-domain] Cd Length: 488 Bit Score: 54.13 E-value: 5.11e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 926 QQEKDQKLLEMEKIMHTQHCEIKELKQSREmALEDLKKLHDEKIESLRAEFQCLEQNHLKELEDTlhiRHTQEFEKVMTD 1005
Cdd:pfam07888 40 LQERAELLQAQEAANRQREKEKERYKRDRE-QWERQRRELESRVAELKEELRQSREKHEELEEKY---KELSASSEELSE 115
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1006 HNMSLEKLKKENQQRIDQMLESHASTIQ---EKEQQLQELKLKVSDLSDMRCKLEVElalKEAETDEIKILLEESRTQQK 1082
Cdd:pfam07888 116 EKDALLAQRAAHEARIRELEEDIKTLTQrvlERETELERMKERAKKAGAQRKEEEAE---RKQLQAKLQQTEEELRSLSK 192
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1083 EM--LKSLLEQ---ETENLRTEISKLNQKI---HDNNESYQVGLSELRALmtiekdqciSELISRHEEESNILKAELDNV 1154
Cdd:pfam07888 193 EFqeLRNSLAQrdtQVLQLQDTITTLTQKLttaHRKEAENEALLEELRSL---------QERLNASERKVEGLGEELSSM 263
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1155 TSLHRQAYEIEKKLKEQIVELQTRLNSELSALekqKDEKITQQEEKyEALIQNLEKDKERlvknheqdkehlIQELNFEK 1234
Cdd:pfam07888 264 AAQRDRTQAELHQARLQAAQLTLQLADASLAL---REGRARWAQER-ETLQQSAEADKDR------------IEKLSAEL 327
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 226693374 1235 NKAvQTALDEFKVERELVEKEL-LEKVKhleNQIAKTPAFESAREDSSSLVAELQEK--LQEEKAKFLEQLEEQEKR 1308
Cdd:pfam07888 328 QRL-EERLQEERMEREKLEVELgREKDC---NRVQLSESRRELQELKASLRVAQKEKeqLQAEKQELLEYIRQLEQR 400
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
865-1302 |
5.99e-07 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 54.28 E-value: 5.99e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 865 ELQSLKIE-YECKLDALVKDSEEnvnkilkLKENLVSLEEALQNKDNEFTSIKHEKDAIVCVQQEKDQKLLEMEKIMHTQ 943
Cdd:PRK02224 303 GLDDADAEaVEARREELEDRDEE-------LRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELESELEEA 375
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 944 HCEIKElkqsREMALEDLkklhDEKIESLRAEFQCLE------QNHLKELEDTLHIRHTQEFEKVMTDHNmsLEKLKKEN 1017
Cdd:PRK02224 376 REAVED----RREEIEEL----EEEIEELRERFGDAPvdlgnaEDFLEELREERDELREREAELEATLRT--ARERVEEA 445
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1018 QQRID--------QMLE--SHASTIQEKEQQLQELKLKVSDLSDMRCKLEVEL----ALKEAET-----DEIKILLEESR 1078
Cdd:PRK02224 446 EALLEagkcpecgQPVEgsPHVETIEEDRERVEELEAELEDLEEEVEEVEERLeraeDLVEAEDrierlEERREDLEELI 525
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1079 TQQKEMLKSLLEQeTENLRTEISKLNQKIHDNNESYQvglsELRalmtIEKDQCISElISRHEEESNILKAELDNVTSLH 1158
Cdd:PRK02224 526 AERRETIEEKRER-AEELRERAAELEAEAEEKREAAA----EAE----EEAEEAREE-VAELNSKLAELKERIESLERIR 595
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1159 RQAYEIEkKLKEQIVELQTRLnSELSALEKQ-------KDEKITQQEEKY-EALIQNLEKDKERLVKNHEQDKEHLiQEL 1230
Cdd:PRK02224 596 TLLAAIA-DAEDEIERLREKR-EALAELNDErrerlaeKRERKRELEAEFdEARIEEAREDKERAEEYLEQVEEKL-DEL 672
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 226693374 1231 NfEKNKAVQTALDefKVERELVE-KELLEKVKHLENQIAKTPAFESAREDSSSLVAELQEKLQEEKAKFLEQL 1302
Cdd:PRK02224 673 R-EERDDLQAEIG--AVENELEElEELRERREALENRVEALEALYDEAEELESMYGDLRAELRQRNVETLERM 742
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
1030-1241 |
6.69e-07 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 53.68 E-value: 6.69e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1030 STIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESRTQQKEmlkslLEQETENLRTEISKLNQKIHD 1109
Cdd:COG3883 16 PQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDK-----LQAEIAEAEAEIEERREELGE 90
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1110 NNESYQVGLSELRALMTIEKDQCISELISRheeesniLKAeLDNVTSLHRQAYEIEKKLKEQIVELQTRLNSELSALEKQ 1189
Cdd:COG3883 91 RARALYRSGGSVSYLDVLLGSESFSDFLDR-------LSA-LSKIADADADLLEELKADKAELEAKKAELEAKLAELEAL 162
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*...
gi 226693374 1190 KDEKITQQEE------KYEALIQNLEKDKERLvknhEQDKEHLIQELNFEKNKAVQTA 1241
Cdd:COG3883 163 KAELEAAKAEleaqqaEQEALLAQLSAEEAAA----EAQLAELEAELAAAEAAAAAAA 216
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
790-1475 |
8.28e-07 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 54.20 E-value: 8.28e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 790 RAALHVQLEKCRAaaQDSHSSIQTIKDDLCHFRTFVQKEQCDLANYL-KCTAVEIRNIIEKVKCSLEITLKEKHQQ---- 864
Cdd:TIGR00618 202 RSQLLTLCTPCMP--DTYHERKQVLEKELKHLREALQQTQQSHAYLTqKREAQEEQLKKQQLLKQLRARIEELRAQeavl 279
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 865 ELQSLKIEYECKLDALVKDSEenvnKILKLKENLVSLEEALQNKDNEFTSIKHEKDAIV------CVQQEKDQKLLEMEK 938
Cdd:TIGR00618 280 EETQERINRARKAAPLAAHIK----AVTQIEQQAQRIHTELQSKMRSRAKLLMKRAAHVkqqssiEEQRRLLQTLHSQEI 355
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 939 IMHTQHCEIKELKQSREMALEDLKKLH--DEKIESLRAEFQCLEQNHLKELEDTLHIRHTQEFEKVMTDHNMSLEKlKKE 1016
Cdd:TIGR00618 356 HIRDAHEVATSIREISCQQHTLTQHIHtlQQQKTTLTQKLQSLCKELDILQREQATIDTRTSAFRDLQGQLAHAKK-QQE 434
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1017 NQQRIDQMLESHAstiqekEQQLQELKLKVSDLSDMRCKLEvelALKEAETDEIKILLEESRTQQKEMLKSLLEQETEN- 1095
Cdd:TIGR00618 435 LQQRYAELCAAAI------TCTAQCEKLEKIHLQESAQSLK---EREQQLQTKEQIHLQETRKKAVVLARLLELQEEPCp 505
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1096 LRTEISKLNQKIHDNNES------YQVGLSELRALMTIEKD---QCISEL--ISRHEEESNILKAELDNVTSLHRQAYEI 1164
Cdd:TIGR00618 506 LCGSCIHPNPARQDIDNPgpltrrMQRGEQTYAQLETSEEDvyhQLTSERkqRASLKEQMQEIQQSFSILTQCDNRSKED 585
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1165 EKKLKEQIVELQtRLNSELSALEKQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQDKE--HLIQ---ELNFEKNKAVQ 1239
Cdd:TIGR00618 586 IPNLQNITVRLQ-DLTEKLSEAEDMLACEQHALLRKLQPEQDLQDVRLHLQQCSQELALKltALHAlqlTLTQERVREHA 664
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1240 TALDEFKVERELVEKELLEKVKHLENQIakTPAFESaredssslVAELQEKLQEEkakflEQLEEQEKRKNEEMQNVRTS 1319
Cdd:TIGR00618 665 LSIRVLPKELLASRQLALQKMQSEKEQL--TYWKEM--------LAQCQTLLREL-----ETHIEEYDREFNEIENASSS 729
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1320 LIAEQQTNFNTVltrekmrkENIINDLSDKLKSTMQQQERD---KDLIESLSEDRARLLEEKKQLEEEVSKLRTSSFLSS 1396
Cdd:TIGR00618 730 LGSDLAAREDAL--------NQSLKELMHQARTVLKARTEAhfnNNEEVTAALQTGAELSHLAAEIQFFNRLREEDTHLL 801
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1397 APVAAAPELYgacAPELPGEPERSVMETADEgrldsametsmmsvQENMLS--EEKQRIMLLERTLQLKEEENKRLNQRL 1474
Cdd:TIGR00618 802 KTLEAEIGQE---IPSDEDILNLQCETLVQE--------------EEQFLSrlEEKSATLGEITHQLLKYEECSKQLAQL 864
|
.
gi 226693374 1475 M 1475
Cdd:TIGR00618 865 T 865
|
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
880-1373 |
1.15e-06 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 53.69 E-value: 1.15e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 880 LVKDSEENVNKILKLKENLVSLEEALQNKDNEFTSIKHEKDAIVCVQQEKDQKLLEMEKIMHTQHceiKELKQSREMALE 959
Cdd:pfam12128 232 AIAGIMKIRPEFTKLQQEFNTLESAELRLSHLHFGYKSDETLIASRQEERQETSAELNQLLRTLD---DQWKEKRDELNG 308
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 960 DLKKLhDEKIESLRAEFQCLEQNHLKELEDTLHIRHtQEFEkvmtdhNMSLEKLKKENQQRIDQMLESHASTIQEKeqqL 1039
Cdd:pfam12128 309 ELSAA-DAAVAKDRSELEALEDQHGAFLDADIETAA-ADQE------QLPSWQSELENLEERLKALTGKHQDVTAK---Y 377
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1040 QELKLKVSDlsdmrcKLEVELALKEAETDEIKilleESRTQQKEMLKSLLEQETENLRTEISKLNQKIHDNNESYQVGLS 1119
Cdd:pfam12128 378 NRRRSKIKE------QNNRDIAGIKDKLAKIR----EARDRQLAVAEDDLQALESELREQLEAGKLEFNEEEYRLKSRLG 447
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1120 ELRALM---TIEKDQCIS-----ELISRHEEESNILKAELDNVTSLHRQAyeieKKLKEQIVELQTRLNSELSALEKQKD 1191
Cdd:pfam12128 448 ELKLRLnqaTATPELLLQlenfdERIERAREEQEAANAEVERLQSELRQA----RKRRDQASEALRQASRRLEERQSALD 523
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1192 EKITQQEEKYEALIQNLEKD----KERLVK------------NHEQDKEHLIQELNF----------------------- 1232
Cdd:pfam12128 524 ELELQLFPQAGTLLHFLRKEapdwEQSIGKvispellhrtdlDPEVWDGSVGGELNLygvkldlkridvpewaaseeelr 603
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1233 EKNKAVQTALDEFKVERELVEKELLEKVKHLENQIAKTP----AFESAREDSSSLVAELQEklqeEKAKFLEQLEEQEKR 1308
Cdd:pfam12128 604 ERLDKAEEALQSAREKQAAAEEQLVQANGELEKASREETfartALKNARLDLRRLFDEKQS----EKDKKNKALAERKDS 679
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 226693374 1309 KNEEmqnvRTSLIAEQQTNFNTVLTREKMRKEniindlsDKLKSTMQQQERDKDLIESLSEDRAR 1373
Cdd:pfam12128 680 ANER----LNSLEAQLKQLDKKHQAWLEEQKE-------QKREARTEKQAYWQVVEGALDAQLAL 733
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
947-1373 |
1.23e-06 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 53.51 E-value: 1.23e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 947 IKELKQSREMALEDLKKLHDEKIE-SLRAEFQCLEQNhLKELEDTLHIRHTQ---------EFEKVMTDHNMSLEKLKkE 1016
Cdd:PRK02224 178 VERVLSDQRGSLDQLKAQIEEKEEkDLHERLNGLESE-LAELDEEIERYEEQreqaretrdEADEVLEEHEERREELE-T 255
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1017 NQQRIDQMLESHASTIQEKE---QQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESRTQQKEMLKSLLEQ-- 1091
Cdd:PRK02224 256 LEAEIEDLRETIAETEREREelaEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEECrv 335
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1092 -------ETENLRTEISKL---NQKIHDNNESYQVGLSELRALMTIEKDQcISEL---ISRHEEESNILKAELDNVTSLH 1158
Cdd:PRK02224 336 aaqahneEAESLREDADDLeerAEELREEAAELESELEEAREAVEDRREE-IEELeeeIEELRERFGDAPVDLGNAEDFL 414
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1159 RQAYEIEKKLKEQIVELQTRLNSELSALEKQKD-----------------------EKITQQEEKYEALIQNLEKDKERL 1215
Cdd:PRK02224 415 EELREERDELREREAELEATLRTARERVEEAEAlleagkcpecgqpvegsphvetiEEDRERVEELEAELEDLEEEVEEV 494
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1216 VKNHEQDKEHLIQELNFEKNKAVQTALDEFKVERELVEKELLEKVKHLENQIAKTPAFESAREDSSSLVAELQEKLQEEK 1295
Cdd:PRK02224 495 EERLERAEDLVEAEDRIERLEERREDLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEV 574
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1296 AKFLEQLEEQEKRKnEEMQNVRTSLIAEQQTNFNTVLTREKMRKENIINDLS-DKLKStmqQQERDKDL--------IES 1366
Cdd:PRK02224 575 AELNSKLAELKERI-ESLERIRTLLAAIADAEDEIERLREKREALAELNDERrERLAE---KRERKRELeaefdearIEE 650
|
....*..
gi 226693374 1367 LSEDRAR 1373
Cdd:PRK02224 651 AREDKER 657
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
967-1246 |
1.26e-06 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 53.38 E-value: 1.26e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 967 EKIESLRAEFQCLEQnHLKELEDTLHIRHTQefEKVMTDHNMSLEKLKKENQQRIDqmLESHASTIQEKEQQLQELKLKV 1046
Cdd:COG4913 610 AKLAALEAELAELEE-ELAEAEERLEALEAE--LDALQERREALQRLAEYSWDEID--VASAEREIAELEAELERLDASS 684
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1047 SDLSdmrcKLEVELALKEAETDEIKILLEESRTQQKEmlkslLEQETENLRTEISKLNQKIHDNNESYQVGLSElralmt 1126
Cdd:COG4913 685 DDLA----ALEEQLEELEAELEELEEELDELKGEIGR-----LEKELEQAEEELDELQDRLEAAEDLARLELRA------ 749
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1127 iekdqcisELISRHEEesnilkAELDNVTSlhrqayEIEKKLKEQIVELQTRLNSELSALEKQKDEKITQQE-------- 1198
Cdd:COG4913 750 --------LLEERFAA------ALGDAVER------ELRENLEERIDALRARLNRAEEELERAMRAFNREWPaetadlda 809
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 226693374 1199 -----EKYEALIQNLEKD---------KERLVKNHEQDKEHLIQELnfekNKAVQTALDEFK 1246
Cdd:COG4913 810 dleslPEYLALLDRLEEDglpeyeerfKELLNENSIEFVADLLSKL----RRAIREIKERID 867
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1074-1323 |
1.63e-06 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 53.00 E-value: 1.63e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1074 LEESRTqqKEMLKSLLEQ--ETENLRTEISKLNQKIH------DNNESYQVGLSELRALmtiekDQCISEL-ISRHEEES 1144
Cdd:COG4913 218 LEEPDT--FEAADALVEHfdDLERAHEALEDAREQIEllepirELAERYAAARERLAEL-----EYLRAALrLWFAQRRL 290
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1145 NILKAELDNVTSLHRQAYEIEKKLKEQIVELQTRLNselsALEKQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQdke 1224
Cdd:COG4913 291 ELLEAELEELRAELARLEAELERLEARLDALREELD----ELEAQIRGNGGDRLEQLEREIERLERELEERERRRAR--- 363
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1225 hliqelnfeknkaVQTALDEFKVERELVEKELLEKVKHLENQIAKTPAFESA-REDSSSLVAELQEkLQEEKAKFLEQLE 1303
Cdd:COG4913 364 -------------LEALLAALGLPLPASAEEFAALRAEAAALLEALEEELEAlEEALAEAEAALRD-LRRELRELEAEIA 429
|
250 260
....*....|....*....|...
gi 226693374 1304 EQEKRKN---EEMQNVRTSLIAE 1323
Cdd:COG4913 430 SLERRKSnipARLLALRDALAEA 452
|
|
| Cast |
pfam10174 |
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ... |
893-1371 |
1.86e-06 |
|
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
Pssm-ID: 431111 [Multi-domain] Cd Length: 766 Bit Score: 52.90 E-value: 1.86e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 893 KLKENlVSLEEALQNKdNEFTSIKHEKDAIVCVQQEKDQKLLEMEKIMHTQHCEIKELKQSREMALEDlkklHDEKIESL 972
Cdd:pfam10174 204 KEKEN-IHLREELHRR-NQLQPDPAKTKALQTVIEMKDTKISSLERNIRDLEDEVQMLKTNGLLHTED----REEEIKQM 277
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 973 R-----AEFQcleQNHLKELEDTLHIRHTQefekvMTDHNMSLEKLKKEN---QQRIDQMLESHAStiqeKEQQLQELKL 1044
Cdd:pfam10174 278 EvykshSKFM---KNKIDQLKQELSKKESE-----LLALQTKLETLTNQNsdcKQHIEVLKESLTA----KEQRAAILQT 345
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1045 KVSDLsdmRCKLEVELALKEAETDEIKILLEESRTQQKEM--LKSLL---EQETENLRTEISKLNQKIHDNNESyqvgLS 1119
Cdd:pfam10174 346 EVDAL---RLRLEEKESFLNKKTKQLQDLTEEKSTLAGEIrdLKDMLdvkERKINVLQKKIENLQEQLRDKDKQ----LA 418
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1120 ELR---------------ALMTIE-----KDQCISELISRHEEESnilKAELDNVTSLHRQAYEIEKKLKEQIVELQTRL 1179
Cdd:pfam10174 419 GLKervkslqtdssntdtALTTLEealseKERIIERLKEQRERED---RERLEELESLKKENKDLKEKVSALQPELTEKE 495
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1180 NSELSALEK---------QKDEKITQQEEKYEALIQNLEKDKERLVKNHEQDKE------------HLIQELNFEKNKAV 1238
Cdd:pfam10174 496 SSLIDLKEHasslassglKKDSKLKSLEIAVEQKKEECSKLENQLKKAHNAEEAvrtnpeindrirLLEQEVARYKEESG 575
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1239 QTALDefkVERELVEKELLEKVKHLENQiaKTPAFESAR----EDSSSLVAELQEKLQEEKAKFLEQLEEQEKRKNEEMQ 1314
Cdd:pfam10174 576 KAQAE---VERLLGILREVENEKNDKDK--KIAELESLTlrqmKEQNKKVANIKHGQQEMKKKGAQLLEEARRREDNLAD 650
|
490 500 510 520 530
....*....|....*....|....*....|....*....|....*....|....*...
gi 226693374 1315 NVRTSLIAEqqtnfnTVLTREKMRKEniINDLSDKLKSTMQQ-QERDKDLiESLSEDR 1371
Cdd:pfam10174 651 NSQQLQLEE------LMGALEKTRQE--LDATKARLSSTQQSlAEKDGHL-TNLRAER 699
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
866-1242 |
2.09e-06 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 52.81 E-value: 2.09e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 866 LQSLKIEYECKLD---ALVKDSEENVNKILKLKENLVSLEEALQNKDNEFTSIKHEKDAIVCVQQEKDQKLLEMEKIMHT 942
Cdd:pfam15921 435 LKAMKSECQGQMErqmAAIQGKNESLEKVSSLTAQLESTKEMLRKVVEELTAKKMTLESSERTVSDLTASLQEKERAIEA 514
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 943 QHCEIKELKQSREMALEDLKKLHDEK--IESLRAEFQCLEQnHLKELEDTLHIRHtQEFEKVM--------TDHNMSLEK 1012
Cdd:pfam15921 515 TNAEITKLRSRVDLKLQELQHLKNEGdhLRNVQTECEALKL-QMAEKDKVIEILR-QQIENMTqlvgqhgrTAGAMQVEK 592
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1013 --LKKE-NQQRIDqmLESHASTIQEKEQQLQELKLKVSDLSDMRCKL-----EVELALKEAETDEIKILLE--------E 1076
Cdd:pfam15921 593 aqLEKEiNDRRLE--LQEFKILKDKKDAKIRELEARVSDLELEKVKLvnagsERLRAVKDIKQERDQLLNEvktsrnelN 670
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1077 SRTQQKEMLKSLLEQETENLRTEISKLNQKIHDNNESYQVGLSELRAL-----------MTIEKD--------------- 1130
Cdd:pfam15921 671 SLSEDYEVLKRNFRNKSEEMETTTNKLKMQLKSAQSELEQTRNTLKSMegsdghamkvaMGMQKQitakrgqidalqski 750
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1131 QCISELIS-----RH--EEESNILKAELDNV-TSLHRQAYEI------EKKLKEQIVELQTRLnselsalekqkdEKITQ 1196
Cdd:pfam15921 751 QFLEEAMTnankeKHflKEEKNKLSQELSTVaTEKNKMAGELevlrsqERRLKEKVANMEVAL------------DKASL 818
|
410 420 430 440
....*....|....*....|....*....|....*....|....*.
gi 226693374 1197 QEEKYEALIQNLEKDKERLVKNHEQDKEHLiQELNFEKNKAVQTAL 1242
Cdd:pfam15921 819 QFAECQDIIQRQEQESVRLKLQHTLDVKEL-QGPGYTSNSSMKPRL 863
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
1065-1373 |
2.40e-06 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 52.67 E-value: 2.40e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1065 AETDEIKILLEESRtQQKEMLKSLLEQETENLRTEISKLNQKIHDNNESYQVGLSELRALMtiEKDQCISELISRHEEES 1144
Cdd:pfam02463 163 AGSRLKRKKKEALK-KLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELE--EEYLLYLDYLKLNEERI 239
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1145 NILKAELdNVTSLHRQAYEIEKKLKEQIVELQTRLNSELSALEKQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQDKE 1224
Cdd:pfam02463 240 DLLQELL-RDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKE 318
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1225 HLIQELNFEKN-KAVQTALDEFKVERELVEKELLEKVKHLENQIAKTPAFESAREDSSSLVAELQEKLQEEKAKFLEQLE 1303
Cdd:pfam02463 319 SEKEKKKAEKElKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELE 398
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 226693374 1304 EQEKRKNE--------EMQNVRTSLIAEQQTNFNTVLTREKMRKENIINDLSDKLKSTMQQQERDKDLIESLSEDRAR 1373
Cdd:pfam02463 399 LKSEEEKEaqlllelaRQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKSEDLLKE 476
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1010-1373 |
2.45e-06 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 52.08 E-value: 2.45e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1010 LEKLKKENQQRIDQMLESHAST--IQEKEQQLQELKLKVSDLSDMRCKLEVELALKE--AETDEIKILLEESRTQQKEML 1085
Cdd:COG4717 73 LKELEEELKEAEEKEEEYAELQeeLEELEEELEELEAELEELREELEKLEKLLQLLPlyQELEALEAELAELPERLEELE 152
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1086 KSLleQETENLRTEISKLNQKIHDNNESyqvgLSELRALMTIEKDQCISELISRHEEesniLKAELdnvtslhRQAYEIE 1165
Cdd:COG4717 153 ERL--EELRELEEELEELEAELAELQEE----LEELLEQLSLATEEELQDLAEELEE----LQQRL-------AELEEEL 215
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1166 KKLKEQIVELQTRLNS-ELSALEKQKDEKITQQEEKYEAL--------------------------------IQNLEKDK 1212
Cdd:COG4717 216 EEAQEELEELEEELEQlENELEAAALEERLKEARLLLLIAaallallglggsllsliltiagvlflvlgllaLLFLLLAR 295
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1213 ERLVKNHEQDKEHLIQELNFEKNKAVQTALDEFKVERELVEKELLEKVKHLENQIAKTPAFESAREDS--SSLVAELQEK 1290
Cdd:COG4717 296 EKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEELLELLDRIEELQELLREAEELEEELqlEELEQEIAAL 375
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1291 LQEEKAKFLEQLEEQEKRKNEEMQNVRTSLIAEQQTNFNTVLTREKMRKENIiNDLSDKLKSTMQQQERDKDLIESLSED 1370
Cdd:COG4717 376 LAEAGVEDEEELRAALEQAEEYQELKEELEELEEQLEELLGELEELLEALDE-EELEEELEELEEELEELEEELEELREE 454
|
...
gi 226693374 1371 RAR 1373
Cdd:COG4717 455 LAE 457
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1085-1389 |
3.46e-06 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 51.99 E-value: 3.46e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1085 LKSLLEQETENLRTEISK---LNQKIHDNNESYQVGLSELRALMTIEKDqcISELISRHEEESNILKAELDNVTSLHRQA 1161
Cdd:PRK03918 170 VIKEIKRRIERLEKFIKRtenIEELIKEKEKELEEVLREINEISSELPE--LREELEKLEKEVKELEELKEEIEELEKEL 247
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1162 YEIEK---KLKEQIVELQTRLN---SELSALEKQKDE--KITQQEEKYEALIQNLEKDKERLvKNHEQDKEHLIQELNfe 1233
Cdd:PRK03918 248 ESLEGskrKLEEKIRELEERIEelkKEIEELEEKVKElkELKEKAEEYIKLSEFYEEYLDEL-REIEKRLSRLEEEIN-- 324
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1234 knkAVQTALDEFKVERELVEkELLEKVKHLENQIAKTPAFESAREDSSSLVAELQ----EKLQEEKAKFLEQLEEQEKRK 1309
Cdd:PRK03918 325 ---GIEERIKELEEKEERLE-ELKKKLKELEKRLEELEERHELYEEAKAKKEELErlkkRLTGLTPEKLEKELEELEKAK 400
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1310 NEEMQNVRTsliaeqqtnfntvLTREKMRKENIINDLSD---KLKSTMQQ---------QERDKDLIESLSEDRARLLEE 1377
Cdd:PRK03918 401 EEIEEEISK-------------ITARIGELKKEIKELKKaieELKKAKGKcpvcgreltEEHRKELLEEYTAELKRIEKE 467
|
330
....*....|..
gi 226693374 1378 KKQLEEEVSKLR 1389
Cdd:PRK03918 468 LKEIEEKERKLR 479
|
|
| PLN03229 |
PLN03229 |
acetyl-coenzyme A carboxylase carboxyl transferase subunit alpha; Provisional |
1026-1329 |
3.52e-06 |
|
acetyl-coenzyme A carboxylase carboxyl transferase subunit alpha; Provisional
Pssm-ID: 178768 [Multi-domain] Cd Length: 762 Bit Score: 51.78 E-value: 3.52e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1026 ESHASTIQEKEQQLQELKLKVSDLSDMRCKLeVELALKEAetdeIKILLEESRTQQKEMLKSL-LEQETENLRTEISKLN 1104
Cdd:PLN03229 425 EAVKTPVRELEGEVEKLKEQILKAKESSSKP-SELALNEM----IEKLKKEIDLEYTEAVIAMgLQERLENLREEFSKAN 499
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1105 -----------QKIHDNNESYQVGLSE----------LRALMTIEKDQCISELISRHE--------------------EE 1143
Cdd:PLN03229 500 sqdqlmhpvlmEKIEKLKDEFNKRLSRapnylslkykLDMLNEFSRAKALSEKKSKAEklkaeinkkfkevmdrpeikEK 579
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1144 SNILKAELDNvtSLHRQAYEIEKKLKEQIVELQTRLNSELSALEK---------QKDEKITQQEEKYEALIQNLEKDKER 1214
Cdd:PLN03229 580 MEALKAEVAS--SGASSGDELDDDLKEKVEKMKKEIELELAGVLKsmglevigvTKKNKDTAEQTPPPNLQEKIESLNEE 657
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1215 LVKNHEQ-----DKEHLIQELNFEKNKAVQTALDEFKverelvekellEKVKHLENQIAKTPAfesAREDSSSLvAELQE 1289
Cdd:PLN03229 658 INKKIERvirssDLKSKIELLKLEVAKASKTPDVTEK-----------EKIEALEQQIKQKIA---EALNSSEL-KEKFE 722
|
330 340 350 360
....*....|....*....|....*....|....*....|
gi 226693374 1290 KLQEEKAKFLEQLEEQEKRKNEEMQNVRTSLIAEQQTNFN 1329
Cdd:PLN03229 723 ELEAELAAARETAAESNGSLKNDDDKEEDSKEDGSRVEVN 762
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
902-1472 |
3.92e-06 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 51.71 E-value: 3.92e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 902 EEALQNKDNEFTSIKHekdaivcVQQEKDQKLLEMEKiMHTQHCEIKELKQSREMALEDLKKlhdeKIESLRAEFQCLEQ 981
Cdd:pfam01576 4 EEEMQAKEEELQKVKE-------RQQKAESELKELEK-KHQQLCEEKNALQEQLQAETELCA----EAEEMRARLAARKQ 71
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 982 nhlkELEDTLHIRHTQEFEKvmTDHNMSLEKLKKENQQRIdQMLESHastIQEKEQQLQELKLKvsdlsdmrcKLEVELA 1061
Cdd:pfam01576 72 ----ELEEILHELESRLEEE--EERSQQLQNEKKKMQQHI-QDLEEQ---LDEEEAARQKLQLE---------KVTTEAK 132
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1062 LKEAEtDEIKILLEESRTQQKEmlKSLLEQETENLRT------EISKLNQKIHDNNEsyqvglselrALMTIEKDQCISE 1135
Cdd:pfam01576 133 IKKLE-EDILLLEDQNSKLSKE--RKLLEERISEFTSnlaeeeEKAKSLSKLKNKHE----------AMISDLEERLKKE 199
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1136 LISRHEEESNILKAELDNVTslhrqayeiekkLKEQIVELQTRLnSELSAlekqkdeKITQQEEKYEALIQNLEKdkERL 1215
Cdd:pfam01576 200 EKGRQELEKAKRKLEGESTD------------LQEQIAELQAQI-AELRA-------QLAKKEEELQAALARLEE--ETA 257
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1216 VKNHEQDKEHLIQELNFEknkaVQTALDEFKVERELVEK------ELLEKVK-HLENQIAKTPAFESAREDSSSLVAELQ 1288
Cdd:pfam01576 258 QKNNALKKIRELEAQISE----LQEDLESERAARNKAEKqrrdlgEELEALKtELEDTLDTTAAQQELRSKREQEVTELK 333
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1289 EKLQEEKAKFLEQLEEQEKRKNEEMQNVRTSLiaEQQTNFNTVLTREKMRKENIINDLSDKLKSTMQ----QQERDKDLI 1364
Cdd:pfam01576 334 KALEEETRSHEAQLQEMRQKHTQALEELTEQL--EQAKRNKANLEKAKQALESENAELQAELRTLQQakqdSEHKRKKLE 411
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1365 ESLSEDRARLLEEKKQLEEEVSKLrtssflssapVAAAPELYGACApeLPGEPERSVMETADEGrldSAMETSMMSVQEN 1444
Cdd:pfam01576 412 GQLQELQARLSESERQRAELAEKL----------SKLQSELESVSS--LLNEAEGKNIKLSKDV---SSLESQLQDTQEL 476
|
570 580
....*....|....*....|....*...
gi 226693374 1445 MLSEEKQRIMLLERTLQLKEEENKRLNQ 1472
Cdd:pfam01576 477 LQEETRQKLNLSTRLRQLEDERNSLQEQ 504
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
879-1107 |
5.16e-06 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 50.53 E-value: 5.16e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 879 ALVKDSEENVNKILKLKENLVSLEEALQNKDNEFTSIKHEKDAIVCVQQEKDQKLLEMEKIMHTQHCEIKELKQSREMAL 958
Cdd:COG4942 17 AQADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELR 96
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 959 EDLKKLHDEKIESLRAEFQCLEQNHLKELedtLHIRHTQEFEKVMtdhnMSLEKLKKENQQRIDQMleshASTIQEKEQQ 1038
Cdd:COG4942 97 AELEAQKEELAELLRALYRLGRQPPLALL---LSPEDFLDAVRRL----QYLKYLAPARREQAEEL----RADLAELAAL 165
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 226693374 1039 LQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESRTQQKEMLKSL--LEQETENLRTEISKLNQKI 1107
Cdd:COG4942 166 RAELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELaeLQQEAEELEALIARLEAEA 236
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
1179-1489 |
5.37e-06 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 51.28 E-value: 5.37e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1179 LNSELSALEKQKDEKITQQEEKYEALIQnlekdkERLvknhEQDKEHLIQELNF-----EKNKAVQTALDEFKVERELVE 1253
Cdd:pfam17380 271 LNQLLHIVQHQKAVSERQQQEKFEKMEQ------ERL----RQEKEEKAREVERrrkleEAEKARQAEMDRQAAIYAEQE 340
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1254 KELLEKVKHLENQIAKTPAFESAREDSSSLVAELqeklqeEKAKFLEQLEEQEKRKNEEmqnVRTSLIAEQQTNFntvlt 1333
Cdd:pfam17380 341 RMAMERERELERIRQEERKRELERIRQEEIAMEI------SRMRELERLQMERQQKNER---VRQELEAARKVKI----- 406
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1334 REKMRKENIINDLSDKLKSTMQQQERDKDLIESLSEDRARLL----EEKKQLEEEVSKLRTSSF----------LSSAPV 1399
Cdd:pfam17380 407 LEEERQRKIQQQKVEMEQIRAEQEEARQREVRRLEEERAREMervrLEEQERQQQVERLRQQEEerkrkkleleKEKRDR 486
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1400 AAAPELYGACAPELPGEPERSVMEtadEGRLDSAMETSMMSVQENMLSEEKQRIMLLERTLQLKEEENKRLNQRLMSQSL 1479
Cdd:pfam17380 487 KRAEEQRRKILEKELEERKQAMIE---EERKRKLLEKEMEERQKAIYEEERRREAEEERRKQQEMEERRRIQEQMRKATE 563
|
330
....*....|
gi 226693374 1480 SSVSSRHSEK 1489
Cdd:pfam17380 564 ERSRLEAMER 573
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
1077-1373 |
1.10e-05 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 50.12 E-value: 1.10e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1077 SRTQQKE----MLKSLLEQETENLRTEISKlNQKIHDNNESYQVGLSELRALMtiekdqciseliSRHEEESNILKAELD 1152
Cdd:pfam17380 285 SERQQQEkfekMEQERLRQEKEEKAREVER-RRKLEEAEKARQAEMDRQAAIY------------AEQERMAMERERELE 351
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1153 NVTSLHRQAyEIEKKLKEQI-VELQTRLNSELSALEKQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQDKEHLIQELN 1231
Cdd:pfam17380 352 RIRQEERKR-ELERIRQEEIaMEISRMRELERLQMERQQKNERVRQELEAARKVKILEEERQRKIQQQKVEMEQIRAEQE 430
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1232 FEKNKAVQTALDEFKVERELVEKELLEKVKHLEnQIAKTPAFESAREDSSSLVAELQEKLQEEKAKFLEQLEEQEKRKNE 1311
Cdd:pfam17380 431 EARQREVRRLEEERAREMERVRLEEQERQQQVE-RLRQQEEERKRKKLELEKEKRDRKRAEEQRRKILEKELEERKQAMI 509
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 226693374 1312 EMQNVRTSLIAEQQTNFNTVLTREKMRKEniinDLSDKLKSTMQQQERDKDLIESLSEDRAR 1373
Cdd:pfam17380 510 EEERKRKLLEKEMEERQKAIYEEERRREA----EEERRKQQEMEERRRIQEQMRKATEERSR 567
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
854-1316 |
1.50e-05 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 49.94 E-value: 1.50e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 854 LEITLKEKHQQELQSLKIEYECKLDALVKDSEENVNKILKLKENLVSLEEALQNKDNEFTSIKHEKDAIvcvqQEKDQKL 933
Cdd:COG1196 358 AELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEEL----EEALAEL 433
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 934 LEMEKIMHTQHCEIKELKQSREMALEDLKKLHDEKIESLRAEFQCLEQNHLKELEDTLHIRHTQEFEKVMTDHNMS-LEK 1012
Cdd:COG1196 434 EEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGvKAA 513
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1013 LKKENQQRIDQMLESHASTIQEKEQQLQE------LKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESRTQQKEMLK 1086
Cdd:COG1196 514 LLLAGLRGLAGAVAVLIGVEAAYEAALEAalaaalQNIVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALA 593
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1087 SLLEQETENLRTEisklnQKIHDNNESYQVGLSELRALMTIEKDQCISELISRHEEESNILKAELDNVTSLHRQAYEIEK 1166
Cdd:COG1196 594 RGAIGAAVDLVAS-----DLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRR 668
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1167 KLKEQIVELQTRLNSELSALEKQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQDK-------EHLIQELNFEKNKAVQ 1239
Cdd:COG1196 669 ELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEAleeqleaEREELLEELLEEEELL 748
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1240 TALDEFKVERELVEKELLEKVKHLENQIAKTPA--------FESAREDSSSLVAELQEkLQEEKakflEQLEEQEKRKNE 1311
Cdd:COG1196 749 EEEALEELPEPPDLEELERELERLEREIEALGPvnllaieeYEELEERYDFLSEQRED-LEEAR----ETLEEAIEEIDR 823
|
....*
gi 226693374 1312 EMQNV 1316
Cdd:COG1196 824 ETRER 828
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
1076-1326 |
1.51e-05 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 49.06 E-value: 1.51e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1076 ESRTQQKEMLKSLLEQETENLRTEISKLNQKIHDNNESYQVGLSELRALmtiekdqciselisrhEEESNILKAELDNVT 1155
Cdd:COG3883 15 DPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEAL----------------QAEIDKLQAEIAEAE 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1156 SLHRQAYEiekKLKEQIVELQT--RLNSELSALEKQKD--------EKITQQEEKYEALIQNLEKDKERLvknheqdkeh 1225
Cdd:COG3883 79 AEIEERRE---ELGERARALYRsgGSVSYLDVLLGSESfsdfldrlSALSKIADADADLLEELKADKAEL---------- 145
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1226 liqelnfeknKAVQTALDEFKVERELVEKELLEKVKHLENQIAKtpafesaREDSSSLVAELQEKLQEEKAKFLEQLEEQ 1305
Cdd:COG3883 146 ----------EAKKAELEAKLAELEALKAELEAAKAELEAQQAE-------QEALLAQLSAEEAAAEAQLAELEAELAAA 208
|
250 260
....*....|....*....|.
gi 226693374 1306 EKRKNEEMQNVRTSLIAEQQT 1326
Cdd:COG3883 209 EAAAAAAAAAAAAAAAAAAAA 229
|
|
| 235kDa-fam |
TIGR01612 |
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ... |
983-1344 |
1.60e-05 |
|
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.
Pssm-ID: 130673 [Multi-domain] Cd Length: 2757 Bit Score: 50.05 E-value: 1.60e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 983 HLKELEDTLHIRHTQEFEKVMTDHNMSLE---KLKKENQQRIDQMLESHASTIQEKEQQLQELKLKVSDLSDMRCKLEVE 1059
Cdd:TIGR01612 501 RMKDFKDIIDFMELYKPDEVPSKNIIGFDidqNIKAKLYKEIEAGLKESYELAKNWKKLIHEIKKELEEENEDSIHLEKE 580
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1060 ---LALKEAETDEIKILLEESRTQQKEMLKSLLEQeTENLRTEISKlnQKIHDNNESYQVGLSELRALMTIE----KDQC 1132
Cdd:TIGR01612 581 ikdLFDKYLEIDDEIIYINKLKLELKEKIKNISDK-NEYIKKAIDL--KKIIENNNAYIDELAKISPYQVPEhlknKDKI 657
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1133 ISELISrheEESNILKAELDnvtSLHRQAYEIEKKLKEQIVELQTRLNSELSALEKQKDEKITQQEEKYEALIQNLEKDK 1212
Cdd:TIGR01612 658 YSTIKS---ELSKIYEDDID---ALYNELSSIVKENAIDNTEDKAKLDDLKSKIDKEYDKIQNMETATVELHLSNIENKK 731
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1213 ERLVKNHEQDKEHLIQELNFEKNKAVQtaldEFKVErelvEKELLEKVKHLENQIAKTPAFESAredssslVAELQEKLQ 1292
Cdd:TIGR01612 732 NELLDIIVEIKKHIHGEINKDLNKILE----DFKNK----EKELSNKINDYAKEKDELNKYKSK-------ISEIKNHYN 796
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|..
gi 226693374 1293 EEKAkfLEQLEEQEKRKNEEMQNVRTSLIAEQQTNFNTVLTREKMRKENIIN 1344
Cdd:TIGR01612 797 DQIN--IDNIKDEDAKQNYDKSKEYIKTISIKEDEIFKIINEMKFMKDDFLN 846
|
|
| CCDC73 |
pfam15818 |
Coiled-coil domain-containing protein 73 family; CCDC73 is a family of eukaryotic coiled-coil ... |
958-1343 |
2.24e-05 |
|
Coiled-coil domain-containing protein 73 family; CCDC73 is a family of eukaryotic coiled-coil containing proteins. The function is not known. The alternative name is sarcoma antigen NY-SAR-79.
Pssm-ID: 464893 [Multi-domain] Cd Length: 1048 Bit Score: 49.17 E-value: 2.24e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 958 LEDLKKLHDEKIESLR----AEFQ--------CLEQNHLKELEDTLHirhtQEFEKVMTDHNMSLEKLKKENQQRIDQML 1025
Cdd:pfam15818 2 LLDFKTSLLEALEELRmrreAETQyeeqigkiIVETQELKWQKETLQ----NQKETLAKQHKEAMAVFKKQLQMKMCALE 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1026 ESH-----ASTIQEKE-QQLQE-LK-LKVSD------LSDMRCKLEVELALKEaetDEIKILLEESR-----TQQKEMLK 1086
Cdd:pfam15818 78 EEKgkyqlATEIKEKEiEGLKEtLKaLQVSKyslqkkVSEMEQKLQLHLLAKE---DHHKQLNEIEKyyatiTGQFGLVK 154
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1087 SLLEQETENLRTEIsKLNQKIHDNNESYQVGLSELRAlmtiEKDQCISELISrheeesnilkaelDNVTSLHRQAYE-IE 1165
Cdd:pfam15818 155 ENHGKLEQNVQEAI-QLNKRLSALNKKQESEICSLKK----ELKKVTSDLIK-------------SKVTCQYKMGEEnIN 216
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1166 KKLKEQ-IVELQTRLNSELsALEKQKDEKITQ-QEEKYEALI--QNLEKdkerLVKNHEQDKEHLIQELNFEKNKAvQTA 1241
Cdd:pfam15818 217 LTIKEQkFQELQERLNMEL-ELNKKINEEITHiQEEKQDIIIsfQHMQQ----LLQQQTQANTEMEAELKALKENN-QTL 290
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1242 LDEFKVERELVeKELLEKVKHLENQ------IAKTPAFESARE-----DSSSLVAELQEKLQEEKAKFLEQLEEQEKRKN 1310
Cdd:pfam15818 291 ERDNELQREKV-KENEEKFLNLQNEhekalgTWKKHVEELNGEineikNELSSLKETHIKLQEHYNKLCNQKKFEEDKKF 369
|
410 420 430
....*....|....*....|....*....|...
gi 226693374 1311 EEMQNVrtsliaeqqTNFNTVLTREKMrkENII 1343
Cdd:pfam15818 370 QNVPEV---------NNENSEMSTEKS--ENLI 391
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
926-1370 |
2.44e-05 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 49.20 E-value: 2.44e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 926 QQEKDQKLLEMEKIMHTQHCEIKELKQSREMALE--DLKKLHDEKIESLRAEFQCLEQnhlkeledtlhirHTQEFEKVM 1003
Cdd:TIGR00618 151 QGEFAQFLKAKSKEKKELLMNLFPLDQYTQLALMefAKKKSLHGKAELLTLRSQLLTL-------------CTPCMPDTY 217
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1004 TDHNMSLEKLKKENQQRIDQMLESHASTIQEKEQQLQELKLKvSDLSDMRCKLEvELALKEAETDEikilLEESRTQQKE 1083
Cdd:TIGR00618 218 HERKQVLEKELKHLREALQQTQQSHAYLTQKREAQEEQLKKQ-QLLKQLRARIE-ELRAQEAVLEE----TQERINRARK 291
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1084 MLKSLLEQEtenlrtEISKLNQKIHDNNESYQVGLSEL-RALMTIEKDQCISELISRHEEESNILKAELD------NVTS 1156
Cdd:TIGR00618 292 AAPLAAHIK------AVTQIEQQAQRIHTELQSKMRSRaKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEIhirdahEVAT 365
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1157 LHRQAYEIEKKLKEQIVELQTRLNSELSALEKQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQDKEHLIQELNFEKNK 1236
Cdd:TIGR00618 366 SIREISCQQHTLTQHIHTLQQQKTTLTQKLQSLCKELDILQREQATIDTRTSAFRDLQGQLAHAKKQQELQQRYAELCAA 445
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1237 AVQTALDEFKVeRELVEKELLEKVKHLENQIAKTPAFESAREDSSSLVAELQEKLQEEKAkfleQLEEQEKRKNEEMQNv 1316
Cdd:TIGR00618 446 AITCTAQCEKL-EKIHLQESAQSLKEREQQLQTKEQIHLQETRKKAVVLARLLELQEEPC----PLCGSCIHPNPARQD- 519
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....
gi 226693374 1317 rtSLIAEQQTNFNTVLTREKMRKENIINDLSDKLKSTMQQQERDKDLIESLSED 1370
Cdd:TIGR00618 520 --IDNPGPLTRRMQRGEQTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQQS 571
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
858-1210 |
2.48e-05 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 49.25 E-value: 2.48e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 858 LKEKHQQELQSLKIEYECKLDALVKDSEENVNKILKLKENLVSLEEALQNKDNEFTSIKHEKDAIVCVQQEKDQKLLEME 937
Cdd:TIGR04523 423 LLEKEIERLKETIIKNNSEIKDLTNQDSVKELIIKNLDNTRESLETQLKVLSRSINKIKQNLEQKQKELKSKEKELKKLN 502
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 938 KIMHTQHCEIKELKQSREMALEDLKKLHDEKIEslraefqclEQNHLKELEDtlhirhtqEFEKVMTDHNMS-LEKLKKE 1016
Cdd:TIGR04523 503 EEKKELEEKVKDLTKKISSLKEKIEKLESEKKE---------KESKISDLED--------ELNKDDFELKKEnLEKEIDE 565
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1017 NQQRIDQMLESHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESRTqqkemlkslLEQETENL 1096
Cdd:TIGR04523 566 KNKEIEELKQTQKSLKKKQEEKQELIDQKEKEKKDLIKEIEEKEKKISSLEKELEKAKKENEK---------LSSIIKNI 636
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1097 RTEISKLNQKIhdnnESYQVGLSELRALMTiEKDQCISELISRHEEESNILKaELDNVTSLHRQAY---EIEKKLKEQIV 1173
Cdd:TIGR04523 637 KSKKNKLKQEV----KQIKETIKEIRNKWP-EIIKKIKESKTKIDDIIELMK-DWLKELSLHYKKYitrMIRIKDLPKLE 710
|
330 340 350
....*....|....*....|....*....|....*..
gi 226693374 1174 ELQTRLNSELSALEKQKDEkitqqeekYEALIQNLEK 1210
Cdd:TIGR04523 711 EKYKEIEKELKKLDEFSKE--------LENIIKNFNK 739
|
|
| DUF4686 |
pfam15742 |
Domain of unknown function (DUF4686); This family of proteins is found in eukaryotes. Proteins ... |
1025-1307 |
2.58e-05 |
|
Domain of unknown function (DUF4686); This family of proteins is found in eukaryotes. Proteins in this family are typically between 498 and 775 amino acids in length. There is a conserved DLK sequence motif.
Pssm-ID: 464838 [Multi-domain] Cd Length: 384 Bit Score: 48.52 E-value: 2.58e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1025 LESHASTIQE----KEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKilLEESRTQQKEMLKSLLEQETENLRTEI 1100
Cdd:pfam15742 46 LKQHNSLLQEenikIKAELKQAQQKLLDSTKMCSSLTAEWKHCQQKIRELE--LEVLKQAQSIKSQNSLQEKLAQEKSRV 123
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1101 SKLNQKIHDN----NESYQVGLSELRALMT-------IEKDQCISELISRHEEESNILKAELDNVTSLHRQA---YEIEK 1166
Cdd:pfam15742 124 ADAEEKILELqqklEHAHKVCLTDTCILEKkqleeriKEASENEAKLKQQYQEEQQKRKLLDQNVNELQQQVrslQDKEA 203
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1167 KLKEQIVELQTRLNSELSALEKQKDEKITQQEE-----KYEALIQNLEKDKERLVKNHEQDKEHLIQEL-------NFEK 1234
Cdd:pfam15742 204 QLEMTNSQQQLRIQQQEAQLKQLENEKRKSDEHlksnqELSEKLSSLQQEKEALQEELQQVLKQLDVHVrkynekhHHHK 283
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 226693374 1235 NKaVQTALDEFKVERELVEkellEKVKHLENQIAKTPAFESAREDSSSLVAELQEKLQEEKAKFLEQLEEQEK 1307
Cdd:pfam15742 284 AK-LRRAKDRLVHEVEQRD----ERIKQLENEIGILQQQSEKEKAFQKQVTAQNEILLLEKRKLLEQLTEQEE 351
|
|
| CCDC73 |
pfam15818 |
Coiled-coil domain-containing protein 73 family; CCDC73 is a family of eukaryotic coiled-coil ... |
885-1311 |
3.75e-05 |
|
Coiled-coil domain-containing protein 73 family; CCDC73 is a family of eukaryotic coiled-coil containing proteins. The function is not known. The alternative name is sarcoma antigen NY-SAR-79.
Pssm-ID: 464893 [Multi-domain] Cd Length: 1048 Bit Score: 48.79 E-value: 3.75e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 885 EENVNKILKLKENLVSLEEALQNKDNefTSIKHEKDAIVCVQQEKDQKL--LEMEKIMHTQHCEIKElkQSREMALEDLK 962
Cdd:pfam15818 27 EEQIGKIIVETQELKWQKETLQNQKE--TLAKQHKEAMAVFKKQLQMKMcaLEEEKGKYQLATEIKE--KEIEGLKETLK 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 963 KLHDEKIeSLraefqcleQNHLKELEDTLHIRHTQEfekvmTDHNMSLEKLKKENQQRIDQ---MLESHASTiqekEQQL 1039
Cdd:pfam15818 103 ALQVSKY-SL--------QKKVSEMEQKLQLHLLAK-----EDHHKQLNEIEKYYATITGQfglVKENHGKL----EQNV 164
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1040 QELKLKVSDLSDMRCKLEVELA-----LKEAETDEIKilleESRTQQKEMlksllEQETENLRTEisklNQKIHDNNEsy 1114
Cdd:pfam15818 165 QEAIQLNKRLSALNKKQESEICslkkeLKKVTSDLIK----SKVTCQYKM-----GEENINLTIK----EQKFQELQE-- 229
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1115 qvglselRALMTIEKDQCISELISRHEEEsnilkaELDNVTSL-HRQayeiekKLKEQIVELQTRLNSELSALEKQ---- 1189
Cdd:pfam15818 230 -------RLNMELELNKKINEEITHIQEE------KQDIIISFqHMQ------QLLQQQTQANTEMEAELKALKENnqtl 290
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1190 ------KDEKITQQEEKYealiQNLEKDKERLVKNHEQDKEHLIQELNFEKNKAVQTALDEFKVER---ELVEKELLEKV 1260
Cdd:pfam15818 291 erdnelQREKVKENEEKF----LNLQNEHEKALGTWKKHVEELNGEINEIKNELSSLKETHIKLQEhynKLCNQKKFEED 366
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*.
gi 226693374 1261 KHLEN-QIAKTPAFESAREDSSSLVAE----LQEKLQEEKAKFLEQLEEQEKRKNE 1311
Cdd:pfam15818 367 KKFQNvPEVNNENSEMSTEKSENLIIQkynsEQEIREENTKSFCSDTEYRETEKKK 422
|
|
| PLN02939 |
PLN02939 |
transferase, transferring glycosyl groups |
854-1221 |
4.01e-05 |
|
transferase, transferring glycosyl groups
Pssm-ID: 215507 [Multi-domain] Cd Length: 977 Bit Score: 48.36 E-value: 4.01e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 854 LEITLKEKHQQELQSLKIEYECKLDALVKDSEENVNKILKLKENLVSLEEALQNKDNEftsikhekdaivcvQQEKDQKL 933
Cdd:PLN02939 65 LQSNTDENGQLENTSLRTVMELPQKSTSSDDDHNRASMQRDEAIAAIDNEQQTNSKDG--------------EQLSDFQL 130
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 934 LEMEKIMHTQHCEIKELKQSREMALEDLKKLHDEKiESLRAEFQCLEQNhlkeLEDTLHIRHTQEFEKVMTDHNMS-LEK 1012
Cdd:PLN02939 131 EDLVGMIQNAEKNILLLNQARLQALEDLEKILTEK-EALQGKINILEMR----LSETDARIKLAAQEKIHVEILEEqLEK 205
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1013 LKKENQQRIdqmlESHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELaLKEAETDEIKILLEESRTQQKEMLKSlLEQE 1092
Cdd:PLN02939 206 LRNELLIRG----ATEGLCVHSLSKELDVLKEENMLLKDDIQFLKAEL-IEVAETEERVFKLEKERSLLDASLRE-LESK 279
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1093 TENLRTEISKlnqkihdnnesyqvgLSELRALMTIEKDQCISELISRHEEESNILKAELDNVTSLHRQAYEIEKKLKEQI 1172
Cdd:PLN02939 280 FIVAQEDVSK---------------LSPLQYDCWWEKVENLQDLLDRATNQVEKAALVLDQNQDLRDKVDKLEASLKEAN 344
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1173 VelqTRLNSELSALEKQK-----------DEKITQQEEKYEALIQNLEKDKERLVKNHEQ 1221
Cdd:PLN02939 345 V---SKFSSYKVELLQQKlklleerlqasDHEIHSYIQLYQESIKEFQDTLSKLKEESKK 401
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
859-1469 |
4.13e-05 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 48.60 E-value: 4.13e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 859 KEKHQQELQSLKIEYECKLDALVKDSEEnVNKILKLK--ENLVSLEEALQNKDNEFTSIKHEKDAivCVQQEKDQKLLEM 936
Cdd:PTZ00121 1099 KAEEAKKTETGKAEEARKAEEAKKKAED-ARKAEEARkaEDARKAEEARKAEDAKRVEIARKAED--ARKAEEARKAEDA 1175
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 937 EKIMHTQHCEikELKQSREmaledLKKLHD-EKIESLRAEFQCLEQNHLKELEDTLHIRHTQEFEKVMTDHNMSLEKLKK 1015
Cdd:PTZ00121 1176 KKAEAARKAE--EVRKAEE-----LRKAEDaRKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEE 1248
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1016 ENQQRIDQMLESHASTIQEKEQQLQ-ELKLKVSDLSDMRCKLEVELALKEAE---TDEIKILLEESRTQQKemlkslLEQ 1091
Cdd:PTZ00121 1249 RNNEEIRKFEEARMAHFARRQAAIKaEEARKADELKKAEEKKKADEAKKAEEkkkADEAKKKAEEAKKADE------AKK 1322
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1092 ETENLRTEISKLNQKIHDNNESYQVGLSELRALM-TIEKDQCISELISRHEEESNILKAELDNVTSLHRQAYEIEKKLKE 1170
Cdd:PTZ00121 1323 KAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAAdEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEE 1402
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1171 QivelqtrlNSELSALEKQKDEKITQQEEKYEAliQNLEKDKERLVKNHEQDKEHLIQELNFEKNKAvqtalDEFKVERE 1250
Cdd:PTZ00121 1403 D--------KKKADELKKAAAAKKKADEAKKKA--EEKKKADEAKKKAEEAKKADEAKKKAEEAKKA-----EEAKKKAE 1467
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1251 lvEKELLEKVKHLENQIAKTPAFESAREDSSSLVAELQEKlQEEKAKFLEQLEEQEKRKNEEMQNVRTSLIAEQQTNFNT 1330
Cdd:PTZ00121 1468 --EAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKA-AEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEE 1544
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1331 VLTREKMRKENIINDLSDKLKSTMQQQERDKDLIESLSEDRARllEEKKQLEEEVSKLRTSSFLSSAPVAAAPELYGACA 1410
Cdd:PTZ00121 1545 KKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAK--KAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKA 1622
|
570 580 590 600 610
....*....|....*....|....*....|....*....|....*....|....*....
gi 226693374 1411 PELPGEPErsvmETADEGRLDSAMETSMMSVQENMLSEEKQRIMLLErtLQLKEEENKR 1469
Cdd:PTZ00121 1623 EELKKAEE----EKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAE--EAKKAEEDKK 1675
|
|
| PRK00409 |
PRK00409 |
recombination and DNA strand exchange inhibitor protein; Reviewed |
1122-1261 |
4.45e-05 |
|
recombination and DNA strand exchange inhibitor protein; Reviewed
Pssm-ID: 234750 [Multi-domain] Cd Length: 782 Bit Score: 48.28 E-value: 4.45e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1122 RALMTIEKDQcISELISRHEEESNILKAELDNVTSLHRQAYEIEKKLKEQIVELQTRLNSELSALEKQKDEKITQQEEKY 1201
Cdd:PRK00409 508 KKLIGEDKEK-LNELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEA 586
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 226693374 1202 EALIQNLekdKERLVKNHEQDKEHLIQELNFEKNKAVQTALDEFKVERELVEK-ELLEKVK 1261
Cdd:PRK00409 587 DEIIKEL---RQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEElKVGDEVK 644
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
831-1110 |
5.11e-05 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 48.09 E-value: 5.11e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 831 DLANYLKCTAVEIRNIIEKVK-CSLEITLKEKHQQELQSLKIEYECKLDALVKDSEENVNKILKLKENLVSLEEALQNKD 909
Cdd:TIGR04523 465 SLETQLKVLSRSINKIKQNLEqKQKELKSKEKELKKLNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESKISDLE 544
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 910 NEFTSIKHE--KDAIVCVQQEKDQKLLEMekimhtqHCEIKELKQSREMALEDLKKLHDEKIEsLRAEfqcLEQNHLKEL 987
Cdd:TIGR04523 545 DELNKDDFElkKENLEKEIDEKNKEIEEL-------KQTQKSLKKKQEEKQELIDQKEKEKKD-LIKE---IEEKEKKIS 613
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 988 EDTLHIRHTQEFEKVMTDHNMSLEKLKKENQQRIDQMLESHASTIQEK---EQQLQELKLKVSDLSDMRCKLEVELAL-- 1062
Cdd:TIGR04523 614 SLEKELEKAKKENEKLSSIIKNIKSKKNKLKQEVKQIKETIKEIRNKWpeiIKKIKESKTKIDDIIELMKDWLKELSLhy 693
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|..
gi 226693374 1063 KEAETDEIKI----LLEESRTQQKEMLKSLLEQETEnLRTEISKLNQKIHDN 1110
Cdd:TIGR04523 694 KKYITRMIRIkdlpKLEEKYKEIEKELKKLDEFSKE-LENIIKNFNKKFDDA 744
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
957-1373 |
5.82e-05 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 48.11 E-value: 5.82e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 957 ALEDLKKLHDEKIESLRAEFQCLEQnhlKELEDTLHirhtqEFEKVMTDHNMSLEKLKkENQQRIDQMLESHASTIQEKE 1036
Cdd:PRK02224 177 GVERVLSDQRGSLDQLKAQIEEKEE---KDLHERLN-----GLESELAELDEEIERYE-EQREQARETRDEADEVLEEHE 247
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1037 QQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKilleesrtQQKEMLKSLLEQETENL-RTEISKLNQKIhdnnesyq 1115
Cdd:PRK02224 248 ERREELETLEAEIEDLRETIAETEREREELAEEVR--------DLRERLEELEEERDDLLaEAGLDDADAEA-------- 311
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1116 vgLSELRALMTIEKDQCISEL------ISRHEEESNILKAELDNV----TSLHRQAYEIEKKLKEQIVELQTRlNSELSA 1185
Cdd:PRK02224 312 --VEARREELEDRDEELRDRLeecrvaAQAHNEEAESLREDADDLeeraEELREEAAELESELEEAREAVEDR-REEIEE 388
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1186 LEKQKDE------KITQQEEKYEALIQNLEKDKERLvKNHEQDKEHLIQELNFEKNKAvQTALDEFK---VERELVEKEL 1256
Cdd:PRK02224 389 LEEEIEElrerfgDAPVDLGNAEDFLEELREERDEL-REREAELEATLRTARERVEEA-EALLEAGKcpeCGQPVEGSPH 466
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1257 LEKVKHLENQIAKtpaFESAREDSSSLVAELQEKLQEekakfLEQLEEQEKRKNEEMQNVRTS--LIAEQQTNFN-TVLT 1333
Cdd:PRK02224 467 VETIEEDRERVEE---LEAELEDLEEEVEEVEERLER-----AEDLVEAEDRIERLEERREDLeeLIAERRETIEeKRER 538
|
410 420 430 440
....*....|....*....|....*....|....*....|
gi 226693374 1334 REKMRKEniindlSDKLKSTMQQQERDKDLIESLSEDRAR 1373
Cdd:PRK02224 539 AEELRER------AAELEAEAEEKREAAAEAEEEAEEARE 572
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
854-1373 |
6.07e-05 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 47.73 E-value: 6.07e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 854 LEITLKEKHQQELQSLKIEYECKL---DALVKDSEENVNKILKLKENLVSLEEALQNKDNEFTSIKHEKDAIVCVQQEKD 930
Cdd:PRK02224 192 LKAQIEEKEEKDLHERLNGLESELaelDEEIERYEEQREQARETRDEADEVLEEHEERREELETLEAEIEDLRETIAETE 271
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 931 QKLLEMEKIMHTQHCEIKELKQSREMALED--LKKLHDEKIESLRAEFQcleqNHLKELEDTLhirhtQEFEKVMTDHNM 1008
Cdd:PRK02224 272 REREELAEEVRDLRERLEELEEERDDLLAEagLDDADAEAVEARREELE----DRDEELRDRL-----EECRVAAQAHNE 342
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1009 SLEKLKKenqqRIDQmLESHASTIQEK----EQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESrtqqkEM 1084
Cdd:PRK02224 343 EAESLRE----DADD-LEERAEELREEaaelESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNA-----ED 412
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1085 LKSLLEQETENLRTEISKLNQKIhdnnESYQVGLSELRALMTIEK---------DQCISELISRHEEESNILKAELDnvt 1155
Cdd:PRK02224 413 FLEELREERDELREREAELEATL----RTARERVEEAEALLEAGKcpecgqpveGSPHVETIEEDRERVEELEAELE--- 485
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1156 SLHRQAYEIEKKLK--EQIVELQTRLNSelsALEKQKD--EKITQQEEKyealiqnLEKDKERLVKNHEQDKEHliQELN 1231
Cdd:PRK02224 486 DLEEEVEEVEERLEraEDLVEAEDRIER---LEERREDleELIAERRET-------IEEKRERAEELRERAAEL--EAEA 553
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1232 FEKNKAVQTALDEFKVERELVeKELLEKVKHLENQIaktpafESAR--EDSSSLVAELQEKLQE--EKAKFLEQLEEQEK 1307
Cdd:PRK02224 554 EEKREAAAEAEEEAEEAREEV-AELNSKLAELKERI------ESLEriRTLLAAIADAEDEIERlrEKREALAELNDERR 626
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 226693374 1308 RKNEEMQNVRTSLIAEQQTNFNTVLTREKMRKENIINDLSDKLkstmqqqerdkdliESLSEDRAR 1373
Cdd:PRK02224 627 ERLAEKRERKRELEAEFDEARIEEAREDKERAEEYLEQVEEKL--------------DELREERDD 678
|
|
| PTZ00440 |
PTZ00440 |
reticulocyte binding protein 2-like protein; Provisional |
946-1370 |
7.03e-05 |
|
reticulocyte binding protein 2-like protein; Provisional
Pssm-ID: 240419 [Multi-domain] Cd Length: 2722 Bit Score: 47.90 E-value: 7.03e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 946 EIKELKQSREMALEDLKKLHDEKIESLRAEFqcleqNHLKELEDTLH-IRHTQEFEKVMTDHNMSLEKLKKENQQRIDQM 1024
Cdd:PTZ00440 429 KIADYALYSNLEIIEIKKKYDEKINELKKSI-----NQLKTLISIMKsFYDLIISEKDSMDSKEKKESSDSNYQEKVDEL 503
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1025 L------ESHASTIQEKEQQLQELKLKVSDLsdmrcKLEVELALKEAETDEIKILLEESRTQQKEMLKSLLEQETENLRT 1098
Cdd:PTZ00440 504 LqiinsiKEKNNIVNNNFKNIEDYYITIEGL-----KNEIEGLIELIKYYLQSIETLIKDEKLKRSMKNDIKNKIKYIEE 578
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1099 EISKLNQKIHDNNESYQV--GLSELRALMTIEKDQCISELISRHEEESNI--------LKAELDNVTSL--HRQAYEIEK 1166
Cdd:PTZ00440 579 NVDHIKDIISLNDEIDNIiqQIEELINEALFNKEKFINEKNDLQEKVKYIlnkfykgdLQELLDELSHFldDHKYLYHEA 658
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1167 KLKEQIVELQTRLNSELSALEKQKDEKITQQEEKYEALIQNLEKDKERLVK-NHEQDKEHLIQELN--FEKNKAVQTALD 1243
Cdd:PTZ00440 659 KSKEDLQTLLNTSKNEYEKLEFMKSDNIDNIIKNLKKELQNLLSLKENIIKkQLNNIEQDISNSLNqyTIKYNDLKSSIE 738
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1244 EFKverelVEKELLEKVKHLENQIAKTpafesaredsssLVAELQEklQEEKakfleqLEEQEKRKNEEMQNVRTSLIAE 1323
Cdd:PTZ00440 739 EYK-----EEEEKLEVYKHQIINRKNE------------FILHLYE--NDKD------LPDGKNTYEEFLQYKDTILNKE 793
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|..
gi 226693374 1324 QQTNFNTVLTRE--KMRKENI--INDLSDKLKS-TMQQQERDKDLIESLSED 1370
Cdd:PTZ00440 794 NKISNDINILKEnkKNNQDLLnsYNILIQKLEAhTEKNDEELKQLLQKFPTE 845
|
|
| DR0291 |
COG1579 |
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ... |
1166-1311 |
7.15e-05 |
|
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];
Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 46.07 E-value: 7.15e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1166 KKLKEQIVELQTRLNSELSALEKQKDEkITQQEEKYEALIQNLEKDKERL--VKNHEqdkehliqELnfeknKAVQTALD 1243
Cdd:COG1579 34 AELEDELAALEARLEAAKTELEDLEKE-IKRLELEIEEVEARIKKYEEQLgnVRNNK--------EY-----EALQKEIE 99
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 226693374 1244 EFKVERELVEKELLEKVKHLENQIAKTPAFESAREDSSSLVAELQEKLQEEKAKFLEQLEEQEKRKNE 1311
Cdd:COG1579 100 SLKRRISDLEDEILELMERIEELEEELAELEAELAELEAELEEKKAELDEELAELEAELEELEAEREE 167
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1164-1361 |
8.06e-05 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 47.45 E-value: 8.06e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1164 IEKKLKEQIVEL------QTRLN-SELSALEKQKDEkITQQEEKYEALIQNLEKDKERLvKNHEQDKEHLIQELNF-EKN 1235
Cdd:COG4717 47 LLERLEKEADELfkpqgrKPELNlKELKELEEELKE-AEEKEEEYAELQEELEELEEEL-EELEAELEELREELEKlEKL 124
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1236 KAVQTALDEFK-VERELveKELLEKVKHLENQIAKTPAFESAREDSSSLVAELQEKLQEEKAKFLEQLEEQEKRKNEEMQ 1314
Cdd:COG4717 125 LQLLPLYQELEaLEAEL--AELPERLEELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEELQDLAEELE 202
|
170 180 190 200
....*....|....*....|....*....|....*....|....*..
gi 226693374 1315 NVRTSlIAEQQTNFNTvLTREKMRKENIINDLSDKLKSTMQQQERDK 1361
Cdd:COG4717 203 ELQQR-LAELEEELEE-AQEELEELEEELEQLENELEAAALEERLKE 247
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
927-1416 |
9.41e-05 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 47.35 E-value: 9.41e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 927 QEKDQKLLEMEKIMHTQH----CEIKELKQSREMALEDLKKL---HDEKIESLRAEFQCLEQNHLK--ELEDTlhIRHTQ 997
Cdd:TIGR00606 198 QGQKVQEHQMELKYLKQYkekaCEIRDQITSKEAQLESSREIvksYENELDPLKNRLKEIEHNLSKimKLDNE--IKALK 275
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 998 EFEKVMTDHNMSLEKLKKENQQRIDQML----ESHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELAlkEAETDEIKIL 1073
Cdd:TIGR00606 276 SRKKQMEKDNSELELKMEKVFQGTDEQLndlyHNHQRTVREKERELVDCQRELEKLNKERRLLNQEKT--ELLVEQGRLQ 353
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1074 LEESRTQQKEMLKSLLEQETEnLRTEISKLnqkihdnnesyqvglsELRALMTIEKDQCISELISRHEEESNILKAELDN 1153
Cdd:TIGR00606 354 LQADRHQEHIRARDSLIQSLA-TRLELDGF----------------ERGPFSERQIKNFHTLVIERQEDEAKTAAQLCAD 416
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1154 VTSLHRQAYEIEKKLKEQIVELQTRLNSELSALEKQKDEKITQQEEkyealIQNLEKDKERLVKNHEQDKEHLIQELNFE 1233
Cdd:TIGR00606 417 LQSKERLKQEQADEIRDEKKGLGRTIELKKEILEKKQEELKFVIKE-----LQQLEGSSDRILELDQELRKAERELSKAE 491
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1234 KNKAVQTALDEFKVERElvEKELLEKVKHLENQiaktpafESAREDSSSLVAELQEKLQEEKAKFLEQLEEQEKRKNEEM 1313
Cdd:TIGR00606 492 KNSLTETLKKEVKSLQN--EKADLDRKLRKLDQ-------EMEQLNHHTTTRTQMEMLTKDKMDKDEQIRKIKSRHSDEL 562
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1314 QNVRTSLIAEQQ-------TNFNTVLTREKMRKENIINDLSDKLKSTMQQQERDKDLIESLSEDRARLLEEKKQLEEEVS 1386
Cdd:TIGR00606 563 TSLLGYFPNKKQledwlhsKSKEINQTRDRLAKLNKELASLEQNKNHINNELESKEEQLSSYEDKLFDVCGSQDEESDLE 642
|
490 500 510
....*....|....*....|....*....|...
gi 226693374 1387 KLRTSSFLSS---APVAAAPELYGACAPELPGE 1416
Cdd:TIGR00606 643 RLKEEIEKSSkqrAMLAGATAVYSQFITQLTDE 675
|
|
| DR0291 |
COG1579 |
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ... |
1019-1179 |
9.83e-05 |
|
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];
Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 45.69 E-value: 9.83e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1019 QRIDQMLESHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESRTQQKEMLKS----LLEQETE 1094
Cdd:COG1579 20 DRLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGNVRNNkeyeALQKEIE 99
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1095 NLRTEISKLNQKIHDNN---ESYQVGLSELRALMTIEKDQcISELISRHEEESNILKAELDNVTSlhrQAYEIEKKLKEQ 1171
Cdd:COG1579 100 SLKRRISDLEDEILELMeriEELEEELAELEAELAELEAE-LEEKKAELDEELAELEAELEELEA---EREELAAKIPPE 175
|
....*...
gi 226693374 1172 IVELQTRL 1179
Cdd:COG1579 176 LLALYERI 183
|
|
| 235kDa-fam |
TIGR01612 |
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ... |
845-1349 |
1.05e-04 |
|
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.
Pssm-ID: 130673 [Multi-domain] Cd Length: 2757 Bit Score: 47.35 E-value: 1.05e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 845 NIIEKVKCSLEITLKEKHQQELQSLKIEYECKldalvKDSEENVNKILKLKENLVSLEEALQNKDNEFTSIK------HE 918
Cdd:TIGR01612 533 NIKAKLYKEIEAGLKESYELAKNWKKLIHEIK-----KELEEENEDSIHLEKEIKDLFDKYLEIDDEIIYINklklelKE 607
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 919 KDAIVCVQQEKDQKLLEMEKIMHTQHCEIKELKQSREMALEDLKKLHDEKIESLRAEFQCLEQNHLKELEDTLHiRHTQE 998
Cdd:TIGR01612 608 KIKNISDKNEYIKKAIDLKKIIENNNAYIDELAKISPYQVPEHLKNKDKIYSTIKSELSKIYEDDIDALYNELS-SIVKE 686
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 999 FEKVMTDHNMSLEKLKKENQQRIDQM-------LESHASTIQEKEQQLqelklkVSDLSDMRCKLEVELalkeaeTDEIK 1071
Cdd:TIGR01612 687 NAIDNTEDKAKLDDLKSKIDKEYDKIqnmetatVELHLSNIENKKNEL------LDIIVEIKKHIHGEI------NKDLN 754
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1072 ILLEESRTQQKEMLKSLLEQETENlrTEISKLNQKI-----HDNNESYQVGLSELRALMTIEKDQCISELISRHEEES-- 1144
Cdd:TIGR01612 755 KILEDFKNKEKELSNKINDYAKEK--DELNKYKSKIseiknHYNDQINIDNIKDEDAKQNYDKSKEYIKTISIKEDEIfk 832
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1145 --NILKAELDNVTSLHRQAYEIEKKLKEQI-------VELQTRLNSELSalekqkDEKITQQEEKYealiqnleKDKERL 1215
Cdd:TIGR01612 833 iiNEMKFMKDDFLNKVDKFINFENNCKEKIdseheqfAELTNKIKAEIS------DDKLNDYEKKF--------NDSKSL 898
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1216 VKNHEQDKEHLIQELNFEKN--------KAVQTALDEFKVERELVEKELLEKVKHLENQIAKTPAFESAREDSsslvaeL 1287
Cdd:TIGR01612 899 INEINKSIEEEYQNINTLKKvdeyikicENTKESIEKFHNKQNILKEILNKNIDTIKESNLIEKSYKDKFDNT------L 972
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 226693374 1288 QEKLQEEKAKFLE-QLEEQEKRKNEEMQNVrTSLIAEQQTNFNTVLTREKMRKENIINDLSDK 1349
Cdd:TIGR01612 973 IDKINELDKAFKDaSLNDYEAKNNELIKYF-NDLKANLGKNKENMLYHQFDEKEKATNDIEQK 1034
|
|
| PRK01156 |
PRK01156 |
chromosome segregation protein; Provisional |
876-1160 |
1.17e-04 |
|
chromosome segregation protein; Provisional
Pssm-ID: 100796 [Multi-domain] Cd Length: 895 Bit Score: 46.82 E-value: 1.17e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 876 KLDALVKDSEENVNKILKLKENLVSLE-EALQNKDNEFTSIKHE-KDAIVCVQQEKDQKLLEMEKIMHTQHCEIKELKQS 953
Cdd:PRK01156 487 EIEIEVKDIDEKIVDLKKRKEYLESEEiNKSINEYNKIESARADlEDIKIKINELKDKHDKYEEIKNRYKSLKLEDLDSK 566
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 954 REMALEDLKKLHDEKIESLRAEFQCLEQnHLKELEDTLHirhtqEFEKVMTDHNMSLEKLKKENQQRIDqMLESHASTIQ 1033
Cdd:PRK01156 567 RTSWLNALAVISLIDIETNRSRSNEIKK-QLNDLESRLQ-----EIEIGFPDDKSYIDKSIREIENEAN-NLNNKYNEIQ 639
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1034 EKEQQLQELKLKVSDLSDMRCKL-EVELALKE--AETDEIKILLEESRTQQKEMLKSLLEQET--ENLRTEISKLNQKIH 1108
Cdd:PRK01156 640 ENKILIEKLRGKIDNYKKQIAEIdSIIPDLKEitSRINDIEDNLKKSRKALDDAKANRARLEStiEILRTRINELSDRIN 719
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|..
gi 226693374 1109 DNNESYQVGLSELRALMTIEKdqcISELISRHEEESNILKAELDNVTSLHRQ 1160
Cdd:PRK01156 720 DINETLESMKKIKKAIGDLKR---LREAFDKSGVPAMIRKSASQAMTSLTRK 768
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
1069-1474 |
1.43e-04 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 46.57 E-value: 1.43e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1069 EIKILLEESRTQQKEMLKSLLEQET-ENLRTEISKLNQKIHDNNESYQVGLSELRALMTIEKDQCISELISRHEEESNIL 1147
Cdd:PRK02224 139 EVNKLINATPSDRQDMIDDLLQLGKlEEYRERASDARLGVERVLSDQRGSLDQLKAQIEEKEEKDLHERLNGLESELAEL 218
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1148 KAELDNVTSLHRQAYEIEKKLKEQIVELQTRLnSELSALEKQKD---EKITQQEEKYEALIQNLEKDKERLvKNHEQDKE 1224
Cdd:PRK02224 219 DEEIERYEEQREQARETRDEADEVLEEHEERR-EELETLEAEIEdlrETIAETEREREELAEEVRDLRERL-EELEEERD 296
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1225 HLIQELNFEK--NKAVQTALDEFKVERELVEKELLEK---VKHLENQIaktpafESAREDSSSLVAELQEKlqEEKAKFL 1299
Cdd:PRK02224 297 DLLAEAGLDDadAEAVEARREELEDRDEELRDRLEECrvaAQAHNEEA------ESLREDADDLEERAEEL--REEAAEL 368
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1300 EqleeqekrknEEMQNVRTSlIAEQQTnfntvlTREKMRKEniINDLSDKLKSTMQQQERDKDLIESLSEDRARLLEEKK 1379
Cdd:PRK02224 369 E----------SELEEAREA-VEDRRE------EIEELEEE--IEELRERFGDAPVDLGNAEDFLEELREERDELREREA 429
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1380 QLEEEVSKLRTSsflssapVAAAPELYGA-----CAPELPGEPERSVMETADEGRLDSAMETSMMSVQENMLSEEKQRIM 1454
Cdd:PRK02224 430 ELEATLRTARER-------VEEAEALLEAgkcpeCGQPVEGSPHVETIEEDRERVEELEAELEDLEEEVEEVEERLERAE 502
|
410 420
....*....|....*....|...
gi 226693374 1455 LL---ERTLQLKEEENKRLNQRL 1474
Cdd:PRK02224 503 DLveaEDRIERLEERREDLEELI 525
|
|
| HEC1 |
COG5185 |
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell ... |
837-1239 |
1.50e-04 |
|
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 444066 [Multi-domain] Cd Length: 594 Bit Score: 46.49 E-value: 1.50e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 837 KCTAVEIRNIIEKVKCSLEITLKEKHQQEL-QSLKIEYECKLDALVKDSEENVNKILKLKE-NLVSLEEALQNKDNEFTS 914
Cdd:COG5185 143 LDEIADIEASYGEVETGIIKDIFGKLTQELnQNLKKLEIFGLTLGLLKGISELKKAEPSGTvNSIKESETGNLGSESTLL 222
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 915 IKHEKDAIVCVQQEKDQKLLEMEKIMHTQHCEIKE-LKQSREMALEDL-------KKLhDEKIESLRAEFQCLEQNhLKE 986
Cdd:COG5185 223 EKAKEIINIEEALKGFQDPESELEDLAQTSDKLEKlVEQNTDLRLEKLgenaessKRL-NENANNLIKQFENTKEK-IAE 300
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 987 LEDTLHIR-HTQEFEKVM--TDHNMSLEKLKKENQQRIDQMLESHASTIQEKEQQLQELKLKVSDLSDmrcklEVELALK 1063
Cdd:COG5185 301 YTKSIDIKkATESLEEQLaaAEAEQELEESKRETETGIQNLTAEIEQGQESLTENLEAIKEEIENIVG-----EVELSKS 375
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1064 EAETDEIKILLEESRT-----------QQKEMLKSL------LEQETENLRTEISKLNQKIHDNNESYQVGLSELRALMT 1126
Cdd:COG5185 376 SEELDSFKDTIESTKEsldeipqnqrgYAQEILATLedtlkaADRQIEELQRQIEQATSSNEEVSKLLNELISELNKVMR 455
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1127 IEKDQCISELISRHEEESNILKAELDNVTSLHRQAYEIEKKLKEQIVELQTRLNSELSALEKQKDEKI-------TQQEE 1199
Cdd:COG5185 456 EADEESQSRLEEAYDEINRSVRSKKEDLNEELTQIESRVSTLKATLEKLRAKLERQLEGVRSKLDQVAeslkdfmRARGY 535
|
410 420 430 440
....*....|....*....|....*....|....*....|
gi 226693374 1200 KYEALIQNLEKDKERLVKNHEQDKEHLIQELNFEKNKAVQ 1239
Cdd:COG5185 536 AHILALENLIPASELIQASNAKTDGQAANLRTAVIDELTQ 575
|
|
| CCDC22 |
pfam05667 |
Coiled-coil domain-containing protein 22; Human coiled-coil domain-containing protein 22 ... |
1009-1255 |
1.63e-04 |
|
Coiled-coil domain-containing protein 22; Human coiled-coil domain-containing protein 22 (CCDC22) is involved in regulation of NF-kappa-B signalling; the function may involve association with COMMD8 and a CUL1-dependent E3 ubiquitin ligase complex. It is part of the OMMD/CCDC22/CCDC93 (CCC) complex, which interacts with the multisubunit WASH complex required for endosomal deposition of F-actin and cargo trafficking in conjunction with the retromer. This entry also includes CCDC22 homologs from animals and plants.
Pssm-ID: 461708 [Multi-domain] Cd Length: 600 Bit Score: 46.17 E-value: 1.63e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1009 SLEKLKKENQQRIDQMleshastiqekEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESRTQQ--KEMLK 1086
Cdd:pfam05667 325 TEEELQQQREEELEEL-----------QEQLEDLESSIQELEKEIKKLESSIKQVEEELEELKEQNEELEKQYkvKKKTL 393
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1087 SLLEQETENlrteISKLNQKIHDNNESyqvgLSELRALMtiEKDQciSELISRHEEesniLKAELDNvtslhrQAYEIEK 1166
Cdd:pfam05667 394 DLLPDAEEN----IAKLQALVDASAQR----LVELAGQW--EKHR--VPLIEEYRA----LKEAKSN------KEDESQR 451
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1167 KLkEQIVELQTRLNsELSALEKQKDEKITQQEEKYEAL----------------IQNLEKDKERLVKNhEQDKEHLIQEL 1230
Cdd:pfam05667 452 KL-EEIKELREKIK-EVAEEAKQKEELYKQLVAEYERLpkdvsrsaytrrileiVKNIKKQKEEITKI-LSDTKSLQKEI 528
|
250 260
....*....|....*....|....*
gi 226693374 1231 NFEKNKAVQTaldeFKVERELVEKE 1255
Cdd:pfam05667 529 NSLTGKLDRT----FTVTDELVFKD 549
|
|
| 46 |
PHA02562 |
endonuclease subunit; Provisional |
993-1217 |
1.70e-04 |
|
endonuclease subunit; Provisional
Pssm-ID: 222878 [Multi-domain] Cd Length: 562 Bit Score: 46.16 E-value: 1.70e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 993 IRHTQEFEKVMTDHNMSLEKLKKENQQRIDQMLESHASTIQEKEQQLQELKLKVSDLSdmRCKLEVELALKEAETDEIKI 1072
Cdd:PHA02562 190 IDHIQQQIKTYNKNIEEQRKKNGENIARKQNKYDELVEEAKTIKAEIEELTDELLNLV--MDIEDPSAALNKLNTAAAKI 267
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1073 lleESRTQQKEMLKSLLEQE------TENLRTEISKLnQKIHDNNESYQVGLSELralmtiekDQCISELISRHEEESNI 1146
Cdd:PHA02562 268 ---KSKIEQFQKVIKMYEKGgvcptcTQQISEGPDRI-TKIKDKLKELQHSLEKL--------DTAIDELEEIMDEFNEQ 335
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1147 LKA--ELDNVTSLHRQAYEIE----KKLKEQIVELQ---TRLNSELSALEKQKDEKITQQ----EEKYE-ALIQNLEKD- 1211
Cdd:PHA02562 336 SKKllELKNKISTNKQSLITLvdkaKKVKAAIEELQaefVDNAEELAKLQDELDKIVKTKselvKEKYHrGIVTDLLKDs 415
|
....*...
gi 226693374 1212 --KERLVK 1217
Cdd:PHA02562 416 giKASIIK 423
|
|
| 235kDa-fam |
TIGR01612 |
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ... |
806-1369 |
1.70e-04 |
|
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.
Pssm-ID: 130673 [Multi-domain] Cd Length: 2757 Bit Score: 46.58 E-value: 1.70e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 806 DSHSSIQTIKDDLCHFRTFVQKEQCDLANyLKCTAVEIRNIIEKVKCSLEITLKEK-------HQQELQSLKIEYECKLD 878
Cdd:TIGR01612 1650 DNLNSLQEFLESLKDQKKNIEDKKKELDE-LDSEIEKIEIDVDQHKKNYEIGIIEKikeiaiaNKEEIESIKELIEPTIE 1728
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 879 ALVKDSEENVNKILKLKENLVSLEEALQNKDNEFtsIKHEKDAIVCVqqekdqKLLEMEKIMHTQHCEIKELKQSREMAL 958
Cdd:TIGR01612 1729 NLISSFNTNDLEGIDPNEKLEEYNTEIGDIYEEF--IELYNIIAGCL------ETVSKEPITYDEIKNTRINAQNEFLKI 1800
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 959 EDLKKLHDEKIESLRA-EFQCLeQNHLKELEDTLHIRHTQEFEKV----------------MTDHNMSLEKLKKENQQRI 1021
Cdd:TIGR01612 1801 IEIEKKSKSYLDDIEAkEFDRI-INHFKKKLDHVNDKFTKEYSKInegfddisksienvknSTDENLLFDILNKTKDAYA 1879
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1022 DQMLESHASTIQEKEQQLQEL-KLKVS-----------DLSDmRCKLEVELALKEAETDEIKILLE-----ESRTQQKEM 1084
Cdd:TIGR01612 1880 GIIGKKYYSYKDEAEKIFINIsKLANSiniqiqnnsgiDLFD-NINIAILSSLDSEKEDTLKFIPSpekepEIYTKIRDS 1958
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1085 LKSLL------------EQETENLRTEISKLNQKIHDNNESYQVgLSELRalmtIEKDQCISE--LISRHEEESNILKAE 1150
Cdd:TIGR01612 1959 YDTLLdifkksqdlhkkEQDTLNIIFENQQLYEKIQASNELKDT-LSDLK----YKKEKILNDvkLLLHKFDELNKLSCD 2033
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1151 LDNVTSLHRQAYEIEKKLKEQIVELQTR---LNSELSALEKQKDEKItQQEEKYEaliqNLEKDKERLVKNHEQDKEHLI 1227
Cdd:TIGR01612 2034 SQNYDTILELSKQDKIKEKIDNYEKEKEkfgIDFDVKAMEEKFDNDI-KDIEKFE----NNYKHSEKDNHDFSEEKDNII 2108
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1228 QElnfekNKAVQTALDEFKVERELVEKELLEKvKHLENQIAKTPAfESAREDSSSLVAELQEKLQEEkAKFLEQLEEQEK 1307
Cdd:TIGR01612 2109 QS-----KKKLKELTEAFNTEIKIIEDKIIEK-NDLIDKLIEMRK-ECLLFSYATLVETLKSKVINH-SEFITSAAKFSK 2180
|
570 580 590 600 610 620
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 226693374 1308 RKNEEMQNVRTSL---IAEQQTNFNTVLTREKMrkENIINDLSDKLKSTMQQQERDKDLIESLSE 1369
Cdd:TIGR01612 2181 DFFEFIEDISDSLnddIDALQIKYNLNQTKKHM--ISILADATKDHNNLIEKEKEATKIINNLTE 2243
|
|
| HCR |
pfam07111 |
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha ... |
862-1364 |
1.78e-04 |
|
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation.
Pssm-ID: 284517 [Multi-domain] Cd Length: 749 Bit Score: 46.28 E-value: 1.78e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 862 HQQELQSLKIEYECKLDALVKDSEenvnkilKLKENLVSLEEALQNKDNEFTSIKHEKDaIVCVQQEKDQKLLEM----- 936
Cdd:pfam07111 149 HQEQLSSLTQAHEEALSSLTSKAE-------GLEKSLNSLETKRAGEAKQLAEAQKEAE-LLRKQLSKTQEELEAqvtlv 220
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 937 --------EKIMHTQHCEIKELKqsREMALEDLKKLHDEK------IESLRAEFQCLEQN-HLKELEDTLHIRHTQEFEK 1001
Cdd:pfam07111 221 eslrkyvgEQVPPEVHSQTWELE--RQELLDTMQHLQEDRadlqatVELLQVRVQSLTHMlALQEEELTRKIQPSDSLEP 298
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1002 VMTDHNMSL-----EKLKKENQQRIDQMLEsHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAETD----EIKI 1072
Cdd:pfam07111 299 EFPKKCRSLlnrwrEKVFALMVQLKAQDLE-HRDSVKQLRGQVAELQEQVTSQSQEQAILQRALQDKAAEVEvermSAKG 377
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1073 L------LEESRTQQKEMLKSLLEQ----------ETENLRTEISKLNQKIH-----DNNESYQVglselRALMTIEKDQ 1131
Cdd:pfam07111 378 LqmelsrAQEARRRQQQQTASAEEQlkfvvnamssTQIWLETTMTRVEQAVAripslSNRLSYAV-----RKVHTIKGLM 452
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1132 CISELISRHEEESNILKAELDNVTSlhrqayEIEKKLkEQIVELQTRLNSELSALEKQKDEKITQQEEKYEALIQNLEKD 1211
Cdd:pfam07111 453 ARKVALAQLRQESCPPPPPAPPVDA------DLSLEL-EQLREERNRLDAELQLSAHLIQQEVGRAREQGEAERQQLSEV 525
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1212 KERLVKNHEQDKEHLI---QELNFEKNKAVQTALDEFKVERELVEKELLEKvkhlenqiaktpafesaredssslvAELQ 1288
Cdd:pfam07111 526 AQQLEQELQRAQESLAsvgQQLEVARQGQQESTEEAASLRQELTQQQEIYG-------------------------QALQ 580
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1289 EKLQEEKAKFLEQLEEQEKRKNEEMQNVRTSLIAEQQTNFNTvlTREKMR-------KENIINDLSDKLKSTMQQQERDK 1361
Cdd:pfam07111 581 EKVAEVETRLREQLSDTKRRLNEARREQAKAVVSLRQIQHRA--TQEKERnqelrrlQDEARKEEGQRLARRVQELERDK 658
|
...
gi 226693374 1362 DLI 1364
Cdd:pfam07111 659 NLM 661
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1127-1468 |
2.63e-04 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 45.83 E-value: 2.63e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1127 IEKDQCISELISRHEEESNILKAELDNVTSLHRQAYEIEKKLKEQIVELQtRLNSELSALEKQKdEKITQQEEKYEALIQ 1206
Cdd:PRK03918 185 IKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELE-ELKEEIEELEKEL-ESLEGSKRKLEEKIR 262
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1207 NLEKDKERLVKnHEQDKEHLIQELNFEKNKAVQ-TALDEFKVERELVEKELLEKVKHLENQIAKTPAFESAREDSSSLVA 1285
Cdd:PRK03918 263 ELEERIEELKK-EIEELEEKVKELKELKEKAEEyIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLE 341
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1286 ELQEKLQEEKAKfLEQLEEQE------KRKNEEMQNVRtsliaeqqtnfntvltreKMRKENIINDLSDKLKSTMQQQER 1359
Cdd:PRK03918 342 ELKKKLKELEKR-LEELEERHelyeeaKAKKEELERLK------------------KRLTGLTPEKLEKELEELEKAKEE 402
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1360 DKDLIESLSEDRARLLEEKKQLEEEVSKLRTSSflSSAPVaaapelygaCAPELPGEPERSVME--TADEGRLDSAMETS 1437
Cdd:PRK03918 403 IEEEISKITARIGELKKEIKELKKAIEELKKAK--GKCPV---------CGRELTEEHRKELLEeyTAELKRIEKELKEI 471
|
330 340 350 360
....*....|....*....|....*....|....*....|
gi 226693374 1438 MMSVQ---------ENMLSEEKQRIMLLERTLQLKEEENK 1468
Cdd:PRK03918 472 EEKERklrkelrelEKVLKKESELIKLKELAEQLKELEEK 511
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1032-1308 |
3.13e-04 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 45.68 E-value: 3.13e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1032 IQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESRTqqkemlksllEQETENLRTEISKLNQKIhdnn 1111
Cdd:COG4913 612 LAALEAELAELEEELAEAEERLEALEAELDALQERREALQRLAEYSWD----------EIDVASAEREIAELEAEL---- 677
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1112 ESYQVGLSELRALmtiekdqciselisrhEEESNILKAELDNVTSLHRQAYEIEKKLKEQIVELQTRLNSELSALEKQKD 1191
Cdd:COG4913 678 ERLDASSDDLAAL----------------EEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAED 741
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1192 EKITQQEEKYEALIQNLEKDK--ERLVKNHEQDKEHLIQELNfEKNKAVQTALDEFKVERELVEKELlekvkhlenqiak 1269
Cdd:COG4913 742 LARLELRALLEERFAAALGDAveRELRENLEERIDALRARLN-RAEEELERAMRAFNREWPAETADL------------- 807
|
250 260 270 280
....*....|....*....|....*....|....*....|....
gi 226693374 1270 TPAFESAREdssslVAELQEKLQEE-----KAKFLEQLEEQEKR 1308
Cdd:COG4913 808 DADLESLPE-----YLALLDRLEEDglpeyEERFKELLNENSIE 846
|
|
| GOLGA2L5 |
pfam15070 |
Putative golgin subfamily A member 2-like protein 5; The function of the GOLGA2L5 protein ... |
1010-1306 |
3.14e-04 |
|
Putative golgin subfamily A member 2-like protein 5; The function of the GOLGA2L5 protein family remains unknown. This family of proteins is thought to be found in the Golgi apparatus of eukaryotes.
Pssm-ID: 464485 [Multi-domain] Cd Length: 521 Bit Score: 45.44 E-value: 3.14e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1010 LEKLKKEN---QQRIDQMLESHASTIQEKEQ---QLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESRTQQKE 1083
Cdd:pfam15070 17 AENLKEEGavwQQKMQQLSEQVRTLREEKERsvsQVQELETSLAELKNQAAVPPAEEEQPPAGPSEEEQRLQEEAEQLQK 96
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1084 MLKSLLEQETENLR--TEISKLNQKihdnnesYQVGLSEL---------------RALMTIEKDQC-ISELISRHEEESN 1145
Cdd:pfam15070 97 ELEALAGQLQAQVQdnEQLSRLNQE-------QEQRLLELeraaerwgeqaedrkQILEDMQSDRAtISRALSQNRELKE 169
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1146 ILkAELDN-----------VTSLHRQAYEIEKKLKEQIVELQTRL----------NSELSALEKQKDEKIT--QQeekYE 1202
Cdd:pfam15070 170 QL-AELQNgfvkltnenmeLTSALQSEQHVKKELAKKLGQLQEELgelketlelkSQEAQSLQEQRDQYLAhlQQ---YV 245
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1203 ALIQNLEKDKERLVKNHEQdKEHLIQELNFEKnkaVQTaldefKVERELVEKELLEKVKHLenqiaktpafESAREDSSS 1282
Cdd:pfam15070 246 AAYQQLASEKEELHKQYLL-QTQLMDRLQHEE---VQG-----KVAAEMARQELQETQERL----------EALTQQNQQ 306
|
330 340
....*....|....*....|....
gi 226693374 1283 LVAELQEKLQEEKAKFLEQLEEQE 1306
Cdd:pfam15070 307 LQAQLSLLANPGEGDGLESEEEEE 330
|
|
| Caldesmon |
pfam02029 |
Caldesmon; |
827-1106 |
4.18e-04 |
|
Caldesmon;
Pssm-ID: 460421 [Multi-domain] Cd Length: 495 Bit Score: 44.86 E-value: 4.18e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 827 KEQCDLANYLKCTAVEIRNIIEKVKCSLEITLKEKHQQELQSLKIEYECKldalvkdSEENVNKILKLKENLVSLEEALQ 906
Cdd:pfam02029 80 QEALERQKEFDPTIADEKESVAERKENNEEEENSSWEKEEKRDSRLGRYK-------EEETEIREKEYQENKWSTEVRQA 152
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 907 NKDNEftsiKHEKDAIVCVQQEKDQKLLEMEKIMHTQhceiKELKQSREM-ALEDLKKLHDE-KIESLRAEFQCLEQNHL 984
Cdd:pfam02029 153 EEEGE----EEEDKSEEAEEVPTENFAKEEVKDEKIK----KEKKVKYESkVFLDQKRGHPEvKSQNGEEEVTKLKVTTK 224
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 985 KELEDTLHIRHTQEFEKVMTDHNMSLEKLKKENQQRIDQMLESHASTIQEKEQQLQELKLKvsdLSDMRCKLEVELALKE 1064
Cdd:pfam02029 225 RRQGGLSQSQEREEEAEVFLEAEQKLEELRRRRQEKESEEFEKLRQKQQEAELELEELKKK---REERRKLLEEEEQRRK 301
|
250 260 270 280
....*....|....*....|....*....|....*....|..
gi 226693374 1065 AETDEIKILLEESRTQQKEmlkslleqETENLRTEISKLNQK 1106
Cdd:pfam02029 302 QEEAERKLREEEEKRRMKE--------EIERRRAEAAEKRQK 335
|
|
| ATG17_like |
pfam04108 |
Autophagy protein ATG17-like domain; This domain is found in the autophagy-related proteins ... |
312-537 |
4.98e-04 |
|
Autophagy protein ATG17-like domain; This domain is found in the autophagy-related proteins ATG17 and ATG11, conserved across eukaryotes. ATG17 forms a complex with ATG29 and ATG31, critical for both PAS (preautophagosomal structure) formation and autophagy. Together with ATG13, it is required for ATG1 kinase activation. ATG11 is a scaffold protein required for the cytoplasm-to-vacuole targeting (Cvt) pathway during starvation and to recruit ATG proteins to the pre-autophagosome. It is also required for ATG1 kinase activation. In many eukaryotes, ATG11 (the orthologue in mammals is RB1-inducible coiled-coil protein 1 (RB1CC1) and in S. pombe is Taz1-interacting factor 1 (taf1)) is essential for bulk autophagy, except in S.cerevisiae. ATG17 and ATG11 are large similar proteins, both predicted to be almost entirely helical, containing conserved coiled-coil regions and lack obvious functional motifs.
Pssm-ID: 427715 [Multi-domain] Cd Length: 360 Bit Score: 44.30 E-value: 4.98e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 312 NVQDRPNDVESLVRKCFDSMSRLDPKIIQPFMLECHQTIAKLDNQNMKAikgLEDRLYALDQMIASCSRLVNEQKELAQG 391
Cdd:pfam04108 127 ELQAAQESLDSDLKRFDDDLRDLQKELESLSSPSESISLIPTLLKELES---LEEEMASLLESLTNHYDQCVTAVKLTEG 203
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 392 FLANqMRAENLKDASVLPDLCLSHANQLMIMLQNHRKLLDIKQKCTTAKQELANNLHV------RLKWCCFVMLHADQDG 465
Cdd:pfam04108 204 GRAE-MLEVLENDARELDDVVPELQDRLDEMENNYERLQKLLEQKNSLIDELLSALQLiaeiqsRLPEYLAALKEFEERW 282
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 226693374 466 EKLQALLRLVIELLERVRivEALSTVPQMYCLAVVEVVRRKMFIKHYREWAGALVKDGKQLYEAEKSKRESF 537
Cdd:pfam04108 283 EEEKETIEDYLSELEDLR--EFYEGFPSAYGSLLLEVERRREWAEKMKKILRKLAEELDRLQEEERKRREKF 352
|
|
| EzrA |
pfam06160 |
Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like ... |
934-1356 |
5.20e-04 |
|
Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerizes into a ring structure that establishes the location of the nascent division site. EzrA modulates the frequency and position of FtsZ ring formation. The structure contains 5 spectrin like alpha helical repeats.
Pssm-ID: 428797 [Multi-domain] Cd Length: 542 Bit Score: 44.46 E-value: 5.20e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 934 LEMEKImhtqhcEIKELKQSREmaLEDLKKLH-----DEKIESLRAEFQCLEQNHLKELEDTLHirhtqefekvmtDHNM 1008
Cdd:pfam06160 15 LEERKN------ELMNLPVQEE--LSKVKKLNltgetQEKFEEWRKKWDDIVTKSLPDIEELLF------------EAEE 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1009 SLEKLK----KENQQRIDQMLESHASTIQEKEQQLQELklkvsdlsdmrckLEVElalkEAETDEIKILLEESRTQQKEM 1084
Cdd:pfam06160 75 LNDKYRfkkaKKALDEIEELLDDIEEDIKQILEELDEL-------------LESE----EKNREEVEELKDKYRELRKTL 137
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1085 LK---------SLLEQETENLRTEISKLNQKIH--DNNESYQVgLSELRAlMTIEKDQCISELISRHEEESNILKAELDN 1153
Cdd:pfam06160 138 LAnrfsygpaiDELEKQLAEIEEEFSQFEELTEsgDYLEAREV-LEKLEE-ETDALEELMEDIPPLYEELKTELPDQLEE 215
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1154 VTSLHRQA----YEIE-KKLKEQIVELQTRLNSELSALEKQKDEKITQQEEKYEALIQNL------EKDKERLVKNHEQD 1222
Cdd:pfam06160 216 LKEGYREMeeegYALEhLNVDKEIQQLEEQLEENLALLENLELDEAEEALEEIEERIDQLydllekEVDAKKYVEKNLPE 295
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1223 KEHLIQELNfEKNKAVQTALDEFKVERELVEKELlEKVKHLENQIA---------------KTPAFESAREDSSSLVAEL 1287
Cdd:pfam06160 296 IEDYLEHAE-EQNKELKEELERVQQSYTLNENEL-ERVRGLEKQLEelekrydeiverleeKEVAYSELQEELEEILEQL 373
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1288 Q--EKLQEEKAKFLEQLEEQEKRKNEEMQNVRTSLIAeqqtnfntvlTREKMRKENI----------INDLSDKLKSTMQ 1355
Cdd:pfam06160 374 EeiEEEQEEFKESLQSLRKDELEAREKLDEFKLELRE----------IKRLVEKSNLpglpesyldyFFDVSDEIEDLAD 443
|
.
gi 226693374 1356 Q 1356
Cdd:pfam06160 444 E 444
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
874-1356 |
5.35e-04 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 45.04 E-value: 5.35e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 874 ECKLDALVKDSEenvNKILKLKENLVSLEEALQNKDNEFTSIKHEKDAIVCVQQEKDQKLLEMEKIMHTQHCEIKELKQS 953
Cdd:TIGR00606 690 EAELQEFISDLQ---SKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGRQSIIDLKEKEIPELRNKLQKVNRDIQRLKND 766
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 954 REMALEDLKKL--HDEKIESLRAEFQCLEQNHLkELEDTLHIRHTQEFEKVMTDHNMSLEKLKKENQQR------IDQML 1025
Cdd:TIGR00606 767 IEEQETLLGTImpEEESAKVCLTDVTIMERFQM-ELKDVERKIAQQAAKLQGSDLDRTVQQVNQEKQEKqheldtVVSKI 845
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1026 ESHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAetdeikilleesrtqqkemlkslLEQETENLRTEISKLNQ 1105
Cdd:TIGR00606 846 ELNRKLIQDQQEQIQHLKSKTNELKSEKLQIGTNLQRRQQ-----------------------FEEQLVELSTEVQSLIR 902
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1106 KIHDNNEsyqvglSELRALMTIEKDQC-ISELISRHEEESNILKAELDnvtslhrqayEIEKKLKEQIVELQTRLNSels 1184
Cdd:TIGR00606 903 EIKDAKE------QDSPLETFLEKDQQeKEELISSKETSNKKAQDKVN----------DIKEKVKNIHGYMKDIENK--- 963
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1185 aLEKQKDEKITQQEEKYEALIQNLEKDKERLvKNHEQDKEHLIQELNFEKNKA--VQTALDEFKVERELveKELLEKVKH 1262
Cdd:TIGR00606 964 -IQDGKDDYLKQKETELNTVNAQLEECEKHQ-EKINEDMRLMRQDIDTQKIQErwLQDNLTLRKRENEL--KEVEEELKQ 1039
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1263 LENQIAKTPAFESAREDSsslvaELQEKLQ---EEKAKFLEQLEEQEKRKNEEMQNVRTSLIAEQQTNFNTVLTREKMRK 1339
Cdd:TIGR00606 1040 HLKEMGQMQVLQMKQEHQ-----KLEENIDlikRNHVLALGRQKGYEKEIKHFKKELREPQFRDAEEKYREMMIVMRTTE 1114
|
490
....*....|....*..
gi 226693374 1340 EnIINDLsDKLKSTMQQ 1356
Cdd:TIGR00606 1115 L-VNKDL-DIYYKTLDQ 1129
|
|
| PRK05771 |
PRK05771 |
V-type ATP synthase subunit I; Validated |
1055-1318 |
6.25e-04 |
|
V-type ATP synthase subunit I; Validated
Pssm-ID: 235600 [Multi-domain] Cd Length: 646 Bit Score: 44.53 E-value: 6.25e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1055 KLEVeLALKEAETDEIKILLEESRTQQKEMLKSLLEQETENLRTEISKLNQKIhDNNESYQVGLSELRALMTIEKDQCIS 1134
Cdd:PRK05771 8 KVLI-VTLKSYKDEVLEALHELGVVHIEDLKEELSNERLRKLRSLLTKLSEAL-DKLRSYLPKLNPLREEKKKVSVKSLE 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1135 ELISRHEEESNILKAELDNVTSLHRQAYEIEKKLKEQIVELQ--TRLNSELSAL-------------EKQKDEKITQQEE 1199
Cdd:PRK05771 86 ELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIERLEpwGNFDLDLSLLlgfkyvsvfvgtvPEDKLEELKLESD 165
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1200 KYEALIQNLEKDKERLV----KNHEQDKEHLIQELNFEKN-----KAVQTALDEFKVERELVEKELlekvKHLENQIakt 1270
Cdd:PRK05771 166 VENVEYISTDKGYVYVVvvvlKELSDEVEEELKKLGFERLeleeeGTPSELIREIKEELEEIEKER----ESLLEEL--- 238
|
250 260 270 280
....*....|....*....|....*....|....*....|....*...
gi 226693374 1271 pafesaredssslvAELQEKLQEEKAKFLEQLEEQEKRKNEEMQNVRT 1318
Cdd:PRK05771 239 --------------KELAKKYLEELLALYEYLEIELERAEALSKFLKT 272
|
|
| PRK12704 |
PRK12704 |
phosphodiesterase; Provisional |
1186-1351 |
6.66e-04 |
|
phosphodiesterase; Provisional
Pssm-ID: 237177 [Multi-domain] Cd Length: 520 Bit Score: 44.38 E-value: 6.66e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1186 LEKQKDEKITQQEEKYEALIQNLEKDKERLvknheqdkehliqelnfeKNKAVQTALDEFKVERELVEKELLEKVKHLEN 1265
Cdd:PRK12704 25 RKKIAEAKIKEAEEEAKRILEEAKKEAEAI------------------KKEALLEAKEEIHKLRNEFEKELRERRNELQK 86
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1266 QIAKTPAFESAREDSSSLVAELQEKLQEEKAKFLEQLEEQEKRKnEEMQNvrtsLIAEQQTNFNTV--LTREKMRKEnII 1343
Cdd:PRK12704 87 LEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKE-EELEE----LIEEQLQELERIsgLTAEEAKEI-LL 160
|
....*...
gi 226693374 1344 NDLSDKLK 1351
Cdd:PRK12704 161 EKVEEEAR 168
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
848-1266 |
7.87e-04 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 44.19 E-value: 7.87e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 848 EKVKCSLEITLKEKHQQELQSLK---IEYECKLDALVkDSEENVNKILKLKENLVSLEEALQNKDNEFTSIKHEKDAIVC 924
Cdd:TIGR00618 485 ETRKKAVVLARLLELQEEPCPLCgscIHPNPARQDID-NPGPLTRRMQRGEQTYAQLETSEEDVYHQLTSERKQRASLKE 563
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 925 VQQEKDQKLLEMEKIMHTQHCEIKELKQsremALEDLKKLHDEKIEsLRAEFQCLEQNHLKELEDTLH----IRHTQEFE 1000
Cdd:TIGR00618 564 QMQEIQQSFSILTQCDNRSKEDIPNLQN----ITVRLQDLTEKLSE-AEDMLACEQHALLRKLQPEQDlqdvRLHLQQCS 638
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1001 K----VMTDHNMSLEKLKKENQQ----RIDQMLESHASTIQEKEQQLQELKLKVSD--------LSDMRCKLEVELAL-K 1063
Cdd:TIGR00618 639 QelalKLTALHALQLTLTQERVRehalSIRVLPKELLASRQLALQKMQSEKEQLTYwkemlaqcQTLLRELETHIEEYdR 718
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1064 EAETDEIKILLEESRTQQKEMLKSLLEQETENLRTEISKLNQKIHDNNEsyqvglseLRALMTIEKDQCISELIS----- 1138
Cdd:TIGR00618 719 EFNEIENASSSLGSDLAAREDALNQSLKELMHQARTVLKARTEAHFNNN--------EEVTAALQTGAELSHLAAeiqff 790
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1139 -RHEEESNILKAELDNVTSLHRQAYEIEKKLkeQIVELQTRLNSELSALEK--QKDEKITQQEEKYEALIQNLEKDKERL 1215
Cdd:TIGR00618 791 nRLREEDTHLLKTLEAEIGQEIPSDEDILNL--QCETLVQEEEQFLSRLEEksATLGEITHQLLKYEECSKQLAQLTQEQ 868
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|.
gi 226693374 1216 VKNHEQDKehliqELNFEKNKAVQTALDEFkveRELVEKELLEKVKHLENQ 1266
Cdd:TIGR00618 869 AKIIQLSD-----KLNGINQIKIQFDGDAL---IKFLHEITLYANVRLANQ 911
|
|
| Cast |
pfam10174 |
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ... |
1034-1373 |
8.90e-04 |
|
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
Pssm-ID: 431111 [Multi-domain] Cd Length: 766 Bit Score: 44.04 E-value: 8.90e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1034 EKEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLE---------------ESRTQQ---KEMLKSLLEQETEN 1095
Cdd:pfam10174 124 EHERQAKELFLLRKTLEEMELRIETQKQTLGARDESIKKLLEmlqskglpkksgeedWERTRRiaeAEMQLGHLEVLLDQ 203
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1096 LRTEISKLNQKIHDNNESYQvGLSELRALMTI--EKDQCISEL---ISRHEEESNILKAELDnvtsLHRQAYEIEKKlke 1170
Cdd:pfam10174 204 KEKENIHLREELHRRNQLQP-DPAKTKALQTVieMKDTKISSLernIRDLEDEVQMLKTNGL----LHTEDREEEIK--- 275
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1171 qivelQTRLNSELSALEKQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQDKEH---LIQELNFEKNKA--VQTALDEF 1245
Cdd:pfam10174 276 -----QMEVYKSHSKFMKNKIDQLKQELSKKESELLALQTKLETLTNQNSDCKQHievLKESLTAKEQRAaiLQTEVDAL 350
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1246 KVERELVEKELLEKVKHLENqiaktpafesAREDSSSLVAELQE-----KLQEEKA----KFLEQLEEQEKRKNEEMQNV 1316
Cdd:pfam10174 351 RLRLEEKESFLNKKTKQLQD----------LTEEKSTLAGEIRDlkdmlDVKERKInvlqKKIENLQEQLRDKDKQLAGL 420
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*...
gi 226693374 1317 RTSLIAEQQTNFNTvltrekmrkeniindlsDKLKSTMQQQERDKD-LIESLSEDRAR 1373
Cdd:pfam10174 421 KERVKSLQTDSSNT-----------------DTALTTLEEALSEKErIIERLKEQRER 461
|
|
| PLN02939 |
PLN02939 |
transferase, transferring glycosyl groups |
1134-1359 |
1.35e-03 |
|
transferase, transferring glycosyl groups
Pssm-ID: 215507 [Multi-domain] Cd Length: 977 Bit Score: 43.35 E-value: 1.35e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1134 SELISRHEEESNILKAELDNVTSLHRQAyeiekklkeqivelqTRLNSELSALEKQKDEKITQQEEKyealIQNLEKDKE 1213
Cdd:PLN02939 63 SKLQSNTDENGQLENTSLRTVMELPQKS---------------TSSDDDHNRASMQRDEAIAAIDNE----QQTNSKDGE 123
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1214 RLVKNHEQDKEHLIQelNFEKN-----KAVQTALDEfkVERELVEKELLE-KVKHLENQIAKTPAfesaredssslvael 1287
Cdd:PLN02939 124 QLSDFQLEDLVGMIQ--NAEKNilllnQARLQALED--LEKILTEKEALQgKINILEMRLSETDA--------------- 184
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 226693374 1288 QEKLQEEKAKFLEQLEEQ-EKRKNEEMQNVRTSLIAEQQTNFN-TVLTREKMRKENIINDLSDKLKSTMQQQER 1359
Cdd:PLN02939 185 RIKLAAQEKIHVEILEEQlEKLRNELLIRGATEGLCVHSLSKElDVLKEENMLLKDDIQFLKAELIEVAETEER 258
|
|
| PRK12704 |
PRK12704 |
phosphodiesterase; Provisional |
1158-1311 |
1.89e-03 |
|
phosphodiesterase; Provisional
Pssm-ID: 237177 [Multi-domain] Cd Length: 520 Bit Score: 42.84 E-value: 1.89e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1158 HRQAYEIEKKLKEQIVELQT----------RLNSELSALEKqKDEKITQQEEKYEALIQNLEKDKERLVKNHEQDKEHL- 1226
Cdd:PRK12704 67 HKLRNEFEKELRERRNELQKlekrllqkeeNLDRKLELLEK-REEELEKKEKELEQKQQELEKKEEELEELIEEQLQELe 145
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1227 -IQELNFEKNKavqtaldefkverelveKELLEKVKhlenqiaktpafESAREDSSSLVAELqeklqEEKAKfleqlEEQ 1305
Cdd:PRK12704 146 rISGLTAEEAK-----------------EILLEKVE------------EEARHEAAVLIKEI-----EEEAK-----EEA 186
|
....*.
gi 226693374 1306 EKRKNE 1311
Cdd:PRK12704 187 DKKAKE 192
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1147-1359 |
2.08e-03 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 42.44 E-value: 2.08e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1147 LKAELDNVTSLHRQAYEIEKKLKEQIVELQT---RLNSELSALEKQKDEkITQQEEKYEALIQNLEKDKERLVKNHEQDK 1223
Cdd:COG4942 25 AEAELEQLQQEIAELEKELAALKKEEKALLKqlaALERRIAALARRIRA-LEQELAALEAELAELEKEIAELRAELEAQK 103
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1224 EHLIQELNF----EKNKAVQTALDEFKVERELVEKELLEKV-KHLENQIAktpAFESAREDSSSLVAELQEKlQEEKAKF 1298
Cdd:COG4942 104 EELAELLRAlyrlGRQPPLALLLSPEDFLDAVRRLQYLKYLaPARREQAE---ELRADLAELAALRAELEAE-RAELEAL 179
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 226693374 1299 LEQLEEQEKRKNEEMQNvRTSLIAEQQTNFNTVLTR--EKMRKENIINDLSDKLKSTMQQQER 1359
Cdd:COG4942 180 LAELEEERAALEALKAE-RQKLLARLEKELAELAAElaELQQEAEELEALIARLEAEAAAAAE 241
|
|
| Ubl_TBK1_like |
cd12219 |
ubiquitin-like (Ubl) domain found in non-canonical Inhibitor of kappa B kinases IKKepsilon and ... |
3-76 |
2.25e-03 |
|
ubiquitin-like (Ubl) domain found in non-canonical Inhibitor of kappa B kinases IKKepsilon and TBK1, and similar proteins; IKKepsilon and TBK1 (TRAF family member-associated NF-kappaB activator-binding kinase 1) are non-canonical members of IKK family. They have been characterized as activators of nuclear factor-kappaB (NF-kappaB), but they are not essential for NF-kappaB activation. They play critical roles in antiviral response via phosphorylation and activation of transcription factors IRF3, IRF7, STAT1 and STAT3. They are also involved in the survival, tumorigenesis and development of various cancers. Both IKKepsilon and TBK1 contain an N-terminal protein kinase domain followed a ubiquitin-like (Ubl) domain. The Ubl domain acts as a protein-protein interaction domain, and has been implicated in regulating kinase activity, which modulates interactions in the interferon pathway.
Pssm-ID: 340518 Cd Length: 77 Bit Score: 38.37 E-value: 2.25e-03
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 226693374 3 LYVFLVNTGTTLTF---DTEltvqTVADLKHAIQSKYKIAIQHQVLVVNGGE-CMAADRRVCTYSAGTDTNPIFLFNK 76
Cdd:cd12219 4 IHVFSVSTCELLKIyldPTE----TLAEFQELIAEQTEIPAKNQLLLFEGQLlEEEVTLPVSDYPKTTEENPIILFNK 77
|
|
| Mitofilin |
pfam09731 |
Mitochondrial inner membrane protein; Mitofilin controls mitochondrial cristae morphology. ... |
859-1040 |
2.43e-03 |
|
Mitochondrial inner membrane protein; Mitofilin controls mitochondrial cristae morphology. Mitofilin is enriched in the narrow space between the inner boundary and the outer membranes, where it forms a homotypic interaction and assembles into a large multimeric protein complex. The first 78 amino acids contain a typical amino-terminal-cleavable mitochondrial presequence rich in positive-charged and hydroxylated residues and a membrane anchor domain. In addition, it has three centrally located coiled coil domains.
Pssm-ID: 430783 [Multi-domain] Cd Length: 618 Bit Score: 42.44 E-value: 2.43e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 859 KEKHQQELQSLKIEYECKLDALVKDSEENVNK-ILKLKENLVSLEEALQN-KDNEFtsiKHEKDAIVcvQQEKDQKLLEM 936
Cdd:pfam09731 256 RIVFQQELVSIFPDIIPVLKEDNLLSNDDLNSlIAHAHREIDQLSKKLAElKKREE---KHIERALE--KQKEELDKLAE 330
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 937 EKIMHTQHCEIKELKQSR---EMALEDLKKLHDEKIES-LRAEFQCLEQ---NHLKELEDTLHIRHTQEF-EKVMTDHNM 1008
Cdd:pfam09731 331 ELSARLEEVRAADEAQLRlefEREREEIRESYEEKLRTeLERQAEAHEEhlkDVLVEQEIELQREFLQDIkEKVEEERAG 410
|
170 180 190
....*....|....*....|....*....|....*.
gi 226693374 1009 SLEKLKK--ENQQRIDQMLESHASTIQE--KEQQLQ 1040
Cdd:pfam09731 411 RLLKLNEllANLKGLEKATSSHSEVEDEnrKAQQLW 446
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
773-1087 |
2.70e-03 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 42.36 E-value: 2.70e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 773 SRRMQDTSTRGNEGFGDRAALHVQLEKCRAAAQDSHSSIQTIKDDLCHFRTFVQKEQCDLANYLKCTAVEIRNIIEKVKC 852
Cdd:TIGR02169 229 LKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLER 308
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 853 SLEitLKEKHQQELQSLKIEYECKLDALVKDSEENVNKILKLKENLVSLEEALQNKDNEFTSIKHEKDAIVCVQQEKDQK 932
Cdd:TIGR02169 309 SIA--EKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDE 386
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 933 LLEM-EKIMHTQHcEIKELKQSREMALEDLKKLHdEKIESLRAEFQCLEQNHLkeledtlhirhtqEFEKVMTDHNMSLE 1011
Cdd:TIGR02169 387 LKDYrEKLEKLKR-EINELKRELDRLQEELQRLS-EELADLNAAIAGIEAKIN-------------ELEEEKEDKALEIK 451
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 226693374 1012 KlKKENQQRIDQMLESHastiqekEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESRTQQkEMLKS 1087
Cdd:TIGR02169 452 K-QEWKLEQLAADLSKY-------EQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEERVRGGRAVE-EVLKA 518
|
|
| AAA_13 |
pfam13166 |
AAA domain; This family of domains contain a P-loop motif that is characteriztic of the AAA ... |
1015-1188 |
2.79e-03 |
|
AAA domain; This family of domains contain a P-loop motif that is characteriztic of the AAA superfamily. Many of the proteins in this family are conjugative transfer proteins. This family includes the PrrC protein that is thought to be the active component of the anticodon nuclease.
Pssm-ID: 463796 [Multi-domain] Cd Length: 712 Bit Score: 42.36 E-value: 2.79e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1015 KENQQRIDQMLESHASTIQEKEQQLQELKLKVSDLS---DMRCKLEVELALKEAETDEIKILLEESRtqqKEMLKSL--- 1088
Cdd:pfam13166 282 TEFQNRLQKLIEKVESAISSLLAQLPAVSDLASLLSafeLDVEDIESEAEVLNSQLDGLRRALEAKR---KDPFKSIeld 358
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1089 -LEQETENLRTEISKLNQKIHDNNEsyqvGLSELRALMTIEKDQ----CISELISRHEEESNILKAELDNVTSLHRQAYE 1163
Cdd:pfam13166 359 sVDAKIESINDLVASINELIAKHNE----ITDNFEEEKNKAKKKlrlhLVEEFKSEIDEYKDKYAGLEKAINSLEKEIKN 434
|
170 180
....*....|....*....|....*...
gi 226693374 1164 IE---KKLKEQIVELQTRLNSELSALEK 1188
Cdd:pfam13166 435 LEaeiKKLREEIKELEAQLRDHKPGADE 462
|
|
| FAM184 |
pfam15665 |
Family with sequence similarity 184, A and B; The function of FAM184 is not known. |
857-1046 |
2.91e-03 |
|
Family with sequence similarity 184, A and B; The function of FAM184 is not known.
Pssm-ID: 464788 [Multi-domain] Cd Length: 211 Bit Score: 40.80 E-value: 2.91e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 857 TLKEKHQQELQSLKIEYECKLDAL-VKDSEEnvnkiLKLKENLVSLEEALQnkdnEFTSIKHEkdAIVCVQQEKDQkLLE 935
Cdd:pfam15665 18 ALKEAHEEEIQQILAETREKILQYkSKIGEE-----LDLKRRIQTLEESLE----QHERMKRQ--ALTEFEQYKRR-VEE 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 936 MEKIMHTQHCEiKELKQSREMalEDLKKLHDEKIESLRAEFQCLEQNHLKELEDtLHIRHTQEFEKVMTDHNMSLEKLKK 1015
Cdd:pfam15665 86 RELKAEAEHRQ-RVVELSREV--EEAKRAFEEKLESFEQLQAQFEQEKRKALEE-LRAKHRQEIQELLTTQRAQSASSLA 161
|
170 180 190
....*....|....*....|....*....|.
gi 226693374 1016 EnQQRIDQMLESHASTIQEKEQQLQELKLKV 1046
Cdd:pfam15665 162 E-QEKLEELHKAELESLRKEVEDLRKEKKKL 191
|
|
| PRK05771 |
PRK05771 |
V-type ATP synthase subunit I; Validated |
947-1209 |
3.61e-03 |
|
V-type ATP synthase subunit I; Validated
Pssm-ID: 235600 [Multi-domain] Cd Length: 646 Bit Score: 41.84 E-value: 3.61e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 947 IKELKQSREMALEDLK-KLHDEKIESLRAEFQcleqnHLKELEDTLhiRHTQEFEKVMTDHNMSLEKLKKEN-QQRIDQM 1024
Cdd:PRK05771 22 LEALHELGVVHIEDLKeELSNERLRKLRSLLT-----KLSEALDKL--RSYLPKLNPLREEKKKVSVKSLEElIKDVEEE 94
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1025 LESHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEaetdEIKILLEESRTQQKemLKSLLEQETENLRTEISKLN 1104
Cdd:PRK05771 95 LEKIEKEIKELEEEISELENEIKELEQEIERLEPWGNFDL----DLSLLLGFKYVSVF--VGTVPEDKLEELKLESDVEN 168
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1105 QKIHDNNESYQ----VGLSELRALmtiekdqcISELISRHEEESNILKAEldnvtslhRQAYEIEKKLKEQIVELQT--- 1177
Cdd:PRK05771 169 VEYISTDKGYVyvvvVVLKELSDE--------VEEELKKLGFERLELEEE--------GTPSELIREIKEELEEIEKere 232
|
250 260 270
....*....|....*....|....*....|..
gi 226693374 1178 RLNSELSALEKQKDEKITQQEEKYEALIQNLE 1209
Cdd:PRK05771 233 SLLEELKELAKKYLEELLALYEYLEIELERAE 264
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1174-1389 |
3.65e-03 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 41.67 E-value: 3.65e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1174 ELQTRLNSELSALEKQKDE---KITQQEEKYEALIQNLEKDKERLVKNHEQdkehlIQELNfEKNKAVQTALDEFKVERE 1250
Cdd:COG4942 20 DAAAEAEAELEQLQQEIAElekELAALKKEEKALLKQLAALERRIAALARR-----IRALE-QELAALEAELAELEKEIA 93
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1251 LVEKELLEKVKHLENQIAKtpAFESAREDSSSLVAELQEKLQEEK-AKFLEQLEEQEKRKNEEMQNVRTSLIAEQQTnfn 1329
Cdd:COG4942 94 ELRAELEAQKEELAELLRA--LYRLGRQPPLALLLSPEDFLDAVRrLQYLKYLAPARREQAEELRADLAELAALRAE--- 168
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1330 tvLTREKMRKENIINDLSDKLKSTMQQQERDKDLIESLSEDRARLLEekkqleeEVSKLR 1389
Cdd:COG4942 169 --LEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAA-------ELAELQ 219
|
|
| MAP65_ASE1 |
pfam03999 |
Microtubule associated protein (MAP65/ASE1 family); |
858-1151 |
3.88e-03 |
|
Microtubule associated protein (MAP65/ASE1 family);
Pssm-ID: 427641 [Multi-domain] Cd Length: 477 Bit Score: 41.53 E-value: 3.88e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 858 LKEKHQQELQSLKIEYEcKLDALVKDSEENVNKILKLKENLVSLEEALQNKDnefTSIKHekdaivcVQQEKDQKLLEME 937
Cdd:pfam03999 46 KEQRILQSIADLRAEAA-ILCLYMRNRLLHEERDPFEPKKGMSLLQKEKKLD---TQLEH-------LRKEKAPRLAEIK 114
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 938 KIMHTQHCEIKELKQSREMALED--------------LKKLHDEKiESLRAEFQCLEQNHLKELEDtLHIRHTQEFEKVM 1003
Cdd:pfam03999 115 ELLEQLQQLCEELGEEPLPLLIDplpsleelesfrkhLENLRNEK-ERRLEEVNELKKQIKLLMEE-LDLVPGTDFEEDL 192
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1004 TDHNMSLEKLKKENQQRIDQMLESHAStiqEKEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESRTQQKE 1083
Cdd:pfam03999 193 LCESEDNFCLSRENIDKLRKLIKQLEE---QKAEREEKIDDLREKILELWNRLQVPQEEQESFVRENNSLSQDTIDALRE 269
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 226693374 1084 MLKSLLEQETENLRTEISKLNQKIHDNNESYQVGLSELRALMTIEKDQCISELISRHEEESNILKAEL 1151
Cdd:pfam03999 270 ELQRLEELKKKNIKKLIEDLRVEIEELWDKLFYSTEQRKRFIPFFEELYTEDLLELHELELKRLKEEY 337
|
|
| PRK01156 |
PRK01156 |
chromosome segregation protein; Provisional |
860-1346 |
4.76e-03 |
|
chromosome segregation protein; Provisional
Pssm-ID: 100796 [Multi-domain] Cd Length: 895 Bit Score: 41.81 E-value: 4.76e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 860 EKHQQELQSLKIEYECKLDALVKDSEENVNKILKLKENLVSLEEALQNKDNEFTSIkhekdaivcvqQEKDQKLLEMEKi 939
Cdd:PRK01156 269 LEKNNYYKELEERHMKIINDPVYKNRNYINDYFKYKNDIENKKQILSNIDAEINKY-----------HAIIKKLSVLQK- 336
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 940 MHTQHCEIKELKQSREMALEDLKKLHDEKIESLRAefqcLEQNHLKeledtlhIRHTQEFEKVMTDHNMSLEKLKKENQQ 1019
Cdd:PRK01156 337 DYNDYIKKKSRYDDLNNQILELEGYEMDYNSYLKS----IESLKKK-------IEEYSKNIERMSAFISEILKIQEIDPD 405
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1020 RIDQMLESHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAE-----------TDEIKILLEESRTQqkemlKSL 1088
Cdd:PRK01156 406 AIKKELNEINVKLQDISSKVSSLNQRIRALRENLDELSRNMEMLNGQsvcpvcgttlgEEKSNHIINHYNEK-----KSR 480
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1089 LEQETENLRTEISKLNQKIHD--------------NNESYQVGLSELRALMTIEKDQcISELISRHEEESNILkaelDNV 1154
Cdd:PRK01156 481 LEEKIREIEIEVKDIDEKIVDlkkrkeyleseeinKSINEYNKIESARADLEDIKIK-INELKDKHDKYEEIK----NRY 555
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1155 TSLHRQayEIEKKLKEQIVELQTRLNSELSALEKQKDEKITQqeekyealIQNLEKDKERLVKNHEQDKEH-------LI 1227
Cdd:PRK01156 556 KSLKLE--DLDSKRTSWLNALAVISLIDIETNRSRSNEIKKQ--------LNDLESRLQEIEIGFPDDKSYidksireIE 625
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1228 QELNFEKNKavQTALDEFKVERElvekELLEKVKHLENQIAKTPAFESAREDSSSLVAELQEKLQEEKAKFLEQLEEQ-E 1306
Cdd:PRK01156 626 NEANNLNNK--YNEIQENKILIE----KLRGKIDNYKKQIAEIDSIIPDLKEITSRINDIEDNLKKSRKALDDAKANRaR 699
|
490 500 510 520
....*....|....*....|....*....|....*....|.
gi 226693374 1307 KRKNEEMQNVRTSLIAEQQTNFNTVLtrEKMRK-ENIINDL 1346
Cdd:PRK01156 700 LESTIEILRTRINELSDRINDINETL--ESMKKiKKAIGDL 738
|
|
| PRK04778 |
PRK04778 |
septation ring formation regulator EzrA; Provisional |
934-1320 |
4.78e-03 |
|
septation ring formation regulator EzrA; Provisional
Pssm-ID: 179877 [Multi-domain] Cd Length: 569 Bit Score: 41.36 E-value: 4.78e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 934 LEMEKImhtqhcEIKELKQSREmaLEDLKKLH-----DEKIESLRAEFQCLEQNHLKELEDTLhirhtqeFEKVMTDHNM 1008
Cdd:PRK04778 34 LEERKQ------ELENLPVNDE--LEKVKKLNltgqsEEKFEEWRQKWDEIVTNSLPDIEEQL-------FEAEELNDKF 98
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1009 SLEKLKKENQQrIDQMLESHASTIQEKEQQLQELKlkvsdlsdmrcklEVElalkEAETDEIKILLEESRTQQKEMLK-- 1086
Cdd:PRK04778 99 RFRKAKHEINE-IESLLDLIEEDIEQILEELQELL-------------ESE----EKNREEVEQLKDLYRELRKSLLAnr 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1087 -------SLLEQETENLRTEISKLNQKIHDNN--ESYQVgLSELRALMT-IEKDqcISELISRHEEESNILKAELDNVTS 1156
Cdd:PRK04778 161 fsfgpalDELEKQLENLEEEFSQFVELTESGDyvEAREI-LDQLEEELAaLEQI--MEEIPELLKELQTELPDQLQELKA 237
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1157 LHRQAYE-----IEKKLKEQIVELQTRLNSELSALEKQK----DEKITQQEEKYEALIQNLEKD---KERLVKNHEQDKE 1224
Cdd:PRK04778 238 GYRELVEegyhlDHLDIEKEIQDLKEQIDENLALLEELDldeaEEKNEEIQERIDQLYDILEREvkaRKYVEKNSDTLPD 317
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1225 HLiQELNfEKNKAVQTALDEFKVERELVEKELlEKVKHLENQIAK--------TPAFESAREDSSSLVAELQ-------- 1288
Cdd:PRK04778 318 FL-EHAK-EQNKELKEEIDRVKQSYTLNESEL-ESVRQLEKQLESlekqydeiTERIAEQEIAYSELQEELEeilkqlee 394
|
410 420 430
....*....|....*....|....*....|...
gi 226693374 1289 -EKLQEEKAKFLEQLEEQEKRKNEEMQNVRTSL 1320
Cdd:PRK04778 395 iEKEQEKLSEMLQGLRKDELEAREKLERYRNKL 427
|
|
| mS26_Tt |
cd23695 |
Tetrahymena thermophila ribosomal protein mS26 and similar proteins; Ribosomal protein mS26 is ... |
1065-1371 |
5.30e-03 |
|
Tetrahymena thermophila ribosomal protein mS26 and similar proteins; Ribosomal protein mS26 is a component of small subunit (SSU) in Tetrahymena thermophila mitochondrial ribosome (mitoribosome). The structure of the mitoribosome reveals an assembly of 94-ribosomal proteins and four-rRNAs with an additional protein mass of ~700 kDa on the small subunit; the large mitoribosomal subunit (LSU) lacks 5S rRNA.
Pssm-ID: 467909 [Multi-domain] Cd Length: 496 Bit Score: 41.35 E-value: 5.30e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1065 AETDEIKILLEESRTQQKEMLKSLLEQETENLRTEI--------SK-------LNQKIHDN---------NESYQVGLSE 1120
Cdd:cd23695 65 ATQNHIKLLEKKSVKKEENERKYLLEQDVKAMNKKIildvmneeSKnwinlqnMNEKINPNlilpdtildETSYYLKLQE 144
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1121 LRALMTIEKDQCISELISrhEEESNILKAEL------DNVTSLHRQAYEIEKKLKEQIVELQTRLNSELSALEKQKDEKI 1194
Cdd:cd23695 145 LAFLFEQGDHEEMDKLLD--ENEEIEYKNSLlmpiyqDLKSLIKHLKYTELFKLLKEYQDAKAIIIEDFRESSEEGAEKL 222
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1195 TQQEEKYEALIQNLEKDKERLVKNHEQDKEHLIQELN-FEK-----------NKAVQTALDEFKVERELVEKELLEKVKH 1262
Cdd:cd23695 223 EKLEKAFATLLKNYKEELEEPEKQLEFMQKRLLDLYNlLRLwgqyitivkmpDSVVRDIMNKTQARPEVAKLNSKQELED 302
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1263 LENQIAKTPAFE--------SAREDSSSLVAELQEKLQEEKAKFLEQLEEQE---------KRKNEEMQNVRTSLIAEQQ 1325
Cdd:cd23695 303 AKNRKRDTEENEfdddyesaDEGETSDEEDEIEEENFQLQKEKKKEEELNAEfniaknslyKFSPQNDKNVVDDRDFYSG 382
|
330 340 350 360
....*....|....*....|....*....|....*....|....*.
gi 226693374 1326 TNFNTVLTREKMRKENIINDLSDKlksTMQQQERDKDLIESLSEDR 1371
Cdd:cd23695 383 VDLENVFPRALLNNLNDFTGLDFQ---NVKEILNNEEKLKIIQGED 425
|
|
| CALCOCO1 |
pfam07888 |
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ... |
859-1185 |
5.64e-03 |
|
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.
Pssm-ID: 462303 [Multi-domain] Cd Length: 488 Bit Score: 41.03 E-value: 5.64e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 859 KEKHQQELQSLKIEYECKLDALVKDSEENVNKILKLKENLVSLEEALQNKDNEFTSIKHEKDAIVCVQQEKDQKLLEMEK 938
Cdd:pfam07888 57 REKEKERYKRDREQWERQRRELESRVAELKEELRQSREKHEELEEKYKELSASSEELSEEKDALLAQRAAHEARIRELEE 136
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 939 IMHT-------QHCEIKELKQSREMALEDLKKLHDEKiESLRAEFQCLEQ------NHLKELEDTLHIRHTQEFekVMTD 1005
Cdd:pfam07888 137 DIKTltqrvleRETELERMKERAKKAGAQRKEEEAER-KQLQAKLQQTEEelrslsKEFQELRNSLAQRDTQVL--QLQD 213
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1006 HNMSLEKLKKENQQRIDQMLESHA--STIQEK----EQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESRT 1079
Cdd:pfam07888 214 TITTLTQKLTTAHRKEAENEALLEelRSLQERlnasERKVEGLGEELSSMAAQRDRTQAELHQARLQAAQLTLQLADASL 293
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1080 QQKEMlKSLLEQETENLRT-------EISKLNQKIHDNNESYQVGLSE---LRALMTIEKDqCISELISRHEEESNILKA 1149
Cdd:pfam07888 294 ALREG-RARWAQERETLQQsaeadkdRIEKLSAELQRLEERLQEERMErekLEVELGREKD-CNRVQLSESRRELQELKA 371
|
330 340 350
....*....|....*....|....*....|....*.
gi 226693374 1150 ELDNVTSLHRQAYEIEKKLKEQIVELQTRLNSELSA 1185
Cdd:pfam07888 372 SLRVAQKEKEQLQAEKQELLEYIRQLEQRLETVADA 407
|
|
| CCDC73 |
pfam15818 |
Coiled-coil domain-containing protein 73 family; CCDC73 is a family of eukaryotic coiled-coil ... |
826-1045 |
5.93e-03 |
|
Coiled-coil domain-containing protein 73 family; CCDC73 is a family of eukaryotic coiled-coil containing proteins. The function is not known. The alternative name is sarcoma antigen NY-SAR-79.
Pssm-ID: 464893 [Multi-domain] Cd Length: 1048 Bit Score: 41.47 E-value: 5.93e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 826 QKEQCDLANYLKCTAVEIrnIIEKVKC-------SLEITLKEKHQQELQS-LKIEYEckldaLVKDSEENVNKILKLKEN 897
Cdd:pfam15818 182 ESEICSLKKELKKVTSDL--IKSKVTCqykmgeeNINLTIKEQKFQELQErLNMELE-----LNKKINEEITHIQEEKQD 254
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 898 LVSLEEALQNKDNEFTSIKHEKDAIVCVQQEKDQKL-----LEMEKIMHTQHcEIKELKQSREMALEDLKKlhdeKIESL 972
Cdd:pfam15818 255 IIISFQHMQQLLQQQTQANTEMEAELKALKENNQTLerdneLQREKVKENEE-KFLNLQNEHEKALGTWKK----HVEEL 329
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 226693374 973 RAEFQCL--EQNHLKELEDTLHIRHTQEFEKVMTDHNMSLEKLKKENQQRIDQMLESHASTIQEKEQQLQELKLK 1045
Cdd:pfam15818 330 NGEINEIknELSSLKETHIKLQEHYNKLCNQKKFEEDKKFQNVPEVNNENSEMSTEKSENLIIQKYNSEQEIREE 404
|
|
| Apolipoprotein |
pfam01442 |
Apolipoprotein A1/A4/E domain; These proteins contain several 22 residue repeats which form a ... |
1152-1308 |
6.54e-03 |
|
Apolipoprotein A1/A4/E domain; These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E.
Pssm-ID: 460211 [Multi-domain] Cd Length: 175 Bit Score: 39.17 E-value: 6.54e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1152 DNVTSLHRQAYEIEKKLKEQIVELQTRLNSELSALEKQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQDKEHLIQELN 1231
Cdd:pfam01442 4 DSLDELSTYAEELQEQLGPVAQELVDRLEKETEALRERLQKDLEEVRAKLEPYLEELQAKLGQNVEELRQRLEPYTEELR 83
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 226693374 1232 FEKNKAVQTALDEFKVERELVEKELLEKVKHLENQIAktPAFESAREDSSSLVAELQEKLQEEKAKFLEQLEEQEKR 1308
Cdd:pfam01442 84 KRLNADAEELQEKLAPYGEELRERLEQNVDALRARLA--PYAEELRQKLAERLEELKESLAPYAEEVQAQLSQRLQE 158
|
|
| SHE3 |
pfam17078 |
SWI5-dependent HO expression protein 3; SWI5-dependent HO expression protein 3 (She3) is an ... |
1138-1351 |
6.58e-03 |
|
SWI5-dependent HO expression protein 3; SWI5-dependent HO expression protein 3 (She3) is an RNA-binding protein that binds specific mRNAs, including the mRNA of Ash1, which is invalid in cell-fate determination. She3 acts as an adapter protein that docks the myosin motor Myo4p onto an Ash1-She2p ribonucleoprotein complex. She3 seems to bind to Myo4p and Shep2p via different domains.
Pssm-ID: 293683 [Multi-domain] Cd Length: 228 Bit Score: 40.11 E-value: 6.58e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1138 SRHEEESNILKAELDNVTSLHRQAYEIEKKLKEQIVELQTRLNsELSALEKQKD---EKITQQEEKYEALIQNLEKDKER 1214
Cdd:pfam17078 41 SKFLENLASLKHENDNLSSMLNRKERRLKDLEDQLSELKNSYE-ELTESNKQLKkrlENSSASETTLEAELERLQIQYDA 119
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1215 LVKNHEQDKEHLIQELNfeknkAVQTALDEFKVERElvekellekvKHLENQIAKTpafesaredsSSLVAELQEKLQEE 1294
Cdd:pfam17078 120 LVDSQNEYKDHYQQEIN-----TLQESLEDLKLENE----------KQLENYQQRI----------SSNDKDIDTKLDSY 174
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*..
gi 226693374 1295 KAKFlEQLEEQEKRKNEEMQNVRTSLIAEQQTNFNTVLTREkmrKENIINDLSDKLK 1351
Cdd:pfam17078 175 NNKF-KNLDNIYVNKNNKLLTKLDSLAQLLDLPSWLNLYPE---SRNKILEYAEKME 227
|
|
| MAD |
pfam05557 |
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ... |
1038-1467 |
6.92e-03 |
|
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.
Pssm-ID: 461677 [Multi-domain] Cd Length: 660 Bit Score: 40.88 E-value: 6.92e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1038 QLQELKLKVSDLSDmrcklevELALKEAETDEIKILLEESRTQQKEMLKSLLEQETEnLRTEISKLNQKIHDNNESYQVG 1117
Cdd:pfam05557 3 ELIESKARLSQLQN-------EKKQMELEHKRARIELEKKASALKRQLDRESDRNQE-LQKRIRLLEKREAEAEEALREQ 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1118 LSELRALMTIEkdqcisELISRHEEESNILKAEL-DNVTSLHRQAYEIEKKLKEQIVELQTrLNSELSALEKQKDEKITQ 1196
Cdd:pfam05557 75 AELNRLKKKYL------EALNKKLNEKESQLADArEVISCLKNELSELRRQIQRAELELQS-TNSELEELQERLDLLKAK 147
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1197 QEEkYEALIQNLEKDKERLvknheQDKEHLIQELNFEKNKAVQTALDEFKVERELV-----EKE---LLEKVKHLENQI- 1267
Cdd:pfam05557 148 ASE-AEQLRQNLEKQQSSL-----AEAEQRIKELEFEIQSQEQDSEIVKNSKSELAripelEKElerLREHNKHLNENIe 221
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1268 -------------AKTPAFESAREDSSSLVAELQEKLQEEKA-KFLEQLEEQEKRKNEEmqnVRTSLIAEQQTNF----- 1328
Cdd:pfam05557 222 nklllkeevedlkRKLEREEKYREEAATLELEKEKLEQELQSwVKLAQDTGLNLRSPED---LSRRIEQLQQREIvlkee 298
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1329 NTVLTREKMRKENIINDLSDKLKSTMQQQERDKDLIESLSEDRARLLEEKKQLEEEVSKLRtsSFLSSAPVAAAPELYGa 1408
Cdd:pfam05557 299 NSSLTSSARQLEKARRELEQELAQYLKKIEDLNKKLKRHKALVRRLQRRVLLLTKERDGYR--AILESYDKELTMSNYS- 375
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*....
gi 226693374 1409 caPELPGEPERSVMETADEGRLDSAMETSMmSVQENMLSEEKQRIMLLERTLQLKEEEN 1467
Cdd:pfam05557 376 --PQLLERIEEAEDMTQKMQAHNEEMEAQL-SVAEEELGGYKQQAQTLERELQALRQQE 431
|
|
| Metal_resist |
pfam13801 |
Heavy-metal resistance; This is a metal-binding protein which is involved in resistance to ... |
1224-1302 |
7.35e-03 |
|
Heavy-metal resistance; This is a metal-binding protein which is involved in resistance to heavy-metal ions. The protein forms a four-helix hooked hairpin, consisting of two long alpha helices each flanked by a shorter alpha helix. It binds a metal ion in a type-2 like centre. It contains two copies of an LTXXQ motif.
Pssm-ID: 433488 [Multi-domain] Cd Length: 119 Bit Score: 38.04 E-value: 7.35e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1224 EHLIQELNFEKNKAVQTALDEFKVERELVEKELLEKVKHLENQIAKTP----AFESAREDSSSLVAELQEKLQEEKAKFL 1299
Cdd:pfam13801 33 LRAALGLPAEQRERLRAALRDHARELRALRRELRAARRELAALLAAPPfdpaAIEAALAEARQARAALQAQIEEALLEFA 112
|
...
gi 226693374 1300 EQL 1302
Cdd:pfam13801 113 ATL 115
|
|
| PKK |
pfam12474 |
Polo kinase kinase; This domain family is found in eukaryotes, and is approximately 140 amino ... |
1196-1324 |
7.65e-03 |
|
Polo kinase kinase; This domain family is found in eukaryotes, and is approximately 140 amino acids in length. The family is found in association with pfam00069. Polo-like kinase 1 (Plx1) is essential during mitosis for the activation of Cdc25C, for spindle assembly, and for cyclin B degradation. This family is Polo kinase kinase (PKK) which phosphorylates Polo kinase and Polo-like kinase to activate them. PKK is a serine/threonine kinase.
Pssm-ID: 463600 [Multi-domain] Cd Length: 139 Bit Score: 38.31 E-value: 7.65e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1196 QQEEKYEALIQNL----EKDKERLVKNHEQDKEHLIQELNFEKNKAVQTALDEFKVERELVEKELLEKVKHLENQIAKTP 1271
Cdd:pfam12474 7 QQKDRFEQERQQLkkryEKELEQLERQQKQQIEKLEQRQTQELRRLPKRIRAEQKKRLKMFRESLKQEKKELKQEVEKLP 86
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|...
gi 226693374 1272 AFEsAREDSSSLVAELQEKLQEEKAKFLEQLEEQEKRKNEEMQNVRTSLIAEQ 1324
Cdd:pfam12474 87 KFQ-RKEAKRQRKEELELEQKHEELEFLQAQSEALERELQQLQNEKRKELAEH 138
|
|
| PRK05771 |
PRK05771 |
V-type ATP synthase subunit I; Validated |
1005-1268 |
8.44e-03 |
|
V-type ATP synthase subunit I; Validated
Pssm-ID: 235600 [Multi-domain] Cd Length: 646 Bit Score: 40.68 E-value: 8.44e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1005 DHNMSLEKLKkENQQRIDQMLeSHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAEtDEIKILLEEsRTQQKEM 1084
Cdd:PRK05771 40 LSNERLRKLR-SLLTKLSEAL-DKLRSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIE-KEIKELEEE-ISELENE 115
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1085 LKSLLEQETE-----NLRTEISKLNqkihdNNESYQVGLSELRA----LMTIEKDQCISELISRHEEESNILkaeldnVT 1155
Cdd:PRK05771 116 IKELEQEIERlepwgNFDLDLSLLL-----GFKYVSVFVGTVPEdkleELKLESDVENVEYISTDKGYVYVV------VV 184
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1156 SLHRQAYEIEKKLKE-QIVELQTRLNSELSALEKQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQDKEHLIQELNFEK 1234
Cdd:PRK05771 185 VLKELSDEVEEELKKlGFERLELEEEGTPSELIREIKEELEEIEKERESLLEELKELAKKYLEELLALYEYLEIELERAE 264
|
250 260 270 280
....*....|....*....|....*....|....*....|
gi 226693374 1235 nkavqtALDEFKVEREL------VEKELLEKVKHLENQIA 1268
Cdd:PRK05771 265 ------ALSKFLKTDKTfaiegwVPEDRVKKLKELIDKAT 298
|
|
| OmpH |
smart00935 |
Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH ... |
1178-1269 |
8.58e-03 |
|
Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH among others. Skp (OmpH) has been characterized as a molecular chaperone that interacts with unfolded proteins as they emerge in the periplasm from the Sec translocation machinery.
Pssm-ID: 214922 [Multi-domain] Cd Length: 140 Bit Score: 38.33 E-value: 8.58e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1178 RLNSELSALEKQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQDKEHLIQELNFEKNKAVQTALDEFKVERELVEKELL 1257
Cdd:smart00935 18 AAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQEFQRKQQKLQQDLQKRQQEELQKIL 97
|
90
....*....|..
gi 226693374 1258 EKVKHLENQIAK 1269
Cdd:smart00935 98 DKINKAIKEVAK 109
|
|
| 235kDa-fam |
TIGR01612 |
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ... |
754-1223 |
8.62e-03 |
|
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.
Pssm-ID: 130673 [Multi-domain] Cd Length: 2757 Bit Score: 41.19 E-value: 8.62e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 754 QSPEMMVESLYSSVINA---------IDSRRMQDTSTRGNEGFGDRAalhvqlEKCRAAAQDSH--SSIQTIKDDLCHFR 822
Cdd:TIGR01612 1258 KSPEIENEMGIEMDIKAemetfnishDDDKDHHIISKKHDENISDIR------EKSLKIIEDFSeeSDINDIKKELQKNL 1331
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 823 TFVQKEQCDLANYL-KCTAV-------EIRNIIEKVK------------------CSLEITLKEKHQQELQSLKIEYECK 876
Cdd:TIGR01612 1332 LDAQKHNSDINLYLnEIANIynilklnKIKKIIDEVKeytkeieennknikdeldKSEKLIKKIKDDINLEECKSKIEST 1411
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 877 LDAlvKDSEENVNKILKLKENLVSLEealQNKDNEFTSIKHEKDAI------VCVQQEKDQKLLEMEKIMHTQHCE--IK 948
Cdd:TIGR01612 1412 LDD--KDIDECIKKIKELKNHILSEE---SNIDTYFKNADENNENVlllfknIEMADNKSQHILKIKKDNATNDHDfnIN 1486
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 949 ELKQSREMAledlKKLHDE------KIESLRAEFQCLEQNHLKELEDTLHIRHTQEFEKVMTDHNMSLEKLKKENQQRID 1022
Cdd:TIGR01612 1487 ELKEHIDKS----KGCKDEadknakAIEKNKELFEQYKKDVTELLNKYSALAIKNKFAKTKKDSEIIIKEIKDAHKKFIL 1562
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1023 QMLESHASTIQEKEQQ-----------------------LQELKLKVSDLSDMRCKleVELALKEAETDEIKILLEESRT 1079
Cdd:TIGR01612 1563 EAEKSEQKIKEIKKEKfrieddaakndksnkaaidiqlsLENFENKFLKISDIKKK--INDCLKETESIEKKISSFSIDS 1640
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1080 QQKEM-------------LKSLLEQ---------ETENLRTEISKLNQKIHDNNESYQVGLSEL---RALMTIEKDQCIS 1134
Cdd:TIGR01612 1641 QDTELkengdnlnslqefLESLKDQkkniedkkkELDELDSEIEKIEIDVDQHKKNYEIGIIEKikeIAIANKEEIESIK 1720
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1135 ELISRHEEE--SNILKAELDNV-TSLHRQAYEIE-KKLKEQIVELQTRLNSELSALEKQ-------KDEKITQQEE---- 1199
Cdd:TIGR01612 1721 ELIEPTIENliSSFNTNDLEGIdPNEKLEEYNTEiGDIYEEFIELYNIIAGCLETVSKEpitydeiKNTRINAQNEflki 1800
|
570 580
....*....|....*....|....*....
gi 226693374 1200 -----KYEALIQNLEKDKERLVKNHEQDK 1223
Cdd:TIGR01612 1801 ieiekKSKSYLDDIEAKEFDRIINHFKKK 1829
|
|
| PRK00106 |
PRK00106 |
ribonuclease Y; |
1193-1357 |
8.64e-03 |
|
ribonuclease Y;
Pssm-ID: 178867 [Multi-domain] Cd Length: 535 Bit Score: 40.62 E-value: 8.64e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1193 KITQQEEKYEALIQNLEKDKERLVKNHEQDKEHLIQELNFEKNKAVQTALDEFKVE----RELVEKEL---LEKVKHLEN 1265
Cdd:PRK00106 25 KMKSAKEAAELTLLNAEQEAVNLRGKAERDAEHIKKTAKRESKALKKELLLEAKEEarkyREEIEQEFkseRQELKQIES 104
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1266 QIAKTpAFESARED----SSSLVAELQEKLQEEKAKFL----EQLEEQEKRKNEEMQNVRTSLIAEQQ----TNFNTVLT 1333
Cdd:PRK00106 105 RLTER-ATSLDRKDenlsSKEKTLESKEQSLTDKSKHIdereEQVEKLEEQKKAELERVAALSQAEAReiilAETENKLT 183
|
170 180
....*....|....*....|....*...
gi 226693374 1334 REKMRK----ENIINDLSDKLKSTMQQQ 1357
Cdd:PRK00106 184 HEIATRireaEREVKDRSDKMAKDLLAQ 211
|
|
|