NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|226693374|ref|NP_033956|]
View 

RB1-inducible coiled-coil protein 1 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Ubl_RB1CC1 cd17060
ubiquitin-like (Ubl) domain found in retinoblastoma 1-inducible coiled-coil protein 1 (RB1CC1) ...
2-76 3.41e-40

ubiquitin-like (Ubl) domain found in retinoblastoma 1-inducible coiled-coil protein 1 (RB1CC1) and similar proteins; RB1CC1, also termed FAK family kinase-interacting protein of 200 kDa (FIP200), is the mammalian counterpart of the yeast Atg17 gene and functions as a component of the ULK1/Atg13/RB1CC1/Atg101 complex essential for induction of autophagy. RB1CC1 is a key signaling node to regulate cellular proliferation and differentiation. As a DNA-binding transcription factor, RB1CC1 has been implicated in the regulation of retinoblastoma 1 (RB1) expression. RB1CC1 contains a conserved ubiquitin-like (Ubl) domain with a beta-grasp Ubl fold, as well as a nuclear localization signal (KPRK), a leucine zipper motif and a coiled-coil structure.


:

Pssm-ID: 340580  Cd Length: 75  Bit Score: 143.09  E-value: 3.41e-40
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 226693374    2 KLYVFLVNTGTTLTFDTELTVQTVADLKHAIQSKYKIAIQHQVLVVNGGECMAADRRVCTYSAGTDTNPIFLFNK 76
Cdd:cd17060     1 MLYVFLVNTGTMLTFDMSLAVQSVSDLKEAIEEKTKIPVEKQVLLVSGGESLDPDKRVSSYSAGTDTNPIFLFNK 75
ATG11 pfam10377
Autophagy-related protein 11; The function of this family is conflicting. In the fission yeast, ...
1456-1582 1.12e-26

Autophagy-related protein 11; The function of this family is conflicting. In the fission yeast, Schizosaccharomyces pombe, this protein has been shown to interact with the telomere cap complex and it is also required for dimerization and kinase activation of Atg1. However, in budding yeast, Saccharomyces cerevisiae, this protein is called ATG11 and is shown to be involved in autophagy.


:

Pssm-ID: 463063  Cd Length: 130  Bit Score: 106.53  E-value: 1.12e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1456 LERTLQlKEEENKRlnqrlmsQSLSSVSSRHSEKIAIRDFQVGDLVLIILDERHDN-----YVLFTVSPTLYFLHSESLP 1530
Cdd:pfam10377   14 LAKKLT-KENKSKR-------EKLEKLQSEAHEKITLKNFKVGDLALFLPTRRHNDlstppWAAFNVGAPHYFLKADSLL 85
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 226693374  1531 ALdlkpgegasgaSRRPWVLGKVMEKEYCQAK---KAQNRFKVPLGTKFYRVKAV 1582
Cdd:pfam10377   86 AL-----------KSREWFVGRITSIEERVVDlsdKDSNPFKLSKGTVWYLVTAE 129
Mplasa_alph_rch super family cl37461
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
827-1317 6.85e-14

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


The actual alignment was detected with superfamily member TIGR04523:

Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 76.98  E-value: 6.85e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   827 KEQCDLANYLKCTAVEIRNIIEKVKCSLEITLKEKHQQELQSLKIEYECKLDALVKDSEENVNK----ILKLKENLVSLE 902
Cdd:TIGR04523  166 KQKEELENELNLLEKEKLNIQKNIDKIKNKLLKLELLLSNLKKKIQKNKSLESQISELKKQNNQlkdnIEKKQQEINEKT 245
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   903 EALQNKDNEFTSIKHEKDAIVCVQQEKDQKLLEMEKIMHTQHCEIKELKqsreMALEDLKKLHDEKIESLRAEFQCLEQN 982
Cdd:TIGR04523  246 TEISNTQTQLNQLKDEQNKIKKQLSEKQKELEQNNKKIKELEKQLNQLK----SEISDLNNQKEQDWNKELKSELKNQEK 321
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   983 HLKELEDTLhirhtQEFEKVMTDHNMSLEKLKKE------NQQRIDQMLESHASTIQEKEQQLQELKLKVSDLSDMRCKL 1056
Cdd:TIGR04523  322 KLEEIQNQI-----SQNNKIISQLNEQISQLKKEltnsesENSEKQRELEEKQNEIEKLKKENQSYKQEIKNLESQINDL 396
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1057 EVELALKEAETDEIkilleESRTQQKEMLKSLLEQETENLRTEISKLNQKIHD-NNESYQVGLS--ELRALMTIEKDQCI 1133
Cdd:TIGR04523  397 ESKIQNQEKLNQQK-----DEQIKKLQQEKELLEKEIERLKETIIKNNSEIKDlTNQDSVKELIikNLDNTRESLETQLK 471
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1134 SELISRHEEESN--ILKAELDNVTSLHRQAYEIEKKLKEQIVELqTRLNSELSALEKQKDEKITQQEEKYEALIQNLEKD 1211
Cdd:TIGR04523  472 VLSRSINKIKQNleQKQKELKSKEKELKKLNEEKKELEEKVKDL-TKKISSLKEKIEKLESEKKEKESKISDLEDELNKD 550
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1212 KERLVKNH-EQDKEHLIQELnfEKNKAVQTALDEFKVERELVEKELLEKVKHLENQIAKTPAFESAREDSSSLVAELQEK 1290
Cdd:TIGR04523  551 DFELKKENlEKEIDEKNKEI--EELKQTQKSLKKKQEEKQELIDQKEKEKKDLIKEIEEKEKKISSLEKELEKAKKENEK 628
                          490       500       510
                   ....*....|....*....|....*....|
gi 226693374  1291 LQEEKAKF---LEQLEEQEKRKNEEMQNVR 1317
Cdd:TIGR04523  629 LSSIIKNIkskKNKLKQEVKQIKETIKEIR 658
ATG17_like super family cl27196
Autophagy protein ATG17-like domain; This domain is found in the autophagy-related proteins ...
312-537 4.98e-04

Autophagy protein ATG17-like domain; This domain is found in the autophagy-related proteins ATG17 and ATG11, conserved across eukaryotes. ATG17 forms a complex with ATG29 and ATG31, critical for both PAS (preautophagosomal structure) formation and autophagy. Together with ATG13, it is required for ATG1 kinase activation. ATG11 is a scaffold protein required for the cytoplasm-to-vacuole targeting (Cvt) pathway during starvation and to recruit ATG proteins to the pre-autophagosome. It is also required for ATG1 kinase activation. In many eukaryotes, ATG11 (the orthologue in mammals is RB1-inducible coiled-coil protein 1 (RB1CC1) and in S. pombe is Taz1-interacting factor 1 (taf1)) is essential for bulk autophagy, except in S.cerevisiae. ATG17 and ATG11 are large similar proteins, both predicted to be almost entirely helical, containing conserved coiled-coil regions and lack obvious functional motifs.


The actual alignment was detected with superfamily member pfam04108:

Pssm-ID: 427715 [Multi-domain]  Cd Length: 360  Bit Score: 44.30  E-value: 4.98e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   312 NVQDRPNDVESLVRKCFDSMSRLDPKIIQPFMLECHQTIAKLDNQNMKAikgLEDRLYALDQMIASCSRLVNEQKELAQG 391
Cdd:pfam04108  127 ELQAAQESLDSDLKRFDDDLRDLQKELESLSSPSESISLIPTLLKELES---LEEEMASLLESLTNHYDQCVTAVKLTEG 203
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   392 FLANqMRAENLKDASVLPDLCLSHANQLMIMLQNHRKLLDIKQKCTTAKQELANNLHV------RLKWCCFVMLHADQDG 465
Cdd:pfam04108  204 GRAE-MLEVLENDARELDDVVPELQDRLDEMENNYERLQKLLEQKNSLIDELLSALQLiaeiqsRLPEYLAALKEFEERW 282
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 226693374   466 EKLQALLRLVIELLERVRivEALSTVPQMYCLAVVEVVRRKMFIKHYREWAGALVKDGKQLYEAEKSKRESF 537
Cdd:pfam04108  283 EEEKETIEDYLSELEDLR--EFYEGFPSAYGSLLLEVERRREWAEKMKKILRKLAEELDRLQEEERKRREKF 352
 
Name Accession Description Interval E-value
Ubl_RB1CC1 cd17060
ubiquitin-like (Ubl) domain found in retinoblastoma 1-inducible coiled-coil protein 1 (RB1CC1) ...
2-76 3.41e-40

ubiquitin-like (Ubl) domain found in retinoblastoma 1-inducible coiled-coil protein 1 (RB1CC1) and similar proteins; RB1CC1, also termed FAK family kinase-interacting protein of 200 kDa (FIP200), is the mammalian counterpart of the yeast Atg17 gene and functions as a component of the ULK1/Atg13/RB1CC1/Atg101 complex essential for induction of autophagy. RB1CC1 is a key signaling node to regulate cellular proliferation and differentiation. As a DNA-binding transcription factor, RB1CC1 has been implicated in the regulation of retinoblastoma 1 (RB1) expression. RB1CC1 contains a conserved ubiquitin-like (Ubl) domain with a beta-grasp Ubl fold, as well as a nuclear localization signal (KPRK), a leucine zipper motif and a coiled-coil structure.


Pssm-ID: 340580  Cd Length: 75  Bit Score: 143.09  E-value: 3.41e-40
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 226693374    2 KLYVFLVNTGTTLTFDTELTVQTVADLKHAIQSKYKIAIQHQVLVVNGGECMAADRRVCTYSAGTDTNPIFLFNK 76
Cdd:cd17060     1 MLYVFLVNTGTMLTFDMSLAVQSVSDLKEAIEEKTKIPVEKQVLLVSGGESLDPDKRVSSYSAGTDTNPIFLFNK 75
ATG11 pfam10377
Autophagy-related protein 11; The function of this family is conflicting. In the fission yeast, ...
1456-1582 1.12e-26

Autophagy-related protein 11; The function of this family is conflicting. In the fission yeast, Schizosaccharomyces pombe, this protein has been shown to interact with the telomere cap complex and it is also required for dimerization and kinase activation of Atg1. However, in budding yeast, Saccharomyces cerevisiae, this protein is called ATG11 and is shown to be involved in autophagy.


Pssm-ID: 463063  Cd Length: 130  Bit Score: 106.53  E-value: 1.12e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1456 LERTLQlKEEENKRlnqrlmsQSLSSVSSRHSEKIAIRDFQVGDLVLIILDERHDN-----YVLFTVSPTLYFLHSESLP 1530
Cdd:pfam10377   14 LAKKLT-KENKSKR-------EKLEKLQSEAHEKITLKNFKVGDLALFLPTRRHNDlstppWAAFNVGAPHYFLKADSLL 85
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 226693374  1531 ALdlkpgegasgaSRRPWVLGKVMEKEYCQAK---KAQNRFKVPLGTKFYRVKAV 1582
Cdd:pfam10377   86 AL-----------KSREWFVGRITSIEERVVDlsdKDSNPFKLSKGTVWYLVTAE 129
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
827-1317 6.85e-14

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 76.98  E-value: 6.85e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   827 KEQCDLANYLKCTAVEIRNIIEKVKCSLEITLKEKHQQELQSLKIEYECKLDALVKDSEENVNK----ILKLKENLVSLE 902
Cdd:TIGR04523  166 KQKEELENELNLLEKEKLNIQKNIDKIKNKLLKLELLLSNLKKKIQKNKSLESQISELKKQNNQlkdnIEKKQQEINEKT 245
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   903 EALQNKDNEFTSIKHEKDAIVCVQQEKDQKLLEMEKIMHTQHCEIKELKqsreMALEDLKKLHDEKIESLRAEFQCLEQN 982
Cdd:TIGR04523  246 TEISNTQTQLNQLKDEQNKIKKQLSEKQKELEQNNKKIKELEKQLNQLK----SEISDLNNQKEQDWNKELKSELKNQEK 321
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   983 HLKELEDTLhirhtQEFEKVMTDHNMSLEKLKKE------NQQRIDQMLESHASTIQEKEQQLQELKLKVSDLSDMRCKL 1056
Cdd:TIGR04523  322 KLEEIQNQI-----SQNNKIISQLNEQISQLKKEltnsesENSEKQRELEEKQNEIEKLKKENQSYKQEIKNLESQINDL 396
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1057 EVELALKEAETDEIkilleESRTQQKEMLKSLLEQETENLRTEISKLNQKIHD-NNESYQVGLS--ELRALMTIEKDQCI 1133
Cdd:TIGR04523  397 ESKIQNQEKLNQQK-----DEQIKKLQQEKELLEKEIERLKETIIKNNSEIKDlTNQDSVKELIikNLDNTRESLETQLK 471
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1134 SELISRHEEESN--ILKAELDNVTSLHRQAYEIEKKLKEQIVELqTRLNSELSALEKQKDEKITQQEEKYEALIQNLEKD 1211
Cdd:TIGR04523  472 VLSRSINKIKQNleQKQKELKSKEKELKKLNEEKKELEEKVKDL-TKKISSLKEKIEKLESEKKEKESKISDLEDELNKD 550
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1212 KERLVKNH-EQDKEHLIQELnfEKNKAVQTALDEFKVERELVEKELLEKVKHLENQIAKTPAFESAREDSSSLVAELQEK 1290
Cdd:TIGR04523  551 DFELKKENlEKEIDEKNKEI--EELKQTQKSLKKKQEEKQELIDQKEKEKKDLIKEIEEKEKKISSLEKELEKAKKENEK 628
                          490       500       510
                   ....*....|....*....|....*....|
gi 226693374  1291 LQEEKAKF---LEQLEEQEKRKNEEMQNVR 1317
Cdd:TIGR04523  629 LSSIIKNIkskKNKLKQEVKQIKETIKEIR 658
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
795-1373 1.51e-13

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 76.31  E-value: 1.51e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   795 VQLEKCRAAAQDSHSSIQTIkddlcHFRTF------VQKEQCDLANYLKCTAVEIRNIIEKVKCS----LEITLKEkHQQ 864
Cdd:pfam15921  194 VDFEEASGKKIYEHDSMSTM-----HFRSLgsaiskILRELDTEISYLKGRIFPVEDQLEALKSEsqnkIELLLQQ-HQD 267
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   865 ELQSLKIEYECKLDALVKDSEENVNKILKLKENLVSLEEALQNKDNEFtsIKHEKDAIVCVQQEKDQ----------KLL 934
Cdd:pfam15921  268 RIEQLISEHEVEITGLTEKASSARSQANSIQSQLEIIQEQARNQNSMY--MRQLSDLESTVSQLRSElreakrmyedKIE 345
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   935 EMEKIMHTQHCEIKELKQSREMALEDLKKLhDEKIESLRAEFqcleqnHLKELEDTLHIRHTQEFEKVMTDHNMSLEKLK 1014
Cdd:pfam15921  346 ELEKQLVLANSELTEARTERDQFSQESGNL-DDQLQKLLADL------HKREKELSLEKEQNKRLWDRDTGNSITIDHLR 418
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1015 KE------NQQRIDQMLESHASTIQ-EKEQQLQELKLKVSDL---SDMRCKLEVELALKEAETDEI---KILLEESRTQQ 1081
Cdd:pfam15921  419 RElddrnmEVQRLEALLKAMKSECQgQMERQMAAIQGKNESLekvSSLTAQLESTKEMLRKVVEELtakKMTLESSERTV 498
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1082 KEMLKSLLEQE--TENLRTEISKLNQKI--------HDNNE-----SYQVGLSELRALMTiEKDQCISELISRHEEESNI 1146
Cdd:pfam15921  499 SDLTASLQEKEraIEATNAEITKLRSRVdlklqelqHLKNEgdhlrNVQTECEALKLQMA-EKDKVIEILRQQIENMTQL 577
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1147 LKAELDNVTSLHRQAYEIEKKLKEQIVELQtrlnsELSALEKQKDEKITQQEEKYEAL------IQNLEKDKERLVKNHE 1220
Cdd:pfam15921  578 VGQHGRTAGAMQVEKAQLEKEINDRRLELQ-----EFKILKDKKDAKIRELEARVSDLelekvkLVNAGSERLRAVKDIK 652
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1221 QDKEHLIQELNFEKNKaVQTALDEFKV----------ERELVEKELLEKVKHLENQIAKT----PAFESAREDSSSLVAE 1286
Cdd:pfam15921  653 QERDQLLNEVKTSRNE-LNSLSEDYEVlkrnfrnkseEMETTTNKLKMQLKSAQSELEQTrntlKSMEGSDGHAMKVAMG 731
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1287 LQEKLQ---------EEKAKFLEQ-----------LEEQEKRKNEEMQNVRTSLiAEQQTNFNTVLTREKMRKENIINDL 1346
Cdd:pfam15921  732 MQKQITakrgqidalQSKIQFLEEamtnankekhfLKEEKNKLSQELSTVATEK-NKMAGELEVLRSQERRLKEKVANME 810
                          650       660
                   ....*....|....*....|....*..
gi 226693374  1347 SDKLKSTMQQQERdKDLIESLSEDRAR 1373
Cdd:pfam15921  811 VALDKASLQFAEC-QDIIQRQEQESVR 836
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1032-1325 7.44e-13

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 73.82  E-value: 7.44e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1032 IQEKEQQLQELK------LKVSDLSDMRCKLEVELAL-------KEAETDEIKILLEESRTQQKEMLKSLLEQETENLRT 1098
Cdd:COG1196   195 LGELERQLEPLErqaekaERYRELKEELKELEAELLLlklreleAELEELEAELEELEAELEELEAELAELEAELEELRL 274
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1099 EISKLNQKIHDNNESYQVGLSELRALmtiEKDqciselISRHEEESNILKAELDNVTSLHRQAYEIEKKLKEQIVELQTR 1178
Cdd:COG1196   275 ELEELELELEEAQAEEYELLAELARL---EQD------IARLEERRRELEERLEELEEELAELEEELEELEEELEELEEE 345
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1179 LNSELSALEkQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQDKEHLIQELNFEKN-KAVQTALDEFKVERELVEKELL 1257
Cdd:COG1196   346 LEEAEEELE-EAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQlEELEEAEEALLERLERLEEELE 424
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 226693374 1258 EKVKHLENQIAKTPAFESAREDSSSLVAELQEKLQEEKAKFLEQLEEQEKRKNEEMQNVRTSLIAEQQ 1325
Cdd:COG1196   425 ELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAAR 492
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
880-1388 8.02e-12

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 70.48  E-value: 8.02e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  880 LVKDSEENVNKILKLKENLvslEEALQNKDNEFTSIKHEKDAIVCVQQEKDQKLLEMEKI---MHTQHCEIKELKQSREM 956
Cdd:PRK03918  173 EIKRRIERLEKFIKRTENI---EELIKEKEKELEEVLREINEISSELPELREELEKLEKEvkeLEELKEEIEELEKELES 249
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  957 ALEDLKKLhDEKIESLRAEFQCLEQnHLKELEDTlhIRHTQEFEKVMTDHnMSLEKLK---KENQQRIDQMLESHASTIQ 1033
Cdd:PRK03918  250 LEGSKRKL-EEKIRELEERIEELKK-EIEELEEK--VKELKELKEKAEEY-IKLSEFYeeyLDELREIEKRLSRLEEEIN 324
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1034 EKEQQLQELKLKVSDLSDMRcKLEVELALKEAETDEIKILLEESRTQQKEM--LKS---------------LLEQETENL 1096
Cdd:PRK03918  325 GIEERIKELEEKEERLEELK-KKLKELEKRLEELEERHELYEEAKAKKEELerLKKrltgltpeklekeleELEKAKEEI 403
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1097 RTEISKLNQKIhdnNESYQVGLSELRALMTIEKDQ-----CISELISRHEEE-SNILKAELDNVTSLHRQAYEIEKKLKE 1170
Cdd:PRK03918  404 EEEISKITARI---GELKKEIKELKKAIEELKKAKgkcpvCGRELTEEHRKElLEEYTAELKRIEKELKEIEEKERKLRK 480
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1171 QIVELQTRLN--SELSALEKQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQ----DKEHLIQELNFEKNKAVQTALDE 1244
Cdd:PRK03918  481 ELRELEKVLKkeSELIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKliklKGEIKSLKKELEKLEELKKKLAE 560
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1245 FKVERELVEKELLEKVKHLENQiaktpAFESAREDSSSLvAELQ------------EKLQEEKAKFLEQLEEQEKRKNEE 1312
Cdd:PRK03918  561 LEKKLDELEEELAELLKELEEL-----GFESVEELEERL-KELEpfyneylelkdaEKELEREEKELKKLEEELDKAFEE 634
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1313 MQNVRTSL------IAEQQTNFNTVLTREKMRKENIINDLSDKLKSTMQQQERDKDLIESLSEDRARLLEEKKQLEEEVS 1386
Cdd:PRK03918  635 LAETEKRLeelrkeLEELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELE 714

                  ..
gi 226693374 1387 KL 1388
Cdd:PRK03918  715 KL 716
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
1179-1489 5.37e-06

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 51.28  E-value: 5.37e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1179 LNSELSALEKQKDEKITQQEEKYEALIQnlekdkERLvknhEQDKEHLIQELNF-----EKNKAVQTALDEFKVERELVE 1253
Cdd:pfam17380  271 LNQLLHIVQHQKAVSERQQQEKFEKMEQ------ERL----RQEKEEKAREVERrrkleEAEKARQAEMDRQAAIYAEQE 340
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1254 KELLEKVKHLENQIAKTPAFESAREDSSSLVAELqeklqeEKAKFLEQLEEQEKRKNEEmqnVRTSLIAEQQTNFntvlt 1333
Cdd:pfam17380  341 RMAMERERELERIRQEERKRELERIRQEEIAMEI------SRMRELERLQMERQQKNER---VRQELEAARKVKI----- 406
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1334 REKMRKENIINDLSDKLKSTMQQQERDKDLIESLSEDRARLL----EEKKQLEEEVSKLRTSSF----------LSSAPV 1399
Cdd:pfam17380  407 LEEERQRKIQQQKVEMEQIRAEQEEARQREVRRLEEERAREMervrLEEQERQQQVERLRQQEEerkrkkleleKEKRDR 486
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1400 AAAPELYGACAPELPGEPERSVMEtadEGRLDSAMETSMMSVQENMLSEEKQRIMLLERTLQLKEEENKRLNQRLMSQSL 1479
Cdd:pfam17380  487 KRAEEQRRKILEKELEERKQAMIE---EERKRKLLEKEMEERQKAIYEEERRREAEEERRKQQEMEERRRIQEQMRKATE 563
                          330
                   ....*....|
gi 226693374  1480 SSVSSRHSEK 1489
Cdd:pfam17380  564 ERSRLEAMER 573
ATG17_like pfam04108
Autophagy protein ATG17-like domain; This domain is found in the autophagy-related proteins ...
312-537 4.98e-04

Autophagy protein ATG17-like domain; This domain is found in the autophagy-related proteins ATG17 and ATG11, conserved across eukaryotes. ATG17 forms a complex with ATG29 and ATG31, critical for both PAS (preautophagosomal structure) formation and autophagy. Together with ATG13, it is required for ATG1 kinase activation. ATG11 is a scaffold protein required for the cytoplasm-to-vacuole targeting (Cvt) pathway during starvation and to recruit ATG proteins to the pre-autophagosome. It is also required for ATG1 kinase activation. In many eukaryotes, ATG11 (the orthologue in mammals is RB1-inducible coiled-coil protein 1 (RB1CC1) and in S. pombe is Taz1-interacting factor 1 (taf1)) is essential for bulk autophagy, except in S.cerevisiae. ATG17 and ATG11 are large similar proteins, both predicted to be almost entirely helical, containing conserved coiled-coil regions and lack obvious functional motifs.


Pssm-ID: 427715 [Multi-domain]  Cd Length: 360  Bit Score: 44.30  E-value: 4.98e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   312 NVQDRPNDVESLVRKCFDSMSRLDPKIIQPFMLECHQTIAKLDNQNMKAikgLEDRLYALDQMIASCSRLVNEQKELAQG 391
Cdd:pfam04108  127 ELQAAQESLDSDLKRFDDDLRDLQKELESLSSPSESISLIPTLLKELES---LEEEMASLLESLTNHYDQCVTAVKLTEG 203
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   392 FLANqMRAENLKDASVLPDLCLSHANQLMIMLQNHRKLLDIKQKCTTAKQELANNLHV------RLKWCCFVMLHADQDG 465
Cdd:pfam04108  204 GRAE-MLEVLENDARELDDVVPELQDRLDEMENNYERLQKLLEQKNSLIDELLSALQLiaeiqsRLPEYLAALKEFEERW 282
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 226693374   466 EKLQALLRLVIELLERVRivEALSTVPQMYCLAVVEVVRRKMFIKHYREWAGALVKDGKQLYEAEKSKRESF 537
Cdd:pfam04108  283 EEEKETIEDYLSELEDLR--EFYEGFPSAYGSLLLEVERRREWAEKMKKILRKLAEELDRLQEEERKRREKF 352
mS26_Tt cd23695
Tetrahymena thermophila ribosomal protein mS26 and similar proteins; Ribosomal protein mS26 is ...
1065-1371 5.30e-03

Tetrahymena thermophila ribosomal protein mS26 and similar proteins; Ribosomal protein mS26 is a component of small subunit (SSU) in Tetrahymena thermophila mitochondrial ribosome (mitoribosome). The structure of the mitoribosome reveals an assembly of 94-ribosomal proteins and four-rRNAs with an additional protein mass of ~700 kDa on the small subunit; the large mitoribosomal subunit (LSU) lacks 5S rRNA.


Pssm-ID: 467909 [Multi-domain]  Cd Length: 496  Bit Score: 41.35  E-value: 5.30e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1065 AETDEIKILLEESRTQQKEMLKSLLEQETENLRTEI--------SK-------LNQKIHDN---------NESYQVGLSE 1120
Cdd:cd23695    65 ATQNHIKLLEKKSVKKEENERKYLLEQDVKAMNKKIildvmneeSKnwinlqnMNEKINPNlilpdtildETSYYLKLQE 144
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1121 LRALMTIEKDQCISELISrhEEESNILKAEL------DNVTSLHRQAYEIEKKLKEQIVELQTRLNSELSALEKQKDEKI 1194
Cdd:cd23695   145 LAFLFEQGDHEEMDKLLD--ENEEIEYKNSLlmpiyqDLKSLIKHLKYTELFKLLKEYQDAKAIIIEDFRESSEEGAEKL 222
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1195 TQQEEKYEALIQNLEKDKERLVKNHEQDKEHLIQELN-FEK-----------NKAVQTALDEFKVERELVEKELLEKVKH 1262
Cdd:cd23695   223 EKLEKAFATLLKNYKEELEEPEKQLEFMQKRLLDLYNlLRLwgqyitivkmpDSVVRDIMNKTQARPEVAKLNSKQELED 302
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1263 LENQIAKTPAFE--------SAREDSSSLVAELQEKLQEEKAKFLEQLEEQE---------KRKNEEMQNVRTSLIAEQQ 1325
Cdd:cd23695   303 AKNRKRDTEENEfdddyesaDEGETSDEEDEIEEENFQLQKEKKKEEELNAEfniaknslyKFSPQNDKNVVDDRDFYSG 382
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....*.
gi 226693374 1326 TNFNTVLTREKMRKENIINDLSDKlksTMQQQERDKDLIESLSEDR 1371
Cdd:cd23695   383 VDLENVFPRALLNNLNDFTGLDFQ---NVKEILNNEEKLKIIQGED 425
OmpH smart00935
Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH ...
1178-1269 8.58e-03

Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH among others. Skp (OmpH) has been characterized as a molecular chaperone that interacts with unfolded proteins as they emerge in the periplasm from the Sec translocation machinery.


Pssm-ID: 214922 [Multi-domain]  Cd Length: 140  Bit Score: 38.33  E-value: 8.58e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   1178 RLNSELSALEKQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQDKEHLIQELNFEKNKAVQTALDEFKVERELVEKELL 1257
Cdd:smart00935   18 AAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQEFQRKQQKLQQDLQKRQQEELQKIL 97
                            90
                    ....*....|..
gi 226693374   1258 EKVKHLENQIAK 1269
Cdd:smart00935   98 DKINKAIKEVAK 109
 
Name Accession Description Interval E-value
Ubl_RB1CC1 cd17060
ubiquitin-like (Ubl) domain found in retinoblastoma 1-inducible coiled-coil protein 1 (RB1CC1) ...
2-76 3.41e-40

ubiquitin-like (Ubl) domain found in retinoblastoma 1-inducible coiled-coil protein 1 (RB1CC1) and similar proteins; RB1CC1, also termed FAK family kinase-interacting protein of 200 kDa (FIP200), is the mammalian counterpart of the yeast Atg17 gene and functions as a component of the ULK1/Atg13/RB1CC1/Atg101 complex essential for induction of autophagy. RB1CC1 is a key signaling node to regulate cellular proliferation and differentiation. As a DNA-binding transcription factor, RB1CC1 has been implicated in the regulation of retinoblastoma 1 (RB1) expression. RB1CC1 contains a conserved ubiquitin-like (Ubl) domain with a beta-grasp Ubl fold, as well as a nuclear localization signal (KPRK), a leucine zipper motif and a coiled-coil structure.


Pssm-ID: 340580  Cd Length: 75  Bit Score: 143.09  E-value: 3.41e-40
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 226693374    2 KLYVFLVNTGTTLTFDTELTVQTVADLKHAIQSKYKIAIQHQVLVVNGGECMAADRRVCTYSAGTDTNPIFLFNK 76
Cdd:cd17060     1 MLYVFLVNTGTMLTFDMSLAVQSVSDLKEAIEEKTKIPVEKQVLLVSGGESLDPDKRVSSYSAGTDTNPIFLFNK 75
ATG11 pfam10377
Autophagy-related protein 11; The function of this family is conflicting. In the fission yeast, ...
1456-1582 1.12e-26

Autophagy-related protein 11; The function of this family is conflicting. In the fission yeast, Schizosaccharomyces pombe, this protein has been shown to interact with the telomere cap complex and it is also required for dimerization and kinase activation of Atg1. However, in budding yeast, Saccharomyces cerevisiae, this protein is called ATG11 and is shown to be involved in autophagy.


Pssm-ID: 463063  Cd Length: 130  Bit Score: 106.53  E-value: 1.12e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1456 LERTLQlKEEENKRlnqrlmsQSLSSVSSRHSEKIAIRDFQVGDLVLIILDERHDN-----YVLFTVSPTLYFLHSESLP 1530
Cdd:pfam10377   14 LAKKLT-KENKSKR-------EKLEKLQSEAHEKITLKNFKVGDLALFLPTRRHNDlstppWAAFNVGAPHYFLKADSLL 85
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 226693374  1531 ALdlkpgegasgaSRRPWVLGKVMEKEYCQAK---KAQNRFKVPLGTKFYRVKAV 1582
Cdd:pfam10377   86 AL-----------KSREWFVGRITSIEERVVDlsdKDSNPFKLSKGTVWYLVTAE 129
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
827-1317 6.85e-14

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 76.98  E-value: 6.85e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   827 KEQCDLANYLKCTAVEIRNIIEKVKCSLEITLKEKHQQELQSLKIEYECKLDALVKDSEENVNK----ILKLKENLVSLE 902
Cdd:TIGR04523  166 KQKEELENELNLLEKEKLNIQKNIDKIKNKLLKLELLLSNLKKKIQKNKSLESQISELKKQNNQlkdnIEKKQQEINEKT 245
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   903 EALQNKDNEFTSIKHEKDAIVCVQQEKDQKLLEMEKIMHTQHCEIKELKqsreMALEDLKKLHDEKIESLRAEFQCLEQN 982
Cdd:TIGR04523  246 TEISNTQTQLNQLKDEQNKIKKQLSEKQKELEQNNKKIKELEKQLNQLK----SEISDLNNQKEQDWNKELKSELKNQEK 321
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   983 HLKELEDTLhirhtQEFEKVMTDHNMSLEKLKKE------NQQRIDQMLESHASTIQEKEQQLQELKLKVSDLSDMRCKL 1056
Cdd:TIGR04523  322 KLEEIQNQI-----SQNNKIISQLNEQISQLKKEltnsesENSEKQRELEEKQNEIEKLKKENQSYKQEIKNLESQINDL 396
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1057 EVELALKEAETDEIkilleESRTQQKEMLKSLLEQETENLRTEISKLNQKIHD-NNESYQVGLS--ELRALMTIEKDQCI 1133
Cdd:TIGR04523  397 ESKIQNQEKLNQQK-----DEQIKKLQQEKELLEKEIERLKETIIKNNSEIKDlTNQDSVKELIikNLDNTRESLETQLK 471
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1134 SELISRHEEESN--ILKAELDNVTSLHRQAYEIEKKLKEQIVELqTRLNSELSALEKQKDEKITQQEEKYEALIQNLEKD 1211
Cdd:TIGR04523  472 VLSRSINKIKQNleQKQKELKSKEKELKKLNEEKKELEEKVKDL-TKKISSLKEKIEKLESEKKEKESKISDLEDELNKD 550
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1212 KERLVKNH-EQDKEHLIQELnfEKNKAVQTALDEFKVERELVEKELLEKVKHLENQIAKTPAFESAREDSSSLVAELQEK 1290
Cdd:TIGR04523  551 DFELKKENlEKEIDEKNKEI--EELKQTQKSLKKKQEEKQELIDQKEKEKKDLIKEIEEKEKKISSLEKELEKAKKENEK 628
                          490       500       510
                   ....*....|....*....|....*....|
gi 226693374  1291 LQEEKAKF---LEQLEEQEKRKNEEMQNVR 1317
Cdd:TIGR04523  629 LSSIIKNIkskKNKLKQEVKQIKETIKEIR 658
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
795-1373 1.51e-13

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 76.31  E-value: 1.51e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   795 VQLEKCRAAAQDSHSSIQTIkddlcHFRTF------VQKEQCDLANYLKCTAVEIRNIIEKVKCS----LEITLKEkHQQ 864
Cdd:pfam15921  194 VDFEEASGKKIYEHDSMSTM-----HFRSLgsaiskILRELDTEISYLKGRIFPVEDQLEALKSEsqnkIELLLQQ-HQD 267
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   865 ELQSLKIEYECKLDALVKDSEENVNKILKLKENLVSLEEALQNKDNEFtsIKHEKDAIVCVQQEKDQ----------KLL 934
Cdd:pfam15921  268 RIEQLISEHEVEITGLTEKASSARSQANSIQSQLEIIQEQARNQNSMY--MRQLSDLESTVSQLRSElreakrmyedKIE 345
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   935 EMEKIMHTQHCEIKELKQSREMALEDLKKLhDEKIESLRAEFqcleqnHLKELEDTLHIRHTQEFEKVMTDHNMSLEKLK 1014
Cdd:pfam15921  346 ELEKQLVLANSELTEARTERDQFSQESGNL-DDQLQKLLADL------HKREKELSLEKEQNKRLWDRDTGNSITIDHLR 418
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1015 KE------NQQRIDQMLESHASTIQ-EKEQQLQELKLKVSDL---SDMRCKLEVELALKEAETDEI---KILLEESRTQQ 1081
Cdd:pfam15921  419 RElddrnmEVQRLEALLKAMKSECQgQMERQMAAIQGKNESLekvSSLTAQLESTKEMLRKVVEELtakKMTLESSERTV 498
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1082 KEMLKSLLEQE--TENLRTEISKLNQKI--------HDNNE-----SYQVGLSELRALMTiEKDQCISELISRHEEESNI 1146
Cdd:pfam15921  499 SDLTASLQEKEraIEATNAEITKLRSRVdlklqelqHLKNEgdhlrNVQTECEALKLQMA-EKDKVIEILRQQIENMTQL 577
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1147 LKAELDNVTSLHRQAYEIEKKLKEQIVELQtrlnsELSALEKQKDEKITQQEEKYEAL------IQNLEKDKERLVKNHE 1220
Cdd:pfam15921  578 VGQHGRTAGAMQVEKAQLEKEINDRRLELQ-----EFKILKDKKDAKIRELEARVSDLelekvkLVNAGSERLRAVKDIK 652
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1221 QDKEHLIQELNFEKNKaVQTALDEFKV----------ERELVEKELLEKVKHLENQIAKT----PAFESAREDSSSLVAE 1286
Cdd:pfam15921  653 QERDQLLNEVKTSRNE-LNSLSEDYEVlkrnfrnkseEMETTTNKLKMQLKSAQSELEQTrntlKSMEGSDGHAMKVAMG 731
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1287 LQEKLQ---------EEKAKFLEQ-----------LEEQEKRKNEEMQNVRTSLiAEQQTNFNTVLTREKMRKENIINDL 1346
Cdd:pfam15921  732 MQKQITakrgqidalQSKIQFLEEamtnankekhfLKEEKNKLSQELSTVATEK-NKMAGELEVLRSQERRLKEKVANME 810
                          650       660
                   ....*....|....*....|....*..
gi 226693374  1347 SDKLKSTMQQQERdKDLIESLSEDRAR 1373
Cdd:pfam15921  811 VALDKASLQFAEC-QDIIQRQEQESVR 836
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
876-1370 2.55e-13

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 75.06  E-value: 2.55e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   876 KLDALVKDSEENVNKILKLKENL----VSLEEALQNKDNEFTSIKHE---KDAIVCVQQEKDQKLLEMEKimhtqhcEIK 948
Cdd:TIGR04523  149 KKEKELEKLNNKYNDLKKQKEELenelNLLEKEKLNIQKNIDKIKNKllkLELLLSNLKKKIQKNKSLES-------QIS 221
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   949 ELKQSREmALEDLKKLHDEKIESLRAEFQCLEQNhLKELEDtlhirhtqefekvmtdhnmSLEKLKKENQQRIDQmLESH 1028
Cdd:TIGR04523  222 ELKKQNN-QLKDNIEKKQQEINEKTTEISNTQTQ-LNQLKD-------------------EQNKIKKQLSEKQKE-LEQN 279
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1029 ASTIQEKEQQLQELKLKVSDLSdmrcklevelalKEAETDEIKILLEESRTQQKEmlKSLLEQETENLRTEISKLNQKIH 1108
Cdd:TIGR04523  280 NKKIKELEKQLNQLKSEISDLN------------NQKEQDWNKELKSELKNQEKK--LEEIQNQISQNNKIISQLNEQIS 345
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1109 D----------NNESYQVGLSE-LRALMTIEKD-QCISELISRHEEESNILKAELDNVTslhrqayEIEKKLKEQIVELQ 1176
Cdd:TIGR04523  346 QlkkeltnsesENSEKQRELEEkQNEIEKLKKEnQSYKQEIKNLESQINDLESKIQNQE-------KLNQQKDEQIKKLQ 418
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1177 TRLNSELSALEKQKDEKITQQEEkyealIQNLEKD---KERLVKNHEQDKEHL---IQELNFEKNKAVQTaLDEFKVERE 1250
Cdd:TIGR04523  419 QEKELLEKEIERLKETIIKNNSE-----IKDLTNQdsvKELIIKNLDNTRESLetqLKVLSRSINKIKQN-LEQKQKELK 492
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1251 LVEKELL---EKVKHLENQIAKTpafesarEDSSSLVAELQEKLQEEKAKFLEQLEEQEKRKNEEMQNVRTSLIA----- 1322
Cdd:TIGR04523  493 SKEKELKklnEEKKELEEKVKDL-------TKKISSLKEKIEKLESEKKEKESKISDLEDELNKDDFELKKENLEkeide 565
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|...
gi 226693374  1323 -----EQQTNFNTVLTREKMRKENIINDLSDKLKSTMQQQERDKDLIESLSED 1370
Cdd:TIGR04523  566 knkeiEELKQTQKSLKKKQEEKQELIDQKEKEKKDLIKEIEEKEKKISSLEKE 618
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1032-1325 7.44e-13

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 73.82  E-value: 7.44e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1032 IQEKEQQLQELK------LKVSDLSDMRCKLEVELAL-------KEAETDEIKILLEESRTQQKEMLKSLLEQETENLRT 1098
Cdd:COG1196   195 LGELERQLEPLErqaekaERYRELKEELKELEAELLLlklreleAELEELEAELEELEAELEELEAELAELEAELEELRL 274
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1099 EISKLNQKIHDNNESYQVGLSELRALmtiEKDqciselISRHEEESNILKAELDNVTSLHRQAYEIEKKLKEQIVELQTR 1178
Cdd:COG1196   275 ELEELELELEEAQAEEYELLAELARL---EQD------IARLEERRRELEERLEELEEELAELEEELEELEEELEELEEE 345
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1179 LNSELSALEkQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQDKEHLIQELNFEKN-KAVQTALDEFKVERELVEKELL 1257
Cdd:COG1196   346 LEEAEEELE-EAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQlEELEEAEEALLERLERLEEELE 424
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 226693374 1258 EKVKHLENQIAKTPAFESAREDSSSLVAELQEKLQEEKAKFLEQLEEQEKRKNEEMQNVRTSLIAEQQ 1325
Cdd:COG1196   425 ELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAAR 492
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
847-1318 7.78e-13

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 73.60  E-value: 7.78e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   847 IEKVKCSLE--ITLKEKHQQELQ-------SLKIEYECKLDALVKDSEENVNKILKLKENLVSLEEALQNKDNEFTSIKH 917
Cdd:pfam05483  298 LEDIKMSLQrsMSTQKALEEDLQiatkticQLTEEKEAQMEELNKAKAAHSFVVTEFEATTCSLEELLRTEQQRLEKNED 377
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   918 EKDAIVCVQQEKDQKLLEMEKIMHTQHCEIKELKQsremALEDLKKLHDEK--IESLRAEFQCLEQN------------H 983
Cdd:pfam05483  378 QLKIITMELQKKSSELEEMTKFKNNKEVELEELKK----ILAEDEKLLDEKkqFEKIAEELKGKEQElifllqarekeiH 453
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   984 LKELEDTLHIRHTQEFEKVMTDHNMSLEKLKKENQQ---RIDQMLESHASTIQEKEQQLQELKLKVSDLSDmrCKLEVEL 1060
Cdd:pfam05483  454 DLEIQLTAIKTSEEHYLKEVEDLKTELEKEKLKNIEltaHCDKLLLENKELTQEASDMTLELKKHQEDIIN--CKKQEER 531
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1061 ALKEAETdeikilLEESRTQQKEMLKSLLEqETENLRTEISKLNQKIHDNNESYQVGLSELRALMTIEKDQCiselisrh 1140
Cdd:pfam05483  532 MLKQIEN------LEEKEMNLRDELESVRE-EFIQKGDEVKCKLDKSEENARSIEYEVLKKEKQMKILENKC-------- 596
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1141 eeesNILKAELDNVTSLHRQAYEIEKKLKEQIVELQTRLNSELSALEKQKDEkITQQEEKYEALIQNLEKDkerlVKNHE 1220
Cdd:pfam05483  597 ----NNLKKQIENKNKNIEELHQENKALKKKGSAENKQLNAYEIKVNKLELE-LASAKQKFEEIIDNYQKE----IEDKK 667
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1221 QDKEHLIQELnfekNKAVQTALDEFKVERELVEK---------ELLEKVKH-----LENQIAKTPAFESAREDSSSLVAE 1286
Cdd:pfam05483  668 ISEEKLLEEV----EKAKAIADEAVKLQKEIDKRcqhkiaemvALMEKHKHqydkiIEERDSELGLYKNKEQEQSSAKAA 743
                          490       500       510
                   ....*....|....*....|....*....|....*....
gi 226693374  1287 LQEKLQEEKAKFLE-------QLEEQEKRKNEEMQNVRT 1318
Cdd:pfam05483  744 LEIELSNIKAELLSlkkqleiEKEEKEKLKMEAKENTAI 782
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1032-1318 1.43e-12

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 73.17  E-value: 1.43e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1032 IQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAEtdeikilLEESRTQQKEMLKSLLEQETE--NLRTEISKLNQKI-- 1107
Cdd:TIGR02168  234 LEELREELEELQEELKEAEEELEELTAELQELEEK-------LEELRLEVSELEEEIEELQKElyALANEISRLEQQKqi 306
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1108 --------HDNNESYQVGLSELRAlmtieKDQCISELISRHEEESNILKAELDNVTSLHRQAYEIEKKLKEQIVELQTRL 1179
Cdd:TIGR02168  307 lrerlanlERQLEELEAQLEELES-----KLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQL 381
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1180 NSELSALEKqkdekITQQEEKYEALIQNLEKDKERLVKNHEQdkehLIQELNFEKNKAVQTALDEFKVERELVEKELLEK 1259
Cdd:TIGR02168  382 ETLRSKVAQ-----LELQIASLNNEIERLEARLERLEDRRER----LQQEIEELLKKLEEAELKELQAELEELEEELEEL 452
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 226693374  1260 VKHLENQIAKTPAFESAREDSSSLVAELQEKLQEEKAKfLEQLEEQEKRKNEEMQNVRT 1318
Cdd:TIGR02168  453 QEELERLEEALEELREELEEAEQALDAAERELAQLQAR-LDSLERLQENLEGFSEGVKA 510
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
880-1388 8.02e-12

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 70.48  E-value: 8.02e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  880 LVKDSEENVNKILKLKENLvslEEALQNKDNEFTSIKHEKDAIVCVQQEKDQKLLEMEKI---MHTQHCEIKELKQSREM 956
Cdd:PRK03918  173 EIKRRIERLEKFIKRTENI---EELIKEKEKELEEVLREINEISSELPELREELEKLEKEvkeLEELKEEIEELEKELES 249
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  957 ALEDLKKLhDEKIESLRAEFQCLEQnHLKELEDTlhIRHTQEFEKVMTDHnMSLEKLK---KENQQRIDQMLESHASTIQ 1033
Cdd:PRK03918  250 LEGSKRKL-EEKIRELEERIEELKK-EIEELEEK--VKELKELKEKAEEY-IKLSEFYeeyLDELREIEKRLSRLEEEIN 324
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1034 EKEQQLQELKLKVSDLSDMRcKLEVELALKEAETDEIKILLEESRTQQKEM--LKS---------------LLEQETENL 1096
Cdd:PRK03918  325 GIEERIKELEEKEERLEELK-KKLKELEKRLEELEERHELYEEAKAKKEELerLKKrltgltpeklekeleELEKAKEEI 403
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1097 RTEISKLNQKIhdnNESYQVGLSELRALMTIEKDQ-----CISELISRHEEE-SNILKAELDNVTSLHRQAYEIEKKLKE 1170
Cdd:PRK03918  404 EEEISKITARI---GELKKEIKELKKAIEELKKAKgkcpvCGRELTEEHRKElLEEYTAELKRIEKELKEIEEKERKLRK 480
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1171 QIVELQTRLN--SELSALEKQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQ----DKEHLIQELNFEKNKAVQTALDE 1244
Cdd:PRK03918  481 ELRELEKVLKkeSELIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKliklKGEIKSLKKELEKLEELKKKLAE 560
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1245 FKVERELVEKELLEKVKHLENQiaktpAFESAREDSSSLvAELQ------------EKLQEEKAKFLEQLEEQEKRKNEE 1312
Cdd:PRK03918  561 LEKKLDELEEELAELLKELEEL-----GFESVEELEERL-KELEpfyneylelkdaEKELEREEKELKKLEEELDKAFEE 634
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1313 MQNVRTSL------IAEQQTNFNTVLTREKMRKENIINDLSDKLKSTMQQQERDKDLIESLSEDRARLLEEKKQLEEEVS 1386
Cdd:PRK03918  635 LAETEKRLeelrkeLEELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELE 714

                  ..
gi 226693374 1387 KL 1388
Cdd:PRK03918  715 KL 716
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1010-1308 1.07e-11

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 69.97  E-value: 1.07e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1010 LEKLKK---------ENQQRIDQM-LESHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESRT 1079
Cdd:COG1196   202 LEPLERqaekaeryrELKEELKELeAELLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELEL 281
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1080 QQKEMLKSL--LEQETENLRTEISKLNQKIHDNNESYQVGLSELRALmtIEKDQCISELISRHEEESNILKAELDNVTSL 1157
Cdd:COG1196   282 ELEEAQAEEyeLLAELARLEQDIARLEERRRELEERLEELEEELAEL--EEELEELEEELEELEEELEEAEEELEEAEAE 359
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1158 HRQAYEIEKKLKEQIVELQTRLNSELSAL--EKQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQDKEHLIQELnfekn 1235
Cdd:COG1196   360 LAEAEEALLEAEAELAEAEEELEELAEELleALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELE----- 434
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 226693374 1236 kavqTALDEFKVERELVEKELLEKVKHLENQIAKTPAFESAREDSSSLVAELQEKLQEEKAKFLEQLEEQEKR 1308
Cdd:COG1196   435 ----EEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADY 503
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
855-1372 1.40e-11

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 69.71  E-value: 1.40e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  855 EITLKEKHQQELQSLKIEYECKLDALVKDSEENVNKILKLKENLVSLEEALQNKDNEFTSIKHEKDAIvcvqQEKDQKLL 934
Cdd:PRK03918  173 EIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEI----EELEKELE 248
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  935 EMEKIMHTQHCEIKELKQSREMALEDLKKLHDEKIESLRAEFQCLEQNHLKELEDTLHIRhTQEFEKVMTDhnmsLEKLK 1014
Cdd:PRK03918  249 SLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDE-LREIEKRLSR----LEEEI 323
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1015 KENQQRIDQmLESHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEA-----------ETDEIKILLEESRTQQKE 1083
Cdd:PRK03918  324 NGIEERIKE-LEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEElerlkkrltglTPEKLEKELEELEKAKEE 402
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1084 mlkslLEQETENLRTEISKLNQKIHDNNEsyqvglselrALMTIEKDQ-----CISELISRHEEE-SNILKAELDNVTSL 1157
Cdd:PRK03918  403 -----IEEEISKITARIGELKKEIKELKK----------AIEELKKAKgkcpvCGRELTEEHRKElLEEYTAELKRIEKE 467
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1158 HRQAYEIEKKLKEQIVELQTRLN--SELSALEKQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQ----DKEHLIQELN 1231
Cdd:PRK03918  468 LKEIEEKERKLRKELRELEKVLKkeSELIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKliklKGEIKSLKKE 547
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1232 FEKNKAVQTALDEFKVERELVEKELLEKVKHLENQiaktpAFESAREDSSSLvAELQ------------EKLQEEKAKFL 1299
Cdd:PRK03918  548 LEKLEELKKKLAELEKKLDELEEELAELLKELEEL-----GFESVEELEERL-KELEpfyneylelkdaEKELEREEKEL 621
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1300 EQLEEQEKRKNEEMQNVRTSL------IAEQQTNFNTV----LTREKMRKENIINDLSDKLKSTMQQQERDKDLIESLSE 1369
Cdd:PRK03918  622 KKLEEELDKAFEELAETEKRLeelrkeLEELEKKYSEEeyeeLREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKE 701

                  ...
gi 226693374 1370 DRA 1372
Cdd:PRK03918  702 ELE 704
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
859-1359 1.98e-11

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 69.31  E-value: 1.98e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   859 KEKHQQELQSLKIEYECKLDALVKDSEENVNKILKLKENLvSLEEALQNKDNEFTSIKHEKDAIVCVQQEKDQKLLEMEK 938
Cdd:TIGR00606  499 LKKEVKSLQNEKADLDRKLRKLDQEMEQLNHHTTTRTQME-MLTKDKMDKDEQIRKIKSRHSDELTSLLGYFPNKKQLED 577
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   939 IMHTQHCEIKELKQSREMALEDLKKLHDEKiESLRAEFQCLEQNHLKELEDTLHIRHTQEFEKVMTDHNMSLEKLKKEN- 1017
Cdd:TIGR00606  578 WLHSKSKEINQTRDRLAKLNKELASLEQNK-NHINNELESKEEQLSSYEDKLFDVCGSQDEESDLERLKEEIEKSSKQRa 656
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1018 ---------QQRIDQMLESHAS----------TIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEesr 1078
Cdd:TIGR00606  657 mlagatavySQFITQLTDENQSccpvcqrvfqTEAELQEFISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAP--- 733
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1079 TQQKEMlkSLLEQETENLRTEISKLNQKIHdnnesyqvglselRALMTIEKDQCISELISRHEEESNILKAELDNVTSLH 1158
Cdd:TIGR00606  734 GRQSII--DLKEKEIPELRNKLQKVNRDIQ-------------RLKNDIEEQETLLGTIMPEEESAKVCLTDVTIMERFQ 798
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1159 RQAYEIEKKLKEQIVELQTrlnSELSALEKQKDEKITQQEEKYEALIQNLEKDKeRLVKNHEQDKEHLIQELNFEKNKAV 1238
Cdd:TIGR00606  799 MELKDVERKIAQQAAKLQG---SDLDRTVQQVNQEKQEKQHELDTVVSKIELNR-KLIQDQQEQIQHLKSKTNELKSEKL 874
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1239 QtaLDEFKVERELVEKELLEKVKHLENQIAktpAFESAREDSSSLvAELQEKLQEEKAKFLEQLEEQEKRKNEEMQNVRT 1318
Cdd:TIGR00606  875 Q--IGTNLQRRQQFEEQLVELSTEVQSLIR---EIKDAKEQDSPL-ETFLEKDQQEKEELISSKETSNKKAQDKVNDIKE 948
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|....*..
gi 226693374  1319 SL---IAEQQTNFNTV---LTREKMRKENIINDLSDKLKSTMQQQER 1359
Cdd:TIGR00606  949 KVkniHGYMKDIENKIqdgKDDYLKQKETELNTVNAQLEECEKHQEK 995
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1055-1369 2.01e-11

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 69.32  E-value: 2.01e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1055 KLEVELALKEAETDEIKILLEESRTQQKEmlkslLEQETENLRTEISKLNQKIHDNNESYQVGLSELRALmtiekdqciS 1134
Cdd:TIGR02168  681 ELEEKIEELEEKIAELEKALAELRKELEE-----LEEELEQLRKELEELSRQISALRKDLARLEAEVEQL---------E 746
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1135 ELISRHEEESNILKAELDNVTSLHRQAYEIEKKLKEQIVELQTRLNSELSALEKQKdEKITQQEEKYEALIQNLEKDKER 1214
Cdd:TIGR02168  747 ERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALR-EALDELRAELTLLNEEAANLRER 825
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1215 LvKNHEQDKEHLIQELNFEKNKAVQTALDEFKVERELVEK-----ELLEKVKHLENQIAKT-PAFESAREDSSSLVAELQ 1288
Cdd:TIGR02168  826 L-ESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELeelieELESELEALLNERASLeEALALLRSELEELSEELR 904
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1289 EKlqEEKAKFLEQLEEQEKRKNEEMQNVRTSLIAEQQTNFNTVLTREKMRKENIINDLSDKLKSTMQQQERDKDLIESLS 1368
Cdd:TIGR02168  905 EL--ESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRRLKRLENKIK 982

                   .
gi 226693374  1369 E 1369
Cdd:TIGR02168  983 E 983
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
892-1311 2.57e-11

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 68.94  E-value: 2.57e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   892 LKLKENLVSLEEALQNKDNEFTSIKHEKDAIvcvQQEKDQKLLEMEKimhtQHCEIKELKQSREMALEDLKKLHdEKIES 971
Cdd:TIGR02169  670 RSEPAELQRLRERLEGLKRELSSLQSELRRI---ENRLDELSQELSD----ASRKIGEIEKEIEQLEQEEEKLK-ERLEE 741
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   972 LRAEFQCLEQnhlkELEDTlhIRHTQEFEKVMTDHNMSLEKLKKEnQQRIDQMLEShaSTIQEKEQQLQELKLKVSDLsd 1051
Cdd:TIGR02169  742 LEEDLSSLEQ----EIENV--KSELKELEARIEELEEDLHKLEEA-LNDLEARLSH--SRIPEIQAELSKLEEEVSRI-- 810
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1052 mrcklevelalkEAETDEIkilleESRTQQKEMLKSLLEQETENLRTEISKLNQKIHDNNESYQVGLSELRALMTI--EK 1129
Cdd:TIGR02169  811 ------------EARLREI-----EQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEEleEL 873
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1130 DQCISELISRHEEesniLKAELDNvtslhrqayeiekkLKEQIVELQTRLNsELSALEKQKDEKITQQEEKYEALIQNLE 1209
Cdd:TIGR02169  874 EAALRDLESRLGD----LKKERDE--------------LEAQLRELERKIE-ELEAQIEKKRKRLSELKAKLEALEEELS 934
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1210 KdkerLVKNHEQDKEHLIQELNFEKNKAvqtaldefKVERELVEKELLEKVKHLENQiaktpAFESAREDSSSLVaELQE 1289
Cdd:TIGR02169  935 E----IEDPKGEDEEIPEEELSLEDVQA--------ELQRVEEEIRALEPVNMLAIQ-----EYEEVLKRLDELK-EKRA 996
                          410       420
                   ....*....|....*....|..
gi 226693374  1290 KLQEEKAKFLEQLEEQEKRKNE 1311
Cdd:TIGR02169  997 KLEEERKAILERIEEYEKKKRE 1018
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
842-1314 6.75e-11

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 67.40  E-value: 6.75e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  842 EIRNIIEKVKCSLEITLKEKHQQELQSLKIEYECKLdalvkdseenvNKILKLKENLVSLEEALQNKDNEFTSIKHEKda 921
Cdd:PRK03918  274 EIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDEL-----------REIEKRLSRLEEEINGIEERIKELEEKEERL-- 340
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  922 ivcvqQEKDQKLLEMEKimhtqhcEIKELKQSREmALEDLKKLHDEkIESLRAEFQCLEQNHLKELEDTLHIRHT--QEF 999
Cdd:PRK03918  341 -----EELKKKLKELEK-------RLEELEERHE-LYEEAKAKKEE-LERLKKRLTGLTPEKLEKELEELEKAKEeiEEE 406
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1000 EKVMTDHNMSLEKLKKENQQRIDQMLESHAS--------TIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIK 1071
Cdd:PRK03918  407 ISKITARIGELKKEIKELKKAIEELKKAKGKcpvcgrelTEEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELE 486
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1072 ILLEESRTQQKemLKSLLEQeTENLRTEISKLNqkihdnnesyqvglselralmtIEKDQCISELISRHEEESNILKAEL 1151
Cdd:PRK03918  487 KVLKKESELIK--LKELAEQ-LKELEEKLKKYN----------------------LEELEKKAEEYEKLKEKLIKLKGEI 541
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1152 DNVTSLHRQAYEIEKKLKEQIVELQTrLNSELSALEKQKDEKITQQEEKYEALIQNLEKDKERLV--KNHEQDKEHLIQE 1229
Cdd:PRK03918  542 KSLKKELEKLEELKKKLAELEKKLDE-LEEELAELLKELEELGFESVEELEERLKELEPFYNEYLelKDAEKELEREEKE 620
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1230 LNFEKNKAVQTA--LDEFKVERELVEKEL--LEKVKHLENQIAKTPAFESAREDSSSLVAELQ--EKLQEEKAKFLEQLE 1303
Cdd:PRK03918  621 LKKLEEELDKAFeeLAETEKRLEELRKELeeLEKKYSEEEYEELREEYLELSRELAGLRAELEelEKRREEIKKTLEKLK 700
                         490
                  ....*....|..
gi 226693374 1304 EQ-EKRKNEEMQ 1314
Cdd:PRK03918  701 EElEEREKAKKE 712
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
855-1372 1.06e-10

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 66.88  E-value: 1.06e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  855 EITLKEKHQQELQSLKIEYECKLDALVKDSEENVNKILKLKENLVSLEEALQNKDNEFTSIKHEKDAIVCVQQEKDQKLL 934
Cdd:COG1196   261 ELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELE 340
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  935 EMEKIMHTQHCEIKELKQSREMALEDLKKLHDEKIESLRAEFQcLEQNHLKELEDTLHIRHTQEfekvmtdhnmSLEKLK 1014
Cdd:COG1196   341 ELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEE-LAEELLEALRAAAELAAQLE----------ELEEAE 409
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1015 KENQQRIDQMLESHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESRTQQKEMLKSLLEQETE 1094
Cdd:COG1196   410 EALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEA 489
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1095 NLRtEISKLNQKIHDNNESYQVGLSELRALMTiEKDQCISELISRHEEESNILKAELDNVtsLHRQAYEIEKKLKEQIVE 1174
Cdd:COG1196   490 AAR-LLLLLEAEADYEGFLEGVKAALLLAGLR-GLAGAVAVLIGVEAAYEAALEAALAAA--LQNIVVEDDEVAAAAIEY 565
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1175 LQTRLNSELSALEKQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQDKEHLIQELNFEKNKAVQTALDEFKVERELVEK 1254
Cdd:COG1196   566 LKAAKAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGR 645
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1255 ELLEKVKHLENQIAKTPAFESAREDSSSLVAELQEKLQEEKAKFLEQLEEQEKRKNEEMQNVRTSLIAEQQTNFNTVLTR 1334
Cdd:COG1196   646 LREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEA 725
                         490       500       510
                  ....*....|....*....|....*....|....*...
gi 226693374 1335 EKMRKENIINDLSDKLKSTMQQQERDKDLIESLSEDRA 1372
Cdd:COG1196   726 LEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLE 763
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
885-1215 1.80e-10

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 66.23  E-value: 1.80e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   885 EENVNKILKLKENLVSLEEALQNKDNEFTSIKHEKDAIVCVQQEKDQKLLEMEKimhtqhcEIKELKQSREMALEDLKKL 964
Cdd:TIGR02168  680 EELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRK-------DLARLEAEVEQLEERIAQL 752
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   965 HDEkIESLRAEFQCLEQNhLKELEDTLHirhtqefekvmtdhnmSLEKLKKENQQRIDQMLESHA---STIQEKEQQLQE 1041
Cdd:TIGR02168  753 SKE-LTELEAEIEELEER-LEEAEEELA----------------EAEAEIEELEAQIEQLKEELKalrEALDELRAELTL 814
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1042 LKLKVSDLsdmRCKLEVELALKEAETDEIKILLEESRTQQKEMLKslLEQETENLRTEISKLNQKIHDNNESYQVgLSEL 1121
Cdd:TIGR02168  815 LNEEAANL---RERLESLERRIAATERRLEDLEEQIEELSEDIES--LAAEIEELEELIEELESELEALLNERAS-LEEA 888
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1122 RALMTIEKDQCISElISRHEEESNILKAELDNVTSLHRQAYEIEKKLKEQIVELQTRLNSELSALE---KQKDEKITQQE 1198
Cdd:TIGR02168  889 LALLRSELEELSEE-LRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLeeaEALENKIEDDE 967
                          330
                   ....*....|....*..
gi 226693374  1199 EKYEALIQNLEKDKERL 1215
Cdd:TIGR02168  968 EEARRRLKRLENKIKEL 984
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1032-1325 2.45e-10

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 65.47  E-value: 2.45e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1032 IQEKEQQLQELKLKVSDLSDMRCKLEVELALKEaetdEIKILLEESR-TQQKEMLKSL--LEQETENLRTEISKLNQKIh 1108
Cdd:TIGR02169  179 LEEVEENIERLDLIIDEKRQQLERLRREREKAE----RYQALLKEKReYEGYELLKEKeaLERQKEAIERQLASLEEEL- 253
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1109 dnnESYQVGLSELRALMTiEKDQCISELISR----HEEESNILKAELDNVTSLHRQAYEIEKKLKEQIVELQTRLNSELS 1184
Cdd:TIGR02169  254 ---EKLTEEISELEKRLE-EIEQLLEELNKKikdlGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEA 329
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1185 ALEKQK------DEKITQQEEKYEALIQNLEKDKERL----VKNHEQDKEHliQELnFEKNKAVQTALDEFKVERELVEK 1254
Cdd:TIGR02169  330 EIDKLLaeieelEREIEEERKRRDKLTEEYAELKEELedlrAELEEVDKEF--AET-RDELKDYREKLEKLKREINELKR 406
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1255 E----------LLEKVKHLENQIA----KTPAFESAREDSSSLVAELQEKLQEEKAKfLEQLEEQEKRKNEEMQNVRTSL 1320
Cdd:TIGR02169  407 EldrlqeelqrLSEELADLNAAIAgieaKINELEEEKEDKALEIKKQEWKLEQLAAD-LSKYEQELYDLKEEYDRVEKEL 485

                   ....*
gi 226693374  1321 IAEQQ 1325
Cdd:TIGR02169  486 SKLQR 490
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
855-1369 2.79e-10

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 65.45  E-value: 2.79e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   855 EITLKEKHQQELQSLKIEYECKLDALVKDSEENVNKILKLKENLVsleealQNKDNEFTSIKHEKDAIVCVQQEKDQKLL 934
Cdd:TIGR00606  270 EIKALKSRKKQMEKDNSELELKMEKVFQGTDEQLNDLYHNHQRTV------REKERELVDCQRELEKLNKERRLLNQEKT 343
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   935 EMEKimhtqhcEIKELKQSREMALEDLKKLHDEKIE-SLRAEFQCLEQNHLKELE-DTLH--IRHTQEFEKVMTDHNM-- 1008
Cdd:TIGR00606  344 ELLV-------EQGRLQLQADRHQEHIRARDSLIQSlATRLELDGFERGPFSERQiKNFHtlVIERQEDEAKTAAQLCad 416
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1009 --SLEKLKKENQQRIDQMLESHASTIQEK----EQQLQELKLKVSDLSD----MRCKLEVELALKEAETDeIKILLEESR 1078
Cdd:TIGR00606  417 lqSKERLKQEQADEIRDEKKGLGRTIELKkeilEKKQEELKFVIKELQQlegsSDRILELDQELRKAERE-LSKAEKNSL 495
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1079 TQQKEMLKSLLEQETENLRTEISKLNQKIHD-NNESYQVGLSELRALMTIEKDQCISELISRHEEESNILKAELDNVTSL 1157
Cdd:TIGR00606  496 TETLKKEVKSLQNEKADLDRKLRKLDQEMEQlNHHTTTRTQMEMLTKDKMDKDEQIRKIKSRHSDELTSLLGYFPNKKQL 575
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1158 HRQAYEIEKKLKeQIVELQTRLNSELSALEKQKD------EKITQQEEKYEALI------QNLEKDKERLVKNHEQDKEH 1225
Cdd:TIGR00606  576 EDWLHSKSKEIN-QTRDRLAKLNKELASLEQNKNhinnelESKEEQLSSYEDKLfdvcgsQDEESDLERLKEEIEKSSKQ 654
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1226 L-------------IQELNFEKNKAVQTALDEFKVERELVE--KELLEKVKHLENQIAKTPAFESAREDSSSLV---AEL 1287
Cdd:TIGR00606  655 RamlagatavysqfITQLTDENQSCCPVCQRVFQTEAELQEfiSDLQSKLRLAPDKLKSTESELKKKEKRRDEMlglAPG 734
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1288 QEKLQEEKAKFLEQLEEQEKRKNEEMQNVRTSlIAEQQTNFNTVLTREKMrKENIINDLSDKLKSTMQQQERDKDLIESL 1367
Cdd:TIGR00606  735 RQSIIDLKEKEIPELRNKLQKVNRDIQRLKND-IEEQETLLGTIMPEEES-AKVCLTDVTIMERFQMELKDVERKIAQQA 812

                   ..
gi 226693374  1368 SE 1369
Cdd:TIGR00606  813 AK 814
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1142-1373 3.59e-10

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 64.96  E-value: 3.59e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1142 EESNILKAELdnvTSLHRQAYEIE-KKLKEQIVELQTRLNSELSALEkQKDEKITQQEEKYEALIQNLEKDKERLVkNHE 1220
Cdd:COG1196   220 EELKELEAEL---LLLKLRELEAElEELEAELEELEAELEELEAELA-ELEAELEELRLELEELELELEEAQAEEY-ELL 294
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1221 QDKEHLIQELNF--EKNKAVQTALDEFKVERELVEKELLEKVKHLENQIAKTPAFESAREDSSSLVAELQEKLQEEKAKF 1298
Cdd:COG1196   295 AELARLEQDIARleERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAEL 374
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 226693374 1299 LEQLEEQEKRKNEEMQNVRTSL-IAEQQTNFNTVLTREKMRKENIINDLSDKLKSTMQQQERDKDLIESLSEDRAR 1373
Cdd:COG1196   375 AEAEEELEELAEELLEALRAAAeLAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEE 450
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
929-1367 3.61e-10

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 64.66  E-value: 3.61e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   929 KDQKLLEMEKIMHTQHCEIKElKQSREMALEDLKKLHDEKIESLRAEFQCLEQNhLKELEDTLHIRHTQefekvMTDHNM 1008
Cdd:TIGR04523   31 QDTEEKQLEKKLKTIKNELKN-KEKELKNLDKNLNKDEEKINNSNNKIKILEQQ-IKDLNDKLKKNKDK-----INKLNS 103
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1009 SLEKLKKENQQRIDQMLESHaSTIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESRTQQKEMLKSL 1088
Cdd:TIGR04523  104 DLSKINSEIKNDKEQKNKLE-VELNKLEKQKKENKKNIDKFLTEIKKKEKELEKLNNKYNDLKKQKEELENELNLLEKEK 182
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1089 LEQETE--NLRTEISKLNQKIHD------NNESYQVGLSELRALMTIEKDQcISELISRHEEESNILKAELDNVTSLHRQ 1160
Cdd:TIGR04523  183 LNIQKNidKIKNKLLKLELLLSNlkkkiqKNKSLESQISELKKQNNQLKDN-IEKKQQEINEKTTEISNTQTQLNQLKDE 261
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1161 AYEIEKKLKEQIVELQT-------------RLNSELSALEKQKD--------EKITQQEEKYEALIQNLEKDKERLVKNH 1219
Cdd:TIGR04523  262 QNKIKKQLSEKQKELEQnnkkikelekqlnQLKSEISDLNNQKEqdwnkelkSELKNQEKKLEEIQNQISQNNKIISQLN 341
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1220 EQ--DKEHLIQELNFEkNKAVQTALDEFKVERELVEKELLEKVKHLENQIAKTPAFESAREDssslvaelQEKLQEEKAK 1297
Cdd:TIGR04523  342 EQisQLKKELTNSESE-NSEKQRELEEKQNEIEKLKKENQSYKQEIKNLESQINDLESKIQN--------QEKLNQQKDE 412
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1298 FLEQLEEQEKRKNEEMQNVRTSLIAEQQTNFNtvLTREKMRKENIINDLSDKLKSTMQQQERDKDLIESL 1367
Cdd:TIGR04523  413 QIKKLQQEKELLEKEIERLKETIIKNNSEIKD--LTNQDSVKELIIKNLDNTRESLETQLKVLSRSINKI 480
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
829-1255 4.86e-10

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 64.70  E-value: 4.86e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   829 QCDLANYLKCTAVEIRNIIEKVkcsleitlkekhqqelqSLKIEYECKLDALVKDSEEnvnkilkLKENLVSLEEALQNK 908
Cdd:TIGR02169  141 QGDVTDFISMSPVERRKIIDEI-----------------AGVAEFDRKKEKALEELEE-------VEENIERLDLIIDEK 196
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   909 DNEFTSIKHEKDaivcvQQEKDQKLLEmEKimhtQHCEIKELKQSREMALEDLKKLhDEKIESLRAEFQCLEQnHLKELE 988
Cdd:TIGR02169  197 RQQLERLRRERE-----KAERYQALLK-EK----REYEGYELLKEKEALERQKEAI-ERQLASLEEELEKLTE-EISELE 264
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   989 DTLHirhtqEFEKVMTDHNMSLEKLKKENQQRIDQMLESHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAETD 1068
Cdd:TIGR02169  265 KRLE-----EIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIE 339
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1069 EIKillEESRTQQKEmlKSLLEQETENLRTEISKLNQKIHDNNESYQVGLSELRALmtiekDQCISELISRHEEesniLK 1148
Cdd:TIGR02169  340 ELE---REIEEERKR--RDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDY-----REKLEKLKREINE----LK 405
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1149 AELDNVTSLHRQAYEIEKKLKEQIVELQTRLNsELSALEKQKDEKITQQEEKYEALIQNLEKDKERLvknheQDKEHLIQ 1228
Cdd:TIGR02169  406 RELDRLQEELQRLSEELADLNAAIAGIEAKIN-ELEEEKEDKALEIKKQEWKLEQLAADLSKYEQEL-----YDLKEEYD 479
                          410       420
                   ....*....|....*....|....*..
gi 226693374  1229 ELNfEKNKAVQTALDEFKVERELVEKE 1255
Cdd:TIGR02169  480 RVE-KELSKLQRELAEAEAQARASEER 505
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
842-1373 6.08e-10

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 64.31  E-value: 6.08e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   842 EIRNIIEKVKCSLEITLKEKHQQELQSLKIEYEC-----KLDALVKDSEENVNKILKLKENLVSLEEALQNKDNEftsik 916
Cdd:TIGR02168  292 ALANEISRLEQQKQILRERLANLERQLEELEAQLeelesKLDELAEELAELEEKLEELKEELESLEAELEELEAE----- 366
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   917 hekdaivcvQQEKDQKLLEMEKIMHTQHCEIKELKQSREMALEDLKKLhDEKIESLRAEFQCLEQNhLKELEDTLHIRHT 996
Cdd:TIGR02168  367 ---------LEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERL-EARLERLEDRRERLQQE-IEELLKKLEEAEL 435
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   997 QEFEKVMTDHNMSLEKLKKENQQRIDQmLESHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEE 1076
Cdd:TIGR02168  436 KELQAELEELEEELEELQEELERLEEA-LEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLKN 514
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1077 SR--TQQKEMLKSLLEQEtENLRTEISK-----LNQKIHDNNESYQVGLSELR--------------------------A 1123
Cdd:TIGR02168  515 QSglSGILGVLSELISVD-EGYEAAIEAalggrLQAVVVENLNAAKKAIAFLKqnelgrvtflpldsikgteiqgndreI 593
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1124 LMTIEKDQCI-SELISRHEEESNILKAELDN---VTSLhRQAYEIEKKLKEQ--IVELQTRL------------NSELSA 1185
Cdd:TIGR02168  594 LKNIEGFLGVaKDLVKFDPKLRKALSYLLGGvlvVDDL-DNALELAKKLRPGyrIVTLDGDLvrpggvitggsaKTNSSI 672
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1186 LEKQKD-EKITQQEEKYEALIQNLEKDKERLVKNHEQDKE--HLIQELNFEKNK---AVQTALDEFKVERELVEKELLEK 1259
Cdd:TIGR02168  673 LERRREiEELEEKIEELEEKIAELEKALAELRKELEELEEelEQLRKELEELSRqisALRKDLARLEAEVEQLEERIAQL 752
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1260 VKHLENQIAKTPAFESAREDSSSLVAELQEKLQEEKAKF------LEQLEEQEKRKNEEMQNVRTSliAEQQTNFNTVLT 1333
Cdd:TIGR02168  753 SKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIeqlkeeLKALREALDELRAELTLLNEE--AANLRERLESLE 830
                          570       580       590       600
                   ....*....|....*....|....*....|....*....|
gi 226693374  1334 REKMRKENIINDLSDKLKSTMQQQERDKDLIESLSEDRAR 1373
Cdd:TIGR02168  831 RRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEE 870
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
859-1390 7.90e-10

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 63.92  E-value: 7.90e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   859 KEKHQQELQSLKIEYEckldALVKDSEENVNKILKLKENLVSLEEALQNKDNEFTSIKHEKDAIVCVQQEKDQKLLEMEK 938
Cdd:TIGR02168  304 KQILRERLANLERQLE----ELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEE 379
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   939 IMHTQHCEIKELKQSREMALEDLKKLhDEKIESLRAEFQCLEQNhLKELEDTLHIRHTQEFEKVMTDHNMSLEKLKKENQ 1018
Cdd:TIGR02168  380 QLETLRSKVAQLELQIASLNNEIERL-EARLERLEDRRERLQQE-IEELLKKLEEAELKELQAELEELEEELEELQEELE 457
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1019 QRIDQmLESHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESR--TQQKEMLKSLLEQEtENL 1096
Cdd:TIGR02168  458 RLEEA-LEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLKNQSglSGILGVLSELISVD-EGY 535
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1097 RTEISK-----LNQKIHDNNESYQVGLSELR--------------------------ALMTIEKDQCI-SELISRHEEES 1144
Cdd:TIGR02168  536 EAAIEAalggrLQAVVVENLNAAKKAIAFLKqnelgrvtflpldsikgteiqgndreILKNIEGFLGVaKDLVKFDPKLR 615
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1145 NILKAELDNV------------------------------------------TSLHRQAYEIE--------KKLKEQIVE 1174
Cdd:TIGR02168  616 KALSYLLGGVlvvddldnalelakklrpgyrivtldgdlvrpggvitggsakTNSSILERRREieeleekiEELEEKIAE 695
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1175 LQTRLN---SELSALEKQKDEKITQQEEKyEALIQNLEKDKERLVKNHEQdKEHLIQELNFEKNKAVQ--TALDEFKVER 1249
Cdd:TIGR02168  696 LEKALAelrKELEELEEELEQLRKELEEL-SRQISALRKDLARLEAEVEQ-LEERIAQLSKELTELEAeiEELEERLEEA 773
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1250 ELVEKELLEKVKHLENQIAKtpaFESAREDSSSLVAELQEKLQEEKAKFLEQLEEQEKRKNEemqnvrtslIAEQQTNFn 1329
Cdd:TIGR02168  774 EEELAEAEAEIEELEAQIEQ---LKEELKALREALDELRAELTLLNEEAANLRERLESLERR---------IAATERRL- 840
                          570       580       590       600       610       620
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 226693374  1330 TVLTREKMRKENIINDLSDKLKSTMQQQERDKDLIESLSEDRARLLEEKKQLEEEVSKLRT 1390
Cdd:TIGR02168  841 EDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSE 901
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
877-1369 8.76e-10

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 63.54  E-value: 8.76e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  877 LDALVKDSEENvNKILKLKENLVSLEEALQNKDNEFTSIKHEKDAIvcvqqekdQKLLEMEKIMHTQhceIKELKQSREM 956
Cdd:PRK03918  137 IDAILESDESR-EKVVRQILGLDDYENAYKNLGEVIKEIKRRIERL--------EKFIKRTENIEEL---IKEKEKELEE 204
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  957 ALEDLKKLHDEkIESLRAEFQCLEQNhLKELEDTLHIRHTQEFEKvmtdhnMSLEKLKKENQQRIDQMLEShastIQEKE 1036
Cdd:PRK03918  205 VLREINEISSE-LPELREELEKLEKE-VKELEELKEEIEELEKEL------ESLEGSKRKLEEKIRELEER----IEELK 272
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1037 QQLQELKLKVSDLSDMRCKLEVELALKEaETDEIKilleeSRTQQKEMLKSLLEQETENLRTEISKLNQKIHDNNEsyqv 1116
Cdd:PRK03918  273 KEIEELEEKVKELKELKEKAEEYIKLSE-FYEEYL-----DELREIEKRLSRLEEEINGIEERIKELEEKEERLEE---- 342
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1117 glselralmtiekdqcISELISRHEEESNILKA---ELDNVTSLHRQAYEIEKKLKEQIVElqtRLNSELSALEKQKdEK 1193
Cdd:PRK03918  343 ----------------LKKKLKELEKRLEELEErheLYEEAKAKKEELERLKKRLTGLTPE---KLEKELEELEKAK-EE 402
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1194 ITQQEEKYEALIQNLEKDKERLVKN------------------HEQDKEHLIQELNFEKNKavqtaLDEFKVERELVEKE 1255
Cdd:PRK03918  403 IEEEISKITARIGELKKEIKELKKAieelkkakgkcpvcgrelTEEHRKELLEEYTAELKR-----IEKELKEIEEKERK 477
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1256 LLEKVKHLENQIAKTPAFESAREdssslVAELQEKLQEEKAKF-LEQLEeqekRKNEEMQNVRTSLIaeqqtnfntvltr 1334
Cdd:PRK03918  478 LRKELRELEKVLKKESELIKLKE-----LAEQLKELEEKLKKYnLEELE----KKAEEYEKLKEKLI------------- 535
                         490       500       510
                  ....*....|....*....|....*....|....*
gi 226693374 1335 ekmrkeniinDLSDKLKSTMQQQERDKDLIESLSE 1369
Cdd:PRK03918  536 ----------KLKGEIKSLKKELEKLEELKKKLAE 560
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1114-1373 1.26e-09

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 63.42  E-value: 1.26e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1114 YQVGLSELRALMTIEKDQCISELISRHEEESNILKAELDNVTSLHRQAYEIEKKLKEQIVELQTRLNSELSALE--KQKD 1191
Cdd:COG1196   218 LKEELKELEAELLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYelLAEL 297
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1192 EKITQQEEKYEALIQNLEKDKERLVKNHEQDKEHLIQ-----ELNFEKNKAVQTALDEFKVERELVEKELLEKVKHLENQ 1266
Cdd:COG1196   298 ARLEQDIARLEERRRELEERLEELEEELAELEEELEEleeelEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEA 377
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1267 IAKTPAFESAREDSSSLVAELQEKLQ------EEKAKFLEQLEEQEKRKNEEMQNVRTSLIAEQQT--NFNTVLTREKMR 1338
Cdd:COG1196   378 EEELEELAEELLEALRAAAELAAQLEeleeaeEALLERLERLEEELEELEEALAELEEEEEEEEEAleEAAEEEAELEEE 457
                         250       260       270
                  ....*....|....*....|....*....|....*
gi 226693374 1339 KENIINDLSDKLKSTMQQQERDKDLIESLSEDRAR 1373
Cdd:COG1196   458 EEALLELLAELLEEAALLEAALAELLEELAEAAAR 492
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
946-1488 1.31e-09

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 63.03  E-value: 1.31e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  946 EIKELKQSREMALEDLKKLhDEKIESLRAEFQCLE------QNHLKELEDTLHIRHTQEFEKVMTDHNM-----SLEKLK 1014
Cdd:COG1196   233 KLRELEAELEELEAELEEL-EAELEELEAELAELEaeleelRLELEELELELEEAQAEEYELLAELARLeqdiaRLEERR 311
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1015 KENQQRIDQMLEShastIQEKEQQLQELKLKvsdlsdmrcklEVELALKEAETDEIKILLEESRTQQKEMLKSLLEQETE 1094
Cdd:COG1196   312 RELEERLEELEEE----LAELEEELEELEEE-----------LEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAE 376
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1095 NLRTEISKLNQKIHDNNESYQVGLSELRALMTIEKDQcisELISRHEEESNILKAELDNVtslhRQAYEIEKKLKEQIVE 1174
Cdd:COG1196   377 AEEELEELAEELLEALRAAAELAAQLEELEEAEEALL---ERLERLEEELEELEEALAEL----EEEEEEEEEALEEAAE 449
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1175 LQTRLNSELSALEKQKDEKITQQEEKYEALIQNLEKDKERLvknheqdkehliQELNFEKNKAVQTALDEFKVERELVEK 1254
Cdd:COG1196   450 EEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAA------------ARLLLLLEAEADYEGFLEGVKAALLLA 517
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1255 ELLEKVKHLENQIAKTPAFESAREDssSLVAELQEKLQEEKAKFLEQLEEQEKRKNEEMQNVRTSLIAEQQTNFNTVLTR 1334
Cdd:COG1196   518 GLRGLAGAVAVLIGVEAAYEAALEA--ALAAALQNIVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARG 595
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1335 EKMRKENIINDLSDKLKStmqQQERDKDLIESLSEDRARLLEEKKQLEEEVSKLRTSSFLSSApVAAAPELYGACAPELP 1414
Cdd:COG1196   596 AIGAAVDLVASDLREADA---RYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEG-GSAGGSLTGGSRRELL 671
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 226693374 1415 GEPERSVMETADEGRLDSAMETSMMSVQENMLSEEKQRIMLLERTLQLKEEENKRLNQRLMSQSLSSVSSRHSE 1488
Cdd:COG1196   672 AALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEE 745
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
804-1390 2.44e-09

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 62.76  E-value: 2.44e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   804 AQDSHSSIQTIKDDLCH--FRTFVQKEQCDLANYLKCTAVEIRNIIEKVkcsleitlkEKHQQELQSLKIEYECKLDAL- 880
Cdd:TIGR01612 1079 AEINITNFNEIKEKLKHynFDDFGKEENIKYADEINKIKDDIKNLDQKI---------DHHIKALEEIKKKSENYIDEIk 1149
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   881 --------VKD---SEENVNKILKLKENLVSleeALQNKDNEFTSIKHEKDAIVCVqqEKDQKLLEMEKIMHTQHCeiKE 949
Cdd:TIGR01612 1150 aqindledVADkaiSNDDPEEIEKKIENIVT---KIDKKKNIYDEIKKLLNEIAEI--EKDKTSLEEVKGINLSYG--KN 1222
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   950 LKQSREMALEDLKKLHDEKIESLRAEFQCLEQNHLKELEDTLHIRHTQEFEKVMTDHNMSLEKLKKenQQRIDQMLESHA 1029
Cdd:TIGR01612 1223 LGKLFLEKIDEEKKKSEHMIKAMEAYIEDLDEIKEKSPEIENEMGIEMDIKAEMETFNISHDDDKD--HHIISKKHDENI 1300
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1030 STIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESrTQQKEMLKSlleQETENLRTEISKLNQKIHD 1109
Cdd:TIGR01612 1301 SDIREKSLKIIEDFSEESDINDIKKELQKNLLDAQKHNSDINLYLNEI-ANIYNILKL---NKIKKIIDEVKEYTKEIEE 1376
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1110 NNESYQVGLSELRALMTIEKD-----QC------------ISELISRHEEESNILKAELDNVTSLHRQAYEIEKKLKEQI 1172
Cdd:TIGR01612 1377 NNKNIKDELDKSEKLIKKIKDdinleECkskiestlddkdIDECIKKIKELKNHILSEESNIDTYFKNADENNENVLLLF 1456
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1173 VELQTRLNSELSALEKQKDEKITQQEEKYEALIQNleKDKERLVKNhEQDKehliQELNFEKNKAVqtaldeFKVERELV 1252
Cdd:TIGR01612 1457 KNIEMADNKSQHILKIKKDNATNDHDFNINELKEH--IDKSKGCKD-EADK----NAKAIEKNKEL------FEQYKKDV 1523
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1253 eKELLEKVKHLE--NQIAKTpafesaREDSSSLVAELQEKlqeeKAKFLEQLEEQEKR----KNEEMQ------------ 1314
Cdd:TIGR01612 1524 -TELLNKYSALAikNKFAKT------KKDSEIIIKEIKDA----HKKFILEAEKSEQKikeiKKEKFRieddaakndksn 1592
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1315 ----NVRTSL---------IAEQQTNFNTVLTREKMRKENI----INDLSDKLKSTMQQQERDKDLIESLSEDRARLLEE 1377
Cdd:TIGR01612 1593 kaaiDIQLSLenfenkflkISDIKKKINDCLKETESIEKKIssfsIDSQDTELKENGDNLNSLQEFLESLKDQKKNIEDK 1672
                          650
                   ....*....|...
gi 226693374  1378 KKQLEEEVSKLRT 1390
Cdd:TIGR01612 1673 KKELDELDSEIEK 1685
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1149-1472 2.60e-09

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 62.38  E-value: 2.60e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1149 AELD-NVTSLHRQAYEIEK--KLKEQIVELQ--------TRLNSELSALEKQKDEKiTQQEEKYEALIQNLEKDKERLVK 1217
Cdd:TIGR02168  196 NELErQLKSLERQAEKAERykELKAELRELElallvlrlEELREELEELQEELKEA-EEELEELTAELQELEEKLEELRL 274
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1218 NHEQDKE--HLIQELNFEKNKAVQtaldefKVERELveKELLEKVKHLENQIAKTPA----FESAREDSSSLVAELQEKL 1291
Cdd:TIGR02168  275 EVSELEEeiEELQKELYALANEIS------RLEQQK--QILRERLANLERQLEELEAqleeLESKLDELAEELAELEEKL 346
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1292 QEEKAKFlEQLEEQEKRKNEEMQNvRTSLIAEQQTNFNTVltrekmRKEniINDLSDKLKSTMQQQERDKDLIESLSEDR 1371
Cdd:TIGR02168  347 EELKEEL-ESLEAELEELEAELEE-LESRLEELEEQLETL------RSK--VAQLELQIASLNNEIERLEARLERLEDRR 416
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1372 ARLLEEKKQLEEEVSKLRTssflssAPVAAAPELYGACAPELPGEPERsVMETADEGRLDSAMETSMMSVQENMLSEEKQ 1451
Cdd:TIGR02168  417 ERLQQEIEELLKKLEEAEL------KELQAELEELEEELEELQEELER-LEEALEELREELEEAEQALDAAERELAQLQA 489
                          330       340
                   ....*....|....*....|.
gi 226693374  1452 RIMLLERTLQLKEEENKRLNQ 1472
Cdd:TIGR02168  490 RLDSLERLQENLEGFSEGVKA 510
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
859-1373 2.61e-09

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 62.11  E-value: 2.61e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   859 KEKHQQELQSLKiEYECKLDALVKDSEENVNKILKLKENLVSLEEA---LQNKDNEFTSIKHEKDAIVCVQQEKDQKLLE 935
Cdd:pfam01576   18 KERQQKAESELK-ELEKKHQQLCEEKNALQEQLQAETELCAEAEEMrarLAARKQELEEILHELESRLEEEEERSQQLQN 96
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   936 MEKIMHTQHCEIKELKQSREMALEDL---KKLHDEKIESLRAEFQCLE-QNHLKELEDTLHIRHTQEFEKVMTDHN---M 1008
Cdd:pfam01576   97 EKKKMQQHIQDLEEQLDEEEAARQKLqleKVTTEAKIKKLEEDILLLEdQNSKLSKERKLLEERISEFTSNLAEEEekaK 176
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1009 SLEKLKKENQQRIDQM---LESHASTIQEKEQ-------QLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESR 1078
Cdd:pfam01576  177 SLSKLKNKHEAMISDLeerLKKEEKGRQELEKakrklegESTDLQEQIAELQAQIAELRAQLAKKEEELQAALARLEEET 256
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1079 TQQKEMLKSL---------LEQETENLRTEISKLNQKIHDNNEsyqvglsELRALMTIEKDQCIS-----ELISRHEEES 1144
Cdd:pfam01576  257 AQKNNALKKIreleaqiseLQEDLESERAARNKAEKQRRDLGE-------ELEALKTELEDTLDTtaaqqELRSKREQEV 329
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1145 NILKAELDNVTSLHR-QAYEIEKKLKEQIVELQTRLnsELSALEKQKDEKITQQEEKyealiQNLEKDKE-RLVKNHEQD 1222
Cdd:pfam01576  330 TELKKALEEETRSHEaQLQEMRQKHTQALEELTEQL--EQAKRNKANLEKAKQALES-----ENAELQAElRTLQQAKQD 402
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1223 KEHliqelnfeKNKAVQTALDEFKV---ERELVEKELLEKVKHLENQIAKTPAFESARED----SSSLVAELQEKLQEEK 1295
Cdd:pfam01576  403 SEH--------KRKKLEGQLQELQArlsESERQRAELAEKLSKLQSELESVSSLLNEAEGknikLSKDVSSLESQLQDTQ 474
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1296 akflEQLEEQEKRKNEEMQNVRTslIAEQQTNFNTVLTREKMRKENI---INDLSDKLKSTMQQQERDKDLIESLSEDRA 1372
Cdd:pfam01576  475 ----ELLQEETRQKLNLSTRLRQ--LEDERNSLQEQLEEEEEAKRNVerqLSTLQAQLSDMKKKLEEDAGTLEALEEGKK 548

                   .
gi 226693374  1373 R 1373
Cdd:pfam01576  549 R 549
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
892-1369 3.08e-09

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 61.66  E-value: 3.08e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   892 LKLKENLVSLEEALQNKDNEFTSIKHEKDAIVCVQQEKDQKLLEMEKIMHTQHCEIKELKQSREMALEDLKKLHDekies 971
Cdd:pfam05483  215 FKLKEDHEKIQHLEEEYKKEINDKEKQVSLLLIQITEKENKMKDLTFLLEESRDKANQLEEKTKLQDENLKELIE----- 289
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   972 lraefqclEQNHL-KELEDTlhirhtqefeKVMTDHNMSLEKLKKENQQ----RIDQMLESHASTIQEKEQQLQELKLKV 1046
Cdd:pfam05483  290 --------KKDHLtKELEDI----------KMSLQRSMSTQKALEEDLQiatkTICQLTEEKEAQMEELNKAKAAHSFVV 351
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1047 SDLSDMRCKLE----VELALKEAETDEIKILLEESRTQQKEM-----LKSLLEQETENLRTEISKlNQKIHDNNESYQVG 1117
Cdd:pfam05483  352 TEFEATTCSLEellrTEQQRLEKNEDQLKIITMELQKKSSELeemtkFKNNKEVELEELKKILAE-DEKLLDEKKQFEKI 430
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1118 LSELRAlmtieKDQCISELISRHEEESNILKAELDNVTSLH----RQAYEIEKKL-KEQIVELQTRLNSELSALEK---- 1188
Cdd:pfam05483  431 AEELKG-----KEQELIFLLQAREKEIHDLEIQLTAIKTSEehylKEVEDLKTELeKEKLKNIELTAHCDKLLLENkelt 505
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1189 QKDEKITQQEEKYEALIQNLEKDKERLVK---NHEQDKEHLIQELNF------EKNKAVQTALDEFKVERELVEKELLEK 1259
Cdd:pfam05483  506 QEASDMTLELKKHQEDIINCKKQEERMLKqieNLEEKEMNLRDELESvreefiQKGDEVKCKLDKSEENARSIEYEVLKK 585
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1260 VKHLENQIAKTPAFESAREDSSSLVAELQEK----------------LQEEKAKFLEQLEEQEKRKNEEMQNVRTSLIAE 1323
Cdd:pfam05483  586 EKQMKILENKCNNLKKQIENKNKNIEELHQEnkalkkkgsaenkqlnAYEIKVNKLELELASAKQKFEEIIDNYQKEIED 665
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|....*.
gi 226693374  1324 QQTNFNTVLtrEKMRKENIINDLSDKLkstmqQQERDKDLIESLSE 1369
Cdd:pfam05483  666 KKISEEKLL--EEVEKAKAIADEAVKL-----QKEIDKRCQHKIAE 704
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1147-1393 3.81e-09

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 61.61  E-value: 3.81e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1147 LKAELDNVTSLHRQAYEIEKKLKEQIVELQ---TRLNSELSALEKQKDEkitQQEEKYE--ALIQNLEKDKERLV---KN 1218
Cdd:TIGR02168  237 LREELEELQEELKEAEEELEELTAELQELEeklEELRLEVSELEEEIEE---LQKELYAlaNEISRLEQQKQILRerlAN 313
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1219 HEQDKEHLIQELNFEKNKavqtaLDEFKVERELVEKELLEKVKHLENQIAKTPAFESAREDSSSLVAELQEKLQEEKAKF 1298
Cdd:TIGR02168  314 LERQLEELEAQLEELESK-----LDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKV 388
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1299 LeQLEEQEKRKNEEMQNVRTSL--IAEQQTNFNTVLT-----REKMRKENIINDLSDKLKSTMQQQERDKDLIESLSEDR 1371
Cdd:TIGR02168  389 A-QLELQIASLNNEIERLEARLerLEDRRERLQQEIEellkkLEEAELKELQAELEELEEELEELQEELERLEEALEELR 467
                          250       260
                   ....*....|....*....|..
gi 226693374  1372 ARLLEEKKQLEEEVSKLRTSSF 1393
Cdd:TIGR02168  468 EELEEAEQALDAAERELAQLQA 489
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1012-1291 5.86e-09

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 60.16  E-value: 5.86e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1012 KLKKENQQRIDQMLEShastIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAEtdeikilLEESRTQQKEmlkslLEQ 1091
Cdd:COG4942    20 DAAAEAEAELEQLQQE----IAELEKELAALKKEEKALLKQLAALERRIAALARR-------IRALEQELAA-----LEA 83
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1092 ETENLRTEISKLNQKIHDNNESYQvglSELRALMTiekdqciselISRHEEESNILKAElDNVTSLHRQAY--EIEKKLK 1169
Cdd:COG4942    84 ELAELEKEIAELRAELEAQKEELA---ELLRALYR----------LGRQPPLALLLSPE-DFLDAVRRLQYlkYLAPARR 149
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1170 EQIVELQTRLNsELSALEKQkdekITQQEEKYEALIQNLEKDKERLVKNhEQDKEHLIQELNfEKNKAVQTALDEFKVEr 1249
Cdd:COG4942   150 EQAEELRADLA-ELAALRAE----LEAERAELEALLAELEEERAALEAL-KAERQKLLARLE-KELAELAAELAELQQE- 221
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|..
gi 226693374 1250 elvEKELLEKVKHLENQIAktpafESAREDSSSLVAELQEKL 1291
Cdd:COG4942   222 ---AEELEALIARLEAEAA-----AAAERTPAAGFAALKGKL 255
MAD pfam05557
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ...
851-1370 1.30e-08

Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.


Pssm-ID: 461677 [Multi-domain]  Cd Length: 660  Bit Score: 59.75  E-value: 1.30e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   851 KCSLEITLKEKHQQELQ--SLKIEYECKLDALVKDSEENVNK-------ILKLKENLVSLEEALQNKDNEFTSIKHEKDA 921
Cdd:pfam05557    8 KARLSQLQNEKKQMELEhkRARIELEKKASALKRQLDRESDRnqelqkrIRLLEKREAEAEEALREQAELNRLKKKYLEA 87
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   922 IVCVQQEKDQKLLEMEKIMHTQHCEIKELKQSREMAledlkklhDEKIESLRAEFQCLEQNHlKELEDTLhirhtQEFEK 1001
Cdd:pfam05557   88 LNKKLNEKESQLADAREVISCLKNELSELRRQIQRA--------ELELQSTNSELEELQERL-DLLKAKA-----SEAEQ 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1002 VMTDHNMSLEKLKKENQ--QRIDQMLESHASTIQEKEQQLQELkLKVSDLSDMRCKLEVELAlKEAETDEIKILLEEsrt 1079
Cdd:pfam05557  154 LRQNLEKQQSSLAEAEQriKELEFEIQSQEQDSEIVKNSKSEL-ARIPELEKELERLREHNK-HLNENIENKLLLKE--- 228
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1080 qQKEMLKSLLEQEtENLRTEISKLNQKihdnNESYQVGLSELRAL-----MTIEKDQCISELISRHEEESNILKAELDNV 1154
Cdd:pfam05557  229 -EVEDLKRKLERE-EKYREEAATLELE----KEKLEQELQSWVKLaqdtgLNLRSPEDLSRRIEQLQQREIVLKEENSSL 302
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1155 TS----LHRQAYEIEKKLK---EQIVELQTRLNSELSALEKQKDEKITQQEEKyEALIQNLEK-DKERLVKNHEQDKEHL 1226
Cdd:pfam05557  303 TSsarqLEKARRELEQELAqylKKIEDLNKKLKRHKALVRRLQRRVLLLTKER-DGYRAILESyDKELTMSNYSPQLLER 381
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1227 IQELNFEKNKaVQTALDEFKVERELVEKELL---EKVKHLENQIAKTPAFESAREDSSSL--VAELQEKLQEEKAkfleQ 1301
Cdd:pfam05557  382 IEEAEDMTQK-MQAHNEEMEAQLSVAEEELGgykQQAQTLERELQALRQQESLADPSYSKeeVDSLRRKLETLEL----E 456
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 226693374  1302 LEEQEKRKNE-EMQNVRTSLiaeqQTNFNTVLTREKMRKENIINDLSDKLKSTMQQ----QERDKDLIESLSED 1370
Cdd:pfam05557  457 RQRLREQKNElEMELERRCL----QGDYDPKKTKVLHLSMNPAAEAYQQRKNQLEKlqaeIERLKRLLKKLEDD 526
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
842-1372 1.78e-08

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 59.60  E-value: 1.78e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   842 EIRNIIEKvKCSLEITLKEKHQQELQSLKIEYECKLDALVKDSEENVNKILKLKENLVSLEEALQNKDNEFTSIKHEKDA 921
Cdd:pfam02463  398 ELKSEEEK-EAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKSEDLLKE 476
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   922 IVCVQQEkdQKLLEMEKIMHTQHCEIKELKQSREMALEDLKKLHDEKIESLRAEFQCLEQNHLKELEDTLHIRHTQEfek 1001
Cdd:pfam02463  477 TQLVKLQ--EQLELLLSRQKLEERSQKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAISTAVIV--- 551
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1002 vmTDHNMSLEKLKKENQQRIDQMLESHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEI------KILLE 1075
Cdd:pfam02463  552 --EVSATADEVEERQKLVRALTELPLGARKLRLLIPKLKLPLKSIAVLEIDPILNLAQLDKATLEADEDdkrakvVEGIL 629
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1076 ESRTQQKEMLKSLLEQETENLRTEISKLNQKIHDNNESYQVGLSELRALMTIEKDQCISELISRHEEESNILKAELDNVT 1155
Cdd:pfam02463  630 KDTELTKLKESAKAKESGLRKGVSLEEGLAEKSEVKASLSELTKELLEIQELQEKAESELAKEEILRRQLEIKKKEQREK 709
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1156 SLHRQAYEIEKKLKEQIVELQTRLNSELSALEKQkdeKITQQEEKYEALIQNLEKDKERLVKNHEQDKEHLIQELNFEKN 1235
Cdd:pfam02463  710 EELKKLKLEAEELLADRVQEAQDKINEELKLLKQ---KIDEEEEEEEKSRLKKEEKEEEKSELSLKEKELAEEREKTEKL 786
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1236 KAVQTaLDEFKVERELVEKELLEKVKHLENQIaktpafesareDSSSLVAELQEKLQEEKAKFLEQLEEQEKRKNEEMQN 1315
Cdd:pfam02463  787 KVEEE-KEEKLKAQEEELRALEEELKEEAELL-----------EEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEE 854
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 226693374  1316 VRTSLIAEQQTNFNTVLTREKMRKENIINDLSDKLKSTMQQQERDKDLIESLSEDRA 1372
Cdd:pfam02463  855 ELERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKKELEEESQKLNL 911
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
926-1259 2.59e-08

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 58.91  E-value: 2.59e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   926 QQEKDQKLLEMEKimhtqhcEIKELKqsREMALEDLKKLHdEKIESLRAEFQCLEQNHlKELEDTLHIRHTQEFEKVMTD 1005
Cdd:TIGR02168  208 QAEKAERYKELKA-------ELRELE--LALLVLRLEELR-EELEELQEELKEAEEEL-EELTAELQELEEKLEELRLEV 276
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1006 HNMSleklkkENQQRIDQMLESHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESRTQQKEML 1085
Cdd:TIGR02168  277 SELE------EEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELK 350
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1086 KSL--LEQETENLRTEISKLNQKIHDNNESYQvGLSELRALMTIEKDQcISELISRHEEESNILKAELDNvtsLHRQAYE 1163
Cdd:TIGR02168  351 EELesLEAELEELEAELEELESRLEELEEQLE-TLRSKVAQLELQIAS-LNNEIERLEARLERLEDRRER---LQQEIEE 425
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1164 IEKKLKE-QIVELQTRLNselsalekQKDEKITQQEEKYEALIQNLEKDKERLVKNHE--QDKEHLIQELNFEKnKAVQT 1240
Cdd:TIGR02168  426 LLKKLEEaELKELQAELE--------ELEEELEELQEELERLEEALEELREELEEAEQalDAAERELAQLQARL-DSLER 496
                          330
                   ....*....|....*....
gi 226693374  1241 ALDEFKVERELVEKELLEK 1259
Cdd:TIGR02168  497 LQENLEGFSEGVKALLKNQ 515
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
864-1272 5.13e-08

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 57.83  E-value: 5.13e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   864 QELQSLKIEYECKLDALVKDSEENVNKILKLKENLVSLEEALQNKdnEFTSIKHEKdaivcVQQEKDQKLLEMEKimhtq 943
Cdd:pfam17380  247 EDVTTMTPEYTVRYNGQTMTENEFLNQLLHIVQHQKAVSERQQQE--KFEKMEQER-----LRQEKEEKAREVER----- 314
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   944 HCEIKELKQSREMALEDLKKLHDE----------KIESLRAEFQCLEQNHLKELEDTLHIRHTQEfekvmtdhnmsLEKL 1013
Cdd:pfam17380  315 RRKLEEAEKARQAEMDRQAAIYAEqermamererELERIRQEERKRELERIRQEEIAMEISRMRE-----------LERL 383
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1014 KKENQQ---RIDQMLE-SHASTIQEKEQQLQELKLKVsdlsdmrcklevelalkeaETDEIKILLEESRTQQKEMLKSLL 1089
Cdd:pfam17380  384 QMERQQkneRVRQELEaARKVKILEEERQRKIQQQKV-------------------EMEQIRAEQEEARQREVRRLEEER 444
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1090 EQETENLRTEISKLNQKIhdnnesyqvglselralmtiekdqcisELISRHEEESNILKAELDNVTSLHRQAYEIEKKLK 1169
Cdd:pfam17380  445 AREMERVRLEEQERQQQV---------------------------ERLRQQEEERKRKKLELEKEKRDRKRAEEQRRKIL 497
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1170 EQIVELQTRlnselSALEKQKDEKITQQEekYEALIQNLEKDKERLVKNHEQDKEHLIQELNFEKNKAVQTALDEFKVER 1249
Cdd:pfam17380  498 EKELEERKQ-----AMIEEERKRKLLEKE--MEERQKAIYEEERRREAEEERRKQQEMEERRRIQEQMRKATEERSRLEA 570
                          410       420
                   ....*....|....*....|...
gi 226693374  1250 ELVEKELLEKVKHLENQIAKTPA 1272
Cdd:pfam17380  571 MEREREMMRQIVESEKARAEYEA 593
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1009-1215 5.91e-08

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 56.70  E-value: 5.91e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1009 SLEKLKKENQQRIDQM---LESHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAEtdeikilLEESRTQQKEml 1085
Cdd:COG4942    24 EAEAELEQLQQEIAELekeLAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAE-------LAELEKEIAE-- 94
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1086 kslLEQETENLRTEISKLNQKI--HDNNESYQVGLSELRALMTIEKDQCISELISRHEEESNILKAELDNVTSLHRQAYE 1163
Cdd:COG4942    95 ---LRAELEAQKEELAELLRALyrLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELEA 171
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 226693374 1164 IEKKLKEQIVELQTRLNsELSALEKQKDE---KITQQEEKYEALIQNLEKDKERL 1215
Cdd:COG4942   172 ERAELEALLAELEEERA-ALEALKAERQKllaRLEKELAELAAELAELQQEAEEL 225
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
901-1320 7.57e-08

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 57.08  E-value: 7.57e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  901 LEEALQNKDNEFTSIKHEKDAIVCVQQEKDQKLLEMEKIMHTQHCEIKELKQSREMALEDLKKlhdeKIESLRAEFQCLE 980
Cdd:COG4717    47 LLERLEKEADELFKPQGRKPELNLKELKELEEELKEAEEKEEEYAELQEELEELEEELEELEA----ELEELREELEKLE 122
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  981 QnhLKELEDTLHIRhtQEFEKVMTDHNMSLEKLKKENQQRIDQM--LESHASTIQEKEQQLQELKLKVSdlsdmrckLEV 1058
Cdd:COG4717   123 K--LLQLLPLYQEL--EALEAELAELPERLEELEERLEELRELEeeLEELEAELAELQEELEELLEQLS--------LAT 190
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1059 ELALKEAETDEIKilLEESRTQQKEMLKSlLEQETENLRTEISKLNQKIHDNNESYQvgLSELRALMTI-----EKDQCI 1133
Cdd:COG4717   191 EEELQDLAEELEE--LQQRLAELEEELEE-AQEELEELEEELEQLENELEAAALEER--LKEARLLLLIaaallALLGLG 265
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1134 SELISRHEEESNILKAEL----DNVTSLHRQAYEIEKKLKE--QIVELQTRLNSELSALEKQKDEKITQQEEKYEALIQN 1207
Cdd:COG4717   266 GSLLSLILTIAGVLFLVLgllaLLFLLLAREKASLGKEAEElqALPALEELEEEELEELLAALGLPPDLSPEELLELLDR 345
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1208 LEKDKERLVKNHEQDKEHLIQELNFEKN----KAVQTALDEF--KVERELVEKELLEKVKHLENQIAKTPAFESAREDSS 1281
Cdd:COG4717   346 IEELQELLREAEELEEELQLEELEQEIAallaEAGVEDEEELraALEQAEEYQELKEELEELEEQLEELLGELEELLEAL 425
                         410       420       430
                  ....*....|....*....|....*....|....*....
gi 226693374 1282 SLvAELQEKLQEEKAKfLEQLEEQEKRKNEEMQNVRTSL 1320
Cdd:COG4717   426 DE-EELEEELEELEEE-LEELEEELEELREELAELEAEL 462
PTZ00121 PTZ00121
MAEBL; Provisional
828-1366 7.67e-08

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 57.46  E-value: 7.67e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  828 EQCDLANYLKCTAVEIRNIIEKVKCSLEitlkEKHQQELQSLKIEYECKLDALVKDSEE--NVNKILKLKENLVSLEEAL 905
Cdd:PTZ00121 1388 EEKKKADEAKKKAEEDKKKADELKKAAA----AKKKADEAKKKAEEKKKADEAKKKAEEakKADEAKKKAEEAKKAEEAK 1463
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  906 QNKDNEFTSIKHEKDAivcvqqEKDQKLLEMEKIMHTQHCEIKELKQSRE--MALEDLKKLHDEKI--ESLRAEfqclEQ 981
Cdd:PTZ00121 1464 KKAEEAKKADEAKKKA------EEAKKADEAKKKAEEAKKKADEAKKAAEakKKADEAKKAEEAKKadEAKKAE----EA 1533
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  982 NHLKELEDTLHIRHTQEFEKvmTDHNMSLEKLKKENQQRIDQMLESHASTIQEKEQQLQELKLK-VSDLSDMRCKLEVEL 1060
Cdd:PTZ00121 1534 KKADEAKKAEEKKKADELKK--AEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEeVMKLYEEEKKMKAEE 1611
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1061 ALKEaetDEIKILLEESRTQQKEmlKSLLEQETENLRTEISKLNQkIHDNNESYQVGLSELRAlmTIEKDQCISELISRH 1140
Cdd:PTZ00121 1612 AKKA---EEAKIKAEELKKAEEE--KKKVEQLKKKEAEEKKKAEE-LKKAEEENKIKAAEEAK--KAEEDKKKAEEAKKA 1683
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1141 EEESNILKAELDNVTSLHRQAYEIEKKLKEQIVELQTRLNSELSALEKQKDEKITQQEEKYEALIQNLEKDKERLVKNHE 1220
Cdd:PTZ00121 1684 EEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLK 1763
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1221 QDKEHLIQELNFEKNKAVQTALDEFKVEREL-VEKELLEKVKHLENQI----AKTPAFESAREDSSSLVAEL--QEKLQE 1293
Cdd:PTZ00121 1764 KEEEKKAEEIRKEKEAVIEEELDEEDEKRRMeVDKKIKDIFDNFANIIeggkEGNLVINDSKEMEDSAIKEVadSKNMQL 1843
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 226693374 1294 EKAKFLEQLEEQEKRKNEEMQNvrtsliaeQQTNFNtvltREKMRKEniinDLSDKLKSTMQQQERDKDLIES 1366
Cdd:PTZ00121 1844 EEADAFEKHKFNKNNENGEDGN--------KEADFN----KEKDLKE----DDEEEIEEADEIEKIDKDDIER 1900
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
872-1389 1.07e-07

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 56.99  E-value: 1.07e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   872 EYECKLDALVKDSEENVNKILKLKENLVSLEEALQNKDNEFTSIKHEKDAIVCVQQEKDQKL--LEMEKIMHTQ------ 943
Cdd:TIGR02168  236 ELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEIsrLEQQKQILRErlanle 315
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   944 ------HCEIKELKQSREMALEDLKKLhDEKIESLRAEFQcLEQNHLKELEDTLhirhtQEFEKVMTDHNMSLEKLKKEN 1017
Cdd:TIGR02168  316 rqleelEAQLEELESKLDELAEELAEL-EEKLEELKEELE-SLEAELEELEAEL-----EELESRLEELEEQLETLRSKV 388
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1018 QQRIDQmLESHASTIQEKEQQLQELKLKVSDLSDMRckLEVELALKEAETDEIKILLEESRTQQKEMLKSL--LEQETEN 1095
Cdd:TIGR02168  389 AQLELQ-IASLNNEIERLEARLERLEDRRERLQQEI--EELLKKLEEAELKELQAELEELEEELEELQEELerLEEALEE 465
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1096 LRTEISKLNQKIHDNNESYQVGLSELRALMTIEK----------------------DQCISELISRHEEesniLKAELDN 1153
Cdd:TIGR02168  466 LREELEEAEQALDAAERELAQLQARLDSLERLQEnlegfsegvkallknqsglsgiLGVLSELISVDEG----YEAAIEA 541
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1154 VTSLHRQAYEIEKK---------LKEqiVELQTRLNSELSALEKQK----DEKITQQEEKYEALIQNLEKDKERLVKNHE 1220
Cdd:TIGR02168  542 ALGGRLQAVVVENLnaakkaiafLKQ--NELGRVTFLPLDSIKGTEiqgnDREILKNIEGFLGVAKDLVKFDPKLRKALS 619
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1221 ------------QDKEHLIQELNFEKN-----------------KAVQTALDEFKVERELveKELLEKVKHLENQIAKTp 1271
Cdd:TIGR02168  620 yllggvlvvddlDNALELAKKLRPGYRivtldgdlvrpggvitgGSAKTNSSILERRREI--EELEEKIEELEEKIAEL- 696
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1272 afESAREDssslVAELQEKLQEEKAKFLEQLEEQEKRKNE-EMQNVRTSLIAEQQTNFNTVLTREKMRKENIINDLSDKL 1350
Cdd:TIGR02168  697 --EKALAE----LRKELEELEEELEQLRKELEELSRQISAlRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERL 770
                          570       580       590
                   ....*....|....*....|....*....|....*....
gi 226693374  1351 KSTMQQQERDKDLIESLSEDRARLLEEKKQLEEEVSKLR 1389
Cdd:TIGR02168  771 EEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELR 809
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
997-1363 2.90e-07

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 55.51  E-value: 2.90e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   997 QEFEKVMTDHNMSLEKLkkenQQRIDQMLESHA-------STIQEKEQQLQELKLKVSDLSDMRcklevelaLKEAETDE 1069
Cdd:pfam15921   74 EHIERVLEEYSHQVKDL----QRRLNESNELHEkqkfylrQSVIDLQTKLQEMQMERDAMADIR--------RRESQSQE 141
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1070 IKILLEESRTQQKEMLKSLLEQETENLRTEISKLNQKIHdnneSYQVGLSELRALMTiekdqciselisRHEEESNILKA 1149
Cdd:pfam15921  142 DLRNQLQNTVHELEAAKCLKEDMLEDSNTQIEQLRKMML----SHEGVLQEIRSILV------------DFEEASGKKIY 205
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1150 ELDNVTSLHRQAY--EIEKKLKEQIVELqTRLNSELSALEKQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQDKEHLI 1227
Cdd:pfam15921  206 EHDSMSTMHFRSLgsAISKILRELDTEI-SYLKGRIFPVEDQLEALKSESQNKIELLLQQHQDRIEQLISEHEVEITGLT 284
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1228 qelnfEKNKAVQTALDEFKVERELVEKELLEKVKHLENQIAktpafesareDSSSLVAELQEKLQEEKAKFLEQLEEQEK 1307
Cdd:pfam15921  285 -----EKASSARSQANSIQSQLEIIQEQARNQNSMYMRQLS----------DLESTVSQLRSELREAKRMYEDKIEELEK 349
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 226693374  1308 R---KNEEMQNVRTS--LIAEQQTNFNTVLtrekmrkENIINDLSDKLKSTMQQQERDKDL 1363
Cdd:pfam15921  350 QlvlANSELTEARTErdQFSQESGNLDDQL-------QKLLADLHKREKELSLEKEQNKRL 403
COG5022 COG5022
Myosin heavy chain [General function prediction only];
860-1371 3.69e-07

Myosin heavy chain [General function prediction only];


Pssm-ID: 227355 [Multi-domain]  Cd Length: 1463  Bit Score: 55.47  E-value: 3.69e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  860 EKHQQELQSLKIEYEcKLDALVKDSEENVNKILKLKENLVSLE-EALQNKDNEFTSIKHEKDAI-VCVQQEKDQKLLEME 937
Cdd:COG5022   878 ELAERQLQELKIDVK-SISSLKLVNLELESEIIELKKSLSSDLiENLEFKTELIARLKKLLNNIdLEEGPSIEYVKLPEL 956
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  938 KIMHTQHCEIKELKQSREMALEDLKKLHDEKI----ESLR-----AEFQCLE---QNHLKELEDTLHI--RHTQEFEKVM 1003
Cdd:COG5022   957 NKLHEVESKLKETSEEYEDLLKKSTILVREGNkansELKNfkkelAELSKQYgalQESTKQLKELPVEvaELQSASKIIS 1036
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1004 TDHNmSLEKLKKENQQRIDQMLEshastIQEKEQQLQELKLKVSDLSDMRCKLE----VELALKEAETDEIKILLEESRT 1079
Cdd:COG5022  1037 SEST-ELSILKPLQKLKGLLLLE-----NNQLQARYKALKLRRENSLLDDKQLYqlesTENLLKTINVKDLEVTNRNLVK 1110
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1080 QQKEMLKSLLEQETENLRTEISKLNQKIHDNNESYQVGLSELRALMTIEKDQC-------------ISELISRHE----- 1141
Cdd:COG5022  1111 PANVLQFIVAQMIKLNLLQEISKFLSQLVNTLEPVFQKLSVLQLELDGLFWEAnlealpspppfaaLSEKRLYQSalyde 1190
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1142 ------EESNILKAELDNVTSLHRQAYEIEKKLKEQIVE--LQTRLNSELSALEKQKDEKITQQEEKYEALIQNLEKDKE 1213
Cdd:COG5022  1191 ksklssSEVNDLKNELIALFSKIFSGWPRGDKLKKLISEgwVPTEYSTSLKGFNNLNKKFDTPASMSNEKLLSLLNSIDN 1270
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1214 RLVKNHEQDKEH--------------LIQELNFEKNKAVQTALDEFKVEREL-----VEKELLEKVKHLENQIAKTPAFE 1274
Cdd:COG5022  1271 LLSSYKLEEEVLpatinsllqyinvgLFNALRTKASSLRWKSATEVNYNSEElddwcREFEISDVDEELEELIQAVKVLQ 1350
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1275 sAREDSSSLVAELQEKLQEEKAKFLEQLEEQEKR-------KNEEMQNVRTSLIAEQQTNFNTVLTREKMRKENIINDLS 1347
Cdd:COG5022  1351 -LLKDDLNKLDELLDACYSLNPAEIQNLKSRYDPadkennlPKEILKKIEALLIKQELQLSLEGKDETEVHLSEIFSEEK 1429
                         570       580
                  ....*....|....*....|....*..
gi 226693374 1348 DKL---KSTMQQQERDKDLIESLSEDR 1371
Cdd:COG5022  1430 SLIsldRNSIYKEEVLSSLSALLTKEK 1456
PTZ00121 PTZ00121
MAEBL; Provisional
840-1475 4.27e-07

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 55.15  E-value: 4.27e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  840 AVEIRNIIEKVKCSLEITLKEKHQQElQSLKIEYECKLDALVKDSEEnvnkiLKLKENLVSLEEAlqnKDNEFTSIKHEK 919
Cdd:PTZ00121 1196 AEDARKAEAARKAEEERKAEEARKAE-DAKKAEAVKKAEEAKKDAEE-----AKKAEEERNNEEI---RKFEEARMAHFA 1266
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  920 DAIVCVQQEKDQKLLEMEKIMHTQHCEikELKQSREM-ALEDLKKLHDEKieslraefqcleqnhlkeledtlhiRHTQE 998
Cdd:PTZ00121 1267 RRQAAIKAEEARKADELKKAEEKKKAD--EAKKAEEKkKADEAKKKAEEA-------------------------KKADE 1319
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  999 FEKVMTDHNMSLEKLKK--ENQQRIDQMLESHASTIQEKEQQLQElKLKVSDLSDMRCKLEVELALKEAE----TDEIKI 1072
Cdd:PTZ00121 1320 AKKKAEEAKKKADAAKKkaEEAKKAAEAAKAEAEAAADEAEAAEE-KAEAAEKKKEEAKKKADAAKKKAEekkkADEAKK 1398
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1073 LLEESRTQQKEMLKSLLEQ----ETENLRTEISKLNQKIHDNNESYQVGLSELRAlmtiEKDQCISELISRHEEESNILK 1148
Cdd:PTZ00121 1399 KAEEDKKKADELKKAAAAKkkadEAKKKAEEKKKADEAKKKAEEAKKADEAKKKA----EEAKKAEEAKKKAEEAKKADE 1474
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1149 A-----ELDNVTSLHRQAYEIEKKLKE--QIVELQTRLNSELSALEKQKDEKITQQEEKYEA--LIQNLEKDKERLVKNH 1219
Cdd:PTZ00121 1475 AkkkaeEAKKADEAKKKAEEAKKKADEakKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKAdeAKKAEEKKKADELKKA 1554
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1220 EQDKEHLIQELNFEKNKAVQTALDEFKVEREL--VEKELLEKVKHLENQIAKTPAFESAREDSSSLVAELQEKLQEEKAK 1297
Cdd:PTZ00121 1555 EELKKAEEKKKAEEAKKAEEDKNMALRKAEEAkkAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKK 1634
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1298 F--LEQLEEQEKRKNEEMQNvrtsliAEQQtnfNTVLTREKMRKENIINDLSDKLKSTMQQQERDKDLIESLSEDRARLL 1375
Cdd:PTZ00121 1635 VeqLKKKEAEEKKKAEELKK------AEEE---NKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAE 1705
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1376 EEKKQLEEEVSKlrtssflsSAPVAAAPELYGACAPELPGEPERSvMETADEGRLDSAMETSMMsvQENMLSEEKQRIML 1455
Cdd:PTZ00121 1706 ELKKKEAEEKKK--------AEELKKAEEENKIKAEEAKKEAEED-KKKAEEAKKDEEEKKKIA--HLKKEEEKKAEEIR 1774
                         650       660
                  ....*....|....*....|
gi 226693374 1456 LERTLQLKEEENKRLNQRLM 1475
Cdd:PTZ00121 1775 KEKEAVIEEELDEEDEKRRM 1794
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
852-1216 4.49e-07

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 54.68  E-value: 4.49e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  852 CSLEITlkEKHQQELQSlkiEYECKLDALVKDSEENVNKILKLKENLVSLEEALqnkdneftsikhekdaivcvqqEKDQ 931
Cdd:PRK03918  441 CGRELT--EEHRKELLE---EYTAELKRIEKELKEIEEKERKLRKELRELEKVL----------------------KKES 493
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  932 KLLEMEKIMHtqhcEIKELKQS-REMALEDLKKlHDEKIESLRAEFQCLEQNhLKELEDTLhiRHTQEFEKVMTDHNMSL 1010
Cdd:PRK03918  494 ELIKLKELAE----QLKELEEKlKKYNLEELEK-KAEEYEKLKEKLIKLKGE-IKSLKKEL--EKLEELKKKLAELEKKL 565
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1011 EKLKKENQQRIDQMLESHASTIQEKEQQLQELK--------LKVS-----DLSDMRCKLEVELALKEAETDEIKILLEES 1077
Cdd:PRK03918  566 DELEEELAELLKELEELGFESVEELEERLKELEpfyneyleLKDAekeleREEKELKKLEEELDKAFEELAETEKRLEEL 645
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1078 RTQQKEMLKSLLEQETENLRTEISKLNQKIhdnnESYQVGLSELRALM-----TIEKDQCISELISRHEEESNILKAELD 1152
Cdd:PRK03918  646 RKELEELEKKYSEEEYEELREEYLELSREL----AGLRAELEELEKRReeikkTLEKLKEELEEREKAKKELEKLEKALE 721
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 226693374 1153 NVTslhrqayEIEKKLKEQIVELQTRLNSELSALEKQKDEKITqqEEKYEALIQNLEKDKERLV 1216
Cdd:PRK03918  722 RVE-------ELREKVKKYKALLKERALSKVGEIASEIFEELT--EGKYSGVRVKAEENKVKLF 776
CALCOCO1 pfam07888
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ...
926-1308 5.11e-07

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.


Pssm-ID: 462303 [Multi-domain]  Cd Length: 488  Bit Score: 54.13  E-value: 5.11e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   926 QQEKDQKLLEMEKIMHTQHCEIKELKQSREmALEDLKKLHDEKIESLRAEFQCLEQNHLKELEDTlhiRHTQEFEKVMTD 1005
Cdd:pfam07888   40 LQERAELLQAQEAANRQREKEKERYKRDRE-QWERQRRELESRVAELKEELRQSREKHEELEEKY---KELSASSEELSE 115
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1006 HNMSLEKLKKENQQRIDQMLESHASTIQ---EKEQQLQELKLKVSDLSDMRCKLEVElalKEAETDEIKILLEESRTQQK 1082
Cdd:pfam07888  116 EKDALLAQRAAHEARIRELEEDIKTLTQrvlERETELERMKERAKKAGAQRKEEEAE---RKQLQAKLQQTEEELRSLSK 192
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1083 EM--LKSLLEQ---ETENLRTEISKLNQKI---HDNNESYQVGLSELRALmtiekdqciSELISRHEEESNILKAELDNV 1154
Cdd:pfam07888  193 EFqeLRNSLAQrdtQVLQLQDTITTLTQKLttaHRKEAENEALLEELRSL---------QERLNASERKVEGLGEELSSM 263
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1155 TSLHRQAYEIEKKLKEQIVELQTRLNSELSALekqKDEKITQQEEKyEALIQNLEKDKERlvknheqdkehlIQELNFEK 1234
Cdd:pfam07888  264 AAQRDRTQAELHQARLQAAQLTLQLADASLAL---REGRARWAQER-ETLQQSAEADKDR------------IEKLSAEL 327
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 226693374  1235 NKAvQTALDEFKVERELVEKEL-LEKVKhleNQIAKTPAFESAREDSSSLVAELQEK--LQEEKAKFLEQLEEQEKR 1308
Cdd:pfam07888  328 QRL-EERLQEERMEREKLEVELgREKDC---NRVQLSESRRELQELKASLRVAQKEKeqLQAEKQELLEYIRQLEQR 400
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
865-1302 5.99e-07

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 54.28  E-value: 5.99e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  865 ELQSLKIE-YECKLDALVKDSEEnvnkilkLKENLVSLEEALQNKDNEFTSIKHEKDAIVCVQQEKDQKLLEMEKIMHTQ 943
Cdd:PRK02224  303 GLDDADAEaVEARREELEDRDEE-------LRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELESELEEA 375
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  944 HCEIKElkqsREMALEDLkklhDEKIESLRAEFQCLE------QNHLKELEDTLHIRHTQEFEKVMTDHNmsLEKLKKEN 1017
Cdd:PRK02224  376 REAVED----RREEIEEL----EEEIEELRERFGDAPvdlgnaEDFLEELREERDELREREAELEATLRT--ARERVEEA 445
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1018 QQRID--------QMLE--SHASTIQEKEQQLQELKLKVSDLSDMRCKLEVEL----ALKEAET-----DEIKILLEESR 1078
Cdd:PRK02224  446 EALLEagkcpecgQPVEgsPHVETIEEDRERVEELEAELEDLEEEVEEVEERLeraeDLVEAEDrierlEERREDLEELI 525
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1079 TQQKEMLKSLLEQeTENLRTEISKLNQKIHDNNESYQvglsELRalmtIEKDQCISElISRHEEESNILKAELDNVTSLH 1158
Cdd:PRK02224  526 AERRETIEEKRER-AEELRERAAELEAEAEEKREAAA----EAE----EEAEEAREE-VAELNSKLAELKERIESLERIR 595
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1159 RQAYEIEkKLKEQIVELQTRLnSELSALEKQ-------KDEKITQQEEKY-EALIQNLEKDKERLVKNHEQDKEHLiQEL 1230
Cdd:PRK02224  596 TLLAAIA-DAEDEIERLREKR-EALAELNDErrerlaeKRERKRELEAEFdEARIEEAREDKERAEEYLEQVEEKL-DEL 672
                         410       420       430       440       450       460       470
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 226693374 1231 NfEKNKAVQTALDefKVERELVE-KELLEKVKHLENQIAKTPAFESAREDSSSLVAELQEKLQEEKAKFLEQL 1302
Cdd:PRK02224  673 R-EERDDLQAEIG--AVENELEElEELRERREALENRVEALEALYDEAEELESMYGDLRAELRQRNVETLERM 742
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
1030-1241 6.69e-07

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 53.68  E-value: 6.69e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1030 STIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESRTQQKEmlkslLEQETENLRTEISKLNQKIHD 1109
Cdd:COG3883    16 PQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDK-----LQAEIAEAEAEIEERREELGE 90
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1110 NNESYQVGLSELRALMTIEKDQCISELISRheeesniLKAeLDNVTSLHRQAYEIEKKLKEQIVELQTRLNSELSALEKQ 1189
Cdd:COG3883    91 RARALYRSGGSVSYLDVLLGSESFSDFLDR-------LSA-LSKIADADADLLEELKADKAELEAKKAELEAKLAELEAL 162
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 226693374 1190 KDEKITQQEE------KYEALIQNLEKDKERLvknhEQDKEHLIQELNFEKNKAVQTA 1241
Cdd:COG3883   163 KAELEAAKAEleaqqaEQEALLAQLSAEEAAA----EAQLAELEAELAAAEAAAAAAA 216
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
790-1475 8.28e-07

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 54.20  E-value: 8.28e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   790 RAALHVQLEKCRAaaQDSHSSIQTIKDDLCHFRTFVQKEQCDLANYL-KCTAVEIRNIIEKVKCSLEITLKEKHQQ---- 864
Cdd:TIGR00618  202 RSQLLTLCTPCMP--DTYHERKQVLEKELKHLREALQQTQQSHAYLTqKREAQEEQLKKQQLLKQLRARIEELRAQeavl 279
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   865 ELQSLKIEYECKLDALVKDSEenvnKILKLKENLVSLEEALQNKDNEFTSIKHEKDAIV------CVQQEKDQKLLEMEK 938
Cdd:TIGR00618  280 EETQERINRARKAAPLAAHIK----AVTQIEQQAQRIHTELQSKMRSRAKLLMKRAAHVkqqssiEEQRRLLQTLHSQEI 355
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   939 IMHTQHCEIKELKQSREMALEDLKKLH--DEKIESLRAEFQCLEQNHLKELEDTLHIRHTQEFEKVMTDHNMSLEKlKKE 1016
Cdd:TIGR00618  356 HIRDAHEVATSIREISCQQHTLTQHIHtlQQQKTTLTQKLQSLCKELDILQREQATIDTRTSAFRDLQGQLAHAKK-QQE 434
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1017 NQQRIDQMLESHAstiqekEQQLQELKLKVSDLSDMRCKLEvelALKEAETDEIKILLEESRTQQKEMLKSLLEQETEN- 1095
Cdd:TIGR00618  435 LQQRYAELCAAAI------TCTAQCEKLEKIHLQESAQSLK---EREQQLQTKEQIHLQETRKKAVVLARLLELQEEPCp 505
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1096 LRTEISKLNQKIHDNNES------YQVGLSELRALMTIEKD---QCISEL--ISRHEEESNILKAELDNVTSLHRQAYEI 1164
Cdd:TIGR00618  506 LCGSCIHPNPARQDIDNPgpltrrMQRGEQTYAQLETSEEDvyhQLTSERkqRASLKEQMQEIQQSFSILTQCDNRSKED 585
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1165 EKKLKEQIVELQtRLNSELSALEKQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQDKE--HLIQ---ELNFEKNKAVQ 1239
Cdd:TIGR00618  586 IPNLQNITVRLQ-DLTEKLSEAEDMLACEQHALLRKLQPEQDLQDVRLHLQQCSQELALKltALHAlqlTLTQERVREHA 664
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1240 TALDEFKVERELVEKELLEKVKHLENQIakTPAFESaredssslVAELQEKLQEEkakflEQLEEQEKRKNEEMQNVRTS 1319
Cdd:TIGR00618  665 LSIRVLPKELLASRQLALQKMQSEKEQL--TYWKEM--------LAQCQTLLREL-----ETHIEEYDREFNEIENASSS 729
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1320 LIAEQQTNFNTVltrekmrkENIINDLSDKLKSTMQQQERD---KDLIESLSEDRARLLEEKKQLEEEVSKLRTSSFLSS 1396
Cdd:TIGR00618  730 LGSDLAAREDAL--------NQSLKELMHQARTVLKARTEAhfnNNEEVTAALQTGAELSHLAAEIQFFNRLREEDTHLL 801
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1397 APVAAAPELYgacAPELPGEPERSVMETADEgrldsametsmmsvQENMLS--EEKQRIMLLERTLQLKEEENKRLNQRL 1474
Cdd:TIGR00618  802 KTLEAEIGQE---IPSDEDILNLQCETLVQE--------------EEQFLSrlEEKSATLGEITHQLLKYEECSKQLAQL 864

                   .
gi 226693374  1475 M 1475
Cdd:TIGR00618  865 T 865
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
880-1373 1.15e-06

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 53.69  E-value: 1.15e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   880 LVKDSEENVNKILKLKENLVSLEEALQNKDNEFTSIKHEKDAIVCVQQEKDQKLLEMEKIMHTQHceiKELKQSREMALE 959
Cdd:pfam12128  232 AIAGIMKIRPEFTKLQQEFNTLESAELRLSHLHFGYKSDETLIASRQEERQETSAELNQLLRTLD---DQWKEKRDELNG 308
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   960 DLKKLhDEKIESLRAEFQCLEQNHLKELEDTLHIRHtQEFEkvmtdhNMSLEKLKKENQQRIDQMLESHASTIQEKeqqL 1039
Cdd:pfam12128  309 ELSAA-DAAVAKDRSELEALEDQHGAFLDADIETAA-ADQE------QLPSWQSELENLEERLKALTGKHQDVTAK---Y 377
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1040 QELKLKVSDlsdmrcKLEVELALKEAETDEIKilleESRTQQKEMLKSLLEQETENLRTEISKLNQKIHDNNESYQVGLS 1119
Cdd:pfam12128  378 NRRRSKIKE------QNNRDIAGIKDKLAKIR----EARDRQLAVAEDDLQALESELREQLEAGKLEFNEEEYRLKSRLG 447
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1120 ELRALM---TIEKDQCIS-----ELISRHEEESNILKAELDNVTSLHRQAyeieKKLKEQIVELQTRLNSELSALEKQKD 1191
Cdd:pfam12128  448 ELKLRLnqaTATPELLLQlenfdERIERAREEQEAANAEVERLQSELRQA----RKRRDQASEALRQASRRLEERQSALD 523
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1192 EKITQQEEKYEALIQNLEKD----KERLVK------------NHEQDKEHLIQELNF----------------------- 1232
Cdd:pfam12128  524 ELELQLFPQAGTLLHFLRKEapdwEQSIGKvispellhrtdlDPEVWDGSVGGELNLygvkldlkridvpewaaseeelr 603
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1233 EKNKAVQTALDEFKVERELVEKELLEKVKHLENQIAKTP----AFESAREDSSSLVAELQEklqeEKAKFLEQLEEQEKR 1308
Cdd:pfam12128  604 ERLDKAEEALQSAREKQAAAEEQLVQANGELEKASREETfartALKNARLDLRRLFDEKQS----EKDKKNKALAERKDS 679
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 226693374  1309 KNEEmqnvRTSLIAEQQTNFNTVLTREKMRKEniindlsDKLKSTMQQQERDKDLIESLSEDRAR 1373
Cdd:pfam12128  680 ANER----LNSLEAQLKQLDKKHQAWLEEQKE-------QKREARTEKQAYWQVVEGALDAQLAL 733
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
947-1373 1.23e-06

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 53.51  E-value: 1.23e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  947 IKELKQSREMALEDLKKLHDEKIE-SLRAEFQCLEQNhLKELEDTLHIRHTQ---------EFEKVMTDHNMSLEKLKkE 1016
Cdd:PRK02224  178 VERVLSDQRGSLDQLKAQIEEKEEkDLHERLNGLESE-LAELDEEIERYEEQreqaretrdEADEVLEEHEERREELE-T 255
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1017 NQQRIDQMLESHASTIQEKE---QQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESRTQQKEMLKSLLEQ-- 1091
Cdd:PRK02224  256 LEAEIEDLRETIAETEREREelaEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEECrv 335
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1092 -------ETENLRTEISKL---NQKIHDNNESYQVGLSELRALMTIEKDQcISEL---ISRHEEESNILKAELDNVTSLH 1158
Cdd:PRK02224  336 aaqahneEAESLREDADDLeerAEELREEAAELESELEEAREAVEDRREE-IEELeeeIEELRERFGDAPVDLGNAEDFL 414
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1159 RQAYEIEKKLKEQIVELQTRLNSELSALEKQKD-----------------------EKITQQEEKYEALIQNLEKDKERL 1215
Cdd:PRK02224  415 EELREERDELREREAELEATLRTARERVEEAEAlleagkcpecgqpvegsphvetiEEDRERVEELEAELEDLEEEVEEV 494
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1216 VKNHEQDKEHLIQELNFEKNKAVQTALDEFKVERELVEKELLEKVKHLENQIAKTPAFESAREDSSSLVAELQEKLQEEK 1295
Cdd:PRK02224  495 EERLERAEDLVEAEDRIERLEERREDLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEV 574
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1296 AKFLEQLEEQEKRKnEEMQNVRTSLIAEQQTNFNTVLTREKMRKENIINDLS-DKLKStmqQQERDKDL--------IES 1366
Cdd:PRK02224  575 AELNSKLAELKERI-ESLERIRTLLAAIADAEDEIERLREKREALAELNDERrERLAE---KRERKRELeaefdearIEE 650

                  ....*..
gi 226693374 1367 LSEDRAR 1373
Cdd:PRK02224  651 AREDKER 657
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
967-1246 1.26e-06

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 53.38  E-value: 1.26e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  967 EKIESLRAEFQCLEQnHLKELEDTLHIRHTQefEKVMTDHNMSLEKLKKENQQRIDqmLESHASTIQEKEQQLQELKLKV 1046
Cdd:COG4913   610 AKLAALEAELAELEE-ELAEAEERLEALEAE--LDALQERREALQRLAEYSWDEID--VASAEREIAELEAELERLDASS 684
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1047 SDLSdmrcKLEVELALKEAETDEIKILLEESRTQQKEmlkslLEQETENLRTEISKLNQKIHDNNESYQVGLSElralmt 1126
Cdd:COG4913   685 DDLA----ALEEQLEELEAELEELEEELDELKGEIGR-----LEKELEQAEEELDELQDRLEAAEDLARLELRA------ 749
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1127 iekdqcisELISRHEEesnilkAELDNVTSlhrqayEIEKKLKEQIVELQTRLNSELSALEKQKDEKITQQE-------- 1198
Cdd:COG4913   750 --------LLEERFAA------ALGDAVER------ELRENLEERIDALRARLNRAEEELERAMRAFNREWPaetadlda 809
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 226693374 1199 -----EKYEALIQNLEKD---------KERLVKNHEQDKEHLIQELnfekNKAVQTALDEFK 1246
Cdd:COG4913   810 dleslPEYLALLDRLEEDglpeyeerfKELLNENSIEFVADLLSKL----RRAIREIKERID 867
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1074-1323 1.63e-06

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 53.00  E-value: 1.63e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1074 LEESRTqqKEMLKSLLEQ--ETENLRTEISKLNQKIH------DNNESYQVGLSELRALmtiekDQCISEL-ISRHEEES 1144
Cdd:COG4913   218 LEEPDT--FEAADALVEHfdDLERAHEALEDAREQIEllepirELAERYAAARERLAEL-----EYLRAALrLWFAQRRL 290
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1145 NILKAELDNVTSLHRQAYEIEKKLKEQIVELQTRLNselsALEKQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQdke 1224
Cdd:COG4913   291 ELLEAELEELRAELARLEAELERLEARLDALREELD----ELEAQIRGNGGDRLEQLEREIERLERELEERERRRAR--- 363
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1225 hliqelnfeknkaVQTALDEFKVERELVEKELLEKVKHLENQIAKTPAFESA-REDSSSLVAELQEkLQEEKAKFLEQLE 1303
Cdd:COG4913   364 -------------LEALLAALGLPLPASAEEFAALRAEAAALLEALEEELEAlEEALAEAEAALRD-LRRELRELEAEIA 429
                         250       260
                  ....*....|....*....|...
gi 226693374 1304 EQEKRKN---EEMQNVRTSLIAE 1323
Cdd:COG4913   430 SLERRKSnipARLLALRDALAEA 452
Cast pfam10174
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...
893-1371 1.86e-06

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.


Pssm-ID: 431111 [Multi-domain]  Cd Length: 766  Bit Score: 52.90  E-value: 1.86e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   893 KLKENlVSLEEALQNKdNEFTSIKHEKDAIVCVQQEKDQKLLEMEKIMHTQHCEIKELKQSREMALEDlkklHDEKIESL 972
Cdd:pfam10174  204 KEKEN-IHLREELHRR-NQLQPDPAKTKALQTVIEMKDTKISSLERNIRDLEDEVQMLKTNGLLHTED----REEEIKQM 277
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   973 R-----AEFQcleQNHLKELEDTLHIRHTQefekvMTDHNMSLEKLKKEN---QQRIDQMLESHAStiqeKEQQLQELKL 1044
Cdd:pfam10174  278 EvykshSKFM---KNKIDQLKQELSKKESE-----LLALQTKLETLTNQNsdcKQHIEVLKESLTA----KEQRAAILQT 345
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1045 KVSDLsdmRCKLEVELALKEAETDEIKILLEESRTQQKEM--LKSLL---EQETENLRTEISKLNQKIHDNNESyqvgLS 1119
Cdd:pfam10174  346 EVDAL---RLRLEEKESFLNKKTKQLQDLTEEKSTLAGEIrdLKDMLdvkERKINVLQKKIENLQEQLRDKDKQ----LA 418
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1120 ELR---------------ALMTIE-----KDQCISELISRHEEESnilKAELDNVTSLHRQAYEIEKKLKEQIVELQTRL 1179
Cdd:pfam10174  419 GLKervkslqtdssntdtALTTLEealseKERIIERLKEQRERED---RERLEELESLKKENKDLKEKVSALQPELTEKE 495
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1180 NSELSALEK---------QKDEKITQQEEKYEALIQNLEKDKERLVKNHEQDKE------------HLIQELNFEKNKAV 1238
Cdd:pfam10174  496 SSLIDLKEHasslassglKKDSKLKSLEIAVEQKKEECSKLENQLKKAHNAEEAvrtnpeindrirLLEQEVARYKEESG 575
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1239 QTALDefkVERELVEKELLEKVKHLENQiaKTPAFESAR----EDSSSLVAELQEKLQEEKAKFLEQLEEQEKRKNEEMQ 1314
Cdd:pfam10174  576 KAQAE---VERLLGILREVENEKNDKDK--KIAELESLTlrqmKEQNKKVANIKHGQQEMKKKGAQLLEEARRREDNLAD 650
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 226693374  1315 NVRTSLIAEqqtnfnTVLTREKMRKEniINDLSDKLKSTMQQ-QERDKDLiESLSEDR 1371
Cdd:pfam10174  651 NSQQLQLEE------LMGALEKTRQE--LDATKARLSSTQQSlAEKDGHL-TNLRAER 699
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
866-1242 2.09e-06

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 52.81  E-value: 2.09e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   866 LQSLKIEYECKLD---ALVKDSEENVNKILKLKENLVSLEEALQNKDNEFTSIKHEKDAIVCVQQEKDQKLLEMEKIMHT 942
Cdd:pfam15921  435 LKAMKSECQGQMErqmAAIQGKNESLEKVSSLTAQLESTKEMLRKVVEELTAKKMTLESSERTVSDLTASLQEKERAIEA 514
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   943 QHCEIKELKQSREMALEDLKKLHDEK--IESLRAEFQCLEQnHLKELEDTLHIRHtQEFEKVM--------TDHNMSLEK 1012
Cdd:pfam15921  515 TNAEITKLRSRVDLKLQELQHLKNEGdhLRNVQTECEALKL-QMAEKDKVIEILR-QQIENMTqlvgqhgrTAGAMQVEK 592
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1013 --LKKE-NQQRIDqmLESHASTIQEKEQQLQELKLKVSDLSDMRCKL-----EVELALKEAETDEIKILLE--------E 1076
Cdd:pfam15921  593 aqLEKEiNDRRLE--LQEFKILKDKKDAKIRELEARVSDLELEKVKLvnagsERLRAVKDIKQERDQLLNEvktsrnelN 670
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1077 SRTQQKEMLKSLLEQETENLRTEISKLNQKIHDNNESYQVGLSELRAL-----------MTIEKD--------------- 1130
Cdd:pfam15921  671 SLSEDYEVLKRNFRNKSEEMETTTNKLKMQLKSAQSELEQTRNTLKSMegsdghamkvaMGMQKQitakrgqidalqski 750
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1131 QCISELIS-----RH--EEESNILKAELDNV-TSLHRQAYEI------EKKLKEQIVELQTRLnselsalekqkdEKITQ 1196
Cdd:pfam15921  751 QFLEEAMTnankeKHflKEEKNKLSQELSTVaTEKNKMAGELevlrsqERRLKEKVANMEVAL------------DKASL 818
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*.
gi 226693374  1197 QEEKYEALIQNLEKDKERLVKNHEQDKEHLiQELNFEKNKAVQTAL 1242
Cdd:pfam15921  819 QFAECQDIIQRQEQESVRLKLQHTLDVKEL-QGPGYTSNSSMKPRL 863
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
1065-1373 2.40e-06

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 52.67  E-value: 2.40e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1065 AETDEIKILLEESRtQQKEMLKSLLEQETENLRTEISKLNQKIHDNNESYQVGLSELRALMtiEKDQCISELISRHEEES 1144
Cdd:pfam02463  163 AGSRLKRKKKEALK-KLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELE--EEYLLYLDYLKLNEERI 239
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1145 NILKAELdNVTSLHRQAYEIEKKLKEQIVELQTRLNSELSALEKQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQDKE 1224
Cdd:pfam02463  240 DLLQELL-RDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKE 318
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1225 HLIQELNFEKN-KAVQTALDEFKVERELVEKELLEKVKHLENQIAKTPAFESAREDSSSLVAELQEKLQEEKAKFLEQLE 1303
Cdd:pfam02463  319 SEKEKKKAEKElKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELE 398
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 226693374  1304 EQEKRKNE--------EMQNVRTSLIAEQQTNFNTVLTREKMRKENIINDLSDKLKSTMQQQERDKDLIESLSEDRAR 1373
Cdd:pfam02463  399 LKSEEEKEaqlllelaRQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKSEDLLKE 476
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1010-1373 2.45e-06

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 52.08  E-value: 2.45e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1010 LEKLKKENQQRIDQMLESHAST--IQEKEQQLQELKLKVSDLSDMRCKLEVELALKE--AETDEIKILLEESRTQQKEML 1085
Cdd:COG4717    73 LKELEEELKEAEEKEEEYAELQeeLEELEEELEELEAELEELREELEKLEKLLQLLPlyQELEALEAELAELPERLEELE 152
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1086 KSLleQETENLRTEISKLNQKIHDNNESyqvgLSELRALMTIEKDQCISELISRHEEesniLKAELdnvtslhRQAYEIE 1165
Cdd:COG4717   153 ERL--EELRELEEELEELEAELAELQEE----LEELLEQLSLATEEELQDLAEELEE----LQQRL-------AELEEEL 215
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1166 KKLKEQIVELQTRLNS-ELSALEKQKDEKITQQEEKYEAL--------------------------------IQNLEKDK 1212
Cdd:COG4717   216 EEAQEELEELEEELEQlENELEAAALEERLKEARLLLLIAaallallglggsllsliltiagvlflvlgllaLLFLLLAR 295
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1213 ERLVKNHEQDKEHLIQELNFEKNKAVQTALDEFKVERELVEKELLEKVKHLENQIAKTPAFESAREDS--SSLVAELQEK 1290
Cdd:COG4717   296 EKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEELLELLDRIEELQELLREAEELEEELqlEELEQEIAAL 375
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1291 LQEEKAKFLEQLEEQEKRKNEEMQNVRTSLIAEQQTNFNTVLTREKMRKENIiNDLSDKLKSTMQQQERDKDLIESLSED 1370
Cdd:COG4717   376 LAEAGVEDEEELRAALEQAEEYQELKEELEELEEQLEELLGELEELLEALDE-EELEEELEELEEELEELEEELEELREE 454

                  ...
gi 226693374 1371 RAR 1373
Cdd:COG4717   455 LAE 457
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1085-1389 3.46e-06

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 51.99  E-value: 3.46e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1085 LKSLLEQETENLRTEISK---LNQKIHDNNESYQVGLSELRALMTIEKDqcISELISRHEEESNILKAELDNVTSLHRQA 1161
Cdd:PRK03918  170 VIKEIKRRIERLEKFIKRtenIEELIKEKEKELEEVLREINEISSELPE--LREELEKLEKEVKELEELKEEIEELEKEL 247
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1162 YEIEK---KLKEQIVELQTRLN---SELSALEKQKDE--KITQQEEKYEALIQNLEKDKERLvKNHEQDKEHLIQELNfe 1233
Cdd:PRK03918  248 ESLEGskrKLEEKIRELEERIEelkKEIEELEEKVKElkELKEKAEEYIKLSEFYEEYLDEL-REIEKRLSRLEEEIN-- 324
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1234 knkAVQTALDEFKVERELVEkELLEKVKHLENQIAKTPAFESAREDSSSLVAELQ----EKLQEEKAKFLEQLEEQEKRK 1309
Cdd:PRK03918  325 ---GIEERIKELEEKEERLE-ELKKKLKELEKRLEELEERHELYEEAKAKKEELErlkkRLTGLTPEKLEKELEELEKAK 400
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1310 NEEMQNVRTsliaeqqtnfntvLTREKMRKENIINDLSD---KLKSTMQQ---------QERDKDLIESLSEDRARLLEE 1377
Cdd:PRK03918  401 EEIEEEISK-------------ITARIGELKKEIKELKKaieELKKAKGKcpvcgreltEEHRKELLEEYTAELKRIEKE 467
                         330
                  ....*....|..
gi 226693374 1378 KKQLEEEVSKLR 1389
Cdd:PRK03918  468 LKEIEEKERKLR 479
PLN03229 PLN03229
acetyl-coenzyme A carboxylase carboxyl transferase subunit alpha; Provisional
1026-1329 3.52e-06

acetyl-coenzyme A carboxylase carboxyl transferase subunit alpha; Provisional


Pssm-ID: 178768 [Multi-domain]  Cd Length: 762  Bit Score: 51.78  E-value: 3.52e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1026 ESHASTIQEKEQQLQELKLKVSDLSDMRCKLeVELALKEAetdeIKILLEESRTQQKEMLKSL-LEQETENLRTEISKLN 1104
Cdd:PLN03229  425 EAVKTPVRELEGEVEKLKEQILKAKESSSKP-SELALNEM----IEKLKKEIDLEYTEAVIAMgLQERLENLREEFSKAN 499
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1105 -----------QKIHDNNESYQVGLSE----------LRALMTIEKDQCISELISRHE--------------------EE 1143
Cdd:PLN03229  500 sqdqlmhpvlmEKIEKLKDEFNKRLSRapnylslkykLDMLNEFSRAKALSEKKSKAEklkaeinkkfkevmdrpeikEK 579
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1144 SNILKAELDNvtSLHRQAYEIEKKLKEQIVELQTRLNSELSALEK---------QKDEKITQQEEKYEALIQNLEKDKER 1214
Cdd:PLN03229  580 MEALKAEVAS--SGASSGDELDDDLKEKVEKMKKEIELELAGVLKsmglevigvTKKNKDTAEQTPPPNLQEKIESLNEE 657
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1215 LVKNHEQ-----DKEHLIQELNFEKNKAVQTALDEFKverelvekellEKVKHLENQIAKTPAfesAREDSSSLvAELQE 1289
Cdd:PLN03229  658 INKKIERvirssDLKSKIELLKLEVAKASKTPDVTEK-----------EKIEALEQQIKQKIA---EALNSSEL-KEKFE 722
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|
gi 226693374 1290 KLQEEKAKFLEQLEEQEKRKNEEMQNVRTSLIAEQQTNFN 1329
Cdd:PLN03229  723 ELEAELAAARETAAESNGSLKNDDDKEEDSKEDGSRVEVN 762
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
902-1472 3.92e-06

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 51.71  E-value: 3.92e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   902 EEALQNKDNEFTSIKHekdaivcVQQEKDQKLLEMEKiMHTQHCEIKELKQSREMALEDLKKlhdeKIESLRAEFQCLEQ 981
Cdd:pfam01576    4 EEEMQAKEEELQKVKE-------RQQKAESELKELEK-KHQQLCEEKNALQEQLQAETELCA----EAEEMRARLAARKQ 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   982 nhlkELEDTLHIRHTQEFEKvmTDHNMSLEKLKKENQQRIdQMLESHastIQEKEQQLQELKLKvsdlsdmrcKLEVELA 1061
Cdd:pfam01576   72 ----ELEEILHELESRLEEE--EERSQQLQNEKKKMQQHI-QDLEEQ---LDEEEAARQKLQLE---------KVTTEAK 132
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1062 LKEAEtDEIKILLEESRTQQKEmlKSLLEQETENLRT------EISKLNQKIHDNNEsyqvglselrALMTIEKDQCISE 1135
Cdd:pfam01576  133 IKKLE-EDILLLEDQNSKLSKE--RKLLEERISEFTSnlaeeeEKAKSLSKLKNKHE----------AMISDLEERLKKE 199
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1136 LISRHEEESNILKAELDNVTslhrqayeiekkLKEQIVELQTRLnSELSAlekqkdeKITQQEEKYEALIQNLEKdkERL 1215
Cdd:pfam01576  200 EKGRQELEKAKRKLEGESTD------------LQEQIAELQAQI-AELRA-------QLAKKEEELQAALARLEE--ETA 257
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1216 VKNHEQDKEHLIQELNFEknkaVQTALDEFKVERELVEK------ELLEKVK-HLENQIAKTPAFESAREDSSSLVAELQ 1288
Cdd:pfam01576  258 QKNNALKKIRELEAQISE----LQEDLESERAARNKAEKqrrdlgEELEALKtELEDTLDTTAAQQELRSKREQEVTELK 333
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1289 EKLQEEKAKFLEQLEEQEKRKNEEMQNVRTSLiaEQQTNFNTVLTREKMRKENIINDLSDKLKSTMQ----QQERDKDLI 1364
Cdd:pfam01576  334 KALEEETRSHEAQLQEMRQKHTQALEELTEQL--EQAKRNKANLEKAKQALESENAELQAELRTLQQakqdSEHKRKKLE 411
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1365 ESLSEDRARLLEEKKQLEEEVSKLrtssflssapVAAAPELYGACApeLPGEPERSVMETADEGrldSAMETSMMSVQEN 1444
Cdd:pfam01576  412 GQLQELQARLSESERQRAELAEKL----------SKLQSELESVSS--LLNEAEGKNIKLSKDV---SSLESQLQDTQEL 476
                          570       580
                   ....*....|....*....|....*...
gi 226693374  1445 MLSEEKQRIMLLERTLQLKEEENKRLNQ 1472
Cdd:pfam01576  477 LQEETRQKLNLSTRLRQLEDERNSLQEQ 504
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
879-1107 5.16e-06

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 50.53  E-value: 5.16e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  879 ALVKDSEENVNKILKLKENLVSLEEALQNKDNEFTSIKHEKDAIVCVQQEKDQKLLEMEKIMHTQHCEIKELKQSREMAL 958
Cdd:COG4942    17 AQADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELR 96
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  959 EDLKKLHDEKIESLRAEFQCLEQNHLKELedtLHIRHTQEFEKVMtdhnMSLEKLKKENQQRIDQMleshASTIQEKEQQ 1038
Cdd:COG4942    97 AELEAQKEELAELLRALYRLGRQPPLALL---LSPEDFLDAVRRL----QYLKYLAPARREQAEEL----RADLAELAAL 165
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 226693374 1039 LQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESRTQQKEMLKSL--LEQETENLRTEISKLNQKI 1107
Cdd:COG4942   166 RAELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELaeLQQEAEELEALIARLEAEA 236
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
1179-1489 5.37e-06

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 51.28  E-value: 5.37e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1179 LNSELSALEKQKDEKITQQEEKYEALIQnlekdkERLvknhEQDKEHLIQELNF-----EKNKAVQTALDEFKVERELVE 1253
Cdd:pfam17380  271 LNQLLHIVQHQKAVSERQQQEKFEKMEQ------ERL----RQEKEEKAREVERrrkleEAEKARQAEMDRQAAIYAEQE 340
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1254 KELLEKVKHLENQIAKTPAFESAREDSSSLVAELqeklqeEKAKFLEQLEEQEKRKNEEmqnVRTSLIAEQQTNFntvlt 1333
Cdd:pfam17380  341 RMAMERERELERIRQEERKRELERIRQEEIAMEI------SRMRELERLQMERQQKNER---VRQELEAARKVKI----- 406
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1334 REKMRKENIINDLSDKLKSTMQQQERDKDLIESLSEDRARLL----EEKKQLEEEVSKLRTSSF----------LSSAPV 1399
Cdd:pfam17380  407 LEEERQRKIQQQKVEMEQIRAEQEEARQREVRRLEEERAREMervrLEEQERQQQVERLRQQEEerkrkkleleKEKRDR 486
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1400 AAAPELYGACAPELPGEPERSVMEtadEGRLDSAMETSMMSVQENMLSEEKQRIMLLERTLQLKEEENKRLNQRLMSQSL 1479
Cdd:pfam17380  487 KRAEEQRRKILEKELEERKQAMIE---EERKRKLLEKEMEERQKAIYEEERRREAEEERRKQQEMEERRRIQEQMRKATE 563
                          330
                   ....*....|
gi 226693374  1480 SSVSSRHSEK 1489
Cdd:pfam17380  564 ERSRLEAMER 573
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
1077-1373 1.10e-05

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 50.12  E-value: 1.10e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1077 SRTQQKE----MLKSLLEQETENLRTEISKlNQKIHDNNESYQVGLSELRALMtiekdqciseliSRHEEESNILKAELD 1152
Cdd:pfam17380  285 SERQQQEkfekMEQERLRQEKEEKAREVER-RRKLEEAEKARQAEMDRQAAIY------------AEQERMAMERERELE 351
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1153 NVTSLHRQAyEIEKKLKEQI-VELQTRLNSELSALEKQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQDKEHLIQELN 1231
Cdd:pfam17380  352 RIRQEERKR-ELERIRQEEIaMEISRMRELERLQMERQQKNERVRQELEAARKVKILEEERQRKIQQQKVEMEQIRAEQE 430
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1232 FEKNKAVQTALDEFKVERELVEKELLEKVKHLEnQIAKTPAFESAREDSSSLVAELQEKLQEEKAKFLEQLEEQEKRKNE 1311
Cdd:pfam17380  431 EARQREVRRLEEERAREMERVRLEEQERQQQVE-RLRQQEEERKRKKLELEKEKRDRKRAEEQRRKILEKELEERKQAMI 509
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 226693374  1312 EMQNVRTSLIAEQQTNFNTVLTREKMRKEniinDLSDKLKSTMQQQERDKDLIESLSEDRAR 1373
Cdd:pfam17380  510 EEERKRKLLEKEMEERQKAIYEEERRREA----EEERRKQQEMEERRRIQEQMRKATEERSR 567
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
854-1316 1.50e-05

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 49.94  E-value: 1.50e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  854 LEITLKEKHQQELQSLKIEYECKLDALVKDSEENVNKILKLKENLVSLEEALQNKDNEFTSIKHEKDAIvcvqQEKDQKL 933
Cdd:COG1196   358 AELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEEL----EEALAEL 433
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  934 LEMEKIMHTQHCEIKELKQSREMALEDLKKLHDEKIESLRAEFQCLEQNHLKELEDTLHIRHTQEFEKVMTDHNMS-LEK 1012
Cdd:COG1196   434 EEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGvKAA 513
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1013 LKKENQQRIDQMLESHASTIQEKEQQLQE------LKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESRTQQKEMLK 1086
Cdd:COG1196   514 LLLAGLRGLAGAVAVLIGVEAAYEAALEAalaaalQNIVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALA 593
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1087 SLLEQETENLRTEisklnQKIHDNNESYQVGLSELRALMTIEKDQCISELISRHEEESNILKAELDNVTSLHRQAYEIEK 1166
Cdd:COG1196   594 RGAIGAAVDLVAS-----DLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRR 668
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1167 KLKEQIVELQTRLNSELSALEKQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQDK-------EHLIQELNFEKNKAVQ 1239
Cdd:COG1196   669 ELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEAleeqleaEREELLEELLEEEELL 748
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1240 TALDEFKVERELVEKELLEKVKHLENQIAKTPA--------FESAREDSSSLVAELQEkLQEEKakflEQLEEQEKRKNE 1311
Cdd:COG1196   749 EEEALEELPEPPDLEELERELERLEREIEALGPvnllaieeYEELEERYDFLSEQRED-LEEAR----ETLEEAIEEIDR 823

                  ....*
gi 226693374 1312 EMQNV 1316
Cdd:COG1196   824 ETRER 828
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
1076-1326 1.51e-05

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 49.06  E-value: 1.51e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1076 ESRTQQKEMLKSLLEQETENLRTEISKLNQKIHDNNESYQVGLSELRALmtiekdqciselisrhEEESNILKAELDNVT 1155
Cdd:COG3883    15 DPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEAL----------------QAEIDKLQAEIAEAE 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1156 SLHRQAYEiekKLKEQIVELQT--RLNSELSALEKQKD--------EKITQQEEKYEALIQNLEKDKERLvknheqdkeh 1225
Cdd:COG3883    79 AEIEERRE---ELGERARALYRsgGSVSYLDVLLGSESfsdfldrlSALSKIADADADLLEELKADKAEL---------- 145
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1226 liqelnfeknKAVQTALDEFKVERELVEKELLEKVKHLENQIAKtpafesaREDSSSLVAELQEKLQEEKAKFLEQLEEQ 1305
Cdd:COG3883   146 ----------EAKKAELEAKLAELEALKAELEAAKAELEAQQAE-------QEALLAQLSAEEAAAEAQLAELEAELAAA 208
                         250       260
                  ....*....|....*....|.
gi 226693374 1306 EKRKNEEMQNVRTSLIAEQQT 1326
Cdd:COG3883   209 EAAAAAAAAAAAAAAAAAAAA 229
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
983-1344 1.60e-05

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 50.05  E-value: 1.60e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   983 HLKELEDTLHIRHTQEFEKVMTDHNMSLE---KLKKENQQRIDQMLESHASTIQEKEQQLQELKLKVSDLSDMRCKLEVE 1059
Cdd:TIGR01612  501 RMKDFKDIIDFMELYKPDEVPSKNIIGFDidqNIKAKLYKEIEAGLKESYELAKNWKKLIHEIKKELEEENEDSIHLEKE 580
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1060 ---LALKEAETDEIKILLEESRTQQKEMLKSLLEQeTENLRTEISKlnQKIHDNNESYQVGLSELRALMTIE----KDQC 1132
Cdd:TIGR01612  581 ikdLFDKYLEIDDEIIYINKLKLELKEKIKNISDK-NEYIKKAIDL--KKIIENNNAYIDELAKISPYQVPEhlknKDKI 657
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1133 ISELISrheEESNILKAELDnvtSLHRQAYEIEKKLKEQIVELQTRLNSELSALEKQKDEKITQQEEKYEALIQNLEKDK 1212
Cdd:TIGR01612  658 YSTIKS---ELSKIYEDDID---ALYNELSSIVKENAIDNTEDKAKLDDLKSKIDKEYDKIQNMETATVELHLSNIENKK 731
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1213 ERLVKNHEQDKEHLIQELNFEKNKAVQtaldEFKVErelvEKELLEKVKHLENQIAKTPAFESAredssslVAELQEKLQ 1292
Cdd:TIGR01612  732 NELLDIIVEIKKHIHGEINKDLNKILE----DFKNK----EKELSNKINDYAKEKDELNKYKSK-------ISEIKNHYN 796
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|..
gi 226693374  1293 EEKAkfLEQLEEQEKRKNEEMQNVRTSLIAEQQTNFNTVLTREKMRKENIIN 1344
Cdd:TIGR01612  797 DQIN--IDNIKDEDAKQNYDKSKEYIKTISIKEDEIFKIINEMKFMKDDFLN 846
CCDC73 pfam15818
Coiled-coil domain-containing protein 73 family; CCDC73 is a family of eukaryotic coiled-coil ...
958-1343 2.24e-05

Coiled-coil domain-containing protein 73 family; CCDC73 is a family of eukaryotic coiled-coil containing proteins. The function is not known. The alternative name is sarcoma antigen NY-SAR-79.


Pssm-ID: 464893 [Multi-domain]  Cd Length: 1048  Bit Score: 49.17  E-value: 2.24e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   958 LEDLKKLHDEKIESLR----AEFQ--------CLEQNHLKELEDTLHirhtQEFEKVMTDHNMSLEKLKKENQQRIDQML 1025
Cdd:pfam15818    2 LLDFKTSLLEALEELRmrreAETQyeeqigkiIVETQELKWQKETLQ----NQKETLAKQHKEAMAVFKKQLQMKMCALE 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1026 ESH-----ASTIQEKE-QQLQE-LK-LKVSD------LSDMRCKLEVELALKEaetDEIKILLEESR-----TQQKEMLK 1086
Cdd:pfam15818   78 EEKgkyqlATEIKEKEiEGLKEtLKaLQVSKyslqkkVSEMEQKLQLHLLAKE---DHHKQLNEIEKyyatiTGQFGLVK 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1087 SLLEQETENLRTEIsKLNQKIHDNNESYQVGLSELRAlmtiEKDQCISELISrheeesnilkaelDNVTSLHRQAYE-IE 1165
Cdd:pfam15818  155 ENHGKLEQNVQEAI-QLNKRLSALNKKQESEICSLKK----ELKKVTSDLIK-------------SKVTCQYKMGEEnIN 216
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1166 KKLKEQ-IVELQTRLNSELsALEKQKDEKITQ-QEEKYEALI--QNLEKdkerLVKNHEQDKEHLIQELNFEKNKAvQTA 1241
Cdd:pfam15818  217 LTIKEQkFQELQERLNMEL-ELNKKINEEITHiQEEKQDIIIsfQHMQQ----LLQQQTQANTEMEAELKALKENN-QTL 290
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1242 LDEFKVERELVeKELLEKVKHLENQ------IAKTPAFESARE-----DSSSLVAELQEKLQEEKAKFLEQLEEQEKRKN 1310
Cdd:pfam15818  291 ERDNELQREKV-KENEEKFLNLQNEhekalgTWKKHVEELNGEineikNELSSLKETHIKLQEHYNKLCNQKKFEEDKKF 369
                          410       420       430
                   ....*....|....*....|....*....|...
gi 226693374  1311 EEMQNVrtsliaeqqTNFNTVLTREKMrkENII 1343
Cdd:pfam15818  370 QNVPEV---------NNENSEMSTEKS--ENLI 391
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
926-1370 2.44e-05

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 49.20  E-value: 2.44e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   926 QQEKDQKLLEMEKIMHTQHCEIKELKQSREMALE--DLKKLHDEKIESLRAEFQCLEQnhlkeledtlhirHTQEFEKVM 1003
Cdd:TIGR00618  151 QGEFAQFLKAKSKEKKELLMNLFPLDQYTQLALMefAKKKSLHGKAELLTLRSQLLTL-------------CTPCMPDTY 217
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1004 TDHNMSLEKLKKENQQRIDQMLESHASTIQEKEQQLQELKLKvSDLSDMRCKLEvELALKEAETDEikilLEESRTQQKE 1083
Cdd:TIGR00618  218 HERKQVLEKELKHLREALQQTQQSHAYLTQKREAQEEQLKKQ-QLLKQLRARIE-ELRAQEAVLEE----TQERINRARK 291
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1084 MLKSLLEQEtenlrtEISKLNQKIHDNNESYQVGLSEL-RALMTIEKDQCISELISRHEEESNILKAELD------NVTS 1156
Cdd:TIGR00618  292 AAPLAAHIK------AVTQIEQQAQRIHTELQSKMRSRaKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEIhirdahEVAT 365
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1157 LHRQAYEIEKKLKEQIVELQTRLNSELSALEKQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQDKEHLIQELNFEKNK 1236
Cdd:TIGR00618  366 SIREISCQQHTLTQHIHTLQQQKTTLTQKLQSLCKELDILQREQATIDTRTSAFRDLQGQLAHAKKQQELQQRYAELCAA 445
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1237 AVQTALDEFKVeRELVEKELLEKVKHLENQIAKTPAFESAREDSSSLVAELQEKLQEEKAkfleQLEEQEKRKNEEMQNv 1316
Cdd:TIGR00618  446 AITCTAQCEKL-EKIHLQESAQSLKEREQQLQTKEQIHLQETRKKAVVLARLLELQEEPC----PLCGSCIHPNPARQD- 519
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....
gi 226693374  1317 rtSLIAEQQTNFNTVLTREKMRKENIINDLSDKLKSTMQQQERDKDLIESLSED 1370
Cdd:TIGR00618  520 --IDNPGPLTRRMQRGEQTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQQS 571
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
858-1210 2.48e-05

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 49.25  E-value: 2.48e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   858 LKEKHQQELQSLKIEYECKLDALVKDSEENVNKILKLKENLVSLEEALQNKDNEFTSIKHEKDAIVCVQQEKDQKLLEME 937
Cdd:TIGR04523  423 LLEKEIERLKETIIKNNSEIKDLTNQDSVKELIIKNLDNTRESLETQLKVLSRSINKIKQNLEQKQKELKSKEKELKKLN 502
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   938 KIMHTQHCEIKELKQSREMALEDLKKLHDEKIEslraefqclEQNHLKELEDtlhirhtqEFEKVMTDHNMS-LEKLKKE 1016
Cdd:TIGR04523  503 EEKKELEEKVKDLTKKISSLKEKIEKLESEKKE---------KESKISDLED--------ELNKDDFELKKEnLEKEIDE 565
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1017 NQQRIDQMLESHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESRTqqkemlkslLEQETENL 1096
Cdd:TIGR04523  566 KNKEIEELKQTQKSLKKKQEEKQELIDQKEKEKKDLIKEIEEKEKKISSLEKELEKAKKENEK---------LSSIIKNI 636
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1097 RTEISKLNQKIhdnnESYQVGLSELRALMTiEKDQCISELISRHEEESNILKaELDNVTSLHRQAY---EIEKKLKEQIV 1173
Cdd:TIGR04523  637 KSKKNKLKQEV----KQIKETIKEIRNKWP-EIIKKIKESKTKIDDIIELMK-DWLKELSLHYKKYitrMIRIKDLPKLE 710
                          330       340       350
                   ....*....|....*....|....*....|....*..
gi 226693374  1174 ELQTRLNSELSALEKQKDEkitqqeekYEALIQNLEK 1210
Cdd:TIGR04523  711 EKYKEIEKELKKLDEFSKE--------LENIIKNFNK 739
DUF4686 pfam15742
Domain of unknown function (DUF4686); This family of proteins is found in eukaryotes. Proteins ...
1025-1307 2.58e-05

Domain of unknown function (DUF4686); This family of proteins is found in eukaryotes. Proteins in this family are typically between 498 and 775 amino acids in length. There is a conserved DLK sequence motif.


Pssm-ID: 464838 [Multi-domain]  Cd Length: 384  Bit Score: 48.52  E-value: 2.58e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1025 LESHASTIQE----KEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKilLEESRTQQKEMLKSLLEQETENLRTEI 1100
Cdd:pfam15742   46 LKQHNSLLQEenikIKAELKQAQQKLLDSTKMCSSLTAEWKHCQQKIRELE--LEVLKQAQSIKSQNSLQEKLAQEKSRV 123
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1101 SKLNQKIHDN----NESYQVGLSELRALMT-------IEKDQCISELISRHEEESNILKAELDNVTSLHRQA---YEIEK 1166
Cdd:pfam15742  124 ADAEEKILELqqklEHAHKVCLTDTCILEKkqleeriKEASENEAKLKQQYQEEQQKRKLLDQNVNELQQQVrslQDKEA 203
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1167 KLKEQIVELQTRLNSELSALEKQKDEKITQQEE-----KYEALIQNLEKDKERLVKNHEQDKEHLIQEL-------NFEK 1234
Cdd:pfam15742  204 QLEMTNSQQQLRIQQQEAQLKQLENEKRKSDEHlksnqELSEKLSSLQQEKEALQEELQQVLKQLDVHVrkynekhHHHK 283
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 226693374  1235 NKaVQTALDEFKVERELVEkellEKVKHLENQIAKTPAFESAREDSSSLVAELQEKLQEEKAKFLEQLEEQEK 1307
Cdd:pfam15742  284 AK-LRRAKDRLVHEVEQRD----ERIKQLENEIGILQQQSEKEKAFQKQVTAQNEILLLEKRKLLEQLTEQEE 351
CCDC73 pfam15818
Coiled-coil domain-containing protein 73 family; CCDC73 is a family of eukaryotic coiled-coil ...
885-1311 3.75e-05

Coiled-coil domain-containing protein 73 family; CCDC73 is a family of eukaryotic coiled-coil containing proteins. The function is not known. The alternative name is sarcoma antigen NY-SAR-79.


Pssm-ID: 464893 [Multi-domain]  Cd Length: 1048  Bit Score: 48.79  E-value: 3.75e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   885 EENVNKILKLKENLVSLEEALQNKDNefTSIKHEKDAIVCVQQEKDQKL--LEMEKIMHTQHCEIKElkQSREMALEDLK 962
Cdd:pfam15818   27 EEQIGKIIVETQELKWQKETLQNQKE--TLAKQHKEAMAVFKKQLQMKMcaLEEEKGKYQLATEIKE--KEIEGLKETLK 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   963 KLHDEKIeSLraefqcleQNHLKELEDTLHIRHTQEfekvmTDHNMSLEKLKKENQQRIDQ---MLESHASTiqekEQQL 1039
Cdd:pfam15818  103 ALQVSKY-SL--------QKKVSEMEQKLQLHLLAK-----EDHHKQLNEIEKYYATITGQfglVKENHGKL----EQNV 164
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1040 QELKLKVSDLSDMRCKLEVELA-----LKEAETDEIKilleESRTQQKEMlksllEQETENLRTEisklNQKIHDNNEsy 1114
Cdd:pfam15818  165 QEAIQLNKRLSALNKKQESEICslkkeLKKVTSDLIK----SKVTCQYKM-----GEENINLTIK----EQKFQELQE-- 229
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1115 qvglselRALMTIEKDQCISELISRHEEEsnilkaELDNVTSL-HRQayeiekKLKEQIVELQTRLNSELSALEKQ---- 1189
Cdd:pfam15818  230 -------RLNMELELNKKINEEITHIQEE------KQDIIISFqHMQ------QLLQQQTQANTEMEAELKALKENnqtl 290
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1190 ------KDEKITQQEEKYealiQNLEKDKERLVKNHEQDKEHLIQELNFEKNKAVQTALDEFKVER---ELVEKELLEKV 1260
Cdd:pfam15818  291 erdnelQREKVKENEEKF----LNLQNEHEKALGTWKKHVEELNGEINEIKNELSSLKETHIKLQEhynKLCNQKKFEED 366
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 226693374  1261 KHLEN-QIAKTPAFESAREDSSSLVAE----LQEKLQEEKAKFLEQLEEQEKRKNE 1311
Cdd:pfam15818  367 KKFQNvPEVNNENSEMSTEKSENLIIQkynsEQEIREENTKSFCSDTEYRETEKKK 422
PLN02939 PLN02939
transferase, transferring glycosyl groups
854-1221 4.01e-05

transferase, transferring glycosyl groups


Pssm-ID: 215507 [Multi-domain]  Cd Length: 977  Bit Score: 48.36  E-value: 4.01e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  854 LEITLKEKHQQELQSLKIEYECKLDALVKDSEENVNKILKLKENLVSLEEALQNKDNEftsikhekdaivcvQQEKDQKL 933
Cdd:PLN02939   65 LQSNTDENGQLENTSLRTVMELPQKSTSSDDDHNRASMQRDEAIAAIDNEQQTNSKDG--------------EQLSDFQL 130
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  934 LEMEKIMHTQHCEIKELKQSREMALEDLKKLHDEKiESLRAEFQCLEQNhlkeLEDTLHIRHTQEFEKVMTDHNMS-LEK 1012
Cdd:PLN02939  131 EDLVGMIQNAEKNILLLNQARLQALEDLEKILTEK-EALQGKINILEMR----LSETDARIKLAAQEKIHVEILEEqLEK 205
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1013 LKKENQQRIdqmlESHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELaLKEAETDEIKILLEESRTQQKEMLKSlLEQE 1092
Cdd:PLN02939  206 LRNELLIRG----ATEGLCVHSLSKELDVLKEENMLLKDDIQFLKAEL-IEVAETEERVFKLEKERSLLDASLRE-LESK 279
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1093 TENLRTEISKlnqkihdnnesyqvgLSELRALMTIEKDQCISELISRHEEESNILKAELDNVTSLHRQAYEIEKKLKEQI 1172
Cdd:PLN02939  280 FIVAQEDVSK---------------LSPLQYDCWWEKVENLQDLLDRATNQVEKAALVLDQNQDLRDKVDKLEASLKEAN 344
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1173 VelqTRLNSELSALEKQK-----------DEKITQQEEKYEALIQNLEKDKERLVKNHEQ 1221
Cdd:PLN02939  345 V---SKFSSYKVELLQQKlklleerlqasDHEIHSYIQLYQESIKEFQDTLSKLKEESKK 401
PTZ00121 PTZ00121
MAEBL; Provisional
859-1469 4.13e-05

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 48.60  E-value: 4.13e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  859 KEKHQQELQSLKIEYECKLDALVKDSEEnVNKILKLK--ENLVSLEEALQNKDNEFTSIKHEKDAivCVQQEKDQKLLEM 936
Cdd:PTZ00121 1099 KAEEAKKTETGKAEEARKAEEAKKKAED-ARKAEEARkaEDARKAEEARKAEDAKRVEIARKAED--ARKAEEARKAEDA 1175
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  937 EKIMHTQHCEikELKQSREmaledLKKLHD-EKIESLRAEFQCLEQNHLKELEDTLHIRHTQEFEKVMTDHNMSLEKLKK 1015
Cdd:PTZ00121 1176 KKAEAARKAE--EVRKAEE-----LRKAEDaRKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEE 1248
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1016 ENQQRIDQMLESHASTIQEKEQQLQ-ELKLKVSDLSDMRCKLEVELALKEAE---TDEIKILLEESRTQQKemlkslLEQ 1091
Cdd:PTZ00121 1249 RNNEEIRKFEEARMAHFARRQAAIKaEEARKADELKKAEEKKKADEAKKAEEkkkADEAKKKAEEAKKADE------AKK 1322
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1092 ETENLRTEISKLNQKIHDNNESYQVGLSELRALM-TIEKDQCISELISRHEEESNILKAELDNVTSLHRQAYEIEKKLKE 1170
Cdd:PTZ00121 1323 KAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAAdEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEE 1402
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1171 QivelqtrlNSELSALEKQKDEKITQQEEKYEAliQNLEKDKERLVKNHEQDKEHLIQELNFEKNKAvqtalDEFKVERE 1250
Cdd:PTZ00121 1403 D--------KKKADELKKAAAAKKKADEAKKKA--EEKKKADEAKKKAEEAKKADEAKKKAEEAKKA-----EEAKKKAE 1467
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1251 lvEKELLEKVKHLENQIAKTPAFESAREDSSSLVAELQEKlQEEKAKFLEQLEEQEKRKNEEMQNVRTSLIAEQQTNFNT 1330
Cdd:PTZ00121 1468 --EAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKA-AEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEE 1544
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1331 VLTREKMRKENIINDLSDKLKSTMQQQERDKDLIESLSEDRARllEEKKQLEEEVSKLRTSSFLSSAPVAAAPELYGACA 1410
Cdd:PTZ00121 1545 KKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAK--KAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKA 1622
                         570       580       590       600       610
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 226693374 1411 PELPGEPErsvmETADEGRLDSAMETSMMSVQENMLSEEKQRIMLLErtLQLKEEENKR 1469
Cdd:PTZ00121 1623 EELKKAEE----EKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAE--EAKKAEEDKK 1675
PRK00409 PRK00409
recombination and DNA strand exchange inhibitor protein; Reviewed
1122-1261 4.45e-05

recombination and DNA strand exchange inhibitor protein; Reviewed


Pssm-ID: 234750 [Multi-domain]  Cd Length: 782  Bit Score: 48.28  E-value: 4.45e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1122 RALMTIEKDQcISELISRHEEESNILKAELDNVTSLHRQAYEIEKKLKEQIVELQTRLNSELSALEKQKDEKITQQEEKY 1201
Cdd:PRK00409  508 KKLIGEDKEK-LNELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEA 586
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 226693374 1202 EALIQNLekdKERLVKNHEQDKEHLIQELNFEKNKAVQTALDEFKVERELVEK-ELLEKVK 1261
Cdd:PRK00409  587 DEIIKEL---RQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEElKVGDEVK 644
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
831-1110 5.11e-05

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 48.09  E-value: 5.11e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   831 DLANYLKCTAVEIRNIIEKVK-CSLEITLKEKHQQELQSLKIEYECKLDALVKDSEENVNKILKLKENLVSLEEALQNKD 909
Cdd:TIGR04523  465 SLETQLKVLSRSINKIKQNLEqKQKELKSKEKELKKLNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESKISDLE 544
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   910 NEFTSIKHE--KDAIVCVQQEKDQKLLEMekimhtqHCEIKELKQSREMALEDLKKLHDEKIEsLRAEfqcLEQNHLKEL 987
Cdd:TIGR04523  545 DELNKDDFElkKENLEKEIDEKNKEIEEL-------KQTQKSLKKKQEEKQELIDQKEKEKKD-LIKE---IEEKEKKIS 613
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   988 EDTLHIRHTQEFEKVMTDHNMSLEKLKKENQQRIDQMLESHASTIQEK---EQQLQELKLKVSDLSDMRCKLEVELAL-- 1062
Cdd:TIGR04523  614 SLEKELEKAKKENEKLSSIIKNIKSKKNKLKQEVKQIKETIKEIRNKWpeiIKKIKESKTKIDDIIELMKDWLKELSLhy 693
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|..
gi 226693374  1063 KEAETDEIKI----LLEESRTQQKEMLKSLLEQETEnLRTEISKLNQKIHDN 1110
Cdd:TIGR04523  694 KKYITRMIRIkdlpKLEEKYKEIEKELKKLDEFSKE-LENIIKNFNKKFDDA 744
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
957-1373 5.82e-05

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 48.11  E-value: 5.82e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  957 ALEDLKKLHDEKIESLRAEFQCLEQnhlKELEDTLHirhtqEFEKVMTDHNMSLEKLKkENQQRIDQMLESHASTIQEKE 1036
Cdd:PRK02224  177 GVERVLSDQRGSLDQLKAQIEEKEE---KDLHERLN-----GLESELAELDEEIERYE-EQREQARETRDEADEVLEEHE 247
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1037 QQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKilleesrtQQKEMLKSLLEQETENL-RTEISKLNQKIhdnnesyq 1115
Cdd:PRK02224  248 ERREELETLEAEIEDLRETIAETEREREELAEEVR--------DLRERLEELEEERDDLLaEAGLDDADAEA-------- 311
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1116 vgLSELRALMTIEKDQCISEL------ISRHEEESNILKAELDNV----TSLHRQAYEIEKKLKEQIVELQTRlNSELSA 1185
Cdd:PRK02224  312 --VEARREELEDRDEELRDRLeecrvaAQAHNEEAESLREDADDLeeraEELREEAAELESELEEAREAVEDR-REEIEE 388
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1186 LEKQKDE------KITQQEEKYEALIQNLEKDKERLvKNHEQDKEHLIQELNFEKNKAvQTALDEFK---VERELVEKEL 1256
Cdd:PRK02224  389 LEEEIEElrerfgDAPVDLGNAEDFLEELREERDEL-REREAELEATLRTARERVEEA-EALLEAGKcpeCGQPVEGSPH 466
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1257 LEKVKHLENQIAKtpaFESAREDSSSLVAELQEKLQEekakfLEQLEEQEKRKNEEMQNVRTS--LIAEQQTNFN-TVLT 1333
Cdd:PRK02224  467 VETIEEDRERVEE---LEAELEDLEEEVEEVEERLER-----AEDLVEAEDRIERLEERREDLeeLIAERRETIEeKRER 538
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|
gi 226693374 1334 REKMRKEniindlSDKLKSTMQQQERDKDLIESLSEDRAR 1373
Cdd:PRK02224  539 AEELRER------AAELEAEAEEKREAAAEAEEEAEEARE 572
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
854-1373 6.07e-05

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 47.73  E-value: 6.07e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  854 LEITLKEKHQQELQSLKIEYECKL---DALVKDSEENVNKILKLKENLVSLEEALQNKDNEFTSIKHEKDAIVCVQQEKD 930
Cdd:PRK02224  192 LKAQIEEKEEKDLHERLNGLESELaelDEEIERYEEQREQARETRDEADEVLEEHEERREELETLEAEIEDLRETIAETE 271
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  931 QKLLEMEKIMHTQHCEIKELKQSREMALED--LKKLHDEKIESLRAEFQcleqNHLKELEDTLhirhtQEFEKVMTDHNM 1008
Cdd:PRK02224  272 REREELAEEVRDLRERLEELEEERDDLLAEagLDDADAEAVEARREELE----DRDEELRDRL-----EECRVAAQAHNE 342
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1009 SLEKLKKenqqRIDQmLESHASTIQEK----EQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESrtqqkEM 1084
Cdd:PRK02224  343 EAESLRE----DADD-LEERAEELREEaaelESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNA-----ED 412
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1085 LKSLLEQETENLRTEISKLNQKIhdnnESYQVGLSELRALMTIEK---------DQCISELISRHEEESNILKAELDnvt 1155
Cdd:PRK02224  413 FLEELREERDELREREAELEATL----RTARERVEEAEALLEAGKcpecgqpveGSPHVETIEEDRERVEELEAELE--- 485
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1156 SLHRQAYEIEKKLK--EQIVELQTRLNSelsALEKQKD--EKITQQEEKyealiqnLEKDKERLVKNHEQDKEHliQELN 1231
Cdd:PRK02224  486 DLEEEVEEVEERLEraEDLVEAEDRIER---LEERREDleELIAERRET-------IEEKRERAEELRERAAEL--EAEA 553
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1232 FEKNKAVQTALDEFKVERELVeKELLEKVKHLENQIaktpafESAR--EDSSSLVAELQEKLQE--EKAKFLEQLEEQEK 1307
Cdd:PRK02224  554 EEKREAAAEAEEEAEEAREEV-AELNSKLAELKERI------ESLEriRTLLAAIADAEDEIERlrEKREALAELNDERR 626
                         490       500       510       520       530       540
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 226693374 1308 RKNEEMQNVRTSLIAEQQTNFNTVLTREKMRKENIINDLSDKLkstmqqqerdkdliESLSEDRAR 1373
Cdd:PRK02224  627 ERLAEKRERKRELEAEFDEARIEEAREDKERAEEYLEQVEEKL--------------DELREERDD 678
PTZ00440 PTZ00440
reticulocyte binding protein 2-like protein; Provisional
946-1370 7.03e-05

reticulocyte binding protein 2-like protein; Provisional


Pssm-ID: 240419 [Multi-domain]  Cd Length: 2722  Bit Score: 47.90  E-value: 7.03e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  946 EIKELKQSREMALEDLKKLHDEKIESLRAEFqcleqNHLKELEDTLH-IRHTQEFEKVMTDHNMSLEKLKKENQQRIDQM 1024
Cdd:PTZ00440  429 KIADYALYSNLEIIEIKKKYDEKINELKKSI-----NQLKTLISIMKsFYDLIISEKDSMDSKEKKESSDSNYQEKVDEL 503
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1025 L------ESHASTIQEKEQQLQELKLKVSDLsdmrcKLEVELALKEAETDEIKILLEESRTQQKEMLKSLLEQETENLRT 1098
Cdd:PTZ00440  504 LqiinsiKEKNNIVNNNFKNIEDYYITIEGL-----KNEIEGLIELIKYYLQSIETLIKDEKLKRSMKNDIKNKIKYIEE 578
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1099 EISKLNQKIHDNNESYQV--GLSELRALMTIEKDQCISELISRHEEESNI--------LKAELDNVTSL--HRQAYEIEK 1166
Cdd:PTZ00440  579 NVDHIKDIISLNDEIDNIiqQIEELINEALFNKEKFINEKNDLQEKVKYIlnkfykgdLQELLDELSHFldDHKYLYHEA 658
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1167 KLKEQIVELQTRLNSELSALEKQKDEKITQQEEKYEALIQNLEKDKERLVK-NHEQDKEHLIQELN--FEKNKAVQTALD 1243
Cdd:PTZ00440  659 KSKEDLQTLLNTSKNEYEKLEFMKSDNIDNIIKNLKKELQNLLSLKENIIKkQLNNIEQDISNSLNqyTIKYNDLKSSIE 738
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1244 EFKverelVEKELLEKVKHLENQIAKTpafesaredsssLVAELQEklQEEKakfleqLEEQEKRKNEEMQNVRTSLIAE 1323
Cdd:PTZ00440  739 EYK-----EEEEKLEVYKHQIINRKNE------------FILHLYE--NDKD------LPDGKNTYEEFLQYKDTILNKE 793
                         410       420       430       440       450
                  ....*....|....*....|....*....|....*....|....*....|..
gi 226693374 1324 QQTNFNTVLTRE--KMRKENI--INDLSDKLKS-TMQQQERDKDLIESLSED 1370
Cdd:PTZ00440  794 NKISNDINILKEnkKNNQDLLnsYNILIQKLEAhTEKNDEELKQLLQKFPTE 845
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1166-1311 7.15e-05

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 46.07  E-value: 7.15e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1166 KKLKEQIVELQTRLNSELSALEKQKDEkITQQEEKYEALIQNLEKDKERL--VKNHEqdkehliqELnfeknKAVQTALD 1243
Cdd:COG1579    34 AELEDELAALEARLEAAKTELEDLEKE-IKRLELEIEEVEARIKKYEEQLgnVRNNK--------EY-----EALQKEIE 99
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 226693374 1244 EFKVERELVEKELLEKVKHLENQIAKTPAFESAREDSSSLVAELQEKLQEEKAKFLEQLEEQEKRKNE 1311
Cdd:COG1579   100 SLKRRISDLEDEILELMERIEELEEELAELEAELAELEAELEEKKAELDEELAELEAELEELEAEREE 167
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1164-1361 8.06e-05

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 47.45  E-value: 8.06e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1164 IEKKLKEQIVEL------QTRLN-SELSALEKQKDEkITQQEEKYEALIQNLEKDKERLvKNHEQDKEHLIQELNF-EKN 1235
Cdd:COG4717    47 LLERLEKEADELfkpqgrKPELNlKELKELEEELKE-AEEKEEEYAELQEELEELEEEL-EELEAELEELREELEKlEKL 124
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1236 KAVQTALDEFK-VERELveKELLEKVKHLENQIAKTPAFESAREDSSSLVAELQEKLQEEKAKFLEQLEEQEKRKNEEMQ 1314
Cdd:COG4717   125 LQLLPLYQELEaLEAEL--AELPERLEELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEELQDLAEELE 202
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*..
gi 226693374 1315 NVRTSlIAEQQTNFNTvLTREKMRKENIINDLSDKLKSTMQQQERDK 1361
Cdd:COG4717   203 ELQQR-LAELEEELEE-AQEELEELEEELEQLENELEAAALEERLKE 247
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
927-1416 9.41e-05

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 47.35  E-value: 9.41e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   927 QEKDQKLLEMEKIMHTQH----CEIKELKQSREMALEDLKKL---HDEKIESLRAEFQCLEQNHLK--ELEDTlhIRHTQ 997
Cdd:TIGR00606  198 QGQKVQEHQMELKYLKQYkekaCEIRDQITSKEAQLESSREIvksYENELDPLKNRLKEIEHNLSKimKLDNE--IKALK 275
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   998 EFEKVMTDHNMSLEKLKKENQQRIDQML----ESHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELAlkEAETDEIKIL 1073
Cdd:TIGR00606  276 SRKKQMEKDNSELELKMEKVFQGTDEQLndlyHNHQRTVREKERELVDCQRELEKLNKERRLLNQEKT--ELLVEQGRLQ 353
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1074 LEESRTQQKEMLKSLLEQETEnLRTEISKLnqkihdnnesyqvglsELRALMTIEKDQCISELISRHEEESNILKAELDN 1153
Cdd:TIGR00606  354 LQADRHQEHIRARDSLIQSLA-TRLELDGF----------------ERGPFSERQIKNFHTLVIERQEDEAKTAAQLCAD 416
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1154 VTSLHRQAYEIEKKLKEQIVELQTRLNSELSALEKQKDEKITQQEEkyealIQNLEKDKERLVKNHEQDKEHLIQELNFE 1233
Cdd:TIGR00606  417 LQSKERLKQEQADEIRDEKKGLGRTIELKKEILEKKQEELKFVIKE-----LQQLEGSSDRILELDQELRKAERELSKAE 491
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1234 KNKAVQTALDEFKVERElvEKELLEKVKHLENQiaktpafESAREDSSSLVAELQEKLQEEKAKFLEQLEEQEKRKNEEM 1313
Cdd:TIGR00606  492 KNSLTETLKKEVKSLQN--EKADLDRKLRKLDQ-------EMEQLNHHTTTRTQMEMLTKDKMDKDEQIRKIKSRHSDEL 562
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1314 QNVRTSLIAEQQ-------TNFNTVLTREKMRKENIINDLSDKLKSTMQQQERDKDLIESLSEDRARLLEEKKQLEEEVS 1386
Cdd:TIGR00606  563 TSLLGYFPNKKQledwlhsKSKEINQTRDRLAKLNKELASLEQNKNHINNELESKEEQLSSYEDKLFDVCGSQDEESDLE 642
                          490       500       510
                   ....*....|....*....|....*....|...
gi 226693374  1387 KLRTSSFLSS---APVAAAPELYGACAPELPGE 1416
Cdd:TIGR00606  643 RLKEEIEKSSkqrAMLAGATAVYSQFITQLTDE 675
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1019-1179 9.83e-05

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 45.69  E-value: 9.83e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1019 QRIDQMLESHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESRTQQKEMLKS----LLEQETE 1094
Cdd:COG1579    20 DRLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGNVRNNkeyeALQKEIE 99
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1095 NLRTEISKLNQKIHDNN---ESYQVGLSELRALMTIEKDQcISELISRHEEESNILKAELDNVTSlhrQAYEIEKKLKEQ 1171
Cdd:COG1579   100 SLKRRISDLEDEILELMeriEELEEELAELEAELAELEAE-LEEKKAELDEELAELEAELEELEA---EREELAAKIPPE 175

                  ....*...
gi 226693374 1172 IVELQTRL 1179
Cdd:COG1579   176 LLALYERI 183
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
845-1349 1.05e-04

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 47.35  E-value: 1.05e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   845 NIIEKVKCSLEITLKEKHQQELQSLKIEYECKldalvKDSEENVNKILKLKENLVSLEEALQNKDNEFTSIK------HE 918
Cdd:TIGR01612  533 NIKAKLYKEIEAGLKESYELAKNWKKLIHEIK-----KELEEENEDSIHLEKEIKDLFDKYLEIDDEIIYINklklelKE 607
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   919 KDAIVCVQQEKDQKLLEMEKIMHTQHCEIKELKQSREMALEDLKKLHDEKIESLRAEFQCLEQNHLKELEDTLHiRHTQE 998
Cdd:TIGR01612  608 KIKNISDKNEYIKKAIDLKKIIENNNAYIDELAKISPYQVPEHLKNKDKIYSTIKSELSKIYEDDIDALYNELS-SIVKE 686
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   999 FEKVMTDHNMSLEKLKKENQQRIDQM-------LESHASTIQEKEQQLqelklkVSDLSDMRCKLEVELalkeaeTDEIK 1071
Cdd:TIGR01612  687 NAIDNTEDKAKLDDLKSKIDKEYDKIqnmetatVELHLSNIENKKNEL------LDIIVEIKKHIHGEI------NKDLN 754
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1072 ILLEESRTQQKEMLKSLLEQETENlrTEISKLNQKI-----HDNNESYQVGLSELRALMTIEKDQCISELISRHEEES-- 1144
Cdd:TIGR01612  755 KILEDFKNKEKELSNKINDYAKEK--DELNKYKSKIseiknHYNDQINIDNIKDEDAKQNYDKSKEYIKTISIKEDEIfk 832
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1145 --NILKAELDNVTSLHRQAYEIEKKLKEQI-------VELQTRLNSELSalekqkDEKITQQEEKYealiqnleKDKERL 1215
Cdd:TIGR01612  833 iiNEMKFMKDDFLNKVDKFINFENNCKEKIdseheqfAELTNKIKAEIS------DDKLNDYEKKF--------NDSKSL 898
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1216 VKNHEQDKEHLIQELNFEKN--------KAVQTALDEFKVERELVEKELLEKVKHLENQIAKTPAFESAREDSsslvaeL 1287
Cdd:TIGR01612  899 INEINKSIEEEYQNINTLKKvdeyikicENTKESIEKFHNKQNILKEILNKNIDTIKESNLIEKSYKDKFDNT------L 972
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 226693374  1288 QEKLQEEKAKFLE-QLEEQEKRKNEEMQNVrTSLIAEQQTNFNTVLTREKMRKENIINDLSDK 1349
Cdd:TIGR01612  973 IDKINELDKAFKDaSLNDYEAKNNELIKYF-NDLKANLGKNKENMLYHQFDEKEKATNDIEQK 1034
PRK01156 PRK01156
chromosome segregation protein; Provisional
876-1160 1.17e-04

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 46.82  E-value: 1.17e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  876 KLDALVKDSEENVNKILKLKENLVSLE-EALQNKDNEFTSIKHE-KDAIVCVQQEKDQKLLEMEKIMHTQHCEIKELKQS 953
Cdd:PRK01156  487 EIEIEVKDIDEKIVDLKKRKEYLESEEiNKSINEYNKIESARADlEDIKIKINELKDKHDKYEEIKNRYKSLKLEDLDSK 566
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  954 REMALEDLKKLHDEKIESLRAEFQCLEQnHLKELEDTLHirhtqEFEKVMTDHNMSLEKLKKENQQRIDqMLESHASTIQ 1033
Cdd:PRK01156  567 RTSWLNALAVISLIDIETNRSRSNEIKK-QLNDLESRLQ-----EIEIGFPDDKSYIDKSIREIENEAN-NLNNKYNEIQ 639
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1034 EKEQQLQELKLKVSDLSDMRCKL-EVELALKE--AETDEIKILLEESRTQQKEMLKSLLEQET--ENLRTEISKLNQKIH 1108
Cdd:PRK01156  640 ENKILIEKLRGKIDNYKKQIAEIdSIIPDLKEitSRINDIEDNLKKSRKALDDAKANRARLEStiEILRTRINELSDRIN 719
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|..
gi 226693374 1109 DNNESYQVGLSELRALMTIEKdqcISELISRHEEESNILKAELDNVTSLHRQ 1160
Cdd:PRK01156  720 DINETLESMKKIKKAIGDLKR---LREAFDKSGVPAMIRKSASQAMTSLTRK 768
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
1069-1474 1.43e-04

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 46.57  E-value: 1.43e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1069 EIKILLEESRTQQKEMLKSLLEQET-ENLRTEISKLNQKIHDNNESYQVGLSELRALMTIEKDQCISELISRHEEESNIL 1147
Cdd:PRK02224  139 EVNKLINATPSDRQDMIDDLLQLGKlEEYRERASDARLGVERVLSDQRGSLDQLKAQIEEKEEKDLHERLNGLESELAEL 218
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1148 KAELDNVTSLHRQAYEIEKKLKEQIVELQTRLnSELSALEKQKD---EKITQQEEKYEALIQNLEKDKERLvKNHEQDKE 1224
Cdd:PRK02224  219 DEEIERYEEQREQARETRDEADEVLEEHEERR-EELETLEAEIEdlrETIAETEREREELAEEVRDLRERL-EELEEERD 296
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1225 HLIQELNFEK--NKAVQTALDEFKVERELVEKELLEK---VKHLENQIaktpafESAREDSSSLVAELQEKlqEEKAKFL 1299
Cdd:PRK02224  297 DLLAEAGLDDadAEAVEARREELEDRDEELRDRLEECrvaAQAHNEEA------ESLREDADDLEERAEEL--REEAAEL 368
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1300 EqleeqekrknEEMQNVRTSlIAEQQTnfntvlTREKMRKEniINDLSDKLKSTMQQQERDKDLIESLSEDRARLLEEKK 1379
Cdd:PRK02224  369 E----------SELEEAREA-VEDRRE------EIEELEEE--IEELRERFGDAPVDLGNAEDFLEELREERDELREREA 429
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1380 QLEEEVSKLRTSsflssapVAAAPELYGA-----CAPELPGEPERSVMETADEGRLDSAMETSMMSVQENMLSEEKQRIM 1454
Cdd:PRK02224  430 ELEATLRTARER-------VEEAEALLEAgkcpeCGQPVEGSPHVETIEEDRERVEELEAELEDLEEEVEEVEERLERAE 502
                         410       420
                  ....*....|....*....|...
gi 226693374 1455 LL---ERTLQLKEEENKRLNQRL 1474
Cdd:PRK02224  503 DLveaEDRIERLEERREDLEELI 525
HEC1 COG5185
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell ...
837-1239 1.50e-04

Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 444066 [Multi-domain]  Cd Length: 594  Bit Score: 46.49  E-value: 1.50e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  837 KCTAVEIRNIIEKVKCSLEITLKEKHQQEL-QSLKIEYECKLDALVKDSEENVNKILKLKE-NLVSLEEALQNKDNEFTS 914
Cdd:COG5185   143 LDEIADIEASYGEVETGIIKDIFGKLTQELnQNLKKLEIFGLTLGLLKGISELKKAEPSGTvNSIKESETGNLGSESTLL 222
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  915 IKHEKDAIVCVQQEKDQKLLEMEKIMHTQHCEIKE-LKQSREMALEDL-------KKLhDEKIESLRAEFQCLEQNhLKE 986
Cdd:COG5185   223 EKAKEIINIEEALKGFQDPESELEDLAQTSDKLEKlVEQNTDLRLEKLgenaessKRL-NENANNLIKQFENTKEK-IAE 300
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  987 LEDTLHIR-HTQEFEKVM--TDHNMSLEKLKKENQQRIDQMLESHASTIQEKEQQLQELKLKVSDLSDmrcklEVELALK 1063
Cdd:COG5185   301 YTKSIDIKkATESLEEQLaaAEAEQELEESKRETETGIQNLTAEIEQGQESLTENLEAIKEEIENIVG-----EVELSKS 375
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1064 EAETDEIKILLEESRT-----------QQKEMLKSL------LEQETENLRTEISKLNQKIHDNNESYQVGLSELRALMT 1126
Cdd:COG5185   376 SEELDSFKDTIESTKEsldeipqnqrgYAQEILATLedtlkaADRQIEELQRQIEQATSSNEEVSKLLNELISELNKVMR 455
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1127 IEKDQCISELISRHEEESNILKAELDNVTSLHRQAYEIEKKLKEQIVELQTRLNSELSALEKQKDEKI-------TQQEE 1199
Cdd:COG5185   456 EADEESQSRLEEAYDEINRSVRSKKEDLNEELTQIESRVSTLKATLEKLRAKLERQLEGVRSKLDQVAeslkdfmRARGY 535
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|
gi 226693374 1200 KYEALIQNLEKDKERLVKNHEQDKEHLIQELNFEKNKAVQ 1239
Cdd:COG5185   536 AHILALENLIPASELIQASNAKTDGQAANLRTAVIDELTQ 575
CCDC22 pfam05667
Coiled-coil domain-containing protein 22; Human coiled-coil domain-containing protein 22 ...
1009-1255 1.63e-04

Coiled-coil domain-containing protein 22; Human coiled-coil domain-containing protein 22 (CCDC22) is involved in regulation of NF-kappa-B signalling; the function may involve association with COMMD8 and a CUL1-dependent E3 ubiquitin ligase complex. It is part of the OMMD/CCDC22/CCDC93 (CCC) complex, which interacts with the multisubunit WASH complex required for endosomal deposition of F-actin and cargo trafficking in conjunction with the retromer. This entry also includes CCDC22 homologs from animals and plants.


Pssm-ID: 461708 [Multi-domain]  Cd Length: 600  Bit Score: 46.17  E-value: 1.63e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1009 SLEKLKKENQQRIDQMleshastiqekEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESRTQQ--KEMLK 1086
Cdd:pfam05667  325 TEEELQQQREEELEEL-----------QEQLEDLESSIQELEKEIKKLESSIKQVEEELEELKEQNEELEKQYkvKKKTL 393
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1087 SLLEQETENlrteISKLNQKIHDNNESyqvgLSELRALMtiEKDQciSELISRHEEesniLKAELDNvtslhrQAYEIEK 1166
Cdd:pfam05667  394 DLLPDAEEN----IAKLQALVDASAQR----LVELAGQW--EKHR--VPLIEEYRA----LKEAKSN------KEDESQR 451
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1167 KLkEQIVELQTRLNsELSALEKQKDEKITQQEEKYEAL----------------IQNLEKDKERLVKNhEQDKEHLIQEL 1230
Cdd:pfam05667  452 KL-EEIKELREKIK-EVAEEAKQKEELYKQLVAEYERLpkdvsrsaytrrileiVKNIKKQKEEITKI-LSDTKSLQKEI 528
                          250       260
                   ....*....|....*....|....*
gi 226693374  1231 NFEKNKAVQTaldeFKVERELVEKE 1255
Cdd:pfam05667  529 NSLTGKLDRT----FTVTDELVFKD 549
46 PHA02562
endonuclease subunit; Provisional
993-1217 1.70e-04

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 46.16  E-value: 1.70e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  993 IRHTQEFEKVMTDHNMSLEKLKKENQQRIDQMLESHASTIQEKEQQLQELKLKVSDLSdmRCKLEVELALKEAETDEIKI 1072
Cdd:PHA02562  190 IDHIQQQIKTYNKNIEEQRKKNGENIARKQNKYDELVEEAKTIKAEIEELTDELLNLV--MDIEDPSAALNKLNTAAAKI 267
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1073 lleESRTQQKEMLKSLLEQE------TENLRTEISKLnQKIHDNNESYQVGLSELralmtiekDQCISELISRHEEESNI 1146
Cdd:PHA02562  268 ---KSKIEQFQKVIKMYEKGgvcptcTQQISEGPDRI-TKIKDKLKELQHSLEKL--------DTAIDELEEIMDEFNEQ 335
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1147 LKA--ELDNVTSLHRQAYEIE----KKLKEQIVELQ---TRLNSELSALEKQKDEKITQQ----EEKYE-ALIQNLEKD- 1211
Cdd:PHA02562  336 SKKllELKNKISTNKQSLITLvdkaKKVKAAIEELQaefVDNAEELAKLQDELDKIVKTKselvKEKYHrGIVTDLLKDs 415

                  ....*...
gi 226693374 1212 --KERLVK 1217
Cdd:PHA02562  416 giKASIIK 423
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
806-1369 1.70e-04

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 46.58  E-value: 1.70e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   806 DSHSSIQTIKDDLCHFRTFVQKEQCDLANyLKCTAVEIRNIIEKVKCSLEITLKEK-------HQQELQSLKIEYECKLD 878
Cdd:TIGR01612 1650 DNLNSLQEFLESLKDQKKNIEDKKKELDE-LDSEIEKIEIDVDQHKKNYEIGIIEKikeiaiaNKEEIESIKELIEPTIE 1728
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   879 ALVKDSEENVNKILKLKENLVSLEEALQNKDNEFtsIKHEKDAIVCVqqekdqKLLEMEKIMHTQHCEIKELKQSREMAL 958
Cdd:TIGR01612 1729 NLISSFNTNDLEGIDPNEKLEEYNTEIGDIYEEF--IELYNIIAGCL------ETVSKEPITYDEIKNTRINAQNEFLKI 1800
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   959 EDLKKLHDEKIESLRA-EFQCLeQNHLKELEDTLHIRHTQEFEKV----------------MTDHNMSLEKLKKENQQRI 1021
Cdd:TIGR01612 1801 IEIEKKSKSYLDDIEAkEFDRI-INHFKKKLDHVNDKFTKEYSKInegfddisksienvknSTDENLLFDILNKTKDAYA 1879
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1022 DQMLESHASTIQEKEQQLQEL-KLKVS-----------DLSDmRCKLEVELALKEAETDEIKILLE-----ESRTQQKEM 1084
Cdd:TIGR01612 1880 GIIGKKYYSYKDEAEKIFINIsKLANSiniqiqnnsgiDLFD-NINIAILSSLDSEKEDTLKFIPSpekepEIYTKIRDS 1958
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1085 LKSLL------------EQETENLRTEISKLNQKIHDNNESYQVgLSELRalmtIEKDQCISE--LISRHEEESNILKAE 1150
Cdd:TIGR01612 1959 YDTLLdifkksqdlhkkEQDTLNIIFENQQLYEKIQASNELKDT-LSDLK----YKKEKILNDvkLLLHKFDELNKLSCD 2033
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1151 LDNVTSLHRQAYEIEKKLKEQIVELQTR---LNSELSALEKQKDEKItQQEEKYEaliqNLEKDKERLVKNHEQDKEHLI 1227
Cdd:TIGR01612 2034 SQNYDTILELSKQDKIKEKIDNYEKEKEkfgIDFDVKAMEEKFDNDI-KDIEKFE----NNYKHSEKDNHDFSEEKDNII 2108
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1228 QElnfekNKAVQTALDEFKVERELVEKELLEKvKHLENQIAKTPAfESAREDSSSLVAELQEKLQEEkAKFLEQLEEQEK 1307
Cdd:TIGR01612 2109 QS-----KKKLKELTEAFNTEIKIIEDKIIEK-NDLIDKLIEMRK-ECLLFSYATLVETLKSKVINH-SEFITSAAKFSK 2180
                          570       580       590       600       610       620
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 226693374  1308 RKNEEMQNVRTSL---IAEQQTNFNTVLTREKMrkENIINDLSDKLKSTMQQQERDKDLIESLSE 1369
Cdd:TIGR01612 2181 DFFEFIEDISDSLnddIDALQIKYNLNQTKKHM--ISILADATKDHNNLIEKEKEATKIINNLTE 2243
HCR pfam07111
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha ...
862-1364 1.78e-04

Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation.


Pssm-ID: 284517 [Multi-domain]  Cd Length: 749  Bit Score: 46.28  E-value: 1.78e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   862 HQQELQSLKIEYECKLDALVKDSEenvnkilKLKENLVSLEEALQNKDNEFTSIKHEKDaIVCVQQEKDQKLLEM----- 936
Cdd:pfam07111  149 HQEQLSSLTQAHEEALSSLTSKAE-------GLEKSLNSLETKRAGEAKQLAEAQKEAE-LLRKQLSKTQEELEAqvtlv 220
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   937 --------EKIMHTQHCEIKELKqsREMALEDLKKLHDEK------IESLRAEFQCLEQN-HLKELEDTLHIRHTQEFEK 1001
Cdd:pfam07111  221 eslrkyvgEQVPPEVHSQTWELE--RQELLDTMQHLQEDRadlqatVELLQVRVQSLTHMlALQEEELTRKIQPSDSLEP 298
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1002 VMTDHNMSL-----EKLKKENQQRIDQMLEsHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAETD----EIKI 1072
Cdd:pfam07111  299 EFPKKCRSLlnrwrEKVFALMVQLKAQDLE-HRDSVKQLRGQVAELQEQVTSQSQEQAILQRALQDKAAEVEvermSAKG 377
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1073 L------LEESRTQQKEMLKSLLEQ----------ETENLRTEISKLNQKIH-----DNNESYQVglselRALMTIEKDQ 1131
Cdd:pfam07111  378 LqmelsrAQEARRRQQQQTASAEEQlkfvvnamssTQIWLETTMTRVEQAVAripslSNRLSYAV-----RKVHTIKGLM 452
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1132 CISELISRHEEESNILKAELDNVTSlhrqayEIEKKLkEQIVELQTRLNSELSALEKQKDEKITQQEEKYEALIQNLEKD 1211
Cdd:pfam07111  453 ARKVALAQLRQESCPPPPPAPPVDA------DLSLEL-EQLREERNRLDAELQLSAHLIQQEVGRAREQGEAERQQLSEV 525
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1212 KERLVKNHEQDKEHLI---QELNFEKNKAVQTALDEFKVERELVEKELLEKvkhlenqiaktpafesaredssslvAELQ 1288
Cdd:pfam07111  526 AQQLEQELQRAQESLAsvgQQLEVARQGQQESTEEAASLRQELTQQQEIYG-------------------------QALQ 580
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1289 EKLQEEKAKFLEQLEEQEKRKNEEMQNVRTSLIAEQQTNFNTvlTREKMR-------KENIINDLSDKLKSTMQQQERDK 1361
Cdd:pfam07111  581 EKVAEVETRLREQLSDTKRRLNEARREQAKAVVSLRQIQHRA--TQEKERnqelrrlQDEARKEEGQRLARRVQELERDK 658

                   ...
gi 226693374  1362 DLI 1364
Cdd:pfam07111  659 NLM 661
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1127-1468 2.63e-04

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 45.83  E-value: 2.63e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1127 IEKDQCISELISRHEEESNILKAELDNVTSLHRQAYEIEKKLKEQIVELQtRLNSELSALEKQKdEKITQQEEKYEALIQ 1206
Cdd:PRK03918  185 IKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELE-ELKEEIEELEKEL-ESLEGSKRKLEEKIR 262
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1207 NLEKDKERLVKnHEQDKEHLIQELNFEKNKAVQ-TALDEFKVERELVEKELLEKVKHLENQIAKTPAFESAREDSSSLVA 1285
Cdd:PRK03918  263 ELEERIEELKK-EIEELEEKVKELKELKEKAEEyIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLE 341
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1286 ELQEKLQEEKAKfLEQLEEQE------KRKNEEMQNVRtsliaeqqtnfntvltreKMRKENIINDLSDKLKSTMQQQER 1359
Cdd:PRK03918  342 ELKKKLKELEKR-LEELEERHelyeeaKAKKEELERLK------------------KRLTGLTPEKLEKELEELEKAKEE 402
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1360 DKDLIESLSEDRARLLEEKKQLEEEVSKLRTSSflSSAPVaaapelygaCAPELPGEPERSVME--TADEGRLDSAMETS 1437
Cdd:PRK03918  403 IEEEISKITARIGELKKEIKELKKAIEELKKAK--GKCPV---------CGRELTEEHRKELLEeyTAELKRIEKELKEI 471
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|
gi 226693374 1438 MMSVQ---------ENMLSEEKQRIMLLERTLQLKEEENK 1468
Cdd:PRK03918  472 EEKERklrkelrelEKVLKKESELIKLKELAEQLKELEEK 511
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1032-1308 3.13e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 45.68  E-value: 3.13e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1032 IQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESRTqqkemlksllEQETENLRTEISKLNQKIhdnn 1111
Cdd:COG4913   612 LAALEAELAELEEELAEAEERLEALEAELDALQERREALQRLAEYSWD----------EIDVASAEREIAELEAEL---- 677
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1112 ESYQVGLSELRALmtiekdqciselisrhEEESNILKAELDNVTSLHRQAYEIEKKLKEQIVELQTRLNSELSALEKQKD 1191
Cdd:COG4913   678 ERLDASSDDLAAL----------------EEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAED 741
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1192 EKITQQEEKYEALIQNLEKDK--ERLVKNHEQDKEHLIQELNfEKNKAVQTALDEFKVERELVEKELlekvkhlenqiak 1269
Cdd:COG4913   742 LARLELRALLEERFAAALGDAveRELRENLEERIDALRARLN-RAEEELERAMRAFNREWPAETADL------------- 807
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....
gi 226693374 1270 TPAFESAREdssslVAELQEKLQEE-----KAKFLEQLEEQEKR 1308
Cdd:COG4913   808 DADLESLPE-----YLALLDRLEEDglpeyEERFKELLNENSIE 846
GOLGA2L5 pfam15070
Putative golgin subfamily A member 2-like protein 5; The function of the GOLGA2L5 protein ...
1010-1306 3.14e-04

Putative golgin subfamily A member 2-like protein 5; The function of the GOLGA2L5 protein family remains unknown. This family of proteins is thought to be found in the Golgi apparatus of eukaryotes.


Pssm-ID: 464485 [Multi-domain]  Cd Length: 521  Bit Score: 45.44  E-value: 3.14e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1010 LEKLKKEN---QQRIDQMLESHASTIQEKEQ---QLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESRTQQKE 1083
Cdd:pfam15070   17 AENLKEEGavwQQKMQQLSEQVRTLREEKERsvsQVQELETSLAELKNQAAVPPAEEEQPPAGPSEEEQRLQEEAEQLQK 96
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1084 MLKSLLEQETENLR--TEISKLNQKihdnnesYQVGLSEL---------------RALMTIEKDQC-ISELISRHEEESN 1145
Cdd:pfam15070   97 ELEALAGQLQAQVQdnEQLSRLNQE-------QEQRLLELeraaerwgeqaedrkQILEDMQSDRAtISRALSQNRELKE 169
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1146 ILkAELDN-----------VTSLHRQAYEIEKKLKEQIVELQTRL----------NSELSALEKQKDEKIT--QQeekYE 1202
Cdd:pfam15070  170 QL-AELQNgfvkltnenmeLTSALQSEQHVKKELAKKLGQLQEELgelketlelkSQEAQSLQEQRDQYLAhlQQ---YV 245
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1203 ALIQNLEKDKERLVKNHEQdKEHLIQELNFEKnkaVQTaldefKVERELVEKELLEKVKHLenqiaktpafESAREDSSS 1282
Cdd:pfam15070  246 AAYQQLASEKEELHKQYLL-QTQLMDRLQHEE---VQG-----KVAAEMARQELQETQERL----------EALTQQNQQ 306
                          330       340
                   ....*....|....*....|....
gi 226693374  1283 LVAELQEKLQEEKAKFLEQLEEQE 1306
Cdd:pfam15070  307 LQAQLSLLANPGEGDGLESEEEEE 330
Caldesmon pfam02029
Caldesmon;
827-1106 4.18e-04

Caldesmon;


Pssm-ID: 460421 [Multi-domain]  Cd Length: 495  Bit Score: 44.86  E-value: 4.18e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   827 KEQCDLANYLKCTAVEIRNIIEKVKCSLEITLKEKHQQELQSLKIEYECKldalvkdSEENVNKILKLKENLVSLEEALQ 906
Cdd:pfam02029   80 QEALERQKEFDPTIADEKESVAERKENNEEEENSSWEKEEKRDSRLGRYK-------EEETEIREKEYQENKWSTEVRQA 152
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   907 NKDNEftsiKHEKDAIVCVQQEKDQKLLEMEKIMHTQhceiKELKQSREM-ALEDLKKLHDE-KIESLRAEFQCLEQNHL 984
Cdd:pfam02029  153 EEEGE----EEEDKSEEAEEVPTENFAKEEVKDEKIK----KEKKVKYESkVFLDQKRGHPEvKSQNGEEEVTKLKVTTK 224
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   985 KELEDTLHIRHTQEFEKVMTDHNMSLEKLKKENQQRIDQMLESHASTIQEKEQQLQELKLKvsdLSDMRCKLEVELALKE 1064
Cdd:pfam02029  225 RRQGGLSQSQEREEEAEVFLEAEQKLEELRRRRQEKESEEFEKLRQKQQEAELELEELKKK---REERRKLLEEEEQRRK 301
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|..
gi 226693374  1065 AETDEIKILLEESRTQQKEmlkslleqETENLRTEISKLNQK 1106
Cdd:pfam02029  302 QEEAERKLREEEEKRRMKE--------EIERRRAEAAEKRQK 335
ATG17_like pfam04108
Autophagy protein ATG17-like domain; This domain is found in the autophagy-related proteins ...
312-537 4.98e-04

Autophagy protein ATG17-like domain; This domain is found in the autophagy-related proteins ATG17 and ATG11, conserved across eukaryotes. ATG17 forms a complex with ATG29 and ATG31, critical for both PAS (preautophagosomal structure) formation and autophagy. Together with ATG13, it is required for ATG1 kinase activation. ATG11 is a scaffold protein required for the cytoplasm-to-vacuole targeting (Cvt) pathway during starvation and to recruit ATG proteins to the pre-autophagosome. It is also required for ATG1 kinase activation. In many eukaryotes, ATG11 (the orthologue in mammals is RB1-inducible coiled-coil protein 1 (RB1CC1) and in S. pombe is Taz1-interacting factor 1 (taf1)) is essential for bulk autophagy, except in S.cerevisiae. ATG17 and ATG11 are large similar proteins, both predicted to be almost entirely helical, containing conserved coiled-coil regions and lack obvious functional motifs.


Pssm-ID: 427715 [Multi-domain]  Cd Length: 360  Bit Score: 44.30  E-value: 4.98e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   312 NVQDRPNDVESLVRKCFDSMSRLDPKIIQPFMLECHQTIAKLDNQNMKAikgLEDRLYALDQMIASCSRLVNEQKELAQG 391
Cdd:pfam04108  127 ELQAAQESLDSDLKRFDDDLRDLQKELESLSSPSESISLIPTLLKELES---LEEEMASLLESLTNHYDQCVTAVKLTEG 203
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   392 FLANqMRAENLKDASVLPDLCLSHANQLMIMLQNHRKLLDIKQKCTTAKQELANNLHV------RLKWCCFVMLHADQDG 465
Cdd:pfam04108  204 GRAE-MLEVLENDARELDDVVPELQDRLDEMENNYERLQKLLEQKNSLIDELLSALQLiaeiqsRLPEYLAALKEFEERW 282
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 226693374   466 EKLQALLRLVIELLERVRivEALSTVPQMYCLAVVEVVRRKMFIKHYREWAGALVKDGKQLYEAEKSKRESF 537
Cdd:pfam04108  283 EEEKETIEDYLSELEDLR--EFYEGFPSAYGSLLLEVERRREWAEKMKKILRKLAEELDRLQEEERKRREKF 352
EzrA pfam06160
Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like ...
934-1356 5.20e-04

Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerizes into a ring structure that establishes the location of the nascent division site. EzrA modulates the frequency and position of FtsZ ring formation. The structure contains 5 spectrin like alpha helical repeats.


Pssm-ID: 428797 [Multi-domain]  Cd Length: 542  Bit Score: 44.46  E-value: 5.20e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   934 LEMEKImhtqhcEIKELKQSREmaLEDLKKLH-----DEKIESLRAEFQCLEQNHLKELEDTLHirhtqefekvmtDHNM 1008
Cdd:pfam06160   15 LEERKN------ELMNLPVQEE--LSKVKKLNltgetQEKFEEWRKKWDDIVTKSLPDIEELLF------------EAEE 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1009 SLEKLK----KENQQRIDQMLESHASTIQEKEQQLQELklkvsdlsdmrckLEVElalkEAETDEIKILLEESRTQQKEM 1084
Cdd:pfam06160   75 LNDKYRfkkaKKALDEIEELLDDIEEDIKQILEELDEL-------------LESE----EKNREEVEELKDKYRELRKTL 137
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1085 LK---------SLLEQETENLRTEISKLNQKIH--DNNESYQVgLSELRAlMTIEKDQCISELISRHEEESNILKAELDN 1153
Cdd:pfam06160  138 LAnrfsygpaiDELEKQLAEIEEEFSQFEELTEsgDYLEAREV-LEKLEE-ETDALEELMEDIPPLYEELKTELPDQLEE 215
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1154 VTSLHRQA----YEIE-KKLKEQIVELQTRLNSELSALEKQKDEKITQQEEKYEALIQNL------EKDKERLVKNHEQD 1222
Cdd:pfam06160  216 LKEGYREMeeegYALEhLNVDKEIQQLEEQLEENLALLENLELDEAEEALEEIEERIDQLydllekEVDAKKYVEKNLPE 295
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1223 KEHLIQELNfEKNKAVQTALDEFKVERELVEKELlEKVKHLENQIA---------------KTPAFESAREDSSSLVAEL 1287
Cdd:pfam06160  296 IEDYLEHAE-EQNKELKEELERVQQSYTLNENEL-ERVRGLEKQLEelekrydeiverleeKEVAYSELQEELEEILEQL 373
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1288 Q--EKLQEEKAKFLEQLEEQEKRKNEEMQNVRTSLIAeqqtnfntvlTREKMRKENI----------INDLSDKLKSTMQ 1355
Cdd:pfam06160  374 EeiEEEQEEFKESLQSLRKDELEAREKLDEFKLELRE----------IKRLVEKSNLpglpesyldyFFDVSDEIEDLAD 443

                   .
gi 226693374  1356 Q 1356
Cdd:pfam06160  444 E 444
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
874-1356 5.35e-04

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 45.04  E-value: 5.35e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   874 ECKLDALVKDSEenvNKILKLKENLVSLEEALQNKDNEFTSIKHEKDAIVCVQQEKDQKLLEMEKIMHTQHCEIKELKQS 953
Cdd:TIGR00606  690 EAELQEFISDLQ---SKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGRQSIIDLKEKEIPELRNKLQKVNRDIQRLKND 766
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   954 REMALEDLKKL--HDEKIESLRAEFQCLEQNHLkELEDTLHIRHTQEFEKVMTDHNMSLEKLKKENQQR------IDQML 1025
Cdd:TIGR00606  767 IEEQETLLGTImpEEESAKVCLTDVTIMERFQM-ELKDVERKIAQQAAKLQGSDLDRTVQQVNQEKQEKqheldtVVSKI 845
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1026 ESHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAetdeikilleesrtqqkemlkslLEQETENLRTEISKLNQ 1105
Cdd:TIGR00606  846 ELNRKLIQDQQEQIQHLKSKTNELKSEKLQIGTNLQRRQQ-----------------------FEEQLVELSTEVQSLIR 902
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1106 KIHDNNEsyqvglSELRALMTIEKDQC-ISELISRHEEESNILKAELDnvtslhrqayEIEKKLKEQIVELQTRLNSels 1184
Cdd:TIGR00606  903 EIKDAKE------QDSPLETFLEKDQQeKEELISSKETSNKKAQDKVN----------DIKEKVKNIHGYMKDIENK--- 963
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1185 aLEKQKDEKITQQEEKYEALIQNLEKDKERLvKNHEQDKEHLIQELNFEKNKA--VQTALDEFKVERELveKELLEKVKH 1262
Cdd:TIGR00606  964 -IQDGKDDYLKQKETELNTVNAQLEECEKHQ-EKINEDMRLMRQDIDTQKIQErwLQDNLTLRKRENEL--KEVEEELKQ 1039
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1263 LENQIAKTPAFESAREDSsslvaELQEKLQ---EEKAKFLEQLEEQEKRKNEEMQNVRTSLIAEQQTNFNTVLTREKMRK 1339
Cdd:TIGR00606 1040 HLKEMGQMQVLQMKQEHQ-----KLEENIDlikRNHVLALGRQKGYEKEIKHFKKELREPQFRDAEEKYREMMIVMRTTE 1114
                          490
                   ....*....|....*..
gi 226693374  1340 EnIINDLsDKLKSTMQQ 1356
Cdd:TIGR00606 1115 L-VNKDL-DIYYKTLDQ 1129
PRK05771 PRK05771
V-type ATP synthase subunit I; Validated
1055-1318 6.25e-04

V-type ATP synthase subunit I; Validated


Pssm-ID: 235600 [Multi-domain]  Cd Length: 646  Bit Score: 44.53  E-value: 6.25e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1055 KLEVeLALKEAETDEIKILLEESRTQQKEMLKSLLEQETENLRTEISKLNQKIhDNNESYQVGLSELRALMTIEKDQCIS 1134
Cdd:PRK05771    8 KVLI-VTLKSYKDEVLEALHELGVVHIEDLKEELSNERLRKLRSLLTKLSEAL-DKLRSYLPKLNPLREEKKKVSVKSLE 85
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1135 ELISRHEEESNILKAELDNVTSLHRQAYEIEKKLKEQIVELQ--TRLNSELSAL-------------EKQKDEKITQQEE 1199
Cdd:PRK05771   86 ELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIERLEpwGNFDLDLSLLlgfkyvsvfvgtvPEDKLEELKLESD 165
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1200 KYEALIQNLEKDKERLV----KNHEQDKEHLIQELNFEKN-----KAVQTALDEFKVERELVEKELlekvKHLENQIakt 1270
Cdd:PRK05771  166 VENVEYISTDKGYVYVVvvvlKELSDEVEEELKKLGFERLeleeeGTPSELIREIKEELEEIEKER----ESLLEEL--- 238
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*...
gi 226693374 1271 pafesaredssslvAELQEKLQEEKAKFLEQLEEQEKRKNEEMQNVRT 1318
Cdd:PRK05771  239 --------------KELAKKYLEELLALYEYLEIELERAEALSKFLKT 272
PRK12704 PRK12704
phosphodiesterase; Provisional
1186-1351 6.66e-04

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 44.38  E-value: 6.66e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1186 LEKQKDEKITQQEEKYEALIQNLEKDKERLvknheqdkehliqelnfeKNKAVQTALDEFKVERELVEKELLEKVKHLEN 1265
Cdd:PRK12704   25 RKKIAEAKIKEAEEEAKRILEEAKKEAEAI------------------KKEALLEAKEEIHKLRNEFEKELRERRNELQK 86
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1266 QIAKTPAFESAREDSSSLVAELQEKLQEEKAKFLEQLEEQEKRKnEEMQNvrtsLIAEQQTNFNTV--LTREKMRKEnII 1343
Cdd:PRK12704   87 LEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKE-EELEE----LIEEQLQELERIsgLTAEEAKEI-LL 160

                  ....*...
gi 226693374 1344 NDLSDKLK 1351
Cdd:PRK12704  161 EKVEEEAR 168
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
848-1266 7.87e-04

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 44.19  E-value: 7.87e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   848 EKVKCSLEITLKEKHQQELQSLK---IEYECKLDALVkDSEENVNKILKLKENLVSLEEALQNKDNEFTSIKHEKDAIVC 924
Cdd:TIGR00618  485 ETRKKAVVLARLLELQEEPCPLCgscIHPNPARQDID-NPGPLTRRMQRGEQTYAQLETSEEDVYHQLTSERKQRASLKE 563
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   925 VQQEKDQKLLEMEKIMHTQHCEIKELKQsremALEDLKKLHDEKIEsLRAEFQCLEQNHLKELEDTLH----IRHTQEFE 1000
Cdd:TIGR00618  564 QMQEIQQSFSILTQCDNRSKEDIPNLQN----ITVRLQDLTEKLSE-AEDMLACEQHALLRKLQPEQDlqdvRLHLQQCS 638
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1001 K----VMTDHNMSLEKLKKENQQ----RIDQMLESHASTIQEKEQQLQELKLKVSD--------LSDMRCKLEVELAL-K 1063
Cdd:TIGR00618  639 QelalKLTALHALQLTLTQERVRehalSIRVLPKELLASRQLALQKMQSEKEQLTYwkemlaqcQTLLRELETHIEEYdR 718
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1064 EAETDEIKILLEESRTQQKEMLKSLLEQETENLRTEISKLNQKIHDNNEsyqvglseLRALMTIEKDQCISELIS----- 1138
Cdd:TIGR00618  719 EFNEIENASSSLGSDLAAREDALNQSLKELMHQARTVLKARTEAHFNNN--------EEVTAALQTGAELSHLAAeiqff 790
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1139 -RHEEESNILKAELDNVTSLHRQAYEIEKKLkeQIVELQTRLNSELSALEK--QKDEKITQQEEKYEALIQNLEKDKERL 1215
Cdd:TIGR00618  791 nRLREEDTHLLKTLEAEIGQEIPSDEDILNL--QCETLVQEEEQFLSRLEEksATLGEITHQLLKYEECSKQLAQLTQEQ 868
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|.
gi 226693374  1216 VKNHEQDKehliqELNFEKNKAVQTALDEFkveRELVEKELLEKVKHLENQ 1266
Cdd:TIGR00618  869 AKIIQLSD-----KLNGINQIKIQFDGDAL---IKFLHEITLYANVRLANQ 911
Cast pfam10174
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...
1034-1373 8.90e-04

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.


Pssm-ID: 431111 [Multi-domain]  Cd Length: 766  Bit Score: 44.04  E-value: 8.90e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1034 EKEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLE---------------ESRTQQ---KEMLKSLLEQETEN 1095
Cdd:pfam10174  124 EHERQAKELFLLRKTLEEMELRIETQKQTLGARDESIKKLLEmlqskglpkksgeedWERTRRiaeAEMQLGHLEVLLDQ 203
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1096 LRTEISKLNQKIHDNNESYQvGLSELRALMTI--EKDQCISEL---ISRHEEESNILKAELDnvtsLHRQAYEIEKKlke 1170
Cdd:pfam10174  204 KEKENIHLREELHRRNQLQP-DPAKTKALQTVieMKDTKISSLernIRDLEDEVQMLKTNGL----LHTEDREEEIK--- 275
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1171 qivelQTRLNSELSALEKQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQDKEH---LIQELNFEKNKA--VQTALDEF 1245
Cdd:pfam10174  276 -----QMEVYKSHSKFMKNKIDQLKQELSKKESELLALQTKLETLTNQNSDCKQHievLKESLTAKEQRAaiLQTEVDAL 350
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1246 KVERELVEKELLEKVKHLENqiaktpafesAREDSSSLVAELQE-----KLQEEKA----KFLEQLEEQEKRKNEEMQNV 1316
Cdd:pfam10174  351 RLRLEEKESFLNKKTKQLQD----------LTEEKSTLAGEIRDlkdmlDVKERKInvlqKKIENLQEQLRDKDKQLAGL 420
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 226693374  1317 RTSLIAEQQTNFNTvltrekmrkeniindlsDKLKSTMQQQERDKD-LIESLSEDRAR 1373
Cdd:pfam10174  421 KERVKSLQTDSSNT-----------------DTALTTLEEALSEKErIIERLKEQRER 461
PLN02939 PLN02939
transferase, transferring glycosyl groups
1134-1359 1.35e-03

transferase, transferring glycosyl groups


Pssm-ID: 215507 [Multi-domain]  Cd Length: 977  Bit Score: 43.35  E-value: 1.35e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1134 SELISRHEEESNILKAELDNVTSLHRQAyeiekklkeqivelqTRLNSELSALEKQKDEKITQQEEKyealIQNLEKDKE 1213
Cdd:PLN02939   63 SKLQSNTDENGQLENTSLRTVMELPQKS---------------TSSDDDHNRASMQRDEAIAAIDNE----QQTNSKDGE 123
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1214 RLVKNHEQDKEHLIQelNFEKN-----KAVQTALDEfkVERELVEKELLE-KVKHLENQIAKTPAfesaredssslvael 1287
Cdd:PLN02939  124 QLSDFQLEDLVGMIQ--NAEKNilllnQARLQALED--LEKILTEKEALQgKINILEMRLSETDA--------------- 184
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 226693374 1288 QEKLQEEKAKFLEQLEEQ-EKRKNEEMQNVRTSLIAEQQTNFN-TVLTREKMRKENIINDLSDKLKSTMQQQER 1359
Cdd:PLN02939  185 RIKLAAQEKIHVEILEEQlEKLRNELLIRGATEGLCVHSLSKElDVLKEENMLLKDDIQFLKAELIEVAETEER 258
PRK12704 PRK12704
phosphodiesterase; Provisional
1158-1311 1.89e-03

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 42.84  E-value: 1.89e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1158 HRQAYEIEKKLKEQIVELQT----------RLNSELSALEKqKDEKITQQEEKYEALIQNLEKDKERLVKNHEQDKEHL- 1226
Cdd:PRK12704   67 HKLRNEFEKELRERRNELQKlekrllqkeeNLDRKLELLEK-REEELEKKEKELEQKQQELEKKEEELEELIEEQLQELe 145
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1227 -IQELNFEKNKavqtaldefkverelveKELLEKVKhlenqiaktpafESAREDSSSLVAELqeklqEEKAKfleqlEEQ 1305
Cdd:PRK12704  146 rISGLTAEEAK-----------------EILLEKVE------------EEARHEAAVLIKEI-----EEEAK-----EEA 186

                  ....*.
gi 226693374 1306 EKRKNE 1311
Cdd:PRK12704  187 DKKAKE 192
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1147-1359 2.08e-03

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 42.44  E-value: 2.08e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1147 LKAELDNVTSLHRQAYEIEKKLKEQIVELQT---RLNSELSALEKQKDEkITQQEEKYEALIQNLEKDKERLVKNHEQDK 1223
Cdd:COG4942    25 AEAELEQLQQEIAELEKELAALKKEEKALLKqlaALERRIAALARRIRA-LEQELAALEAELAELEKEIAELRAELEAQK 103
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1224 EHLIQELNF----EKNKAVQTALDEFKVERELVEKELLEKV-KHLENQIAktpAFESAREDSSSLVAELQEKlQEEKAKF 1298
Cdd:COG4942   104 EELAELLRAlyrlGRQPPLALLLSPEDFLDAVRRLQYLKYLaPARREQAE---ELRADLAELAALRAELEAE-RAELEAL 179
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 226693374 1299 LEQLEEQEKRKNEEMQNvRTSLIAEQQTNFNTVLTR--EKMRKENIINDLSDKLKSTMQQQER 1359
Cdd:COG4942   180 LAELEEERAALEALKAE-RQKLLARLEKELAELAAElaELQQEAEELEALIARLEAEAAAAAE 241
Ubl_TBK1_like cd12219
ubiquitin-like (Ubl) domain found in non-canonical Inhibitor of kappa B kinases IKKepsilon and ...
3-76 2.25e-03

ubiquitin-like (Ubl) domain found in non-canonical Inhibitor of kappa B kinases IKKepsilon and TBK1, and similar proteins; IKKepsilon and TBK1 (TRAF family member-associated NF-kappaB activator-binding kinase 1) are non-canonical members of IKK family. They have been characterized as activators of nuclear factor-kappaB (NF-kappaB), but they are not essential for NF-kappaB activation. They play critical roles in antiviral response via phosphorylation and activation of transcription factors IRF3, IRF7, STAT1 and STAT3. They are also involved in the survival, tumorigenesis and development of various cancers. Both IKKepsilon and TBK1 contain an N-terminal protein kinase domain followed a ubiquitin-like (Ubl) domain. The Ubl domain acts as a protein-protein interaction domain, and has been implicated in regulating kinase activity, which modulates interactions in the interferon pathway.


Pssm-ID: 340518  Cd Length: 77  Bit Score: 38.37  E-value: 2.25e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 226693374    3 LYVFLVNTGTTLTF---DTEltvqTVADLKHAIQSKYKIAIQHQVLVVNGGE-CMAADRRVCTYSAGTDTNPIFLFNK 76
Cdd:cd12219     4 IHVFSVSTCELLKIyldPTE----TLAEFQELIAEQTEIPAKNQLLLFEGQLlEEEVTLPVSDYPKTTEENPIILFNK 77
Mitofilin pfam09731
Mitochondrial inner membrane protein; Mitofilin controls mitochondrial cristae morphology. ...
859-1040 2.43e-03

Mitochondrial inner membrane protein; Mitofilin controls mitochondrial cristae morphology. Mitofilin is enriched in the narrow space between the inner boundary and the outer membranes, where it forms a homotypic interaction and assembles into a large multimeric protein complex. The first 78 amino acids contain a typical amino-terminal-cleavable mitochondrial presequence rich in positive-charged and hydroxylated residues and a membrane anchor domain. In addition, it has three centrally located coiled coil domains.


Pssm-ID: 430783 [Multi-domain]  Cd Length: 618  Bit Score: 42.44  E-value: 2.43e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   859 KEKHQQELQSLKIEYECKLDALVKDSEENVNK-ILKLKENLVSLEEALQN-KDNEFtsiKHEKDAIVcvQQEKDQKLLEM 936
Cdd:pfam09731  256 RIVFQQELVSIFPDIIPVLKEDNLLSNDDLNSlIAHAHREIDQLSKKLAElKKREE---KHIERALE--KQKEELDKLAE 330
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   937 EKIMHTQHCEIKELKQSR---EMALEDLKKLHDEKIES-LRAEFQCLEQ---NHLKELEDTLHIRHTQEF-EKVMTDHNM 1008
Cdd:pfam09731  331 ELSARLEEVRAADEAQLRlefEREREEIRESYEEKLRTeLERQAEAHEEhlkDVLVEQEIELQREFLQDIkEKVEEERAG 410
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 226693374  1009 SLEKLKK--ENQQRIDQMLESHASTIQE--KEQQLQ 1040
Cdd:pfam09731  411 RLLKLNEllANLKGLEKATSSHSEVEDEnrKAQQLW 446
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
773-1087 2.70e-03

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 42.36  E-value: 2.70e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   773 SRRMQDTSTRGNEGFGDRAALHVQLEKCRAAAQDSHSSIQTIKDDLCHFRTFVQKEQCDLANYLKCTAVEIRNIIEKVKC 852
Cdd:TIGR02169  229 LKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLER 308
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   853 SLEitLKEKHQQELQSLKIEYECKLDALVKDSEENVNKILKLKENLVSLEEALQNKDNEFTSIKHEKDAIVCVQQEKDQK 932
Cdd:TIGR02169  309 SIA--EKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDE 386
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   933 LLEM-EKIMHTQHcEIKELKQSREMALEDLKKLHdEKIESLRAEFQCLEQNHLkeledtlhirhtqEFEKVMTDHNMSLE 1011
Cdd:TIGR02169  387 LKDYrEKLEKLKR-EINELKRELDRLQEELQRLS-EELADLNAAIAGIEAKIN-------------ELEEEKEDKALEIK 451
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 226693374  1012 KlKKENQQRIDQMLESHastiqekEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESRTQQkEMLKS 1087
Cdd:TIGR02169  452 K-QEWKLEQLAADLSKY-------EQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEERVRGGRAVE-EVLKA 518
AAA_13 pfam13166
AAA domain; This family of domains contain a P-loop motif that is characteriztic of the AAA ...
1015-1188 2.79e-03

AAA domain; This family of domains contain a P-loop motif that is characteriztic of the AAA superfamily. Many of the proteins in this family are conjugative transfer proteins. This family includes the PrrC protein that is thought to be the active component of the anticodon nuclease.


Pssm-ID: 463796 [Multi-domain]  Cd Length: 712  Bit Score: 42.36  E-value: 2.79e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1015 KENQQRIDQMLESHASTIQEKEQQLQELKLKVSDLS---DMRCKLEVELALKEAETDEIKILLEESRtqqKEMLKSL--- 1088
Cdd:pfam13166  282 TEFQNRLQKLIEKVESAISSLLAQLPAVSDLASLLSafeLDVEDIESEAEVLNSQLDGLRRALEAKR---KDPFKSIeld 358
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1089 -LEQETENLRTEISKLNQKIHDNNEsyqvGLSELRALMTIEKDQ----CISELISRHEEESNILKAELDNVTSLHRQAYE 1163
Cdd:pfam13166  359 sVDAKIESINDLVASINELIAKHNE----ITDNFEEEKNKAKKKlrlhLVEEFKSEIDEYKDKYAGLEKAINSLEKEIKN 434
                          170       180
                   ....*....|....*....|....*...
gi 226693374  1164 IE---KKLKEQIVELQTRLNSELSALEK 1188
Cdd:pfam13166  435 LEaeiKKLREEIKELEAQLRDHKPGADE 462
FAM184 pfam15665
Family with sequence similarity 184, A and B; The function of FAM184 is not known.
857-1046 2.91e-03

Family with sequence similarity 184, A and B; The function of FAM184 is not known.


Pssm-ID: 464788 [Multi-domain]  Cd Length: 211  Bit Score: 40.80  E-value: 2.91e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   857 TLKEKHQQELQSLKIEYECKLDAL-VKDSEEnvnkiLKLKENLVSLEEALQnkdnEFTSIKHEkdAIVCVQQEKDQkLLE 935
Cdd:pfam15665   18 ALKEAHEEEIQQILAETREKILQYkSKIGEE-----LDLKRRIQTLEESLE----QHERMKRQ--ALTEFEQYKRR-VEE 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   936 MEKIMHTQHCEiKELKQSREMalEDLKKLHDEKIESLRAEFQCLEQNHLKELEDtLHIRHTQEFEKVMTDHNMSLEKLKK 1015
Cdd:pfam15665   86 RELKAEAEHRQ-RVVELSREV--EEAKRAFEEKLESFEQLQAQFEQEKRKALEE-LRAKHRQEIQELLTTQRAQSASSLA 161
                          170       180       190
                   ....*....|....*....|....*....|.
gi 226693374  1016 EnQQRIDQMLESHASTIQEKEQQLQELKLKV 1046
Cdd:pfam15665  162 E-QEKLEELHKAELESLRKEVEDLRKEKKKL 191
PRK05771 PRK05771
V-type ATP synthase subunit I; Validated
947-1209 3.61e-03

V-type ATP synthase subunit I; Validated


Pssm-ID: 235600 [Multi-domain]  Cd Length: 646  Bit Score: 41.84  E-value: 3.61e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  947 IKELKQSREMALEDLK-KLHDEKIESLRAEFQcleqnHLKELEDTLhiRHTQEFEKVMTDHNMSLEKLKKEN-QQRIDQM 1024
Cdd:PRK05771   22 LEALHELGVVHIEDLKeELSNERLRKLRSLLT-----KLSEALDKL--RSYLPKLNPLREEKKKVSVKSLEElIKDVEEE 94
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1025 LESHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEaetdEIKILLEESRTQQKemLKSLLEQETENLRTEISKLN 1104
Cdd:PRK05771   95 LEKIEKEIKELEEEISELENEIKELEQEIERLEPWGNFDL----DLSLLLGFKYVSVF--VGTVPEDKLEELKLESDVEN 168
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1105 QKIHDNNESYQ----VGLSELRALmtiekdqcISELISRHEEESNILKAEldnvtslhRQAYEIEKKLKEQIVELQT--- 1177
Cdd:PRK05771  169 VEYISTDKGYVyvvvVVLKELSDE--------VEEELKKLGFERLELEEE--------GTPSELIREIKEELEEIEKere 232
                         250       260       270
                  ....*....|....*....|....*....|..
gi 226693374 1178 RLNSELSALEKQKDEKITQQEEKYEALIQNLE 1209
Cdd:PRK05771  233 SLLEELKELAKKYLEELLALYEYLEIELERAE 264
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1174-1389 3.65e-03

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 41.67  E-value: 3.65e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1174 ELQTRLNSELSALEKQKDE---KITQQEEKYEALIQNLEKDKERLVKNHEQdkehlIQELNfEKNKAVQTALDEFKVERE 1250
Cdd:COG4942    20 DAAAEAEAELEQLQQEIAElekELAALKKEEKALLKQLAALERRIAALARR-----IRALE-QELAALEAELAELEKEIA 93
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1251 LVEKELLEKVKHLENQIAKtpAFESAREDSSSLVAELQEKLQEEK-AKFLEQLEEQEKRKNEEMQNVRTSLIAEQQTnfn 1329
Cdd:COG4942    94 ELRAELEAQKEELAELLRA--LYRLGRQPPLALLLSPEDFLDAVRrLQYLKYLAPARREQAEELRADLAELAALRAE--- 168
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1330 tvLTREKMRKENIINDLSDKLKSTMQQQERDKDLIESLSEDRARLLEekkqleeEVSKLR 1389
Cdd:COG4942   169 --LEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAA-------ELAELQ 219
MAP65_ASE1 pfam03999
Microtubule associated protein (MAP65/ASE1 family);
858-1151 3.88e-03

Microtubule associated protein (MAP65/ASE1 family);


Pssm-ID: 427641 [Multi-domain]  Cd Length: 477  Bit Score: 41.53  E-value: 3.88e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   858 LKEKHQQELQSLKIEYEcKLDALVKDSEENVNKILKLKENLVSLEEALQNKDnefTSIKHekdaivcVQQEKDQKLLEME 937
Cdd:pfam03999   46 KEQRILQSIADLRAEAA-ILCLYMRNRLLHEERDPFEPKKGMSLLQKEKKLD---TQLEH-------LRKEKAPRLAEIK 114
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   938 KIMHTQHCEIKELKQSREMALED--------------LKKLHDEKiESLRAEFQCLEQNHLKELEDtLHIRHTQEFEKVM 1003
Cdd:pfam03999  115 ELLEQLQQLCEELGEEPLPLLIDplpsleelesfrkhLENLRNEK-ERRLEEVNELKKQIKLLMEE-LDLVPGTDFEEDL 192
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1004 TDHNMSLEKLKKENQQRIDQMLESHAStiqEKEQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESRTQQKE 1083
Cdd:pfam03999  193 LCESEDNFCLSRENIDKLRKLIKQLEE---QKAEREEKIDDLREKILELWNRLQVPQEEQESFVRENNSLSQDTIDALRE 269
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 226693374  1084 MLKSLLEQETENLRTEISKLNQKIHDNNESYQVGLSELRALMTIEKDQCISELISRHEEESNILKAEL 1151
Cdd:pfam03999  270 ELQRLEELKKKNIKKLIEDLRVEIEELWDKLFYSTEQRKRFIPFFEELYTEDLLELHELELKRLKEEY 337
PRK01156 PRK01156
chromosome segregation protein; Provisional
860-1346 4.76e-03

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 41.81  E-value: 4.76e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  860 EKHQQELQSLKIEYECKLDALVKDSEENVNKILKLKENLVSLEEALQNKDNEFTSIkhekdaivcvqQEKDQKLLEMEKi 939
Cdd:PRK01156  269 LEKNNYYKELEERHMKIINDPVYKNRNYINDYFKYKNDIENKKQILSNIDAEINKY-----------HAIIKKLSVLQK- 336
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  940 MHTQHCEIKELKQSREMALEDLKKLHDEKIESLRAefqcLEQNHLKeledtlhIRHTQEFEKVMTDHNMSLEKLKKENQQ 1019
Cdd:PRK01156  337 DYNDYIKKKSRYDDLNNQILELEGYEMDYNSYLKS----IESLKKK-------IEEYSKNIERMSAFISEILKIQEIDPD 405
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1020 RIDQMLESHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAE-----------TDEIKILLEESRTQqkemlKSL 1088
Cdd:PRK01156  406 AIKKELNEINVKLQDISSKVSSLNQRIRALRENLDELSRNMEMLNGQsvcpvcgttlgEEKSNHIINHYNEK-----KSR 480
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1089 LEQETENLRTEISKLNQKIHD--------------NNESYQVGLSELRALMTIEKDQcISELISRHEEESNILkaelDNV 1154
Cdd:PRK01156  481 LEEKIREIEIEVKDIDEKIVDlkkrkeyleseeinKSINEYNKIESARADLEDIKIK-INELKDKHDKYEEIK----NRY 555
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1155 TSLHRQayEIEKKLKEQIVELQTRLNSELSALEKQKDEKITQqeekyealIQNLEKDKERLVKNHEQDKEH-------LI 1227
Cdd:PRK01156  556 KSLKLE--DLDSKRTSWLNALAVISLIDIETNRSRSNEIKKQ--------LNDLESRLQEIEIGFPDDKSYidksireIE 625
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1228 QELNFEKNKavQTALDEFKVERElvekELLEKVKHLENQIAKTPAFESAREDSSSLVAELQEKLQEEKAKFLEQLEEQ-E 1306
Cdd:PRK01156  626 NEANNLNNK--YNEIQENKILIE----KLRGKIDNYKKQIAEIDSIIPDLKEITSRINDIEDNLKKSRKALDDAKANRaR 699
                         490       500       510       520
                  ....*....|....*....|....*....|....*....|.
gi 226693374 1307 KRKNEEMQNVRTSLIAEQQTNFNTVLtrEKMRK-ENIINDL 1346
Cdd:PRK01156  700 LESTIEILRTRINELSDRINDINETL--ESMKKiKKAIGDL 738
PRK04778 PRK04778
septation ring formation regulator EzrA; Provisional
934-1320 4.78e-03

septation ring formation regulator EzrA; Provisional


Pssm-ID: 179877 [Multi-domain]  Cd Length: 569  Bit Score: 41.36  E-value: 4.78e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  934 LEMEKImhtqhcEIKELKQSREmaLEDLKKLH-----DEKIESLRAEFQCLEQNHLKELEDTLhirhtqeFEKVMTDHNM 1008
Cdd:PRK04778   34 LEERKQ------ELENLPVNDE--LEKVKKLNltgqsEEKFEEWRQKWDEIVTNSLPDIEEQL-------FEAEELNDKF 98
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1009 SLEKLKKENQQrIDQMLESHASTIQEKEQQLQELKlkvsdlsdmrcklEVElalkEAETDEIKILLEESRTQQKEMLK-- 1086
Cdd:PRK04778   99 RFRKAKHEINE-IESLLDLIEEDIEQILEELQELL-------------ESE----EKNREEVEQLKDLYRELRKSLLAnr 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1087 -------SLLEQETENLRTEISKLNQKIHDNN--ESYQVgLSELRALMT-IEKDqcISELISRHEEESNILKAELDNVTS 1156
Cdd:PRK04778  161 fsfgpalDELEKQLENLEEEFSQFVELTESGDyvEAREI-LDQLEEELAaLEQI--MEEIPELLKELQTELPDQLQELKA 237
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1157 LHRQAYE-----IEKKLKEQIVELQTRLNSELSALEKQK----DEKITQQEEKYEALIQNLEKD---KERLVKNHEQDKE 1224
Cdd:PRK04778  238 GYRELVEegyhlDHLDIEKEIQDLKEQIDENLALLEELDldeaEEKNEEIQERIDQLYDILEREvkaRKYVEKNSDTLPD 317
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1225 HLiQELNfEKNKAVQTALDEFKVERELVEKELlEKVKHLENQIAK--------TPAFESAREDSSSLVAELQ-------- 1288
Cdd:PRK04778  318 FL-EHAK-EQNKELKEEIDRVKQSYTLNESEL-ESVRQLEKQLESlekqydeiTERIAEQEIAYSELQEELEeilkqlee 394
                         410       420       430
                  ....*....|....*....|....*....|...
gi 226693374 1289 -EKLQEEKAKFLEQLEEQEKRKNEEMQNVRTSL 1320
Cdd:PRK04778  395 iEKEQEKLSEMLQGLRKDELEAREKLERYRNKL 427
mS26_Tt cd23695
Tetrahymena thermophila ribosomal protein mS26 and similar proteins; Ribosomal protein mS26 is ...
1065-1371 5.30e-03

Tetrahymena thermophila ribosomal protein mS26 and similar proteins; Ribosomal protein mS26 is a component of small subunit (SSU) in Tetrahymena thermophila mitochondrial ribosome (mitoribosome). The structure of the mitoribosome reveals an assembly of 94-ribosomal proteins and four-rRNAs with an additional protein mass of ~700 kDa on the small subunit; the large mitoribosomal subunit (LSU) lacks 5S rRNA.


Pssm-ID: 467909 [Multi-domain]  Cd Length: 496  Bit Score: 41.35  E-value: 5.30e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1065 AETDEIKILLEESRTQQKEMLKSLLEQETENLRTEI--------SK-------LNQKIHDN---------NESYQVGLSE 1120
Cdd:cd23695    65 ATQNHIKLLEKKSVKKEENERKYLLEQDVKAMNKKIildvmneeSKnwinlqnMNEKINPNlilpdtildETSYYLKLQE 144
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1121 LRALMTIEKDQCISELISrhEEESNILKAEL------DNVTSLHRQAYEIEKKLKEQIVELQTRLNSELSALEKQKDEKI 1194
Cdd:cd23695   145 LAFLFEQGDHEEMDKLLD--ENEEIEYKNSLlmpiyqDLKSLIKHLKYTELFKLLKEYQDAKAIIIEDFRESSEEGAEKL 222
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1195 TQQEEKYEALIQNLEKDKERLVKNHEQDKEHLIQELN-FEK-----------NKAVQTALDEFKVERELVEKELLEKVKH 1262
Cdd:cd23695   223 EKLEKAFATLLKNYKEELEEPEKQLEFMQKRLLDLYNlLRLwgqyitivkmpDSVVRDIMNKTQARPEVAKLNSKQELED 302
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1263 LENQIAKTPAFE--------SAREDSSSLVAELQEKLQEEKAKFLEQLEEQE---------KRKNEEMQNVRTSLIAEQQ 1325
Cdd:cd23695   303 AKNRKRDTEENEfdddyesaDEGETSDEEDEIEEENFQLQKEKKKEEELNAEfniaknslyKFSPQNDKNVVDDRDFYSG 382
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....*.
gi 226693374 1326 TNFNTVLTREKMRKENIINDLSDKlksTMQQQERDKDLIESLSEDR 1371
Cdd:cd23695   383 VDLENVFPRALLNNLNDFTGLDFQ---NVKEILNNEEKLKIIQGED 425
CALCOCO1 pfam07888
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ...
859-1185 5.64e-03

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.


Pssm-ID: 462303 [Multi-domain]  Cd Length: 488  Bit Score: 41.03  E-value: 5.64e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   859 KEKHQQELQSLKIEYECKLDALVKDSEENVNKILKLKENLVSLEEALQNKDNEFTSIKHEKDAIVCVQQEKDQKLLEMEK 938
Cdd:pfam07888   57 REKEKERYKRDREQWERQRRELESRVAELKEELRQSREKHEELEEKYKELSASSEELSEEKDALLAQRAAHEARIRELEE 136
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   939 IMHT-------QHCEIKELKQSREMALEDLKKLHDEKiESLRAEFQCLEQ------NHLKELEDTLHIRHTQEFekVMTD 1005
Cdd:pfam07888  137 DIKTltqrvleRETELERMKERAKKAGAQRKEEEAER-KQLQAKLQQTEEelrslsKEFQELRNSLAQRDTQVL--QLQD 213
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1006 HNMSLEKLKKENQQRIDQMLESHA--STIQEK----EQQLQELKLKVSDLSDMRCKLEVELALKEAETDEIKILLEESRT 1079
Cdd:pfam07888  214 TITTLTQKLTTAHRKEAENEALLEelRSLQERlnasERKVEGLGEELSSMAAQRDRTQAELHQARLQAAQLTLQLADASL 293
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1080 QQKEMlKSLLEQETENLRT-------EISKLNQKIHDNNESYQVGLSE---LRALMTIEKDqCISELISRHEEESNILKA 1149
Cdd:pfam07888  294 ALREG-RARWAQERETLQQsaeadkdRIEKLSAELQRLEERLQEERMErekLEVELGREKD-CNRVQLSESRRELQELKA 371
                          330       340       350
                   ....*....|....*....|....*....|....*.
gi 226693374  1150 ELDNVTSLHRQAYEIEKKLKEQIVELQTRLNSELSA 1185
Cdd:pfam07888  372 SLRVAQKEKEQLQAEKQELLEYIRQLEQRLETVADA 407
CCDC73 pfam15818
Coiled-coil domain-containing protein 73 family; CCDC73 is a family of eukaryotic coiled-coil ...
826-1045 5.93e-03

Coiled-coil domain-containing protein 73 family; CCDC73 is a family of eukaryotic coiled-coil containing proteins. The function is not known. The alternative name is sarcoma antigen NY-SAR-79.


Pssm-ID: 464893 [Multi-domain]  Cd Length: 1048  Bit Score: 41.47  E-value: 5.93e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   826 QKEQCDLANYLKCTAVEIrnIIEKVKC-------SLEITLKEKHQQELQS-LKIEYEckldaLVKDSEENVNKILKLKEN 897
Cdd:pfam15818  182 ESEICSLKKELKKVTSDL--IKSKVTCqykmgeeNINLTIKEQKFQELQErLNMELE-----LNKKINEEITHIQEEKQD 254
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   898 LVSLEEALQNKDNEFTSIKHEKDAIVCVQQEKDQKL-----LEMEKIMHTQHcEIKELKQSREMALEDLKKlhdeKIESL 972
Cdd:pfam15818  255 IIISFQHMQQLLQQQTQANTEMEAELKALKENNQTLerdneLQREKVKENEE-KFLNLQNEHEKALGTWKK----HVEEL 329
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 226693374   973 RAEFQCL--EQNHLKELEDTLHIRHTQEFEKVMTDHNMSLEKLKKENQQRIDQMLESHASTIQEKEQQLQELKLK 1045
Cdd:pfam15818  330 NGEINEIknELSSLKETHIKLQEHYNKLCNQKKFEEDKKFQNVPEVNNENSEMSTEKSENLIIQKYNSEQEIREE 404
Apolipoprotein pfam01442
Apolipoprotein A1/A4/E domain; These proteins contain several 22 residue repeats which form a ...
1152-1308 6.54e-03

Apolipoprotein A1/A4/E domain; These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E.


Pssm-ID: 460211 [Multi-domain]  Cd Length: 175  Bit Score: 39.17  E-value: 6.54e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1152 DNVTSLHRQAYEIEKKLKEQIVELQTRLNSELSALEKQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQDKEHLIQELN 1231
Cdd:pfam01442    4 DSLDELSTYAEELQEQLGPVAQELVDRLEKETEALRERLQKDLEEVRAKLEPYLEELQAKLGQNVEELRQRLEPYTEELR 83
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 226693374  1232 FEKNKAVQTALDEFKVERELVEKELLEKVKHLENQIAktPAFESAREDSSSLVAELQEKLQEEKAKFLEQLEEQEKR 1308
Cdd:pfam01442   84 KRLNADAEELQEKLAPYGEELRERLEQNVDALRARLA--PYAEELRQKLAERLEELKESLAPYAEEVQAQLSQRLQE 158
SHE3 pfam17078
SWI5-dependent HO expression protein 3; SWI5-dependent HO expression protein 3 (She3) is an ...
1138-1351 6.58e-03

SWI5-dependent HO expression protein 3; SWI5-dependent HO expression protein 3 (She3) is an RNA-binding protein that binds specific mRNAs, including the mRNA of Ash1, which is invalid in cell-fate determination. She3 acts as an adapter protein that docks the myosin motor Myo4p onto an Ash1-She2p ribonucleoprotein complex. She3 seems to bind to Myo4p and Shep2p via different domains.


Pssm-ID: 293683 [Multi-domain]  Cd Length: 228  Bit Score: 40.11  E-value: 6.58e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1138 SRHEEESNILKAELDNVTSLHRQAYEIEKKLKEQIVELQTRLNsELSALEKQKD---EKITQQEEKYEALIQNLEKDKER 1214
Cdd:pfam17078   41 SKFLENLASLKHENDNLSSMLNRKERRLKDLEDQLSELKNSYE-ELTESNKQLKkrlENSSASETTLEAELERLQIQYDA 119
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1215 LVKNHEQDKEHLIQELNfeknkAVQTALDEFKVERElvekellekvKHLENQIAKTpafesaredsSSLVAELQEKLQEE 1294
Cdd:pfam17078  120 LVDSQNEYKDHYQQEIN-----TLQESLEDLKLENE----------KQLENYQQRI----------SSNDKDIDTKLDSY 174
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 226693374  1295 KAKFlEQLEEQEKRKNEEMQNVRTSLIAEQQTNFNTVLTREkmrKENIINDLSDKLK 1351
Cdd:pfam17078  175 NNKF-KNLDNIYVNKNNKLLTKLDSLAQLLDLPSWLNLYPE---SRNKILEYAEKME 227
MAD pfam05557
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ...
1038-1467 6.92e-03

Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.


Pssm-ID: 461677 [Multi-domain]  Cd Length: 660  Bit Score: 40.88  E-value: 6.92e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1038 QLQELKLKVSDLSDmrcklevELALKEAETDEIKILLEESRTQQKEMLKSLLEQETEnLRTEISKLNQKIHDNNESYQVG 1117
Cdd:pfam05557    3 ELIESKARLSQLQN-------EKKQMELEHKRARIELEKKASALKRQLDRESDRNQE-LQKRIRLLEKREAEAEEALREQ 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1118 LSELRALMTIEkdqcisELISRHEEESNILKAEL-DNVTSLHRQAYEIEKKLKEQIVELQTrLNSELSALEKQKDEKITQ 1196
Cdd:pfam05557   75 AELNRLKKKYL------EALNKKLNEKESQLADArEVISCLKNELSELRRQIQRAELELQS-TNSELEELQERLDLLKAK 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1197 QEEkYEALIQNLEKDKERLvknheQDKEHLIQELNFEKNKAVQTALDEFKVERELV-----EKE---LLEKVKHLENQI- 1267
Cdd:pfam05557  148 ASE-AEQLRQNLEKQQSSL-----AEAEQRIKELEFEIQSQEQDSEIVKNSKSELAripelEKElerLREHNKHLNENIe 221
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1268 -------------AKTPAFESAREDSSSLVAELQEKLQEEKA-KFLEQLEEQEKRKNEEmqnVRTSLIAEQQTNF----- 1328
Cdd:pfam05557  222 nklllkeevedlkRKLEREEKYREEAATLELEKEKLEQELQSwVKLAQDTGLNLRSPED---LSRRIEQLQQREIvlkee 298
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1329 NTVLTREKMRKENIINDLSDKLKSTMQQQERDKDLIESLSEDRARLLEEKKQLEEEVSKLRtsSFLSSAPVAAAPELYGa 1408
Cdd:pfam05557  299 NSSLTSSARQLEKARRELEQELAQYLKKIEDLNKKLKRHKALVRRLQRRVLLLTKERDGYR--AILESYDKELTMSNYS- 375
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 226693374  1409 caPELPGEPERSVMETADEGRLDSAMETSMmSVQENMLSEEKQRIMLLERTLQLKEEEN 1467
Cdd:pfam05557  376 --PQLLERIEEAEDMTQKMQAHNEEMEAQL-SVAEEELGGYKQQAQTLERELQALRQQE 431
Metal_resist pfam13801
Heavy-metal resistance; This is a metal-binding protein which is involved in resistance to ...
1224-1302 7.35e-03

Heavy-metal resistance; This is a metal-binding protein which is involved in resistance to heavy-metal ions. The protein forms a four-helix hooked hairpin, consisting of two long alpha helices each flanked by a shorter alpha helix. It binds a metal ion in a type-2 like centre. It contains two copies of an LTXXQ motif.


Pssm-ID: 433488 [Multi-domain]  Cd Length: 119  Bit Score: 38.04  E-value: 7.35e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1224 EHLIQELNFEKNKAVQTALDEFKVERELVEKELLEKVKHLENQIAKTP----AFESAREDSSSLVAELQEKLQEEKAKFL 1299
Cdd:pfam13801   33 LRAALGLPAEQRERLRAALRDHARELRALRRELRAARRELAALLAAPPfdpaAIEAALAEARQARAALQAQIEEALLEFA 112

                   ...
gi 226693374  1300 EQL 1302
Cdd:pfam13801  113 ATL 115
PKK pfam12474
Polo kinase kinase; This domain family is found in eukaryotes, and is approximately 140 amino ...
1196-1324 7.65e-03

Polo kinase kinase; This domain family is found in eukaryotes, and is approximately 140 amino acids in length. The family is found in association with pfam00069. Polo-like kinase 1 (Plx1) is essential during mitosis for the activation of Cdc25C, for spindle assembly, and for cyclin B degradation. This family is Polo kinase kinase (PKK) which phosphorylates Polo kinase and Polo-like kinase to activate them. PKK is a serine/threonine kinase.


Pssm-ID: 463600 [Multi-domain]  Cd Length: 139  Bit Score: 38.31  E-value: 7.65e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1196 QQEEKYEALIQNL----EKDKERLVKNHEQDKEHLIQELNFEKNKAVQTALDEFKVERELVEKELLEKVKHLENQIAKTP 1271
Cdd:pfam12474    7 QQKDRFEQERQQLkkryEKELEQLERQQKQQIEKLEQRQTQELRRLPKRIRAEQKKRLKMFRESLKQEKKELKQEVEKLP 86
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 226693374  1272 AFEsAREDSSSLVAELQEKLQEEKAKFLEQLEEQEKRKNEEMQNVRTSLIAEQ 1324
Cdd:pfam12474   87 KFQ-RKEAKRQRKEELELEQKHEELEFLQAQSEALERELQQLQNEKRKELAEH 138
PRK05771 PRK05771
V-type ATP synthase subunit I; Validated
1005-1268 8.44e-03

V-type ATP synthase subunit I; Validated


Pssm-ID: 235600 [Multi-domain]  Cd Length: 646  Bit Score: 40.68  E-value: 8.44e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1005 DHNMSLEKLKkENQQRIDQMLeSHASTIQEKEQQLQELKLKVSDLSDMRCKLEVELALKEAEtDEIKILLEEsRTQQKEM 1084
Cdd:PRK05771   40 LSNERLRKLR-SLLTKLSEAL-DKLRSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIE-KEIKELEEE-ISELENE 115
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1085 LKSLLEQETE-----NLRTEISKLNqkihdNNESYQVGLSELRA----LMTIEKDQCISELISRHEEESNILkaeldnVT 1155
Cdd:PRK05771  116 IKELEQEIERlepwgNFDLDLSLLL-----GFKYVSVFVGTVPEdkleELKLESDVENVEYISTDKGYVYVV------VV 184
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1156 SLHRQAYEIEKKLKE-QIVELQTRLNSELSALEKQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQDKEHLIQELNFEK 1234
Cdd:PRK05771  185 VLKELSDEVEEELKKlGFERLELEEEGTPSELIREIKEELEEIEKERESLLEELKELAKKYLEELLALYEYLEIELERAE 264
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|
gi 226693374 1235 nkavqtALDEFKVEREL------VEKELLEKVKHLENQIA 1268
Cdd:PRK05771  265 ------ALSKFLKTDKTfaiegwVPEDRVKKLKELIDKAT 298
OmpH smart00935
Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH ...
1178-1269 8.58e-03

Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH among others. Skp (OmpH) has been characterized as a molecular chaperone that interacts with unfolded proteins as they emerge in the periplasm from the Sec translocation machinery.


Pssm-ID: 214922 [Multi-domain]  Cd Length: 140  Bit Score: 38.33  E-value: 8.58e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   1178 RLNSELSALEKQKDEKITQQEEKYEALIQNLEKDKERLVKNHEQDKEHLIQELNFEKNKAVQTALDEFKVERELVEKELL 1257
Cdd:smart00935   18 AAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQEFQRKQQKLQQDLQKRQQEELQKIL 97
                            90
                    ....*....|..
gi 226693374   1258 EKVKHLENQIAK 1269
Cdd:smart00935   98 DKINKAIKEVAK 109
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
754-1223 8.62e-03

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 41.19  E-value: 8.62e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   754 QSPEMMVESLYSSVINA---------IDSRRMQDTSTRGNEGFGDRAalhvqlEKCRAAAQDSH--SSIQTIKDDLCHFR 822
Cdd:TIGR01612 1258 KSPEIENEMGIEMDIKAemetfnishDDDKDHHIISKKHDENISDIR------EKSLKIIEDFSeeSDINDIKKELQKNL 1331
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   823 TFVQKEQCDLANYL-KCTAV-------EIRNIIEKVK------------------CSLEITLKEKHQQELQSLKIEYECK 876
Cdd:TIGR01612 1332 LDAQKHNSDINLYLnEIANIynilklnKIKKIIDEVKeytkeieennknikdeldKSEKLIKKIKDDINLEECKSKIEST 1411
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   877 LDAlvKDSEENVNKILKLKENLVSLEealQNKDNEFTSIKHEKDAI------VCVQQEKDQKLLEMEKIMHTQHCE--IK 948
Cdd:TIGR01612 1412 LDD--KDIDECIKKIKELKNHILSEE---SNIDTYFKNADENNENVlllfknIEMADNKSQHILKIKKDNATNDHDfnIN 1486
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374   949 ELKQSREMAledlKKLHDE------KIESLRAEFQCLEQNHLKELEDTLHIRHTQEFEKVMTDHNMSLEKLKKENQQRID 1022
Cdd:TIGR01612 1487 ELKEHIDKS----KGCKDEadknakAIEKNKELFEQYKKDVTELLNKYSALAIKNKFAKTKKDSEIIIKEIKDAHKKFIL 1562
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1023 QMLESHASTIQEKEQQ-----------------------LQELKLKVSDLSDMRCKleVELALKEAETDEIKILLEESRT 1079
Cdd:TIGR01612 1563 EAEKSEQKIKEIKKEKfrieddaakndksnkaaidiqlsLENFENKFLKISDIKKK--INDCLKETESIEKKISSFSIDS 1640
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1080 QQKEM-------------LKSLLEQ---------ETENLRTEISKLNQKIHDNNESYQVGLSEL---RALMTIEKDQCIS 1134
Cdd:TIGR01612 1641 QDTELkengdnlnslqefLESLKDQkkniedkkkELDELDSEIEKIEIDVDQHKKNYEIGIIEKikeIAIANKEEIESIK 1720
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374  1135 ELISRHEEE--SNILKAELDNV-TSLHRQAYEIE-KKLKEQIVELQTRLNSELSALEKQ-------KDEKITQQEE---- 1199
Cdd:TIGR01612 1721 ELIEPTIENliSSFNTNDLEGIdPNEKLEEYNTEiGDIYEEFIELYNIIAGCLETVSKEpitydeiKNTRINAQNEflki 1800
                          570       580
                   ....*....|....*....|....*....
gi 226693374  1200 -----KYEALIQNLEKDKERLVKNHEQDK 1223
Cdd:TIGR01612 1801 ieiekKSKSYLDDIEAKEFDRIINHFKKK 1829
PRK00106 PRK00106
ribonuclease Y;
1193-1357 8.64e-03

ribonuclease Y;


Pssm-ID: 178867 [Multi-domain]  Cd Length: 535  Bit Score: 40.62  E-value: 8.64e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1193 KITQQEEKYEALIQNLEKDKERLVKNHEQDKEHLIQELNFEKNKAVQTALDEFKVE----RELVEKEL---LEKVKHLEN 1265
Cdd:PRK00106   25 KMKSAKEAAELTLLNAEQEAVNLRGKAERDAEHIKKTAKRESKALKKELLLEAKEEarkyREEIEQEFkseRQELKQIES 104
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 226693374 1266 QIAKTpAFESARED----SSSLVAELQEKLQEEKAKFL----EQLEEQEKRKNEEMQNVRTSLIAEQQ----TNFNTVLT 1333
Cdd:PRK00106  105 RLTER-ATSLDRKDenlsSKEKTLESKEQSLTDKSKHIdereEQVEKLEEQKKAELERVAALSQAEAReiilAETENKLT 183
                         170       180
                  ....*....|....*....|....*...
gi 226693374 1334 REKMRK----ENIINDLSDKLKSTMQQQ 1357
Cdd:PRK00106  184 HEIATRireaEREVKDRSDKMAKDLLAQ 211
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH