NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1387212236|ref|XP_024831920|]
View 

transport and Golgi organization protein 1 homolog isoform X1 [Bos taurus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
SH3_MIA3 cd11893
Src Homology 3 domain of Melanoma Inhibitory Activity 3 protein; MIA3, also called TANGO or ...
37-109 9.40e-45

Src Homology 3 domain of Melanoma Inhibitory Activity 3 protein; MIA3, also called TANGO or TANGO1, acts as a tumor suppressor of malignant melanoma. It is downregulated or lost in melanoma cells lines. Unlike other MIA family members, MIA3 is widely expressed except in hematopoietic cells. MIA3 is an ER resident transmembrane protein that is required for the loading of collagen VII into transport vesicles. SNPs in the MIA3 gene have been associated with coronary arterial disease and myocardial infarction. MIA3 contains an N-terminal SH3-like domain, similar to MIA. It is a member of the recently identified family that also includes MIA, MIAL, and MIA2. MIA is a single domain protein that adopts a SH3 domain-like fold; it contains an additional antiparallel beta sheet and two disulfide bonds compared to classical SH3 domains. Unlike classical SH3 domains, MIA does not bind proline-rich ligands.


:

Pssm-ID: 212826  Cd Length: 73  Bit Score: 156.16  E-value: 9.40e-45
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1387212236   37 LCADEECSMLMYRGEALEDFTGPDCRFVNFKKGDTVYVYYKLAGGSPEVWAGSVGHTFGYFPKDLIQVVHEYT 109
Cdd:cd11893      1 RCADEECSMLLCRGKAVKDFTGPDCRFLSFKKGETIYVYYKLSGRRTDLWAGSVGFDFGYFPKDLLDVNHLYT 73
Mplasa_alph_rch super family cl37461
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
1209-1588 5.77e-15

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


The actual alignment was detected with superfamily member TIGR04523:

Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 80.83  E-value: 5.77e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1209 TEQQISEKLKNIMKENAELVQK------LSSYEQKIKESKKHVQETKKQNMILSDEAIKFKDKIkslEETNEILGDTAKS 1282
Cdd:TIGR04523  178 LEKEKLNIQKNIDKIKNKLLKLelllsnLKKKIQKNKSLESQISELKKQNNQLKDNIEKKQQEI---NEKTTEISNTQTQ 254
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1283 LRAMLESerEQNAKNQdlISENKKSIEKLKDVISVNASEFSEVQIALNEakLSEEKVKSECHRVQEENARLKKKKEQLQQ 1362
Cdd:TIGR04523  255 LNQLKDE--QNKIKKQ--LSEKQKELEQNNKKIKELEKQLNQLKSEISD--LNNQKEQDWNKELKSELKNQEKKLEEIQN 328
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1363 EIKDWSKSHAELSEQIRSFEKSQKDLEVALTHKDDNINALTNCITQLNRldcESESEDQNKGGSESdelangevggdRSE 1442
Cdd:TIGR04523  329 QISQNNKIISQLNEQISQLKKELTNSESENSEKQRELEEKQNEIEKLKK---ENQSYKQEIKNLES-----------QIN 394
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1443 KVKNQIKQMMDVSRT-QTAISVVEEDLKLLQCKLRASMSTKCNLEDQIKKLEEDRSSLQSAKTVLEDECKTLRQKVEILN 1521
Cdd:TIGR04523  395 DLESKIQNQEKLNQQkDEQIKKLQQEKELLEKEIERLKETIIKNNSEIKDLTNQDSVKELIIKNLDNTRESLETQLKVLS 474
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1387212236 1522 ELYQQKEMALQKKlsqeEYERQEREQRLSAADEKAVLAAEEVKTYKRRIEEMEDELQKTErSFKNQI 1588
Cdd:TIGR04523  475 RSINKIKQNLEQK----QKELKSKEKELKKLNEEKKELEEKVKDLTKKISSLKEKIEKLE-SEKKEK 536
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1642-1929 3.66e-12

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 72.28  E-value: 3.66e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1642 PMPGRPNTQNPPRRGPLSQNGSFGPSPVSGGECSPPLTADPPARPLSATLNRREMPRSEFGSVDGPL--PRPRWASEASG 1719
Cdd:PHA03247  2614 PSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPqrPRRRAARPTVG 2693
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1720 KPSAS----DPESGAAPTVNSSSRSSSPSKVMDEGKQTVPQEPEGPSVPSIPSLAEHPVSVsmAAKGPPPFPGTPLMSSP 1795
Cdd:PHA03247  2694 SLTSLadppPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGP--ARPARPPTTAGPPAPAP 2771
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1796 VGGPLLPPIRYGPPPQLCGPFGPRPLPPPFGPGMRPPLGLREYAPGVPPGKRDLPLDPREFLPPGHAPFRPLGSLGPRE- 1874
Cdd:PHA03247  2772 PAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLp 2851
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1387212236 1875 ---YFFPG---TRLPPpnhgPQDYPPSSAARDLPPSGSRDEPPPASQGASQDCSPALKQSP 1929
Cdd:PHA03247  2852 lggSVAPGgdvRRRPP----SRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERP 2908
PTZ00341 super family cl31759
Ring-infected erythrocyte surface antigen; Provisional
198-462 6.69e-04

Ring-infected erythrocyte surface antigen; Provisional


The actual alignment was detected with superfamily member PTZ00341:

Pssm-ID: 173534 [Multi-domain]  Cd Length: 1136  Bit Score: 44.78  E-value: 6.69e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236  198 AELRERSEAQKSHPQV----------NSQTGHAQGErtsfesfgEMLQDKLKVPDSENNKTSNSSQVSHEQEKIDAYKLL 267
Cdd:PTZ00341   324 AEMKKRAEKPKKKKSKrrgwlccgggDIETVEPQQE--------EPVQDVGEHQINEYGDILPSLKASINNSAINYYDAV 395
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236  268 KTEMTLDlktkfGSTADALVSDDEttrLVTSLEDD-FVEDLDpeyytvGKEEEENKEDFDELPLltftDGEDTKSPGHSG 346
Cdd:PTZ00341   396 KDGKYLD-----DDSSDALYTDED---LLFDLEKQkYMDMLD------GSEDESVEDNEEEHSG----DANEEELSVDEH 457
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236  347 IEKHPTEKEQNSNKEHKVEETQPPGIKKGDKEIPKHREDTVFSDVMEgEENTDTDLESSDSKEEDDPLVMDSRLGKPRPE 426
Cdd:PTZ00341   458 VEEHNADDSGEQQSDDESGEHQSVNEIVEEQSVNEHVEEPTVADIVE-QETVDEHVEEPAVDENEEQQTADEHVEEPTIA 536
                          250       260       270
                   ....*....|....*....|....*....|....*.
gi 1387212236  427 DHTDPEKAADHLVNVEVPKADSDDDPEVGAGLHMKD 462
Cdd:PTZ00341   537 EEHVEEEISTAEEHIEEPASDVQQDSEAAPTIEIPD 572
PTZ00449 super family cl33186
104 kDa microneme/rhoptry antigen; Provisional
735-912 9.80e-03

104 kDa microneme/rhoptry antigen; Provisional


The actual alignment was detected with superfamily member PTZ00449:

Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 40.83  E-value: 9.80e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236  735 KQSKERSPEIQDKRLDVDLQNPEKPVSGAIKTDPETEKNKEETRHVSENERKNETAGKavdSLGRDAGGPVVEKEGSSPV 814
Cdd:PTZ00449   490 KKSKKKLAPIEEEDSDKHDEPPEGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGG---KPGETKEGEVGKKPGPAKE 566
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236  815 HQKVQRPSEGSDVPGKKQNQTPELGEASQK-KDPDYLKEDNHEGHPKTSGLMEKPGVEPSKEDDEHAEKFVDPgSRGSAS 893
Cdd:PTZ00449   567 HKPSKIPTLSKKPEFPKDPKHPKDPEEPKKpKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPP-QRPSSP 645
                          170
                   ....*....|....*....
gi 1387212236  894 EDPDDDPFPWAPHAPVQPE 912
Cdd:PTZ00449   646 ERPEGPKIIKSPKPPKSPK 664
 
Name Accession Description Interval E-value
SH3_MIA3 cd11893
Src Homology 3 domain of Melanoma Inhibitory Activity 3 protein; MIA3, also called TANGO or ...
37-109 9.40e-45

Src Homology 3 domain of Melanoma Inhibitory Activity 3 protein; MIA3, also called TANGO or TANGO1, acts as a tumor suppressor of malignant melanoma. It is downregulated or lost in melanoma cells lines. Unlike other MIA family members, MIA3 is widely expressed except in hematopoietic cells. MIA3 is an ER resident transmembrane protein that is required for the loading of collagen VII into transport vesicles. SNPs in the MIA3 gene have been associated with coronary arterial disease and myocardial infarction. MIA3 contains an N-terminal SH3-like domain, similar to MIA. It is a member of the recently identified family that also includes MIA, MIAL, and MIA2. MIA is a single domain protein that adopts a SH3 domain-like fold; it contains an additional antiparallel beta sheet and two disulfide bonds compared to classical SH3 domains. Unlike classical SH3 domains, MIA does not bind proline-rich ligands.


Pssm-ID: 212826  Cd Length: 73  Bit Score: 156.16  E-value: 9.40e-45
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1387212236   37 LCADEECSMLMYRGEALEDFTGPDCRFVNFKKGDTVYVYYKLAGGSPEVWAGSVGHTFGYFPKDLIQVVHEYT 109
Cdd:cd11893      1 RCADEECSMLLCRGKAVKDFTGPDCRFLSFKKGETIYVYYKLSGRRTDLWAGSVGFDFGYFPKDLLDVNHLYT 73
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
1209-1588 5.77e-15

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 80.83  E-value: 5.77e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1209 TEQQISEKLKNIMKENAELVQK------LSSYEQKIKESKKHVQETKKQNMILSDEAIKFKDKIkslEETNEILGDTAKS 1282
Cdd:TIGR04523  178 LEKEKLNIQKNIDKIKNKLLKLelllsnLKKKIQKNKSLESQISELKKQNNQLKDNIEKKQQEI---NEKTTEISNTQTQ 254
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1283 LRAMLESerEQNAKNQdlISENKKSIEKLKDVISVNASEFSEVQIALNEakLSEEKVKSECHRVQEENARLKKKKEQLQQ 1362
Cdd:TIGR04523  255 LNQLKDE--QNKIKKQ--LSEKQKELEQNNKKIKELEKQLNQLKSEISD--LNNQKEQDWNKELKSELKNQEKKLEEIQN 328
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1363 EIKDWSKSHAELSEQIRSFEKSQKDLEVALTHKDDNINALTNCITQLNRldcESESEDQNKGGSESdelangevggdRSE 1442
Cdd:TIGR04523  329 QISQNNKIISQLNEQISQLKKELTNSESENSEKQRELEEKQNEIEKLKK---ENQSYKQEIKNLES-----------QIN 394
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1443 KVKNQIKQMMDVSRT-QTAISVVEEDLKLLQCKLRASMSTKCNLEDQIKKLEEDRSSLQSAKTVLEDECKTLRQKVEILN 1521
Cdd:TIGR04523  395 DLESKIQNQEKLNQQkDEQIKKLQQEKELLEKEIERLKETIIKNNSEIKDLTNQDSVKELIIKNLDNTRESLETQLKVLS 474
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1387212236 1522 ELYQQKEMALQKKlsqeEYERQEREQRLSAADEKAVLAAEEVKTYKRRIEEMEDELQKTErSFKNQI 1588
Cdd:TIGR04523  475 RSINKIKQNLEQK----QKELKSKEKELKKLNEEKKELEEKVKDLTKKISSLKEKIEKLE-SEKKEK 536
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
1215-1533 4.59e-13

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 75.01  E-value: 4.59e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1215 EKLKNIMKENAELVQKLSSYEQKIKESKKHVQETKK---------------QNMILSDEAIKFKDKIKSLEETNEILGDT 1279
Cdd:pfam02463  173 EALKKLIEETENLAELIIDLEELKLQELKLKEQAKKaleyyqlkekleleeEYLLYLDYLKLNEERIDLLQELLRDEQEE 252
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1280 AKSLRAMLESEREQNAKNQDLISENKKSIEKLKDVISVNASEFSEVQIALNEAKLSEEKVKSECHRVQEENARLKKKKEQ 1359
Cdd:pfam02463  253 IESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKEKKKAEKELKK 332
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1360 LQQEIKDWSKSHAELSEQIRSFEKSQKDLEVALTHKddninaltncITQLNRLDCESESEDQNKGGSESD-----ELANG 1434
Cdd:pfam02463  333 EKEEIEELEKELKELEIKREAEEEEEEELEKLQEKL----------EQLEEELLAKKKLESERLSSAAKLkeeelELKSE 402
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1435 EVGGDRSEKVKNQIKQMMDVSRTQTAISVVEEDLKLLQCKLRASMSTKCNLEDQIKKLEEDRSSLQSAKTVLEDECKTLR 1514
Cdd:pfam02463  403 EEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKSEDLLKETQLVKL 482
                          330
                   ....*....|....*....
gi 1387212236 1515 QKVEILNELYQQKEMALQK 1533
Cdd:pfam02463  483 QEQLELLLSRQKLEERSQK 501
PHA03247 PHA03247
large tegument protein UL36; Provisional
1642-1929 3.66e-12

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 72.28  E-value: 3.66e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1642 PMPGRPNTQNPPRRGPLSQNGSFGPSPVSGGECSPPLTADPPARPLSATLNRREMPRSEFGSVDGPL--PRPRWASEASG 1719
Cdd:PHA03247  2614 PSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPqrPRRRAARPTVG 2693
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1720 KPSAS----DPESGAAPTVNSSSRSSSPSKVMDEGKQTVPQEPEGPSVPSIPSLAEHPVSVsmAAKGPPPFPGTPLMSSP 1795
Cdd:PHA03247  2694 SLTSLadppPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGP--ARPARPPTTAGPPAPAP 2771
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1796 VGGPLLPPIRYGPPPQLCGPFGPRPLPPPFGPGMRPPLGLREYAPGVPPGKRDLPLDPREFLPPGHAPFRPLGSLGPRE- 1874
Cdd:PHA03247  2772 PAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLp 2851
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1387212236 1875 ---YFFPG---TRLPPpnhgPQDYPPSSAARDLPPSGSRDEPPPASQGASQDCSPALKQSP 1929
Cdd:PHA03247  2852 lggSVAPGgdvRRRPP----SRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERP 2908
PTZ00121 PTZ00121
MAEBL; Provisional
1200-1594 1.25e-10

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 67.09  E-value: 1.25e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1200 AVKSRVYQVTEQQISEKLKNIMKENAELVQKLSSYEQKIKESKKHVQETKKqnmilSDEAIKFKDKIKSLEETNEILGDT 1279
Cdd:PTZ00121  1382 AAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKK-----ADEAKKKAEEAKKADEAKKKAEEA 1456
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1280 AKSLRAMLESEREQNAKNQDLISENKKSIEKLKDVISVNASEFSEVQIALNEAKLSEEKVKSECHRVQEE--NARLKKKK 1357
Cdd:PTZ00121  1457 KKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEakKAEEAKKA 1536
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1358 EQLQQEIKDWSKSHAELSEQIRSFEKSQKdLEVALTHKDDNINALTNciTQLNRLDCESESEDQNKGGSESDELANGEVG 1437
Cdd:PTZ00121  1537 DEAKKAEEKKKADELKKAEELKKAEEKKK-AEEAKKAEEDKNMALRK--AEEAKKAEEARIEEVMKLYEEEKKMKAEEAK 1613
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1438 GDRSEKVK-NQIKQMMDVSRTQTAISVVEEDLKLLQCKLR-ASMSTKCNLEDQIKKLEEDRSSLQSAKTVLEDEcktlRQ 1515
Cdd:PTZ00121  1614 KAEEAKIKaEELKKAEEEKKKVEQLKKKEAEEKKKAEELKkAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDE----KK 1689
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1387212236 1516 KVEILNELYQQKEMALQkkLSQEEYERQEREQRLSAADEKAVLAAEEVKTYKRRIEEMEDELQKTERSfKNQIATHEKK 1594
Cdd:PTZ00121  1690 AAEALKKEAEEAKKAEE--LKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEE-KKKIAHLKKE 1765
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1210-1636 4.43e-08

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 58.24  E-value: 4.43e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1210 EQQISEkLKNIMKENAELVQKLSSYEQKIKESKKHVQETKKQNMILsDEAIKFKDKIKSLEETNEILGDTAKSLRAMLES 1289
Cdd:COG4717     77 EEELKE-AEEKEEEYAELQEELEELEEELEELEAELEELREELEKL-EKLLQLLPLYQELEALEAELAELPERLEELEER 154
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1290 EREqnaknqdlISENKKSIEKLKDvisvnasEFSEVQIALNEA-KLSEEKVKSECHRVQEENARLKKKKEQLQQEIKDWS 1368
Cdd:COG4717    155 LEE--------LRELEEELEELEA-------ELAELQEELEELlEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQ 219
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1369 KSHAELSEQIRSFEKSQKDLE------------------VALTHKDDNINALTNCIT-------QLNRLDCESESEDQNK 1423
Cdd:COG4717    220 EELEELEEELEQLENELEAAAleerlkearlllliaaalLALLGLGGSLLSLILTIAgvlflvlGLLALLFLLLAREKAS 299
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1424 GGSESDELANGEVGGDRSEKVKNQIKQMMDVSRTQTAISVVEEDLKLLQCKLRASMSTKCNLEDQIKKLEEDRSSLQSAK 1503
Cdd:COG4717    300 LGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEELLELLDRIEELQELLREAEELEEELQLEELEQEIAALLAEA 379
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1504 TVLEDEckTLRQKVEILNElYQQKEMALQKKLSQEEYERQEREQRLSAADEKAVLAA-EEVKTYKRRIEEMEDELQKTER 1582
Cdd:COG4717    380 GVEDEE--ELRAALEQAEE-YQELKEELEELEEQLEELLGELEELLEALDEEELEEElEELEEELEELEEELEELREELA 456
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1387212236 1583 SFKNQIATHEKkahdnwlkaraaeraiaeeKREAANLRHKLLELTQKMAMMQEE 1636
Cdd:COG4717    457 ELEAELEQLEE-------------------DGELAELLQELEELKAELRELAEE 491
SH3_2 pfam07653
Variant SH3 domain; SH3 (Src homology 3) domains are often indicative of a protein involved in ...
49-105 1.65e-07

Variant SH3 domain; SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel.


Pssm-ID: 429575 [Multi-domain]  Cd Length: 54  Bit Score: 49.52  E-value: 1.65e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1387212236   49 RGEALEDFTGPDCRFVNFKKGDTVYVYYKLAGGSpevWAGSVGHTFGYFPKDLIQVV 105
Cdd:pfam07653    1 YGRVIFDYVGTDKNGLTLKKGDVVKVLGKDNDGW---WEGETGGRVGLVPSTAVEEI 54
ATP-synt_Fo_b cd06503
F-type ATP synthase, membrane subunit b; Membrane subunit b is a component of the Fo complex ...
1187-1316 1.64e-04

F-type ATP synthase, membrane subunit b; Membrane subunit b is a component of the Fo complex of FoF1-ATP synthase. The F-type ATP synthases (FoF1-ATPase) consist of two structural domains: the F1 (assembly factor one) complex containing the soluble catalytic core, and the Fo (oligomycin sensitive factor) complex containing the membrane proton channel, linked together by a central stalk and a peripheral stalk. F1 is composed of alpha (or A), beta (B), gamma (C), delta (D) and epsilon (E) subunits with a stoichiometry of 3:3:1:1:1, while Fo consists of the three subunits a, b, and c (1:2:10-14). An oligomeric ring of 10-14 c subunits (c-ring) make up the Fo rotor. The flux of protons through the ATPase channel (Fo) drives the rotation of the c-ring, which in turn is coupled to the rotation of the F1 complex gamma subunit rotor due to the permanent binding between the gamma and epsilon subunits of F1 and the c-ring of Fo. The F-ATP synthases are primarily found in the inner membranes of eukaryotic mitochondria, in the thylakoid membranes of chloroplasts or in the plasma membranes of bacteria. The F-ATP synthases are the primary producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). Alternatively, under conditions of low driving force, ATP synthases function as ATPases, thus generating a transmembrane proton or Na(+) gradient at the expense of energy derived from ATP hydrolysis. This group also includes F-ATP synthase that has also been found in the archaea Candidatus Methanoperedens.


Pssm-ID: 349951 [Multi-domain]  Cd Length: 132  Bit Score: 43.20  E-value: 1.64e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1187 IVSFAV-------FFWRTVLAV-KSRvyqvtEQQISEKLKNIMKENAELVQKLSSYEQKIKESKKHVQEtkkqnmILsDE 1258
Cdd:cd06503      6 IINFLIllfilkkFLWKPILKAlDER-----EEKIAESLEEAEKAKEEAEELLAEYEEKLAEARAEAQE------II-EE 73
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1387212236 1259 AIKFKDKIKsleetNEILGDtakslrAMLESEREQNAKNQDLISENKKSIEKLKDVIS 1316
Cdd:cd06503     74 ARKEAEKIK-----EEILAE------AKEEAERILEQAKAEIEQEKEKALAELRKEVA 120
Spc7 smart00787
Spc7 kinetochore protein; This domain is found in cell division proteins which are required ...
1212-1383 4.23e-04

Spc7 kinetochore protein; This domain is found in cell division proteins which are required for kinetochore-spindle association.


Pssm-ID: 197874 [Multi-domain]  Cd Length: 312  Bit Score: 44.62  E-value: 4.23e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236  1212 QISEKLKNIMKENAELVqklssyeqkiKESKKHVqeTKKQNmILSDEAIKFKDKIKSLEETneilgdtaksLRAMLESER 1291
Cdd:smart00787  140 KLLEGLKEGLDENLEGL----------KEDYKLL--MKELE-LLNSIKPKLRDRKDALEEE----------LRQLKQLED 196
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236  1292 EQNAKNQDLISENKKSIEKLKDVISVNASEFSEVQIALNEAKLSEEKVKSECHRVQEENARLKKKKEQlqqeIKDWSKSH 1371
Cdd:smart00787  197 ELEDCDPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEAEKKLEQ----CRGFTFKE 272
                           170
                    ....*....|...
gi 1387212236  1372 AE-LSEQIRSFEK 1383
Cdd:smart00787  273 IEkLKEQLKLLQS 285
PTZ00341 PTZ00341
Ring-infected erythrocyte surface antigen; Provisional
198-462 6.69e-04

Ring-infected erythrocyte surface antigen; Provisional


Pssm-ID: 173534 [Multi-domain]  Cd Length: 1136  Bit Score: 44.78  E-value: 6.69e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236  198 AELRERSEAQKSHPQV----------NSQTGHAQGErtsfesfgEMLQDKLKVPDSENNKTSNSSQVSHEQEKIDAYKLL 267
Cdd:PTZ00341   324 AEMKKRAEKPKKKKSKrrgwlccgggDIETVEPQQE--------EPVQDVGEHQINEYGDILPSLKASINNSAINYYDAV 395
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236  268 KTEMTLDlktkfGSTADALVSDDEttrLVTSLEDD-FVEDLDpeyytvGKEEEENKEDFDELPLltftDGEDTKSPGHSG 346
Cdd:PTZ00341   396 KDGKYLD-----DDSSDALYTDED---LLFDLEKQkYMDMLD------GSEDESVEDNEEEHSG----DANEEELSVDEH 457
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236  347 IEKHPTEKEQNSNKEHKVEETQPPGIKKGDKEIPKHREDTVFSDVMEgEENTDTDLESSDSKEEDDPLVMDSRLGKPRPE 426
Cdd:PTZ00341   458 VEEHNADDSGEQQSDDESGEHQSVNEIVEEQSVNEHVEEPTVADIVE-QETVDEHVEEPAVDENEEQQTADEHVEEPTIA 536
                          250       260       270
                   ....*....|....*....|....*....|....*.
gi 1387212236  427 DHTDPEKAADHLVNVEVPKADSDDDPEVGAGLHMKD 462
Cdd:PTZ00341   537 EEHVEEEISTAEEHIEEPASDVQQDSEAAPTIEIPD 572
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1642-1929 6.83e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 44.76  E-value: 6.83e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1642 PMPGRPNTQNPPRRGPLSQNGSFGPSPVSGGECSP---PLTADPPARPLSATLNRREMPRSEFGSVDGPLPRPRWASEAS 1718
Cdd:pfam03154  243 PSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPmphSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQ 322
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1719 GK----PSASDPESGAAPTVNSSSRSSSPSKVMDEGKQT-VPQEPeGPSVPSIPSLAEHPVSVSMAAKGPPPFPGTPLMS 1793
Cdd:pfam03154  323 QRihtpPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTpIPQLP-NPQSHKHPPHLSGPSPFQMNSNLPPPPALKPLSS 401
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1794 SPVGGPllpPIRYGPPPQLcgpfgPRPLPPPFGPGMRPPlGLREyAPGVPPGKRDLPLDPREFLPPGHAPFrplgslgPR 1873
Cdd:pfam03154  402 LSTHHP---PSAHPPPLQL-----MPQSQQLPPPPAQPP-VLTQ-SQSLPPPAASHPPTSGLHQVPSQSPF-------PQ 464
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1387212236 1874 EYFFPGTrlpPPNHGPQDYPPSSAardlPPSGSRDEPPPASQGASQDCSPALKQSP 1929
Cdd:pfam03154  465 HPFVPGG---PPPITPPSGPPTST----SSAMPGIQPPSSASVSSSGPVPAAVSCP 513
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
735-912 9.80e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 40.83  E-value: 9.80e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236  735 KQSKERSPEIQDKRLDVDLQNPEKPVSGAIKTDPETEKNKEETRHVSENERKNETAGKavdSLGRDAGGPVVEKEGSSPV 814
Cdd:PTZ00449   490 KKSKKKLAPIEEEDSDKHDEPPEGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGG---KPGETKEGEVGKKPGPAKE 566
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236  815 HQKVQRPSEGSDVPGKKQNQTPELGEASQK-KDPDYLKEDNHEGHPKTSGLMEKPGVEPSKEDDEHAEKFVDPgSRGSAS 893
Cdd:PTZ00449   567 HKPSKIPTLSKKPEFPKDPKHPKDPEEPKKpKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPP-QRPSSP 645
                          170
                   ....*....|....*....
gi 1387212236  894 EDPDDDPFPWAPHAPVQPE 912
Cdd:PTZ00449   646 ERPEGPKIIKSPKPPKSPK 664
 
Name Accession Description Interval E-value
SH3_MIA3 cd11893
Src Homology 3 domain of Melanoma Inhibitory Activity 3 protein; MIA3, also called TANGO or ...
37-109 9.40e-45

Src Homology 3 domain of Melanoma Inhibitory Activity 3 protein; MIA3, also called TANGO or TANGO1, acts as a tumor suppressor of malignant melanoma. It is downregulated or lost in melanoma cells lines. Unlike other MIA family members, MIA3 is widely expressed except in hematopoietic cells. MIA3 is an ER resident transmembrane protein that is required for the loading of collagen VII into transport vesicles. SNPs in the MIA3 gene have been associated with coronary arterial disease and myocardial infarction. MIA3 contains an N-terminal SH3-like domain, similar to MIA. It is a member of the recently identified family that also includes MIA, MIAL, and MIA2. MIA is a single domain protein that adopts a SH3 domain-like fold; it contains an additional antiparallel beta sheet and two disulfide bonds compared to classical SH3 domains. Unlike classical SH3 domains, MIA does not bind proline-rich ligands.


Pssm-ID: 212826  Cd Length: 73  Bit Score: 156.16  E-value: 9.40e-45
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1387212236   37 LCADEECSMLMYRGEALEDFTGPDCRFVNFKKGDTVYVYYKLAGGSPEVWAGSVGHTFGYFPKDLIQVVHEYT 109
Cdd:cd11893      1 RCADEECSMLLCRGKAVKDFTGPDCRFLSFKKGETIYVYYKLSGRRTDLWAGSVGFDFGYFPKDLLDVNHLYT 73
SH3_MIA_like cd11760
Src Homology 3 domain of Melanoma Inhibitory Activity protein and similar proteins; MIA is a ...
37-109 5.11e-37

Src Homology 3 domain of Melanoma Inhibitory Activity protein and similar proteins; MIA is a single domain protein that adopts a SH3 domain-like fold; it contains an additional antiparallel beta sheet and two disulfide bonds compared to classical SH3 domains. MIA is secreted from malignant melanoma cells and it plays an important role in melanoma development and invasion. MIA is expressed by chondrocytes in normal tissues and may be important in the cartilage cell phenotype. Unlike classical SH3 domains, MIA does not bind proline-rich ligands. MIA is a member of the recently identified family that also includes MIA-like (MIAL), MIA2, and MIA3 (also called TANGO); the biological functions of this family are not yet fully understood.


Pssm-ID: 212694  Cd Length: 76  Bit Score: 134.15  E-value: 5.11e-37
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1387212236   37 LCADEECSMLMYRGEALEDFTGPDCRFVNFKKGDTVYVYYKLAGGSPEVWAGSVGH---TFGYFPKDLIQVVHEYT 109
Cdd:cd11760      1 LCADAECSNPISRARALEDYHGPDCRFLNFKKGDTIYVYSKLAGERQDLWAGSVGGdagLFGYFPKNLVQELKVYE 76
SH3_MIA2 cd11892
Src Homology 3 domain of Melanoma Inhibitory Activity 2 protein; MIA2 is expressed ...
38-109 1.06e-26

Src Homology 3 domain of Melanoma Inhibitory Activity 2 protein; MIA2 is expressed specifically in hepatocytes and its expression is controlled by hepatocyte nuclear factor 1 binding sites in the MIA2 promoter. It inhibits the growth and invasion of hepatocellular carcinomas (HCC) and may act as a tumor suppressor. A mutation in MIA2 in mice resulted in reduced cholesterol and triglycerides. Since MIA2 localizes to ER exit sites, it may function as an ER-to-Golgi trafficking protein that regulates lipid metabolism. MIA2 contains an N-terminal SH3-like domain, similar to MIA. It is a member of the recently identified family that also includes MIA, MIAL, and MIA3 (also called TANGO). MIA is a single domain protein that adopts a SH3 domain-like fold; it contains an additional antiparallel beta sheet and two disulfide bonds compared to classical SH3 domains. Unlike classical SH3 domains, MIA does not bind proline-rich ligands.


Pssm-ID: 212825  Cd Length: 73  Bit Score: 104.92  E-value: 1.06e-26
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1387212236   38 CADEECSMLMYRGEALEDFTGPDCRFVNFKKGDTVYVYYKLAGGSPEVWAGSVGHTFGYFPKDLIQVVHEYT 109
Cdd:cd11892      2 CGDPECERLMSRVQAIRDYRGPDCRYLSFKKGDEIIVYYKLSGKREDLWAGSTGKEFGYFPKDAVKVEEVYI 73
MIA cd11890
Melanoma Inhibitory Activity protein; MIA is a single domain protein that adopts a Src ...
36-124 1.92e-20

Melanoma Inhibitory Activity protein; MIA is a single domain protein that adopts a Src Homology 3 (SH3) domain-like fold; it contains an additional antiparallel beta sheet and two disulfide bonds compared to classical SH3 domains. MIA is secreted from malignant melanoma cells and it plays an important role in melanoma development and invasion. MIA is expressed by chondrocytes in normal tissues and may be important in the cartilage cell phenotype. Unlike classical SH3 domains, MIA does not bind proline-rich ligands. It binds peptide ligands with sequence similarity to type III human fibronectin repeats.


Pssm-ID: 212823  Cd Length: 98  Bit Score: 88.01  E-value: 1.92e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236   36 KLCADEECSMLMYRGEALEDFTGPDCRFVNFKKGDTVYVYYKLAGGSPEVWAGSVGHTF--------GYFPKDLIQVVHE 107
Cdd:cd11890      2 KLCADQECSHPISIAVALQDYMAPDCRFIPIQRGQVVYVFSKLKGRGRLFWGGSVQGDYygeqaarlGYFPSSIVQEDQY 81
                           90
                   ....*....|....*..
gi 1387212236  108 YTQEELQVPTDETDFVC 124
Cdd:cd11890     82 LKPGKVEVKTDKWDFYC 98
MIAL cd11891
Melanoma Inhibitory Activity-Like protein; MIAL is specifically expressed in the cochlea and ...
37-108 8.84e-19

Melanoma Inhibitory Activity-Like protein; MIAL is specifically expressed in the cochlea and the vestibule of the inner ear and may contribute to inner ear dysfunction in humans. MIAL is a member of the recently identified family that also includes MIA, MIA2, and MIA3 (also called TANGO); MIA is the most studied member of the family. MIA is a single domain protein that adopts a Src Homology 3 (SH3) domain-like fold; it contains an additional antiparallel beta sheet and two disulfide bonds compared to classical SH3 domains. MIA is secreted from malignant melanoma cells and it plays an important role in melanoma development and invasion. MIA is expressed by chondrocytes in normal tissues and may be important in the cartilage cell phenotype. Unlike classical SH3 domains, MIA does not bind proline-rich ligands.


Pssm-ID: 212824  Cd Length: 83  Bit Score: 82.60  E-value: 8.84e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236   37 LCADEECSMLMYRGEALEDFTGPDCRFVNFKKGDTVYVYYKLA--GGSPEVWAGSVGH--------TFGYFPKDLIQVVH 106
Cdd:cd11891      1 LCADEECVYAISLARAEDDYNAPDCRFINIKKGQLIYVYSKLVkeNGAGEFWSGSVYSeryvdqmgIVGYFPSNLVKEQT 80

                   ..
gi 1387212236  107 EY 108
Cdd:cd11891     81 VY 82
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
1209-1588 5.77e-15

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 80.83  E-value: 5.77e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1209 TEQQISEKLKNIMKENAELVQK------LSSYEQKIKESKKHVQETKKQNMILSDEAIKFKDKIkslEETNEILGDTAKS 1282
Cdd:TIGR04523  178 LEKEKLNIQKNIDKIKNKLLKLelllsnLKKKIQKNKSLESQISELKKQNNQLKDNIEKKQQEI---NEKTTEISNTQTQ 254
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1283 LRAMLESerEQNAKNQdlISENKKSIEKLKDVISVNASEFSEVQIALNEakLSEEKVKSECHRVQEENARLKKKKEQLQQ 1362
Cdd:TIGR04523  255 LNQLKDE--QNKIKKQ--LSEKQKELEQNNKKIKELEKQLNQLKSEISD--LNNQKEQDWNKELKSELKNQEKKLEEIQN 328
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1363 EIKDWSKSHAELSEQIRSFEKSQKDLEVALTHKDDNINALTNCITQLNRldcESESEDQNKGGSESdelangevggdRSE 1442
Cdd:TIGR04523  329 QISQNNKIISQLNEQISQLKKELTNSESENSEKQRELEEKQNEIEKLKK---ENQSYKQEIKNLES-----------QIN 394
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1443 KVKNQIKQMMDVSRT-QTAISVVEEDLKLLQCKLRASMSTKCNLEDQIKKLEEDRSSLQSAKTVLEDECKTLRQKVEILN 1521
Cdd:TIGR04523  395 DLESKIQNQEKLNQQkDEQIKKLQQEKELLEKEIERLKETIIKNNSEIKDLTNQDSVKELIIKNLDNTRESLETQLKVLS 474
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1387212236 1522 ELYQQKEMALQKKlsqeEYERQEREQRLSAADEKAVLAAEEVKTYKRRIEEMEDELQKTErSFKNQI 1588
Cdd:TIGR04523  475 RSINKIKQNLEQK----QKELKSKEKELKKLNEEKKELEEKVKDLTKKISSLKEKIEKLE-SEKKEK 536
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
1209-1593 9.63e-15

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 80.06  E-value: 9.63e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1209 TEQQISEKLKNIMKENAELVQKLSSYEQKIK----ESKKHVQETKKQNMILSDEAIKFKDKIKSLEetNEILGDTAKSLR 1284
Cdd:TIGR04523  236 KKQQEINEKTTEISNTQTQLNQLKDEQNKIKkqlsEKQKELEQNNKKIKELEKQLNQLKSEISDLN--NQKEQDWNKELK 313
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1285 AMLESEREQ--NAKNQdlISENKKSIEKLKDVISV-------NASEFSEVQIALNEAKLSEEKVKSECHRVQEENARLKK 1355
Cdd:TIGR04523  314 SELKNQEKKleEIQNQ--ISQNNKIISQLNEQISQlkkeltnSESENSEKQRELEEKQNEIEKLKKENQSYKQEIKNLES 391
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1356 KKEQLQQEIKDWSKSHAELSEQIRSF-------EKSQKDLEVALTHKDDNINALTNCITQLNRL--DCESESEDQNKggs 1426
Cdd:TIGR04523  392 QINDLESKIQNQEKLNQQKDEQIKKLqqekellEKEIERLKETIIKNNSEIKDLTNQDSVKELIikNLDNTRESLET--- 468
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1427 esdelaNGEVGGDRSEKVKNQIKQmmdvsrTQTAISVVEEDLKLLQCKLRASMSTKCNLEDQIKKLEEDRSSLQSAKTVL 1506
Cdd:TIGR04523  469 ------QLKVLSRSINKIKQNLEQ------KQKELKSKEKELKKLNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEK 536
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1507 EDECKTLRQKVEILN-----ELYQQKEMALQKKLSQEEYERQEREQRLSAADEKAVLAAEEVKTYKRRIEEMEDELQKTE 1581
Cdd:TIGR04523  537 ESKISDLEDELNKDDfelkkENLEKEIDEKNKEIEELKQTQKSLKKKQEEKQELIDQKEKEKKDLIKEIEEKEKKISSLE 616
                          410
                   ....*....|..
gi 1387212236 1582 RSFKNQIATHEK 1593
Cdd:TIGR04523  617 KELEKAKKENEK 628
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
1215-1533 4.59e-13

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 75.01  E-value: 4.59e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1215 EKLKNIMKENAELVQKLSSYEQKIKESKKHVQETKK---------------QNMILSDEAIKFKDKIKSLEETNEILGDT 1279
Cdd:pfam02463  173 EALKKLIEETENLAELIIDLEELKLQELKLKEQAKKaleyyqlkekleleeEYLLYLDYLKLNEERIDLLQELLRDEQEE 252
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1280 AKSLRAMLESEREQNAKNQDLISENKKSIEKLKDVISVNASEFSEVQIALNEAKLSEEKVKSECHRVQEENARLKKKKEQ 1359
Cdd:pfam02463  253 IESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKEKKKAEKELKK 332
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1360 LQQEIKDWSKSHAELSEQIRSFEKSQKDLEVALTHKddninaltncITQLNRLDCESESEDQNKGGSESD-----ELANG 1434
Cdd:pfam02463  333 EKEEIEELEKELKELEIKREAEEEEEEELEKLQEKL----------EQLEEELLAKKKLESERLSSAAKLkeeelELKSE 402
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1435 EVGGDRSEKVKNQIKQMMDVSRTQTAISVVEEDLKLLQCKLRASMSTKCNLEDQIKKLEEDRSSLQSAKTVLEDECKTLR 1514
Cdd:pfam02463  403 EEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKSEDLLKETQLVKL 482
                          330
                   ....*....|....*....
gi 1387212236 1515 QKVEILNELYQQKEMALQK 1533
Cdd:pfam02463  483 QEQLELLLSRQKLEERSQK 501
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1198-1518 7.45e-13

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 74.32  E-value: 7.45e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1198 VLAVKSRVYQVTEQ--QISEKLKNIMKENAELVQKLSSYEQKIKESKKHVQETKKQNMILSDEAIKFKDKIKSLEETNEI 1275
Cdd:TIGR02168  672 ILERRREIEELEEKieELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQ 751
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1276 LGDTAKSLRAMLESEREQNAKNQDLISENKKSIEKLKDVISVNASEFSEVQIALNEAKLSEEKVKSECHRVQEENARLKK 1355
Cdd:TIGR02168  752 LSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLER 831
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1356 KKEQLQQEIKDWSKSHAELSEQIRSFEKSQKDLEVALTHKDDNINALTNCITQLNRLDCESESEDQNKggseSDELANGE 1435
Cdd:TIGR02168  832 RIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEEL----SEELRELE 907
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1436 vgGDRSEKVKNQIKQMMDVSRTQTAISVVEEDLKLLQCKLRASMSTkcNLEDQIKKLEEDRSSLQSAktvlEDECKTLRQ 1515
Cdd:TIGR02168  908 --SKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSL--TLEEAEALENKIEDDEEEA----RRRLKRLEN 979

                   ...
gi 1387212236 1516 KVE 1518
Cdd:TIGR02168  980 KIK 982
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
1203-1597 1.24e-12

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 73.22  E-value: 1.24e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1203 SRVYQVTEQQISEKLKNIMKENAELVQKlssyEQKIKESKKHVQETKKQNMILSDEAIKFKDKIKSLEETNEILGDTAKS 1282
Cdd:pfam05483   77 SRLYSKLYKEAEKIKKWKVSIEAELKQK----ENKLQENRKIIEAQRKAIQELQFENEKVSLKLEEEIQENKDLIKENNA 152
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1283 LRAMLESEREQNAKNQDLISENKKSIEKLKDVISVNASEFSEVQIALNEAKLSEEKVKSECH-RVQEENARLKKKKEQLQ 1361
Cdd:pfam05483  153 TRHLCNLLKETCARSAEKTKKYEYEREETRQVYMDLNNNIEKMILAFEELRVQAENARLEMHfKLKEDHEKIQHLEEEYK 232
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1362 QEIKDWSKSHAELSEQIRSFEKSQKDLEVALTHKDDNINALTncitQLNRLDCESESEDQNKGGSESDELangevggdrs 1441
Cdd:pfam05483  233 KEINDKEKQVSLLLIQITEKENKMKDLTFLLEESRDKANQLE----EKTKLQDENLKELIEKKDHLTKEL---------- 298
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1442 EKVKnqikqmMDVSRTQTAISVVEEDLKLlqcklraSMSTKCNL----EDQIKKLEEDRSS-------LQSAKTVLEDEC 1510
Cdd:pfam05483  299 EDIK------MSLQRSMSTQKALEEDLQI-------ATKTICQLteekEAQMEELNKAKAAhsfvvteFEATTCSLEELL 365
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1511 KTLRQKVEILNELYQQKEMALQKKLSQEEYERQEREQRLSAADE-KAVLAAEEVKTY-KRRIEEMEDELQKTERSFKNQI 1588
Cdd:pfam05483  366 RTEQQRLEKNEDQLKIITMELQKKSSELEEMTKFKNNKEVELEElKKILAEDEKLLDeKKQFEKIAEELKGKEQELIFLL 445

                   ....*....
gi 1387212236 1589 ATHEKKAHD 1597
Cdd:pfam05483  446 QAREKEIHD 454
PHA03247 PHA03247
large tegument protein UL36; Provisional
1642-1929 3.66e-12

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 72.28  E-value: 3.66e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1642 PMPGRPNTQNPPRRGPLSQNGSFGPSPVSGGECSPPLTADPPARPLSATLNRREMPRSEFGSVDGPL--PRPRWASEASG 1719
Cdd:PHA03247  2614 PSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPqrPRRRAARPTVG 2693
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1720 KPSAS----DPESGAAPTVNSSSRSSSPSKVMDEGKQTVPQEPEGPSVPSIPSLAEHPVSVsmAAKGPPPFPGTPLMSSP 1795
Cdd:PHA03247  2694 SLTSLadppPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGP--ARPARPPTTAGPPAPAP 2771
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1796 VGGPLLPPIRYGPPPQLCGPFGPRPLPPPFGPGMRPPLGLREYAPGVPPGKRDLPLDPREFLPPGHAPFRPLGSLGPRE- 1874
Cdd:PHA03247  2772 PAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLp 2851
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1387212236 1875 ---YFFPG---TRLPPpnhgPQDYPPSSAARDLPPSGSRDEPPPASQGASQDCSPALKQSP 1929
Cdd:PHA03247  2852 lggSVAPGgdvRRRPP----SRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERP 2908
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
1209-1522 2.46e-11

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 68.89  E-value: 2.46e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1209 TEQQISE---KLKNIMKENAELVQKLSSYEQKIKESKKHVQETKKQNMILSDEAIKFKDKIKSLEETNEILGDTAKSLRA 1285
Cdd:TIGR04523  333 NNKIISQlneQISQLKKELTNSESENSEKQRELEEKQNEIEKLKKENQSYKQEIKNLESQINDLESKIQNQEKLNQQKDE 412
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1286 MLESEREQNAKNQDLISENKKSIEKLKDVISVNASEFSEVQIALNEAKLSEEKVKSECHRVQEENARLKKKKEQLQQEIK 1365
Cdd:TIGR04523  413 QIKKLQQEKELLEKEIERLKETIIKNNSEIKDLTNQDSVKELIIKNLDNTRESLETQLKVLSRSINKIKQNLEQKQKELK 492
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1366 DWSKSHAELSEQIRSFEKSQKDLEvalthkdDNINALTNCITQLNRLDCESESEDQNKggseSDELANGEVGGDRS--EK 1443
Cdd:TIGR04523  493 SKEKELKKLNEEKKELEEKVKDLT-------KKISSLKEKIEKLESEKKEKESKISDL----EDELNKDDFELKKEnlEK 561
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1444 VKNQIKQMMD-VSRTQTAISVVEEDLKLLQCKLRAS-----------MSTKCNLEDQIKKLEEDRSSLQSAKTVLEDECK 1511
Cdd:TIGR04523  562 EIDEKNKEIEeLKQTQKSLKKKQEEKQELIDQKEKEkkdlikeieekEKKISSLEKELEKAKKENEKLSSIIKNIKSKKN 641
                          330
                   ....*....|.
gi 1387212236 1512 TLRQKVEILNE 1522
Cdd:TIGR04523  642 KLKQEVKQIKE 652
PHA03247 PHA03247
large tegument protein UL36; Provisional
1634-1914 9.61e-11

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 67.66  E-value: 9.61e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1634 QEEPVIVKPMPGRPNTQNPPrrGPLSQNGSF-------GPSPVSGGECSPPLTADPPARPLSATLNRREMPRSEFGSVDG 1706
Cdd:PHA03247  2704 PPPTPEPAPHALVSATPLPP--GPAAARQASpalpaapAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR 2781
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1707 PLPRPRWASEASGK---PSASDPE------SGAAPTVNSSSRSSSPSKVMDEGKQTVPQEPEGPSVPSIP---SLAEH-P 1773
Cdd:PHA03247  2782 RLTRPAVASLSESReslPSPWDPAdppaavLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPlggSVAPGgD 2861
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1774 VSVSMAAKGPPPFPGTPLMsspvggpllPPIRYGPPPQLCGPFGPRPLPPPFGPGMRPP-------------LGLREYAP 1840
Cdd:PHA03247  2862 VRRRPPSRSPAAKPAAPAR---------PPVRRLARPAVSRSTESFALPPDQPERPPQPqappppqpqpqppPPPQPQPP 2932
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1841 GVPPGKRDLPLDPR-----EFLPPGHAPFRPLGSLGPREYFFPGTRLPPPNHG-PQDYPPSSAARDLPPSG--------- 1905
Cdd:PHA03247  2933 PPPPPRPQPPLAPTtdpagAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSrEAPASSTPPLTGHSLSRvsswassla 3012
                          330
                   ....*....|.
gi 1387212236 1906 --SRDEPPPAS 1914
Cdd:PHA03247  3013 lhEETDPPPVS 3023
PHA03247 PHA03247
large tegument protein UL36; Provisional
1642-1929 1.01e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 67.66  E-value: 1.01e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1642 PMPGRPNTQNPPRRGPlSQNGSFGPSPVSGGECSPPLTADPPArPLSATLNRREMPRSEFGSVDGPL--PRPRWASEASG 1719
Cdd:PHA03247  2616 PLPPDTHAPDPPPPSP-SPAANEPDPHPPPTVPPPERPRDDPA-PGRVSRPRRARRLGRAAQASSPPqrPRRRAARPTVG 2693
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1720 KPSAS----DPESGAAPTVNSSSRSSSPSKVMDEGKQTVPQEPEGPSVPSIPSLAEHPVSVsmAAKGPPPFPGTPLMSSP 1795
Cdd:PHA03247  2694 SLTSLadppPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGP--ARPARPPTTAGPPAPAP 2771
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1796 VGGPLLPPIRYGPPPQLCGPFGPRPLPPPFGPGMRPPLGLREYAPGVPPGKRDLPLDP---------------------- 1853
Cdd:PHA03247  2772 PAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPpptsaqptapppppgppppslp 2851
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1854 -----------REFLPPGHAPFRPLGSLGPREYFFPGTRLPPPNHgPQDYPPSSAARDLPPSG-----SRDEPPPASQGA 1917
Cdd:PHA03247  2852 lggsvapggdvRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTE-SFALPPDQPERPPQPQAppppqPQPQPPPPPQPQ 2930
                          330
                   ....*....|..
gi 1387212236 1918 SQDCSPALKQSP 1929
Cdd:PHA03247  2931 PPPPPPPRPQPP 2942
PTZ00121 PTZ00121
MAEBL; Provisional
1200-1594 1.25e-10

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 67.09  E-value: 1.25e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1200 AVKSRVYQVTEQQISEKLKNIMKENAELVQKLSSYEQKIKESKKHVQETKKqnmilSDEAIKFKDKIKSLEETNEILGDT 1279
Cdd:PTZ00121  1382 AAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKK-----ADEAKKKAEEAKKADEAKKKAEEA 1456
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1280 AKSLRAMLESEREQNAKNQDLISENKKSIEKLKDVISVNASEFSEVQIALNEAKLSEEKVKSECHRVQEE--NARLKKKK 1357
Cdd:PTZ00121  1457 KKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEakKAEEAKKA 1536
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1358 EQLQQEIKDWSKSHAELSEQIRSFEKSQKdLEVALTHKDDNINALTNciTQLNRLDCESESEDQNKGGSESDELANGEVG 1437
Cdd:PTZ00121  1537 DEAKKAEEKKKADELKKAEELKKAEEKKK-AEEAKKAEEDKNMALRK--AEEAKKAEEARIEEVMKLYEEEKKMKAEEAK 1613
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1438 GDRSEKVK-NQIKQMMDVSRTQTAISVVEEDLKLLQCKLR-ASMSTKCNLEDQIKKLEEDRSSLQSAKTVLEDEcktlRQ 1515
Cdd:PTZ00121  1614 KAEEAKIKaEELKKAEEEKKKVEQLKKKEAEEKKKAEELKkAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDE----KK 1689
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1387212236 1516 KVEILNELYQQKEMALQkkLSQEEYERQEREQRLSAADEKAVLAAEEVKTYKRRIEEMEDELQKTERSfKNQIATHEKK 1594
Cdd:PTZ00121  1690 AAEALKKEAEEAKKAEE--LKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEE-KKKIAHLKKE 1765
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
1210-1533 1.41e-09

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 63.12  E-value: 1.41e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1210 EQQISEKLKNIMKE--NAELVQK-----LSSYEQKIKESKKHVQETKKQNMILSDEAIKFKDKIKSLEETneilgdtAKS 1282
Cdd:TIGR04523   35 EKQLEKKLKTIKNElkNKEKELKnldknLNKDEEKINNSNNKIKILEQQIKDLNDKLKKNKDKINKLNSD-------LSK 107
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1283 LRAMLESEREQNAKNQDLISENKKSIEKLKDVISVNASEFSEVQIALNEAKLSEEKVKSECHRVQEENARLKKKKEQLQQ 1362
Cdd:TIGR04523  108 INSEIKNDKEQKNKLEVELNKLEKQKKENKKNIDKFLTEIKKKEKELEKLNNKYNDLKKQKEELENELNLLEKEKLNIQK 187
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1363 EIKD-----------------WSKSHAELSEQIRSFEKSQKDLEVALTHKDDNINALTNCI----TQLNRL--------- 1412
Cdd:TIGR04523  188 NIDKiknkllklelllsnlkkKIQKNKSLESQISELKKQNNQLKDNIEKKQQEINEKTTEIsntqTQLNQLkdeqnkikk 267
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1413 ---DCESESEDQNKGGSE-SDELANGEVggdRSEKVKNQIKQMMDvSRTQTAISVVEEDLKLLQCKLRASMSTKCNLEDQ 1488
Cdd:TIGR04523  268 qlsEKQKELEQNNKKIKElEKQLNQLKS---EISDLNNQKEQDWN-KELKSELKNQEKKLEEIQNQISQNNKIISQLNEQ 343
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*
gi 1387212236 1489 IKKLEEDRSSLQSAKTVLEDECKTLRQKVEILNELYQQKEMALQK 1533
Cdd:TIGR04523  344 ISQLKKELTNSESENSEKQRELEEKQNEIEKLKKENQSYKQEIKN 388
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1238-1581 1.53e-09

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 63.54  E-value: 1.53e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1238 IKESKKHVQETKKQnmilSDEAIKFKDKIKSLEETN-EILGDTAKSLRAMLESEREQNAKNQDLISENKKSIEKLKDVIS 1316
Cdd:TIGR02168  195 LNELERQLKSLERQ----AEKAERYKELKAELRELElALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLE 270
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1317 VNASEFSEVQIALNEAKlseekvkSECHRVQEENARLKKKKEQLQQEIKDWSKSHAELSEQIRSFEKSQKDLEVALTHKD 1396
Cdd:TIGR02168  271 ELRLEVSELEEEIEELQ-------KELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELE 343
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1397 DNINALTNCItqlnrldcESESEDQNKGGSESDELANgevggdRSEKVKNQIKQMmdvsrtQTAISVVEEDLKLLQCKLR 1476
Cdd:TIGR02168  344 EKLEELKEEL--------ESLEAELEELEAELEELES------RLEELEEQLETL------RSKVAQLELQIASLNNEIE 403
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1477 ASMSTKCNLEDQIKKLEEDRSSLQSAKTvlEDECKTLRQKVEILNELYQQkemaLQKKLSQEEYERQEREQRLSAADEKA 1556
Cdd:TIGR02168  404 RLEARLERLEDRRERLQQEIEELLKKLE--EAELKELQAELEELEEELEE----LQEELERLEEALEELREELEEAEQAL 477
                          330       340
                   ....*....|....*....|....*
gi 1387212236 1557 VLAAEEVKTYKRRIEEMEDELQKTE 1581
Cdd:TIGR02168  478 DAAERELAQLQARLDSLERLQENLE 502
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1210-1581 2.15e-09

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 62.78  E-value: 2.15e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1210 EQQISEKLKNIMKENAELVQKLSSYEQKIKESKKHVQETKKqnmilsdeaikfkdKIKSLEETNEILGDTAKSLRAMLES 1289
Cdd:TIGR02169  676 LQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASR--------------KIGEIEKEIEQLEQEEEKLKERLEE 741
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1290 EREQNAKNQDLISENKKSIEKLKDVISVNASEFSEVQIALN--EAKLSEEKVksechrvqeenarlkkkkEQLQQEIKDW 1367
Cdd:TIGR02169  742 LEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNdlEARLSHSRI------------------PEIQAELSKL 803
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1368 SKSHAELSEQIRSFEKSqkdlevalthkddninaltncitqLNRLDCESESEDqnkggsesDELANGEVGGDRSEKVKNQ 1447
Cdd:TIGR02169  804 EEEVSRIEARLREIEQK------------------------LNRLTLEKEYLE--------KEIQELQEQRIDLKEQIKS 851
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1448 IKQMMDVSRTQtaISVVEEDLKLLQCKLRasmstkcNLEDQIKKLEEDRSSLQSAKTVLEDECKTLRQKVEILNELYQQK 1527
Cdd:TIGR02169  852 IEKEIENLNGK--KEELEEELEELEAALR-------DLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSEL 922
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1387212236 1528 EMALQKKLSQEEYERQEREQRLSAADEKAVLaaeevKTYKRRIEEMEDELQKTE 1581
Cdd:TIGR02169  923 KAKLEALEEELSEIEDPKGEDEEIPEEELSL-----EDVQAELQRVEEEIRALE 971
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1213-1636 7.99e-09

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 60.85  E-value: 7.99e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1213 ISEKLKNIMKENAELVQKLSSYEQKIKESKKhVQETKKQNMILSDEAIKFKDKIKSLEETNEILGDTAKSLRAMLEsERE 1292
Cdd:PRK03918   257 LEEKIRELEERIEELKKEIEELEEKVKELKE-LKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIK-ELE 334
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1293 QNAKNQDLISENKKSIEKLKDVISVNASEFSEVQIALNEA-KLSEEKVKSECHRVQEENARLKKKKEQLQQEIKdwsksh 1371
Cdd:PRK03918   335 EKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELeRLKKRLTGLTPEKLEKELEELEKAKEEIEEEIS------ 408
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1372 aELSEQIRSFEKSQKDLEvalthkdDNINALTN----CITQLNRLDCESESEDQNKGGSESDELANG-EVGGDRSEKVKN 1446
Cdd:PRK03918   409 -KITARIGELKKEIKELK-------KAIEELKKakgkCPVCGRELTEEHRKELLEEYTAELKRIEKElKEIEEKERKLRK 480
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1447 QIKQMMDVSRTQTAISVVEEDLKLLQcKLRASMStKCNLEDqIKKLEEDRSSLQSAKTVLEDECKTLRQKVEILNELYQQ 1526
Cdd:PRK03918   481 ELRELEKVLKKESELIKLKELAEQLK-ELEEKLK-KYNLEE-LEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKK 557
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1527 KEmALQKKLSqeeyerqereqrlSAADEKAVLAAEEVKTYKRRIEEMEDELQKTErsfknqiathekKAHDNWLKARAAE 1606
Cdd:PRK03918   558 LA-ELEKKLD-------------ELEEELAELLKELEELGFESVEELEERLKELE------------PFYNEYLELKDAE 611
                          410       420       430
                   ....*....|....*....|....*....|
gi 1387212236 1607 RAIAEEKREAANLRhklLELTQKMAMMQEE 1636
Cdd:PRK03918   612 KELEREEKELKKLE---EELDKAFEELAET 638
PHA03247 PHA03247
large tegument protein UL36; Provisional
1642-1924 1.07e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 60.72  E-value: 1.07e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1642 PMPGRPNTQNPPrrgPLSQNGSFGPSPVSGGECSPPLTADPPARPLSATlnrremprsefGSVDGPLPRPRWASEASGKP 1721
Cdd:PHA03247  2702 PPPPPTPEPAPH---ALVSATPLPPGPAAARQASPALPAAPAPPAVPAG-----------PATPGGPARPARPPTTAGPP 2767
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1722 SASDPESGAAPTVNSSSRSSSPSKVMDEGKQTVPQEPEGPSVPSIPSLAEHPVSVSMAAKGPPPFPGTPLMSSPVGGPLL 1801
Cdd:PHA03247  2768 APAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPP 2847
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1802 PPIRYGPPPQLCGPFGPRPLPPPFGPG----MRPPLGlREYAPGVPPGKRDLPL--DPREFLPPGHAPFRPLGslgprey 1875
Cdd:PHA03247  2848 PSLPLGGSVAPGGDVRRRPPSRSPAAKpaapARPPVR-RLARPAVSRSTESFALppDQPERPPQPQAPPPPQP------- 2919
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*....
gi 1387212236 1876 ffPGTRLPPPNHGPQDYPPSSAARDLPPsgsrdEPPPASQGASQDCSPA 1924
Cdd:PHA03247  2920 --QPQPPPPPQPQPPPPPPPRPQPPLAP-----TTDPAGAGEPSGAVPQ 2961
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
1209-1649 2.07e-08

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 59.35  E-value: 2.07e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1209 TEQQISEKLKNIMKE-NAELVQKLSSYEQKIKESKKHVQETKKQNMILSDEAiKFKDKIKSLEETNEILGDTAKSLRAML 1287
Cdd:pfam05483  367 TEQQRLEKNEDQLKIiTMELQKKSSELEEMTKFKNNKEVELEELKKILAEDE-KLLDEKKQFEKIAEELKGKEQELIFLL 445
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1288 ESeREQNAKNQDLISENKKSIEKLKdvisvnASEFSEVQIALNEAKLSEEKVKSECHRVQEENARLKKKKEQL------- 1360
Cdd:pfam05483  446 QA-REKEIHDLEIQLTAIKTSEEHY------LKEVEDLKTELEKEKLKNIELTAHCDKLLLENKELTQEASDMtlelkkh 518
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1361 QQEIKDWSKSHAELSEQIRSFEKSQKDLEVALTHKDDNInaltncITQLNRLDCESESEDQNKGGSESDELANGEVGGDR 1440
Cdd:pfam05483  519 QEDIINCKKQEERMLKQIENLEEKEMNLRDELESVREEF------IQKGDEVKCKLDKSEENARSIEYEVLKKEKQMKIL 592
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1441 SEKVKNQIKQMMDVSRTqtaISVVEEDLKLLQCKLRASMSTKCNLEDQIKKLEEDrssLQSAKTVLEDECKTLRQKVEIl 1520
Cdd:pfam05483  593 ENKCNNLKKQIENKNKN---IEELHQENKALKKKGSAENKQLNAYEIKVNKLELE---LASAKQKFEEIIDNYQKEIED- 665
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1521 nelyqqkemalqKKLSQEEYERQEREQRLSAaDEKAVLAAEEVKTYKRRIEEMEDELQKTERSFKNQIATHEKKAHDNWL 1600
Cdd:pfam05483  666 ------------KKISEEKLLEEVEKAKAIA-DEAVKLQKEIDKRCQHKIAEMVALMEKHKHQYDKIIEERDSELGLYKN 732
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*....
gi 1387212236 1601 KARAAERAIAEEKREAANLRHKLLELTQKMAMMQEEPVIVKpMPGRPNT 1649
Cdd:pfam05483  733 KEQEQSSAKAALEIELSNIKAELLSLKKQLEIEKEEKEKLK-MEAKENT 780
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1210-1636 4.43e-08

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 58.24  E-value: 4.43e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1210 EQQISEkLKNIMKENAELVQKLSSYEQKIKESKKHVQETKKQNMILsDEAIKFKDKIKSLEETNEILGDTAKSLRAMLES 1289
Cdd:COG4717     77 EEELKE-AEEKEEEYAELQEELEELEEELEELEAELEELREELEKL-EKLLQLLPLYQELEALEAELAELPERLEELEER 154
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1290 EREqnaknqdlISENKKSIEKLKDvisvnasEFSEVQIALNEA-KLSEEKVKSECHRVQEENARLKKKKEQLQQEIKDWS 1368
Cdd:COG4717    155 LEE--------LRELEEELEELEA-------ELAELQEELEELlEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQ 219
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1369 KSHAELSEQIRSFEKSQKDLE------------------VALTHKDDNINALTNCIT-------QLNRLDCESESEDQNK 1423
Cdd:COG4717    220 EELEELEEELEQLENELEAAAleerlkearlllliaaalLALLGLGGSLLSLILTIAgvlflvlGLLALLFLLLAREKAS 299
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1424 GGSESDELANGEVGGDRSEKVKNQIKQMMDVSRTQTAISVVEEDLKLLQCKLRASMSTKCNLEDQIKKLEEDRSSLQSAK 1503
Cdd:COG4717    300 LGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEELLELLDRIEELQELLREAEELEEELQLEELEQEIAALLAEA 379
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1504 TVLEDEckTLRQKVEILNElYQQKEMALQKKLSQEEYERQEREQRLSAADEKAVLAA-EEVKTYKRRIEEMEDELQKTER 1582
Cdd:COG4717    380 GVEDEE--ELRAALEQAEE-YQELKEELEELEEQLEELLGELEELLEALDEEELEEElEELEEELEELEEELEELREELA 456
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1387212236 1583 SFKNQIATHEKkahdnwlkaraaeraiaeeKREAANLRHKLLELTQKMAMMQEE 1636
Cdd:COG4717    457 ELEAELEQLEE-------------------DGELAELLQELEELKAELRELAEE 491
PTZ00121 PTZ00121
MAEBL; Provisional
1214-1620 6.36e-08

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 58.23  E-value: 6.36e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1214 SEKLKNIMKENAELVQKLSSYEQKIKESKKHVQETKKQNMIL--SDEAIKFKDKIKSLEETNEILGDTAKSLRAMLESER 1291
Cdd:PTZ00121  1376 AKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKkkADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEE 1455
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1292 EQNAKNQDLISENKKSIEKLKDvisvNASEFSEVQIAlnEAKLSEEKVKSECHRVQEENarlKKKKEQLQQEIKDWSKSH 1371
Cdd:PTZ00121  1456 AKKAEEAKKKAEEAKKADEAKK----KAEEAKKADEA--KKKAEEAKKKADEAKKAAEA---KKKADEAKKAEEAKKADE 1526
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1372 AELSEQIRSFEKSQKDLEValtHKDDninaltncitQLNRLDCESESEDQNKGGSESDELANGEVGGDRSEKVKnQIKQm 1451
Cdd:PTZ00121  1527 AKKAEEAKKADEAKKAEEK---KKAD----------ELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAK-KAEE- 1591
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1452 mdvSRTQTAISVVEEDLKLLQCKLRASMSTKCNLEdQIKKLEEDRSSLQSAKTVLEDECKTLRQkVEILNELYQQKEMAL 1531
Cdd:PTZ00121  1592 ---ARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAE-ELKKAEEEKKKVEQLKKKEAEEKKKAEE-LKKAEEENKIKAAEE 1666
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1532 QKKlsqeEYERQEREQRLSAADEKAVLAAEEVKTYKRRIEEMEdELQKTERSFKNQIATHEKKAHDNWLKARAAERAIAE 1611
Cdd:PTZ00121  1667 AKK----AEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAE-ELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEE 1741

                   ....*....
gi 1387212236 1612 EKREAANLR 1620
Cdd:PTZ00121  1742 DKKKAEEAK 1750
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
1196-1582 6.64e-08

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 57.74  E-value: 6.64e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1196 RTVLAVKSRVYQVTEQQISEKlknimkENAELVQKLSSYEQKIKESK---KHVQETKKQ--------NMILS-------- 1256
Cdd:PRK02224   179 ERVLSDQRGSLDQLKAQIEEK------EEKDLHERLNGLESELAELDeeiERYEEQREQaretrdeaDEVLEeheerree 252
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1257 ----DEAI-KFKDKIKSLEETNEILGDTAKSLRAMLESEREQNAKNQDLISENKKSIEKLKDVISVNASEFSEVQIALNE 1331
Cdd:PRK02224   253 letlEAEIeDLRETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEE 332
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1332 AKLSEEKVKSECHRVQEENARLKKKKEQLQQEIKDWSKSHAELSEQIRSFEKSQKDLEVALthkDDNINALTNCITQLNR 1411
Cdd:PRK02224   333 CRVAAQAHNEEAESLREDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEI---EELRERFGDAPVDLGN 409
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1412 LDCESESEDQNKGGSESDElANGEVGGDRSEKVKNQIKQMMDVSRTQTA---------ISVVEEDLKLLQcKLRASMSTk 1482
Cdd:PRK02224   410 AEDFLEELREERDELRERE-AELEATLRTARERVEEAEALLEAGKCPECgqpvegsphVETIEEDRERVE-ELEAELED- 486
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1483 cnLEDQIKKLEEDRSSLQSAKTvLEDECKTLRQKVEILNELYQQKEMALQKKLSQEEYERQEREQRLSAADEK------- 1555
Cdd:PRK02224   487 --LEEEVEEVEERLERAEDLVE-AEDRIERLEERREDLEELIAERRETIEEKRERAEELRERAAELEAEAEEKreaaaea 563
                          410       420       430
                   ....*....|....*....|....*....|
gi 1387212236 1556 ---AVLAAEEVKTYKRRIEEMEDELQKTER 1582
Cdd:PRK02224   564 eeeAEEAREEVAELNSKLAELKERIESLER 593
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1213-1522 9.38e-08

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 57.39  E-value: 9.38e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1213 ISEKLKN-----IMKENAELVQKLSS----YEQKIKESKKHVQETKKQNMI-----LSDEAIKFKDKI----KSLEETNE 1274
Cdd:TIGR02169  193 IDEKRQQlerlrREREKAERYQALLKekreYEGYELLKEKEALERQKEAIErqlasLEEELEKLTEEIseleKRLEEIEQ 272
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1275 ILGDTAKSLRAMleSEREQNAKNQDlISENKKSIEKLKDVISVNASEFSEVQIALNEAKLSEEKVKSECHRVQEENARLK 1354
Cdd:TIGR02169  273 LLEELNKKIKDL--GEEEQLRVKEK-IGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEER 349
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1355 KKKEQLQQEIKDWSKSHAELSEQIRSFEKSQKDLEVALTHKDDNINALTNCITQLNRLDCESESEDQNKggseSDELANG 1434
Cdd:TIGR02169  350 KRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRL----SEELADL 425
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1435 EVGGDRSEKVKNQIKQMMD-----VSRTQTAISVVEEDLKLLQCKLRASMSTKCNLEDQIKKLEEDRSSLQSAKTVLEDE 1509
Cdd:TIGR02169  426 NAAIAGIEAKINELEEEKEdkaleIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEER 505
                          330
                   ....*....|...
gi 1387212236 1510 CKTLRQKVEILNE 1522
Cdd:TIGR02169  506 VRGGRAVEEVLKA 518
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1265-1578 1.22e-07

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 56.87  E-value: 1.22e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1265 KIKSLEETNEILGDTAKSLRAMLESEREQNAKNQDLISENKKSIEKLKDvisvnasEFSEVQIALNEAKLSEEKVKSECH 1344
Cdd:COG1196    233 KLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELEL-------ELEEAQAEEYELLAELARLEQDIA 305
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1345 RVQEENARLKKKKEQLQQEIKDWSKSHAELSEQIRSFEKSQKDLEVALTHKDdninaltncitqlnrldcesESEDQNKG 1424
Cdd:COG1196    306 RLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAE--------------------AELAEAEE 365
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1425 GSESDELANGEVGGDRSEKVKNQIKQMMDVSRTQTAISVVEEDLKllqcklrasmstkcNLEDQIKKLEEDRSSLQSAKT 1504
Cdd:COG1196    366 ALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEE--------------ALLERLERLEEELEELEEALA 431
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1387212236 1505 VLEDECKTLRQKVEILNELYQQKEmALQKKLSQEEYERQEREQRLSAADEKAVLAAEEVKTYKRRIEEMEDELQ 1578
Cdd:COG1196    432 ELEEEEEEEEEALEEAAEEEAELE-EEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYE 504
SH3_2 pfam07653
Variant SH3 domain; SH3 (Src homology 3) domains are often indicative of a protein involved in ...
49-105 1.65e-07

Variant SH3 domain; SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel.


Pssm-ID: 429575 [Multi-domain]  Cd Length: 54  Bit Score: 49.52  E-value: 1.65e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1387212236   49 RGEALEDFTGPDCRFVNFKKGDTVYVYYKLAGGSpevWAGSVGHTFGYFPKDLIQVV 105
Cdd:pfam07653    1 YGRVIFDYVGTDKNGLTLKKGDVVKVLGKDNDGW---WEGETGGRVGLVPSTAVEEI 54
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
1214-1534 2.08e-07

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 56.27  E-value: 2.08e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1214 SEKLKNI---------MKENAELVQKLSSYEQKIKESKKHVQETKKQNMilsdeaiKFKDKIKSLEETNEILGDTAKSLR 1284
Cdd:pfam05483  482 KEKLKNIeltahcdklLLENKELTQEASDMTLELKKHQEDIINCKKQEE-------RMLKQIENLEEKEMNLRDELESVR 554
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1285 AMLESEREQNAKNQDLISENKKSIEklkdvisvnasefSEVQIALNEAKLSEEKvkseCHRVQEENARLKKKKEQLQQEI 1364
Cdd:pfam05483  555 EEFIQKGDEVKCKLDKSEENARSIE-------------YEVLKKEKQMKILENK----CNNLKKQIENKNKNIEELHQEN 617
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1365 KDWSKSHAELSEQIRSFEKSQKDLEVALTHKDDNINALTNCIT---QLNRLDCESESEDQNKGGSESDELANGEVGGDRs 1441
Cdd:pfam05483  618 KALKKKGSAENKQLNAYEIKVNKLELELASAKQKFEEIIDNYQkeiEDKKISEEKLLEEVEKAKAIADEAVKLQKEIDK- 696
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1442 eKVKNQIKQM---MDVSRTQTAISVVEED--LKLLQCKLRASMSTKCNLEDQIKKLeedRSSLQSAKTVLEDEcktlRQK 1516
Cdd:pfam05483  697 -RCQHKIAEMvalMEKHKHQYDKIIEERDseLGLYKNKEQEQSSAKAALEIELSNI---KAELLSLKKQLEIE----KEE 768
                          330
                   ....*....|....*...
gi 1387212236 1517 VEILNELYQQKEMALQKK 1534
Cdd:pfam05483  769 KEKLKMEAKENTAILKDK 786
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1211-1411 2.18e-07

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 55.16  E-value: 2.18e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1211 QQISEKLKNIMKENAELVQKLSSYEQKIKESKKHVQETKKQNMILSDEAIKFKDKIKSLEET-NEILGDTAKSLRAMLES 1289
Cdd:COG4942     37 AELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAElEAQKEELAELLRALYRL 116
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1290 EREQNAK---NQDLISENKKSIEKLKDVISVNASEFSEVQIALNEAKLSEEKVKSECHRVQEENARLKKKKEQLQQEIKd 1366
Cdd:COG4942    117 GRQPPLAlllSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELEAERAELEALLAELEEERAALEALKA- 195
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*
gi 1387212236 1367 wskshaELSEQIRSFEKSQKDLEVALTHKDDNINALTNCITQLNR 1411
Cdd:COG4942    196 ------ERQKLLARLEKELAELAAELAELQQEAEELEALIARLEA 234
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1361-1636 2.91e-07

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 55.83  E-value: 2.91e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1361 QQEIKDWSKSHAELSEQIRSFEKSQKDLEVALThkddninALTNCITQLNRLdcesesedqnkGGSESDELANGEVGGDR 1440
Cdd:TIGR02168  676 RREIEELEEKIEELEEKIAELEKALAELRKELE-------ELEEELEQLRKE-----------LEELSRQISALRKDLAR 737
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1441 SEKVKNQIKQMMDvsRTQTAISVVEEDLKLLQCKLRASMSTKCNLEDQIKKLEEDRSSLQSAKTVLEDECKTLRQKVEIL 1520
Cdd:TIGR02168  738 LEAEVEQLEERIA--QLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLL 815
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1521 NELYQQKEMA---LQKKLSQEEYERQEREQRLSAADEKAVLAAEEVKTYKRRIEEMEDELQkterSFKNQIATHEKKAHD 1597
Cdd:TIGR02168  816 NEEAANLRERlesLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELE----ALLNERASLEEALAL 891
                          250       260       270
                   ....*....|....*....|....*....|....*....
gi 1387212236 1598 NWLKARAAERAIAEEKREAANLRHKLLELTQKMAMMQEE 1636
Cdd:TIGR02168  892 LRSELEELSEELRELESKRSELRRELEELREKLAQLELR 930
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
1214-1595 4.01e-07

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 55.44  E-value: 4.01e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1214 SEKLKNIMKENAELVQKLSSYE------------QKIKESKKHV--QETK-----------KQNMILSDEAIKFKD---- 1264
Cdd:TIGR01612 1388 SEKLIKKIKDDINLEECKSKIEstlddkdideciKKIKELKNHIlsEESNidtyfknadenNENVLLLFKNIEMADnksq 1467
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1265 ---KIKSLEET-------NEILGDTAKSLRAMLESEreqnaKNQDLISENKKSIEKLKDVISVNASEFSEVQIALNEAKl 1334
Cdd:TIGR01612 1468 hilKIKKDNATndhdfniNELKEHIDKSKGCKDEAD-----KNAKAIEKNKELFEQYKKDVTELLNKYSALAIKNKFAK- 1541
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1335 seekVKSECHRVQEENARLKKK----KEQLQQEIKDWSKSHAELSEQIRSFEKSQK---DLEVALTHKDDNINALTNCIT 1407
Cdd:TIGR01612 1542 ----TKKDSEIIIKEIKDAHKKfileAEKSEQKIKEIKKEKFRIEDDAAKNDKSNKaaiDIQLSLENFENKFLKISDIKK 1617
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1408 QLNrlDCESESEdqnkggsesdelangevggdrseKVKNQIKQMmDVSRTQTAISVVEEDLKLLQCKLRASMSTKCNLED 1487
Cdd:TIGR01612 1618 KIN--DCLKETE-----------------------SIEKKISSF-SIDSQDTELKENGDNLNSLQEFLESLKDQKKNIED 1671
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1488 QIKKLEEDRSSLQSAKTVLEDECKTLRQK-VEILNELYQQKEMALQKKLSQEEYERQEREQRLSAADEKAVLAAEEVKTY 1566
Cdd:TIGR01612 1672 KKKELDELDSEIEKIEIDVDQHKKNYEIGiIEKIKEIAIANKEEIESIKELIEPTIENLISSFNTNDLEGIDPNEKLEEY 1751
                          410       420
                   ....*....|....*....|....*....
gi 1387212236 1567 KRRIEEMEDELQKTERSFKNQIATHEKKA 1595
Cdd:TIGR01612 1752 NTEIGDIYEEFIELYNIIAGCLETVSKEP 1780
COG5022 COG5022
Myosin heavy chain [General function prediction only];
1203-1576 6.10e-07

Myosin heavy chain [General function prediction only];


Pssm-ID: 227355 [Multi-domain]  Cd Length: 1463  Bit Score: 55.08  E-value: 6.10e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1203 SRVYQVTEQQISEKLKNIMKENAELVQKLSSYEQKIKESKKHVQETKKQNMILSDEAIKFKDKIKSleetneilgdtakS 1282
Cdd:COG5022    852 GRSLKAKKRFSLLKKETIYLQSAQRVELAERQLQELKIDVKSISSLKLVNLELESEIIELKKSLSS-------------D 918
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1283 LRAMLESEREQNAKNQDLISENKKSIEKLKDVISVnasefsEVQIALNEaklseekVKSECHRVQEENARLKKKKEQLQ- 1361
Cdd:COG5022    919 LIENLEFKTELIARLKKLLNNIDLEEGPSIEYVKL------PELNKLHE-------VESKLKETSEEYEDLLKKSTILVr 985
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1362 ------QEIKDWSKSHAELSEQIRSFEKSQKDLEVaLTHKDDNINALTNCITQlnrldcESESEDQNKGGSES---DELA 1432
Cdd:COG5022    986 egnkanSELKNFKKELAELSKQYGALQESTKQLKE-LPVEVAELQSASKIISS------ESTELSILKPLQKLkglLLLE 1058
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1433 NGEVGG---------DRSEKVKNQIKQMMDVSRTQTAISVVEEDLKLLQC--------KLRASMStKCNLEDQIKKleed 1495
Cdd:COG5022   1059 NNQLQArykalklrrENSLLDDKQLYQLESTENLLKTINVKDLEVTNRNLvkpanvlqFIVAQMI-KLNLLQEISK---- 1133
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1496 rsSLQSAKTVLEDECKTLRQKVEILNELYQQKemALQKKLSQEEYERQEREQRLSAA--DEKAVLAAEEVKTYKRRIEEM 1573
Cdd:COG5022   1134 --FLSQLVNTLEPVFQKLSVLQLELDGLFWEA--NLEALPSPPPFAALSEKRLYQSAlyDEKSKLSSSEVNDLKNELIAL 1209

                   ...
gi 1387212236 1574 EDE 1576
Cdd:COG5022   1210 FSK 1212
FPP pfam05911
Filament-like plant protein, long coiled-coil; FPP is a family of long coiled-coil plant ...
1201-1528 8.10e-07

Filament-like plant protein, long coiled-coil; FPP is a family of long coiled-coil plant proteins that are filament-like. It interacts with the nuclear envelope-associated protein, MAF1, the WPP family pfam13943.


Pssm-ID: 461778 [Multi-domain]  Cd Length: 859  Bit Score: 54.30  E-value: 8.10e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1201 VKSRVYQVTEQQISEKLKNIMKENAE-LVQKLSSYE--QKIKESKKHVQETKKQNMILSDEAIKFKDKIKSLEETNEILG 1277
Cdd:pfam05911  441 VPVSSKDISLGKSLSWLQSRISVILEsHVTQKSIGKilEDIRCALQDINDSLPEADSCLSSGHPSTDASCDYITCKENSS 520
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1278 DTAKSLRAMLESEREQNAKNQDLISENKKSIEKLKDVI---SVNASEFSEVQIALNEAKLSEEKVKSECHRVQEENARLK 1354
Cdd:pfam05911  521 VVEKEGSVSGDDKSSEETSKQSIQQDLSKAISKIIDFVeglSKEALDDQDTSSDSSELSEVLQQFSATCNDVLSGKADLE 600
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1355 KKKEQLQqEIKDWSKSH-------AELSEQIRSFEKSQKD--LEVALTHKDDNINALTNCITQLNRLDCESESEDQNKGG 1425
Cdd:pfam05911  601 DFVLELS-HILDWISNHcfslldvSSMEDEIKKHDCIDKVtlSENKVAQVDNGCSEIDNLSSDPEIPSDGPLVSGSNDLK 679
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1426 SESDELANGEVGGDRSEKVKNQIKQMMDVSRTQTAIS---VVEEDLKLLQCKLRASMSTKCNLEDQIKKLEEDRSSLQSA 1502
Cdd:pfam05911  680 TEENKRLKEEFEQLKSEKENLEVELASCTENLESTKSqlqESEQLIAELRSELASLKESNSLAETQLKCMAESYEDLETR 759
                          330       340
                   ....*....|....*....|....*..
gi 1387212236 1503 KTVLEDECKTLRQKVEIL-NELYQQKE 1528
Cdd:pfam05911  760 LTELEAELNELRQKFEALeVELEEEKN 786
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
1211-1601 8.78e-07

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 54.35  E-value: 8.78e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1211 QQISEKLKNIMKENAELVQKLSSYEQKIKESKKHV-------QETKKQNMILSDEAIKFKDKIKSLeetneiLGDTAKSL 1283
Cdd:pfam15921  317 RQLSDLESTVSQLRSELREAKRMYEDKIEELEKQLvlanselTEARTERDQFSQESGNLDDQLQKL------LADLHKRE 390
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1284 RAmLESEREQNAKNQDLISENKKSIEKLKDVISVNASEFSEVQIALneaklseEKVKSECH-RVQEENARLKKKKE---- 1358
Cdd:pfam15921  391 KE-LSLEKEQNKRLWDRDTGNSITIDHLRRELDDRNMEVQRLEALL-------KAMKSECQgQMERQMAAIQGKNEslek 462
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1359 ------QLQQEIKDWSKSHAELSEQIRSFEKSQK---DLEVALTHKDDNINALTNCITQL-NRLDCE--------SESED 1420
Cdd:pfam15921  463 vssltaQLESTKEMLRKVVEELTAKKMTLESSERtvsDLTASLQEKERAIEATNAEITKLrSRVDLKlqelqhlkNEGDH 542
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1421 QNKGGSESDELANGEVGGDRS-EKVKNQIKQMMDV----SRTQTAISV----VEEDLKLLQCKLRASMSTKCNLEDQIKK 1491
Cdd:pfam15921  543 LRNVQTECEALKLQMAEKDKViEILRQQIENMTQLvgqhGRTAGAMQVekaqLEKEINDRRLELQEFKILKDKKDAKIRE 622
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1492 LEEDRSSLQSAKTVLEDECKtlrQKVEILNELYQQKEMALQkklsqeeyerqereqrlsaadekavlaaeEVKTYKRRIE 1571
Cdd:pfam15921  623 LEARVSDLELEKVKLVNAGS---ERLRAVKDIKQERDQLLN-----------------------------EVKTSRNELN 670
                          410       420       430
                   ....*....|....*....|....*....|
gi 1387212236 1572 EMEDELQKTERSFKNQiaTHEKKAHDNWLK 1601
Cdd:pfam15921  671 SLSEDYEVLKRNFRNK--SEEMETTTNKLK 698
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1230-1582 1.30e-06

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 53.92  E-value: 1.30e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1230 KLSSYEQKIKESKKHVQETKKQNMILSDE---AIKFKDKIKSLEETNeilgdtAKSLRAMLESEREQNAKNQDLISENKK 1306
Cdd:TIGR02169  178 ELEEVEENIERLDLIIDEKRQQLERLRRErekAERYQALLKEKREYE------GYELLKEKEALERQKEAIERQLASLEE 251
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1307 SIEKLKDVISVNASEFSEVQIALNEA-----KLSEEK---VKSECHRVQEENARLKKKKEQLQQEIKDWSKSHAELSEQI 1378
Cdd:TIGR02169  252 ELEKLTEEISELEKRLEEIEQLLEELnkkikDLGEEEqlrVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEI 331
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1379 RSFEKSQKDLEVALTHKDDNINALTNCI----TQLNRLDCESESEDQnKGGSESDELANGEVggdRSEKVKNQIKQMmdv 1454
Cdd:TIGR02169  332 DKLLAEIEELEREIEEERKRRDKLTEEYaelkEELEDLRAELEEVDK-EFAETRDELKDYRE---KLEKLKREINEL--- 404
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1455 srtQTAISVVEEDLKLLQCKLRasmstkcNLEDQIKKLEEDRSSLQSAKTVLEDECKTLRQKVEILnelyqqkemalqkk 1534
Cdd:TIGR02169  405 ---KRELDRLQEELQRLSEELA-------DLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQL-------------- 460
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1535 lsqeeyerqereqrlsaadeKAVLAAEEVKTYKRR--IEEMEDELQKTER 1582
Cdd:TIGR02169  461 --------------------AADLSKYEQELYDLKeeYDRVEKELSKLQR 490
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1222-1412 1.52e-06

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 52.46  E-value: 1.52e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1222 KENAELVQKLSSYEQKIKESKKHVQETKKQnmilsdeaikFKDKIKSLEETNEILGDTAKSLRAMlesEREQNAKNQDL- 1300
Cdd:COG4942     20 DAAAEAEAELEQLQQEIAELEKELAALKKE----------EKALLKQLAALERRIAALARRIRAL---EQELAALEAELa 86
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1301 -----ISENKKSIEKLKDVIS------------------VNASEFSEVQI------ALNEAKLSE-EKVKSECHRVQEEN 1350
Cdd:COG4942     87 elekeIAELRAELEAQKEELAellralyrlgrqpplallLSPEDFLDAVRrlqylkYLAPARREQaEELRADLAELAALR 166
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1387212236 1351 ARLKKKKEQLQQEIKDWSKSHAELSEQIRSFEKSQKDLEVALTHKDDNINALTNCITQLNRL 1412
Cdd:COG4942    167 AELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEAL 228
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1266-1586 1.55e-06

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 53.53  E-value: 1.55e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1266 IKSLEETNEILGDTAKSLRAM---LESEREQNAKNQDLISENKKSIEKLKDVISVNASEFSEVQIALNEA---KLSEEKV 1339
Cdd:PRK03918   157 LDDYENAYKNLGEVIKEIKRRierLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLekeVKELEEL 236
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1340 KSECHRVQEENARLKKKKEQLQQEIKDWSKSHAELSEQIRSFEKSQKDLEvalthkddNINALTNCITQLNRLDCESESE 1419
Cdd:PRK03918   237 KEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELK--------ELKEKAEEYIKLSEFYEEYLDE 308
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1420 DQN--KGGSESDELANG--EVGGDRSEKVKNQIKQMMDVSRTQTAISVVEEDLKLLQcKLRASMSTKCNLEDQIK----- 1490
Cdd:PRK03918   309 LREieKRLSRLEEEINGieERIKELEEKEERLEELKKKLKELEKRLEELEERHELYE-EAKAKKEELERLKKRLTgltpe 387
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1491 KLEEDRSSLQSAKTVLEDECKTLRQKVEILNELYQQKEMALQkKLSQEEYERQEREQRLSAADEKAVLA--AEEVKTYKR 1568
Cdd:PRK03918   388 KLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIE-ELKKAKGKCPVCGRELTEEHRKELLEeyTAELKRIEK 466
                          330
                   ....*....|....*...
gi 1387212236 1569 RIEEMEDELQKTERSFKN 1586
Cdd:PRK03918   467 ELKEIEEKERKLRKELRE 484
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
1211-1580 3.15e-06

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 52.67  E-value: 3.15e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1211 QQISEKLKNIMKENAELVQKLSSYEQKIKESKKhvqetkKQNMILSDEAIKFKDKIKSLEETNeilGDTAKSLRAMLESE 1290
Cdd:pfam02463  665 KASLSELTKELLEIQELQEKAESELAKEEILRR------QLEIKKKEQREKEELKKLKLEAEE---LLADRVQEAQDKIN 735
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1291 REQNAKNQDLISENKKSIEKLKDVISVNASEFSEVQialNEAKLSEEKVKSECHRVQEENARLKKKKEQLQQEIKDWSKS 1370
Cdd:pfam02463  736 EELKLLKQKIDEEEEEEEKSRLKKEEKEEEKSELSL---KEKELAEEREKTEKLKVEEEKEEKLKAQEEELRALEEELKE 812
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1371 HAELSEQirsFEKSQKDLEVALTHKDDNINALTNCITQLNRLDCESESEDQnkggsesdELANGEVGGDRSEKVKNQIKQ 1450
Cdd:pfam02463  813 EAELLEE---EQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLE--------EEITKEELLQELLLKEEELEE 881
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1451 MMDVSRTQTAISVVEEDLKLLQCKLRASmstkCNLEDQIKKLEEDRSSLQSAKTVLEDECKTLRQKVEILNELYQQ-KEM 1529
Cdd:pfam02463  882 QKLKDELESKEEKEKEEKKELEEESQKL----NLLEEKENEIEERIKEEAEILLKYEEEPEELLLEEADEKEKEENnKEE 957
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1387212236 1530 ALQKKLSQEEYERQEREQRLSAADE----------------------KAVLAAEEVKTYKRRIEEMEDELQKT 1580
Cdd:pfam02463  958 EEERNKRLLLAKEELGKVNLMAIEEfeekeerynkdelekerleeekKKLIRAIIEETCQRLKEFLELFVSIN 1030
PTZ00121 PTZ00121
MAEBL; Provisional
1210-1636 3.34e-06

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 52.45  E-value: 3.34e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1210 EQQISEKLKnimKENAELVQKLSSYEQKIKESKKHVQETKKQNMILSDEAIKFKDKIKSLEETNEILGDTAKSLRAMLES 1289
Cdd:PTZ00121  1313 EAKKADEAK---KKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEE 1389
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1290 EREQNAKNQDlISENKKSIEKLKDVISVNASEFSEVQIALNEAKLSEEKVKSECHRVQEE---NARLKKKKEQLQQEIKD 1366
Cdd:PTZ00121  1390 KKKADEAKKK-AEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEakkKAEEAKKAEEAKKKAEE 1468
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1367 WSKSH--AELSEQIRSFEKSQKDLEVALTHKDdninaltncitQLNRLDCESESEDQNKGGSE---SDELANGEVGGDRS 1441
Cdd:PTZ00121  1469 AKKADeaKKKAEEAKKADEAKKKAEEAKKKAD-----------EAKKAAEAKKKADEAKKAEEakkADEAKKAEEAKKAD 1537
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1442 EKVKNQIKQMMDVSRTQTAISVVEEDLKLLQCKlRASMSTKCNLE--DQIKKLEEDRsslqsAKTVLEDECKTLRQKVEI 1519
Cdd:PTZ00121  1538 EAKKAEEKKKADELKKAEELKKAEEKKKAEEAK-KAEEDKNMALRkaEEAKKAEEAR-----IEEVMKLYEEEKKMKAEE 1611
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1520 LNELYQQKEMALQKKlsqEEYERQEREQRLSAADEKAVLAAEEVKTYKRRIEEMEDELQKTERSFKNQiATHEKKAHDNW 1599
Cdd:PTZ00121  1612 AKKAEEAKIKAEELK---KAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKK-AEEAKKAEEDE 1687
                          410       420       430
                   ....*....|....*....|....*....|....*..
gi 1387212236 1600 LKARAAERAIAEEKREAANLRHKLLELTQKMAMMQEE 1636
Cdd:PTZ00121  1688 KKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKA 1724
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1202-1533 3.52e-06

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 52.37  E-value: 3.52e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1202 KSRVYQVTEQQISEKLKN-------IMKENAELVQKLSSYEQKIKESKKHVQETKKQNMI-------LSDE-----AIKF 1262
Cdd:PRK03918   378 KKRLTGLTPEKLEKELEElekakeeIEEEISKITARIGELKKEIKELKKAIEELKKAKGKcpvcgreLTEEhrkelLEEY 457
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1263 KDKIKSLEETNEILGDTAKSLRAMLEsEREQNAKNQDLISENKKSIEKLKDV-----------ISVNASEFSEVQIALNE 1331
Cdd:PRK03918   458 TAELKRIEKELKEIEEKERKLRKELR-ELEKVLKKESELIKLKELAEQLKELeeklkkynleeLEKKAEEYEKLKEKLIK 536
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1332 AKLSEEKVKSECHRVQEenarLKKKKEQLQQEIKDWSKSHAELSEQIRSFE-KSQKDLEVALTHKD---DNINALTNCIT 1407
Cdd:PRK03918   537 LKGEIKSLKKELEKLEE----LKKKLAELEKKLDELEEELAELLKELEELGfESVEELEERLKELEpfyNEYLELKDAEK 612
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1408 QLNRLDCESESEdQNKGGSESDELANGEvggDRSEKVKNQIKQ----------------MMDVSRtqtAISVVEEDLKLL 1471
Cdd:PRK03918   613 ELEREEKELKKL-EEELDKAFEELAETE---KRLEELRKELEElekkyseeeyeelreeYLELSR---ELAGLRAELEEL 685
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1387212236 1472 QCKLRASMSTKCNLEDQIKKLEEDRSSLQSAKTVLEDeCKTLRQKVEILNELyqQKEMALQK 1533
Cdd:PRK03918   686 EKRREEIKKTLEKLKEELEEREKAKKELEKLEKALER-VEELREKVKKYKAL--LKERALSK 744
HOOK pfam05622
HOOK protein coiled-coil region; This family consists of several HOOK1, 2 and 3 proteins from ...
1234-1534 3.55e-06

HOOK protein coiled-coil region; This family consists of several HOOK1, 2 and 3 proteins from different eukaryotic organizms. The different members of the human gene family are HOOK1, HOOK2 and HOOK3. Different domains have been identified in the three human HOOK proteins, and it was demonstrated that the highly conserved NH2-domain mediates attachment to microtubules, whereas this central coiled-coil motif mediates homodimerization and the more divergent C-terminal domains are involved in binding to specific organelles (organelle-binding domains). It has been demonstrated that endogenous HOOK3 binds to Golgi membranes, whereas both HOOK1 and HOOK2 are localized to discrete but unidentified cellular structures. In mice the Hook1 gene is predominantly expressed in the testis. Hook1 function is necessary for the correct positioning of microtubular structures within the haploid germ cell. Disruption of Hook1 function in mice causes abnormal sperm head shape and fragile attachment of the flagellum to the sperm head. This entry includes the central coiled-coiled domain and the divergent C-terminal domain.


Pssm-ID: 461694 [Multi-domain]  Cd Length: 528  Bit Score: 52.00  E-value: 3.55e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1234 YEQKIKESKKHVQETKKQN---MILSDEAIKFKDKIKSLEETNeilgDTAKSLRAMLESERE--------------QNAK 1296
Cdd:pfam05622   85 YRIKCEELEKEVLELQHRNeelTSLAEEAQALKDEMDILRESS----DKVKKLEATVETYKKkledlgdlrrqvklLEER 160
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1297 NQDLISENKKSIEKLKDVISVNAS-EFSEVQIALNEAKLSEEKVKS-----ECHRVQEENARLKKKKEQLQQEiKDWSKs 1370
Cdd:pfam05622  161 NAEYMQRTLQLEEELKKANALRGQlETYKRQVQELHGKLSEESKKAdklefEYKKLEEKLEALQKEKERLIIE-RDTLR- 238
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1371 haELSEQIRSFEKSQKDLEVALTHKDDNINALTNCITQ---------LNRLDCESESEDQNKGGSESDELANGEVGGDRS 1441
Cdd:pfam05622  239 --ETNEELRCAQLQQAELSQADALLSPSSDPGDNLAAEimpaeirekLIRLQHENKMLRLGQEGSYRERLTELQQLLEDA 316
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1442 EKVKNQIKQMMDVSRTQtaISVVEEDLKLLQCKLRA-------SMSTKCNLEDQIKKLEEDRSSLQSAKTVLED-ECKTL 1513
Cdd:pfam05622  317 NRRKNELETQNRLANQR--ILELQQQVEELQKALQEqgskaedSSLLKQKLEEHLEKLHEAQSELQKKKEQIEElEPKQD 394
                          330       340
                   ....*....|....*....|.
gi 1387212236 1514 RQKVEILNELyqqkEMALQKK 1534
Cdd:pfam05622  395 SNLAQKIDEL----QEALRKK 411
46 PHA02562
endonuclease subunit; Provisional
1292-1529 5.80e-06

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 51.17  E-value: 5.80e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1292 EQNAKNQDLISENK---KSIEKLKDVIsvnasefsEVQIALNEAKLSEEKVKSEchrvqEENARLKKKKEQLQQEIKDWS 1368
Cdd:PHA02562   167 EMDKLNKDKIRELNqqiQTLDMKIDHI--------QQQIKTYNKNIEEQRKKNG-----ENIARKQNKYDELVEEAKTIK 233
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1369 KSHAELSEQIRSFEKSQKDLEVALTHKDDNINALTNCITQLNRL-----------DCESE-SEDQNKGGSESDELANGEV 1436
Cdd:PHA02562   234 AEIEELTDELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQFQKVikmyekggvcpTCTQQiSEGPDRITKIKDKLKELQH 313
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1437 GGDRSEKVKNQIKQMMDVSRTQTaisvveEDLKLLQCKLRASMSTKCNLEDQIKKLEEDRSSLQSAKTVLEDECKTLRQK 1516
Cdd:PHA02562   314 SLEKLDTAIDELEEIMDEFNEQS------KKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQAEFVDNAEELAKLQDE 387
                          250
                   ....*....|...
gi 1387212236 1517 VEILNELYQQKEM 1529
Cdd:PHA02562   388 LDKIVKTKSELVK 400
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
1210-1401 6.33e-06

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 50.60  E-value: 6.33e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1210 EQQISEKLKNIMKENAELVQKLSSYEQKIKESKKHVQETKKQNMILSDEaikfkdkiksLEETNEILGDTAkslRAMLES 1289
Cdd:COG3883     32 LEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAE----------IEERREELGERA---RALYRS 98
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1290 EREQ-------NAKN-QDLISenkkSIEKLKDVISVNASEFSEVQIALNEAKLSEEKVKSECHRVQEENARLKKKKEQLQ 1361
Cdd:COG3883     99 GGSVsyldvllGSESfSDFLD----RLSALSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKAELEAAKAELE 174
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|
gi 1387212236 1362 QEIKDWSKSHAELSEQIRSFEKSQKDLEVALTHKDDNINA 1401
Cdd:COG3883    175 AQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAA 214
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1302-1582 7.43e-06

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 51.22  E-value: 7.43e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1302 SENKKSIEKLKDVisvnasefsEVQIALNEAKLSEekVKSECHRVQEENAR------LKKKKEQLQQ-----EIKDWSKS 1370
Cdd:TIGR02169  170 RKKEKALEELEEV---------EENIERLDLIIDE--KRQQLERLRREREKaeryqaLLKEKREYEGyellkEKEALERQ 238
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1371 HAELSEQIRSFEKSQKDLEVALTHKDDNINALTNCITQLNRldcesesedqnkggsESDELangevGGDRSEKVKNQIKQ 1450
Cdd:TIGR02169  239 KEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNK---------------KIKDL-----GEEEQLRVKEKIGE 298
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1451 M-MDVSRTQTAISVVEEDLKllqcklrasmstkcNLEDQIKKLEEDRSSLQSAKTVLEDECKTLRQKVEILNELYQQKEM 1529
Cdd:TIGR02169  299 LeAEIASLERSIAEKERELE--------------DAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKE 364
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1387212236 1530 ALQKKLSQeeyerqereqrLSAADEKAVLAAEEVKTYKRRIEEMEDELQKTER 1582
Cdd:TIGR02169  365 ELEDLRAE-----------LEEVDKEFAETRDELKDYREKLEKLKREINELKR 406
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1641-1929 8.94e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 50.94  E-value: 8.94e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1641 KPMPGRPNTQNPPRRGPLSQNGSFGPSPVSGGEcSPPLTADPPARPLSATLNRREMPRSEFG--------SVDGPLPRPR 1712
Cdd:PHA03307    83 ESRSTPTWSLSTLAPASPAREGSPTPPGPSSPD-PPPPTPPPASPPPSPAPDLSEMLRPVGSpgpppaasPPAAGASPAA 161
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1713 WASEASGKPSASDPesgaAPTVNSSSRSSSPSKVMDEGKQTVPQEPEGPSVPSIPSLAEHPVSVSMAAKG---PPPFPGT 1789
Cdd:PHA03307   162 VASDAASSRQAALP----LSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSaadDAGASSS 237
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1790 PLMSSPVGGPLLPPIRYGPPPQLCGPFGPRPLPPPFGPGMRPPLGLREYAPGVPPGKRDLPLDPREFLPPGHAPFRPLGS 1869
Cdd:PHA03307   238 DSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSS 317
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1870 LGP----------REYFFPGTRLPPPnhGPqdyPPSSAARDLPPSGSRDEPPPASQGASQDCSPALKQSP 1929
Cdd:PHA03307   318 SSSsressssstsSSSESSRGAAVSP--GP---SPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASA 382
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1212-1582 9.65e-06

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 50.83  E-value: 9.65e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1212 QISEKLKNIMKENAELVQKLSSYEqKIKESKKHVQETKKQNMILSDEaiKFKDKIKSLEETNEILGDTAKSLRAMLESER 1291
Cdd:PRK03918   342 ELKKKLKELEKRLEELEERHELYE-EAKAKKEELERLKKRLTGLTPE--KLEKELEELEKAKEEIEEEISKITARIGELK 418
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1292 EQnaknqdlISENKKSIEKL---KDVISVNASEFSEVQIA--LNEAKLSEEKVKSECHRVQEENARLKKKKEQLQQEIKD 1366
Cdd:PRK03918   419 KE-------IKELKKAIEELkkaKGKCPVCGRELTEEHRKelLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKK 491
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1367 WSK--SHAELSEQIRSFEKSQKDLEVALTHKDD---------------NINALTNCITQLNRLDCESEsEDQNKGGSESD 1429
Cdd:PRK03918   492 ESEliKLKELAEQLKELEEKLKKYNLEELEKKAeeyeklkekliklkgEIKSLKKELEKLEELKKKLA-ELEKKLDELEE 570
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1430 ELAN-----GEVGGDRSEKVKNQIKQMMDVSRTQTAISVVEEDLKLLQCKLRasmSTKCNLEDQIKKLEEDRSSLQSAKT 1504
Cdd:PRK03918   571 ELAEllkelEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELK---KLEEELDKAFEELAETEKRLEELRK 647
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1505 VLED-ECKTLRQKVEILNELYQQKEMALQKKLSQEEYERQEREQRLSAADE-KAVLaaEEVKTYKRRIEEMEDELQKTER 1582
Cdd:PRK03918   648 ELEElEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEKlKEEL--EEREKAKKELEKLEKALERVEE 725
PHA03247 PHA03247
large tegument protein UL36; Provisional
1685-1926 1.16e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 1.16e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1685 RPLSATLNRREM----------PRSEFGSVDGPLPRPrwASEASGKPSASDPESGAAPTVnsssrssspskvmdegkqtV 1754
Cdd:PHA03247  2462 APFSLSLLLGELfpgapvyrrpAEARFPFAAGAAPDP--GGGGPPDPDAPPAPSRLAPAI-------------------L 2520
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1755 PQEPEGPSVPS-----IPSLAEhpvSVSMAAKGPPPfPGTPLMSSPVGGPLLPPIRYGPPPqlcgpfGPRPLPPPFGPGM 1829
Cdd:PHA03247  2521 PDEPVGEPVHPrmltwIRGLEE---LASDDAGDPPP-PLPPAAPPAAPDRSVPPPRPAPRP------SEPAVTSRARRPD 2590
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1830 RPPLGLREYAPGVPPGKRDLPLDPREFLPPGHAPFRPLGSLGPR--EYFFPGTRLPPPNHGPQDYPPSSAARDLPPSGSR 1907
Cdd:PHA03247  2591 APPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAanEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRL 2670
                          250
                   ....*....|....*....
gi 1387212236 1908 DEPPPASQGASQDCSPALK 1926
Cdd:PHA03247  2671 GRAAQASSPPQRPRRRAAR 2689
PHA03378 PHA03378
EBNA-3B; Provisional
1630-1918 1.32e-05

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 50.45  E-value: 1.32e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1630 MAMMQEEPVIVKPMPGRPNTQNPPRRGPlsqngsfgpspvsgGECSPPLTADPPARPLSATLNRREMPRSEFGSVDGPLP 1709
Cdd:PHA03378   684 MLPIQWAPGTMQPPPRAPTPMRPPAAPP--------------GRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAA 749
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1710 RPRWASEASGKPSASDPESGAAptvnsssrssspskvmdeGKQTVPQEPEGPSVP-SIPSLAEHPVSvsmaakgPPPFPG 1788
Cdd:PHA03378   750 APGRARPPAAAPGRARPPAAAP------------------GAPTPQPPPQAPPAPqQRPRGAPTPQP-------PPQAGP 804
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1789 TPLMSSPVGgpllPPIRYGPPPQLCGPFGPRPLPPPFGPGMRPPLGLREYAPGVPP----GKRDLPLDPREFLPPGHAPF 1864
Cdd:PHA03378   805 TSMQLMPRA----APGQQGPTKQILRQLLTGGVKRGRPSLKKPAALERQAAAGPTPspgsGTSDKIVQAPVFYPPVLQPI 880
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1387212236 1865 RPLGSLG-------------PREYffPGTRLPPPNHGPQDYPPSSAARdlppSGSRDEPPPASQGAS 1918
Cdd:PHA03378   881 QVMRQLGsvraaaastvtqaPTEY--TGERRGVGPMHPTDIPPSKRAK----TDAYVESQPPHGGQS 941
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1204-1636 1.53e-05

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 50.06  E-value: 1.53e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1204 RVYQVTEQQISEKLKNIMKENAELVQKLSSYEQKIKESKKHVQETKKQNMILSDEAIKFKDKIKSLEETNEILGDTAKSL 1283
Cdd:TIGR02168  305 QILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETL 384
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1284 RAMLESEREQNAKNQDLISENKKSIEKLKDVISVNASEFSEVQIALNEAKLSE------------EKVKSECHRVQEENA 1351
Cdd:TIGR02168  385 RSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKElqaeleeleeelEELQEELERLEEALE 464
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1352 RLKKKKEQLQQEIKDWSKSHAELSEQIRSFEKSQKDLE------VALTHKDDNINALTNCITQLNRLDCESE-------- 1417
Cdd:TIGR02168  465 ELREELEEAEQALDAAERELAQLQARLDSLERLQENLEgfsegvKALLKNQSGLSGILGVLSELISVDEGYEaaieaalg 544
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1418 -------SEDQNKGGSESDELANGEVG-----------GDRSEKVKNQIKQMMDVSRtQTAISVVEEDLKL--------- 1470
Cdd:TIGR02168  545 grlqavvVENLNAAKKAIAFLKQNELGrvtflpldsikGTEIQGNDREILKNIEGFL-GVAKDLVKFDPKLrkalsyllg 623
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1471 -------------LQCKLRASM------------------------STKCNLEDQIKKLEEDRSSLQSAKTVLEDECKTL 1513
Cdd:TIGR02168  624 gvlvvddldnaleLAKKLRPGYrivtldgdlvrpggvitggsaktnSSILERRREIEELEEKIEELEEKIAELEKALAEL 703
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1514 RQKVEILN-------------------------------ELYQQKEMALQKKLSQEEYERQEREQRLSAADEKAVLAAEE 1562
Cdd:TIGR02168  704 RKELEELEeeleqlrkeleelsrqisalrkdlarleaevEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAE 783
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1387212236 1563 VKTYKRRIEEMEDELQKTER---SFKNQIATHEKKAHDNWLKARAAERAIAEEKREAANLRHKLLELTQKMAMMQEE 1636
Cdd:TIGR02168  784 IEELEAQIEQLKEELKALREaldELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAE 860
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
1200-1533 1.60e-05

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 50.12  E-value: 1.60e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1200 AVKSRVYQVTEQQISE-KLKNimkENAELVQKLSSYEQKIKES-KKHVQETKKQNMIL-------SDEAIKFKDKIKSLE 1270
Cdd:pfam15921  437 AMKSECQGQMERQMAAiQGKN---ESLEKVSSLTAQLESTKEMlRKVVEELTAKKMTLessertvSDLTASLQEKERAIE 513
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1271 ETN-EIlgdTAKSLRAMLESEREQNAKNQDlisENKKSIEKLKDVISVNASEFSEVQIALNEAKLSEEKVKSECHR---- 1345
Cdd:pfam15921  514 ATNaEI---TKLRSRVDLKLQELQHLKNEG---DHLRNVQTECEALKLQMAEKDKVIEILRQQIENMTQLVGQHGRtaga 587
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1346 VQEENARLKKKKEQLQQEIKDWSKSHAELSEQIRSFEKSQKDLE---VALTH---------------KDDNINALTNCIT 1407
Cdd:pfam15921  588 MQVEKAQLEKEINDRRLELQEFKILKDKKDAKIRELEARVSDLElekVKLVNagserlravkdikqeRDQLLNEVKTSRN 667
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1408 QLNRLDCESESEDQN-KGGSESDELANGEV------GGDRSEKVKNQIKQM-----------MDVSRTQTA----ISVVE 1465
Cdd:pfam15921  668 ELNSLSEDYEVLKRNfRNKSEEMETTTNKLkmqlksAQSELEQTRNTLKSMegsdghamkvaMGMQKQITAkrgqIDALQ 747
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1387212236 1466 EDLKLLQCKLRASMSTKCNLEDQIKKLEEDRSSLQSAKTVLEDECKTLRQKVEILNELYQQKEMALQK 1533
Cdd:pfam15921  748 SKIQFLEEAMTNANKEKHFLKEEKNKLSQELSTVATEKNKMAGELEVLRSQERRLKEKVANMEVALDK 815
PHA03247 PHA03247
large tegument protein UL36; Provisional
1642-1929 1.82e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.32  E-value: 1.82e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1642 PMPGRPNTQNPPRRGPLSQNGSFGPSPVSGGECSPPLTADPPARPLSATLNRREMPRSEFGSVDGPLPRPRWASEASGKP 1721
Cdd:PHA03247  2554 PLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSP 2633
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1722 SASDPESG---AAPTVNSSSRSSSPSKVMDEGKQTVPQEPEGPSVPS---IPSLAEHPVSVSMAAKGPPPFPGTPlmssp 1795
Cdd:PHA03247  2634 AANEPDPHpppTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPqrpRRRAARPTVGSLTSLADPPPPPPTP----- 2708
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1796 vgGPLLPPIRYGPPPQLCGPFGPRPLPPPFGPGMRPPLGLREYAPGVP--PGKRDLPLDP--------REFLPPGHAPFR 1865
Cdd:PHA03247  2709 --EPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGParPARPPTTAGPpapappaaPAAGPPRRLTRP 2786
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1387212236 1866 PLGSLGPREYFFPGTRLPPPNHGPQDYPPSSAARDLPPSGSrdEPPPASqgaSQDCSPALKQSP 1929
Cdd:PHA03247  2787 AVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGP--LPPPTS---AQPTAPPPPPGP 2845
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1272-1600 2.00e-05

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 49.13  E-value: 2.00e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1272 TNEILGDTAKSLRAMLESEREQNAKNQDL---ISENKKSIEKLKDVISVNASEFSEVQIALNEAKLSEEKVKSECHRVQE 1348
Cdd:COG4372     22 TGILIAALSEQLRKALFELDKLQEELEQLreeLEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQE 101
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1349 ENARLKKKKEQLQQEIKDWSKSHAELSEQIRSFEKSQKDLEVALTHKDDNINALTNCITQLNRLDCESESEDQNKGGSES 1428
Cdd:COG4372    102 ELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQALSEAEA 181
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1429 DELANGEVGGDRSEKVKNQIKQMMDVSRTQTAISVVEEDLKLLQ------CKLRASMSTKCNLEdQIKKLEEDRSSLQSA 1502
Cdd:COG4372    182 EQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDsleaklGLALSALLDALELE-EDKEELLEEVILKEI 260
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1503 KTVLEDECKTLRQKVEILNELYQQKEMALQKKLSQEEYERQEREQRLSAADEKAVLAAEEVKTYKRRIEEMEDELQKTER 1582
Cdd:COG4372    261 EELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALEDALLAALLELAKKLELALAILLAELA 340
                          330
                   ....*....|....*...
gi 1387212236 1583 SFKNQIATHEKKAHDNWL 1600
Cdd:COG4372    341 DLLQLLLVGLLDNDVLEL 358
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1348-1636 2.31e-05

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 49.67  E-value: 2.31e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1348 EENARLKKKKEQLQQEIKdwsksHAELSEQIRsfeksQKDLEVALTH---KDDNINALTNCITQLNRlDCESESEDQNKG 1424
Cdd:TIGR02168  197 ELERQLKSLERQAEKAER-----YKELKAELR-----ELELALLVLRleeLREELEELQEELKEAEE-ELEELTAELQEL 265
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1425 GSESDEL--ANGEVggdrsEKVKNQIKQmmDVSRTQTAISVVEEDLKLLQCKLRasmstkcNLEDQIKKLEEDRSSLQSA 1502
Cdd:TIGR02168  266 EEKLEELrlEVSEL-----EEEIEELQK--ELYALANEISRLEQQKQILRERLA-------NLERQLEELEAQLEELESK 331
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1503 KTVLEDECKTLRQKVEILNELYQqkemALQKKLSQEEYERQEREQRLSAADE-------KAVLAAEEVKTYKRRIEEMED 1575
Cdd:TIGR02168  332 LDELAEELAELEEKLEELKEELE----SLEAELEELEAELEELESRLEELEEqletlrsKVAQLELQIASLNNEIERLEA 407
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1387212236 1576 ELQKTERS---FKNQIATHEKKAHDNWLKarAAERAIAEEKREAANLRHKLLELTQKMAMMQEE 1636
Cdd:TIGR02168  408 RLERLEDRrerLQQEIEELLKKLEEAELK--ELQAELEELEEELEELQEELERLEEALEELREE 469
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1254-1507 2.32e-05

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 49.38  E-value: 2.32e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1254 ILSDEAIKFKDKIKSLEETNEIlGDT--AKSLRAMLESEREQNAknQDLISENKKSIEklkdvisVNASEFSEVQIALNE 1331
Cdd:COG4717     13 KFRDRTIEFSPGLNVIYGPNEA-GKStlLAFIRAMLLERLEKEA--DELFKPQGRKPE-------LNLKELKELEEELKE 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1332 AKLSEEkvksECHRVQEENARLKKKKEQLQQEIKDWSKSHAELSEQIRSFEKSQ--KDLEVALTHKDDNINALTNCITQL 1409
Cdd:COG4717     83 AEEKEE----EYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQelEALEAELAELPERLEELEERLEEL 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1410 NRL--DCESESEDQNKGGSESDELANgevggDRSEKVKNQIKQMM-DVSRTQTAISVVEEDLKLLQCKLRAsmstkcnLE 1486
Cdd:COG4717    159 RELeeELEELEAELAELQEELEELLE-----QLSLATEEELQDLAeELEELQQRLAELEEELEEAQEELEE-------LE 226
                          250       260
                   ....*....|....*....|.
gi 1387212236 1487 DQIKKLEEDRSSLQSAKTVLE 1507
Cdd:COG4717    227 EELEQLENELEAAALEERLKE 247
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1347-1582 2.88e-05

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 48.61  E-value: 2.88e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1347 QEENARLKKKKEQLQQEIKDWSKSHAELSEQIRSFEKSQKDLEVALTHKDDNINALTNCITQLNRldceseseDQNKGGS 1426
Cdd:COG4942     19 ADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEA--------ELAELEK 90
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1427 ESDELANgevggdRSEKVKNQIKQMMDVSRTQTAISVVE-----EDLKLLQCKLRASMSTKCNLEDQIKKLEEDRSSLQS 1501
Cdd:COG4942     91 EIAELRA------ELEAQKEELAELLRALYRLGRQPPLAlllspEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAA 164
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1502 AKTVLEDECKTLRQkveILNELYQQKEmALQKKLSQEEYERQEREQRLSAADEKAVLAAEEVKTYKRRIEEMEDELQKTE 1581
Cdd:COG4942    165 LRAELEAERAELEA---LLAELEEERA-ALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAA 240

                   .
gi 1387212236 1582 R 1582
Cdd:COG4942    241 E 241
PTZ00121 PTZ00121
MAEBL; Provisional
1222-1631 3.89e-05

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 48.98  E-value: 3.89e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1222 KENAELVQKLSSYEQKIKESKKHVQETKKQNMILSDEAIKFKDKIKSLEETNEilgdtAKSLRAMLESEREQNAKNQdli 1301
Cdd:PTZ00121  1315 KKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAE-----AAEKKKEEAKKKADAAKKK--- 1386
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1302 SENKKSIEKLKDVISVNASEFSEVQIALNEAKLSEE-KVKSECHRVQEE---NARLKKKKEQLQQEIKDWSKSH--AELS 1375
Cdd:PTZ00121  1387 AEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEaKKKAEEKKKADEakkKAEEAKKADEAKKKAEEAKKAEeaKKKA 1466
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1376 EQIRSFEKSQKDLEVALTHKDDNINA--LTNCITQLNRLDCESESEDQNKGGSE---SDELANGEVGGDRSEKVKNQIKQ 1450
Cdd:PTZ00121  1467 EEAKKADEAKKKAEEAKKADEAKKKAeeAKKKADEAKKAAEAKKKADEAKKAEEakkADEAKKAEEAKKADEAKKAEEKK 1546
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1451 MMDVSRTQTAISVVEEDLKLLQCKlRASMSTKCNLE--DQIKKLEEDRSSLQSAKTVLEDECKT--LRQKVEILNELYQQ 1526
Cdd:PTZ00121  1547 KADELKKAEELKKAEEKKKAEEAK-KAEEDKNMALRkaEEAKKAEEARIEEVMKLYEEEKKMKAeeAKKAEEAKIKAEEL 1625
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1527 KEMALQKK----LSQEEYERQEREQRLSAADEKAVLAAEEVKTY----KRRIEEM---EDELQKTERSFKNQiaTHEKKA 1595
Cdd:PTZ00121  1626 KKAEEEKKkveqLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKaeedKKKAEEAkkaEEDEKKAAEALKKE--AEEAKK 1703
                          410       420       430
                   ....*....|....*....|....*....|....*.
gi 1387212236 1596 HDNWLKARAAERAIAEEKREAANLRHKLLELTQKMA 1631
Cdd:PTZ00121  1704 AEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEA 1739
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1201-1390 7.17e-05

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 47.75  E-value: 7.17e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1201 VKSRVYQVTEQQISekLKNIMKENAELVQKLSSYEQKIKESKKHVQETKKQNMILSDEAIK-FKDKIKSLEET-NEILgd 1278
Cdd:PRK03918   530 LKEKLIKLKGEIKS--LKKELEKLEELKKKLAELEKKLDELEEELAELLKELEELGFESVEeLEERLKELEPFyNEYL-- 605
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1279 TAKSLRAMLESEREQNAKNQDLISENKKSIEKLKDVISVNASEFSEVQIALNEAKLseEKVKSECHRVQEENARLKKKKE 1358
Cdd:PRK03918   606 ELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEY--EELREEYLELSRELAGLRAELE 683
                          170       180       190
                   ....*....|....*....|....*....|..
gi 1387212236 1359 QLQQEIKDWSKSHAELSEQIRSFEKSQKDLEV 1390
Cdd:PRK03918   684 ELEKRREEIKKTLEKLKEELEEREKAKKELEK 715
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1652-1924 8.07e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 47.86  E-value: 8.07e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1652 PPRRGPLSQNGSFGPSPVSGgecSPPLTADPPARPLSATLNRREMPRSEFGSVDGPLPRPRwaseasgkPSASDPESGAA 1731
Cdd:PHA03307    19 EFFPRPPATPGDAADDLLSG---SQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPG--------TEAPANESRST 87
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1732 PTVNSSSRSSSPSKVMDEGKQTVPQEPEGPSVPSIPslaehpvsvsmAAKGPPPFPGTPLMSSPVGGPLLPPIRYGPPPQ 1811
Cdd:PHA03307    88 PTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPP-----------ASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAG 156
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1812 LCGPFGPRPLPPPFGPGMRPPLGLREYAPGVPPGKRDLPLDPREFLPPG-HAPFRPL-------GSLGPREYFF------ 1877
Cdd:PHA03307   157 ASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRpPRRSSPIsasasspAPAPGRSAADdagass 236
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1387212236 1878 --------------PGTRLPPPNHGPQDYPPSSAARDLPPSGSRDEPPPASQGASQDCSPA 1924
Cdd:PHA03307   237 sdssssessgcgwgPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPS 297
PHA03247 PHA03247
large tegument protein UL36; Provisional
1644-1912 8.77e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.01  E-value: 8.77e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1644 PGRPNTQNPPR-RGPLSQNGsfGPSPVSGGECSPPlTADPPARPLSATLNRREM-----PR--------SEFGSVDGPLP 1709
Cdd:PHA03247  2475 PGAPVYRRPAEaRFPFAAGA--APDPGGGGPPDPD-APPAPSRLAPAILPDEPVgepvhPRmltwirglEELASDDAGDP 2551
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1710 RPRWASEA-SGKPSASDPESGAAPTVNSSSRSSSPSKVMDEGKQTVPQEPEGPSVPSIPSLAEHPVSVSMAAKGPPPFPG 1788
Cdd:PHA03247  2552 PPPLPPAApPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSP 2631
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1789 TPLMSSPVGGPLLPpiryGPPPqlcgpfgprplpppfgpgmrpplglreyapgvpPGKRDLPLDPREFLPP-GHAPFRPL 1867
Cdd:PHA03247  2632 SPAANEPDPHPPPT----VPPP---------------------------------ERPRDDPAPGRVSRPRrARRLGRAA 2674
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 1387212236 1868 GSLGPreyffpgTRLPPPNHGPQDYPPSSAARDLPPSGSRDEPPP 1912
Cdd:PHA03247  2675 QASSP-------PQRPRRRAARPTVGSLTSLADPPPPPPTPEPAP 2712
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
1225-1582 9.78e-05

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 47.48  E-value: 9.78e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1225 AELVQKLSSYEQKIKESKKHVQETKKQNMILSDEAIKFKDKIKSLEETNE--------------ILGDTAKSLRAMLESE 1290
Cdd:pfam01576   64 ARLAARKQELEEILHELESRLEEEEERSQQLQNEKKKMQQHIQDLEEQLDeeeaarqklqlekvTTEAKIKKLEEDILLL 143
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1291 REQNAKnqdLISENKKSIEKLKDVISvNASEFSEVQIALNEAKLSEEKVKS--ECHRVQEENAR--LKKKKEQLQQEIKD 1366
Cdd:pfam01576  144 EDQNSK---LSKERKLLEERISEFTS-NLAEEEEKAKSLSKLKNKHEAMISdlEERLKKEEKGRqeLEKAKRKLEGESTD 219
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1367 WSKSHAELSEQIR----SFEKSQKDLEVALTHKDDNINALTNCITQLNRL---------DCESESEDQNKG-------GS 1426
Cdd:pfam01576  220 LQEQIAELQAQIAelraQLAKKEEELQAALARLEEETAQKNNALKKIRELeaqiselqeDLESERAARNKAekqrrdlGE 299
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1427 ESDEL------------ANGEVGGDRSEKVKNQIKQMMDVSRT---------QTAISVVEEDLKLLQCKLRASMS---TK 1482
Cdd:pfam01576  300 ELEALkteledtldttaAQQELRSKREQEVTELKKALEEETRSheaqlqemrQKHTQALEELTEQLEQAKRNKANlekAK 379
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1483 CNLEDQIKKLEEDRSSLQSAKTVLEDECKTLRQKVEILNELYQQKEMA---LQKKLSQEEYERQEREQRLSAADEKAVLA 1559
Cdd:pfam01576  380 QALESENAELQAELRTLQQAKQDSEHKRKKLEGQLQELQARLSESERQraeLAEKLSKLQSELESVSSLLNEAEGKNIKL 459
                          410       420
                   ....*....|....*....|...
gi 1387212236 1560 AEEVKTYKRRIEEMEDELQKTER 1582
Cdd:pfam01576  460 SKDVSSLESQLQDTQELLQEETR 482
46 PHA02562
endonuclease subunit; Provisional
1210-1376 1.09e-04

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 46.93  E-value: 1.09e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1210 EQQISEKLKNIMKENAELVQKLSSYEQKIKESKKHVQETKKQNMILSDEAIKFkdkIKSLEETNEILGDTaKSLRAMLES 1289
Cdd:PHA02562   194 QQQIKTYNKNIEEQRKKNGENIARKQNKYDELVEEAKTIKAEIEELTDELLNL---VMDIEDPSAALNKL-NTAAAKIKS 269
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1290 EREQNAK--------------NQDLISENKKsIEKLKDVISVNASEFSEVQIALNEAKLSEEKVKSECHRVQEENARLKK 1355
Cdd:PHA02562   270 KIEQFQKvikmyekggvcptcTQQISEGPDR-ITKIKDKLKELQHSLEKLDTAIDELEEIMDEFNEQSKKLLELKNKIST 348
                          170       180
                   ....*....|....*....|.
gi 1387212236 1356 KKEQLQQEIKDWSKSHAELSE 1376
Cdd:PHA02562   349 NKQSLITLVDKAKKVKAAIEE 369
MAD pfam05557
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ...
1207-1582 1.22e-04

Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.


Pssm-ID: 461677 [Multi-domain]  Cd Length: 660  Bit Score: 47.04  E-value: 1.22e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1207 QVTEQQISEKLKnimkENAELVQKLSSYEQKIKESKKHVQETKKQNMILSDEAIKFKD---KIKSLEETNEILGDTAKSL 1283
Cdd:pfam05557  121 QRAELELQSTNS----ELEELQERLDLLKAKASEAEQLRQNLEKQQSSLAEAEQRIKElefEIQSQEQDSEIVKNSKSEL 196
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1284 RAMLESEREQnaknQDLISENKKSIEKLKDVisvnasEFSEVQIALNEAKLSEEKvksechRVQEENARLKKKKEQLQQE 1363
Cdd:pfam05557  197 ARIPELEKEL----ERLREHNKHLNENIENK------LLLKEEVEDLKRKLEREE------KYREEAATLELEKEKLEQE 260
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1364 IKDWSKSHAELSEQIRSFEksqkdlevalthkddninALTNCITQLNRLDCESESEdqnKGGSESDELANGEVGGDRSEK 1443
Cdd:pfam05557  261 LQSWVKLAQDTGLNLRSPE------------------DLSRRIEQLQQREIVLKEE---NSSLTSSARQLEKARRELEQE 319
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1444 VKNQIKQMMDVSRTQTAISVVEEDL---KLLQCKLRASMstKCNLEDQIKKLEEDRSSLQSAKTVLEDEckTLRQKVEIL 1520
Cdd:pfam05557  320 LAQYLKKIEDLNKKLKRHKALVRRLqrrVLLLTKERDGY--RAILESYDKELTMSNYSPQLLERIEEAE--DMTQKMQAH 395
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1387212236 1521 NELYQQKEMALQKKLSQEEYERQEREQRLSAADEKAVLA-----AEEVKTYKRRIEEMEDELQKTER 1582
Cdd:pfam05557  396 NEEMEAQLSVAEEELGGYKQQAQTLERELQALRQQESLAdpsysKEEVDSLRRKLETLELERQRLRE 462
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1278-1553 1.33e-04

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 46.30  E-value: 1.33e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1278 DTAKSLRAMLESEREQNAKNQDLISENKKSIEKLKDVIsvnasEFSEVQIALNEAKLSEekVKSECHRVQEENARLKKKK 1357
Cdd:COG4942     20 DAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQL-----AALERRIAALARRIRA--LEQELAALEAELAELEKEI 92
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1358 EQLQQEIKdwsKSHAELSEQIRSFEKSQKDLEVALTHKDDNINALTNCITQLNRLdcesesedqnkggsesdelangevg 1437
Cdd:COG4942     93 AELRAELE---AQKEELAELLRALYRLGRQPPLALLLSPEDFLDAVRRLQYLKYL------------------------- 144
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1438 gdrsekVKNQIKQMMDVSRTQTAISVVEEDLKLLQCKLRASMStkcNLEDQIKKLEEDRSSLQSAKTVLEDECKTLRQKV 1517
Cdd:COG4942    145 ------APARREQAEELRADLAELAALRAELEAERAELEALLA---ELEEERAALEALKAERQKLLARLEKELAELAAEL 215
                          250       260       270
                   ....*....|....*....|....*....|....*.
gi 1387212236 1518 EILnelyQQKEMALQKKLSQEEYERQEREQRLSAAD 1553
Cdd:COG4942    216 AEL----QQEAEELEALIARLEAEAAAAAERTPAAG 247
HEC1 COG5185
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell ...
1188-1588 1.52e-04

Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 444066 [Multi-domain]  Cd Length: 594  Bit Score: 46.87  E-value: 1.52e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1188 VSFAVFFWRTVLAVKSRVYQVTEQQISEKLKNIMKENaELVQKLSSYEQKIKESKKHVQETKKQNMILSDEAIKFKDKIK 1267
Cdd:COG5185    148 DIEASYGEVETGIIKDIFGKLTQELNQNLKKLEIFGL-TLGLLKGISELKKAEPSGTVNSIKESETGNLGSESTLLEKAK 226
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1268 SLEETNEILgdtakSLRAMLESEREQNAKNQDLISENKKSIEKLKDvisvnasefsevqialneAKLSEEKVKSEchRVQ 1347
Cdd:COG5185    227 EIINIEEAL-----KGFQDPESELEDLAQTSDKLEKLVEQNTDLRL------------------EKLGENAESSK--RLN 281
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1348 EENARLKKKKEQLQQEI------KDWSKSHAELSEQIRSFEKSQKdLEVALTHKDDNINALTNCITQLNrldcESESEDQ 1421
Cdd:COG5185    282 ENANNLIKQFENTKEKIaeytksIDIKKATESLEEQLAAAEAEQE-LEESKRETETGIQNLTAEIEQGQ----ESLTENL 356
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1422 NKGGSESDELAnGEVGGDRSEKVKNQIKQMMDVSRT--------------------QTAISVVEEDLKLLQCKLRASMSt 1481
Cdd:COG5185    357 EAIKEEIENIV-GEVELSKSSEELDSFKDTIESTKEsldeipqnqrgyaqeilatlEDTLKAADRQIEELQRQIEQATS- 434
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1482 kcNLEDQIKKLEEDRSSLQSAKTVLEDECK------------TLRQKVEILNELYQQKEMALQKKLSQEEYERQEREQRL 1549
Cdd:COG5185    435 --SNEEVSKLLNELISELNKVMREADEESQsrleeaydeinrSVRSKKEDLNEELTQIESRVSTLKATLEKLRAKLERQL 512
                          410       420       430
                   ....*....|....*....|....*....|....*....
gi 1387212236 1550 SAADEKAVLAAEEVKTYKRRIEEMEDELQKTERSFKNQI 1588
Cdd:COG5185    513 EGVRSKLDQVAESLKDFMRARGYAHILALENLIPASELI 551
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
1195-1526 1.56e-04

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 46.97  E-value: 1.56e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1195 WRTVLAVKSRVYQVTEQQISEKLKNIMKENA----ELVQKLSSYEQKIKESKKHVQ--ETKKQNMILSDEAIKfKDKIKS 1268
Cdd:TIGR01612  658 YSTIKSELSKIYEDDIDALYNELSSIVKENAidntEDKAKLDDLKSKIDKEYDKIQnmETATVELHLSNIENK-KNELLD 736
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1269 --LEETNEILGDTAKSLRAMLESEREQNAKnqdlISENKKSIEKLKDVISVNASEFSEVQIALNEaklseekvKSECHRV 1346
Cdd:TIGR01612  737 iiVEIKKHIHGEINKDLNKILEDFKNKEKE----LSNKINDYAKEKDELNKYKSKISEIKNHYND--------QINIDNI 804
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1347 QEENArlKKKKEQLQQEIKDWSKSHAELSEQIRSFEKSQKDLevaLTHKDDNINALTNCitqlnrldceseSEDQNKGGS 1426
Cdd:TIGR01612  805 KDEDA--KQNYDKSKEYIKTISIKEDEIFKIINEMKFMKDDF---LNKVDKFINFENNC------------KEKIDSEHE 867
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1427 ESDELANgevggdrseKVKNQIKqmmdvsrtqtaisvvEEDLKLLQCKLRASMSTkcnLEDQIKKLEEDRSSLQSAKTVL 1506
Cdd:TIGR01612  868 QFAELTN---------KIKAEIS---------------DDKLNDYEKKFNDSKSL---INEINKSIEEEYQNINTLKKVD 920
                          330       340
                   ....*....|....*....|....*....
gi 1387212236 1507 E---------DECKTLRQKVEILNELYQQ 1526
Cdd:TIGR01612  921 EyikicentkESIEKFHNKQNILKEILNK 949
ATP-synt_Fo_b cd06503
F-type ATP synthase, membrane subunit b; Membrane subunit b is a component of the Fo complex ...
1187-1316 1.64e-04

F-type ATP synthase, membrane subunit b; Membrane subunit b is a component of the Fo complex of FoF1-ATP synthase. The F-type ATP synthases (FoF1-ATPase) consist of two structural domains: the F1 (assembly factor one) complex containing the soluble catalytic core, and the Fo (oligomycin sensitive factor) complex containing the membrane proton channel, linked together by a central stalk and a peripheral stalk. F1 is composed of alpha (or A), beta (B), gamma (C), delta (D) and epsilon (E) subunits with a stoichiometry of 3:3:1:1:1, while Fo consists of the three subunits a, b, and c (1:2:10-14). An oligomeric ring of 10-14 c subunits (c-ring) make up the Fo rotor. The flux of protons through the ATPase channel (Fo) drives the rotation of the c-ring, which in turn is coupled to the rotation of the F1 complex gamma subunit rotor due to the permanent binding between the gamma and epsilon subunits of F1 and the c-ring of Fo. The F-ATP synthases are primarily found in the inner membranes of eukaryotic mitochondria, in the thylakoid membranes of chloroplasts or in the plasma membranes of bacteria. The F-ATP synthases are the primary producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). Alternatively, under conditions of low driving force, ATP synthases function as ATPases, thus generating a transmembrane proton or Na(+) gradient at the expense of energy derived from ATP hydrolysis. This group also includes F-ATP synthase that has also been found in the archaea Candidatus Methanoperedens.


Pssm-ID: 349951 [Multi-domain]  Cd Length: 132  Bit Score: 43.20  E-value: 1.64e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1187 IVSFAV-------FFWRTVLAV-KSRvyqvtEQQISEKLKNIMKENAELVQKLSSYEQKIKESKKHVQEtkkqnmILsDE 1258
Cdd:cd06503      6 IINFLIllfilkkFLWKPILKAlDER-----EEKIAESLEEAEKAKEEAEELLAEYEEKLAEARAEAQE------II-EE 73
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1387212236 1259 AIKFKDKIKsleetNEILGDtakslrAMLESEREQNAKNQDLISENKKSIEKLKDVIS 1316
Cdd:cd06503     74 ARKEAEKIK-----EEILAE------AKEEAERILEQAKAEIEQEKEKALAELRKEVA 120
Cast pfam10174
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...
1235-1596 1.97e-04

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.


Pssm-ID: 431111 [Multi-domain]  Cd Length: 766  Bit Score: 46.35  E-value: 1.97e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1235 EQKIKESKKHVQETKKQNMILSDEA--------IKFKD-KIKSLEETNEILGDTAKSLR--AMLESEREQnaknqdlisE 1303
Cdd:pfam10174  202 DQKEKENIHLREELHRRNQLQPDPAktkalqtvIEMKDtKISSLERNIRDLEDEVQMLKtnGLLHTEDRE---------E 272
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1304 NKKSIEKLKdvisvNASEFSEVQIalneaklseEKVKSECHRVQEENARLKKKKEQLQQEIKDwSKSHAELseqirsfek 1383
Cdd:pfam10174  273 EIKQMEVYK-----SHSKFMKNKI---------DQLKQELSKKESELLALQTKLETLTNQNSD-CKQHIEV--------- 328
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1384 sqkdLEVALTHKDDNINALTNCITQLnRLDCESESEDQNKGGSESDELANgEVGGDRSEKvkNQIKQMMDVSRTQtaISV 1463
Cdd:pfam10174  329 ----LKESLTAKEQRAAILQTEVDAL-RLRLEEKESFLNKKTKQLQDLTE-EKSTLAGEI--RDLKDMLDVKERK--INV 398
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1464 VEEDLKLLQCKLRASMSTKCNLEDQIKKLEEDRSSLQSAKTVLEDeckTLRQKVEILNELYQQKEMA------------- 1530
Cdd:pfam10174  399 LQKKIENLQEQLRDKDKQLAGLKERVKSLQTDSSNTDTALTTLEE---ALSEKERIIERLKEQREREdrerleeleslkk 475
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1531 ----LQKKLSQEEYERQEREQRLSAADEKAVLAAEEVKTYKRRIEEMEDELQKTERSFkNQIATHEKKAH 1596
Cdd:pfam10174  476 enkdLKEKVSALQPELTEKESSLIDLKEHASSLASSGLKKDSKLKSLEIAVEQKKEEC-SKLENQLKKAH 544
PTZ00440 PTZ00440
reticulocyte binding protein 2-like protein; Provisional
1212-1588 2.15e-04

reticulocyte binding protein 2-like protein; Provisional


Pssm-ID: 240419 [Multi-domain]  Cd Length: 2722  Bit Score: 46.75  E-value: 2.15e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1212 QISEKLKNIMKENAELvqkLSSYEQKIKESKkhvQETKKQNMILSDEAIKFKDKI--KSLEETNEI-----LGDTAKSLR 1284
Cdd:PTZ00440   400 YFISKYTNIISLSEHT---LKAAEDVLKENS---QKIADYALYSNLEIIEIKKKYdeKINELKKSInqlktLISIMKSFY 473
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1285 AMLESEREQNAKNQDLISENKKSIEKLKDVISVNASEFSEVQIALNEAKlseeKVKSECHRVQEenarLKKKKEQLQQEI 1364
Cdd:PTZ00440   474 DLIISEKDSMDSKEKKESSDSNYQEKVDELLQIINSIKEKNNIVNNNFK----NIEDYYITIEG----LKNEIEGLIELI 545
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1365 KDWSKSHAELSEQirsfEKSQKDLEVALTHKDDNINALTNCITQLNRLDCESESEDQNKGGSESDELANGEVGGDRSEKV 1444
Cdd:PTZ00440   546 KYYLQSIETLIKD----EKLKRSMKNDIKNKIKYIEENVDHIKDIISLNDEIDNIIQQIEELINEALFNKEKFINEKNDL 621
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1445 KNQIKQMMDvsrtqtaiSVVEEDLKLLQcklrASMSTkcNLEDQiKKLEEDRSSLQSAKTVL---EDECKTLRQKV---- 1517
Cdd:PTZ00440   622 QEKVKYILN--------KFYKGDLQELL----DELSH--FLDDH-KYLYHEAKSKEDLQTLLntsKNEYEKLEFMKsdni 686
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1387212236 1518 -EILNELYQQKEMALQKKLSQEEYERQEREQRLSAADEKAVlaaEEVKTYKRRIEEMEDELQKTErSFKNQI 1588
Cdd:PTZ00440   687 dNIIKNLKKELQNLLSLKENIIKKQLNNIEQDISNSLNQYT---IKYNDLKSSIEEYKEEEEKLE-VYKHQI 754
TPR_MLP1_2 pfam07926
TPR/MLP1/MLP2-like protein; The sequences featured in this family are similar to a region of ...
1262-1377 2.32e-04

TPR/MLP1/MLP2-like protein; The sequences featured in this family are similar to a region of human TPR protein and to yeast myosin-like proteins 1 (MLP1) and 2 (MLP2). These proteins share a number of features; for example, they all have coiled-coil regions and all three are associated with nuclear pores. TPR is thought to be a component of nuclear pore complex- attached intra-nuclear filaments, and is implicated in nuclear protein import. Moreover, its N-terminal region is involved in the activation of oncogenic kinases, possibly by mediating the dimerization of kinase domains or by targeting these kinases to the nuclear pore complex. MLP1 and MLP2 are involved in the process of telomere length regulation, where they are thought to interact with proteins such as Tel1p and modulate their activity.


Pssm-ID: 462316 [Multi-domain]  Cd Length: 129  Bit Score: 43.01  E-value: 2.32e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1262 FKDKIKSLEETNEILGDTAKSLRAMLESEREQNAKNQD-------LISENKKSIEKLKdvisvnaSEFSEVQIALNEAKL 1334
Cdd:pfam07926    6 LQSEIKRLKEEAADAEAQLQKLQEDLEKQAEIAREAQQnyerelvLHAEDIKALQALR-------EELNELKAEIAELKA 78
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|...
gi 1387212236 1335 SEEKVKSEchrVQEENARLKKKKEQLQQEIKDWSKSHAELSEQ 1377
Cdd:pfam07926   79 EAESAKAE---LEESEESWEEQKKELEKELSELEKRIEDLNEQ 118
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1199-1380 2.88e-04

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 45.53  E-value: 2.88e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1199 LAVKSRVYQVTEQQISE---KLKNIMKENAELVQKLSSYEQKIKESKKHVQETKKQN---MILSDEAikFKDKIKSLEet 1272
Cdd:COG4942     64 IAALARRIRALEQELAAleaELAELEKEIAELRAELEAQKEELAELLRALYRLGRQPplaLLLSPED--FLDAVRRLQ-- 139
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1273 neILGDTAKSLRAMLESEREQNAKNQDLISENKKSIEKLKDVISVNASEFSEVQIALNEAKLSEEKVKSECHRVQEENAR 1352
Cdd:COG4942    140 --YLKYLAPARREQAEELRADLAELAALRAELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAE 217
                          170       180
                   ....*....|....*....|....*...
gi 1387212236 1353 LKKKKEQLQQEIKDWSKSHAELSEQIRS 1380
Cdd:COG4942    218 LQQEAEELEALIARLEAEAAAAAERTPA 245
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
1207-1586 3.27e-04

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 46.20  E-value: 3.27e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1207 QVTEQQISEKLKNIMKENAELV---------QKLSSYEQKiKESKKHVQETKKQNMILSdEAIKFKDKIKSLE------- 1270
Cdd:TIGR01612  443 NIFKDDFDEFNKPIPKSKLKALekrffeifeEEWGSYDIK-KDIDENSKQDNTVKLILM-RMKDFKDIIDFMElykpdev 520
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1271 -ETNEILGDTAKSLRAMLESEREQNAK----------------NQDLISENKKSIeKLKDVISVNASEFSEV---QIALN 1330
Cdd:TIGR01612  521 pSKNIIGFDIDQNIKAKLYKEIEAGLKesyelaknwkkliheiKKELEEENEDSI-HLEKEIKDLFDKYLEIddeIIYIN 599
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1331 EAKLS-EEKVK--SECHRVQEENARLKKKKEQLQQEIKDWSK-SHAELSEQIRSFEKSQKDLEVALTH-KDDNINALTN- 1404
Cdd:TIGR01612  600 KLKLElKEKIKniSDKNEYIKKAIDLKKIIENNNAYIDELAKiSPYQVPEHLKNKDKIYSTIKSELSKiYEDDIDALYNe 679
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1405 --CITQLNRLDcesESEDQNKggseSDELANgevggdRSEKVKNQIkQMMDVSRTQTAISVVEEDL-KLLQCKLRASMST 1481
Cdd:TIGR01612  680 lsSIVKENAID---NTEDKAK----LDDLKS------KIDKEYDKI-QNMETATVELHLSNIENKKnELLDIIVEIKKHI 745
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1482 KCNLEDQIKKLEEDRSSLQsaktvledecKTLRQKVeilNELYQQKEM--ALQKKLSQEEYERQEREQRLSAADEKAVLA 1559
Cdd:TIGR01612  746 HGEINKDLNKILEDFKNKE----------KELSNKI---NDYAKEKDElnKYKSKISEIKNHYNDQINIDNIKDEDAKQN 812
                          410       420
                   ....*....|....*....|....*..
gi 1387212236 1560 AEEVKTYKRRIEEMEDELQKTERSFKN 1586
Cdd:TIGR01612  813 YDKSKEYIKTISIKEDEIFKIINEMKF 839
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
1211-1435 3.32e-04

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 45.21  E-value: 3.32e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1211 QQISEKLKNIMKENAELVQKLSSYEQKIKESKKHVQETKKQNMILSDEAIKFKDKI----KSLEETNEILGDTAkslRAM 1286
Cdd:COG3883     19 QAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIaeaeAEIEERREELGERA---RAL 95
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1287 LESEREQNAKNQDLISENkksiekLKDVIS-VNASEfsevQIALNEAKLSEEkvksechrVQEENARLKKKKEQLQQEIK 1365
Cdd:COG3883     96 YRSGGSVSYLDVLLGSES------FSDFLDrLSALS----KIADADADLLEE--------LKADKAELEAKKAELEAKLA 157
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1366 DWSKSHAELSEQIRSFEKSQKDLEVALTHKDDNINALTNCITQLNRLDCESESEDQNKGGSESDELANGE 1435
Cdd:COG3883    158 ELEALKAELEAAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAA 227
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1453-1587 3.67e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 45.68  E-value: 3.67e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1453 DVSRTQTAISVVEEDLKllqcKLRASMSTKCNLEDQIKKLEEDRSSLQSAKTVLEDECKTLRQKVEILNELYQQKEMALQ 1532
Cdd:COG4913    662 DVASAEREIAELEAELE----RLDASSDDLAALEEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLE 737
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1387212236 1533 KKLSQEEYERQER-EQRLSAADEKAVLA------AEEVKTYKRRIEEMEDELQKTERSFKNQ 1587
Cdd:COG4913    738 AAEDLARLELRALlEERFAAALGDAVERelrenlEERIDALRARLNRAEEELERAMRAFNRE 799
PHA03378 PHA03378
EBNA-3B; Provisional
1756-1929 4.18e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 45.44  E-value: 4.18e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1756 QEPEGPSVPSIPSLAEHPVSVSMAAKgppPFPGTPLMSSPVggPLLPPIRYGP--PPQLCGPFGPRPLPPPFGPGMRP-P 1832
Cdd:PHA03378   604 QTPEPPTTQSHIPETSAPRQWPMPLR---PIPMRPLRMQPI--TFNVLVFPTPhqPPQVEITPYKPTWTQIGHIPYQPsP 678
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1833 LG-----LREYAPGV--PPGKRDLPLDPREfLPPGHAPfRPLGSLGP-REYFFPGTRLPPPNHGPQDYPPSSAA--RDLP 1902
Cdd:PHA03378   679 TGantmlPIQWAPGTmqPPPRAPTPMRPPA-APPGRAQ-RPAAATGRaRPPAAAPGRARPPAAAPGRARPPAAApgRARP 756
                          170       180       190
                   ....*....|....*....|....*....|....
gi 1387212236 1903 PSGS--RDEPPPASQGASQ-----DCSPALKQSP 1929
Cdd:PHA03378   757 PAAApgRARPPAAAPGAPTpqpppQAPPAPQQRP 790
Spc7 smart00787
Spc7 kinetochore protein; This domain is found in cell division proteins which are required ...
1212-1383 4.23e-04

Spc7 kinetochore protein; This domain is found in cell division proteins which are required for kinetochore-spindle association.


Pssm-ID: 197874 [Multi-domain]  Cd Length: 312  Bit Score: 44.62  E-value: 4.23e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236  1212 QISEKLKNIMKENAELVqklssyeqkiKESKKHVqeTKKQNmILSDEAIKFKDKIKSLEETneilgdtaksLRAMLESER 1291
Cdd:smart00787  140 KLLEGLKEGLDENLEGL----------KEDYKLL--MKELE-LLNSIKPKLRDRKDALEEE----------LRQLKQLED 196
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236  1292 EQNAKNQDLISENKKSIEKLKDVISVNASEFSEVQIALNEAKLSEEKVKSECHRVQEENARLKKKKEQlqqeIKDWSKSH 1371
Cdd:smart00787  197 ELEDCDPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEAEKKLEQ----CRGFTFKE 272
                           170
                    ....*....|...
gi 1387212236  1372 AE-LSEQIRSFEK 1383
Cdd:smart00787  273 IEkLKEQLKLLQS 285
PRK12704 PRK12704
phosphodiesterase; Provisional
1263-1403 5.03e-04

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 44.77  E-value: 5.03e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1263 KDKIKSLEETneilgdtAKSLRAmlESEREQNAKNQDLISENKKSIEKLKdvisvnaSEF-SEVQIALNEAKLSEEKVKS 1341
Cdd:PRK12704    30 EAKIKEAEEE-------AKRILE--EAKKEAEAIKKEALLEAKEEIHKLR-------NEFeKELRERRNELQKLEKRLLQ 93
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1387212236 1342 ECHRVQEENARLKKKKEQLQQEIKDWSKSHAELSEQIRSFEKSQKDLEVALthkdDNINALT 1403
Cdd:PRK12704    94 KEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQEL----ERISGLT 151
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1634-1929 5.11e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.16  E-value: 5.11e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1634 QEEPVIVKPMPGRPNTqnPPRRGPLSQNGSFGPSPVSGGECSPPLTADPPARPLSATLNRREMPRSEFG-SVDGPLPRPR 1712
Cdd:PHA03307   190 PAEPPPSTPPAAASPR--PPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPlPRPAPITLPT 267
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1713 WASEAS-GKPSASDPESGAAPTVNSssrssspskvmDEGKQTVPQEPEGPSVPSIPSLAEH-----------PVSVSMAA 1780
Cdd:PHA03307   268 RIWEASgWNGPSSRPGPASSSSSPR-----------ERSPSPSPSSPGSGPAPSSPRASSSssssresssssTSSSSESS 336
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1781 KGPPPFPGTPLMSSPVGGPLLPPIRYGPPPQLCGPfgprplpppfgpgmRPPLGLREYAPGVPPGKRDLPLDPREFLpPG 1860
Cdd:PHA03307   337 RGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRP--------------SRAPSSPAASAGRPTRRRARAAVAGRAR-RR 401
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1387212236 1861 HAPFRplgslgpreyfFPGTRLPPPNHGPQDYPPSSAAR--DLPPSGS---RDEPPPASQ---GASQDCSPALKQSP 1929
Cdd:PHA03307   402 DATGR-----------FPAGRPRPSPLDAGAASGAFYARypLLTPSGEpwpGSPPPPPGRvryGGLGDSRPGLWDAP 467
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1211-1378 5.14e-04

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 43.76  E-value: 5.14e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1211 QQISEKLKNIMKENAELVQKLSSYEQKIKESKKHVQETKKQNMILSDEAIKFKDKIKSLEETneiLGD--TAKSLRAM-- 1286
Cdd:COG1579     20 DRLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQ---LGNvrNNKEYEALqk 96
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1287 -LESEREQNAKNQDLISENKKSIEKLKDVISVNASEFSEVQIALNEAKlseekvksechrvqeenARLKKKKEQLQQEIK 1365
Cdd:COG1579     97 eIESLKRRISDLEDEILELMERIEELEEELAELEAELAELEAELEEKK-----------------AELDEELAELEAELE 159
                          170
                   ....*....|...
gi 1387212236 1366 DWSKSHAELSEQI 1378
Cdd:COG1579    160 ELEAEREELAAKI 172
PRK01156 PRK01156
chromosome segregation protein; Provisional
1202-1534 5.29e-04

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 45.28  E-value: 5.29e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1202 KSRVYQVTEQQISE------KLKNIMKENAELVQKLSSYEQKIKESKKHVQETKKQNMILSDEAIKFKDKIK-SLEETNE 1274
Cdd:PRK01156   344 KKSRYDDLNNQILElegyemDYNSYLKSIESLKKKIEEYSKNIERMSAFISEILKIQEIDPDAIKKELNEINvKLQDISS 423
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1275 ILGDTAKSLRAMLESEREQNAKNQDLISENKKSI-------EKLKDVISVNASEFSEVQIALNEaklseekVKSECHRVQ 1347
Cdd:PRK01156   424 KVSSLNQRIRALRENLDELSRNMEMLNGQSVCPVcgttlgeEKSNHIINHYNEKKSRLEEKIRE-------IEIEVKDID 496
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1348 EENARLKKKKEQLQQEIKDWSKSHAELSEQIRSFEKSQKDLEVALTHKDDNINALTNCITQLNRLDCESESEDQNKGGSE 1427
Cdd:PRK01156   497 EKIVDLKKRKEYLESEEINKSINEYNKIESARADLEDIKIKINELKDKHDKYEEIKNRYKSLKLEDLDSKRTSWLNALAV 576
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1428 SDeLANGEVGGDRSEKVKNQIKQMMD------------VSRTQTAISVVEEDLKLLQCKL-------------------- 1475
Cdd:PRK01156   577 IS-LIDIETNRSRSNEIKKQLNDLESrlqeieigfpddKSYIDKSIREIENEANNLNNKYneiqenkilieklrgkidny 655
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1387212236 1476 -------------RASMSTKCN-LEDQIKKLEEDRSSLQSAKTVLEDECKTLRQKVEILNELYQQKEMALQKK 1534
Cdd:PRK01156   656 kkqiaeidsiipdLKEITSRINdIEDNLKKSRKALDDAKANRARLESTIEILRTRINELSDRINDINETLESM 728
Taxilin pfam09728
Myosin-like coiled-coil protein; Taxilin contains an extraordinarily long coiled-coil domain ...
1222-1533 5.34e-04

Myosin-like coiled-coil protein; Taxilin contains an extraordinarily long coiled-coil domain in its C-terminal half and is ubiquitously expressed. It is a novel binding partner of several syntaxin family members and is possibly involved in Ca2+-dependent exocytosis in neuroendocrine cells. Gamma-taxilin, described as leucine zipper protein Factor Inhibiting ATF4-mediated Transcription (FIAT), localizes to the nucleus in osteoblasts and dimerizes with ATF4 to form inactive dimers, thus inhibiting ATF4-mediated transcription.


Pssm-ID: 462861 [Multi-domain]  Cd Length: 302  Bit Score: 44.17  E-value: 5.34e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1222 KENAELVQ---KLSSYEQKIKESKKHVQETKKQNMILSDEAIKFKDKIKSLEETNEILG---DTAKSLRAMLES-EREQN 1294
Cdd:pfam09728    1 KAARELMQllnKLDSPEEKLAALCKKYAELLEEMKRLQKDLKKLKKKQDQLQKEKDQLQselSKAILAKSKLEKlCRELQ 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1295 AKNQDLISENKKSIEKLKDvisvNASEFSE-VQIALNEAKLSEEKVKSECHRVQEENARLKKK-KEQLQQeikdwskshA 1372
Cdd:pfam09728   81 KQNKKLKEESKKLAKEEEE----KRKELSEkFQSTLKDIQDKMEEKSEKNNKLREENEELREKlKSLIEQ---------Y 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1373 ELSEQIrsFEK--SQKDLEVALthkddninaltnCITQLNRLDCESESEDQNKGGSESDELAngevggDRSEKVKNQIKQ 1450
Cdd:pfam09728  148 ELRELH--FEKllKTKELEVQL------------AEAKLQQATEEEEKKAQEKEVAKARELK------AQVQTLSETEKE 207
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1451 MmdvsRTQtaISVVEEDLKLLQCKLRAS-------------MSTKCnledqiKKLEEDRSSLQS--------------AK 1503
Cdd:pfam09728  208 L----REQ--LNLYVEKFEEFQDTLNKSnevfttfkkemekMSKKI------KKLEKENLTWKRkweksnkallemaeER 275
                          330       340       350
                   ....*....|....*....|....*....|
gi 1387212236 1504 TVLEDECKTLRQKVEILNELYQqkemALQK 1533
Cdd:pfam09728  276 QKLKEELEKLQKKLEKLENLCR----ALQA 301
PTZ00341 PTZ00341
Ring-infected erythrocyte surface antigen; Provisional
198-462 6.69e-04

Ring-infected erythrocyte surface antigen; Provisional


Pssm-ID: 173534 [Multi-domain]  Cd Length: 1136  Bit Score: 44.78  E-value: 6.69e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236  198 AELRERSEAQKSHPQV----------NSQTGHAQGErtsfesfgEMLQDKLKVPDSENNKTSNSSQVSHEQEKIDAYKLL 267
Cdd:PTZ00341   324 AEMKKRAEKPKKKKSKrrgwlccgggDIETVEPQQE--------EPVQDVGEHQINEYGDILPSLKASINNSAINYYDAV 395
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236  268 KTEMTLDlktkfGSTADALVSDDEttrLVTSLEDD-FVEDLDpeyytvGKEEEENKEDFDELPLltftDGEDTKSPGHSG 346
Cdd:PTZ00341   396 KDGKYLD-----DDSSDALYTDED---LLFDLEKQkYMDMLD------GSEDESVEDNEEEHSG----DANEEELSVDEH 457
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236  347 IEKHPTEKEQNSNKEHKVEETQPPGIKKGDKEIPKHREDTVFSDVMEgEENTDTDLESSDSKEEDDPLVMDSRLGKPRPE 426
Cdd:PTZ00341   458 VEEHNADDSGEQQSDDESGEHQSVNEIVEEQSVNEHVEEPTVADIVE-QETVDEHVEEPAVDENEEQQTADEHVEEPTIA 536
                          250       260       270
                   ....*....|....*....|....*....|....*.
gi 1387212236  427 DHTDPEKAADHLVNVEVPKADSDDDPEVGAGLHMKD 462
Cdd:PTZ00341   537 EEHVEEEISTAEEHIEEPASDVQQDSEAAPTIEIPD 572
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1642-1929 6.83e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 44.76  E-value: 6.83e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1642 PMPGRPNTQNPPRRGPLSQNGSFGPSPVSGGECSP---PLTADPPARPLSATLNRREMPRSEFGSVDGPLPRPRWASEAS 1718
Cdd:pfam03154  243 PSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPmphSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQ 322
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1719 GK----PSASDPESGAAPTVNSSSRSSSPSKVMDEGKQT-VPQEPeGPSVPSIPSLAEHPVSVSMAAKGPPPFPGTPLMS 1793
Cdd:pfam03154  323 QRihtpPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTpIPQLP-NPQSHKHPPHLSGPSPFQMNSNLPPPPALKPLSS 401
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1794 SPVGGPllpPIRYGPPPQLcgpfgPRPLPPPFGPGMRPPlGLREyAPGVPPGKRDLPLDPREFLPPGHAPFrplgslgPR 1873
Cdd:pfam03154  402 LSTHHP---PSAHPPPLQL-----MPQSQQLPPPPAQPP-VLTQ-SQSLPPPAASHPPTSGLHQVPSQSPF-------PQ 464
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1387212236 1874 EYFFPGTrlpPPNHGPQDYPPSSAardlPPSGSRDEPPPASQGASQDCSPALKQSP 1929
Cdd:pfam03154  465 HPFVPGG---PPPITPPSGPPTST----SSAMPGIQPPSSASVSSSGPVPAAVSCP 513
PTZ00121 PTZ00121
MAEBL; Provisional
1215-1629 7.08e-04

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 44.75  E-value: 7.08e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1215 EKLKNIMKENAELVQKLSSYEQKIKESKKHVQETKKQNMILSDEAIKFKDKIKSLEETNEIlgDTAKSLRAMLESEREQN 1294
Cdd:PTZ00121  1070 EGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKA--EEARKAEDARKAEEARK 1147
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1295 AKNQDLISENKKSIEKLKDVISVNASEFSEVQialnEAKLSEEKVKSECHRVQEENARLK--KKKEQLQQEIKDWSKSHA 1372
Cdd:PTZ00121  1148 AEDAKRVEIARKAEDARKAEEARKAEDAKKAE----AARKAEEVRKAEELRKAEDARKAEaaRKAEEERKAEEARKAEDA 1223
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1373 ELSEQIRSFEKSQKDLEVALTHKDDNINALTNC-----ITQLNRLDCESESEDQNKggseSDELANGEvggdrSEKVKNQ 1447
Cdd:PTZ00121  1224 KKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKfeearMAHFARRQAAIKAEEARK----ADELKKAE-----EKKKADE 1294
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1448 IKQMMDVSRTQTAISVVEEDLKLLQCKLRAsmstkcnlEDQIKKLEEDRSSLQSAKTVLEDECKTLRQKVEILNELYQQK 1527
Cdd:PTZ00121  1295 AKKAEEKKKADEAKKKAEEAKKADEAKKKA--------EEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKA 1366
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1528 EMALQKKlsqeeyerQEREQRLSAADEKA--VLAAEEVKTYKRRIEEMEDELQKTERSfknqiathEKKAHDnwLKARAA 1605
Cdd:PTZ00121  1367 EAAEKKK--------EEAKKKADAAKKKAeeKKKADEAKKKAEEDKKKADELKKAAAA--------KKKADE--AKKKAE 1428
                          410       420
                   ....*....|....*....|....
gi 1387212236 1606 ERAIAEEKREAANLRHKLLELTQK 1629
Cdd:PTZ00121  1429 EKKKADEAKKKAEEAKKADEAKKK 1452
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1275-1522 8.25e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 44.52  E-value: 8.25e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1275 ILGDTAKSLRAMLESEREQNAKN----QDLISENKKSIEKLKDVISV--NASEFSEVQIALNEAKLSEEKVKSECHRVQE 1348
Cdd:COG4913    603 VLGFDNRAKLAALEAELAELEEElaeaEERLEALEAELDALQERREAlqRLAEYSWDEIDVASAEREIAELEAELERLDA 682
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1349 EN---ARLKKKKEQLQQEIKDWSKSHAELSEQIRSFEKSQKDLEVALTHKDDNINALTNCITQLNRLDCESESEDQNKGG 1425
Cdd:COG4913    683 SSddlAALEEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAALGDA 762
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1426 SESDELAN--GEVGGDRSEKVKNQ---IKQM--------MDVSRTQTAISVVEEDLKLLQcKLRASmstkcNL---EDQI 1489
Cdd:COG4913    763 VERELRENleERIDALRARLNRAEeelERAMrafnrewpAETADLDADLESLPEYLALLD-RLEED-----GLpeyEERF 836
                          250       260       270
                   ....*....|....*....|....*....|....*...
gi 1387212236 1490 KKL-----EEDRSSLQSAktvLEDECKTLRQKVEILNE 1522
Cdd:COG4913    837 KELlnensIEFVADLLSK---LRRAIREIKERIDPLND 871
PTZ00440 PTZ00440
reticulocyte binding protein 2-like protein; Provisional
1210-1533 8.66e-04

reticulocyte binding protein 2-like protein; Provisional


Pssm-ID: 240419 [Multi-domain]  Cd Length: 2722  Bit Score: 44.44  E-value: 8.66e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1210 EQQISEKLkNIMKENAELVQ-KLSSYEQKIKESKKHVQE-TKKQNMILS-----DEAIKFKDKIKSLEETNEILGDTAKS 1282
Cdd:PTZ00440   793 ENKISNDI-NILKENKKNNQdLLNSYNILIQKLEAHTEKnDEELKQLLQkfpteDENLNLKELEKEFNENNQIVDNIIKD 871
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1283 LRAMLES----EREQNAKNQDliSENKKSIEKLK----DVISVNASEFSEVQ----IALNEAKLSEEKVKSECHRVQEE- 1349
Cdd:PTZ00440   872 IENMNKNiniiKTLNIAINRS--NSNKQLVEHLLnnkiDLKNKLEQHMKIINtdniIQKNEKLNLLNNLNKEKEKIEKQl 949
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1350 -NARLKKKKEQLQQEIKDWSKSHAELS-------EQIRSFEKSQKDLEVALTHKDDNINALTNCITQLNRldcesesEDQ 1421
Cdd:PTZ00440   950 sDTKINNLKMQIEKTLEYYDKSKENINgndgthlEKLDKEKDEWEHFKSEIDKLNVNYNILNKKIDDLIK-------KQH 1022
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1422 NKGGSESDELANgEVGGDRSEKVKNQIKQM-------------MDVSRTQTAISvvEEDLKLLQCKlrasmstkcnLEDQ 1488
Cdd:PTZ00440  1023 DDIIELIDKLIK-EKGKEIEEKVDQYISLLekmktklssfhfnIDIKKYKNPKI--KEEIKLLEEK----------VEAL 1089
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*
gi 1387212236 1489 IKKLEEDRSSLQSAKTVLEDECKTLRQKVEILNELYQQKEMALQK 1533
Cdd:PTZ00440  1090 LKKIDENKNKLIEIKNKSHEHVVNADKEKNKQTEHYNKKKKSLEK 1134
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
1320-1562 1.19e-03

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 43.28  E-value: 1.19e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1320 SEFSEVQIALNEAKLSEEKVKSECHRVQEENARLKKKKEQLQQEIKdwskshaELSEQIRSFEKSQKDLEVALTHKDDNI 1399
Cdd:COG3883     16 PQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELE-------ALQAEIDKLQAEIAEAEAEIEERREEL 88
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1400 NAltncitQLNRLdcesesedQNKGGSES--DELANGEVGGDRSEKVkNQIKQMMDvsRTQTAISVVEEDLKLLQCKLRA 1477
Cdd:COG3883     89 GE------RARAL--------YRSGGSVSylDVLLGSESFSDFLDRL-SALSKIAD--ADADLLEELKADKAELEAKKAE 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1478 SMSTKCNLEDQIKKLEEDRSSLQSAKTVLEDECKTLRQKVEILNELYQQKEMALQKKLSQEEYERQEREQRLSAADEKAV 1557
Cdd:COG3883    152 LEAKLAELEALKAELEAAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAA 231

                   ....*
gi 1387212236 1558 LAAEE 1562
Cdd:COG3883    232 AAAAA 236
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
1222-1521 1.35e-03

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 43.95  E-value: 1.35e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1222 KENAELVqkLSSYEQKIKESKKHVQET----KKQNMILSDEAIKFKDKIKSLEETNEILGDTAKSlramlESEREQNAKN 1297
Cdd:pfam15921   73 KEHIERV--LEEYSHQVKDLQRRLNESnelhEKQKFYLRQSVIDLQTKLQEMQMERDAMADIRRR-----ESQSQEDLRN 145
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1298 Q----------------DLISENKKSIEKLKDVISVNASEFSEVQIAL---NEA---KLSEEKVKSECHrVQEENARLKK 1355
Cdd:pfam15921  146 QlqntvheleaakclkeDMLEDSNTQIEQLRKMMLSHEGVLQEIRSILvdfEEAsgkKIYEHDSMSTMH-FRSLGSAISK 224
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1356 KKEQLQQEIKDWSKSHAELSEQIRSFE-KSQKDLEVALTHKDDNINALTncitqlnrldceSESEDQNKGGSESDELANG 1434
Cdd:pfam15921  225 ILRELDTEISYLKGRIFPVEDQLEALKsESQNKIELLLQQHQDRIEQLI------------SEHEVEITGLTEKASSARS 292
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1435 EvggdrSEKVKNQIKQMMDVSRTQTAISVveedlkllqcklrasmstkCNLEDQIKKLEEDRSSLQSAKTVLEDECKTLR 1514
Cdd:pfam15921  293 Q-----ANSIQSQLEIIQEQARNQNSMYM-------------------RQLSDLESTVSQLRSELREAKRMYEDKIEELE 348

                   ....*..
gi 1387212236 1515 QKVEILN 1521
Cdd:pfam15921  349 KQLVLAN 355
DivIC pfam04977
Septum formation initiator; DivIC from B. subtilis is necessary for both vegetative and ...
1338-1380 1.37e-03

Septum formation initiator; DivIC from B. subtilis is necessary for both vegetative and sporulation septum formation. These proteins are mainly composed of an amino terminal coiled-coil.


Pssm-ID: 428231 [Multi-domain]  Cd Length: 69  Bit Score: 39.12  E-value: 1.37e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|...
gi 1387212236 1338 KVKSECHRVQEENARLKKKKEQLQQEIKDWSKSHAELSEQIRS 1380
Cdd:pfam04977   10 QLKQEIAQLQAEIAKLKQENEELEAEIKDLKSDPDYIEERARS 52
SOBP pfam15279
Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual ...
1710-1914 1.73e-03

Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual disability. It carries a zinc-finger of the zf-C2H2 type at the N-terminus, and a highly characteriztic C-terminal PhPhPhPhPhPh motif. The deduced 873-amino acid protein contains an N-terminal nuclear localization signal (NLS), followed by 2 FCS-type zinc finger motifs, a proline-rich region (PR1), a putative RNA-binding motif region, and a C-terminal NLS embedded in a second proline-rich motif. SOBP is expressed in various human tissues, including developing mouse brain at embryonic day 14. In postnatal and adult mouse brain SOBP is expressed in all neurons, with intense staining in the limbic system. Highest expression is in layer V cortical neurons, hippocampus, pyriform cortex, dorsomedial nucleus of thalamus, amygdala, and hypothalamus. Postnatal expression of SOBP in the limbic system corresponds to a time of active synaptogenesis. the family is also referred to as Jackson circler, JXC1. In seven affected siblings from a consanguineous Israeli Arab family with mental retardation, anterior maxillary protrusion, and strabismus mutations were found in this protein.


Pssm-ID: 464609 [Multi-domain]  Cd Length: 325  Bit Score: 42.88  E-value: 1.73e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1710 RPRWASEASGKPSASDPESGAAPTVNSSSRSSSPSKVM----DEGKQTVPQEPEGPSVPSIP---SLAEHPVSVSMAAKG 1782
Cdd:pfam15279   79 RRKSASPASTRSESVSPGPSSSASPSSSPTSSNSSKPLisvaSSSKLLAPKPHEPPSLPPPPlppKKGRRHRPGLHPPLG 158
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1783 PPPfpGTPLMSSPVGGPLLPPIRYGPPPqlcgpfgprplpppfgpgmrpPLGLREYAPGVPPGkrdlpldpreFLPPGHA 1862
Cdd:pfam15279  159 RPP--GSPPMSMTPRGLLGKPQQHPPPS---------------------PLPAFMEPSSMPPP----------FLRPPPS 205
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1387212236 1863 PFRPLGSLGPReyffPGTRLPPPNHGPQDYPPSSAARDLPPSGSRDEPPPAS 1914
Cdd:pfam15279  206 IPQPNSPLSNP----MLPGIGPPPKPPRNLGPPSNPMHRPPFSPHHPPPPPT 253
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
1226-1588 1.79e-03

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 43.49  E-value: 1.79e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1226 ELVQKLSSYEQKIKESKKHVQETKKQnmilsdeAIKFKDKIKSLEETNEILGDTAKSLRAMLESEREQnaknqdlISENK 1305
Cdd:PRK02224   318 ELEDRDEELRDRLEECRVAAQAHNEE-------AESLREDADDLEERAEELREEAAELESELEEAREA-------VEDRR 383
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1306 KSIEKLKDVISVNASEFSEVQIALNEAKLSEEKVKSECHRVQEENARLK----------KKKEQLQ---------QEIKD 1366
Cdd:PRK02224   384 EEIEELEEEIEELRERFGDAPVDLGNAEDFLEELREERDELREREAELEatlrtarervEEAEALLeagkcpecgQPVEG 463
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1367 wsKSHA----ELSEQIRSFEKSQKDLEVALTHKDDNINALTNCI-------TQLNRLDCESESEDQNKGGSESDELA--- 1432
Cdd:PRK02224   464 --SPHVetieEDRERVEELEAELEDLEEEVEEVEERLERAEDLVeaedrieRLEERREDLEELIAERRETIEEKRERaee 541
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1433 ----NGEVGGDRSEKVKNQIKQMMDVSRTQTAISVVEEDLKLLQC------KLRASMSTKCNLEDQIKKLEEDRSSLQSA 1502
Cdd:PRK02224   542 lrerAAELEAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKEriesleRIRTLLAAIADAEDEIERLREKREALAEL 621
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1503 KTVLEDECKTLRQKVEILNELYQQK--EMALQKKlsqeeyerqereqrlsaadEKAVLAAEEVKTYKRRIEEMEDELQKT 1580
Cdd:PRK02224   622 NDERRERLAEKRERKRELEAEFDEAriEEAREDK-------------------ERAEEYLEQVEEKLDELREERDDLQAE 682

                   ....*...
gi 1387212236 1581 ERSFKNQI 1588
Cdd:PRK02224   683 IGAVENEL 690
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
1278-1643 1.95e-03

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 43.42  E-value: 1.95e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1278 DTAKSLRAMLESEREQNAKNQDLISENKKSiekLKDVISVNASEFSEVQIALNEAKLSEEKVKSEcHRVQEENARLKKKK 1357
Cdd:TIGR00618  187 AKKKSLHGKAELLTLRSQLLTLCTPCMPDT---YHERKQVLEKELKHLREALQQTQQSHAYLTQK-REAQEEQLKKQQLL 262
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1358 EQLQQEIKDWSKSHAELSEQIRSFEKSQKDLEVALTHKddninaltnCITQLNRLDCESESEDQNKGGSESDELAngevg 1437
Cdd:TIGR00618  263 KQLRARIEELRAQEAVLEETQERINRARKAAPLAAHIK---------AVTQIEQQAQRIHTELQSKMRSRAKLLM----- 328
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1438 gdrseKVKNQIKQMMDVSRTQTAISVVEEDLKLLQCKLRASMSTKCNLEDQiKKLEEDRSSLQSAKTVLEDECKTLRQKV 1517
Cdd:TIGR00618  329 -----KRAAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATSIREISCQQ-HTLTQHIHTLQQQKTTLTQKLQSLCKEL 402
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1518 EILNELYQQ------KEMALQKKLSQEEYERQEREQRLSAADEKAVLAAEEVKTYKRRIEEMEDELQKTERSFKNQIATH 1591
Cdd:TIGR00618  403 DILQREQATidtrtsAFRDLQGQLAHAKKQQELQQRYAELCAAAITCTAQCEKLEKIHLQESAQSLKEREQQLQTKEQIH 482
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1387212236 1592 EKKAHdnwlKARAAERAIAEEKREAANLRHKLLELTQKMAMMQEEPVIVKPM 1643
Cdd:TIGR00618  483 LQETR----KKAVVLARLLELQEEPCPLCGSCIHPNPARQDIDNPGPLTRRM 530
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1215-1516 2.08e-03

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 42.83  E-value: 2.08e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1215 EKLKNIMKENAELVQKLSSYEQKIKESKKHVQETKKQNMILSDEAIK-FKDKIKSLEETNEILGDTAKSLRAMLESEREQ 1293
Cdd:COG4717    149 EELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEELQdLAEELEELQQRLAELEEELEEAQEELEELEEE 228
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1294 --NAKNQDLISENKKSIEKLKDVI---------------------------------------------SVNASEFSEVQ 1326
Cdd:COG4717    229 leQLENELEAAALEERLKEARLLLliaaallallglggsllsliltiagvlflvlgllallflllarekASLGKEAEELQ 308
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1327 IALNEAKLSEEKVKSECHRVQ-------EENARLKKKKEQLQQEIKDWSKSHAELseQIRSFEKSQKDLevaLTHKD-DN 1398
Cdd:COG4717    309 ALPALEELEEEELEELLAALGlppdlspEELLELLDRIEELQELLREAEELEEEL--QLEELEQEIAAL---LAEAGvED 383
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1399 INALTNCITQLNRL--------DCESESEDQNKGGSESDELANGEVGGDRSEKVKNQIKQ--------MMDVSRTQTAIS 1462
Cdd:COG4717    384 EEELRAALEQAEEYqelkeeleELEEQLEELLGELEELLEALDEEELEEELEELEEELEEleeeleelREELAELEAELE 463
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1387212236 1463 VVEEDLKLLQCKLRASMstkcnLEDQIKKLEEDRSSLQSAKTVLEDECKTLRQK 1516
Cdd:COG4717    464 QLEEDGELAELLQELEE-----LKAELRELAEEWAALKLALELLEEAREEYREE 512
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1663-1916 2.63e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 42.56  E-value: 2.63e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1663 SFGPSPvSGGECSPPLTAdppARPLSATLNRREMPRSefgsVDGPLPRPRWASEASGKPSASDPESGAAPTVNSSSRSSS 1742
Cdd:PRK12323   362 AFRPGQ-SGGGAGPATAA---AAPVAQPAPAAAAPAA----AAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEAL 433
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1743 PSKVMDEGKQTVPQEPEGPSVPSIPSLAEHPvsvSMAAKGPPPFPGTPLMSSPVGGPLLPPIRYGPPPQlcgPFGPRPLP 1822
Cdd:PRK12323   434 AAARQASARGPGGAPAPAPAPAAAPAAAARP---AAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPW---EELPPEFA 507
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1823 PPFGPGMRPPLGLREYAPGVPPGKRDlPLDPREFLPPGHAPfrplgslgpreyffpgTRLPPPNHGPqdyPPSSAARdlP 1902
Cdd:PRK12323   508 SPAPAQPDAAPAGWVAESIPDPATAD-PDDAFETLAPAPAA----------------APAPRAAAAT---EPVVAPR--P 565
                          250
                   ....*....|....
gi 1387212236 1903 PSGSRDEPPPASQG 1916
Cdd:PRK12323   566 PRASASGLPDMFDG 579
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1211-1445 2.86e-03

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 42.20  E-value: 2.86e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1211 QQISEKLKNIMKENAELVQKLSSYEQKIKESKKHVQETKKQNMILSDEAIKFKDKIKSLEETNEILGDTAKSLRAmlese 1290
Cdd:COG4372     62 EQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEA----- 136
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1291 reQNAKNQDLISENKKSIEKLKDVISVNASEFSEVQIALNEAKLSE-EKVKSECHRVQEENARLKKKKEQLQQEIKDWSK 1369
Cdd:COG4372    137 --QIAELQSEIAEREEELKELEEQLESLQEELAALEQELQALSEAEaEQALDELLKEANRNAEKEEELAEAEKLIESLPR 214
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1387212236 1370 SHAELSEQIRSFEKSQKDLEVALTHKDDNINALTNCITQLNRLDCESESEDQNKGGSESDELANGEVGGDRSEKVK 1445
Cdd:COG4372    215 ELAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEEVILKEIEELELAILVEKDTEEEELEIAALELEALEE 290
PTZ00419 PTZ00419
valyl-tRNA synthetase-like protein; Provisional
1226-1412 3.15e-03

valyl-tRNA synthetase-like protein; Provisional


Pssm-ID: 240411 [Multi-domain]  Cd Length: 995  Bit Score: 42.69  E-value: 3.15e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1226 ELVQKLSSYEQKIkES---KKHVQETKKQNMILSDEAIK-FKDKIKSLEETNEILGDTAK-SLRAMLESereQNAKNQDL 1300
Cdd:PTZ00419   798 ELYQRLPNYLRKS-ESisiAKYPQPNPGWNNEALDEEMKiIMSIVKSIRSLIATLGIPNKtKPDCYVTA---KDAELIEL 873
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1301 ISENKKSIEKLKDVISVNASEFSEVQIALNE--------AKLSEEKVKSECHRVQEENARLKKKKEQLQQEIKDWSKS-- 1370
Cdd:PTZ00419   874 IESAENLISTLAKIGSVSVIPPIEEEAEVPKgcgfdvvdNKVIIYLNLDEFIDLKKELAKLEKKLAKLQKSLESYLKKis 953
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 1387212236 1371 ----HAELSEQIRSFEKSQKDlevALTHkddNINALTNCITQLNRL 1412
Cdd:PTZ00419   954 ipnyEDKVPEDVRKLNDEKID---ELNE---EIKQLEQAIEELKSL 993
MISS pfam15822
MAPK-interacting and spindle-stabilising protein-like; MISS is a family of eukaryotic ...
1758-1929 3.34e-03

MAPK-interacting and spindle-stabilising protein-like; MISS is a family of eukaryotic MAPK-interacting and spindle-stabilising protein-like proteins. MISS is rich in prolines and has four potential MAPK-phosphorylation sites, a MAPK-docking site, a PEST sequence (PEST motif) and a bipartite nuclear localization signal. The endogenous protein accumulates during mouse meiotic maturation and is found as discrete dots on the MII spindle. MISS is the first example of a physiological MAPK-substrate that is stabilized in MII that specifically regulates MII spindle integrity during the CSF arrest.


Pssm-ID: 318115 [Multi-domain]  Cd Length: 238  Bit Score: 41.13  E-value: 3.34e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1758 PEGPS----VPSIPSLAEHPVSvsmaakgPPPFPGTPLMSSPVGGPLLPPIRYGPPPqlcgpfgprpLPPPFGPGMRPPL 1833
Cdd:pfam15822   59 PFGPAptgmYPSIPLTGPSPGP-------PAPFPPSGPSCPPPGGPYPAPTVPGPGP----------IGPYPTPNMPFPE 121
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1834 GLREYAPgvppgkrdlPLDPREFLPPGHAPFRPLGSLGPR---EYFFPGTRLPPPNHGPQDYPPSS--AARDLP----PS 1904
Cdd:pfam15822  122 LPRPYGA---------PTDPAAAAPSGPWGSMSSGPWAPGmggQYPAPNMPYPSPGPYPAVPPPQSpgAAPPVPwgtvPP 192
                          170       180
                   ....*....|....*....|....*
gi 1387212236 1905 GSRDEPPPASQGASQDCSPALKQSP 1929
Cdd:pfam15822  193 GPWGPPAPYPDPTGSYPMPGLYPTP 217
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1635-1812 3.45e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 42.17  E-value: 3.45e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1635 EEPVIVKPMPGRPNTQNPPRRGPLSQNGSFGPSPVS----------GGECSPPLTADPPARPLSATLNRREMPRSEFGSV 1704
Cdd:PRK12323   397 PAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEAlaaarqasarGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAA 476
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1705 DGPLPRPRWASEASGKPSASDPESGAAPTVNSSSRSSSPSKVMDEGKQTVPQEPEGPSVPSIPSLAEHPVSVSMAAKGPP 1784
Cdd:PRK12323   477 AAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAA 556
                          170       180
                   ....*....|....*....|....*...
gi 1387212236 1785 PFPGTPLMSSPVGGPLLPPIRYGPPPQL 1812
Cdd:PRK12323   557 TEPVVAPRPPRASASGLPDMFDGDWPAL 584
ClyA-like cd21116
family of the cytolysin A (ClyA) family alpha pore-forming toxins (alpha-PFT) including ...
1205-1394 3.56e-03

family of the cytolysin A (ClyA) family alpha pore-forming toxins (alpha-PFT) including Bacillus cereus HblB, Aeromonas hydrophila AhlB, Bacillus thuringiensis Cry6Aa and similar proteins; This family belongs to the ClyA family of alpha-PFT bacterial toxins. PFTs form the major group of virulence factors in many pathogenic bacteria and in general are critical components of the molecular offensive and defensive machinery of cells in all kingdoms of life. Bacterial PFTs facilitate the takeover of host resources by puncturing holes in the membrane. PFTs can be classified as alpha-PFTs and beta-PFTs depending on the secondary structures of their membrane component. Alpha-PFTs use a ring of amphipathic helices while beta-PFTs use a beta-barrel to construct the pore. Members of this family include the toxins: Bacillus cereus hemolysin binding component B (HblB or HBL-B) of the diarrheal enterotoxin hemolysin BL, Aeromonas hydrophila hemolytic (Ahl) component B (AhlB) of the tripartite AhlABC toxin, Vibrio cholerae cytotoxin motility associated killing factor A (MakA) cytotoxin, Xenorhabdus nematophila alpha-xenorhabdolysin (XaxA), Bacillus thuringiensis crystal 6Aa (Cry6Aa) parasporal crystal (Cry) toxin, and Bacillus cereus non-hemolytic enterotoxin (Nhe) component A (NheA) of the non-hemolytic enterotoxin Nhe, which, despite its name, is hemolytic, among others. In solution, ClyA proteins have an elongated, almost entirely alpha-helical structure, except for a short hydrophobic beta-hairpin known as the beta-tongue. Pore formation by ClyA requires circular oligomerization of the toxin by a sequential mechanism. This, in turn, concentrates the amphipathic helices in the center of the ring-like structure, forming a helical barrel that inserts into the membrane by a wedge-like mechanism. Compared with ClyA, NheA is almost entirely alpha-helical with an enlarged "head" domain, and an enlarged beta-tongue; it has been proposed that NheA could even form beta-barrel pores. Alpha-PFTs with similar structures are increasingly being found in eukaryotes, in particular as components of the immune systems of animals. This family may be distantly related to Escherichia coli alpha-PFT hemolysin E (HlyE, also known as ClyA or SheA).


Pssm-ID: 439149 [Multi-domain]  Cd Length: 224  Bit Score: 40.86  E-value: 3.56e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1205 VYQVTEQQiseklkNIMKENAELVQKLSSYEQKIKESKKHVQETKKQNMILSDEAIK-FKDKIKSLEET------NEILG 1277
Cdd:cd21116     15 VTAILNQP------NINLIPLDLLPSLNTHQALARAHALEWLNEIKPKLLSLPNDIIgYNNTFQSYYPDlieladNLIKG 88
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1278 DTA--KSLRAMLESEREQNAKNQDLISENKKSIEKLKDVISVNASEFSEVQIALNEAKLSEEKVKSECHRVQEENAR--- 1352
Cdd:cd21116     89 DQGakQQLLQGLEALQSQVTKKQTSVTSFINELTTFKNDLDDDSRNLQTDATKAQAQVAVLNALKNQLNSLAEQIDAaid 168
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 1387212236 1353 ----LKKKKEQLQQEIKDWSKShAELSEQIRSFEKSQKDLEVALTH 1394
Cdd:cd21116    169 alekLSNDWQTLDSDIKELITD-LEDAESSIDAAFLQADLKAAKAD 213
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1708-1927 3.96e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.45  E-value: 3.96e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1708 LPRPRWASEASGKPS---------ASDPE-----------SGAAPTVNSSSRSSSPSKVMDEGKQTVPQEPEGPSVPSIP 1767
Cdd:pfam03154  107 ISRPNSPSEGEGESSdgrsvndegSSDPKdidqdnrstspSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSP 186
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1768 slaehPVSVSMAAKGPPPFPGTPLMSSPVGGPLLPPIRYGPPPQLCGPFGPRPLPPPFGPGMRPPLGLREYAPGVPPGKR 1847
Cdd:pfam03154  187 -----PPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQV 261
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1848 DLPLDPReflPPGHAPFRPLG---SLGPREYFFPGTRLP---PPNHGPQDYPPSSAARDLPPSGSRDEPPPaSQGASQDC 1921
Cdd:pfam03154  262 SPQPLPQ---PSLHGQMPPMPhslQTGPSHMQHPVPPQPfplTPQSSQSQVPPGPSPAAPGQSQQRIHTPP-SQSQLQSQ 337

                   ....*.
gi 1387212236 1922 SPALKQ 1927
Cdd:pfam03154  338 QPPREQ 343
PRK00409 PRK00409
recombination and DNA strand exchange inhibitor protein; Reviewed
1211-1310 4.27e-03

recombination and DNA strand exchange inhibitor protein; Reviewed


Pssm-ID: 234750 [Multi-domain]  Cd Length: 782  Bit Score: 42.12  E-value: 4.27e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1211 QQISEKLKNIMKENAE----LVQKLSS----YEQKIKESKKHVQETKKQNMILSDEAIKFKDKIKSL-----EETNEILg 1277
Cdd:PRK00409   501 ENIIEEAKKLIGEDKEklneLIASLEElereLEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLleeaeKEAQQAI- 579
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|
gi 1387212236 1278 DTAKS-----LRAMLESEREQNA--KNQDLIsENKKSIEK 1310
Cdd:PRK00409   580 KEAKKeadeiIKELRQLQKGGYAsvKAHELI-EARKRLNK 618
Drf_FH1 pfam06346
Formin Homology Region 1; This region is found in some of the Diaphanous related formins (Drfs) ...
1750-1918 4.49e-03

Formin Homology Region 1; This region is found in some of the Diaphanous related formins (Drfs). It consists of low complexity repeats of around 12 residues.


Pssm-ID: 461881 [Multi-domain]  Cd Length: 157  Bit Score: 39.85  E-value: 4.49e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1750 GKQTVPQEPEGPSVPSIPSLAEHPVSVSmaakgPPPFPGTPLMSSPV---GGPLLPPirygPPPQLCGPFGPRPLPPPFG 1826
Cdd:pfam06346    8 GDSSTIPLPPGACIPTPPPLPGGGGPPP-----PPPLPGSAAIPPPPplpGGTSIPP----PPPLPGAASIPPPPPLPGS 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1827 PGMRPPLGLrEYAPGVPPGKRDLPLDPREFLPPghaPFRPLGslgpreyffPGTRLPPPNHGPQDYPPSSAARDLPPsgs 1906
Cdd:pfam06346   79 TGIPPPPPL-PGGAGIPPPPPPLPGGAGVPPPP---PPLPGG---------PGIPPPPPFPGGPGIPPPPPGMGMPP--- 142
                          170
                   ....*....|..
gi 1387212236 1907 rdePPPASQGAS 1918
Cdd:pfam06346  143 ---PPPFGFGVP 151
SH3 cd00174
Src Homology 3 domain superfamily; Src Homology 3 (SH3) domains are protein interaction ...
52-98 4.68e-03

Src Homology 3 domain superfamily; Src Homology 3 (SH3) domains are protein interaction domains that bind proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs. Thus, they are referred to as proline-recognition domains (PRDs). SH3 domains are less selective and show more diverse specificity compared to other PRDs. They have been shown to bind peptide sequences that lack the PxxP motif; examples include the PxxDY motif of Eps8 and the RKxxYxxY sequence in SKAP55. SH3 domain containing proteins play versatile and diverse roles in the cell, including the regulation of enzymes, changing the subcellular localization of signaling pathway components, and mediating the formation of multiprotein complex assemblies, among others. Many members of this superfamily are adaptor proteins that associate with a number of protein partners, facilitating complex formation and signal transduction.


Pssm-ID: 212690 [Multi-domain]  Cd Length: 51  Bit Score: 37.06  E-value: 4.68e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 1387212236   52 ALEDFTGPDCRFVNFKKGDTVYVYYKLAGGspevWA-GSV-GHTFGYFP 98
Cdd:cd00174      4 ALYDYEAQDDDELSFKKGDIITVLEKDDDG----WWeGELnGGREGLFP 48
PRK12705 PRK12705
hypothetical protein; Provisional
1179-1381 5.14e-03

hypothetical protein; Provisional


Pssm-ID: 237178 [Multi-domain]  Cd Length: 508  Bit Score: 41.62  E-value: 5.14e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1179 VLITASLGIVSFAVFFWRtvlavKSRVYQVTEQQISEKLKNIMKENAELVQKLSSYEQKIKESKKhvQETKKQNMILSDE 1258
Cdd:PRK12705    10 LLLLIGLLLGVLVVLLKK-----RQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQR--QEARREREELQRE 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1259 AIKFKDKIKSLEETNEILGDTAKSLramLESEREQNAKNQDLISENKKSIEKLKDVISVNASEFSEVQIALNEAKLSEEK 1338
Cdd:PRK12705    83 EERLVQKEEQLDARAEKLDNLENQL---EEREKALSARELELEELEKQLDNELYRVAGLTPEQARKLLLKLLDAELEEEK 159
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 1387212236 1339 VKSecHRVQEENARLKKKKEQLQQEIKDWSKSHAELSEQIRSF 1381
Cdd:PRK12705   160 AQR--VKKIEEEADLEAERKAQNILAQAMQRIASETASDLSVS 200
KASH_CCD pfam14662
Coiled-coil region of CCDC155 or KASH; This coiled-coil region is found in the central part of ...
1252-1382 5.24e-03

Coiled-coil region of CCDC155 or KASH; This coiled-coil region is found in the central part of KASH or Klarsicht/ANC-1/Syne/homology proteins. KASH are a meiosis-specific proteins that localize at telomeres and interact with SUN1, thus being implicated in meiotic chromosome dynamics and homolog pairing.


Pssm-ID: 405365 [Multi-domain]  Cd Length: 191  Bit Score: 40.16  E-value: 5.24e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1252 NMILSDEAIKFKDKIKSLEETNEILGDTAKSLRAMLESEREQNAKNqdliSENKKSIEKLKdvISVNASEFSEVQIALNE 1331
Cdd:pfam14662   17 NQKLLQENSKLKATVETREETNAKLLEENLNLRKQAKSQQQAVQKE----KLLEEELEDLK--LIVNSLEEARRSLLAQN 90
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1387212236 1332 AKLSEEKVK--SECHRVQEENARLKKKKEQLQQEIKDWSKSHAELSEQIRSFE 1382
Cdd:pfam14662   91 KQLEKENQSllQEIESLQEENKKNQAERDKLQKKKKELLKSKACLKEQLHSCE 143
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1656-1913 6.85e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 41.70  E-value: 6.85e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1656 GPLSQNGSFGPSPVsggeCSPPLTADPPARPLSATLNRREMPRSEFGSVDGPLPRPRWASEAsgKPSASDPESGAAPTVN 1735
Cdd:PHA03307   661 GLSAVPGLAFPRPA----CPPRALEACPARLESWLRELRDLRDAVYLARLRGDLPVAGGREE--RVAAVRAVSLVARTVA 734
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1736 SSsrssspskvmdegkqtVPQEPEGPS-----------VPSIPSLAEHPVSVSMAAKGPPpfpgTPLMSSPVGGPLLPPI 1804
Cdd:PHA03307   735 PL----------------VRYSPRRARarasawditdaLFSNPSLVPAKLAEALALLEPA----EPQRGAGSSPPVRAEA 794
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1805 RYGPPPQLCGPFGPRPLPPPFGPGMRPP-----LGLREYAPGVPPGK--RDLPLDPREFLPPGHAPFRPLGSLGPREYFF 1877
Cdd:PHA03307   795 AFRRPGRLRRSGPAADAASRTASKRKSRshtpdGGSESSGPARPPGAaaRPPPARSSESSKSKPAAAGGRARGKNGRRRP 874
                          250       260       270
                   ....*....|....*....|....*....|....*.
gi 1387212236 1878 PGtrLPPPNHGPQDYPPSSAARDLPPSGSRDEPPPA 1913
Cdd:PHA03307   875 RP--PEPRARPGAAAPPKAAAAAPPAGAPAPRPRPA 908
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1206-1399 6.90e-03

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 41.59  E-value: 6.90e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1206 YQVTEQQISEKLKNIMKENAELVQKLSSYEQKIKESK-KHVQETKKQNMIlsdeaikfKDKIKSLEETNEILGDTAKSLR 1284
Cdd:TIGR02169  348 ERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRdELKDYREKLEKL--------KREINELKRELDRLQEELQRLS 419
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1285 AMLEserEQNAKNQDLISENKKSIEKLKDVI-SVNASEFSEVQIALNEAKLSEE---------KVKSECHRVQEENARLK 1354
Cdd:TIGR02169  420 EELA---DLNAAIAGIEAKINELEEEKEDKAlEIKKQEWKLEQLAADLSKYEQElydlkeeydRVEKELSKLQRELAEAE 496
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1387212236 1355 KKKEQLQQEIKDWSKS-----------HAELSEQIRSFEKSQKDLEVALTHKDDNI 1399
Cdd:TIGR02169  497 AQARASEERVRGGRAVeevlkasiqgvHGTVAQLGSVGERYATAIEVAAGNRLNNV 552
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
1215-1601 7.78e-03

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 41.19  E-value: 7.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1215 EKLKNIMKENAELVQKLSSYEQKIKESKKHVQETKKQNMILSDEAIKF----KDKIKSLEETNEILGDTAKSLRAMLESE 1290
Cdd:TIGR00606  248 DPLKNRLKEIEHNLSKIMKLDNEIKALKSRKKQMEKDNSELELKMEKVfqgtDEQLNDLYHNHQRTVREKERELVDCQRE 327
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1291 REQNAKNQDLISENKKSIEKLKDVISVNASEFSEvQIALNEAKLSEEKVKSECH--------RVQEENArLKKKKEQLQQ 1362
Cdd:TIGR00606  328 LEKLNKERRLLNQEKTELLVEQGRLQLQADRHQE-HIRARDSLIQSLATRLELDgfergpfsERQIKNF-HTLVIERQED 405
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1363 EIKDWSKSHAELSEQIRSFEKSQKDLEVALTHKDDNInaltncitqlnRLDCESESEDQNKGGSESDELANGEVGGDRSE 1442
Cdd:TIGR00606  406 EAKTAAQLCADLQSKERLKQEQADEIRDEKKGLGRTI-----------ELKKEILEKKQEELKFVIKELQQLEGSSDRIL 474
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1443 KVKNQIKqmmdvsRTQTAISVVEEDlKLLQCKLRASMS---TKCNLEDQIKKLEEDRSSLQSAKTVLEDECKTLRQKVEI 1519
Cdd:TIGR00606  475 ELDQELR------KAERELSKAEKN-SLTETLKKEVKSlqnEKADLDRKLRKLDQEMEQLNHHTTTRTQMEMLTKDKMDK 547
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1520 LNELYQQKEMALQKKLSqeEYERQEREQRLSAADEKavlAAEEVKTYKRRIEEMEDELQKTERSfKNQIATHEKKAHDNW 1599
Cdd:TIGR00606  548 DEQIRKIKSRHSDELTS--LLGYFPNKKQLEDWLHS---KSKEINQTRDRLAKLNKELASLEQN-KNHINNELESKEEQL 621

                   ..
gi 1387212236 1600 LK 1601
Cdd:TIGR00606  622 SS 623
BAR_SNX cd07596
The Bin/Amphiphysin/Rvs (BAR) domain of Sorting Nexins; BAR domains are dimerization, lipid ...
1226-1394 8.45e-03

The Bin/Amphiphysin/Rvs (BAR) domain of Sorting Nexins; BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. Sorting nexins (SNXs) are Phox homology (PX) domain containing proteins that are involved in regulating membrane traffic and protein sorting in the endosomal system. SNXs differ from each other in their lipid-binding specificity, subcellular localization and specific function in the endocytic pathway. A subset of SNXs also contain BAR domains. The PX-BAR structural unit determines the specific membrane targeting of SNXs. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in protein-protein interactions.


Pssm-ID: 153280 [Multi-domain]  Cd Length: 218  Bit Score: 39.65  E-value: 8.45e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1226 ELVQKLSSYEQKIKE-SK------KHVQETKKQNMILSDEAIKF----KDKIKSLEETNEILGDTAKSLRAMLESE-REQ 1293
Cdd:cd07596      8 EAKDYILKLEEQLKKlSKqaqrlvKRRRELGSALGEFGKALIKLakceEEVGGELGEALSKLGKAAEELSSLSEAQaNQE 87
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1294 NAKNQDLISENKKSIEKLKDVISVNASEFSEVQIA---LNEAKLSEEKVK----SECHRVQEENARLKKKKEQLQQEIKD 1366
Cdd:cd07596     88 LVKLLEPLKEYLRYCQAVKETLDDRADALLTLQSLkkdLASKKAQLEKLKaapgIKPAKVEELEEELEEAESALEEARKR 167
                          170       180
                   ....*....|....*....|....*....
gi 1387212236 1367 WSKSHAELSEQIRSFEKS-QKDLEVALTH 1394
Cdd:cd07596    168 YEEISERLKEELKRFHEErARDLKAALKE 196
PTZ00440 PTZ00440
reticulocyte binding protein 2-like protein; Provisional
1209-1536 9.57e-03

reticulocyte binding protein 2-like protein; Provisional


Pssm-ID: 240419 [Multi-domain]  Cd Length: 2722  Bit Score: 41.36  E-value: 9.57e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1209 TEQQISEKLKNIMKENAELVQKLSS-YEQKIKESKKHVQETKKqnmiLSDEAIKfkdkikslEETNEILGDTAKSLRAML 1287
Cdd:PTZ00440   660 SKEDLQTLLNTSKNEYEKLEFMKSDnIDNIIKNLKKELQNLLS----LKENIIK--------KQLNNIEQDISNSLNQYT 727
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1288 ESEREQNaKNQDLISENKKSIEKLKDVISVNASEFSeVQIALNEAKLSEEKVKSEchRVQEENARLKKKKEQLQQEIKDW 1367
Cdd:PTZ00440   728 IKYNDLK-SSIEEYKEEEEKLEVYKHQIINRKNEFI-LHLYENDKDLPDGKNTYE--EFLQYKDTILNKENKISNDINIL 803
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1368 SKSHAELSEQIRSFEKSQKDLEVALTHKDDNINALTNCITqlNRLDCESESEDQNKGGSESDELANgevggdRSEKVKNQ 1447
Cdd:PTZ00440   804 KENKKNNQDLLNSYNILIQKLEAHTEKNDEELKQLLQKFP--TEDENLNLKELEKEFNENNQIVDN------IIKDIENM 875
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236 1448 IKQMMDVSRTQTAISVVEEDLKLLQCKLRASMSTKCNLEDQIKKLEEDRSSLQSAKTVLEDECKTLRQKVE--ILNELYQ 1525
Cdd:PTZ00440   876 NKNINIIKTLNIAINRSNSNKQLVEHLLNNKIDLKNKLEQHMKIINTDNIIQKNEKLNLLNNLNKEKEKIEkqLSDTKIN 955
                          330
                   ....*....|.
gi 1387212236 1526 QKEMALQKKLS 1536
Cdd:PTZ00440   956 NLKMQIEKTLE 966
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
735-912 9.80e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 40.83  E-value: 9.80e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236  735 KQSKERSPEIQDKRLDVDLQNPEKPVSGAIKTDPETEKNKEETRHVSENERKNETAGKavdSLGRDAGGPVVEKEGSSPV 814
Cdd:PTZ00449   490 KKSKKKLAPIEEEDSDKHDEPPEGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGG---KPGETKEGEVGKKPGPAKE 566
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1387212236  815 HQKVQRPSEGSDVPGKKQNQTPELGEASQK-KDPDYLKEDNHEGHPKTSGLMEKPGVEPSKEDDEHAEKFVDPgSRGSAS 893
Cdd:PTZ00449   567 HKPSKIPTLSKKPEFPKDPKHPKDPEEPKKpKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPP-QRPSSP 645
                          170
                   ....*....|....*....
gi 1387212236  894 EDPDDDPFPWAPHAPVQPE 912
Cdd:PTZ00449   646 ERPEGPKIIKSPKPPKSPK 664
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH