NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|578837046|ref|XP_006724243|]
View 

calcineurin-binding protein cabin-1 isoform X10 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
MEF2_binding pfam09047
MEF2 binding; The myocyte enhancer factor-2 (MEF2) binding domain, predominantly found in the ...
2132-2166 1.29e-15

MEF2 binding; The myocyte enhancer factor-2 (MEF2) binding domain, predominantly found in the calcineurin-binding protein CABIN 1, adopts an amphipathic alpha-helical structure, which allows it to bind a hydrophobic groove on the MEF2S domain, forming a triple-helical interaction. Interaction of this domain with MEF2 causes repression of transcription.


:

Pssm-ID: 370261 [Multi-domain]  Cd Length: 35  Bit Score: 72.19  E-value: 1.29e-15
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 578837046  2132 TLLSPKGSISEETKQKLKSAILSAQSAANVRKESL 2166
Cdd:pfam09047    1 TLLSPKGSISEETKQKLKNAILSAQSAANVKKDSL 35
TPR COG0457
Tetratricopeptide (TPR) repeat [General function prediction only];
30-205 3.71e-11

Tetratricopeptide (TPR) repeat [General function prediction only];


:

Pssm-ID: 440225 [Multi-domain]  Cd Length: 245  Bit Score: 65.41  E-value: 3.71e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046   30 EAQEAEAFALYHKALDLQKHDRFEESAKAYHELLEasllreavssgdekeglKHPGLilkYSTYKNLAQLAAQREDLETA 109
Cdd:COG0457     2 ELDPDDAEAYNNLGLAYRRLGRYEEAIEDYEKALE-----------------LDPDD---AEALYNLGLAYLRLGRYEEA 61
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046  110 MEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKDC 189
Cdd:COG0457    62 LADYEQALELDPDDAEALNNLGLALQALGRYEEALEDYDKALELDPDDAEALYNLGLALLELGRYDEAIEAYERALELDP 141
                         170
                  ....*....|....*.
gi 578837046  190 RYSKGLVLKEKIFEEQ 205
Cdd:COG0457   142 DDADALYNLGIALEKL 157
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1890-2133 4.18e-11

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 68.81  E-value: 4.18e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1890 AAAQRQASGDTPTTPKHPKDSRENFFP----VTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASAS--- 1962
Cdd:PHA03247 2752 GGPARPARPPTTAGPPAPAPPAAPAAGpprrLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLppp 2831
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1963 TLDQSKDPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFP-PQEPRHSPQVKMA--PTSSPAEPHCWPAEaalgtgaE 2039
Cdd:PHA03247 2832 TSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAkPAAPARPPVRRLArpAVSRSTESFALPPD-------Q 2904
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 2040 PTCSQEGKLRPEPRRDGEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRP-LPNMPKLVIPSAAt 2118
Cdd:PHA03247 2905 PERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPgRVAVPRFRVPQPA- 2983
                         250
                  ....*....|....*
gi 578837046 2119 kfPPEITVTPPTPTL 2133
Cdd:PHA03247 2984 --PSREAPASSTPPL 2996
 
Name Accession Description Interval E-value
MEF2_binding pfam09047
MEF2 binding; The myocyte enhancer factor-2 (MEF2) binding domain, predominantly found in the ...
2132-2166 1.29e-15

MEF2 binding; The myocyte enhancer factor-2 (MEF2) binding domain, predominantly found in the calcineurin-binding protein CABIN 1, adopts an amphipathic alpha-helical structure, which allows it to bind a hydrophobic groove on the MEF2S domain, forming a triple-helical interaction. Interaction of this domain with MEF2 causes repression of transcription.


Pssm-ID: 370261 [Multi-domain]  Cd Length: 35  Bit Score: 72.19  E-value: 1.29e-15
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 578837046  2132 TLLSPKGSISEETKQKLKSAILSAQSAANVRKESL 2166
Cdd:pfam09047    1 TLLSPKGSISEETKQKLKNAILSAQSAANVKKDSL 35
MEF2_binding cd13839
Mycocyte enhancer factor-2 (MEF2) binding domain of the calcineurin-binding protein cabin-1; ...
2132-2166 6.82e-14

Mycocyte enhancer factor-2 (MEF2) binding domain of the calcineurin-binding protein cabin-1; The myocyte enhancer factor-2 (MEF2) binding domain, as found in the calcineurin-binding protein cabin-1, adopts an amphipathic alpha-helical structure, which allows it to bind to a hydrophobic groove on the MEF2S domain, forming a triple-helical interaction. Interaction of this domain with MEF2 causes repression of transcription. Cabin-1 inhibits calcineurin-mediated signal transduction in T-cell receptor-mediated signalling pathways, by binding to the activated form of calcineurin. Cabin-1 acts as a co-repressor of MEF2, the mycocyte enhancer factor-2, which regulates transcription in a calcium-dependent manner and plays vital roles in T-cell development and function.


Pssm-ID: 260103 [Multi-domain]  Cd Length: 35  Bit Score: 67.41  E-value: 6.82e-14
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 578837046 2132 TLLSPKGSISEETKQKLKSAILSAQSAANVRKESL 2166
Cdd:cd13839     1 TLLSPKGSISEETKQKLKNAILSSQSAANVKKDTL 35
TPR COG0457
Tetratricopeptide (TPR) repeat [General function prediction only];
30-205 3.71e-11

Tetratricopeptide (TPR) repeat [General function prediction only];


Pssm-ID: 440225 [Multi-domain]  Cd Length: 245  Bit Score: 65.41  E-value: 3.71e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046   30 EAQEAEAFALYHKALDLQKHDRFEESAKAYHELLEasllreavssgdekeglKHPGLilkYSTYKNLAQLAAQREDLETA 109
Cdd:COG0457     2 ELDPDDAEAYNNLGLAYRRLGRYEEAIEDYEKALE-----------------LDPDD---AEALYNLGLAYLRLGRYEEA 61
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046  110 MEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKDC 189
Cdd:COG0457    62 LADYEQALELDPDDAEALNNLGLALQALGRYEEALEDYDKALELDPDDAEALYNLGLALLELGRYDEAIEAYERALELDP 141
                         170
                  ....*....|....*.
gi 578837046  190 RYSKGLVLKEKIFEEQ 205
Cdd:COG0457   142 DDADALYNLGIALEKL 157
PHA03247 PHA03247
large tegument protein UL36; Provisional
1890-2133 4.18e-11

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 68.81  E-value: 4.18e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1890 AAAQRQASGDTPTTPKHPKDSRENFFP----VTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASAS--- 1962
Cdd:PHA03247 2752 GGPARPARPPTTAGPPAPAPPAAPAAGpprrLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLppp 2831
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1963 TLDQSKDPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFP-PQEPRHSPQVKMA--PTSSPAEPHCWPAEaalgtgaE 2039
Cdd:PHA03247 2832 TSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAkPAAPARPPVRRLArpAVSRSTESFALPPD-------Q 2904
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 2040 PTCSQEGKLRPEPRRDGEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRP-LPNMPKLVIPSAAt 2118
Cdd:PHA03247 2905 PERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPgRVAVPRFRVPQPA- 2983
                         250
                  ....*....|....*
gi 578837046 2119 kfPPEITVTPPTPTL 2133
Cdd:PHA03247 2984 --PSREAPASSTPPL 2996
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
1890-2159 2.31e-10

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 65.75  E-value: 2.31e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046  1890 AAAQRQASGDTPTTPKHPKdSRENFFPVTVVPTAPDPV----PADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLD 1965
Cdd:pfam17823  120 SSSPSSAAQSLPAAIAALP-SEAFSAPRAAACRANASAapraAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTA 198
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046  1966 QSKDPGPPRPHRPEATPSMASLGPE-GEELARVaeGTSFPpqeprhspqvkMAPTSSPAEPHCWPAE-AALGTGAEPTCS 2043
Cdd:pfam17823  199 ASSAPATLTPARGISTAATATGHPAaGTALAAV--GNSSP-----------AAGTVTAAVGTVTPAAlATLAAAAGTVAS 265
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046  2044 QEGKLR---PEPRRDGEAQEAASETQPLS-SPPTAASSKAPSS--GSAQPPEGHPGKPEPSRAKSRPLPNMPKLVIPS-- 2115
Cdd:pfam17823  266 AAGTINmgdPHARRLSPAKHMPSDTMARNpAAPMGAQAQGPIIqvSTDQPVHNTAGEPTPSPSNTTLEPNTPKSVASTnl 345
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|.
gi 578837046  2116 -------AATKFPPEITVtPPTPTLLSPKGSISEETKQklKSAILSAQSAA 2159
Cdd:pfam17823  346 avvtttkAQAKEPSASPV-PVLHTSMIPEVEATSPTTQ--PSPLLPTQGAA 393
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
1851-2131 6.20e-05

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 48.23  E-value: 6.20e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1851 RVERIMSETYMLIKQHLPV--KVDEEAALEQAVKFCQVHLGAAAQRQASGDTPTTPKHPKDSRenffpvtvvPTAPDPvp 1928
Cdd:NF033839  229 QIVALIKELDELKKQALSEidNVNTKVEIENTVHKIFADMDAVVTKFKKGLTQDTPKEPGNKK---------PSAPKP-- 297
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1929 adsvqrpsdaHTKPRPALAAAttiitcPPSASASTLDQSKDPGPPRPhRPEATPSmaslgPEGEElarvaegTSFPPQEP 2008
Cdd:NF033839  298 ----------GMQPSPQPEKK------EVKPEPETPKPEVKPQLEKP-KPEVKPQ-----PEKPK-------PEVKPQLE 348
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 2009 RHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTcsqegklRPEPRRDGEAQEAASETQPlsSPPTAASSKAPSSGSAQP- 2087
Cdd:NF033839  349 TPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPE-------TPKPEVKPQPEKPKPEVKP--QPEKPKPEVKPQPEKPKPe 419
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|..
gi 578837046 2088 --PEGHPGKPE--PSRAKS----RPLPNMPKLVIPSAATKFPPEITVTPPTP 2131
Cdd:NF033839  420 vkPQPEKPKPEvkPQPEKPkpevKPQPEKPKPEVKPQPETPKPEVKPQPEKP 471
SepH NF040712
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ...
1969-2109 1.70e-04

septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.


Pssm-ID: 468676 [Multi-domain]  Cd Length: 346  Bit Score: 46.30  E-value: 1.70e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1969 DPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHC------WPAEAALGTGAEPTC 2042
Cdd:NF040712  189 DPDFGRPLRPLATVPRLAREPADARPEEVEPAPAAEGAPATDSDPAEAGTPDDLASARRrragveQPEDEPVGPGAAPAA 268
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 578837046 2043 SQEGKLRPEPRRdgEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSR-PLPNMP 2109
Cdd:NF040712  269 EPDEATRDAGEP--PAPGAAETPEAAEPPAPAPAAPAAPAAPEAEEPARPEPPPAPKPKRRrRRASVP 334
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
1889-2131 1.52e-03

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 43.51  E-value: 1.52e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1889 GAAAQRQASGDTPTTPKH----PKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTiitcpPSASASTL 1964
Cdd:COG5180   152 AALLQRSDPILAKDPDGDsastLPPPAEKLDKVLTEPRDALKDSPEKLDRPKVEVKDEAQEEPPDLT-----GGADHPRP 226
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1965 DQSKDPGPPRPHRPEATPSMASLGPEGEEL-------ARVAEGTSFPPQEPRHSPQ-------VKMAPTSSPAEPHCWPA 2030
Cdd:COG5180   227 EAASSPKVDPPSTSEARSRPATVDAQPEMRppadakeRRRAAIGDTPAAEPPGLPVleagsepQSDAPEAETARPIDVKG 306
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 2031 EAALGTGAEPTCSQEGKLRPEPRRDGEAQEaasetQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKS------RP 2104
Cdd:COG5180   307 VASAPPATRPVRPPGGARDPGTPRPGQPTE-----RPAGVPEAASDAGQPPSAYPPAEEAVPGKPLEQGAPRpgssggDG 381
                         250       260
                  ....*....|....*....|....*..
gi 578837046 2105 LPNMPKLVIPSAATKFPPeiTVTPPTP 2131
Cdd:COG5180   382 APFQPPNGAPQPGLGRRG--APGPPMG 406
sucB TIGR01347
2-oxoglutarate dehydrogenase complex dihydrolipoamide succinyltransferase (E2 component); This ...
1991-2101 1.95e-03

2-oxoglutarate dehydrogenase complex dihydrolipoamide succinyltransferase (E2 component); This model describes the TCA cycle 2-oxoglutarate system E2 component, dihydrolipoamide succinyltransferase. It is closely related to the pyruvate dehydrogenase E2 component, dihydrolipoamide acetyltransferase. The seed for this model includes mitochondrial and Gram-negative bacterial forms. Mycobacterial candidates are highly derived, differ in having and extra copy of the lipoyl-binding domain at the N-terminus. They score below the trusted cutoff, but above the noise cutoff and above all examples of dihydrolipoamide acetyltransferase. [Energy metabolism, TCA cycle]


Pssm-ID: 273565 [Multi-domain]  Cd Length: 403  Bit Score: 42.80  E-value: 1.95e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046  1991 GEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHcwPAEAALGTGAEPTCSQEGKlrpEPRRDGEAQEAASETQPLSS 2070
Cdd:TIGR01347   68 GQVLAILEEGNDATAAPPAKSGEEKEETPAASAAAA--PTAAANRPSLSPAARRLAK---EHGIDLSAVPGTGVTGRVTK 142
                           90       100       110
                   ....*....|....*....|....*....|.
gi 578837046  2071 PPTAASSKAPSsgSAQPPEGHPGKPEPSRAK 2101
Cdd:TIGR01347  143 EDIIKKTEAPA--SAQPPAAAAAAAAPAAAT 171
TPR_12 pfam13424
Tetratricopeptide repeat;
36-119 2.07e-03

Tetratricopeptide repeat;


Pssm-ID: 315987 [Multi-domain]  Cd Length: 77  Bit Score: 38.91  E-value: 2.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046    36 AFALYHKALDLQKHDRFEESAKAYHELLEaslLREAVSSGDekeglkHPGLILkysTYKNLAQLAAQREDLETAMEFYLE 115
Cdd:pfam13424    3 ATALNNLAAVLRRLGRYDEALELLEKALE---IARRLLGPD------HPLTAT---TLLNLGRLYLELGRYEEALELLER 70

                   ....
gi 578837046   116 AVML 119
Cdd:pfam13424   71 ALAL 74
KLF9_13_N-like cd21975
Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like ...
1976-2119 4.43e-03

Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. KLF9, KLF10, KLF11, KLF13, KLF14, and KLF16 share a conserved alpha-helical motif AA/VXXL that mediates their binding to Sin3A and their activities as transcriptional repressors. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the related N-terminal domains of KLF9, KLF13, KLF14, KLF16, and similar proteins.


Pssm-ID: 409240 [Multi-domain]  Cd Length: 163  Bit Score: 40.06  E-value: 4.43e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1976 HRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQEGKLRPEPRRD 2055
Cdd:cd21975    19 HGVRPDPEGAGLAAGLDVRATREVAKGPGPPGPAWKPDGADSPGLVTAAPHLLAANVLAPLRGPSVEGSSLESGDADMGS 98
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 578837046 2056 GEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGkPEPSRAKSRPLPNMPKLVIPSAATK 2119
Cdd:cd21975    99 DSDVAPASGAAASTSPESSSDAASSPSPLSLLHPGEAG-LEPERPRPRVRRGVRRRGVTPAAKR 161
 
Name Accession Description Interval E-value
MEF2_binding pfam09047
MEF2 binding; The myocyte enhancer factor-2 (MEF2) binding domain, predominantly found in the ...
2132-2166 1.29e-15

MEF2 binding; The myocyte enhancer factor-2 (MEF2) binding domain, predominantly found in the calcineurin-binding protein CABIN 1, adopts an amphipathic alpha-helical structure, which allows it to bind a hydrophobic groove on the MEF2S domain, forming a triple-helical interaction. Interaction of this domain with MEF2 causes repression of transcription.


Pssm-ID: 370261 [Multi-domain]  Cd Length: 35  Bit Score: 72.19  E-value: 1.29e-15
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 578837046  2132 TLLSPKGSISEETKQKLKSAILSAQSAANVRKESL 2166
Cdd:pfam09047    1 TLLSPKGSISEETKQKLKNAILSAQSAANVKKDSL 35
MEF2_binding cd13839
Mycocyte enhancer factor-2 (MEF2) binding domain of the calcineurin-binding protein cabin-1; ...
2132-2166 6.82e-14

Mycocyte enhancer factor-2 (MEF2) binding domain of the calcineurin-binding protein cabin-1; The myocyte enhancer factor-2 (MEF2) binding domain, as found in the calcineurin-binding protein cabin-1, adopts an amphipathic alpha-helical structure, which allows it to bind to a hydrophobic groove on the MEF2S domain, forming a triple-helical interaction. Interaction of this domain with MEF2 causes repression of transcription. Cabin-1 inhibits calcineurin-mediated signal transduction in T-cell receptor-mediated signalling pathways, by binding to the activated form of calcineurin. Cabin-1 acts as a co-repressor of MEF2, the mycocyte enhancer factor-2, which regulates transcription in a calcium-dependent manner and plays vital roles in T-cell development and function.


Pssm-ID: 260103 [Multi-domain]  Cd Length: 35  Bit Score: 67.41  E-value: 6.82e-14
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 578837046 2132 TLLSPKGSISEETKQKLKSAILSAQSAANVRKESL 2166
Cdd:cd13839     1 TLLSPKGSISEETKQKLKNAILSSQSAANVKKDTL 35
TPR COG0457
Tetratricopeptide (TPR) repeat [General function prediction only];
30-205 3.71e-11

Tetratricopeptide (TPR) repeat [General function prediction only];


Pssm-ID: 440225 [Multi-domain]  Cd Length: 245  Bit Score: 65.41  E-value: 3.71e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046   30 EAQEAEAFALYHKALDLQKHDRFEESAKAYHELLEasllreavssgdekeglKHPGLilkYSTYKNLAQLAAQREDLETA 109
Cdd:COG0457     2 ELDPDDAEAYNNLGLAYRRLGRYEEAIEDYEKALE-----------------LDPDD---AEALYNLGLAYLRLGRYEEA 61
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046  110 MEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKDC 189
Cdd:COG0457    62 LADYEQALELDPDDAEALNNLGLALQALGRYEEALEDYDKALELDPDDAEALYNLGLALLELGRYDEAIEAYERALELDP 141
                         170
                  ....*....|....*.
gi 578837046  190 RYSKGLVLKEKIFEEQ 205
Cdd:COG0457   142 DDADALYNLGIALEKL 157
PHA03247 PHA03247
large tegument protein UL36; Provisional
1890-2133 4.18e-11

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 68.81  E-value: 4.18e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1890 AAAQRQASGDTPTTPKHPKDSRENFFP----VTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASAS--- 1962
Cdd:PHA03247 2752 GGPARPARPPTTAGPPAPAPPAAPAAGpprrLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLppp 2831
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1963 TLDQSKDPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFP-PQEPRHSPQVKMA--PTSSPAEPHCWPAEaalgtgaE 2039
Cdd:PHA03247 2832 TSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAkPAAPARPPVRRLArpAVSRSTESFALPPD-------Q 2904
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 2040 PTCSQEGKLRPEPRRDGEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRP-LPNMPKLVIPSAAt 2118
Cdd:PHA03247 2905 PERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPgRVAVPRFRVPQPA- 2983
                         250
                  ....*....|....*
gi 578837046 2119 kfPPEITVTPPTPTL 2133
Cdd:PHA03247 2984 --PSREAPASSTPPL 2996
PHA03247 PHA03247
large tegument protein UL36; Provisional
1888-2138 1.14e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 67.66  E-value: 1.14e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1888 LGAAAQRQASGDTPTTPKHPKDSRenffpVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQS 1967
Cdd:PHA03247 2723 PGPAAARQASPALPAAPAPPAVPA-----GPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES 2797
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1968 KdPGPPRPhrpeATPSMASLGPEGEELARVAEGTSFPPqePRHSPQVKMAPTSSPAEPHCwPAEAALGTGAEPTcsqegk 2047
Cdd:PHA03247 2798 L-PSPWDP----ADPPAAVLAPAAALPPAASPAGPLPP--PTSAQPTAPPPPPGPPPPSL-PLGGSVAPGGDVR------ 2863
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 2048 lRPEPRRDGEAQEAASETQPLSSPPTAASSKAPSSgSAQPPEGHPGKPEPSrAKSRPLPNMPKLVIPSAATKFPPEITVT 2127
Cdd:PHA03247 2864 -RRPPSRSPAAKPAAPARPPVRRLARPAVSRSTES-FALPPDQPERPPQPQ-APPPPQPQPQPPPPPQPQPPPPPPPRPQ 2940
                         250
                  ....*....|.
gi 578837046 2128 PPTPTLLSPKG 2138
Cdd:PHA03247 2941 PPLAPTTDPAG 2951
PHA03247 PHA03247
large tegument protein UL36; Provisional
1901-2171 1.32e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 67.27  E-value: 1.32e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1901 PTTPKHPKDSRENFFPVTVVPTAPDPVPADS-VQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKDPGPPRPHRPE 1979
Cdd:PHA03247 2629 PSPSPAANEPDPHPPPTVPPPERPRDDPAPGrVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTP 2708
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1980 ATPSMASLGPEGEELARVAEGTSFPPqeprhsPQVKMAPTSSPAEPHCWPAEAALGTGAEPTcsqeGKLRPEPRRDGEAQ 2059
Cdd:PHA03247 2709 EPAPHALVSATPLPPGPAAARQASPA------LPAAPAPPAVPAGPATPGGPARPARPPTTA----GPPAPAPPAAPAAG 2778
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 2060 EAASETQPLSSPPTAASSKAPS-SGSAQPPEGHPGKPEPSRAKSRPLPNMPKlviPSAATKFPPEITVTPPTPTL----- 2133
Cdd:PHA03247 2779 PPRRLTRPAVASLSESRESLPSpWDPADPPAAVLAPAAALPPAASPAGPLPP---PTSAQPTAPPPPPGPPPPSLplggs 2855
                         250       260       270
                  ....*....|....*....|....*....|....*...
gi 578837046 2134 LSPKGSISEETKQKLKSAILSAQSAANVRkeSLCQPAL 2171
Cdd:PHA03247 2856 VAPGGDVRRRPPSRSPAAKPAAPARPPVR--RLARPAV 2891
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1896-2162 1.77e-10

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 66.73  E-value: 1.77e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1896 ASGDTPTTPKHPKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATT--IITCPPSASA----STLDQSKD 1969
Cdd:PHA03307  124 ASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPeeTARAPSSPPAepppSTPPAAAS 203
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1970 PGPPRPHRPEATPSM--ASLGPEGEELARVAEGTSFPPQEPRHSP--QVKMAPTSSPA--EPHCWPAEAALGTGAEPTCS 2043
Cdd:PHA03307  204 PRPPRRSSPISASASspAPAPGRSAADDAGASSSDSSSSESSGCGwgPENECPLPRPApiTLPTRIWEASGWNGPSSRPG 283
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 2044 QEGKLRPEPRRDGEAQEAASETQPLSSPPT----------AASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLPNmpklvi 2113
Cdd:PHA03307  284 PASSSSSPRERSPSPSPSSPGSGPAPSSPRasssssssreSSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPP------ 357
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|
gi 578837046 2114 PSAATKFPPE-ITVTPPTPTLLSPKGSISEETKQKLKSAILSAQSAANVR 2162
Cdd:PHA03307  358 PPADPSSPRKrPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRF 407
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
1890-2159 2.31e-10

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 65.75  E-value: 2.31e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046  1890 AAAQRQASGDTPTTPKHPKdSRENFFPVTVVPTAPDPV----PADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLD 1965
Cdd:pfam17823  120 SSSPSSAAQSLPAAIAALP-SEAFSAPRAAACRANASAapraAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTA 198
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046  1966 QSKDPGPPRPHRPEATPSMASLGPE-GEELARVaeGTSFPpqeprhspqvkMAPTSSPAEPHCWPAE-AALGTGAEPTCS 2043
Cdd:pfam17823  199 ASSAPATLTPARGISTAATATGHPAaGTALAAV--GNSSP-----------AAGTVTAAVGTVTPAAlATLAAAAGTVAS 265
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046  2044 QEGKLR---PEPRRDGEAQEAASETQPLS-SPPTAASSKAPSS--GSAQPPEGHPGKPEPSRAKSRPLPNMPKLVIPS-- 2115
Cdd:pfam17823  266 AAGTINmgdPHARRLSPAKHMPSDTMARNpAAPMGAQAQGPIIqvSTDQPVHNTAGEPTPSPSNTTLEPNTPKSVASTnl 345
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|.
gi 578837046  2116 -------AATKFPPEITVtPPTPTLLSPKGSISEETKQklKSAILSAQSAA 2159
Cdd:pfam17823  346 avvtttkAQAKEPSASPV-PVLHTSMIPEVEATSPTTQ--PSPLLPTQGAA 393
TPR COG0457
Tetratricopeptide (TPR) repeat [General function prediction only];
34-199 7.02e-10

Tetratricopeptide (TPR) repeat [General function prediction only];


Pssm-ID: 440225 [Multi-domain]  Cd Length: 245  Bit Score: 61.95  E-value: 7.02e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046   34 AEAFALYHKALDLQkhdrfEESAKAYHELleASLLREAvssGDEKEGLKH--------PGLIlkySTYKNLAQLAAQRED 105
Cdd:COG0457    25 EEAIEDYEKALELD-----PDDAEALYNL--GLAYLRL---GRYEEALADyeqaleldPDDA---EALNNLGLALQALGR 91
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046  106 LETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKAL 185
Cdd:COG0457    92 YEEALEDYDKALELDPDDAEALYNLGLALLELGRYDEAIEAYERALELDPDDADALYNLGIALEKLGRYEEALELLEKLE 171
                         170
                  ....*....|....
gi 578837046  186 EKDCRYSKGLVLKE 199
Cdd:COG0457   172 AAALAALLAAALGE 185
Spy COG3914
Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational ...
30-188 2.30e-09

Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 443119 [Multi-domain]  Cd Length: 658  Bit Score: 62.70  E-value: 2.30e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046   30 EAQEAEAFALYHKALDLQKHDRFEESAKAYHELLEAsllreavssgdekeglkHPGLilkYSTYKNLAQLAAQREDLETA 109
Cdd:COG3914    72 AALLLLAALLELAALLLQALGRYEEALALYRRALAL-----------------NPDN---AEALFNLGNLLLALGRLEEA 131
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 578837046  110 MEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKD 188
Cdd:COG3914   132 LAALRRALALNPDFAEAYLNLGEALRRLGRLEEAIAALRRALELDPDNAEALNNLGNALQDLGRLEEAIAAYRRALELD 210
PHA03247 PHA03247
large tegument protein UL36; Provisional
1897-2170 5.63e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.88  E-value: 5.63e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1897 SGDTPttPKHPKDSRENFFPVTVVPTAPDPVPADSVQRpSDAHTKPRPALAAATTIITCPPSASASTLDQSkdPGPPRPH 1976
Cdd:PHA03247 2548 AGDPP--PPLPPAAPPAAPDRSVPPPRPAPRPSEPAVT-SRARRPDAPPQSARPRAPVDDRGDPRGPAPPS--PLPPDTH 2622
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1977 RPEATPSMASlgPEGEELARVAEGTSFPPQEPRHSPQVK-----------------MAPTSSPAEPHCWPAEAALGTGAE 2039
Cdd:PHA03247 2623 APDPPPPSPS--PAANEPDPHPPPTVPPPERPRDDPAPGrvsrprrarrlgraaqaSSPPQRPRRRAARPTVGSLTSLAD 2700
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 2040 PTcsqegklrPEPRRDGEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKP----EPSRAKSRPLPNMPklviPS 2115
Cdd:PHA03247 2701 PP--------PPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPatpgGPARPARPPTTAGP----PA 2768
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 578837046 2116 AAtkfPPEITVTPPTPTLLSPKGSISEETKQKLKSAILSAQSAANVRKESLCQPA 2170
Cdd:PHA03247 2769 PA---PPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPP 2820
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
30-205 1.13e-08

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 58.59  E-value: 1.13e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046   30 EAQEAEAFALYHKALDLQKHDRFEESAKAYHELLEAS---------LLREAVSSGDEKEGLKHPGLILKYS-----TYKN 95
Cdd:COG2956    70 ERDPDRAEALLELAQDYLKAGLLDRAEELLEKLLELDpddaealrlLAEIYEQEGDWEKAIEVLERLLKLGpenahAYCE 149
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046   96 LAQLAAQREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYT 175
Cdd:COG2956   150 LAELYLEQGDYDEAIEALEKALKLDPDCARALLLLAELYLEQGDYEEAIAALERALEQDPDYLPALPRLAELYEKLGDPE 229
                         170       180       190
                  ....*....|....*....|....*....|
gi 578837046  176 TCLYFICKALEKDCRYSKGLVLKEKIFEEQ 205
Cdd:COG2956   230 EALELLRKALELDPSDDLLLALADLLERKE 259
BepA COG4783
Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell ...
33-188 6.89e-08

Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell wall/membrane/envelope biogenesis, Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 443813 [Multi-domain]  Cd Length: 139  Bit Score: 53.66  E-value: 6.89e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046   33 EAEAFALYHKALDLQKHDRFEESAKAYHELLEASllreavssGDEKEGlkhpglilkystYKNLAQLAAQREDLETAMEF 112
Cdd:COG4783     1 AACAEALYALAQALLLAGDYDEAEALLEKALELD--------PDNPEA------------FALLGEILLQLGDLDEAIVL 60
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 578837046  113 YLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKD 188
Cdd:COG4783    61 LHEALELDPDEPEARLNLGLALLKAGDYDEALALLEKALKLDPEHPEAYLRLARAYRALGRPDEAIAALEKALELD 136
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1866-2159 7.20e-08

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 58.00  E-value: 7.20e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046  1866 HLPVKVDEEAALEQAVKFCQVHLGAAAQrQASGDTPTTPK-HPKDS-RENFFPVTVVPTAPDPVPADSVQRPSDAHTKPR 1943
Cdd:pfam05109  453 HVPTNLTAPASTGPTVSTADVTSPTPAG-TTSGASPVTPSpSPRDNgTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPT 531
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046  1944 PALAAATTIITCPPSASASTLDQSKDPGPP-RPHRPEAT-PSMASLGPEGEELARVAEGTSfpPQEPRHSPQVK-----M 2016
Cdd:pfam05109  532 PNATSPTLGKTSPTSAVTTPTPNATSPTPAvTTPTPNATiPTLGKTSPTSAVTTPTPNATS--PTVGETSPQANttnhtL 609
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046  2017 APTSSPAEPHCWP--AEAALGTGAEPTCSQEG---KLRPEPRRDG---EAQEAASETQPL--SSPPTAA---SSKAPSSG 2083
Cdd:pfam05109  610 GGTSSTPVVTSPPknATSAVTTGQHNITSSSTssmSLRPSSISETlspSTSDNSTSHMPLltSAHPTGGeniTQVTPAST 689
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 578837046  2084 SAQPPEGHPGKPEPSRAKSRPLPNMpklviPSAATKfPPEITVTPPTPtllsPKGSISEETKQKLKSAILSAQSAA 2159
Cdd:pfam05109  690 STHHVSTSSPAPRPGTTSQASGPGN-----SSTSTK-PGEVNVTKGTP----PKNATSPQAPSGQKTAVPTVTSTG 755
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
1921-2160 2.40e-07

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 56.40  E-value: 2.40e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1921 PTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASASTldqskdpgPPRPHRPEATPSMASLGPEGEELARVAEG 2000
Cdd:PRK07003  374 ARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAA--------AAAATRAEAPPAAPAPPATADRGDDAADG 445
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 2001 TSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTG-AEPTCSQEgklrPEPRRDGEAQEAASETQPLSSPPTAASSKA 2079
Cdd:PRK07003  446 DAPVPAKANARASADSRCDERDAQPPADSGSASAPASdAPPDAAFE----PAPRAAAPSAATPAAVPDARAPAAASREDA 521
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 2080 PSSGSAQPPEGHPGKP----EPSRA--------------------KSRPLPNMPKLVIPSAATKFPPEITVTPPTPTLLS 2135
Cdd:PRK07003  522 PAAAAPPAPEARPPTPaaaaPAARAggaaaaldvlrnagmrvssdRGARAAAAAKPAAAPAAAPKPAAPRVAVQVPTPRA 601
                         250       260
                  ....*....|....*....|....*
gi 578837046 2136 PKGSiseeTKQKLKSAILSAQSAAN 2160
Cdd:PRK07003  602 RAAT----GDAPPNGAARAEQAAES 622
PHA03378 PHA03378
EBNA-3B; Provisional
1899-2172 8.99e-07

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 54.30  E-value: 8.99e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1899 DTPTTPKHP---KDSRENFFPVTVVPTAP---DPVPADSVQRPSdAHTKPRPALAAATTIITCPPSASAstldQSKDPGP 1972
Cdd:PHA03378  607 EPPTTQSHIpetSAPRQWPMPLRPIPMRPlrmQPITFNVLVFPT-PHQPPQVEITPYKPTWTQIGHIPY----QPSPTGA 681
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1973 PRPHRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPaephcwpaeaalgtgaeptcsqeGKLRPep 2052
Cdd:PHA03378  682 NTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAP-----------------------GRARP-- 736
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 2053 rrdgeAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSrpLPNMPKLVIPSAATKFPPeiTVTPPTPT 2132
Cdd:PHA03378  737 -----PAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQ--APPAPQQRPRGAPTPQPP--PQAGPTSM 807
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|
gi 578837046 2133 LLSPKGSISEETKQKLKSAILSAQSAANVRKESLCQPALE 2172
Cdd:PHA03378  808 QLMPRAAPGQQGPTKQILRQLLTGGVKRGRPSLKKPAALE 847
PHA03378 PHA03378
EBNA-3B; Provisional
1896-2146 1.07e-06

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 54.30  E-value: 1.07e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1896 ASGDTPTTP--KHPKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIitcPPSASAstldqskdPGPP 1973
Cdd:PHA03378  646 LVFPTPHQPpqVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPTPM---RPPAAP--------PGRA 714
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1974 RPhrPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAephcwPAEAALGTGAEPTCSQEGKLRPEPR 2053
Cdd:PHA03378  715 QR--PAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPG-----RARPPAAAPGAPTPQPPPQAPPAPQ 787
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 2054 RdgEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRplpNMPKLVIPSAATKFPPEItvtpPTPtl 2133
Cdd:PHA03378  788 Q--RPRGAPTPQPPPQAGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVKR---GRPSLKKPAALERQAAAG----PTP-- 856
                         250
                  ....*....|...
gi 578837046 2134 lSPKGSISEETKQ 2146
Cdd:PHA03378  857 -SPGSGTSDKIVQ 868
dnaA PRK14086
chromosomal replication initiator protein DnaA;
1918-2137 1.19e-06

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 53.68  E-value: 1.19e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1918 TVVPTAPDPVPADSVQRPSDAHTKPRPAlaaattiitcpPSASASTLDQSKDPGPPRPHRPEATPSMASLGPEGEELARV 1997
Cdd:PRK14086   91 SAGEPAPPPPHARRTSEPELPRPGRRPY-----------EGYGGPRADDRPPGLPRQDQLPTARPAYPAYQQRPEPGAWP 159
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1998 AEGTSFPPQEPRHSPqvkmaptsSPAEPHCWPAEAAlgTGAEPTCSQEGKLRPE---PRRDGEAQEaasetqPLSSPPTA 2074
Cdd:PRK14086  160 RAADDYGWQQQRLGF--------PPRAPYASPASYA--PEQERDREPYDAGRPEydqRRRDYDHPR------PDWDRPRR 223
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 578837046 2075 ASSKAP--SSGSAQPPEGHPGKPEPSRAKSRPlpnmpklVIPSAATKFP--PEITVTPPTPTL-LSPK 2137
Cdd:PRK14086  224 DRTDRPepPPGAGHVHRGGPGPPERDDAPVVP-------IRPSAPGPLAaqPAPAPGPGEPTArLNPK 284
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1895-2121 1.34e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 53.73  E-value: 1.34e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1895 QASGDT-PTTPKHPKDSRENffPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASA-STLDQSKDPGP 1972
Cdd:PRK12323  367 QSGGGAgPATAAAAPVAQPA--PAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEAlAAARQASARGP 444
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1973 PRPHRPEATPSMAslgPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQEGKLRPEP 2052
Cdd:PRK12323  445 GGAPAPAPAPAAA---PAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGW 521
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 578837046 2053 RRDGEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRpLPNMPKLVIPSAATKFP 2121
Cdd:PRK12323  522 VAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASG-LPDMFDGDWPALAARLP 589
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1884-2109 1.67e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 53.45  E-value: 1.67e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1884 CQVHLGAAAQRQASGDTPTTPKHPKDSRENffpvTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASAST 1963
Cdd:PRK07764  582 WQVEAVVGPAPGAAGGEGPPAPASSGPPEE----AARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHV 657
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1964 LDQSKDPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHcWPAEAALGTGAEPTCS 2043
Cdd:PRK07764  658 AVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQ-PPQAAQGASAPSPAAD 736
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 578837046 2044 QEGKLRPEPRRDGEAQEAasetqPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLPNMP 2109
Cdd:PRK07764  737 DPVPLPPEPDDPPDPAGA-----PAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDED 797
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1922-2129 2.47e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 52.96  E-value: 2.47e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1922 TAPDPVPADSVQRPSDAHTKPRPALAAATTiitcPPSASASTLDQSKDPGPPRPHRPEATPSMASLGPEGEELARVAEGT 2001
Cdd:PRK12323  372 AGPATAAAAPVAQPAPAAAAPAAAAPAPAA----PPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGA 447
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 2002 SFPPQEPRHSPQVKMAPTSSPAEPhcwpaEAALGTGAEPTCSQEGKLRPEPRRDGEAQEAASEtqpLSSPPTAASSKAPs 2081
Cdd:PRK12323  448 PAPAPAPAAAPAAAARPAAAGPRP-----VAAAAAAAPARAAPAAAPAPADDDPPPWEELPPE---FASPAPAQPDAAP- 518
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*...
gi 578837046 2082 SGSAQPPEGHPGKPEPSRAKSRPLPNMPKLVIPSAATKFPPEITVTPP 2129
Cdd:PRK12323  519 AGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPP 566
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1889-2109 2.75e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 52.87  E-value: 2.75e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1889 GAAAQRQASGDTPTTPKHPKDSREnfFPVTVVPTAPDPVPADSVQRPSDahtkprPALAAATtiitcPPSASASTLDQSK 1968
Cdd:PHA03307   76 GTEAPANESRSTPTWSLSTLAPAS--PAREGSPTPPGPSSPDPPPPTPP------PASPPPS-----PAPDLSEMLRPVG 142
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1969 DPGPPRPHRPEATPSMASLGPEGEE-------LARVAEGTSFPPQEPRHSPQVKMAP---TSSPAEPHCWPAEAALGTGA 2038
Cdd:PHA03307  143 SPGPPPAASPPAAGASPAAVASDAAssrqaalPLSSPEETARAPSSPPAEPPPSTPPaaaSPRPPRRSSPISASASSPAP 222
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 2039 EPTCSQEGKLR--------PEPRRDGEAQE-------AASETQP----LSSPPTAASSKAPSSGSAQPPEGHPGKPEPSR 2099
Cdd:PHA03307  223 APGRSAADDAGasssdsssSESSGCGWGPEnecplprPAPITLPtriwEASGWNGPSSRPGPASSSSSPRERSPSPSPSS 302
                         250
                  ....*....|
gi 578837046 2100 AKSRPLPNMP 2109
Cdd:PHA03307  303 PGSGPAPSSP 312
TadD COG5010
Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, ...
38-188 3.34e-06

Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444034 [Multi-domain]  Cd Length: 155  Bit Score: 49.19  E-value: 3.34e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046   38 ALYHKALDLQKHDRFEESAKAYHELLEASLLREAVSSGDEKEGLKHPGLILKYSTYKNLAQLAAQREDLETAMEFYLEAV 117
Cdd:COG5010     2 RALEGFDRLPLYLLLLTKLRTLVEKYEAALAGANNTKEDELAAAGRDKLAKAFAIESPSDNLYNKLGDFEESLALLEQAL 81
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 578837046  118 MLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKD 188
Cdd:COG5010    82 QLDPNNPELYYNLALLYSRSGDKDEAKEYYEKALALSPDNPNAYSNLAALLLSLGQDDEAKAALQRALGTS 152
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1932-2137 3.88e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 52.46  E-value: 3.88e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046  1932 VQRPSDAHTKPRPALAAATTIITCPPSASASTLD-QSKDPGPPRPHRPEATPSMASLGPEGEELarvaegtsFPPQEPrh 2010
Cdd:pfam03154  174 LQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPpQGSPATSQPPNQTQSTAAPHTLIQQTPTL--------HPQRLP-- 243
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046  2011 SPQVKMAPTSSPAEPHCWPAEAAlgtgAEPTCSQEGKLRPEPRRDGEAQ-EAASETQPLSSPPTAASSKAPSSGSAQ--- 2086
Cdd:pfam03154  244 SPHPPLQPMTQPPPPSQVSPQPL----PQPSLHGQMPPMPHSLQTGPSHmQHPVPPQPFPLTPQSSQSQVPPGPSPAapg 319
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046  2087 ---------PPEGHPGKPEPSRakSRPLPNMPkLVIPSAAtkfPPEITVTPPTPTLLSPK 2137
Cdd:pfam03154  320 qsqqrihtpPSQSQLQSQQPPR--EQPLPPAP-LSMPHIK---PPPTTPIPQLPNPQSHK 373
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1889-2151 5.04e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 52.08  E-value: 5.04e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046  1889 GAAAQRQASGDTPTTPKHPKDSRENffpvtvvPTAPDPVPADSVQRPSdahTKPRPALAAATTIiTCPPSASASTLDQSK 1968
Cdd:pfam03154  319 GQSQQRIHTPPSQSQLQSQQPPREQ-------PLPPAPLSMPHIKPPP---TTPIPQLPNPQSH-KHPPHLSGPSPFQMN 387
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046  1969 DPGPPRP-----------HRPEATPSMASLGPEGEELARvaegtsfPPQEPRHSPQVKMAPTSSPAEPHcwpaeaalGTG 2037
Cdd:pfam03154  388 SNLPPPPalkplsslsthHPPSAHPPPLQLMPQSQQLPP-------PPAQPPVLTQSQSLPPPAASHPP--------TSG 452
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046  2038 AEPTCSQEgklrPEPRRDGEAQEAASETQPlSSPPTAASSKAPSSgsaQPPEghpgkpEPSRAKSRPLPNMPKLVIPSAA 2117
Cdd:pfam03154  453 LHQVPSQS----PFPQHPFVPGGPPPITPP-SGPPTSTSSAMPGI---QPPS------SASVSSSGPVPAAVSCPLPPVQ 518
                          250       260       270
                   ....*....|....*....|....*....|....*....
gi 578837046  2118 TKFPP-----EITVTPPTPTLLSPKGSISEETKQKLKSA 2151
Cdd:pfam03154  519 IKEEAldeaeEPESPPPPPRSPSPEPTVVNTPSHASQSA 557
PilF COG3063
Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];
99-188 5.13e-06

Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];


Pssm-ID: 442297 [Multi-domain]  Cd Length: 94  Bit Score: 46.70  E-value: 5.13e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046   99 LAAQREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARhAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCL 178
Cdd:COG3063     1 LYLKLGDLEEAEEYYEKALELDPDNADALNNLGLLLLEQGRYDEAI-ALEKALKLDPNNAEALLNLAELLLELGDYDEAL 79
                          90
                  ....*....|
gi 578837046  179 YFICKALEKD 188
Cdd:COG3063    80 AYLERALELD 89
PHA03379 PHA03379
EBNA-3A; Provisional
1895-2140 9.40e-06

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 51.21  E-value: 9.40e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1895 QASGDTPTTPKHPKDSrenffPVTVVP----TAPDPVPADSVQRPSDAHTKPRPaLAAATTIITCP-------PSASAST 1963
Cdd:PHA03379  407 KASEPTYGTPRPPVEK-----PRPEVPqsleTATSHGSAQVPEPPPVHDLEPGP-LHDQHSMAPCPvaqlppgPLQDLEP 480
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1964 LDQskDPGPPRPHRPEATPSMASLGP---EGEELARVAEGTSFPPQEPRHSP-QVKMAPTSSPAEPHC-WPAEAALGTGA 2038
Cdd:PHA03379  481 GDQ--LPGVVQDGRPACAPVPAPAGPivrPWEASLSQVPGVAFAPVMPQPMPvEPVPVPTVALERPVCpAPPLIAMQGPG 558
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 2039 EPTCSQEGKLR---------------PEPRRDGEAQ---EAASETQPLSSPP---TAASSKAPSSGSAQPPEG-HPGKPE 2096
Cdd:PHA03379  559 ETSGIVRVRERwrpapwtpnpprspsQMSVRDRLARlraEAQPYQASVEVQPpqlTQVSPQQPMEYPLEPEQQmFPGSPF 638
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....
gi 578837046 2097 PSRAKSRPLPNMPKLVIPSAATKFPPEITVTPPTPTLLSPKGSI 2140
Cdd:PHA03379  639 SQVADVMRAGGVPAMQPQYFDLPLQQPISQGAPLAPLRASMGPV 682
NlpI COG4785
Lipoprotein NlpI, contains TPR repeats [Cell wall/membrane/envelope biogenesis];
29-174 1.66e-05

Lipoprotein NlpI, contains TPR repeats [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 443815 [Multi-domain]  Cd Length: 223  Bit Score: 48.37  E-value: 1.66e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046   29 KEAQEAEAFALYHKALDLQKHDRF-----EESAKAYHELLEASLLREAVSSGDEK--EGLKHPGLIlkySTYKNLAQLAA 101
Cdd:COG4785     8 LLLALALAAAAASKAAILLAALLFaavlaLAIALADLALALAAAALAAAALAAERidRALALPDLA---QLYYERGVAYD 84
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 578837046  102 QREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDY 174
Cdd:COG4785    85 SLGDYDLAIADFDQALELDPDLAEAYNNRGLAYLLLGDYDAALEDFDRALELDPDYAYAYLNRGIALYYLGRY 157
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1890-2143 1.76e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 50.17  E-value: 1.76e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1890 AAAQRQASGDTPTT-PKHPKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSasastldqsk 1968
Cdd:PHA03307   60 AACDRFEPPTGPPPgPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPS---------- 129
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1969 dPGPPRPH-----RPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAAlGTGAEPTCs 2043
Cdd:PHA03307  130 -PAPDLSEmlrpvGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTP-PAAASPRP- 206
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 2044 qegklrpePRRDGEAQEAASETQPlSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLPNMPKLVIPSAATKFPpe 2123
Cdd:PHA03307  207 --------PRRSSPISASASSPAP-APGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGW-- 275
                         250       260
                  ....*....|....*....|
gi 578837046 2124 iTVTPPTPTLLSPKGSISEE 2143
Cdd:PHA03307  276 -NGPSSRPGPASSSSSPRER 294
NrfG COG4235
Cytochrome c-type biogenesis protein CcmH/NrfG [Energy production and conversion, ...
95-174 2.35e-05

Cytochrome c-type biogenesis protein CcmH/NrfG [Energy production and conversion, Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 443378 [Multi-domain]  Cd Length: 131  Bit Score: 46.15  E-value: 2.35e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046   95 NLAQLAAQREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDY 174
Cdd:COG4235    22 LLGRAYLRLGRYDEALAAYEKALRLDPDNADALLDLAEALLAAGDTEEAEELLERALALDPDNPEALYLLGLAAFQQGDY 101
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1917-2133 2.71e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 49.60  E-value: 2.71e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1917 VTVVPtAPDPVPADSVQRPSDAHTKPRPALAAATtiitcPPSASASTlDQSKDPGPPRPHRPEATPSMASLGPEGEELAR 1996
Cdd:PRK07764  584 VEAVV-GPAPGAAGGEGPPAPASSGPPEEAARPA-----APAAPAAP-AAPAPAGAAAAPAEASAAPAPGVAAPEHHPKH 656
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1997 VAEGTSFPPQEPRHSPQVKMAPTSSPAEPhcwpAEAALGTGAEPTCSQEGKLRPEPRRDGEAQEAASETQplsSPPTAAS 2076
Cdd:PRK07764  657 VAVPDASDGGDGWPAKAGGAAPAAPPPAP----APAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPP---QAAQGAS 729
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 578837046 2077 SKAPSSGSAQPPEGHPGKPEPSRAKSRPLPNMPKLVIPSAATKFPPEITVTPPTPTL 2133
Cdd:PRK07764  730 APSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMA 786
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1890-2062 3.08e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 49.21  E-value: 3.08e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1890 AAAQRQASGDTPTTPKHPKDSRENFFPVTVVPTAPDPVPaDSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKD 1969
Cdd:PRK07764  622 AAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVP-DASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQ 700
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1970 PGPPRPHRP------EATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCS 2043
Cdd:PRK07764  701 PAPAPAATPpagqadDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPS 780
                         170       180
                  ....*....|....*....|..
gi 578837046 2044 QEGKLRPEPRR---DGEAQEAA 2062
Cdd:PRK07764  781 EEEEMAEDDAPsmdDEDRRDAE 802
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
1890-2109 3.90e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 49.08  E-value: 3.90e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1890 AAAQRQASGDTPTTPKHPKdsrenffPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKD 1969
Cdd:PRK07003  395 AVPAVTAVTGAAGAALAPK-------AAAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERD 467
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1970 PGPPRPHRPEATPSMASLGPEGEELA--------------RVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHC-WPAEAAL 2034
Cdd:PRK07003  468 AQPPADSGSASAPASDAPPDAAFEPApraaapsaatpaavPDARAPAAASREDAPAAAAPPAPEARPPTPAAaAPAARAG 547
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 2035 GTGAEPTCSQEGKLRPEPRRDGEAQEAASETQPLSSPPTAASSK---------APSSGSAQPPEghPGKPEPSRAKSR-- 2103
Cdd:PRK07003  548 GAAAALDVLRNAGMRVSSDRGARAAAAAKPAAAPAAAPKPAAPRvavqvptprARAATGDAPPN--GAARAEQAAESRga 625

                  ....*...
gi 578837046 2104 --PLPNMP 2109
Cdd:PRK07003  626 ppPWEDIP 633
PHA02682 PHA02682
ORF080 virion core protein; Provisional
1916-2022 4.12e-05

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 47.93  E-value: 4.12e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1916 PVTVVPTAPDP-VPADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKDPGPPRPHR--PEAT------PSMAS 1986
Cdd:PHA02682   76 PSGQSPLAPSPaCAAPAPACPACAPAAPAPAVTCPAPAPACPPATAPTCPPPAVCPAPARPAPacPPSTrqcppaPPLPT 155
                          90       100       110
                  ....*....|....*....|....*....|....*..
gi 578837046 1987 LGPEGEELARVAEGTSFPPQEPRHS-PQVKMAPTSSP 2022
Cdd:PHA02682  156 PKPAPAAKPIFLHNQLPPPDYPAAScPTIETAPAASP 192
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1916-2104 5.07e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 48.63  E-value: 5.07e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1916 PVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAA-----TTIITCPPSASASTLDQSKDPGPpRPHRPEATPSMASLGPE 1990
Cdd:PHA03307   25 PATPGDAADDLLSGSQGQLVSDSAELAAVTVVAGaaacdRFEPPTGPPPGPGTEAPANESRS-TPTWSLSTLAPASPARE 103
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1991 GEELARVAEGTSFPPQ-EPRHSPqvkmAPTSSPAEPHCWPAEAALGTGAEPtcsqegklRPEPRRDGEAQEAASETQPLS 2069
Cdd:PHA03307  104 GSPTPPGPSSPDPPPPtPPPASP----PPSPAPDLSEMLRPVGSPGPPPAA--------SPPAAGASPAAVASDAASSRQ 171
                         170       180       190
                  ....*....|....*....|....*....|....*
gi 578837046 2070 SPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRP 2104
Cdd:PHA03307  172 AALPLSSPEETARAPSSPPAEPPPSTPPAAASPRP 206
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
1956-2169 5.34e-05

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 48.38  E-value: 5.34e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1956 PPSASASTldQSKDPGPPRPHRPEATPSMASLGPEGEELAR---VAEGTSF----PPQEPRHSPQVKMAPTSSPAEPHCW 2028
Cdd:PLN03209  329 PPKESDAA--DGPKPVPTKPVTPEAPSPPIEEEPPQPKAVVprpLSPYTAYedlkPPTSPIPTPPSSSPASSKSVDAVAK 406
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 2029 PAEAALGTGAEPTCS-QEGKLRPEPR---RDGEAQEAASETQPLSSP-PTAASSKAPSSGSAQPPEGHPGKPEPSRAKSR 2103
Cdd:PLN03209  407 PAEPDVVPSPGSASNvPEVEPAQVEAkktRPLSPYARYEDLKPPTSPsPTAPTGVSPSVSSTSSVPAVPDTAPATAATDA 486
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 578837046 2104 PLPNMPKlviPSAATKFPPEITVTPPT-PTLLSPKGSISEETKQKLKSAILSAQSAANVRKESLCQP 2169
Cdd:PLN03209  487 AAPPPAN---MRPLSPYAVYDDLKPPTsPSPAAPVGKVAPSSTNEVVKVGNSAPPTALADEQHHAQP 550
TadD COG5010
Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, ...
1-155 5.98e-05

Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444034 [Multi-domain]  Cd Length: 155  Bit Score: 45.34  E-value: 5.98e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046    1 MIRIAALNASSTIEDDHEGSFKSHKTQTKEAQEAEAFALYHKALDLQKhdRFEESAKAYHELLEAsllreavssgdekeg 80
Cdd:COG5010    21 RTLVEKYEAALAGANNTKEDELAAAGRDKLAKAFAIESPSDNLYNKLG--DFEESLALLEQALQL--------------- 83
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 578837046   81 lkHPGlilKYSTYKNLAQLAAQREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNP 155
Cdd:COG5010    84 --DPN---NPELYYNLALLYSRSGDKDEAKEYYEKALALSPDNPNAYSNLAALLLSLGQDDEAKAALQRALGTSP 153
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
1851-2131 6.20e-05

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 48.23  E-value: 6.20e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1851 RVERIMSETYMLIKQHLPV--KVDEEAALEQAVKFCQVHLGAAAQRQASGDTPTTPKHPKDSRenffpvtvvPTAPDPvp 1928
Cdd:NF033839  229 QIVALIKELDELKKQALSEidNVNTKVEIENTVHKIFADMDAVVTKFKKGLTQDTPKEPGNKK---------PSAPKP-- 297
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1929 adsvqrpsdaHTKPRPALAAAttiitcPPSASASTLDQSKDPGPPRPhRPEATPSmaslgPEGEElarvaegTSFPPQEP 2008
Cdd:NF033839  298 ----------GMQPSPQPEKK------EVKPEPETPKPEVKPQLEKP-KPEVKPQ-----PEKPK-------PEVKPQLE 348
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 2009 RHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTcsqegklRPEPRRDGEAQEAASETQPlsSPPTAASSKAPSSGSAQP- 2087
Cdd:NF033839  349 TPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPE-------TPKPEVKPQPEKPKPEVKP--QPEKPKPEVKPQPEKPKPe 419
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|..
gi 578837046 2088 --PEGHPGKPE--PSRAKS----RPLPNMPKLVIPSAATKFPPEITVTPPTP 2131
Cdd:NF033839  420 vkPQPEKPKPEvkPQPEKPkpevKPQPEKPKPEVKPQPETPKPEVKPQPEKP 471
PHA03378 PHA03378
EBNA-3B; Provisional
1901-2172 8.25e-05

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 48.14  E-value: 8.25e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1901 PTTPKHPKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAAttiitcPPSASASTLDQSKDPGPPRPHRPEA 1980
Cdd:PHA03378  676 PSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRA------RPPAAAPGRARPPAAAPGRARPPAA 749
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1981 TPSMA---SLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPA---EPHCWPAEAALGTGAEPTCSQEGKLRPEPRR 2054
Cdd:PHA03378  750 APGRArppAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTpqpPPQAGPTSMQLMPRAAPGQQGPTKQILRQLL 829
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 2055 DGEAQEA-ASETQPLSSPPTAASSKAPSSGSA------QPPEGHPGKPEPSRAKSRplPNMPKLVIPSAATKFPPEIT-- 2125
Cdd:PHA03378  830 TGGVKRGrPSLKKPAALERQAAAGPTPSPGSGtsdkivQAPVFYPPVLQPIQVMRQ--LGSVRAAAASTVTQAPTEYTge 907
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|...
gi 578837046 2126 ---VTPPTPTLLSPKGSISEETKQKLKSAILSAQSAANVRKESLC---QPALE 2172
Cdd:PHA03378  908 rrgVGPMHPTDIPPSKRAKTDAYVESQPPHGGQSHSFSVIWENVSqgqQQTLE 960
SepH NF040712
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ...
1969-2109 1.70e-04

septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.


Pssm-ID: 468676 [Multi-domain]  Cd Length: 346  Bit Score: 46.30  E-value: 1.70e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1969 DPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHC------WPAEAALGTGAEPTC 2042
Cdd:NF040712  189 DPDFGRPLRPLATVPRLAREPADARPEEVEPAPAAEGAPATDSDPAEAGTPDDLASARRrragveQPEDEPVGPGAAPAA 268
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 578837046 2043 SQEGKLRPEPRRdgEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSR-PLPNMP 2109
Cdd:NF040712  269 EPDEATRDAGEP--PAPGAAETPEAAEPPAPAPAAPAAPAAPEAEEPARPEPPPAPKPKRRrRRASVP 334
PHA03291 PHA03291
envelope glycoprotein I; Provisional
1914-2137 2.32e-04

envelope glycoprotein I; Provisional


Pssm-ID: 223033 [Multi-domain]  Cd Length: 401  Bit Score: 46.10  E-value: 2.32e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1914 FFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTiiTCPPSASASTLDQSKDPGPPRPHRPEATPSMASLGPEgee 1993
Cdd:PHA03291  203 FVPATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPST--TIAAPQAGTTPEAEGTPAPPTPGGGEAPPANATPAPE--- 277
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1994 larvaegtsfppqEPRHspQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQEGKLRPEPRRdgeaqeaASETQPLSSPPT 2073
Cdd:PHA03291  278 -------------ASRY--ELTVTQIIQIAIPASIIACVFLGSCACCLHRRCRRRRRRPAR-------IYRPPSPVAPSI 335
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 578837046 2074 AASSKAPSSGSAQPPEGHPGKPePSRAKSRPLPN-MPKLVIPSAATKFP--PEITVTPPTPTLLSPK 2137
Cdd:PHA03291  336 SAVNEAALARLGDELKRHPPES-PRRSKRRSSQTmVPSLTAISEESEAPavVELSRSPRRPGGPTAR 401
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1979-2136 2.99e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 46.02  E-value: 2.99e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1979 EATPSMASLGPEGEELARVAEGTSFPPqeprhspqvkmAPTSSPAEPHCWPAEAAlgtGAEPTCSQEGKLRPEPRRDGEA 2058
Cdd:PRK12323  371 GAGPATAAAAPVAQPAPAAAAPAAAAP-----------APAAPPAAPAAAPAAAA---AARAVAAAPARRSPAPEALAAA 436
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 578837046 2059 QEAASETQPLSSPPTAASSKAPSsgSAQPPEGHPGKPEPSRAKSRPLPNMPKLViPSAATKFPPEITVTPPTPTLLSP 2136
Cdd:PRK12323  437 RQASARGPGGAPAPAPAPAAAPA--AAARPAAAGPRPVAAAAAAAPARAAPAAA-PAPADDDPPPWEELPPEFASPAP 511
PRK10263 PRK10263
DNA translocase FtsK; Provisional
1897-2106 3.30e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 46.23  E-value: 3.30e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1897 SGDTPTTPKHPK-DSRENFFPVT--VVPTAPDPVPADSVQRPSDAHTkPRPALAAATTIITCPpsasasTLDQSKDPGPp 1973
Cdd:PRK10263  295 SGNRATQPEYDEyDPLLNGAPITepVAVAAAATTATQSWAAPVEPVT-QTPPVASVDVPPAQP------TVAWQPVPGP- 366
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1974 rpHRPEatPSMASlGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQEGKLRPEPR 2053
Cdd:PRK10263  367 --QTGE--PVIAP-APEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQP 441
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|...
gi 578837046 2054 RDGEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLP 2106
Cdd:PRK10263  442 VAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQQPVVEPEP 494
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1934-2160 4.02e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 45.64  E-value: 4.02e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1934 RPSDAHTKPRPALAAATTIITCPPSASASTldqskdPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFPPQEP----R 2009
Cdd:PRK12323  364 RPGQSGGGAGPATAAAAPVAQPAPAAAAPA------AAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEAlaaaR 437
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 2010 HSPQVKMAPTSSPAephcwPAEAALGTGAEPTCSQEgkLRPEPRrdgeaqeAASETQPLSSPPTAAsskAPSSGSAQPPE 2089
Cdd:PRK12323  438 QASARGPGGAPAPA-----PAPAAAPAAAARPAAAG--PRPVAA-------AAAAAPARAAPAAAP---APADDDPPPWE 500
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 2090 GHPGK-PEPSRAKSRPLPNM--------PKLVIPSAATKFPPEITVTPPTPTLLSPKGSISEETKQKLKSAILSAQSAAN 2160
Cdd:PRK12323  501 ELPPEfASPAPAQPDAAPAGwvaesipdPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGD 580
PHA03381 PHA03381
tegument protein VP22; Provisional
1907-2047 4.63e-04

tegument protein VP22; Provisional


Pssm-ID: 177618 [Multi-domain]  Cd Length: 290  Bit Score: 44.62  E-value: 4.63e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1907 PKDSRENFFPVTVVPTAPDPVPAD-SVQRPSDAHTKPRPALAAAT----------TIITCPPSASASTLDQSKDPGPPRP 1975
Cdd:PHA03381   11 PHGTDEVEADVYYDFISPDASPARvSFEEPADRARRGAGQARGRSqaerrfhhydEARADYPYYTGSSSEDERPADPRPS 90
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 578837046 1976 HRPEATPSM----ASLGPEGEELARVAEGTSFPPqEPRHSPQVKMAPTSSPAEPHCwPAEAALGTGAEPTCSQEGK 2047
Cdd:PHA03381   91 RRPHAQPEAsgpgPARGARGPAGSRGRGRRAESP-SPRDPPNPKGASAPRGRKSAC-ADSAALLDAPAPAAPKRQK 164
PilF COG3063
Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];
92-158 6.75e-04

Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];


Pssm-ID: 442297 [Multi-domain]  Cd Length: 94  Bit Score: 40.92  E-value: 6.75e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 578837046   92 TYKNLAQLAAQREDLETAMEFyLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHW 158
Cdd:COG3063    28 ALNNLGLLLLEQGRYDEAIAL-EKALKLDPNNAEALLNLAELLLELGDYDEALAYLERALELDPSAL 93
PHA03247 PHA03247
large tegument protein UL36; Provisional
1925-2136 7.27e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.93  E-value: 7.27e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1925 DPVPADSV-QRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKDPGPPRPHRPEATPSMASLGP-------------- 1989
Cdd:PHA03247 2452 DPFFARTIlGAPFSLSLLLGELFPGAPVYRRPAEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPailpdepvgepvhp 2531
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1990 ------EG-EELARVAEGTSFPPQEPRHSPQV--KMAPTSSPAePHcwPAEAALGTGAE----PTCSQEGKLRPEPRRDG 2056
Cdd:PHA03247 2532 rmltwiRGlEELASDDAGDPPPPLPPAAPPAApdRSVPPPRPA-PR--PSEPAVTSRARrpdaPPQSARPRAPVDDRGDP 2608
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 2057 EAQEAASETQPLSSPPTAASSkAPSSGSAQPPEGHP-GKPEPSRAKSRPLPNMPKLVIPSAATKFPPEITVTP--PTPTL 2133
Cdd:PHA03247 2609 RGPAPPSPLPPDTHAPDPPPP-SPSPAANEPDPHPPpTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPqrPRRRA 2687

                  ...
gi 578837046 2134 LSP 2136
Cdd:PHA03247 2688 ARP 2690
PHA03325 PHA03325
nuclear-egress-membrane-like protein; Provisional
1959-2128 9.07e-04

nuclear-egress-membrane-like protein; Provisional


Pssm-ID: 223044  Cd Length: 418  Bit Score: 44.10  E-value: 9.07e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1959 ASASTLDQSKDPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQ-----VKMAPTSSPAEPhcwPAEAA 2033
Cdd:PHA03325  259 SSAFMLNSSLPTSAPKRRSRRAGAMRAAAGETADLADDDGSEHSDPEPLPASLPPppvrrPRVKHPEAGKEE---PDGAR 335
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 2034 LGTGAEPTCSQEGKLRPeprrdgeAQEAASETQPLSSPPTAASSKApSSGSAQPPEGHPGKPEPSRAKSRPLPnmpklvi 2113
Cdd:PHA03325  336 NAEAKEPAQPATSTSSK-------GSSSAQNKDSGSTGPGSSLAAA-SSFLEDDDFGSPPLDLTTSLRHMPSP------- 400
                         170
                  ....*....|....*
gi 578837046 2114 PSAATKFPPEITVTP 2128
Cdd:PHA03325  401 SVTSAPEPPSIPLTY 415
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1970-2107 1.05e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 44.21  E-value: 1.05e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1970 PGPPRPHRPEATPSMASLGPEGEelarvaegtsfPPQEPRHSPQVKMAPTSSPAEPHCwPAEAALGTGAEPtcsqegklr 2049
Cdd:PRK07764  396 AAAPSAAAAAPAAAPAPAAAAPA-----------AAAAPAPAAAPQPAPAPAPAPAPP-SPAGNAPAGGAP--------- 454
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 578837046 2050 pePRRDGEAQEAASETQPLSSPPTAASSkAPSSGSAQPPEGHPGKPEPSRAKSRPLPN 2107
Cdd:PRK07764  455 --SPPPAAAPSAQPAPAPAAAPEPTAAP-APAPPAAPAPAAAPAAPAAPAAPAGADDA 509
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
1904-2136 1.10e-03

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 44.15  E-value: 1.10e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1904 PKHPkDSRENFFPVTVVPTAPD-PVPADSVQRPSDAHTKPRPA--------LAAATTIITCPPS---ASASTLDQSKDPG 1971
Cdd:PLN03209  330 PKES-DAADGPKPVPTKPVTPEaPSPPIEEEPPQPKAVVPRPLspytayedLKPPTSPIPTPPSsspASSKSVDAVAKPA 408
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1972 PPRPH-RPEATPSMASLGPEGEELARVAEGTSF-------PPQEPRHSPQVKMAPTSSPAephcwPAEAALGTGAEPTCS 2043
Cdd:PLN03209  409 EPDVVpSPGSASNVPEVEPAQVEAKKTRPLSPYaryedlkPPTSPSPTAPTGVSPSVSST-----SSVPAVPDTAPATAA 483
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 2044 QEGKLRPEPRrdgeaqeaaseTQPLSSPPTAASSKAPSSGSaqppeghPGKPEPSRAKSRPlPNMPKLVIPSAATKFPPE 2123
Cdd:PLN03209  484 TDAAAPPPAN-----------MRPLSPYAVYDDLKPPTSPS-------PAAPVGKVAPSST-NEVVKVGNSAPPTALADE 544
                         250
                  ....*....|...
gi 578837046 2124 ITVTPPTPTLLSP 2136
Cdd:PLN03209  545 QHHAQPKPRPLSP 557
PHA03247 PHA03247
large tegument protein UL36; Provisional
1891-2133 1.20e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.54  E-value: 1.20e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1891 AAQRQASGDTPT-TPKHPKDSRENFFPV---------TVVPTAPDPVPADSVQRPSDAHTKPRpalaAATTIITCP-PSA 1959
Cdd:PHA03247  270 ETARGATGPPPPpEAAAPNGAAAPPDGVwgaalagapLALPAPPDPPPPAPAGDAEEEDDEDG----AMEVVSPLPrPRQ 345
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1960 SASTldqskdpGPPRPHRPEATP--SMASLGpEGEELARVAEgtsfPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTG 2037
Cdd:PHA03247  346 HYPL-------GFPKRRRPTWTPpsSLEDLS-AGRHHPKRAS----LPTRKRRSARHAATPFARGPGGDDQTRPAAPVPA 413
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 2038 AEPTCSQEGKLRPEPrrdgeaqeaasetqPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLPNMPKlVIPSAA 2117
Cdd:PHA03247  414 SVPTPAPTPVPASAP--------------PPPATPLPSAEPGSDDGPAPPPERQPPAPATEPAPDDPDDATRK-ALDALR 478
                         250
                  ....*....|....*.
gi 578837046 2118 TKFPPEitvtPPTPTL 2133
Cdd:PHA03247  479 ERRPPE----PPGADL 490
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1868-2105 1.50e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 43.71  E-value: 1.50e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1868 PVKVDEEAALEQAVKFCQVHLGAAAQRQASGDTPTTPkhpkdsrenffpvtvvptAPDPVPADSVQRPSDAHTKPRPALA 1947
Cdd:PRK12323  392 PAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSP------------------APEALAAARQASARGPGGAPAPAPA 453
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1948 -AATTIITCPPSASASTLDQSKDPGPPRPHRPEATPSMASLG-PEGEELarvaegtsfpPQEPrhspqvkmaPTSSPAEP 2025
Cdd:PRK12323  454 pAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDpPPWEEL----------PPEF---------ASPAPAQP 514
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 2026 HCWPAEAALGTGAEPTCSQEGKLRPEPRrdgEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKpEPSRAKSRPL 2105
Cdd:PRK12323  515 DAAPAGWVAESIPDPATADPDDAFETLA---PAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGD-WPALAARLPV 590
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
1889-2131 1.52e-03

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 43.51  E-value: 1.52e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1889 GAAAQRQASGDTPTTPKH----PKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTiitcpPSASASTL 1964
Cdd:COG5180   152 AALLQRSDPILAKDPDGDsastLPPPAEKLDKVLTEPRDALKDSPEKLDRPKVEVKDEAQEEPPDLT-----GGADHPRP 226
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1965 DQSKDPGPPRPHRPEATPSMASLGPEGEEL-------ARVAEGTSFPPQEPRHSPQ-------VKMAPTSSPAEPHCWPA 2030
Cdd:COG5180   227 EAASSPKVDPPSTSEARSRPATVDAQPEMRppadakeRRRAAIGDTPAAEPPGLPVleagsepQSDAPEAETARPIDVKG 306
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 2031 EAALGTGAEPTCSQEGKLRPEPRRDGEAQEaasetQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKS------RP 2104
Cdd:COG5180   307 VASAPPATRPVRPPGGARDPGTPRPGQPTE-----RPAGVPEAASDAGQPPSAYPPAEEAVPGKPLEQGAPRpgssggDG 381
                         250       260
                  ....*....|....*....|....*..
gi 578837046 2105 LPNMPKLVIPSAATKFPPeiTVTPPTP 2131
Cdd:COG5180   382 APFQPPNGAPQPGLGRRG--APGPPMG 406
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1970-2088 1.71e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 43.44  E-value: 1.71e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1970 PGPPRPHRPEATPSMASLGPEgeelarvAEGTSFPPQEPRHSPQVKMAPTSSPAEPhcwPAEAALGTGAEPTCSQEGKLR 2049
Cdd:PRK07764  394 PAAAAPSAAAAAPAAAPAPAA-------AAPAAAAAPAPAAAPQPAPAPAPAPAPP---SPAGNAPAGGAPSPPPAAAPS 463
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|..
gi 578837046 2050 PEPRR---DGEAQEAASETQPLSSPPTAASSKAPSSGSAQPP 2088
Cdd:PRK07764  464 AQPAPapaAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAG 505
sucB TIGR01347
2-oxoglutarate dehydrogenase complex dihydrolipoamide succinyltransferase (E2 component); This ...
1991-2101 1.95e-03

2-oxoglutarate dehydrogenase complex dihydrolipoamide succinyltransferase (E2 component); This model describes the TCA cycle 2-oxoglutarate system E2 component, dihydrolipoamide succinyltransferase. It is closely related to the pyruvate dehydrogenase E2 component, dihydrolipoamide acetyltransferase. The seed for this model includes mitochondrial and Gram-negative bacterial forms. Mycobacterial candidates are highly derived, differ in having and extra copy of the lipoyl-binding domain at the N-terminus. They score below the trusted cutoff, but above the noise cutoff and above all examples of dihydrolipoamide acetyltransferase. [Energy metabolism, TCA cycle]


Pssm-ID: 273565 [Multi-domain]  Cd Length: 403  Bit Score: 42.80  E-value: 1.95e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046  1991 GEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHcwPAEAALGTGAEPTCSQEGKlrpEPRRDGEAQEAASETQPLSS 2070
Cdd:TIGR01347   68 GQVLAILEEGNDATAAPPAKSGEEKEETPAASAAAA--PTAAANRPSLSPAARRLAK---EHGIDLSAVPGTGVTGRVTK 142
                           90       100       110
                   ....*....|....*....|....*....|.
gi 578837046  2071 PPTAASSKAPSsgSAQPPEGHPGKPEPSRAK 2101
Cdd:TIGR01347  143 EDIIKKTEAPA--SAQPPAAAAAAAAPAAAT 171
Treacle pfam03546
Treacher Collins syndrome protein Treacle;
1928-2162 1.98e-03

Treacher Collins syndrome protein Treacle;


Pssm-ID: 460967 [Multi-domain]  Cd Length: 531  Bit Score: 43.14  E-value: 1.98e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046  1928 PADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKDPGPPRPHRPEATPSMASLGPEGEELARvaEGTSFPPQE 2007
Cdd:pfam03546  168 DSESSSEESDSEGEAPPAATQAKPSGKILQVRPASGPAKGAAPAPPQKAGPVATQVKAERSKEDSESSE--ESSDSEEEA 245
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046  2008 PRHSPQVKMAPTSSPAEPHCWPAEaalGTGAEPTCSQEGKLR---PEPRRDGEAQEAASetqpLSSPPTAASSKAP---S 2081
Cdd:pfam03546  246 PAAATPAQAKPALKTPQTKASPRK---GTPITPTSAKVPPVRvgtPAPWKAGTVTSPAC----ASSPAVARGAQRPeedS 318
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046  2082 SGSAQPPEGHPGKPEPSRAKSRPLPNMPKLVIPSAATKFPPEITVTPPTPTLLSPKGSI-----------SEETKQKLKS 2150
Cdd:pfam03546  319 SSSEESESEEETAPAAAVGQAKSVGKGLQGKAASAPTKGPSGQGTAPVPPGKTGPAVAQvkaeaqedsesSEEESDSEEA 398
                          250
                   ....*....|..
gi 578837046  2151 AILSAQSAANVR 2162
Cdd:pfam03546  399 AATPAQVKASGK 410
TPR_12 pfam13424
Tetratricopeptide repeat;
36-119 2.07e-03

Tetratricopeptide repeat;


Pssm-ID: 315987 [Multi-domain]  Cd Length: 77  Bit Score: 38.91  E-value: 2.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046    36 AFALYHKALDLQKHDRFEESAKAYHELLEaslLREAVSSGDekeglkHPGLILkysTYKNLAQLAAQREDLETAMEFYLE 115
Cdd:pfam13424    3 ATALNNLAAVLRRLGRYDEALELLEKALE---IARRLLGPD------HPLTAT---TLLNLGRLYLELGRYEEALELLER 70

                   ....
gi 578837046   116 AVML 119
Cdd:pfam13424   71 ALAL 74
PHA03321 PHA03321
tegument protein VP11/12; Provisional
1894-2138 2.10e-03

tegument protein VP11/12; Provisional


Pssm-ID: 223041 [Multi-domain]  Cd Length: 694  Bit Score: 43.41  E-value: 2.10e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1894 RQASGDTPTTPKHPKDSRENFFPVT-----------VVPTAPDPVPAdSVQRPSDAHTK---PRPAlaaattiitcPPSA 1959
Cdd:PHA03321  447 RARPGSTPACARRARAQRARDAGPEyvdplgalrrlPAGAAPPPEPA-AAPSPATYYTRmggGPPR----------LPPR 515
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1960 SASTLDQSKDPGPPRPHRPEATPSmASLGPEGEELARVAEGTSFPPQEPRHSPqvkmAPTSSPaephcwPAEaALGTGAE 2039
Cdd:PHA03321  516 NRATETLRPDWGPPAAAPPEQMED-PYLEPDDDRFDRRDGAAAAATSHPREAP----APDDDP------IYE-GVSDSEE 583
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 2040 PTCSQegklRPEPR----RDGEAQEAASETQPLSSPptaassKAPSSGSAQPPEGHPGKP--EPSRAKSRPLPnmpklvi 2113
Cdd:PHA03321  584 PVYEE----IPTPRvyqnPLPRPMEGAGEPPDLDAP------TSPWVEEENPIYGWGDSPlfSPPPAARFPPP------- 646
                         250       260
                  ....*....|....*....|....*
gi 578837046 2114 PSAATKFPPEITVTPPTPTLLSPKG 2138
Cdd:PHA03321  647 DPALSPEPPALPAHRPRPGALAPDG 671
TPR_21 pfam09976
Tetratricopeptide repeat-like domain; This family resembles a single unit of a TPR repeat.
48-151 3.55e-03

Tetratricopeptide repeat-like domain; This family resembles a single unit of a TPR repeat.


Pssm-ID: 430959 [Multi-domain]  Cd Length: 194  Bit Score: 41.03  E-value: 3.55e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046    48 KHDRFEESAKAYHELLEAsllreaVSSGDEKEGL--------KHPGlilkySTYKNLAQL-----AAQREDLETAMEfYL 114
Cdd:pfam09976   32 QRSQAEEASALYQQLLEA------VAAGDAAKAQaaaaqlkdEYGG-----TGYAALAALllakaAVEAGDLAAAKA-QL 99
                           90       100       110
                   ....*....|....*....|....*....|....*...
gi 578837046   115 EAVMLDSTDVNLwykiGHVA-LRLIRIPLARHAFEEGL 151
Cdd:pfam09976  100 EWVADNAKDEAL----KALArLRLARVLLAQGKYDEAL 133
PHA03369 PHA03369
capsid maturational protease; Provisional
1983-2170 3.87e-03

capsid maturational protease; Provisional


Pssm-ID: 223061 [Multi-domain]  Cd Length: 663  Bit Score: 42.29  E-value: 3.87e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1983 SMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALgTGAEPTCSQEGKLRPEPRRDGEAQEAA 2062
Cdd:PHA03369  349 KTASLTAPSRVLAAAAKVAVIAAPQTHTGPADRQRPQRPDGIPYSVPARSPM-TAYPPVPQFCGDPGLVSPYNPQSPGTS 427
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 2063 SETQPLSS-PPT-AASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLP-NMPKLVIPSAATKFPPEITVTPPTPTLLSPKGS 2139
Cdd:PHA03369  428 YGPEPVGPvPPQpTNPYVMPISMANMVYPGHPQEHGHERKRKRGGElKEELIETLKLVKKLKEEQESLAKELEATAHKSE 507
                         170       180       190
                  ....*....|....*....|....*....|.
gi 578837046 2140 ISEETKQKLKSAILSAQSAANVRKESLCQPA 2170
Cdd:PHA03369  508 IKKIAESEFKNAGAKTAAANIEPNCSADAAA 538
PRK12727 PRK12727
flagellar biosynthesis protein FlhF;
1917-2131 4.31e-03

flagellar biosynthesis protein FlhF;


Pssm-ID: 237182 [Multi-domain]  Cd Length: 559  Bit Score: 42.28  E-value: 4.31e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1917 VTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKdpgpprPHRPEATPSMASLGpegeelAR 1996
Cdd:PRK12727   62 TPATAAAPAPAPQAPTKPAAPVHAPLKLSANANMSQRQRVASAAEDMIAAMA------LRQPVSVPRQAPAA------AP 129
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1997 VAEGTSFPPQEPRHSPQVKMapTSSPAEPHCWPAEAALGTGAEPTCSQegklRPEPRRDGEAQEAASETqPLSSPPTAAS 2076
Cdd:PRK12727  130 VRAASIPSPAAQALAHAAAV--RTAPRQEHALSAVPEQLFADFLTTAP----VPRAPVQAPVVAAPAPV-PAIAAALAAH 202
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 578837046 2077 SKAPSSGSAQPPEGHPGKPEPSrAKSRPLPNMPKLVIPSAATKFPPEITVTPPTP 2131
Cdd:PRK12727  203 AAYAQDDDEQLDDDGFDLDDAL-PQILPPAALPPIVVAPAAPAALAAVAAAAPAP 256
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
34-174 4.32e-03

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 41.25  E-value: 4.32e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046   34 AEAFALYHKALDLQKHDRFEESAKAYHELLEasllreavssgdekeglKHPGLIlkySTYKNLAQLAAQREDLETAMEFY 113
Cdd:COG2956     6 AAALGWYFKGLNYLLNGQPDKAIDLLEEALE-----------------LDPETV---EAHLALGNLYRRRGEYDRAIRIH 65
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 578837046  114 LEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDY 174
Cdd:COG2956    66 QKLLERDPDRAEALLELAQDYLKAGLLDRAEELLEKLLELDPDDAEALRLLAEIYEQEGDW 126
KLF9_13_N-like cd21975
Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like ...
1976-2119 4.43e-03

Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. KLF9, KLF10, KLF11, KLF13, KLF14, and KLF16 share a conserved alpha-helical motif AA/VXXL that mediates their binding to Sin3A and their activities as transcriptional repressors. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the related N-terminal domains of KLF9, KLF13, KLF14, KLF16, and similar proteins.


Pssm-ID: 409240 [Multi-domain]  Cd Length: 163  Bit Score: 40.06  E-value: 4.43e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1976 HRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQEGKLRPEPRRD 2055
Cdd:cd21975    19 HGVRPDPEGAGLAAGLDVRATREVAKGPGPPGPAWKPDGADSPGLVTAAPHLLAANVLAPLRGPSVEGSSLESGDADMGS 98
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 578837046 2056 GEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGkPEPSRAKSRPLPNMPKLVIPSAATK 2119
Cdd:cd21975    99 DSDVAPASGAAASTSPESSSDAASSPSPLSLLHPGEAG-LEPERPRPRVRRGVRRRGVTPAAKR 161
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
1892-2024 4.44e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 42.07  E-value: 4.44e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1892 AQRQASGDTPTTPKHPKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASA---------- 1961
Cdd:PRK14971  360 AQLTQKGDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVdppaavpvnp 439
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 578837046 1962 -STLDQSKDPGPPRPHRPEATPSMASLGPegeelarvaeGTSFPPQEPRHSPQ--VKMAPTSSPAE 2024
Cdd:PRK14971  440 pSTAPQAVRPAQFKEEKKIPVSKVSSLGP----------STLRPIQEKAEQATgnIKEAPTGTQKE 495
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
1974-2165 9.64e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 41.00  E-value: 9.64e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 1974 RPHRPEATPSM---ASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQEgklRP 2050
Cdd:PRK07994  360 HPAAPLPEPEVppqSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQG---AT 436
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837046 2051 EPRRDGEAqeAASETQPLSSPPTAASSKAPssgSAQPPEGHPGKPEPSRAKSR-PLPNMPKLVIPSAATKFPPEITVTPP 2129
Cdd:PRK07994  437 KAKKSEPA--AASRARPVNSALERLASVRP---APSALEKAPAKKEAYRWKATnPVEVKKEPVATPKALKKALEHEKTPE 511
                         170       180       190
                  ....*....|....*....|....*....|....*....
gi 578837046 2130 TPTLLSPKGSISE---ETKQKLKSAILSAQSAANVRKES 2165
Cdd:PRK07994  512 LAAKLAAEAIERDpwaALVSQLGLPGLVEQLALNAWKEE 550
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH