|
Name |
Accession |
Description |
Interval |
E-value |
| MEF2_binding |
pfam09047 |
MEF2 binding; The myocyte enhancer factor-2 (MEF2) binding domain, predominantly found in the ... |
2126-2160 |
1.05e-15 |
|
MEF2 binding; The myocyte enhancer factor-2 (MEF2) binding domain, predominantly found in the calcineurin-binding protein CABIN 1, adopts an amphipathic alpha-helical structure, which allows it to bind a hydrophobic groove on the MEF2S domain, forming a triple-helical interaction. Interaction of this domain with MEF2 causes repression of transcription.
Pssm-ID: 370261 [Multi-domain] Cd Length: 35 Bit Score: 72.58 E-value: 1.05e-15
10 20 30
....*....|....*....|....*....|....*
gi 2217339257 2126 TLLSPKGSISEETKQKLKSAILSAQSAANVRKESL 2160
Cdd:pfam09047 1 TLLSPKGSISEETKQKLKNAILSAQSAANVKKDSL 35
|
|
| MEF2_binding |
cd13839 |
Mycocyte enhancer factor-2 (MEF2) binding domain of the calcineurin-binding protein cabin-1; ... |
2126-2160 |
6.11e-14 |
|
Mycocyte enhancer factor-2 (MEF2) binding domain of the calcineurin-binding protein cabin-1; The myocyte enhancer factor-2 (MEF2) binding domain, as found in the calcineurin-binding protein cabin-1, adopts an amphipathic alpha-helical structure, which allows it to bind to a hydrophobic groove on the MEF2S domain, forming a triple-helical interaction. Interaction of this domain with MEF2 causes repression of transcription. Cabin-1 inhibits calcineurin-mediated signal transduction in T-cell receptor-mediated signalling pathways, by binding to the activated form of calcineurin. Cabin-1 acts as a co-repressor of MEF2, the mycocyte enhancer factor-2, which regulates transcription in a calcium-dependent manner and plays vital roles in T-cell development and function.
Pssm-ID: 260103 [Multi-domain] Cd Length: 35 Bit Score: 67.41 E-value: 6.11e-14
10 20 30
....*....|....*....|....*....|....*
gi 2217339257 2126 TLLSPKGSISEETKQKLKSAILSAQSAANVRKESL 2160
Cdd:cd13839 1 TLLSPKGSISEETKQKLKNAILSSQSAANVKKDTL 35
|
|
| TPR |
COG0457 |
Tetratricopeptide (TPR) repeat [General function prediction only]; |
30-205 |
2.26e-11 |
|
Tetratricopeptide (TPR) repeat [General function prediction only];
Pssm-ID: 440225 [Multi-domain] Cd Length: 245 Bit Score: 66.18 E-value: 2.26e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 30 EAQEAEAFALYHKALDLQKHDRFEESAKAYHELLEasllreavssgdekeglKHPGLilkYSTYKNLAQLAAQREDLETA 109
Cdd:COG0457 2 ELDPDDAEAYNNLGLAYRRLGRYEEAIEDYEKALE-----------------LDPDD---AEALYNLGLAYLRLGRYEEA 61
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 110 MEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKDC 189
Cdd:COG0457 62 LADYEQALELDPDDAEALNNLGLALQALGRYEEALEDYDKALELDPDDAEALYNLGLALLELGRYDEAIEAYERALELDP 141
|
170
....*....|....*.
gi 2217339257 190 RYSKGLVLKEKIFEEQ 205
Cdd:COG0457 142 DDADALYNLGIALEKL 157
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1884-2127 |
5.41e-11 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 68.43 E-value: 5.41e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1884 AAAQRQASGDTPTTPKHPKDSRENFFP----VTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASAS--- 1956
Cdd:PHA03247 2752 GGPARPARPPTTAGPPAPAPPAAPAAGpprrLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLppp 2831
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1957 TLDQSKDPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFP-PQEPRHSPQVKMA--PTSSPAEPHCWPAEaalgtgaE 2033
Cdd:PHA03247 2832 TSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAkPAAPARPPVRRLArpAVSRSTESFALPPD-------Q 2904
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2034 PTCSQEGKLRPEPRRDGEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRP-LPNMPKLVIPSAAt 2112
Cdd:PHA03247 2905 PERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPgRVAVPRFRVPQPA- 2983
|
250
....*....|....*
gi 2217339257 2113 kfPPEITVTPPTPTL 2127
Cdd:PHA03247 2984 --PSREAPASSTPPL 2996
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
1884-2153 |
4.04e-10 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 64.60 E-value: 4.04e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1884 AAAQRQASGDTPTTPKHPKdSRENFFPVTVVPTAPDPV----PADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLD 1959
Cdd:pfam17823 120 SSSPSSAAQSLPAAIAALP-SEAFSAPRAAACRANASAapraAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTA 198
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1960 QSKDPGPPRPHRPEATPSMASLGPE-GEELARVaeGTSFPpqeprhspqvkMAPTSSPAEPHCWPAE-AALGTGAEPTCS 2037
Cdd:pfam17823 199 ASSAPATLTPARGISTAATATGHPAaGTALAAV--GNSSP-----------AAGTVTAAVGTVTPAAlATLAAAAGTVAS 265
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2038 QEGKLR---PEPRRDGEAQEAASETQPLS-SPPTAASSKAPSS--GSAQPPEGHPGKPEPSRAKSRPLPNMPKLVIPS-- 2109
Cdd:pfam17823 266 AAGTINmgdPHARRLSPAKHMPSDTMARNpAAPMGAQAQGPIIqvSTDQPVHNTAGEPTPSPSNTTLEPNTPKSVASTnl 345
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|.
gi 2217339257 2110 -------AATKFPPEITVtPPTPTLLSPKGSISEETKQklKSAILSAQSAA 2153
Cdd:pfam17823 346 avvtttkAQAKEPSASPV-PVLHTSMIPEVEATSPTTQ--PSPLLPTQGAA 393
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
1845-2125 |
6.85e-05 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 47.84 E-value: 6.85e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1845 RVERIMSETYMLIKQHLPV--KVDEEAALEQAVKFCQVHLGAAAQRQASGDTPTTPKHPKDSRenffpvtvvPTAPDPvp 1922
Cdd:NF033839 229 QIVALIKELDELKKQALSEidNVNTKVEIENTVHKIFADMDAVVTKFKKGLTQDTPKEPGNKK---------PSAPKP-- 297
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1923 adsvqrpsdaHTKPRPALAAAttiitcPPSASASTLDQSKDPGPPRPhRPEATPSmaslgPEGEElarvaegTSFPPQEP 2002
Cdd:NF033839 298 ----------GMQPSPQPEKK------EVKPEPETPKPEVKPQLEKP-KPEVKPQ-----PEKPK-------PEVKPQLE 348
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2003 RHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTcsqegklRPEPRRDGEAQEAASETQPlsSPPTAASSKAPSSGSAQP- 2081
Cdd:NF033839 349 TPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPE-------TPKPEVKPQPEKPKPEVKP--QPEKPKPEVKPQPEKPKPe 419
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|..
gi 2217339257 2082 --PEGHPGKPE--PSRAKS----RPLPNMPKLVIPSAATKFPPEITVTPPTP 2125
Cdd:NF033839 420 vkPQPEKPKPEvkPQPEKPkpevKPQPEKPKPEVKPQPETPKPEVKPQPEKP 471
|
|
| SepH |
NF040712 |
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ... |
1963-2103 |
1.78e-04 |
|
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.
Pssm-ID: 468676 [Multi-domain] Cd Length: 346 Bit Score: 46.30 E-value: 1.78e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1963 DPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHC------WPAEAALGTGAEPTC 2036
Cdd:NF040712 189 DPDFGRPLRPLATVPRLAREPADARPEEVEPAPAAEGAPATDSDPAEAGTPDDLASARRrragveQPEDEPVGPGAAPAA 268
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2217339257 2037 SQEGKLRPEPRRdgEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSR-PLPNMP 2103
Cdd:NF040712 269 EPDEATRDAGEP--PAPGAAETPEAAEPPAPAPAAPAAPAAPEAEEPARPEPPPAPKPKRRrRRASVP 334
|
|
| TPR_12 |
pfam13424 |
Tetratricopeptide repeat; |
36-119 |
1.60e-03 |
|
Tetratricopeptide repeat;
Pssm-ID: 315987 [Multi-domain] Cd Length: 77 Bit Score: 39.29 E-value: 1.60e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 36 AFALYHKALDLQKHDRFEESAKAYHELLEaslLREAVSSGDekeglkHPGLILkysTYKNLAQLAAQREDLETAMEFYLE 115
Cdd:pfam13424 3 ATALNNLAAVLRRLGRYDEALELLEKALE---IARRLLGPD------HPLTAT---TLLNLGRLYLELGRYEEALELLER 70
|
....
gi 2217339257 116 AVML 119
Cdd:pfam13424 71 ALAL 74
|
|
| sucB |
TIGR01347 |
2-oxoglutarate dehydrogenase complex dihydrolipoamide succinyltransferase (E2 component); This ... |
1985-2095 |
1.95e-03 |
|
2-oxoglutarate dehydrogenase complex dihydrolipoamide succinyltransferase (E2 component); This model describes the TCA cycle 2-oxoglutarate system E2 component, dihydrolipoamide succinyltransferase. It is closely related to the pyruvate dehydrogenase E2 component, dihydrolipoamide acetyltransferase. The seed for this model includes mitochondrial and Gram-negative bacterial forms. Mycobacterial candidates are highly derived, differ in having and extra copy of the lipoyl-binding domain at the N-terminus. They score below the trusted cutoff, but above the noise cutoff and above all examples of dihydrolipoamide acetyltransferase. [Energy metabolism, TCA cycle]
Pssm-ID: 273565 [Multi-domain] Cd Length: 403 Bit Score: 42.80 E-value: 1.95e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1985 GEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHcwPAEAALGTGAEPTCSQEGKlrpEPRRDGEAQEAASETQPLSS 2064
Cdd:TIGR01347 68 GQVLAILEEGNDATAAPPAKSGEEKEETPAASAAAA--PTAAANRPSLSPAARRLAK---EHGIDLSAVPGTGVTGRVTK 142
|
90 100 110
....*....|....*....|....*....|.
gi 2217339257 2065 PPTAASSKAPSsgSAQPPEGHPGKPEPSRAK 2095
Cdd:TIGR01347 143 EDIIKKTEAPA--SAQPPAAAAAAAAPAAAT 171
|
|
| PBP1 |
COG5180 |
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ... |
1883-2125 |
2.07e-03 |
|
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];
Pssm-ID: 444064 [Multi-domain] Cd Length: 548 Bit Score: 43.13 E-value: 2.07e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1883 GAAAQRQASGDTPTTPKH----PKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTiitcpPSASASTL 1958
Cdd:COG5180 152 AALLQRSDPILAKDPDGDsastLPPPAEKLDKVLTEPRDALKDSPEKLDRPKVEVKDEAQEEPPDLT-----GGADHPRP 226
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1959 DQSKDPGPPRPHRPEATPSMASLGPEGEEL-------ARVAEGTSFPPQEPRHSPQ-------VKMAPTSSPAEPHCWPA 2024
Cdd:COG5180 227 EAASSPKVDPPSTSEARSRPATVDAQPEMRppadakeRRRAAIGDTPAAEPPGLPVleagsepQSDAPEAETARPIDVKG 306
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2025 EAALGTGAEPTCSQEGKLRPEPRRDGEAQEaasetQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKS------RP 2098
Cdd:COG5180 307 VASAPPATRPVRPPGGARDPGTPRPGQPTE-----RPAGVPEAASDAGQPPSAYPPAEEAVPGKPLEQGAPRpgssggDG 381
|
250 260
....*....|....*....|....*..
gi 2217339257 2099 LPNMPKLVIPSAATKFPPeiTVTPPTP 2125
Cdd:COG5180 382 APFQPPNGAPQPGLGRRG--APGPPMG 406
|
|
| KLF9_13_N-like |
cd21975 |
Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like ... |
1970-2113 |
5.69e-03 |
|
Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. KLF9, KLF10, KLF11, KLF13, KLF14, and KLF16 share a conserved alpha-helical motif AA/VXXL that mediates their binding to Sin3A and their activities as transcriptional repressors. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the related N-terminal domains of KLF9, KLF13, KLF14, KLF16, and similar proteins.
Pssm-ID: 409240 [Multi-domain] Cd Length: 163 Bit Score: 39.67 E-value: 5.69e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1970 HRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQEGKLRPEPRRD 2049
Cdd:cd21975 19 HGVRPDPEGAGLAAGLDVRATREVAKGPGPPGPAWKPDGADSPGLVTAAPHLLAANVLAPLRGPSVEGSSLESGDADMGS 98
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2217339257 2050 GEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGkPEPSRAKSRPLPNMPKLVIPSAATK 2113
Cdd:cd21975 99 DSDVAPASGAAASTSPESSSDAASSPSPLSLLHPGEAG-LEPERPRPRVRRGVRRRGVTPAAKR 161
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| MEF2_binding |
pfam09047 |
MEF2 binding; The myocyte enhancer factor-2 (MEF2) binding domain, predominantly found in the ... |
2126-2160 |
1.05e-15 |
|
MEF2 binding; The myocyte enhancer factor-2 (MEF2) binding domain, predominantly found in the calcineurin-binding protein CABIN 1, adopts an amphipathic alpha-helical structure, which allows it to bind a hydrophobic groove on the MEF2S domain, forming a triple-helical interaction. Interaction of this domain with MEF2 causes repression of transcription.
Pssm-ID: 370261 [Multi-domain] Cd Length: 35 Bit Score: 72.58 E-value: 1.05e-15
10 20 30
....*....|....*....|....*....|....*
gi 2217339257 2126 TLLSPKGSISEETKQKLKSAILSAQSAANVRKESL 2160
Cdd:pfam09047 1 TLLSPKGSISEETKQKLKNAILSAQSAANVKKDSL 35
|
|
| MEF2_binding |
cd13839 |
Mycocyte enhancer factor-2 (MEF2) binding domain of the calcineurin-binding protein cabin-1; ... |
2126-2160 |
6.11e-14 |
|
Mycocyte enhancer factor-2 (MEF2) binding domain of the calcineurin-binding protein cabin-1; The myocyte enhancer factor-2 (MEF2) binding domain, as found in the calcineurin-binding protein cabin-1, adopts an amphipathic alpha-helical structure, which allows it to bind to a hydrophobic groove on the MEF2S domain, forming a triple-helical interaction. Interaction of this domain with MEF2 causes repression of transcription. Cabin-1 inhibits calcineurin-mediated signal transduction in T-cell receptor-mediated signalling pathways, by binding to the activated form of calcineurin. Cabin-1 acts as a co-repressor of MEF2, the mycocyte enhancer factor-2, which regulates transcription in a calcium-dependent manner and plays vital roles in T-cell development and function.
Pssm-ID: 260103 [Multi-domain] Cd Length: 35 Bit Score: 67.41 E-value: 6.11e-14
10 20 30
....*....|....*....|....*....|....*
gi 2217339257 2126 TLLSPKGSISEETKQKLKSAILSAQSAANVRKESL 2160
Cdd:cd13839 1 TLLSPKGSISEETKQKLKNAILSSQSAANVKKDTL 35
|
|
| TPR |
COG0457 |
Tetratricopeptide (TPR) repeat [General function prediction only]; |
30-205 |
2.26e-11 |
|
Tetratricopeptide (TPR) repeat [General function prediction only];
Pssm-ID: 440225 [Multi-domain] Cd Length: 245 Bit Score: 66.18 E-value: 2.26e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 30 EAQEAEAFALYHKALDLQKHDRFEESAKAYHELLEasllreavssgdekeglKHPGLilkYSTYKNLAQLAAQREDLETA 109
Cdd:COG0457 2 ELDPDDAEAYNNLGLAYRRLGRYEEAIEDYEKALE-----------------LDPDD---AEALYNLGLAYLRLGRYEEA 61
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 110 MEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKDC 189
Cdd:COG0457 62 LADYEQALELDPDDAEALNNLGLALQALGRYEEALEDYDKALELDPDDAEALYNLGLALLELGRYDEAIEAYERALELDP 141
|
170
....*....|....*.
gi 2217339257 190 RYSKGLVLKEKIFEEQ 205
Cdd:COG0457 142 DDADALYNLGIALEKL 157
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1884-2127 |
5.41e-11 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 68.43 E-value: 5.41e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1884 AAAQRQASGDTPTTPKHPKDSRENFFP----VTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASAS--- 1956
Cdd:PHA03247 2752 GGPARPARPPTTAGPPAPAPPAAPAAGpprrLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLppp 2831
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1957 TLDQSKDPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFP-PQEPRHSPQVKMA--PTSSPAEPHCWPAEaalgtgaE 2033
Cdd:PHA03247 2832 TSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAkPAAPARPPVRRLArpAVSRSTESFALPPD-------Q 2904
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2034 PTCSQEGKLRPEPRRDGEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRP-LPNMPKLVIPSAAt 2112
Cdd:PHA03247 2905 PERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPgRVAVPRFRVPQPA- 2983
|
250
....*....|....*
gi 2217339257 2113 kfPPEITVTPPTPTL 2127
Cdd:PHA03247 2984 --PSREAPASSTPPL 2996
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1882-2132 |
1.47e-10 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 67.27 E-value: 1.47e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1882 LGAAAQRQASGDTPTTPKHPKDSRenffpVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQS 1961
Cdd:PHA03247 2723 PGPAAARQASPALPAAPAPPAVPA-----GPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES 2797
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1962 KdPGPPRPhrpeATPSMASLGPEGEELARVAEGTSFPPqePRHSPQVKMAPTSSPAEPHCwPAEAALGTGAEPTcsqegk 2041
Cdd:PHA03247 2798 L-PSPWDP----ADPPAAVLAPAAALPPAASPAGPLPP--PTSAQPTAPPPPPGPPPPSL-PLGGSVAPGGDVR------ 2863
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2042 lRPEPRRDGEAQEAASETQPLSSPPTAASSKAPSSgSAQPPEGHPGKPEPSrAKSRPLPNMPKLVIPSAATKFPPEITVT 2121
Cdd:PHA03247 2864 -RRPPSRSPAAKPAAPARPPVRRLARPAVSRSTES-FALPPDQPERPPQPQ-APPPPQPQPQPPPPPQPQPPPPPPPRPQ 2940
|
250
....*....|.
gi 2217339257 2122 PPTPTLLSPKG 2132
Cdd:PHA03247 2941 PPLAPTTDPAG 2951
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1895-2165 |
1.72e-10 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 66.89 E-value: 1.72e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1895 PTTPKHPKDSRENFFPVTVVPTAPDPVPADS-VQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKDPGPPRPHRPE 1973
Cdd:PHA03247 2629 PSPSPAANEPDPHPPPTVPPPERPRDDPAPGrVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTP 2708
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1974 ATPSMASLGPEGEELARVAEGTSFPPqeprhsPQVKMAPTSSPAEPHCWPAEAALGTGAEPTcsqeGKLRPEPRRDGEAQ 2053
Cdd:PHA03247 2709 EPAPHALVSATPLPPGPAAARQASPA------LPAAPAPPAVPAGPATPGGPARPARPPTTA----GPPAPAPPAAPAAG 2778
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2054 EAASETQPLSSPPTAASSKAPS-SGSAQPPEGHPGKPEPSRAKSRPLPNMPKlviPSAATKFPPEITVTPPTPTL----- 2127
Cdd:PHA03247 2779 PPRRLTRPAVASLSESRESLPSpWDPADPPAAVLAPAAALPPAASPAGPLPP---PTSAQPTAPPPPPGPPPPSLplggs 2855
|
250 260 270
....*....|....*....|....*....|....*...
gi 2217339257 2128 LSPKGSISEETKQKLKSAILSAQSAANVRkeSLCQPAL 2165
Cdd:PHA03247 2856 VAPGGDVRRRPPSRSPAAKPAAPARPPVR--RLARPAV 2891
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
1890-2156 |
2.37e-10 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 66.35 E-value: 2.37e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1890 ASGDTPTTPKHPKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATT--IITCPPSASA----STLDQSKD 1963
Cdd:PHA03307 124 ASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPeeTARAPSSPPAepppSTPPAAAS 203
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1964 PGPPRPHRPEATPSM--ASLGPEGEELARVAEGTSFPPQEPRHSP--QVKMAPTSSPA--EPHCWPAEAALGTGAEPTCS 2037
Cdd:PHA03307 204 PRPPRRSSPISASASspAPAPGRSAADDAGASSSDSSSSESSGCGwgPENECPLPRPApiTLPTRIWEASGWNGPSSRPG 283
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2038 QEGKLRPEPRRDGEAQEAASETQPLSSPPT----------AASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLPNmpklvi 2107
Cdd:PHA03307 284 PASSSSSPRERSPSPSPSSPGSGPAPSSPRasssssssreSSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPP------ 357
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2108 PSAATKFPPE-ITVTPPTPTLLSPKGSISEETKQKLKSAILSAQSAANVR 2156
Cdd:PHA03307 358 PPADPSSPRKrPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRF 407
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
1884-2153 |
4.04e-10 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 64.60 E-value: 4.04e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1884 AAAQRQASGDTPTTPKHPKdSRENFFPVTVVPTAPDPV----PADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLD 1959
Cdd:pfam17823 120 SSSPSSAAQSLPAAIAALP-SEAFSAPRAAACRANASAapraAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTA 198
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1960 QSKDPGPPRPHRPEATPSMASLGPE-GEELARVaeGTSFPpqeprhspqvkMAPTSSPAEPHCWPAE-AALGTGAEPTCS 2037
Cdd:pfam17823 199 ASSAPATLTPARGISTAATATGHPAaGTALAAV--GNSSP-----------AAGTVTAAVGTVTPAAlATLAAAAGTVAS 265
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2038 QEGKLR---PEPRRDGEAQEAASETQPLS-SPPTAASSKAPSS--GSAQPPEGHPGKPEPSRAKSRPLPNMPKLVIPS-- 2109
Cdd:pfam17823 266 AAGTINmgdPHARRLSPAKHMPSDTMARNpAAPMGAQAQGPIIqvSTDQPVHNTAGEPTPSPSNTTLEPNTPKSVASTnl 345
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|.
gi 2217339257 2110 -------AATKFPPEITVtPPTPTLLSPKGSISEETKQklKSAILSAQSAA 2153
Cdd:pfam17823 346 avvtttkAQAKEPSASPV-PVLHTSMIPEVEATSPTTQ--PSPLLPTQGAA 393
|
|
| TPR |
COG0457 |
Tetratricopeptide (TPR) repeat [General function prediction only]; |
34-199 |
4.36e-10 |
|
Tetratricopeptide (TPR) repeat [General function prediction only];
Pssm-ID: 440225 [Multi-domain] Cd Length: 245 Bit Score: 62.33 E-value: 4.36e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 34 AEAFALYHKALDLQkhdrfEESAKAYHELleASLLREAvssGDEKEGLKH--------PGLIlkySTYKNLAQLAAQRED 105
Cdd:COG0457 25 EEAIEDYEKALELD-----PDDAEALYNL--GLAYLRL---GRYEEALADyeqaleldPDDA---EALNNLGLALQALGR 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 106 LETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKAL 185
Cdd:COG0457 92 YEEALEDYDKALELDPDDAEALYNLGLALLELGRYDEAIEAYERALELDPDDADALYNLGIALEKLGRYEEALELLEKLE 171
|
170
....*....|....
gi 2217339257 186 EKDCRYSKGLVLKE 199
Cdd:COG0457 172 AAALAALLAAALGE 185
|
|
| Spy |
COG3914 |
Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational ... |
30-188 |
2.22e-09 |
|
Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones];
Pssm-ID: 443119 [Multi-domain] Cd Length: 658 Bit Score: 62.70 E-value: 2.22e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 30 EAQEAEAFALYHKALDLQKHDRFEESAKAYHELLEAsllreavssgdekeglkHPGLilkYSTYKNLAQLAAQREDLETA 109
Cdd:COG3914 72 AALLLLAALLELAALLLQALGRYEEALALYRRALAL-----------------NPDN---AEALFNLGNLLLALGRLEEA 131
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2217339257 110 MEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKD 188
Cdd:COG3914 132 LAALRRALALNPDFAEAYLNLGEALRRLGRLEEAIAALRRALELDPDNAEALNNLGNALQDLGRLEEAIAAYRRALELD 210
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1891-2164 |
7.16e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 61.49 E-value: 7.16e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1891 SGDTPttPKHPKDSRENFFPVTVVPTAPDPVPADSVQRpSDAHTKPRPALAAATTIITCPPSASASTLDQSkdPGPPRPH 1970
Cdd:PHA03247 2548 AGDPP--PPLPPAAPPAAPDRSVPPPRPAPRPSEPAVT-SRARRPDAPPQSARPRAPVDDRGDPRGPAPPS--PLPPDTH 2622
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1971 RPEATPSMASlgPEGEELARVAEGTSFPPQEPRHSPQVK-----------------MAPTSSPAEPHCWPAEAALGTGAE 2033
Cdd:PHA03247 2623 APDPPPPSPS--PAANEPDPHPPPTVPPPERPRDDPAPGrvsrprrarrlgraaqaSSPPQRPRRRAARPTVGSLTSLAD 2700
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2034 PTcsqegklrPEPRRDGEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKP----EPSRAKSRPLPNMPklviPS 2109
Cdd:PHA03247 2701 PP--------PPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPatpgGPARPARPPTTAGP----PA 2768
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*
gi 2217339257 2110 AAtkfPPEITVTPPTPTLLSPKGSISEETKQKLKSAILSAQSAANVRKESLCQPA 2164
Cdd:PHA03247 2769 PA---PPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPP 2820
|
|
| LapB |
COG2956 |
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ... |
30-205 |
7.93e-09 |
|
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442196 [Multi-domain] Cd Length: 275 Bit Score: 58.97 E-value: 7.93e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 30 EAQEAEAFALYHKALDLQKHDRFEESAKAYHELLEAS---------LLREAVSSGDEKEGLKHPGLILKYS-----TYKN 95
Cdd:COG2956 70 ERDPDRAEALLELAQDYLKAGLLDRAEELLEKLLELDpddaealrlLAEIYEQEGDWEKAIEVLERLLKLGpenahAYCE 149
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 96 LAQLAAQREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYT 175
Cdd:COG2956 150 LAELYLEQGDYDEAIEALEKALKLDPDCARALLLLAELYLEQGDYEEAIAALERALEQDPDYLPALPRLAELYEKLGDPE 229
|
170 180 190
....*....|....*....|....*....|
gi 2217339257 176 TCLYFICKALEKDCRYSKGLVLKEKIFEEQ 205
Cdd:COG2956 230 EALELLRKALELDPSDDLLLALADLLERKE 259
|
|
| BepA |
COG4783 |
Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell ... |
33-188 |
4.81e-08 |
|
Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell wall/membrane/envelope biogenesis, Posttranslational modification, protein turnover, chaperones];
Pssm-ID: 443813 [Multi-domain] Cd Length: 139 Bit Score: 54.04 E-value: 4.81e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 33 EAEAFALYHKALDLQKHDRFEESAKAYHELLEASllreavssGDEKEGlkhpglilkystYKNLAQLAAQREDLETAMEF 112
Cdd:COG4783 1 AACAEALYALAQALLLAGDYDEAEALLEKALELD--------PDNPEA------------FALLGEILLQLGDLDEAIVL 60
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2217339257 113 YLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKD 188
Cdd:COG4783 61 LHEALELDPDEPEARLNLGLALLKAGDYDEALALLEKALKLDPEHPEAYLRLARAYRALGRPDEAIAALEKALELD 136
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1860-2153 |
7.18e-08 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 58.00 E-value: 7.18e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1860 HLPVKVDEEAALEQAVKFCQVHLGAAAQrQASGDTPTTPK-HPKDS-RENFFPVTVVPTAPDPVPADSVQRPSDAHTKPR 1937
Cdd:pfam05109 453 HVPTNLTAPASTGPTVSTADVTSPTPAG-TTSGASPVTPSpSPRDNgTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPT 531
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1938 PALAAATTIITCPPSASASTLDQSKDPGPP-RPHRPEAT-PSMASLGPEGEELARVAEGTSfpPQEPRHSPQVK-----M 2010
Cdd:pfam05109 532 PNATSPTLGKTSPTSAVTTPTPNATSPTPAvTTPTPNATiPTLGKTSPTSAVTTPTPNATS--PTVGETSPQANttnhtL 609
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2011 APTSSPAEPHCWP--AEAALGTGAEPTCSQEG---KLRPEPRRDG---EAQEAASETQPL--SSPPTAA---SSKAPSSG 2077
Cdd:pfam05109 610 GGTSSTPVVTSPPknATSAVTTGQHNITSSSTssmSLRPSSISETlspSTSDNSTSHMPLltSAHPTGGeniTQVTPAST 689
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2217339257 2078 SAQPPEGHPGKPEPSRAKSRPLPNMpklviPSAATKfPPEITVTPPTPtllsPKGSISEETKQKLKSAILSAQSAA 2153
Cdd:pfam05109 690 STHHVSTSSPAPRPGTTSQASGPGN-----SSTSTK-PGEVNVTKGTP----PKNATSPQAPSGQKTAVPTVTSTG 755
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
1915-2154 |
2.76e-07 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 56.01 E-value: 2.76e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1915 PTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASASTldqskdpgPPRPHRPEATPSMASLGPEGEELARVAEG 1994
Cdd:PRK07003 374 ARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAA--------AAAATRAEAPPAAPAPPATADRGDDAADG 445
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1995 TSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTG-AEPTCSQEgklrPEPRRDGEAQEAASETQPLSSPPTAASSKA 2073
Cdd:PRK07003 446 DAPVPAKANARASADSRCDERDAQPPADSGSASAPASdAPPDAAFE----PAPRAAAPSAATPAAVPDARAPAAASREDA 521
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2074 PSSGSAQPPEGHPGKP----EPSRA--------------------KSRPLPNMPKLVIPSAATKFPPEITVTPPTPTLLS 2129
Cdd:PRK07003 522 PAAAAPPAPEARPPTPaaaaPAARAggaaaaldvlrnagmrvssdRGARAAAAAKPAAAPAAAPKPAAPRVAVQVPTPRA 601
|
250 260
....*....|....*....|....*
gi 2217339257 2130 PKGSiseeTKQKLKSAILSAQSAAN 2154
Cdd:PRK07003 602 RAAT----GDAPPNGAARAEQAAES 622
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
1893-2166 |
1.16e-06 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 53.92 E-value: 1.16e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1893 DTPTTPKHP---KDSRENFFPVTVVPTAP---DPVPADSVQRPSdAHTKPRPALAAATTIITCPPSASAstldQSKDPGP 1966
Cdd:PHA03378 607 EPPTTQSHIpetSAPRQWPMPLRPIPMRPlrmQPITFNVLVFPT-PHQPPQVEITPYKPTWTQIGHIPY----QPSPTGA 681
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1967 PRPHRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPaephcwpaeaalgtgaeptcsqeGKLRPep 2046
Cdd:PHA03378 682 NTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAP-----------------------GRARP-- 736
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2047 rrdgeAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSrpLPNMPKLVIPSAATKFPPeiTVTPPTPT 2126
Cdd:PHA03378 737 -----PAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQ--APPAPQQRPRGAPTPQPP--PQAGPTSM 807
|
250 260 270 280
....*....|....*....|....*....|....*....|
gi 2217339257 2127 LLSPKGSISEETKQKLKSAILSAQSAANVRKESLCQPALE 2166
Cdd:PHA03378 808 QLMPRAAPGQQGPTKQILRQLLTGGVKRGRPSLKKPAALE 847
|
|
| dnaA |
PRK14086 |
chromosomal replication initiator protein DnaA; |
1912-2131 |
1.29e-06 |
|
chromosomal replication initiator protein DnaA;
Pssm-ID: 237605 [Multi-domain] Cd Length: 617 Bit Score: 53.68 E-value: 1.29e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1912 TVVPTAPDPVPADSVQRPSDAHTKPRPAlaaattiitcpPSASASTLDQSKDPGPPRPHRPEATPSMASLGPEGEELARV 1991
Cdd:PRK14086 91 SAGEPAPPPPHARRTSEPELPRPGRRPY-----------EGYGGPRADDRPPGLPRQDQLPTARPAYPAYQQRPEPGAWP 159
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1992 AEGTSFPPQEPRHSPqvkmaptsSPAEPHCWPAEAAlgTGAEPTCSQEGKLRPE---PRRDGEAQEaasetqPLSSPPTA 2068
Cdd:PRK14086 160 RAADDYGWQQQRLGF--------PPRAPYASPASYA--PEQERDREPYDAGRPEydqRRRDYDHPR------PDWDRPRR 223
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2217339257 2069 ASSKAP--SSGSAQPPEGHPGKPEPSRAKSRPlpnmpklVIPSAATKFP--PEITVTPPTPTL-LSPK 2131
Cdd:PRK14086 224 DRTDRPepPPGAGHVHRGGPGPPERDDAPVVP-------IRPSAPGPLAaqPAPAPGPGEPTArLNPK 284
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
1890-2140 |
1.31e-06 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 53.92 E-value: 1.31e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1890 ASGDTPTTP--KHPKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIitcPPSASASTLDQSKDPGPP 1967
Cdd:PHA03378 646 LVFPTPHQPpqVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPTPM---RPPAAPPGRAQRPAAATG 722
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1968 RPHRPEATPSMaslgpegeelARVAEGTSFPPQEPRHSPQVKMAPTSSPAephcwPAEAALGTGAEPTCSQEGKLRPEPR 2047
Cdd:PHA03378 723 RARPPAAAPGR----------ARPPAAAPGRARPPAAAPGRARPPAAAPG-----RARPPAAAPGAPTPQPPPQAPPAPQ 787
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2048 RdgEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRplpNMPKLVIPSAATKFPPEItvtpPTPtl 2127
Cdd:PHA03378 788 Q--RPRGAPTPQPPPQAGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVKR---GRPSLKKPAALERQAAAG----PTP-- 856
|
250
....*....|...
gi 2217339257 2128 lSPKGSISEETKQ 2140
Cdd:PHA03378 857 -SPGSGTSDKIVQ 868
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
1889-2115 |
1.39e-06 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 53.73 E-value: 1.39e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1889 QASGDT-PTTPKHPKDSRENffPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASA-STLDQSKDPGP 1966
Cdd:PRK12323 367 QSGGGAgPATAAAAPVAQPA--PAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEAlAAARQASARGP 444
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1967 PRPHRPEATPSMAslgPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQEGKLRPEP 2046
Cdd:PRK12323 445 GGAPAPAPAPAAA---PAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGW 521
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2217339257 2047 RRDGEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRpLPNMPKLVIPSAATKFP 2115
Cdd:PRK12323 522 VAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASG-LPDMFDGDWPALAARLP 589
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
1878-2103 |
1.74e-06 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 53.45 E-value: 1.74e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1878 CQVHLGAAAQRQASGDTPTTPKHPKDSRENffpvTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASAST 1957
Cdd:PRK07764 582 WQVEAVVGPAPGAAGGEGPPAPASSGPPEE----AARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHV 657
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1958 LDQSKDPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHcWPAEAALGTGAEPTCS 2037
Cdd:PRK07764 658 AVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQ-PPQAAQGASAPSPAAD 736
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2217339257 2038 QEGKLRPEPRRDGEAQEAasetqPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLPNMP 2103
Cdd:PRK07764 737 DPVPLPPEPDDPPDPAGA-----PAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDED 797
|
|
| TadD |
COG5010 |
Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, ... |
38-188 |
2.43e-06 |
|
Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];
Pssm-ID: 444034 [Multi-domain] Cd Length: 155 Bit Score: 49.57 E-value: 2.43e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 38 ALYHKALDLQKHDRFEESAKAYHELLEASLLREAVSSGDEKEGLKHPGLILKYSTYKNLAQLAAQREDLETAMEFYLEAV 117
Cdd:COG5010 2 RALEGFDRLPLYLLLLTKLRTLVEKYEAALAGANNTKEDELAAAGRDKLAKAFAIESPSDNLYNKLGDFEESLALLEQAL 81
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217339257 118 MLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKD 188
Cdd:COG5010 82 QLDPNNPELYYNLALLYSRSGDKDEAKEYYEKALALSPDNPNAYSNLAALLLSLGQDDEAKAALQRALGTS 152
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
1916-2123 |
2.61e-06 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 52.57 E-value: 2.61e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1916 TAPDPVPADSVQRPSDAHTKPRPALAAATTiitcPPSASASTLDQSKDPGPPRPHRPEATPSMASLGPEGEELARVAEGT 1995
Cdd:PRK12323 372 AGPATAAAAPVAQPAPAAAAPAAAAPAPAA----PPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGA 447
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1996 SFPPQEPRHSPQVKMAPTSSPAEPhcwpaEAALGTGAEPTCSQEGKLRPEPRRDGEAQEAASEtqpLSSPPTAASSKAPs 2075
Cdd:PRK12323 448 PAPAPAPAAAPAAAARPAAAGPRP-----VAAAAAAAPARAAPAAAPAPADDDPPPWEELPPE---FASPAPAQPDAAP- 518
|
170 180 190 200
....*....|....*....|....*....|....*....|....*...
gi 2217339257 2076 SGSAQPPEGHPGKPEPSRAKSRPLPNMPKLVIPSAATKFPPEITVTPP 2123
Cdd:PRK12323 519 AGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPP 566
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
1883-2103 |
3.62e-06 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 52.48 E-value: 3.62e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1883 GAAAQRQASGDTPTTPKHPKDSREnfFPVTVVPTAPDPVPADSVQRPSDahtkprPALAAATtiitcPPSASASTLDQSK 1962
Cdd:PHA03307 76 GTEAPANESRSTPTWSLSTLAPAS--PAREGSPTPPGPSSPDPPPPTPP------PASPPPS-----PAPDLSEMLRPVG 142
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1963 DPGPPRPHRPEATPSMASLGPEGEE-------LARVAEGTSFPPQEPRHSPQVKMAP---TSSPAEPHCWPAEAALGTGA 2032
Cdd:PHA03307 143 SPGPPPAASPPAAGASPAAVASDAAssrqaalPLSSPEETARAPSSPPAEPPPSTPPaaaSPRPPRRSSPISASASSPAP 222
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2033 EPTCSQEGKLR--------PEPRRDGEAQE-------AASETQP----LSSPPTAASSKAPSSGSAQPPEGHPGKPEPSR 2093
Cdd:PHA03307 223 APGRSAADDAGasssdsssSESSGCGWGPEnecplprPAPITLPtriwEASGWNGPSSRPGPASSSSSPRERSPSPSPSS 302
|
250
....*....|
gi 2217339257 2094 AKSRPLPNMP 2103
Cdd:PHA03307 303 PGSGPAPSSP 312
|
|
| PilF |
COG3063 |
Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures]; |
99-188 |
3.97e-06 |
|
Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];
Pssm-ID: 442297 [Multi-domain] Cd Length: 94 Bit Score: 47.09 E-value: 3.97e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 99 LAAQREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARhAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCL 178
Cdd:COG3063 1 LYLKLGDLEEAEEYYEKALELDPDNADALNNLGLLLLEQGRYDEAI-ALEKALKLDPNNAEALLNLAELLLELGDYDEAL 79
|
90
....*....|
gi 2217339257 179 YFICKALEKD 188
Cdd:COG3063 80 AYLERALELD 89
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
1926-2131 |
5.47e-06 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 51.69 E-value: 5.47e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1926 VQRPSDAHTKPRPALAAATTIITCPPSASASTLD-QSKDPGPPRPHRPEATPSMASLGPEGEELarvaegtsFPPQEPrh 2004
Cdd:pfam03154 174 LQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPpQGSPATSQPPNQTQSTAAPHTLIQQTPTL--------HPQRLP-- 243
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2005 SPQVKMAPTSSPAEPHCWPAEAAlgtgAEPTCSQEGKLRPEPRRDGEAQ-EAASETQPLSSPPTAASSKAPSSGSAQ--- 2080
Cdd:pfam03154 244 SPHPPLQPMTQPPPPSQVSPQPL----PQPSLHGQMPPMPHSLQTGPSHmQHPVPPQPFPLTPQSSQSQVPPGPSPAapg 319
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2081 ---------PPEGHPGKPEPSRakSRPLPNMPkLVIPSAAtkfPPEITVTPPTPTLLSPK 2131
Cdd:pfam03154 320 qsqqrihtpPSQSQLQSQQPPR--EQPLPPAP-LSMPHIK---PPPTTPIPQLPNPQSHK 373
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
1883-2145 |
7.66e-06 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 51.31 E-value: 7.66e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1883 GAAAQRQASGDTPTTPKHPKDSRENffpvtvvPTAPDPVPADSVQRPSdahTKPRPALAAATTIiTCPPSASASTLDQSK 1962
Cdd:pfam03154 319 GQSQQRIHTPPSQSQLQSQQPPREQ-------PLPPAPLSMPHIKPPP---TTPIPQLPNPQSH-KHPPHLSGPSPFQMN 387
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1963 DPGPPRP-----------HRPEATPSMASLGPEGEELARvaegtsfPPQEPRHSPQVKMAPTSSPAEPHcwpaeaalGTG 2031
Cdd:pfam03154 388 SNLPPPPalkplsslsthHPPSAHPPPLQLMPQSQQLPP-------PPAQPPVLTQSQSLPPPAASHPP--------TSG 452
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2032 AEPTCSQEgklrPEPRRDGEAQEAASETQPlSSPPTAASSKAPSSgsaQPPEghpgkpEPSRAKSRPLPNMPKLVIPSAA 2111
Cdd:pfam03154 453 LHQVPSQS----PFPQHPFVPGGPPPITPP-SGPPTSTSSAMPGI---QPPS------SASVSSSGPVPAAVSCPLPPVQ 518
|
250 260 270
....*....|....*....|....*....|....*....
gi 2217339257 2112 TKFPP-----EITVTPPTPTLLSPKGSISEETKQKLKSA 2145
Cdd:pfam03154 519 IKEEAldeaeEPESPPPPPRSPSPEPTVVNTPSHASQSA 557
|
|
| PHA03379 |
PHA03379 |
EBNA-3A; Provisional |
1889-2134 |
1.18e-05 |
|
EBNA-3A; Provisional
Pssm-ID: 223066 [Multi-domain] Cd Length: 935 Bit Score: 50.83 E-value: 1.18e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1889 QASGDTPTTPKHPKDSrenffPVTVVP----TAPDPVPADSVQRPSDAHTKPRPaLAAATTIITCP-------PSASAST 1957
Cdd:PHA03379 407 KASEPTYGTPRPPVEK-----PRPEVPqsleTATSHGSAQVPEPPPVHDLEPGP-LHDQHSMAPCPvaqlppgPLQDLEP 480
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1958 LDQskDPGPPRPHRPEATPSMASLGP---EGEELARVAEGTSFPPQEPRHSP-QVKMAPTSSPAEPHC-WPAEAALGTGA 2032
Cdd:PHA03379 481 GDQ--LPGVVQDGRPACAPVPAPAGPivrPWEASLSQVPGVAFAPVMPQPMPvEPVPVPTVALERPVCpAPPLIAMQGPG 558
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2033 EPTCSQEGKLR---------------PEPRRDGEAQ---EAASETQPLSSPP---TAASSKAPSSGSAQPPEG-HPGKPE 2090
Cdd:PHA03379 559 ETSGIVRVRERwrpapwtpnpprspsQMSVRDRLARlraEAQPYQASVEVQPpqlTQVSPQQPMEYPLEPEQQmFPGSPF 638
|
250 260 270 280
....*....|....*....|....*....|....*....|....
gi 2217339257 2091 PSRAKSRPLPNMPKLVIPSAATKFPPEITVTPPTPTLLSPKGSI 2134
Cdd:PHA03379 639 SQVADVMRAGGVPAMQPQYFDLPLQQPISQGAPLAPLRASMGPV 682
|
|
| NlpI |
COG4785 |
Lipoprotein NlpI, contains TPR repeats [Cell wall/membrane/envelope biogenesis]; |
29-174 |
1.33e-05 |
|
Lipoprotein NlpI, contains TPR repeats [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 443815 [Multi-domain] Cd Length: 223 Bit Score: 48.76 E-value: 1.33e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 29 KEAQEAEAFALYHKALDLQKHDRF-----EESAKAYHELLEASLLREAVSSGDEK--EGLKHPGLIlkySTYKNLAQLAA 101
Cdd:COG4785 8 LLLALALAAAAASKAAILLAALLFaavlaLAIALADLALALAAAALAAAALAAERidRALALPDLA---QLYYERGVAYD 84
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2217339257 102 QREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDY 174
Cdd:COG4785 85 SLGDYDLAIADFDQALELDPDLAEAYNNRGLAYLLLGDYDAALEDFDRALELDPDYAYAYLNRGIALYYLGRY 157
|
|
| NrfG |
COG4235 |
Cytochrome c-type biogenesis protein CcmH/NrfG [Energy production and conversion, ... |
95-174 |
1.97e-05 |
|
Cytochrome c-type biogenesis protein CcmH/NrfG [Energy production and conversion, Posttranslational modification, protein turnover, chaperones];
Pssm-ID: 443378 [Multi-domain] Cd Length: 131 Bit Score: 46.15 E-value: 1.97e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 95 NLAQLAAQREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDY 174
Cdd:COG4235 22 LLGRAYLRLGRYDEALAAYEKALRLDPDNADALLDLAEALLAAGDTEEAEELLERALALDPDNPEALYLLGLAAFQQGDY 101
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
1884-2137 |
2.22e-05 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 49.78 E-value: 2.22e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1884 AAAQRQASGDTPTT-PKHPKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPalaaattiitcPPSASASTLDQSK 1962
Cdd:PHA03307 60 AACDRFEPPTGPPPgPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPD-----------PPPPTPPPASPPP 128
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1963 DPGPPRPH-----RPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAAlGTGAEPTCs 2037
Cdd:PHA03307 129 SPAPDLSEmlrpvGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTP-PAAASPRP- 206
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2038 qegklrpePRRDGEAQEAASETQPlSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLPNMPKLVIPSAATKFPpe 2117
Cdd:PHA03307 207 --------PRRSSPISASASSPAP-APGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGW-- 275
|
250 260
....*....|....*....|
gi 2217339257 2118 iTVTPPTPTLLSPKGSISEE 2137
Cdd:PHA03307 276 -NGPSSRPGPASSSSSPRER 294
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
1911-2127 |
2.96e-05 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 49.21 E-value: 2.96e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1911 VTVVPtAPDPVPADSVQRPSDAHTKPRPALAAATtiitcPPSASASTlDQSKDPGPPRPHRPEATPSMASLGPEGEELAR 1990
Cdd:PRK07764 584 VEAVV-GPAPGAAGGEGPPAPASSGPPEEAARPA-----APAAPAAP-AAPAPAGAAAAPAEASAAPAPGVAAPEHHPKH 656
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1991 VAEGTSFPPQEPRHSPQVKMAPTSSPAEPhcwpAEAALGTGAEPTCSQEGKLRPEPRRDGEAQEAASETQplsSPPTAAS 2070
Cdd:PRK07764 657 VAVPDASDGGDGWPAKAGGAAPAAPPPAP----APAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPP---QAAQGAS 729
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*..
gi 2217339257 2071 SKAPSSGSAQPPEGHPGKPEPSRAKSRPLPNMPKLVIPSAATKFPPEITVTPPTPTL 2127
Cdd:PRK07764 730 APSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMA 786
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
1884-2056 |
3.45e-05 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 49.21 E-value: 3.45e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1884 AAAQRQASGDTPTTPKHPKDSRENFFPVTVVPTAPDPVPaDSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKD 1963
Cdd:PRK07764 622 AAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVP-DASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQ 700
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1964 PGPPRPHRP------EATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCS 2037
Cdd:PRK07764 701 PAPAPAATPpagqadDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPS 780
|
170 180
....*....|....*....|..
gi 2217339257 2038 QEGKLRPEPRR---DGEAQEAA 2056
Cdd:PRK07764 781 EEEEMAEDDAPsmdDEDRRDAE 802
|
|
| TadD |
COG5010 |
Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, ... |
1-155 |
4.04e-05 |
|
Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];
Pssm-ID: 444034 [Multi-domain] Cd Length: 155 Bit Score: 46.11 E-value: 4.04e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1 MIRIAALNASSTIEDDHEGSFKSHKTQTKEAQEAEAFALYHKALDLQKhdRFEESAKAYHELLEAsllreavssgdekeg 80
Cdd:COG5010 21 RTLVEKYEAALAGANNTKEDELAAAGRDKLAKAFAIESPSDNLYNKLG--DFEESLALLEQALQL--------------- 83
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217339257 81 lkHPGlilKYSTYKNLAQLAAQREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNP 155
Cdd:COG5010 84 --DPN---NPELYYNLALLYSRSGDKDEAKEYYEKALALSPDNPNAYSNLAALLLSLGQDDEAKAALQRALGTSP 153
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
1884-2103 |
4.12e-05 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 49.08 E-value: 4.12e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1884 AAAQRQASGDTPTTPKHPKdsrenffPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKD 1963
Cdd:PRK07003 395 AVPAVTAVTGAAGAALAPK-------AAAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERD 467
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1964 PGPPRPHRPEATPSMASLGPEGEELA--------------RVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHC-WPAEAAL 2028
Cdd:PRK07003 468 AQPPADSGSASAPASDAPPDAAFEPApraaapsaatpaavPDARAPAAASREDAPAAAAPPAPEARPPTPAAaAPAARAG 547
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2029 GTGAEPTCSQEGKLRPEPRRDGEAQEAASETQPLSSPPTAASSK---------APSSGSAQPPEghPGKPEPSRAKSR-- 2097
Cdd:PRK07003 548 GAAAALDVLRNAGMRVSSDRGARAAAAAKPAAAPAAAPKPAAPRvavqvptprARAATGDAPPN--GAARAEQAAESRga 625
|
....*...
gi 2217339257 2098 --PLPNMP 2103
Cdd:PRK07003 626 ppPWEDIP 633
|
|
| PHA02682 |
PHA02682 |
ORF080 virion core protein; Provisional |
1910-2016 |
4.61e-05 |
|
ORF080 virion core protein; Provisional
Pssm-ID: 177464 [Multi-domain] Cd Length: 280 Bit Score: 47.55 E-value: 4.61e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1910 PVTVVPTAPDP-VPADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKDPGPPRPHR--PEAT------PSMAS 1980
Cdd:PHA02682 76 PSGQSPLAPSPaCAAPAPACPACAPAAPAPAVTCPAPAPACPPATAPTCPPPAVCPAPARPAPacPPSTrqcppaPPLPT 155
|
90 100 110
....*....|....*....|....*....|....*..
gi 2217339257 1981 LGPEGEELARVAEGTSFPPQEPRHS-PQVKMAPTSSP 2016
Cdd:PHA02682 156 PKPAPAAKPIFLHNQLPPPDYPAAScPTIETAPAASP 192
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
1910-2098 |
5.89e-05 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 48.63 E-value: 5.89e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1910 PVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAA-----TTIITCPPSASASTLDQSKDPGPpRPHRPEATPSMASLGPE 1984
Cdd:PHA03307 25 PATPGDAADDLLSGSQGQLVSDSAELAAVTVVAGaaacdRFEPPTGPPPGPGTEAPANESRS-TPTWSLSTLAPASPARE 103
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1985 GEELARVAEGTSFPPQ-EPRHSPqvkmAPTSSPAEPHCWPAEAALGTGAEPtcsqegklRPEPRRDGEAQEAASETQPLS 2063
Cdd:PHA03307 104 GSPTPPGPSSPDPPPPtPPPASP----PPSPAPDLSEMLRPVGSPGPPPAA--------SPPAAGASPAAVASDAASSRQ 171
|
170 180 190
....*....|....*....|....*....|....*
gi 2217339257 2064 SPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRP 2098
Cdd:PHA03307 172 AALPLSSPEETARAPSSPPAEPPPSTPPAAASPRP 206
|
|
| PLN03209 |
PLN03209 |
translocon at the inner envelope of chloroplast subunit 62; Provisional |
1950-2163 |
6.15e-05 |
|
translocon at the inner envelope of chloroplast subunit 62; Provisional
Pssm-ID: 178748 [Multi-domain] Cd Length: 576 Bit Score: 48.00 E-value: 6.15e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1950 PPSASASTldQSKDPGPPRPHRPEATPSMASLGPEGEELAR---VAEGTSF----PPQEPRHSPQVKMAPTSSPAEPHCW 2022
Cdd:PLN03209 329 PPKESDAA--DGPKPVPTKPVTPEAPSPPIEEEPPQPKAVVprpLSPYTAYedlkPPTSPIPTPPSSSPASSKSVDAVAK 406
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2023 PAEAALGTGAEPTCS-QEGKLRPEPR---RDGEAQEAASETQPLSSP-PTAASSKAPSSGSAQPPEGHPGKPEPSRAKSR 2097
Cdd:PLN03209 407 PAEPDVVPSPGSASNvPEVEPAQVEAkktRPLSPYARYEDLKPPTSPsPTAPTGVSPSVSSTSSVPAVPDTAPATAATDA 486
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2217339257 2098 PLPNMPKlviPSAATKFPPEITVTPPT-PTLLSPKGSISEETKQKLKSAILSAQSAANVRKESLCQP 2163
Cdd:PLN03209 487 AAPPPAN---MRPLSPYAVYDDLKPPTsPSPAAPVGKVAPSSTNEVVKVGNSAPPTALADEQHHAQP 550
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
1845-2125 |
6.85e-05 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 47.84 E-value: 6.85e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1845 RVERIMSETYMLIKQHLPV--KVDEEAALEQAVKFCQVHLGAAAQRQASGDTPTTPKHPKDSRenffpvtvvPTAPDPvp 1922
Cdd:NF033839 229 QIVALIKELDELKKQALSEidNVNTKVEIENTVHKIFADMDAVVTKFKKGLTQDTPKEPGNKK---------PSAPKP-- 297
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1923 adsvqrpsdaHTKPRPALAAAttiitcPPSASASTLDQSKDPGPPRPhRPEATPSmaslgPEGEElarvaegTSFPPQEP 2002
Cdd:NF033839 298 ----------GMQPSPQPEKK------EVKPEPETPKPEVKPQLEKP-KPEVKPQ-----PEKPK-------PEVKPQLE 348
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2003 RHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTcsqegklRPEPRRDGEAQEAASETQPlsSPPTAASSKAPSSGSAQP- 2081
Cdd:NF033839 349 TPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPE-------TPKPEVKPQPEKPKPEVKP--QPEKPKPEVKPQPEKPKPe 419
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|..
gi 2217339257 2082 --PEGHPGKPE--PSRAKS----RPLPNMPKLVIPSAATKFPPEITVTPPTP 2125
Cdd:NF033839 420 vkPQPEKPKPEvkPQPEKPkpevKPQPEKPKPEVKPQPETPKPEVKPQPEKP 471
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
1895-2166 |
9.34e-05 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 47.75 E-value: 9.34e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1895 PTTPKHPKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAAttiitcPPSASASTLDQSKDPGPPRPHRPEA 1974
Cdd:PHA03378 676 PSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRA------RPPAAAPGRARPPAAAPGRARPPAA 749
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1975 TPSMA---SLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPA---EPHCWPAEAALGTGAEPTCSQEGKLRPEPRR 2048
Cdd:PHA03378 750 APGRArppAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTpqpPPQAGPTSMQLMPRAAPGQQGPTKQILRQLL 829
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2049 DGEAQEA-ASETQPLSSPPTAASSKAPSSGSA------QPPEGHPGKPEPSRAKSRplPNMPKLVIPSAATKFPPEIT-- 2119
Cdd:PHA03378 830 TGGVKRGrPSLKKPAALERQAAAGPTPSPGSGtsdkivQAPVFYPPVLQPIQVMRQ--LGSVRAAAASTVTQAPTEYTge 907
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|...
gi 2217339257 2120 ---VTPPTPTLLSPKGSISEETKQKLKSAILSAQSAANVRKESLC---QPALE 2166
Cdd:PHA03378 908 rrgVGPMHPTDIPPSKRAKTDAYVESQPPHGGQSHSFSVIWENVSqgqQQTLE 960
|
|
| SepH |
NF040712 |
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ... |
1963-2103 |
1.78e-04 |
|
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.
Pssm-ID: 468676 [Multi-domain] Cd Length: 346 Bit Score: 46.30 E-value: 1.78e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1963 DPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHC------WPAEAALGTGAEPTC 2036
Cdd:NF040712 189 DPDFGRPLRPLATVPRLAREPADARPEEVEPAPAAEGAPATDSDPAEAGTPDDLASARRrragveQPEDEPVGPGAAPAA 268
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2217339257 2037 SQEGKLRPEPRRdgEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSR-PLPNMP 2103
Cdd:NF040712 269 EPDEATRDAGEP--PAPGAAETPEAAEPPAPAPAAPAAPAAPEAEEPARPEPPPAPKPKRRrRRASVP 334
|
|
| PHA03291 |
PHA03291 |
envelope glycoprotein I; Provisional |
1908-2131 |
3.10e-04 |
|
envelope glycoprotein I; Provisional
Pssm-ID: 223033 [Multi-domain] Cd Length: 401 Bit Score: 45.72 E-value: 3.10e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1908 FFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTiiTCPPSASASTLDQSKDPGPPRPHRPEATPSMASLGPEgee 1987
Cdd:PHA03291 203 FVPATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPST--TIAAPQAGTTPEAEGTPAPPTPGGGEAPPANATPAPE--- 277
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1988 larvaegtsfppqEPRHspQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQEGKLRPEPRRdgeaqeaASETQPLSSPPT 2067
Cdd:PHA03291 278 -------------ASRY--ELTVTQIIQIAIPASIIACVFLGSCACCLHRRCRRRRRRPAR-------IYRPPSPVAPSI 335
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2217339257 2068 AASSKAPSSGSAQPPEGHPGKPePSRAKSRPLPN-MPKLVIPSAATKFP--PEITVTPPTPTLLSPK 2131
Cdd:PHA03291 336 SAVNEAALARLGDELKRHPPES-PRRSKRRSSQTmVPSLTAISEESEAPavVELSRSPRRPGGPTAR 401
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
1973-2130 |
3.33e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 46.02 E-value: 3.33e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1973 EATPSMASLGPEGEELARVAEGTSFPPqeprhspqvkmAPTSSPAEPHCWPAEAAlgtGAEPTCSQEGKLRPEPRRDGEA 2052
Cdd:PRK12323 371 GAGPATAAAAPVAQPAPAAAAPAAAAP-----------APAAPPAAPAAAPAAAA---AARAVAAAPARRSPAPEALAAA 436
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2217339257 2053 QEAASETQPLSSPPTAASSKAPSsgSAQPPEGHPGKPEPSRAKSRPLPNMPKLViPSAATKFPPEITVTPPTPTLLSP 2130
Cdd:PRK12323 437 RQASARGPGGAPAPAPAPAAAPA--AAARPAAAGPRPVAAAAAAAPARAAPAAA-PAPADDDPPPWEELPPEFASPAP 511
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
1891-2100 |
3.58e-04 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 45.85 E-value: 3.58e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1891 SGDTPTTPKHPK-DSRENFFPVT--VVPTAPDPVPADSVQRPSDAHTkPRPALAAATTIITCPpsasasTLDQSKDPGPp 1967
Cdd:PRK10263 295 SGNRATQPEYDEyDPLLNGAPITepVAVAAAATTATQSWAAPVEPVT-QTPPVASVDVPPAQP------TVAWQPVPGP- 366
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1968 rpHRPEatPSMASlGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQEGKLRPEPR 2047
Cdd:PRK10263 367 --QTGE--PVIAP-APEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQP 441
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|...
gi 2217339257 2048 RDGEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLP 2100
Cdd:PRK10263 442 VAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQQPVVEPEP 494
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
1928-2154 |
4.36e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 45.64 E-value: 4.36e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1928 RPSDAHTKPRPALAAATTIITCPPSASASTldqskdPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFPPQEP----R 2003
Cdd:PRK12323 364 RPGQSGGGAGPATAAAAPVAQPAPAAAAPA------AAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEAlaaaR 437
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2004 HSPQVKMAPTSSPAephcwPAEAALGTGAEPTCSQEgkLRPEPRrdgeaqeAASETQPLSSPPTAAsskAPSSGSAQPPE 2083
Cdd:PRK12323 438 QASARGPGGAPAPA-----PAPAAAPAAAARPAAAG--PRPVAA-------AAAAAPARAAPAAAP---APADDDPPPWE 500
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2084 GHPGK-PEPSRAKSRPLPNM--------PKLVIPSAATKFPPEITVTPPTPTLLSPKGSISEETKQKLKSAILSAQSAAN 2154
Cdd:PRK12323 501 ELPPEfASPAPAQPDAAPAGwvaesipdPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGD 580
|
|
| PHA03381 |
PHA03381 |
tegument protein VP22; Provisional |
1901-2041 |
5.28e-04 |
|
tegument protein VP22; Provisional
Pssm-ID: 177618 [Multi-domain] Cd Length: 290 Bit Score: 44.23 E-value: 5.28e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1901 PKDSRENFFPVTVVPTAPDPVPAD-SVQRPSDAHTKPRPALAAAT----------TIITCPPSASASTLDQSKDPGPPRP 1969
Cdd:PHA03381 11 PHGTDEVEADVYYDFISPDASPARvSFEEPADRARRGAGQARGRSqaerrfhhydEARADYPYYTGSSSEDERPADPRPS 90
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2217339257 1970 HRPEATPSM----ASLGPEGEELARVAEGTSFPPqEPRHSPQVKMAPTSSPAEPHCwPAEAALGTGAEPTCSQEGK 2041
Cdd:PHA03381 91 RRPHAQPEAsgpgPARGARGPAGSRGRGRRAESP-SPRDPPNPKGASAPRGRKSAC-ADSAALLDAPAPAAPKRQK 164
|
|
| PilF |
COG3063 |
Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures]; |
92-158 |
5.70e-04 |
|
Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];
Pssm-ID: 442297 [Multi-domain] Cd Length: 94 Bit Score: 40.92 E-value: 5.70e-04
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2217339257 92 TYKNLAQLAAQREDLETAMEFyLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHW 158
Cdd:COG3063 28 ALNNLGLLLLEQGRYDEAIAL-EKALKLDPNNAEALLNLAELLLELGDYDEALAYLERALELDPSAL 93
|
|
| PHA03325 |
PHA03325 |
nuclear-egress-membrane-like protein; Provisional |
1953-2122 |
8.96e-04 |
|
nuclear-egress-membrane-like protein; Provisional
Pssm-ID: 223044 Cd Length: 418 Bit Score: 44.10 E-value: 8.96e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1953 ASASTLDQSKDPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQ-----VKMAPTSSPAEPhcwPAEAA 2027
Cdd:PHA03325 259 SSAFMLNSSLPTSAPKRRSRRAGAMRAAAGETADLADDDGSEHSDPEPLPASLPPppvrrPRVKHPEAGKEE---PDGAR 335
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2028 LGTGAEPTCSQEGKLRPeprrdgeAQEAASETQPLSSPPTAASSKApSSGSAQPPEGHPGKPEPSRAKSRPLPnmpklvi 2107
Cdd:PHA03325 336 NAEAKEPAQPATSTSSK-------GSSSAQNKDSGSTGPGSSLAAA-SSFLEDDDFGSPPLDLTTSLRHMPSP------- 400
|
170
....*....|....*
gi 2217339257 2108 PSAATKFPPEITVTP 2122
Cdd:PHA03325 401 SVTSAPEPPSIPLTY 415
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1919-2130 |
9.25e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 44.54 E-value: 9.25e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1919 DPVPADSV-QRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKDPGPPRPHRPEATPSMASLGP-------------- 1983
Cdd:PHA03247 2452 DPFFARTIlGAPFSLSLLLGELFPGAPVYRRPAEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPailpdepvgepvhp 2531
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1984 ------EG-EELARVAEGTSFPPQEPRHSPQV--KMAPTSSPAePHcwPAEAALGTGAE----PTCSQEGKLRPEPRRDG 2050
Cdd:PHA03247 2532 rmltwiRGlEELASDDAGDPPPPLPPAAPPAApdRSVPPPRPA-PR--PSEPAVTSRARrpdaPPQSARPRAPVDDRGDP 2608
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2051 EAQEAASETQPLSSPPTAASSkAPSSGSAQPPEGHP-GKPEPSRAKSRPLPNMPKLVIPSAATKFPPEITVTP--PTPTL 2127
Cdd:PHA03247 2609 RGPAPPSPLPPDTHAPDPPPP-SPSPAANEPDPHPPpTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPqrPRRRA 2687
|
...
gi 2217339257 2128 LSP 2130
Cdd:PHA03247 2688 ARP 2690
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
1964-2101 |
1.17e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 44.21 E-value: 1.17e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1964 PGPPRPHRPEATPSMASLGPEGEelarvaegtsfPPQEPRHSPQVKMAPTSSPAEPHCwPAEAALGTGAEPtcsqegklr 2043
Cdd:PRK07764 396 AAAPSAAAAAPAAAPAPAAAAPA-----------AAAAPAPAAAPQPAPAPAPAPAPP-SPAGNAPAGGAP--------- 454
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*...
gi 2217339257 2044 pePRRDGEAQEAASETQPLSSPPTAASSkAPSSGSAQPPEGHPGKPEPSRAKSRPLPN 2101
Cdd:PRK07764 455 --SPPPAAAPSAQPAPAPAAAPEPTAAP-APAPPAAPAPAAAPAAPAAPAAPAGADDA 509
|
|
| PLN03209 |
PLN03209 |
translocon at the inner envelope of chloroplast subunit 62; Provisional |
1898-2130 |
1.37e-03 |
|
translocon at the inner envelope of chloroplast subunit 62; Provisional
Pssm-ID: 178748 [Multi-domain] Cd Length: 576 Bit Score: 43.76 E-value: 1.37e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1898 PKHPkDSRENFFPVTVVPTAPD-PVPADSVQRPSDAHTKPRPA--------LAAATTIITCPPS---ASASTLDQSKDPG 1965
Cdd:PLN03209 330 PKES-DAADGPKPVPTKPVTPEaPSPPIEEEPPQPKAVVPRPLspytayedLKPPTSPIPTPPSsspASSKSVDAVAKPA 408
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1966 PPRPH-RPEATPSMASLGPEGEELARVAEGTSF-------PPQEPRHSPQVKMAPTSSPAephcwPAEAALGTGAEPTCS 2037
Cdd:PLN03209 409 EPDVVpSPGSASNVPEVEPAQVEAKKTRPLSPYaryedlkPPTSPSPTAPTGVSPSVSST-----SSVPAVPDTAPATAA 483
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2038 QEGKLRPEPRrdgeaqeaaseTQPLSSPPTAASSKAPSSGSaqppeghPGKPEPSRAKSRPlPNMPKLVIPSAATKFPPE 2117
Cdd:PLN03209 484 TDAAAPPPAN-----------MRPLSPYAVYDDLKPPTSPS-------PAAPVGKVAPSST-NEVVKVGNSAPPTALADE 544
|
250
....*....|...
gi 2217339257 2118 ITVTPPTPTLLSP 2130
Cdd:PLN03209 545 QHHAQPKPRPLSP 557
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
1862-2099 |
1.56e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 43.71 E-value: 1.56e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1862 PVKVDEEAALEQAVKFCQVHLGAAAQRQASGDTPTTPkhpkdsrenffpvtvvptAPDPVPADSVQRPSDAHTKPRPALA 1941
Cdd:PRK12323 392 PAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSP------------------APEALAAARQASARGPGGAPAPAPA 453
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1942 -AATTIITCPPSASASTLDQSKDPGPPRPHRPEATPSMASLG-PEGEELarvaegtsfpPQEPrhspqvkmaPTSSPAEP 2019
Cdd:PRK12323 454 pAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDpPPWEEL----------PPEF---------ASPAPAQP 514
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2020 HCWPAEAALGTGAEPTCSQEGKLRPEPRrdgEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKpEPSRAKSRPL 2099
Cdd:PRK12323 515 DAAPAGWVAESIPDPATADPDDAFETLA---PAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGD-WPALAARLPV 590
|
|
| TPR_12 |
pfam13424 |
Tetratricopeptide repeat; |
36-119 |
1.60e-03 |
|
Tetratricopeptide repeat;
Pssm-ID: 315987 [Multi-domain] Cd Length: 77 Bit Score: 39.29 E-value: 1.60e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 36 AFALYHKALDLQKHDRFEESAKAYHELLEaslLREAVSSGDekeglkHPGLILkysTYKNLAQLAAQREDLETAMEFYLE 115
Cdd:pfam13424 3 ATALNNLAAVLRRLGRYDEALELLEKALE---IARRLLGPD------HPLTAT---TLLNLGRLYLELGRYEEALELLER 70
|
....
gi 2217339257 116 AVML 119
Cdd:pfam13424 71 ALAL 74
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1885-2127 |
1.64e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 43.77 E-value: 1.64e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1885 AAQRQASGDTPT-TPKHPKDSRENFFPV---------TVVPTAPDPVPADSVQRPSDAHTKPRpalaAATTIITCP-PSA 1953
Cdd:PHA03247 270 ETARGATGPPPPpEAAAPNGAAAPPDGVwgaalagapLALPAPPDPPPPAPAGDAEEEDDEDG----AMEVVSPLPrPRQ 345
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1954 SASTldqskdpGPPRPHRPEATP--SMASLGpEGEELARVAEgtsfPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTG 2031
Cdd:PHA03247 346 HYPL-------GFPKRRRPTWTPpsSLEDLS-AGRHHPKRAS----LPTRKRRSARHAATPFARGPGGDDQTRPAAPVPA 413
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2032 AEPTCSQEGKLRPEPrrdgeaqeaasetqPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLPNMPKlVIPSAA 2111
Cdd:PHA03247 414 SVPTPAPTPVPASAP--------------PPPATPLPSAEPGSDDGPAPPPERQPPAPATEPAPDDPDDATRK-ALDALR 478
|
250
....*....|....*.
gi 2217339257 2112 TKFPPEitvtPPTPTL 2127
Cdd:PHA03247 479 ERRPPE----PPGADL 490
|
|
| sucB |
TIGR01347 |
2-oxoglutarate dehydrogenase complex dihydrolipoamide succinyltransferase (E2 component); This ... |
1985-2095 |
1.95e-03 |
|
2-oxoglutarate dehydrogenase complex dihydrolipoamide succinyltransferase (E2 component); This model describes the TCA cycle 2-oxoglutarate system E2 component, dihydrolipoamide succinyltransferase. It is closely related to the pyruvate dehydrogenase E2 component, dihydrolipoamide acetyltransferase. The seed for this model includes mitochondrial and Gram-negative bacterial forms. Mycobacterial candidates are highly derived, differ in having and extra copy of the lipoyl-binding domain at the N-terminus. They score below the trusted cutoff, but above the noise cutoff and above all examples of dihydrolipoamide acetyltransferase. [Energy metabolism, TCA cycle]
Pssm-ID: 273565 [Multi-domain] Cd Length: 403 Bit Score: 42.80 E-value: 1.95e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1985 GEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHcwPAEAALGTGAEPTCSQEGKlrpEPRRDGEAQEAASETQPLSS 2064
Cdd:TIGR01347 68 GQVLAILEEGNDATAAPPAKSGEEKEETPAASAAAA--PTAAANRPSLSPAARRLAK---EHGIDLSAVPGTGVTGRVTK 142
|
90 100 110
....*....|....*....|....*....|.
gi 2217339257 2065 PPTAASSKAPSsgSAQPPEGHPGKPEPSRAK 2095
Cdd:TIGR01347 143 EDIIKKTEAPA--SAQPPAAAAAAAAPAAAT 171
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
1964-2082 |
1.97e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 43.44 E-value: 1.97e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1964 PGPPRPHRPEATPSMASLGPEgeelarvAEGTSFPPQEPRHSPQVKMAPTSSPAEPhcwPAEAALGTGAEPTCSQEGKLR 2043
Cdd:PRK07764 394 PAAAAPSAAAAAPAAAPAPAA-------AAPAAAAAPAPAAAPQPAPAPAPAPAPP---SPAGNAPAGGAPSPPPAAAPS 463
|
90 100 110 120
....*....|....*....|....*....|....*....|..
gi 2217339257 2044 PEPRR---DGEAQEAASETQPLSSPPTAASSKAPSSGSAQPP 2082
Cdd:PRK07764 464 AQPAPapaAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAG 505
|
|
| PBP1 |
COG5180 |
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ... |
1883-2125 |
2.07e-03 |
|
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];
Pssm-ID: 444064 [Multi-domain] Cd Length: 548 Bit Score: 43.13 E-value: 2.07e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1883 GAAAQRQASGDTPTTPKH----PKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTiitcpPSASASTL 1958
Cdd:COG5180 152 AALLQRSDPILAKDPDGDsastLPPPAEKLDKVLTEPRDALKDSPEKLDRPKVEVKDEAQEEPPDLT-----GGADHPRP 226
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1959 DQSKDPGPPRPHRPEATPSMASLGPEGEEL-------ARVAEGTSFPPQEPRHSPQ-------VKMAPTSSPAEPHCWPA 2024
Cdd:COG5180 227 EAASSPKVDPPSTSEARSRPATVDAQPEMRppadakeRRRAAIGDTPAAEPPGLPVleagsepQSDAPEAETARPIDVKG 306
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2025 EAALGTGAEPTCSQEGKLRPEPRRDGEAQEaasetQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKS------RP 2098
Cdd:COG5180 307 VASAPPATRPVRPPGGARDPGTPRPGQPTE-----RPAGVPEAASDAGQPPSAYPPAEEAVPGKPLEQGAPRpgssggDG 381
|
250 260
....*....|....*....|....*..
gi 2217339257 2099 LPNMPKLVIPSAATKFPPeiTVTPPTP 2125
Cdd:COG5180 382 APFQPPNGAPQPGLGRRG--APGPPMG 406
|
|
| Treacle |
pfam03546 |
Treacher Collins syndrome protein Treacle; |
1922-2156 |
2.26e-03 |
|
Treacher Collins syndrome protein Treacle;
Pssm-ID: 460967 [Multi-domain] Cd Length: 531 Bit Score: 43.14 E-value: 2.26e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1922 PADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKDPGPPRPHRPEATPSMASLGPEGEELARvaEGTSFPPQE 2001
Cdd:pfam03546 168 DSESSSEESDSEGEAPPAATQAKPSGKILQVRPASGPAKGAAPAPPQKAGPVATQVKAERSKEDSESSE--ESSDSEEEA 245
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2002 PRHSPQVKMAPTSSPAEPHCWPAEaalGTGAEPTCSQEGKLR---PEPRRDGEAQEAASetqpLSSPPTAASSKAP---S 2075
Cdd:pfam03546 246 PAAATPAQAKPALKTPQTKASPRK---GTPITPTSAKVPPVRvgtPAPWKAGTVTSPAC----ASSPAVARGAQRPeedS 318
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2076 SGSAQPPEGHPGKPEPSRAKSRPLPNMPKLVIPSAATKFPPEITVTPPTPTLLSPKGSI-----------SEETKQKLKS 2144
Cdd:pfam03546 319 SSSEESESEEETAPAAAVGQAKSVGKGLQGKAASAPTKGPSGQGTAPVPPGKTGPAVAQvkaeaqedsesSEEESDSEEA 398
|
250
....*....|..
gi 2217339257 2145 AILSAQSAANVR 2156
Cdd:pfam03546 399 AATPAQVKASGK 410
|
|
| PHA03321 |
PHA03321 |
tegument protein VP11/12; Provisional |
1888-2132 |
2.64e-03 |
|
tegument protein VP11/12; Provisional
Pssm-ID: 223041 [Multi-domain] Cd Length: 694 Bit Score: 43.02 E-value: 2.64e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1888 RQASGDTPTTPKHPKDSRENFFPVT-----------VVPTAPDPVPAdSVQRPSDAHTK---PRPAlaaattiitcPPSA 1953
Cdd:PHA03321 447 RARPGSTPACARRARAQRARDAGPEyvdplgalrrlPAGAAPPPEPA-AAPSPATYYTRmggGPPR----------LPPR 515
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1954 SASTLDQSKDPGPPRPHRPEATPSmASLGPEGEELARVAEGTSFPPQEPRHSPqvkmAPTSSPaephcwPAEaALGTGAE 2033
Cdd:PHA03321 516 NRATETLRPDWGPPAAAPPEQMED-PYLEPDDDRFDRRDGAAAAATSHPREAP----APDDDP------IYE-GVSDSEE 583
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2034 PTCSQegklRPEPR----RDGEAQEAASETQPLSSPptaassKAPSSGSAQPPEGHPGKP--EPSRAKSRPLPnmpklvi 2107
Cdd:PHA03321 584 PVYEE----IPTPRvyqnPLPRPMEGAGEPPDLDAP------TSPWVEEENPIYGWGDSPlfSPPPAARFPPP------- 646
|
250 260
....*....|....*....|....*
gi 2217339257 2108 PSAATKFPPEITVTPPTPTLLSPKG 2132
Cdd:PHA03321 647 DPALSPEPPALPAHRPRPGALAPDG 671
|
|
| LapB |
COG2956 |
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ... |
34-174 |
3.30e-03 |
|
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442196 [Multi-domain] Cd Length: 275 Bit Score: 41.64 E-value: 3.30e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 34 AEAFALYHKALDLQKHDRFEESAKAYHELLEasllreavssgdekeglKHPGLIlkySTYKNLAQLAAQREDLETAMEFY 113
Cdd:COG2956 6 AAALGWYFKGLNYLLNGQPDKAIDLLEEALE-----------------LDPETV---EAHLALGNLYRRRGEYDRAIRIH 65
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217339257 114 LEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDY 174
Cdd:COG2956 66 QKLLERDPDRAEALLELAQDYLKAGLLDRAEELLEKLLELDPDDAEALRLLAEIYEQEGDW 126
|
|
| TPR_21 |
pfam09976 |
Tetratricopeptide repeat-like domain; This family resembles a single unit of a TPR repeat. |
48-151 |
3.54e-03 |
|
Tetratricopeptide repeat-like domain; This family resembles a single unit of a TPR repeat.
Pssm-ID: 430959 [Multi-domain] Cd Length: 194 Bit Score: 41.03 E-value: 3.54e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 48 KHDRFEESAKAYHELLEAsllreaVSSGDEKEGL--------KHPGlilkySTYKNLAQL-----AAQREDLETAMEfYL 114
Cdd:pfam09976 32 QRSQAEEASALYQQLLEA------VAAGDAAKAQaaaaqlkdEYGG-----TGYAALAALllakaAVEAGDLAAAKA-QL 99
|
90 100 110
....*....|....*....|....*....|....*...
gi 2217339257 115 EAVMLDSTDVNLwykiGHVA-LRLIRIPLARHAFEEGL 151
Cdd:pfam09976 100 EWVADNAKDEAL----KALArLRLARVLLAQGKYDEAL 133
|
|
| PHA03369 |
PHA03369 |
capsid maturational protease; Provisional |
1977-2164 |
4.38e-03 |
|
capsid maturational protease; Provisional
Pssm-ID: 223061 [Multi-domain] Cd Length: 663 Bit Score: 42.29 E-value: 4.38e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1977 SMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALgTGAEPTCSQEGKLRPEPRRDGEAQEAA 2056
Cdd:PHA03369 349 KTASLTAPSRVLAAAAKVAVIAAPQTHTGPADRQRPQRPDGIPYSVPARSPM-TAYPPVPQFCGDPGLVSPYNPQSPGTS 427
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2057 SETQPLSS-PPT-AASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLP-NMPKLVIPSAATKFPPEITVTPPTPTLLSPKGS 2133
Cdd:PHA03369 428 YGPEPVGPvPPQpTNPYVMPISMANMVYPGHPQEHGHERKRKRGGElKEELIETLKLVKKLKEEQESLAKELEATAHKSE 507
|
170 180 190
....*....|....*....|....*....|.
gi 2217339257 2134 ISEETKQKLKSAILSAQSAANVRKESLCQPA 2164
Cdd:PHA03369 508 IKKIAESEFKNAGAKTAAANIEPNCSADAAA 538
|
|
| PRK14971 |
PRK14971 |
DNA polymerase III subunit gamma/tau; |
1886-2018 |
4.43e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237874 [Multi-domain] Cd Length: 614 Bit Score: 42.07 E-value: 4.43e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1886 AQRQASGDTPTTPKHPKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASA---------- 1955
Cdd:PRK14971 360 AQLTQKGDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVdppaavpvnp 439
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2217339257 1956 -STLDQSKDPGPPRPHRPEATPSMASLGPegeelarvaeGTSFPPQEPRHSPQ--VKMAPTSSPAE 2018
Cdd:PRK14971 440 pSTAPQAVRPAQFKEEKKIPVSKVSSLGP----------STLRPIQEKAEQATgnIKEAPTGTQKE 495
|
|
| PRK12727 |
PRK12727 |
flagellar biosynthesis protein FlhF; |
1911-2125 |
4.64e-03 |
|
flagellar biosynthesis protein FlhF;
Pssm-ID: 237182 [Multi-domain] Cd Length: 559 Bit Score: 41.90 E-value: 4.64e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1911 VTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKdpgpprPHRPEATPSMASLGpegeelAR 1990
Cdd:PRK12727 62 TPATAAAPAPAPQAPTKPAAPVHAPLKLSANANMSQRQRVASAAEDMIAAMA------LRQPVSVPRQAPAA------AP 129
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1991 VAEGTSFPPQEPRHSPQVKMapTSSPAEPHCWPAEAALGTGAEPTCSQegklRPEPRRDGEAQEAASETqPLSSPPTAAS 2070
Cdd:PRK12727 130 VRAASIPSPAAQALAHAAAV--RTAPRQEHALSAVPEQLFADFLTTAP----VPRAPVQAPVVAAPAPV-PAIAAALAAH 202
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*
gi 2217339257 2071 SKAPSSGSAQPPEGHPGKPEPSrAKSRPLPNMPKLVIPSAATKFPPEITVTPPTP 2125
Cdd:PRK12727 203 AAYAQDDDEQLDDDGFDLDDAL-PQILPPAALPPIVVAPAAPAALAAVAAAAPAP 256
|
|
| KLF9_13_N-like |
cd21975 |
Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like ... |
1970-2113 |
5.69e-03 |
|
Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. KLF9, KLF10, KLF11, KLF13, KLF14, and KLF16 share a conserved alpha-helical motif AA/VXXL that mediates their binding to Sin3A and their activities as transcriptional repressors. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the related N-terminal domains of KLF9, KLF13, KLF14, KLF16, and similar proteins.
Pssm-ID: 409240 [Multi-domain] Cd Length: 163 Bit Score: 39.67 E-value: 5.69e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1970 HRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQEGKLRPEPRRD 2049
Cdd:cd21975 19 HGVRPDPEGAGLAAGLDVRATREVAKGPGPPGPAWKPDGADSPGLVTAAPHLLAANVLAPLRGPSVEGSSLESGDADMGS 98
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2217339257 2050 GEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGkPEPSRAKSRPLPNMPKLVIPSAATK 2113
Cdd:cd21975 99 DSDVAPASGAAASTSPESSSDAASSPSPLSLLHPGEAG-LEPERPRPRVRRGVRRRGVTPAAKR 161
|
|
| PRK07994 |
PRK07994 |
DNA polymerase III subunits gamma and tau; Validated |
1968-2159 |
9.13e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236138 [Multi-domain] Cd Length: 647 Bit Score: 41.00 E-value: 9.13e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 1968 RPHRPEATPSM---ASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQEgklRP 2044
Cdd:PRK07994 360 HPAAPLPEPEVppqSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQG---AT 436
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217339257 2045 EPRRDGEAqeAASETQPLSSPPTAASSKAPssgSAQPPEGHPGKPEPSRAKSR-PLPNMPKLVIPSAATKFPPEITVTPP 2123
Cdd:PRK07994 437 KAKKSEPA--AASRARPVNSALERLASVRP---APSALEKAPAKKEAYRWKATnPVEVKKEPVATPKALKKALEHEKTPE 511
|
170 180 190
....*....|....*....|....*....|....*....
gi 2217339257 2124 TPTLLSPKGSISE---ETKQKLKSAILSAQSAANVRKES 2159
Cdd:PRK07994 512 LAAKLAAEAIERDpwaALVSQLGLPGLVEQLALNAWKEE 550
|
|
|